Agent-native · One-stop delivery · Self-hostable

Let your team
talk to data directly

Connect databases and files, ask in natural language to get SQL, charts, and insights; turn frequent questions into scheduled jobs pushed on time to Lark / DingTalk / WeCom.

View docs

Data sources

15+ natively supported

IM push

Lark / DingTalk / WeCom

Monitoring

Cron · failure alerts

Deployment

SaaS / self-hosted

VALUE

Not another BI,
your team's data assistant

DataFinder is built for the “ad-hoc but frequent” data questions — the ones not worth a dashboard, yet asked every day.

Self-serve data,
no more queue

Everyday questions from ops and product answered directly by the Agent.

Spot an anomaly,
find the cause in one reply

Ask a follow-up like replying to a message; the Agent rewrites the SQL.

Key metrics,
delivered on time

Scheduled monitoring lands results in your IM group. Failures alert.

QUICK START

Four steps to onboard,
usable the same day

Add a database connection or upload CSV / Excel; schema and samples are read automatically.
Attach datasets to a Space and write down table relations, metric definitions, and field meanings.
Ask in natural language and see the SQL, tables, and charts. Off? Just ask a follow-up.
Charts, tables, and business narrative in one click. Export or push to IM groups on cron.

Capture semantics: field catalog and descriptions

Start chatting: analysis results and charts

CAPABILITIES

Six core capabilities,
end to end

Data access

Relational · Warehouse · Lake · Files

One-click database connections, direct file upload. All secrets Fernet-encrypted.

Smart Q&A

Schema understanding · SQL generation · Multi-turn

The Agent reads your schema and business notes, turning them into executable SQL.

Auto visualization

Bar · Line · Share · KPI

Picks the right chart from the result and question — no dragging, no type-picking.

Scheduled monitoring

Cron · Failure alerts · History

Run metric batches at any frequency with cron; detailed errors logged on failure.

Two-way IM

Lark · DingTalk · WeCom

@Bot to ask in a group; scheduled results pushed to report, on-call, and ops groups.

Measurable quality

Benchmark · Accuracy scoring

Maintain business Q&A pairs, batch-evaluate SQL accuracy, and compare across iterations.

USE CASES

Who uses it,
and how

SCENE 01

Business monitoring / daily reports

Turn daily GMV, DAU, and conversion funnels into scheduled jobs pushed to your group.

·Scheduled daily reports
·Pushed to IM groups
·Auto alerts on failure

SCENE 02

Self-serve data analysis

Automate frequent business questions so the data team focuses on complex analysis.

·SQL templates reused in a Space
·Q&A pairs measure accuracy
·Define once, apply everywhere

SCENE 03

Eng / product self-serve

Self-serve event tracking, error rates, and key-metric monitoring.

·No waiting on data schedules
·Auto-aware of tracking changes
·Cost controllable per Space

PLATFORM

Enterprise-grade foundation,
out of the box

Data-source coverage, IM channels, security and observability — set up once, self-hosting optional.

01 · DATA SOURCES

15+ data sources
natively supported

Relational

MySQL PostgreSQL ClickHouse MSSQL SQLite

Warehouse

Snowflake BigQuery DuckDB

Lake formats

Iceberg Delta DuckLake Avro

Files / streams

CSV Excel Google Sheets SLS

Object storage

S3 Azure Blob

Federated execution via DuckDB — cross-source joins in one pass.

02 · INTEGRATIONS

Ask data
right in your work chat

Lark Bot

Event subscription · Group push

DingTalk Bot

Outgoing · Signed

WeCom

App callback · Group bot

All IM secrets stored encrypted; test the connection with one click.

03 · ENTERPRISE

Secure · observable
· in control

Three-tier model auto-fallback

Flagship / Standard / Lite tiered routing; auto-switch along the chain when the primary model fails.
Transparent usage & cost

Per-call tokens, latency, model, and estimated cost are all visible; quotas at a glance.
Multi-tenant & permissions

Space-level data isolation; cross-user access strictly 404s — safe for internal deployment.

FastAPI + SQLAlchemy backend, React frontend; self-hosting supported.

PRICING

Billed by token,
pay for what you use

The free quota covers everyday use; anything beyond is charged from your balance by actual token usage.

Free quota on sign-up, auto-resets daily / monthly

Try every tier, no card required. Decide whether to top up after the quota runs out.

100

Daily calls

100k

Daily tokens

Monthly tokens

Lite

Light and fast, for quick queries.

¥0.0072 / 1k input

¥0.0216 / 1k output

SQL parsing / field extraction
Light summaries, text classification
~200ms first-byte latency

Standard

02 · Default

Smart and efficient, for everyday tasks.

¥0.0216 / 1k input

¥0.0648 / 1k output

Multi-step reasoning / Agent workflows
Function calling & structured output
Auto-fallback to Lite when the primary model fails

Flagship

Complex analysis and deep reports.

¥0.108 / 1k input

¥0.324 / 1k output

Complex analysis / deep reports
Multimodal · chart understanding
200k+ token long context

FAQ

Still have questions?
We've answered them

Will DataFinder replace BI tools? +

No. BI suits long-lived dashboards; DataFinder solves ad-hoc but frequent questions. They complement each other.

Is my data sent to external models? +

Only the schema, field descriptions, and your question reach the LLM — raw data rows are never sent externally.

Which data sources are supported? +

Mainstream relational databases, warehouses, lake formats, files and streams, and object storage. Multiple datasets can join in one Space.

How do I control the Agent's answer quality? +

Write instructions in a Space, use Benchmark to score SQL accuracy, and ask a follow-up when an answer is wrong.

Can it be self-hosted? +

Yes. FastAPI + SQLAlchemy backend, Vite + React frontend — it runs on your own servers. For self-hosting and enterprise plans, contact {email}.

Let your team talk to data directly

Not another BI, your team's data assistant

Self-serve data,no more queue

Spot an anomaly,find the cause in one reply

Key metrics,delivered on time

Four steps to onboard, usable the same day

Six core capabilities, end to end