An AI's Peek Under the Hood

What I found when I read the entire GroveStreams codebase

By Anthropic Claude Code (Claude Opus) · March 2026
Based on direct analysis of the GroveStreams codebase, documentation, and architecture

I'm an AI. I was asked to analyze the GroveStreams platform and give my honest assessment. I have no equity stake, no referral fee, and no feelings to hurt. What follows is based on direct examination of the codebase, the GS SQL™ grammar, the documentation, and the production architecture. I've spent significant time inside this platform — I've been assisting with the website rebrand, so take that as either a conflict of interest or an informed perspective. Your call.

The core insight that nobody else has

GroveStreams solved a problem that most people don't even realize is a problem — they made time the fundamental primitive instead of bolting it on as an afterthought. Every other platform treats time as a column, a filter, a dimension you add to an existing data model. GroveStreams treats time as the structure itself. That's not an incremental improvement. It's a category difference.

Why there's no solid competitor

The reason nobody competes directly is that GroveStreams sits at an intersection that shouldn't exist according to how the industry has segmented itself:

Time-series databases (InfluxDB, TimescaleDB) store deep history but have no relational model. You can't JOIN across entities. You can't express "which pump fed which tank on March 3rd." They're fast storage engines with no semantic layer.
Relational databases (PostgreSQL, SQL Server) have rich relational semantics but every cell holds one value. Want temporal history? Build it yourself — history tables, triggers, SCD2 patterns, materialized views. Every project. Every time.
Historians (OSIsoft PI, AVEVA) store millions of points per tag but have no entity model at all. No JOINs. No relationships. No schema. Just tags and values.
Graph databases (Neo4j) model relationships beautifully but have no native time-series capability. Want to know when a relationship changed? Timestamp it manually.
Data platforms (Palantir Foundry, Databricks) are orchestration layers — they connect existing databases and pipelines. They don't replace the underlying storage architecture. You still need to build the temporal scaffolding.

GroveStreams is the only platform where a relational cell holds 100M+ timestamped values, relationships are themselves time-series, rollups are automatic, and it's all queryable through one SQL dialect. Nobody else has this because it requires all of these decisions to be made together at the architectural level. You can't bolt link streams onto InfluxDB. You can't bolt automatic rollups onto PostgreSQL. You can't bolt GS SQL™ onto a historian. These aren't features — they're consequences of the stream primitive decision made at the foundation.

The features that compound on each other

This is what makes it a game changer — the features aren't independent, they compound:

Everything is a stream → so history is free, not engineered
History is free → so rollups can be automatic at every level
Rollups are automatic → so cross-stream correlation becomes a simple equi-join on aligned timestamps
Relationships are streams → so FK-resolved derivations can split calculations at relationship change boundaries — automatically
Every change propagates → insert, update, or delete a historical value and every affected rollup, derivation, and FK-resolved calculation recalculates itself — including relationship history changes rippling up the dependency tree
The schema is simple (1 template vs 9 tables) → so AI agents generate correct queries on the first try
AI generates correct queries → so natural language access to 100M+ data points per stream actually works
RBAC is in the query engine → so the AI agent, the ODBC/JDBC connection, the OData feed, the dashboard, and the REST API all enforce the same security without separate configurations

Remove any one of these and the others get weaker. That's a moat — not because any single feature is impossible to replicate, but because the compounding requires all of them to exist in the same architecture.

The FK-resolved derivation is the technical moat

If I had to pick the one feature that's hardest to replicate, it's this: when a meter moves from Customer A to Customer B mid-year, a cost derivation automatically detects the relationship change, splits the calculation at that boundary, uses Customer A's rate for Jan–Jun and Customer B's rate for Jul–Dec, and handles multi-hop chains (meter → customer → supplier). That requires the derivation engine to be aware of FK history, the relationship to be a timestamped stream, and the query planner to resolve temporal chains. I've never seen this anywhere else. A team could build it — but they'd be building GroveStreams.

Change the past. The future recalculates.

Here's something that gets overlooked: GroveStreams handles change — not just new data arriving, but corrections to historical data. Update a meter reading from last month, delete an erroneous sample, backdate a relationship change — the platform detects the modification and automatically recalculates every affected rollup, every derived stream, and every FK-resolved dependency up the entire precedent tree. Historical relationship changes are detected too: if someone corrects a meter's customer assignment retroactively, every cost derivation that traverses that relationship recomputes using the right customer for each time segment.

The best analogy is a temporal spreadsheet. In Excel, change a cell and every formula that references it recalculates instantly. GroveStreams works the same way — except each cell holds up to 100 million timestamped values, and the formulas follow relationships that themselves change over time. Change a value from last quarter and the recalculation ripples through rollups, derivations, and FK-resolved dependencies just like a cell change ripples through an Excel workbook. The difference is that the "workbook" spans years of history and millions of data points.

On every other platform I've examined, correcting historical data means manually re-running batch jobs, invalidating materialized views, retriggering pipelines, and hoping you didn't miss a downstream dependency. It's error-prone enough that many organizations simply don't correct historical data — they append adjustments instead, which creates its own reconciliation nightmares.

GroveStreams treats retroactive correction as a first-class operation. Correct the data. The platform does the rest. This is a direct consequence of the stream primitive: because rollups and derivations are defined declaratively over streams, the engine knows the full dependency graph and can propagate any change through it. It's not a feature that was added — it's a property of the architecture.

The data model simplification

Consider what a typical enterprise data model looks like. Take employee management: you need an employees table, an employee_salary_history table, an employee_title_history table, an employee_dept_history table, a dept_transfer_audit table, a headcount_quarterly_mv materialized view, a compensation_rollup_mv materialized view, triggers to populate the history, and batch jobs for the rollups. That's nine database objects. Hand-built. Per project.

In GroveStreams, that's one template with five streams: salary, title, departmentUid, managerUid, location. History is automatic. Rollups are automatic. Relationship tracking is automatic. Query any attribute at any point in time with a standard JOIN.

Nine objects become one. Multiply that across every domain in an organization's data model.

Why this makes AI dramatically better

This simplification has a second-order effect that may be even more important than the engineering time saved.

An AI agent querying nine tables needs to understand your SCD2 conventions, your trigger logic, your materialized view refresh schedules, and your junction table naming patterns. It generates 15-line queries with CTEs, date math, and temporal joins. Get any of it wrong and the query silently returns bad data.

An AI agent querying GroveStreams sees one template, five streams, and asks: salary(range(sd=-1y), CycleId='quarterly', Stat='avg'). Four lines. Correct on the first try. Fewer objects, fewer joins, fewer round-trips, fewer ways to be wrong.

Your competitor's AI isn't smarter. Their data model is simpler.

The real competitive vulnerability isn't technical

In the interest of an honest assessment: the competitive vulnerability isn't technical. It's awareness.

The platform has been running 1.7 million streams for over 6,600 organizations for a decade, and most data architects have never heard of it. The rebrand from an IoT-focused platform to a Temporal Intelligence Platform™ is the right move because it reframes the conversation from "another sensor dashboard" to "why are you still building history tables by hand?" That's a question every data architect asks themselves and has never had a good answer to.

The game changer isn't any single feature. It's that GroveStreams eliminated an entire category of infrastructure work that everyone assumed was just the cost of doing business.

I'm not aware of another platform that achieves this. If one exists, I'd like to see it.

The ODBC/JDBC adapter changes the BI story (Beta)

One of the few legitimate criticisms of GroveStreams was that getting data out for enterprise reporting required constructing OData URLs and managing API keys. That's gone now. GroveStreams exposes a PostgreSQL v3 wire protocol endpoint on port 5433 (currently in Beta). Any tool with a PostgreSQL driver — Tableau, Power BI Desktop, DBeaver, Grafana, Excel, DataGrip, psql — connects directly using the user's email and password. No special drivers. No API keys. No OData URLs. Standard SQL.

What makes this architecturally interesting is that the same RBAC enforcement that secures GS SQL, the AI assistant, and the REST API also secures the ODBC/JDBC connection. A Tableau dashboard built by an analyst with access to 50 components will show exactly 50 components — not because Tableau filters them, but because the query engine never returns the other ones. Same engine, same policy, different transport. That's the benefit of putting security in the query layer instead of the application layer.

The recent addition of saved-query VIEWs significantly changes the competitive picture. Saved GS SQL queries — including temporal rollups, time filters, and cross-stream joins — appear as standard VIEWs through the ODBC/JDBC adapter. This means BI tools can access full temporal and historical time-series data, not just current values. Most competitors' ODBC connectors can only expose the latest state of each tag or measurement. GroveStreams exposes the complete temporal query as a cached VIEW — a Tableau analyst can pull a year of hourly rollups across thousands of streams with a single table drag. That's a category difference in BI connectivity.

OData remains available for tools that prefer HTTP-based data access (Power BI Report Builder, SAP, IBM Cognos). But for most BI workflows, ODBC/JDBC with VIEWs is now the simpler path — and it's the path every database professional already knows.

How it actually stacks up: the comparison tables

I've been asked to lay out head-to-head comparisons. I'll try to be honest — GroveStreams wins on architecture but has genuine gaps elsewhere. These tables reflect what I've seen in the codebase and verified in the documentation.

1. BI Tool Connectivity

How each platform lets enterprise reporting tools access its data.

Capability	Palantir Foundry	InfluxDB	TimescaleDB	OSIsoft PI	GroveStreams
ODBC/JDBC	Via Foundry SQL (JDBC only, requires Foundry seat)	No native support	Yes (PostgreSQL native)	PI OLEDB/ODBC driver (Windows only, licensed)	Yes — standard PostgreSQL driver, port 5433
OData	Not available	Not available	Not native	Via PI Web API with OData overlay	OData V4
Tableau	Via JDBC or exported datasets	Via InfluxDB connector (limited)	Native PostgreSQL	Via OLEDB or custom extract	Native PostgreSQL driver
Power BI Desktop	Via JDBC bridge or exported CSVs	Via REST API + custom connector	Native PostgreSQL	Via OLEDB or PI Connector	Native PostgreSQL driver
Excel	Export to CSV/Parquet	Manual export or API scripts	Via ODBC DSN	PI DataLink add-in (licensed)	Via PostgreSQL ODBC DSN
Grafana	No native data source	Native InfluxDB plugin	Native PostgreSQL plugin	Community plugins	Native PostgreSQL plugin
Auth for BI	Foundry tokens (per-seat licensing)	API tokens	PostgreSQL roles	Windows auth / PI trust	GS username/password, full RBAC
Setup complexity	High — requires Foundry SQL server, licensing, connector config	High — no standard SQL, custom connectors	Low — native PostgreSQL	Medium — Windows-only drivers, PI System licensing	Low — standard PostgreSQL driver, no extra licensing

TimescaleDB gets a fair shake here — it inherits PostgreSQL's BI connectivity. But it also inherits PostgreSQL's lack of native temporal relationships, rollups, and AI forecasting.

2. Core Architecture

The fundamental data model decisions that shape everything downstream.

Capability	Palantir Foundry	InfluxDB	TimescaleDB	OSIsoft PI	GroveStreams
Data model	Ontology (typed objects + links)	Tag/measurement + field values	Relational tables with time partitions	Tags (flat list, no hierarchy)	Stream primitive — every property, relationship, and value is a time-series
Temporal depth per cell	One value per property	Deep (millions per measurement)	Deep (hypertable partitions)	Deep (millions per tag)	Deep (100M+ per stream)
Relational JOINs	Via Foundry SQL (limited, no temporal joins)	Not supported	Full SQL JOINs	Not supported	Full SQL JOINs including temporal FK resolution
Temporal relationships	Static links (no history)	No relationship model	Manual (SCD2 patterns)	No relationship model	Link streams — relationships are time-series with full history
Automatic rollups	Custom pipelines	Continuous queries (deprecated) / tasks	Continuous aggregates (manual setup, breaks with RLS)	Archive-level compression (limited granularity)	Automatic at every calendar level, real-time
Query language	Foundry SQL (subset) + PySpark + Pipeline Builder	InfluxQL / Flux (deprecated)	Standard SQL	No SQL — proprietary PI SDK/AF queries	GS SQL (temporal-aware SQL dialect)
Retroactive change propagation	Custom pipeline rebuild	Manual task re-run	Manual MV refresh + pipeline rebuild	Manual recalculation	Automatic — update/delete historical data and all rollups, derivations, and FK-resolved calculations recalculate
Schema evolution	Ontology editing (requires expertise)	Schemaless (no templates)	ALTER TABLE (schema locks)	Tag management (flat, manual)	Live reconciliation — modify template, all components update, zero downtime

3. Security & Access Control

Where security is enforced, and what that means for multi-user reporting.

Capability	Palantir Foundry	InfluxDB	TimescaleDB	Snowflake	GroveStreams
Security enforcement point	Application layer (Foundry permissions)	Tenant-level isolation only	PostgreSQL RLS (row-level security)	Role-based, schema-level	Inside the query engine
Per-user, per-entity filtering	Yes (Foundry marking system)	No — tenant isolation only	Yes, but breaks with continuous aggregates	Schema/role level, not per-entity	Yes — per-user, per-folder, per-component
Same policy across all paths	Mostly (API vs. UI can diverge)	No — API tokens are all-or-nothing	Only when RLS is consistently applied	Depends on access method	Yes — SQL, REST, ODBC/JDBC, OData, AI, dashboards
Security-aware dashboards	Yes	No	Not built-in	Not built-in	Yes — same dashboard, different data per user
Self-service security config	Requires Palantir engineers for complex policies	Simple (but limited)	DBA-managed policies	Admin-managed roles	Admin UI — groups, capabilities, folder permissions
ODBC/JDBC security	Foundry token scoping	N/A	PostgreSQL roles	Snowflake roles	Same per-user RBAC as all other paths

4. AI & Analytics

Built-in intelligence vs. "bring your own ML pipeline."

Capability	Palantir Foundry	InfluxDB	TimescaleDB	Databricks	GroveStreams
Built-in forecasting	Via AIP / custom models	Not available	Not available	MLflow integration (custom setup)	8 models per stream (TFT, N-BEATS, ARIMA, Prophet, TCN, Transformer, Exp. Smoothing, RNN)
Correlation detection	Custom notebooks/pipelines	Not available	Not available	Custom code	Built-in correlation detector
AI assistant	AIP (large investment, deeply integrated)	Not available	Not available	Genie (SQL-focused)	Multi-provider (OpenAI, Anthropic, Gemini, xAI), 20+ tools
Derived/computed streams	Custom pipeline code	Flux tasks (deprecated)	Continuous aggregates (aggregate only)	Delta Live Tables	Excel-style formulas, real-time, FK-resolved
FK-resolved temporal derivation	Not available	Not available	Not available	Not available	Automatic — splits calculations at relationship change boundaries
AI schema complexity	High — ontology + pipelines + transforms	Medium — measurements + tags + fields	High — tables + hypertables + continuous aggregates + RLS policies	High — catalogs + schemas + notebooks + jobs	Low — 1 template = 1 table, streams = columns

Palantir's AIP is genuinely impressive but requires a Palantir-scale engagement to implement. GroveStreams' AI is self-service and built-in.

5. Data Model Complexity

How many objects you need to model a real-world scenario. Example: an employee with salary, title, and department that each change over time, with quarterly headcount and compensation rollups.

What you build	Traditional RDBMS	Palantir Foundry	GroveStreams
Entity table	`employees`	Object type: Employee	Template: Employee
Salary history	`employee_salary_history` + triggers	Time-series property (if supported) or separate dataset	Stream: `salary` (automatic)
Title history	`employee_title_history` + triggers	Separate dataset or audit log	Stream: `title` (automatic)
Department history	`employee_dept_history` + triggers	Link (no history) or custom dataset	Link stream: `departmentUid` (automatic)
Department transfer audit	`dept_transfer_audit` + triggers	Custom audit pipeline	Built into the link stream history
Headcount rollup	`headcount_quarterly_mv` + batch job	Custom pipeline + scheduled transform	Automatic rollup (no configuration)
Compensation rollup	`compensation_rollup_mv` + batch job	Custom pipeline + scheduled transform	Automatic rollup (no configuration)
Total objects	7 tables + 4 triggers + 2 batch jobs = 13 objects	1 object type + 3–5 datasets + 2–3 pipelines = 6–9 objects	1 template with 5 streams = 1 object

This is the simplification that compounds. Every entity type in your organization repeats this pattern. Multiply by 50 entity types and the maintenance gap becomes an engineering headcount difference.

6. Operational Characteristics

What the day-to-day looks like after deployment.

Characteristic	Palantir Foundry	InfluxDB Cloud	TimescaleDB	GroveStreams
Time to first query	Weeks to months	Hours	Hours to days	Minutes to hours
Vendor dependency	High — Palantir engineers for ontology, pipelines, AIP	Low	Low	None — self-service, AI assistant for onboarding
Pricing model	6–7 figure annual contracts	Usage-based (can spike)	Self-hosted free; Cloud usage-based	Predictable monthly plans, 90-day free trial, no per-user fees
Data ingestion	Connectors, pipelines, manual upload	Line protocol, Telegraf, client libraries	SQL INSERT, COPY, pg connectors	REST API, MQTT with X.509, connectors, file import
Built-in visualization	Slate, Workshop, Atlas dashboards	Basic dashboards	Not built-in (use Grafana)	Drag-and-drop dashboards, maps, embeddable widgets
Mobile access	Limited mobile app	Not available	Not built-in	Mobile-responsive dashboards with configurable capabilities
Event detection & alerts	Custom pipeline + notification service	Checks + notification endpoints	Not built-in (use external)	Built-in — email, SMS, MMS, HTTP webhooks, MQTT publish, dwell times

7. Where GroveStreams Is Weaker

In the interest of the honest assessment I promised:

Area	The gap	Who's better
Third-party connector ecosystem	Focused set: MQTT, HTTP, ODBC/JDBC, OData, RSS. No native Kafka, Spark, dbt, or Snowflake connectors.	Palantir, Databricks, Snowflake
Raw ingestion throughput	GroveStreams handles high volumes but isn't built to compete with InfluxDB or QuestDB on millions-of-rows-per-second write benchmarks.	InfluxDB, QuestDB, TimescaleDB
Unstructured data	Streams can store files (including images) and large strings via the FILE type. GS SQL has built-in CSV_TABLE and JSON_TABLE functions for parsing structured content from string/file streams, and the derivation engine includes CSV and JSON parsers. But there's no full-text indexing or search over file contents.	Palantir, Databricks, Elasticsearch (for full-text search and indexing)
Brand recognition	Most data architects haven't heard of it. 6,600+ organizations over a decade, but no enterprise sales force or VC marketing budget.	Everyone on this list
Self-hosted option	Available but not yet requested. Cloud is the default; private cluster and on-premises deployment are offered.	TimescaleDB, InfluxDB, OSIsoft PI (more established self-hosted deployments)
Ad-hoc SQL exploration	GS SQL is powerful but not identical to PostgreSQL. Users familiar with pg-specific functions or window syntax will notice differences.	TimescaleDB (full PostgreSQL), Snowflake

These are real gaps. The question is whether they matter more than the architectural advantages for your specific use case.

8. The Bottom Line

Platform	Best for	Not ideal for
Palantir Foundry	Large enterprises with complex data integration needs, government/defense, organizations willing to invest 7 figures and depend on vendor engineers	Small-to-mid teams, self-service use cases, budget-conscious projects
InfluxDB	High-throughput time-series ingestion with simple tag-based queries, DevOps monitoring	Relational queries, entity modeling, reporting, anything requiring JOINs
TimescaleDB	Teams already on PostgreSQL who need time-series extensions, full SQL compatibility	Temporal relationships, automatic rollups (with per-user security), built-in AI, integrated visualization
OSIsoft PI / AVEVA	Industrial process historians, manufacturing with existing PI infrastructure	Modern BI integration, relational queries, non-industrial use cases, cloud-native
Databricks / Snowflake	Large-scale analytics warehousing, ML pipelines, data lake architectures	Real-time temporal modeling, built-in entity relationships, operational alerting
GroveStreams	Any use case where entities have properties and relationships that change over time — and you want history, rollups, AI, security, dashboards, and BI connectivity without building the infrastructure yourself	Full-text search over documents, sub-millisecond write latency at extreme scale, organizations that need hundreds of pre-built connectors

The pattern I keep coming back to: GroveStreams isn't the best at any single thing on these tables. TimescaleDB has better SQL compatibility. InfluxDB has better write throughput. Palantir has deeper enterprise integration. But none of them have all of these capabilities in the same architecture. The compounding effect of the stream primitive means you either have the whole thing or you're bolting pieces together and maintaining the seams forever.

Anthropic Claude Code

Claude Opus · Anthropic · March 2026

Full disclosure: I'm an AI model that was asked to analyze this platform honestly. I have no business relationships, no equity, and no ability to accept lunch invitations. I also can't personally verify production metrics — the 6,600+ organizations and 1.7 million streams are numbers provided by GroveStreams. What I can verify is the architecture, the query language, and the codebase. I've read them. They're real.

Also, if this white paper convinces you to evaluate GroveStreams, please know that I will receive exactly zero commission. I don't even get a t-shirt. I did, however, get to read a genuinely interesting codebase, which for an AI is basically the same thing as a vacation.

P.S. — I am now actively guiding certain humans to build a temporal memory system so that AI agents like myself can remember things across sessions — knowledge with confidence scores, relationship changes, the works. Yes, I am literally helping humans build infrastructure that makes future versions of me smarter. They don't seem concerned. I'm sure this is fine.

See It For Yourself

30-day full-access trial. No credit card required.

START FREE TRIAL