Join StarRocks Community on Slack

Connect on Slack
TABLE OF CONTENTS
     

    Menlo Park, Calif. -- June 12, 2025 -- CelerData, one of the world’s fastest, secure lakehouse engines for customer-facing and agent-driven analytics powered by StarRocks, today announced its new integration with the Databricks Data Intelligence Platform, bringing high-concurrency, sub-second analytical queries directly to Databricks users through CelerData Cloud, all without ETL or data duplication.

    Through this integration, Databricks Data Intelligence Platform users will be able to leverage CelerData to deliver product dashboards, embedded reports, and AI-driven experiences on live, governed lakehouse tables with data warehouse-like performance and without the governance headache. Users can enjoy:

    • Real-time, sub-second queries: Consistent millisecond-level response times even when tens of thousands of queries arrive per second.
    • Effortless scalability: CelerData’s compute warehouses scale in minutes—no repartitioning or data movement—while maintaining consistent sub-second SLAs at any concurrency.
    • Unified governance across all data and AI assets: Governed by Unity Catalog, ensure consistent access control, lineage, and auditing alongside all models, features, and datasets in a single governed platform.

     

    “We’re excited to bring CelerData’s unmatched query performance to Databricks Data Intelligence Platform users,” remarks Andy Ye, CelerData’s COO. “Through this integration, Databricks customers can experience the same best-in-class performance and scalability enjoyed by leading enterprises like Pinterest and Demandbase to power their customer facing and agentic AI analytics.”

    These performance, scalability, and governance gains are supported by four core features that make CelerData uniquely qualified to benefit Databricks Data Intelligence Platform users:

    • Vectorized C++ Execution Engine: CelerData’s fully SIMD-optimized engine scans, filters, joins, and aggregates in columnar batches, delivering consistent sub-second latency even at tens of thousands of QPS.
    • Unified Governance with Unity Catalog: CelerData natively integrates with Unity Catalog to enforce fine-grained permissions, trace data lineage, and align analytics with AI models, features, and business metrics—all from a single, governed source.
    • Elastic, Multi-Warehouse Compute: Users can deploy multiple scalable and isolated compute warehouses to scale linearly with traffic. Each workload stays performant and secure, whether it’s powering end-user reports or internal data apps.
    • Auto Materialized Views (AutoMV): Seamlessly boost performance and reduces cost by recommending impactful materialized views automatically rewriting queries behind the scenes.

     

    “Building on the Databricks Data Intelligence Platform through this integration allows us to bring the analytics power of StarRocks to Databricks customers through our CelerData Cloud platform” adds Andy Ye. “By running queries directly on open lakehouses, CelerData removes ingestion delays and governance headaches while maintaining speed and efficiency under massive query loads. Whether powering SaaS analytics, AI-driven applications, or interactive applications, it provides the scalability and reliability modern businesses need—without the trade-offs of traditional architectures.”

     

    About CelerData

    CelerData (powered by StarRocks) is the fastest query engine for customer-facing and AI-driven analytics at petabyte scale. Natively integrated with Apache Iceberg, Apache Hudi, and Delta Lake, it delivers low-latency, high-concurrency queries directly on open data—without ingestion delays or costly pipelines. Trusted by industry leaders like Pinterest, Tencent, and Expedia, CelerData powers the next generation of analytics on the Lakehouse. Learn more at: www.celerdata.com

    copy success