Data Lakehouse Features

Platform
No vendor lock-in
Built on open standards
Easy setup
Unlimited compute
Lakehouse
Clusters
Cluster Size
Complete SQL Data Lakehouse
Decoupled Compute & Storage
Auto-Scaling
Unlimited Time Travel
Full ANSI SQL
User Defined Functions (UDFs)
Query History and Profile
Unlimited
Unlimited
Lakehouse - Query Federation
Native support for structured and semi-structured file formats
Analyze data stored in RDBMS databases (e.g. MySQL, PostgreSQL)
Analyze data stored in NoSQL sources (e.g. MongoDB, Cassandra)
Lakehouse - Connectivity
Connecting from Java, Python, Go, Node.js
DBT Integration (data build tool)
BI Integrations - Tableau, Power BI, Metabase etc.
Notebooks Integration
Rest API
Spark Jobs
Number of Executors
Spark Jobs
Auto-Scaling
Advanced Metrics & Logs
Unlimited
SQL Editor
SQL Worksheets
Query History
Autocomplete and Syntax Highlighter
Data Governance - Data Access Control
Centralized Data ACL
RBAC (Role Based Access Control)
Data Masking
Data Governance - Data Catalog
Data Discovery
Data Monitoring & Observability
Automatic PII/PCI Detection
Lineage
Notebook service
Jupyter Notebook Service
Pyspark Kernel
Python Kernel with pre-installed Mongo libraries
Security and Compliance
SOC2
GDPR, HIPAA ready
Always-on Enterprise Grade Encryption (in transit and at rest)
Encryption with Customer-Managed Keys (BYOK)
Audit Log
Deployment options
On-Premise Deployment
Hybrid Deployment
Private Cloud Deployment
AWS Deployment
Azure Deployment
Google Cloud Deployment
Multi-Cloud Deployment
Multi-RegionDeployment
SLA/Authentication
SLA
Federated Authentication & SSO
Support
First Class Onboarding
Migration Assistance
Enterprise Level 24 x 7 Support
Dedicated Communication Channel (e.g. Slack, MS Teams, Discord)
Want to learn more about our product or book a demo?