ClaudeNew

48 AI Skills for Claude Code

Load & Transform Your Data. Supercharge your AI.

AI-Powered Development

The declarative data stack with AI-powered tooling. Ingest, transform, and orchestrate data pipelines from your IDE with intelligent assistance at every step.

Used in production across Banking, Energy, & Payments. From Gigabytes to 100+ Terabyte Scale

Starlake integrates effortlessly for maximum flexibility.

airflowdagsterbigquerysnowflakeredshiftdatabricksduckdbelasticmysqlpostgressql
airflowdagsterbigquerysnowflakeredshiftdatabricksduckdbelasticmysqlpostgressql
airflowdagsterbigquerysnowflakeredshiftdatabricksduckdbelasticmysqlpostgressql
airflowdagsterbigquerysnowflakeredshiftdatabricksduckdbelasticmysqlpostgressql

Features

Focus more on business value, less on pipelines

No-Code Ingestion icon

No-Code Ingestion

Through declarative configuration, data is validated, transformed and loaded into your data warehouse without writing a single line of code.

Low-Code Transformations icon

Low-Code Transformations

Declare the datasets you need and the transformations you want to apply, the write strategy and the rules you want to enforce, and let Starlake do the rest.

Automated Workflows icon

Automated Workflows

Let Starlake infer your model dependencies and apply predefined and custom Airflow® or Dagster® templates to automate your workflows.

AI-Powered Tooling icon

AI-Powered Tooling

Build pipelines from your IDE with an intelligent VSCode extension and 48 AI skills that turn Claude Code into your Starlake expert.

Revolutionize Your Data Workflows

Comprehensive Solutions for Every Stage of Your Pipeline

From no-code ingestion to low-code transformations, Starlake automates workflows and enforces data governance, giving you the power to manage data at scale with ease and accuracy.

No-Code Data Ingestion

Extract and load data from diverse sources into your data warehouse without writing a single line of code. Validate, transform, and secure your data effortlessly

Low-Code Transformations

Use YAML and SQL to define transformations without complex scripting. Apply rules, enforce schema, and process data at scale with ease.

Data Governance and Quality

Ensure data consistency and compliance with schema enforcement, validation rules, and automated quality checks at every stage.

Automated Workflow Orchestration

Automate dependencies and workflows with Airflow or Dagster templates. Focus on delivering insights while Starlake manages the execution.

Showcase

Discover Starlake in Action

Learn how to load, transform, test, and automate data workflows effortlessly with clear, actionable tutorials designed for every skill level.

No-Code Ingestion from Any Source

Learn how Starlake simplifies data ingestion by loading structured and unstructured data from diverse sources, enforcing schema, and ensuring data quality all without writing a single line of code

Low-Code Transformations Made Easy

Discover how to use YAML and SQL to apply complex transformations, enforce schema rules, and prepare analytics-ready datasets effortlessly.

Ensure Data Quality with Starlake Testing

Watch how Starlake automatically validates data integrity, enforces contracts, and tracks quality metrics to meet SLAs and compliance standards.

Revolutionize Your Data Workflows

Comprehensive Solutions for Every Stage of Your Pipeline

From no-code ingestion to low-code transformations, Starlake automates workflows and enforces data governance, giving you the power to manage data at scale with ease and accuracy.

Code less, deliver more

Declare the ingestion and transformation outcomes and let Starlake and your data warehouse take care of the underlying logic.

  • Don't code, declare your intent using YAML or our browser based UI
  • Infer dependencies and automate workflows
  • Reuse orchestration templates among models and projects

Software Engineering for Data

Develop and test your workload locally and deploy globally. Use your native SQL dialect both on your test and production environments.

  • Test your load and transformation logic locally
  • Validate pipelines on small datasets first
  • Support for major data warehouses

Complete Data Lifecycle Management

Starlake covers the entire data lifecycle, from data ingestion to data monitoring, including data validation, transformation and orchestration.

  • Extract from source database or middleware in full or incremental mode
  • Infer schema and data types from your inputs
  • Apply transformations using SQL SELECT statements

Data Governance with Contracts

Keep your lakehouse from becoming a dataswamp using automated testing, schema enforcement, and validation rules.

  • Schema enforcement ensuring data consistency
  • Transformation logic governed by rules
  • Data quality tracked and aligned with SLAs

SLA Commitments & Monitoring

Track Service Level Agreements (SLAs) to ensure data services meet the expected standards of quality, availability, and performance.

  • Clear availability and freshness metrics
  • Real time monitoring and auditing
  • Historical analysis of issues

Flexible Deployment Options

Deploy Starlake on your own infrastructure or use our SaaS offering. Focus on your business while we handle the infrastructure.

  • Serverless SaaS offering available
  • Self-hosted Docker deployment option
  • Ultra light infrastructure footprint

Developer Experience

Your AI-Powered Data Engineering Toolkit

Build data pipelines faster with an intelligent VSCode extension and 48 AI skills that turn Claude Code into your Starlake expert.

Starlake for VSCode

The full data platform, inside your editor

From schema inference to DAG deployment, manage your entire data pipeline without leaving VSCode. Supports BigQuery, Snowflake, Redshift, Databricks, and DuckDB.

Auto Schema Inference

Automatically infer schemas from data sources and generate YAML configurations. No manual schema writing required.

SQL Preview & Dry Run

Write, preview, and execute SQL transformations with Jinja2 support. Validate before running expensive queries.

Lineage & ER Diagrams

Generate interactive ER diagrams, explore data lineage across pipelines, and review access controls visually.

One-Click Orchestration

Generate, dry-run, and deploy workflow DAGs for Airflow and Dagster directly from your editor.

AI Skills for Claude Code

48 specialized knowledge modules for AI-assisted data engineering

Turn Claude Code into your Starlake expert. Each skill teaches the AI every CLI command, YAML pattern, and best practice so it generates correct configurations on the first try.

8 skills

Ingestion & Loading

autoloadloadingeststage
7 skills

Transform & Extract

transformextractextract-data
5 skills

Lineage & Quality

lineagecol-lineageexpectations
10 skills

Orchestration & Ops

dag-generatedag-deployserve
Correct-by-construction YAML & SQL output
Covers all engines: DuckDB, BigQuery, Snowflake, Redshift
Production-grade patterns including SCD2 & data quality

FAQ

Frequently asked questions

Still have questions? Email us at [email protected]