Data Lakehouse articles

3/5/2026 • EN

How to Use Dremio with JetBrains AI Assistant: Connect, Query, and Build Data Apps

A guide to integrating Dremio's data platform with JetBrains AI Assistant for enhanced data querying, pipeline generation, and app development within JetBrains IDEs.

Data Engineering Data Lakehouse Dremio Jetbrains AI Assistant mcp server

Alex Merced

3/5/2026 • EN

How to Use Dremio with OpenAI Codex CLI: Connect, Query, and Build Data Apps

Guide on integrating Dremio data lakehouse with OpenAI Codex CLI for querying, building data apps, and generating analytics code.

Data Lakehouse Dremio mcp openai codex sql

Alex Merced

3/5/2026 • EN

How to Use Dremio with Cursor: Connect, Query, and Build Data Apps

A guide on integrating Dremio's data platform with the Cursor AI code editor to enable accurate SQL generation and data app development.

AI Code Editor Cursor Data Lakehouse Dremio SQL Integration

Alex Merced

3/5/2026 • EN

How to Use Dremio with Gemini CLI: Connect, Query, and Build Data Apps

A guide to integrating Google's Gemini CLI with Dremio's data platform for querying, building data apps, and generating SQL using AI.

Data Applications Data Lakehouse Dremio Gemini CLI mcp server

Alex Merced

3/5/2026 • EN

How to Use Dremio with Claude CoWork: Connect, Query, and Build Data Apps

A guide to integrating Dremio's data lakehouse platform with Claude CoWork, enabling natural language queries, automated reporting, and data app development.

Claudecowork Data Lakehouse Dremio Query Federation Semantic Layer

Alex Merced

3/5/2026 • EN

How to Use Dremio with Claude Code: Connect, Query, and Build Data Apps

A guide to connecting Dremio's data lakehouse platform with Claude Code, enabling the AI coding agent to query live data and build data applications.

Claude Code Data Lakehouse Dremio mcp server sql

Alex Merced

3/5/2026 • EN

How to Use Dremio with Amazon Kiro: Connect, Query, and Build Data Apps

A guide to integrating Dremio's data lakehouse platform with Amazon Kiro's AI IDE for data querying, app building, and pipeline generation.

Amazon Kiro Data Lakehouse Dremio mcp server spec-driven development

Alex Merced

3/1/2026 • EN

Dremio's Built-in Open Catalog: Your Zero-Configuration Apache Iceberg Lakehouse

Introduces Dremio's built-in Open Catalog for Apache Iceberg, offering a zero-configuration, production-ready lakehouse solution with automated management.

Apache Iceberg Cloud Analytics Data Catalog Data Lakehouse Data Management

Alex Merced

3/1/2026 • EN

Classify Your Data with SQL: A Hands-On Guide to Dremio's AI_CLASSIFY Function

A tutorial on using Dremio's AI_CLASSIFY SQL function to categorize data like customer sentiment and support tickets directly within a data lakehouse.

AI Classification Data Lakehouse Dremio llm sql

Alex Merced

12/29/2025 • EN

2025 Year in Review Apache Iceberg, Polaris, Parquet, and Arrow

A 2025 year-in-review of key Apache data projects: Iceberg, Polaris, Parquet, and Arrow, detailing their major updates and future roadmap.

Apache Arrow Apache Iceberg Apache Parquet Apache Polaris Data Lakehouse

Alex Merced

11/12/2025 • EN

Comprehensive Hands-on Walk Through of Dremio Cloud Next Gen (Hands-on with Free Trial)

A hands-on tutorial exploring Dremio Cloud Next Gen's new free trial, covering its lakehouse platform, AI features, and SQL capabilities.

Apache Iceberg Cloud Platform Data Lakehouse Dremio sql

Alex Merced

10/23/2025 • EN

2025-2026 Guide to Learning about Apache Iceberg, Data Lakehouse & Agentic AI

A comprehensive guide to learning Apache Iceberg, data lakehouse architecture, and Agentic AI with curated tutorials, tools, and resources.

agentic ai Apache Iceberg Data Engineering Data Lakehouse Table Formats

Alex Merced

10/21/2025 • EN

An Exploration of the Commercial Iceberg Catalog Ecosystem

Explores the commercial Apache Iceberg catalog ecosystem, focusing on REST Catalog standards, optimization strategies, and architectural trade-offs.

Data Lakehouse Iceberg Catalog Metadata Management optimization Table Format

Alex Merced

10/17/2025 • EN

Building a Universal Lakehouse Catalog - Beyond Iceberg Tables

Explores two paths for building a universal lakehouse catalog that extends beyond Apache Iceberg tables to manage diverse data formats and sources.

Data Lakehouse Iceberg REST Catalog Table Format Universal Catalog

Alex Merced

10/16/2025 • EN

Intro to Apache Iceberg with Apache Polaris and Apache Spark

A technical guide on using Apache Iceberg with Apache Spark and Polaris for building and managing a data lakehouse, covering setup, operations, and optimization.

Apache Iceberg Apache Spark Data Engineering Data Lakehouse Table Management

Alex Merced

10/14/2025 • EN

The State of Apache Iceberg v4 - October 2025 Edition

Overview of key proposals in Apache Iceberg v4, focusing on performance, metadata efficiency, and portability for modern data workloads.

Apache Iceberg Data Engineering Data Lakehouse Metadata Optimization Table Format

Alex Merced

9/23/2025 • EN

The 2025 & 2026 Ultimate Guide to the Data Lakehouse and the Data Lakehouse Ecosystem

A comprehensive guide to the data lakehouse architecture, its core components (Iceberg, Delta, Hudi, Paimon), and the surrounding ecosystem for modern data platforms.

Apache Iceberg Data Architecture Data Lakehouse Delta Lake Table Formats

Alex Merced

9/16/2025 • EN

The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

A guide to building an autonomous, self-healing optimization pipeline for Apache Iceberg tables to maintain performance and cost efficiency.

Apache Iceberg Data Lakehouse Data Optimization Metadata Management Pipeline Automation

Alex Merced

9/2/2025 • EN

Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Explores challenges and best practices for managing partition evolution and compaction in Apache Iceberg to maintain query performance.

Apache Iceberg Data Compaction Data Lakehouse Metadata Management Partition Evolution

Alex Merced

8/26/2025 • EN

Using Iceberg Metadata Tables to Determine When Compaction Is Needed

Explains how to use Apache Iceberg's metadata tables to dynamically trigger data compaction based on file size, manifest health, and snapshot patterns.

Apache Iceberg Data Compaction Data Lakehouse Metadata Tables Table Optimization

Alex Merced

Data Lakehouse Articles

How to Use Dremio with JetBrains AI Assistant: Connect, Query, and Build Data Apps

How to Use Dremio with OpenAI Codex CLI: Connect, Query, and Build Data Apps

How to Use Dremio with Cursor: Connect, Query, and Build Data Apps

How to Use Dremio with Gemini CLI: Connect, Query, and Build Data Apps

How to Use Dremio with Claude CoWork: Connect, Query, and Build Data Apps

How to Use Dremio with Claude Code: Connect, Query, and Build Data Apps

How to Use Dremio with Amazon Kiro: Connect, Query, and Build Data Apps

Dremio's Built-in Open Catalog: Your Zero-Configuration Apache Iceberg Lakehouse

Classify Your Data with SQL: A Hands-On Guide to Dremio's AI_CLASSIFY Function

2025 Year in Review Apache Iceberg, Polaris, Parquet, and Arrow

Comprehensive Hands-on Walk Through of Dremio Cloud Next Gen (Hands-on with Free Trial)

2025-2026 Guide to Learning about Apache Iceberg, Data Lakehouse & Agentic AI

An Exploration of the Commercial Iceberg Catalog Ecosystem

Building a Universal Lakehouse Catalog - Beyond Iceberg Tables

Intro to Apache Iceberg with Apache Polaris and Apache Spark

The State of Apache Iceberg v4 - October 2025 Edition

The 2025 & 2026 Ultimate Guide to the Data Lakehouse and the Data Lakehouse Ecosystem

The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Using Iceberg Metadata Tables to Determine When Compaction Is Needed

Select Language

We use cookies