Real-Time Lakehouse Patterns with Apache Flink and Iceberg
A technical guide on building real-time lakehouse architectures using Apache Flink 2.1 and the Dynamic Iceberg Sink, addressing schema drift, file proliferation, and operational rigidity.
A technical guide on building real-time lakehouse architectures using Apache Flink 2.1 and the Dynamic Iceberg Sink, addressing schema drift, file proliferation, and operational rigidity.
A curated list of interesting tech links for April 2026, covering data engineering, analytics, and AI integration.
Explores three streaming architectures for Apache Iceberg: Spark Structured Streaming, Flink, and Kafka Connect, focusing on trade-offs between latency and table maintenance.
Explores Materialized Tables in Apache Flink 1.20+, covering setup, SQL definition, and automated refreshes with examples.
A monthly roundup of interesting links focused on Kafka, event streaming, stream processing, and analytics in the tech industry.
A monthly tech link roundup focusing on Kafka, event streaming, and data engineering, including personal news about company acquisitions.
A monthly roundup of interesting links and resources for developers, focusing on Apache Kafka, event streaming, and tech talks.
Explores the role of Large Language Models (LLMs) in AI, covering major model families, providers, and concepts like hallucinations.
A technical guide on connecting Apache Flink SQL to a Kafka broker on Confluent Cloud, covering broker endpoints and API key configuration.
Explores building AI Agents as streaming SQL queries using platforms like Apache Flink for improved consistency, scalability, and developer experience.
Explores building AI Agents as streaming SQL queries using platforms like Apache Flink for improved consistency, scalability, and developer experience.
Explains the importance of time semantics (event vs. processing time) and watermarks for accurate data processing in Apache Flink and Kafka SQL.
A monthly roundup of curated links and articles on data engineering, Kafka, CDC, stream processing, and AI/ML topics.
A technical guide exploring the Confluent Cloud API for deploying and managing Apache Flink workloads, including authentication and deployment models.
A technical tutorial on using Apache Flink SQL to explore and process real-time flood monitoring data from a government API, demonstrating data wrangling techniques.
A technical guide on joining two data streams using Apache Flink SQL, including code examples and practical considerations.
A technical tutorial on using the UNNEST operator in Flink SQL to explode nested arrays of sensor data into separate rows.
Part 2 of a guide on running Apache Flink on Kubernetes, covering fault tolerance, high availability, savepoints, and observability.
Part 2 of a guide on running Apache Flink on Kubernetes, covering fault tolerance, high availability, savepoints, and observability.
A technical guide on installing Apache Flink's Kubernetes operator and deploying your first Flink job, with a focus on automation and setup.