Intro to Apache Iceberg with Apache Polaris and Apache Spark
A technical guide on using Apache Iceberg with Apache Spark and Polaris for building and managing a data lakehouse, covering setup, operations, and optimization.
A technical guide on using Apache Iceberg with Apache Spark and Polaris for building and managing a data lakehouse, covering setup, operations, and optimization.
A beginner-friendly introduction to using PySpark for big data processing with Apache Spark, covering the fundamentals.
Using GitHub Actions to trigger Airflow DAGs for orchestrating data pipelines across Spark, Dremio, and Snowflake.
A hands-on tutorial for building a Data Lakehouse on your laptop using Apache Iceberg, Spark, Nessie, Minio, and Dremio.
A weekly tech digest covering Microsoft Fabric, Power BI, Purview updates, and articles on Generative AI, Semantic Kernel, and the AutoGen framework.
A tutorial on building a local Data Lakehouse using Docker Compose with Apache Spark, Minio, Dremio, and Nessie.
A guide to configuring Apache Spark for use with the Apache Iceberg table format, covering packages, flags, and programmatic setup.
Troubleshooting an Azure Synapse Analytics error: 'LSRServiceException – Could not find Linked Service' when running AutoML.
Summary of key application-agnostic talks from Spark+AI Summit 2020, focusing on scaling and optimizing deep learning models.