Introduction to Data Engineering Concepts | Understanding Data Sources and Ingestion
An introduction to data engineering concepts, focusing on data sources and ingestion strategies like batch vs. streaming.
An introduction to data engineering concepts, focusing on data sources and ingestion strategies like batch vs. streaming.
A technical guide exploring different Flink SQL connectors and formats for ingesting and processing Debezium CDC events from Apache Kafka topics.
Explores methods for ingesting Debezium CDC events from Kafka into Apache Flink using different SQL connectors and data formats.
A guide to setting up and using Dremio's Auto-Ingest feature for automated, event-driven data loading into Apache Iceberg tables from cloud storage.
A guide to building a real-time data pipeline using Azure services like Event Hub, Stream Analytics, Parquet, ADLS, and ADX.
A technical guide on setting up Azure Event Hub to ingest and route compressed data into Azure Data Explorer (ADX) for real-time analytics.
A guide to installing and using LightIngest on Ubuntu for data ingestion into Azure Data Explorer, including managed identity authentication.
Explains how to use Azure OpenAI's 'Add your data' feature programmatically via REST APIs to integrate custom documents into AI responses.
A guide to building a custom chatbot in Azure OpenAI using your own data, covering setup, costs, and integration with Azure Cognitive Search.
Explores methods for copying data into Azure Synapse Analytics (SQL DW), focusing on the CTAS and new COPY INTO commands.