Robin Moffatt 1/13/2017

Data Processing and Enrichment in Spark Streaming with Python and Kafka

Read Original

This article details a real-world proof of concept for low-latency data processing. It explains how to use Apache Spark Streaming, Python, and Kafka to ingest Twitter data, filter tweets for suspected copyright-infringing links using specific match criteria, enrich the data, and output results to different Kafka topics, including performance monitoring.

Data Processing and Enrichment in Spark Streaming with Python and Kafka

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser