ETL Offload with Spark and Amazon EMR - Part 3 - Running pySpark on EMR
Part 3 of a series on offloading ETL to Spark on Amazon EMR, focusing on running PySpark code on a provisioned EMR cluster.
Part 3 of a series on offloading ETL to Spark on Amazon EMR, focusing on running PySpark code on a provisioned EMR cluster.
Reflects on how social influence and online reputation, inspired by Black Mirror, are becoming a new currency in tech and hiring.
Part 2 of a guide on developing ETL processes using Apache Spark, Jupyter Notebooks, and Docker on Amazon EMR.
Explores using Apache Spark on Amazon EMR to offload and improve ETL processes, comparing it to traditional Oracle-based solutions.
Learn how to externally resolve JavaScript Promises using a clever technique with getters and setters for better code architecture.
Explores why reflection in .NET is slow, examining CLR design goals, internal data structures, and the multi-step invocation process.
Explains the importance of source control and automated deployment for OBIEE, detailing the 'why' and 'how' to prevent release issues.
Explores academic research papers referenced in the .NET runtime and compiler source code, focusing on data structures and optimization algorithms.
Open-source Enhanced Usage Tracking for OBIEE now available, capturing detailed user interaction data for performance and analytics.
A tutorial on debugging an ASP.NET Core application running inside a Docker container using Visual Studio Code.
A personal recap of attending the FrontInPOA 2016 tech conference, covering talks on front-end development, web apps, and tech careers.
Explains the benefits of upgrading from PHP 5 to PHP 7, including major performance improvements, reduced memory usage, and new features.
Explains a UWP bug where AdaptiveTriggers fail if a control's name matches a VisualState name, with code examples and a workaround.
A presentation on Lazada's machine learning framework for ranking products in catalog and search results to improve user experience.
Rittman Mead announces open-source release of key BI/DI tools including a JavaScript API for OBIEE frontends and visual plugins.
Explains how to use and extend the HttpClient's message handler pipeline in .NET for custom request/response processing and unit testing.
Cimbalino Toolkit 2.4.0 released for UWP, featuring new controls, converters, and navigation service improvements.
A beginner-friendly introduction to computer networking concepts, focusing on the OSI model and essential hardware.
A deep dive into debugging a memory allocation issue in an operating system, starting from a bug report about slow iter_content performance.
Explains how to modify Java final fields using reflection, discusses compile-time constants, and warns about unexpected behavior.