All About Parquet Part 08 - Reading and Writing Parquet Files in Python
A practical guide to reading and writing Parquet files in Python using PyArrow and FastParquet libraries.
A practical guide to reading and writing Parquet files in Python using PyArrow and FastParquet libraries.
A developer recounts an intense debugging session with the django-allauth package that turned a minor sign-up issue into a major investigation.
A tutorial on using PyArrow for data analytics in Python, covering core concepts, file I/O, and analytical operations.
Analyzes the viability of using uv's managed Python in production, covering portability, performance, and security implications.
A comprehensive guide to building interactive data applications using the Streamlit framework, covering setup, visualization, ML integration, and deployment.
Introduces rerankers, a lightweight Python library providing a unified interface for various document re-ranking models used in information retrieval pipelines.
The author begins migrating a SaaS application from Heroku to Digital Ocean, focusing on creating a Dockerfile and new deployment tools.
A developer explains the benefits of using Python virtual environments within Docker containers for predictability, standards, and team communication.
A developer explores using Kamal, a deployment tool, to deploy a web app to a Digital Ocean Droplet virtual machine.
A guide to building fast, production-ready Docker containers for Python applications using the uv tool, focusing on multi-stage builds and caching strategies.
A beginner-friendly guide explaining Python's built-in functions with simple definitions and examples for easy learning.
A developer shares their experience using ChatGPT as a coding assistant to boost productivity, generate tailored code, and troubleshoot.
A developer details automating a TradingView subscription service using n8n, Python, Telegram bots, and web crawlers to handle customer access and emails.
Learn how to extract text and data from PDFs using Python tools like pypdf, OCR, and table extraction techniques.
A guide for R users to learn basics of Python, HTML, CSS, JS, and C++ to enhance their data science and web development projects.
Building a Django job to send activation email reminders before user accounts expire, plus discussion on dependency management.
Python 3.12 deprecates datetime.utcnow(). This article explains why and how to migrate to timezone-aware datetime.now(timezone.utc).
A tutorial on using Plotnine, a Python data visualization library based on the grammar of graphics, comparing it to ggplot2 and using Polars.
Announcing skrub 0.2.0, a library update simplifying machine learning on complex dataframes with new features like tabular_learner.
This episode covers implementing email confirmations and customizing Django-allauth forms for password reset functionality in a SaaS project.