Running Big Data Discovery Shell and Jupyter Notebook on Big Data Lite VM 4.5
Guide to setting up Big Data Discovery Shell and Jupyter Notebooks on Oracle's Big Data Lite VM for advanced data science work.
Guide to setting up Big Data Discovery Shell and Jupyter Notebooks on Oracle's Big Data Lite VM for advanced data science work.
A guide to creating presentation slides using Jupyter Notebook and Reveal.js, including automation and hosting on GitHub Pages.
A summary of the author's experience and key takeaways from attending the PyData Berlin 2016 conference, including notable talks.
Explains improvements in joblib's compressed persistence for Python, focusing on reduced memory usage and single-file storage for large numpy arrays.
Explains how to handle conditional Python package dependencies based on Python version, covering PEP 508, setuptools versions, and workarounds.
An introduction to Python sorted collections, explaining the need for libraries like SortedContainers for efficient sorted data types.
The Intermediate Python book is now available in a Chinese translation, which quickly gained popularity on GitHub.
A technical critique of Sucuri Security's flawed analysis of TLS certificate verification, focusing on errors in their assessment of Python's Requests library.
A Python script called csv2vw converts CSV data into Vowpal Wabbit's input format for machine learning, with examples for label handling.
A guide for academics with math/physics backgrounds transitioning into data science, covering skills, learning paths, and practical advice.
Analysis of a cryptographic vulnerability in the Beaker Python library's session encryption due to nonce reuse in AES-CTR mode.
A curated list of resources for beginners to learn Python specifically for data science, including tutorials, courses, and books.
A tutorial on using Python, Tesseract, and Wand to perform OCR (Optical Character Recognition) on PDF files and extract text.
A technical guide demonstrating how to call the RSiteCatalyst R package from Python using the rpy2 library for data analysis.
A Fedora maintainer shares a Python script to scrape and email daily reports of failed live CD builds from Koji.
A technical tutorial on building a data product using Python, Markov chains, and a dataset of science questions to generate random quiz questions.
A technical guide on processing millions of small text files using GNU Parallel and stream processing, without needing Hadoop or a database.
A guide on using the moto library to mock AWS S3 interactions in Python tests, replacing complex boto mocks.
A technical guide on using Python to scrape public data, including answers to questions, from the European Parliament website.
A guide to using Python's tempfile.NamedTemporaryFile() for creating and managing temporary files with control over deletion.