MusicMood
A developer shares a data mining project that builds a machine learning model to classify songs as happy or sad based on their lyrics.
Sebastian Raschka, PhD, is an LLM Research Engineer and AI expert bridging academia and industry, specializing in large language models, high-performance AI systems, and practical, code-driven machine learning.
97 articles from this blog
A developer shares a data mining project that builds a machine learning model to classify songs as happy or sad based on their lyrics.
A Python tutorial showing how to download your Twitter timeline and visualize it as a word cloud using data science libraries.
Explores Naive Bayes classifiers for text classification, covering theory and applications like spam filtering and song lyric analysis.
Explains how to use the RBF kernel trick to perform nonlinear dimensionality reduction via Kernel PCA for complex datasets.
An overview of predictive modeling, supervised machine learning, and the core workflow for pattern classification tasks.
A technical guide to Linear Discriminant Analysis (LDA) for dimensionality reduction and classification in machine learning, with comparisons to PCA.
A technical guide to Dixon's Q test for identifying outliers in small datasets, including its method, application, and criticisms.
Explains feature scaling and normalization in machine learning, comparing standardization and Min-Max scaling, with examples using scikit-learn.
A tutorial on using Python tools for machine learning, covering data loading, visualization, preprocessing, and classification with scikit-learn.
A technical overview of molecular docking, focusing on using AutoDock 4.2 to estimate protein-ligand binding free energies and comparing scoring functions.
A guide to using Python's multiprocessing module for parallel programming to overcome the GIL and utilize multi-core CPUs.
A 5-step tutorial on converting Markdown to HTML with Python, adding syntax highlighting for code blocks using Python-Markdown and Pygments.
A tutorial on creating internal links and a table of contents in IPython Notebooks and Markdown documents using HTML anchors.
A technical guide on using OpenEye's command-line tools for molecular substructure alignment and low-energy conformer overlay workflows.
A guide to unit testing in Python, covering its benefits, components, and a practical walkthrough using the py.test framework.
A tutorial on creating customizable heat maps in R using the gplots package, covering installation, data preparation, and plot customization.
A technical guide on using SQLite and Python's sqlite3 module to efficiently manage and query large datasets, replacing slow text file processing.