Sessionizing Log Data Using SQL
A technical guide on using SQL window functions to group discrete time-series events into user sessions for data analysis.
A technical guide on using SQL window functions to group discrete time-series events into user sessions for data analysis.
Tips for improving Drupal performance by optimizing database indexes and SQL queries, inspired by Doom's difficulty levels.
Discusses the importance of naming conventions and coding style for readability and productivity, with examples from SQL and C++.
A data engineer shares five practical lessons and performance tips for working with Apache Hive, focusing on common pitfalls and optimizations.
A historical account of Oracle's poor performance in a 1983 database benchmark and their alleged attempt to have the professor behind it fired.
A technical guide on using SQL window functions, specifically LAG, to calculate month-over-month revenue growth percentages for SaaS or recurring billing analysis.
Explains how to use PostgreSQL's CTEs (WITH clauses) to write more readable, composable, and maintainable SQL queries.
A guide to using SQL queries and a simple Ruby script to send personalized, data-driven emails to users, avoiding complex marketing tools.
A tutorial on using Apache Hive to create tables and views from data loaded into a Hadoop cluster, continuing a multi-part series.
Explains how to use PostgreSQL's COMMENT feature to document tables, columns, and SQL for better data understanding and maintenance.
A tutorial on using PostgreSQL's crosstab function to pivot data directly in SQL, enabling easier analysis without external tools.
A summary of upcoming technical talks on statistical computing, rare DNA variant analysis, and handling large datasets with R and SQL.
A collection of practical tips and guides for effective indexing in PostgreSQL, covering unused indexes, costs, and performance considerations.
A developer shares common SQL bad habits like using column numbers in ORDER BY, implicit joins, and lacking comments, with examples and better practices.
A guide to using PostgreSQL's array_agg function to efficiently aggregate and format data, avoiding manual application-level processing.
A developer shares his preferred method for working with PostgreSQL, advocating for the psql command-line tool and sharing key tips and configurations.
Argues that SQL databases are inherently non-scalable, questioning the effectiveness of traditional scaling techniques like caching and sharding.
A tutorial video demonstrating how to execute SQL queries within the R programming language using the 'sqldf' package for data analysis.
A developer shares his approach to writing clean, readable SQL with practical examples and best practices for structuring queries.
Explains how to use the Array datatype in PostgreSQL for storing and querying data like tags or purchase items, with practical examples.