Install ClickHouse Faster
A guide to quickly install ClickHouse on macOS using a one-line shell command and demonstrates its use for converting CSV data to Parquet.
Mark Litwintschik is a Big Data, AI, GIS, and networking consultant with international experience, helping clients across the UK, USA, Europe, and beyond. He specializes in large-scale data analysis, geospatial insights, and technology consulting for major corporations and organizations.
97 articles from this blog
A guide to quickly install ClickHouse on macOS using a one-line shell command and demonstrates its use for converting CSV data to Parquet.
A technical benchmark comparing PostgreSQL, ClickHouse, and BigQuery for fast geospatial coordinate conversion using Uber's H3 hexagon system.
Explains how IPinfo's probe network maps the physical location and metadata of nearly every IPv4 address, detailing its uses and origins.
A developer compares performance of a Rust-based TLD extraction script rewritten in Go, analyzing processing times on a large reverse DNS dataset.
An analysis of the world's fastest FizzBuzz implementation, written in Assembler and optimized for AVX2, achieving 56 GB/s output.
ROAPI is an open-source API server built in Rust that automatically creates REST APIs from static data files like CSV, JSON, and Parquet.
An overview of the Actix web framework for Rust, covering its history, features, and a practical example of setting up a WebSocket chat application.
An overview of the Rocket web framework for Rust, covering its features, history, and a practical setup guide.
A technical guide on building high-performance PostgreSQL extensions using the Rust programming language and the pgx framework.
A developer explores using a Rust library to significantly speed up the process of extracting top-level domains from a massive reverse DNS dataset.
Learn how to use Git and external tools to track changes and diff binary files like Office documents, PDFs, images, and videos.
Explores S2, a faster extension of Google's Snappy compression library, focusing on performance trade-offs and practical setup.
An overview of MeiliSearch, a minimalist, Rust-based full-text search engine, highlighting its features, codebase, and performance.
Explores MinIO, an open-source, S3-compatible object storage solution for on-premises or private cloud deployments.
A technical guide on setting up Prometheus and Grafana to monitor a ClickHouse database server, including installation and configuration steps.
Introducing Data Fluent, an open-source Python package for analyzing and understanding PostgreSQL database structure, row counts, and growth trends.
A technical benchmark of the Hydrolix analytics platform on AWS, testing its performance on a 1.1 billion row NYC taxi dataset.