Emir U

SEO Short Description (2–3 lines): Emir U. is a research-focused software engineer applying mathematics, statistics, and computer science to real-world problems, with 18+ years in software and 7+ years in commercial research. A PhD candidate in astronomy with a background in applied maths and philosophy, he writes about machine learning, logic, and statistical modeling.

https://emiruz.com

RSS Feed

1/26/2026

data science research applied statistics machine learning methods logic programming computational science

Articles from this Blog

24 articles from this blog

1/17/2026 • EN

Explainable unsupervised query tagging

Explains an unsupervised method for tagging search queries using evidence theory and Python, demonstrated with map query examples.

Python Natural Language Processing Information Retrieval

1/4/2026 • EN

Snakes & ladders: a short statistical analysis

A statistical analysis of the classic board game Snakes & Ladders, modeling it as a Markov chain to calculate the expected game length.

statistics graph theory R

1/1/2026 • EN

pyevidence: practical evidence theory

Introduces pyevidence, a Python library for practical implementation of Dempster-Shafer evidence theory, addressing computational challenges.

Python Probability Theory Computational Methods

10/30/2025 • EN

Modelling beliefs about sets

Explores using Dempster-Shafer theory to model probabilistic beliefs about sets based on quantified logical statements, as an alternative to Bayesian methods.

set theory Probability Theory Dempster Shafer Theory

8/17/2025 • EN

A short statistical reasoning test

A statistical reasoning test with three practical problems on sorting uncertain fractions, highlighting anomalies, and estimating population sizes.

data analysis Estimation Probability

5/10/2025 • EN

Fitting models from noisy heuristic labels

Explains the 'data programming' weak supervision paradigm for training models using noisy heuristic labels, with a practical example.

Weak Supervision Maximum Likelihood Estimation Data Programming

3/30/2025 • EN

Bootstrapping ranking models with an LLM judge

Using an LLM to label Hacker News titles and train a Ridge regression model for personalized article ranking based on user preferences.

Machine Learning llm Sentence Transformers

$Kelly fractions for independent simultaneous bets$

1/12/2025 • EN

Kelly fractions for independent simultaneous bets

Explains the Kelly criterion for bet sizing and extends it to multiple simultaneous independent bets using mathematical derivation and Python code.

Python Kelly Criterion Betting Strategy

4/28/2024 • EN

RBF kernel approximation with random Fourier features

Explains kernel ridge regression and scaling RBF kernels using random Fourier features for efficient large-scale machine learning.

Machine Learning Kernel Methods Rbf Kernel

4/24/2024 • EN

Metric learning with linear methods

Explores a closed-form solution for linear metric learning, deriving a transformation matrix to align feature distances with response distances.

Machine Learning optimization Matrix Algebra

3/24/2024 • EN

The "Billion Row Challenge!" with Fortran

A developer documents their journey tackling the 'Billion Row Challenge' in Fortran, optimizing performance from over 2 minutes to under 6 seconds.

performance optimization benchmarking Fortran

2/2/2024 • EN

Advent of Code in Prolog, Haskell, Python and Scala

A developer compares solving Advent of Code puzzles in Prolog, Haskell, Python, and Scala, analyzing productivity, code style, and language ergonomics.

Python Advent Of Code Haskell

11/19/2023 • EN

Domicles: a novel logic puzzle using Dominoe tiles

Introduces 'Domicles,' a logic puzzle using domino tiles, with examples and a Prolog implementation for puzzle generation.

Prolog Constraint Programming Logic Puzzle

10/18/2023 • EN

A minimal probabilistic Prolog meta-interpreter

A technical exploration of a minimal probabilistic Prolog meta-interpreter for stochastic simulation.

Probabilistic Programming Prolog Logic Programming

10/15/2023 • EN

Better data analysis with logic programming

Explores using logic programming (Prolog) for data analysis, demonstrating its application on a diamond pricing dataset to build robust models.

data analysis R Statistical Modeling

8/12/2023 • EN

Analysis of the data job market using "Ask HN: Who is hiring?" posts

Analysis of Hacker News job posts shows the Data Scientist role declining while ML Engineer roles rise, indicating a shift in the data job market.

Data Science Data Engineering Job Market Analysis

7/30/2023 • EN

An optimal-stopping quant riddle

A detailed analysis of an optimal stopping problem involving drawing cards for reward, exploring mathematical strategies and first-principles reasoning.

statistics algorithm Probability

6/18/2023 • EN

Blocking, covariate adjustment and optimal experiment design

Explains blocking, covariate adjustment, and optimal design to improve statistical power in online experiments, with a Python implementation.

Python Statistical Power Experimental Design

5/12/2023 • EN

Semi-supervised clustering with logic programming

Explores using logic programming and Prolog for semi-supervised clustering, arguing it's more intuitive than traditional algorithms for rule-based problems.

artificial intelligence Clustering Semi Supervised Learning

4/30/2023 • EN

Prolog for data science

Explores using Prolog for symbolic reasoning in data science, integrating it with Python for tasks like piecewise regression analysis.

Data Science Regression Prolog

1 2 Next

Emir U

Articles from this Blog

Select Language