Mailbag: Parsing Fields from PDFs—When to Use Machine Learning?
A developer asks when to use ML for parsing PDF fields with typos, and receives advice on using Levenshtein distance and human-in-the-loop solutions.
A developer asks when to use ML for parsing PDF fields with typos, and receives advice on using Levenshtein distance and human-in-the-loop solutions.
Learn two methods to replace all occurrences of a string in JavaScript: using regex with global flag and the newer replaceAll function.
Explains how to implement responsive images in Jekyll using a pre-render hook and a custom regex to replace Markdown syntax with a Liquid tag.
A guide to replacing long chains of PowerShell 'like' operators with more efficient and scalable regular expressions (RegEx) for string matching.
A developer shares their journey learning Regex by applying it to normalize phone numbers in Active Directory, moving from copy-pasting to understanding patterns.
A guide to supercharging search and global replace functionality in Vim using plugins and built-in tools for maximum speed.
An exploration of ASCII art found in comments within major .NET open-source repositories, categorized by technical themes.
A guide to setting up and using Livegrep, a fast, regex-based code search tool for large repositories, to improve GitHub search performance.
A Haskell developer shares a solution for performing regex substitutions with Unicode, specifically for embedding YouTube links.
A developer shares a clever one-line JavaScript solution using String.replace() to highlight search terms in text results with HTML <mark> tags.
A developer's cautionary tale about building an MVP too late, using a recipe ingredient parsing service as a case study.
A guide to creating dynamic regular expressions in JavaScript using tagged template literals and Proxies to simplify string escaping.
Explains how flawed Regular Expressions can cause ReDoS attacks, crippling Node.js services with catastrophic backtracking and high CPU usage.
A practical guide to a robust regular expression pattern for accurately extracting quoted strings, avoiding common pitfalls like greedy matching and escaped characters.
A curated collection of resources and tools for learning and using Regular Expressions (RegEx) in the R programming language.
A visual cheat sheet for the stringr package in R, originally created in French and translated to English, for string manipulation tasks.
Explains why complex regexes are often wrong and unmaintainable, advocating for simpler, more robust validation methods.
Fixing a TypeScript error in AngularJS when initializing a RegExp object incorrectly, with two valid solutions.
A developer investigates a performance bottleneck caused by regex validation in AutoMapper and explores faster alternatives for string validation.
Overview of new and upcoming ECMAScript regular expression features like dotAll mode, lookbehind assertions, named capture groups, and Unicode property escapes.