Detecting RLO character in Python
How to detect Right-to-Left Override (RLO) characters in filenames using Python's unicodedata module to prevent malicious file spoofing.
How to detect Right-to-Left Override (RLO) characters in filenames using Python's unicodedata module to prevent malicious file spoofing.
Overview of new and upcoming ECMAScript regular expression features like dotAll mode, lookbehind assertions, named capture groups, and Unicode property escapes.
Explains Python 3's string and bytes handling, debunking myths about it being worse than Python 2.
A technical guide exploring Swift's Unicode-compliant string handling, focusing on CharacterView and encoding-specific views for developers.
A guide explaining Unicode fundamentals and how JavaScript handles characters, including common pitfalls and ECMAScript 2015 features.
Explains how to use Unicode property escapes (\p{...}) in JavaScript regexes, a feature added in ES2018, for matching characters by script, category, or properties like emoji.
Explains the purpose and importance of the <meta charset='UTF-8'> tag in HTML for proper character encoding.
Explains the updated rules for valid JavaScript variable names in ES2015, including Unicode support and reserved words.
Explains the new Unicode 'u' flag in ES2015 JavaScript regex, covering syntax, dot operator, quantifiers, and character class behavior.
Explains JavaScript's surprising Unicode handling, common pain points, and how ECMAScript 6 improves the situation.
How to insert the functional dependency arrow symbol (Long Rightwards Arrow From Bar) in Microsoft Word 2010 using the Symbol dialog.
Explains why MySQL's 'utf8' charset is incomplete and how to use 'utf8mb4' for full Unicode support, including emojis.
Explains the rules for valid JavaScript variable names in ES5, including Unicode characters and reserved words.
Explores whether JavaScript uses UCS-2 or UTF-16 encoding, detailing the differences and implications for handling Unicode characters.
A technical guide explaining character escape sequences in JavaScript, covering code points, single character escapes, and encoding details.
A guide to correctly extracting URL hash values in JavaScript, addressing common pitfalls and naive approaches.
A PHP function to fix serialized data after converting from single-byte encoding to Unicode UTF-8 by recounting string lengths.
An explanation of UTF-8 encoding, detailing its design, convenient properties, and why it's simpler than often perceived.
Exploring challenges with downcasing UTF-8 strings containing Polish characters using command-line tools like tr and dd.
Explores using Unicode characters like @ and ․ to hide email addresses from spambots, offering a less common alternative to traditional methods.