Daniel Janus 2/16/2010

Downcasing strings

Read Original

The article details the author's experience trying to downcase a large UTF-8 text file with Polish characters using standard Unix tools like `tr` and `dd`. It highlights the poor or non-existent support for multibyte characters in these tools and contrasts them with working solutions in `gawk` and `sed`. An update notes that `tr` on OS X/BSD handles this correctly.

Downcasing strings

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser

Top of the Week

2
Designing Design Systems
TkDodo Dominik Dorfmeister 2 votes
3
Introducing RSC Explorer
Dan Abramov 1 votes
5
Fragments Dec 11
Martin Fowler 1 votes
6
Adding Type Hints to my Blog
Daniel Feldroy 1 votes
7
Refactoring English: Month 12
Michael Lynch 1 votes
9