Daniel Janus 2/16/2010

Downcasing strings

Read Original

The article details the author's experience trying to downcase a large UTF-8 text file with Polish characters using standard Unix tools like `tr` and `dd`. It highlights the poor or non-existent support for multibyte characters in these tools and contrasts them with working solutions in `gawk` and `sed`. An update notes that `tr` on OS X/BSD handles this correctly.

Downcasing strings

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser