As with the "CR LF" => "LF" policy...my pitch is to take "strong bets" on trends that are going to be guaranteed to still be relevant, and put the costs of edge cases on those few who demand them.
Be sure to read over this thread regarding unicode normalization: