Monday, September 19, 2011

Convert Unicode to UTF-8

Here is a simple function for changing \u3232 characters to UTF-8 characters in a given string:

def unicodeToUtf8(str)
    return str.gsub(/\\u([a-zA-Z0-9][a-zA-Z0-9][a-zA-Z0-9][a-zA-Z0-9])/) {|p| [$1.to_i(16)].pack("U")}   
end

No comments:

Post a Comment