html_meta_tags v1.4.1 does not truncate UTF-8 string well

One of Chinese language photo gallery using UTF-8 as encoding.

When I check the following image page by W3C validator

http://www.hkgimages.com/pic/people/theresa-fu-wing/hkgimages-20110705-131104.jpg.htm

http://validator.w3.org/check?uri=http://www.hkgimages.com/pic/people/theresa-fu-wing/hkgimages-20110705-131104.jpg.htm&charset=(detect+automatically)&doctype=Inline&group=0&user-agent=W3C_Validator/1.2

It return following error:

Sorry, I am unable to validate this document because on line 21 it contained one or more bytes that I cannot interpret as utf-8 (in other words, the bytes found are not valid values in the specified Character Encoding). Please check both the content of the file and the character encoding indication.

The error was: utf8 "\xE6" does not map to Unicode

http://img828.imageshack.us/img828/3200/descriptiont.jpg

It check line 21, which is meta description line and find the system seems does not truncate the last chinese character well.

Question : Does the plugin support UTF-8? which use PHP Multi-byte function, such as mb_strimwidth?

Comments

  • acrylian Administrator, Developer
    No, it uses currently the plain substr() function. It should actually use our own truncate function that probable is utf8 save. Please open a ticket as I have no time for this right now.
Sign In or Register to comment.