Defect #145
SimplePie does not output correctly forumed UTF-8
| Status: | Unconfirmed | Start: | 2009-09-01 | |
| Priority: | High | Due date: | ||
| Assigned to: | - | % Done: | 0% |
|
| Category: | Compatibility Test | |||
| Target version: | - | |||
| Affected Version: | 1.2 |
PHP Version: | 5.2.6 |
|
| mbstring enabled: | No |
iconv enabled: | No |
|
| cURL enabled: | No |
zlib enabled: | No |
|
Description
Certain invalid byte sequences are given as output even if UTF-8 is explicitly set e.g.:
0xe28022
0xa0
0xa3
0x88
Examples of these can be found by downloading these feeds with SimplePie:
http://feeds.feedburner.com/BronteCapital?format=xml
http://www.thisismoney.co.uk/xml/page-rss.xml?in_page_id=53946
http://cynicuseconomicus.blogspot.com/feeds/posts/default
When you then attempt to insert these into a Postgres database, an error is given due to the malformed UTF-8:
Warning: pg_query(): Query failed: ERROR: invalid byte sequence for encoding "UTF8": 0xe28022
HINT: This error can also happen if the byte sequence does not match the encoding expected by the server, which is controlled by "client_encoding".