Defect #145

SimplePie does not output correctly forumed UTF-8

Added by Anonymous 330 days ago.

Status:Unconfirmed Start:2009-09-01
Priority:High Due date:
Assigned to:- % Done:

0%

Category:Compatibility Test
Target version:-
Affected Version:

1.2

PHP Version:

5.2.6

mbstring enabled:

No

iconv enabled:

No

cURL enabled:

No

zlib enabled:

No


Description

Certain invalid byte sequences are given as output even if UTF-8 is explicitly set e.g.:

0xe28022
0xa0
0xa3
0x88

Examples of these can be found by downloading these feeds with SimplePie:

http://feeds.feedburner.com/BronteCapital?format=xml
http://www.thisismoney.co.uk/xml/page-rss.xml?in_page_id=53946
http://cynicuseconomicus.blogspot.com/feeds/posts/default

When you then attempt to insert these into a Postgres database, an error is given due to the malformed UTF-8:

Warning: pg_query(): Query failed: ERROR: invalid byte sequence for encoding "UTF8": 0xe28022
HINT: This error can also happen if the byte sequence does not match the encoding expected by the server, which is controlled by "client_encoding".

Also available in: Atom PDF