Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Invalid
Priority: P3: Somewhat important
Fix Version/s: None
Affects Version/s: 4.7.3
Component/s: Core: QString and Unicode
Labels:
None
Environment:
Gentoo Linux x86_64

Description

I have invalid unicode bytesequences in QByteArray

for example:

F0 9D 93 98 27 F0 9D 93 B6 20

which looks like 2 utf8 characters, but they aren't.

notice 27 and 20 bytes. they are invalid according to utf-8 spec.
then when I try to QString::fromUtf8(barray.constData(), barray.size()).toUtf8()
I get the same invalid sequences while according to documentation they should be somehow replaced.

However, invalid sequences are possible with UTF-8 and, if any such are found, they will be replaced with one or more "replacement characters", or suppressed. These include non-Unicode sequences, non-characters, overlong sequences or surrogate codepoints encoded into UTF-8.

Attachments

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Thiago Macieira

Reporter:: Rion

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 13 Jul '11 17:49

Updated:: 30 Jul '13 01:15

Resolved:: 30 Jul '13 01:15

Gerrit Reviews

There are no open Gerrit changes