[jira] [Created] (IGNITE-3098) UTF-16 surrogate pairs are not properly serialized by BinaryMarshaller

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (IGNITE-3098) UTF-16 surrogate pairs are not properly serialized by BinaryMarshaller

Anton Vinogradov (Jira)
Denis Magda created IGNITE-3098:
-----------------------------------

             Summary: UTF-16 surrogate pairs are not properly serialized by BinaryMarshaller
                 Key: IGNITE-3098
                 URL: https://issues.apache.org/jira/browse/IGNITE-3098
             Project: Ignite
          Issue Type: Bug
            Reporter: Denis Magda


BinaryMarshaller is unable to properly serialize surrogate symbols (read here [1] paragraph "invalid code points") because it relies on String.getBytes("UTF-8") method [2].

However Optimized and JDK marshalers can properly handle this symbols since they rely on {{ObjectOutputStream.writeUTF()}} method.

[1] https://en.wikipedia.org/wiki/UTF-8
[2] https://community.oracle.com/thread/1164397?start=0&tstart=0




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)