Mailing List Archive

[issue14923] Even faster UTF-8 decoding
Serhiy Storchaka <storchaka@gmail.com> added the comment:

Here is a patch that uses some sort of autodetection.

----------
Added file: http://bugs.python.org/file26098/decode_utf8_signed_byte-2.patch

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com
[issue14923] Even faster UTF-8 decoding [ In reply to ]
Serhiy Storchaka <storchaka@gmail.com> added the comment:

Any chance to commit the patch before final feature freeze?

----------

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com
[issue14923] Even faster UTF-8 decoding [ In reply to ]
Antoine Pitrou <pitrou@free.fr> added the comment:

> Any chance to commit the patch before final feature freeze?

I'll defer to Mark :-)

----------

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com
[issue14923] Even faster UTF-8 decoding [ In reply to ]
Mark Dickinson <dickinsm@gmail.com> added the comment:

Okay, will look at this this afternoon.

----------
assignee: -> mark.dickinson

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com
[issue14923] Even faster UTF-8 decoding [ In reply to ]
Mark Dickinson <dickinsm@gmail.com> added the comment:

I'm happy to apply the 'decode_utf8_range_check.patch'; I'll do that unless there are objections. The code is clearer than the original, and if we get a speedup into the bargain then I don't see a reason not to apply this.

I'm less comfortable with either the original patch, or the most recent one (decode_utf8_signed_byte-2.patch).

----------

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com
[issue14923] Even faster UTF-8 decoding [ In reply to ]
Ezio Melotti <ezio.melotti@gmail.com> added the comment:

Serhiy, does this patch also fix #8271?
If so, can you also include the tests I wrote for it?

----------

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com
[issue14923] Even faster UTF-8 decoding [ In reply to ]
Roundup Robot <devnull@psf.upfronthosting.co.za> added the comment:

New changeset 3214c9ebcf5e by Mark Dickinson in branch 'default':
Issue #14923: Optimize continuation-byte check in UTF-8 decoding. Patch by Serhiy Storchaka.
http://hg.python.org/cpython/rev/3214c9ebcf5e

----------

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com
[issue14923] Even faster UTF-8 decoding [ In reply to ]
Mark Dickinson <dickinsm@gmail.com> added the comment:

Patch applied. Closing.

Ezio: the patch is pure optimization, with no change in semantics; I don't see how it could fix #8271.

----------
resolution: -> fixed
status: open -> closed

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com
[issue14923] Even faster UTF-8 decoding [ In reply to ]
Serhiy Storchaka <storchaka@gmail.com> added the comment:

> Serhiy, does this patch also fix #8271?

No, this patch not change behavior. But updated patch for issue 8271 now
contains this patch (I hope this will help merge).

> If so, can you also include the tests I wrote for it?

Your tests included in patch for issue 8271.

----------

_______________________________________
Python tracker <report@bugs.python.org>
<http://bugs.python.org/issue14923>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/list-python-bugs%40lists.gossamer-threads.com