summary |
shortlog | log |
commit |
commitdiff |
tree
first ⋅ prev ⋅ next
Joe Wreschnig [Mon, 22 Feb 2010 06:46:56 +0000 (22:46 -0800)]
Test quotes.
Joe Wreschnig [Mon, 22 Feb 2010 05:57:32 +0000 (21:57 -0800)]
Simplify algorithm. No changes to sort behavior or runtime.
Joe Wreschnig [Mon, 22 Feb 2010 04:28:46 +0000 (20:28 -0800)]
Different algorithm, gives better results for numbers following grouping numbres but worse results for numbers following other punctuation.
Joe Wreschnig [Mon, 22 Feb 2010 00:21:07 +0000 (16:21 -0800)]
sortemes: Simplify some logic.
Joe Wreschnig [Sun, 21 Feb 2010 02:00:33 +0000 (18:00 -0800)]
Test cases.
Joe Wreschnig [Fri, 19 Feb 2010 09:23:57 +0000 (01:23 -0800)]
More tweaks; notably try to insert paragraph breaks rather than a separate Python tuple when re-concatenating strings.
Joe Wreschnig [Thu, 18 Feb 2010 08:10:31 +0000 (00:10 -0800)]
Installation docs.
Joe Wreschnig [Thu, 18 Feb 2010 08:04:17 +0000 (00:04 -0800)]
Minor tweaks for better numeric-following-split-strings.
Joe Wreschnig [Wed, 17 Feb 2010 10:04:15 +0000 (02:04 -0800)]
New approach - find split points based on Unicode categories.
Joe Wreschnig [Tue, 16 Feb 2010 09:28:22 +0000 (01:28 -0800)]
Category-based splitting.
Joe Wreschnig [Tue, 16 Feb 2010 09:28:11 +0000 (01:28 -0800)]
Typo.
Joe Wreschnig [Tue, 16 Feb 2010 08:10:44 +0000 (00:10 -0800)]
Calculate sortemes using simply alnum splitting rather than word breaks. Faster and slightly more accurate for our purposes. Strip punctuation.
Joe Wreschnig [Tue, 16 Feb 2010 04:49:57 +0000 (20:49 -0800)]
'Advanced' sorteme functions.
Joe Wreschnig [Tue, 16 Feb 2010 01:03:52 +0000 (17:03 -0800)]
_strings: Numeric string extraction routines.
Joe Wreschnig [Mon, 15 Feb 2010 09:20:21 +0000 (01:20 -0800)]
Word-splitting.
Joe Wreschnig [Mon, 15 Feb 2010 08:05:24 +0000 (00:05 -0800)]
uca is not going to make the cut.
Joe Wreschnig [Mon, 15 Feb 2010 05:40:07 +0000 (21:40 -0800)]
pycollate: Options to set backend, locale, and encoding.
Joe Wreschnig [Mon, 15 Feb 2010 05:39:44 +0000 (21:39 -0800)]
syslocale: setlocale for LC_COLLATE requires encoding to be considered valid.
Joe Wreschnig [Mon, 15 Feb 2010 05:39:19 +0000 (21:39 -0800)]
uca: Share DUCET data; ensure it exists at import time.
Joe Wreschnig [Mon, 15 Feb 2010 03:04:26 +0000 (19:04 -0800)]
icu: Document and tab-fix.
Joe Wreschnig [Mon, 15 Feb 2010 03:04:01 +0000 (19:04 -0800)]
syslocale: Normalize encoding based on the locale.
Joe Wreschnig [Mon, 15 Feb 2010 01:06:44 +0000 (17:06 -0800)]
syslocale: Standardize API.
Joe Wreschnig [Sat, 13 Feb 2010 06:02:41 +0000 (22:02 -0800)]
Remove NAME, just use __name__.
Joe Wreschnig [Sat, 13 Feb 2010 06:02:30 +0000 (22:02 -0800)]
set_default: Forgot to break.
Joe Wreschnig [Sat, 13 Feb 2010 06:02:16 +0000 (22:02 -0800)]
Print locale, backend.
Joe Wreschnig [Sat, 13 Feb 2010 06:02:05 +0000 (22:02 -0800)]
set_default: Forgot to break.
Joe Wreschnig [Fri, 12 Feb 2010 10:06:56 +0000 (02:06 -0800)]
Raw codepoint test backend. Tweaks to constructor arguments. Try to handle the case when no locale or string codec information is available..
Joe Wreschnig [Thu, 11 Feb 2010 07:26:48 +0000 (23:26 -0800)]
Test script. Add NAME to backends. Some smarter system locale handling.
Joe Wreschnig [Thu, 11 Feb 2010 04:34:41 +0000 (20:34 -0800)]
ICU backend; uses Pyrex, based on zope.ucol.
Joe Wreschnig [Wed, 10 Feb 2010 09:04:18 +0000 (01:04 -0800)]
pycollate: Initial import. Unstable/unfinished uca and syslocale backends. No tests. No documentation. No ICU backend.