search for: codepoints

Displaying 20 results from an estimated 81 matches for "codepoints".

2011 Feb 09
6
[Bug 1856] New: Wrong QoS naming and obsolete defaults
https://bugzilla.mindrot.org/show_bug.cgi?id=1856 Summary: Wrong QoS naming and obsolete defaults Product: Portable OpenSSH Version: 5.8p1 Platform: All OS/Version: All Status: NEW Severity: normal Priority: P2 Component: Miscellaneous AssignedTo: unassigned-bugs at mindrot.org ReportedBy:
2024 Jan 08
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...t; least: Removing the whole block will cause word-breaker to not correctly handle halfwidth Katakana, such as "??????????" which it would treat as a single term, whereas it should be two: ??????and ????). My pull request causes word-breaker to only handle halfwidth Katakana and Hangul codepoints as unbroken script and treats Latin characters, numbers, symbols and punctuation as broken script. There's a couple of unit tests that check for this. diff --git a/xapian-core/queryparser/word-breaker.cc b/xapian-core/queryparser/word-breaker.cc index 8108523ccd53..6122dcdccc97 100644 --- a/xa...
2024 Jan 09
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...hole block will cause word-breaker to not correctly > handle halfwidth Katakana, such as "??????????" which it would treat > as a single term, whereas it should be two: ??????and ????). > > My pull request causes word-breaker to only handle halfwidth Katakana > and Hangul codepoints as unbroken script and treats Latin characters, > numbers, symbols and punctuation as broken script. There's a couple of > unit tests that check for this. Thanks, that looks good - now merged. I think we probably should backport this to 1.4 - it's a behaviour change, but limited to...
2013 Mar 13
2
patch - Some CJK codepoints are also punctuation
-- Greg. -------------- next part -------------- A non-text attachment was scrubbed... Name: xapian-some-cjk-codepoints-are-also-punctuation.patch Type: text/x-patch Size: 1499 bytes Desc: not available URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130313/4da8b0f9/attachment.bin>
2018 Jun 29
2
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...r the a vendor-defined IMAP capability called "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering in IMAP. More information can be found in doc/plugins/imap_filter_sieve.txt. - The Sieve address test caused an assertion panic for invalid addresses with UTF-8 codepoints in the localpart. Fixed by properly detecting invalid addresses with UTF-8 codepoints in the localpart and skipping these like other invalid addresses while iterating addresses for the address test. - Make the length of the subject header for the vacation response configurable and enfor...
2018 Jun 29
2
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...r the a vendor-defined IMAP capability called "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering in IMAP. More information can be found in doc/plugins/imap_filter_sieve.txt. - The Sieve address test caused an assertion panic for invalid addresses with UTF-8 codepoints in the localpart. Fixed by properly detecting invalid addresses with UTF-8 codepoints in the localpart and skipping these like other invalid addresses while iterating addresses for the address test. - Make the length of the subject header for the vacation response configurable and enfor...
2024 Jan 07
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Thu, Jan 04, 2024 at 05:50:22PM +0100, Robert Stepanek wrote: > Since I am undecided yet if and how to fix this in Xapian I haven't > come up with a pull request. Because trac currently is offline, I > could not file a bug. I hope it's OK to post my analysis here first, > I'll be happy to follow up reporting that bug proper later (should we > conclude that it actually
2011 Dec 18
10
[Bug 1964] New: QoS/DSCP names false translated to ToS hex value
https://bugzilla.mindrot.org/show_bug.cgi?id=1964 Bug #: 1964 Summary: QoS/DSCP names false translated to ToS hex value Classification: Unclassified Product: Portable OpenSSH Version: 5.9p1 Platform: amd64 OS/Version: Linux Status: NEW Severity: normal Priority: P2 Component: ssh
2024 Jan 10
2
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Tue, Jan 9, 2024, at 3:28 AM, Olly Betts wrote: > Thanks, that looks good - now merged. Thanks! > Did you already check the other ranges for cased letters? I can but if > you have already there's not much point. I did not. If you find time, that'd be great. Otherwise I can make room for it in the next days. > > The fullwidth "????? ??????" tests suggests to
2018 Jul 01
1
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...lled >> ? "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering >> ? in IMAP. More information can be found in >> ? doc/plugins/imap_filter_sieve.txt. >> - The Sieve address test caused an assertion panic for invalid addresses >> ? with UTF-8 codepoints in the localpart. Fixed by properly detecting >> ? invalid addresses with UTF-8 codepoints in the localpart and skipping >> ? these like other invalid addresses while iterating addresses for the >> ? address test. >> - Make the length of the subject header for the vacation r...
2020 Mar 30
2
Plotmath on Fedora 31 broken with with pango >= 1.44 - workarounds?
Le lundi 30 mars 2020 ? 15:24 +1300, Paul Murrell a ?crit : > Hi > > I have created an R branch that contains a potential fix ... > > https://svn.r-project.org/R/branches/R-symfam/ > > This allows, for example, ... > > cairo_pdf(symbolfamily="OpenSymbol") > > ... to specify that the OpenSymbol family should be used as the > "symbol" font
2024 Jan 04
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
I think I found a bug in Xapian 1.5 when using FLAG_WORD_BREAKS for input that contains characters in Unicode Halfwidth and Fullwidth Forms (https://unicode.org/charts/PDF/UFF00.pdf). Since I am undecided yet if and how to fix this in Xapian I haven't come up with a pull request. Because trac currently is offline, I could not file a bug. I hope it's OK to post my analysis here first,
2016 Sep 19
2
Pull requests: CJK words and Snippet generator
...doesn't currently have a separate stage for > normalisation and for stemming. I'll investigate both options with tests and will merge them into Xapian's unit tests where it makes sense. I won't be able to come up with it until next week, though. > The main issue is that new codepoints get added (and the odd one changes > category) in each new Unicode version, so if you're using different > Unicode versions at index time and at search time, the terms you get > won't match each other. [...] If Xapian's CJK::codepoint_is_cjk() and ICU have different ideas of &...
2020 Mar 30
2
Plotmath on Fedora 31 broken with with pango >= 1.44 - workarounds?
...though, > for example, ... > > F8EB E6 # LEFT PAREN TOP # parenlefttp (CUS) > F8EC E7 # LEFT PAREN EXTENDER # parenleftex (CUS) > F8ED E8 # LEFT PAREN BOTTOM # parenleftbt (CUS) > > Even OpenSymbol is missing a few (though perhaps not very common > ones) ... All the F8* codepoints are in the private use area. That means you can?t rely on them existing in standard unicode fonts You need to use correct Unicode values for things to work: Ux239? for parenthesis, brackets https://www.unicode.org/charts/PDF/U2300.pdf > F8E6 BD # VERTICAL ARROW EXTENDER # arrowv...
2018 Jun 24
0
Released Pigeonhole v0.5.2.rc1 for Dovecot v2.3.2.rc1.
...r the a vendor-defined IMAP capability called "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering in IMAP. More information can be found in doc/plugins/imap_filter_sieve.txt. - The Sieve address test caused an assertion panic for invalid addresses with UTF-8 codepoints in the localpart. Fixed by properly detecting invalid addresses with UTF-8 codepoints in the localpart and skipping these like other invalid addresses while iterating addresses for the address test. - Make the length of the subject header for the vacation response configurable and enfor...
2018 Jun 24
0
Released Pigeonhole v0.5.2.rc1 for Dovecot v2.3.2.rc1.
...r the a vendor-defined IMAP capability called "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering in IMAP. More information can be found in doc/plugins/imap_filter_sieve.txt. - The Sieve address test caused an assertion panic for invalid addresses with UTF-8 codepoints in the localpart. Fixed by properly detecting invalid addresses with UTF-8 codepoints in the localpart and skipping these like other invalid addresses while iterating addresses for the address test. - Make the length of the subject header for the vacation response configurable and enfor...
2018 Jun 30
0
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...d IMAP capability called > "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering > in IMAP. More information can be found in > doc/plugins/imap_filter_sieve.txt. > - The Sieve address test caused an assertion panic for invalid addresses > with UTF-8 codepoints in the localpart. Fixed by properly detecting > invalid addresses with UTF-8 codepoints in the localpart and skipping > these like other invalid addresses while iterating addresses for the > address test. > - Make the length of the subject header for the vacation response > c...
2001 Dec 23
0
Need Help Sambaserver is not accessible
Here is the packet trace - ICMP seems to be coming from my SAMBA Server - see frame 3 Does the netstat output look correct in my first request for help? Joel Morrow jiram@aol.com TRACE Frame 1 (92 on wire, 92 captured) Arrival Time: Dec 22, 2001 09:36:08.333676000 Time delta from previous packet: 1.999444000 seconds Time relative to first packet: 2.000088000 seconds Frame
2006 Nov 08
14
Increased memory requirements on 1.2
I just recently upgraded a rails app of mine to run on edge (and the 1-2-pre-release branch) and I noticed my fcgis required roughly 6-8MBs more memory after just a couple requests. For example, each fcgi on edge would start around 40MB and rise to ~46MBs after a couple requests. I downgraded my app back to 1.1.6 and each fcgi would start at around 33MBs and rise to ~38MB. As a result of the
2006 Mar 21
2
How do I get substring of utf-8 string?
I''m trying to get substring from a utf-8 encoded string. (say, first 50 characters of the string) String#[0..49] would give me the first 50 bytes not 50 characters.. I know there is jcode library, but it only let you count number of characters in utf-8 string. unicode gem doesn''t seem to help much. unicode_hacks gem seem to solve the problem, but it also seems to