search for: codepoint

Displaying 20 results from an estimated 81 matches for "codepoint".

Did you mean: codepoints
2011 Feb 09
6
[Bug 1856] New: Wrong QoS naming and obsolete defaults
https://bugzilla.mindrot.org/show_bug.cgi?id=1856 Summary: Wrong QoS naming and obsolete defaults Product: Portable OpenSSH Version: 5.8p1 Platform: All OS/Version: All Status: NEW Severity: normal Priority: P2 Component: Miscellaneous AssignedTo: unassigned-bugs at mindrot.org ReportedBy:
2024 Jan 08
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...t; least: Removing the whole block will cause word-breaker to not correctly handle halfwidth Katakana, such as "??????????" which it would treat as a single term, whereas it should be two: ??????and ????). My pull request causes word-breaker to only handle halfwidth Katakana and Hangul codepoints as unbroken script and treats Latin characters, numbers, symbols and punctuation as broken script. There's a couple of unit tests that check for this. diff --git a/xapian-core/queryparser/word-breaker.cc b/xapian-core/queryparser/word-breaker.cc index 8108523ccd53..6122dcdccc97 100644 --- a/x...
2024 Jan 09
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...hole block will cause word-breaker to not correctly > handle halfwidth Katakana, such as "??????????" which it would treat > as a single term, whereas it should be two: ??????and ????). > > My pull request causes word-breaker to only handle halfwidth Katakana > and Hangul codepoints as unbroken script and treats Latin characters, > numbers, symbols and punctuation as broken script. There's a couple of > unit tests that check for this. Thanks, that looks good - now merged. I think we probably should backport this to 1.4 - it's a behaviour change, but limited to...
2013 Mar 13
2
patch - Some CJK codepoints are also punctuation
-- Greg. -------------- next part -------------- A non-text attachment was scrubbed... Name: xapian-some-cjk-codepoints-are-also-punctuation.patch Type: text/x-patch Size: 1499 bytes Desc: not available URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130313/4da8b0f9/attachment.bin>
2018 Jun 29
2
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...r the a vendor-defined IMAP capability called "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering in IMAP. More information can be found in doc/plugins/imap_filter_sieve.txt. - The Sieve address test caused an assertion panic for invalid addresses with UTF-8 codepoints in the localpart. Fixed by properly detecting invalid addresses with UTF-8 codepoints in the localpart and skipping these like other invalid addresses while iterating addresses for the address test. - Make the length of the subject header for the vacation response configurable and enfo...
2018 Jun 29
2
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...r the a vendor-defined IMAP capability called "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering in IMAP. More information can be found in doc/plugins/imap_filter_sieve.txt. - The Sieve address test caused an assertion panic for invalid addresses with UTF-8 codepoints in the localpart. Fixed by properly detecting invalid addresses with UTF-8 codepoints in the localpart and skipping these like other invalid addresses while iterating addresses for the address test. - Make the length of the subject header for the vacation response configurable and enfo...
2024 Jan 07
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Thu, Jan 04, 2024 at 05:50:22PM +0100, Robert Stepanek wrote: > Since I am undecided yet if and how to fix this in Xapian I haven't > come up with a pull request. Because trac currently is offline, I > could not file a bug. I hope it's OK to post my analysis here first, > I'll be happy to follow up reporting that bug proper later (should we > conclude that it actually
2011 Dec 18
10
[Bug 1964] New: QoS/DSCP names false translated to ToS hex value
https://bugzilla.mindrot.org/show_bug.cgi?id=1964 Bug #: 1964 Summary: QoS/DSCP names false translated to ToS hex value Classification: Unclassified Product: Portable OpenSSH Version: 5.9p1 Platform: amd64 OS/Version: Linux Status: NEW Severity: normal Priority: P2 Component: ssh
2024 Jan 10
2
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Tue, Jan 9, 2024, at 3:28 AM, Olly Betts wrote: > Thanks, that looks good - now merged. Thanks! > Did you already check the other ranges for cased letters? I can but if > you have already there's not much point. I did not. If you find time, that'd be great. Otherwise I can make room for it in the next days. > > The fullwidth "????? ??????" tests suggests to
2018 Jul 01
1
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...lled >> ? "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering >> ? in IMAP. More information can be found in >> ? doc/plugins/imap_filter_sieve.txt. >> - The Sieve address test caused an assertion panic for invalid addresses >> ? with UTF-8 codepoints in the localpart. Fixed by properly detecting >> ? invalid addresses with UTF-8 codepoints in the localpart and skipping >> ? these like other invalid addresses while iterating addresses for the >> ? address test. >> - Make the length of the subject header for the vacation...
2020 Mar 30
2
Plotmath on Fedora 31 broken with with pango >= 1.44 - workarounds?
...OpenSymbol family should be used as the > "symbol" font (e.g., for "plotmath") in R. Thanks for looking at it! But, really, there is no such thing as a Symbol font on Linux anymore. Symbol is pre-unicode thinking. Most modern general-purpose unicode fonts will include every codepoint Symbol ever shipped, and fontconfig will fallback gracefully when that?s not the case (unless your fontconfig integration is broken). Just use the sans-serif or monospace fontconfig defaults. You don?t need Symbol, or OpenSymbol, or any special font setup. Symbol?s codepoint coverage is laughable...
2024 Jan 04
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
I think I found a bug in Xapian 1.5 when using FLAG_WORD_BREAKS for input that contains characters in Unicode Halfwidth and Fullwidth Forms (https://unicode.org/charts/PDF/UFF00.pdf). Since I am undecided yet if and how to fix this in Xapian I haven't come up with a pull request. Because trac currently is offline, I could not file a bug. I hope it's OK to post my analysis here first,
2016 Sep 19
2
Pull requests: CJK words and Snippet generator
...doesn't currently have a separate stage for > normalisation and for stemming. I'll investigate both options with tests and will merge them into Xapian's unit tests where it makes sense. I won't be able to come up with it until next week, though. > The main issue is that new codepoints get added (and the odd one changes > category) in each new Unicode version, so if you're using different > Unicode versions at index time and at search time, the terms you get > won't match each other. [...] If Xapian's CJK::codepoint_is_cjk() and ICU have different ideas of...
2020 Mar 30
2
Plotmath on Fedora 31 broken with with pango >= 1.44 - workarounds?
...e.g., for "plotmath") in R. > > > > Thanks for looking at it! > > > > But, really, there is no such thing as a Symbol font on Linux > > anymore. > > Symbol is pre-unicode thinking. Most modern general-purpose unicode > > fonts will include every codepoint Symbol ever shipped, and > > fontconfig > > will fallback gracefully when that?s not the case (unless your > > fontconfig integration is broken). > > Yep, the "symbol" font is an (outdated) R "plotmath" concept, but > one > that would take a fair...
2018 Jun 24
0
Released Pigeonhole v0.5.2.rc1 for Dovecot v2.3.2.rc1.
...r the a vendor-defined IMAP capability called "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering in IMAP. More information can be found in doc/plugins/imap_filter_sieve.txt. - The Sieve address test caused an assertion panic for invalid addresses with UTF-8 codepoints in the localpart. Fixed by properly detecting invalid addresses with UTF-8 codepoints in the localpart and skipping these like other invalid addresses while iterating addresses for the address test. - Make the length of the subject header for the vacation response configurable and enfo...
2018 Jun 24
0
Released Pigeonhole v0.5.2.rc1 for Dovecot v2.3.2.rc1.
...r the a vendor-defined IMAP capability called "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering in IMAP. More information can be found in doc/plugins/imap_filter_sieve.txt. - The Sieve address test caused an assertion panic for invalid addresses with UTF-8 codepoints in the localpart. Fixed by properly detecting invalid addresses with UTF-8 codepoints in the localpart and skipping these like other invalid addresses while iterating addresses for the address test. - Make the length of the subject header for the vacation response configurable and enfo...
2018 Jun 30
0
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...d IMAP capability called > "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering > in IMAP. More information can be found in > doc/plugins/imap_filter_sieve.txt. > - The Sieve address test caused an assertion panic for invalid addresses > with UTF-8 codepoints in the localpart. Fixed by properly detecting > invalid addresses with UTF-8 codepoints in the localpart and skipping > these like other invalid addresses while iterating addresses for the > address test. > - Make the length of the subject header for the vacation response >...
2001 Dec 23
0
Need Help Sambaserver is not accessible
...Telebit_75:3c:2f) Type: IP (0x0800) Internet Protocol, Src Addr: 192.168.1.102 (192.168.1.102), Dst Addr: 192.168.1.255 (192.168.1.255) Version: 4 Header length: 20 bytes Differentiated Services Field: 0x00 (DSCP 0x00: Default; ECN: 0x00) 0000 00.. = Differentiated Services Codepoint: Default (0x00) .... ..0. = ECN-Capable Transport (ECT): 0 .... ...0 = ECN-CE: 0 Total Length: 78 Identification: 0x0000 Flags: 0x04 .1.. = Don't fragment: Set ..0. = More fragments: Not set Fragment offset: 0 Time to live: 64 Protocol: UD...
2006 Nov 08
14
Increased memory requirements on 1.2
I just recently upgraded a rails app of mine to run on edge (and the 1-2-pre-release branch) and I noticed my fcgis required roughly 6-8MBs more memory after just a couple requests. For example, each fcgi on edge would start around 40MB and rise to ~46MBs after a couple requests. I downgraded my app back to 1.1.6 and each fcgi would start at around 33MBs and rise to ~38MB. As a result of the
2006 Mar 21
2
How do I get substring of utf-8 string?
I''m trying to get substring from a utf-8 encoded string. (say, first 50 characters of the string) String#[0..49] would give me the first 50 bytes not 50 characters.. I know there is jcode library, but it only let you count number of characters in utf-8 string. unicode gem doesn''t seem to help much. unicode_hacks gem seem to solve the problem, but it also seems to