Displaying 20 results from an estimated 81 matches for "codepoint".
Did you mean:
codepoints
2011 Feb 09
6
[Bug 1856] New: Wrong QoS naming and obsolete defaults
https://bugzilla.mindrot.org/show_bug.cgi?id=1856
Summary: Wrong QoS naming and obsolete defaults
Product: Portable OpenSSH
Version: 5.8p1
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P2
Component: Miscellaneous
AssignedTo: unassigned-bugs at mindrot.org
ReportedBy:
2024 Jan 08
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...t; least:
Removing the whole block will cause word-breaker to not correctly handle halfwidth Katakana, such as "??????????" which it would treat as a single term, whereas it should be two: ??????and ????).
My pull request causes word-breaker to only handle halfwidth Katakana and Hangul codepoints as unbroken script and treats Latin characters, numbers, symbols and punctuation as broken script. There's a couple of unit tests that check for this.
diff --git a/xapian-core/queryparser/word-breaker.cc b/xapian-core/queryparser/word-breaker.cc
index 8108523ccd53..6122dcdccc97 100644
--- a/x...
2024 Jan 09
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...hole block will cause word-breaker to not correctly
> handle halfwidth Katakana, such as "??????????" which it would treat
> as a single term, whereas it should be two: ??????and ????).
>
> My pull request causes word-breaker to only handle halfwidth Katakana
> and Hangul codepoints as unbroken script and treats Latin characters,
> numbers, symbols and punctuation as broken script. There's a couple of
> unit tests that check for this.
Thanks, that looks good - now merged.
I think we probably should backport this to 1.4 - it's a behaviour
change, but limited to...
2013 Mar 13
2
patch - Some CJK codepoints are also punctuation
--
Greg.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: xapian-some-cjk-codepoints-are-also-punctuation.patch
Type: text/x-patch
Size: 1499 bytes
Desc: not available
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130313/4da8b0f9/attachment.bin>
2018 Jun 29
2
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...r the a vendor-defined IMAP capability called
"FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
in IMAP. More information can be found in
doc/plugins/imap_filter_sieve.txt.
- The Sieve address test caused an assertion panic for invalid addresses
with UTF-8 codepoints in the localpart. Fixed by properly detecting
invalid addresses with UTF-8 codepoints in the localpart and skipping
these like other invalid addresses while iterating addresses for the
address test.
- Make the length of the subject header for the vacation response
configurable and enfo...
2018 Jun 29
2
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...r the a vendor-defined IMAP capability called
"FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
in IMAP. More information can be found in
doc/plugins/imap_filter_sieve.txt.
- The Sieve address test caused an assertion panic for invalid addresses
with UTF-8 codepoints in the localpart. Fixed by properly detecting
invalid addresses with UTF-8 codepoints in the localpart and skipping
these like other invalid addresses while iterating addresses for the
address test.
- Make the length of the subject header for the vacation response
configurable and enfo...
2024 Jan 07
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Thu, Jan 04, 2024 at 05:50:22PM +0100, Robert Stepanek wrote:
> Since I am undecided yet if and how to fix this in Xapian I haven't
> come up with a pull request. Because trac currently is offline, I
> could not file a bug. I hope it's OK to post my analysis here first,
> I'll be happy to follow up reporting that bug proper later (should we
> conclude that it actually
2011 Dec 18
10
[Bug 1964] New: QoS/DSCP names false translated to ToS hex value
https://bugzilla.mindrot.org/show_bug.cgi?id=1964
Bug #: 1964
Summary: QoS/DSCP names false translated to ToS hex value
Classification: Unclassified
Product: Portable OpenSSH
Version: 5.9p1
Platform: amd64
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P2
Component: ssh
2024 Jan 10
2
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Tue, Jan 9, 2024, at 3:28 AM, Olly Betts wrote:
> Thanks, that looks good - now merged.
Thanks!
> Did you already check the other ranges for cased letters? I can but if
> you have already there's not much point.
I did not. If you find time, that'd be great. Otherwise I can make room for it in the next days.
> > The fullwidth "????? ??????" tests suggests to
2018 Jul 01
1
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...lled
>> ? "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
>> ? in IMAP. More information can be found in
>> ? doc/plugins/imap_filter_sieve.txt.
>> - The Sieve address test caused an assertion panic for invalid addresses
>> ? with UTF-8 codepoints in the localpart. Fixed by properly detecting
>> ? invalid addresses with UTF-8 codepoints in the localpart and skipping
>> ? these like other invalid addresses while iterating addresses for the
>> ? address test.
>> - Make the length of the subject header for the vacation...
2020 Mar 30
2
Plotmath on Fedora 31 broken with with pango >= 1.44 - workarounds?
...OpenSymbol family should be used as the
> "symbol" font (e.g., for "plotmath") in R.
Thanks for looking at it!
But, really, there is no such thing as a Symbol font on Linux anymore.
Symbol is pre-unicode thinking. Most modern general-purpose unicode
fonts will include every codepoint Symbol ever shipped, and fontconfig
will fallback gracefully when that?s not the case (unless your
fontconfig integration is broken).
Just use the sans-serif or monospace fontconfig defaults. You don?t
need Symbol, or OpenSymbol, or any special font setup.
Symbol?s codepoint coverage is laughable...
2024 Jan 04
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
I think I found a bug in Xapian 1.5 when using FLAG_WORD_BREAKS for input that contains characters in Unicode Halfwidth and Fullwidth Forms (https://unicode.org/charts/PDF/UFF00.pdf).
Since I am undecided yet if and how to fix this in Xapian I haven't come up with a pull request. Because trac currently is offline, I could not file a bug. I hope it's OK to post my analysis here first,
2016 Sep 19
2
Pull requests: CJK words and Snippet generator
...doesn't currently have a separate stage for
> normalisation and for stemming.
I'll investigate both options with tests and will merge them into
Xapian's unit tests where it makes sense. I won't be able to come up
with it until next week, though.
> The main issue is that new codepoints get added (and the odd one changes
> category) in each new Unicode version, so if you're using different
> Unicode versions at index time and at search time, the terms you get
> won't match each other. [...] If Xapian's CJK::codepoint_is_cjk() and ICU have different ideas of...
2020 Mar 30
2
Plotmath on Fedora 31 broken with with pango >= 1.44 - workarounds?
...e.g., for "plotmath") in R.
> >
> > Thanks for looking at it!
> >
> > But, really, there is no such thing as a Symbol font on Linux
> > anymore.
> > Symbol is pre-unicode thinking. Most modern general-purpose unicode
> > fonts will include every codepoint Symbol ever shipped, and
> > fontconfig
> > will fallback gracefully when that?s not the case (unless your
> > fontconfig integration is broken).
>
> Yep, the "symbol" font is an (outdated) R "plotmath" concept, but
> one
> that would take a fair...
2018 Jun 24
0
Released Pigeonhole v0.5.2.rc1 for Dovecot v2.3.2.rc1.
...r the a vendor-defined IMAP capability called
"FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
in IMAP. More information can be found in
doc/plugins/imap_filter_sieve.txt.
- The Sieve address test caused an assertion panic for invalid addresses
with UTF-8 codepoints in the localpart. Fixed by properly detecting
invalid addresses with UTF-8 codepoints in the localpart and skipping
these like other invalid addresses while iterating addresses for the
address test.
- Make the length of the subject header for the vacation response
configurable and enfo...
2018 Jun 24
0
Released Pigeonhole v0.5.2.rc1 for Dovecot v2.3.2.rc1.
...r the a vendor-defined IMAP capability called
"FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
in IMAP. More information can be found in
doc/plugins/imap_filter_sieve.txt.
- The Sieve address test caused an assertion panic for invalid addresses
with UTF-8 codepoints in the localpart. Fixed by properly detecting
invalid addresses with UTF-8 codepoints in the localpart and skipping
these like other invalid addresses while iterating addresses for the
address test.
- Make the length of the subject header for the vacation response
configurable and enfo...
2018 Jun 30
0
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...d IMAP capability called
> "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
> in IMAP. More information can be found in
> doc/plugins/imap_filter_sieve.txt.
> - The Sieve address test caused an assertion panic for invalid addresses
> with UTF-8 codepoints in the localpart. Fixed by properly detecting
> invalid addresses with UTF-8 codepoints in the localpart and skipping
> these like other invalid addresses while iterating addresses for the
> address test.
> - Make the length of the subject header for the vacation response
>...
2001 Dec 23
0
Need Help Sambaserver is not accessible
...Telebit_75:3c:2f)
Type: IP (0x0800)
Internet Protocol, Src Addr: 192.168.1.102 (192.168.1.102), Dst Addr:
192.168.1.255 (192.168.1.255)
Version: 4
Header length: 20 bytes
Differentiated Services Field: 0x00 (DSCP 0x00: Default; ECN: 0x00)
0000 00.. = Differentiated Services Codepoint: Default (0x00)
.... ..0. = ECN-Capable Transport (ECT): 0
.... ...0 = ECN-CE: 0
Total Length: 78
Identification: 0x0000
Flags: 0x04
.1.. = Don't fragment: Set
..0. = More fragments: Not set
Fragment offset: 0
Time to live: 64
Protocol: UD...
2006 Nov 08
14
Increased memory requirements on 1.2
I just recently upgraded a rails app of mine to run on edge (and the
1-2-pre-release branch) and I noticed my fcgis required roughly 6-8MBs
more memory after just a couple requests.
For example, each fcgi on edge would start around 40MB and rise to
~46MBs after a couple requests.
I downgraded my app back to 1.1.6 and each fcgi would start at around
33MBs and rise to ~38MB.
As a result of the
2006 Mar 21
2
How do I get substring of utf-8 string?
I''m trying to get substring from a utf-8 encoded string. (say, first
50 characters of the string) String#[0..49] would give me the first
50 bytes not 50 characters..
I know there is jcode library, but it only let you count number of
characters in utf-8 string.
unicode gem doesn''t seem to help much. unicode_hacks gem seem to
solve the problem, but it also seems to