Displaying 20 results from an estimated 81 matches for "codepoints".
2011 Feb 09
6
[Bug 1856] New: Wrong QoS naming and obsolete defaults
https://bugzilla.mindrot.org/show_bug.cgi?id=1856
Summary: Wrong QoS naming and obsolete defaults
Product: Portable OpenSSH
Version: 5.8p1
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P2
Component: Miscellaneous
AssignedTo: unassigned-bugs at mindrot.org
ReportedBy:
2024 Jan 08
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...t; least:
Removing the whole block will cause word-breaker to not correctly handle halfwidth Katakana, such as "??????????" which it would treat as a single term, whereas it should be two: ??????and ????).
My pull request causes word-breaker to only handle halfwidth Katakana and Hangul codepoints as unbroken script and treats Latin characters, numbers, symbols and punctuation as broken script. There's a couple of unit tests that check for this.
diff --git a/xapian-core/queryparser/word-breaker.cc b/xapian-core/queryparser/word-breaker.cc
index 8108523ccd53..6122dcdccc97 100644
--- a/xa...
2024 Jan 09
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...hole block will cause word-breaker to not correctly
> handle halfwidth Katakana, such as "??????????" which it would treat
> as a single term, whereas it should be two: ??????and ????).
>
> My pull request causes word-breaker to only handle halfwidth Katakana
> and Hangul codepoints as unbroken script and treats Latin characters,
> numbers, symbols and punctuation as broken script. There's a couple of
> unit tests that check for this.
Thanks, that looks good - now merged.
I think we probably should backport this to 1.4 - it's a behaviour
change, but limited to...
2013 Mar 13
2
patch - Some CJK codepoints are also punctuation
--
Greg.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: xapian-some-cjk-codepoints-are-also-punctuation.patch
Type: text/x-patch
Size: 1499 bytes
Desc: not available
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130313/4da8b0f9/attachment.bin>
2018 Jun 29
2
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...r the a vendor-defined IMAP capability called
"FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
in IMAP. More information can be found in
doc/plugins/imap_filter_sieve.txt.
- The Sieve address test caused an assertion panic for invalid addresses
with UTF-8 codepoints in the localpart. Fixed by properly detecting
invalid addresses with UTF-8 codepoints in the localpart and skipping
these like other invalid addresses while iterating addresses for the
address test.
- Make the length of the subject header for the vacation response
configurable and enfor...
2018 Jun 29
2
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...r the a vendor-defined IMAP capability called
"FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
in IMAP. More information can be found in
doc/plugins/imap_filter_sieve.txt.
- The Sieve address test caused an assertion panic for invalid addresses
with UTF-8 codepoints in the localpart. Fixed by properly detecting
invalid addresses with UTF-8 codepoints in the localpart and skipping
these like other invalid addresses while iterating addresses for the
address test.
- Make the length of the subject header for the vacation response
configurable and enfor...
2024 Jan 07
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Thu, Jan 04, 2024 at 05:50:22PM +0100, Robert Stepanek wrote:
> Since I am undecided yet if and how to fix this in Xapian I haven't
> come up with a pull request. Because trac currently is offline, I
> could not file a bug. I hope it's OK to post my analysis here first,
> I'll be happy to follow up reporting that bug proper later (should we
> conclude that it actually
2011 Dec 18
10
[Bug 1964] New: QoS/DSCP names false translated to ToS hex value
https://bugzilla.mindrot.org/show_bug.cgi?id=1964
Bug #: 1964
Summary: QoS/DSCP names false translated to ToS hex value
Classification: Unclassified
Product: Portable OpenSSH
Version: 5.9p1
Platform: amd64
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P2
Component: ssh
2024 Jan 10
2
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Tue, Jan 9, 2024, at 3:28 AM, Olly Betts wrote:
> Thanks, that looks good - now merged.
Thanks!
> Did you already check the other ranges for cased letters? I can but if
> you have already there's not much point.
I did not. If you find time, that'd be great. Otherwise I can make room for it in the next days.
> > The fullwidth "????? ??????" tests suggests to
2018 Jul 01
1
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...lled
>> ? "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
>> ? in IMAP. More information can be found in
>> ? doc/plugins/imap_filter_sieve.txt.
>> - The Sieve address test caused an assertion panic for invalid addresses
>> ? with UTF-8 codepoints in the localpart. Fixed by properly detecting
>> ? invalid addresses with UTF-8 codepoints in the localpart and skipping
>> ? these like other invalid addresses while iterating addresses for the
>> ? address test.
>> - Make the length of the subject header for the vacation r...
2020 Mar 30
2
Plotmath on Fedora 31 broken with with pango >= 1.44 - workarounds?
Le lundi 30 mars 2020 ? 15:24 +1300, Paul Murrell a ?crit :
> Hi
>
> I have created an R branch that contains a potential fix ...
>
> https://svn.r-project.org/R/branches/R-symfam/
>
> This allows, for example, ...
>
> cairo_pdf(symbolfamily="OpenSymbol")
>
> ... to specify that the OpenSymbol family should be used as the
> "symbol" font
2024 Jan 04
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
I think I found a bug in Xapian 1.5 when using FLAG_WORD_BREAKS for input that contains characters in Unicode Halfwidth and Fullwidth Forms (https://unicode.org/charts/PDF/UFF00.pdf).
Since I am undecided yet if and how to fix this in Xapian I haven't come up with a pull request. Because trac currently is offline, I could not file a bug. I hope it's OK to post my analysis here first,
2016 Sep 19
2
Pull requests: CJK words and Snippet generator
...doesn't currently have a separate stage for
> normalisation and for stemming.
I'll investigate both options with tests and will merge them into
Xapian's unit tests where it makes sense. I won't be able to come up
with it until next week, though.
> The main issue is that new codepoints get added (and the odd one changes
> category) in each new Unicode version, so if you're using different
> Unicode versions at index time and at search time, the terms you get
> won't match each other. [...] If Xapian's CJK::codepoint_is_cjk() and ICU have different ideas of
&...
2020 Mar 30
2
Plotmath on Fedora 31 broken with with pango >= 1.44 - workarounds?
...though,
> for example, ...
>
> F8EB E6 # LEFT PAREN TOP # parenlefttp (CUS)
> F8EC E7 # LEFT PAREN EXTENDER # parenleftex (CUS)
> F8ED E8 # LEFT PAREN BOTTOM # parenleftbt (CUS)
>
> Even OpenSymbol is missing a few (though perhaps not very common
> ones) ...
All the F8* codepoints are in the private use area. That means you
can?t rely on them existing in standard unicode fonts
You need to use correct Unicode values for things to work:
Ux239? for parenthesis, brackets
https://www.unicode.org/charts/PDF/U2300.pdf
> F8E6 BD # VERTICAL ARROW EXTENDER # arrowv...
2018 Jun 24
0
Released Pigeonhole v0.5.2.rc1 for Dovecot v2.3.2.rc1.
...r the a vendor-defined IMAP capability called
"FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
in IMAP. More information can be found in
doc/plugins/imap_filter_sieve.txt.
- The Sieve address test caused an assertion panic for invalid addresses
with UTF-8 codepoints in the localpart. Fixed by properly detecting
invalid addresses with UTF-8 codepoints in the localpart and skipping
these like other invalid addresses while iterating addresses for the
address test.
- Make the length of the subject header for the vacation response
configurable and enfor...
2018 Jun 24
0
Released Pigeonhole v0.5.2.rc1 for Dovecot v2.3.2.rc1.
...r the a vendor-defined IMAP capability called
"FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
in IMAP. More information can be found in
doc/plugins/imap_filter_sieve.txt.
- The Sieve address test caused an assertion panic for invalid addresses
with UTF-8 codepoints in the localpart. Fixed by properly detecting
invalid addresses with UTF-8 codepoints in the localpart and skipping
these like other invalid addresses while iterating addresses for the
address test.
- Make the length of the subject header for the vacation response
configurable and enfor...
2018 Jun 30
0
Released Pigeonhole v0.5.2 for Dovecot v2.3.2.
...d IMAP capability called
> "FILTER=SIEVE". It adds the ability to manually invoke Sieve filtering
> in IMAP. More information can be found in
> doc/plugins/imap_filter_sieve.txt.
> - The Sieve address test caused an assertion panic for invalid addresses
> with UTF-8 codepoints in the localpart. Fixed by properly detecting
> invalid addresses with UTF-8 codepoints in the localpart and skipping
> these like other invalid addresses while iterating addresses for the
> address test.
> - Make the length of the subject header for the vacation response
> c...
2001 Dec 23
0
Need Help Sambaserver is not accessible
Here is the packet trace - ICMP seems to be coming from my SAMBA Server -
see frame 3
Does the netstat output look correct in my first request for help?
Joel Morrow
jiram@aol.com
TRACE
Frame 1 (92 on wire, 92 captured)
Arrival Time: Dec 22, 2001 09:36:08.333676000
Time delta from previous packet: 1.999444000 seconds
Time relative to first packet: 2.000088000 seconds
Frame
2006 Nov 08
14
Increased memory requirements on 1.2
I just recently upgraded a rails app of mine to run on edge (and the
1-2-pre-release branch) and I noticed my fcgis required roughly 6-8MBs
more memory after just a couple requests.
For example, each fcgi on edge would start around 40MB and rise to
~46MBs after a couple requests.
I downgraded my app back to 1.1.6 and each fcgi would start at around
33MBs and rise to ~38MB.
As a result of the
2006 Mar 21
2
How do I get substring of utf-8 string?
I''m trying to get substring from a utf-8 encoded string. (say, first
50 characters of the string) String#[0..49] would give me the first
50 bytes not 50 characters..
I know there is jcode library, but it only let you count number of
characters in utf-8 string.
unicode gem doesn''t seem to help much. unicode_hacks gem seem to
solve the problem, but it also seems to