Displaying 20 results from an estimated 10000 matches similar to: "pigeonhole, regex, UTF-8"
2023 Jul 25
1
Bug in perl=TRUE regexp matching?
On 7/24/23 4:10 AM, Duncan Murdoch wrote:
> On 23/07/2023 9:01 p.m., Brodie Gaslam wrote:
>>
>>
>> On 7/23/23 4:29 PM, Duncan Murdoch wrote:
>>> The help page for `?gsub` says (in the context of performance
>>> considerations):
>>>
>>>
>>> "... just one UTF-8 string will force all the matching to be done in
>>>
2024 Aug 01
1
Question about regexp edge case
Thanks Tomas. Do note that my original post also mentioned a bug or doc
error in the PCRE docs for this regexp:
> - perl = TRUE does *not* give the documented result on at least one
> system (which is "123456789", because "{,5}" is documented to not be a
> quantifier, so it should only match the literal string "{,5}").
Duncan
On 2024-08-01 6:49 a.m.,
2024 Aug 09
1
Question about regexp edge case
On 8/1/24 20:55, Duncan Murdoch wrote:
> Thanks Tomas.? Do note that my original post also mentioned a bug or
> doc error in the PCRE docs for this regexp:
>
>> ? - perl = TRUE does *not* give the documented result on at least one
>> system (which is "123456789", because "{,5}" is documented to not be
>> a quantifier, so it should only match the
2023 Jun 01
1
why does [A-Z] include 'T' in an Estonian locale?
On 5/30/23 17:45, Ben Bolker wrote:
> Inspired by this old Stack Overflow question
>
> https://stackoverflow.com/questions/19765610/when-does-locale-affect-rs-regular-expressions
>
>
> I was wondering why this is TRUE:
>
> Sys.setlocale("LC_ALL", "et_EE")
> grepl("[A-Z]", "T")
>
> TRE's documentation at
>
2020 Jun 08
1
Potential issue with perl-based pattern matching with Unicode characters on Windows R 4.0 and above
Hi everyone,
I've noticed new behavior in `regexpr(..., perl = TRUE)` on Windows with
R4.0 and above with Unicode characters. Here's a minimal example where I'd
expect to see a start value of `5` (as R 3.6.2 and below gives), but R
4.0.0 (and R 4.0.1) now returns:
```
> regexpr("b", "foo\U0001F937bar", perl = TRUE)
#> [1] 6
#>
2023 May 30
3
why does [A-Z] include 'T' in an Estonian locale?
Inspired by this old Stack Overflow question
https://stackoverflow.com/questions/19765610/when-does-locale-affect-rs-regular-expressions
I was wondering why this is TRUE:
Sys.setlocale("LC_ALL", "et_EE")
grepl("[A-Z]", "T")
TRE's documentation at
<https://laurikari.net/tre/documentation/regex-syntax/> says that a
range "is shorthand for
2019 Jul 16
3
pigeonhole question: filtering on delivered-to in case of fetchmail
So, one of the problems I am seeing is that people are trying to fake
users into revealing information by sending from an outside domain but
with an internal reply to address and claiming to be administration, IT
or what not.
I can set up something that will reject if from is outside the domain by
reply to is internal. The problem is in some setups, there are fetchmail
setups. I do not want to
2024 Jul 29
1
Question about regexp edge case
On StackOverflow (here:
https://stackoverflow.com/questions/78803652/why-does-gsub-in-r-match-one-character-too-many)
there was a question about this result:
> gsub("^([0-9]{,5}).*","\\1","123456789")
[1] "123456"
The OP expected "12345" as the result. Several points were raised:
- The R docs don't mention the case of {,5} for the
2019 Jan 24
4
Discard mail with from date older than xxxx
On 1/24/19 12:15 AM, Per Jessen wrote:
> Trever L. Adams wrote:
>
>> I know that sieve doesn't do math. The file would be created
>> externally.
>>
>> Based on examples in section 4.4 of
>> https://tools.ietf.org/html/rfc5260#section-4
>>
>> I figured something like this would work.
>>
>> if date :value "le" :originalzone
2011 Apr 26
2
possible bug leading to lmtp crashes
On 04/25/2011 09:12 AM, Trever L. Adams wrote:
>
> As I look at the code in the hg reference above, I think there is a bug.
> If HOME should be preserved as first, and some of these others may be
> critical to proper operation, they should be preserved automatically, no
> matter what the configuration says. This seems to be contrary to the
> code and to the top of the page
2015 Aug 19
2
Samba 4 DC - no AES kerberos tickets - only arcfour
Hi Trever,
things improved after resetting user/machine passwords, however only the session key is using aes256 now, the ticket itself is still arcfour:
root at ubuntu1:~# kinit user09999
user09999 at S4DOM.TEST's Password:
root at ubuntu1:~# klist -v
Credentials cache: FILE:/tmp/krb5cc_0
Principal: user09999 at S4DOM.TEST
Cache version: 4
Server: krbtgt/S4DOM.TEST at
2011 Feb 14
0
[LLVMdev] Introducing LLBrowse: A graphical browser for LLVM modules
On 2011-02-14 20:58, Török Edwin wrote:
> On 2011-02-14 20:39, Talin wrote:
>>
>>
>> On Mon, Feb 14, 2011 at 1:17 AM, Renato Golin <rengolin at systemcall.org
>> <mailto:rengolin at systemcall.org>> wrote:
>>
>> I think this deserves a blog post...
>>
>> I'd like to wait until I get some feedback - I don't know yet if
2019 Feb 22
2
[PATCH 2/2] Cygwin: implement case-insensitive Unicode user and group name matching
On Feb 22 16:02, Darren Tucker wrote:
> On Fri, Feb 22, 2019 at 03:32:43PM +1100, Darren Tucker wrote:
> > On Wed, 20 Feb 2019 at 23:54, Corinna Vinschen <vinschen at redhat.com> wrote:
> > > The previous revert enabled case-insensitive user names again. This
> > > patch implements the case-insensitive user and group name matching.
> > > To allow Unicode
2009 Aug 24
0
[LLVMdev] Regular Expression lib support
On Sun, Aug 23, 2009 at 5:56 PM, Daniel Dunbar<daniel at zuster.org> wrote:
> We would like to have access to some kind of regular expression
> library inside LLVM. For example, we need this to extend the FileCheck
> test case checking tool to support regular expressions.
>
> There are three obvious options:
> 1. Roll our own library. Multiple unnamed individuals may even
2003 Dec 22
2
[LLVMdev] hello.bc & binary code
hi,
I try to build hello.cpp using both llvmg++ and GNU g++,
the generate llvm bytecode's size is about 960K,
and the size of binary code generated by g++ is only 13K.
Could anyone explain the difference between the two result?
BWT:
I rebuild the cfrontend in RH linux9.0, but when I build the hello.cpp
the llvmG++ reports warnings too, it shows:
-----------------------------
[yue at RH9
2010 Nov 17
2
Bug in agrep computing edit distance?
I posted this yesterday to r-help and Ben Bolker suggested reposting it
here...
Dickison, Daniel <ddickison <at> carnegielearning.com> writes:
>
> The documentation for agrep says it uses the Levenshtein edit distance,
> but it seems to get this wrong in certain cases when there is a
> combination of deletions and substitutions. For example:
>
> >
2017 Dec 22
1
Typos in ?regex
I found three little typos in the ?regex manual. Apologies for the lack of a diff, as the utility is not (yet) installed on this machine.
1. "There is a also" should probably be "There is also".
2. In the list of functions following "This section covers the regular expressions"..., "grep" appears twice. The other one should probably be "grepl".
3.
2011 Feb 14
5
[LLVMdev] Introducing LLBrowse: A graphical browser for LLVM modules
OK I figured out what the problem was - I assume you are trying to compile
under Snow Leopard? There is a problem with linking against wxWidgets
because the prebuilt binaries for wxWidgets are compiled in 32-bit mode, and
the default compilation mode in Snow Leopard is 64 bits. See this article
for an explanation:
http://wiki.wxwidgets.org/Development:_wxMac#Building_under_10.6_Snow_Leopard
2015 Jul 14
2
krbtgt user not showing aes types
I have found source4/scripting/devel/chgtdcpass for adding the aes types
to machines. I know you have to change the password of normal users.
How do you fix this for krbtgt? Can you just change the password? Is
there a recommended method?
Thank you for any help,
Trever
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type:
2011 Feb 02
1
LDAP and GSSAPI problems
This is a continuation of a problem I have been having. Samba 4 has
recently changed to require binds. I need LDAP to verify users exist. I
am using Kerberos (GSSAPI) as the passdb. Samba can handle
GSSAPI/Kerberos SASL binds.
I have the following in my dovecot-ldap setup for userdb:
dn = smtp/mailhost.example.org at EXAMPLE.ORG
sasl_bind = yes
sasl_mech = GSSAPI
sasl_realm = EXAMPLE.ORG