Displaying 20 results from an estimated 1000 matches similar to: "links causing dovecot endless search through user's homedirs"
2013 Nov 06
1
[LLVMdev] loop vectorizer
On 06/11/13 08:54, Arnold wrote:
>
>
> Sent from my iPhone
>
> On Nov 5, 2013, at 7:39 PM, Frank Winter <fwinter at jlab.org
> <mailto:fwinter at jlab.org>> wrote:
>
>> Good that you bring this up. I still have no solution to this
>> vectorization problem.
>>
>> However, I can rewrite the code and insert a second loop which
>>
2013 Oct 31
0
[LLVMdev] loop vectorizer misses opportunity, exploit
Hi Nadav,
that's the whole point of it. I can't in general make the index
calculation simpler. The example given is the simplest non-trivial index
function that is needed. It might well be that it's that simple that the
index calculation in this case can be thrown aways altogether and - as
you say - be replaced by the simple loop you mentioned. However, this
cannot be done in the
2013 Nov 15
0
[LLVMdev] Limit loop vectorizer to SSE
Hmm.. I don't quite understand. How can a module validator
catch this, when it's the pointers, i.e. the payload, you pass
as function arguments that need to be aligned.. ?!
Frank
On 15/11/13 15:16, Renato Golin wrote:
> On 15 November 2013 20:05, Frank Winter <fwinter at jlab.org
> <mailto:fwinter at jlab.org>> wrote:
>
> Good catch! That was the problem in my
2005 Jan 25
1
Box-Cox / data transformation question
Dear R users,
Is it reasonable to transform data (measurements of plant height) to the
power of 1/4? I?ve used boxcox(response~A*B) and lambda was close to 0.25.
Regards,
Christoph
2013 Nov 12
0
[LLVMdev] Limit loop vectorizer to SSE
.. forcing the vector size to 4 does not prevent using AVX. I just hit
the following:
LV: We can vectorize this loop!
LV: Found trip count: 4
LV: The Widest type: 64 bits.
LV: The Widest register is: 256 bits.
LV: Using user VF 4.
Looks like I have to disable AVX somehow. (Which is sad on its own.)
Frank
On 12/11/13 10:34, Renato Golin wrote:
> On 12 November 2013 15:14, Frank Winter
2013 Nov 12
0
[LLVMdev] Limit loop vectorizer to SSE
On 12/11/13 11:01, Renato Golin wrote:
> On 12 November 2013 15:53, Frank Winter <fwinter at jlab.org
> <mailto:fwinter at jlab.org>> wrote:
>
> .. forcing the vector size to 4 does not prevent using AVX.
>
>
> Sure. That's more for tests than anything else.
>
> So, there are ways of disabling stuf in Clang, for instance
> "-mattr=-avx"
2013 Nov 06
0
[LLVMdev] loop vectorizer
Sent from my iPhone
> On Nov 5, 2013, at 7:39 PM, Frank Winter <fwinter at jlab.org> wrote:
>
> Good that you bring this up. I still have no solution to this vectorization problem.
>
> However, I can rewrite the code and insert a second loop which eliminates the 'urem' and 'div' instructions in the index calculations. In this case, the inner loop's trip
2013 Oct 31
0
[LLVMdev] loop vectorizer
I tried the following on the hand-unrolled loop:
const std::uint64_t ir0 = i*8+0; // working
const std::uint64_t ir0 = i%4+0; // working
const std::uint64_t ir0 = (i+0)%4; // not working
'+0' means +1,+2,+3 in the unrolled iterations.
'Working' means the SLP vectorizer succeeded.
Thus, when working 'towards' the correct index function, auto
2013 Oct 31
0
[LLVMdev] loop vectorizer misses opportunity, exploit
A quite small but yet complete example function which all vectorization
passes fail to optimize:
#include <cstdint>
#include <iostream>
void bar(std::uint64_t start, std::uint64_t end, float * __restrict__
c, float * __restrict__ a, float * __restrict__ b)
{
for ( std::uint64_t i = start ; i < end ; i += 4 ) {
{
const std::uint64_t ir0 = (i+0)%4 + 8*((i+0)/4);
2005 Jan 05
0
Using windows homedirs as a linux homedirs
Hello,
I would like to have linux users to have same homedirs as in windows. Now home
directories are on windows 2000 server.
How I can map users homedirectory from windows 2000 server to linux workstation
with credentials user gives at logintime?
Or even better, how I can persistently map those homedirectories to linux
(without mapping them as an administrator)?
--
/\_/\
( o.o )
2013 Nov 11
2
[LLVMdev] loop vectorizer: JIT + AVX segfaults
It's not much.
(gdb) bt
#0 0x00007ffff7f6506b in ?? ()
#1 0x000000000045d01a in main () at main.cc:165
Line 165 is the call to the function that was compiled by the JIT'er.
Meaning that JIT'ing the function went well, but the code or the pointer
are somehow corrupt.
There is no particular reason why I am working with the legacy
interface. Would you recommend to use the MCJIT
2013 Nov 06
3
[LLVMdev] loop vectorizer
Good that you bring this up. I still have no solution to this
vectorization problem.
However, I can rewrite the code and insert a second loop which
eliminates the 'urem' and 'div' instructions in the index calculations.
In this case, the inner loop's trip count would be equal to the SIMD
length and the loop vectorizer ignores the loop. Unrolling the loop and
SLP is not an
2013 Nov 15
3
[LLVMdev] Limit loop vectorizer to SSE
On 15 November 2013 20:05, Frank Winter <fwinter at jlab.org> wrote:
> Good catch! That was the problem in my case too. I totally
> overlooked the alignment requirement for AVX.
I wonder if the validation mechanism shouldn't have caught it earlier... Do
you guys run validate on the modules before JIT-ing?
--renato
-------------- next part --------------
An HTML attachment was
2013 Nov 15
2
[LLVMdev] Limit loop vectorizer to SSE
Agreed, is there a pass that will insert a runtime alignment check? Also,
what's the easiest way to get at TargetTransformInfo::getRegisterBitWidth()
so I don't have to hard code 32? Thanks!
-Josh
On Fri, Nov 15, 2013 at 3:20 PM, Frank Winter <fwinter at jlab.org> wrote:
> Hmm.. I don't quite understand. How can a module validator
> catch this, when it's the
2013 Oct 31
3
[LLVMdev] loop vectorizer misses opportunity, exploit
Hi Frank,
This loop should be vectorized by the SLP-vectorizer. It has several scalars (C[0], C[1] … ) that can be merged into a vector. The SLP vectorizer can’t figure out that the stores are consecutive because SCEV can’t analyze the OR in the index calculation:
%2 = and i64 %i.04, 3
%3 = lshr i64 %i.04, 2
%4 = shl i64 %3, 3
%5 = or i64 %4, %2
%11 = getelementptr inbounds float*
2013 Nov 12
2
[LLVMdev] Limit loop vectorizer to SSE
On 12 November 2013 15:14, Frank Winter <fwinter at jlab.org> wrote:
> I am asking because the option 'force-vector-width' is too restrictive.
> I would like to leave open the possibility to use vector width 2.
I was about to say that, and you saved us both one cycle. ;)
What you could do is to force an architecture that doesn't have AVX, only
SSE. I'm not sure how
2013 Nov 12
2
[LLVMdev] Limit loop vectorizer to SSE
On 12 November 2013 15:53, Frank Winter <fwinter at jlab.org> wrote:
> .. forcing the vector size to 4 does not prevent using AVX.
>
Sure. That's more for tests than anything else.
So, there are ways of disabling stuf in Clang, for instance "-mattr=-avx"
or "-target-feature -avx", but I'm not sure how you're doing it in the JIT.
I'm also not sure
2015 Jul 01
3
[LLVMdev] SLP vectorizer on AVX feature
On 1 July 2015 at 21:22, Frank Winter <fwinter at jlab.org> wrote:
> there were two follow-up emails.
I only got one... weird...
> The issue is solved. The SLP vectorizer has
> a magic number built into the code which determines the max. vector length
> to search for. That was set to 128 bits. Increasing it to 256 bits solved
> the issue.
That looks like a simple fix. Is
2002 Feb 28
2
how to disable homedirs?
Hi,
how can i disable the homedir?
I have a Domain login configuration, but do not want server based
profiles. After I add a workstation to the domain, the user profiles,
created during the first login of a domain user, on that machine are
always server based. Sure, I can change this profile to a local profile,
but I think I can change this it in the samba configuration, that all
profiles are
2014 Aug 07
2
[LLVMdev] MCJIT generates MOVAPS on unaligned address
> On Aug 7, 2014, at 2:57 PM, Arnold Schwaighofer <aschwaighofer at apple.com> wrote:
>
> Your .ll file does not have a data layout. Opt will not initialize the DataLayoutPass. The SLP vectorizer will not vectorize because there is no DataLayoutPass.
>
> debug-cmake/bin/opt -default-data-layout="e-m:e-i64:64-f80:128-n8:16:32:64-S128" -basicaa -slp-vectorizer -S