thr3ads.net - similar to: "[LLVMdev] Another memory alignment issue with SSE operations"

Displaying 20 results from an estimated 90 matches similar to: "[LLVMdev] Another memory alignment issue with SSE operations"

[LLVMdev] llvm.x86.sse2.sqrt.pd not using sqrtpd, calling a function that modifies ECX

2013 Jul 19

[LLVMdev] llvm.x86.sse2.sqrt.pd not using sqrtpd, calling a function that modifies ECX

(Changing subject line as diagnosis has changed) I'm attaching the compiled code that I've been getting, both with CodeGenOpt::Default and CodeGenOpt::None . The crash isn't occurring with CodeGenOpt::None, but that seems to be because ECX isn't being used - it still gets set to 0x7fffffff by one of the calls to 76719BA1 I notice that X86::SQRTPD[m|r] appear in

[LLVMdev] SIMD instructions and memory alignment on X86

2013 Jul 19

[LLVMdev] SIMD instructions and memory alignment on X86

Hmm, I'm not able to get those .ll files to compile if I disable SSE and I end up with SSE instructions(including sqrtpd) if I don't disable it. On Thu, Jul 18, 2013 at 10:53 PM, Peter Newman <peter at uformia.com> wrote: > Is there something specifically required to enable SSE? If it's not > detected as available (based from the target triple?) then I don't think

Question about llvm vectors

2020 Aug 20

Question about llvm vectors

Hi Craig, Thank you very much for your answer. I did not want to discuss exactly the semantic and name of one operation but instead raise the question "would it be beneficial to have more vector builtins?". You wrote that the compiler will recognize a pattern and replace it by __builtin_ia32_haddps when possible, but how can I be sure of that? I would have to disassemble the generated

[LLVMdev] Proposal for a new LLVM concurrency memory model

2010 Apr 27

[LLVMdev] Proposal for a new LLVM concurrency memory model

On Monday 26 April 2010 16:09:48 Jeffrey Yasskin wrote: > > Vector atomics are extremely useful on architectures that support them. > > I'm curious about the architectures/instructions you're thinking of. > Something like 'lock; movdqa'? Don't think X86. Think traditional vector machines like the Cray X1/X2. Atomic vector adds and logicals are common

[LLVMdev] Proposal for a new LLVM concurrency memory model

2010 Apr 26

[LLVMdev] Proposal for a new LLVM concurrency memory model

On Mon, Apr 26, 2010 at 1:09 PM, David Greene <dag at cray.com> wrote: > On Monday 26 April 2010 14:03:35 Chandler Carruth wrote: >> We can allow the IR to represent vectors, but unless hardware supports it, >> I think lowering these by decomposing them is more than LLVM should try to >> do. Because of that, I'm not sure we should support vectors as elsewhere

[LLVMdev] Use of movupd instead of movapd for x86

2011 Feb 28

[LLVMdev] Use of movupd instead of movapd for x86

Understood for the aligned case, I want to measure performance degradation for unaligned case. I mean unaligned case versus aligned. I know this is stupid, but I want to try to pass a <4 x float>* as parameter of a routine and at the call site I want to pass a misaligned pointer. Since LLVM is generating movapd instruction it will raise an exception (SEGFAULT), I just want to know if there

[LLVMdev] Use of movupd instead of movapd for x86

2011 Feb 25

[LLVMdev] Use of movupd instead of movapd for x86

Hi all, Is there a way to force llc to generate movupd instruction instead of movapd for x86 target ? I know that movapd is more performant, but I would like to measure degradation when alignment constraints are not met. Best Regards Seb -------------- next part -------------- An HTML attachment was scrubbed... URL:

[LLVMdev] Use of movupd instead of movapd for x86

2011 Mar 01

[LLVMdev] Use of movupd instead of movapd for x86

On Feb 28, 2011, at 2:58 AM, Sebastien DELDON-GNB wrote: > Understood for the aligned case, I want to measure performance degradation for unaligned case. > I mean unaligned case versus aligned. I know this is stupid, but I want to try to pass a <4 x float>* as parameter of a routine and at the call site I want to pass a misaligned pointer. Since LLVM is generating movapd instruction

[LLVMdev] Use of movupd instead of movapd for x86

2011 Feb 25

[LLVMdev] Use of movupd instead of movapd for x86

Sebastien DELDON-GNB <sebastien.deldon at st.com> writes: > Hi all, > > Is there a way to force llc to generate movupd instruction instead of movapd for x86 target ? > > I know that movapd is more performant, but I would like to measure degradation when alignment constraints are not met. On modern processors a movupd on aligned data is going to be indistinguishable in

[LLVMdev] Enabling the SLP vectorizer by default for -O3

2013 Jul 15

[LLVMdev] Enabling the SLP vectorizer by default for -O3

On Jul 14, 2013, at 9:52 PM, Chris Lattner <clattner at apple.com> wrote: > > On Jul 13, 2013, at 11:30 PM, Nadav Rotem <nrotem at apple.com> wrote: > >> Hi, >> >> LLVM’s SLP-vectorizer is a new pass that combines similar independent instructions in a straight-line code. It is currently not enabled by default, and people who want to experiment with it

[LLVMdev] Enabling the SLP vectorizer by default for -O3

2013 Jul 23

[LLVMdev] Enabling the SLP vectorizer by default for -O3

Hi, Sorry for the delay in response. I measured the code size change and noticed small changes in both directions for individual programs. I found a 30k binary size growth for the entire testsuite + SPEC. I attached an updated performance report that includes both compile time and performance measurements. Thanks, Nadav On Jul 14, 2013, at 10:55 PM, Nadav Rotem <nrotem at apple.com>

[LLVMdev] Codegen for vector float->double cast fails on x86 above SSE3

2011 Dec 28

[LLVMdev] Codegen for vector float->double cast fails on x86 above SSE3

I've isolated a bug in SSE codegen to the attached example. define void @f(<2 x float>* %in, <2 x double>* %out) { entry: %0 = load <2 x float>* %in, align 8 %1 = fpext <2 x float> %0 to <2 x double> store <2 x double> %1, <2 x double>* %out, align 1 ret void } The code should load a <2 x float> vector from %in, fpext cast it to

[LLVMdev] buildbot with -vectorize

2012 Jun 28

[LLVMdev] buildbot with -vectorize

On 06/28/2012 02:51 PM, Hal Finkel wrote: > On Sun, 24 Jun 2012 14:44:45 +0200 > Tobias Grosser<tobias at grosser.es> wrote: > [..] > Also, since you're running these on an x86_64 machine, and I think they > don't have unaligned vector load/stores, you should probably add -mllvm > -bb-vectorize-aligned-only to the target flags. What about MOVUPS and MOVUPD? Tobi

[LLVMdev] buildbot with -vectorize

2012 Jun 28

[LLVMdev] buildbot with -vectorize

On Thu, 28 Jun 2012 16:04:38 +0200 Tobias Grosser <tobias at grosser.es> wrote: > On 06/28/2012 02:51 PM, Hal Finkel wrote: > > On Sun, 24 Jun 2012 14:44:45 +0200 > > Tobias Grosser<tobias at grosser.es> wrote: > > > [..] > > > Also, since you're running these on an x86_64 machine, and I think > > they don't have unaligned vector

[LLVMdev] buildbot with -vectorize

2012 Jun 28

[LLVMdev] buildbot with -vectorize

On Sun, 24 Jun 2012 14:44:45 +0200 Tobias Grosser <tobias at grosser.es> wrote: > On 06/24/2012 02:42 PM, Hal Finkel wrote: > > On Sun, 24 Jun 2012 08:17:32 +0200 > > Tobias Grosser<tobias at grosser.es> wrote: > > > >> On 06/24/2012 05:42 AM, Hal Finkel wrote: > >>> On Thu, 21 Jun 2012 16:25:13 +0200 > >>> Tobias

[LLVMdev] Enabling the SLP vectorizer by default for -O3

2013 Jul 15

[LLVMdev] Enabling the SLP vectorizer by default for -O3

On Jul 13, 2013, at 11:30 PM, Nadav Rotem <nrotem at apple.com> wrote: > Hi, > > LLVM’s SLP-vectorizer is a new pass that combines similar independent instructions in a straight-line code. It is currently not enabled by default, and people who want to experiment with it can use the clang command line flag “-fslp-vectorize”. I ran LLVM’s test suite with and without the SLP

Tables

2009 Feb 18

Tables

Are there any standards emerging for how to represent tables in Markdown? Regards, - Daniel -------------------------------------------- Dr Daniel Winterstein Winterwell Associates Ltd tel: 0772 5172 612 http://www.winterwell.com Registered in Scotland, company no. SC342991

[LLVMdev] Enabling the SLP vectorizer by default for -O3

2013 Jul 14

[LLVMdev] Enabling the SLP vectorizer by default for -O3

Hi, LLVM’s SLP-vectorizer is a new pass that combines similar independent instructions in a straight-line code. It is currently not enabled by default, and people who want to experiment with it can use the clang command line flag “-fslp-vectorize”. I ran LLVM’s test suite with and without the SLP vectorizer on a Sandybridge mac (using SSE4, w/o AVX). Based on my performance measurements

problem mounting ocfs2: heartbeat

2005 Jul 12

problem mounting ocfs2: heartbeat

When attempting to mount the OCFS2 file system I'm getting the following error message: ocfs2_hb_ctl: Internal logic failure while starting heartbeat mount.ocfs2: Error when attempting to run /sbin/ocfs2_hb_ctl: "Operation not permitted" I followed the steps given in the users_guide: modprobe ocfs2_dlmfs mount -t configfs none /config mount -t ocfs2_dlmfs none /dlm o2cb_ctl

rejected because extension not found in context 'introutingB'

2013 Jul 20

rejected because extension not found in context 'introutingB'

Dear All, I am trying to recieve call from inbound proxy then route to internal peer (localhost) and then route to outgoing sip proxy but it failing with subject error. Can any one please correct me what i am doing wrong in below config. SIP.conf [Inbound] type=peer context=introuting host=184.107.XXX.XXX disallow=all allow=all [astinside] type=peer context=introutingB host=localhost

similar to: [LLVMdev] Another memory alignment issue with SSE operations