similar to: [LLVMdev] Handling Masked Vector Operations

Displaying 20 results from an estimated 20000 matches similar to: "[LLVMdev] Handling Masked Vector Operations"

2013 May 02
0
[LLVMdev] Handling Masked Vector Operations
Hi David, > > It seems the only solution is to create an intrinsic: > > llvm_int_load_masked mask, [addr] > > But this unnecessarily shuts down optimization. > I think that using intrinsics is the right solution. I imagine that most interesting load/store optimizations happen before vectorization, so I am not sure how much we can gain by optimizing masked load/stores.
2013 May 02
2
[LLVMdev] Handling Masked Vector Operations
Nadav Rotem <nrotem at apple.com> writes: > For DIV/MOD you can blend the inputs BEFORE the operation. You can > place ones or zeros depending on the operation. Quick follow-up on this. What about using "undef" as the input for false items: tv1 = select mask, v1, undef tv2 = select mask, v2, undef tv3 = div tv1, tv2 v3 = select mask, tv3, undef I'm always confused
2013 May 02
4
[LLVMdev] Handling Masked Vector Operations
Nadav Rotem <nrotem at apple.com> writes: > > For DIV/MOD you can blend the inputs BEFORE the operation. You > can > place ones or zeros depending on the operation. > > Quick follow-up on this. What about using "undef" as the input for > false items: > > tv1 = select mask, v1, undef > tv2 = select
2013 May 02
0
[LLVMdev] Handling Masked Vector Operations
> >> For DIV/MOD you can blend the inputs BEFORE the operation. You can >> place ones or zeros depending on the operation. > > Quick follow-up on this. What about using "undef" as the input for > false items: > > tv1 = select mask, v1, undef > tv2 = select mask, v2, undef > tv3 = div tv1, tv2 > v3 = select mask, tv3, undef > > I'm
2013 May 03
0
[LLVMdev] Handling Masked Vector Operations
Hi David, On 02/05/13 17:57, dag at cray.com wrote: > We're looking at how to handle masked vector operations in architectures > like Knight's Corner. In our case, we have to translate from a fully > vectorized IR that has mask support to llvm IR which does not have mask > support. > > For non-trapping instructions this is fairly straightforward: > > ; Input >
2013 May 09
0
[LLVMdev] Handling Masked Vector Operations
On 2 May 2013 16:57, <dag at cray.com> wrote: > We're looking at how to handle masked vector operations in architectures > like Knight's Corner. In our case, we have to translate from a fully > vectorized IR that has mask support to llvm IR which does not have mask > support. > Has anyone done a comparision between the "fully vectorized IR and "LLVM
2013 May 06
2
[LLVMdev] Handling Masked Vector Operations
Duncan Sands <baldrick at free.fr> writes: > there seems to be a plan to get rid of the select instruction and just use > branches and phi nodes instead. Amongst other things this requires boosting > the power of codegen so that branches+phi nodes can be turned into cmov or > whatever when appropriate. This is a very BAD idea. Are you telling me that every predicated
2013 May 02
0
[LLVMdev] Handling Masked Vector Operations
On Thu, May 2, 2013 at 10:31 AM, <dag at cray.com> wrote: > Nadav Rotem <nrotem at apple.com> writes: > > > > > For DIV/MOD you can blend the inputs BEFORE the operation. You > > can > > place ones or zeros depending on the operation. > > > > Quick follow-up on this. What about using "undef" as the input
2019 Sep 28
9
[PATCH nbdkit v2 0/4] info: Add mode for sending back server time.
v1 was: https://www.redhat.com/archives/libguestfs/2019-September/thread.html#00361 v2: - Adds a patch to rename the reflection plugin to the info plugin. - Adds tests. Rich.
2013 May 07
6
[LLVMdev] Predicated Vector Operations
I'm trying to understand how predicated/masked instructions can be generated in llvm, specifically an instruction where a set bit in the mask will write the new result into the corresponding vector lane in the destination and a clear bit will cause the lane in the destination to remain what it was before the instruction executed. I've seen a few places that suggest 'select' is the
2012 Oct 22
2
[LLVMdev] Predication on SIMD architectures and LLVM
Dan Gohman <dan433584 at gmail.com> writes: > And, in part because a popular trend seems to be to have SIMD units > which don't trap or raise exception flags on arithmetic and which don't > go faster when predicated, such that there's no  reason to predicate > anything except stores and occasionally loads. On these architectures, > simply having intrinsics for
2019 Jan 31
4
[RFC] Vector Predication
On Thu, 31 Jan 2019 at 20:17, Philip Reames via llvm-dev < llvm-dev at lists.llvm.org> wrote: > > On 1/31/19 11:03 AM, David Greene wrote: > > Philip Reames <listmail at philipreames.com> writes: > > > >> Question 1 - Why do we need separate mask and lengths? Can't the > >> length be easily folded into the mask operand? > >> >
2019 Jan 31
6
[RFC] Vector Predication
Hi, There is now an RFC for a roadmap to native vector predication support in LLVM and a prototype implementation:   https://reviews.llvm.org/D57504 The prototype demonstrates: -  Predicated vector intrinsics with an explicit mask and vector length parameter on IR level. -  First-class predicated SDNodes on ISel level. Mask and vector length are value operands. -  An incremental strategy
2013 May 09
0
[LLVMdev] Predicated Vector Operations
On Thu, May 9, 2013 at 8:10 AM, <dag at cray.com> wrote: > Jeff Bush <jeffbush001 at gmail.com> writes: > >> %tx = select %mask, %x, <0.0, 0.0, 0.0 ...> >> %ty = select %mask, %y, <0.0, 0.0, 0.0 ...> >> %sum = fadd %tx, %ty >> %newvalue = select %mask, %sum, %oldvalue >> >> I believe the generated instructions depend on whether
2019 Jan 31
4
[RFC] Vector Predication
Philip Reames <listmail at philipreames.com> writes: > Question 1 - Why do we need separate mask and lengths? Can't the > length be easily folded into the mask operand? > > e.g. newmask = (<4 x i1>)((i4)%y & (1 << %L -1)) > and then pattern matched in the backend if needed I'm a little concerned about how difficult it will be to maintain enough
2006 May 26
8
Comparing two documents in the index
I want to compare two documents in the index (i.e. retrieve the cosine similarity/score between two documents term-vector''s). Is this possible using the standard Ferret functionality? Thanks in advance, Jeroen Bulters -- Posted via http://www.ruby-forum.com/.
2013 May 06
0
[LLVMdev] Handling Masked Vector Operations
On May 6, 2013, at 9:35 AM, dag at cray.com wrote: >>> It's easy enough to write a TableGen pattern to match add+select and >>> emit a masked instruction. Alternative, we can always resort to manual >>> lowering (ugh). >>> >>> For trapping operations this is problematic. Take a load. Here's the >>> same attempt with a load:
2012 Oct 24
0
[LLVMdev] Predication on SIMD architectures and LLVM
On Oct 22, 2012, at 10:15 AM, dag at cray.com wrote: > > It's true that a target-independent predicated IR isn't going to > translate well to a target that doesn't have predication. However, for > targets that do it's a godsend. Even for MIC (Xeon Phi), the predicated IR is not necessary. The instructions that really benefit from predication are loads and stores.
2021 Jan 04
4
Dovecot v2.3.13 released
We are pleased to release v2.3.13. Please find it from locations below: https://dovecot.org/releases/2.3/dovecot-2.3.13.tar.gz https://dovecot.org/releases/2.3/dovecot-2.3.13.tar.gz.sig Binary packages in https://repo.dovecot.org/ Docker images in https://hub.docker.com/r/dovecot/dovecot Aki Tuomi Open-Xchange oy --- * CVE-2020-24386: Specially crafted command can cause IMAP hibernate to
2021 Jan 04
4
Dovecot v2.3.13 released
We are pleased to release v2.3.13. Please find it from locations below: https://dovecot.org/releases/2.3/dovecot-2.3.13.tar.gz https://dovecot.org/releases/2.3/dovecot-2.3.13.tar.gz.sig Binary packages in https://repo.dovecot.org/ Docker images in https://hub.docker.com/r/dovecot/dovecot Aki Tuomi Open-Xchange oy --- * CVE-2020-24386: Specially crafted command can cause IMAP hibernate to