thr3ads.net - llvm dev - [LLVMdev] Proposal: New DAG node type for reciprocal operation [Sep 2012]

If this information is useful, please help other people find it:
Share via:

shahid shahid

2012-Sep-21 05:05 UTC

[LLVMdev] Proposal: New DAG node type for reciprocal operation

--- On Thu, 9/20/12, Jim Grosbach <grosbach at apple.com> wrote:

From: Jim Grosbach <grosbach at apple.com>
Subject: Re: [LLVMdev] Proposal: New DAG node type for reciprocal operation
To: "Weiming Zhao" <weimingz at codeaurora.org>
Cc: llvmdev at cs.uiuc.edu
Date: Thursday, September 20, 2012, 3:32 PM

Sounds like a reasonable fit for a target-specific DAG combine. I suspect a
target specific node wouldn't be necessary and the patterns could be matched
directly.
-Jim
Yes, a target specific node is not necessary, direct pattern matching would be
enough for the required transformation.Having reciprocal node may also give
opportunity for other target specific transformation.
-Shahid
On Sep 20, 2012, at 3:26 PM, Weiming Zhao <weimingz at codeaurora.org>
wrote:
Hi,  In relaxed/fast math mode, if we can convert a/b to a * (1/b), we may get
more performance when (1) “b” is loop invariant or (2) arch has faster
reciprocal instruction (e.g. recipe/recips on ARM)  or (3) arch has no vector
div, but has vector mul and recip.  So ,with this node type, a div node can be
converted to a mul and a recip when desired. Then, each arch can further lower
the recip node. Even if the arch has no recip support, allowing other passes to
hoist  “1/b” out of loop may still be profitable.  It this feasible?
 Thanks,Weiming  Qualcomm Innovation Center, Inc. is a member of Code Aurora
Forum, hosted by The Linux Foundation  
_______________________________________________
LLVM Developers mailing list
LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev


-----Inline Attachment Follows-----

_______________________________________________
LLVM Developers mailing list
LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20120920/c9554c00/attachment.html>

Zhao, Weiming

2012-Sep-24 17:27 UTC

head link

[LLVMdev] Proposal: New DAG node type for reciprocal operation

Yes, what I mean is a target independent node in the ISD::NodeType enum.
I already did the node transformation DAGCombiner and target-specific lowering
in the first place. It worked. But introducing a specific node will make the
logic more clear.
For example, in ARM, FDIV is a scalar operation. So, after DAGCombiner and
Vector Type legalize, vectorized FDIV has been expanded into scalar versions,
which breaks the intention of utilizaing vectorizable mul/recip to implement a
vectorized fdiv. To fix that, one need to either combine them back or change the
logic of vector type legalize.

Thanks,
Weiming

From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu] On
Behalf Of shahid shahid
Sent: Thursday, September 20, 2012 10:06 PM
To: Weiming Zhao; Jim Grosbach
Cc: llvmdev at cs.uiuc.edu
Subject: Re: [LLVMdev] Proposal: New DAG node type for reciprocal operation



--- On Thu, 9/20/12, Jim Grosbach <grosbach at apple.com<mailto:grosbach
at apple.com>> wrote:

From: Jim Grosbach <grosbach at apple.com<mailto:grosbach at
apple.com>>
Subject: Re: [LLVMdev] Proposal: New DAG node type for reciprocal operation
To: "Weiming Zhao" <weimingz at codeaurora.org<mailto:weimingz
at codeaurora.org>>
Cc: llvmdev at cs.uiuc.edu<mailto:llvmdev at cs.uiuc.edu>
Date: Thursday, September 20, 2012, 3:32 PM
Sounds like a reasonable fit for a target-specific DAG combine. I suspect a
target specific node wouldn't be necessary and the patterns could be matched
directly.

-Jim

Yes, a target specific node is not necessary, direct pattern matching would be
enough for the required transformation.Having reciprocal node may also give
opportunity for other target specific transformation.

-Shahid

On Sep 20, 2012, at 3:26 PM, Weiming Zhao <weimingz at
codeaurora.org</mc/compose?to=weimingz at codeaurora.org>> wrote:


Hi,

In relaxed/fast math mode, if we can convert a/b to a * (1/b), we may get more
performance when (1) “b” is loop invariant or (2) arch has faster reciprocal
instruction (e.g. recipe/recips on ARM)  or (3) arch has no vector div, but has
vector mul and recip.

So ,with this node type, a div node can be converted to a mul and a recip when
desired. Then, each arch can further lower the recip node. Even if the arch has
no recip support, allowing other passes to hoist  “1/b” out of loop may still be
profitable.

It this feasible?

Thanks,
Weiming

Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum, hosted by The
Linux Foundation

_______________________________________________
LLVM Developers mailing list
LLVMdev at cs.uiuc.edu</mc/compose?to=LLVMdev at cs.uiuc.edu>        
http://llvm.cs.uiuc.edu
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev


-----Inline Attachment Follows-----
_______________________________________________
LLVM Developers mailing list
LLVMdev at cs.uiuc.edu</mc/compose?to=LLVMdev at cs.uiuc.edu>        
http://llvm.cs.uiuc.edu
http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev


-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20120924/b0a96b3c/attachment.html>

shahid shahid

2012-Sep-25 05:45 UTC

head link

[LLVMdev] Proposal: New DAG node type for reciprocal operation

>  To fix that, one need to either combine them back or change the logic of vector
> type legalize. 
Combining them back is simple, however if a scalar operation has combined &
vectorized it should not have been expanded into scalar.So changing the logic of
vector type legalize seems better solution.
-Shahid

--- On Mon, 9/24/12, Zhao, Weiming <weimingz at quicinc.com> wrote:

From: Zhao, Weiming <weimingz at quicinc.com>
Subject: RE: [LLVMdev] Proposal: New DAG node type for reciprocal operation
To: "shahid shahid" <shahid77c at yahoo.com>, "Weiming
Zhao" <weimingz at codeaurora.org>, "Jim Grosbach"
<grosbach at apple.com>
Cc: "llvmdev at cs.uiuc.edu" <llvmdev at cs.uiuc.edu>
Date: Monday, September 24, 2012, 10:27 AM

Yes, what I mean is a target independent node in the ISD::NodeType enum. 
I already did the node transformation DAGCombiner and target-specific lowering
in the first place. It worked. But introducing a specific node will make the
 logic more clear.  
For example, in ARM, FDIV is a scalar operation. So, after DAGCombiner and
Vector Type legalize, vectorized FDIV has been expanded into scalar versions,
which
 breaks the intention of utilizaing vectorizable mul/recip to implement a
vectorized fdiv. To fix that, one need to either combine them back or change the
logic of vector type legalize.

Thanks, 
Weiming 

From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu]
On Behalf Of shahid shahid

Sent: Thursday, September 20, 2012 10:06 PM

To: Weiming Zhao; Jim Grosbach

Cc: llvmdev at cs.uiuc.edu

Subject: Re: [LLVMdev] Proposal: New DAG node type for reciprocal operation 

--- On Thu, 9/20/12, Jim Grosbach <grosbach at apple.com> wrote: 

From: Jim Grosbach <grosbach at apple.com>

Subject: Re: [LLVMdev] Proposal: New DAG node type for reciprocal operation

To: "Weiming Zhao" <weimingz at codeaurora.org>

Cc: llvmdev at cs.uiuc.edu

Date: Thursday, September 20, 2012, 3:32 PM 

Sounds like a reasonable fit for a target-specific DAG combine. I suspect a
target specific node wouldn't be necessary and the patterns could be matched
directly.

-Jim 

Yes, a target specific node is not necessary, direct pattern matching would be
enough for the required transformation.Having reciprocal node may also give
opportunity for other target specific transformation.

-Shahid 

On Sep 20, 2012, at 3:26 PM, Weiming Zhao <weimingz at codeaurora.org>
wrote:

Hi, 

In relaxed/fast math mode, if we can convert a/b to a * (1/b), we may get more
performance when (1) “b” is loop invariant or (2) arch has faster reciprocal
instruction (e.g.
 recipe/recips on ARM)  or (3) arch has no vector div, but has vector mul and
recip.

So ,with this node type, a div node can be converted to a mul and a recip when
desired. Then, each arch can further lower the recip node. Even if the arch has
no recip support,
 allowing other passes to hoist  “1/b” out of loop may still be profitable. 

It this feasible? 

Thanks, 

Weiming 

Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum, hosted by The
Linux Foundation

_______________________________________________

LLVM Developers mailing list

LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu

http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev 

-----Inline Attachment Follows----- 

_______________________________________________

LLVM Developers mailing list

LLVMdev at cs.uiuc.edu         http://llvm.cs.uiuc.edu

http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev 

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20120924/153fabf0/attachment.html>

Maybe Matching Threads

Search for more apparently analagous threads

llvm dev - Sep 2012 - [LLVMdev] Proposal: New DAG node type for reciprocal operation

[LLVMdev] Proposal: New DAG node type for reciprocal operation

[LLVMdev] Proposal: New DAG node type for reciprocal operation

[LLVMdev] Proposal: New DAG node type for reciprocal operation

Maybe Matching Threads