Displaying 2 results from an estimated 2 matches for "vec_body".
Did you mean:
get_body
2018 Aug 02
2
Vectorizing remainder loop
...s already busted. Typically, when VF is that large, what you'll get in the remainder is masked vector like below, and vec_remainder_body is reasonably hot as you say in your original mail. As such, remainder loop vectorization isn't a solution for that problem.
for (i=0;i<N;i+=2048){
Vec_body
}
for (i<M;i+=1024){ // where M is the smallest multiple of 1024 over N
If (I < N) {
Vec_Remainder_Body
}
}
If your HW designers insist that the compiler to generate
VF=2048 // main vector loop
VF=1024 // vectorized remainder 1
VF=512 // vectorized remainder 2
...
Remainder i...
2018 Aug 03
2
Vectorizing remainder loop
...ically, when VF is that large, what you'll get in the remainder is masked vector like below, and vec_remainder_body is reasonably hot as you say in your original mail. As such, remainder loop vectorization isn't a solution for that problem.
for (i=0;i<N;i+=2048){
Vec_body
}
for (i<M;i+=1024){ // where M is the smallest multiple of 1024 over N
If (I < N) {
Vec_Remainder_Body
}
}
If your HW designers insist that the compiler to generate
VF=2048 // main vector loop
VF...