Displaying 20 results from an estimated 1000 matches similar to: "[LLVMdev] ldmxcsr reordering issue"
2019 Sep 16
3
Handling of FP denormal values
Hi all,
While reviewing a recent clang documentation change, I became aware of an issue with the way that clang is handling FP denormals. There is currently some support for variations in the way denormals are handled, but it isn't consistent across architectures and generally feels kind of half-baked. I'd like to discuss possible solutions to this problem.
First, there is a clang
2012 Jun 14
1
High CPU usage
Hi Mark,
Code below:
int16_t* samples;
int16_t* fbSilenceFrame;
void *fSpeexState;
float eng(0.f);
int speexFrameSize(0);
speex_encoder_ctl(speexState, SPEEX_GET_FRAME_SIZE, &speexFrameSize);
for (int i = 0; i < speexFrameSize; i++)
{
eng += samples[i] * samples[i];
}
if (eng / speexFrameSize < 3.f)
{
memcpy(samples, silenceFrame, speexFrameSize * sizeof(int16_t));
}
where
2019 Sep 17
2
[cfe-dev] Handling of FP denormal values
On Mon, Sep 16, 2019 at 9:43 PM Matt Arsenault via cfe-dev <
cfe-dev at lists.llvm.org> wrote:
>
>
> On Sep 16, 2019, at 19:57, Kaylor, Andrew via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>
> Do we need an ftz fast-math flag?
>
>
> This would be useful for matching a handful of AMDGPU instructions (a fmad
> that only always flushes being the
2009 Sep 23
1
High CPU usage
Hi Jeff,
Hi Jean-Marc,
I first modified the FPU control word to raise an exception whenever a denormal is used. Then I used the debugger to locate the exceptions and added VERY_SMALLs where they seem to fit well.
Although I got CPU usage as low as 10%, I seriously lack knowledge of how things work inside speex. So just changing some code is not the best idea for me.
My second attempt was to
2014 Jan 17
2
[LLVMdev] Offset overflow on calling __chkstc and __alloca
Hi,
Attempting to use LLVM in jitting mode for AMD64, we met a problem.
When the jitted routine needs a big stack frame (> 1 page), the system attempts to call __chkstk to probe the stack.
This attempt results in assertion in RuntimeDyldELF::resolveX86_64Relocation(), case ELF::R_X86_64_PC32,
because the RealOffset does not fit in 32 bits.
Same happens with __alloca (when
2019 Mar 18
3
[RFC] Making space for a flush-to-zero flag in FastMathFlags
We knew the day when we needed another FMF bit was coming back in:
https://reviews.llvm.org/D39304
...it was just a question of 'when'. :)
I'm guessing that an FTZ bit won't be the last new bit needed if we
consider permutations between strict FP and fast-math. Even without that,
denormals-as-zero (DAZ) might also be useful?
So rather than continuing to carve these out bit-by-bit,
2011 Jul 09
1
[LLVMdev] LLVM floating point rounding modes
Hi,
I am not sure if this is the right mailing list to ask my question, if not, please refer me to the proper one.
Is there any support for rounding modes in LLVM floating point? I looked in the assembler reference manual, and it doesn't seem so. I am thinking about choosing LLVM as one of the backends for my programming language Babel-17 (www.babel-17.com). Babel-17 features interval
2019 Mar 16
3
[RFC] Making space for a flush-to-zero flag in FastMathFlags
Hi,
I need to add a flush-denormals-to-zero (FTZ) flag to FastMathFlags,
but we've already used up the 7 bits available in
Value::SubclassOptionalData (the "backing storage" for
FPMathOperator::getFastMathFlags()). These are the possibilities I
can think of:
1. Increase the size of FPMathOperator. This gives us some additional
bits for FTZ and other fastmath flags we'd want
2017 Apr 20
4
[cfe-dev] FE_INEXACT being set for an exact conversion from float to unsigned long long
> This seems like it was done for perf reason (mispredict). Conditional-to-cmov transformation should keep
> from introducing additional observable side-effects, and it's clear that whatever did this did not account
> for floating point exception.
That’s a very reasonable statement, but I’m not sure it corresponds to the way we have typically approached this sort of problem.
In
2004 Mar 05
3
Lyapunov exponent code for time series
Dear all,
Has anyone worked on coding for calculating Lyapunov Exponent for a time
series data? or any package is available for computing Lyapunov?
Please advice and many thanks in advance.
Catherine X Wang
2005 May 26
3
Buidling R on Linux (Itanium) fails (PR#7897)
Full_Name: Reinhold Bader
Version: 2.1.0
OS: SLES 9
Submission from: (NULL) (129.187.48.1)
I tried to build R using the Intel compilers (icc, ifort) and the Math Kernel
Libraries
for BLAS/LAPACK
Optimizations used were -O3, and the
FPICFLAGS="-fpic"
was exported.
The build failed with the following error message:
...
dumping R code in package 'methods'
Saving namespace image
2013 Oct 16
2
[LLVMdev] Multiple modules JITting
Hi,
Can you please clarify?
We're looking for a way to use LLVM to JIT many modules, assuming that the full list of modules and their content are not available at a time when some of jitted pieces are already in use.
Is it feasible to destruct ExecutionEngine but keep jitted code alive?
Are jitted binary codes position independent? (or is there a way to relocate) If the solution requires
2017 Apr 19
3
[cfe-dev] FE_INEXACT being set for an exact conversion from float to unsigned long long
Changing the list from cfe-dev to llvm-dev
> On 20 Apr 2017, at 4:52 AM, Michael Clark <michaeljclark at mac.com> wrote:
>
> I’m getting close. I think it may be an issue with an individual intrinsic. I’m looking for the X86 lowering of Instruction::FPToUI.
>
> I found a comment around the rationale for using a conditional move versus a branch. I believe the predicate logic
2004 Apr 22
1
Lyapunov exponent?
Hello,
Does anybody know if there is somewhere in R a function to calculate the
Lyapunov exponent in a time series?
Thanks,
Philippe Grosjean
.......................................................<??}))><....
) ) ) ) )
( ( ( ( ( Prof. Philippe Grosjean
\ ___ )
\/ECO\ ( Numerical Ecology of Aquatic Systems
/\___/ ) Mons-Hainaut University, Pentagone
/ ___ /( 8, Av. du
2012 May 31
2
time-series statistics collection
Hello,
I am trying to collect several global measures or statistics for
time-series as well as packages of R that can compute them. I have found
several of them in papers and books, but the literature is so big i am sure
i am missing several of them.
skewness
kurtosis
min
max
mean
SD
trend
seasonality
periodicity
chaos (Lyapunov Exponent) / Largest Lyapunov Exponent (i think is the same
2006 May 18
2
help
Dear Sir,
I’am a frensh student and i’am a new user of the R software.
After using the command (x<-read.delim(“clipboard”) to read a spreadsheet of Excel, I want to run the bds test and calculate the Lyapunov exponent. I have charged the R software by the packages tseries and tseriesChaos. when i run bds.test(x,m=2) Unfortunately the R software displays “error in as.vector(x,mode= “double”) :
2010 Jan 29
1
Lyapunov Discrete Time Equation
Dear all,
I need to solve the following Lyapunov Matrix equation:
C=ACA' + B,
with A and B given square symmetric matrices. Does anyone knows of a
package that can solve the lyapunov matrix equation in R? Or even a
C/Fortran implementation? I did not find one on netlib.
Thank you.
2017 Jul 01
2
[PATCH 1/2] nv110/exa: Remove depbars
Removed explicit depar instructions as they're not used by the blob anymore.
Signed-off-by: Aaryaman Vasishta <jem456.vasishta at gmail.com>
---
src/shader/exac8nv110.fp | 5 ++---
src/shader/exac8nv110.fpc | 10 ++++------
src/shader/exacanv110.fp | 5 ++---
src/shader/exacanv110.fpc | 10 ++++------
src/shader/exacmnv110.fp | 5 ++---
src/shader/exacmnv110.fpc | 10 ++++------
2013 Oct 17
0
[LLVMdev] Multiple modules JITting
> From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu]
> On Behalf Of Mikhail Lyapunov
> Subject: [LLVMdev] Multiple modules JITting
> We're looking for a way to use LLVM to JIT many modules, assuming that
> the full list of modules and their content are not available at a time
> when some of jitted pieces are already in use.
We do this in our
2017 Jun 27
4
[PATCH v4] nv110/exa: update sched codes
v4: Updated the wait dependancy bars based on tex component masks.
This patch adds proper delays to maxwell exa shaders. Tested with
rendercheck -f a8r8g8b8.
I am still wondering whether the rd's are required. We could
still wait on the write bars instead. eg. see
"sched (st 0xf wr 0x1 wt 0x2) (st 0xf wr 0x1 wt 0x2) (st 0xf)" in exacmnv110.fp
Trello: