thr3ads.net - similar to: "[LLVMdev] ldmxcsr reordering issue"

Displaying 20 results from an estimated 1000 matches similar to: "[LLVMdev] ldmxcsr reordering issue"

2019 Sep 16

Handling of FP denormal values

Hi all, While reviewing a recent clang documentation change, I became aware of an issue with the way that clang is handling FP denormals. There is currently some support for variations in the way denormals are handled, but it isn't consistent across architectures and generally feels kind of half-baked. I'd like to discuss possible solutions to this problem. First, there is a clang

High CPU usage

2012 Jun 14

High CPU usage

Hi Mark, Code below: int16_t* samples; int16_t* fbSilenceFrame; void *fSpeexState; float eng(0.f); int speexFrameSize(0); speex_encoder_ctl(speexState, SPEEX_GET_FRAME_SIZE, &speexFrameSize); for (int i = 0; i < speexFrameSize; i++) { eng += samples[i] * samples[i]; } if (eng / speexFrameSize < 3.f) { memcpy(samples, silenceFrame, speexFrameSize * sizeof(int16_t)); } where

[cfe-dev] Handling of FP denormal values

2019 Sep 17

[cfe-dev] Handling of FP denormal values

On Mon, Sep 16, 2019 at 9:43 PM Matt Arsenault via cfe-dev < cfe-dev at lists.llvm.org> wrote: > > > On Sep 16, 2019, at 19:57, Kaylor, Andrew via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > > > Do we need an ftz fast-math flag? > > > This would be useful for matching a handful of AMDGPU instructions (a fmad > that only always flushes being the

High CPU usage

2009 Sep 23

High CPU usage

Hi Jeff, Hi Jean-Marc, I first modified the FPU control word to raise an exception whenever a denormal is used. Then I used the debugger to locate the exceptions and added VERY_SMALLs where they seem to fit well. Although I got CPU usage as low as 10%, I seriously lack knowledge of how things work inside speex. So just changing some code is not the best idea for me. My second attempt was to

[LLVMdev] Offset overflow on calling __chkstc and __alloca

2014 Jan 17

[LLVMdev] Offset overflow on calling __chkstc and __alloca

Hi, Attempting to use LLVM in jitting mode for AMD64, we met a problem. When the jitted routine needs a big stack frame (> 1 page), the system attempts to call __chkstk to probe the stack. This attempt results in assertion in RuntimeDyldELF::resolveX86_64Relocation(), case ELF::R_X86_64_PC32, because the RealOffset does not fit in 32 bits. Same happens with __alloca (when

[RFC] Making space for a flush-to-zero flag in FastMathFlags

2019 Mar 18

[RFC] Making space for a flush-to-zero flag in FastMathFlags

We knew the day when we needed another FMF bit was coming back in: https://reviews.llvm.org/D39304 ...it was just a question of 'when'. :) I'm guessing that an FTZ bit won't be the last new bit needed if we consider permutations between strict FP and fast-math. Even without that, denormals-as-zero (DAZ) might also be useful? So rather than continuing to carve these out bit-by-bit,

[LLVMdev] LLVM floating point rounding modes

2011 Jul 09

[LLVMdev] LLVM floating point rounding modes

Hi, I am not sure if this is the right mailing list to ask my question, if not, please refer me to the proper one. Is there any support for rounding modes in LLVM floating point? I looked in the assembler reference manual, and it doesn't seem so. I am thinking about choosing LLVM as one of the backends for my programming language Babel-17 (www.babel-17.com). Babel-17 features interval

[RFC] Making space for a flush-to-zero flag in FastMathFlags

2019 Mar 16

[RFC] Making space for a flush-to-zero flag in FastMathFlags

Hi, I need to add a flush-denormals-to-zero (FTZ) flag to FastMathFlags, but we've already used up the 7 bits available in Value::SubclassOptionalData (the "backing storage" for FPMathOperator::getFastMathFlags()). These are the possibilities I can think of: 1. Increase the size of FPMathOperator. This gives us some additional bits for FTZ and other fastmath flags we'd want

[cfe-dev] FE_INEXACT being set for an exact conversion from float to unsigned long long

2017 Apr 20

[cfe-dev] FE_INEXACT being set for an exact conversion from float to unsigned long long

> This seems like it was done for perf reason (mispredict). Conditional-to-cmov transformation should keep > from introducing additional observable side-effects, and it's clear that whatever did this did not account > for floating point exception. That’s a very reasonable statement, but I’m not sure it corresponds to the way we have typically approached this sort of problem. In

Lyapunov exponent code for time series

2004 Mar 05

Lyapunov exponent code for time series

Dear all, Has anyone worked on coding for calculating Lyapunov Exponent for a time series data? or any package is available for computing Lyapunov? Please advice and many thanks in advance. Catherine X Wang

Buidling R on Linux (Itanium) fails (PR#7897)

2005 May 26

Buidling R on Linux (Itanium) fails (PR#7897)

Full_Name: Reinhold Bader Version: 2.1.0 OS: SLES 9 Submission from: (NULL) (129.187.48.1) I tried to build R using the Intel compilers (icc, ifort) and the Math Kernel Libraries for BLAS/LAPACK Optimizations used were -O3, and the FPICFLAGS="-fpic" was exported. The build failed with the following error message: ... dumping R code in package 'methods' Saving namespace image

[LLVMdev] Multiple modules JITting

2013 Oct 16

[LLVMdev] Multiple modules JITting

Hi, Can you please clarify? We're looking for a way to use LLVM to JIT many modules, assuming that the full list of modules and their content are not available at a time when some of jitted pieces are already in use. Is it feasible to destruct ExecutionEngine but keep jitted code alive? Are jitted binary codes position independent? (or is there a way to relocate) If the solution requires

[cfe-dev] FE_INEXACT being set for an exact conversion from float to unsigned long long

2017 Apr 19

[cfe-dev] FE_INEXACT being set for an exact conversion from float to unsigned long long

Changing the list from cfe-dev to llvm-dev > On 20 Apr 2017, at 4:52 AM, Michael Clark <michaeljclark at mac.com> wrote: > > I’m getting close. I think it may be an issue with an individual intrinsic. I’m looking for the X86 lowering of Instruction::FPToUI. > > I found a comment around the rationale for using a conditional move versus a branch. I believe the predicate logic

Lyapunov exponent?

2004 Apr 22

Lyapunov exponent?

Hello, Does anybody know if there is somewhere in R a function to calculate the Lyapunov exponent in a time series? Thanks, Philippe Grosjean .......................................................<??}))><.... ) ) ) ) ) ( ( ( ( ( Prof. Philippe Grosjean \ ___ ) \/ECO\ ( Numerical Ecology of Aquatic Systems /\___/ ) Mons-Hainaut University, Pentagone / ___ /( 8, Av. du

time-series statistics collection

2012 May 31

time-series statistics collection

Hello, I am trying to collect several global measures or statistics for time-series as well as packages of R that can compute them. I have found several of them in papers and books, but the literature is so big i am sure i am missing several of them. skewness kurtosis min max mean SD trend seasonality periodicity chaos (Lyapunov Exponent) / Largest Lyapunov Exponent (i think is the same

help

2006 May 18

help

Dear Sir, I’am a frensh student and i’am a new user of the R software. After using the command (x<-read.delim(“clipboard”) to read a spreadsheet of Excel, I want to run the bds test and calculate the Lyapunov exponent. I have charged the R software by the packages tseries and tseriesChaos. when i run bds.test(x,m=2) Unfortunately the R software displays “error in as.vector(x,mode= “double”) :

Lyapunov Discrete Time Equation

2010 Jan 29

Lyapunov Discrete Time Equation

Dear all, I need to solve the following Lyapunov Matrix equation: C=ACA' + B, with A and B given square symmetric matrices. Does anyone knows of a package that can solve the lyapunov matrix equation in R? Or even a C/Fortran implementation? I did not find one on netlib. Thank you.

[PATCH 1/2] nv110/exa: Remove depbars

2017 Jul 01

[PATCH 1/2] nv110/exa: Remove depbars

Removed explicit depar instructions as they're not used by the blob anymore. Signed-off-by: Aaryaman Vasishta <jem456.vasishta at gmail.com> --- src/shader/exac8nv110.fp | 5 ++--- src/shader/exac8nv110.fpc | 10 ++++------ src/shader/exacanv110.fp | 5 ++--- src/shader/exacanv110.fpc | 10 ++++------ src/shader/exacmnv110.fp | 5 ++--- src/shader/exacmnv110.fpc | 10 ++++------

[LLVMdev] Multiple modules JITting

2013 Oct 17

[LLVMdev] Multiple modules JITting

> From: llvmdev-bounces at cs.uiuc.edu [mailto:llvmdev-bounces at cs.uiuc.edu] > On Behalf Of Mikhail Lyapunov > Subject: [LLVMdev] Multiple modules JITting > We're looking for a way to use LLVM to JIT many modules, assuming that > the full list of modules and their content are not available at a time > when some of jitted pieces are already in use. We do this in our

[PATCH v4] nv110/exa: update sched codes

2017 Jun 27

[PATCH v4] nv110/exa: update sched codes

v4: Updated the wait dependancy bars based on tex component masks. This patch adds proper delays to maxwell exa shaders. Tested with rendercheck -f a8r8g8b8. I am still wondering whether the rd's are required. We could still wait on the write bars instead. eg. see "sched (st 0xf wr 0x1 wt 0x2) (st 0xf wr 0x1 wt 0x2) (st 0xf)" in exacmnv110.fp Trello:

similar to: [LLVMdev] ldmxcsr reordering issue