thr3ads.net - similar to: "boxplot: reverse y-axis order"

Displaying 20 results from an estimated 40000 matches similar to: "boxplot: reverse y-axis order"

[PATCH 05/14] arm: implement exception and hypercall entries.

2012 Feb 13

[PATCH 05/14] arm: implement exception and hypercall entries.

arm: implement exception and hypercall entries. xen/arch/arm/xen/Makefile | 3 + xen/arch/arm/xen/asm-offsets.c | 61 ++++++++ xen/arch/arm/xen/entry.S | 596 +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ xen/arch/arm/xen/hypercalls.S | 67 +++++++++ xen/arch/arm/xen/physdev.c | 41 +++++ 5 files changed, 768 insertions(+), 0

MMX IDCT for theora-exp

2005 Jul 20

MMX IDCT for theora-exp

Hello, I'm attaching IDCT MMX patch. I reused IDCT from theora-a3-MMXd.zip. It should work on 64bit X86 platform too. Here is most used functions when playing video with jet aircrafts (gripen) Ogg logical stream 310b2968 is Theora 720x480 29.97 fps video Encoded frame content is 720x480 with 0x0 offset I can play this video with like 200-300 frame drops on Athlon XP 1700+ CPU load (with

Optimised qmf_synth and iir_mem16

2007 Dec 02

Optimised qmf_synth and iir_mem16

Hi all, I've taken preglows ARM versions of qmf_synth and iir_mem16 from rockboxes speex codec, and tweaked them a bit further for some more speed. I attach them here so you can review and take on any changes you want. Please let me know if you have questions etc. Thanks, Robin -- Robin Watts, Email: <mailto:Robin.Watts@wss.co.uk> Warm Silence Software, WWW:

[ARM] Register pressure with -mthumb forces register reload before each call

2020 Mar 31

[ARM] Register pressure with -mthumb forces register reload before each call

Hi, Compiling attached test-case, which is reduced version of of uECC_shared_secret from tinycrypt library [1], with --target=arm-linux-gnueabi -march=armv6-m -Oz -S results in reloading of register holding function's address before every call to blx: ldr r3, .LCPI0_0 blx r3 mov r0, r6 mov r1, r5 mov r2, r4 ldr r3,

[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support

2017 Oct 20

[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support

* Thomas Garnier <thgarnie at google.com> wrote: > Change the assembly code to use only relative references of symbols for the > kernel to be PIE compatible. > > Position Independent Executable (PIE) support will allow to extended the > KASLR randomization range below the -2G memory limit. > diff --git a/arch/x86/crypto/aes-x86_64-asm_64.S

[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support

2017 Oct 20

[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support

On 20 October 2017 at 09:24, Ingo Molnar <mingo at kernel.org> wrote: > > * Thomas Garnier <thgarnie at google.com> wrote: > >> Change the assembly code to use only relative references of symbols for the >> kernel to be PIE compatible. >> >> Position Independent Executable (PIE) support will allow to extended the >> KASLR randomization range below

[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support

2017 Oct 20

[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support

[PATCH] maxwell,pascal: add scheduling data to shaders

2018 Sep 08

[PATCH] maxwell,pascal: add scheduling data to shaders

Generated with envysched. Tested by running rendercheck from piglit, running mplayer -vo xv, and staring at gnome-shell. Signed-off-by: Rhys Perry <pendingchaos02 at gmail.com> --- src/shader/exac8nv110.fp | 11 ++++---- src/shader/exac8nv110.fpc | 22 ++++++++-------- src/shader/exacanv110.fp | 11 ++++---- src/shader/exacanv110.fpc | 22 ++++++++-------- src/shader/exacmnv110.fp | 10

[ARM] Register pressure with -mthumb forces register reload before each call

2020 Apr 15

[ARM] Register pressure with -mthumb forces register reload before each call

Hi, I have attached WIP patch for adding foldMemoryOperand to Thumb1InstrInfo. For the following case: void f(int x, int y, int z) { void bar(int, int, int); bar(x, y, z); bar(x, z, y); bar(y, x, z); bar(y, y, x); } it calls foldMemoryOperand twice, and thus converts two calls from blx to bl. callMI->dump() shows the function name "bar" correctly, however in generated

[LLVMdev] A question about StrongPhiElimination

2009 Aug 20

[LLVMdev] A question about StrongPhiElimination

Hello, I've encountered a problem similar to 'lost-copy' when using the StrongPhiElimination and wonder whether it is a incompatibility issue between the two different algorithms used in StrongPhiElimination.cpp. The StrongPhiElimination is mostly based on the algorithm in Zoran Budimilic et al's "Fast Copy Coalescing and Live-Range Identification" ([1]), while the

[PATCH v5 2/2] nv110/exa: update sched codes

2017 Jul 01

[PATCH v5 2/2] nv110/exa: update sched codes

v5: Rebased on depbar removal patch; removed a redundant read dep-bar. This patch adds proper delays to maxwell exa shaders. rendercheck tests seem consistent with/without this patch. I haven't extensively tested them though. Trello: https://trello.com/c/6LPB2EIS/174-update-maxwell-shaders-with-proper-delays Signed-off-by: Aaryaman Vasishta <jem456.vasishta at gmail.com> ---

[PATCH] nv110/exa: update sched codes

2017 Jun 03

[PATCH] nv110/exa: update sched codes

This patch adds proper delays to maxwell exa shaders. rendercheck tests seem consistent with/without this patch. I haven't extensively tested them though. Trello: https://trello.com/c/6LPB2EIS/174-update-maxwell-shaders-with-proper-delays Signed-off-by: Aaryaman Vasishta <jem456.vasishta at gmail.com> --- src/shader/exac8nv110.fp | 10 +++++----- src/shader/exac8nv110.fpc | 18

[PATCH v2] nv110/exa: update sched codes

2017 Jun 05

[PATCH v2] nv110/exa: update sched codes

Nice work! See my comments below, and double-check if some of them can be applied to the shaders I didn't review yet. I recommend you to test your work because if one sched code is wrong, you are likely going to kill your card and reboot your box. :-) On 06/03/2017 04:16 PM, Aaryaman Vasishta wrote: > v2: Add missing delays > > This patch adds proper delays to maxwell exa

[PATCH v2 1/7] exa: add GM10x acceleration support

2016 Oct 27

[PATCH v2 1/7] exa: add GM10x acceleration support

rendercheck -f a8r8g8b8 passes as much as on a GK208, and xv appears to work. Very lightly tested. Instead of sticking coordinates into pushbufs, the vertex shader is modified to read them from a constbuf, indexed by vertex id. This approach could be used for all nvc0 generations, but I didn't want to rock the boat. Signed-off-by: Ilia Mirkin <imirkin at alum.mit.edu> ---

[PATCH v3] nv110/exa: update sched codes

2017 Jun 10

[PATCH v3] nv110/exa: update sched codes

See the 'wt' on the first fmul in exacanv110.fp, exacmnv110.fp and exasanv110.fp. Any ideas on what could be causing the first fmul to require $r0 and/or $r1? Cheers, Aaryaman On Sat, Jun 10, 2017 at 4:10 PM, Aaryaman Vasishta < jem456.vasishta at gmail.com> wrote: > This patch adds proper delays to maxwell exa shaders. rendercheck tests > seem consistent with/without this

[PATCH v3] nv110/exa: update sched codes

2017 Jun 12

[PATCH v3] nv110/exa: update sched codes

On 06/10/2017 09:14 AM, Aaryaman Vasishta wrote: > See the 'wt' on the first fmul in exacanv110.fp, exacmnv110.fp and > exasanv110.fp. Any ideas on what could be causing the first fmul to > require $r0 and/or $r1? 'tex nodep $r4 $r2 0x0 0x1 t2d 0xf' is actually: 'tex nodep $r4:$r7 $r2 0x0 0x1 t2d 0xf' Very confusing, I know. > > Cheers, > Aaryaman

[PATCH v4] nv110/exa: update sched codes

2017 Jun 29

[PATCH v4] nv110/exa: update sched codes

Do you still have some glitches or does it work correctly now? Did you also remove the spurious wait dep bars between v3 and v4? On 06/27/2017 05:16 PM, Aaryaman Vasishta wrote: > v4: Updated the wait dependancy bars based on tex component masks. > > This patch adds proper delays to maxwell exa shaders. Tested with > rendercheck -f a8r8g8b8. > > I am still wondering whether

[PATCH] exa: add GM10x acceleration support

2016 Oct 17

[PATCH] exa: add GM10x acceleration support

Looks reasonable, some minor comments below. On 10/16/2016 02:06 AM, Ilia Mirkin wrote: > rendercheck -f a8r8g8b8 passes as much as on a GK208, and xv appears to > work. Very lightly tested. > > Instead of sticking coordinates into pushbufs, the vertex shader is > modified to read them from a constbuf, indexed by vertex id. This > approach could be used for all nvc0 generations,

[PATCH v4] nv110/exa: update sched codes

2017 Jun 28

[PATCH v4] nv110/exa: update sched codes

BTW, you can drop those explicit "depbar" ops. I think they're only needed when you're doing something weird with barriers. Blob doesn't use them (anymore) On Tue, Jun 27, 2017 at 11:16 AM, Aaryaman Vasishta <jem456.vasishta at gmail.com> wrote: > v4: Updated the wait dependancy bars based on tex component masks. > > This patch adds proper delays to maxwell

[PATCH] Blackfin: cleanup astat/cc/hardware loop asm clobbers

2009 Apr 24

[PATCH] Blackfin: cleanup astat/cc/hardware loop asm clobbers

Most asm statements clobber ASTAT bits (shifts, maxes, etc...) but do declare the register as clobbered. Same thing with CC in a few places. Some places make an attempt at clobbering some hardware loop registers, but it's very incomplete compared with how many asm statements actually use hardware loops. Signed-off-by: Mike Frysinger <vapier at gentoo.org> --- libspeex/bfin.h

similar to: boxplot: reverse y-axis order