Displaying 20 results from an estimated 40000 matches similar to: "boxplot: reverse y-axis order"
2012 Feb 13
0
[PATCH 05/14] arm: implement exception and hypercall entries.
arm: implement exception and hypercall entries.
xen/arch/arm/xen/Makefile | 3 +
xen/arch/arm/xen/asm-offsets.c | 61 ++++++++
xen/arch/arm/xen/entry.S | 596 +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
xen/arch/arm/xen/hypercalls.S | 67 +++++++++
xen/arch/arm/xen/physdev.c | 41 +++++
5 files changed, 768 insertions(+), 0
2005 Jul 20
1
MMX IDCT for theora-exp
Hello,
I'm attaching IDCT MMX patch. I reused IDCT from theora-a3-MMXd.zip.
It should work on 64bit X86 platform too.
Here is most used functions when playing video with jet aircrafts (gripen)
Ogg logical stream 310b2968 is Theora 720x480 29.97 fps video
Encoded frame content is 720x480 with 0x0 offset
I can play this video with like 200-300 frame drops on Athlon XP 1700+
CPU load (with
2007 Dec 02
2
Optimised qmf_synth and iir_mem16
Hi all,
I've taken preglows ARM versions of qmf_synth and iir_mem16 from
rockboxes speex codec, and tweaked them a bit further for some more
speed.
I attach them here so you can review and take on any changes you
want.
Please let me know if you have questions etc.
Thanks,
Robin
--
Robin Watts, Email: <mailto:Robin.Watts@wss.co.uk>
Warm Silence Software, WWW:
2020 Mar 31
2
[ARM] Register pressure with -mthumb forces register reload before each call
Hi,
Compiling attached test-case, which is reduced version of of
uECC_shared_secret from tinycrypt library [1], with
--target=arm-linux-gnueabi -march=armv6-m -Oz -S
results in reloading of register holding function's address before
every call to blx:
ldr r3, .LCPI0_0
blx r3
mov r0, r6
mov r1, r5
mov r2, r4
ldr r3,
2017 Oct 20
0
[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support
* Thomas Garnier <thgarnie at google.com> wrote:
> Change the assembly code to use only relative references of symbols for the
> kernel to be PIE compatible.
>
> Position Independent Executable (PIE) support will allow to extended the
> KASLR randomization range below the -2G memory limit.
> diff --git a/arch/x86/crypto/aes-x86_64-asm_64.S
2017 Oct 20
1
[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support
On 20 October 2017 at 09:24, Ingo Molnar <mingo at kernel.org> wrote:
>
> * Thomas Garnier <thgarnie at google.com> wrote:
>
>> Change the assembly code to use only relative references of symbols for the
>> kernel to be PIE compatible.
>>
>> Position Independent Executable (PIE) support will allow to extended the
>> KASLR randomization range below
2017 Oct 20
1
[PATCH v1 01/27] x86/crypto: Adapt assembly for PIE support
On 20 October 2017 at 09:24, Ingo Molnar <mingo at kernel.org> wrote:
>
> * Thomas Garnier <thgarnie at google.com> wrote:
>
>> Change the assembly code to use only relative references of symbols for the
>> kernel to be PIE compatible.
>>
>> Position Independent Executable (PIE) support will allow to extended the
>> KASLR randomization range below
2018 Sep 08
0
[PATCH] maxwell,pascal: add scheduling data to shaders
Generated with envysched. Tested by running rendercheck from piglit, running
mplayer -vo xv, and staring at gnome-shell.
Signed-off-by: Rhys Perry <pendingchaos02 at gmail.com>
---
src/shader/exac8nv110.fp | 11 ++++----
src/shader/exac8nv110.fpc | 22 ++++++++--------
src/shader/exacanv110.fp | 11 ++++----
src/shader/exacanv110.fpc | 22 ++++++++--------
src/shader/exacmnv110.fp | 10
2020 Apr 15
4
[ARM] Register pressure with -mthumb forces register reload before each call
Hi,
I have attached WIP patch for adding foldMemoryOperand to Thumb1InstrInfo.
For the following case:
void f(int x, int y, int z)
{
void bar(int, int, int);
bar(x, y, z);
bar(x, z, y);
bar(y, x, z);
bar(y, y, x);
}
it calls foldMemoryOperand twice, and thus converts two calls from blx to bl.
callMI->dump() shows the function name "bar" correctly, however in
generated
2009 Aug 20
0
[LLVMdev] A question about StrongPhiElimination
Hello, I've encountered a problem similar to 'lost-copy' when using
the StrongPhiElimination and wonder whether it is a incompatibility
issue between the two different algorithms used in
StrongPhiElimination.cpp. The StrongPhiElimination is mostly based on
the algorithm in Zoran Budimilic et al's "Fast Copy Coalescing and
Live-Range Identification" ([1]), while the
2017 Jul 01
0
[PATCH v5 2/2] nv110/exa: update sched codes
v5: Rebased on depbar removal patch; removed a redundant read dep-bar.
This patch adds proper delays to maxwell exa shaders. rendercheck tests
seem consistent with/without this patch. I haven't extensively tested
them though.
Trello:
https://trello.com/c/6LPB2EIS/174-update-maxwell-shaders-with-proper-delays
Signed-off-by: Aaryaman Vasishta <jem456.vasishta at gmail.com>
---
2017 Jun 03
0
[PATCH] nv110/exa: update sched codes
This patch adds proper delays to maxwell exa shaders. rendercheck tests
seem consistent with/without this patch. I haven't extensively tested
them though.
Trello:
https://trello.com/c/6LPB2EIS/174-update-maxwell-shaders-with-proper-delays
Signed-off-by: Aaryaman Vasishta <jem456.vasishta at gmail.com>
---
src/shader/exac8nv110.fp | 10 +++++-----
src/shader/exac8nv110.fpc | 18
2017 Jun 05
0
[PATCH v2] nv110/exa: update sched codes
Nice work!
See my comments below, and double-check if some of them can be applied
to the shaders I didn't review yet.
I recommend you to test your work because if one sched code is wrong,
you are likely going to kill your card and reboot your box. :-)
On 06/03/2017 04:16 PM, Aaryaman Vasishta wrote:
> v2: Add missing delays
>
> This patch adds proper delays to maxwell exa
2016 Oct 27
0
[PATCH v2 1/7] exa: add GM10x acceleration support
rendercheck -f a8r8g8b8 passes as much as on a GK208, and xv appears to
work. Very lightly tested.
Instead of sticking coordinates into pushbufs, the vertex shader is
modified to read them from a constbuf, indexed by vertex id. This
approach could be used for all nvc0 generations, but I didn't want to
rock the boat.
Signed-off-by: Ilia Mirkin <imirkin at alum.mit.edu>
---
2017 Jun 10
0
[PATCH v3] nv110/exa: update sched codes
See the 'wt' on the first fmul in exacanv110.fp, exacmnv110.fp and
exasanv110.fp. Any ideas on what could be causing the first fmul to require
$r0 and/or $r1?
Cheers,
Aaryaman
On Sat, Jun 10, 2017 at 4:10 PM, Aaryaman Vasishta <
jem456.vasishta at gmail.com> wrote:
> This patch adds proper delays to maxwell exa shaders. rendercheck tests
> seem consistent with/without this
2017 Jun 12
2
[PATCH v3] nv110/exa: update sched codes
On 06/10/2017 09:14 AM, Aaryaman Vasishta wrote:
> See the 'wt' on the first fmul in exacanv110.fp, exacmnv110.fp and
> exasanv110.fp. Any ideas on what could be causing the first fmul to
> require $r0 and/or $r1?
'tex nodep $r4 $r2 0x0 0x1 t2d 0xf'
is actually:
'tex nodep $r4:$r7 $r2 0x0 0x1 t2d 0xf'
Very confusing, I know.
>
> Cheers,
> Aaryaman
2017 Jun 29
0
[PATCH v4] nv110/exa: update sched codes
Do you still have some glitches or does it work correctly now?
Did you also remove the spurious wait dep bars between v3 and v4?
On 06/27/2017 05:16 PM, Aaryaman Vasishta wrote:
> v4: Updated the wait dependancy bars based on tex component masks.
>
> This patch adds proper delays to maxwell exa shaders. Tested with
> rendercheck -f a8r8g8b8.
>
> I am still wondering whether
2016 Oct 17
0
[PATCH] exa: add GM10x acceleration support
Looks reasonable, some minor comments below.
On 10/16/2016 02:06 AM, Ilia Mirkin wrote:
> rendercheck -f a8r8g8b8 passes as much as on a GK208, and xv appears to
> work. Very lightly tested.
>
> Instead of sticking coordinates into pushbufs, the vertex shader is
> modified to read them from a constbuf, indexed by vertex id. This
> approach could be used for all nvc0 generations,
2017 Jun 28
0
[PATCH v4] nv110/exa: update sched codes
BTW, you can drop those explicit "depbar" ops. I think they're only
needed when you're doing something weird with barriers. Blob doesn't
use them (anymore)
On Tue, Jun 27, 2017 at 11:16 AM, Aaryaman Vasishta
<jem456.vasishta at gmail.com> wrote:
> v4: Updated the wait dependancy bars based on tex component masks.
>
> This patch adds proper delays to maxwell
2009 Apr 24
2
[PATCH] Blackfin: cleanup astat/cc/hardware loop asm clobbers
Most asm statements clobber ASTAT bits (shifts, maxes, etc...) but do
declare the register as clobbered. Same thing with CC in a few places.
Some places make an attempt at clobbering some hardware loop registers,
but it's very incomplete compared with how many asm statements actually
use hardware loops.
Signed-off-by: Mike Frysinger <vapier at gentoo.org>
---
libspeex/bfin.h