thr3ads.net - similar to: "[PATCH v2 0/6] powerpc: queued spinlocks and rwlocks"

[PATCH 0/8] powerpc: queued spinlocks and rwlocks

2020 Jul 02

12

[PATCH 0/8] powerpc: queued spinlocks and rwlocks

This series adds an option to use queued spinlocks for powerpc, and makes it the default for the Book3S-64 subarch. This effort starts with the generic code so it's very simple but still very performant. There are optimisations that can be made to slowpaths, but I think it's better to attack those incrementally if/when we find things, and try to add the improvements to generic code as

[PATCH v4 0/6] powerpc: queued spinlocks and rwlocks

2020 Jul 24

8

[PATCH v4 0/6] powerpc: queued spinlocks and rwlocks

Updated with everybody's feedback (thanks all), and more performance results. What I've found is I might have been measuring the worst load point for the paravirt case, and by looking at a range of loads it's clear that queued spinlocks are overall better even on PV, doubly so when you look at the generally much improved worst case latencies. I have defaulted it to N even though

[PATCH v3 0/6] powerpc: queued spinlocks and rwlocks

2020 Jul 06

13

[PATCH v3 0/6] powerpc: queued spinlocks and rwlocks

v3 is updated to use __pv_queued_spin_unlock, noticed by Waiman (thank you). Thanks, Nick Nicholas Piggin (6): powerpc/powernv: must include hvcall.h to get PAPR defines powerpc/pseries: move some PAPR paravirt functions to their own file powerpc: move spinlock implementation to simple_spinlock powerpc/64s: implement queued spinlocks and rwlocks powerpc/pseries: implement paravirt

[PATCH v3 0/6] powerpc: queued spinlocks and rwlocks

2020 Jul 06

13

[PATCH v3 0/6] powerpc: queued spinlocks and rwlocks

v3 is updated to use __pv_queued_spin_unlock, noticed by Waiman (thank you). Thanks, Nick Nicholas Piggin (6): powerpc/powernv: must include hvcall.h to get PAPR defines powerpc/pseries: move some PAPR paravirt functions to their own file powerpc: move spinlock implementation to simple_spinlock powerpc/64s: implement queued spinlocks and rwlocks powerpc/pseries: implement paravirt

[PATCH v3 3/6] powerpc: move spinlock implementation to simple_spinlock

2020 Jul 06

0

[PATCH v3 3/6] powerpc: move spinlock implementation to simple_spinlock

To prepare for queued spinlocks. This is a simple rename except to update preprocessor guard name and a file reference. Signed-off-by: Nicholas Piggin <npiggin at gmail.com> --- arch/powerpc/include/asm/simple_spinlock.h | 292 ++++++++++++++++++ .../include/asm/simple_spinlock_types.h | 21 ++ arch/powerpc/include/asm/spinlock.h | 285 +----------------

[PATCH v8 0/6] Implement qspinlock/pv-qspinlock on ppc

2016 Dec 05

9

[PATCH v8 0/6] Implement qspinlock/pv-qspinlock on ppc

Hi All, this is the fairlock patchset. You can apply them and build successfully. patches are based on linux-next qspinlock can avoid waiter starved issue. It has about the same speed in single-thread and it can be much faster in high contention situations especially when the spinlock is embedded within the data structure to be protected. v7 -> v8: add one patch to drop a function call

[PATCH v8 0/6] Implement qspinlock/pv-qspinlock on ppc

2016 Dec 05

9

[PATCH v8 0/6] Implement qspinlock/pv-qspinlock on ppc

Hi All, this is the fairlock patchset. You can apply them and build successfully. patches are based on linux-next qspinlock can avoid waiter starved issue. It has about the same speed in single-thread and it can be much faster in high contention situations especially when the spinlock is embedded within the data structure to be protected. v7 -> v8: add one patch to drop a function call

[PATCH v5 0/6] powerPC/pSeries use pv-qpsinlock as the default spinlock implemention

2016 Jun 02

8

[PATCH v5 0/6] powerPC/pSeries use pv-qpsinlock as the default spinlock implemention

From: root <root at ltcalpine2-lp13.aus.stglabs.ibm.com> change from v4: BUG FIX. thanks boqun reporting this issue. struct __qspinlock has different layout in bigendian mahcine. native_queued_spin_unlock() may write value to a wrong address. now fix it. change from v3: a big change in [PATCH v4 4/6] pv-qspinlock: powerpc support pv-qspinlock no other patch changed. and the patch

[PATCH v5 0/6] powerPC/pSeries use pv-qpsinlock as the default spinlock implemention

2016 Jun 02

8

[PATCH v5 0/6] powerPC/pSeries use pv-qpsinlock as the default spinlock implemention

From: root <root at ltcalpine2-lp13.aus.stglabs.ibm.com> change from v4: BUG FIX. thanks boqun reporting this issue. struct __qspinlock has different layout in bigendian mahcine. native_queued_spin_unlock() may write value to a wrong address. now fix it. change from v3: a big change in [PATCH v4 4/6] pv-qspinlock: powerpc support pv-qspinlock no other patch changed. and the patch

[PATCH v5 1/6] qspinlock: powerpc support qspinlock

2016 Jun 03

2

[PATCH v5 1/6] qspinlock: powerpc support qspinlock

On Thu, 2016-06-02 at 17:22 +0800, Pan Xinhui wrote: > Base code to enable qspinlock on powerpc. this patch add some #ifdef > here and there. Although there is no paravirt related code, we can > successfully build a qspinlock kernel after apply this patch. This is missing the IO_SYNC stuff ... It means we'll fail to do a full sync to order vs MMIOs. You need to add that back in the

[PATCH v5 1/6] qspinlock: powerpc support qspinlock

2016 Jun 03

2

[PATCH v5 1/6] qspinlock: powerpc support qspinlock

On Thu, 2016-06-02 at 17:22 +0800, Pan Xinhui wrote: > Base code to enable qspinlock on powerpc. this patch add some #ifdef > here and there. Although there is no paravirt related code, we can > successfully build a qspinlock kernel after apply this patch. This is missing the IO_SYNC stuff ... It means we'll fail to do a full sync to order vs MMIOs. You need to add that back in the

[PATCH v8 1/6] powerpc/qspinlock: powerpc support qspinlock

2016 Dec 06

1

[PATCH v8 1/6] powerpc/qspinlock: powerpc support qspinlock

On Mon, Dec 05, 2016 at 10:19:21AM -0500, Pan Xinhui wrote: > This patch add basic code to enable qspinlock on powerpc. qspinlock is > one kind of fairlock implementation. And seen some performance improvement > under some scenarios. > > queued_spin_unlock() release the lock by just one write of NULL to the > ::locked field which sits at different places in the two endianness

[PATCH v8 1/6] powerpc/qspinlock: powerpc support qspinlock

2016 Dec 06

1

[PATCH v8 1/6] powerpc/qspinlock: powerpc support qspinlock

On Mon, Dec 05, 2016 at 10:19:21AM -0500, Pan Xinhui wrote: > This patch add basic code to enable qspinlock on powerpc. qspinlock is > one kind of fairlock implementation. And seen some performance improvement > under some scenarios. > > queued_spin_unlock() release the lock by just one write of NULL to the > ::locked field which sits at different places in the two endianness

[PATCH v3 0/6] powerpc use pv-qpsinlock instead of spinlock

2016 May 17

6

[PATCH v3 0/6] powerpc use pv-qpsinlock instead of spinlock

change fome v1: separate into 6 pathes from one patch some minor code changes. benchmark test results are below. run 3 tests on pseries IBM,8408-E8E with 32cpus, 64GB memory perf bench futex hash perf bench futex lock-pi perf record -advRT || perf bench sched messaging -g 1000 || perf report summary: _____test________________spinlcok______________pv-qspinlcok_____ |futex hash | 556370 ops |

[PATCH v3 0/6] powerpc use pv-qpsinlock instead of spinlock

2016 May 17

6

[PATCH v3 0/6] powerpc use pv-qpsinlock instead of spinlock

change fome v1: separate into 6 pathes from one patch some minor code changes. benchmark test results are below. run 3 tests on pseries IBM,8408-E8E with 32cpus, 64GB memory perf bench futex hash perf bench futex lock-pi perf record -advRT || perf bench sched messaging -g 1000 || perf report summary: _____test________________spinlcok______________pv-qspinlcok_____ |futex hash | 556370 ops |

[PATCH v9 0/6] Implement qspinlock/pv-qspinlock on ppc

2016 Dec 06

6

[PATCH v9 0/6] Implement qspinlock/pv-qspinlock on ppc

Hi All, this is the fairlock patchset. You can apply them and build successfully. patches are based on linux-next qspinlock can avoid waiter starved issue. It has about the same speed in single-thread and it can be much faster in high contention situations especially when the spinlock is embedded within the data structure to be protected. v8 -> v9: mv qspinlocm config entry to

[PATCH v9 0/6] Implement qspinlock/pv-qspinlock on ppc

2016 Dec 06

6

[PATCH v9 0/6] Implement qspinlock/pv-qspinlock on ppc

Hi All, this is the fairlock patchset. You can apply them and build successfully. patches are based on linux-next qspinlock can avoid waiter starved issue. It has about the same speed in single-thread and it can be much faster in high contention situations especially when the spinlock is embedded within the data structure to be protected. v8 -> v9: mv qspinlocm config entry to

[PATCH v3 0/6] powerpc use pv-qpsinlock as the default spinlock implemention

2016 May 25

10

[PATCH v3 0/6] powerpc use pv-qpsinlock as the default spinlock implemention

change from v2: __spin_yeild_cpu() will yield slices to lpar if target cpu is running. remove unnecessary rmb() in __spin_yield/wake_cpu. __pv_wait() will check the *ptr == val. some commit message change change fome v1: separate into 6 pathes from one patch some minor code changes. I do several tests on pseries IBM,8408-E8E with 32cpus, 64GB memory. benchmark test results are below. 2

[PATCH v3 0/6] powerpc use pv-qpsinlock as the default spinlock implemention

2016 May 25

10

[PATCH v3 0/6] powerpc use pv-qpsinlock as the default spinlock implemention

change from v2: __spin_yeild_cpu() will yield slices to lpar if target cpu is running. remove unnecessary rmb() in __spin_yield/wake_cpu. __pv_wait() will check the *ptr == val. some commit message change change fome v1: separate into 6 pathes from one patch some minor code changes. I do several tests on pseries IBM,8408-E8E with 32cpus, 64GB memory. benchmark test results are below. 2

[PATCH v5 0/6] powerPC/pSeries use pv-qpsinlock as the default spinlock implemention

2016 Jun 02

9

[PATCH v5 0/6] powerPC/pSeries use pv-qpsinlock as the default spinlock implemention

change from v4: BUG FIX. thanks boqun reporting this issue. struct __qspinlock has different layout in bigendian mahcine. native_queued_spin_unlock() may write value to a wrong address. now fix it. sorry for not even doing a test on bigendian machine before!!! change from v3: a big change in [PATCH v4 4/6] pv-qspinlock: powerpc support pv-qspinlock no other patch changed. and the patch

similar to: [PATCH v2 0/6] powerpc: queued spinlocks and rwlocks