Ben Guthro
2013-Mar-27 13:13 UTC
[PATCH] x86/S3: Restore broken vcpu affinity on resume (v4)
When in SYS_STATE_suspend, and going through the cpu_disable_scheduler path, save a copy of the current cpu affinity, and mark a flag to restore it later. Later, in the resume process, when enabling nonboot cpus restore these affinities. v2: Formatting: Fix hard tabs. remove early return in cpu_disable_scheduler() path. v3: Formatting: Fix remaining errant tab. Move restore_vcpu_affinity() to thaw_domains(), eliminating the need to promote for_each_cpupool() v4: Formatting: Fix if statement spacing. Eliminate unnecessary if statement in thaw_domains() Suppress affinity related logging to XENLOG_DEBUG to reduce noise. Signed-off-by: Ben Guthro <benjamin.guthro@citrix.com> --- xen/arch/x86/acpi/power.c | 3 +++ xen/common/domain.c | 2 ++ xen/common/schedule.c | 45 ++++++++++++++++++++++++++++++++++++++++++--- xen/include/xen/sched.h | 6 ++++++ 4 files changed, 53 insertions(+), 3 deletions(-) diff --git a/xen/arch/x86/acpi/power.c b/xen/arch/x86/acpi/power.c index 3c2585c..f41f0de 100644 --- a/xen/arch/x86/acpi/power.c +++ b/xen/arch/x86/acpi/power.c @@ -96,7 +96,10 @@ static void thaw_domains(void) rcu_read_lock(&domlist_read_lock); for_each_domain ( d ) + { + restore_vcpu_affinity(d); domain_unpause(d); + } rcu_read_unlock(&domlist_read_lock); } diff --git a/xen/common/domain.c b/xen/common/domain.c index 64ee29d..590548e 100644 --- a/xen/common/domain.c +++ b/xen/common/domain.c @@ -126,6 +126,7 @@ struct vcpu *alloc_vcpu( if ( !zalloc_cpumask_var(&v->cpu_affinity) || !zalloc_cpumask_var(&v->cpu_affinity_tmp) || + !zalloc_cpumask_var(&v->cpu_affinity_saved) || !zalloc_cpumask_var(&v->vcpu_dirty_cpumask) ) goto fail_free; @@ -155,6 +156,7 @@ struct vcpu *alloc_vcpu( fail_free: free_cpumask_var(v->cpu_affinity); free_cpumask_var(v->cpu_affinity_tmp); + free_cpumask_var(v->cpu_affinity_saved); free_cpumask_var(v->vcpu_dirty_cpumask); free_vcpu_struct(v); return NULL; diff --git a/xen/common/schedule.c b/xen/common/schedule.c index 83fae4c..7364ff8 100644 --- a/xen/common/schedule.c +++ b/xen/common/schedule.c @@ -541,6 +541,38 @@ void vcpu_force_reschedule(struct vcpu *v) } } +void restore_vcpu_affinity(struct domain *d) +{ + struct vcpu *v; + + for_each_vcpu ( d, v ) + { + vcpu_schedule_lock_irq(v); + + if ( v->affinity_broken ) + { + printk(XENLOG_DEBUG "Restoring affinity for d%dv%d\n", + d->domain_id, v->vcpu_id); + cpumask_copy(v->cpu_affinity, v->cpu_affinity_saved); + v->affinity_broken = 0; + } + + if ( v->processor == smp_processor_id() ) + { + set_bit(_VPF_migrating, &v->pause_flags); + vcpu_schedule_unlock_irq(v); + vcpu_sleep_nosync(v); + vcpu_migrate(v); + } + else + { + vcpu_schedule_unlock_irq(v); + } + } + + domain_update_node_affinity(d); +} + /* * This function is used by cpu_hotplug code from stop_machine context * and from cpupools to switch schedulers on a cpu. @@ -554,7 +586,7 @@ int cpu_disable_scheduler(unsigned int cpu) int ret = 0; c = per_cpu(cpupool, cpu); - if ( (c == NULL) || (system_state == SYS_STATE_suspend) ) + if ( c == NULL ) return ret; for_each_domain_in_cpupool ( d, c ) @@ -567,8 +599,15 @@ int cpu_disable_scheduler(unsigned int cpu) if ( cpumask_empty(&online_affinity) && cpumask_test_cpu(cpu, v->cpu_affinity) ) { - printk("Breaking vcpu affinity for domain %d vcpu %d\n", - v->domain->domain_id, v->vcpu_id); + printk(XENLOG_DEBUG "Breaking affinity for d%dv%d\n", + d->domain_id, v->vcpu_id); + + if (system_state == SYS_STATE_suspend) + { + cpumask_copy(v->cpu_affinity_saved, v->cpu_affinity); + v->affinity_broken = 1; + } + cpumask_setall(v->cpu_affinity); } diff --git a/xen/include/xen/sched.h b/xen/include/xen/sched.h index cabaf27..d15d567 100644 --- a/xen/include/xen/sched.h +++ b/xen/include/xen/sched.h @@ -153,6 +153,9 @@ struct vcpu bool_t defer_shutdown; /* VCPU is paused following shutdown request (d->is_shutting_down)? */ bool_t paused_for_shutdown; + /* VCPU need affinity restored */ + bool_t affinity_broken; + /* * > 0: a single port is being polled; @@ -175,6 +178,8 @@ struct vcpu cpumask_var_t cpu_affinity; /* Used to change affinity temporarily. */ cpumask_var_t cpu_affinity_tmp; + /* Used to restore affinity across S3. */ + cpumask_var_t cpu_affinity_saved; /* Bitmask of CPUs which are holding onto this VCPU''s state. */ cpumask_var_t vcpu_dirty_cpumask; @@ -697,6 +702,7 @@ int schedule_cpu_switch(unsigned int cpu, struct cpupool *c); void vcpu_force_reschedule(struct vcpu *v); int cpu_disable_scheduler(unsigned int cpu); int vcpu_set_affinity(struct vcpu *v, const cpumask_t *affinity); +void restore_vcpu_affinity(struct domain *d); void vcpu_runstate_get(struct vcpu *v, struct vcpu_runstate_info *runstate); uint64_t get_cpu_idle_time(unsigned int cpu); -- 1.7.9.5
Jan Beulich
2013-Mar-28 08:19 UTC
Re: [PATCH] x86/S3: Restore broken vcpu affinity on resume (v4)
>>> On 27.03.13 at 14:13, Ben Guthro <benjamin.guthro@citrix.com> wrote: > When in SYS_STATE_suspend, and going through the cpu_disable_scheduler > path, save a copy of the current cpu affinity, and mark a flag to > restore it later. > > Later, in the resume process, when enabling nonboot cpus restore these > affinities. > > v2: > Formatting: Fix hard tabs. > remove early return in cpu_disable_scheduler() path. > > v3: > Formatting: Fix remaining errant tab. > Move restore_vcpu_affinity() to thaw_domains(), eliminating the need to > promote for_each_cpupool() > > v4: > Formatting: Fix if statement spacing. > Eliminate unnecessary if statement in thaw_domains() > Suppress affinity related logging to XENLOG_DEBUG to reduce noise. > > Signed-off-by: Ben Guthro <benjamin.guthro@citrix.com>Looks good to me now, but needs an ack by Keir (or George, considering the minor change to xen/common/domain.c is perhaps acceptable without formal ack). Thanks, Jan
George Dunlap
2013-Mar-28 12:18 UTC
Re: [PATCH] x86/S3: Restore broken vcpu affinity on resume (v4)
On Wed, Mar 27, 2013 at 1:13 PM, Ben Guthro <benjamin.guthro@citrix.com> wrote:> When in SYS_STATE_suspend, and going through the cpu_disable_scheduler > path, save a copy of the current cpu affinity, and mark a flag to > restore it later. > > Later, in the resume process, when enabling nonboot cpus restore these > affinities. > > v2: > Formatting: Fix hard tabs. > remove early return in cpu_disable_scheduler() path. > > v3: > Formatting: Fix remaining errant tab. > Move restore_vcpu_affinity() to thaw_domains(), eliminating the need to > promote for_each_cpupool() > > v4: > Formatting: Fix if statement spacing. > Eliminate unnecessary if statement in thaw_domains() > Suppress affinity related logging to XENLOG_DEBUG to reduce noise. > > Signed-off-by: Ben Guthro <benjamin.guthro@citrix.com>I''m not super-familiar with the save/restore paths; but it looks like a reasonable change to me: Acked-by: George Dunlap <george.dunlap@eu.citrix.com>
Ben Guthro
2013-Apr-01 19:47 UTC
Re: [PATCH] x86/S3: Restore broken vcpu affinity on resume (v4)
On 03/28/2013 08:18 AM, George Dunlap wrote:> On Wed, Mar 27, 2013 at 1:13 PM, Ben Guthro <benjamin.guthro@citrix.com> wrote: >> When in SYS_STATE_suspend, and going through the cpu_disable_scheduler >> path, save a copy of the current cpu affinity, and mark a flag to >> restore it later. >> >> Later, in the resume process, when enabling nonboot cpus restore these >> affinities. >> >> v2: >> Formatting: Fix hard tabs. >> remove early return in cpu_disable_scheduler() path. >> >> v3: >> Formatting: Fix remaining errant tab. >> Move restore_vcpu_affinity() to thaw_domains(), eliminating the need to >> promote for_each_cpupool() >> >> v4: >> Formatting: Fix if statement spacing. >> Eliminate unnecessary if statement in thaw_domains() >> Suppress affinity related logging to XENLOG_DEBUG to reduce noise. >> >> Signed-off-by: Ben Guthro <benjamin.guthro@citrix.com> > > I''m not super-familiar with the save/restore paths; but it looks like > a reasonable change to me: > > Acked-by: George Dunlap <george.dunlap@eu.citrix.com> >Thanks George. Jan, Is this a sufficient Ack, or does Kier need to weigh in as well? Thanks, Ben
Keir Fraser
2013-Apr-01 20:17 UTC
Re: [PATCH] x86/S3: Restore broken vcpu affinity on resume (v4)
On 01/04/2013 20:47, "Ben Guthro" <Benjamin.Guthro@citrix.com> wrote:>>> v4: >>> Formatting: Fix if statement spacing. >>> Eliminate unnecessary if statement in thaw_domains() >>> Suppress affinity related logging to XENLOG_DEBUG to reduce noise. >>> >>> Signed-off-by: Ben Guthro <benjamin.guthro@citrix.com> >> >> I''m not super-familiar with the save/restore paths; but it looks like >> a reasonable change to me: >> >> Acked-by: George Dunlap <george.dunlap@eu.citrix.com> >> > > Thanks George. > > Jan, > > Is this a sufficient Ack, or does Kier need to weigh in as well?Acked-by: Keir Fraser <keir@xen.org> I kind of feel this sort of thing should be done in dom0 userspace but, pragmatically, I know that''s not really going to happen! -- Keir
Jan Beulich
2013-Apr-02 07:49 UTC
Re: [PATCH] x86/S3: Restore broken vcpu affinity on resume (v4)
>>> On 01.04.13 at 22:17, Keir Fraser <keir.xen@gmail.com> wrote: > On 01/04/2013 20:47, "Ben Guthro" <Benjamin.Guthro@citrix.com> wrote: > >>>> v4: >>>> Formatting: Fix if statement spacing. >>>> Eliminate unnecessary if statement in thaw_domains() >>>> Suppress affinity related logging to XENLOG_DEBUG to reduce noise. >>>> >>>> Signed-off-by: Ben Guthro <benjamin.guthro@citrix.com> >>> >>> I''m not super-familiar with the save/restore paths; but it looks like >>> a reasonable change to me: >>> >>> Acked-by: George Dunlap <george.dunlap@eu.citrix.com> >>> >> >> Thanks George. >> >> Jan, >> >> Is this a sufficient Ack, or does Kier need to weigh in as well? > > Acked-by: Keir Fraser <keir@xen.org> > > I kind of feel this sort of thing should be done in dom0 userspace but, > pragmatically, I know that''s not really going to happen!How would that work, considering the Dom0''s vCPU affinities are equally affected, and considering that all domains get thawed at once? Minimally you''d have a burst of (overcommit) load on pCPU0 right after resume, and whether the system would survive that is impossible to predict. Jan
Keir Fraser
2013-Apr-03 14:29 UTC
Re: [PATCH] x86/S3: Restore broken vcpu affinity on resume (v4)
On 02/04/2013 08:49, "Jan Beulich" <JBeulich@suse.com> wrote:>>> Thanks George. >>> >>> Jan, >>> >>> Is this a sufficient Ack, or does Kier need to weigh in as well? >> >> Acked-by: Keir Fraser <keir@xen.org> >> >> I kind of feel this sort of thing should be done in dom0 userspace but, >> pragmatically, I know that''s not really going to happen! > > How would that work, considering the Dom0''s vCPU affinities are > equally affected, and considering that all domains get thawed at > once? Minimally you''d have a burst of (overcommit) load on pCPU0 > right after resume, and whether the system would survive that is > impossible to predict.Ha, yes, well you''d put freeze/thaw in dom0 too. Well, it wasn''t really a serious suggestion on my part, what we have works, this new patch will also work, and I''m happy. :) -- Keir