Timur Tabi
2024-Jan-31 21:39 UTC
[PATCH] drm/nouveau: nouveau_sched_fini() should check for init failure
If initialization fails, Nouveau can still call the _fini() function to clean up, with the expectation that the function can handle if its corresponding _init() function was never called or exited with error. Such is not the case with nouveau_sched_fini(), which still attempts to wait for jobs to finish even if the underlying data structures were never initialized. Fixes: 5f03a507b29e ("drm/nouveau: implement 1:1 scheduler - entity relationship") Signed-off-by: Timur Tabi <ttabi at nvidia.com> --- drivers/gpu/drm/nouveau/nouveau_sched.c | 5 +++++ drivers/gpu/drm/nouveau/nouveau_sched.h | 2 ++ 2 files changed, 7 insertions(+) diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.c b/drivers/gpu/drm/nouveau/nouveau_sched.c index dd98f6910f9c..9c771bc0e332 100644 --- a/drivers/gpu/drm/nouveau/nouveau_sched.c +++ b/drivers/gpu/drm/nouveau/nouveau_sched.c @@ -443,6 +443,8 @@ nouveau_sched_init(struct nouveau_sched *sched, struct nouveau_drm *drm, INIT_LIST_HEAD(&sched->job.list.head); init_waitqueue_head(&sched->job.wq); + sched->initialized = true; + return 0; fail_sched: @@ -459,6 +461,9 @@ nouveau_sched_fini(struct nouveau_sched *sched) struct drm_gpu_scheduler *drm_sched = &sched->base; struct drm_sched_entity *entity = &sched->entity; + if (!sched->initialized) + return; + rmb(); /* for list_empty to work without lock */ wait_event(sched->job.wq, list_empty(&sched->job.list.head)); diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.h b/drivers/gpu/drm/nouveau/nouveau_sched.h index a6528f5981e6..351931c706aa 100644 --- a/drivers/gpu/drm/nouveau/nouveau_sched.h +++ b/drivers/gpu/drm/nouveau/nouveau_sched.h @@ -109,6 +109,8 @@ struct nouveau_sched { } list; struct wait_queue_head wq; } job; + + bool initialized; }; int nouveau_sched_init(struct nouveau_sched *sched, struct nouveau_drm *drm, -- 2.34.1
Danilo Krummrich
2024-Feb-01 12:58 UTC
[PATCH] drm/nouveau: nouveau_sched_fini() should check for init failure
On 1/31/24 22:39, Timur Tabi wrote:> If initialization fails, Nouveau can still call the _fini() function > to clean up, with the expectation that the function can handle if its > corresponding _init() function was never called or exited with error. > > Such is not the case with nouveau_sched_fini(), which still attempts > to wait for jobs to finish even if the underlying data structures were > never initialized.Good catch!> > Fixes: 5f03a507b29e ("drm/nouveau: implement 1:1 scheduler - entity relationship") > Signed-off-by: Timur Tabi <ttabi at nvidia.com> > --- > drivers/gpu/drm/nouveau/nouveau_sched.c | 5 +++++ > drivers/gpu/drm/nouveau/nouveau_sched.h | 2 ++ > 2 files changed, 7 insertions(+) > > diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.c b/drivers/gpu/drm/nouveau/nouveau_sched.c > index dd98f6910f9c..9c771bc0e332 100644 > --- a/drivers/gpu/drm/nouveau/nouveau_sched.c > +++ b/drivers/gpu/drm/nouveau/nouveau_sched.c > @@ -443,6 +443,8 @@ nouveau_sched_init(struct nouveau_sched *sched, struct nouveau_drm *drm, > INIT_LIST_HEAD(&sched->job.list.head); > init_waitqueue_head(&sched->job.wq); > > + sched->initialized = true;I wonder if we should allocate struct nouveau_sched dynamically instead and just check for NULL in the corresponding *_fini() functions. Actually, in nouveau_abi16_ioctl_channel_alloc() we can omit creating a scheduler instance entirely if !nouveau_cli_uvmm(). - Danilo> + > return 0; > > fail_sched: > @@ -459,6 +461,9 @@ nouveau_sched_fini(struct nouveau_sched *sched) > struct drm_gpu_scheduler *drm_sched = &sched->base; > struct drm_sched_entity *entity = &sched->entity; > > + if (!sched->initialized) > + return; > + > rmb(); /* for list_empty to work without lock */ > wait_event(sched->job.wq, list_empty(&sched->job.list.head)); > > diff --git a/drivers/gpu/drm/nouveau/nouveau_sched.h b/drivers/gpu/drm/nouveau/nouveau_sched.h > index a6528f5981e6..351931c706aa 100644 > --- a/drivers/gpu/drm/nouveau/nouveau_sched.h > +++ b/drivers/gpu/drm/nouveau/nouveau_sched.h > @@ -109,6 +109,8 @@ struct nouveau_sched { > } list; > struct wait_queue_head wq; > } job; > + > + bool initialized; > }; > > int nouveau_sched_init(struct nouveau_sched *sched, struct nouveau_drm *drm,