Greg KH
2024-Sep-26 09:14 UTC
[RFC 00/29] Introduce NVIDIA GPU Virtualization (vGPU) Support
On Mon, Sep 23, 2024 at 12:01:40PM -0300, Jason Gunthorpe wrote:> On Mon, Sep 23, 2024 at 10:49:07AM +0200, Danilo Krummrich wrote: > > > 2. Proposal for upstream > > > =======================> > > > What is the strategy in the mid / long term with this? > > > > As you know, we're trying to move to Nova and the blockers with the device / > > driver infrastructure have been resolved and we're able to move forward. Besides > > that, Dave made great progress on the firmware abstraction side of things. > > > > Is this more of a proof of concept? Do you plan to work on Nova in general and > > vGPU support for Nova? > > This is intended to be a real product that customers would use, it is > not a proof of concept. There is alot of demand for this kind of > simplified virtualization infrastructure in the host side. The series > here is the first attempt at making thin host infrastructure and > Zhi/etc are doing it with an upstream-first approach. > > >From the VFIO side I would like to see something like this merged in > nearish future as it would bring a previously out of tree approach to > be fully intree using our modern infrastructure. This is a big win for > the VFIO world. > > As a commercial product this will be backported extensively to many > old kernels and that is harder/impossible if it isn't exclusively in > C. So, I think nova needs to co-exist in some way.Please never make design decisions based on old ancient commercial kernels that have any relevance to upstream kernel development today. If you care about those kernels, work with the companies that get paid to support such things. Otherwise development upstream would just completely stall and never go forward, as you well know. As it seems that future support for this hardware is going to be in rust, just use those apis going forward and backport the small number of missing infrastructure patches to the relevant ancient kernels as well, it's not like that would even be noticed in the overall number of patches they take for normal subsystem improvements :) thanks, greg k-h
Jason Gunthorpe
2024-Sep-26 12:42 UTC
[RFC 00/29] Introduce NVIDIA GPU Virtualization (vGPU) Support
On Thu, Sep 26, 2024 at 11:14:27AM +0200, Greg KH wrote:> On Mon, Sep 23, 2024 at 12:01:40PM -0300, Jason Gunthorpe wrote: > > On Mon, Sep 23, 2024 at 10:49:07AM +0200, Danilo Krummrich wrote: > > > > 2. Proposal for upstream > > > > =======================> > > > > > What is the strategy in the mid / long term with this? > > > > > > As you know, we're trying to move to Nova and the blockers with the device / > > > driver infrastructure have been resolved and we're able to move forward. Besides > > > that, Dave made great progress on the firmware abstraction side of things. > > > > > > Is this more of a proof of concept? Do you plan to work on Nova in general and > > > vGPU support for Nova? > > > > This is intended to be a real product that customers would use, it is > > not a proof of concept. There is alot of demand for this kind of > > simplified virtualization infrastructure in the host side. The series > > here is the first attempt at making thin host infrastructure and > > Zhi/etc are doing it with an upstream-first approach. > > > > >From the VFIO side I would like to see something like this merged in > > nearish future as it would bring a previously out of tree approach to > > be fully intree using our modern infrastructure. This is a big win for > > the VFIO world. > > > > As a commercial product this will be backported extensively to many > > old kernels and that is harder/impossible if it isn't exclusively in > > C. So, I think nova needs to co-exist in some way. > > Please never make design decisions based on old ancient commercial > kernels that have any relevance to upstream kernel development > today.Greg, you are being too extreme. Those "ancient commercial kernels" have a huge relevance to alot of our community because they are the users that actually run the code we are building and pay for it to be created. Yes we usually (but not always!) push back on accommodations upstream, but taking hard dependencies on rust is currently a very different thing.> If you care about those kernels, work with the companies that get paid > to support such things. Otherwise development upstream would just > completely stall and never go forward, as you well know.They seem to be engaged, but upstream rust isn't even done yet. So what exactly do you expect them to do? Throw out whole architectures from their products? I know how things work, I just don't think we are ready to elevate Rust to the category of decisions where upstream can ignore the downstream side readiness. In my view the community needs to agree to remove the experimental status from Rust first.> As it seems that future support for this hardware is going to be in > rust, just use those apis going forward and backport the small number of"those apis" don't even exist yet! There is a big multi-year gap between when pure upstream would even be ready to put something like VFIO on top of Nova and Rust and where we are now with this series. This argument is *way too early*. I'm deeply hoping we never have to actually have it, that by the time Nova gets merged Rust will be 100% ready upstream and there will be no issue. Please? Can that happen? Otherwise, let's slow down here. Nova is still years away from being finished. Nouveau is the in-tree driver for this HW. This series improves on Nouveau. We are definitely not at the point of refusing new code because it is not writte in Rust, RIGHT? Jason