nix-gl-host

Commit Graph

Author	SHA1	Message	Date
Félix Baylac Jacqué	891dd0bf95	Cache cold build: prevent race condition We had a race condition on cold cache builds. When several nix-gl-host instances were called, they were trying to concurrently build the cache. This lead to some weird errors and busted caches. We introduce a file lock preventing any concurrent access to the cache. We take advantage of this bug to rethink the way we build the cache and do it in a more robust way. Instead of building it in place, we're now building it first in a temporary directory (while making sure to patchelf the DSOs according their final destination). We then move this directory to the actual cache destination iff the cache has been built successfully.	2023-01-17 16:14:16 +01:00
Félix Baylac Jacqué	4f51b37c29	nvidia_main: refactor out metadata generation in separate function This nvidia_main function got out of control. Let's chop the metadata generation out of it.	2023-01-17 12:41:07 +01:00
Félix Baylac Jacqué	c1680e0c7c	flake: add meta.mainProgram Allow us to run the program through `nix run`.	2023-01-05 16:48:38 +01:00
Félix Baylac Jacqué	3876bce7a1	Flake: overlays removal leftovers. I forgot to remove an instance of the nixpkgs overlay I removed two commits ago…	2022-12-22 19:14:23 +01:00
Félix Baylac Jacqué	37d4a1b248	Bugfix: do not try to copy and patch unexisting DSOs It seems like some drivers installations are missing some NVidia subsystems. We stumbled upon the case of somebody missing the Cuda libraries. It did end up making fail the patchelf call. Preventing the copying/patch routine to run when we do not have any DSO to copy/patch.	2022-12-22 19:01:31 +01:00
Félix Baylac Jacqué	09fda6b9bd	Remove unused nixpkgs overlay That nixpkgs overlay was used to inject a patched libglvnd. We're not using that patch anymore, there's no reason to keep this overlay around.	2022-12-22 10:15:46 +01:00
Félix Baylac Jacqué	eb782d3fd5	Bug: use mtime instead of atime as timestamp atime == last time a file has been accessed. mtime == last time a file has been modified. Facepalm on that one…	2022-12-19 20:53:14 +01:00
Félix Baylac Jacqué	c7c5f08cbc	Cold cache: optimize cache generation We manage to improve the cold cache generation from 98s to ~30s on my desktop. Two things have been done to improve that performance: 1. This one was stupid. I forgot a debug tracing routine that should have been removed in the code… This tracing routine was forcing us to cache the libraries… …twice. Massive facepalm. Addressing this reduced the cold runtime by 50%. 2. Instead of spinning up a patchelf subprocess for each library, we batch these operations as much as possible in a single subprocess. This trick shaves about 30% of the remaining runtime.	2022-12-14 19:20:31 +01:00
Félix Baylac Jacqué	97e35d20fa	Rename nixglhost_wrapper -> nixglhost	2022-12-14 19:20:31 +01:00
Félix Baylac Jacqué	3ff2f01812	Hot Cache: use the DSO last write time/size instead of content hash After profiling a nixglhost hot run, it turns out that we were spending more than 98% of the run time reading and sha256-hashing files. Let's give up on content hashing the files and assume that using their name, size and last write time is good enough. On a hot run, we reduce the run time from about 3s to 0.3s on a nvme-powered ryzen 7 desktop. I guess this 10x speedup probably worth the little cache corectness we lose on the way.	2022-12-14 19:20:10 +01:00
Félix Baylac Jacqué	61a5fcdae8	Re-thinking the host library auto detection mechanism. We made the incorrect assumption that the first DSO we'd stumble upon in the load path would be the most appropriate one for the host system. IE. it'd be the x86_64-gnu-linux on such a system. It turned out not being the case, meaning we can't take such a shortcut and have to handle the case where we end up with multiple same libraries for different archs. This forced us to do two things: 1. It forced us to rethink the cache directory structure. We now have to cache each library directory separately. Meaning we now have to inject the GLX/EGL/Cuda directories of all these library directories subpaths to the LD_LIBRARY_PATH. The dynamic linker will then try to open these DSOs (and fail) until it stumble upon the appropriate one. 2. We had to give up on the way we injected EGL libraries using asolute paths. We don't know which DSO is adapted to the wrapped program arch. Instead of injecting the absolute paths through the JSON configuration files, we just stipulate the libraries names in them. We then inject the various EGL DSOs we find through the LD_LIBRARY_PATH, similarly to what we already do for GLX and Cuda.	2022-12-14 14:32:54 +01:00
Félix Baylac Jacqué	19ae538413	Add a flag to print the nix-gl-host LD_LIBRARY_PATH In some situations, such as environment setup scripts, we'd like to print the nix-gl-host LD_LIBRARY_PATH instead of running a binary.	2022-12-06 18:06:41 +01:00
Ivor Wanders	95456b5ed1	Make the copied library writable. We need to make sure the library is writable before trying to patchelf it.	2022-12-06 10:54:21 +01:00
Félix Baylac Jacqué	29f82cbd52	Update doc	2022-12-05 15:38:58 +01:00
Félix Baylac Jacqué	8e93fe5dea	DSO search: Add escape hatch We add a -d/--driver-directory flag allowing the user to circomvent the DSO automatic lookup and instead force nix-gl-host to load its dynamic libraries from a specific directory.	2022-12-05 15:34:29 +01:00
Félix Baylac Jacqué	d020c8f4fe	Exec: search for executables in the PATH Using execvp instead of execv. It'll look for the binary in PATH if the bin_path provided is not an explicit path.	2022-12-05 15:30:43 +01:00
Félix Baylac Jacqué	375148c949	Introduce a cache system Copying & patching all the DSOs is a time consuming process (~10s on a slow hard drive computer). We definitely don't want to go through it for each process start, we need to introduce a cache. For this cache, we go the concervative way. We're going to "resolve" a DSO name (ie. find the DSO absolute path) and sha256-hash each DSO. We're then going to compare the fingerprints to determine whether or not we need to nuke and rebuild the DSO cache. The cache state is persisted through a JSON file saved in the cache dir.	2022-12-05 13:33:01 +01:00
Félix Baylac Jacqué	494bac3613	Types: add return types to function Removing two unused functions at the same time.	2022-12-04 09:21:02 +01:00
Félix Baylac Jacqué	4d4b5e2dad	GLX support via LD_LIBRARY_PATH	2022-12-04 09:21:02 +01:00
Félix Baylac Jacqué	314da66ae5	Introduce Cuda support	2022-12-04 09:16:40 +01:00
Félix Baylac Jacqué	8100ad24d6	DSO scanning: separately scan the cuda drivers We create a new function in charge of finding the CUDA DSOs. We also modify the find_nvidia_dsos function and remove the cuda-related libraries from its output. We take advantage of this new feature to factor out the file searching logic in its own function.	2022-12-04 09:12:43 +01:00
Félix Baylac Jacqué	6fb5f29175	EGL support for the Nvidia proprietary driver Good news: we did not have to patch libglvnd for the EGL support. All the low-level machinery was already here (but sadly undocumented). Bad news: properly supporting the EGL stack turned out being more involving than its GLX counterpart on the wrapper side. Not only you need the main EGL lib in charge of implementing the primitives (libEGL_nvidia.so), but also two other libraries implementing the wayland and gbm bindings. These DSOs in turn depend on some non-glibc but open source shared libraries. I decided to use the ones coming from the host system rather than the ones provided by Nixpkgs: it's best to assume that the host system did its homework to determine which version of these libraries the Nvidia driver is expecting to work with.	2022-11-29 17:55:39 +01:00
Félix Baylac-Jacqué	39d40f4922	Readme: status update	2022-11-24 17:12:22 +01:00
Félix Baylac-Jacqué	7e49ce6fbd	Nix: add patched libglvnd overlay	2022-11-24 17:03:46 +01:00
Félix Baylac-Jacqué	0ccc67bbb4	nix-gl-host: first relatively naive implementation First take. For now, we're only try to support the Nvidia proprietary driver. We also cut quite some corners :) We hardcode a list of DSOs we're looking for in the code. That's obviously the best long-term decision, we'll have to revise this particular approach later on. We're looking for these listed DSOs in the GL_VENDOR_PATH provided by the user. We'll need to patch these DSOs and we obviously don't want to alter the host OS configuration. So we have to first copy them to the user XDG cache directory. Once copied, we alter their runpath to point to the user cache dir: these DSO can depend on each other. Finally, we point the patched libglvnd GLX implementation to the cache dir and replace the current process with the target one.	2022-11-24 17:03:46 +01:00
Félix Baylac Jacqué	834d7d56c9	Add Nix build infrastructure	2022-11-24 17:03:42 +01:00
Félix Baylac Jacqué	91d1d09d89	Init: add short readme describing the project's goal	2022-11-22 10:28:40 +01:00

27 Commits All Branches Search

27 Commits

All Branches