nix-gl-host

Commit Graph

Author	SHA1	Message	Date
Félix Baylac Jacqué	891dd0bf95	Cache cold build: prevent race condition We had a race condition on cold cache builds. When several nix-gl-host instances were called, they were trying to concurrently build the cache. This lead to some weird errors and busted caches. We introduce a file lock preventing any concurrent access to the cache. We take advantage of this bug to rethink the way we build the cache and do it in a more robust way. Instead of building it in place, we're now building it first in a temporary directory (while making sure to patchelf the DSOs according their final destination). We then move this directory to the actual cache destination iff the cache has been built successfully.	2023-01-17 16:14:16 +01:00
Félix Baylac Jacqué	4f51b37c29	nvidia_main: refactor out metadata generation in separate function This nvidia_main function got out of control. Let's chop the metadata generation out of it.	2023-01-17 12:41:07 +01:00
Félix Baylac Jacqué	37d4a1b248	Bugfix: do not try to copy and patch unexisting DSOs It seems like some drivers installations are missing some NVidia subsystems. We stumbled upon the case of somebody missing the Cuda libraries. It did end up making fail the patchelf call. Preventing the copying/patch routine to run when we do not have any DSO to copy/patch.	2022-12-22 19:01:31 +01:00
Félix Baylac Jacqué	eb782d3fd5	Bug: use mtime instead of atime as timestamp atime == last time a file has been accessed. mtime == last time a file has been modified. Facepalm on that one…	2022-12-19 20:53:14 +01:00
Félix Baylac Jacqué	c7c5f08cbc	Cold cache: optimize cache generation We manage to improve the cold cache generation from 98s to ~30s on my desktop. Two things have been done to improve that performance: 1. This one was stupid. I forgot a debug tracing routine that should have been removed in the code… This tracing routine was forcing us to cache the libraries… …twice. Massive facepalm. Addressing this reduced the cold runtime by 50%. 2. Instead of spinning up a patchelf subprocess for each library, we batch these operations as much as possible in a single subprocess. This trick shaves about 30% of the remaining runtime.	2022-12-14 19:20:31 +01:00
Félix Baylac Jacqué	97e35d20fa	Rename nixglhost_wrapper -> nixglhost	2022-12-14 19:20:31 +01:00
Félix Baylac Jacqué	3ff2f01812	Hot Cache: use the DSO last write time/size instead of content hash After profiling a nixglhost hot run, it turns out that we were spending more than 98% of the run time reading and sha256-hashing files. Let's give up on content hashing the files and assume that using their name, size and last write time is good enough. On a hot run, we reduce the run time from about 3s to 0.3s on a nvme-powered ryzen 7 desktop. I guess this 10x speedup probably worth the little cache corectness we lose on the way.	2022-12-14 19:20:10 +01:00
Félix Baylac Jacqué	61a5fcdae8	Re-thinking the host library auto detection mechanism. We made the incorrect assumption that the first DSO we'd stumble upon in the load path would be the most appropriate one for the host system. IE. it'd be the x86_64-gnu-linux on such a system. It turned out not being the case, meaning we can't take such a shortcut and have to handle the case where we end up with multiple same libraries for different archs. This forced us to do two things: 1. It forced us to rethink the cache directory structure. We now have to cache each library directory separately. Meaning we now have to inject the GLX/EGL/Cuda directories of all these library directories subpaths to the LD_LIBRARY_PATH. The dynamic linker will then try to open these DSOs (and fail) until it stumble upon the appropriate one. 2. We had to give up on the way we injected EGL libraries using asolute paths. We don't know which DSO is adapted to the wrapped program arch. Instead of injecting the absolute paths through the JSON configuration files, we just stipulate the libraries names in them. We then inject the various EGL DSOs we find through the LD_LIBRARY_PATH, similarly to what we already do for GLX and Cuda.	2022-12-14 14:32:54 +01:00
Félix Baylac Jacqué	19ae538413	Add a flag to print the nix-gl-host LD_LIBRARY_PATH In some situations, such as environment setup scripts, we'd like to print the nix-gl-host LD_LIBRARY_PATH instead of running a binary.	2022-12-06 18:06:41 +01:00
Ivor Wanders	95456b5ed1	Make the copied library writable. We need to make sure the library is writable before trying to patchelf it.	2022-12-06 10:54:21 +01:00
Félix Baylac Jacqué	8e93fe5dea	DSO search: Add escape hatch We add a -d/--driver-directory flag allowing the user to circomvent the DSO automatic lookup and instead force nix-gl-host to load its dynamic libraries from a specific directory.	2022-12-05 15:34:29 +01:00
Félix Baylac Jacqué	d020c8f4fe	Exec: search for executables in the PATH Using execvp instead of execv. It'll look for the binary in PATH if the bin_path provided is not an explicit path.	2022-12-05 15:30:43 +01:00
Félix Baylac Jacqué	375148c949	Introduce a cache system Copying & patching all the DSOs is a time consuming process (~10s on a slow hard drive computer). We definitely don't want to go through it for each process start, we need to introduce a cache. For this cache, we go the concervative way. We're going to "resolve" a DSO name (ie. find the DSO absolute path) and sha256-hash each DSO. We're then going to compare the fingerprints to determine whether or not we need to nuke and rebuild the DSO cache. The cache state is persisted through a JSON file saved in the cache dir.	2022-12-05 13:33:01 +01:00

13 Commits