Message boards : Number crunching : External GPU - BOINC's device number does not match acemd-922-80.exe's device number
Author | Message |
---|---|
I've got a 1060 6GB in an Akitio Node Thunderbolt 3 enclosure, and during the night, I'd like it to help heat my room, so I use it for BOINC. | |
ID: 50993 | Rating: 0 | rate: / Reply Quote | |
With multiple manufactures they will each be device 0. You'll probably need the <type> command as well to exclude the Intel device at index 0. | |
ID: 50994 | Rating: 0 | rate: / Reply Quote | |
Hm, but the problem isn't that it's trying to run on the Intel card, but just that the index is different between BOINC (which calls GPUGrid), and GPUGrid itself. So BOINC thinks it's telling GPUGrid to run on one card, but it's really running on another. | |
ID: 50997 | Rating: 0 | rate: / Reply Quote | |
Just curious, what does the startup log says about all this? and what does your cc_config look like. Could you post both so we can look them over? | |
ID: 50998 | Rating: 0 | rate: / Reply Quote | |
Sure - here's my cc_config.xml, just enabling multiple GPUs: <cc_config> <options> <use_all_gpus>1</use_all_gpus> </options> </cc_config> And the startup log... 12/4/2018 1:20:42 AM | | Starting BOINC client version 7.14.2 for windows_x86_64 12/4/2018 1:20:42 AM | | log flags: file_xfer, sched_ops, task 12/4/2018 1:20:42 AM | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2g zlib/1.2.8 12/4/2018 1:20:42 AM | | Data directory: C:\ProgramData\BOINC 12/4/2018 1:20:42 AM | | Running under account Falcon 12/4/2018 1:20:44 AM | | CUDA: NVIDIA GPU 0: GeForce GTX 1060 6GB (driver version 417.01, CUDA version 10.0, compute capability 6.1, 4096MB, 3564MB available, 4568 GFLOPS peak) 12/4/2018 1:20:44 AM | | CUDA: NVIDIA GPU 1: GeForce 940MX (driver version 417.01, CUDA version 10.0, compute capability 5.0, 2048MB, 1686MB available, 881 GFLOPS peak) 12/4/2018 1:20:44 AM | | OpenCL: NVIDIA GPU 0: GeForce GTX 1060 6GB (driver version 417.01, device version OpenCL 1.2 CUDA, 6144MB, 3564MB available, 4568 GFLOPS peak) 12/4/2018 1:20:44 AM | | OpenCL: NVIDIA GPU 0: GeForce GTX 1060 6GB (driver version 417.01, device version OpenCL 1.2 CUDA, 6144MB, 3564MB available, 4568 GFLOPS peak) 12/4/2018 1:20:44 AM | | OpenCL: NVIDIA GPU 1: GeForce 940MX (driver version 417.01, device version OpenCL 1.2 CUDA, 2048MB, 1686MB available, 881 GFLOPS peak) 12/4/2018 1:20:44 AM | | OpenCL: NVIDIA GPU 1: GeForce 940MX (driver version 417.01, device version OpenCL 1.2 CUDA, 2048MB, 1686MB available, 881 GFLOPS peak) 12/4/2018 1:20:44 AM | | OpenCL: Intel GPU 0: Intel(R) HD Graphics 630 (driver version 22.20.16.4799, device version OpenCL 2.1, 6489MB, 6489MB available, 211 GFLOPS peak) 12/4/2018 1:20:44 AM | | OpenCL CPU: Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz (OpenCL driver vendor: Intel(R) Corporation, driver version 7.2.0.10, device version OpenCL 2.1 (Build 10)) 12/4/2018 1:20:44 AM | | Host name: DESKTOP-JKHBDQ2 12/4/2018 1:20:44 AM | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz [Family 6 Model 158 Stepping 9] 12/4/2018 1:20:44 AM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx smx tm2 pbe fsgsbase bmi1 hle smep bmi2 12/4/2018 1:20:44 AM | | OS: Microsoft Windows 10: Professional x64 Edition, (10.00.17763.00) 12/4/2018 1:20:44 AM | | Memory: 15.86 GB physical, 20.34 GB virtual 12/4/2018 1:20:44 AM | | Disk: 476.34 GB total, 177.30 GB free 12/4/2018 1:20:44 AM | | Local time is UTC -8 hours 12/4/2018 1:20:44 AM | | No WSL found. 12/4/2018 1:20:44 AM | GPUGRID | Found app_config.xml 12/4/2018 1:20:44 AM | GPUGRID | Missing <app_config> in app_config.xml 12/4/2018 1:20:44 AM | Milkyway@Home | Found app_config.xml 12/4/2018 1:20:44 AM | SETI@home | Found app_config.xml 12/4/2018 1:20:44 AM | | Config: use all coprocessors 12/4/2018 1:20:44 AM | climateprediction.net | URL http://climateprediction.net/; Computer ID 1385937; resource share 1500 12/4/2018 1:20:44 AM | Einstein@Home | URL http://einstein.phys.uwm.edu/; Computer ID 12162809; resource share 0 12/4/2018 1:20:44 AM | GPUGRID | URL http://www.gpugrid.net/; Computer ID 493460; resource share 750 12/4/2018 1:20:44 AM | Milkyway@Home | URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 715608; resource share 100 12/4/2018 1:20:44 AM | SETI@home | URL http://setiathome.berkeley.edu/; Computer ID 8365185; resource share 100 12/4/2018 1:20:44 AM | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 3461727; resource share 750 12/4/2018 1:20:44 AM | World Community Grid | General prefs: from World Community Grid (last modified 02-Jan-2018 18:29:34) 12/4/2018 1:20:44 AM | World Community Grid | Host location: none 12/4/2018 1:20:44 AM | World Community Grid | General prefs: using your defaults 12/4/2018 1:20:44 AM | | Reading preferences override file 12/4/2018 1:20:44 AM | | Preferences: 12/4/2018 1:20:44 AM | | max memory usage when active: 12181.12 MB 12/4/2018 1:20:44 AM | | max memory usage when idle: 14617.35 MB 12/4/2018 1:20:44 AM | | max disk usage: 180.25 GB 12/4/2018 1:20:44 AM | | max CPUs used: 2 12/4/2018 1:20:44 AM | | (to change preferences, visit a project web site or select Preferences in the Manager) 12/4/2018 1:20:44 AM | | Setting up project and slot directories 12/4/2018 1:20:44 AM | | Checking active tasks 12/4/2018 1:20:44 AM | | Setting up GUI RPC socket 12/4/2018 1:20:44 AM | | Checking presence of 696 project files 12/4/2018 1:20:44 AM | GPUGRID | Sending scheduler request: Requested by project. 12/4/2018 1:20:44 AM | GPUGRID | Requesting new tasks for Intel GPU Now... this is definitely odd. If I exclude the 1060... (man, it takes forever to shut down GPUGrid ;) ) 12/4/2018 1:23:25 AM | | Starting BOINC client version 7.14.2 for windows_x86_64 12/4/2018 1:23:25 AM | | log flags: file_xfer, sched_ops, task 12/4/2018 1:23:25 AM | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2g zlib/1.2.8 12/4/2018 1:23:25 AM | | Data directory: C:\ProgramData\BOINC 12/4/2018 1:23:25 AM | | Running under account Falcon 12/4/2018 1:23:26 AM | | CUDA: NVIDIA GPU 0: GeForce 940MX (driver version 417.01, CUDA version 10.0, compute capability 5.0, 2048MB, 1686MB available, 881 GFLOPS peak) 12/4/2018 1:23:26 AM | | OpenCL: NVIDIA GPU 0: GeForce 940MX (driver version 417.01, device version OpenCL 1.2 CUDA, 2048MB, 1686MB available, 881 GFLOPS peak) 12/4/2018 1:23:26 AM | | OpenCL: Intel GPU 0: Intel(R) HD Graphics 630 (driver version 22.20.16.4799, device version OpenCL 2.1, 6489MB, 6489MB available, 211 GFLOPS peak) 12/4/2018 1:23:26 AM | | OpenCL CPU: Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz (OpenCL driver vendor: Intel(R) Corporation, driver version 7.2.0.10, device version OpenCL 2.1 (Build 10)) 12/4/2018 1:23:26 AM | | Host name: DESKTOP-JKHBDQ2 12/4/2018 1:23:26 AM | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz [Family 6 Model 158 Stepping 9] 12/4/2018 1:23:26 AM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx smx tm2 pbe fsgsbase bmi1 hle smep bmi2 12/4/2018 1:23:26 AM | | OS: Microsoft Windows 10: Professional x64 Edition, (10.00.17763.00) 12/4/2018 1:23:26 AM | | Memory: 15.86 GB physical, 20.34 GB virtual 12/4/2018 1:23:26 AM | | Disk: 476.34 GB total, 177.30 GB free 12/4/2018 1:23:26 AM | | Local time is UTC -8 hours 12/4/2018 1:23:26 AM | | No WSL found. 12/4/2018 1:23:26 AM | GPUGRID | Found app_config.xml 12/4/2018 1:23:26 AM | GPUGRID | Missing <app_config> in app_config.xml 12/4/2018 1:23:26 AM | Milkyway@Home | Found app_config.xml 12/4/2018 1:23:26 AM | SETI@home | Found app_config.xml 12/4/2018 1:23:26 AM | | Config: use all coprocessors 12/4/2018 1:23:26 AM | climateprediction.net | URL http://climateprediction.net/; Computer ID 1385937; resource share 1500 12/4/2018 1:23:26 AM | Einstein@Home | URL http://einstein.phys.uwm.edu/; Computer ID 12162809; resource share 0 12/4/2018 1:23:26 AM | GPUGRID | URL http://www.gpugrid.net/; Computer ID 493460; resource share 750 12/4/2018 1:23:26 AM | Milkyway@Home | URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 715608; resource share 100 12/4/2018 1:23:26 AM | SETI@home | URL http://setiathome.berkeley.edu/; Computer ID 8365185; resource share 100 12/4/2018 1:23:26 AM | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 3461727; resource share 750 12/4/2018 1:23:26 AM | World Community Grid | General prefs: from World Community Grid (last modified 02-Jan-2018 18:29:34) 12/4/2018 1:23:26 AM | World Community Grid | Host location: none 12/4/2018 1:23:26 AM | World Community Grid | General prefs: using your defaults 12/4/2018 1:23:26 AM | | Reading preferences override file 12/4/2018 1:23:26 AM | | Preferences: 12/4/2018 1:23:26 AM | | max memory usage when active: 12181.12 MB 12/4/2018 1:23:26 AM | | max memory usage when idle: 14617.35 MB 12/4/2018 1:23:26 AM | | max disk usage: 180.25 GB 12/4/2018 1:23:26 AM | | max CPUs used: 2 12/4/2018 1:23:26 AM | | (to change preferences, visit a project web site or select Preferences in the Manager) 12/4/2018 1:23:26 AM | | Setting up project and slot directories 12/4/2018 1:23:26 AM | | Checking active tasks 12/4/2018 1:23:26 AM | | Setting up GUI RPC socket 12/4/2018 1:23:26 AM | | Checking presence of 696 project files 12/4/2018 1:23:26 AM | GPUGRID | Sending scheduler request: To fetch work. 12/4/2018 1:23:26 AM | GPUGRID | Requesting new tasks for Intel GPU 12/4/2018 1:23:28 AM | GPUGRID | Scheduler request completed: got 0 new tasks ... then of course GPU 0 becomes the 940MX. That's probably where I thought GPU 0 was the 940MX. Restarted it again, and they again shuffled so the 1060 is in "0" slot and 940mx in the "1" slot. But come to think of it, I hadn't really had concrete evidence of the relationship. So, say I'm running without the 1060 connected. Is there a way in cc_config.xml to exclude a GPU by its name, not by ID (which seems to change to stay sequential, no matter what they end up being)? So, sounds like there's no issue in the device ID assignment, but still leaves a hole in figuring out how to keep GPUGrid from trying to run on the 940mx, specifically... | |
ID: 50999 | Rating: 0 | rate: / Reply Quote | |
I see a couple of issues. | |
ID: 51000 | Rating: 0 | rate: / Reply Quote | |
BOINC is seeing 3 GPUs, two of which are device 0. 1 per device manufacture. Setup cc_config per the Wiki link with device 1 for the 940MX with type option. | |
ID: 51001 | Rating: 0 | rate: / Reply Quote | |
The intel GPU is not supported by GPUGrid so I don't think this will affect the GPU order | |
ID: 51002 | Rating: 0 | rate: / Reply Quote | |
Right, the Intel GPU was never part of the problem :P | |
ID: 51003 | Rating: 0 | rate: / Reply Quote | |
Ok, now that I have a better understanding of what you want to do, we should try to BOINC to ignore the built in GPU. Never tried this on GPUGrid but..... <cc_config> <options> <exclude_gpu> <url>http://einstein.phys.uwm.edu/</url> <device_num>2</device_num> <app>einsteinbinary_BRP4G</app> </exclude_gpu> <exclude_gpu> <url>http://einstein.phys.uwm.edu/</url> <device_num>0</device_num> <app>einsteinbinary_BRP5</app> </exclude_gpu> <exclude_gpu> <url>http://einstein.phys.uwm.edu/</url> <device_num>1</device_num> <app>einsteinbinary_BRP5</app> </exclude_gpu> </options> </cc_config> so I think we should be able to use it here. Will need to trim it. I think it might work without the <app></app> section so that it ignores all work from this website. So maybe something like <cc_config> <options> <exclude_gpu> <url>http://www.gpugrid.net/</url> <device_num>1</device_num> </exclude_gpu> </options> </cc_config> Not on a machine that is currently running GPUGrid so can't test it. Z ____________ | |
ID: 51004 | Rating: 0 | rate: / Reply Quote | |
That would just keep GPUGrid from using any device numbered "1", which would keep the 940MX from doing anything when the 1060 is connected (as 0), but with it disconnected (as now, writing this from bed :) ), it'd revert to using device 0, which would be the 940MX. | |
ID: 51005 | Rating: 0 | rate: / Reply Quote | |
That would just keep GPUGrid from using any device numbered "1", which would keep the 940MX from doing anything when the 1060 is connected (as 0), but with it disconnected (as now, writing this from bed :) ), it'd revert to using device 0, which would be the 940MX. That's why the <type> command is needed to ignore just the single manufacture of a certain ID which can only be one single card in any given system. This is what I used to ignore the 1st NV GPU listed in BOINC startup in my own system. <exclude_gpu>--> <url>http://xansons4cod.com/xansons4cod/</url> <device_num>0</device_num> <type>NVIDIA</type> </exclude_gpu> <exclude_gpu>--> <url>https://albertathome.org/</url> <device_num>0</device_num> <type>NVIDIA</type> </exclude_gpu> That is a 980Ti that I use for FAH and for awhile I had a 2nd NV 970 card I used for BOINC. Adding this to cc_config is the only way to keep BOINC from running on the 980Ti and only on the 970. Change the URL and device_num = 1 when its displayed as 'CUDA: NVIDIA GPU 1: GeForce 940MX" | |
ID: 51006 | Rating: 0 | rate: / Reply Quote | |
That's the thing, there are two nVidia cards here when the 1060 is connected. It shifts so that 0 = 1060, 1 = 940MX (BOTH are nVidia!), but when the 1060 is unplugged, it becomes 0 = 940MX (again, nVidia). So all I'd be doing with that config is excluding "whatever device 1 happens to be", which would still leave it with 0 = 940MX being enabled when the 1060 is unplugged. | |
ID: 51008 | Rating: 0 | rate: / Reply Quote | |
I'm going to be going this route myself and exclude the RTX 2080 from both Einstein and GPUGRid. Want to keep the other 3 Nvidia cards running those projects though. | |
ID: 51009 | Rating: 0 | rate: / Reply Quote | |
Glad I could help Keith. | |
ID: 51010 | Rating: 0 | rate: / Reply Quote | |
I'm not having any luck with excluding device 0 for both GPUGrid and Einstein. Doing so prevents all Seti cpu tasks from running and puts them into "waiting to run" status. | |
ID: 51011 | Rating: 0 | rate: / Reply Quote | |
Keith PM your cc_config so I can see what it looks like. | |
ID: 51012 | Rating: 0 | rate: / Reply Quote | |
Okay, after a while of crunching like this, I've determined that there IS definitely a bug of some kind in how GPUGrid and/or BOINC handles device ID assignments. <cc_config> <options> <use_all_gpus>1</use_all_gpus> <exclude_gpu> <url>http://www.gpugrid.net/</url> <device_num>1</device_num> </exclude_gpu> <exclude_gpu> <url>http://setiathome.berkeley.edu/</url> <type>NVIDIA</type> </exclude_gpu> <exclude_gpu> <url>http://milkyway.cs.rpi.edu/milkyway/</url> <type>NVIDIA</type> </exclude_gpu> <exclude_gpu> <url>http://einstein.phys.uwm.edu/</url> <type>NVIDIA</type> <type>intel_gpu</type> </exclude_gpu> </options> </cc_config> ... But as you can see, the Einstein@Home listing is out here giving zero f^ks about what I told it. Look at the log in that screenshot as well. The IDs are clearly assigned, and should not be overlapping on the same GPU. What can we do with this information? | |
ID: 51172 | Rating: 0 | rate: / Reply Quote | |
wow this forum's functionality is extremely limited (not being able to edit after a time period, not being able to post images inline without an external host, etc etc etc 2003 internet things) | |
ID: 51173 | Rating: 0 | rate: / Reply Quote | |
Since you are using windows, might want to look at BoincTasks and SIVx64. I like it as it gives better idea of what my system is doing. | |
ID: 51174 | Rating: 0 | rate: / Reply Quote | |
Have you tried Nvidia's built-in tool that is available? nvidia-smi?? It shows what is running on each gpu, the amount of utilization, memory usage, power usage in watts and each application running on each particular gpu. | |
ID: 51175 | Rating: 0 | rate: / Reply Quote | |
Have you tried??? <use_all_gpus>0</use_all_gpus> A 1060 (6.1) is more capable than a 940MX (5.0).<use_all_gpus>0|1</use_all_gpus> If 1, use all GPUs (otherwise only the most capable ones are used). Requires a client restart. https://developer.nvidia.com/cuda-gpus | |
ID: 51180 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : External GPU - BOINC's device number does not match acemd-922-80.exe's device number