Message boards : Number crunching : ACEMD3: strange hanging at the end of crunching
Author | Message |
---|---|
Laptop on Win11, Intel i7 11th gen, discrete RTX3070, 16 GB RAM: ACEMD3 application is processed quickly, but every time BOINC finishes calculations the computer hangs. I noticed that behaviour only on ACEMD3. Temperature has nothing to do with it, it seems, as ATM and ATMML apps are more compute-intensive with higher temperatures for longer times, but there is no hanging. Once the computer is restarted, BOINC starts normally and continues uploading the results back to the server. | |
ID: 61990 | Rating: 0 | rate: / Reply Quote | |
Actually, i started noticing this behaviour quite recently. Not sure if it's related to this new version of ACEMD3 which takes some 20-25 minutes to finish - previous versions of the app were running for 2-3 hours and wouldn't hang. | |
ID: 61991 | Rating: 0 | rate: / Reply Quote | |
The app hasn't changed, just the data processed by the app has changed. So I suspect that it might be some conflicting app on your laptop (for example a GPU monitoring app). Is there anything about this error in the reliability history? | |
ID: 61992 | Rating: 0 | rate: / Reply Quote | |
The app hasn't changed, just the data processed by the app has changed. So I suspect that it might be some conflicting app on your laptop (for example a GPU monitoring app). Is there anything about this error in the reliability history? the app actually has changed. they used to just distribute a single binary file for the app, and now they distribute it as an archive containing the binary and many other things (probably some libraries or dependencies) and it's now called via the BOINC wrapper instead of directly. the most recent versions of the acemd3 app were released in September of this year. ____________ | |
ID: 61993 | Rating: 0 | rate: / Reply Quote | |
the app actually has changed. they used to just distribute a single binary file for the app, and now they distribute it as an archive containing the binary and many other things (probably some libraries or dependencies) and it's now called via the BOINC wrapper instead of directly. the most recent versions of the acemd3 app were released in September of this year. Thanks, Ian&Steve, for confirming. I think this weird behaviour started manifesting around that time. Before that, the laptop would occasionally hang, but that didn't depend on the app - more on the duration and complexity of a task. The app hasn't changed, just the data processed by the app has changed. So I suspect that it might be some conflicting app on your laptop (for example a GPU monitoring app). Is there anything about this error in the reliability history? Short-run ACEMD3 tasks never hang before. They were my favourites for 2 reasons: proper checkpointing and stability on my laptop. Now things have changed. Even running BOINC only, without any other user app, each ACEMD3 task ends with hanging. Could be something specific about the way resources are freed or, maybe, requested, and the AV software (which i can't change). But, again, that never happened before in such a consistent manner. ATMML that run much longer became far more reliable than ACEMD3 because i can set and forget about them, but each ACEMD3 requires hard restart of my laptop... | |
ID: 61994 | Rating: 0 | rate: / Reply Quote | |
I thought that your host was able to finish ACEMD3 (v2.32) tasks since september, but now I understand that you've processed only ATMML tasks in that period, that's why you've noticed just quite recently that your computer hangs with the ACEMD3 v2.32 app. c:\ProgramData\BOINC\slots\
c:\ProgramData\BOINC\projects\www.gpugrid.net\ Have you tried to reset the GPUGrid project in BOINC?Can you check your pc's reliability history? (click on start and type "reliability" in the search field on the top) | |
ID: 61995 | Rating: 0 | rate: / Reply Quote | |
The ACEMD3 v2.32 Windows app throws 195 (0xc3) EXIT_CHILD_FAILED errors quite frequently, not only on my Windows host. Luckily it happens very early. | |
ID: 61996 | Rating: 0 | rate: / Reply Quote | |
AV: WithSecure | |
ID: 61997 | Rating: 0 | rate: / Reply Quote | |
For the record I do not have such issue on my host. I see that your host uses Boinc 8.0.4, I'm still on 8.0.2, could the issue come from here? | |
ID: 61999 | Rating: 0 | rate: / Reply Quote | |
Disabled Gigabyte Control Center service, restarted the laptop, started BOINC: hanging after finishing the calculations. Checked various files, found this in stderrdae.txt: Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x00007FF7F17F60FA read attempt from address 0x0000000000000020 Engaging BOINC Windows Runtime Debugger... As for the version, i had an issue where BOINC manager suddenly became unable to connect to the client. I tried reinstalling 8.0, 8.02 - nothing helped. 8.04 worked, so i'm staying on it for now. But i'll try reinstalling BOINC and test again - thanks for the suggestion. | |
ID: 62000 | Rating: 0 | rate: / Reply Quote | |
I'm using the 8.04 BOINC manager on my Windows host, the ACEMD3 v2.32 is running fine with that version, when it doesn't throw the "195 (0xc3) EXIT_CHILD_FAILED" error. | |
ID: 62001 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : ACEMD3: strange hanging at the end of crunching