Author |
Message |
mclaverSend message
Joined: 9 Mar 09 Posts: 25 Credit: 3,321,711,931 RAC: 0 Level
Scientific publications
|
All of my GPUGRID tasks are failing on computation error under Windows 7 profesional 64 bit, BOINC 6.12.26.
From the task
<core_client_version>6.12.26</core_client_version>
<![CDATA[
<message>
The system cannot find the path specified. (0x3) - exit code 3 (0x3)
</message>
<stderr_txt>
# Using device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce GTX 285"
# Clock rate: 1.58 GHz
# Total amount of global memory: 1017839616 bytes
# Number of multiprocessors: 30
# Number of cores: 240
MDIO: cannot open file "restart.coor"
SWAN: FATAL : swanMemcpyDtoH failed
Assertion failed: 0, file swanlib_nv.c, line 390
This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.
</stderr_txt>
]]>
From BOINC
6/26/2011 11:22:08 AM | GPUGRID | Computation for task A418-TONI_AGGsoup1-16-100-RND7971_0 finished
6/26/2011 11:22:08 AM | GPUGRID | Output file A418-TONI_AGGsoup1-16-100-RND7971_0_1 for task A418-TONI_AGGsoup1-16-100-RND7971_0 absent
6/26/2011 11:22:08 AM | GPUGRID | Output file A418-TONI_AGGsoup1-16-100-RND7971_0_2 for task A418-TONI_AGGsoup1-16-100-RND7971_0 absent
6/26/2011 11:22:08 AM | GPUGRID | Output file A418-TONI_AGGsoup1-16-100-RND7971_0_3 for task A418-TONI_AGGsoup1-16-100-RND7971_0 absent
____________
|
|
|
|
Is your card overclocked at all? The reference clock rate appears to be a little lower than what is reported in output.
It took a while to find overclocking settings that would work with my GTX 570 so you may want to start at reference clock speeds. |
|
|
|
Please make sure you are setup in the GPUGrid preferences for shorter tasks only. Your successful tasks are all the shorter type. The GTX 285 is no longer recommended for longer tasks may not return results in time to get credit.
I looked at your computer and you are running the newest drivers. Have you tested the GPU configuration on other projects?
Please let us know if you get the issues resolved and how.
|
|
|
mclaverSend message
Joined: 9 Mar 09 Posts: 25 Credit: 3,321,711,931 RAC: 0 Level
Scientific publications
|
I do not overclock and Collatz and SETI seem to work fine. Right now I stopped running GPU Grid on this card. I am running GPUGRID on a GTX 275 and it has never had an error so I do not understand the comment that the GTX 285 is too slow. |
|
|
|
The GTX 285 is no longer recommended for longer tasks may not return results in time to get credit.
Hi, The GTX285 is a perfectly valid card to perform any task of this project, including long.
I am working with a GTX295 = x2 GTX285 downclock, are actually longer and the task ends in 15 hours and without activating SWAN_SYNC = 0. Greetings. |
|
|
mclaverSend message
Joined: 9 Mar 09 Posts: 25 Credit: 3,321,711,931 RAC: 0 Level
Scientific publications
|
The GTX 285 is no longer recommended for longer tasks may not return results in time to get credit.
Hi, The GTX285 is a perfectly valid card to perform any task of this project, including long.
I am working with a GTX295 = x2 GTX285 downclock, are actually longer and the task ends in 15 hours and without activating SWAN_SYNC = 0. Greetings.
I have not done anything with SWAN_SYNC = 0. How do I check that and what do I need to do with it? |
|
|
mclaverSend message
Joined: 9 Mar 09 Posts: 25 Credit: 3,321,711,931 RAC: 0 Level
Scientific publications
|
It looks like now I am getting 50% erorrs on Collatz. I have deinstalled and reinstalled the latest NVIDA drivers.
Driver Version 27533, Boinc 6.12.26, Windows 7 Profesional 64 biy. |
|
|
|
I have not done anything with SWAN_SYNC = 0. How do I check that and what do I need to do with it?
Hello: Using SWAN_SYNC=0 is to dedicate one CPU core each core GPU with what results are achieved 10 to 15% better but at the expense of strong copper CPU load, I personally do not recommend it.
It has been discussed in several threads on this website, see: http://www.gpugrid.net/forum_thread.php?id=2123 and http://www.gpugrid.net/forum_thread.php?id=2553 and others.
On the other hand I see this using the GTX285 with 7% overclock, I think it's best not to push and set the nominal rate or a little less even to improve stability. Greetings.
|
|
|
mclaverSend message
Joined: 9 Mar 09 Posts: 25 Credit: 3,321,711,931 RAC: 0 Level
Scientific publications
|
I have not done anything with SWAN_SYNC = 0. How do I check that and what do I need to do with it?
Hello: Using SWAN_SYNC=0 is to dedicate one CPU core each core GPU with what results are achieved 10 to 15% better but at the expense of strong copper CPU load, I personally do not recommend it.
It has been discussed in several threads on this website, see: http://www.gpugrid.net/forum_thread.php?id=2123 and http://www.gpugrid.net/forum_thread.php?id=2553 and others.
On the other hand I see this using the GTX285 with 7% overclock, I think it's best not to push and set the nominal rate or a little less even to improve stability. Greetings.
That is not what I want to do. I have all my CPUs dedicated to WCG, and most of my GPUs dedicated to GPUGRID except this one.
I am actually ranked 19 in GPUGRID, and have GPUs from a GT 240 to multiple GTX 480s. The GTX 285 is the only one I am having problems with.
I am not that familiar with GPUs. I did not overclock this card, but I believe it was factory overclocked to 702, and that could be causing the problem.
How do I change the clock on a GPU (I know how to do it on a CPU), and what speed would you recommend for a GTX 285. |
|
|
DagorathSend message
Joined: 16 Mar 11 Posts: 509 Credit: 179,005,236 RAC: 0 Level
Scientific publications
|
I have heard some Windows users are using MSI Afterburner to adjust clock speeds. I don't use Windows so I can't tell you much more than that. But before you mess with clock speeds, have you looked at the temperature your GPU is running at? It was running fine before, right? Maybe it just needs to have the dust blown out of it?
It seems every summer there is always a surge in errors on hardware that was running fine. It's usually just due to dust building up over the winter. |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
I would go along with what Dagorath says:
Check for dust; in the summer it's warmer so any dust build up often causes heat problems. While this may be seen from products such as MSI Afterburner, EVGA Precision and GPUz, it might not always be so apparent; these products only report GPU temperatures, not capacitor temps.
If you feel the need to reduce speeds, drop the GPU clock speed. Your GTX285 is a CC1.3 card, so you may not even see a drop in performance. Keep the shader frequency as is. For CC2.0 and CC2.1 cards the GPU is linked to the shaders (1:2) so you would see a drop. |
|
|
mclaverSend message
Joined: 9 Mar 09 Posts: 25 Credit: 3,321,711,931 RAC: 0 Level
Scientific publications
|
The fan is clean and the card does not seem to hot. I reduced my GPU clock from 702 to 650 and I am still getting about 30% errors on Collatz. Once I get through the queue for Collatz I will go back to GPU grid and see how I do there at the lower clock speed. |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
30% is better than 50%, so you might be on to something.
I would suggest reducing the RAM frequency; it can often lead to better GPU stability. |
|
|
MicanSend message
Joined: 1 Jan 09 Posts: 2 Credit: 39,430,277 RAC: 0 Level
Scientific publications
|
I have the same error 0x3 message on every unit I tried this week and I am not conviced that is caused by overclocking or computer instability (this computer finished many tasks recently). Tasks are failing 5 secs after start, reporting missing or inaccessible file. Maybe is something wrong with a 275.33 nvidia driver?
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
Syst�m nem��e nal�zt uvedenou cestu. (0x3) - exit code 3 (0x3)
</message>
<stderr_txt>
# Using device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce GTX 560 Ti"
# Clock rate: 1.90 GHz
# Total amount of global memory: 1008271360 bytes
# Number of multiprocessors: 8
# Number of cores: 64
MDIO: cannot open file "restart.coor"
SWAN: FATAL : swanMemcpyDtoH failed
Assertion failed: 0, file swanlib_nv.c, line 390
This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.
</stderr_txt>
]]>
|
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
I would suggest you install the driver again.
If that does not work reset the project from Boinc Manager.
You don't have to use SWAN_SYNC anymore. |
|
|
|
Same card and same problem on my only host wich have 570 GTX and the 285.
And no O/C on the 285.
There is few weeks the system work good, but now all wu on 285 go in errors absent.
And my temp are low. its not the problem :/ |
|
|
HonaSend message
Joined: 21 Sep 10 Posts: 2 Credit: 530,432,306 RAC: 0 Level
Scientific publications
|
This seems to be the way to go.
After some weeks of testing, my GTX 285 now runs fine with GPU and Shaders at factory-settings of 648 MHz and 1476 MHz. I only lowered the mem Frequency from 1242 MHz to 999 MHz and there seems to be no loss of speed in calculation for the rate of GPU-using is still the same.
No more compute- or validate-errors here and at Einstein@Home.
Thaks for your suggestion.
Hona
|
|
|
|
I have the same error message. All my work unit on GPUGRID, Einstein@home, Seti using my GPU are a computation error. Is your problem was solved by installing again the graphics card drivers?
Let me know.
My error message:
<core_client_version>6.13.1</core_client_version>
<![CDATA[
<message>
Le chemin d�acc�s sp�cifi� est introuvable. (0x3) - exit code 3 (0x3)
</message>
<stderr_txt>
# Using device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce GTX 560 Ti"
# Clock rate: 1.80 GHz
# Total amount of global memory: 1073741824 bytes
# Number of multiprocessors: 8
# Number of cores: 64
MDIO: cannot open file "restart.coor"
SWAN: FATAL : swanMemcpyDtoH failed
Assertion failed: 0, file swanlib_nv.c, line 390
This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.
</stderr_txt>
]]> |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
Try a driver reinstall.
Your Boinc configuration could influence the chances of failures, as could your environment (temperature), and what else you use the system for.
____________
FAQ's
HOW TO:
- Opt out of Beta Tests
- Ask for Help |
|
|
|
I installed again all my graphic drivers card and I have always computation errors.
I have this kind of errors since I installed my new GTX 560TI graphic card. Before I used a 9800 GT and I don't have any problem.
Any one have a solution?
http://einstein.phys.uwm.edu/result.php?resultid=256463235 |
|
|
|
I installed again all my graphic drivers card and I have always computation errors.
I have this kind of errors since I installed my new GTX 560TI graphic card. Before I used a 9800 GT and I don't have any problem.
Any one have a solution?
Errors computation on my Boinc project
http://www.gpugrid.net/result.php?resultid=4525101
http://einstein.phys.uwm.edu/result.php?resultid=256463235 |
|
|
DagorathSend message
Joined: 16 Mar 11 Posts: 509 Credit: 179,005,236 RAC: 0 Level
Scientific publications
|
davidalary wrote: My error message:
<core_client_version>6.13.1</core_client_version>
I think we are not supposed to use any of the 6.13.xx clients because it causes problems creating WUs.
|
|
|