Author |
Message |
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
I have uploaded the new CUDA3.1 application for everyone.
Remember to use SWAN_SYNC=0 for maximum performance (at the price of one CPU core).
Let us know the performance improvement.
gdf |
|
|
nenymSend message
Joined: 31 Mar 09 Posts: 137 Credit: 1,308,230,581 RAC: 0 Level
Scientific publications
|
For 65nm GTX260 too? |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Anyone with a CUDA3.1 driver will receive it.
(Windows only for now)
gdf |
|
|
|
Anyone with a CUDA3.1 driver will receive it.
(Windows only for now)
gdf
Any testing with non-GTX 4X0 cards? It appears pretty good so far for these, but I haven't received any CUDA3.1 WU's on my slower / older cards. Just wondering how many of these beta's have been run on those systems. |
|
|
nenymSend message
Joined: 31 Mar 09 Posts: 137 Credit: 1,308,230,581 RAC: 0 Level
Scientific publications
|
Got 6.05 cuda tasks, what to do to recieve 3.1 tasks? Beta 3.1 run nice on my GTX260. |
|
|
|
Got 6.05 cuda tasks, what to do to recieve 3.1 tasks? Beta 3.1 run nice on my GTX260.
The best you can do is make sure you have "Run test applications" checked on your Prefences and then hope you get one.
From what I can see this is a fairly small beta (I think it was 20 WUs), most likely because it tested really well in the lab and so far I have not seen anyone posting an error for this beta. Congrats to the GPUGrid team on this and I look forward to this version of your app's release. Hopefully the performance increases will allow you to increase overall WU size to help the project but still remain about the same average runtime we crunchers see now.
:thumbs up:
____________
Thanks - Steve |
|
|
nenymSend message
Joined: 31 Mar 09 Posts: 137 Credit: 1,308,230,581 RAC: 0 Level
Scientific publications
|
Test 3.1 tasks I run OK http://www.gpugrid.net/forum_thread.php?id=2199&nowrap=true#17785, I have understood, that are available standard 3.1 tasks. Standard 6.05 2.2/2.3 cuda tasks crashes on my 65nm GTX260. |
|
|
|
Test 3.1 tasks I run OK.
I have understood, that are available standard 3.1 tasks.
I don't think a standard GPUGrid application that uses 3.1 has been released yet.
____________
Thanks - Steve |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Got 6.05 cuda tasks, what to do to recieve 3.1 tasks? Beta 3.1 run nice on my GTX260.
You just need to install cuda3.1 drivers.
gdf |
|
|
|
Test 3.1 tasks I run OK.
I have understood, that are available standard 3.1 tasks.
I don't think a standard GPUGrid application that uses 3.1 has been released yet.
And what about the 6.09 version in the apps? It's not among the ACEMD beta versions. Only the windows version available at the moment.
The first post says:
I have uploaded the new CUDA3.1 application for everyone.
Remember to use SWAN_SYNC=0 for maximum performance (at the price of one CPU core).
Let us know the performance improvement.
gdf
|
|
|
nenymSend message
Joined: 31 Mar 09 Posts: 137 Credit: 1,308,230,581 RAC: 0 Level
Scientific publications
|
I have installed WHQL 257.21 since 06/15/2010 at host ID 31329 and beta 3.1 tasks run OK....but still recieving (and deleting) standard tasks as 6.05 cuda. |
|
|
|
Test 3.1 tasks I run OK.
I have understood, that are available standard 3.1 tasks.
I don't think a standard GPUGrid application that uses 3.1 has been released yet.
And what about the 6.09 version in the apps? It's not among the ACEMD beta versions. Only the windows version available at the moment.
The first post says:
I have uploaded the new CUDA3.1 application for everyone.
Remember to use SWAN_SYNC=0 for maximum performance (at the price of one CPU core).
Let us know the performance improvement.
gdf
My bad ... I was not reading closely.
Might I suggest that we all just crank through whatever WUs GPUGrid gives us rather than cherry picking the new ones? This way we can finish up the old ones and move everything to the new as soon as possible.
For point hunters ... make sure you don't cancel too many or the server will shut you off for a while ... possibly a long while ~ many hours!!!
____________
Thanks - Steve |
|
|
nenymSend message
Joined: 31 Mar 09 Posts: 137 Credit: 1,308,230,581 RAC: 0 Level
Scientific publications
|
For point hunters ... make sure you don't cancel too many or the server will shut you off for a while ... possibly a long while ~ many hours!!!
I know that, but my 65nm GTX260 crashes any CUDA 2.2/2.3 task. It was usable a year ago and from that time I am crunching other project on it, but not GPUGRID. I load (and cancel) task time to time to not reach daily quota. Sometimes i recieve 3.1 beta task that rise daily quota.
GDF, are you sure that server should send standard tasks for non-Fermi class GPU with 257.21 drivers as CUDA 3.1?
|
|
|
|
For point hunters ... make sure you don't cancel too many or the server will shut you off for a while ... possibly a long while ~ many hours!!!
I know that, but my 65nm GTX260 crashes any CUDA 2.2/2.3 task. It was usable a year ago and from that time I am crunching other project on it, but not GPUGRID. I load (and cancel) task time to time to not reach daily quota. Sometimes i recieve 3.1 beta task that rise daily quota.
My post was not directed towards you, it was just a general warning for people who do not already know the consequences if they abort to many.
____________
Thanks - Steve |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
For point hunters ... make sure you don't cancel too many or the server will shut you off for a while ... possibly a long while ~ many hours!!!
I know that, but my 65nm GTX260 crashes any CUDA 2.2/2.3 task. It was usable a year ago and from that time I am crunching other project on it, but not GPUGRID. I load (and cancel) task time to time to not reach daily quota. Sometimes i recieve 3.1 beta task that rise daily quota.
GDF, are you sure that server should send standard tasks for non-Fermi class GPU with 257.21 drivers as CUDA 3.1?
It should. I will check the log files.
gdf |
|
|
|
Got anybody any of these new v6.09 WUs?
I'm still getting only v6.05.
I guess we have to wait the queue to run out of v6.05 before we'll get v6.09. |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
My fault. The server was not giving out cuda3.1 jobs to anyone.
Now it should be fixed.
gdf |
|
|
nenymSend message
Joined: 31 Mar 09 Posts: 137 Credit: 1,308,230,581 RAC: 0 Level
Scientific publications
|
Thanks GDF, it works now. Great. |
|
|
|
6.09 Cuda 3.1 beta is running good on my 55nm EVGA 260s. As did the previous versions also, with moderate overclock.
Just 1 WU completed so far but it seems to be 5 to 10 percent faster than 6.05 was.
Time per step seems 2x faster than I usualy see. |
|
|
|
For Linux, when ?
Please. |
|
|
|
My first return on a full size 3.1 is slower ... very disappointing.
No changes to system setup: GTX480 on WinXP
Examples are from the same WU type: TONI_CAPBIND*
Old version average runtime was 6550 (very little delta between runs)
# Time per step (avg over 650000 steps): 10.065 ms
# Approximate elapsed time for entire WU: 6541.937 s
New version 1 result runtime was 7768 seconds.
# Time per step (avg over 650000 steps): 11.947 ms
# Approximate elapsed time for entire WU: 7765.391 s
____________
Thanks - Steve |
|
|
|
next result on the same system is for HERG* type WU.
about the same as the old app version ...
Old App average runtime 6250 sec. (very little delta between WUs)
# Time per step (avg over 625000 steps): 10.001 ms
# Approximate elapsed time for entire WU: 6250.359 s
New App 1 result
# Time per step (avg over 625000 steps): 9.815 ms
# Approximate elapsed time for entire WU: 6134.672 s
____________
Thanks - Steve |
|
|
ftpd Send message
Joined: 6 Jun 08 Posts: 152 Credit: 328,250,382 RAC: 0 Level
Scientific publications
|
Two cuda 3.1 6.09 WU windows-xp-pro gtx480 takes more than 3 hours per wu.
More 18.000 ms over 625000 steps.
So this is not good??!!
____________
Ton (ftpd) Netherlands |
|
|
nenymSend message
Joined: 31 Mar 09 Posts: 137 Credit: 1,308,230,581 RAC: 0 Level
Scientific publications
|
My the first standard task succesfully finished after more than one year (65nm GTX 260, 64bit Win XP).
TONI_CAPBIND*
SWAN: Using synchronization method 0
# Time per step (avg over 325000 steps): 27.879 ms
# Approximate elapsed time for entire WU: 18121.656 s
Thanks GDF.
|
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
For Linux, when ?
Please.
Next week.
gdf |
|
|
|
Thank you.
My gpus drivers are updated to 256.35 linux version. |
|
|
|
I have completed four 6.09 tasks, and I'm a little disappointed.
There is a performance gain, but it's not the way I thought it would be.
Now my GPU is cooler with the new client, but that's not the important factor for me. :)
These 6.09 WUs take longer, beacuse they use less GPU.
According to GPU-Z v0.4.4 the GPU load is now around 60% to 74%.
Could you optimize the code so it would cause higher GPU load (on GTX 480)?
It was over 80% with the V6.05 client.
V6.09 m2-IBUCH_201b_pYEEI_100304-40-80-RND1404_1 . 15.422ms 9639.047s
V6.05 p41-IBUCH_501_pYEEI_100301-39-40-RND6587_1 _ 12.876ms 8047.344s
V6.09 h232f99r412-TONI_CAPBINDsp2-29-100-RND6253_0 16.624ms 10805.422s
V6.05 h232f99r558-TONI_CAPBINDsp2-22-100-RND1136_1 12.253ms 7964.406s
Also, one WU is failed approximately at 80%. |
|
|
|
While I am seeing higher GPU usage I am also seeing higher CPU usage which is causing more kernel thrashing which I believe is what is slowing this version of the app back. I have cut BOINC Manager down another thread to see if this helps. I was leaving one free so now I have two thread open for GPUGrid on each machine, not good for my WCG crunching. Overall I am seeing performance LOSSES of a few minutes on WinXP (480) and enormous (like at least an hour per WU) LOSSES on Win7 (a 285 and a 295). The only WU type that I am seeing an improvement on is TONI_HERG by a couple minutes on both systems. I am seriously considering rolling back.
Perhaps the betas proved the programming was stable and perhaps I was overly optimistic in think the beta runtimes / ms per step were indivcative of production performance.
Sorry if I sound harsh, I think I had my hopes up high and should crunch for aliens and esoteric math problems to regain my focus on GPUGrid which I really do think is a fantastic project. Time for a break from posting.
____________
Thanks - Steve |
|
|
|
Results seem to vary drastically with my WinXP
I392-TONI_KIDln-12-100-RND6196_0
# Device 0: "GeForce GTX 470"
SWAN: Using synchronization method 0
# Time per step (avg over 1300000 steps): 4.924 ms!!!
# Approximate elapsed time for entire WU: 6401.594 s
h232f99r440-TONI_CAPBINDsp2-25-100-RND7492_0
# Device 0: "GeForce GTX 470"
SWAN: Using synchronization method 0
# Time per step (avg over 650000 steps): 16.178 ms
# Approximate elapsed time for entire WU: 10515.719 s
Not sure what the difference between these 2 applications is but if the second can be optimized like first then that would be seriously impressive |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
The beta workunit has a size of the simulation box of 64x64x64. Cuda3.1 optimizes FFT in multiple of 2. So it was a bit faster and other machines (WinXP) reproduced it. As they were overclocked even improved the reference speed we have found locally.
After these tests in gpugrid, it seems that other workunits are not any better than cuda3.0. I will do some tests next week on other workunit types.
Nevertheless, CUDA3.1 works well also on holder card, and seems to fix several bug. So it is a good release.
gdf |
|
|
|
@GDF Please read this and give me an optout of 3.1 if that's possible because the result will be the probable loss of two crunchers or more for this project.
____________
Radio Caroline, the world's most famous offshore pirate radio station.
Great music since April 1964. Support Radio Caroline Team -
Radio Caroline |
|
|
|
Serious note ... the majority of the degraded performance on my 2xx series cards was because the last time I switched hardware around I forgot to delete the SWAN_SYNC environ var and apparently the new version of the app recognizes and uses it. This is why I had an even greater amount of kernel usage than normal. While some WUs are faster and some are slower I hope the project gets the benefits from a more stable app.
____________
Thanks - Steve |
|
|
MarkJ Volunteer moderator Volunteer tester Send message
Joined: 24 Dec 08 Posts: 738 Credit: 200,909,904 RAC: 0 Level
Scientific publications
|
@GDF Please read this and give me an optout of 3.1 if that's possible because the result will be the probable loss of two crunchers or more for this project.
You could just go back to older drivers.
____________
BOINC blog |
|
|
|
I've just installed v258.19 driver, and it reports itself like this in Boinc Manager:
2010.07.03. 23:09:18 NVIDIA GPU 0: GeForce GTX 480 (driver version 25819, CUDA version 3020, compute capability 2.0, 1536MB, 1345 GFLOPS peak)
It means that soon(?) tere will be CUDA 3.2 |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
This CUDA 3010 WU look promising!
http://www.gpugrid.net/result.php?resultid=2611239
It finished about 40% faster than the CUDA 3000 tasks. So although there are some that are about 50% slower, there are some faster WU's as well.
GDF just needs to work out what the difference is, and make sure we get the fast ones rather than the slow ones.
Good luck with that,
Details:
The fast one,
2611239 1660106 3 Jul 2010 13:39:37 UTC 3 Jul 2010 18:51:08 UTC Completed and validated 6,601.92 6,598.16 4,727.93 7,091.89 ACEMD2: GPU molecular dynamics v6.09 (cuda31)
Name I531-TONI_KIDln-13-100-RND2019_0
Workunit 1660106
Created 3 Jul 2010 13:19:57 UTC
Sent 3 Jul 2010 13:39:37 UTC
Received 3 Jul 2010 18:51:08 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 71363
Report deadline 8 Jul 2010 13:39:37 UTC
Run time 6601.921875
CPU time 6598.156
stderr out
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<stderr_txt>
# Using device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce GTX 470"
# Clock rate: 1.41 GHz
# Total amount of global memory: 1341718528 bytes
# Number of multiprocessors: 14
# Number of cores: 112
SWAN: Using synchronization method 0
MDIO ERROR: cannot open file "restart.coor"
# Time per step (avg over 1300000 steps): 5.077 ms
# Approximate elapsed time for entire WU: 6599.953 s
called boinc_finish
</stderr_txt>
]]>
Validate state Valid
Claimed credit 4727.92939814815
Granted credit 7091.89409722222
application version ACEMD2: GPU molecular dynamics v6.09 (cuda31)
An example of the older CUDA 3000 tasks,
2603160 1654062 2 Jul 2010 1:29:51 UTC 2 Jul 2010 6:15:33 UTC Completed and validated 8,218.06 8,105.92 4,428.01 6,642.02 ACEMD2: GPU molecular dynamics v6.05 (cuda30)
An example of the newer slow WUs,
2611961 1657969 3 Jul 2010 17:01:16 UTC 3 Jul 2010 22:26:14 UTC Completed and validated 12,871.38 12,867.41 4,535.61 6,803.41 ACEMD2: GPU molecular dynamics v6.09 (cuda31)
|
|
|
|
just want to suggest that everyone take a slow deep breath....
and remind everyone we are using bleeding edge drivers with bleeding edge developer tools.
Let's not prematurely flee like rats from a sinking ship just because every single GPU transistor is not working at 100% optimization and productivity.
Remember the main underlying goal is to help crunch protein simulations for medical research.
If some work units are slower now than before...so what? GPU crunching is still exponentially faster than crunching the same thing on CPU based applications.
If we keep our wits about us and work together we can help improve application productivity by providing constructive feedback. |
|
|
liveonc Send message
Joined: 1 Jan 10 Posts: 292 Credit: 41,567,650 RAC: 0 Level
Scientific publications
|
I second that!
I guess that I could run Prime95 or Linpack 24/7 in all I wanted to do was produce heat & waste electricity, especially in Summer. If I like getting points, I'm sure i could do that on Facebook for free w/o needing more then a netbook.
But I like GPUGRID. They do work, they have a friendly forum where stupid questions aren't answered back with a verbal gutting or impalement, & they're quick to adopt to all the new things which are still raw.
That one day, there would be other things then research in mental disorders, is something I can but hope. I'm more interested in cancer research, cures for STD's, & illness' that normally don't get enough funding, this is none of the above, but if GPUGRID shows the way, maybe the rest of the pack will follow one day...
____________
|
|
|
|
I got one of those fast work units:
# Time per step (avg over 1300000 steps): 4.617 ms
# Approximate elapsed time for entire WU: 6002.672 s
Claimed credit 4728
Granted credit 7092
This WU generated 94% CPU load, wich makes me contented.
But I know the other WUs did run almost at this 'efficiency level' before, and I'd like to have it back, or more :) |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
These CUDA 3.1 I531-TONI_KIDln WU’s are clearly much faster than the CUDA 3000 WUs, and the 3.1 TONI_HERG WU's are just the same as 3.0, but the TONI_CAPBIND, KASHIF_HIVPR are much slower.
TONI_KID fast, and again
Before last weekend’s downtime my Fermi on XP had a steady RAC of 65K. It has now dropped to 55K and is showing no signs of increasing.
If I just crunched the TONI_CAPBIND it would drop to 45K!
If I just crunched KASHIF_HIVPR it would drop to about 53K
If I could just crunch TONI_KIDln it would rise to 92K. Yes, these are twice as fast as TONI_CAPBIND under CUDA 3.1. So, as you can see there is a massive performance difference between tasks!
Before you (GPUGrid team) sit down to work out why this is, could you build the Work Units according to this, so the Fermi’s just get TONI_KIDln Work Units? The slower tasks could just be compiled for 2.2, as they are faster that way. This would immediately speed up the research on Fermi cards (almost double it).
Thanks, |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
We can reproduce the problem so we can probably fix it.
http://www.gpugrid.net/forum_thread.php?id=2209&nowrap=true#17865
gdf |
|
|
|
The IBUCH_ _pYEEI units are also slower on my computer. |
|
|
WerkstattSend message
Joined: 23 May 09 Posts: 121 Credit: 327,240,386 RAC: 13,601 Level
Scientific publications
|
Got anybody any of these new v6.09 WUs?
Hi,
yes, today I updated to cuda 3.1, resetted the project. A couple of minutes later the download of all necessary files started. For abaout half an hour I received messages, that I have a wrong system, that a fermi-GPU is required aso.
But then - surprise, surprice - a 6.09WU started. Right now my old GTX260 is crunching it. Usually it faild after ~16%, now it reached 18% .. and it is still working!
I will give an update, whether it finished or failed.
Regards,
Alexander |
|
|
WerkstattSend message
Joined: 23 May 09 Posts: 121 Credit: 327,240,386 RAC: 13,601 Level
Scientific publications
|
I will give an update, whether it finished or failed.
No reason to be happy, wu failed after ~2 hours.
So CUDA3.1 + old 260 : Sorry, the problem is still there!
Regards
Alexander |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
We can reproduce the problem so we can probably fix it.
http://www.gpugrid.net/forum_thread.php?id=2209&nowrap=true#17865
gdf
We know where is the problem. It is due to a change of data structure in CUDA3.1 which seems to be very bad in most cases.
A fix is on its way, but it might require a couple of days.
Be patient in the meanwhile.
gdf |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
It's a problem of a routine in cuda3.1. We have patched it for now.
New application release coming up.
gdf |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
We have found a very bad bug in CUDA3.1. I have removed the CUDA3.1 application until we can see the extent of the problem.
gdf |
|
|
|
Maybe NV will fix it in CUDA3.2 :)
I hope they will release it in a month. |
|
|
|
Are there any news about that very bad bug? |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
On a machine with a mixed Fermi/g200 installation of cards, results on g200 for the FFT are simply wrong. This is a really bad bug.
The only solution is to deliver the CUDA3.1 application to Fermi cards only.
gdf |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
New application is out for test in beta. Only fermi cards can get the cuda31 app due to the cuda3.1 bug.
This version maintains the level of performance of cuda3.1 for all workunits.
gdf |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
Picked up two, both failed in about 1 sec!
2599077 1651913 15 Jul 2010 14:35:06 UTC 15 Jul 2010 14:36:31 UTC Error while computing 1.45 0.00 0.00 --- ACEMD beta version v6.30 (cuda31)
2598716 1651698 15 Jul 2010 14:36:31 UTC 15 Jul 2010 14:40:05 UTC Error while computing 0.00 0.00 0.00 --- ACEMD beta version v6.30 (cuda31)
Name 100-GIANNI_TESTDHFR7-1-5-RND4717_0
Workunit 1651913
Created 1 Jul 2010 8:39:18 UTC
Sent 15 Jul 2010 14:35:06 UTC
Received 15 Jul 2010 14:36:31 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status -40 (0xffffffffffffffd8)
Computer ID 71363
Report deadline 20 Jul 2010 14:35:06 UTC
Run time 1.453125
CPU time 0
stderr out
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
- exit code -40 (0xffffffd8)
</message>
<stderr_txt>
# Using device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce GTX 470"
# Clock rate: 1.41 GHz
# Total amount of global memory: 1341718528 bytes
# Number of multiprocessors: 14
# Number of cores: 112
SWAN: Using synchronization method 0
SWAN : Module load result [.nonbonded.cu.] [200]
SWAN: FATAL : Module load failed
</stderr_txt>
]]>
Validate state Invalid
Claimed credit 0
Granted credit 0
application version ACEMD beta version v6.30 (cuda31)
NVIDIA GeForce GTX 470 (1279MB) driver: 25721
Operating System Microsoft Windows XP |
|
|
|
Same results on my GTX 470...immediate failure
Sorry i can't post the error details ATM but I am fairly sure they identical to skgiven's |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
This one also failed,
2598612 1651618 15 Jul 2010 14:40:05 UTC 15 Jul 2010 16:10:07 UTC Error while computing 0.00 0.00 0.00 --- ACEMD beta version v6.30 (cuda31)
They are not even starting,
SWAN: FATAL : Module load failed
|
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Removed, tomorrow another one.
gdf |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Trying now with acemdbeta 6.32 for windows.
gdf |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
Ran one Beta task. It worked on my GTX470 on XP x86, Boinc ver. 6.10.56
Cant find it online yet!
Here are the Boinc details:
16/07/2010 15:00:29 GPUGRID Starting 63-GIANNI_TESTDHFR7-1-5-RND3683_0
16/07/2010 15:00:30 GPUGRID Starting task 63-GIANNI_TESTDHFR7-1-5-RND3683_0 using acemdbeta version 632
16/07/2010 15:05:22 GPUGRID Computation for task 63-GIANNI_TESTDHFR7-1-5-RND3683_0 finished
16/07/2010 15:05:23 GPUGRID Started upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_0
16/07/2010 15:05:23 GPUGRID Started upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_1
16/07/2010 15:05:31 GPUGRID Finished upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_0
16/07/2010 15:05:31 GPUGRID Started upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_2
16/07/2010 15:05:36 GPUGRID Finished upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_1
16/07/2010 15:05:36 GPUGRID Started upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_3
16/07/2010 15:05:47 GPUGRID Finished upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_2
16/07/2010 15:05:47 GPUGRID Started upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_7
16/07/2010 15:05:51 GPUGRID Finished upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_7
16/07/2010 15:05:52 GPUGRID Finished upload of 63-GIANNI_TESTDHFR7-1-5-RND3683_0_3
16/07/2010 15:05:52 GPUGRID Sending scheduler request: To report completed tasks.
16/07/2010 15:05:52 GPUGRID Reporting 1 completed tasks, not requesting new tasks
16/07/2010 15:05:53 GPUGRID Scheduler request completed
GDF, see here too - http://www.gpugrid.net/forum_thread.php?id=2227&nowrap=true#17998 |
|
|
|
I just finished 3 of these beta units, they all came out completed and validated. |
|
|
Beyond Send message
Joined: 23 Nov 08 Posts: 1112 Credit: 6,162,416,256 RAC: 0 Level
Scientific publications
|
These beta WUs are failing on the GTX 460:
<core_client_version>6.10.57</core_client_version>
<![CDATA[
<message>
- exit code -40 (0xffffffd8)
</message>
<stderr_txt>
# Using device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce GTX 460"
# Clock rate: 0.81 GHz
# Total amount of global memory: 774307840 bytes
# Number of multiprocessors: 7
# Number of cores: 56
SWAN : Module load result [.fastfill.cu.] [200]
SWAN: FATAL : Module load failed
</stderr_txt>
While the clock speed is also incorrect above, it's reported correctly by MSI Afterburner v1.61. |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
The beta's are working well for GF100 cards, just not for GF104 cards (GTX460). The scientists are now aware of the problem because Beyond reported a difference in Compute Capability between the cards; CC2.0 for GF100 cards, CC2.1 for GF104 cards. GDF said they will try to create a new Beta for the GTX460 on Monday. So check back Monday evening. Apparently the GTX460 works on Collatz. Those wanting to test their card & tune the fans for a nice low temps, you could figure it out there. |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
GTX470 on XP running a Beta:
2678845 1704781 17 Jul 2010 8:32:57 UTC 17 Jul 2010 11:05:25 UTC
Completed and validated 280.06 276.80 187.28 280.92 ACEMD beta version v6.32 (cuda31) |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
GTX470 on XP running a Beta:
2678845 1704781 17 Jul 2010 8:32:57 UTC 17 Jul 2010 11:05:25 UTC
Completed and validated 280.06 276.80 187.28 280.92 ACEMD beta version v6.32 (cuda31)
This is really good speed. A standard clocked GTX480 is 5ms/step under Linux. So only 10% slower for an overclocked GTX470. As good as it can get.
gdf |
|
|
|
GTX470 on XP running a Beta:
2678845 1704781 17 Jul 2010 8:32:57 UTC 17 Jul 2010 11:05:25 UTC
Completed and validated 280.06 276.80 187.28 280.92 ACEMD beta version v6.32 (cuda31)
Nearly identical results on my OCed GTX470 on XP
2681352 1706384 17 Jul 2010 15:17:24 UTC 17 Jul 2010 15:36:53 UTC
Completed and validated 269.16 266.86 187.28 280.92 ACEMD beta version v6.32 (cuda31)
Time per step: 5.344 ms
|
|
|
|
Standard clocked GTX480 is 5ms/step under Windows XP too:
2679396
2679141
2679030
And they ran with 93% GPU usage, that's great!
I can't hardly wait for the real workunits. :)
I hope they will run this fast and efficient too. |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
I ran a Folding@home task today while running this Beta. It ran very slow compaired to the other Betas, but at least it ran. You cant do this with non-Fermi cards; the GPUGrid tasks would just fail!
2686592 1708583 18 Jul 2010 Completed and validated 277.58 275.69 187.28 280.92 ACEMD beta version v6.32 (cuda31)
2686539 1707219 18 Jul 2010 Completed and validated 1,442.30 1,438.45 187.28 280.92 ACEMD beta version v6.32 (cuda31)
2686534 1707190 18 Jul 2010 Completed and validated 277.84 276.08 187.28 280.92 ACEMD beta version v6.32 (cuda31) |
|
|
|
Five more 6.32 test WUs on standard clocked GTX480 on Win XP x86:
2688266
2686600
2686535
2688266
2683261
The last one failed, during a BSOD, I'm not sure what caused this (I was browsing, but nothing serious). One of my rosetta WU failed too.
Every test WU are slightly under 5ms/step. |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
New beta 6.34 is now out. Hopefully it should work with GTX460 cards, but we could not test it.
gdf |
|
|
|
The beta 6.34 is immediately errored out on my GTX480.
Task 2691577 |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Any more cases of this problem?
gdf |
|
|
|
Even GTX460 does not work by the same error.
http://www.gpugrid.net/result.php?resultid=2691593
http://www.gpugrid.net/result.php?resultid=2691594
<core_client_version>6.11.1</core_client_version>
<![CDATA[
<message>
- exit code -40 (0xffffffd8)
</message>
<stderr_txt>
# Using device 0
# There are 2 devices supporting CUDA
# Device 0: "GeForce GTX 460"
# Clock rate: 1.40 GHz
# Total amount of global memory: 1073283072 bytes
# Number of multiprocessors: 7
# Number of cores: 56
# Device 1: "GeForce GTX 460"
# Clock rate: 1.40 GHz
# Total amount of global memory: 1073414144 bytes
# Number of multiprocessors: 7
# Number of cores: 56
SWAN: Using synchronization method 0
SWAN: FATAL : swanMallocHost failed
</stderr_txt>
]]> |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Removed.
Another coming out now.
gdf |
|
|
|
It still becomes the error on my GTX460.
http://www.gpugrid.net/result.php?resultid=2691594
http://www.gpugrid.net/result.php?resultid=2692279
<core_client_version>6.11.1</core_client_version>
<![CDATA[
<message>
- exit code -40 (0xffffffd8)
</message>
<stderr_txt>
# Using device 0
# There are 2 devices supporting CUDA
# Device 0: "GeForce GTX 460"
# Clock rate: 1.40 GHz
# Total amount of global memory: 1073283072 bytes
# Number of multiprocessors: 7
# Number of cores: 56
# Device 1: "GeForce GTX 460"
# Clock rate: 1.40 GHz
# Total amount of global memory: 1073414144 bytes
# Number of multiprocessors: 7
# Number of cores: 56
SWAN : Module load result [.fastfill.cu.] [200]
SWAN: FATAL : Module load failed
</stderr_txt>
]]>
Specifications
M/B: ASUS P5E
CPU: Inel Xeon X3360
RAM: PC2-6400 2GBx2
GPU: Kuroutoshikou GF-GTX460-E1GHD (made by Sparkle Computer) x2
Driver: FW258.96
OS: Windows XP Pro SP3 x86
--
JG4KEZ |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
application 6.36 is out
gdf |
|
|
|
Looks good!
http://www.gpugrid.net/result.php?resultid=2691597
|
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Well,
ok it runs, we can start tests.
it is very slow at least on Windows 7.
Anyone with XP and SWAN_SYNC=0?
gdf |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
JG4KEZ(Koichi Soraku) has... |
|
|
|
6.36 worked in my GTX460 normally.
GPU load is around 95%.
http://www.gpugrid.net/result.php?resultid=2691958
http://www.gpugrid.net/result.php?resultid=2691640
<core_client_version>6.11.1</core_client_version>
<![CDATA[
<stderr_txt>
# Using device 1
# There are 2 devices supporting CUDA
# Device 0: "GeForce GTX 460"
# Clock rate: 1.40 GHz
# Total amount of global memory: 1073283072 bytes
# Number of multiprocessors: 7
# Number of cores: 56
# Device 1: "GeForce GTX 460"
# Clock rate: 1.40 GHz
# Total amount of global memory: 1073414144 bytes
# Number of multiprocessors: 7
# Number of cores: 56
SWAN: Using synchronization method 0
MDIO ERROR: cannot open file "restart.coor"
# Time per step (avg over 50000 steps): 11.216 ms
# Approximate elapsed time for entire WU: 560.783 s
called boinc_finish
</stderr_txt>
]]> |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
half the speed of a GTX480.
gdf
6.36 worked in my GTX460 normally.
GPU load is around 95%.
http://www.gpugrid.net/result.php?resultid=2691958
http://www.gpugrid.net/result.php?resultid=2691640
<core_client_version>6.11.1</core_client_version>
<![CDATA[
<stderr_txt>
# Using device 1
# There are 2 devices supporting CUDA
# Device 0: "GeForce GTX 460"
# Clock rate: 1.40 GHz
# Total amount of global memory: 1073283072 bytes
# Number of multiprocessors: 7
# Number of cores: 56
# Device 1: "GeForce GTX 460"
# Clock rate: 1.40 GHz
# Total amount of global memory: 1073414144 bytes
# Number of multiprocessors: 7
# Number of cores: 56
SWAN: Using synchronization method 0
MDIO ERROR: cannot open file "restart.coor"
# Time per step (avg over 50000 steps): 11.216 ms
# Approximate elapsed time for entire WU: 560.783 s
called boinc_finish
</stderr_txt>
]]> |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
Perhaps the GTX460's are not using all of their shaders?
Anyway, half the speed of a GTX480 is not bad; a GTX460 costs less than half the price of a GTX480!
It would seem that under Win7 the 1GB version is 10% faster than the 786MB version:
http://www.gpugrid.net/result.php?resultid=2691759
http://www.gpugrid.net/result.php?resultid=2691597 |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Maybe we can order one and try to optimize for it, it should have done much better.
gdf |
|
|
Beyond Send message
Joined: 23 Nov 08 Posts: 1112 Credit: 6,162,416,256 RAC: 0 Level
Scientific publications
|
Well,
ok it runs, we can start tests.
it is very slow at least on Windows 7.
Anyone with XP and SWAN_SYNC=0?
gdf
Works here too but it's so slow that the 460 will be staying on Collatz for now. In Collatz the GPU usage is 99% and temp 56C. In GPUGRID the GPU usage is 46% and the temp 44C. Win7-32. Why can't we get an app that runs at 95+ percent when every other GPU project does that easily?
http://www.gpugrid.net/result.php?resultid=2691759
http://www.gpugrid.net/result.php?resultid=2691643
http://www.gpugrid.net/result.php?resultid=2691630 |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
It's odd that your clocks are really low.
gdf
Well,
ok it runs, we can start tests.
it is very slow at least on Windows 7.
Anyone with XP and SWAN_SYNC=0?
gdf
Works here too but it's so slow that the 460 will be staying on Collatz for now. In Collatz the GPU usage is 99% and temp 56C. In GPUGRID the GPU usage is 46% and the temp 44C. Win7-32. Why can't we get an app that runs at 95+ percent when every other GPU project does that easily?
http://www.gpugrid.net/result.php?resultid=2691759
http://www.gpugrid.net/result.php?resultid=2691643
http://www.gpugrid.net/result.php?resultid=2691630
|
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
Perhaps not just a reporting artefact, either from the driver or the Boinc Version:
Beyond (W7 x86)
NVIDIA GeForce GTX 460 (738MB) driver: 25856
Boinc 6.10.57
Clock rate: 0.81 GHz
JG4KEZ(Koichi Soraku) (XP x86)
[2] NVIDIA GeForce GTX 460 (1023MB) driver: 25896
Clock rate: 1.40 GHz
Bikermatt (W7 x64)
NVIDIA GeForce GTX 460 (993MB) driver: 25896
Clock rate: 1.40 GHz
W7 still reports less RAM than the card actually has.
As we can see from Bikermatt’s results, the Boinc/Driver upgrade would only get it to report the correct speed, and would not improve performance, however it does suggest something:
The difference in performance between an XP system and a W7 system is approximately 0.81/1.4 (to within a few percent) – an XP system takes about 58% of the time to do a similar task, ie 0.81/1.4 of the time.
This is also true of the older GF100 Fermi cards!
This could be a coincidence, but it makes me suspect the update (whatever it is; driver or Boinc version) that allows GPUGrid to display the correct frequency, is actually just displaying what the clock should be, rather than what they actually are, which is still 0.81 under W7 & Vista. This would also explain why the GF100 cards are also slower to about the same ratio. |
|
|
Beyond Send message
Joined: 23 Nov 08 Posts: 1112 Credit: 6,162,416,256 RAC: 0 Level
Scientific publications
|
Perhaps not just a reporting artefact, either from the driver or the Boinc Version:
Beyond (W7 x86)
NVIDIA GeForce GTX 460 (738MB) driver: 25856
Boinc 6.10.57
Clock rate: 0.81 GHz
JG4KEZ(Koichi Soraku) (XP x86)
[2] NVIDIA GeForce GTX 460 (1023MB) driver: 25896
Clock rate: 1.40 GHz
Bikermatt (W7 x64)
NVIDIA GeForce GTX 460 (993MB) driver: 25896
Clock rate: 1.40 GHz
W7 still reports less RAM than the card actually has.
Just updated the Win7-32 system from BOINC 6.10.57 to 6.10.58 and the NVidia drivers from 25856 to 25896. Still reports .81. Now runs at 49% and 45C, 200 watts system draw when running GPUGRID, 240+ when running Collatz (99% GPU). When I get a chance I'll try it with XP. Until then it's on Collatz or DNETC. We really shouldn't have to be downgrading OSes to run a project IMO though.
http://www.gpugrid.net/result.php?resultid=2692961
BTW, as stated before MSI Afterburner shows the card running correctly at: 800 core / 1600 shader / 1900 memory. |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
It looks like Boinc/GPUGrid/Drivers think it is a GF100 card.
Did you do a restart? |
|
|
|
Hi
Use 2x460 GPU's on this host, run "almost" fine
http://www.gpugrid.net/show_host_detail.php?hostid=75798 |
|
|
skgivenVolunteer moderator Volunteer tester
Send message
Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level
Scientific publications
|
Yes, they work, so that is a big +ve.
The downside is that the GTX460's are a little bit slower than expected, but these are only Betas, and that might change with improved applications. It was a good idea to put them on XP - the first XP x64 I have seen in a while ;)
I guess you are not using the swan_sync=0 variable? Try leaving a core free, to see if it improves performance (I expect it will).
Good Luck, |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
I will be moving the beta to production today.
gdf |
|
|
nenymSend message
Joined: 31 Mar 09 Posts: 137 Credit: 1,308,230,581 RAC: 0 Level
Scientific publications
|
For the Fermi class GPU only? |
|
|
GDFVolunteer moderator Project administrator Project developer Project tester Volunteer developer Volunteer tester Project scientist Send message
Joined: 14 Mar 07 Posts: 1957 Credit: 629,356 RAC: 0 Level
Scientific publications
|
Yes, due to the bug in CUDA3.1.
gdf |
|
|