Message boards : Graphics cards (GPUs) : Temperatures
Author | Message |
---|---|
What temperatures do you run your cards at? I keep mine in the low 70s and 60s if I can. Have people been noticing a higher death rate with certain temps, from experience, not theoretical. | |
ID: 46215 | Rating: 0 | rate: / Reply Quote | |
86c for my 980ti 24/7 no errors due to that temp. Never had a card fail. | |
ID: 46216 | Rating: 0 | rate: / Reply Quote | |
60-65°C for my previous gtx980 (MSI) and my current gtx1070 and 1080, also 24/7. The latter Palit GameRock/Jetstream Cards are 2,5 slots high and therefore have extremely good cooling just by sheer radiator mass. Fast, cool and silent, but a little bulky and therefore maybe not everyones cup of tea. Have people been noticing a higher death rate with certain temps, from experience, not theoretical. Well... exactly that is difficult to say, you never know why a card was failing. You would have to run 1000 identical cards in parallel with different temps and try to get some statistics out of it. IMHO only a GPU manufacturer can possibly answer that. They surely do some load testing with their cards prior to release. ____________ I would love to see HCF1 protein folding and interaction simulations to help my little boy... someday. | |
ID: 46217 | Rating: 0 | rate: / Reply Quote | |
Try to keep mine at 75c or lower as from my experience this helps reduce errors, system crashes or bad data being created. | |
ID: 46218 | Rating: 0 | rate: / Reply Quote | |
My Maxwell GTX 980 Ti GPUs, which use GPU Boost v2.0, ship with the "Temp Target" set at 83*C. And I've noticed that the drivers will start dropping GPU clocks, when temps get near that. | |
ID: 46261 | Rating: 0 | rate: / Reply Quote | |
86c for my 980ti 24/7 no errors due to that temp. Never had a card fail. that's interesting to read. For how long have these cards been crunching? I also have two 980ti (Palit Jetstream) in one of my PCs, crunching 24/7, and with the NVIDIA Inspector I set the temp limit to 63°C. Am I too cautious? | |
ID: 46264 | Rating: 0 | rate: / Reply Quote | |
No. Regarding GPU temperatures the lower the better.86c for my 980ti 24/7 no errors due to that temp. Never had a card fail. Take a look at the pre-programmed behavior of the GPUs: as temperature rises, the GPU clock and voltage is reduced to preserve the chip. Not just the overclocking capabilities but the lifespan and temperature tolerance is a matter of silicon lottery. One chip could work for years at 86°C, another will fail after 6 months on that temperature. It's not just the temperature itself which is dangerous, but the expansion (of the chip and the PCB) caused by the change in temperature. Larger change in temperature causes lager expansion that is more wear. The chip and the soldering could withstand a limited number of thermal cycles, the less the expansion the more thermal cycles they could withstand (so the lifespan will be longer). The PCB has 6-8 layers, two of them is for the supply voltage, but the others have ten thousands of wires (and interconnects between the layers). If only one of them gets cut by the thermal expansion, the card will malfunction. The lifespan of capacitors is degraded by higher temperatures (that's why most of the manufacturers use "military standard" capacitors). Thermal cycle is one of the reasons for not to put the GPUs into sockets (the other is the limited width of the card). | |
ID: 46265 | Rating: 0 | rate: / Reply Quote | |
Thanks, Zoltan, for the thorough and informative technical explanations. | |
ID: 46271 | Rating: 0 | rate: / Reply Quote | |
Hello , I've been crunching occassionally with gigabyte geforce gtx-780. Would be nice to do it more but gpu-fans keep noise. How can I make the card silent (lower frequency or something)? I have ubuntu 16.04 , core2duo , driver is 340.101. (newer driver seems to cause computer start-up problems somehow). 'Nvidia x server settings' don't allow to adjust frequencies but is there a way to adjust that somehow? | |
ID: 46273 | Rating: 0 | rate: / Reply Quote | |
86c for my 980ti 24/7 no errors due to that temp. Never had a card fail. The 980ti has been running 24/7 for about a year but doesn't always hit my max 86c depends on WU. | |
ID: 46274 | Rating: 0 | rate: / Reply Quote | |
Hello , I've been crunching occassionally with gigabyte geforce gtx-780. Would be nice to do it more but gpu-fans keep noise. How can I make the card silent (lower frequency or something)? I have ubuntu 16.04 , core2duo , driver is 340.101. (newer driver seems to cause computer start-up problems somehow). 'Nvidia x server settings' don't allow to adjust frequencies but is there a way to adjust that somehow? As you already mentioned, there is http://www.phoronix.com/scan.php?px=MTY1OTM&page=news_item https://wiki.ubuntuusers.de/Overclocking/ From those instructions I would have assumed that you can lower the frequency for both the chip and the memory..? Another alternative would be installing a third Party cooler such as the Arctic Accelero. That one will keep the card cool and silent even at full crunching speed. ____________ I would love to see HCF1 protein folding and interaction simulations to help my little boy... someday. | |
ID: 46276 | Rating: 0 | rate: / Reply Quote | |
From a terminal,
Section "Screen"
Identifier "Screen0"
Device "Device0"
Monitor "Monitor0"
DefaultDepth 24
Option "coolbits" "12"
SubSection "Display"
Depth 24
EndSubSection
EndSection
Save the config file and restart. On restarting open NVIDIA X Server Settings Beneath your GPU (GPU0), select PowerMizer To reduce the GPU clock by 96MHz under Editable Performance Levels, Graphics Clock Offset enter -96. Similarly, to reduce the memory transfer clock by 100MHz enter -100. To set an audibly acceptable GPU fan speed click on Thermal Settings, Enable GPU Fan Settings and set the fan at something sensible (probably 60% or more) to test it. Keep an eye on the GPU Temperature, and adjust accordingly; so that it does not go too high. Up to 70C is usually fine (if not there's likely a problem with the GPU), if it's above that but below 80C adjust your settings or at least keep an eye on the temp and performance (look out for failures/system issues). If it's over 80C you could increase the fan speed further &/or reduce the GPU clock & memory clock further, testing as you go. Note that you need to reapply settings after restarting or create an .sh file and enter the settings and set them to run at startup. For multiple GPU's you need to add coolbits for each GPU (under a screen) & you might need specific drivers (375.20). The above is what I'm using for one GPU with 370.28 drivers. Didn't get anywhere with nvclock - might be defunct with 16.04. Your NV settings can be added to a .sh file, which can be set as an executable and added to the startup list: Right click on your desktop and Create a New Document, Empty Document and call it nv.sh (must end in .sh). Past in the following values (note these are for underclocking the GPU & memory and setting the fan), save and close the file, !/bin/bash nvidia-settings -a '[gpu:0]/GPUGraphicsClockOffset[3]=-96' nvidia-settings -a '[gpu:0]/GPUMemoryTransferRateOffset[3]=-100' nvidia-settings -a '[gpu:0]/GPUFanControlState=1' nvidia-settings -a '[fan:0]/GPUTargetFanSpeed=60' Right click on the nv.sh file and select Properties. Under the Permissions tab select Allow executing file as program and close it. Search your PC for Startup Applications and then Add the nv.sh file to the list (located on the desktop):
Command: /home/'username'/Desktop/nv.sh Comment: SetGPUandFanSpeeds
| |
ID: 46283 | Rating: 0 | rate: / Reply Quote | |
Another way to set the coolbits option is to open a terminal session and enter the following command: sudo nvidia-xconfig --cool-bits = 12 This will set the coolbits option in the xorg.conf file for you. Then restart and follow the instructions listed by skgiven. | |
ID: 46284 | Rating: 0 | rate: / Reply Quote | |
I have an EVGA 1080 FTW Hybrid. The waterblock keeps things COOL. Ambient room temp of up to 28-29C, I've never seen the GPU temp go above 51C. I have a 760 in the same box, it runs around 60-65C at the same time. Not worried about either, well within a good temp range for the cards. | |
ID: 46289 | Rating: 0 | rate: / Reply Quote | |
The GP104 chips have a low TDP, 140-180 watts, and since they throw just as big of a cooler on them as the 250 watt cards, they definitely stay cool. | |
ID: 46292 | Rating: 0 | rate: / Reply Quote | |
Currently running a EVGA GTX 1080 SC ACX 3.0. Previous card was EVGA 980Ti SC ACX 3.0. On GPUGrid, with both cards I rarely got much above 60C. The 1080 almost never goes above mid-50s. I do use a custom fan curve, but it never needs to go above 80% and usually runs about 60 - 70%. Part of that is due to having a very well ventilated case (Phanteks Enthoo Primo) and part of it is just due to the superior cooling ability of the ACX fans over the reference blower style. | |
ID: 46408 | Rating: 0 | rate: / Reply Quote | |
I have an EVGA 1080 FTW Hybrid. The waterblock keeps things COOL. Ambient room temp of up to 28-29C, I've never seen the GPU temp go above 51C. I have a 760 in the same box, it runs around 60-65C at the same time. Not worried about either, well within a good temp range for the cards. Does your card run at full boost on GPUGrid? My EVGA 1080 SC never wants to boost above base clock when running most BOINC projects. PrimeGrid has a few that always make it boost. I'm just trying to figure out if it's just my card or if BOINC projects are just not working the card hard enough to make it boost. Your temps are similar to mine, but on waterblock so I'm curious. | |
ID: 46409 | Rating: 0 | rate: / Reply Quote | |
Hello and Thank you all the coolbits-advisors. I somehow managed to get editable options to "Nvidia x-server settings" , obviously with a help of little luck. "Standing by" for further experiments. Thanks! | |
ID: 46429 | Rating: 0 | rate: / Reply Quote | |
96°C (205°F) is pretty much. <core_client_version>7.6.33</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -97 (0xffffff9f)
</message>
<stderr_txt>
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# GPU 0 : 90C
# GPU 0 : 93C
# GPU 0 : 94C
# GPU 0 : 95C
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# GPU 0 : 74C
# GPU 0 : 81C
# GPU 0 : 83C
# GPU 0 : 86C
# BOINC suspending at user request (exit)
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# GPU 0 : 78C
# GPU 0 : 84C
# GPU 0 : 90C
# GPU 0 : 93C
# GPU 0 : 94C
# GPU 0 : 95C
# GPU 0 : 96C
# BOINC suspending at user request (exit)
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# GPU 0 : 78C
# GPU 0 : 84C
# GPU 0 : 89C
# GPU 0 : 90C
# GPU 0 : 91C
# GPU 0 : 92C
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# BOINC suspending at user request (exit)
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# GPU 0 : 78C
# GPU 0 : 86C
# GPU 0 : 89C
# GPU 0 : 90C
# GPU 0 : 91C
# GPU 0 : 92C
# BOINC suspending at user request (exit)
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# GPU 0 : 77C
# GPU 0 : 84C
# GPU 0 : 88C
# GPU 0 : 91C
# GPU 0 : 92C
# GPU 0 : 93C
# GPU 0 : 94C
# GPU 0 : 95C
# GPU 0 : 96C
# BOINC suspending at user request (exit)
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# GPU 0 : 86C
# GPU 0 : 93C
# GPU 0 : 94C
# GPU 0 : 96C
# The simulation has become unstable. Terminating to avoid lock-up (1)
# Attempting restart (step 15255000)
# GPU [GeForce GTX 765M] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GTX 765M
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:01:00.0
# Device clock : 862MHz
# Memory clock : 2004MHz
# Memory width : 128bit
# Driver version : r375_00 : 37633
# The simulation has become unstable. Terminating to avoid lock-up (1)
</stderr_txt>
]]> No, I was wrong. 96°C is too much. | |
ID: 46430 | Rating: 0 | rate: / Reply Quote | |
True ... so if one really, really wants to crunch on a Notebook I would strongly recommend to run TThrottle at least in order to keep temperatures below 80°C. | |
ID: 46431 | Rating: 0 | rate: / Reply Quote | |
Crunching on a laptop, even if the gpu is somehow magically cool, kills the fans in a matter of weeks, at most a matter of months. | |
ID: 46432 | Rating: 0 | rate: / Reply Quote | |
Hello folks , I think I got coolbits working , (gtx-780) , temperatures dropped down 5-6 degrees and fans got little more silent with underclocking that -105Mhz that the x-server setting page said to be minimum amount ,... but have anyone underclocked even more? Could it be harmfull to card to reduce the clock-frequency for example 300Mhz? | |
ID: 46450 | Rating: 0 | rate: / Reply Quote | |
This is a new world record: 99°C, and it's *not* a laptop. <core_client_version>7.6.22</core_client_version>
<![CDATA[
<message>
aborted by user
</message>
<stderr_txt>
# GPU [GeForce GT 530] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GT 530
# ECC : Disabled
# Global mem : 2048MB
# Capability : 2.1
# PCI ID : 0000:02:00.0
# Device clock : 1399MHz
# Memory clock : 793MHz
# Memory width : 128bit
# Driver version : r343_00 : 34520
# BOINC suspending at user request (exit)
# GPU [GeForce GT 530] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 0 :
# Name : GeForce GT 530
# ECC : Disabled
# Global mem : 2048MB
# Capability : 2.1
# PCI ID : 0000:02:00.0
# Device clock : 1399MHz
# Memory clock : 793MHz
# Memory width : 128bit
# Driver version : r343_00 : 34520
# GPU 0 : 98C
# GPU 0 : 99C
# BOINC aborting at user request (exit)
</stderr_txt>
]]> | |
ID: 46501 | Rating: 0 | rate: / Reply Quote | |
Probably passive cooled, and did I real that right? 1399mhz? That can't be stock. | |
ID: 46502 | Rating: 0 | rate: / Reply Quote | |
This is a new world record: 99°C, and it's *not* a laptop. There was once a bug in EVGA Precision X, where, if you shutdown (with fast startup enabled, meaning it hibernates instead of shutting down), then started back up, it would forget your "Auto fan" setting. If your fan was at 0% at shutdown, it'd stay at 0% regardless of load or temp, since Auto was off. I woke up one morning to find my GPUs (GTX 660 Ti, GTX 460, I believe), all thermal-throttled at 100*C. They were happily crunching away, at some ridiculous clock like 320 MHz, but the thermal throttle saved the day. I'm glad I switched away from EVGA Precision X, and to MSI Afterburner. Said GPUs are still happily crunching at custom fan curves, at full clocks, no problems, to this day... with MSI Afterburner. | |
ID: 46504 | Rating: 0 | rate: / Reply Quote | |
I'm glad I switched away from EVGA Precision X, and to MSI Afterburner. Said GPUs are still happily crunching at custom fan curves, at full clocks, no problems, to this day... with MSI Afterburner. I'll drink to that. Never liked the EVGA app. Afterburner + TThrottle + BoincTasks all the way. | |
ID: 46513 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : Temperatures