GPU probelms-Bad work units
#1
I've recently started folding again and have been having a lot of bad work units, around 95% of work units are failing.

I've folded successfully in the past on this card (AMD r9 290X). Yesterday I tried updating driver and going back to an even older driver, uninstalled and reinstalled folding at home all without success.

Does anyone have any other suggestions or is my card dead?


20:36:49:WU00:FS00:0x22:Completed 40000 out of 1000000 steps (4%)
20:38:45:WU00:FS00:0x22:Completed 50000 out of 1000000 steps (5%)
20:38:58:WU00:FS00:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
20:38:58:WU00:FS00:0x22:Following exception occured: Particle coordinate is nan
20:39:03:WU00:FS00:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
20:39:03:WU00:FS00:0x22:Following exception occured: Particle coordinate is nan
20:39:09:WU00:FS00:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
20:39:09:WU00:FS00:0x22:Following exception occured: Particle coordinate is nan
20:39:10:WU00:FS00:0x22:ERROR:114: Max Retries Reached
20:39:10:WU00:FS00:0x22:Saving result file ..\logfile_01.txt
20:39:10:WU00:FS00:0x22:Saving result file badstate-0.xml
20:39:12:WU00:FS00:0x22:Saving result file badstate-1.xml
20:39:16:WU00:FS00:0x22:Saving result file badstate-2.xml
20:39:19:WU00:FS00:0x22:Saving result file checkpointState.xml
20:39:22:WU00:FS00:0x22:Saving result file checkpt.crc
20:39:22:WU00:FS00:0x22:Saving result file positions.xtc
20:39:22:WU00:FS00:0x22:Saving result file science.log
20:39:22:WU00:FS00:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
20:39:23:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:39:23:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:11744 run:0 clone:1445 gen:47 core:0x22 unit:0x0000004a8ca304f15e67e7f89db17c70
Reply
#2
Well, is your system overclocked?
Arrow
Reply
#3
Are you overcloking?

I had the same issue earlier this week and last week where I was getting bad wor units when they were like halfway through.

For one it said "Are you overclocking?" in the long and since then I turn off overclock when I fold and I haven't had any fail.
[Image: sigimage.php?u=714605&bg=1&c1=FFFFFF&c2=...&c4=0000CC][Image: sigimage.php?u=735689&bg=1&c1=FFFFFF&c2=...&c4=0000CC]
Reply
#4
Thanks for the replies. 

Its not overclocked.
Reply
#5
Did you uninstall your drivers before upgrading/downgrading? If you did, did you use DDU (DisplayDriverUninstaller) when uninstalling them? If you did it manually there could be traces of the old drivers left over so you should always use DDU to uninstall drivers, and if you didn't uninstall drivers at all you should definitely use DDU and reinstall the latest drivers

Something else you could try is limiting your GPU by lowering the power limit either through amd's software or something like msi afterburner, I would also check your temps (not sure if too high of temps could cause this issue or not but its always good to check especially on a old and power hungry card like the 290x)
The Following 1 User Says Thank You to David For This Useful Post:
  • edetarod
Reply
#6
(25th April 2020, 11:30 AM)David Wrote: Did you uninstall your drivers before upgrading/downgrading? If you did, did you use DDU (DisplayDriverUninstaller) when uninstalling them? If you did it manually there could be traces of the old drivers left over so you should always use DDU to uninstall drivers, and if you didn't uninstall drivers at all you should definitely use DDU and reinstall the latest drivers

Something else you could try is limiting your GPU by lowering the power limit either through amd's software or something like msi afterburner, I would also check your temps (not sure if too high of temps could cause this issue or not but its always good to check especially on a old and power hungry card like the 290x)

Ive left the drivers as they are, limited power down 5% in msi afterburner and saw an improvement in stability but not enough. Dropped it down a further 3%, limited clock speed and memory slightly,  and manually set fan speed to keep the temperature down and its been completely stable for almost 24 hours now. Completed 4 work units and around 400k points. 

I just find it strange that it used to fold fine without limiting the power, and now it wont. At least Im back folding
Reply


Forum Jump:


Users browsing this thread: 1 Guest(s)