Hardware Canucks

Hardware Canucks (http://www.hardwarecanucks.com/forum/)
-   HardwareCanucks F@H Team (http://www.hardwarecanucks.com/forum/hardwarecanucks-f-h-team/)
-   -   *Updated* EUE Like Crazy! (http://www.hardwarecanucks.com/forum/hardwarecanucks-f-h-team/34566-updated-eue-like-crazy.html)

geokilla July 26, 2010 05:40 PM

*Updated* EUE Like Crazy!
 
Lately, my SMP client is EUEing like crazy. I've already reinstalled the client, but that didn't fix much. I'm going to check my overclock next, but does anyone got a clue why I'm getting so much EUEs?

Code:

[22:29:13] Thank you for your contribution to Folding@Home.
[22:29:13] + Starting local stats count at 1
[22:29:19] Trying to send all finished work units
[22:29:19] + No unsent completed units remaining.
[22:29:19] - Preparing to get new work unit...
[22:29:19] Cleaning up work directory
[22:29:19] + Attempting to get work packet
[22:29:19] Passkey found
[22:29:19] - Will indicate memory of 4093 MB
[22:29:19] - Detect CPU. Vendor: AuthenticAMD, Family: 15, Model: 4, Stepping: 2
[22:29:19] - Connecting to assignment server
[22:29:19] Connecting to http://assign.stanford.edu:8080/
[22:29:20] Posted data.
[22:29:20] Initial: 40AB; - Successful: assigned to (171.64.65.54).
[22:29:20] + News From Folding@Home: Welcome to Folding@Home
[22:29:20] Loaded queue successfully.
[22:29:20] Sent data
[22:29:20] Connecting to http://171.64.65.54:8080/
[22:29:23] Posted data.
[22:29:23] Initial: 0000; - Receiving payload (expected size: 1780614)
[22:29:24] - Downloaded at ~1738 kB/s
[22:29:24] - Averaged speed for that direction ~1056 kB/s
[22:29:24] + Received work.
[22:29:24] Trying to send all finished work units
[22:29:24] + No unsent completed units remaining.
[22:29:24] + Closed connections
[22:29:24]
[22:29:24] + Processing work unit
[22:29:24] Core required: FahCore_a3.exe
[22:29:24] Core found.
[22:29:24] Working on queue slot 02 [July 26 22:29:24 UTC]
[22:29:24] + Working ...
[22:29:24] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 02 -np 4 -checkpoint 15 -verbose -lifeline 2960 -version 630'

[22:29:25]
[22:29:25] *------------------------------*
[22:29:25] Folding@Home Gromacs SMP Core
[22:29:25] Version 2.22 (Mar 12, 2010)
[22:29:25]
[22:29:25] Preparing to commence simulation
[22:29:25] - Looking at optimizations...
[22:29:25] - Created dyn
[22:29:25] - Files status OK
[22:29:25] - Expanded 1780102 -> 2059833 (decompressed 115.7 percent)
[22:29:25] Called DecompressByteArray: compressed_data_size=1780102 data_size=2059833, decompressed_data_size=2059833 diff=0
[22:29:25] - Digital signature verified
[22:29:25]
[22:29:25] Project: 6024 (Run 0, Clone 165, Gen 253)
[22:29:25]
[22:29:25] Assembly optimizations on if available.
[22:29:25] Entering M.D.
[22:29:31] Completed 0 out of 500000 steps  (0%)
[22:34:56] Completed 5000 out of 500000 steps  (1%)
[22:40:22] Completed 10000 out of 500000 steps  (2%)
[22:45:49] Completed 15000 out of 500000 steps  (3%)
[22:46:50] Gromacs cannot continue further.
[22:46:50] Going to send back what have done -- stepsTotalG=500000
[22:46:50] Work fraction=-1.#IND steps=500000.
[22:46:57] CoreStatus = C0000005 (-1073741819)
[22:46:57] Client-core communications error: ERROR 0xc0000005
[22:46:57] Deleting current work unit & continuing...
[22:47:09] Trying to send all finished work units
[22:47:09] + No unsent completed units remaining.
[22:47:09] - Preparing to get new work unit...
[22:47:09] Cleaning up work directory
[22:47:09] + Attempting to get work packet
[22:47:09] Passkey found
[22:47:09] - Will indicate memory of 4093 MB
[22:47:09] - Connecting to assignment server
[22:47:09] Connecting to http://assign.stanford.edu:8080/
[22:47:10] Posted data.
[22:47:10] Initial: ED82; - Successful: assigned to (130.237.232.140).
[22:47:10] + News From Folding@Home: Welcome to Folding@Home
[22:47:10] Loaded queue successfully.
[22:47:10] Sent data
[22:47:10] Connecting to http://130.237.232.140:8080/
[22:47:13] Posted data.
[22:47:13] Initial: 0000; - Receiving payload (expected size: 1799412)
[22:47:15] - Downloaded at ~878 kB/s
[22:47:15] - Averaged speed for that direction ~996 kB/s
[22:47:15] + Received work.
[22:47:15] + Closed connections
[22:47:20]
[22:47:20] + Processing work unit
[22:47:20] Core required: FahCore_a3.exe
[22:47:20] Core found.
[22:47:20] Working on queue slot 03 [July 26 22:47:20 UTC]
[22:47:20] + Working ...
[22:47:20] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 03 -np 4 -checkpoint 15 -verbose -lifeline 2960 -version 630'

[22:47:21]
[22:47:21] *------------------------------*
[22:47:21] Folding@Home Gromacs SMP Core
[22:47:21] Version 2.22 (Mar 12, 2010)
[22:47:21]
[22:47:21] Preparing to commence simulation
[22:47:21] - Looking at optimizations...
[22:47:21] - Created dyn
[22:47:21] - Files status OK
[22:47:21] - Expanded 1798900 -> 2396877 (decompressed 133.2 percent)
[22:47:21] Called DecompressByteArray: compressed_data_size=1798900 data_size=2396877, decompressed_data_size=2396877 diff=0
[22:47:21] - Digital signature verified
[22:47:21]
[22:47:21] Project: 6014 (Run 0, Clone 33, Gen 242)
[22:47:21]
[22:47:21] Assembly optimizations on if available.
[22:47:21] Entering M.D.
[22:47:27] Completed 0 out of 500000 steps  (0%)
[22:53:11] Completed 5000 out of 500000 steps  (1%)
[22:58:49] Completed 10000 out of 500000 steps  (2%)
[23:05:21] Completed 15000 out of 500000 steps  (3%)
[23:11:04] Completed 20000 out of 500000 steps  (4%)
[23:16:46] Completed 25000 out of 500000 steps  (5%)
[23:22:11] Completed 30000 out of 500000 steps  (6%)
[23:27:50] Completed 35000 out of 500000 steps  (7%)
[23:28:59] CoreStatus = C0000005 (-1073741819)
[23:28:59] Client-core communications error: ERROR 0xc0000005
[23:28:59] Deleting current work unit & continuing...
[23:29:12] Trying to send all finished work units
[23:29:12] + No unsent completed units remaining.
[23:29:12] - Preparing to get new work unit...
[23:29:12] Cleaning up work directory
[23:29:12] + Attempting to get work packet
[23:29:12] Passkey found
[23:29:12] - Will indicate memory of 4093 MB
[23:29:12] - Connecting to assignment server
[23:29:12] Connecting to http://assign.stanford.edu:8080/
[23:29:12] Posted data.
[23:29:12] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[23:29:12] + News From Folding@Home: Welcome to Folding@Home
[23:29:12] Loaded queue successfully.
[23:29:12] Sent data
[23:29:12] Connecting to http://171.64.65.56:8080/
[23:29:15] Posted data.
[23:29:15] Initial: 0000; - Receiving payload (expected size: 764524)
[23:29:15] Conversation time very short, giving reduced weight in bandwidth avg
[23:29:15] - Downloaded at ~1493 kB/s
[23:29:15] - Averaged speed for that direction ~1067 kB/s
[23:29:15] + Received work.
[23:29:15] + Closed connections
[23:29:20]
[23:29:20] + Processing work unit
[23:29:20] Core required: FahCore_a3.exe
[23:29:20] Core found.
[23:29:20] Working on queue slot 04 [July 26 23:29:20 UTC]
[23:29:20] + Working ...
[23:29:20] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 15 -verbose -lifeline 2960 -version 630'

[23:29:20]
[23:29:20] *------------------------------*
[23:29:20] Folding@Home Gromacs SMP Core
[23:29:20] Version 2.22 (Mar 12, 2010)
[23:29:20]
[23:29:20] Preparing to commence simulation
[23:29:20] - Looking at optimizations...
[23:29:20] - Created dyn
[23:29:20] - Files status OK
[23:29:21] - Expanded 764012 -> 1404481 (decompressed 183.8 percent)
[23:29:21] Called DecompressByteArray: compressed_data_size=764012 data_size=1404481, decompressed_data_size=1404481 diff=0
[23:29:21] - Digital signature verified
[23:29:21]
[23:29:21] Project: 6702 (Run 8, Clone 55, Gen 11)
[23:29:21]
[23:29:21] Assembly optimizations on if available.
[23:29:21] Entering M.D.
[23:29:27] Completed 0 out of 2000000 steps  (0%)
[23:41:44] Completed 20000 out of 2000000 steps  (1%)
[23:54:11] Completed 40000 out of 2000000 steps  (2%)
[00:07:47] Completed 60000 out of 2000000 steps  (3%)
[00:20:31] CoreStatus = C0000005 (-1073741819)
[00:20:31] Client-core communications error: ERROR 0xc0000005
[00:20:31] - Attempting to download new core...
[00:20:31] + Downloading new core: FahCore_a3.exe
[00:20:31] Downloading core (/~pande/Win32/x86/Core_a3.fah from Stanford University)
[00:20:34] Initial: AFDE; + 10240 bytes downloaded
[00:20:34] Initial: D8B1; + 20480 bytes downloaded
[00:20:34] Initial: 7D98; + 30720 bytes downloaded
[00:20:34] Initial: FB47; + 40960 bytes downloaded
[00:20:34] Initial: C727; + 51200 bytes downloaded
[00:20:34] Initial: 3959; + 61440 bytes downloaded
core continues to download
[00:20:36] Initial: 3ABE; + 2711113 bytes downloaded
[00:20:36] Verifying core Core_a3.fah...
[00:20:36] Signature is VALID
[00:20:36]
[00:20:36] Trying to unzip core FahCore_a3.exe
[00:20:37] Decompressed FahCore_a3.exe (9325056 bytes) successfully
[00:20:42] + Core successfully engaged
[00:20:42] Deleting current work unit & continuing...
[00:20:54] Trying to send all finished work units
[00:20:54] + No unsent completed units remaining.
[00:20:54] - Preparing to get new work unit...
[00:20:54] Cleaning up work directory
[00:20:54] + Attempting to get work packet
[00:20:54] Passkey found
[00:20:54] - Will indicate memory of 4093 MB
[00:20:54] - Connecting to assignment server
[00:20:54] Connecting to http://assign.stanford.edu:8080/
[00:20:55] Posted data.
[00:20:55] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[00:20:55] + News From Folding@Home: Welcome to Folding@Home
[00:20:55] Loaded queue successfully.
[00:20:55] Sent data
[00:20:55] Connecting to http://171.64.65.56:8080/
[00:20:58] Posted data.
[00:20:58] Initial: 0000; - Receiving payload (expected size: 763733)
[00:20:58] Conversation time very short, giving reduced weight in bandwidth avg
[00:20:58] - Downloaded at ~1491 kB/s
[00:20:58] - Averaged speed for that direction ~1114 kB/s
[00:20:58] + Received work.
[00:20:58] + Closed connections
[00:21:03]
[00:21:03] + Processing work unit
[00:21:03] Core required: FahCore_a3.exe
[00:21:03] Core found.
[00:21:03] Working on queue slot 05 [July 27 00:21:03 UTC]
[00:21:03] + Working ...
[00:21:03] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 05 -np 4 -checkpoint 15 -verbose -lifeline 2960 -version 630'

[00:21:03]
[00:21:03] *------------------------------*
[00:21:03] Folding@Home Gromacs SMP Core
[00:21:03] Version 2.22 (Mar 12, 2010)
[00:21:03]
[00:21:03] Preparing to commence simulation
[00:21:03] - Looking at optimizations...
[00:21:03] - Created dyn
[00:21:03] - Files status OK
[00:21:04] - Expanded 763221 -> 1404481 (decompressed 184.0 percent)
[00:21:04] Called DecompressByteArray: compressed_data_size=763221 data_size=1404481, decompressed_data_size=1404481 diff=0
[00:21:04] - Digital signature verified
[00:21:04]
[00:21:04] Project: 6701 (Run 88, Clone 39, Gen 24)
[00:21:04]
[00:21:04] Assembly optimizations on if available.
[00:21:04] Entering M.D.
[00:21:10] Completed 0 out of 2000000 steps  (0%)
[00:33:57] Completed 20000 out of 2000000 steps  (1%)
[00:36:12] Killing all core threads
[00:36:12] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[00:36:12] ***** Got a SIGTERM signal (2)
[00:36:12] Killing all core threads
[00:36:12] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


Sagath July 26, 2010 05:47 PM

Delete the core (just the core), and let it redownload the latest. That usually fixes EUE's. Unless your card is shot of course.

chrisk July 26, 2010 05:47 PM

Corestatus C0000005 can mean memory errors according to this link:
CoreStatus codes - FaHWiki

Check your OC. Run memtest first and check for memory errors, the run a prime blend test for a while.

Also, never hurts to make sure you install outside the Program files folder under Vista/7.

geokilla July 26, 2010 06:14 PM

Quote:

Originally Posted by Sagath (Post 407958)
Delete the core (just the core), and let it redownload the latest. That usually fixes EUE's. Unless your card is shot of course.

The program already did that, which leaves me to believe it's more of an OC error.

Quote:

Originally Posted by chriskwarren (Post 407959)
Corestatus C0000005 can mean memory errors according to this link:
CoreStatus codes - FaHWiki

Check your OC. Run memtest first and check for memory errors, the run a prime blend test for a while.

Also, never hurts to make sure you install outside the Program files folder under Vista/7.

Hmm I ran it before when I first setup the computer. Time to run it again. I did change some settings in the BIOS though, maybe that's why I'm getting errors :blarg:

The install is in a folder of its own already.

chrisk July 26, 2010 06:17 PM

You can also try turning off UAC, and make sure you run as admin as outlined in Zero's guide in the stickies. Windows might be managing memory for F@H funny.

But start with memtest86+ , and then prime as I mentioned. Looks like a memory or memory controller issue.

sswilson July 26, 2010 06:29 PM

From personal experience I write this kind of thing down to OC failure due to higher ambient temps in the summer. I've seen this kind of behaviour so many times now that I make it a point to dial my OC down a couple of notches after the first couple of EUEs I see in the spring/summer months.

_dangtx_ July 26, 2010 07:19 PM

same here watch ya temps. 2c difference will make it eul

ilya July 26, 2010 08:37 PM

Quote:

Originally Posted by _dangtx_ (Post 407994)
same here watch ya temps. 2c difference will make it eul

That would explain why I lose stability running GPU3 when I increase voltage. So hard to fine tune the OC for folding with this heat. ><

geokilla July 26, 2010 09:04 PM

It was the RAM.

I achieved my OC while in this summer heat, so in the winter, I might actually be able to push it to 4Ghz.

_dangtx_ July 26, 2010 11:11 PM

lately i gave up on clocks, focusing on stability. too many headaches for only 20% more points lol


All times are GMT -7. The time now is 02:09 PM.