Unrecoverable error

Joined
Jan 4, 2003
Messages
8,039
Reaction score
846
Not impressed

Ive had 4 of the above in 15mins. It gives me the MS error popup and tells me that the data may not be recoverable only happening on the Faah data so far

GRRRRRRRR
 
Joined
Jan 14, 2006
Messages
12,268
Reaction score
283
Just had one myself on a unit that i have crunched for 4.5 hours. May be a problem at WCG??
 
Joined
Jan 4, 2003
Messages
8,039
Reaction score
846
Phew, Cheers m8, Thought it was just me cos no one else had mentioned it

Makes me feel a little better ;) taaa
 
Joined
Jan 4, 2003
Messages
8,039
Reaction score
846
Just had another one.

Starting to get a little annoyed again. I wonder what it is causing the problem

Edit**

7 so far, anyone else?
 
Last edited:

Adywebb

Growing old....
Moderator
Joined
Jan 1, 2005
Messages
5,459
Reaction score
21
If you are getting an MS error pop-up, it would indicate to me that the problem is on your system and not at WCG - something like a conflict causing Windows to terminate the program.
 
Joined
Jan 4, 2003
Messages
8,039
Reaction score
846
Well if it is me it's happening all of a sudden out of the blue. I remain sceptical on that one m8. Plus the fact Feckit had one too

Nothing in my system has changed!
 

Adywebb

Growing old....
Moderator
Joined
Jan 1, 2005
Messages
5,459
Reaction score
21
What I'm saying is that the fact your are getting a Windows error pop-up indicates a problem with the actual program, ie Boinc - or the process of saving the progress to disc.

Errors processing an actual work unit would only show up within the messages tab, likewise problems at WCG.

Is feckit actually getting Windows error pop-ups, or is it just an error with a unit?

I am having no problems with FAAH units here.
 
Joined
Jan 4, 2003
Messages
8,039
Reaction score
846
Dunno m8, I made the post and he told me he had the same thing within this same thread. He has only registered one tho I have had several
 

Adywebb

Growing old....
Moderator
Joined
Jan 1, 2005
Messages
5,459
Reaction score
21
Perhaps feckit can clarify later....

TD - when you get the error message, do you have to re-start Boinc?
 
Last edited:

Adywebb

Growing old....
Moderator
Joined
Jan 1, 2005
Messages
5,459
Reaction score
21
Honestly mate, I would say that is a boinc/windows issue rather than anything to do with WCG.

Personally, if it kept happening to me I would completely uninstall Boinc, use CCleaner to get rid of any remnants lying around, and install afresh.

Obviously thats a decision only you can take....
 

BigJay

Kingsize Crunchie
Joined
Sep 13, 2004
Messages
615
Reaction score
10
Event log containing 2 x unrecoverable error?


25/06/2006 16:40:03|World Community Grid|Reason: To report results
25/06/2006 16:40:03|World Community Grid|Reporting 1 results
25/06/2006 16:40:08|World Community Grid|Scheduler request to https://secure.worldcommunitygrid.org/boinc/wcg_cgi/fcgi succeeded
25/06/2006 19:32:07|World Community Grid|Result za057_01000_3 exited with zero status but no 'finished' file
25/06/2006 19:32:07|World Community Grid|If this happens repeatedly you may need to reset the project.
25/06/2006 19:32:07||request_reschedule_cpus: process exited
25/06/2006 19:32:07|World Community Grid|Restarting result za057_01000_3 using hpf2 version 506
25/06/2006 23:49:37|World Community Grid|Unrecoverable error for result za057_01000_3 ( - exit code 1282 (0x502))
25/06/2006 23:49:37||request_reschedule_cpus: process exited
25/06/2006 23:49:37|World Community Grid|Computation for result za057_01000_3 finished
25/06/2006 23:50:40|World Community Grid|Sending scheduler request to https://secure.worldcommunitygrid.org/boinc/wcg_cgi/fcgi
25/06/2006 23:50:40|World Community Grid|Reason: To fetch work
25/06/2006 23:50:40|World Community Grid|Requesting 432 seconds of new work, and reporting 1 results
25/06/2006 23:50:45|World Community Grid|Scheduler request to https://secure.worldcommunitygrid.org/boinc/wcg_cgi/fcgi succeeded
25/06/2006 23:50:48|World Community Grid|Started download of za060_00114_za060.fasta.gz
25/06/2006 23:50:48|World Community Grid|Started download of za060_00114_za060.psipred.gz
25/06/2006 23:50:52|World Community Grid|Finished download of za060_00114_za060.fasta.gz
25/06/2006 23:50:52|World Community Grid|Throughput 89 bytes/sec
25/06/2006 23:50:52|World Community Grid|Finished download of za060_00114_za060.psipred.gz
25/06/2006 23:50:52|World Community Grid|Throughput 101 bytes/sec
25/06/2006 23:50:52|World Community Grid|Started download of za060_00114_za060.psipred_ss2.gz
25/06/2006 23:50:52|World Community Grid|Started download of za060_00114_aaza06003_05.075_v1_3.gz
25/06/2006 23:50:55|World Community Grid|Finished download of za060_00114_za060.psipred_ss2.gz
25/06/2006 23:50:55|World Community Grid|Throughput 526 bytes/sec
25/06/2006 23:50:55|World Community Grid|Started download of za060_00114_aaza06009_05.075_v1_3.gz
25/06/2006 23:50:58|World Community Grid|Finished download of za060_00114_aaza06003_05.075_v1_3.gz
25/06/2006 23:50:58|World Community Grid|Throughput 55048 bytes/sec
25/06/2006 23:51:05|World Community Grid|Finished download of za060_00114_aaza06009_05.075_v1_3.gz
25/06/2006 23:51:05|World Community Grid|Throughput 78017 bytes/sec
25/06/2006 23:51:06||request_reschedule_cpus: files downloaded
25/06/2006 23:51:06|World Community Grid|Starting result za060_00114_2 using hpf2 version 506
26/06/2006 03:30:04|World Community Grid|Unrecoverable error for result za060_00114_2 ( - exit code -1073741819 (0xc0000005))
26/06/2006 03:30:04||request_reschedule_cpus: process exited
26/06/2006 03:30:04|World Community Grid|Computation for result za060_00114_2 finished
26/06/2006 03:31:08|World Community Grid|Sending scheduler request to https://secure.worldcommunitygrid.org/boinc/wcg_cgi/fcgi
 

floppybootstomp

sugar 'n spikes
Moderator
Joined
Mar 5, 2002
Messages
20,281
Reaction score
1,794
I spoke too soon.

Two hpf2 units have crashed, wasting me a total of 9 hours.

The Boinc Manager cited 'Computation Error' in each case.

I'm switching to AIDS only until it's sorted as still no probs with faah units.

Latest version of Boinc, no tweak applied.
 

Adywebb

Growing old....
Moderator
Joined
Jan 1, 2005
Messages
5,459
Reaction score
21
BigJay said:
Event log containing 2 x unrecoverable error?
The event log CaptHook was refering to was the Windows Event Viewer (Start>Settings>Control Panel>Admin Tools>Event Viewer) to see if it showed the cause of the crash and error codes etc.

The Boinc log you have posted is something different, however it does obviously shows a problem you have with crunching the new HPF2 units.

As Flops has done, I would switch to FAAH units only for a while until the WCG Techs sort it out :thumb:
 

Adywebb

Growing old....
Moderator
Joined
Jan 1, 2005
Messages
5,459
Reaction score
21
I have now had a couple of these new HPF2 units abort before completion losing 8 hours of work.

I suspect there may be a bug within in the 9MB science application that is downloaded to everyone at the commencement of a new project, or possibly some corruption with the initial download.

I have posted over at WCG regarding this, and the Techs are looking into it - apparently this is normal at the commencement of a new project.

In the meantime if anyone is having problems, I would switch over to Aids stuff only (FAAH units) for while.

However this still doesn't solve the issues with TD's initial problem with his crashing with FAAH units :confused:
 

Adywebb

Growing old....
Moderator
Joined
Jan 1, 2005
Messages
5,459
Reaction score
21
This has been posted at WCG earlier today:
There are a couple of errors in the HPF2 code that we are working on a fix for now and we expect to start development testing within the next hour or two for that fix. We hope to deploy that version to production within 24 to 48 hours. If this fix works as expected it should significantly reduce the errors.

The errors that we hope to reduce are as follows:

On Windows:
Exit Code 1282
Exit Code 10
Exit Code -1073741819

On Linux
Exit Code 1

We will not be able to know how much improvement we see until we have released the code into production and let it run for a couple of days.

We apologize for the error. Although we ran testing for an extended period (about 6 weeks) the fact of the matter is that all of you members processed more work in the first few hours after we launched then we had processed during our entire 6 week testing period. As such there were some conditions that we did not find during our testing.

As always - we thank you for your contribution and appreciate your support. It is your incredible contribution that makes it possible to process more work in a few hours then we can do in 6 weeks!
http://www.worldcommunitygrid.org/fo...ad?thread=7721

This covers the three Windows errors suffered by BigJay and myself - so hopefully things will be running smoothly soon
smile.gif
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top