MD5 check failed

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4286
Credit: 245703935
RAC: 10921

The problem has been fixed

The problem has been fixed for now, i.e. newly generated workunits and tasks generated from old workunits should have the correct md5 checksum now.

The root cause of the problem has been identified and measurements are taken to prevent the same from happening in the future.

BM

Stranger7777
Stranger7777
Joined: 17 Mar 05
Posts: 436
Credit: 419982789
RAC: 37220

But I do have the same

But I do have the same promblem on a newly buit [HOST#12619532]

Moreover I have MD5 problem with lots of other files, like FGRP OpenCL NVidia-1k.exe.

Will try to change BOINC version. Let's see what happens.

bluestang
bluestang
Joined: 13 Apr 15
Posts: 34
Credit: 2492970228
RAC: 34

Having this problem again

Having this problem again today when adding a new host.  now it won't let me try and get new tasks because I reached my daily quota of 40 for the day because of this MD5 error.

New WUs/data files that were changed lately to blame?

Please fix (again)...thanks!

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5860
Credit: 111389868647
RAC: 33422322

bluestang wrote:... New

bluestang wrote:
... New WUs/data files that were changed lately to blame?

Unlikely I think.  There are no other reports I'm aware of.  I would expect to see other reports if it was happening to all new installs.  Last time, the problem was not caused by bad data or tasks - just a bad stored value for the checksum.

I'm also a bit surprised that with a GPU plus 32 CPUs, there is a daily limit of only 40.

Have you compared the file on the new host to the same file on one of your other hosts?  Have you checked the MD5sum stored in client_state.xml on each?  The previous occurrence was a bad MD5sum stored in the state file.  Do you have a bad MD5sum on the new host?  It should be the same on all hosts.  If you have, can you try editing the state file to change the bad value into the correct value?  That was the workaround used last time.

If all of those things turn out negative, I have seen examples of checksum failures being caused by bad RAM.  Since it's a new host, are you confident the RAM is good?

 

Cheers,
Gary.

Dr_Mabuse
Dr_Mabuse
Joined: 9 May 05
Posts: 11
Credit: 9364790
RAC: 93

Dear experts, I got

Dear experts,

I got repeatedly a checksum error trying to download new work unites.

See here:

28.09.2018 09:06:24 | Einstein@Home | Started download of LATeah0040F.dat 28.09.2018 09:06:24 | Einstein@Home | Started download of JPLEPH.405

28.09.2018 09:06:34 | Einstein@Home | Finished download of LATeah0040F.dat 28.09.2018 09:06:40 | Einstein@Home | work fetch suspended by user 28.09.2018 09:06:45 | Einstein@Home | task LATeah0040F_1192.0_114550_0.0_0 suspended by user 28.09.2018 09:06:46 | Einstein@Home | Finished download of JPLEPH.405 28.09.2018 09:06:46 | Einstein@Home | [error] MD5 check failed for JPLEPH.405 28.09.2018 09:06:46 | Einstein@Home | [error] expected d41d8cd98f00b204e9800998ecf8427e, got d6ce12bacd2a81a56423f5f238ba84eb 28.09.2018 09:06:46 | Einstein@Home | [error] Checksum or signature error for JPLEPH.405 28.09.2018 09:06:47 | Einstein@Home | task LATeah0040F_1192.0_114234_0.0_0 suspended by user 28.09.2018 09:06:48 | Einstein@Home | task LATeah0040F_1192.0_114155_0.0_0 suspended by user

 

Up to now I got 53 aborted tasks:

Task ID Workunit ID Computer Sent Time Reported or Deadline Status Run time CPU time Granted Credit  
 
LATeah0039F_1304.0_190864_0.0_2 370325765 12569019 26 Sep 2018 6:50:38 UTC 26 Sep 2018 6:52:41 UTC Aborted 0 0 0 Gamma-ray pulsar search #5 v1.08 () windows_intelx86

...

 LATeah0039F_1304.0_215354_0.0_1 370326075 12569019 26 Sep 2018 6:53:44 UTC 26 Sep 2018 6:57:42 UTC Error while downloading 0 0 0 Gamma-ray pulsar search #5 v1.08 () windows_intelx86
Atomic Booty
Atomic Booty
Joined: 10 Sep 05
Posts: 4
Credit: 401333
RAC: 0

I have also been getting

I have also been getting download errors on all my WU's here the past couple days. :( 

 

Task 789416778

Name: LATeah0041F_40.0_272_-9e-12_0
Workunit ID: 370784685
Created: 29 Sep 2018 5:30:10 GMT
Sent: 29 Sep 2018 5:56:30 GMT
Report deadline: 13 Oct 2018 5:56:30 GMT
Received: 29 Sep 2018 5:57:54 GMT
Server state: Over
Outcome: Computation error
Client state: Downloading
Exit status: -186 (0xFFFFFF46) ERR_RESULT_DOWNLOAD
Computer: 12669328
Run time (sec): 0.00
CPU time (sec): 0.00
Peak working set size (MB): 0
Peak swap size (MB): 0
Peak disk usage (MB): 0
Validation state: Invalid
Granted credit: 0
Application: Gamma-ray pulsar search #5 v1.08 (FGRPSSE) windows_intelx86

bluestang
bluestang
Joined: 13 Apr 15
Posts: 34
Credit: 2492970228
RAC: 34

Well, there is evidently an

Well, there is evidently an issue again with this as it's not just me.

Hopefully it will be fixed soon as I plan on adding more power for Einstein.

Atomic Booty
Atomic Booty
Joined: 10 Sep 05
Posts: 4
Credit: 401333
RAC: 0

I dug through some of last

I dug through some of last year's posts and seem to have it working. Here's what I did:

1. Exit BOINC manager, making sure all work has stopped.

2. Open your BOINC program files and look for "client_state.xml" (Mine was in C > Program Data > BOINC)

3. Open the file "client_state.xml" with a simple text editor like Notepad.

4. Search for the line:

<file>
    <name>hsgamma_FGRP5_XXX.exe</name>

The "XXX" part doesn't matter, it's just version number info. I did a Ctrl+F search for occurrences of "hsgamma" and found the right line almost immediately.

5. JUST BEFORE that ^ line you just found, paste in all this:

<file>
    <name>JPLEPH.405</name>
    <nbytes>9319680.000000</nbytes>
    <max_nbytes>0.000000</max_nbytes>
    <md5_cksum>d6ce12bacd2a81a56423f5f238ba84eb</md5_cksum>
    <status>1</status>
    <sticky/>
    <download_url>http://einstein8.aei.uni-hannover.de/EinsteinAtHome/download/29/JPLEPH.405</download_url>
</file>

 6. Save your newly edited "client_state.xml" file, overwriting the old one. Make sure it does NOT save as a Text Document (*.txt),  use the All Files (*.*) dropdown option. Also make sure it DOES save with ANSI coding (which seems pretty standard).

7. Restart BOINC

 

Thanks to everyone who had to work this out for themselves last time. Hope it helps somebody else!

Trotador
Trotador
Joined: 2 May 13
Posts: 58
Credit: 2122643213
RAC: 1

I've also got this error on a

I've also got this error on a new installation.

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4286
Credit: 245703935
RAC: 10921

Sorry for the trouble, and

Sorry for the trouble, and thanks for the notification.

This is a result of a race condition between the two FGRP workunit generators. This should be fixed now (i.e. not happen again). Also too the broken md5sums were fixed in the database, i.e. new tasks should be sent out with the correct checksum even if they are instantiated from old workunits.

BM

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.