Error while computing

Smrt_HB
Smrt_HB
Joined: 29 Jan 18
Posts: 2
Credit: 129633558
RAC: 2819
Topic 228541

Gamma-ray pulsar binary search #1 on GPUs v1.28 () windows_x86_64 - All these tasks will fail after 8-12 seconds. They used to work fine for me. Thanks for the tips.

Task 1369517922

Jméno: LATeah3012L12221008_892.0_0_0.0_10684719_1

ID pracovní jednotky: 680317302

Vytvořen: 20 Oct 2022 18:17:43 UTC

Odesláno: 20 Oct 2022 18:24:41 UTC

Odevzdat do : 3 Nov 2022 18:24:41 UTC

Přijato: 20 Oct 2022 18:37:56 UTC

Stav serveru: Over

Výsledek: Computation error

Stav klienta: Compute error

Stav ukončení: 69 (0x00000045) Unknown error code

Počítač: 12639340

Run time (sec): 9.21

CPU time (sec): 6.67

Peak working set size (MB): 73.73

Peak swap size (MB): 187.95

Peak disk usage (MB): 0.01

Stav validace: Invalid

Přidělený kredit: 0

Aplikace: Gamma-ray pulsar binary search #1 on GPUs v1.28 (FGRPopencl2-ati)
windows_x86_64


Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
Byl p - exit code 69 (0x45)</message>
<stderr_txt>
20:24:54 (6852): [normal]: This Einstein@home App was built at: Aug 17 2021 14:12:21

20:24:55 (6852): [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.28_windows_x86_64__FGRPopencl2-ati.exe'.
20:24:55 (6852): [debug]: 1e+016 fp, 4e+009 fp/s, 2632334 s, 731h12m13s53
20:24:55 (6852): [normal]: % CPU usage: 1.000000, GPU usage: 1.000000
command line: projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.28_windows_x86_64__FGRPopencl2-ati.exe --inputfile ../../projects/einstein.phys.uwm.edu/LATeah3012L12221008.dat --alpha 2.59819959601 --delta -0.694603692878 --skyRadius 1.890770e-06 --ldiBins 15 --f0start 884.0 --f0Band 8.0 --firstSkyPoint 0 --numSkyPoints 1 --f1dot -1e-13 --f1dotBand 1e-13 --df1dot 1.69860773e-15 --ephemdir ..\..\projects\einstein.phys.uwm.edu\JPLEPH --Tcoh 2097152.0 --toplist 10 --cohFollow 10 --numCells 1 --useWeights 1 --Srefinement 1 --CohSkyRef 1 --cohfullskybox 1 --mmfu 0.1 --reftime 56100 --model 0 --f0orbit 0.005 --mismatch 0.1 --demodbinary 1 --BinaryPointFile ../../projects/einstein.phys.uwm.edu/templates_LATeah3012L12221008_0892_10684719.dat --debug 0 -o LATeah3012L12221008_892.0_0_0.0_10684719_1_0.out
output files: 'LATeah3012L12221008_892.0_0_0.0_10684719_1_0.out' '../../projects/einstein.phys.uwm.edu/LATeah3012L12221008_892.0_0_0.0_10684719_1_0' 'LATeah3012L12221008_892.0_0_0.0_10684719_1_0.out.cohfu' '../../projects/einstein.phys.uwm.edu/LATeah3012L12221008_892.0_0_0.0_10684719_1_1'
20:24:55 (6852): [debug]: Flags: X64 SSE SSE2 GNUC X86 GNUX86
20:24:55 (6852): [debug]: Set up communication with graphics process.
boinc_get_opencl_ids returned [00000000037d30f0 , 00007fff4f073490]
Using OpenCL platform provided by: Advanced Micro Devices, Inc.
Using OpenCL device "Baffin" by: Advanced Micro Devices, Inc.
Max allocation limit: 3422552064
Global mem size: 0
read_checkpoint(): Couldn't open file 'LATeah3012L12221008_892.0_0_0.0_10684719_1_0.out.cpt': No error (0)
% fft length: 16777216 (0x1000000)
% Scratch buffer size: 136314880
Error during OpenCL FFT (error: -5)
ERROR: gen_fft_execute() returned with error -1519625920
20:25:01 (6852): [CRITICAL]: ERROR: MAIN() returned with error '5'
FPU status flags: PRECISION
20:25:01 (6852): [normal]: done. calling boinc_finish(69).
20:25:01 (6852): called boinc_finish(69)

</stderr_txt>
]]>




 

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3745
Credit: 35544952780
RAC: 36748737

disable beta tasks in your

disable beta tasks in your project preferences.

_________________________________________________________________________

Smrt_HB
Smrt_HB
Joined: 29 Jan 18
Posts: 2
Credit: 129633558
RAC: 2819

Yes, that was it. Thank

Yes, that was it.

Thank you.

Dark Angel
Dark Angel
Joined: 3 Jan 12
Posts: 4
Credit: 20909231
RAC: 2226

Getting multiple errors on

Getting multiple errors on these tasks as well.

Typical example:

https://einsteinathome.org/task/1392505783

 

Stderr output

<core_client_version>7.21.0</core_client_version>
<![CDATA[
<message>
process exited with code 11 (0xb, -245)</message>
<stderr_txt>
21:20:46 (2088535): [normal]: This Einstein@home App was built at: Aug 17 2021 16:19:40

21:20:46 (2088535): [normal]: Start of BOINC application '../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.28_x86_64-pc-linux-gnu__FGRPopencl2Pup-nvidia'.
21:20:46 (2088535): [debug]: 1e+16 fp, 2.9e+09 fp/s, 3597021 s, 999h10m21s44
21:20:46 (2088535): [normal]: % CPU usage: 1.000000, GPU usage: 1.000000
command line: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.28_x86_64-pc-linux-gnu__FGRPopencl2Pup-nvidia --inputfile ../../projects/einstein.phys.uwm.edu/LATeah4021L02.dat --alpha 0.943218186562 --delta 1.30995332125 --skyRadius 8.726650e-08 --ldiBins 30 --f0start 852.0 --f0Band 8.0 --firstSkyPoint 0 --numSkyPoints 1 --f1dot -1e-13 --f1dotBand 1e-13 --df1dot 1.413729381e-15 --ephemdir ../../projects/einstein.phys.uwm.edu/JPLEPH --Tcoh 2097152.0 --toplist 10 --cohFollow 10 --numCells 1 --useWeights 1 --Srefinement 1 --CohSkyRef 1 --cohfullskybox 1 --mmfu 0.1 --reftime 56100 --model 0 --f0orbit 0.005 --mismatch 0.1 --demodbinary 1 --BinaryPointFile ../../projects/einstein.phys.uwm.edu/templates_LATeah4021L02_0860_28810383.dat --debug 0 -o LATeah4021L02_860.0_0_0.0_28810383_1_0.out
output files: 'LATeah4021L02_860.0_0_0.0_28810383_1_0.out' '../../projects/einstein.phys.uwm.edu/LATeah4021L02_860.0_0_0.0_28810383_1_0' 'LATeah4021L02_860.0_0_0.0_28810383_1_0.out.cohfu' '../../projects/einstein.phys.uwm.edu/LATeah4021L02_860.0_0_0.0_28810383_1_1'
21:20:46 (2088535): [debug]: Flags: X64 SSE SSE2 GNUC X86 GNUX86
21:20:46 (2088535): [debug]: glibc version/release: 2.31/stable
21:20:46 (2088535): [debug]: Set up communication with graphics process.

-- signal handler called: signal 1
2 stack frames obtained for this thread:
Frame 2:
Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.28_x86_64-pc-linux-gnu__FGRPopencl2Pup-nvidia (0x48e401)
Source file: hs_boinc_extras.c (Function: sighandler / Line: 290)
Frame 1:
Binary file: /lib/x86_64-linux-gnu/libpthread.so.0 (0x7f6abf2f3420)
Offset info: +0x14420

End of stcaktrace
21:20:46 (2088535): called boinc_finish(11)

</stderr_txt>
]]>

 

https://einsteinathome.org/task/1392505884

Stderr output

<core_client_version>7.21.0</core_client_version>
<![CDATA[
<message>
process exited with code 11 (0xb, -245)</message>
<stderr_txt>
18:57:55 (2078913): [normal]: This Einstein@home App was built at: Aug 17 2021 16:19:40

18:57:55 (2078913): [normal]: Start of BOINC application '../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.28_x86_64-pc-linux-gnu__FGRPopencl2Pup-nvidia'.
18:57:55 (2078913): [debug]: 1e+16 fp, 2.9e+09 fp/s, 3597021 s, 999h10m21s44
18:57:55 (2078913): [normal]: % CPU usage: 1.000000, GPU usage: 1.000000
command line: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.28_x86_64-pc-linux-gnu__FGRPopencl2Pup-nvidia --inputfile ../../projects/einstein.phys.uwm.edu/LATeah4021L02.dat --alpha 0.943218186562 --delta 1.30995332125 --skyRadius 8.726650e-08 --ldiBins 30 --f0start 852.0 --f0Band 8.0 --firstSkyPoint 0 --numSkyPoints 1 --f1dot -1e-13 --f1dotBand 1e-13 --df1dot 1.413729381e-15 --ephemdir ../../projects/einstein.phys.uwm.edu/JPLEPH --Tcoh 2097152.0 --toplist 10 --cohFollow 10 --numCells 1 --useWeights 1 --Srefinement 1 --CohSkyRef 1 --cohfullskybox 1 --mmfu 0.1 --reftime 56100 --model 0 --f0orbit 0.005 --mismatch 0.1 --demodbinary 1 --BinaryPointFile ../../projects/einstein.phys.uwm.edu/templates_LATeah4021L02_0860_28841463.dat --debug 0 -o LATeah4021L02_860.0_0_0.0_28841463_0_0.out
output files: 'LATeah4021L02_860.0_0_0.0_28841463_0_0.out' '../../projects/einstein.phys.uwm.edu/LATeah4021L02_860.0_0_0.0_28841463_0_0' 'LATeah4021L02_860.0_0_0.0_28841463_0_0.out.cohfu' '../../projects/einstein.phys.uwm.edu/LATeah4021L02_860.0_0_0.0_28841463_0_1'
18:57:55 (2078913): [debug]: Flags: X64 SSE SSE2 GNUC X86 GNUX86
18:57:55 (2078913): [debug]: glibc version/release: 2.31/stable
18:57:55 (2078913): [debug]: Set up communication with graphics process.

-- signal handler called: signal 1
2 stack frames obtained for this thread:
Frame 2:
Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.28_x86_64-pc-linux-gnu__FGRPopencl2Pup-nvidia (0x48e401)
Source file: hs_boinc_extras.c (Function: sighandler / Line: 290)
Frame 1:
Binary file: /lib/x86_64-linux-gnu/libpthread.so.0 (0x7f3385fcf420)
Offset info: +0x14420

End of stcaktrace
18:57:55 (2078913): called boinc_finish(11)

</stderr_txt>
]]>

 

If letting these run helps someone at the back end sort things out I will, otherwise I'll disable test work units in my profile.

 

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.