climateprediction.net home page
Task 15680491

Task 15680491

Name hadam3p_eu_q5it_2004_1_008330292_2
Workunit 8481153
Created 23 Mar 2013, 20:52:15 UTC
Sent 23 Mar 2013, 20:52:19 UTC
Report deadline 6 Mar 2014, 2:12:19 UTC
Received 8 Apr 2013, 18:37:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1129146
Run time 3 days 7 hours 36 min 10 sec
CPU time 3 days 0 hours 45 min 29 sec
Validate state Invalid
Credit 1,988.94
Device peak FLOPS 3.23 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
GlController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6028, selfPID=5084, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5852, selfPID=3796, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4064, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5708, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=932, selfPID=4964, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
20:40:03 (948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:40:05 (948): No heartbeat from core client for 30 sec - exiting
20:40:06 (948): No heartbeat from core client for 30 sec - exiting
20:40:07 (948): No heartbeat from core client for 30 sec - exiting
20:40:08 (948): No heartbeat from core client for 30 sec - exiting
20:40:09 (948): No heartbeat from core client for 30 sec - exiting
20:40:10 (948): No heartbeat from core client for 30 sec - exiting
20:40:11 (948): No heartbeat from core client for 30 sec - exiting
20:40:12 (948): No heartbeat from core client for 30 sec - exiting
20:40:13 (948): No heartbeat from core client for 30 sec - exiting
20:40:14 (948): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7476, selfPID=1048, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5336, selfPID=4284, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5112, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5364, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: error reading file D:\05_Climateprediction\02_Data/projects/climateprediction.net/hadam3p_eu_q5it_2004_1_008330292/dataout/atmos_restart.day

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error ferror - Unit 21 - Return code = 32

Model crashed: READDUMP: BAD BUFFIN OF DATA                                                                                                                                                                                                                                    tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5316, selfPID=5316, iMonCtr=2
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_q5it_2004_1_008330292_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_q5it_2004_1_008330292_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Apr 2013 17:39:34 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 115,296 259,641 2.2520
07 Apr 2013 10:58:37 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 103,776 233,943 2.2543
06 Apr 2013 17:01:30 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 92,256 207,583 2.2501
06 Apr 2013 09:23:52 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 80,745 181,714 2.2505
06 Apr 2013 08:23:30 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 80,736 181,338 2.2461
03 Apr 2013 15:40:46 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 69,216 155,597 2.2480
31 Mar 2013 19:32:51 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 57,696 130,789 2.2669
30 Mar 2013 14:05:20 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 46,176 104,260 2.2579
29 Mar 2013 16:50:57 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 34,656 77,915 2.2482
29 Mar 2013 10:01:22 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 23,136 52,245 2.2582
25 Mar 2013 21:15:41 1129146 15680491 hadam3p_eu_q5it_2004_1_008330292_2 11,616 26,164 2.2524


©2024 cpdn.org