climateprediction.net home page
Task 14877917

Task 14877917

Name hadam3p_pnw_bild_1977_1_008032639_0
Workunit 8187753
Created 8 Jul 2012, 17:16:10 UTC
Sent 8 Jul 2012, 17:16:32 UTC
Report deadline 20 Jun 2013, 22:36:32 UTC
Received 11 Aug 2012, 18:40:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1007897
Run time 2 days 17 hours 36 min 33 sec
CPU time 2 days 15 hours 16 min 42 sec
Validate state Invalid
Credit 1,503.98
Device peak FLOPS 2.39 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2632, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9524, selfPID=9524, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CSuspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4036, selfPID=4036, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8184, selfPID=8184, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6664, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2728, selfPID=2728, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8600, selfPID=8600, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
15:29:13 (5212): No heartbeat from core client for 30 sec - exiting
15:29:14 (5212): No heartbeat from core client for 30 sec - exiting
15:29:15 (5212): No heartbeat from core client for 30 sec - exiting
15:29:16 (5212): No heartbeat from core client for 30 sec - exiting
15:29:17 (5212): No heartbeat from core client for 30 sec - exiting
15:29:18 (5212): No heartbeat from core client for 30 sec - exiting
15:29:19 (5212): No heartbeat from core client for 30 sec - exiting
15:29:20 (5212): No heartbeat from core client for 30 sec - exiting
15:29:21 (5212): No heartbeat from core client for 30 sec - exiting
15:29:22 (5212): No heartbeat from core client for 30 sec - exiting
15:29:23 (5212): No heartbeat from core client for 30 sec - exiting
15:29:24 (5212): No heartbeat from core client for 30 sec - exiting
15:29:25 (5212): No heartbeat from core client for 30 sec - exiting
15:29:26 (5212): No heartbeat from core client for 30 sec - exiting
15:29:27 (5212): No heartbeat from core client for 30 sec - exiting
15:29:28 (5212): No heartbeat from core client for 30 sec - exiting
15:29:29 (5212): No heartbeat from core client for 30 sec - exiting
15:29:30 (5212): No heartbeat from core client for 30 sec - exiting
15:29:31 (5212): No heartbeat from core client for 30 sec - exiting
15:29:32 (5212): No heartbeat from core client for 30 sec - exiting
15:29:33 (5212): No heartbeat from core client for 30 sec - exiting
15:29:34 (5212): No heartbeat from core client for 30 sec - exiting
15:29:35 (5212): No heartbeat from core client for 30 sec - exiting
15:29:36 (5212): No heartbeat from core client for 30 sec - exiting
15:29:37 (5212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:57:40 (4668): No heartbeat from core client for 30 sec - exiting
14:57:41 (4668): No heartbeat from core client for 30 sec - exiting
14:57:42 (4668): No heartbeat from core client for 30 sec - exiting
14:57:43 (4668): No heartbeat from core client for 30 sec - exiting
14:57:44 (4668): No heartbeat from core client for 30 sec - exiting
14:57:45 (4668): No heartbeat from core client for 30 sec - exiting
14:57:46 (4668): No heartbeat from core client for 30 sec - exiting
14:57:47 (4668): No heartbeat from core client for 30 sec - exiting
14:57:48 (4668): No heartbeat from core client for 30 sec - exiting
14:57:49 (4668): No heartbeat from core client for 30 sec - exiting
14:57:50 (4668): No heartbeat from core client for 30 sec - exiting
14:57:51 (4668): No heartbeat from core client for 30 sec - exiting
14:57:52 (4668): No heartbeat from core client for 30 sec - exiting
14:57:53 (4668): No heartbeat from core client for 30 sec - exiting
14:57:54 (4668): No heartbeat from core client for 30 sec - exiting
14:57:55 (4668): No heartbeat from core client for 30 sec - exiting
14:57:56 (4668): No heartbeat from core client for 30 sec - exiting
14:57:57 (4668): No heartbeat from core client for 30 sec - exiting
14:57:58 (4668): No heartbeat from core client for 30 sec - exiting
14:57:59 (4668): No heartbeat from core client for 30 sec - exiting
14:58:00 (4668): No heartbeat from core client for 30 sec - exiting
14:58:01 (4668): No heartbeat from core client for 30 sec - exiting
14:58:02 (4668): No heartbeat from core client for 30 sec - exiting
14:58:03 (4668): No heartbeat from core client for 30 sec - exiting
14:58:04 (4668): No heartbeat from core client for 30 sec - exiting
14:58:05 (4668): No heartbeat from core client for 30 sec - exiting
14:58:06 (4668): No heartbeat from core client for 30 sec - exiting
14:58:07 (4668): No heartbeat from core client for 30 sec - exiting
14:58:08 (4668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6496, selfPID=6496, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2516, selfPID=2516, iMonCtr=2
16:07:46 (7888): No heartbeat from core client for 30 sec - exiting
16:07:47 (7888): No heartbeat from core client for 30 sec - exiting
16:07:48 (7888): No heartbeat from core client for 30 sec - exiting
16:07:49 (7888): No heartbeat from core client for 30 sec - exiting
16:07:50 (7888): No heartbeat from core client for 30 sec - exiting
16:07:51 (7888): No heartbeat from core client for 30 sec - exiting
16:07:52 (7888): No heartbeat from core client for 30 sec - exiting
16:07:53 (7888): No heartbeat from core client for 30 sec - exiting
16:07:54 (7888): No heartbeat from core client for 30 sec - exiting
16:07:55 (7888): No heartbeat from core client for 30 sec - exiting
16:07:56 (7888): No heartbeat from core client for 30 sec - exiting
16:07:57 (7888): No heartbeat from core client for 30 sec - exiting
16:07:58 (7888): No heartbeat from core client for 30 sec - exiting
16:07:59 (7888): No heartbeat from core client for 30 sec - exiting
16:08:00 (7888): No heartbeat from core client for 30 sec - exiting
16:08:01 (7888): No heartbeat from core client for 30 sec - exiting
16:08:02 (7888): No heartbeat from core client for 30 sec - exiting
16:08:03 (7888): No heartbeat from core client for 30 sec - exiting
16:08:04 (7888): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:56:51 (3348): No heartbeat from core client for 30 sec - exiting
10:56:52 (3348): No heartbeat from core client for 30 sec - exiting
10:56:53 (3348): No heartbeat from core client for 30 sec - exiting
10:56:54 (3348): No heartbeat from core client for 30 sec - exiting
10:56:55 (3348): No heartbeat from core client for 30 sec - exiting
10:56:56 (3348): No heartbeat from core client for 30 sec - exiting
10:56:57 (3348): No heartbeat from core client for 30 sec - exiting
10:56:58 (3348): No heartbeat from core client for 30 sec - exiting
Regional yearly means requires 12 input files got 6
10:56:59 (3348): No heartbeat from core client for 30 sec - exiting
10:57:00 (3348): No heartbeat from core client for 30 sec - exiting
10:57:01 (3348): No heartbeat from core client for 30 sec - exiting
10:57:02 (3348): No heartbeat from core client for 30 sec - exiting
10:57:03 (3348): No heartbeat from core client for 30 sec - exiting
10:57:04 (3348): No heartbeat from core client for 30 sec - exiting
10:57:05 (3348): No heartbeat from core client for 30 sec - exiting
10:57:06 (3348): No heartbeat from core client for 30 sec - exiting
10:57:07 (3348): No heartbeat from core client for 30 sec - exiting
10:57:08 (3348): No heartbeat from core client for 30 sec - exiting
10:57:09 (3348): No heartbeat from core client for 30 sec - exiting
10:57:10 (3348): No heartbeat from core client for 30 sec - exiting
10:57:11 (3348): No heartbeat from core client for 30 sec - exiting
10:57:12 (3348): No heartbeat from core client for 30 sec - exiting
10:57:13 (3348): No heartbeat from core client for 30 sec - exiting
10:57:14 (3348): No heartbeat from core client for 30 sec - exiting
10:57:15 (3348): No heartbeat from core client for 30 sec - exiting
10:57:16 (3348): No heartbeat from core client for 30 sec - exiting
10:57:17 (3348): No heartbeat from core client for 30 sec - exiting
10:57:18 (3348): No heartbeat from core client for 30 sec - exiting
10:57:19 (3348): No heartbeat from core client for 30 sec - exiting
10:57:20 (3348): No heartbeat from core client for 30 sec - exiting
10:57:21 (3348): No heartbeat from core client for 30 sec - exiting
10:57:22 (3348): No heartbeat from core client for 30 sec - exiting
10:57:23 (3348): No heartbeat from core client for 30 sec - exiting
10:57:24 (3348): No heartbeat from core client for 30 sec - exiting
10:57:25 (3348): No heartbeat from core client for 30 sec - exiting
10:57:26 (3348): No heartbeat from core client for 30 sec - exiting
10:57:27 (3348): No heartbeat from core client for 30 sec - exiting
10:57:28 (3348): No heartbeat from core client for 30 sec - exiting
10:57:29 (3348): No heartbeat from core client for 30 sec - exiting
10:57:30 (3348): No heartbeat from core client for 30 sec - exiting
10:57:31 (3348): No heartbeat from core client for 30 sec - exiting
10:57:32 (3348): No heartbeat from core client for 30 sec - exiting
10:57:33 (3348): No heartbeat from core client for 30 sec - exiting
10:57:35 (3348): No heartbeat from core client for 30 sec - exiting
10:57:36 (3348): No heartbeat from core client for 30 sec - exiting
10:57:37 (3348): No heartbeat from core client for 30 sec - exiting
10:57:38 (3348): No heartbeat from core client for 30 sec - exiting
10:57:39 (3348): No heartbeat from core client for 30 sec - exiting
10:57:40 (3348): No heartbeat from core client for 30 sec - exiting
10:57:41 (3348): No heartbeat from core client for 30 sec - exiting
10:57:42 (3348): No heartbeat from core client for 30 sec - exiting
10:57:43 (3348): No heartbeat from core client for 30 sec - exiting
10:57:44 (3348): No heartbeat from core client for 30 sec - exiting
10:57:45 (3348): No heartbeat from core client for 30 sec - exiting
10:57:46 (3348): No heartbeat from core client for 30 sec - exiting
10:57:47 (3348): No heartbeat from core client for 30 sec - exiting
10:57:48 (3348): No heartbeat from core client for 30 sec - exiting
10:57:49 (3348): No heartbeat from core client for 30 sec - exiting
10:57:50 (3348): No heartbeat from core client for 30 sec - exiting
10:57:51 (3348): No heartbeat from core client for 30 sec - exiting
10:57:52 (3348): No heartbeat from core client for 30 sec - exiting
10:57:53 (3348): No heartbeat from core client for 30 sec - exiting
10:57:54 (3348): No heartbeat from core client for 30 sec - exiting
10:57:55 (3348): No heartbeat from core client for 30 sec - exiting
10:57:56 (3348): No heartbeat from core client for 30 sec - exiting
10:57:57 (3348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakg.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6

zip error: Could not create output file (was replacing the original zip file)
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_bild_1977_1_008032639_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bild_1977_1_008032639_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bild_1977_1_008032639_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bild_1977_1_008032639_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bild_1977_1_008032639_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bild_1977_1_008032639_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 Aug 2012 20:18:14 1007897 14877917 hadam3p_pnw_bild_1977_1_008032639_0 69,216 226,997 3.2795
04 Aug 2012 19:28:20 1007897 14877917 hadam3p_pnw_bild_1977_1_008032639_0 57,696 189,854 3.2906
28 Jul 2012 20:51:39 1007897 14877917 hadam3p_pnw_bild_1977_1_008032639_0 46,176 151,514 3.2812
24 Jul 2012 18:37:39 1007897 14877917 hadam3p_pnw_bild_1977_1_008032639_0 34,656 112,259 3.2392
20 Jul 2012 18:46:16 1007897 14877917 hadam3p_pnw_bild_1977_1_008032639_0 23,136 75,538 3.2650
14 Jul 2012 18:46:42 1007897 14877917 hadam3p_pnw_bild_1977_1_008032639_0 11,616 37,374 3.2175


©2024 cpdn.org