Name | hadam3p_pnw_bild_1977_1_008032639_0 |
Workunit | 8187753 |
Created | 8 Jul 2012, 17:16:10 UTC |
Sent | 8 Jul 2012, 17:16:32 UTC |
Report deadline | 20 Jun 2013, 22:36:32 UTC |
Received | 11 Aug 2012, 18:40:48 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1007897 |
Run time | 2 days 17 hours 36 min 33 sec |
CPU time | 2 days 15 hours 16 min 42 sec |
Validate state | Invalid |
Credit | 1,503.98 |
Device peak FLOPS | 2.39 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2632, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9524, selfPID=9524, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4036, selfPID=4036, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8184, selfPID=8184, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6664, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2728, selfPID=2728, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8600, selfPID=8600, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... 15:29:13 (5212): No heartbeat from core client for 30 sec - exiting 15:29:14 (5212): No heartbeat from core client for 30 sec - exiting 15:29:15 (5212): No heartbeat from core client for 30 sec - exiting 15:29:16 (5212): No heartbeat from core client for 30 sec - exiting 15:29:17 (5212): No heartbeat from core client for 30 sec - exiting 15:29:18 (5212): No heartbeat from core client for 30 sec - exiting 15:29:19 (5212): No heartbeat from core client for 30 sec - exiting 15:29:20 (5212): No heartbeat from core client for 30 sec - exiting 15:29:21 (5212): No heartbeat from core client for 30 sec - exiting 15:29:22 (5212): No heartbeat from core client for 30 sec - exiting 15:29:23 (5212): No heartbeat from core client for 30 sec - exiting 15:29:24 (5212): No heartbeat from core client for 30 sec - exiting 15:29:25 (5212): No heartbeat from core client for 30 sec - exiting 15:29:26 (5212): No heartbeat from core client for 30 sec - exiting 15:29:27 (5212): No heartbeat from core client for 30 sec - exiting 15:29:28 (5212): No heartbeat from core client for 30 sec - exiting 15:29:29 (5212): No heartbeat from core client for 30 sec - exiting 15:29:30 (5212): No heartbeat from core client for 30 sec - exiting 15:29:31 (5212): No heartbeat from core client for 30 sec - exiting 15:29:32 (5212): No heartbeat from core client for 30 sec - exiting 15:29:33 (5212): No heartbeat from core client for 30 sec - exiting 15:29:34 (5212): No heartbeat from core client for 30 sec - exiting 15:29:35 (5212): No heartbeat from core client for 30 sec - exiting 15:29:36 (5212): No heartbeat from core client for 30 sec - exiting 15:29:37 (5212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 14:57:40 (4668): No heartbeat from core client for 30 sec - exiting 14:57:41 (4668): No heartbeat from core client for 30 sec - exiting 14:57:42 (4668): No heartbeat from core client for 30 sec - exiting 14:57:43 (4668): No heartbeat from core client for 30 sec - exiting 14:57:44 (4668): No heartbeat from core client for 30 sec - exiting 14:57:45 (4668): No heartbeat from core client for 30 sec - exiting 14:57:46 (4668): No heartbeat from core client for 30 sec - exiting 14:57:47 (4668): No heartbeat from core client for 30 sec - exiting 14:57:48 (4668): No heartbeat from core client for 30 sec - exiting 14:57:49 (4668): No heartbeat from core client for 30 sec - exiting 14:57:50 (4668): No heartbeat from core client for 30 sec - exiting 14:57:51 (4668): No heartbeat from core client for 30 sec - exiting 14:57:52 (4668): No heartbeat from core client for 30 sec - exiting 14:57:53 (4668): No heartbeat from core client for 30 sec - exiting 14:57:54 (4668): No heartbeat from core client for 30 sec - exiting 14:57:55 (4668): No heartbeat from core client for 30 sec - exiting 14:57:56 (4668): No heartbeat from core client for 30 sec - exiting 14:57:57 (4668): No heartbeat from core client for 30 sec - exiting 14:57:58 (4668): No heartbeat from core client for 30 sec - exiting 14:57:59 (4668): No heartbeat from core client for 30 sec - exiting 14:58:00 (4668): No heartbeat from core client for 30 sec - exiting 14:58:01 (4668): No heartbeat from core client for 30 sec - exiting 14:58:02 (4668): No heartbeat from core client for 30 sec - exiting 14:58:03 (4668): No heartbeat from core client for 30 sec - exiting 14:58:04 (4668): No heartbeat from core client for 30 sec - exiting 14:58:05 (4668): No heartbeat from core client for 30 sec - exiting 14:58:06 (4668): No heartbeat from core client for 30 sec - exiting 14:58:07 (4668): No heartbeat from core client for 30 sec - exiting 14:58:08 (4668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6496, selfPID=6496, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2516, selfPID=2516, iMonCtr=2 16:07:46 (7888): No heartbeat from core client for 30 sec - exiting 16:07:47 (7888): No heartbeat from core client for 30 sec - exiting 16:07:48 (7888): No heartbeat from core client for 30 sec - exiting 16:07:49 (7888): No heartbeat from core client for 30 sec - exiting 16:07:50 (7888): No heartbeat from core client for 30 sec - exiting 16:07:51 (7888): No heartbeat from core client for 30 sec - exiting 16:07:52 (7888): No heartbeat from core client for 30 sec - exiting 16:07:53 (7888): No heartbeat from core client for 30 sec - exiting 16:07:54 (7888): No heartbeat from core client for 30 sec - exiting 16:07:55 (7888): No heartbeat from core client for 30 sec - exiting 16:07:56 (7888): No heartbeat from core client for 30 sec - exiting 16:07:57 (7888): No heartbeat from core client for 30 sec - exiting 16:07:58 (7888): No heartbeat from core client for 30 sec - exiting 16:07:59 (7888): No heartbeat from core client for 30 sec - exiting 16:08:00 (7888): No heartbeat from core client for 30 sec - exiting 16:08:01 (7888): No heartbeat from core client for 30 sec - exiting 16:08:02 (7888): No heartbeat from core client for 30 sec - exiting 16:08:03 (7888): No heartbeat from core client for 30 sec - exiting 16:08:04 (7888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:56:51 (3348): No heartbeat from core client for 30 sec - exiting 10:56:52 (3348): No heartbeat from core client for 30 sec - exiting 10:56:53 (3348): No heartbeat from core client for 30 sec - exiting 10:56:54 (3348): No heartbeat from core client for 30 sec - exiting 10:56:55 (3348): No heartbeat from core client for 30 sec - exiting 10:56:56 (3348): No heartbeat from core client for 30 sec - exiting 10:56:57 (3348): No heartbeat from core client for 30 sec - exiting 10:56:58 (3348): No heartbeat from core client for 30 sec - exiting Regional yearly means requires 12 input files got 6 10:56:59 (3348): No heartbeat from core client for 30 sec - exiting 10:57:00 (3348): No heartbeat from core client for 30 sec - exiting 10:57:01 (3348): No heartbeat from core client for 30 sec - exiting 10:57:02 (3348): No heartbeat from core client for 30 sec - exiting 10:57:03 (3348): No heartbeat from core client for 30 sec - exiting 10:57:04 (3348): No heartbeat from core client for 30 sec - exiting 10:57:05 (3348): No heartbeat from core client for 30 sec - exiting 10:57:06 (3348): No heartbeat from core client for 30 sec - exiting 10:57:07 (3348): No heartbeat from core client for 30 sec - exiting 10:57:08 (3348): No heartbeat from core client for 30 sec - exiting 10:57:09 (3348): No heartbeat from core client for 30 sec - exiting 10:57:10 (3348): No heartbeat from core client for 30 sec - exiting 10:57:11 (3348): No heartbeat from core client for 30 sec - exiting 10:57:12 (3348): No heartbeat from core client for 30 sec - exiting 10:57:13 (3348): No heartbeat from core client for 30 sec - exiting 10:57:14 (3348): No heartbeat from core client for 30 sec - exiting 10:57:15 (3348): No heartbeat from core client for 30 sec - exiting 10:57:16 (3348): No heartbeat from core client for 30 sec - exiting 10:57:17 (3348): No heartbeat from core client for 30 sec - exiting 10:57:18 (3348): No heartbeat from core client for 30 sec - exiting 10:57:19 (3348): No heartbeat from core client for 30 sec - exiting 10:57:20 (3348): No heartbeat from core client for 30 sec - exiting 10:57:21 (3348): No heartbeat from core client for 30 sec - exiting 10:57:22 (3348): No heartbeat from core client for 30 sec - exiting 10:57:23 (3348): No heartbeat from core client for 30 sec - exiting 10:57:24 (3348): No heartbeat from core client for 30 sec - exiting 10:57:25 (3348): No heartbeat from core client for 30 sec - exiting 10:57:26 (3348): No heartbeat from core client for 30 sec - exiting 10:57:27 (3348): No heartbeat from core client for 30 sec - exiting 10:57:28 (3348): No heartbeat from core client for 30 sec - exiting 10:57:29 (3348): No heartbeat from core client for 30 sec - exiting 10:57:30 (3348): No heartbeat from core client for 30 sec - exiting 10:57:31 (3348): No heartbeat from core client for 30 sec - exiting 10:57:32 (3348): No heartbeat from core client for 30 sec - exiting 10:57:33 (3348): No heartbeat from core client for 30 sec - exiting 10:57:35 (3348): No heartbeat from core client for 30 sec - exiting 10:57:36 (3348): No heartbeat from core client for 30 sec - exiting 10:57:37 (3348): No heartbeat from core client for 30 sec - exiting 10:57:38 (3348): No heartbeat from core client for 30 sec - exiting 10:57:39 (3348): No heartbeat from core client for 30 sec - exiting 10:57:40 (3348): No heartbeat from core client for 30 sec - exiting 10:57:41 (3348): No heartbeat from core client for 30 sec - exiting 10:57:42 (3348): No heartbeat from core client for 30 sec - exiting 10:57:43 (3348): No heartbeat from core client for 30 sec - exiting 10:57:44 (3348): No heartbeat from core client for 30 sec - exiting 10:57:45 (3348): No heartbeat from core client for 30 sec - exiting 10:57:46 (3348): No heartbeat from core client for 30 sec - exiting 10:57:47 (3348): No heartbeat from core client for 30 sec - exiting 10:57:48 (3348): No heartbeat from core client for 30 sec - exiting 10:57:49 (3348): No heartbeat from core client for 30 sec - exiting 10:57:50 (3348): No heartbeat from core client for 30 sec - exiting 10:57:51 (3348): No heartbeat from core client for 30 sec - exiting 10:57:52 (3348): No heartbeat from core client for 30 sec - exiting 10:57:53 (3348): No heartbeat from core client for 30 sec - exiting 10:57:54 (3348): No heartbeat from core client for 30 sec - exiting 10:57:55 (3348): No heartbeat from core client for 30 sec - exiting 10:57:56 (3348): No heartbeat from core client for 30 sec - exiting 10:57:57 (3348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/xaakg.pipe_dummy 2048 Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 6 zip error: Could not create output file (was replacing the original zip file) Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_pnw_bild_1977_1_008032639_0_7.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bild_1977_1_008032639_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bild_1977_1_008032639_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bild_1977_1_008032639_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bild_1977_1_008032639_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_bild_1977_1_008032639_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Aug 2012 20:18:14 | 1007897 | 14877917 | hadam3p_pnw_bild_1977_1_008032639_0 | 69,216 | 226,997 | 3.2795 |
04 Aug 2012 19:28:20 | 1007897 | 14877917 | hadam3p_pnw_bild_1977_1_008032639_0 | 57,696 | 189,854 | 3.2906 |
28 Jul 2012 20:51:39 | 1007897 | 14877917 | hadam3p_pnw_bild_1977_1_008032639_0 | 46,176 | 151,514 | 3.2812 |
24 Jul 2012 18:37:39 | 1007897 | 14877917 | hadam3p_pnw_bild_1977_1_008032639_0 | 34,656 | 112,259 | 3.2392 |
20 Jul 2012 18:46:16 | 1007897 | 14877917 | hadam3p_pnw_bild_1977_1_008032639_0 | 23,136 | 75,538 | 3.2650 |
14 Jul 2012 18:46:42 | 1007897 | 14877917 | hadam3p_pnw_bild_1977_1_008032639_0 | 11,616 | 37,374 | 3.2175 |
©2024 cpdn.org