Name | hadcm3n_4goc_1940_40_008306167_1 |
Workunit | 8457302 |
Created | 7 Feb 2013, 10:03:18 UTC |
Sent | 7 Feb 2013, 10:16:38 UTC |
Report deadline | 9 May 2013, 17:43:49 UTC |
Received | 4 Apr 2013, 1:52:41 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1237173 |
Run time | 23 hours 5 min 37 sec |
CPU time | 19 hours 20 min 25 sec |
Validate state | Invalid |
Credit | 622.08 |
Device peak FLOPS | 3.70 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 17:37:45 (38140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:37:51 (38140): No heartbeat from core client for 30 sec - exiting 17:37:52 (38140): No heartbeat from core client for 30 sec - exiting 17:37:53 (38140): No heartbeat from core client for 30 sec - exiting 19:50:23 (39564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:50:25 (39564): No heartbeat from core client for 30 sec - exiting 19:50:26 (39564): No heartbeat from core client for 30 sec - exiting 19:50:27 (39564): No heartbeat from core client for 30 sec - exiting 19:50:28 (39564): No heartbeat from core client for 30 sec - exiting 19:50:29 (39564): No heartbeat from core client for 30 sec - exiting 19:50:30 (39564): No heartbeat from core client for 30 sec - exiting 19:50:31 (39564): No heartbeat from core client for 30 sec - exiting 19:50:32 (39564): No heartbeat from core client for 30 sec - exiting 19:50:33 (39564): No heartbeat from core client for 30 sec - exiting 19:50:34 (39564): No heartbeat from core client for 30 sec - exiting 21:07:05 (40336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:07:10 (40336): No heartbeat from core client for 30 sec - exiting 21:07:11 (40336): No heartbeat from core client for 30 sec - exiting 21:07:12 (40336): No heartbeat from core client for 30 sec - exiting 21:07:13 (40336): No heartbeat from core client for 30 sec - exiting 21:07:14 (40336): No heartbeat from core client for 30 sec - exiting 21:07:15 (40336): No heartbeat from core client for 30 sec - exiting 21:07:16 (40336): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 23:25:30 (40196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:54:40 (37080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:18:23 (41744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:18:16 (41188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:18:21 (41188): No heartbeat from core client for 30 sec - exiting 02:18:22 (41188): No heartbeat from core client for 30 sec - exiting 02:18:23 (41188): No heartbeat from core client for 30 sec - exiting 02:18:24 (41188): No heartbeat from core client for 30 sec - exiting 02:18:25 (41188): No heartbeat from core client for 30 sec - exiting 02:18:26 (41188): No heartbeat from core client for 30 sec - exiting 02:18:27 (41188): No heartbeat from core client for 30 sec - exiting 02:18:28 (41188): No heartbeat from core client for 30 sec - exiting 02:18:29 (41188): No heartbeat from core client for 30 sec - exiting 02:18:30 (41188): No heartbeat from core client for 30 sec - exiting 02:22:10 (40872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:02:11 (42712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:02:17 (42712): No heartbeat from core client for 30 sec - exiting 03:02:18 (42712): No heartbeat from core client for 30 sec - exiting 03:02:19 (42712): No heartbeat from core client for 30 sec - exiting 03:02:20 (42712): No heartbeat from core client for 30 sec - exiting 03:02:21 (42712): No heartbeat from core client for 30 sec - exiting 03:02:22 (42712): No heartbeat from core client for 30 sec - exiting 03:02:23 (42712): No heartbeat from core client for 30 sec - exiting 03:02:24 (42712): No heartbeat from core client for 30 sec - exiting 03:02:25 (42712): No heartbeat from core client for 30 sec - exiting 03:02:26 (42712): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 03:20:15 (42500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:20:16 (42500): No heartbeat from core client for 30 sec - exiting 03:20:17 (42500): No heartbeat from core client for 30 sec - exiting 03:20:18 (42500): No heartbeat from core client for 30 sec - exiting 03:20:19 (42500): No heartbeat from core client for 30 sec - exiting 03:20:20 (42500): No heartbeat from core client for 30 sec - exiting 03:20:21 (42500): No heartbeat from core client for 30 sec - exiting 03:20:22 (42500): No heartbeat from core client for 30 sec - exiting 03:20:23 (42500): No heartbeat from core client for 30 sec - exiting 03:20:24 (42500): No heartbeat from core client for 30 sec - exiting 03:20:25 (42500): No heartbeat from core client for 30 sec - exiting 04:01:09 (42464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:11 (42464): No heartbeat from core client for 30 sec - exiting 04:06:38 (43816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:20:53 (43736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:45 (41756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:48 (41756): No heartbeat from core client for 30 sec - exiting 07:17:49 (41756): No heartbeat from core client for 30 sec - exiting 07:21:44 (44748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:27:01 (43436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:31 (43532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:22 (34420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:23 (34420): No heartbeat from core client for 30 sec - exiting 10:15:12 (46184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:32:18 (45452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:32:21 (45452): No heartbeat from core client for 30 sec - exiting 10:32:22 (45452): No heartbeat from core client for 30 sec - exiting 10:32:23 (45452): No heartbeat from core client for 30 sec - exiting 10:32:24 (45452): No heartbeat from core client for 30 sec - exiting 10:32:25 (45452): No heartbeat from core client for 30 sec - exiting 10:32:26 (45452): No heartbeat from core client for 30 sec - exiting 10:32:27 (45452): No heartbeat from core client for 30 sec - exiting 10:32:28 (45452): No heartbeat from core client for 30 sec - exiting 10:38:48 (44596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:38:51 (44596): No heartbeat from core client for 30 sec - exiting 10:43:23 (45508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:43:31 (45508): No heartbeat from core client for 30 sec - exiting 10:43:32 (45508): No heartbeat from core client for 30 sec - exiting 10:43:33 (45508): No heartbeat from core client for 30 sec - exiting 10:46:05 (41468): No heartbeat from core client for 30 sec - exiting 10:46:10 (41468): No heartbeat from core client for 30 sec - exiting 10:46:11 (41468): No heartbeat from core client for 30 sec - exiting 10:46:12 (41468): No heartbeat from core client for 30 sec - exiting 10:46:13 (41468): No heartbeat from core client for 30 sec - exiting 10:46:14 (41468): No heartbeat from core client for 30 sec - exiting 10:46:15 (41468): No heartbeat from core client for 30 sec - exiting 10:46:16 (41468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/4gocko.pje2c10 Error converting file to netcdf: dataout/4gocko.pie2c10 Error converting file to netcdf: dataout/4gocko.pfe2c10 Error converting file to netcdf: dataout/4gocka.phe2c10 Error converting file to netcdf: dataout/4gocka.pge2c10 Error converting file to netcdf: dataout/4gocka.pee2c10 Error converting file to netcdf: dataout/4gocka.pde2c10 11:10:27 (47028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:10:28 (47028): No heartbeat from core client for 30 sec - exiting 11:10:29 (47028): No heartbeat from core client for 30 sec - exiting 11:10:30 (47028): No heartbeat from core client for 30 sec - exiting 11:10:31 (47028): No heartbeat from core client for 30 sec - exiting 11:10:32 (47028): No heartbeat from core client for 30 sec - exiting 11:10:33 (47028): No heartbeat from core client for 30 sec - exiting 11:10:34 (47028): No heartbeat from core client for 30 sec - exiting 11:10:35 (47028): No heartbeat from core client for 30 sec - exiting 11:10:36 (47028): No heartbeat from core client for 30 sec - exiting 11:14:54 (45488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:15:01 (45488): No heartbeat from core client for 30 sec - exiting 11:15:02 (45488): No heartbeat from core client for 30 sec - exiting 11:15:03 (45488): No heartbeat from core client for 30 sec - exiting 11:15:04 (45488): No heartbeat from core client for 30 sec - exiting 11:15:05 (45488): No heartbeat from core client for 30 sec - exiting 11:20:33 (45524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:45:25 (46888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:08:27 (47588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:50:19 (45356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:41:49 (48440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:41:57 (48440): No heartbeat from core client for 30 sec - exiting 14:41:58 (48440): No heartbeat from core client for 30 sec - exiting 15:53:44 (46748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:58:34 (48860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:22:49 (43108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:28:03 (49580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:31:56 (48248): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:35:49 (51528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:38:48 (51304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 18:49:01 (51572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:28:59 (52172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:47:43 (52600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:01:59 (53880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:08:30 (53828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:18:21 (56016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:22:21 (53076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:21:11 (54744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:50:27 (54492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:06:16 (56584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:00:27 (56736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:45:49 (56668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:21:36 (55988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:05:11 (59000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:47:42 (57796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:55:41 (59560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:06:10 (56860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:13:11 (56480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:21:26 (59740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:30:18 (58792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:02:49 (58816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:10:11 (60988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:19:54 (61308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=59960, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=59960, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3692, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21588, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21588, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21588, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Mar 2013 15:45:40 | 1237173 | 15592803 | hadcm3n_4goc_1940_40_008306167_1 | 51,840 | 53,925 | 1.0402 |
30 Mar 2013 06:23:39 | 1237173 | 15592803 | hadcm3n_4goc_1940_40_008306167_1 | 25,920 | 27,443 | 1.0588 |
©2024 climateprediction.net