Name | hadcm3n_4827_1940_40_008310479_0 |
Workunit | 8461614 |
Created | 8 Feb 2013, 1:01:29 UTC |
Sent | 9 Feb 2013, 3:31:09 UTC |
Report deadline | 11 May 2013, 10:58:20 UTC |
Received | 4 Apr 2013, 15:42:48 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1237173 |
Run time | 15 hours 13 min 1 sec |
CPU time | 10 hours 41 min 56 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 3.70 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 17:37:45 (37072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:37:51 (37072): No heartbeat from core client for 30 sec - exiting 17:37:52 (37072): No heartbeat from core client for 30 sec - exiting 19:50:22 (38960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:50:25 (38960): No heartbeat from core client for 30 sec - exiting 19:50:26 (38960): No heartbeat from core client for 30 sec - exiting 19:50:27 (38960): No heartbeat from core client for 30 sec - exiting 19:50:28 (38960): No heartbeat from core client for 30 sec - exiting 19:50:29 (38960): No heartbeat from core client for 30 sec - exiting 19:50:30 (38960): No heartbeat from core client for 30 sec - exiting 19:50:31 (38960): No heartbeat from core client for 30 sec - exiting 19:50:32 (38960): No heartbeat from core client for 30 sec - exiting 19:50:33 (38960): No heartbeat from core client for 30 sec - exiting 19:50:34 (38960): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 21:07:05 (40588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:07:06 (40588): No heartbeat from core client for 30 sec - exiting 21:07:07 (40588): No heartbeat from core client for 30 sec - exiting 21:07:08 (40588): No heartbeat from core client for 30 sec - exiting 21:07:09 (40588): No heartbeat from core client for 30 sec - exiting 21:07:10 (40588): No heartbeat from core client for 30 sec - exiting 21:07:11 (40588): No heartbeat from core client for 30 sec - exiting 21:07:12 (40588): No heartbeat from core client for 30 sec - exiting 21:07:13 (40588): No heartbeat from core client for 30 sec - exiting 21:07:14 (40588): No heartbeat from core client for 30 sec - exiting 21:07:15 (40588): No heartbeat from core client for 30 sec - exiting 23:25:30 (39076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:54:41 (41936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:18:22 (41116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:18:16 (37984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:18:21 (37984): No heartbeat from core client for 30 sec - exiting 02:18:22 (37984): No heartbeat from core client for 30 sec - exiting 02:18:23 (37984): No heartbeat from core client for 30 sec - exiting 02:18:24 (37984): No heartbeat from core client for 30 sec - exiting 02:18:25 (37984): No heartbeat from core client for 30 sec - exiting 02:18:26 (37984): No heartbeat from core client for 30 sec - exiting 02:18:27 (37984): No heartbeat from core client for 30 sec - exiting 02:18:28 (37984): No heartbeat from core client for 30 sec - exiting 02:18:29 (37984): No heartbeat from core client for 30 sec - exiting 02:18:30 (37984): No heartbeat from core client for 30 sec - exiting 02:22:10 (39296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:02:11 (41572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:02:17 (41572): No heartbeat from core client for 30 sec - exiting 03:02:18 (41572): No heartbeat from core client for 30 sec - exiting 03:02:19 (41572): No heartbeat from core client for 30 sec - exiting 03:02:20 (41572): No heartbeat from core client for 30 sec - exiting 03:02:21 (41572): No heartbeat from core client for 30 sec - exiting 03:02:22 (41572): No heartbeat from core client for 30 sec - exiting 03:02:23 (41572): No heartbeat from core client for 30 sec - exiting 03:02:24 (41572): No heartbeat from core client for 30 sec - exiting 03:02:25 (41572): No heartbeat from core client for 30 sec - exiting 03:02:26 (41572): No heartbeat from core client for 30 sec - exiting 03:20:16 (42672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 04:01:08 (39108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:09 (39108): No heartbeat from core client for 30 sec - exiting 04:01:10 (39108): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 04:06:38 (44028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:06:39 (44028): No heartbeat from core client for 30 sec - exiting 04:06:40 (44028): No heartbeat from core client for 30 sec - exiting 04:06:41 (44028): No heartbeat from core client for 30 sec - exiting 04:06:42 (44028): No heartbeat from core client for 30 sec - exiting 04:06:43 (44028): No heartbeat from core client for 30 sec - exiting 04:06:44 (44028): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 05:20:53 (37404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:20:54 (37404): No heartbeat from core client for 30 sec - exiting 05:20:55 (37404): No heartbeat from core client for 30 sec - exiting 05:20:56 (37404): No heartbeat from core client for 30 sec - exiting 05:20:57 (37404): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 07:17:45 (43960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:27:01 (45788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:31 (45480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:22 (45872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:15:12 (43364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:32:18 (46784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:38:47 (41688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:43:23 (46032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:46:05 (47004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:10:27 (39536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:14:54 (45116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:20:33 (44920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:45:26 (46416): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:08:27 (45468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:50:19 (45532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:41:49 (49600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:53:44 (48196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:58:34 (45988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:22:49 (49484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:28:03 (50804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:31:56 (50708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:35:49 (52124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:38:48 (50016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30732, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30732, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Mar 2013 08:24:15 | 1237173 | 15597790 | hadcm3n_4827_1940_40_008310479_0 | 25,920 | 27,716 | 1.0693 |
©2024 climateprediction.net