Name | hadcm3n_o4xw_1980_40_007538195_1 |
Workunit | 7735427 |
Created | 5 Nov 2011, 19:34:35 UTC |
Sent | 7 Nov 2011, 8:47:31 UTC |
Report deadline | 6 Feb 2012, 16:14:42 UTC |
Received | 5 Feb 2012, 2:38:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1230193 |
Run time | 33 days 22 hours 43 min 7 sec |
CPU time | 30 days 12 hours 27 min 52 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 1.56 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:18:07 (4848): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:16:59 (1268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7120, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:24:32 (7392): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:59:08 (172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:59:09 (172): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5996, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:02:31 (8892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:56:27 (4464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7184, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5940, iMonCtr=1 Model crash detected, will try to restart... 23:11:46 (3944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11832, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11832, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11832, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:08:22 (3240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Feb 2012 01:17:02 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 1,036,800 | 2,636,771 | 2.5432 |
03 Feb 2012 16:54:07 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 1,010,880 | 2,565,981 | 2.5384 |
02 Feb 2012 13:22:56 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 984,960 | 2,495,469 | 2.5336 |
23 Jan 2012 06:59:48 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 959,040 | 2,443,835 | 2.5482 |
22 Jan 2012 13:47:11 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 933,120 | 2,404,446 | 2.5768 |
21 Jan 2012 22:03:37 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 907,200 | 2,362,387 | 2.6040 |
21 Jan 2012 05:29:50 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 881,280 | 2,316,570 | 2.6286 |
19 Jan 2012 19:15:39 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 855,360 | 2,271,566 | 2.6557 |
18 Jan 2012 22:19:16 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 829,440 | 2,246,492 | 2.7084 |
18 Jan 2012 08:59:12 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 803,520 | 2,200,177 | 2.7382 |
12 Jan 2012 03:31:48 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 777,600 | 2,146,840 | 2.7609 |
11 Jan 2012 01:57:21 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 751,680 | 2,090,657 | 2.7813 |
09 Jan 2012 08:23:13 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 725,760 | 2,044,461 | 2.8170 |
08 Jan 2012 19:24:02 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 699,840 | 1,999,516 | 2.8571 |
05 Jan 2012 20:19:24 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 673,920 | 1,953,140 | 2.8982 |
05 Jan 2012 05:13:38 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 648,000 | 1,900,470 | 2.9328 |
04 Jan 2012 13:47:21 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 622,080 | 1,847,562 | 2.9700 |
03 Jan 2012 21:02:36 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 596,160 | 1,789,460 | 3.0016 |
02 Jan 2012 17:37:11 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 570,240 | 1,726,789 | 3.0282 |
01 Jan 2012 10:57:44 | 1068203 | 13605859 | hadcm3n_o4xw_1980_40_007538195_1 | 544,320 | 1,660,410 | 3.0504 |
©2024 cpdn.org