Name | hadcm3n_zlpn_1880_40_008249695_2 |
Workunit | 8404819 |
Created | 3 Dec 2012, 22:42:27 UTC |
Sent | 3 Dec 2012, 22:42:33 UTC |
Report deadline | 5 Mar 2013, 6:09:44 UTC |
Received | 12 Feb 2013, 16:39:52 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1096733 |
Run time | 8 days 19 hours 30 min 27 sec |
CPU time | 6 days 18 hours 45 min 48 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.06 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1268, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5888, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5340, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 18:59:47 (5632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:24:21 (5820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:55:31 (1368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6244, iMonCtr=1 Model crash detected, will try to restart... 14:38:30 (3996): No heartbeat from core client for 30 sec - exiting 14:38:31 (3996): No heartbeat from core client for 30 sec - exiting 14:38:32 (3996): No heartbeat from core client for 30 sec - exiting 14:38:33 (3996): No heartbeat from core client for 30 sec - exiting 14:38:34 (3996): No heartbeat from core client for 30 sec - exiting 14:38:35 (3996): No heartbeat from core client for 30 sec - exiting 14:38:36 (3996): No heartbeat from core client for 30 sec - exiting 14:38:37 (3996): No heartbeat from core client for 30 sec - exiting 14:38:38 (3996): No heartbeat from core client for 30 sec - exiting 14:38:39 (3996): No heartbeat from core client for 30 sec - exiting 14:38:40 (3996): No heartbeat from core client for 30 sec - exiting 14:38:41 (3996): No heartbeat from core client for 30 sec - exiting 14:38:42 (3996): No heartbeat from core client for 30 sec - exiting 14:38:43 (3996): No heartbeat from core client for 30 sec - exiting 14:38:44 (3996): No heartbeat from core client for 30 sec - exiting 14:38:45 (3996): No heartbeat from core client for 30 sec - exiting 14:38:46 (3996): No heartbeat from core client for 30 sec - exiting 14:38:47 (3996): No heartbeat from core client for 30 sec - exiting 14:38:48 (3996): No heartbeat from core client for 30 sec - exiting 14:38:49 (3996): No heartbeat from core client for 30 sec - exiting 14:38:50 (3996): No heartbeat from core client for 30 sec - exiting 14:38:51 (3996): No heartbeat from core client for 30 sec - exiting 14:38:52 (3996): No heartbeat from core client for 30 sec - exiting 14:38:53 (3996): No heartbeat from core client for 30 sec - exiting 14:38:54 (3996): No heartbeat from core client for 30 sec - exiting 14:38:55 (3996): No heartbeat from core client for 30 sec - exiting 14:38:56 (3996): No heartbeat from core client for 30 sec - exiting 14:38:57 (3996): No heartbeat from core client for 30 sec - exiting 14:38:58 (3996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:43:16 (992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:35:47 (5604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:47:53 (6044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:53:47 (6012): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:22:08 (5880): No heartbeat from core client for 30 sec - exiting 16:22:09 (5880): No heartbeat from core client for 30 sec - exiting 16:22:10 (5880): No heartbeat from core client for 30 sec - exiting 16:22:11 (5880): No heartbeat from core client for 30 sec - exiting 16:22:12 (5880): No heartbeat from core client for 30 sec - exiting 16:22:13 (5880): No heartbeat from core client for 30 sec - exiting 16:22:14 (5880): No heartbeat from core client for 30 sec - exiting 16:22:15 (5880): No heartbeat from core client for 30 sec - exiting 16:22:16 (5880): No heartbeat from core client for 30 sec - exiting 16:22:17 (5880): No heartbeat from core client for 30 sec - exiting 16:22:18 (5880): No heartbeat from core client for 30 sec - exiting 16:22:19 (5880): No heartbeat from core client for 30 sec - exiting 16:22:20 (5880): No heartbeat from core client for 30 sec - exiting 16:22:21 (5880): No heartbeat from core client for 30 sec - exiting 16:22:22 (5880): No heartbeat from core client for 30 sec - exiting 16:22:23 (5880): No heartbeat from core client for 30 sec - exiting 16:22:24 (5880): No heartbeat from core client for 30 sec - exiting 16:22:25 (5880): No heartbeat from core client for 30 sec - exiting 16:22:26 (5880): No heartbeat from core client for 30 sec - exiting 16:22:27 (5880): No heartbeat from core client for 30 sec - exiting 16:22:28 (5880): No heartbeat from core client for 30 sec - exiting 16:22:29 (5880): No heartbeat from core client for 30 sec - exiting 16:22:30 (5880): No heartbeat from core client for 30 sec - exiting 16:22:31 (5880): No heartbeat from core client for 30 sec - exiting 16:22:32 (5880): No heartbeat from core client for 30 sec - exiting 16:22:33 (5880): No heartbeat from core client for 30 sec - exiting 16:22:34 (5880): No heartbeat from core client for 30 sec - exiting 16:22:35 (5880): No heartbeat from core client for 30 sec - exiting 16:22:36 (5880): No heartbeat from core client for 30 sec - exiting 16:22:37 (5880): No heartbeat from core client for 30 sec - exiting 16:22:38 (5880): No heartbeat from core client for 30 sec - exiting 16:22:39 (5880): No heartbeat from core client for 30 sec - exiting 16:22:40 (5880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:53:47 (5776): No heartbeat from core client for 30 sec - exiting 12:53:48 (5776): No heartbeat from core client for 30 sec - exiting 12:53:49 (5776): No heartbeat from core client for 30 sec - exiting 12:53:50 (5776): No heartbeat from core client for 30 sec - exiting 12:53:51 (5776): No heartbeat from core client for 30 sec - exiting 12:53:52 (5776): No heartbeat from core client for 30 sec - exiting 12:53:53 (5776): No heartbeat from core client for 30 sec - exiting 12:53:54 (5776): No heartbeat from core client for 30 sec - exiting 12:53:55 (5776): No heartbeat from core client for 30 sec - exiting 12:53:56 (5776): No heartbeat from core client for 30 sec - exiting 12:53:57 (5776): No heartbeat from core client for 30 sec - exiting 12:53:58 (5776): No heartbeat from core client for 30 sec - exiting 12:53:59 (5776): No heartbeat from core client for 30 sec - exiting 12:54:00 (5776): No heartbeat from core client for 30 sec - exiting 12:54:01 (5776): No heartbeat from core client for 30 sec - exiting 12:54:02 (5776): No heartbeat from core client for 30 sec - exiting 12:54:03 (5776): No heartbeat from core client for 30 sec - exiting 12:54:04 (5776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:54:05 (5776): No heartbeat from core client for 30 sec - exiting 12:54:06 (5776): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5408, iMonCtr=1 Model crash detected, will try to restart... 19:20:47 (724): No heartbeat from core client for 30 sec - exiting 19:20:49 (724): No heartbeat from core client for 30 sec - exiting 19:20:50 (724): No heartbeat from core client for 30 sec - exiting 19:20:51 (724): No heartbeat from core client for 30 sec - exiting 19:20:52 (724): No heartbeat from core client for 30 sec - exiting 19:20:53 (724): No heartbeat from core client for 30 sec - exiting 19:20:54 (724): No heartbeat from core client for 30 sec - exiting 19:20:55 (724): No heartbeat from core client for 30 sec - exiting 19:20:56 (724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77257373 read attempt to address 0x40B926AE Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77183AB3 read attempt to address 0x40B926B6 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zlpn_1880_40_008249695/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Feb 2013 16:51:03 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 259,200 | 556,191 | 2.1458 |
10 Feb 2013 14:35:38 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 233,280 | 496,257 | 2.1273 |
31 Dec 2012 23:09:13 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 207,360 | 437,047 | 2.1077 |
29 Dec 2012 14:26:56 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 181,440 | 375,607 | 2.0701 |
27 Dec 2012 21:31:20 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 155,520 | 318,783 | 2.0498 |
25 Dec 2012 23:01:38 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 129,600 | 265,909 | 2.0518 |
16 Dec 2012 16:41:15 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 103,680 | 209,162 | 2.0174 |
14 Dec 2012 20:04:04 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 77,760 | 156,086 | 2.0073 |
13 Dec 2012 19:01:39 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 51,840 | 104,212 | 2.0103 |
13 Dec 2012 19:01:39 | 1096733 | 15469904 | hadcm3n_zlpn_1880_40_008249695_2 | 25,920 | 52,495 | 2.0253 |
©2024 cpdn.org