Name | hadcm3n_zk2s_1960_40_008370074_0 |
Workunit | 8520933 |
Created | 20 May 2013, 16:38:53 UTC |
Sent | 20 May 2013, 16:39:04 UTC |
Report deadline | 20 Aug 2013, 0:06:15 UTC |
Received | 24 Jun 2013, 11:30:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1129146 |
Run time | 5 days 7 hours 32 min 47 sec |
CPU time | 4 days 18 hours 55 min 21 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 3.25 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> 14:18:39 (5380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=1 Model crash detected, will try to restart... 18:20:55 (5852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:15:57 (4388): No heartbeat from core client for 30 sec - exiting 16:17:10 (4388): No heartbeat from core client for 30 sec - exiting 16:17:12 (4388): No heartbeat from core client for 30 sec - exiting 16:17:13 (4388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:23:17 (2784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:42:51 (4124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:52:50 (1680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:52:51 (1680): No heartbeat from core client for 30 sec - exiting 16:52:52 (1680): No heartbeat from core client for 30 sec - exiting 16:52:53 (1680): No heartbeat from core client for 30 sec - exiting 16:52:56 (1680): No heartbeat from core client for 30 sec - exiting 16:52:59 (1680): No heartbeat from core client for 30 sec - exiting 17:00:03 (6380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5396, iMonCtr=1 Model crash detected, will try to restart... 18:46:26 (5476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:46:32 (5476): No heartbeat from core client for 30 sec - exiting 18:46:33 (5476): No heartbeat from core client for 30 sec - exiting 18:46:34 (5476): No heartbeat from core client for 30 sec - exiting 18:46:35 (5476): No heartbeat from core client for 30 sec - exiting 18:46:36 (5476): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 13:26:17 (5444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:26:33 (5444): No heartbeat from core client for 30 sec - exiting 13:26:34 (5444): No heartbeat from core client for 30 sec - exiting 13:26:35 (5444): No heartbeat from core client for 30 sec - exiting 13:26:37 (5444): No heartbeat from core client for 30 sec - exiting 13:26:38 (5444): No heartbeat from core client for 30 sec - exiting 13:26:39 (5444): No heartbeat from core client for 30 sec - exiting 13:26:40 (5444): No heartbeat from core client for 30 sec - exiting 13:26:41 (5444): No heartbeat from core client for 30 sec - exiting 13:26:42 (5444): No heartbeat from core client for 30 sec - exiting 13:26:43 (5444): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1 Model crash detected, will try to restart... 20:31:39 (4896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:07:59 (5512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6424, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4296, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77C87373 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77073AB3 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Jun 2013 16:15:29 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 259,200 | 396,471 | 1.5296 |
22 Jun 2013 17:26:29 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 233,280 | 356,581 | 1.5286 |
16 Jun 2013 20:02:45 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 207,360 | 317,646 | 1.5319 |
14 Jun 2013 09:12:42 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 181,440 | 278,367 | 1.5342 |
08 Jun 2013 12:40:20 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 155,520 | 239,115 | 1.5375 |
02 Jun 2013 13:35:27 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 129,600 | 199,591 | 1.5401 |
01 Jun 2013 12:35:50 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 103,680 | 158,455 | 1.5283 |
31 May 2013 10:57:07 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 77,760 | 119,012 | 1.5305 |
25 May 2013 13:48:47 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 51,840 | 79,406 | 1.5318 |
22 May 2013 18:22:03 | 1129146 | 15791246 | hadcm3n_zk2s_1960_40_008370074_0 | 25,920 | 39,638 | 1.5292 |
©2024 cpdn.org