Name | hadcm3n_ye2i_1940_40_007426850_2 |
Workunit | 7624353 |
Created | 23 Sep 2011, 7:12:54 UTC |
Sent | 23 Sep 2011, 7:16:57 UTC |
Report deadline | 23 Dec 2011, 14:44:08 UTC |
Received | 15 Nov 2011, 20:50:29 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1164541 |
Run time | 22 days 22 hours 15 min 20 sec |
CPU time | 20 days 13 hours 26 min 5 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.67 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2588, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2604, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2084, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2392, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3932, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 06:49:19 (6136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2376, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2424, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2368, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77143A93 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77033A93 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Nov 2011 19:52:05 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 777,600 | 1,769,283 | 2.2753 |
15 Nov 2011 19:52:05 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 751,680 | 1,711,663 | 2.2771 |
15 Nov 2011 19:52:05 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 725,760 | 1,654,096 | 2.2791 |
15 Nov 2011 19:52:05 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 699,840 | 1,595,086 | 2.2792 |
09 Nov 2011 21:51:23 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 673,920 | 1,534,060 | 2.2763 |
08 Nov 2011 07:34:50 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 648,000 | 1,474,187 | 2.2750 |
07 Nov 2011 04:35:51 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 622,080 | 1,417,031 | 2.2779 |
05 Nov 2011 22:48:24 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 596,160 | 1,356,304 | 2.2751 |
03 Nov 2011 09:46:22 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 570,240 | 1,296,134 | 2.2730 |
01 Nov 2011 09:10:21 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 544,320 | 1,235,911 | 2.2706 |
31 Oct 2011 18:33:36 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 518,400 | 1,179,419 | 2.2751 |
31 Oct 2011 17:16:13 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 492,480 | 1,121,705 | 2.2777 |
31 Oct 2011 16:49:23 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 466,560 | 1,063,173 | 2.2787 |
31 Oct 2011 14:55:30 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 440,640 | 1,007,189 | 2.2857 |
31 Oct 2011 14:55:30 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 414,720 | 949,136 | 2.2886 |
31 Oct 2011 14:55:30 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 388,800 | 890,588 | 2.2906 |
19 Oct 2011 05:22:23 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 362,880 | 831,226 | 2.2906 |
18 Oct 2011 10:44:24 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 336,960 | 775,381 | 2.3011 |
17 Oct 2011 00:28:30 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 311,040 | 719,429 | 2.3130 |
16 Oct 2011 07:05:36 | 1164541 | 13415006 | hadcm3n_ye2i_1940_40_007426850_2 | 285,120 | 663,430 | 2.3268 |
©2024 cpdn.org