Name | hadcm3n_ofx1_1900_40_008475752_0 |
Workunit | 8626591 |
Created | 27 Sep 2013, 10:39:26 UTC |
Sent | 27 Sep 2013, 12:08:07 UTC |
Report deadline | 27 Dec 2013, 19:35:18 UTC |
Received | 8 Jan 2014, 7:57:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1244735 |
Run time | 16 days 3 hours 30 min 13 sec |
CPU time | 15 days 23 hours 28 min 32 sec |
Validate state | Invalid |
Credit | 12,130.56 |
Device peak FLOPS | 2.49 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.33</core_client_version> <![CDATA[ <message> (unknown error) - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=507Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11948, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1368, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9068, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4816, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8796, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8188, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5628, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5748, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4732, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1 Model crash detected, will try to restart... 10:57:27 (6776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2096, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4264, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1084, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12020, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7404, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=784, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1 Model crash detected, will try to restart... 19:19:29 (6524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77877383 read attempt to address 0x409857CB Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77273AC3 read attempt to address 0x409857D3 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ofx1_1900_40_008475752/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77273AC3 read attempt to address 0x409857D3 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Jan 2014 08:04:49 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 1,010,880 | 1,346,131 | 1.3316 |
04 Jan 2014 22:17:32 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 984,960 | 1,311,314 | 1.3313 |
20 Dec 2013 07:15:35 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 959,040 | 1,277,532 | 1.3321 |
17 Dec 2013 10:37:04 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 933,120 | 1,243,333 | 1.3324 |
16 Dec 2013 10:14:20 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 907,200 | 1,208,891 | 1.3326 |
14 Dec 2013 10:31:16 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 881,280 | 1,172,568 | 1.3305 |
13 Dec 2013 04:37:11 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 855,360 | 1,136,500 | 1.3287 |
10 Dec 2013 09:36:44 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 829,440 | 1,102,154 | 1.3288 |
08 Dec 2013 08:04:32 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 803,520 | 1,067,563 | 1.3286 |
07 Dec 2013 07:40:01 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 777,600 | 1,032,700 | 1.3281 |
05 Dec 2013 10:58:06 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 751,680 | 997,514 | 1.3270 |
03 Dec 2013 09:18:29 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 725,760 | 963,115 | 1.3270 |
01 Dec 2013 03:07:28 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 699,840 | 928,221 | 1.3263 |
28 Nov 2013 11:13:03 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 673,920 | 893,784 | 1.3262 |
27 Nov 2013 05:35:00 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 648,000 | 859,796 | 1.3268 |
24 Nov 2013 06:01:03 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 622,080 | 824,645 | 1.3256 |
22 Nov 2013 08:46:11 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 596,160 | 789,551 | 1.3244 |
18 Nov 2013 09:50:00 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 570,240 | 755,176 | 1.3243 |
17 Nov 2013 01:42:16 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 544,320 | 720,362 | 1.3234 |
14 Nov 2013 08:03:09 | 1244735 | 16046466 | hadcm3n_ofx1_1900_40_008475752_0 | 518,400 | 685,874 | 1.3231 |
©2024 cpdn.org