climateprediction.net home page
Task 16046466

Task 16046466

Name hadcm3n_ofx1_1900_40_008475752_0
Workunit 8626591
Created 27 Sep 2013, 10:39:26 UTC
Sent 27 Sep 2013, 12:08:07 UTC
Report deadline 27 Dec 2013, 19:35:18 UTC
Received 8 Jan 2014, 7:57:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID 1244735
Run time 16 days 3 hours 30 min 13 sec
CPU time 15 days 23 hours 28 min 32 sec
Validate state Invalid
Credit 12,130.56
Device peak FLOPS 2.49 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=507Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11948, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1368, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9068, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4816, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8796, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8188, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5628, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5748, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4732, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7752, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1
Model crash detected, will try to restart...
10:57:27 (6776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2096, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4264, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2856, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1084, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12020, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7404, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=784, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
19:19:29 (6524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77877383 read attempt to address 0x409857CB

Engaging BOINC Windows Runtime Debugger...



Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77273AC3 read attempt to address 0x409857D3

Engaging BOINC Windows Runtime Debugger...

Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ofx1_1900_40_008475752/dataout/shmem_restart.day
Signal 11 received, exiting...
Called boinc_finish


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77273AC3 read attempt to address 0x409857D3

Engaging BOINC Windows Runtime Debugger...


</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Jan 2014 08:04:49 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 1,010,880 1,346,131 1.3316
04 Jan 2014 22:17:32 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 984,960 1,311,314 1.3313
20 Dec 2013 07:15:35 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 959,040 1,277,532 1.3321
17 Dec 2013 10:37:04 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 933,120 1,243,333 1.3324
16 Dec 2013 10:14:20 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 907,200 1,208,891 1.3326
14 Dec 2013 10:31:16 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 881,280 1,172,568 1.3305
13 Dec 2013 04:37:11 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 855,360 1,136,500 1.3287
10 Dec 2013 09:36:44 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 829,440 1,102,154 1.3288
08 Dec 2013 08:04:32 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 803,520 1,067,563 1.3286
07 Dec 2013 07:40:01 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 777,600 1,032,700 1.3281
05 Dec 2013 10:58:06 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 751,680 997,514 1.3270
03 Dec 2013 09:18:29 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 725,760 963,115 1.3270
01 Dec 2013 03:07:28 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 699,840 928,221 1.3263
28 Nov 2013 11:13:03 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 673,920 893,784 1.3262
27 Nov 2013 05:35:00 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 648,000 859,796 1.3268
24 Nov 2013 06:01:03 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 622,080 824,645 1.3256
22 Nov 2013 08:46:11 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 596,160 789,551 1.3244
18 Nov 2013 09:50:00 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 570,240 755,176 1.3243
17 Nov 2013 01:42:16 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 544,320 720,362 1.3234
14 Nov 2013 08:03:09 1244735 16046466 hadcm3n_ofx1_1900_40_008475752_0 518,400 685,874 1.3231


©2024 cpdn.org