climateprediction.net home page
Task 12747514

Task 12747514

Name hadcm3n_o617_1900_40_007203150_1
Workunit 7401430
Created 28 Mar 2011, 14:15:40 UTC
Sent 29 Mar 2011, 11:24:31 UTC
Report deadline 28 Jun 2011, 18:51:42 UTC
Received 26 Apr 2011, 19:36:45 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID 1096933
Run time 8 days 5 hours 54 min 1 sec
CPU time 8 days 3 hours 12 min 13 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.81 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
 - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4068, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1108, selfPID=1108, iMonCtr=1
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o617ko.pja3c10
Error converting file to netcdf: dataout/o617ko.pia3c10
Error converting file to netcdf: dataout/o617ko.pfa3c10
Error converting file to netcdf: dataout/o617ka.pha3c10
Error converting file to netcdf: dataout/o617ka.pga3c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1252, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3620, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:00:25 (6304): Can't acquire lockfile (32) - waiting 35s
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7640, selfPID=7640, iMonCtr=1
18:44:54 (6304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:44:55 (6304): No heartbeat from core client for 30 sec - exiting
18:44:56 (6304): No heartbeat from core client for 30 sec - exiting
18:44:57 (6304): No heartbeat from core client for 30 sec - exiting
18:44:58 (6304): No heartbeat from core client for 30 sec - exiting
18:44:59 (6304): No heartbeat from core client for 30 sec - exiting
18:45:00 (6304): No heartbeat from core client for 30 sec - exiting
18:45:01 (6304): No heartbeat from core client for 30 sec - exiting
18:45:02 (6304): No heartbeat from core client for 30 sec - exiting
18:45:03 (6304): No heartbeat from core client for 30 sec - exiting
18:45:04 (6304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3940, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3048, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x76FB3232 read attempt to address 0x3EC97BB2

Engaging BOINC Windows Runtime Debugger...

No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2644, selfPID=2644, iMonCtr=1


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x76FB3232 read attempt to address 0x3EC97BB2

Engaging BOINC Windows Runtime Debugger...


</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Apr 2011 17:32:10 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 518,400 694,961 1.3406
25 Apr 2011 21:40:52 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 492,480 660,116 1.3404
24 Apr 2011 19:29:34 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 466,560 625,224 1.3401
21 Apr 2011 20:08:01 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 440,640 590,627 1.3404
21 Apr 2011 07:05:25 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 414,720 556,659 1.3423
20 Apr 2011 20:45:44 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 388,800 522,907 1.3449
20 Apr 2011 20:45:44 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 362,880 489,273 1.3483
20 Apr 2011 20:45:44 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 336,960 455,748 1.3525
20 Apr 2011 20:45:44 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 311,040 421,230 1.3543
20 Apr 2011 20:45:44 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 285,120 385,423 1.3518
20 Apr 2011 20:45:44 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 259,200 349,618 1.3488
20 Apr 2011 20:45:44 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 233,280 313,915 1.3457
13 Apr 2011 05:56:52 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 207,360 278,528 1.3432
12 Apr 2011 07:59:09 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 181,440 242,972 1.3391
11 Apr 2011 12:54:23 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 155,520 207,501 1.3342
09 Apr 2011 20:11:18 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 129,600 171,657 1.3245
02 Apr 2011 12:23:03 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 103,680 139,194 1.3425
01 Apr 2011 14:08:12 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 77,760 104,177 1.3397
31 Mar 2011 18:24:46 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 51,840 69,468 1.3400
30 Mar 2011 15:59:36 1096933 12747514 hadcm3n_o617_1900_40_007203150_1 25,920 34,947 1.3483


©2024 cpdn.org