climateprediction.net home page
Task 14612578

Task 14612578

Name hadcm3n_y7qh_1980_40_007865190_4
Workunit 8020302
Created 1 May 2012, 10:20:44 UTC
Sent 1 May 2012, 10:21:08 UTC
Report deadline 31 Jul 2012, 17:48:19 UTC
Received 26 Jul 2012, 10:42:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1019837
Run time 59 days 8 hours 9 min 23 sec
CPU time 25 days 18 hours 33 min 22 sec
Validate state Invalid
Credit 5,598.72
Device peak FLOPS 1.39 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.6.38</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5996, selfPID=5996, iMonCtr=1
CPDN Monitor - Quit request from BOINC...

Model crashed: DUMPCTL : Fail to open output dump - may already exist                                                                                                                                                                                                          tmp/pipe_dummy                                                                  2048    
09:02:49 (850504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold

Model crashed: TEMPHIST: Failed in OPEN of history file                                                                                                                                                                                                                        tmp/pipe_dummy                                                                  2048    
06:57:08 (893108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Restart file copy failed on y7qhka.daj61l0
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=864100, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=864100, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=864100, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=864100, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=864100, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jul 2012 17:00:58 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 466,560 2,370,553 5.0809
21 Jul 2012 20:13:51 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 440,640 2,239,537 5.0825
19 Jul 2012 06:20:47 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 414,720 2,102,891 5.0706
10 Jul 2012 20:35:41 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 388,800 1,954,619 5.0273
05 Jul 2012 10:18:43 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 362,880 1,809,121 4.9855
02 Jul 2012 10:55:42 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 336,960 1,667,018 4.9472
25 Jun 2012 14:46:49 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 311,040 1,524,421 4.9010
20 Jun 2012 23:36:46 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 285,120 1,375,473 4.8242
17 Jun 2012 23:01:38 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 259,200 1,235,590 4.7669
12 Jun 2012 23:01:24 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 233,280 1,089,725 4.6713
07 Jun 2012 21:04:30 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 207,360 946,451 4.5643
27 May 2012 12:31:00 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 181,440 894,370 4.9293
23 May 2012 17:50:50 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 155,520 748,928 4.8156
17 May 2012 11:54:10 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 129,600 689,855 5.3230
12 May 2012 19:35:33 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 103,680 543,641 5.2435
07 May 2012 11:41:05 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 77,760 403,688 5.1915
05 May 2012 20:27:30 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 51,840 271,767 5.2424
03 May 2012 13:20:29 1019837 14612578 hadcm3n_y7qh_1980_40_007865190_4 25,920 133,755 5.1603


©2024 cpdn.org