climateprediction.net home page
Task 15449332

Task 15449332

Name hadcm3n_zh9u_1880_40_008249440_1
Workunit 8404564
Created 21 Nov 2012, 22:56:55 UTC
Sent 21 Nov 2012, 22:56:56 UTC
Report deadline 21 Feb 2013, 6:24:07 UTC
Received 17 Dec 2012, 6:49:35 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1157390
Run time 17 days 16 hours 3 min 44 sec
CPU time 17 days 15 hours 8 min 29 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 1.38 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:47:51 (18624): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
17:47:55 (18624): No heartbeat from core client for 30 sec - exiting
17:47:56 (18624): No heartbeat from core client for 30 sec - exiting
17:47:57 (18624): No heartbeat from core client for 30 sec - exiting
17:48:00 (18624): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:34:09 (25994): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
06:34:10 (25994): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:53:55 (3340): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:43:03 (6221): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:23:20 (6319): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:23:22 (6319):00:54:15 (6449): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:16:44 (6608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:34:53 (6671): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:38:40 (6758): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
21:54:52 (8666): No heartbeat from core client for 30 sec - exiting
21:54:53 (8666): No heartbeat from core client for 30 sec - exiting
21:54:54 (8666): No heartbeat from core client for 30 sec - exiting
21:54:55 (8666): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_zh9u_1880_40_008249440/dataout/ocean_restart.day after 11 attempts

Model crashed: READ_FLH: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Dec 2012 03:58:50 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 518,400 1,523,350 2.9386
13 Dec 2012 20:30:54 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 492,480 1,449,866 2.9440
08 Dec 2012 02:29:55 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 466,560 1,373,321 2.9435
07 Dec 2012 05:16:23 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 440,640 1,296,950 2.9433
06 Dec 2012 07:53:12 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 414,720 1,220,412 2.9427
05 Dec 2012 10:39:48 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 388,800 1,144,151 2.9428
04 Dec 2012 13:23:12 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 362,880 1,067,733 2.9424
03 Dec 2012 16:05:58 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 336,960 991,223 2.9417
02 Dec 2012 18:53:46 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 311,040 914,970 2.9416
01 Dec 2012 21:35:57 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 285,120 838,465 2.9407
01 Dec 2012 00:19:20 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 259,200 762,068 2.9401
30 Nov 2012 03:55:47 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 233,280 685,534 2.9387
29 Nov 2012 06:42:04 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 207,360 609,364 2.9387
28 Nov 2012 09:26:56 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 181,440 533,389 2.9398
27 Nov 2012 12:24:23 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 155,520 457,440 2.9414
26 Nov 2012 14:57:12 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 129,600 381,406 2.9429
25 Nov 2012 17:43:37 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 103,680 305,410 2.9457
24 Nov 2012 19:50:42 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 77,760 229,017 2.9452
23 Nov 2012 20:03:00 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 51,840 152,994 2.9513
22 Nov 2012 22:33:19 1157390 15449332 hadcm3n_zh9u_1880_40_008249440_1 25,920 76,444 2.9492


©2024 cpdn.org