climateprediction.net home page
Task 15698341

Task 15698341

Name hadcm3n_4jwl_1940_40_008303711_2
Workunit 8454846
Created 1 Apr 2013, 6:16:19 UTC
Sent 1 Apr 2013, 6:16:42 UTC
Report deadline 1 Jul 2013, 13:43:53 UTC
Received 23 Apr 2013, 2:16:10 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1181321
Run time 15 days 5 hours 20 min 11 sec
CPU time 10 days 21 hours 35 min 41 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.14 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
07:44:18 (3972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:44:20 (3972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
09:14:00 (3260): No heartbeat from core client for 30 sec - exiting
09:14:01 (3260): No heartbeat from core client for 30 sec - exiting
09:14:02 (3260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1
Model crash detected, will try to restart...
20:33:58 (4560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:05:32 (3748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:05:34 (3748): No heartbeat from core client for 30 sec - exiting
01:22:53 (5784): No heartbeat from core client for 30 sec - exiting
01:22:56 (5784): No heartbeat from core client for 30 sec - exiting
01:22:57 (5784): No heartbeat from core client for 30 sec - exiting
01:22:58 (5784): No heartbeat from core client for 30 sec - exiting
01:22:59 (5784): No heartbeat from core client for 30 sec - exiting
01:23:00 (5784): No heartbeat from core client for 30 sec - exiting
01:23:01 (5784): No heartbeat from core client for 30 sec - exiting
01:23:02 (5784): No heartbeat from core client for 30 sec - exiting
01:23:03 (5784): No heartbeat from core client for 30 sec - exiting
01:23:04 (5784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

zip error: Could not create output file (was replacing the original zip file)
Suspended CPDN Monitor - Suspend request from BOINC...
00:01:31 (5544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:01:32 (5544): No heartbeat from core client for 30 sec - exiting
00:01:33 (5544): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
21:36:10 (4100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
00:01:32 (4224): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:01:36 (4224): No heartbeat from core client for 30 sec - exiting
00:01:37 (4224): No heartbeat from core client for 30 sec - exiting
00:04:23 (5148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:59:35 (4064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:59:37 (4064): No heartbeat from core client for 30 sec - exiting
18:59:38 (4064): No heartbeat from core client for 30 sec - exiting
18:59:39 (4064): No heartbeat from core client for 30 sec - exiting
18:59:40 (4064): No heartbeat from core client for 30 sec - exiting
05:37:18 (4972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:35:59 (4376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Apr 2013 21:29:59 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 518,400 956,729 1.8455
22 Apr 2013 02:06:32 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 492,480 901,062 1.8296
22 Apr 2013 00:25:58 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 466,560 844,483 1.8100
22 Apr 2013 00:25:58 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 440,640 788,370 1.7891
20 Apr 2013 05:07:45 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 414,720 882,590 2.1282
19 Apr 2013 05:17:33 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 388,800 826,802 2.1265
18 Apr 2013 06:15:15 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 362,880 771,001 2.1247
18 Apr 2013 05:09:53 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 336,960 715,977 2.1248
17 Apr 2013 05:05:34 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 311,040 661,201 2.1258
16 Apr 2013 05:08:33 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 285,120 606,438 2.1270
15 Apr 2013 06:24:37 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 259,200 551,523 2.1278
15 Apr 2013 05:09:13 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 233,280 496,940 2.1302
14 Apr 2013 05:05:23 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 207,360 441,928 2.1312
13 Apr 2013 05:07:20 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 181,440 386,787 2.1318
12 Apr 2013 06:13:29 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 155,520 332,021 2.1349
12 Apr 2013 05:03:05 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 129,600 276,059 2.1301
11 Apr 2013 05:07:54 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 103,680 220,911 2.1307
10 Apr 2013 05:06:53 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 77,760 166,058 2.1355
09 Apr 2013 06:29:10 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 51,840 112,039 2.1612
09 Apr 2013 05:08:43 1181321 15698341 hadcm3n_4jwl_1940_40_008303711_2 25,920 56,529 2.1809


©2024 cpdn.org