climateprediction.net home page
Task 15723798

Task 15723798

Name hadcm3n_zj3b_1920_40_008324548_3
Workunit 8475683
Created 13 Apr 2013, 20:37:40 UTC
Sent 13 Apr 2013, 20:38:05 UTC
Report deadline 14 Jul 2013, 4:05:16 UTC
Received 21 May 2013, 10:28:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1278437
Run time 15 days 19 hours 9 min 41 sec
CPU time 13 days 17 hours 29 min 14 sec
Validate state Invalid
Credit 11,197.44
Device peak FLOPS 2.17 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2792, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4376, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5588, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5644, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 May 2013 18:44:37 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 933,120 1,182,980 1.2678
20 May 2013 05:04:40 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 907,200 1,148,223 1.2657
19 May 2013 09:27:24 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 881,280 1,133,347 1.2860
18 May 2013 23:18:35 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 855,360 1,097,888 1.2835
18 May 2013 12:45:06 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 829,440 1,061,575 1.2799
18 May 2013 02:26:13 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 803,520 1,025,992 1.2769
17 May 2013 15:17:54 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 777,600 988,990 1.2718
17 May 2013 02:32:54 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 751,680 952,997 1.2678
16 May 2013 15:35:00 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 725,760 918,117 1.2650
15 May 2013 23:28:38 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 699,840 883,643 1.2626
15 May 2013 12:31:52 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 673,920 847,288 1.2573
14 May 2013 17:07:20 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 648,000 811,128 1.2517
14 May 2013 06:22:52 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 622,080 775,582 1.2468
13 May 2013 20:14:34 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 596,160 741,252 1.2434
13 May 2013 10:09:16 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 570,240 705,399 1.2370
12 May 2013 13:22:33 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 544,320 669,614 1.2302
12 May 2013 02:43:49 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 518,400 633,867 1.2227
11 May 2013 17:11:43 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 492,480 599,018 1.2163
11 May 2013 06:12:05 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 466,560 585,028 1.2539
10 May 2013 17:58:09 1278437 15723798 hadcm3n_zj3b_1920_40_008324548_3 440,640 550,244 1.2487


©2024 cpdn.org