climateprediction.net (CPDN) home page
Task 13128089

Task 13128089

Name hadcm3n_ymkj_1900_40_007362109_0
Workunit 7559539
Created 6 Jul 2011, 15:23:33 UTC
Sent 7 Jul 2011, 8:44:21 UTC
Report deadline 6 Oct 2011, 16:11:32 UTC
Received 20 Sep 2011, 20:35:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1065107
Run time 18 days 5 hours 56 min 31 sec
CPU time 18 days 5 hours 56 min 31 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.33 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3112, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3456, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3516, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=952, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5944, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5944, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5944, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Sep 2011 09:47:53 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 777,600 1,571,739 2.0213
17 Sep 2011 09:25:33 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 751,680 1,521,060 2.0235
15 Sep 2011 15:47:06 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 725,760 1,469,036 2.0241
11 Sep 2011 03:24:35 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 699,840 1,417,517 2.0255
05 Sep 2011 02:00:57 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 673,920 1,362,591 2.0219
04 Sep 2011 09:30:40 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 648,000 1,307,184 2.0173
03 Sep 2011 22:29:15 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 622,080 1,251,739 2.0122
02 Sep 2011 08:54:58 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 596,160 1,196,922 2.0077
28 Aug 2011 08:46:31 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 570,240 1,141,918 2.0025
27 Aug 2011 10:15:59 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 544,320 1,091,202 2.0047
25 Aug 2011 06:23:11 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 518,400 1,040,481 2.0071
23 Aug 2011 03:53:59 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 492,480 990,734 2.0117
21 Aug 2011 12:23:06 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 466,560 942,679 2.0205
19 Aug 2011 08:13:03 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 440,640 891,319 2.0228
13 Aug 2011 08:48:28 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 414,720 838,124 2.0209
07 Aug 2011 11:47:13 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 388,800 787,768 2.0262
06 Aug 2011 10:57:07 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 362,880 736,119 2.0285
05 Aug 2011 11:26:09 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 336,960 685,534 2.0345
31 Jul 2011 05:06:40 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 311,040 629,413 2.0236
30 Jul 2011 13:14:50 1065107 13128089 hadcm3n_ymkj_1900_40_007362109_0 285,120 575,673 2.0191


©2025 cpdn.org