climateprediction.net home page
Task 16060642

Task 16060642

Name hadcm3n_o92x_1900_40_008466892_1
Workunit 8617731
Created 7 Oct 2013, 20:24:15 UTC
Sent 7 Oct 2013, 20:42:23 UTC
Report deadline 7 Jan 2014, 4:09:34 UTC
Received 12 Nov 2013, 1:19:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1296485
Run time 14 days 10 hours 56 min 11 sec
CPU time 13 days 9 hours 4 min 6 sec
Validate state Invalid
Credit 5,287.68
Device peak FLOPS 2.18 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
Enheden genkender ikke kommandoen.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4560, iMonCtr=1
Model crash detected, will try to restart...
09:49:55 (4468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4136, iMonCtr=1
Model crash detected, will try to restart...
00:09:03 (4408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3296, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5536, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3716, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
01:54:48 (4456): No heartbeat from core client for 30 sec - exiting
01:54:49 (4456): No heartbeat from core client for 30 sec - exiting
01:54:50 (4456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:12:28 (4512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:12:29 (4512): No heartbeat from core client for 30 sec - exiting
10:12:30 (4512): No heartbeat from core client for 30 sec - exiting
10:12:31 (4512): No heartbeat from core client for 30 sec - exiting
10:12:32 (4512): No heartbeat from core client for 30 sec - exiting
10:12:33 (4512): No heartbeat from core client for 30 sec - exiting
10:12:34 (4512): No heartbeat from core client for 30 sec - exiting
10:12:35 (4512): No heartbeat from core client for 30 sec - exiting
10:12:36 (4512): No heartbeat from core client for 30 sec - exiting
10:12:37 (4512): No heartbeat from core client for 30 sec - exiting
10:12:38 (4512): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4152, iMonCtr=1
Model crash detected, will try to restart...
12:18:58 (3740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4148, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4148, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4148, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4148, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4148, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4148, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 Nov 2013 22:15:25 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 440,640 1,097,010 2.4896
08 Nov 2013 17:10:52 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 414,720 1,027,237 2.4769
07 Nov 2013 13:56:52 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 388,800 962,304 2.4751
06 Nov 2013 10:33:19 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 362,880 894,190 2.4641
03 Nov 2013 19:49:33 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 336,960 824,745 2.4476
02 Nov 2013 14:15:33 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 311,040 750,621 2.4133
31 Oct 2013 23:13:51 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 285,120 688,062 2.4132
30 Oct 2013 18:02:28 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 259,200 618,702 2.3870
29 Oct 2013 13:40:44 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 233,280 551,996 2.3662
28 Oct 2013 09:11:00 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 207,360 485,389 2.3408
26 Oct 2013 11:58:26 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 181,440 421,792 2.3247
25 Oct 2013 09:38:51 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 155,520 355,772 2.2876
20 Oct 2013 21:19:06 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 129,600 294,867 2.2752
19 Oct 2013 12:41:54 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 103,680 238,221 2.2977
15 Oct 2013 18:39:40 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 77,760 180,099 2.3161
13 Oct 2013 11:20:51 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 51,840 121,599 2.3457
12 Oct 2013 09:04:51 1296485 16060642 hadcm3n_o92x_1900_40_008466892_1 25,920 61,364 2.3674


©2024 climateprediction.net