climateprediction.net home page
Task 15597587

Task 15597587

Name hadcm3n_4dq6_1940_40_008310314_0
Workunit 8461449
Created 8 Feb 2013, 0:25:06 UTC
Sent 8 Feb 2013, 0:30:27 UTC
Report deadline 10 May 2013, 7:57:38 UTC
Received 2 Mar 2013, 3:59:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1266192
Run time 6 days 7 hours 17 min 21 sec
CPU time 6 days 6 hours 18 min 32 sec
Validate state Invalid
Credit 6,842.88
Device peak FLOPS 4.41 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:59:40 (2436): Can't acquire lockfile (32) - waiting 35s
17:59:54 (4284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2752, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Feb 2013 11:26:07 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 570,240 536,717 0.9412
25 Feb 2013 21:56:25 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 544,320 514,438 0.9451
25 Feb 2013 16:17:04 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 518,400 492,678 0.9504
25 Feb 2013 09:40:03 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 492,480 472,142 0.9587
25 Feb 2013 03:17:49 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 466,560 450,317 0.9652
20 Feb 2013 18:21:42 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 440,640 424,874 0.9642
20 Feb 2013 11:03:31 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 414,720 398,917 0.9619
18 Feb 2013 23:26:07 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 388,800 372,886 0.9591
18 Feb 2013 16:10:48 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 362,880 346,154 0.9539
18 Feb 2013 08:38:17 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 336,960 319,521 0.9482
17 Feb 2013 13:55:19 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 311,040 292,829 0.9415
17 Feb 2013 06:36:32 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 285,120 266,186 0.9336
14 Feb 2013 20:53:31 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 259,200 239,427 0.9237
13 Feb 2013 23:03:25 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 233,280 212,829 0.9123
13 Feb 2013 16:50:54 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 207,360 190,773 0.9200
13 Feb 2013 10:06:37 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 181,440 167,921 0.9255
10 Feb 2013 16:41:05 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 155,520 143,530 0.9229
10 Feb 2013 09:12:15 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 129,600 116,734 0.9007
09 Feb 2013 19:14:53 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 103,680 89,420 0.8625
08 Feb 2013 21:28:12 1266192 15597587 hadcm3n_4dq6_1940_40_008310314_0 77,760 66,131 0.8505


©2024 cpdn.org