climateprediction.net home page
Task 13014724

Task 13014724

Name hadcm3n_t2nz_1940_40_007311826_0
Workunit 7509256
Created 28 Jun 2011, 0:47:31 UTC
Sent 28 Jun 2011, 0:47:36 UTC
Report deadline 27 Sep 2011, 8:14:47 UTC
Received 8 Jul 2011, 17:37:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 917641
Run time 9 days 23 hours 13 min 41 sec
CPU time 9 days 12 hours 8 min 37 sec
Validate state Invalid
Credit 8,398.08
Device peak FLOPS 3.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
19:02:49 (2664): No heartbeat from core client for 30 sec - exiting
19:02:50 (2664): No heartbeat from core client for 30 sec - exiting
19:02:51 (2664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3768, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3768, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Jul 2011 07:28:25 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 699,840 793,017 1.1331
07 Jul 2011 22:50:14 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 673,920 762,921 1.1321
07 Jul 2011 17:56:19 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 648,000 734,249 1.1331
07 Jul 2011 15:42:01 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 622,080 704,319 1.1322
07 Jul 2011 15:42:01 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 596,160 674,787 1.1319
07 Jul 2011 15:42:01 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 570,240 646,684 1.1341
07 Jul 2011 15:42:01 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 544,320 617,199 1.1339
05 Jul 2011 19:40:48 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 518,400 586,760 1.1319
05 Jul 2011 10:23:46 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 492,480 558,452 1.1340
05 Jul 2011 01:45:49 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 466,560 529,027 1.1339
04 Jul 2011 17:23:37 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 440,640 498,907 1.1322
04 Jul 2011 09:33:47 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 414,720 470,689 1.1350
03 Jul 2011 23:46:56 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 388,800 440,470 1.1329
03 Jul 2011 14:32:33 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 362,880 411,236 1.1333
03 Jul 2011 06:27:22 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 336,960 383,510 1.1381
02 Jul 2011 21:48:56 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 311,040 353,164 1.1354
02 Jul 2011 13:53:56 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 285,120 324,850 1.1393
02 Jul 2011 05:18:39 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 259,200 297,199 1.1466
01 Jul 2011 21:08:08 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 233,280 268,670 1.1517
01 Jul 2011 12:40:16 917641 13014724 hadcm3n_t2nz_1940_40_007311826_0 207,360 239,305 1.1541


©2024 cpdn.org