climateprediction.net home page
Task 15912843

Task 15912843

Name hadcm3n_o4u0_2140_40_008269103_4
Workunit 8424227
Created 14 Aug 2013, 11:34:51 UTC
Sent 14 Aug 2013, 15:22:04 UTC
Report deadline 13 Nov 2013, 22:49:15 UTC
Received 22 Aug 2013, 2:04:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1266353
Run time 6 days 10 hours 46 min 38 sec
CPU time 6 days 4 hours 3 min 13 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 3.57 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.11</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:22:54 (16016): start_timer_thread(): CreateThread() failed, errno 0
09:00:13 (4828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	06:46:53 AM	No files match the supplied pattern.
MainError:	06:46:53 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:09:29 (6448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:09:30 (6448): No heartbeat from core client for 30 sec - exiting
04:09:31 (6448): No heartbeat from core client for 30 sec - exiting
04:09:32 (6448): No heartbeat from core client for 30 sec - exiting
04:10:15 (52208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:14:20 (31508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:14:21 (31508): No heartbeat from core client for 30 sec - exiting
04:14:22 (31508): No heartbeat from core client for 30 sec - exiting
04:14:23 (31508): No heartbeat from core client for 30 sec - exiting
04:14:24 (31508): No heartbeat from core client for 30 sec - exiting
04:14:25 (31508): No heartbeat from core client for 30 sec - exiting
04:14:26 (31508): No heartbeat from core client for 30 sec - exiting
04:14:27 (31508): No heartbeat from core client for 30 sec - exiting
04:14:28 (31508): No heartbeat from core client for 30 sec - exiting
04:14:29 (31508): No heartbeat from core client for 30 sec - exiting
04:14:30 (31508): No heartbeat from core client for 30 sec - exiting
04:15:06 (5584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:15:12 (5584): No heartbeat from core client for 30 sec - exiting
04:15:13 (5584): No heartbeat from core client for 30 sec - exiting
04:15:14 (5584): No heartbeat from core client for 30 sec - exiting
04:15:15 (5584): No heartbeat from core client for 30 sec - exiting
04:15:16 (5584): No heartbeat from core client for 30 sec - exiting
04:15:17 (5584): No heartbeat from core client for 30 sec - exiting
04:15:18 (5584): No heartbeat from core client for 30 sec - exiting
04:15:19 (5584): No heartbeat from core client for 30 sec - exiting
04:15:20 (5584): No heartbeat from core client for 30 sec - exiting
04:15:21 (5584): No heartbeat from core client for 30 sec - exiting
04:16:00 (9812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:17:01 (48676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:18:18 (51588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:18:24 (51588): No heartbeat from core client for 30 sec - exiting
04:18:25 (51588): No heartbeat from core client for 30 sec - exiting
04:19:59 (6528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:20:04 (6528): No heartbeat from core client for 30 sec - exiting
04:20:06 (6528): No heartbeat from core client for 30 sec - exiting
04:20:07 (6528): No heartbeat from core client for 30 sec - exiting
04:20:08 (6528): No heartbeat from core client for 30 sec - exiting
04:20:09 (6528): No heartbeat from core client for 30 sec - exiting
04:20:10 (6528): No heartbeat from core client for 30 sec - exiting
04:20:11 (6528): No heartbeat from core client for 30 sec - exiting
04:22:01 (42260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:27:36 (6500): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:36:56 (15840): start_timer_thread(): CreateThread() failed, errno 0
04:37:30 (38668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:37:51 (51612): start_timer_thread(): CreateThread() failed, errno 0
04:38:24 (48620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:39:18 (48576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:40:17 (48500): start_timer_thread(): CreateThread() failed, errno 0
04:40:51 (50744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:41:12 (47340): start_timer_thread(): CreateThread() failed, errno 0
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=51948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=51948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=51948, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=49924, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=49924, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=49924, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Aug 2013 06:51:18 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 518,400 525,564 1.0138
20 Aug 2013 23:23:16 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 492,480 498,445 1.0121
20 Aug 2013 15:40:04 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 466,560 471,272 1.0101
20 Aug 2013 08:02:59 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 440,640 444,137 1.0079
20 Aug 2013 00:24:37 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 414,720 416,983 1.0055
19 Aug 2013 16:41:26 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 388,800 389,775 1.0025
19 Aug 2013 08:59:45 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 362,880 362,606 0.9992
19 Aug 2013 00:17:49 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 336,960 335,406 0.9954
18 Aug 2013 16:31:20 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 311,040 308,276 0.9911
18 Aug 2013 08:44:57 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 285,120 281,137 0.9860
18 Aug 2013 00:57:53 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 259,200 253,978 0.9799
17 Aug 2013 17:11:21 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 233,280 226,819 0.9723
17 Aug 2013 09:24:44 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 207,360 199,790 0.9635
17 Aug 2013 01:28:24 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 181,440 172,662 0.9516
16 Aug 2013 17:31:26 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 155,520 145,880 0.9380
16 Aug 2013 07:34:19 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 129,600 135,175 1.0430
15 Aug 2013 23:47:09 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 103,680 107,885 1.0406
15 Aug 2013 15:53:23 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 77,760 80,809 1.0392
15 Aug 2013 07:55:22 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 51,840 53,740 1.0367
15 Aug 2013 00:07:28 1266353 15912843 hadcm3n_o4u0_2140_40_008269103_4 25,920 26,518 1.0231


©2024 cpdn.org