climateprediction.net home page
Task 14105601

Task 14105601

Name hadcm3n_o274_1980_40_007753851_1
Workunit 7908960
Created 17 Feb 2012, 13:57:03 UTC
Sent 17 Feb 2012, 13:57:06 UTC
Report deadline 18 May 2012, 21:24:17 UTC
Received 5 Mar 2012, 10:29:12 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1105487
Run time 2 days 23 hours 41 min 5 sec
CPU time 2 days 16 hours 27 min 24 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 3.04 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
22:46:27 (4228): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:46:28 (4228): No heartbeat from core client for 30 sec - exiting
22:46:29 (4228): No heartbeat from core client for 30 sec - exiting
22:46:30 (4228): No heartbeat from core client for 30 sec - exiting
22:46:31 (4228): No heartbeat from core client for 30 sec - exiting
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o274ko.pji4c10
Error converting file to netcdf: dataout/o274ko.pii4c10
Error converting file to netcdf: dataout/o274ko.pfi4c10
Error converting file to netcdf: dataout/o274ka.phi4c10
Error converting file to netcdf: dataout/o274ka.pgi4c10
Error converting file to netcdf: dataout/o274ka.pei4c10
Error converting file to netcdf: dataout/o274ka.pdi4c10
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
15:01:09 (4400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:48:18 (4184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:48:20 (4184): No heartbeat from core client for 30 sec - exiting
23:48:21 (4184): No heartbeat from core client for 30 sec - exiting
23:48:22 (4184): No heartbeat from core client for 30 sec - exiting
23:48:23 (4184): No heartbeat from core client for 30 sec - exiting
23:48:25 (4184): No heartbeat from core client for 30 sec - exiting
23:48:26 (4184): No heartbeat from core client for 30 sec - exiting
23:48:27 (4184): No heartbeat from core client for 30 sec - exiting
23:48:28 (4184): No heartbeat from core client for 30 sec - exiting
23:48:29 (4184): No heartbeat from core client for 30 sec - exiting
23:48:30 (4184): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3968, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3968, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Mar 2012 23:37:46 1105487 14105601 hadcm3n_o274_1980_40_007753851_1 129,600 223,892 1.7276
03 Mar 2012 23:33:34 1105487 14105601 hadcm3n_o274_1980_40_007753851_1 103,680 180,806 1.7439
02 Mar 2012 23:49:07 1105487 14105601 hadcm3n_o274_1980_40_007753851_1 77,760 137,069 1.7627
02 Mar 2012 10:39:31 1105487 14105601 hadcm3n_o274_1980_40_007753851_1 51,840 91,495 1.7649
01 Mar 2012 14:38:45 1105487 14105601 hadcm3n_o274_1980_40_007753851_1 25,920 45,751 1.7651


©2024 cpdn.org