climateprediction.net home page
Task 15653326

Task 15653326

Name hadcm3n_zamu_1880_40_008250771_4
Workunit 8405895
Created 7 Mar 2013, 17:22:58 UTC
Sent 7 Mar 2013, 17:23:05 UTC
Report deadline 7 Jun 2013, 0:50:16 UTC
Received 17 Mar 2013, 7:18:14 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1272565
Run time 6 days 7 hours 48 min 5 sec
CPU time 1 days 18 hours 18 min 33 sec
Validate state Invalid
Credit 933.12
Device peak FLOPS 1.46 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:18:02 (5368): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
10:08:56 (8656): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:08:02 (8661): No heartbeat from core client for 30 sec - exiting
12:08:05 (8661): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 63 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 64 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 65 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 66 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 67 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Error: Input file: dataout/zamuko.pj81c10 is not a valid UM file.
Error converting file to netcdf: dataout/zamuko.pj81c10
Error: Input file: dataout/zamuko.pi81c10 is not a valid UM file.
Error converting file to netcdf: dataout/zamuko.pi81c10
Error: Input file: dataout/zamuko.pf81c10 is not a valid UM file.
Error converting file to netcdf: dataout/zamuko.pf81c10
Error: Input file: dataout/zamuka.ph81c10 is not a valid UM file.
Error converting file to netcdf: dataout/zamuka.ph81c10
Error: Input file: dataout/zamuka.pg81c10 is not a valid UM file.
Error converting file to netcdf: dataout/zamuka.pg81c10
Error: Input file: dataout/zamuka.pe81c10 is not a valid UM file.
Error converting file to netcdf: dataout/zamuka.pe81c10
Error: Input file: dataout/zamuka.pd81c10 is not a valid UM file.
Error converting file to netcdf: dataout/zamuka.pd81c10
07:59:55 (9428): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:00:01 (9428): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9927, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9927, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9927, iMonCtr=1
Model crash detected, will try to restart...
07:44:32 (9927): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:59:17 (11780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:38:29 (11848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11883, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11883, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11883, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11883, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11883, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11883, iMonCtr=1
Model crash detected, will try to restart...
08:14:23 (11883): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12453, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Mar 2013 23:34:45 1272565 15653326 hadcm3n_zamu_1880_40_008250771_4 77,760 184,288 2.3700
14 Mar 2013 04:18:34 1272565 15653326 hadcm3n_zamu_1880_40_008250771_4 51,840 198,267 3.8246
12 Mar 2013 11:12:13 1272565 15653326 hadcm3n_zamu_1880_40_008250771_4 25,920 99,245 3.8289


©2024 cpdn.org