climateprediction.net home page
Task 13090177

Task 13090177

Name hadcm3n_y7y2_1900_40_007343156_0
Workunit 7540586
Created 6 Jul 2011, 13:16:18 UTC
Sent 23 Jul 2011, 6:58:12 UTC
Report deadline 22 Oct 2011, 14:25:23 UTC
Received 29 Aug 2011, 15:24:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1107625
Run time 17 days 17 hours 30 min 45 sec
CPU time 17 days 13 hours 22 min 2 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.43 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
09:20:51 (2768): No heartbeat from core client for 30 sec - exiting
09:20:52 (2768): No heartbeat from core client for 30 sec - exiting
09:20:53 (2768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7112, selfPID=7112, iMonCtr=1
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/y7y2ko.pja2c10
Error converting file to netcdf: dataout/y7y2ko.pia2c10
Error converting file to netcdf: dataout/y7y2ko.pfa2c10
Error converting file to netcdf: dataout/y7y2ka.pha2c10
Error converting file to netcdf: dataout/y7y2ka.pga2c10
Error converting file to netcdf: dataout/y7y2ka.pea2c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:44:50 (5280): Can't acquire lockfile (32) - waiting 35s
13:45:00 (2880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:00:38 (3768): No heartbeat from core client for 30 sec - exiting
09:00:39 (3832): Can't acquire lockfile (32) - waiting 35s
CPDN Monitor - No 'heartbeat' from BOINC...
09:00:40 (3768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5436, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
00:38:13 (4432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:11:31 (2560): Can't acquire lockfile (32) - waiting 35s
10:11:41 (5436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:11:42 (5436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
16:22:18 (4360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:14:18 (6764): Can't acquire lockfile (32) - waiting 35s
20:14:31 (6028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4452, iMonCtr=1
Model crash detected, will try to restart...
16:38:17 (4840): No heartbeat from core client for 30 sec - exiting
16:38:18 (4840): No heartbeat from core client for 30 sec - exiting
16:38:19 (4840): No heartbeat from core client for 30 sec - exiting
16:38:20 (4840): No heartbeat from core client for 30 sec - exiting
16:38:21 (4840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:38:23 (4840): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
17:07:03 (5532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:24:40 (5256): No heartbeat from core client for 30 sec - exiting
08:24:41 (5256): No heartbeat from core client for 30 sec - exiting
08:24:42 (5256): No heartbeat from core client for 30 sec - exiting
08:24:43 (5256): No heartbeat from core client for 30 sec - exiting
08:24:44 (5256): No heartbeat from core client for 30 sec - exiting
08:24:45 (5256): No heartbeat from core client for 30 sec - exiting
08:24:46 (5256): No heartbeat from core client for 30 sec - exiting
08:24:47 (5256): No heartbeat from core client for 30 sec - exiting
08:24:48 (5256): No heartbeat from core client for 30 sec - exiting
08:24:49 (5256): No heartbeat from core client for 30 sec - exiting
08:24:50 (5256): No heartbeat from core client for 30 sec - exiting
08:24:51 (5256): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3236, iMonCtr=1
Model crash detected, will try to restart...
09:48:16 (4884): No heartbeat from core client for 30 sec - exiting
09:48:17 (4884): No heartbeat from core client for 30 sec - exiting
09:48:18 (4884): No heartbeat from core client for 30 sec - exiting
09:48:19 (4884): No heartbeat from core client for 30 sec - exiting
09:48:20 (4884): No heartbeat from core client for 30 sec - exiting
09:48:21 (4884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16
BUFFIN: C I/O Error feof - Unit 62 - Return code = 16
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/y7y2ko.pjb0c10
Error converting file to netcdf: dataout/y7y2ko.pib0c10
Error converting file to netcdf: dataout/y7y2ko.pfb0c10
Error converting file to netcdf: dataout/y7y2ko.pcb0c10
Error converting file to netcdf: dataout/y7y2ko.pbb0c10
Error converting file to netcdf: dataout/y7y2ko.pab0c10
Error converting file to netcdf: dataout/y7y2ka.phb0c10
Error converting file to netcdf: dataout/y7y2ka.pgb0c10
Error converting file to netcdf: dataout/y7y2ka.peb0c10
Error converting file to netcdf: dataout/y7y2ka.pdb0c10
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Aug 2011 13:20:48 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 259,200 1,516,914 5.8523
27 Aug 2011 19:32:37 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 233,280 1,377,737 5.9059
22 Aug 2011 06:26:46 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 207,360 1,188,071 5.7295
16 Aug 2011 06:40:54 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 181,440 1,023,617 5.6416
13 Aug 2011 18:02:23 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 155,520 874,798 5.6250
11 Aug 2011 06:26:48 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 129,600 714,896 5.5162
05 Aug 2011 06:50:24 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 103,680 557,710 5.3791
01 Aug 2011 07:07:59 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 77,760 456,418 5.8696
29 Jul 2011 07:25:12 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 51,840 298,303 5.7543
25 Jul 2011 23:00:04 1107625 13090177 hadcm3n_y7y2_1900_40_007343156_0 25,920 193,374 7.4604


©2024 cpdn.org