Name | hadcm3n_n3q6_1920_40_008407142_4 |
Workunit | 8557998 |
Created | 18 Feb 2014, 12:01:07 UTC |
Sent | 18 Feb 2014, 12:03:08 UTC |
Report deadline | 20 May 2014, 19:30:19 UTC |
Received | 17 Apr 2014, 11:29:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1190093 |
Run time | 9 days 21 hours 40 min 38 sec |
CPU time | 9 days 10 hours 58 min 39 sec |
Validate state | Invalid |
Credit | 5,909.76 |
Device peak FLOPS | 2.42 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> Das Laufwerk kann einen bestimmten Bereich oder eine bestimmte Spur nicht finden. (0x19) - exit code 25 (0x19) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:11:06 (2872): No heartbeat from core client for 30 sec - exiting 09:11:07 (2872): No heartbeat from core client for 30 sec - exiting 09:11:08 (2872): No heartbeat from core client for 30 sec - exiting 09:11:10 (2872): No heartbeat from core client for 30 sec - exiting 09:11:11 (2872): No heartbeat from core client for 30 sec - exiting 09:11:12 (2872): No heartbeat from core client for 30 sec - exiting 09:11:13 (2872): No heartbeat from core client for 30 sec - exiting 09:11:14 (2872): No heartbeat from core client for 30 sec - exiting 09:11:15 (2872): No heartbeat from core client for 30 sec - exiting 09:11:16 (2872): No heartbeat from core client for 30 sec - exiting 09:11:17 (2872): No heartbeat from core client for 30 sec - exiting 09:11:18 (2872): No heartbeat from core client for 30 sec - exiting 09:11:19 (2872): No heartbeat from core client for 30 sec - exiting 09:11:20 (2872): No heartbeat from core client for 30 sec - exiting 09:11:21 (2872): No heartbeat from core client for 30 sec - exiting 09:11:23 (2872): No heartbeat from core client for 30 sec - exiting 09:11:24 (2872): No heartbeat from core client for 30 sec - exiting 09:11:25 (2872): No heartbeat from core client for 30 sec - exiting 09:11:26 (2872): No heartbeat from core client for 30 sec - exiting 09:11:27 (2872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1612, iMonCtr=1 Model crash detected, will try to restart... 04:49:41 (3876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/n3q6ko.pjc8c10 Error converting file to netcdf: dataout/n3q6ko.pic8c10 Error converting file to netcdf: dataout/n3q6ko.pfc8c10 Error converting file to netcdf: dataout/n3q6ka.phc8c10 Error converting file to netcdf: dataout/n3q6ka.pgc8c10 Error converting file to netcdf: dataout/n3q6ka.pec8c10 Error converting file to netcdf: dataout/n3q6ka.pdc8c10 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4880, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1020, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4180, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Apr 2014 20:31:54 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 492,480 | 808,255 | 1.6412 |
15 Apr 2014 19:12:58 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 466,560 | 763,806 | 1.6371 |
03 Apr 2014 04:50:00 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 440,640 | 718,385 | 1.6303 |
30 Mar 2014 11:57:28 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 414,720 | 675,765 | 1.6294 |
29 Mar 2014 19:14:30 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 388,800 | 633,502 | 1.6294 |
28 Mar 2014 21:04:46 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 362,880 | 590,731 | 1.6279 |
24 Mar 2014 00:51:39 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 336,960 | 547,962 | 1.6262 |
23 Mar 2014 15:04:50 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 311,040 | 504,066 | 1.6206 |
21 Mar 2014 20:17:41 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 285,120 | 458,241 | 1.6072 |
20 Mar 2014 14:52:01 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 259,200 | 411,371 | 1.5871 |
18 Mar 2014 19:24:49 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 233,280 | 369,955 | 1.5859 |
28 Feb 2014 18:57:00 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 207,360 | 328,075 | 1.5822 |
27 Feb 2014 13:10:38 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 181,440 | 286,729 | 1.5803 |
25 Feb 2014 19:17:11 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 155,520 | 244,367 | 1.5713 |
24 Feb 2014 19:37:30 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 129,600 | 202,609 | 1.5633 |
23 Feb 2014 13:26:22 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 103,680 | 162,440 | 1.5667 |
22 Feb 2014 17:03:04 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 77,760 | 121,679 | 1.5648 |
21 Feb 2014 21:41:03 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 51,840 | 81,549 | 1.5731 |
20 Feb 2014 23:36:14 | 1190093 | 16291641 | hadcm3n_n3q6_1920_40_008407142_4 | 25,920 | 40,901 | 1.5780 |
©2024 cpdn.org