Name | hadcm3n_o5nv_2140_40_008270445_2 |
Workunit | 8425569 |
Created | 31 Dec 2012, 10:04:16 UTC |
Sent | 31 Dec 2012, 10:04:36 UTC |
Report deadline | 1 Apr 2013, 17:31:47 UTC |
Received | 16 Feb 2013, 12:31:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 429009 |
Run time | 30 days 10 hours 20 min 37 sec |
CPU time | 26 days 20 hours 53 min 58 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 1.93 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:19:15 (5887): No heartbeat from core client for 30 sec - exiting 10:19:23 (5887): No heartbeat from core client for 30 sec - exiting 10:19:24 (5887): No heartbeat from core client for 30 sec - exiting 10:19:25 (5887): No heartbeat from core client for 30 sec - exiting 10:19:26 (5887): No heartbeat from core client for 30 sec - exiting 10:19:27 (5887): No heartbeat from core client for 30 sec - exiting 10:19:30 (5887): No heartbeat from core client for 30 sec - exiting 10:19:31 (5887): No heartbeat from core client for 30 sec - exiting 10:19:32 (5887): No heartbeat from core client for 30 sec - exiting 10:19:33 (5887): No heartbeat from core client for 30 sec - exiting 10:19:34 (5887): No heartbeat from core client for 30 sec - exiting 10:19:35 (5887): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:19:36 (5887): No heartbeat from core client for 30 sec - exiting 10:19:37 (5887): No heartbeat from core client for 30 sec - exiting 10:19:38 (5887): No heartbeat from core client for 30 sec - exiting 10:19:39 (5887): No heartbeat from core client for 30 sec - exiting 10:19:40 (5887): No heartbeat from core client for 30 sec - exiting 10:19:41 (5887): No heartbeat from core client for 30 sec - exiting 18:02:46 (6938): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:32:00 (15352): No heartbeat from core client for 30 sec - exiting 07:32:04 (15352): No heartbeat from core client for 30 sec - exiting 07:32:05 (15352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:32:06 (15352): No heartbeat from core client for 30 sec - exiting 07:32:23 (15352): No heartbeat from core client for 30 sec - exiting 07:32:24 (15352): No heartbeat from core client for 30 sec - exiting 07:32:25 (15352): No heartbeat from core client for 30 sec - exiting 07:32:26 (15352): No heartbeat from core client for 30 sec - exiting 07:32:27 (15352): No heartbeat from core client for 30 sec - exiting 07:32:28 (15352): No heartbeat from core client for 30 sec - exiting 07:32:31 (15352): No heartbeat from core client for 30 sec - exiting 07:32:32 (15352): No heartbeat from core client for 30 sec - exiting 07:32:33 (15352): No heartbeat from core client for 30 sec - exiting 07:32:34 (15352): No heartbeat from core client for 30 sec - exiting 07:33:42 (23360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:32:47 (15690): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:32:52 (15690): No heartbeat from core client for 30 sec - exiting MainError: 11:25:30 AM No files match the supplied pattern. MainError: 11:25:30 AM No files match the supplied pattern. MainError: 05:42:12 PM No files match the supplied pattern. MainError: 05:42:12 PM No files match the supplied pattern. 13:08:30 (27128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 08:53:59 AM No files match the supplied pattern. MainError: 08:53:59 AM No files match the supplied pattern. 07:11:29 (20631): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:11:59 (20631): No heartbeat from core client for 30 sec - exiting 07:12:07 (20631): No heartbeat from core client for 30 sec - exiting 07:12:08 (20631): No heartbeat from core client for 30 sec - exiting 07:12:09 (20631): No heartbeat from core client for 30 sec - exiting MainError: 03:19:41 PM No files match the supplied pattern. MainError: 03:19:41 PM No files match the supplied pattern. 15:39:38 (13919): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:51:09 (21671): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:56:54 (21693): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:27:42 (21701): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:39:08 (22995): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:05:27 (23008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:11:27 (24290): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:29:25 (24298): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:46:46 (24326): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:58:49 (24344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:05:20 (24360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:17:42 (25619): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:33:53 (25635): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:03:15 (25659): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 09:49:35 PM No files match the supplied pattern. MainError: 09:49:35 PM No files match the supplied pattern. MainError: 10:16:02 AM No files match the supplied pattern. MainError: 10:16:02 AM No files match the supplied pattern. MainError: 10:42:07 AM No files match the supplied pattern. MainError: 10:42:07 AM No files match the supplied pattern. MainError: 06:32:37 PM No files match the supplied pattern. MainError: 06:32:37 PM No files match the supplied pattern. MainError: 07:25:07 AM No files match the supplied pattern. MainError: 07:25:07 AM No files match the supplied pattern. 07:02:49 (26945): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... MainError: 01:04:40 PM No files match the supplied pattern. MainError: 01:04:40 PM No files match the supplied pattern. Error converting file to netcdf: dataout/o5nvka.ph11c10 Error converting file to netcdf: dataout/o5nvka.pg11c10 Error converting file to netcdf: dataout/o5nvka.pe11c10 MainError: 10:23:49 AM No files match the supplied pattern. MainError: 10:23:49 AM No files match the supplied pattern. BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: Read Failed: Inappropriate ioctl for device BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: Invalid argument BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 BUFFIN: Read Failed: Numerical result out of range BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Feb 2013 10:24:18 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 777,600 | 2,321,680 | 2.9857 |
14 Feb 2013 13:05:11 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 751,680 | 2,244,675 | 2.9862 |
13 Feb 2013 07:26:06 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 725,760 | 2,167,949 | 2.9871 |
11 Feb 2013 18:36:49 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 699,840 | 2,090,870 | 2.9876 |
10 Feb 2013 10:42:36 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 673,920 | 2,013,660 | 2.9880 |
08 Feb 2013 10:17:37 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 648,000 | 1,936,575 | 2.9885 |
06 Feb 2013 21:53:44 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 622,080 | 1,859,462 | 2.9891 |
05 Feb 2013 15:20:43 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 596,160 | 1,782,587 | 2.9901 |
04 Feb 2013 08:57:50 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 570,240 | 1,705,250 | 2.9904 |
01 Feb 2013 17:42:28 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 544,320 | 1,627,777 | 2.9905 |
31 Jan 2013 11:26:08 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 518,400 | 1,550,428 | 2.9908 |
29 Jan 2013 21:41:42 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 492,480 | 1,471,819 | 2.9886 |
28 Jan 2013 15:30:34 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 466,560 | 1,392,859 | 2.9854 |
27 Jan 2013 00:24:24 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 440,640 | 1,313,806 | 2.9816 |
25 Jan 2013 11:20:45 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 414,720 | 1,235,048 | 2.9780 |
23 Jan 2013 19:35:58 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 388,800 | 1,156,526 | 2.9746 |
22 Jan 2013 10:10:45 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 362,880 | 1,077,878 | 2.9703 |
20 Jan 2013 17:01:14 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 336,960 | 999,301 | 2.9656 |
18 Jan 2013 17:32:24 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 311,040 | 920,946 | 2.9609 |
17 Jan 2013 08:14:30 | 429009 | 15518484 | hadcm3n_o5nv_2140_40_008270445_2 | 285,120 | 841,981 | 2.9531 |
©2024 cpdn.org