Name | hadcm3n_n4st_1920_40_008321634_1 |
Workunit | 8472769 |
Created | 26 Mar 2013, 10:52:30 UTC |
Sent | 26 Mar 2013, 10:52:52 UTC |
Report deadline | 25 Jun 2013, 18:20:03 UTC |
Received | 1 Jun 2013, 20:31:53 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1273614 |
Run time | 57 days 22 hours 44 min 5 sec |
CPU time | 53 days 19 hours 44 min 19 sec |
Validate state | Invalid |
Credit | 10,886.40 |
Device peak FLOPS | 1.24 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 11:57:25 (23242): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:21:06 (30543): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:45:37 (11746): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:47:56 (18587): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:12:22 (18688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:18:45 (20499): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:21:26 (20673): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:24:41 (20786): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:10:45 (23165): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:29:53 (26327): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:29:46 (30950): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... 20:09:03 (3734): No heartbeat from core client for 30 sec - exiting 20:09:04 (3734): No heartbeat from core client for 30 sec - exiting 20:09:05 (3734): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:27:45 (3436): No heartbeat from core client for 30 sec - exiting 20:27:46 (3436): No heartbeat from core client for 30 sec - exiting 20:27:47 (3436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:31:22 (3860): No heartbeat from core client for 30 sec - exiting 20:31:27 (3860): No heartbeat from core client for 30 sec - exiting 20:31:28 (3860): No heartbeat from core client for 30 sec - exiting 20:31:29 (3860): No heartbeat from core client for 30 sec - exiting 20:31:30 (3860): No heartbeat from core client for 30 sec - exiting 20:31:31 (3860): No heartbeat from core client for 30 sec - exiting 20:31:32 (3860): No heartbeat from core client for 30 sec - exiting 20:31:33 (3860): No heartbeat from core client for 30 sec - exiting 20:31:34 (3860): No heartbeat from core client for 30 sec - exiting 20:31:35 (3860): No heartbeat from core client for 30 sec - exiting 20:31:36 (3860): No heartbeat from core client for 30 sec - exiting 20:31:38 (3860): No heartbeat from core client for 30 sec - exiting 20:31:39 (3860): No heartbeat from core client for 30 sec - exiting 20:31:40 (3860): No heartbeat from core client for 30 sec - exiting 20:31:41 (3860): No heartbeat from core client for 30 sec - exiting 20:31:42 (3860): No heartbeat from core client for 30 sec - exiting 20:31:43 (3860): No heartbeat from core client for 30 sec - exiting 20:31:44 (3860): No heartbeat from core client for 30 sec - exiting 20:31:45 (3860): No heartbeat from core client for 30 sec - exiting 20:31:46 (3860): No heartbeat from core client for 30 sec - exiting 20:31:47 (3860): No heartbeat from core client for 30 sec - exiting 20:31:48 (3860): No heartbeat from core client for 30 sec - exiting 20:31:49 (3860): No heartbeat from core client for 30 sec - exiting 20:31:50 (3860): No heartbeat from core client for 30 sec - exiting 20:31:51 (3860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:24:45 (17046): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:46:24 (4568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:59:44 (7443): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:03:44 (7790): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:06:07 (8046): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:21:21 (8173): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:38:34 (8639): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:40:43 (9197): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:47:45 (9269): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:24:50 (9597): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:50:41 (10860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:51:48 (11752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:54:36 (11781): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:02 (11896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:59:32 (11946): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:01:40 (12043): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:04:22 (12189): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:06:52 (12314): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:08:35 (12410): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:11:10 (12497): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:12:10 (12575): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:14:05 (12660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:30:55 (4391): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:19:09 (5044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:08:27 (6696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:49:01 (9160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:50:44 (16655): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:36:25 (8373): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 forrtl: No space left on device forrtl: severe (38): error during write, unit 6, file /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_n4st_1920_40_008321634/dataout/stdout_um.txt Image PC Routine Line Source hadcm3n_um_6.07_i 0848EB7D Unknown Unknown Unknown hadcm3n_um_6.07_i 0848D975 Unknown Unknown Unknown hadcm3n_um_6.07_i 0845F3CF Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F90D Unknown Unknown Unknown hadcm3n_um_6.07_i 0841F257 Unknown Unknown Unknown hadcm3n_um_6.07_i 08451069 Unknown Unknown Unknown hadcm3n_um_6.07_i 0844E937 Unknown Unknown Unknown hadcm3n_um_6.07_i 0836D10D Unknown Unknown Unknown hadcm3n_um_6.07_i 082EB086 Unknown Unknown Unknown hadcm3n_um_6.07_i 0838F66D Unknown Unknown Unknown hadcm3n_um_6.07_i 0839BDF8 Unknown Unknown Unknown libc.so.6 B7574E46 Unknown Unknown Unknown hadcm3n_um_6.07_i 0804CB11 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4325, iMonCtr=1 Model crash detected, will try to restart... forrtl: No spacCalled boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 May 2013 09:19:21 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 907,200 | 4,622,277 | 5.0951 |
27 May 2013 05:47:55 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 881,280 | 4,475,657 | 5.0786 |
24 May 2013 05:06:43 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 855,360 | 4,344,407 | 5.0790 |
21 May 2013 18:44:33 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 829,440 | 4,222,260 | 5.0905 |
20 May 2013 09:16:06 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 803,520 | 4,104,388 | 5.1080 |
19 May 2013 00:09:00 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 777,600 | 3,986,792 | 5.1270 |
17 May 2013 14:07:26 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 751,680 | 3,866,994 | 5.1445 |
15 May 2013 22:23:01 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 725,760 | 3,732,867 | 5.1434 |
13 May 2013 12:31:20 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 699,840 | 3,596,818 | 5.1395 |
11 May 2013 16:26:22 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 673,920 | 3,451,785 | 5.1220 |
09 May 2013 18:06:50 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 648,000 | 3,299,117 | 5.0912 |
06 May 2013 15:15:37 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 622,080 | 3,156,240 | 5.0737 |
04 May 2013 20:46:17 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 596,160 | 3,016,502 | 5.0599 |
03 May 2013 02:26:46 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 570,240 | 2,876,139 | 5.0437 |
01 May 2013 07:13:22 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 544,320 | 2,735,533 | 5.0256 |
29 Apr 2013 12:43:57 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 518,400 | 2,595,350 | 5.0065 |
27 Apr 2013 17:32:35 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 492,480 | 2,459,613 | 4.9943 |
25 Apr 2013 22:19:17 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 466,560 | 2,326,104 | 4.9856 |
23 Apr 2013 09:34:18 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 440,640 | 2,194,105 | 4.9794 |
21 Apr 2013 23:30:26 | 1273614 | 15683684 | hadcm3n_n4st_1920_40_008321634_1 | 414,720 | 2,074,792 | 5.0029 |
©2024 cpdn.org