Name | hadcm3n_n6cu_1920_40_008378214_0 |
Workunit | 8529073 |
Created | 30 May 2013, 14:34:18 UTC |
Sent | 30 May 2013, 15:16:58 UTC |
Report deadline | 29 Aug 2013, 22:44:09 UTC |
Received | 20 Jun 2013, 2:54:22 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1240735 |
Run time | 16 days 14 hours 55 min 17 sec |
CPU time | 15 days 4 hours 31 min 34 sec |
Validate state | Invalid |
Credit | 11,508.48 |
Device peak FLOPS | 3.07 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.65</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 21:32:24 (20761): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:35:42 (27027): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 10:15:19 (2345): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 10:30:10 (27162): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:30:11 (27162): No heartbeat from core client for 30 sec - exiting 10:34:26 (32715): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:24:59 (1710): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:18:36 (20118): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:21:19 (1844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:22:02 (1844): No heartbeat from core client for 30 sec - exiting 12:29:24 (2881): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:29:25 (2881): No heartbeat from core client for 30 sec - exiting 12:29:26 (2881): No heartbeat from core client for 30 sec - exiting 12:29:27 (2881): No heartbeat from core client for 30 sec - exiting 12:35:16 (6084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:35:17 (6084): No heartbeat from core client for 30 sec - exiting 12:35:18 (6084): No heartbeat from core client for 30 sec - exiting 12:35:19 (6084): No heartbeat from core client for 30 sec - exiting 12:35:20 (6084): No heartbeat from core client for 30 sec - exiting 12:35:21 (6084): No heartbeat from core client for 30 sec - exiting 12:36:25 (8008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 12:46:46 (8372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:46:47 (8372): No heartbeat from core client for 30 sec - exiting 12:47:09 (8372): No heartbeat from core client for 30 sec - exiting 12:47:10 (8372): No heartbeat from core client for 30 sec - exiting 12:47:11 (8372): No heartbeat from core client for 30 sec - exiting 12:47:12 (8372): No heartbeat from core client for 30 sec - exiting 12:47:13 (8372): No heartbeat from core client for 30 sec - exiting 12:47:14 (8372): No heartbeat from core client for 30 sec - exiting 12:47:15 (8372): No heartbeat from core client for 30 sec - exiting 12:47:16 (8372): No heartbeat from core client for 30 sec - exiting 12:47:17 (8372): No heartbeat from core client for 30 sec - exiting 12:47:18 (8372): No heartbeat from core client for 30 sec - exiting 12:47:19 (8372): No heartbeat from core client for 30 sec - exiting 12:47:20 (8372): No heartbeat from core client for 30 sec - exiting 12:47:21 (8372): No heartbeat from core client for 30 sec - exiting 12:47:22 (8372): No heartbeat from core client for 30 sec - exiting 12:47:23 (8372): No heartbeat from core client for 30 sec - exiting 12:47:24 (8372): No heartbeat from core client for 30 sec - exiting 12:47:25 (8372): No heartbeat from core client for 30 sec - exiting 12:49:41 (11043): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:49:42 (11043): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 13:08:06 (11313): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:08:07 (11313): No heartbeat from core client for 30 sec - exiting 13:08:08 (11313): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 13:59:15 (15810): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:59:16 (15810): No heartbeat from core client for 30 sec - exiting 13:59:17 (15810): No heartbeat from core client for 30 sec - exiting 13:59:18 (15810): No heartbeat from core client for 30 sec - exiting 14:30:41 (24216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:30:42 (24216): No heartbeat from core client for 30 sec - exiting 14:30:43 (24216): No heartbeat from core client for 30 sec - exiting 14:30:44 (24216): No heartbeat from core client for 30 sec - exiting 14:30:45 (24216): No heartbeat from core client for 30 sec - exiting 14:30:46 (24216): No heartbeat from core client for 30 sec - exiting 14:30:47 (24216): No heartbeat from core client for 30 sec - exiting 14:30:48 (24216): No heartbeat from core client for 30 sec - exiting 14:30:49 (24216): No heartbeat from core client for 30 sec - exiting 14:30:50 (24216): No heartbeat from core client for 30 sec - exiting 14:30:51 (24216): No heartbeat from core client for 30 sec - exiting 14:30:52 (24216): No heartbeat from core client for 30 sec - exiting 14:30:53 (24216): No heartbeat from core client for 30 sec - exiting 14:30:54 (24216): No heartbeat from core client for 30 sec - exiting 14:30:55 (24216): No heartbeat from core client for 30 sec - exiting 14:30:56 (24216): No heartbeat from core client for 30 sec - exiting 14:30:57 (24216): No heartbeat from core client for 30 sec - exiting 14:30:58 (24216): No heartbeat from core client for 30 sec - exiting 14:30:59 (24216): No heartbeat from core client for 30 sec - exiting 14:31:00 (24216): No heartbeat from core client for 30 sec - exiting 14:31:01 (24216): No heartbeat from core client for 30 sec - exiting 14:31:02 (24216): No heartbeat from core client for 30 sec - exiting 14:31:03 (24216): No heartbeat from core client for 30 sec - exiting 14:31:04 (24216): No heartbeat from core client for 30 sec - exiting 14:31:05 (24216): No heartbeat from core client for 30 sec - exiting 14:34:34 (29340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:34:35 (29340): No heartbeat from core client for 30 sec - exiting 14:34:36 (29340): No heartbeat from core client for 30 sec - exiting 14:34:37 (29340): No heartbeat from core client for 30 sec - exiting 15:46:09 (30354): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:46:10 (30354): No heartbeat from core client for 30 sec - exiting 15:46:11 (30354): No heartbeat from core client for 30 sec - exiting 15:46:12 (30354): No heartbeat from core client for 30 sec - exiting 15:46:13 (30354): No heartbeat from core client for 30 sec - exiting 15:46:14 (30354): No heartbeat from core client for 30 sec - exiting 15:46:15 (30354): No heartbeat from core client for 30 sec - exiting 15:46:16 (30354): No heartbeat from core client for 30 sec - exiting 15:46:17 (30354): No heartbeat from core client for 30 sec - exiting 15:46:18 (30354): No heartbeat from core client for 30 sec - exiting 15:46:19 (30354): No heartbeat from core client for 30 sec - exiting 15:46:20 (30354): No heartbeat from core client for 30 sec - exiting 16:04:23 (4155): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:04:24 (4155): No heartbeat from core client for 30 sec - exiting 16:04:25 (4155): No heartbeat from core client for 30 sec - exiting 16:04:26 (4155): No heartbeat from core client for 30 sec - exiting 16:07:33 (9018): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:07:34 (9018): No heartbeat from core client for 30 sec - exiting 16:07:35 (9018): No heartbeat from core client for 30 sec - exiting 16:07:36 (9018): No heartbeat from core client for 30 sec - exiting 16:07:37 (9018): No heartbeat from core client for 30 sec - exiting 16:07:38 (9018): No heartbeat from core client for 30 sec - exiting 16:07:39 (9018): No heartbeat from core client for 30 sec - exiting 16:07:40 (9018): No heartbeat from core client for 30 sec - exiting 16:07:41 (9018): No heartbeat from core client for 30 sec - exiting 16:07:42 (9018): No heartbeat from core client for 30 sec - exiting 16:07:43 (9018): No heartbeat from core client for 30 sec - exiting 16:07:44 (9018): No heartbeat from core client for 30 sec - exiting 16:07:45 (9018): No heartbeat from core client for 30 sec - exiting 16:07:46 (9018): No heartbeat from core client for 30 sec - exiting 16:07:47 (9018): No heartbeat from core client for 30 sec - exiting 16:07:48 (9018): No heartbeat from core client for 30 sec - exiting 16:07:49 (9018): No heartbeat from core client for 30 sec - exiting 16:07:50 (9018): No heartbeat from core client for 30 sec - exiting 16:07:51 (9018): No heartbeat from core client for 30 sec - exiting 16:07:52 (9018): No heartbeat from core client for 30 sec - exiting 16:07:53 (9018): No heartbeat from core client for 30 sec - exiting BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 SETPOS: Seek Failed: No space left on device SETPOS: Unit 42 to Word Address 10520576 Failed with Error Code -1 Model crashed: SETPOS: Unit 42 to Word Address 10520576 Failed with Error Code -1 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Jun 2013 23:08:11 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 959,040 | 1,402,828 | 1.4627 |
19 Jun 2013 11:22:35 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 933,120 | 1,365,070 | 1.4629 |
19 Jun 2013 00:22:25 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 907,200 | 1,326,257 | 1.4619 |
18 Jun 2013 12:52:38 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 881,280 | 1,288,312 | 1.4619 |
18 Jun 2013 02:00:48 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 855,360 | 1,249,494 | 1.4608 |
17 Jun 2013 15:23:36 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 829,440 | 1,210,684 | 1.4596 |
17 Jun 2013 03:43:52 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 803,520 | 1,172,460 | 1.4592 |
16 Jun 2013 17:00:09 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 777,600 | 1,134,146 | 1.4585 |
16 Jun 2013 06:04:39 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 751,680 | 1,095,766 | 1.4578 |
15 Jun 2013 19:32:40 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 725,760 | 1,058,038 | 1.4578 |
15 Jun 2013 08:34:36 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 699,840 | 1,020,163 | 1.4577 |
14 Jun 2013 21:54:05 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 673,920 | 982,137 | 1.4573 |
14 Jun 2013 11:13:37 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 648,000 | 944,103 | 1.4569 |
14 Jun 2013 00:28:15 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 622,080 | 905,606 | 1.4558 |
13 Jun 2013 13:45:48 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 596,160 | 867,499 | 1.4551 |
09 Jun 2013 19:54:55 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 570,240 | 829,618 | 1.4549 |
09 Jun 2013 09:19:53 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 544,320 | 791,716 | 1.4545 |
08 Jun 2013 22:35:32 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 518,400 | 753,383 | 1.4533 |
08 Jun 2013 11:44:50 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 492,480 | 714,964 | 1.4518 |
08 Jun 2013 01:00:09 | 1240735 | 15807734 | hadcm3n_n6cu_1920_40_008378214_0 | 466,560 | 676,794 | 1.4506 |
©2025 cpdn.org