Name | hadcm3n_zkzt_1880_40_008199111_4 |
Workunit | 8354235 |
Created | 21 Oct 2012, 21:53:46 UTC |
Sent | 21 Oct 2012, 21:54:00 UTC |
Report deadline | 21 Jan 2013, 5:21:11 UTC |
Received | 1 Dec 2012, 3:11:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1225550 |
Run time | 28 days 19 hours 31 min 14 sec |
CPU time | 22 days 23 hours 30 min 20 sec |
Validate state | Invalid |
Credit | 10,575.36 |
Device peak FLOPS | 1.87 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.29</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 09:28:18 (22733): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:50:12 (13141): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 15:50:14 (13141): No heartbeat from core client for 30 sec - exiting 15:50:15 (13141): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:11:28 (11308): No heartbeat from core client for 30 sec - exiting 14:11:31 (11308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:38:09 (7521): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:17:42 (28621): No heartbeat from core client for 30 sec - exiting 01:17:44 (28621): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:17:45 (28621): No heartbeat from core client for 30 sec - exiting 01:17:46 (28621): No heartbeat from core client for 30 sec - exiting 01:17:47 (28621): No heartbeat from core client for 30 sec - exiting 01:17:48 (28621): No heartbeat from core client for 30 sec - exiting 01:17:49 (28621): No heartbeat from core client for 30 sec - exiting 01:17:50 (28621): No heartbeat from core client for 30 sec - exiting 01:17:51 (28621): No heartbeat from core client for 30 sec - exiting 01:17:52 (28621): No heartbeat from core client for 30 sec - exiting 01:17:53 (28621): No heartbeat from core client for 30 sec - exiting 01:17:54 (28621): No heartbeat from core client for 30 sec - exiting 01:17:55 (28621): No heartbeat from core client for 30 sec - exiting 01:17:56 (28621): No heartbeat from core client for 30 sec - exiting 01:17:57 (28621): No heartbeat from core client for 30 sec - exiting 01:17:58 (28621): No heartbeat from core client for 30 sec - exiting 01:17:59 (28621): No heartbeat from core client for 30 sec - exiting 01:18:00 (28621): No heartbeat from core client for 30 sec - exiting 01:18:01 (28621): No heartbeat from core client for 30 sec - exiting 01:18:02 (28621): No heartbeat from core client for 30 sec - exiting 01:18:03 (28621): No heartbeat from core client for 30 sec - exiting 01:18:04 (28621): No heartbeat from core client for 30 sec - exiting 01:18:05 (28621): No heartbeat from core client for 30 sec - exiting 01:18:06 (28621): No heartbeat from core client for 30 sec - exiting 01:18:07 (28621): No heartbeat from core client for 30 sec - exiting 01:18:08 (28621): No heartbeat from core client for 30 sec - exiting 01:18:09 (28621): No heartbeat from core client for 30 sec - exiting 01:18:10 (28621): No heartbeat from core client for 30 sec - exiting 01:18:11 (28621): No heartbeat from core client for 30 sec - exiting 01:18:12 (28621): No heartbeat from core client for 30 sec - exiting 01:18:13 (28621): No heartbeat from core client for 30 sec - exiting 01:18:14 (28621): No heartbeat from core client for 30 sec - exiting 01:18:15 (28621): No heartbeat from core client for 30 sec - exiting 01:18:16 (28621): No heartbeat from core client for 30 sec - exiting 01:18:17 (28621): No heartbeat from core client for 30 sec - exiting 01:18:18 (28621): No heartbeat from core client for 30 sec - exiting 01:18:19 (28621): No heartbeat from core client for 30 sec - exiting 01:18:20 (28621): No heartbeat from core client for 30 sec - exiting 01:18:21 (28621): No heartbeat from core client for 30 sec - exiting 01:18:22 (28621): No heartbeat from core client for 30 sec - exiting 01:18:23 (28621): No heartbeat from core client for 30 sec - exiting 01:18:24 (28621): No heartbeat from core client for 30 sec - exiting 01:18:25 (28621): No heartbeat from core client for 30 sec - exiting 01:18:26 (28621): No heartbeat from core client for 30 sec - exiting 01:18:27 (28621): No heartbeat from core client for 30 sec - exiting 01:18:28 (28621): No heartbeat from core client for 30 sec - exiting 01:18:29 (28621): No heartbeat from core client for 30 sec - exiting 01:18:30 (28621): No heartbeat from core client for 30 sec - exiting 01:18:31 (28621): No heartbeat from core client for 30 sec - exiting 01:18:32 (28621): No heartbeat from core client for 30 sec - exiting 01:18:33 (28621): No heartbeat from core client for 30 sec - exiting 01:18:34 (28621): No heartbeat from core client for 30 sec - exiting 01:18:35 (28621): No heartbeat from core client for 30 sec - exiting 01:18:36 (28621): No heartbeat from core client for 30 sec - exiting 01:18:37 (28621): No heartbeat from core client for 30 sec - exiting 01:18:38 (28621): No heartbeat from core client for 30 sec - exiting 01:18:39 (28621): No heartbeat from core client for 30 sec - exiting 01:18:40 (28621): No heartbeat from core client for 30 sec - exiting 01:18:41 (28621): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 SETPOS: Seek Failed: No space left on device SETPOS: Unit 22 to Word Address 555008 Failed with Error Code -1 Model crashed: SETPOS: Unit 22 to Word Address 555008 Failed with Error Code -1 SETPOS: Seek Failed: No space left on device SETPOS: Unit 22 to Word Address 702464 Failed with Error Code -1 Model crashed: SETPOS: Unit 22 to Word Address 702464 Failed with Error Code -1 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 SETPOS: Seek Failed: No space left on device SETPOS: Unit 22 to Word Address 845824 Failed with Error Code -1 Model crashed: SETPOS: Unit 22 to Word Address 845824 Failed with Error Code -1 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Nov 2012 08:27:24 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 881,280 | 2,021,067 | 2.2933 |
28 Nov 2012 00:59:56 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 855,360 | 1,963,389 | 2.2954 |
25 Nov 2012 21:27:17 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 829,440 | 1,900,202 | 2.2909 |
25 Nov 2012 02:34:08 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 803,520 | 1,840,649 | 2.2907 |
24 Nov 2012 05:07:38 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 777,600 | 1,780,277 | 2.2895 |
23 Nov 2012 03:29:28 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 751,680 | 1,720,532 | 2.2889 |
22 Nov 2012 06:27:05 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 725,760 | 1,657,523 | 2.2838 |
21 Nov 2012 04:33:03 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 699,840 | 1,596,166 | 2.2808 |
19 Nov 2012 09:23:46 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 673,920 | 1,538,573 | 2.2830 |
18 Nov 2012 13:45:26 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 648,000 | 1,482,576 | 2.2879 |
17 Nov 2012 18:48:07 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 622,080 | 1,424,821 | 2.2904 |
17 Nov 2012 00:19:07 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 596,160 | 1,367,015 | 2.2930 |
16 Nov 2012 04:50:18 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 570,240 | 1,306,568 | 2.2913 |
15 Nov 2012 09:14:26 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 544,320 | 1,246,519 | 2.2900 |
14 Nov 2012 05:18:38 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 518,400 | 1,161,685 | 2.2409 |
13 Nov 2012 11:18:46 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 492,480 | 1,103,575 | 2.2409 |
10 Nov 2012 21:30:58 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 466,560 | 1,046,656 | 2.2433 |
10 Nov 2012 02:30:02 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 440,640 | 986,002 | 2.2377 |
08 Nov 2012 14:30:28 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 414,720 | 928,578 | 2.2390 |
07 Nov 2012 13:23:14 | 1225550 | 15374213 | hadcm3n_zkzt_1880_40_008199111_4 | 388,800 | 864,900 | 2.2245 |
©2024 cpdn.org