Name | hadam3p_saf_05h8_2003_1_006777876_0 |
Workunit | 6981192 |
Created | 16 Nov 2010, 22:53:24 UTC |
Sent | 16 Nov 2010, 22:58:49 UTC |
Report deadline | 30 Oct 2011, 4:18:49 UTC |
Received | 22 Mar 2011, 11:42:07 UTC |
Server state | Over |
Outcome | No reply |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1114615 |
Run time | 9 days 12 hours 24 min |
CPU time | 7 days 17 hours 6 min 14 sec |
Validate state | Initial |
Credit | 2,242.53 |
Device peak FLOPS | 2.63 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:17:55 (5264): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 16:41:03 (5520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4916, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5132, selfPID=2124, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6224, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5624, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5308, selfPID=4356, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5304, selfPID=2620, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2348, selfPID=4352, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6328, selfPID=6088, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=748, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5760, selfPID=5300, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6116, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3028, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:19:49 (1908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:19:51 (1908): No heartbeat from core client for 30 sec - exiting 18:19:52 (1908): No heartbeat from core client for 30 sec - exiting 18:19:53 (1908): No heartbeat from core client for 30 sec - exiting 18:19:54 (1908): No heartbeat from core client for 30 sec - exiting 18:19:55 (1908): No heartbeat from core client for 30 sec - exiting 18:19:56 (1908): No heartbeat from core client for 30 sec - exiting 18:19:57 (1908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 21:11:28 (6088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:25:37 (6132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:25:38 (6132): No heartbeat from core client for 30 sec - exiting 01:25:39 (6132): No heartbeat from core client for 30 sec - exiting 01:25:40 (6132): No heartbeat from core client for 30 sec - exiting 01:25:41 (6132): No heartbeat from core client for 30 sec - exiting 01:25:42 (6132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:56:45 (4788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 23:16:08 (1180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2600, selfPID=2600, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5748, selfPID=5748, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 22:44:36 (948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4736, selfPID=5452, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6188, selfPID=5600, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2260, selfPID=4608, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5856, selfPID=5264, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5820, iMonCtr=2 Atmos Restart file copy failed on atmos_restart.day Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6364, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5204, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5176, selfPID=3040, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5992, selfPID=5336, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4368, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4468, selfPID=3708, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=216, selfPID=2456, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2300, selfPID=316, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3952, selfPID=3952, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4924, selfPID=4656, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5628, selfPID=5628, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1400, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5824, selfPID=6988, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5724, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=3404, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1384, selfPID=4488, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6372, selfPID=5776, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7040, selfPID=2396, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:00:50 (1740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:01:23 (6660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1356, selfPID=3044, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2016, selfPID=5796, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5356, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3952, selfPID=5236, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4172, selfPID=4172, iMonCtr=2 Leaving CPDN_Main::Monitor... 15:40:00 (5528): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 Mar 2011 03:50:12 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 138,240 | 664,976 | 4.8103 |
18 Mar 2011 06:14:52 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 126,720 | 610,449 | 4.8173 |
08 Mar 2011 23:55:05 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 115,212 | 557,274 | 4.8369 |
08 Mar 2011 23:55:05 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 115,203 | 556,339 | 4.8292 |
08 Mar 2011 23:55:05 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 115,200 | 555,198 | 4.8194 |
25 Feb 2011 14:23:32 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 103,680 | 504,075 | 4.8618 |
16 Feb 2011 06:56:20 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 92,160 | 452,543 | 4.9104 |
28 Jan 2011 23:49:48 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 80,352 | 394,260 | 4.9067 |
09 Jan 2011 12:09:52 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 69,216 | 342,442 | 4.9474 |
02 Jan 2011 04:54:53 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 57,696 | 289,299 | 5.0142 |
16 Dec 2010 22:31:52 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 46,176 | 235,601 | 5.1022 |
06 Dec 2010 08:34:54 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 34,656 | 180,111 | 5.1971 |
01 Dec 2010 11:42:15 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 23,136 | 119,199 | 5.1521 |
26 Nov 2010 23:57:41 | 1114615 | 12043285 | hadam3p_saf_05h8_2003_1_006777876_0 | 11,616 | 58,623 | 5.0467 |
©2024 cpdn.org