Name | hadam3p_eu_f97w_2013_1_008763744_0 |
Workunit | 8909722 |
Created | 17 Jun 2014, 15:18:16 UTC |
Sent | 17 Jun 2014, 15:19:05 UTC |
Report deadline | 30 May 2015, 20:39:05 UTC |
Received | 6 Aug 2014, 15:34:47 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1243381 |
Run time | 2 days 5 hours 41 min 24 sec |
CPU time | 2 days 5 hours 3 min 30 sec |
Validate state | Workunit error - check skipped |
Credit | 2,386.69 |
Device peak FLOPS | 2.85 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8860, selfPID=8488, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6736, selfPID=6832, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8984, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8992, selfPID=6500, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 19:21:37 (5224): No heartbeat from core client for 30 sec - exiting 19:21:38 (5224): No heartbeat from core client for 30 sec - exiting 19:21:39 (5224): No heartbeat from core client for 30 sec - exiting 19:21:40 (5224): No heartbeat from core client for 30 sec - exiting 19:21:41 (5224): No heartbeat from core client for 30 sec - exiting 19:21:42 (5224): No heartbeat from core client for 30 sec - exiting 19:21:43 (5224): No heartbeat from core client for 30 sec - exiting 19:21:44 (5224): No heartbeat from core client for 30 sec - exiting 19:21:45 (5224): No heartbeat from core client for 30 sec - exiting 19:21:46 (5224): No heartbeat from core client for 30 sec - exiting 19:21:47 (5224): No heartbeat from core client for 30 sec - exiting 19:21:48 (5224): No heartbeat from core client for 30 sec - exiting 19:21:49 (5224): No heartbeat from core client for 30 sec - exiting 19:21:50 (5224): No heartbeat from core client for 30 sec - exiting 19:21:51 (5224): No heartbeat from core client for 30 sec - exiting 19:21:52 (5224): No heartbeat from core client for 30 sec - exiting 19:21:53 (5224): No heartbeat from core client for 30 sec - exiting 19:21:54 (5224): No heartbeat from core client for 30 sec - exiting 19:21:55 (5224): No heartbeat from core client for 30 sec - exiting 19:21:56 (5224): No heartbeat from core client for 30 sec - exiting 19:21:57 (5224): No heartbeat from core client for 30 sec - exiting 19:21:58 (5224): No heartbeat from core client for 30 sec - exiting 19:21:59 (5224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:23:58 (3800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11788, selfPID=17804, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6676, selfPID=7060, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:47:18 (9484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:47:19 (9484): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7416, selfPID=7416, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:26:47 (9208): No heartbeat from core client for 30 sec - exiting 18:26:48 (9208): No heartbeat from core client for 30 sec - exiting 18:26:49 (9208): No heartbeat from core client for 30 sec - exiting 18:26:50 (9208): No heartbeat from core client for 30 sec - exiting 18:26:51 (9208): No heartbeat from core client for 30 sec - exiting 18:26:52 (9208): No heartbeat from core client for 30 sec - exiting 18:26:53 (9208): No heartbeat from core client for 30 sec - exiting 18:26:54 (9208): No heartbeat from core client for 30 sec - exiting 18:26:55 (9208): No heartbeat from core client for 30 sec - exiting 18:26:56 (9208): No heartbeat from core client for 30 sec - exiting 18:26:57 (9208): No heartbeat from core client for 30 sec - exiting 18:26:58 (9208): No heartbeat from core client for 30 sec - exiting 18:26:59 (9208): No heartbeat from core client for 30 sec - exiting 18:27:00 (9208): No heartbeat from core client for 30 sec - exiting 18:27:01 (9208): No heartbeat from core client for 30 sec - exiting 18:27:02 (9208): No heartbeat from core client for 30 sec - exiting 18:27:03 (9208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10464, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:04:09 (22020): No heartbeat from core client for 30 sec - exiting 10:04:10 (22020): No heartbeat from core client for 30 sec - exiting 10:04:11 (22020): No heartbeat from core client for 30 sec - exiting 10:04:12 (22020): No heartbeat from core client for 30 sec - exiting 10:04:13 (22020): No heartbeat from core client for 30 sec - exiting 10:04:14 (22020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5448, selfPID=5448, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1704, selfPID=1704, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1624, selfPID=9440, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9752, selfPID=9752, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
06 Aug 2014 15:36:07 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 138,353 | 190,730 | 1.3786 |
06 Aug 2014 15:36:07 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 138,336 | 190,533 | 1.3773 |
04 Aug 2014 08:21:30 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 126,816 | 175,270 | 1.3821 |
24 Jul 2014 16:52:11 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 115,296 | 159,598 | 1.3842 |
05 Jul 2014 12:47:14 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 103,776 | 144,071 | 1.3883 |
02 Jul 2014 21:13:53 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 92,256 | 128,654 | 1.3945 |
01 Jul 2014 07:56:08 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 80,736 | 113,155 | 1.4015 |
27 Jun 2014 18:30:59 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 69,216 | 97,780 | 1.4127 |
21 Jun 2014 17:06:49 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 57,696 | 81,712 | 1.4163 |
20 Jun 2014 16:53:01 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 46,176 | 65,039 | 1.4085 |
20 Jun 2014 12:17:08 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 34,656 | 48,490 | 1.3992 |
19 Jun 2014 17:08:11 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 23,136 | 31,895 | 1.3786 |
19 Jun 2014 12:22:12 | 1243381 | 16671113 | hadam3p_eu_f97w_2013_1_008763744_0 | 11,616 | 16,400 | 1.4118 |
©2024 cpdn.org