Name | hadcm3n_480p_1940_40_008308710_4 |
Workunit | 8459845 |
Created | 22 Oct 2013, 12:44:58 UTC |
Sent | 22 Oct 2013, 12:45:02 UTC |
Report deadline | 21 Jan 2014, 20:12:13 UTC |
Received | 21 Feb 2014, 0:33:38 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1212486 |
Run time | 30 days 12 hours 42 min 26 sec |
CPU time | 17 days 7 hours 55 min 58 sec |
Validate state | Invalid |
Credit | 8,398.08 |
Device peak FLOPS | 2.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.33</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5740, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=804, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3408, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13884, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25244, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5700, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4052, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8480, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8772, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14772, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7176, iMonCtr=1 Model crash detected, will try to restart... 00:30:43 (6564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 15:08:41 (29324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5388, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7992, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 08:08:30 (1208): No heartbeat from core client for 30 sec - exiting 08:08:31 (1208): No heartbeat from core client for 30 sec - exiting 08:08:32 (1208): No heartbeat from core client for 30 sec - exiting 08:08:33 (1208): No heartbeat from core client for 30 sec - exiting 08:08:34 (1208): No heartbeat from core client for 30 sec - exiting 08:08:35 (1208): No heartbeat from core client for 30 sec - exiting 08:08:36 (1208): No heartbeat from core client for 30 sec - exiting 08:08:37 (1208): No heartbeat from core client for 30 sec - exiting 08:08:38 (1208): No heartbeat from core client for 30 sec - exiting 08:08:39 (1208): No heartbeat from core client for 30 sec - exiting 08:08:40 (1208): No heartbeat from core client for 30 sec - exiting 08:08:41 (1208): No heartbeat from core client for 30 sec - exiting 08:08:42 (1208): No heartbeat from core client for 30 sec - exiting 08:08:43 (1208): No heartbeat from core client for 30 sec - exiting 08:08:44 (1208): No heartbeat from core client for 30 sec - exiting 08:08:45 (1208): No heartbeat from core client for 30 sec - exiting 08:08:46 (1208): No heartbeat from core client for 30 sec - exiting 08:08:47 (1208): No heartbeat from core client for 30 sec - exiting 08:08:48 (1208): No heartbeat from core client for 30 sec - exiting 08:08:49 (1208): No heartbeat from core client for 30 sec - exiting 08:08:50 (1208): No heartbeat from core client for 30 sec - exiting 08:08:51 (1208): No heartbeat from core client for 30 sec - exiting 08:08:52 (1208): No heartbeat from core client for 30 sec - exiting 08:08:53 (1208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9752, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 20:10:44 (4116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:40:53 (10396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:25:43 (29036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:36:23 (18652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:43:18 (21148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:46:21 (22500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:29:40 (36340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:31:18 (32392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:33:35 (38780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4028, iMonCtr=1 Model crash detected, will try to restart... 11:36:49 (6804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:36:50 (6804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8028, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:02:25 (7464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2720, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:58:41 (4156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:58:42 (4156): No heartbeat from core client for 30 sec - exiting 23:58:43 (4156): No heartbeat from core client for 30 sec - exiting 23:58:44 (4156): No heartbeat from core client for 30 sec - exiting 23:58:45 (4156): No heartbeat from core client for 30 sec - exiting 23:58:46 (4156): No heartbeat from core client for 30 sec - exiting 23:58:47 (4156): No heartbeat from core client for 30 sec - exiting 23:58:49 (4156): No heartbeat from core client for 30 sec - exiting 23:58:50 (4156): No heartbeat from core client for 30 sec - exiting 23:58:51 (4156): No heartbeat from core client for 30 sec - exiting 23:58:52 (4156): No heartbeat from core client for 30 sec - exiting 23:58:53 (4156): No heartbeat from core client for 30 sec - exiting 23:58:54 (4156): No heartbeat from core client for 30 sec - exiting 00:00:24 (5172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:00:25 (5172): No heartbeat from core client for 30 sec - exiting 00:00:26 (5172): No heartbeat from core client for 30 sec - exiting 00:00:27 (5172): No heartbeat from core client for 30 sec - exiting 00:00:28 (5172): No heartbeat from core client for 30 sec - exiting 00:00:29 (5172): No heartbeat from core client for 30 sec - exiting 00:00:30 (5172): No heartbeat from core client for 30 sec - exiting 00:00:31 (5172): No heartbeat from core client for 30 sec - exiting 00:00:32 (5172): No heartbeat from core client for 30 sec - exiting 00:00:33 (5172): No heartbeat from core client for 30 sec - exiting 00:00:34 (5172): No heartbeat from core client for 30 sec - exiting 00:00:35 (5172): No heartbeat from core client for 30 sec - exiting 00:00:36 (5172): No heartbeat from core client for 30 sec - exiting 00:01:44 (7804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:01:45 (7804): No heartbeat from core client for 30 sec - exiting 00:01:46 (7804): No heartbeat from core client for 30 sec - exiting 00:01:47 (7804): No heartbeat from core client for 30 sec - exiting 00:01:48 (7804): No heartbeat from core client for 30 sec - exiting 00:01:49 (7804): No heartbeat from core client for 30 sec - exiting 00:01:50 (7804): No heartbeat from core client for 30 sec - exiting 00:01:51 (7804): No heartbeat from core client for 30 sec - exiting 00:01:52 (7804): No heartbeat from core client for 30 sec - exiting 00:01:53 (7804): No heartbeat from core client for 30 sec - exiting 00:01:54 (7804): No heartbeat from core client for 30 sec - exiting 00:03:54 (7256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:03:55 (7256): No heartbeat from core client for 30 sec - exiting 00:03:56 (7256): No heartbeat from core client for 30 sec - exiting 00:03:57 (7256): No heartbeat from core client for 30 sec - exiting 00:03:58 (7256): No heartbeat from core client for 30 sec - exiting 00:03:59 (7256): No heartbeat from core client for 30 sec - exiting 00:04:00 (7256): No heartbeat from core client for 30 sec - exiting 00:04:01 (7256): No heartbeat from core client for 30 sec - exiting 00:04:02 (7256): No heartbeat from core client for 30 sec - exiting 00:04:03 (7256): No heartbeat from core client for 30 sec - exiting 00:04:04 (7256): No heartbeat from core client for 30 sec - exiting Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Jan 2014 20:00:19 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 699,840 | 1,457,621 | 2.0828 |
13 Jan 2014 01:36:22 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 673,920 | 1,405,575 | 2.0857 |
12 Jan 2014 17:03:24 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 648,000 | 1,353,581 | 2.0889 |
12 Jan 2014 17:03:24 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 622,080 | 1,304,097 | 2.0963 |
30 Dec 2013 18:16:04 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 596,160 | 1,247,712 | 2.0929 |
30 Dec 2013 13:08:03 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 570,240 | 1,199,174 | 2.1029 |
26 Dec 2013 14:00:12 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 544,320 | 1,144,982 | 2.1035 |
26 Dec 2013 14:00:12 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 518,400 | 1,088,872 | 2.1004 |
26 Dec 2013 14:00:12 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 492,480 | 1,035,074 | 2.1018 |
16 Dec 2013 12:38:38 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 466,560 | 977,181 | 2.0944 |
16 Dec 2013 12:38:38 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 440,640 | 921,596 | 2.0915 |
12 Dec 2013 12:37:27 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 414,720 | 863,845 | 2.0830 |
06 Dec 2013 14:41:50 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 388,800 | 805,403 | 2.0715 |
25 Nov 2013 15:12:49 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 362,880 | 747,536 | 2.0600 |
24 Nov 2013 16:32:56 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 336,960 | 695,110 | 2.0629 |
20 Nov 2013 20:10:03 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 311,040 | 645,478 | 2.0752 |
18 Nov 2013 10:15:06 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 285,120 | 580,854 | 2.0372 |
15 Nov 2013 15:33:28 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 259,200 | 527,803 | 2.0363 |
11 Nov 2013 14:45:59 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 233,280 | 467,204 | 2.0028 |
07 Nov 2013 16:46:30 | 1212486 | 16072090 | hadcm3n_480p_1940_40_008308710_4 | 207,360 | 407,428 | 1.9648 |
©2024 cpdn.org