Name | hadcm3n_o38x_2140_40_008269810_3 |
Workunit | 8424934 |
Created | 4 Jun 2013, 12:01:22 UTC |
Sent | 4 Jun 2013, 12:11:21 UTC |
Report deadline | 3 Sep 2013, 19:38:32 UTC |
Received | 1 Jul 2013, 17:16:42 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1240097 |
Run time | 7 days 16 hours 37 min 35 sec |
CPU time | 7 days 13 hours 16 min 55 sec |
Validate state | Invalid |
Credit | 7,464.96 |
Device peak FLOPS | 3.63 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> Le lecteur ne trouve pas de zone ou de piste spécifique sur le disque. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:31:17 (5908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5776, iMonCtr=1 Model crash detected, will try to restart... 08:42:12 (5904): No heartbeat from core client for 30 sec - exiting 08:42:13 (5904): No heartbeat from core client for 30 sec - exiting 08:42:14 (5904): No heartbeat from core client for 30 sec - exiting 08:42:15 (5904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6724, iMonCtr=1 Model crash detected, will try to restart... 07:58:24 (6844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:00:11 (6512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:54:25 (4476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:56:00 (5132): No heartbeat from core client for 30 sec - exiting 07:56:01 (5132): No heartbeat from core client for 30 sec - exiting 07:56:02 (5132): No heartbeat from core client for 30 sec - exiting 07:56:03 (5132): No heartbeat from core client for 30 sec - exiting 07:56:04 (5132): No heartbeat from core client for 30 sec - exiting 07:56:05 (5132): No heartbeat from core client for 30 sec - exiting 07:56:06 (5132): No heartbeat from core client for 30 sec - exiting 07:56:07 (5132): No heartbeat from core client for 30 sec - exiting 07:56:08 (5132): No heartbeat from core client for 30 sec - exiting 07:56:09 (5132): No heartbeat from core client for 30 sec - exiting 07:56:10 (5132): No heartbeat from core client for 30 sec - exiting 07:56:11 (5132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7104, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4404, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1572, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:01:34 (5516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 05:51:15 PM No files match the supplied pattern. MainError: 05:51:15 PM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5684, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 03:24:37 PM No files match the supplied pattern. MainError: 03:24:37 PM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 02:18:27 PM No files match the supplied pattern. MainError: 02:18:27 PM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5268, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 04:11:53 PM No files match the supplied pattern. MainError: 04:11:53 PM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 08:02:08 PM No files match the supplied pattern. MainError: 08:02:08 PM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4952, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Jul 2013 10:56:52 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 622,080 | 628,423 | 1.0102 |
02 Jul 2013 10:12:05 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 596,160 | 601,406 | 1.0088 |
27 Jun 2013 14:21:20 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 570,240 | 574,605 | 1.0077 |
26 Jun 2013 15:27:46 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 544,320 | 547,283 | 1.0054 |
24 Jun 2013 17:58:28 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 518,400 | 520,277 | 1.0036 |
23 Jun 2013 21:28:35 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 492,480 | 493,548 | 1.0022 |
23 Jun 2013 14:04:02 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 466,560 | 467,152 | 1.0013 |
21 Jun 2013 23:04:06 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 440,640 | 441,120 | 1.0011 |
20 Jun 2013 07:47:33 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 414,720 | 414,759 | 1.0001 |
19 Jun 2013 20:05:04 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 388,800 | 387,530 | 0.9967 |
19 Jun 2013 12:17:59 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 362,880 | 359,832 | 0.9916 |
19 Jun 2013 03:18:56 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 336,960 | 332,523 | 0.9868 |
18 Jun 2013 19:27:33 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 311,040 | 304,982 | 0.9805 |
18 Jun 2013 02:46:10 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 285,120 | 278,293 | 0.9761 |
16 Jun 2013 18:45:46 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 259,200 | 250,789 | 0.9676 |
15 Jun 2013 20:09:06 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 233,280 | 223,436 | 0.9578 |
15 Jun 2013 12:06:07 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 207,360 | 195,056 | 0.9407 |
14 Jun 2013 19:36:19 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 181,440 | 165,882 | 0.9143 |
13 Jun 2013 23:52:50 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 155,520 | 139,333 | 0.8959 |
13 Jun 2013 16:27:41 | 1240097 | 15828651 | hadcm3n_o38x_2140_40_008269810_3 | 129,600 | 117,789 | 0.9089 |
©2024 cpdn.org