Name | hadcm3n_zkvs_1880_40_008253876_1 |
Workunit | 8409000 |
Created | 26 Nov 2012, 13:15:01 UTC |
Sent | 26 Nov 2012, 13:16:10 UTC |
Report deadline | 25 Feb 2013, 20:43:21 UTC |
Received | 16 Dec 2012, 18:49:26 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1104109 |
Run time | 11 days 12 hours 22 min 13 sec |
CPU time | 9 days 8 hours 11 min 40 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.47 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10136, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1944, iMonCtr=1 Model crash detected, will try to restart... 12:39:36 (3884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:43:16 (10320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:47:26 (8280): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:02:16 (4868): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 08:02:19 (4868): No heartbeat from core client for 30 sec - exiting 08:02:20 (4868): No heartbeat from core client for 30 sec - exiting 08:02:21 (4868): No heartbeat from core client for 30 sec - exiting 08:02:22 (4868): No heartbeat from core client for 30 sec - exiting 08:02:23 (4868): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 14:06:42 (9980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:06:44 (9980): No heartbeat from core client for 30 sec - exiting 14:06:45 (9980): No heartbeat from core client for 30 sec - exiting 14:06:46 (9980): No heartbeat from core client for 30 sec - exiting 15:08:17 (7400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:08:19 (7400): No heartbeat from core client for 30 sec - exiting 15:08:20 (7400): No heartbeat from core client for 30 sec - exiting 15:12:40 (8788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:13:02 (8788): No heartbeat from core client for 30 sec - exiting 15:13:03 (8788): No heartbeat from core client for 30 sec - exiting 15:13:04 (8788): No heartbeat from core client for 30 sec - exiting 15:13:05 (8788): No heartbeat from core client for 30 sec - exiting 15:13:06 (8788): No heartbeat from core client for 30 sec - exiting 15:13:07 (8788): No heartbeat from core client for 30 sec - exiting 15:13:08 (8788): No heartbeat from core client for 30 sec - exiting 15:13:09 (8788): No heartbeat from core client for 30 sec - exiting 15:13:10 (8788): No heartbeat from core client for 30 sec - exiting 15:13:11 (8788): No heartbeat from core client for 30 sec - exiting 15:13:12 (8788): No heartbeat from core client for 30 sec - exiting 15:13:13 (8788): No heartbeat from core client for 30 sec - exiting 15:13:14 (8788): No heartbeat from core client for 30 sec - exiting 15:13:16 (8788): No heartbeat from core client for 30 sec - exiting 15:13:17 (8788): No heartbeat from core client for 30 sec - exiting 15:13:18 (8788): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 22:41:27 (9000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:41:29 (9000): No heartbeat from core client for 30 sec - exiting 00:43:59 (3188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:44:01 (3188): No heartbeat from core client for 30 sec - exiting 01:14:52 (4952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:14:54 (4952): No heartbeat from core client for 30 sec - exiting 01:14:55 (4952): No heartbeat from core client for 30 sec - exiting 01:14:56 (4952): No heartbeat from core client for 30 sec - exiting 01:14:57 (4952): No heartbeat from core client for 30 sec - exiting 01:14:58 (4952): No heartbeat from core client for 30 sec - exiting 01:14:59 (4952): No heartbeat from core client for 30 sec - exiting 01:15:00 (4952): No heartbeat from core client for 30 sec - exiting 01:15:01 (4952): No heartbeat from core client for 30 sec - exiting 01:15:02 (4952): No heartbeat from core client for 30 sec - exiting 01:15:03 (4952): No heartbeat from core client for 30 sec - exiting 04:19:39 (8580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:19:41 (8580): No heartbeat from core client for 30 sec - exiting 04:19:42 (8580): No heartbeat from core client for 30 sec - exiting 04:19:43 (8580): No heartbeat from core client for 30 sec - exiting 04:24:29 (2712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:24:33 (2712): No heartbeat from core client for 30 sec - exiting 04:24:34 (2712): No heartbeat from core client for 30 sec - exiting 04:24:35 (2712): No heartbeat from core client for 30 sec - exiting 04:24:36 (2712): No heartbeat from core client for 30 sec - exiting 04:30:13 (9660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:30:18 (9660): No heartbeat from core client for 30 sec - exiting 04:30:19 (9660): No heartbeat from core client for 30 sec - exiting 04:30:20 (9660): No heartbeat from core client for 30 sec - exiting 04:30:21 (9660): No heartbeat from core client for 30 sec - exiting 04:51:02 (4744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:51:03 (4744): No heartbeat from core client for 30 sec - exiting 05:22:00 (7188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:22:01 (7188): No heartbeat from core client for 30 sec - exiting 05:22:02 (7188): No heartbeat from core client for 30 sec - exiting 05:55:06 (10184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:55:08 (10184): No heartbeat from core client for 30 sec - exiting 07:27:05 (5572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:27:07 (5572): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:24:24 (7436): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 03:24:26 (7436): No heartbeat from core client for 30 sec - exiting 20:23:29 (7324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:23:30 (7324): No heartbeat from core client for 30 sec - exiting 20:26:33 (5304): Can't acquire lockfile (32) - waiting 35s 20:26:57 (3476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Ocean Restart file copy failed on zkvsko.da993h0 Ocean Restart file copy failed on zkvsko.da993i0 Ocean Restart file copy failed on zkvsko.da993j0 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:02:15 (6380): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 12:02:16 (6380): No heartbeat from core client for 30 sec - exiting 12:02:17 (6380): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Dec 2012 10:49:57 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 518,400 | 807,094 | 1.5569 |
15 Dec 2012 06:58:14 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 492,480 | 763,916 | 1.5512 |
14 Dec 2012 12:45:44 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 466,560 | 722,388 | 1.5483 |
14 Dec 2012 12:45:44 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 440,640 | 682,911 | 1.5498 |
14 Dec 2012 12:45:44 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 414,720 | 642,058 | 1.5482 |
14 Dec 2012 12:45:44 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 388,800 | 600,819 | 1.5453 |
07 Dec 2012 03:01:01 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 362,880 | 559,162 | 1.5409 |
05 Dec 2012 04:08:28 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 336,960 | 518,064 | 1.5375 |
04 Dec 2012 13:03:07 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 311,040 | 476,962 | 1.5334 |
03 Dec 2012 19:16:42 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 285,120 | 435,720 | 1.5282 |
02 Dec 2012 13:34:30 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 259,200 | 395,929 | 1.5275 |
01 Dec 2012 22:01:18 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 233,280 | 356,912 | 1.5300 |
01 Dec 2012 09:36:26 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 207,360 | 318,008 | 1.5336 |
30 Nov 2012 20:37:40 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 181,440 | 278,054 | 1.5325 |
30 Nov 2012 06:46:25 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 155,520 | 238,263 | 1.5320 |
29 Nov 2012 16:58:49 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 129,600 | 197,603 | 1.5247 |
29 Nov 2012 03:36:30 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 103,680 | 158,352 | 1.5273 |
28 Nov 2012 09:52:01 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 77,760 | 119,108 | 1.5317 |
27 Nov 2012 19:31:46 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 51,840 | 79,902 | 1.5413 |
27 Nov 2012 04:02:40 | 1104109 | 15462217 | hadcm3n_zkvs_1880_40_008253876_1 | 25,920 | 40,064 | 1.5457 |
©2025 cpdn.org