Name | hadcm3n_ydj7_1980_40_008182483_0 |
Workunit | 8337607 |
Created | 3 Sep 2012, 12:36:30 UTC |
Sent | 3 Sep 2012, 16:54:41 UTC |
Report deadline | 4 Dec 2012, 0:21:52 UTC |
Received | 31 Oct 2012, 15:22:25 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1049657 |
Run time | 27 days 2 hours 57 min 25 sec |
CPU time | 23 days 12 hours 34 min 46 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.79 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> 01:30:05 (8012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:50:53 (6164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=372, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 00:39:50 (6500): No heartbeat from core client for 30 sec - exiting 00:39:51 (6500): No heartbeat from core client for 30 sec - exiting 00:39:52 (6500): No heartbeat from core client for 30 sec - exiting 00:39:53 (6500): No heartbeat from core client for 30 sec - exiting 00:39:54 (6500): No heartbeat from core client for 30 sec - exiting 00:39:55 (6500): No heartbeat from core client for 30 sec - exiting 00:39:57 (6500): No heartbeat from core client for 30 sec - exiting 00:39:58 (6500): No heartbeat from core client for 30 sec - exiting 00:39:59 (6500): No heartbeat from core client for 30 sec - exiting 00:40:00 (6500): No heartbeat from core client for 30 sec - exiting 00:40:01 (6500): No heartbeat from core client for 30 sec - exiting 00:40:02 (6500): No heartbeat from core client for 30 sec - exiting 00:40:03 (6500): No heartbeat from core client for 30 sec - exiting 00:40:04 (6500): No heartbeat from core client for 30 sec - exiting 00:40:05 (6500): No heartbeat from core client for 30 sec - exiting 00:40:06 (6500): No heartbeat from core client for 30 sec - exiting 00:40:07 (6500): No heartbeat from core client for 30 sec - exiting 00:40:09 (6500): No heartbeat from core client for 30 sec - exiting 00:40:10 (6500): No heartbeat from core client for 30 sec - exiting 00:40:11 (6500): No heartbeat from core client for 30 sec - exiting 00:40:12 (6500): No heartbeat from core client for 30 sec - exiting 00:40:13 (6500): No heartbeat from core client for 30 sec - exiting 00:40:14 (6500): No heartbeat from core client for 30 sec - exiting 00:40:15 (6500): No heartbeat from core client for 30 sec - exiting 00:40:16 (6500): No heartbeat from core client for 30 sec - exiting 00:40:17 (6500): No heartbeat from core client for 30 sec - exiting 00:40:18 (6500): No heartbeat from core client for 30 sec - exiting 00:40:20 (6500): No heartbeat from core client for 30 sec - exiting 00:40:21 (6500): No heartbeat from core client for 30 sec - exiting 00:40:22 (6500): No heartbeat from core client for 30 sec - exiting 00:40:23 (6500): No heartbeat from core client for 30 sec - exiting 00:40:24 (6500): No heartbeat from core client for 30 sec - exiting 00:40:25 (6500): No heartbeat from core client for 30 sec - exiting 00:40:26 (6500): No heartbeat from core client for 30 sec - exiting 00:40:27 (6500): No heartbeat from core client for 30 sec - exiting 00:40:28 (6500): No heartbeat from core client for 30 sec - exiting 00:40:29 (6500): No heartbeat from core client for 30 sec - exiting 00:40:30 (6500): No heartbeat from core client for 30 sec - exiting 00:40:32 (6500): No heartbeat from core client for 30 sec - exiting 00:40:33 (6500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:56:41 (7704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:49:02 (8040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:49:03 (8040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 11:19:43 (4336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/ydj7ko.pjk5c10 Error converting file to netcdf: dataout/ydj7ko.pik5c10 Error converting file to netcdf: dataout/ydj7ko.pfk5c10 Error converting file to netcdf: dataout/ydj7ka.phk5c10 Error converting file to netcdf: dataout/ydj7ka.pgk5c10 Error converting file to netcdf: dataout/ydj7ka.pek5c10 Error converting file to netcdf: dataout/ydj7ka.pdk5c10 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:48:12 (9708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:48:13 (9708): No heartbeat from core client for 30 sec - exiting 00:49:15 (6248): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:03:44 (9008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:26:45 (3976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:41:13 (5980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:47:17 (9000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:04:03 (9548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:05:04 (9600): No heartbeat from core client for 30 sec - exiting 02:05:05 (9600): No heartbeat from core client for 30 sec - exiting 02:05:06 (9600): No heartbeat from core client for 30 sec - exiting 02:05:08 (9600): No heartbeat from core client for 30 sec - exiting 02:05:09 (9600): No heartbeat from core client for 30 sec - exiting 02:05:10 (9600): No heartbeat from core client for 30 sec - exiting 02:05:11 (9600): No heartbeat from core client for 30 sec - exiting 02:05:12 (9600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:19:11 (6912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:19:12 (6912): No heartbeat from core client for 30 sec - exiting 00:19:13 (6912): No heartbeat from core client for 30 sec - exiting 00:20:50 (8488): No heartbeat from core client for 30 sec - exiting 00:20:52 (8488): No heartbeat from core client for 30 sec - exiting 00:20:53 (8488): No heartbeat from core client for 30 sec - exiting 00:20:54 (8488): No heartbeat from core client for 30 sec - exiting 00:20:55 (8488): No heartbeat from core client for 30 sec - exiting 00:20:56 (8488): No heartbeat from core client for 30 sec - exiting 00:20:57 (8488): No heartbeat from core client for 30 sec - exiting 00:20:58 (8488): No heartbeat from core client for 30 sec - exiting 00:20:59 (8488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:15:52 (6392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:10:39 (2536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:13:38 (7204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Oct 2012 15:26:16 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 1,036,800 | 2,032,475 | 1.9603 |
30 Oct 2012 23:27:42 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 1,010,880 | 1,984,042 | 1.9627 |
30 Oct 2012 07:44:52 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 984,960 | 1,934,739 | 1.9643 |
27 Oct 2012 22:57:55 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 959,040 | 1,886,551 | 1.9671 |
27 Oct 2012 04:03:21 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 933,120 | 1,839,827 | 1.9717 |
25 Oct 2012 23:27:04 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 907,200 | 1,788,570 | 1.9715 |
24 Oct 2012 06:31:12 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 881,280 | 1,737,623 | 1.9717 |
23 Oct 2012 13:48:50 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 855,360 | 1,686,956 | 1.9722 |
22 Oct 2012 21:55:26 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 829,440 | 1,636,706 | 1.9733 |
22 Oct 2012 04:32:10 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 803,520 | 1,585,096 | 1.9727 |
21 Oct 2012 07:39:50 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 777,600 | 1,533,427 | 1.9720 |
20 Oct 2012 13:37:07 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 751,680 | 1,480,995 | 1.9702 |
19 Oct 2012 17:12:59 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 725,760 | 1,427,492 | 1.9669 |
18 Oct 2012 23:25:08 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 699,840 | 1,375,257 | 1.9651 |
18 Oct 2012 04:31:26 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 673,920 | 1,322,728 | 1.9627 |
17 Oct 2012 12:25:36 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 648,000 | 1,272,459 | 1.9637 |
15 Oct 2012 23:48:02 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 622,080 | 1,222,792 | 1.9657 |
15 Oct 2012 07:40:30 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 596,160 | 1,171,080 | 1.9644 |
14 Oct 2012 14:57:14 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 570,240 | 1,121,666 | 1.9670 |
14 Oct 2012 00:10:06 | 1049657 | 15228714 | hadcm3n_ydj7_1980_40_008182483_0 | 544,320 | 1,070,203 | 1.9661 |
©2024 cpdn.org