Name | hadcm3n_48kj_1940_40_008300921_2 |
Workunit | 8452056 |
Created | 30 May 2013, 16:30:01 UTC |
Sent | 20 Jun 2013, 6:18:51 UTC |
Report deadline | 19 Sep 2013, 13:46:02 UTC |
Received | 9 Jul 2013, 16:19:01 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1209437 |
Run time | 12 days 20 hours 24 min 54 sec |
CPU time | 12 days 4 hours 59 min |
Validate state | Invalid |
Credit | 12,130.56 |
Device peak FLOPS | 3.33 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:55:08 (12644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:55:09 (12644): No heartbeat from core client for 30 sec - exiting 00:55:10 (12644): No heartbeat from core client for 30 sec - exiting 00:55:11 (12644): No heartbeat from core client for 30 sec - exiting 00:55:12 (12644): No heartbeat from core client for 30 sec - exiting 00:55:13 (12644): No heartbeat from core client for 30 sec - exiting 00:55:14 (12644): No heartbeat from core client for 30 sec - exiting 00:55:15 (12644): No heartbeat from core client for 30 sec - exiting 00:55:16 (12644): No heartbeat from core client for 30 sec - exiting 00:55:17 (12644): No heartbeat from core client for 30 sec - exiting 00:55:18 (12644): No heartbeat from core client for 30 sec - exiting 02:02:58 (13340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:02:59 (13340): No heartbeat from core client for 30 sec - exiting 02:03:00 (13340): No heartbeat from core client for 30 sec - exiting 02:03:01 (13340): No heartbeat from core client for 30 sec - exiting 02:03:02 (13340): No heartbeat from core client for 30 sec - exiting 02:03:03 (13340): No heartbeat from core client for 30 sec - exiting 02:03:04 (13340): No heartbeat from core client for 30 sec - exiting 02:03:05 (13340): No heartbeat from core client for 30 sec - exiting 02:03:06 (13340): No heartbeat from core client for 30 sec - exiting 02:03:07 (13340): No heartbeat from core client for 30 sec - exiting 02:03:08 (13340): No heartbeat from core client for 30 sec - exiting 02:55:07 (12332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:55:08 (12332): No heartbeat from core client for 30 sec - exiting 02:55:09 (12332): No heartbeat from core client for 30 sec - exiting 02:55:10 (12332): No heartbeat from core client for 30 sec - exiting 02:55:11 (12332): No heartbeat from core client for 30 sec - exiting 02:55:12 (12332): No heartbeat from core client for 30 sec - exiting 02:55:13 (12332): No heartbeat from core client for 30 sec - exiting 02:55:14 (12332): No heartbeat from core client for 30 sec - exiting 02:55:15 (12332): No heartbeat from core client for 30 sec - exiting 03:02:34 (16256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:02:35 (16256): No heartbeat from core client for 30 sec - exiting 03:02:36 (16256): No heartbeat from core client for 30 sec - exiting 03:02:37 (16256): No heartbeat from core client for 30 sec - exiting 03:02:38 (16256): No heartbeat from core client for 30 sec - exiting 03:02:39 (16256): No heartbeat from core client for 30 sec - exiting 03:02:40 (16256): No heartbeat from core client for 30 sec - exiting 03:02:41 (16256): No heartbeat from core client for 30 sec - exiting 03:02:42 (16256): No heartbeat from core client for 30 sec - exiting 03:02:43 (16256): No heartbeat from core client for 30 sec - exiting 03:02:44 (16256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:26:21 (20672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:26:22 (20672): No heartbeat from core client for 30 sec - exiting 16:26:23 (20672): No heartbeat from core client for 30 sec - exiting 16:26:24 (20672): No heartbeat from core client for 30 sec - exiting 16:26:25 (20672): No heartbeat from core client for 30 sec - exiting 16:26:26 (20672): No heartbeat from core client for 30 sec - exiting 16:26:27 (20672): No heartbeat from core client for 30 sec - exiting 16:26:28 (20672): No heartbeat from core client for 30 sec - exiting 16:26:30 (20672): No heartbeat from core client for 30 sec - exiting 16:26:31 (20672): No heartbeat from core client for 30 sec - exiting 16:26:32 (20672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13280, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 10:00:29 (4384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:00:30 (4384): No heartbeat from core client for 30 sec - exiting 10:00:31 (4384): No heartbeat from core client for 30 sec - exiting 10:00:32 (4384): No heartbeat from core client for 30 sec - exiting 10:00:33 (4384): No heartbeat from core client for 30 sec - exiting 10:00:34 (4384): No heartbeat from core client for 30 sec - exiting 10:00:35 (4384): No heartbeat from core client for 30 sec - exiting 10:00:36 (4384): No heartbeat from core client for 30 sec - exiting 10:00:37 (4384): No heartbeat from core client for 30 sec - exiting 10:00:38 (4384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 10:06:58 (7052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:06:59 (7052): No heartbeat from core client for 30 sec - exiting 10:07:00 (7052): No heartbeat from core client for 30 sec - exiting 10:07:01 (7052): No heartbeat from core client for 30 sec - exiting 10:07:02 (7052): No heartbeat from core client for 30 sec - exiting 10:07:03 (7052): No heartbeat from core client for 30 sec - exiting 10:07:04 (7052): No heartbeat from core client for 30 sec - exiting 10:07:05 (7052): No heartbeat from core client for 30 sec - exiting 10:07:06 (7052): No heartbeat from core client for 30 sec - exiting 10:07:07 (7052): No heartbeat from core client for 30 sec - exiting 10:07:08 (7052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Jul 2013 10:13:01 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 1,010,880 | 1,094,710 | 1.0829 |
09 Jul 2013 01:24:36 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 984,960 | 1,063,222 | 1.0795 |
08 Jul 2013 11:28:06 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 959,040 | 1,035,027 | 1.0792 |
08 Jul 2013 04:26:21 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 933,120 | 1,007,774 | 1.0800 |
07 Jul 2013 13:31:56 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 907,200 | 978,858 | 1.0790 |
07 Jul 2013 05:10:23 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 881,280 | 949,352 | 1.0772 |
06 Jul 2013 14:29:00 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 855,360 | 920,014 | 1.0756 |
06 Jul 2013 05:03:49 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 829,440 | 890,657 | 1.0738 |
06 Jul 2013 04:34:52 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 803,520 | 861,375 | 1.0720 |
04 Jul 2013 14:28:29 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 777,600 | 831,953 | 1.0699 |
04 Jul 2013 14:21:22 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 751,680 | 801,755 | 1.0666 |
03 Jul 2013 14:03:03 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 725,760 | 771,369 | 1.0628 |
03 Jul 2013 05:34:35 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 699,840 | 741,408 | 1.0594 |
02 Jul 2013 15:05:44 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 673,920 | 711,644 | 1.0560 |
02 Jul 2013 12:05:35 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 648,000 | 681,901 | 1.0523 |
02 Jul 2013 11:49:30 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 622,080 | 652,152 | 1.0483 |
02 Jul 2013 11:18:29 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 596,160 | 622,746 | 1.0446 |
02 Jul 2013 11:03:23 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 570,240 | 593,101 | 1.0401 |
02 Jul 2013 10:35:56 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 544,320 | 563,456 | 1.0352 |
02 Jul 2013 10:26:42 | 1209437 | 15807949 | hadcm3n_48kj_1940_40_008300921_2 | 518,400 | 541,174 | 1.0439 |
©2024 cpdn.org