Name | hadcm3n_3kuh_1940_40_008266172_3 |
Workunit | 8421296 |
Created | 3 May 2013, 22:57:07 UTC |
Sent | 3 May 2013, 22:57:10 UTC |
Report deadline | 3 Aug 2013, 6:24:21 UTC |
Received | 25 Jun 2013, 11:55:49 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1077037 |
Run time | 30 days 4 hours 55 min 59 sec |
CPU time | 28 days 9 hours 34 min 22 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 1.42 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:15:33 (11796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:15:34 (11796): No heartbeat from core client for 30 sec - exiting 23:15:35 (11796): No heartbeat from core client for 30 sec - exiting 23:15:36 (11796): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 19:15:12 (12036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:49:34 (11236): No heartbeat from core client for 30 sec - exiting 23:49:35 (11236): No heartbeat from core client for 30 sec - exiting 23:49:36 (11236): No heartbeat from core client for 30 sec - exiting 23:49:37 (11236): No heartbeat from core client for 30 sec - exiting 23:49:38 (11236): No heartbeat from core client for 30 sec - exiting 23:49:39 (11236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... zip error: Could not create output file (was replacing the original zip file) 09:56:22 (6344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:36:52 (10132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:50:11 (6320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:30:11 (10856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:30:12 (10856): No heartbeat from core client for 30 sec - exiting 02:31:51 (4220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:27:42 (12480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:27:44 (12480): No heartbeat from core client for 30 sec - exiting 10:27:45 (12480): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:03:06 (7264): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:02:40 (11680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:12:40 (2808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:12:42 (2808): No heartbeat from core client for 30 sec - exiting 06:04:02 (4024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:18:29 (6840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:18:31 (6840): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:03:17 (2360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:20:50 (5660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:01:56 (7132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:50:59 (4456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:51:00 (4456): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:58:34 (8920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:54:29 (9296): No heartbeat from core client for 30 sec - exiting 06:54:30 (9296): No heartbeat from core client for 30 sec - exiting 06:54:31 (9296): No heartbeat from core client for 30 sec - exiting 06:54:32 (9296): No heartbeat from core client for 30 sec - exiting 06:54:33 (9296): No heartbeat from core client for 30 sec - exiting 06:54:34 (9296): No heartbeat from core client for 30 sec - exiting 06:54:35 (9296): No heartbeat from core client for 30 sec - exiting 06:54:36 (9296): No heartbeat from core client for 30 sec - exiting 06:54:37 (9296): No heartbeat from core client for 30 sec - exiting 06:54:38 (9296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... zip error: Could not create output file (was replacing the original zip file) cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3kuh_1940_40_008266172/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Jun 2013 10:57:11 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 518,400 | 2,453,833 | 4.7335 |
22 Jun 2013 14:28:46 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 492,480 | 2,322,006 | 4.7149 |
19 Jun 2013 12:23:02 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 466,560 | 2,196,715 | 4.7083 |
15 Jun 2013 09:55:06 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 440,640 | 2,058,676 | 4.6720 |
11 Jun 2013 10:03:49 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 414,720 | 1,920,078 | 4.6298 |
07 Jun 2013 06:44:33 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 388,800 | 1,785,116 | 4.5913 |
02 Jun 2013 14:35:48 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 362,880 | 1,653,450 | 4.5565 |
29 May 2013 08:02:41 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 336,960 | 1,515,966 | 4.4989 |
26 May 2013 16:47:45 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 311,040 | 1,402,047 | 4.5076 |
24 May 2013 16:48:41 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 285,120 | 1,289,377 | 4.5222 |
21 May 2013 03:51:11 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 259,200 | 1,162,572 | 4.4852 |
19 May 2013 05:15:59 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 233,280 | 1,042,379 | 4.4684 |
17 May 2013 04:28:47 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 207,360 | 921,399 | 4.4435 |
15 May 2013 16:03:12 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 181,440 | 801,028 | 4.4148 |
13 May 2013 18:38:10 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 155,520 | 692,670 | 4.4539 |
12 May 2013 01:03:11 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 129,600 | 573,133 | 4.4223 |
09 May 2013 22:38:10 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 103,680 | 449,429 | 4.3348 |
08 May 2013 05:43:24 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 77,760 | 336,534 | 4.3279 |
06 May 2013 17:50:48 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 51,840 | 222,969 | 4.3011 |
05 May 2013 10:25:22 | 1077037 | 15761515 | hadcm3n_3kuh_1940_40_008266172_3 | 25,920 | 114,452 | 4.4156 |
©2024 climateprediction.net