Name | hadcm3n_u69a_1980_40_007832165_2 |
Workunit | 7987277 |
Created | 18 Mar 2012, 16:15:30 UTC |
Sent | 18 Mar 2012, 16:15:43 UTC |
Report deadline | 17 Jun 2012, 23:42:54 UTC |
Received | 9 Apr 2012, 1:23:31 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1189010 |
Run time | 10 days 23 hours 8 min 54 sec |
CPU time | 10 days 16 hours 23 min 47 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.99 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 01:59:55 (9364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:44:54 (9397): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:59:05 (9438): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:03:00 (9475): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:04:14 (9515): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:05:45 (9551): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:10:57 (9587): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:08:40 (9623): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:09:55 (9661): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:11:28 (9701): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:12:25 (9737): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:13:07 (24713): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:14:20 (24747): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:25:09 (24791): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:26:15 (24848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:16:14 (24900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:17:52 (25056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:19:34 (25098): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:10 (25136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:22:39 (25172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:57:33 (25210): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:12:55 (25331): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:20:38 (27332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:49 (27904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:23:10 (27938): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:25:07 (27972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:29:59 (28006): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:31:10 (28320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:32:23 (28354): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:34:34 (28388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:08:01 (2801): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:19:02 (7037): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:27:27 (7072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:30:29 (11533): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:37:00 (11569): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 15 received, exiting... Called boinc_finish 03:01:17 (2562): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:02:32 (9644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:05:22 (9674): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... 02:08:10 (2370): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file /usr/BOINC/projects/climateprediction.net/hadcm3n_u69a_1980_40_007832165/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Apr 2012 01:26:30 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 518,400 | 923,051 | 1.7806 |
08 Apr 2012 11:57:17 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 492,480 | 876,952 | 1.7807 |
07 Apr 2012 22:57:59 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 466,560 | 830,846 | 1.7808 |
07 Apr 2012 09:22:40 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 440,640 | 784,351 | 1.7800 |
06 Apr 2012 20:48:36 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 414,720 | 737,776 | 1.7790 |
06 Apr 2012 06:38:47 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 388,800 | 691,084 | 1.7775 |
05 Apr 2012 17:32:00 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 362,880 | 644,825 | 1.7770 |
05 Apr 2012 04:21:11 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 336,960 | 598,374 | 1.7758 |
04 Apr 2012 15:14:07 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 311,040 | 551,540 | 1.7732 |
04 Apr 2012 03:11:13 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 285,120 | 508,520 | 1.7835 |
03 Apr 2012 14:10:10 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 259,200 | 462,074 | 1.7827 |
03 Apr 2012 01:01:10 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 233,280 | 415,722 | 1.7821 |
02 Apr 2012 11:51:37 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 207,360 | 369,889 | 1.7838 |
01 Apr 2012 22:17:59 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 181,440 | 324,100 | 1.7863 |
01 Apr 2012 09:30:23 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 155,520 | 277,930 | 1.7871 |
31 Mar 2012 19:09:51 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 129,600 | 231,441 | 1.7858 |
31 Mar 2012 06:37:37 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 103,680 | 185,134 | 1.7856 |
30 Mar 2012 16:35:16 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 77,760 | 139,151 | 1.7895 |
30 Mar 2012 03:25:41 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 51,840 | 92,558 | 1.7855 |
29 Mar 2012 13:37:03 | 1189010 | 14283494 | hadcm3n_u69a_1980_40_007832165_2 | 25,920 | 46,362 | 1.7887 |
©2024 cpdn.org