Name | hadcm3n_z9jk_1920_40_008281741_4 |
Workunit | 8432876 |
Created | 17 Feb 2013, 19:16:31 UTC |
Sent | 17 Feb 2013, 19:16:53 UTC |
Report deadline | 20 May 2013, 2:44:04 UTC |
Received | 8 Apr 2013, 10:54:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1172479 |
Run time | 7 days 15 hours 10 min |
CPU time | 7 days 15 hours 10 min |
Validate state | Invalid |
Credit | 3,421.44 |
Device peak FLOPS | 2.48 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7265, iMonCtr=1 Model crash detected, will try to restart... Atmos Hold Restart file rename failed on atmos_restart.hold Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7265, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7265, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7265, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7265, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7265, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7265, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18387, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Apr 2013 06:42:13 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 285,120 | 609,618 | 2.1381 |
06 Apr 2013 08:43:38 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 259,200 | 561,013 | 2.1644 |
05 Apr 2013 14:30:54 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 233,280 | 505,921 | 2.1687 |
04 Apr 2013 17:01:29 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 207,360 | 450,805 | 2.1740 |
03 Apr 2013 19:38:29 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 181,440 | 395,988 | 2.1825 |
02 Apr 2013 22:06:09 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 155,520 | 340,613 | 2.1902 |
02 Apr 2013 01:30:55 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 129,600 | 285,600 | 2.2037 |
29 Mar 2013 22:15:32 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 103,680 | 229,164 | 2.2103 |
29 Mar 2013 03:18:11 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 77,760 | 172,345 | 2.2164 |
07 Mar 2013 12:42:11 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 51,840 | 114,927 | 2.2170 |
07 Mar 2013 12:42:11 | 1172479 | 15611662 | hadcm3n_z9jk_1920_40_008281741_4 | 25,920 | 57,467 | 2.2171 |
©2024 cpdn.org