Name | famous_u3fm_1799_200_006637293_4 |
Workunit | 6840665 |
Created | 10 Jun 2010, 11:39:44 UTC |
Sent | 21 Jul 2010, 10:07:17 UTC |
Report deadline | 20 Oct 2010, 17:34:28 UTC |
Received | 5 Aug 2010, 0:00:19 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1082616 |
Run time | 7 days 8 hours 53 min 38 sec |
CPU time | 6 days 19 hours 48 min 19 sec |
Validate state | Invalid |
Credit | 1,822.10 |
Device peak FLOPS | 1.64 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.43</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5520, selfPID=5520, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=712, selfPID=712, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5972, selfPID=5972, iMonCtr=1 22:57:46 (1736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:57:47 (1736): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1932, selfPID=1932, iMonCtr=1 06:48:47 (6352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:48:48 (6352): No heartbeat from core client for 30 sec - exiting 06:48:49 (6352): No heartbeat from core client for 30 sec - exiting 06:48:50 (6352): No heartbeat from core client for 30 sec - exiting 06:48:51 (6352): No heartbeat from core client for 30 sec - exiting 06:48:52 (6352): No heartbeat from core client for 30 sec - exiting 06:48:53 (6352): No heartbeat from core client for 30 sec - exiting 06:48:54 (6352): No heartbeat from core client for 30 sec - exiting 06:48:56 (6352): No heartbeat from core client for 30 sec - exiting 06:48:57 (6352): No heartbeat from core client for 30 sec - exiting 06:48:58 (6352): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5196, selfPID=5196, iMonCtr=1 06:49:54 (6784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:49:55 (6784): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7044, selfPID=7044, iMonCtr=1 06:53:00 (7132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:53:01 (7132): No heartbeat from core client for 30 sec - exiting 06:53:02 (7132): No heartbeat from core client for 30 sec - exiting 06:53:03 (7132): No heartbeat from core client for 30 sec - exiting 06:53:05 (7132): No heartbeat from core client for 30 sec - exiting 06:53:06 (7132): No heartbeat from core client for 30 sec - exiting 06:53:07 (7132): No heartbeat from core client for 30 sec - exiting 06:53:08 (7132): No heartbeat from core client for 30 sec - exiting 06:53:09 (7132): No heartbeat from core client for 30 sec - exiting 06:53:10 (7132): No heartbeat from core client for 30 sec - exiting 06:53:11 (7132): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7000, selfPID=7000, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=5988, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1988, selfPID=1988, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5708, selfPID=5708, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3984, selfPID=3984, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6632, selfPID=6632, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2092, selfPID=2092, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6056, selfPID=6056, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4768, selfPID=4768, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6024, selfPID=6024, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3964, selfPID=3964, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4496, selfPID=4496, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5672, selfPID=5672, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3428, selfPID=3428, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5980, selfPID=5980, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5660, selfPID=5660, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5976, selfPID=5976, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4196, selfPID=4196, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5656, selfPID=5656, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5208, selfPID=5208, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5908, selfPID=5908, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4668, selfPID=4668, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2532, selfPID=2532, iMonCtr=1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6492, selfPID=6492, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5728, selfPID=5728, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5080, selfPID=5080, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4484, selfPID=4484, iMonCtr=1 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6772, selfPID=6772, iMonCtr=1 15:46:15 (4748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_u3fm_1799_200_006637293\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_u3fm_1799_200_006637293\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_u3fm_1799_200_006637293\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_u3fm_1799_200_006637293\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_u3fm_1799_200_006637293\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_u3fm_1799_200_006637293\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=756, selfPID=756, iMonCtr=1 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/jobs/afyel.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/jobs/afyel.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/ocean_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/jobs/afyel.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/ocean_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/jobs/afyel.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/ocean_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/jobs/afyel.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/ocean_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/jobs/afyel.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/ocean_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/jobs/afyel.ihist after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_u3fm_1799_200_006637293/dataout/ocean_restart.day after 11 attempts Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO tmp/pipe_dummy Sorry, too many model crashes! :-( 15:59:32 (3988): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Aug 2010 20:51:44 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 552,266 | 583,034 | 1.0557 |
03 Aug 2010 08:42:32 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 542,906 | 571,950 | 1.0535 |
03 Aug 2010 05:35:10 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 533,546 | 561,349 | 1.0521 |
03 Aug 2010 03:42:05 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 524,186 | 549,085 | 1.0475 |
02 Aug 2010 22:10:26 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 514,826 | 536,716 | 1.0425 |
02 Aug 2010 08:04:33 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 505,466 | 528,085 | 1.0447 |
02 Aug 2010 05:45:18 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 496,106 | 520,066 | 1.0483 |
02 Aug 2010 03:34:31 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 486,746 | 512,378 | 1.0527 |
02 Aug 2010 01:56:50 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 477,386 | 502,101 | 1.0518 |
01 Aug 2010 21:24:13 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 468,026 | 491,995 | 1.0512 |
01 Aug 2010 18:22:21 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 458,666 | 481,836 | 1.0505 |
01 Aug 2010 16:28:50 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 449,306 | 472,701 | 1.0521 |
01 Aug 2010 12:50:29 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 439,946 | 463,736 | 1.0541 |
01 Aug 2010 10:42:05 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 430,586 | 456,084 | 1.0592 |
01 Aug 2010 09:28:45 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 421,226 | 448,496 | 1.0647 |
01 Aug 2010 06:19:47 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 411,866 | 440,926 | 1.0706 |
01 Aug 2010 03:56:18 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 402,506 | 432,291 | 1.0740 |
31 Jul 2010 22:37:17 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 393,146 | 420,723 | 1.0701 |
31 Jul 2010 16:59:26 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 383,786 | 412,947 | 1.0760 |
31 Jul 2010 14:40:28 | 1082616 | 11437276 | famous_u3fm_1799_200_006637293_4 | 374,426 | 405,676 | 1.0835 |
©2024 cpdn.org