Name | hadam3p_eu_2m1e_1987_1_007424998_0 |
Workunit | 7622633 |
Created | 26 Aug 2011, 13:31:24 UTC |
Sent | 26 Aug 2011, 13:31:33 UTC |
Report deadline | 7 Aug 2012, 18:51:33 UTC |
Received | 18 Sep 2011, 12:12:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1159510 |
Run time | 11 days 23 hours 34 min 19 sec |
CPU time | 7 days 15 hours 34 min 7 sec |
Validate state | Invalid |
Credit | 1,392.75 |
Device peak FLOPS | 1.22 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1296, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3748, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2156, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3140, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3260, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1784, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1768, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3188, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2420, selfPID=1656, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN procesController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2840, selfPID=3160, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3776, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3848, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1832, selfPID=3496, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3740, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3188, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=2 ontroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=884, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3132, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1508, selfPID=2628, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2084, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2464, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3044, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=432, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=2 Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2284, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=364, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3100, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2312, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3700, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=624, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2640, iMonCtr=2 Leaving CPDN_Main::Monitor... forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2m1e_1987_1_007424998\tmp\xaakm.namelists Image PC Routine Line Source hadam3p_eu_um_6.0 0159A39A Unknown Unknown Unknown hadam3p_eu_um_6.0 01542CD0 Unknown Unknown Unknown hadam3p_eu_um_6.0 01541E9A Unknown Unknown Unknown hadam3p_eu_um_6.0 01522819 Unknown Unknown Unknown hadam3p_eu_um_6.0 01422287 Unknown Unknown Unknown hadam3p_eu_um_6.0 014BE7B2 Unknown Unknown Unknown hadam3p_eu_um_6.0 014BF2DA Unknown Unknown Unknown hadam3p_eu_um_6.0 01239BD2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0157E638 Unknown Unknown Unknown kernel32.dll 76D1D309 Unknown Unknown Unknown ntdll.dll 76DF16C3 Unknown Unknown Unknown ntdll.dll 76DF1696 Unknown Unknown Unknown rrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2m1e_1987_1_007424998\tmp\xaakg.namelists Image PC Routine Line Source hadrm3p_eu_um_6.0 016CC52A Unknown Unknown Unknown hadrm3p_eu_um_6.0 01674460 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0167362A Unknown Unknown Unknown hadrm3p_eu_um_6.0 01652469 Unknown Unknown Unknown hadrm3p_eu_um_6.0 015566EB Unknown Unknown Unknown hadrm3p_eu_um_6.0 015F2AE2 Unknown Unknown Unknown hadrm3p_eu_um_6.0 015F35AF Unknown Unknown Unknown hadrm3p_eu_um_6.0 01399860 Unknown Unknown Unknown hadrm3p_eu_um_6.0 016B0893 Unknown Unknown Unknown kernel32.dll 76D1D309 Unknown Unknown Unknown ntdll.dll 76DF16C3 Unknown Unknown Unknown ntdll.dll 76DF1696 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2784, selfPID=2068, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_8.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Sep 2011 08:56:59 | 1159510 | 13300556 | hadam3p_eu_2m1e_1987_1_007424998_0 | 80,736 | 584,827 | 7.2437 |
13 Sep 2011 18:13:58 | 1159510 | 13300556 | hadam3p_eu_2m1e_1987_1_007424998_0 | 69,216 | 500,180 | 7.2264 |
10 Sep 2011 17:09:03 | 1159510 | 13300556 | hadam3p_eu_2m1e_1987_1_007424998_0 | 57,696 | 415,714 | 7.2052 |
06 Sep 2011 18:57:56 | 1159510 | 13300556 | hadam3p_eu_2m1e_1987_1_007424998_0 | 46,176 | 332,817 | 7.2076 |
04 Sep 2011 03:07:23 | 1159510 | 13300556 | hadam3p_eu_2m1e_1987_1_007424998_0 | 34,656 | 250,748 | 7.2353 |
01 Sep 2011 18:51:49 | 1159510 | 13300556 | hadam3p_eu_2m1e_1987_1_007424998_0 | 23,136 | 170,264 | 7.3593 |
29 Aug 2011 11:53:49 | 1159510 | 13300556 | hadam3p_eu_2m1e_1987_1_007424998_0 | 11,616 | 85,549 | 7.3648 |
©2024 cpdn.org