climateprediction.net home page
Task 14883173

Task 14883173

Name hadam3p_pnw_ayja_1982_1_008035597_0
Workunit 8190711
Created 9 Jul 2012, 13:30:20 UTC
Sent 9 Jul 2012, 13:30:28 UTC
Report deadline 21 Jun 2013, 18:50:28 UTC
Received 24 Jul 2012, 14:57:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1168410
Run time 5 days 0 hours 13 min 56 sec
CPU time 3 days 13 hours 15 min 39 sec
Validate state Invalid
Credit 2,004.61
Device peak FLOPS 2.29 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
200:43:11 (5260)) : atart_timer_thread(): CreateThread() failed, errno 0
rt_timer_thread(): CreateThread() failed, errno 0
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6248, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 1
Called boinc_finish
diagnostics_init_unhandled_exception_monitor(): Creating hExceptionMonitorThread failed, errno 12
WARNING: BOINC Windows Runtime Debugger has been disabled.
11:52:19 (7384): start_timer_thread(): CreateThread() failed, errno 0
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7384, iMonCtr=2
17:56:39 (6596): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6112, iMonCtr=2
Model crash detected, will try to restart...
11:10:43 (5132): start_timer_thread(): CreateThread() failed, errno 0
11:10:44 (4476): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2876, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 0
Called boinc_finish
17:17:05 (3432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7548, iMonCtr=2
Model crash detected, will try to restart...
20:51:20 (7076): start_timer_thread(): CreateThread() failed, errno 0
20:51:21 (5692): start_timer_thread(): CreateThread() failed, errno 0
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3556, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7824, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4320, selfPID=1480, iMonCtr=1
Model crash detected, will try to restart...
13:53:26 (4840): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4840, selfPID=5236, iMonCtr=1
Model crash detected, will try to restart...
19:25:14 (848): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=848, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
11:41:27 (7976): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5108, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
11:04:56 (7380): start_timer_thread(): CreateThread() failed, errno 0
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7380, selfPID=2972, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3768, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6376, selfPID=7472, iMonCtr=1
Model crash detected, will try to restart...
13:40:42 (7316): start_timer_thread(): CreateThread() failed, errno 0
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7252, iMonCtr=2
17:31:19 (2700): start_timer_thread(): CreateThread() failed, errno 0
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7980, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2700, selfPID=6376, iMonCtr=1
Model crash detected, will try to restart...
15:25:47 (7340): start_timer_thread(): CreateThread() failed, errno 0
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6508, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6512, iMonCtr=2
Model crash detected, will try to restart...
11:11:30 (5636): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6912, iMonCtr=2
Model crash detected, will try to restart...
15:05:36 (1924): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1924, selfPID=6792, iMonCtr=1
Model crash detected, will try to restart...
21:28:54 (3928): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6196, iMonCtr=2
Model crash detected, will try to restart...
13:47:32 (7012): start_timer_thread(): CreateThread() failed, errno 0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7012, selfPID=7272, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4916, selfPID=5844, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 7
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1872, iMonCtr=2
10:41:02 (4416): start_timer_thread(): CreateThread() failed, errno 0
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2448, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5280, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_ayja_1982_1_008035597_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ayja_1982_1_008035597_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ayja_1982_1_008035597_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ayja_1982_1_008035597_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jul 2012 20:45:55 1168410 14883173 hadam3p_pnw_ayja_1982_1_008035597_0 92,256 288,535 3.1275
22 Jul 2012 13:37:53 1168410 14883173 hadam3p_pnw_ayja_1982_1_008035597_0 80,736 253,079 3.1346
20 Jul 2012 16:13:36 1168410 14883173 hadam3p_pnw_ayja_1982_1_008035597_0 69,216 215,641 3.1155
19 Jul 2012 12:37:00 1168410 14883173 hadam3p_pnw_ayja_1982_1_008035597_0 57,696 179,948 3.1189
16 Jul 2012 20:01:57 1168410 14883173 hadam3p_pnw_ayja_1982_1_008035597_0 46,176 143,583 3.1095
15 Jul 2012 20:28:06 1168410 14883173 hadam3p_pnw_ayja_1982_1_008035597_0 34,656 106,425 3.0709
14 Jul 2012 21:19:36 1168410 14883173 hadam3p_pnw_ayja_1982_1_008035597_0 23,136 71,062 3.0715
10 Jul 2012 20:15:29 1168410 14883173 hadam3p_pnw_ayja_1982_1_008035597_0 11,616 36,183 3.1149


©2024 cpdn.org