Name | hadam3p_anz_f1ad_2012_1_009265100_1 |
Workunit | 9358016 |
Created | 3 Dec 2014, 21:19:45 UTC |
Sent | 3 Dec 2014, 22:08:07 UTC |
Report deadline | 16 Nov 2015, 3:28:07 UTC |
Received | 11 Dec 2014, 13:31:53 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1228992 |
Run time | 1 days 23 hours 6 min 1 sec |
CPU time | 1 days 22 hours 18 min 25 sec |
Validate state | Invalid |
Credit | 1,006.54 |
Device peak FLOPS | 2.96 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.27</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:44:04 (5928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 01:00:11 (15676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:33:37 (3344): No heartbeat from core client for 30 sec - exiting 10:33:38 (3344): No heartbeat from core client for 30 sec - exiting 10:33:39 (3344): No heartbeat from core client for 30 sec - exiting 10:33:40 (3344): No heartbeat from core client for 30 sec - exiting 10:33:41 (3344): No heartbeat from core client for 30 sec - exiting 10:33:43 (3344): No heartbeat from core client for 30 sec - exiting 10:33:44 (3344): No heartbeat from core client for 30 sec - exiting 10:33:45 (3344): No heartbeat from core client for 30 sec - exiting 10:33:46 (3344): No heartbeat from core client for 30 sec - exiting 10:33:47 (3344): No heartbeat from core client for 30 sec - exiting 10:33:48 (3344): No heartbeat from core client for 30 sec - exiting 10:33:49 (3344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6812, selfPID=6812, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 08:21:34 (5024): No heartbeat from core client for 30 sec - exiting 08:21:35 (5024): No heartbeat from core client for 30 sec - exiting 08:21:36 (5024): No heartbeat from core client for 30 sec - exiting 08:21:37 (5024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13328, selfPID=13328, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x7713C42D Engaging BOINC Windows Runtime Debugger... ******************** BOINC Windows Runtime Debugger Version 6.13.0 Dump Timestamp : 12/11/14 01:14:04 Install Directory : C:\Program Files (x86)\BOINC\ Data Directory : C:\ProgramData\BOINC Project Symstore : LoadLibraryA( C:\Program Files (x86)\BOINC\\dbghelp.dll ): GetLastError = 126 Loaded Library : dbghelp.dll LoadLibraryA( C:\Program Files (x86)\BOINC\\symsrv.dll ): GetLastError = 126 LoadLibraryA( symsrv.dll ): GetLastError = 126 LoadLibraryA( C:\Program Files (x86)\BOINC\\srcsrv.dll ): GetLastError = 126 LoadLibraryA( srcsrv.dll ): GetLastError = 126 LoadLibraryA( C:\Program Files (x86)\BOINC\\version.dll ): GetLastError = 126 Loaded Library : version.dll Debugger Engine : 4.0.5.0 Symbol Search Path: C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_anz_f1ad_2012_1_009265100;C:\ProgramData\BOINC\projects\climateprediction.net ModLoad: 003a0000 008e6000 C:\ProgramData\BOINC\projects\climateprediction.net\hadrm3p_anz_um_6.10_windows_intelx86.exe (-nosymbols- Symbols Loaded) Linked PDB Filename : d:\cpdn\cpdnboinc\cpdnprecis\test\projects\reqs.comlab.ox.ac.uk_cpdn\hadrm3p_anz_um_6.10_windows_intelx86.pdb ModLoad: 77d00000 00180000 C:\Windows\SysWOW64\ntdll.dll (6.1.7601.18247) (-exported- Symbols Loaded) Linked PDB Filename : wntdll.pdb File Version : 6.1.7600.16385 (win7_rtm.090713-1255) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 6.1.7600.16385 ModLoad: 77350000 00110000 C:\Windows\syswow64\kernel32.dll (6.1.7601.18409) (-exported- Symbols Loaded) Linked PDB Filename : wkernel32.pdb File Version : 6.1.7601.18409 (win7sp1_gdr.140303-2144) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 6.1.7601.18409 ModLoad: 77130000 00047000 C:\Windows\syswow64\KERNELBASE.dll (6.1.7601.18409) (-exported- Symbols Loaded) Linked PDB Filename : wkernelbase.pdb File Version : 6.1.7601.18409 (win7sp1_gdr.140303-2144) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 6.1.7601.18409 ModLoad: 76f20000 00100000 C:\Windows\syswow64\USER32.dll (6.1.7601.17514) (-exported- Symbols Loaded) Linked PDB Filename : wuser32.pdb File Version : 6.1.7601.17514 (win7sp1_rtm.101119-1850) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 6.1.7601.17514 ModLoad: 77460000 00090000 C:\Windows\syswow64\GDI32.dll (6.1.7601.18577) (-nosymbols- Symbols Loaded) Linked PDB Filename : wgdi32.pdb File Version : 6.1.7601.18577 (win7sp1_gdr.140822-1508) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 6.1.7601.18577 ModLoad: 76080000 0000a000 C:\Windows\syswow64\LPK.dll (6.1.7601.18177) (-nosymbols- Symbols Loaded) Linked PDB Filename : wlpk.pdb File Version : 6.1.7601.18177 (win7sp1_gdr.130605-1534) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 6.1.7601.18177 ModLoad: 75cc0000 0009d000 C:\Windows\syswow64\USP10.dll (1.626.7601.18454) (-nosymbols- Symbols Loaded) Linked PDB Filename : usp10.pdb File Version : 1.0626.7601.18454 (win7sp1_gdr.140424-1533) Company Name : Microsoft Corporation Product Name : Microsoft(R) Uniscribe Unicode script processor Product Version : 1.0626.7601.18454 ModLoad: 77040000 000ac000 C:\Windows\syswow64\msvcrt.dll (7.0.7601.17744) (-nosymbols- Symbols Loaded) Linked PDB Filename : msvcrt.pdb File Version : 7.0.7601.17744 (win7sp1_gdr.111215-1535) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 7.0.7601.17744 ModLoad: 771f0000 000a0000 C:\Windows\syswow64\ADVAPI32.dll (6.1.7601.18247) (-nosymbols- Symbols Loaded) Linked PDB Filename : advapi32.pdb File Version : 6.1.7600.16385 (win7_rtm.090713-1255) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 6.1.7600.16385 *** Dump of the Process Statistics: *** - I/O Operations Counters - Read: 0, Write: 0, Other 0 - I/O Transfers Counters - Read: 0, Write: 0, Other 0 - Paged Pool Usage - QuotaPagedPoolUsage: 0, QuotaPeakPagedPoolUsage: 0 QuotaNonPagedPoolUsage: 0, QuotaPeakNonPagedPoolUsage: 0 - Virtual Memory Usage - VirtualSize: 0, PeakVirtualSize: 0 - Pagefile Usage - PagefileUsage: 0, PeakPagefileUsage: 0 - Working Set Size - WorkingSetSize: 0, PeakWorkingSetSize: 0, PageFaultCount: 0 *** Dump of thread ID 9508 (state: Initialized): *** - Information - Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000 - Unhandled Exception Record - Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x7713C42D - Registers - eax=123af508 ebx=00000001 ecx=00000003 edx=00000000 esi=00832428 edi=fffffffe eip=7713c42d esp=123af508 ebp=123af558 cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000212 - Callstack - ChildEBP RetAddr Args to Child 123af558 006c1d43 e06d7363 00000001 00000003 123af584 KERNELBASE!RaiseException+0x0 123af590 006bef45 123af5a0 00759608 0072f15c 00755754 hadrm3p_anz_um_6.10_windows_int!+0x0 123af5ac 003a8361 00fd9f80 01000000 003a98a8 00000000 hadrm3p_anz_um_6.10_windows_int!+0x0 123af60c 003a95e0 123af620 12543190 123af620 12543111 hadrm3p_anz_um_6.10_windows_int!+0x0 123afabc 7736338a 7efde000 123afb08 77d39f72 7efde000 hadrm3p_anz_um_6.10_windows_int!+0x0 123afac8 77d39f72 7efde000 66438e11 00000000 00000000 kernel32!BaseThreadInitThunk+0x0 123afb08 77d39f45 006c0ea6 7efde000 00000000 00000000 ntdll!RtlInitializeExceptionChain+0x0 123afb20 00000000 006c0ea6 7efde000 00000000 00000000 ntdll!RtlInitializeExceptionChain+0x0 *** Dump of thread ID 11420 (state: Initialized): *** - Information - Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000 - Registers - eax=0064bb20 ebx=00000000 ecx=00000000 edx=00000000 esi=3909feb0 edi=00000000 eip=77d1fd91 esp=3909fe6c ebp=3909fed4 cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000246 - Callstack - ChildEBP RetAddr Args to Child 3909fed4 771444a5 00000064 00000000 3909fef0 0064bb34 ntdll!ZwDelayExecution+0x0 3909fee4 0064bb34 00000064 3909fefc 7736338a 00000000 KERNELBASE!Sleep+0x0 3909fef0 7736338a 00000000 3909ff3c 77d39f72 00000000 hadrm3p_anz_um_6.10_windows_int!+0x0 3909fefc 77d39f72 00000000 4d708a25 00000000 00000000 kernel32!BaseThreadInitThunk+0x0 3909ff3c 77d39f45 0064bb20 00000000 00000000 00000000 ntdll!RtlInitializeExceptionChain+0x0 3909ff54 00000000 0064bb20 00000000 00000000 00000000 ntdll!RtlInitializeExceptionChain+0x0 *** Debug Message Dump **** *** Foreground Window Data *** Window Name : Window Class : Window Process ID: 0 Window Thread ID : 0 Exiting... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15952, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_3.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_4.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_5.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_6.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_7.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_8.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_9.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_10.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_11.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_anz_f1ad_2012_1_009265100_1_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Dec 2014 14:04:01 | 1228992 | 17542347 | hadam3p_anz_f1ad_2012_1_009265100_1 | 23,339 | 140,642 | 6.0261 |
06 Dec 2014 18:31:00 | 1228992 | 17542347 | hadam3p_anz_f1ad_2012_1_009265100_1 | 11,819 | 69,047 | 5.8420 |
©2024 cpdn.org