climateprediction.net home page
Task 15898318

Task 15898318

Name hadcm3n_n6lm_1880_40_008372899_2
Workunit 8523758
Created 20 Jul 2013, 16:05:10 UTC
Sent 20 Jul 2013, 16:05:23 UTC
Report deadline 19 Oct 2013, 23:32:34 UTC
Received 14 Aug 2013, 16:47:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 787844
Run time 4 days 17 hours 18 min 37 sec
CPU time 4 days 8 hours 59 min 16 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 2.00 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-apple-darwin
Stderr
<core_client_version>7.0.65</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
hadcm3n_6.07_i686-apple-darwin(208,0xa0922540) malloc: *** error for object 0x801004: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(208,0xa0922540) malloc: *** error for object 0x801000: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=208, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=208, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=208, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=208, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=208, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=208, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
SIGSEGV: segmentation violation
hadcm3n_6.07_i686-apple-darwin(183,0xa0922540) malloc: *** error for object 0x822c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(183,0xa0922540) malloc: *** error for object 0x800004: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=183, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=183, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=183, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=183, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=183, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=183, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
SIGSEGV: segmentation violation
hadcm3n_6.07_i686-apple-darwin(186,0xa0922540) malloc: *** error for object 0x811a04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(186,0xa0922540) malloc: *** error for object 0x800804: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(186,0xa0922540) malloc: *** error for object 0x800800: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(186,0xa0922540) malloc: *** error for object 0x812a04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=186, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=186, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=186, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=186, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=186, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=186, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
SIGSEGV: segmentation violation
hadcm3n_6.07_i686-apple-darwin(207,0xa0922540) malloc: *** error for object 0x800004: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(207,0xa0922540) malloc: *** error for object 0x812c04: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
hadcm3n_6.07_i686-apple-darwin(207,0xa0922540) malloc: *** error for object 0x812c00: incorrect checksum for freed object - object was probably modified after being freed.
*** set a breakpoint in malloc_error_break to debug
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=207, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=207, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=207, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=207, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=207, iMonCtr=1
Model crash detected, will try to restart...
Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=207, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Jul 2013 01:31:35 787844 15898318 hadcm3n_n6lm_1880_40_008372899_2 129,600 315,145 2.4317
24 Jul 2013 06:43:05 787844 15898318 hadcm3n_n6lm_1880_40_008372899_2 103,680 252,066 2.4312
23 Jul 2013 22:08:46 787844 15898318 hadcm3n_n6lm_1880_40_008372899_2 77,760 188,992 2.4305
23 Jul 2013 21:47:58 787844 15898318 hadcm3n_n6lm_1880_40_008372899_2 51,840 126,083 2.4322
23 Jul 2013 21:24:56 787844 15898318 hadcm3n_n6lm_1880_40_008372899_2 25,920 63,156 2.4366


©2024 cpdn.org