Name | hadcm3n_o6tk_2140_40_008269389_2 |
Workunit | 8424513 |
Created | 27 Mar 2013, 15:29:58 UTC |
Sent | 27 Mar 2013, 15:30:35 UTC |
Report deadline | 26 Jun 2013, 22:57:46 UTC |
Received | 22 May 2013, 12:38:08 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1221462 |
Run time | 11 days 11 hours 8 min 54 sec |
CPU time | 9 days 6 hours 22 min 58 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 3.10 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4104, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2744, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5008, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 11:20:41 AM No files match the supplied pattern. MainError: 11:20:41 AM No files match the supplied pattern. MainError: 09:06:50 PM No files match the supplied pattern. MainError: 09:06:50 PM No files match the supplied pattern. MainError: 06:00:37 AM No files match the supplied pattern. MainError: 06:00:37 AM No files match the supplied pattern. MainError: 02:04:03 PM No files match the supplied pattern. MainError: 02:04:03 PM No files match the supplied pattern. MainError: 02:02:13 PM No files match the supplied pattern. MainError: 02:02:13 PM No files match the supplied pattern. MainError: 09:01:39 PM No files match the supplied pattern. MainError: 09:01:39 PM No files match the supplied pattern. MainError: 03:57:14 AM No files match the supplied pattern. MainError: 03:57:14 AM No files match the supplied pattern. MainError: 11:03:14 AM No files match the supplied pattern. MainError: 11:03:14 AM No files match the supplied pattern. MainError: 06:15:07 PM No files match the supplied pattern. MainError: 06:15:07 PM No files match the supplied pattern. MainError: 01:30:52 AM No files match the supplied pattern. MainError: 01:30:52 AM No files match the supplied pattern. Error converting file to netcdf: dataout/o6tkka.ph11c10 Error converting file to netcdf: dataout/o6tkka.pg11c10 Error converting file to netcdf: dataout/o6tkka.pe11c10 MainError: 08:21:29 AM No files match the supplied pattern. MainError: 08:21:29 AM No files match the supplied pattern. BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 777,600 | 974,974 | 1.2538 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 751,680 | 950,383 | 1.2643 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 725,760 | 924,286 | 1.2735 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 699,840 | 898,418 | 1.2837 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 673,920 | 872,901 | 1.2953 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 648,000 | 848,005 | 1.3086 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 622,080 | 822,877 | 1.3228 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 596,160 | 796,787 | 1.3365 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 570,240 | 767,832 | 1.3465 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 544,320 | 735,952 | 1.3521 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 518,400 | 701,079 | 1.3524 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 492,480 | 666,261 | 1.3529 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 466,560 | 631,488 | 1.3535 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 440,640 | 596,700 | 1.3542 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 414,720 | 561,884 | 1.3549 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 388,800 | 525,622 | 1.3519 |
22 May 2013 12:39:50 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 362,880 | 488,737 | 1.3468 |
12 May 2013 23:08:56 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 336,960 | 451,908 | 1.3411 |
12 May 2013 13:12:30 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 311,040 | 416,441 | 1.3389 |
11 May 2013 03:45:49 | 1221462 | 15686965 | hadcm3n_o6tk_2140_40_008269389_2 | 285,120 | 381,677 | 1.3387 |
©2024 cpdn.org