intermittent ssh connect error

Paul Archer parcher at masergy.com
Fri Oct 19 07:19:18 EST 2007


We've got set of servers that use AFD (http://www.dwd.de/AFD) to scp files 
to a set of receiving servers. There are about 25 sending machines, and 12 
receiving machines. All the servers are RHEL 4 and 5, with the senders 
running openssh 4.3p2, and the receivers running 3.9p1.

Most files are sent without a problem, but on a pretty regular basis, we 
get "ssh_exchange_identification: Connection closed by remote host" in our 
log files on the senders.
The odd/interesting thing is that while that error often shows up on a 
connection to one machine, we also get bursts, where the error happens on 
connections to several different machines at once. That leads us to think 
it's an issue on the sending side, rather than the receiving side.

I've tried tuning various network paramaters, both on the senders and the 
receivers, but no joy. I wasn't really sure that would help anyway, as 
netstat only reports between 20 and 30 connections (established & 
time_wait) at any given time--although that is almost exclusively ssh.

Any recommendations on where to go from here? What to look for, what 
diagnostics to try?

Paul


--------------------
     Paul Archer
Senior Unix SysAdmin
     Masergy, Inc.
     214-442-8827
parcher at masergy.com
--------------------


PS My apologies if this isn't the appropriate list for this.

PPS This is a sample of the errors we're getting in our AFD log files:
18 21:09:19 <I> hoard10 [0]: 4796416 Bytes send in 8 file(s).
18 21:09:19 <I> hoard09 [2]: 2170880 Bytes send in 26 file(s).
18 21:09:19 <I> hoard04 [2]: 2170880 Bytes send in 26 file(s).
18 21:09:36 <E> hoard03 [0]: Protocol error. ssh is complaining, see next 
message. #28b9df9d (ssh_common.c 427)
18 21:09:36 <E> hoard03 [0]: ssh_exchange_identification: Connection 
closed by remote host
18 21:09:36 <E> hoard03 [0]: SCP connection to 64.47.61.180 at port 22 
failed (-1). #28b9df9d (sf_scp.c 298)
18 21:09:36 <E> hoard01 [0]: Protocol error. ssh is complaining, see next 
message. #ba53f3db (ssh_common.c 427)
18 21:09:36 <E> hoard01 [0]: ssh_exchange_identification: Connection 
closed by remote host
18 21:09:36 <E> hoard01 [0]: SCP connection to 64.47.61.178 at port 22 
failed (-1). #ba53f3db (sf_scp.c 298)
18 21:09:36 <E> hoard11 [0]: Protocol error. ssh is complaining, see next 
message. #1a3b49e6 (ssh_common.c 427)
18 21:09:36 <E> hoard11 [0]: ssh_exchange_identification: Connection 
closed by remote host
18 21:09:36 <E> hoard11 [0]: SCP connection to 64.47.61.132 at port 22 
failed (-1). #1a3b49e6 (sf_scp.c 298)
18 21:09:36 <E> hoard04 [2]: Protocol error. ssh is complaining, see next 
message. #d83bb35 (ssh_common.c 427)
18 21:09:36 <E> hoard04 [2]: ssh_exchange_identification: Connection 
closed by remote host
18 21:09:36 <E> hoard04 [2]: SCP connection to 64.47.61.157 at port 22 
failed (-1). #d83bb35 (sf_scp.c 298)
18 21:09:36 <E> hoard05 [0]: Protocol error. ssh is complaining, see next 
message. #44f6ad16 (ssh_common.c 427)
18 21:09:36 <E> hoard05 [0]: ssh_exchange_identification: Connection 
closed by remote host
18 21:09:36 <E> hoard05 [0]: SCP connection to 64.47.61.156 at port 22 
failed (-1). #44f6ad16 (sf_scp.c 298)
18 21:09:38 <I> hoard12 [0]: 16384 Bytes send in 4 file(s).
18 21:09:38 <I> hoard06 [0]: 16384 Bytes send in 4 file(s).
18 21:09:38 <I> hoard07 [0]: 16384 Bytes send in 4 file(s).
18 21:09:39 <I> hoard08 [0]: 16384 Bytes send in 4 file(s).
18 21:09:39 <I> hoard02 [0]: 16384 Bytes send in 4 file(s).
18 21:09:47 <I> hoard09 [1]: 4812800 Bytes send in 100 file(s).



More information about the openssh-unix-dev mailing list