Instrumentation for metrics

Craig Miskell cmiskell at gitlab.com
Wed Jan 22 09:10:18 AEDT 2020


On 21/01/20 8:44 pm, Damien Miller wrote:
 > On Tue, 21 Jan 2020, Philipp Marek wrote:
 >
 >>> This makes me think that the syslog approach is probably the way to go
 >>
 >> Yeah, right.
 >> Another idea is to mirror the current preauth load via setproctitle()...
 >> That makes that data accessible even without a syscall (at least the
 >> writing of the data - quering needs syscalls, right), so that can be
 >> kept up-to-date and allows a high monitoring frequency as well.
 >>
 >> Multiple instances of SSHd (on different ports) are easily distinguished
 >> as well.
 >
 > That's a really, really good idea. Patch below.

That would certainly cover my use case.  I wonder if there's a case to 
be made to add options.max_startups_begin to the title status (per the 
amended patch below).  It's more informative, but also a bit more 
confusing e.g. when we're above max_startups_begin and beginning to drop 
we get a title like:

sshd: [listener] 12/10/100 startups

But my primary goal here is getting that first number exposed and we 
could reasonably make max_startups_begin and max_startups the same; for 
us, once we start dropping any we may as well be dropping them all, 
because it happening at all is a thing to be avoided, and we want 
to/should have been alerted before it happens.  So the original patch 
would certainly be sufficient.

Just an aside: I notice that the original patch got included in commit 
f8c11461aa6db168fc5e7eeae448b4cbbf59642a in the portable git repo, along 
with another change that matches the commit message.  Was that intentional?


-- Craig Miskell Site Reliability Engineer | GitLab | Dunedin, New Zealand

diff --git a/sshd.c b/sshd.c
index 6129b0a..debbdcb 100644
--- a/sshd.c
+++ b/sshd.c
@@ -1005,7 +1005,7 @@ server_accept_loop(int *sock_in, int *sock_out, 
int *newsock, int *config_s)
  {
  	fd_set *fdset;
  	int i, j, ret, maxfd;
-	int startups = 0, listening = 0, lameduck = 0;
+	int ostartups = -1, startups = 0, listening = 0, lameduck = 0;
  	int startup_p[2] = { -1 , -1 };
  	char c = 0;
  	struct sockaddr_storage from;
@@ -1029,6 +1029,11 @@ server_accept_loop(int *sock_in, int *sock_out, 
int *newsock, int *config_s)
  	 * the daemon is killed with a signal.
  	 */
  	for (;;) {
+		if (ostartups != startups) {
+			setproctitle("[listener] %d/%d/%d startups",
+			    startups, options.max_startups_begin, options.max_startups);
+			ostartups = startups;
+		}
  		if (received_sighup) {
  			if (!lameduck) {
  				debug("Received SIGHUP; waiting for children");


More information about the openssh-unix-dev mailing list