If 99% of people aren't on the list, and 1% are, if your check is super fast but makes 1% false positives, you still end up having to only do a full check on 2%. Which could be a huge huge huge win computationally.
Your post is really weird to me, talking about boarding times? You start skeptical of the example & I'm confused how you think this is anything but a fine example. Ultimately there's some service running in the cloud somewhere that needs to have checks run against it. 2.9m people fly a day in the US, and whether the servers doing that work can do it efficiently or whether they do it in a dogsbit bad manner seems like an obvious concern to me? https://www.faa.gov/air_traffic/by_the_numbers
I suspect the actual usage for this is for much broader higher traffic systems. For things that watch sizable chunks of the internet for patterns and traffic. But checking passengers against. I fly lists sounds like a pretty reasonable example use to me, and the criticism seems off base & weird in a number of dimensions that straight up don't make sense.
Assuming the airport runs from 6am to 11pm, 2.9m people a day works out to be about ~47 reqs/second. Which is not terribly much.
Even if we check them at both ends, and effectively double the load, thats only ~100reqs/second. A single machine would happily handle that.
> Assuming the airport runs from 6am to 11pm
That's a strange assumption. The airports that have significant traffic are operating 24 hours.
Under the assumption that airports close between 11 and 6, there would be no such thing as a redeye flight.
Congratulations on re-highlighting my greatest complaint about your qualm-making, a propensity for adding factors into the mix that have nothing to do with the big picture.
To me, 47 or 37 req/s seems like a fantastically immaterial difference. It's just not a big enough change in magnitude to really affect the situation.
Accurate qualm, and being technically correct. Personally I'd try to find a more liberal minded approach when trying to hold in my mind the question for what efficient set membership might be good for.
The change in magnitude is over 20%; if you're thinking about it in terms of "change in magnitude", it's huge.
As FridgeSeal points out, both numbers are very small, but that's not a reason you'd want to set up an inaccurate triage system on top of the accurate one. If you don't have very much work to do, you don't need to invest much in optimizing it.
Most airports are not open 24/7 in the sense that flights are departing 24/7 or that you can get through security checkpoints 24/7. They simply don't kick people out of the secure area when they shut down. You'll still have difficulties showing up to your 5am flight 3 hours early before the checkpoints open at most major airports.
You have difficulties showing up to your flight three hours early because checkin is not available that far in advance. But the airport obviously is open that far in advance. Planes are departing and landing at that time. The checkin counter may be open (or may not; volume does go down).
It it likely true that "most airports" are not operating 24/7, but how is that relevant? It could be just as true that "most airports" don't serve commercial flights at all. The airports that have a lot of passengers are operating 24/7. We're talking about a metric assessed per passenger.
I'm not talking about the check-in counter, I'm talking about general security that non-charter passengers always have to go through to board a flight. Security checkpoints operate limited hours at even the largest airports, like LAX where they're open from 4h00-22h00 max. If we look the departures for LAX today, we can see that no passenger planes departed between 2h00-5h00, except for one charter jet out of the private terminal. Yesterday had no such departures, only the usual nighttime cargo flights.