原文:Intro to Load Balancing for Developers – The Algorithms
转载:http://blog.gesha.net/archives/205/
Plain Programmer Description:The system builds an array of Servers being load balanced, and uses the random number generator to determine who gets the next connection… Far from an elegant solution, and most often found in large software packages that have thrown load balancing in as a feature.
Plain Programmer Description:The system builds a standard circular queue and walks through it, sending one request to each machine before getting to the start of the queue and doing it again. While I’ve never seen the code (or actual load balancer code for any of these for that matter), we’ve all written this queue with the modulus function before. In school if nowhere else.
Plain Programmer Description:The simplest way to explain for this one is that the system makes multiple entries in the Round Robin circular queue for servers with larger ratios. So if you set ratios at 3:2:1:1 for your four servers, that’s what the queue would look like – 3 entries for the first server, two for the second, one each for the third and fourth. In this version, the weights are set when the load balancing is configured for your application and never change, so the system will just keep looping through that circular queue. Different vendors use different weighting systems – whole numbers, decimals that must total 1.0 (100%), etc. but this is an implementation detail, they all end up in a circular queue style layout with more entries for larger ratings.
Plain Programmer Description:If you think of Weighted Round Robin where the circular queue is rebuilt with new (dynamic) weights whenever it has been fully traversed, you’ll be dead-on.
Plain Programmer Description:The load balancer looks at the response time of each attached server and chooses the one with the best response time. This is pretty straight-forward, but can lead to congestion because response time right now won’t necessarily be response time in 1 second or two seconds. Since connections are generally going through the load balancer, this algorithm is a lot easier to implement than you might think, as long as the numbers are kept up to date whenever a response comes through.
These next three I use the BIG-IP name for. They are variants of a generalized algorithm sometimes called Long Term Resource Monitoring.
Plain Programmer Description:This algorithm just keeps track of the number of connections attached to each server, and selects the one with the smallest number to receive the connection. Like fastest, this can cause congestion when the connections are all of different durations – like if one is loading a plain HTML page and another is running a JSP with a ton of database lookups. Connection counting just doesn’t account for that scenario very well.
Plain Programmer Description:This algorithm tries to merge Fastest and Least Connections, which does make it more appealing than either one of the above than alone. In this case, an array is built with the information indicated (how weighting is done will vary, and I don’t know even for F5, let alone our competitors), and the element with the highest value is chosen to receive the connection. This somewhat counters the weaknesses of both of the original algorithms, but does not account for when a server is about to be overloaded – like when three requests to that query-heavy JSP have just been submitted, but not yet hit the heavy work.
Plain Programmer Description:This method attempts to fix the one problem with Observed by watching what is happening with the server. If its response time has started going down, it is less likely to receive the packet. Again, no idea what the weightings are, but an array is built and the most desirable is chosen.
 
You can see with some of these algorithms that persistent connections 
would cause problems. Like Round Robin, if the connections persist to a 
server for as long as the user session is working, some servers will 
build a backlog of persistent connections that slow their response time.
 The Long Term Resource Monitoring algorithms are the best choice if you
 have a significant number of persistent connections. Fastest works okay
 in this scenario also if you don’t have access to any of the dynamic 
solutions.
你可以看到,有些算法遇到长连接可能会导致问题。像是轮询,如果长连接保持用户会话那么
久,当连接数积压到一定值时会导致服务器响应时间变慢。如果你有大量的长连接,LTRM( Long Term Resource Monitoring
 )算法是最好的选择。如果没有动态的解决方案,Fastest算法也比较适合这种场景。
That’s it for this week, next week we’ll start talking specifically 
about Application Delivery Controllers and what they offer – which is a 
whole lot – that can help your application in a variety of ways.
Until then!
Don.
原文:http://www.cnblogs.com/horizonli/p/5393583.html