Nginx 反向代理

发表于 2017-05-06 更新于 2025-02-12

什么是代理？

一张图了解两种代理模式的区别：

两种代理模式

常见应用场景：

正向代理：VPN 翻墙
反向代理：Web 站点服务

Nginx 反向代理

Nginx 代理功能由标准 HTTP 模块中内置的 ngx_http_proxy_module 提供，因此无需额外编译，常见配置如下：

http {
    server {
        server_name example.com;
        listen 80;
        
        location / {
            proxy_pass http://127.0.0.1:81/;
        }
    }
}

proxy_pass 指令用于配置被代理服务器的协议和地址。除了配置单机，还可以配置集群，详见 Nginx 负载均衡。

常见问题

To slash or not to slash

Here is a handy table that shows you how the request will be received by your WebApp, depending on how you write the location and proxy_pass declarations. Assume all requests go to http://localhost:8080:

location	proxy_pass	Request	Received by upstream
/webapp/	http://localhost:8080/api/	/webapp/foo?bar=baz	/api/foo?bar=baz ✅
/webapp/	http://localhost:8080/api	/webapp/foo?bar=baz	/apifoo?bar=baz
/webapp	http://localhost:8080/api/	/webapp/foo?bar=baz	/api//foo?bar=baz
/webapp	http://localhost:8080/api	/webapp/foo?bar=baz	/api/foo?bar=baz
/webapp	http://localhost:8080/api	/webappfoo?bar=baz	/apifoo?bar=baz

In other words: You usually always want a trailing slash, never want to mix with and without trailing slash, and only want without trailing slash when you want to concatenate a certain path component together (which I guess is quite rarely the case).

上游无法获取真实的访问来源信息

使用反向代理之后，上游服务器（如 Tomcat）无法获取真实的访问来源信息（如协议、域名、访问 IP），例如下面代码：

request.getScheme() // 总是 http，而不是实际的 http 或 https
request.isSecure() // 总是 false
request.getRemoteAddr() // Nginx IP
request.getServerName() // 127.0.0.1
request.getRequestURL() // http://127.0.0.1:81/index
response.sendRedirect(...) // 总是重定向到 http

这个问题需要在 Nginx 和 Tomcat 中做一些配置以解决问题。

Nginx 配置：

使用 proxy_set_header 指令为上游服务器添加请求头：

http {
    proxy_set_header Host              $host;
    proxy_set_header X-Real-IP         $remote_addr;
    proxy_set_header X-Forwarded-For   $proxy_add_x_forwarded_for;
    proxy_set_header X-Forwarded-Proto $scheme;

    server {
        ...
    }
}

Tomcat 配置：

在 Tomcat 的 server.xml 中配置 RemoteIpValve 让代码能够获取真实 IP 和协议：

1
2
3

<Valve className="org.apache.catalina.valves.RemoteIpValve" 
    remoteIpHeader="X-Forwarded-For" 
    protocolHeader="X-Forwarded-Proto" />

解决结果：

request.getScheme() // 实际的 http 或 https
request.isSecure() // 对应的 false 或 true
request.getRemoteAddr() // 用户 IP
request.getHeader("X-Real-IP") // 用户 IP
request.getServerName() // example.com
request.getRequestURL() // 对应的 http://example.com/index 或 https://example.com/index
response.sendRedirect(...) // 实际的 http 或 https

全局 proxy_set_header 失效

先来看下 proxy_set_header 的语法：

语法:	proxy_set_header field value;
默认值: proxy_set_header Host $proxy_host;
        proxy_set_header Connection close;
上下文: http, server, location

允许重新定义或者添加发往后端服务器的请求头。value 可以包含文本、变量或者它们的组合。当且仅当当前配置级别中没有定义 proxy_set_header 指令时，会从上面的级别继承配置。 默认情况下，只有两个请求头会被重新定义：

proxy_set_header Host       $proxy_host;
proxy_set_header Connection close;

这里隐含一个坑：如果当前配置级别中定义了 proxy_set_header 指令，哪怕只配置了一个，都会导致无法从上面的级别继承配置，即导致全局级别的 proxy_set_header 配置失效。例如下述 HTTP 长连接配置：

http {
    proxy_set_header Host              $host;
    proxy_set_header X-Real-IP         $remote_addr;
    proxy_set_header X-Forwarded-For   $proxy_add_x_forwarded_for;
    proxy_set_header X-Forwarded-Proto $scheme;

    upstream backend {
        server 127.0.0.1:8080;
        keepalive 16;
    }

    server {
        location / {
            proxy_pass http://backend;
            proxy_http_version 1.1;
            proxy_set_header Connection "";
            ...
        }
    }
}

导致全局级别的 proxy_set_header 配置失效：

proxy_set_header 失效

解决办法是在 location 中重新配置这四个请求头。

参考

How nginx processes a request ?

《万字多图，搞懂 Nginx 高性能网络工作原理！》