网络编程
位置:首页>> 网络编程>> Python编程>> Scrapy 配置 * IP的实现

Scrapy 配置 * IP的实现

作者:BradyCC  发布时间:2023-12-10 19:06:22 

标签:Scrapy, , IP

应用 Scrapy框架 ,配置动态IP处理反爬。


# settings 配置中间件
DOWNLOADER_MIDDLEWARES = {
 'text.middlewares.TextDownloaderMiddleware': 543,
 # 'text.middlewares.RandomUserAgentMiddleware': 544,
 # 'text.middlewares.CheckUserAgentMiddleware': 545,
 'text.middlewares.ProxyMiddleware': 546,
 'text.middlewares.CheckProxyMiddleware': 547
}

# settings 配置可用动态IP
PROXIES = [
 "http://101.231.104.82:80",
 "http://39.137.69.6:8080",
 "http://39.137.69.10:8080",
 "http://39.137.69.7:80",
 "http://39.137.77.66:8080",
 "http://117.191.11.102:80",
 "http://117.191.11.113:8080",
 "http://117.191.11.113:80",
 "http://120.210.219.103:8080",
 "http://120.210.219.104:80",
 "http://120.210.219.102:80",
 "http://119.41.236.180:8010",
 "http://117.191.11.80:8080"
]

# middlewares 配置中间件
import random

class ProxyMiddleware(object):

def process_request(self, request, spider):
   ip = random.choice(spider.settings.get('PROXIES'))
   print('测试IP:', ip)
   request.meta['proxy'] = ip

class CheckProxyMiddleware(object):

def process_response(self, request, response, spider):
   print(' * :', request.meta['proxy'])
   return response

来源:https://blog.csdn.net/BradyCC/article/details/90759341

0
投稿

猜你喜欢

手机版 网络编程 asp之家 www.aspxhome.com