scrapy如何获取response请求头中的多个set-cookie

无敌的白金之星 2018-10-17 01:50:20
如题,这是response.headers:
{b'Content-Type': [b'text/html; charset=utf-8'], b'Cache-Control': [b'private, no-cache, no-store, must-revalidate'], b'Pragma': [b'no-cache'], b'Expires': [b'Sat, 01 Jan 2000 00:00:00 GMT'], b'Vary': [b'Accept-Language, Cookie, Accept-Encoding
'], b'Content-Language': [b'en'], b'Date': [b'Wed, 17 Oct 2018 05:39:47 GMT'], b'Strict-Transport-Security': [b'max-age=60'], b'X-Frame-Options': [b'SAMEORIGIN'], b'Content-Security-Policy': [b"report-uri https://www.instagram.com/security/csp_
report/; default-src 'self' https://www.instagram.com; img-src https: data: blob:; font-src https: data:; media-src 'self' https://www.instagram.com https://*.cdninstagram.com https://*.fbcdn.net; manifest-src 'self' https://www.instagram.com;
script-src 'self' https://instagram.com https://www.instagram.com https://*.www.instagram.com https://*.cdninstagram.com wss://www.instagram.com https://*.facebook.com https://*.fbcdn.net https://*.facebook.net 'unsafe-inline' 'unsafe-eval' blo
b:; style-src 'self' https://*.www.instagram.com https://www.instagram.com 'unsafe-inline'; connect-src 'self' https://instagram.com https://www.instagram.com https://*.www.instagram.com https://graph.instagram.com https://*.graph.instagram.com
https://*.cdninstagram.com https://api.instagram.com wss://www.instagram.com https://*.facebook.com https://*.fbcdn.net https://*.facebook.net chrome-extension://boadgeojelhgndaghljhdicfkmllpafd; worker-src 'self' https://www.instagram.com; fr
ame-src 'self' https://instagram.com https://www.instagram.com https://staticxx.facebook.com https://www.facebook.com https://web.facebook.com https://connect.facebook.net https://m.facebook.com; object-src 'none'; upgrade-insecure-requests"],
b'Content-Security-Policy-Report-Only': [b"report-uri https://www.instagram.com/security/csp_report/; default-src 'self' https://www.instagram.com; img-src https: data: blob:; font-src https: data:; media-src 'self' https://www.instagram.com ht
tps://*.cdninstagram.com https://*.fbcdn.net; manifest-src 'self' https://www.instagram.com; script-src 'self' https://instagram.com https://www.instagram.com https://*.www.instagram.com https://*.cdninstagram.com wss://www.instagram.com https:
//*.facebook.com https://*.fbcdn.net https://*.facebook.net 'unsafe-inline' 'unsafe-eval' blob:; style-src 'self' https://*.www.instagram.com https://www.instagram.com 'unsafe-inline'; connect-src 'self' https://instagram.com https://www.instag
ram.com https://*.www.instagram.com https://graph.instagram.com https://*.graph.instagram.com https://*.cdninstagram.com https://api.instagram.com wss://www.instagram.com https://*.facebook.com https://*.fbcdn.net https://*.facebook.net chrome-
extension://boadgeojelhgndaghljhdicfkmllpafd; worker-src 'self' https://www.instagram.com; frame-src 'self' https://instagram.com https://www.instagram.com https://staticxx.facebook.com https://www.facebook.com https://web.facebook.com https://
connect.facebook.net https://m.facebook.com; object-src 'none'; upgrade-insecure-requests"], b'Set-Cookie': [b'rur=FRC; Domain=.instagram.com; HttpOnly; Path=/; Secure', b'mid=W8bLIgALAAF7VCTnsHdOeJAA7IzL; Domain=.instagram.com; expires=Sat, 14
-Oct-2028 05:39:47 GMT; Max-Age=315360000; Path=/; Secure', b'urlgen="{\\"140.82.22.53\\": 20473}:1gCeYd:dhTaKnksZ3Yo7uF5yU_9v6IVs-g"; Domain=.instagram.com; HttpOnly; Path=/; Secure', b'mcd=3; Domain=.instagram.com; expires=Sat, 14-Oct-2028 05
:39:47 GMT; Max-Age=315360000; Path=/; Secure', b'csrftoken=0XLdwkUkSII40ea8N4BR19B3zWuFSLg0; Domain=.instagram.com; expires=Wed, 16-Oct-2019 05:39:47 GMT; Max-Age=31449600; Path=/; Secure']}

其中显示的set-cookie为:
b'Set-Cookie': [b'rur=FRC; Domain=.instagram.com; HttpOnly; Path=/; Secure', b'mid=W8bLIgALAAF7VCTnsHdOeJAA7IzL; Domain=.instagram.com; expires=Sat, 14
-Oct-2028 05:39:47 GMT; Max-Age=315360000; Path=/; Secure', b'urlgen="{\\"140.82.22.53\\": 20473}:1gCeYd:dhTaKnksZ3Yo7uF5yU_9v6IVs-g"; Domain=.instagram.com; HttpOnly; Path=/; Secure', b'mcd=3; Domain=.instagram.com; expires=Sat, 14-Oct-2028 05
:39:47 GMT; Max-Age=315360000; Path=/; Secure', b'csrftoken=0XLdwkUkSII40ea8N4BR19B3zWuFSLg0; Domain=.instagram.com; expires=Wed, 16-Oct-2019 05:39:47 GMT; Max-Age=31449600; Path=/; Secure']

但是用response.headers['set-cookie']获取的内容只有:
b'csrftoken=0XLdwkUkSII40ea8N4BR19B3zWuFSLg0; Domain=.instagram.com; expires=Wed, 16-Oct-2019 05:39:47 GMT; Max-Age=31449600; Path=/; Secure'

要怎样才能获取到全部?

附上chrome浏览器显示的set-cookie情况:
set-cookie: rur=FRC; Domain=.instagram.com; HttpOnly; Path=/; Secure
set-cookie: urlgen="{\"139.180.218.220\": 20473\054 \"140.82.22.53\": 20473}:1gCeSl:J3gpMvE673WW0a8KXOPMl_D3A6k"; Domain=.instagram.com; HttpOnly; Path=/; Secure
set-cookie: csrftoken=is3aHClgPzEXi2bIzYGqlmbnqQrxmvrr; Domain=.instagram.com; expires=Wed, 16-Oct-2019 05:33:43 GMT; Max-Age=31449600; Path=/; Secure
...全文
648 1 打赏 收藏 转发到动态 举报
写回复
用AI写文章
1 条回复
切换为时间正序
请发表友善的回复…
发表回复
街头小贩 2018-11-22
  • 打赏
  • 举报
回复
找个http parse库,没有都自已写个,http协议的各部分都是定好的

10,608

社区成员

发帖
与我相关
我的任务
社区描述
Web 开发 其他
社区管理员
  • 其他
加入社区
  • 近7日
  • 近30日
  • 至今
社区公告
暂无公告

试试用AI创作助手写篇文章吧