python获取微博热搜

2023-05-08 20:30:35

# 获取热搜源码
import json
import re

import requests as requests


def main(json_list=[]):
    response = requests.get('http://s.weibo.com/top/summary')
    html = response.text
    regex = re.compile(
        r'<tr class="">\s+<td class="td-01 ranktop">(\d+)</td>\s+<td class="td-02">\s+<a href="(\S+)" target="_blank" rel="external nofollow"  target="_blank">('
        r'.*?)</a>\s+<span>(.*?)</span>\s+</td>\s+<td class="td-03">.*</td>\s+</tr>')
    lists = regex.findall(html)
    [json_list.append(dict(num=vo[0], url="https://s.weibo.com" + vo[1], key=vo[2], hotNum=vo[3])) for vo in lists]
    print(json.dumps(json_list, indent=2, ensure_ascii=False))


if __name__ == '__main__':
    main()

python获取微博热搜

python获取微博热搜

继续阅读

无法解析的外部符号 wmain，该符号在函数 "void cdecl mainCRTStartupHelper(struct HINSTANCE *,unsigned short con......

TestLink导出用例转换工具(XML2Excel)

YAML简介和PyYAML安全操作YAML支持的类型YAML的优点：yaml的基本语法python操作

Small tricks

libsvm for python 安装

学习软件测试基础测试第七天

Zeppelin 配置访问 REST APIApache Zeppelin Configuration REST API

【Torch】最简洁logging使用指南

27. Remove Element(列表)题目代码

sort()函数到底是怎样进行数字排序的

Cloud Studio初体验

使用 ctypes 进行 Python 和 C 的混合编程

【python】【数据处理】画多维数据分布图

【python】netconf协议对接管理设备

「Python 网络自动化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 网络设备

在python中创建excel并写入