爬取有道翻譯

2023-03-24 10:29:41

入門學習了爬蟲,嘗試寫一段爬取有道翻譯的代碼.

import urllib.request as ur
import urllib.parse as up
import chardet
import json
string = input('please enter the words needing to translate:')#在input()中用中文輸入法提示會出現錯誤,有沒有大佬懂啊..
URL = 'http://fanyi.youdao.com/translate?smartresult=dict&smartresult=rule'
data = {}
data['i'] = string
data['from'] = 'AUTO'
data['to'] = 'AUTO'
data['smartresult'] = 'dict'
data['client'] = 'fanyideskweb'
data['salt'] = '1536587001028'
data['sign'] = '9fe501a15b60074aa1fbbdc15baeac93'
data['doctype'] = 'json'
data['version'] = '2.1'
data['keyfrom'] = 'fanyi.web'
data['action'] = 'FY_BY_REALTIME'
data['typoResult'] = 'false'
data = up.urlencode(data).encode('utf-8')
# header = {}   #直接設定參數修改隐藏
# header['Ueser-Agent'] = 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36'
response = ur.Request(URL,data) #使用參數隐藏的話,隻能用于ur.Request(URL,data,header)中
response.add_header('User-Agent','Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36')
response = ur.urlopen(response)
html = response.read()
type_encode = chardet.detect(html)['encoding']#通過chardet.detect()獲得編碼方式
html = html.decode(type_encode)
html = json.loads(html)#json是輕量級的字元串封裝方式
answer = html['translateResult'][][]['tgt']
print(answer)

爬取有道翻譯

繼續閱讀

Python漫畫爬蟲開源 66漫畫 AJAX，包含資料庫連接配接，圖檔下載下傳處理

requests子產品進行人人網模拟登陸

Python image.show() 出錯FSPathMakeRef(/Applications/Preview.app) failed with error -43

2023爬蟲學習筆記 -- 多線程操作

M團店鋪評價采集不到問題問題展示：解決方案：

Python爬蟲學習（1）

Python爬蟲學習進階

Python爬蟲（入門+進階）學習筆記 1-2 初識Python爬蟲

Python進階爬蟲——Class1：認識爬蟲

python爬蟲學習筆記-1

python學習之urllib使用小結

NOIp模拟題之肮髒的牧師（桶排序）

python的函數抽象複用--以定時器為例簡單的内循環方法函數化functooldecoratordecorator的局限性coroutine

一篇文章教你如何在一個月内學會爬取大規模資料

Pyhton爬蟲實戰 - 抓取BOSS直聘職位描述和資料清洗Pyhton爬蟲實戰 - 抓取BOSS直聘職位描述和資料清洗

sort()函數到底是怎樣進行數字排序的