大家好,我是空空star,本篇给大家分享一下
《通过Python+Selenium查询文章质量分》
。
浏览器:本篇使用的是Chrome
Chrome驱动版本:110.0.5481.77
Python版本:Python3.8
selenium版本: 4.8.2
Selenium基础篇之环境准备
import pprint
import time
from selenium import webdriver
from selenium.webdriver.common.by import By
options = webdriver.ChromeOptions()
options.add_argument('--headless')
driver = webdriver.Chrome(options=options)
driver.get('https://www.csdn.net/qc')
driver.maximize_window()
driver.find_elements(By.CLASS_NAME, 'el-input__inner')[0].send_keys(blog_url)
driver.find_elements(By.CLASS_NAME, 'trends-input-box-btn')[0].click()
time.sleep(1)
title = driver.find_elements(By.TAG_NAME, 'span')[3].text
posttime = driver.find_elements(By.TAG_NAME, 'span')[4].text
score = driver.find_elements(By.TAG_NAME, 'p')[1].text
remark = driver.find_elements(By.TAG_NAME, 'p')[2].text
pprint.pprint({'title': title, 'posttime': posttime, 'score': score, 'remark': remark, 'url': blog_url})
{‘posttime’: ‘- 空空star · 2023-05-16 18:35:51 -’,
‘remark’: ‘文章质量良好’,
‘score’: ‘85’,
‘title’: ‘通过Python的wordcloud库将单词生成词云(心形形状)’,
‘url’: ‘https://blog.csdn.net/weixin_38093452/article/details/130704860’}
Process finished with exit code 0
def qc(blog_url):
options = webdriver.ChromeOptions()
options.add_argument('--headless')
driver = webdriver.Chrome(options=options)
driver.get('https://www.csdn.net/qc')
driver.maximize_window()
driver.find_elements(By.CLASS_NAME, 'el-input__inner')[0].send_keys(blog_url)
driver.find_elements(By.CLASS_NAME, 'trends-input-box-btn')[0].click()
time.sleep(1)
title = driver.find_elements(By.TAG_NAME, 'span')[3].text
posttime = driver.find_elements(By.TAG_NAME, 'span')[4].text
score = driver.find_elements(By.TAG_NAME, 'p')[1].text
remark = driver.find_elements(By.TAG_NAME, 'p')[2].text
pprint.pprint({'title': title, 'posttime': posttime, 'score': score, 'remark': remark, 'url': blog_url})
return {'title': title, 'posttime': posttime, 'score': score, 'remark': remark, 'url': blog_url}
单篇查询文章质量分至此结束,下一篇介绍《如何快速获取作者所有文章的质量分并保存到excle》。