用python爬取电影名,演员,评分等并存入excel

1在python环境下找到安装python的路径
这里插入图片描述

2在这个文件名后cmd+enter进入黑框
在这里插入图片描述
3在黑框中安装jupyter notebook
黑框中也可以安装其他python编辑是需要用的包
用python爬取电影名,演员,评分等并存入excel_第1张图片
4创建一个学习python的文件夹,在控制台输入jupyter notebook点击enter进入编译
用python爬取电影名,演员,评分等并存入excel_第2张图片
6.写代码,要学会分析一下电影的用源代码(html)的组成

import requests
from bs4 import BeautifulSoup
import csv
#创建CSV文件,并写入表头信息
fp = open('E:\yxm.csv','a',newline='',encoding='utf-8')
writer = csv.writer(fp)
writer.writerow(('排名','片面','主演','上映时间','评分'))

url = "https://maoyan.com/board/7"
headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/77.0.3865.90 Safari/537.36'
}
html=requests.get(url=url,headers=headers)
# print(html.text)
soup = BeautifulSoup(html.text,"html.parser")
for i in soup.find_all("dd"):
    rank = i.find("i").get_text()
    filmName = i.find_all("p")[0].get_text()
    protagonist = i.find_all("p")[1].get_text().replace("\n","").replace(" ","")
    releaseTime = i.find_all("p")[2].get_text().replace("上映时间","时间")
    grade = i.find_all("p")[3].get_text()
    writer.writerow((rank,filmName,protagonist,releaseTime,grade))
    print(rank,'\t',filmName,'\t',protagonist,'\t',releaseTime,'\t',grade)
#  关闭文件
fp.close()

7.数据写入csv

import csv
#创建CSV文件,并写入表头信息
fp = open('G:\maoyanTop10.csv','a',newline='',encoding='utf-8')
writer = csv.writer(fp)
writer.writerow(('排名','片面','主演','上映时间','评分'))

for i in soup.find_all("dd"):
    rank = i.find("i").get_text()
    filmName = i.find_all("p")[0].get_text()
    protagonist = i.find_all("p")[1].get_text().replace("\n","").replace(" ","")
    releaseTime = i.find_all("p")[2].get_text().replace("上映时间","时间")
    grade = i.find_all("p")[3].get_text()
    writer.writerow((rank,filmName,protagonist,releaseTime,grade))

    # 写入数据
    writer.writerow((rank,title,actor,time,score))
#  关闭文件
fp.close()

8.输出效果
用python爬取电影名,演员,评分等并存入excel_第3张图片
9总结
学习代码需要耐心,虽然忙活了很久,但是得到自己的想要的结果还是超开心。谢谢阿优乐杨(杨友)同学的耐心教导,感激不尽。

你可能感兴趣的:(爬虫学习笔记)