Python采集淘宝商品评论API接口,json数据示例参考

Python采集淘宝商品评论API接口及JSON数据示例

淘宝并没有公开的官方API供开发者直接采集商品评论数据,但可以通过以下几种方式获取评论数据:

方法一:使用淘宝开放平台API(需要申请权限)

淘宝开放平台提供了部分评论相关API,如taobao.traderate.list.add等,但需要企业资质和申请权限。

示例JSON响应结构

{
"traderate_list_add_response": {
"trade_rates": {
"trade_rate": [
{
"tid": 1234567890,
"oid": 1234567890,
"role": "buyer",
"result": "good",
"created": "2023-01-01 12:00:00",
"rated_nick": "买家昵称",
"item_title": "商品标题",
"item_price": 99.9,
"content": "商品质量很好,物流快",
"reply": "感谢您的支持",
"auction_sku": "颜色:红色;尺寸:XL"
}
]
},
"total_results": 100
}
}

方法二:模拟网页请求采集(反爬较严格)

淘宝网页版评论数据通过AJAX加载,可以分析其接口。

示例代码

import requests
import json
import time
import random
from urllib.parse import quote
def get_taobao_comments(item_id, page=1):
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36',
'Referer': f'https://item.taobao.com/item.htm?id={item_id}'
}
# 淘宝评论API接口(可能变化)
url = f'https://rate.taobao.com/feedRateList.htm?auctionNumId={item_id}¤tPageNum={page}'
try:
response = requests.get(url, headers=headers)
# 返回的是JSONP格式,需要处理
json_str = response.text.strip().lstrip('(').rstrip(')')
data = json.loads(json_str)
return data
except Exception as e:
print(f"获取评论失败: {e}")
return None
# 使用示例
item_id = "1234567890" # 替换为实际商品ID
comments_data = get_taobao_comments(item_id)
if comments_data:
print(json.dumps(comments_data, indent=2, ensure_ascii=False))

返回的JSON数据示例

{
"comments": [
{
"auction": {
"sku": "颜色:红色;尺寸:XL",
"price": "99.90"
},
"user": {
"nick": "买家昵称",
"vipLevel": 3,
"vip": "普通会员"
},
"rateContent": "商品质量很好,物流快",
"rateDate": "2023-01-01 12:00:00",
"reply": "感谢您的支持",
"pics": ["https://img.alicdn.com/imgextra/i1/123456789/O1CN01abc123.jpg"],
"appendComment": {
"content": "追加评论:用了一个月还是很满意",
"days": 15
}
}
],
"currentPageNum": 1,
"totalPage": 10,
"tags": [
{"tag": "质量好", "count": 50},
{"tag": "物流快", "count": 30}
]
}

方法三:使用第三方数据服务

一些第三方数据服务商提供淘宝评论采集API,如:

  • 淘宝客API
  • 某些数据采集平台

第三方API示例响应

{
"code": 200,
"message": "success",
"data": {
"item_id": "1234567890",
"total_comments": 1250,
"comments": [
{
"id": "123456789",
"user_name": "匿名用户",
"user_level": "V3",
"content": "包装完好,质量不错",
"star": 5,
"sku": "默认",
"images": ["https://img.example.com/1.jpg"],
"create_time": "2023-01-15 14:22:33",
"append_comment": {
"content": "使用良好",
"create_time": "2023-02-01 09:15:22"
}
}
]
}
}

你可能感兴趣的:(python,API,python,json,大数据)