1. 首先我需要抓取数据的网址是: 'https://mtj.baidu.com/data/mobile/device' 2. 打开浏览器,输入网址,是这个页面  3. 我想要抓取,品牌,机型,系统,分辨率,联网右边的表格 4. 使用 pycharm IDE,下载 selenium模块 pip install selenium(请在python安装时勾选pip) 5. 代码如下 from selenium import webdriver from selenium.webdriver.chrome.options import Options url = 'https://mtj.baidu.com/data/mobile/device' for trytimes in range(0, 10): # noinspection PyBroadException browser.find_element_by_class_name(class_name).click() for trytimes in range(0, 10): # noinspection PyBroadException element = browser.find_elements_by_class_name(class_name) fileName = u'百度研究学院移动平台.xls' wb = xlwt.Workbook(encoding='utf-8') sheet = wb.add_sheet(d, cell_overwrite_ok=True) sheet.write(col, row, head) sheet.write(col, i % 2, data) browser.refresh() # 刷新方法 refresh print ('test pass: refresh successful') print ('Exception found', format(e)) #保存5个类别的数据,list_button中是class_name list_button = ['icon-brand','icon-device', 'icon-os', 'icon-screen','icon-network'] dict = {'icon-brand':icon_brand, 'icon-device':icon_device, 'icon-os':icon_os , 'icon-screen':icon_screen , 'icon-network':icon_network } #分别点击5个按钮,保存数据,品牌,机型,系统分辨率,联网 for button in list_button: print('************',button,'********************') element_name = browser.find_elements_by_class_name('dtd1') element_rank = browser.find_elements_by_class_name('dtd3') listen = len(element_name) for name in element_name: name_list.append(name.get_attribute('textContent')) #print(name.get_attribute('textContent')) for rank in element_rank: rank_list.append(rank.get_attribute('textContent')) for i in range(0, listen): dict[button].append(name_list[i]) dict[button].append(rank_list[i]) ###################################################################################### browser = webdriver.Chrome() browser.maximize_window()
5. 生成的表格形式如下 
|