Boss直聘大数据岗位爬虫实战:含全国城市招聘数据、薪资学历分析与技能图谱可视化源码

上传者: l1k9j8h7g6 | 上传时间: 2026-06-01 14:43:28 | 文件大小: 246KB | 文件类型: ZIP
用Python+Scrapy抓取Boss直聘上北上广深杭等热门城市的大数据、数据分析、数据挖掘、机器学习、人工智能类岗位招聘信息,覆盖全国主要区域。项目产出结构化CSV数据文件(全国-热门城市岗位数据.csv),支持直接读取分析。代码包含完整爬虫流程:请求调度、反反爬处理(通过middlewares.py定制)、数据清洗(clean目录)、字段提取(items.py)、存储管道(pipelines.py)及运行脚本(runspider.py)。配套README.md说明部署步骤和环境依赖,settings.py可配置并发数、请求头、延迟等参数。分析维度包括:各岗位平均薪资分布、主流学历门槛对比、一线城市vs新一线需求热度、行业集中度(如互联网/金融/电商)、高频技能词云(Python/SQL/Hadoop/Spark/TensorFlow等)及能力要求关联性。所有源码、原始采集数据、配置文件和项目文档均已整理就绪,开箱即用,适合课程设计、求职作品集或数据分析入门实践。

文件下载

资源详情

[{"title":"( 36 个子文件 246KB ) Boss直聘大数据岗位爬虫实战:含全国城市招聘数据、薪资学历分析与技能图谱可视化源码","children":[{"title":"zNR8uEwht3KLVDkfwWRT-master-ffd54b51d54765bd86bbe3ff42e21ca9136737e6","children":[{"title":"init_db.py <span style='color:#111;'> 1.72KB </span>","children":null,"spread":false},{"title":"scrapy.cfg <span style='color:#111;'> 255B </span>","children":null,"spread":false},{"title":"runspider.py <span style='color:#111;'> 149B </span>","children":null,"spread":false},{"title":"全国-热门城市岗位数据.csv <span style='color:#111;'> 37.79KB </span>","children":null,"spread":false},{"title":"requirements.txt <span style='color:#111;'> 70B </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 1.16KB </span>","children":null,"spread":false},{"title":".inscode <span style='color:#111;'> 217B </span>","children":null,"spread":false},{"title":"bosszp","children":[{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"pipelines.py <span style='color:#111;'> 812B </span>","children":null,"spread":false},{"title":"clean","children":[{"title":"__init__.py <span style='color:#111;'> 52B </span>","children":null,"spread":false},{"title":"dataclean.py <span style='color:#111;'> 3.66KB </span>","children":null,"spread":false}],"spread":true},{"title":"web","children":[{"title":"__init__.py <span style='color:#111;'> 52B </span>","children":null,"spread":false},{"title":"dbutils.py <span style='color:#111;'> 3.22KB </span>","children":null,"spread":false},{"title":"templates","children":[{"title":"index.html <span style='color:#111;'> 3.38KB </span>","children":null,"spread":false}],"spread":false},{"title":"run.py <span style='color:#111;'> 4.06KB </span>","children":null,"spread":false},{"title":"static","children":[{"title":"js","children":[{"title":"pie.js <span style='color:#111;'> 1.73KB </span>","children":null,"spread":false},{"title":"jquery-1.8.3.min.js <span style='color:#111;'> 91.44KB </span>","children":null,"spread":false},{"title":"packgebubble.js <span style='color:#111;'> 1.92KB </span>","children":null,"spread":false},{"title":"word.js <span style='color:#111;'> 1.36KB </span>","children":null,"spread":false},{"title":"fan.js <span style='color:#111;'> 1.75KB </span>","children":null,"spread":false},{"title":"order.js <span style='color:#111;'> 1.13KB </span>","children":null,"spread":false},{"title":"cylindrical.js <span style='color:#111;'> 2.02KB </span>","children":null,"spread":false}],"spread":false},{"title":"img","children":[{"title":"bg.png <span style='color:#111;'> 38.50KB </span>","children":null,"spread":false},{"title":"favicon.png <span style='color:#111;'> 2.19KB </span>","children":null,"spread":false}],"spread":false},{"title":"css","children":[{"title":"mystyle.css <span style='color:#111;'> 1.73KB </span>","children":null,"spread":false}],"spread":false},{"title":"highcharts","children":[{"title":"highcharts.js <span style='color:#111;'> 268.23KB </span>","children":null,"spread":false},{"title":"dark-unica.js <span style='color:#111;'> 2.97KB </span>","children":null,"spread":false},{"title":"oldie.js <span style='color:#111;'> 25.56KB </span>","children":null,"spread":false},{"title":"wordcloud.js <span style='color:#111;'> 8.87KB </span>","children":null,"spread":false},{"title":"highcharts-more.js <span style='color:#111;'> 81.73KB </span>","children":null,"spread":false}],"spread":false}],"spread":false}],"spread":true},{"title":"spiders","children":[{"title":"__init__.py <span style='color:#111;'> 161B </span>","children":null,"spread":false},{"title":"boss.py <span style='color:#111;'> 6.06KB </span>","children":null,"spread":false}],"spread":true},{"title":"items.py <span style='color:#111;'> 640B </span>","children":null,"spread":false},{"title":"settings.py <span style='color:#111;'> 3.27KB </span>","children":null,"spread":false},{"title":"middlewares.py <span style='color:#111;'> 3.56KB </span>","children":null,"spread":false}],"spread":true},{"title":"README.md <span style='color:#111;'> 31.41KB </span>","children":null,"spread":false}],"spread":true}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明