【網站名稱】
https://www.bibloo.com/
https://www.next.co.uk/
https://www.johnlewis.com/
【爬蟲要求】
1類目:爬取網站女裝、男裝、童裝服飾圖片和信息,不是每個網站這三類都有,有什么爬什么。2內容:爬取每個商品的一組商品圖,商品信息以json格式記錄,包含商品標題、品牌、商品鏈接、圖片鏈接、圖片存儲路徑等內容,類似于{"title": "3 PACK - Basic T-shirt", "color": "white/camel/black", --如果選擇不同顏色,會出現不同的商品圖,那么一種顏色算作一組商品。"detail_url": "https://www.zalando.co.uk/anna-field-3-pack-basic-t-shirt-whitecamelblack-an621d0ph-a13.html", "image_url": ["https://img01.ztat.net/article/spp-media-p1/c0958b828c8d35d984a1786c1c707dc2/4bbff84f2d3c40e4badf3255c519aeac.jpg?imwidth=1800", "https://img01.ztat.net/article/spp-media-p1/ff3d9e6fa62c3910ae7c3f1ee36e2f60/4a1d618840044bf48050b61f3af6a20f.jpg?imwidth=1800", "https://img01.ztat.net/article/spp-media-p1/e3c5407bdfdd36f5a151d23df2abb9d8/1792f13991eb4f09b42e6aa5471dd78e.jpg?imwidth=1800", "https://img01.ztat.net/article/spp-media-p1/2682886df9ec33d8aaf0d87c343b1f33/1899c4648410408eb69cb60c3e22ef8a.jpg?imwidth=1800", "https://img01.ztat.net/article/spp-media-p1/d6f9ff08a9423d52b373cb14dc05a686/95f707cd34764deda00bf1782f6aacd6.jpg?imwidth=1800", "https://img01.ztat.net/article/spp-media-p1/ac4bb5cc243333dd9096580de36aa14c/a805e6a4136a4fde8ad391962948c357.jpg?imwidth=1800&filter=packshot"], "path": ["pachong/zalando/img/4bbff84f2d3c40e4badf3255c519aeac.jpg", "pachong/zalando/img/4a1d618840044bf48050b61f3af6a20f.jpg", "pachong/zalando/img/1792f13991eb4f09b42e6aa5471dd78e.jpg", "pachong/zalando/img/1899c4648410408eb69cb60c3e22ef8a.jpg", "pachong/zalando/img/95f707cd34764deda00bf1782f6aacd6.jpg", "pachong/zalando/img/a805e6a4136a4fde8ad391962948c357.jpg", "pachong/zalando/img/00212fca4b7e4cbfb067e1afd6b42dc3.jpg"], "timestamp": 1716957618}
【輸出要求】
數據存儲格式如右,按網站、類目和具體數據類型目錄結構存儲。最后如果網站女裝有1w個商品,那就是1w條json,和差不多4w+的圖片。爬取完成后
1 告知每個類目圖片和json數量,
2 將數據根據右側目錄結構整體打包成zip文件傳送交付