抓取 segmentfault 推荐

模板 tesths 2周前 (10-06) 953次浏览 0个评论

抓取 segmentfault 推荐

 

1、 数据字段

  • 标题
  • 简介
  • 点赞数
  • 作者
  • 发布时间
  • 详情链接
  • 标签
  • 阅读数
  • 内容

2、结果示例截图

抓取 segmentfault 推荐

3、sitemap json

{"_id":"segmentfault","startUrl":["https://segmentfault.com/"],"selectors":[{"id":"element","type":"SelectorElementScroll","parentSelectors":["_root"],"selector":"div.news-item:nth-of-type(-n+80)","multiple":true,"delay":"2000"},{"id":"title","type":"SelectorText","parentSelectors":["element"],"selector":"h4","multiple":false,"regex":"","delay":0},{"id":"intro","type":"SelectorText","parentSelectors":["element"],"selector":"div.article-excerpt","multiple":false,"regex":"","delay":0},{"id":"like","type":"SelectorText","parentSelectors":["element"],"selector":"span.votes-num","multiple":false,"regex":"","delay":0},{"id":"author","type":"SelectorText","parentSelectors":["element"],"selector":".author a","multiple":false,"regex":"","delay":0},{"id":"publlsh","type":"SelectorHTML","parentSelectors":["element"],"selector":"span.author","multiple":false,"regex":"(?<=(</span>)).*","delay":0},{"id":"link","type":"SelectorLink","parentSelectors":["element"],"selector":"a[target]","multiple":false,"delay":0},{"id":"tag","type":"SelectorText","parentSelectors":["link"],"selector":".tagPopup a","multiple":false,"regex":"","delay":0},{"id":"read","type":"SelectorText","parentSelectors":["link"],"selector":".content__tech span","multiple":false,"regex":"","delay":0},{"id":"content","type":"SelectorText","parentSelectors":["link"],"selector":"div.article","multiple":false,"regex":"","delay":0}]}

喜欢 (6)
发表我的评论
取消评论
表情 贴图 加粗 删除线 居中 斜体 签到

Hi,您需要填写昵称和邮箱!

  • 昵称 (必填)
  • 邮箱 (必填)
  • 网址