Uses of Package
us.codecraft.webmagic.selector
-
Packages that use us.codecraft.webmagic.selector Package Description us.codecraft.webmagic Main class "Spider" and models.us.codecraft.webmagic.configurable us.codecraft.webmagic.downloader Downloader is the part that downloads web pages and store in Page object.us.codecraft.webmagic.model Page model and annotations used to customize a crawler.us.codecraft.webmagic.selector Selectors for page extraction.us.codecraft.webmagic.utils Static utils of webmagic. -
Classes in us.codecraft.webmagic.selector used by us.codecraft.webmagic Class Description Html Selectable html.Json parse jsonSelectable Selectable text. -
Classes in us.codecraft.webmagic.selector used by us.codecraft.webmagic.configurable Class Description Selector Selector(extractor) for text. -
Classes in us.codecraft.webmagic.selector used by us.codecraft.webmagic.downloader Class Description Html Selectable html. -
Classes in us.codecraft.webmagic.selector used by us.codecraft.webmagic.model Class Description Selector Selector(extractor) for text. -
Classes in us.codecraft.webmagic.selector used by us.codecraft.webmagic.selector Class Description AbstractSelectable AndSelector All selectors will be arranged as a pipeline.BaseElementSelector CssSelector CSS selector.ElementSelector Selector(extractor) for html elements.Html Selectable html.HtmlNode Json parse jsonNodeSelector Selector(extractor) for html node.OrSelector All extractors will do extracting separately,
and the results of extractors will combined as the final result.PlainText Selectable plain text.
Can not be selected by XPath or CSS Selector.RegexSelector Selector in regex.Selectable Selectable text.Selector Selector(extractor) for text.SmartContentSelector Borrowed from https://code.google.com/p/cx-extractor/Xpath2Selector 支持xpath2.0的选择器。包装了HtmlCleaner和Saxon HE。XpathSelector XPath selector based on Xsoup. -
Classes in us.codecraft.webmagic.selector used by us.codecraft.webmagic.utils Class Description Selector Selector(extractor) for text.