Package us.codecraft.webmagic.selector
Class HtmlNode
java.lang.Object
us.codecraft.webmagic.selector.AbstractSelectable
us.codecraft.webmagic.selector.HtmlNode
- All Implemented Interfaces:
Selectable
- Direct Known Subclasses:
Html
- Author:
- code4crafer@gmail.com
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionselect list with css selectorselect list with css selectorprotected List<org.jsoup.nodes.Element>links()select all linksnodes()get all nodesextract by custom selectorprotected SelectableselectElements(BaseElementSelector elementSelector) select elementsselectList(Selector selector) extract by custom selectorselect list with xpathMethods inherited from class us.codecraft.webmagic.selector.AbstractSelectable
all, css, css, get, getFirstSourceText, jsonPath, match, regex, regex, replace, select, selectList, toString
-
Constructor Details
-
HtmlNode
-
HtmlNode
public HtmlNode()
-
-
Method Details
-
getElements
-
smartContent
-
links
Description copied from interface:Selectableselect all links- Returns:
- all links
-
xpath
Description copied from interface:Selectableselect list with xpath- Parameters:
xpath- xpath- Returns:
- new Selectable after extract
-
selectList
Description copied from interface:Selectableextract by custom selector- Specified by:
selectListin interfaceSelectable- Overrides:
selectListin classAbstractSelectable- Parameters:
selector- selector- Returns:
- result
-
select
Description copied from interface:Selectableextract by custom selector- Specified by:
selectin interfaceSelectable- Overrides:
selectin classAbstractSelectable- Parameters:
selector- selector- Returns:
- result
-
selectElements
select elements- Parameters:
elementSelector- elementSelector- Returns:
- result
-
$
Description copied from interface:Selectableselect list with css selector- Parameters:
selector- css selector expression- Returns:
- new Selectable after extract
-
$
Description copied from interface:Selectableselect list with css selector- Parameters:
selector- css selector expressionattrName- attribute name of css selector- Returns:
- new Selectable after extract
-
nodes
Description copied from interface:Selectableget all nodes- Returns:
- result
-
getSourceTexts
- Specified by:
getSourceTextsin classAbstractSelectable
-