Package us.codecraft.webmagic.selector
Class HtmlNode
- java.lang.Object
-
- us.codecraft.webmagic.selector.AbstractSelectable
-
- us.codecraft.webmagic.selector.HtmlNode
-
- All Implemented Interfaces:
Selectable
- Direct Known Subclasses:
Html
public class HtmlNode extends AbstractSelectable
- Author:
- code4crafer@gmail.com
-
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Selectable
$(java.lang.String selector)
select list with css selectorSelectable
$(java.lang.String selector, java.lang.String attrName)
select list with css selectorprotected java.util.List<org.jsoup.nodes.Element>
getElements()
protected java.util.List<java.lang.String>
getSourceTexts()
Selectable
links()
select all linksjava.util.List<Selectable>
nodes()
get all nodesSelectable
select(Selector selector)
extract by custom selectorprotected Selectable
selectElements(BaseElementSelector elementSelector)
select elementsSelectable
selectList(Selector selector)
extract by custom selectorSelectable
smartContent()
Selectable
xpath(java.lang.String xpath)
select list with xpath-
Methods inherited from class us.codecraft.webmagic.selector.AbstractSelectable
all, css, css, get, getFirstSourceText, jsonPath, match, regex, regex, replace, select, selectList, toString
-
-
-
-
Method Detail
-
getElements
protected java.util.List<org.jsoup.nodes.Element> getElements()
-
smartContent
public Selectable smartContent()
-
links
public Selectable links()
Description copied from interface:Selectable
select all links- Returns:
- all links
-
xpath
public Selectable xpath(java.lang.String xpath)
Description copied from interface:Selectable
select list with xpath- Parameters:
xpath
- xpath- Returns:
- new Selectable after extract
-
selectList
public Selectable selectList(Selector selector)
Description copied from interface:Selectable
extract by custom selector- Specified by:
selectList
in interfaceSelectable
- Overrides:
selectList
in classAbstractSelectable
- Parameters:
selector
- selector- Returns:
- result
-
select
public Selectable select(Selector selector)
Description copied from interface:Selectable
extract by custom selector- Specified by:
select
in interfaceSelectable
- Overrides:
select
in classAbstractSelectable
- Parameters:
selector
- selector- Returns:
- result
-
selectElements
protected Selectable selectElements(BaseElementSelector elementSelector)
select elements- Parameters:
elementSelector
- elementSelector- Returns:
- result
-
$
public Selectable $(java.lang.String selector)
Description copied from interface:Selectable
select list with css selector- Parameters:
selector
- css selector expression- Returns:
- new Selectable after extract
-
$
public Selectable $(java.lang.String selector, java.lang.String attrName)
Description copied from interface:Selectable
select list with css selector- Parameters:
selector
- css selector expressionattrName
- attribute name of css selector- Returns:
- new Selectable after extract
-
nodes
public java.util.List<Selectable> nodes()
Description copied from interface:Selectable
get all nodes- Returns:
- result
-
getSourceTexts
protected java.util.List<java.lang.String> getSourceTexts()
- Specified by:
getSourceTexts
in classAbstractSelectable
-
-