Package us.codecraft.webmagic.selector
Class HtmlNode
- java.lang.Object
-
- us.codecraft.webmagic.selector.AbstractSelectable
-
- us.codecraft.webmagic.selector.HtmlNode
-
- All Implemented Interfaces:
Selectable
- Direct Known Subclasses:
Html
public class HtmlNode extends AbstractSelectable
- Author:
- code4crafer@gmail.com
-
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Selectable
$(java.lang.String selector)
select list with css selectorSelectable
$(java.lang.String selector, java.lang.String attrName)
select list with css selectorprotected java.util.List<org.jsoup.nodes.Element>
getElements()
protected java.util.List<java.lang.String>
getSourceTexts()
Selectable
links()
select all linksjava.util.List<Selectable>
nodes()
get all nodesSelectable
select(Selector selector)
extract by custom selectorprotected Selectable
selectElements(BaseElementSelector elementSelector)
select elementsSelectable
selectList(Selector selector)
extract by custom selectorSelectable
smartContent()
select smart content with ReadAbility algorithmSelectable
xpath(java.lang.String xpath)
select list with xpath-
Methods inherited from class us.codecraft.webmagic.selector.AbstractSelectable
all, css, css, get, getFirstSourceText, jsonPath, match, regex, regex, replace, select, selectList, toString
-
-
-
-
Method Detail
-
getElements
protected java.util.List<org.jsoup.nodes.Element> getElements()
-
smartContent
public Selectable smartContent()
Description copied from interface:Selectable
select smart content with ReadAbility algorithm- Returns:
- content
-
links
public Selectable links()
Description copied from interface:Selectable
select all links- Returns:
- all links
-
xpath
public Selectable xpath(java.lang.String xpath)
Description copied from interface:Selectable
select list with xpath- Parameters:
xpath
- xpath- Returns:
- new Selectable after extract
-
selectList
public Selectable selectList(Selector selector)
Description copied from interface:Selectable
extract by custom selector- Specified by:
selectList
in interfaceSelectable
- Overrides:
selectList
in classAbstractSelectable
- Parameters:
selector
- selector- Returns:
- result
-
select
public Selectable select(Selector selector)
Description copied from interface:Selectable
extract by custom selector- Specified by:
select
in interfaceSelectable
- Overrides:
select
in classAbstractSelectable
- Parameters:
selector
- selector- Returns:
- result
-
selectElements
protected Selectable selectElements(BaseElementSelector elementSelector)
select elements- Parameters:
elementSelector
- elementSelector- Returns:
- result
-
$
public Selectable $(java.lang.String selector)
Description copied from interface:Selectable
select list with css selector- Parameters:
selector
- css selector expression- Returns:
- new Selectable after extract
-
$
public Selectable $(java.lang.String selector, java.lang.String attrName)
Description copied from interface:Selectable
select list with css selector- Parameters:
selector
- css selector expressionattrName
- attribute name of css selector- Returns:
- new Selectable after extract
-
nodes
public java.util.List<Selectable> nodes()
Description copied from interface:Selectable
get all nodes- Returns:
- result
-
getSourceTexts
protected java.util.List<java.lang.String> getSourceTexts()
- Specified by:
getSourceTexts
in classAbstractSelectable
-
-