Package us.codecraft.webmagic.model
Page model and annotations used to customize a crawler.
-
Interface Summary Interface Description AfterExtractor Interface to be implemented by page models that need to do something after fields are extracted.HasKey Interface to be implemented by page mode.
Can be used to identify a page model, or be used as name of file storing the object. -
Class Summary Class Description ConsolePageModelPipeline Print page model in console.
Usually used in test.HttpRequestBody HttpRequestBody.ContentType OOSpider<T> The spider for page model extractor.
In webmagic, we call a POJO containing extract result as "page model".PageMapper<T>