public class HtmlDomDocNewsletter extends Object
Modifier and Type | Class and Description |
---|---|
protected static class |
HtmlDomDocNewsletter.ElementUrl
provide a description for the HTML elements to be parsed
|
Modifier and Type | Field and Description |
---|---|
static String |
CONSTANT_PROTOCOL_DELIMITER |
static String |
CONSTANT_STATIC_URL |
static HtmlDomDocNewsletter.ElementUrl |
ELEMENT_A
To define a anchor, a element must have:
"a" tag name
The url is contained in the attributed named "href"
|
static HtmlDomDocNewsletter.ElementUrl |
ELEMENT_CSS
To define a CSS, html element must have:
"link" tag name
"rel" attribute equal to "stylesheet"
The url is contained in the attributed named "href"
|
static HtmlDomDocNewsletter.ElementUrl |
ELEMENT_FORM
To define a form, form element must have:
"form" tag name
The url is contained in the attributed named "action"
|
static HtmlDomDocNewsletter.ElementUrl |
ELEMENT_IMG
To define an image, html element must have:
"img" tag name
The url is contained in the attributed named "src"
|
static HtmlDomDocNewsletter.ElementUrl |
ELEMENT_JAVASCRIPT
To define a javascript, html element must have:
"script" tag name
"type" attribute equal to "text/javascript"
The url is contained in the attributed named "src"
|
Constructor and Description |
---|
HtmlDomDocNewsletter(String strHtml,
String strBaseUrl)
Instantiates an HtmlDocument after having built the DOM tree.
|
Modifier and Type | Method and Description |
---|---|
void |
convertAllRelativesUrls(HtmlDomDocNewsletter.ElementUrl elementType)
Get the relatives urls of all html elements specified by elementType and
convert its to absolutes urls
|
protected String |
getBaseUrl()
Get the base url
|
String |
getContent()
Get the document content
|
protected Document |
getDomDocument()
Get the document used by this instance
|
public static final String CONSTANT_STATIC_URL
public static final String CONSTANT_PROTOCOL_DELIMITER
public static final HtmlDomDocNewsletter.ElementUrl ELEMENT_CSS
public static final HtmlDomDocNewsletter.ElementUrl ELEMENT_JAVASCRIPT
public static final HtmlDomDocNewsletter.ElementUrl ELEMENT_IMG
public static final HtmlDomDocNewsletter.ElementUrl ELEMENT_A
public static final HtmlDomDocNewsletter.ElementUrl ELEMENT_FORM
public void convertAllRelativesUrls(HtmlDomDocNewsletter.ElementUrl elementType)
elementType
- the type of element to getpublic String getContent()
protected Document getDomDocument()
protected String getBaseUrl()
Copyright © 2019 City of Paris. All rights reserved.