All Packages  Class Hierarchy  This Package  Previous  Next  Index

Class Webcrawler.Crawler.HTMLConstants

java.lang.Object
   |
   +----Webcrawler.Crawler.HTMLConstants

public class HTMLConstants
extends Object
Defines usefull constants for parsing HTML-files.


Variable Index

 o charentity
HTML replaces certain chars with a code for that char.
 o linkTags
Tags such as A HREF and FRAME SRC
 o loadableTags
Tags such as BODY BACKGROUND and IMG SRC
 o titleElement
Miscellaneous Elements and tags
 o whiteSpaces
Contains all characters which have to be ignored by the HTMLParser

Constructor Index

 o HTMLConstants()

Method Index

 o ()
 o getAttributeForLinkElement(String)
 o getAttributeForLoadableElement(String)
 o getCharEntity(char)

Variables

 o loadableTags
 public static HTMLConstants. ConstantTag loadableTags[]
Tags such as BODY BACKGROUND and IMG SRC

 o linkTags
 public static HTMLConstants. ConstantTag linkTags[]
Tags such as A HREF and FRAME SRC

 o titleElement
 public static final String titleElement
Miscellaneous Elements and tags

 o charentity
 protected static String charentity[]
HTML replaces certain chars with a code for that char. e.g. the german Ä is represented as Ä and has the UniCode# 196. Given: Ä converted to int=196 => look in charentity[196-160] -> Auml

 o whiteSpaces
 public static String whiteSpaces
Contains all characters which have to be ignored by the HTMLParser

Constructors

 o HTMLConstants
 public HTMLConstants()

Methods

 o getAttributeForLoadableElement
 public String getAttributeForLoadableElement(String element)
Returns:
the Attribute (e.g: BACKGROUND) for the Element (e.g: BODY), null if no element fits.
 o getAttributeForLinkElement
 public String getAttributeForLinkElement(String element)
Returns:
the Attribute (e.g: SRC) for the Element (e.g: FRAME), null if no element fits.
 o
 static void ()
 o getCharEntity
 public String getCharEntity(char c)
Returns:
the HTML-code for the given character (e.g: auml for ä)

All Packages  Class Hierarchy  This Package  Previous  Next  Index