This guide is for users of the webseer html analysis library. That you are interested in webseer probably means you are interested in doing one of the following things:
Doing a survey on a website (or URLs that have something in common) to come up with some site statistics/survey
Performing some classification tasks of URLs (topical classification for instance) to create classifiers to use in another system or to check the usefulness of a certain set of features
Some other task that involves analyzing HTML for which you'd rather learn one aggregate library than a bunch of smaller ones
This guide is designed to show you how to use webseer to hep in web page analysis.
To begin with, we're going to explain some of the general concepts in webseer and what it does and doesn't do. Then in the second part, we're going to explain how to use the command line tools to quickly do some cool things out of the box. Then, we'll go over how to use the APIs to plug webseer into your own Java program. Finally we'll show you how you can develop your own plugins to create your own features and models.