HtmlUnit is a "browser for Java programs". It models HTML documents and provides an API that allows you to invoke pages, fill out forms, click links, etc... just like you do in your "normal" browser.

It has fairly good JavaScript support (which is constantly improving) and is able to work even with quite complex AJAX libraries, simulating either Firefox or Internet Explorer depending on the configuration you want to use.

It is typically used for testing purposes or to retrieve information from web sites.

HtmlUnit is not a generic unit testing framework. It is specifically a way to simulate a browser for testing purposes and is intended to be used within another testing framework such as JUnit or TestNG . Refer to the document "Getting Started with HtmlUnit" for an introduction.

HtmlUnit is used as the underlying "browser" by the Open Source testing tools Canoo WebTest and JWebUnit .

HtmlUnit was originally written by Mike Bowler of Gargoyle Software and is released under an Apache-style license . Since then, it has received many contributions from other developers , and would not be where it is today without their assistance.

Features

  • Support for the HTTP and HTTPS protocols.
  • Support for cookies
  • Ability to specify whether failing responses from the server should throw exceptions or should be returned as pages of the appropriate type (based on content type)
  • Support for submit methods POST and GET (as well as HEAD, DELETE, ...)
  • Ability to customize the request headers being sent to the server
  • Support for HTML responses
    • Wrapper for HTML pages that provides easy access to all information contained inside them
    • Support for submitting forms
    • Support for clicking links
    • Support for walking the DOM model of the HTML document
  • Proxy server support
  • Support for basic and NTLM authentication
  • Good support for JavaScript (see the JavaScript section below)

Installation

Place all the required jars in your classpath. All of these can be found in the lib directory of the HtmlUnit installation.

JavaScript support

HtmlUnit provides good JavaScript support, simulating the behavior of the configured browser (Firefox or Internet Explorer). It uses the Rhino JavaScript engine for the core language (plus workarounds for some Rhino bugs) and provides the implementation for the objects specific to the execution in a browser.

The support is already really good and many complex JavaScript libraries are well supported (the unit tests of some well known AJAX libraries are included in HtmlUnit's own unit tests). JavaScript support is the area where currently most of HtmlUnit's development activity occurs. If you encounter a problem it may be already have been fixed since the last release. Otherwise, don't hesitate to open an issue with an example of the problem.

To disable JavaScript processing for one WebClient, call WebClient.setJavaScriptEnabled(false).

JavaScript code is executed just like in normal browsers when the page loads or when an handler is triggered. Additionally HtmlUnit provides the ability to inject code into an existing page through HtmlPage.executeJavascript(String yourJsCode).

Refer to the changes document for details on what is being added.