Is there a way to extract the keywords and/or content from an HTML document? I've been playing with the URL class, and trying to use the getContent method. However, that method returns an Object.

The question is: how do I retrieve the content from that object? Ultimately, the content needs to end up as members of a Set, but simply knowing how to view the content from the returned Object would be nice. Does anyone know how to do this? If so, what format are they returned in (I'm assuming String...)?

Cheers in advance.