Extracting Keywords and Content

DevX Home    Today's Headlines   Articles Archive   Tip Bank   Forums   

Results 1 to 2 of 2

Thread: Extracting Keywords and Content

  1. #1
    Join Date
    Mar 2004

    Extracting Keywords and Content

    Is there a way to extract the keywords and/or content from an HTML document? I've been playing with the URL class, and trying to use the getContent method. However, that method returns an Object.

    The question is: how do I retrieve the content from that object? Ultimately, the content needs to end up as members of a Set, but simply knowing how to view the content from the returned Object would be nice. Does anyone know how to do this? If so, what format are they returned in (I'm assuming String...)?

    Cheers in advance.

  2. #2
    Join Date
    Feb 2004
    do a search for posts here by me, containing word "URLConnection" or "URL" or "openStream"

    also, look at HTTPUnit
    The 6th edict:
    "A thing of reference thing can hold either a null thing or a thing to any thing whose thing is assignment compatible with the thing of the thing" - ArchAngel, www.dictionary.com et al.
    JAR tutorial GridBag tutorial Inherited Shapes Inheritance? String.split(); FTP?

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
HTML5 Development Center
Latest Articles
Questions? Contact us.
Web Development
Latest Tips
Open Source

   Development Centers

   -- Android Development Center
   -- Cloud Development Project Center
   -- HTML5 Development Center
   -- Windows Mobile Development Center