Extracting Keywords and Content


DevX Home    Today's Headlines   Articles Archive   Tip Bank   Forums   

Results 1 to 2 of 2

Thread: Extracting Keywords and Content

  1. #1
    Join Date
    Mar 2004
    Posts
    3

    Extracting Keywords and Content

    Is there a way to extract the keywords and/or content from an HTML document? I've been playing with the URL class, and trying to use the getContent method. However, that method returns an Object.

    The question is: how do I retrieve the content from that object? Ultimately, the content needs to end up as members of a Set, but simply knowing how to view the content from the returned Object would be nice. Does anyone know how to do this? If so, what format are they returned in (I'm assuming String...)?

    Cheers in advance.

  2. #2
    Join Date
    Feb 2004
    Posts
    808
    do a search for posts here by me, containing word "URLConnection" or "URL" or "openStream"

    also, look at HTTPUnit
    The 6th edict:
    "A thing of reference thing can hold either a null thing or a thing to any thing whose thing is assignment compatible with the thing of the thing" - ArchAngel, www.dictionary.com et al.
    JAR tutorial GridBag tutorial Inherited Shapes Inheritance? String.split(); FTP?

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
HTML5 Development Center
 
 
FAQ
Latest Articles
Java
.NET
XML
Database
Enterprise
Questions? Contact us.
C++
Web Development
Wireless
Latest Tips
Open Source


   Development Centers

   -- Android Development Center
   -- Cloud Development Project Center
   -- HTML5 Development Center
   -- Windows Mobile Development Center