Stripping stop words from a HTML document
I am currently doing some tests on the topic of Information Retrieval. However,
before I proceed to calculate the frequency of terms that occur in the HTML
document, I would need to first eliminate the stop words.
Does anyone have any idea how this can be implemented?
Your help will be greatly appreciated. Thanks!!
-- Android Development Center
-- Cloud Development Project Center
-- HTML5 Development Center
-- Windows Mobile Development Center