VC++ / MFC / C++

Results 1 to 1 of 1

Thread: Parsing HTML For Key Words

Thread Tools
- Show Printable Version
Display
- Linear Mode
- Switch to Hybrid Mode
- Switch to Threaded Mode

July 6th, 2011, 09:28 PM #1
Applellial

View Profile

View Forum Posts

Visit Homepage

Member
Join Date

Feb 2011

Posts

28
Parsing HTML For Key Words

cheers again guys for sorting my last thread out and sorry for posting so quick but by fixing it it showed im doing things wrong haha

im trying to parse a webpage of unknown structure to extract key terms using topia.termextract. Howeverim having trouble coping with the complexity of web pages and getting to the main textual content where ever it lies on the page.

Are there any ways of doing it effectivly. I tried reading the whole webpage line by line and scanning for terms, but html tags and spaces and what not just totally destroy that approach. Im stuck basiically,

any ideas to make a start guys?....or at least another start as my last attempt wasnt that good

cheers

clubwear

Reply With Quote

Quick Navigation Python Top

« Previous Thread | Next Thread »

Posting Permissions

You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
[VIDEO] code is On
HTML code is Off

Click Here to Expand Forum to Full Width

Featured

The Best Reasons to Target Windows 8

* Porting from Android to Windows 8: The Real Story
Do you have an Android application? How hard would it really be to port to Windows 8?
* Guide to Porting Android Applications to Windows 8
If you've already built for Android, learn what do you really need to know to port your application to Windows Phone 8.
* HTML5 Development Center
Our portal for articles, videos, and news on HTML5, CSS3, and JavaScript
* Windows App Gallery
See the Windows 8.x apps we've spotlighted or submit your own app to the gallery!

Terms and Conditions | About Us | Privacy Notice | Contact Us | Advertise | Sitemap| California - Do Not Sell My Info

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

All times are GMT -5. The time now is 04:24 PM.

Copyright TechnologyAdvice