Customizable, context-based website content extraction software
- Summary
- This technology is a context-based website content extraction tool for removing superfluous clutter from a webpage, rendering essential web content in a clean, easy-to-navigate format
- Technology Benefits
- Can apply the same filter to multiple websites with similar contentHighly customizable based on website’s genre and keywords Can be used to increase the font size for the visually impaired without increasing the size of the clutter Suitably reformats webpages for smart phones and tablets without input from website developer Customizable filters allow user to control webpage outputApplicable to a number of markup languages including HTML, XML, and MathML
- Technology Application
- Content extraction and rendering for smart phones, tablets, and cell phonesAd-blocking softwareRendering unique browser skinsSpeech rendering for the visually impaired
- Detailed Technology Description
- None
- *Abstract
-
None
- *Inquiry
- Greg MaskelColumbia Technology VenturesTel: (212) 854-8444Email: TechTransfer@columbia.edu
- *IR
- M05-053
- *Principal Investigator
-
- *Publications
- Gupta, S., Kaiser, G., Neistadt, D., and P. Grimm. “DOM-based Content Extraction of HTML Documents” Proceedings of the 12th International Conference on the World Wide Web. Session: Adapting Content to Mobile Devices: 207-214 (2003)Gupta, S., Kaiser, G., Grimm, P., Chiang, M.F. and J. Starren. “Automating Content Extraction of HTML Documents” Conference of the World Wide Web, 8(2): 179-224 (2005)Gupta, S. and G. Kaiser. “Extracting Content from Accessible Web Pages“ Proceedings of the 2005 International Cross-Disciplinary Workshop on Web Accessibility. Session: Engineering Client Systems: 26-30 (2005)Tech Ventures Reference:IR M05-053Licensing Contact: Greg Maskel
- *Web Links
- USPTO: US 2007-0050708 A1
- Country/Region
- USA

For more information, please click Here