Last Updated: February 25, 2016
·
460
· ziyan

Spider: Content extraction through clustering

This project aims at learning site templates by clustering example pages to generate css selectors that would efficiently pinpoint the main content of the page.

http://ziyan.github.io/spider/