This dataset was downloaded from: http://www.cse.iitb.ac.in/soumen/doc/QCQ/ It was transformed into Stanbol's BDL (Benchmark Description Language) for evaluations by Iavor Jelev. The original dataset contains: - Web documents crawled for CSAW evaluation in SIGKDD 2009 paper. - Ground truth annotations on the above documents collected from volunteers. Project members for the original dataset (In approximate order of recency) Soumen Chakrabarti, Ganesh Ramakrishnan, Sasidhar Kasturi, Apoorv Sharma, Devshree Sane, Amit Singh, Sayali Kulkarni, Somnath Banerjee.