WSDM Workshop Program

9:00 - 12:15 Session A

9:00-9:10     Workshop overview by the organizers
9:10-10:00    Invited talk 1: Large-scale Structured Learning for Statistical Classification, Yiming Yang (CMU)
10:00-10:30  Marek Ciglan, Michal Laclavík and Alex Dorman, Reusing Knowledge Hidden in Wikipedia for Scalable Text Categorization
10:30-11:00  Coffee break
11:00-11:30  Cornelia Caragea, Jian Wu, Kyle Williams, Sujatha Das G., Madian Khabsa, Pradeep Teregowda and C. Lee Giles, Automatic Identification of Research Articles from Crawled Documents
11:30-12:00 Faizan Javed, Matt McNair, Ferosh Jacob and Meng Zhao, Towards a Job Title Classification System

12:00-14:00 Lunch break

14:00 - 17:30 Session B

14:00-14:50 Invited talk 2: Selected Machine Learning Reductions (pdf), Anna Choromanska (Columbia University)
14:50-15:20 Raphael Puget, Nicolas Baskiotis and Patrick Gallinari, Scalable Learnability Measure for Hierarchical Learning in Large Scale Multi-Class Classification
15:20-15:50 Klemens Muthmann and Alina Petrova, An Automatic Approach for Identifying Topical Near-Duplicate Relations between Questions from Social Media Q/A Sites
15:50-16:20 Coffee break
16:20-16:50 Guangyu Wu, Oisín Boydell and Pádraig CunninghamHigh-Throughput, Web-Scale Data Stream Clustering
16:50-17:00 The LSHTC challenge series
17:00-17:10 The BIOASQ challenge
17:10            Closure