Schedule

The workshop will be held in the Gates Computer Science Building room 104.

Twitter hashtag: #wacsw

Friday (June 29)
4:00 pm Welcome and Introductions - Frank McCown (Harding University)
4:30 Memento and Source Code Repositories (Slides)
Harihar Shankar (LANL)
5:30 A Research Agenda for "Obsolete Data or Resources" (Slides)
Michael Nelson (Old Dominion University)
6:30 Dinner at Stanford - Student Research Presentations
 
Saturday (June 30)
6 - 8:15 am Breakfast at the Guest House
8:15 Shuttle leaves hotel for Gates
8:45 Building Full Text Indexes of Web Content using Open Source Tools (Slides)
Erik Hetzner (California Digital Library)
9:45 Issues in Preserving Scientific and Scholarly Data in Web Archiving
Laura Wynholds (UCLA)
10:30 Morning break
10:45 Whose Content is it Anyway? User Perspectives on Archiving Social Media
Cathy Marshall (Microsoft Research)
11:30 Panel: Legal Opportunities for Web Archiving (Slides)
Pamela Samuelson, Kathy Hashimoto, and David Hansen (Berkley Digital Library Copyright Project)
12:30 pm Lunch at Stanford
1:30 ArcSpread: Familiar Concepts Towards Archive Analytics for Social Scientists
Andreas Paepcke (Stanford University)
2:30 Text-Entity-Time Analytics in a Temporal Coherent Web Archive (Slides)
Marc Spaniol (LAWA Project)
3:30 Afternoon break
4:00 Archiving Web Pages with Hadoop and Pig (Slides)
Aaron Binns (Internet Archive)
5:00 Beyond BigData: Challenges for Facebook's Data Infrastructure
Sameet Agarwal (Facebook)
7:00 Dinner at Buca di Beppo (Directions) - Workshop is concluded