Navigation

HTRC UnCamp 2012 Final Agenda and Presentations

Original announcement

HathiTrust Research Center Wiki

Agenda

Monday September 10, 2012:  CyberInfrastructure Building, Indiana University (CIB), 2719 E 10th St., Bloomington, IN

Time

Room

Title

Presenter(s)

1:00 – 1:30

Main

Welcome

Brad Wheeler, VP for IT and CIO, Indiana University, HathiTrust Executive Committee
Beth Plale, HathiTrust Research Center Executive Chair

1:30 – 2:15

Main

Keynote:  HathiTrust: Putting Research in Context

John Wilkin, HathiTrust Executive Director

2:15 – 2:45

Main

Data and Collections Overview

Stacy Kowalczyk, HathiTrust Research Center  and Jeremy York, HathiTrust

2:45 – 3:00

Main

Architecture Overview

Robert H. McDonald, HathiTrust Research Center Executive Committee

3:00 – 3:30

Main

Break

 

3:30 – 4:30

Main

Demonstration of Capability (Application Screenshots)
i) Dickens in Depth,
ii) Analysis on 2.6M volumes using XSEDE,
iii) SEASR Analytics like topic modeling and entity extraction

Stacy Kowalczyk, Beth Plale, Loretta Auvil

4:30 – 5:00

Main

Post day discussion and identification of topics for next day

Beth Plale and Robert H. McDonald

5:30 – 7:00

Main

Reception:  HTRC Big Data Analytics
Images: 1 2 3 4 5 6 7 8

Loretta Auvil, HathiTrust Research Center Tech Team

 

Tuesday September 11, 2012:  CyberInfrastructure Building, Indiana University (CIB), 2719 E 10th St., Bloomington, IN

Time

Room

Title

Presenter(s)

8:30 – 9:00

Main

Digging Into Debating

Colin Allen, Indiana Univeristy, Cognitive Science, Director of InPhO Project

9:00 – 9:30

Main

Using HathiTrust Texts for Literary Research  

Ted Underwood is converting 500,000  eighteenth- and nineteenth- century volumes downloaded from HathiTrust into a  normalized collection that can be used for literary-historical research. He will briefly describe a workflow he and Mike Black have developed for normalizing the texts, and will encourage discussion about a range of other practical problems that confront researchers: assessing OCR quality, dividing texts into workparts, categorizing texts by genre, and identifying duplicate volumes, for instance.

Ted Underwood, University of Illinois at Urbana-Champaign, English, Associate Prof.

9:30 – 10:30 

Main

Topics I:  Data in Detail (repeated in Topics III)

Jeremy York, HathiTrust and J. Stephen Downie, HathiTrust Executive Committee

 

Flex Lab

Topics I:   API in Detail Part One 

Yiming Sun, HathiTrust Research Center Tech Team

10:30 – 11:00

Main

Break

 

11:00 – 12:00

Main

Topics II: Hands on: SEASR Analytics

Loretta Auvil, HathiTrust Research Center Tech Team

 

Flex Lab

Topics II: Hands on: API in detail Part Two - Developing algorithms with the API

 Yiming Sun, HathiTrust Research Center Tech Team

12:00 – 1:00

 

Lunch (topics report out)

 

1:00 – 2:00

Flex Lab

Topics III:  Hands on: Building Collections and Analyzing Data

Stacy Kowalczyk, HathiTrust Research Center Tech Team

 

Main

Topics III: Hands on: Data in Detail (repeated session)

Jeremy York, HathiTrust and J. Stephen Downie, HathiTrust Executive Committee

2:00 – 2:15

 

Break

 

2:15 – 3:00

Flex Lab

Topics IV: HTRC in the Classroom

 

 

Main

Topics IV: HTRC as Anchor for Digital Humanities Centers

 

 

CIB 325

Topics IV: OCR Correction (Group formed after UnCamp on OCR)

 

 

Main

Topics IV: Intertextual Analysis

 

3:00 – 4:00

Main

Town Hall Session, Next Steps

Robert McDonald, HathiTrust Research Center Executive Committee
J. Stephen Downie, HathiTrust Research Center Executive Committee
Tim Cole, HathiTrust Research Center Tech Team
Stacy Kowalczyk, HathiTrust Research Center Tech Team