Indiana University

«  

May

  »
S M T W T F S
 
 
1
 
2
 
3
 
4
 
5
 
6
 
7
 
8
 
9
 
10
 
11
 
12
 
13
 
14
 
15
 
16
 
17
 
18
 
19
 
20
 
21
 
22
 
23
 
24
 
25
 
26
 
27
 
28
 
29
 
30
 
31
 
 
 

Events

07/16/2012 (All day) - 07/20/2012 (All day)
Chicago, ILLocation TBD (Stewart General Chair)
01/01/2013 1:05am
D2I Events are held throughout the year in various locations - see for yourself.

D2I: Adaptable and Incremental Metadata Capture in e-Science

Wednesday, February 22, 2012 - 4:00pm - 5:00pm

Innovation Center, Room 105, 2719 E 10th St., Indiana University, Bloomington, IN

Scott Jensen, Post Doc Research Associate, Data to Insight Center, Indiana University

Scott Jensen

Abstract:  Scientific communities are recognizing an increasing need to enable reuse of the deluge (or bonanza) of scientific data currently being generated.  Detailed metadata, or “data about data”, is key to preserving the value, as well as enabling the sharing and reuse of data.  Communities have developed detailed XML schemata to capture and communicate metadata describing scientific data. Historically however, to the extent metadata has been captured at all, it was done at the end of an experiment when results are published and being curated.  This approach does not scale well with the increasing volume of data being generated and results in much of the metadata needed to understand a data object being lost due to the ephemeral nature of metadata itself.  To address these issues, we push metadata capture to the earliest stages of the scientific data lifecycle as data objects are created and the scientist generating the data is the steward of the data.  However, scientists often see little benefit to documenting their data with metadata, in part due to a misalignment of incentives between those generating metadata and future users benefiting from the reuse enabled by the metadata.  In this talk I will discuss how we are exploiting characteristics of scientific metadata schemata to enable the efficient incremental and automated capture of detailed metadata.   This approach reduces the misalignment of incentives by reducing the burden on the scientist through automation while also increasing the utility of the metadata to the original researcher by making it available during an experiment’s execution.  This approach uses a generalized underlying architecture that can be applied across the schemata of different scientific communities and “talk” the metadata schema of the community implementing the system. 
Bio:  Scott Jensen is currently a post-doctoral researcher in the Data to Insight Center at Indiana University.  He received his PhD. from Indiana University, Bloomington in 2010.  His research focus is on capturing the metadata and provenance needed to enable the reuse of scientific data and the leveraging of data across scientific disciplines.  His research interests also include web services, XML, XML-relational data storage, data search, and the semantic web.  Scott earned an M.S. in Computer Science at DePaul University in 1996, and a B.S. and MAcc.  from Southern Illinois University in 1984 and 1986 respectively. 

Light refreshments will be served.  Sponsored by the Data to Insight Center.

Current Seminar Series Schedule >>

 Data to Insight Center Events  (past talks and related speaker series around campus) >>