Building a Multi-Year Database of AAG Conference Abstracts

19
Building a Multi-Year Database of AAG Conference Abstracts André Skupin / Shujing Shu Dept. of Geography / Dept. of Computer Science University of New Orleans

description

Building a Multi-Year Database of AAG Conference Abstracts. André Skupin /Shujing Shu Dept. of Geography / Dept. of Computer Science University of New Orleans. Knowledge Domain Visualization. Why? understand knowledge domains in terms of: Past  how did this research area develop? - PowerPoint PPT Presentation

Transcript of Building a Multi-Year Database of AAG Conference Abstracts

Page 1: Building a Multi-Year Database of AAG Conference Abstracts

Building a Multi-Year Database of AAG Conference Abstracts

André Skupin / Shujing ShuDept. of Geography / Dept. of Computer Science

University of New Orleans

Page 2: Building a Multi-Year Database of AAG Conference Abstracts

Knowledge Domain Visualization

• Why?– understand knowledge domains in terms of:

• Past how did this research area develop?• Present who are the leading researchers and topics?• Future what will have high priority for federal funding?

• Who are the users?– non-specialists:

• teaching tool– specialists:

• find trends, emerging topics, potential collaborators– b/w specialists from different domains:

• enable communication about knowledge domains– funding agencies

• research impact analysis

Page 3: Building a Multi-Year Database of AAG Conference Abstracts

Knowledge Domain Visualization

Example: domain experts

Page 4: Building a Multi-Year Database of AAG Conference Abstracts

Knowledge Domain Visualization (KDV)

Page 5: Building a Multi-Year Database of AAG Conference Abstracts

Knowledge Domain Visualization

• Geographic/Cartographic/GIScience Perspectives on:– Metaphors

• map, landscape

– Technology• GIS

– Methods• objects vs. fields• visualizing change

– Challenges• computation• cognition

Page 6: Building a Multi-Year Database of AAG Conference Abstracts

Typical KDV versus KDV based on AAG

Typical Knowledge Domain

Geographic Knowledge Domain via AAG Meeting

Topic serious serious to whimsical

Peer review peer-reviewed not peer-reviewed

Project Phase final planning to final

Career Phase early to late pre- to mid/late-career

Abstract-to-Content Prox.

close medium to far

Co-Authorship

moderate to extensive

limited

Citations citations no citations

KDV methods structure-based content-based

Page 7: Building a Multi-Year Database of AAG Conference Abstracts

AAG abstracts 1993 – 2002from unstructured to structured data

• Issues:– parse documents

• begin/end

– parse document components• metadata

– author name, address, email

• content– title, abstract text, keywords

– variations in format between years– variations in format within years– author disambiguation

Page 8: Building a Multi-Year Database of AAG Conference Abstracts

AAG abstracts 1993 – 2002from unstructured to structured data

Page 9: Building a Multi-Year Database of AAG Conference Abstracts

AAG abstracts 1993 – 2002n abstracts by year

0 500 1000 1500 2000 2500 3000 3500

[Atlanta - 1993]

San Francisco - 1994

Chicago - 1995

Charlotte - 1996

Fort Worth - 1997

Boston - 1998

Honolulu - 1999

Pittsburgh - 2000

New York - 2001

Los Angeles - 2002

Page 10: Building a Multi-Year Database of AAG Conference Abstracts

AAG abstracts 1993 – 2002interesting facts

• abstracts: 22,089• authors: 30,589• unique authors: ~ 13,521

– disambiguation of author names difficult• same authors counted more than once

– “Andre Skupin” versus “André Skupin”

• multiple authors with same name– “Michael Smith” versus “Michael Smith”

• max authors for one abstract: 17• Knight et al. (1999) A framework for integrated regional assessment

of global climate change. (Penn State)

• max abstracts for one author: 29• Kam-biu Liu, Louisiana State University

Page 11: Building a Multi-Year Database of AAG Conference Abstracts

AAG Meeting 1992 – 2003: Term Dominance Landscape

Page 12: Building a Multi-Year Database of AAG Conference Abstracts

AAG Meeting 1992 – 2003: Term Dom. Landscape + Neuron Label Clusters

Page 13: Building a Multi-Year Database of AAG Conference Abstracts

AAG Meeting 1992 – 2003: Term Dom. Landscape + Neuron Label Clusters

Page 14: Building a Multi-Year Database of AAG Conference Abstracts

NAS Workshop on Uncertainty Visualization: 3 sessions w/ 3-4 speakers + 1 discussant

Page 15: Building a Multi-Year Database of AAG Conference Abstracts

2005 AAG Meeting: Sessions on “Mapping Humanity’s Knowledge in the Digital Domain”

Page 16: Building a Multi-Year Database of AAG Conference Abstracts

Author Change Surface – Michael Goodchild1970 - 1979 1975 - 1984

[’75-’84] minus [’70-’79]

Page 17: Building a Multi-Year Database of AAG Conference Abstracts

Author Change Surface – Michael Goodchild1980 - 1989 1985 - 1994

[’85-’94] minus [’80-’89]

Page 18: Building a Multi-Year Database of AAG Conference Abstracts

AAG Abstract Database – Future Work

– Tool development and deployment• AbstractMap System

– AbstractMap Manager

– AbstractMap Viewer

• scalability to large data sets?

– Tool application• time / geographic space / attribute space

– cluster emergence/growth/abandonment

– disciplinary evolution

– institutional evolution

– author trajectories

– links between conference site and topics

• integrate across disciplines– map of science

Page 19: Building a Multi-Year Database of AAG Conference Abstracts

THE END