Who’s who?

AAHEP6 1

Who’s who?

Author identification in INSPIRE-Heath O’Connell, Fermilab

November 2012

AAHEP6 2

What’s the problem?

• Author search is the most popular search• Names are not unique– Denis Bernard (theory), – Denis Bernard (BABAR),– Bernard Denis (accelerators)– David Nathan Brown and David Norvil Brown

(both BABAR)• 2,800+ authors on ATLAS

November 2012

AAHEP6 3

How do we deal with this?• HEPNAMES database to collect information on

scientists– Establish identity of author as a person– 99,000 records managed by 1 FTE– 34,000 INSPIRE ID numbers assigned.

• Record checked for duplicates, etc.

• Bib Author Identify (BAI): computer algorithm to identify author profile based on publication info such as affiliation and co-authors– Establish BAI profile, may or may not correspond to a

unique person

November 2012

AAHEP6 4

INSPIRE ID vs. ORCID ID

• INSPIRE ID gives us immediate control– New ATLAS member can be assigned an ID that day by us,

do not have to wait for person– HEPNames record curated for that person

• IDs are all “one-to-one” and an association can be made at a later date (ask users?)

• Mark Doyle:– ORCID-0000-0001-5919-8670 | INSPIRE-00077990

• Start promoting ORCID with button to ORCID in our system

November 2012

AAHEP6 5

Adding authors and affiliations to HEP records

• 1-10 authors– Add by hand using an auto-suggest script which guesses

the affiliation based on older records.• More than 10 authors (typically experimental)– Did they use an authors.xml file?

• Yes: extract authors and affs cleanly in a few seconds.• No: use script that extracts authors and affiliations from TeX

file and matches their ID number based on name and experiment.

• e.g. “d. denisov” + “FNAL-E-0823” = INSPIRE-00076696• Affiliations matched with INSTITUTIONS database

November 2012

AAHEP6 6

Authors.xml file

November 2012

Authors.xml file was proposed by INSPIRE and developed in partnership with arXiv.org and publishers such as the APS to enable collaborations to ensure all authors are properly specified.

AAHEP6 7

Helping the Smaller Collaborations

• 10-200 people– Big enough to be a problem– Small enough to have no system in place

• INSPIRE has created a system these collaborations can use to manage their authors and create author.xml and LaTeX files

November 2012

AAHEP6 8

Author management for collaborations

November 2012

AAHEP6 9

Let’s get automated

• Bib Author Identify (BAI): 12,000 lines of code that uses metadata to create likely author profiles to identify a person

• 6.7 M “signatures” on 1M papers in HEP • 270,000 author profiles created– cf. HEPNames: 100,000 records

• On average each profile has 25 papers

November 2012

AAHEP6 10November 2012

For people with very common names it naturally has some difficulties. These are cleaned by a combination of user and operator effort.

Algorithm will get smarter so A.J. Martin and A.D. Martin aren’t in same profile.

AAHEP6 11

How to reach users

• Use the HEPNames database to identify candidates for a mailout.

• Look for people who have verified their HEPNames record (know they respond).

• 10,000 emails have been sent out.

November 2012

AAHEP6 12

Author Publication Profile Page

November 2012

AAHEP6 13

Login page: arXiv or “guest”

November 2012

AAHEP6 14

Claim your papers, remove others

November 2012

AAHEP6 15

Claiming results versus totalPapers Signatures Author Profiles

Total in HEP 1,000,000 6,000,000 270,000

Claiming actions 151,000 350,000 (4,000,000) 5,000 (100,000)

November 2012

N.B. Very high number of signatures (4,000,000) on small number of papers (151,000). Probably an effect of newer papers being claimed, hence more signatures from big collaborations.

AAHEP6 16

Summary

• 98,000 records in HEPNAMES– 34,000 with INSPIRE ID (real, unique people)

• Will integrate ORCID and INSPIRE • Created author.xml format for collaborations

and system for them to manage authors• BAI algorithm created 270,000 author profiles• 10,000 solicitation emails 5,000 responses• 150,000 papers claimed (out of 1,000,000)

November 2012

Who’s who?

Documents

Transcript of Who’s who?

Who’s Who In Congress?

Who’s Who in CUSTOMER TEAMS

Who’s Who & services Buyers Guide

4 — WHO’S WHO? (A) WHO’S WHO? (A) — 5s3-eu-central-1.amazonaws.com/seriasmania/rand-content/uploads/... · 4 — WHO’S WHO? (A) WHO’S WHO? (A) ... BBC WORLDWIDE UK Louise

1983 Who’s who at Sinclair

Who’s Who at AXA XL

Who’s Who in the European Technology Platform on Logistics ...€¦ · Who’s Who in the European Technology Platform on Logistics ALICE Who’s Who in the ETP on Logistics Alice

WHO’s WHO 2020

Who’s Who Presentation

WHO’S WHO - hryvidhansabhahistory.comhryvidhansabhahistory.com/who/2009.pdf · Preface This publication of WHO’S WHO contains the biographical ... Union, Sirsa; Director, IFFCO;

Who’s Who In The Theatre

Who’s Who in Letterland (Shapes)

Who’s Who in the English Rev.?

Who’s Who Wednesday Categories - Classroom Guitar by ... Who Wednesday Complete.pdf · Who’s Who Wednesday Categories Royalty John Williams Julian Bream Christopher Parkening

Who’s Who in Fintech?

Who’s Who on Mount Olympus?

Who’s Who in Your School?

Who’s Who???

WHO’S WHO IN FRANCE e ÉDITION SÉLECTION2019 ANTOINE HÉBRARD Président ÉDITO ÉDITION SÉLECTION e WHO’S WHO IN FRANCE Chers amis, Le Who’s Who in France fête cette année

WHO’S WHO - OECD