Topic 5 - Patent Analytics for Empowering Business Decisions

46
WIPO Regional Workshop on Patent Analytics WIPO-IPO of the Philippines Manila, Philippines December 4, 2013 Cynthia Barcelon Yang Director Scientific Information & Patent Analysis Group Information & Analytics Sciences Patent Analytics for Empowering Business Decisions

Transcript of Topic 5 - Patent Analytics for Empowering Business Decisions

Page 1: Topic 5 - Patent Analytics for Empowering Business Decisions

WIPO Regional Workshop on Patent Analytics

WIPO-IPO of the Philippines

Manila, Philippines

December 4, 2013

Cynthia Barcelon Yang

Director

Scientific Information & Patent Analysis Group

Information & Analytics Sciences

Patent Analytics for Empowering Business Decisions

Page 2: Topic 5 - Patent Analytics for Empowering Business Decisions

Bristol-Myers Squibb at a glance Mission: To discover, develop and deliver innovative medicines that help patients prevail over serious diseases.

World-class science with global reach and experience

28,000 employees in >90 countries

$17.6 B Net Sales in 2012

~ 8,000 people in R&D worldwide (10 major sites)

$3.9 B R&D investments in 2012

125 year (1887-2012) History of Innovation

A leader in biopharmaceuticals

Benchmark BioPharma Company

Best Big Drug Company - Forbes (Dec. 26, 2011)

Page 3: Topic 5 - Patent Analytics for Empowering Business Decisions

Strong Track Record of Success

Schizophrenia, Depression

Cancer

Rheumatoid Arthritis

Cancer

Cancer

Hepatitis B

HIV / AIDS

Diabetes

HIV / AIDS

Diabetes

Cardiovascular Disease

Cancer

Transplant

2005 2007 2003 2004 2008 2009 2006 2010 2011 2012

Diabetes 14 new product approvals in past 10 years*

63 Compounds in development

3 *Forxiga is not approved in the U.S.

*

Page 4: Topic 5 - Patent Analytics for Empowering Business Decisions

Global Manufacturing

MEXICO

Tlalpan

PUERTO RICO

Humacao

Manati

CHINA

SASS-Shanghai

JAPAN

Aichi

FRANCE

Agen

UNITED STATES

Devens, MA

Mt. Vernon, IN

West Chester, OH

Syracuse, NY (R&D)

IRELAND

Swords

Cruiserath

Pharmaceutical (10)

Biological (3)

API Plant (Active Pharmaceutical Ingredient)

Finishing Plant

ITALY

Anagni

4

supplying

57 worldwide

markets

supporting

102 products in

portfolio

68 external

network

suppliers

>6,500 colleagues

worldwide

Page 5: Topic 5 - Patent Analytics for Empowering Business Decisions

String of Pearls Strategy

Data as of

July 2013

Complements

our Internal Pipeline

~20 alliances, partnerships and

acquisitions since 2007

Zymo- Genetics

Alder Adnexus

Medarex

Amira

Zymo- Genetics

Oncolys

Inhibitex

Medarex Amylin

Zymo- Genetics

Innate

Ono

Exelixis

Adnexus Medarex

AbbVie

KAI

Ambrx

Exelixis Teijin/ Nissan

Medarex

Metabolic Diseases

Neuroscience

Fibrotic Disease

Virology (HCV, HIV)

Immuno-science

Oncology

Cardiology

Kosan

Allergan

Santaris

5

• 40% of pipeline assets

• 50% of revenue

5

Page 6: Topic 5 - Patent Analytics for Empowering Business Decisions

The Importance of Intellectual Property (IP)

Patents are the

Lifeblood of the

Pharmaceutical

industry.

IP provides market

exclusivity and

hence the

incentive for

investing in R&D.

Competitiveness is

based on the ability to

provide high value-

added products and

services at a

competitive

price.

IP accounts for

74% of the

average purchase

price of

acquisitions

(Pricewaterhouse

Coopers, London).

Assays

6

Chemotypes

High-value pipeline

Targets

Patents

Assays

Page 7: Topic 5 - Patent Analytics for Empowering Business Decisions

Vital Role of Patent Analysis in Corporate R&D *

Enable innovation & strategic business decision-making by providing value-add analysis of the patent literature to support:

Scientific Research & Development

State-of-the-art patent landscape

IP Procurement & Protection

Patentability determinations

Freedom-to-operate assessments

Validity opinions

Business Development & Strategic Transactions

Competitive analyses

Due diligence – String of Pearls Strategy

7

Patent

Analyst Strategic

Transactions

R&D

Intellectual

Property

* Pharmaceutical Patent Analyst , March 2012, Vol. 1, No. 1, pp 5-7

http://www.future-science.com/doi/full/10.4155/ppa.12.1

Page 8: Topic 5 - Patent Analytics for Empowering Business Decisions

Increasing Trend in Patent Filings*

8

Page 9: Topic 5 - Patent Analytics for Empowering Business Decisions

Challenges

“how to “capture, explore and capitalize”

9

Patent Data

Challenges

Source: Bill Hayden

Praxeon, Inc.

“70-90% of information contained within

patents is never published anywhere else”.

- U.S. Office Tech Assessment & Forecast Report

Page 10: Topic 5 - Patent Analytics for Empowering Business Decisions

10

Emerging Technologies Opportunities

Set of tools to facilitate knowledge extraction from patents

Semantic „concept-based‟ vs keyword-based

Automated extraction/indexing & creation of virtual compounds from Markush structure claims in chemical patents

Tools for integrating multiple sources of data/information

Patent alerts from patenting authorities (eg, USPTO PAIR, EP Register) & commercial sources (eg, CAPLUS, Inpadoc)

See also Text Mining & Visualization Tools – Impressions of Emerging Capabilities, Yang et.al.,

World Patent Information, Vol. 30, (2008), pp 280-293.

Text Analytics TEMIS

Pipeline Pilot

Aureka

Page 11: Topic 5 - Patent Analytics for Empowering Business Decisions

11

Company Profiling* to identify:

Technology areas

Key researchers

Patenting trends

Potential for String-of-Pearl strategy implementation

Technology Assessment to identify:

Industry trends in kinase assay technology platforms

Competitive landscape of pharma organizations

Trends in therapeutic areas and kinase family groups

Business investment strategy direction & implementation

* Enhancing Patent Landscape Analysis with Visualization Output, Yang et. al. ,

World Patent Information, Vol. 32 (2010), pp. 203-220

Case Studies

Page 12: Topic 5 - Patent Analytics for Empowering Business Decisions

12

Case Study 1:Company Profiling of Kosan Biosciences

Main Objective:

Assess Kosan Biosciences patent assets and research activities for potential String of Pearls engagement

Questions:

Who are their top inventors? What are their research teams composed of?

What is their research focus in terms of Mechanism of Action?

What is their research focus in terms of Utility?

Page 13: Topic 5 - Patent Analytics for Empowering Business Decisions

13

Patent Analytics Tool: VantagePoint

Sources: Derwent World Patent Index Chemical Abstracts Services

Type of Search: Patent Assignee

Patents Retrieved : 123 patents

Case Study 1: Company Profiling of Kosan Biosciences

Page 14: Topic 5 - Patent Analytics for Empowering Business Decisions

14

Company Profiling Case Study: Kosan Biosciences

Who are Kosan’s top inventors? What are their research teams composed of?

Page 15: Topic 5 - Patent Analytics for Empowering Business Decisions

15

What research has Kosan focused on in terms of Mechanism of Action ?

1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006

0

2

4

6

8

10

12

14

None given.

HSP-90 Inhibitors

Cancer Cell Growth Inhibitors

Motilin Agonists

Peptide Inhibitors

Bacterial Growth Inhibitors

Tubulin Polymerization

Megalomicin Synthesis

Hydroxylase

Gene therapy.

GPCR agonist

Antibiotic

Page 16: Topic 5 - Patent Analytics for Empowering Business Decisions

16

What research has Kosan focused on in terms of Utility?

1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006

0

1

2

3

4

5

6

7

8

9

Polyketides

Hyperproliferative Disease

Cancer

Anti-Infective Agents

Gastric Motility Diseases

Recombinant DNA

Disease

For Treating Multiple Myeloma

For the Production of Synthetic Genes/Libraries

Chemical Deriv.

Epothilone Deriv.

Erythromycin Deriv.

Page 17: Topic 5 - Patent Analytics for Empowering Business Decisions

Research Landscape

17

Document Titles

Key Researchers Publication Year Trend

Clustering Concepts

4 Panel View

Page 18: Topic 5 - Patent Analytics for Empowering Business Decisions

18

Main Objectives:

Assess competitive landscape of kinase assay technology platforms for drug screening to guide future investment & strategy at BMS

Identify current assay technology trends used for different kinase groups/families in various therapeutic areas

Questions:

What are the kinase assay technology trends?

What assay technology platforms are being used by companies?

What are the trends for kinase groups/families?

What are the trends for different therapeutic areas?

What are the trends in therapeutic area vs. kinase groups/families?

Case Study 2: Competitive Kinase Assay Technology Platform Analysis

Page 19: Topic 5 - Patent Analytics for Empowering Business Decisions

19

Case Study 2: Kinase Technology Platform Analysis

Patent Analytics Tool: Linguamatics I2E

Sources: PatBase (Bibliographic data)

IBM computer-curated patents

( Full-text WO, EP, US patents)

Type of Search: Key Concepts

Patents Retrieved : > 7000 patents

Page 20: Topic 5 - Patent Analytics for Empowering Business Decisions

Patent Analytics Approach – Overall Process

20

Step 1:

Data Collection and

Optimization

Step 2:

Knowledge Extraction

Using Linguamatics I2E

Step 3:

Analysis and

Visualization

Create Patent Number List

Iteratively Revise Strategy for

Most Relevant Dataset

Queries

Index Full Text Patents with

Ontologies

Create Custom Macros

Query Patents for Satisfactory Results

Output – Excel

Data cleanup

Remove Duplicates

Client Review

Extract Technology Terms

Assay

Technology

Macro

Kinase

Macro

Patent

Assignee

Macro

Custom Macros

The Key to Data Extraction

Page 21: Topic 5 - Patent Analytics for Empowering Business Decisions

Step 1 : Data Collection & Optimization

21

Data Collection and

Optimization

Create Patent Number List

Iteratively Revise Strategy for

Most Relevant Dataset

Client Review

Extract Technology Terms

Page 22: Topic 5 - Patent Analytics for Empowering Business Decisions

Step 1: Data Collection & Optimization Clients Requirements & Search Strategies

•Collect client requirements

Data coverage - 2004 to 2011

Trends reviewed in 2-year blocks – 4 two-year blocks for this analysis

•Generate relevant patent dataset

PatBase - Search kinase AND inhibitor terms near each other

Limit to families with a US or EP or WO member

•Identify kinase assay technology terminologies

Search broad assay terms in Description section only

Get clients involved

Refine terms to get the most relevant retrieval

22

Page 23: Topic 5 - Patent Analytics for Empowering Business Decisions

Data Collection & Optimization Generating Patent Family Sets

23

# Search query (edited) Results

1 Title, Abstract, Claims=(kinas* w10 (inhibit* OR activat*

OR modulat* OR antagon*)) 12983

2 1 AND Country Code=(us OR ep OR wo) 11466

3 2 AND Description=(assay* OR bioassay* OR screen* OR

measur* OR detect*) 10751

4 Patent families in which 1st publication is in 2004 or 2005 1678

5 Patent families in which 1st publication is in 2006 or 2007 1822

6 Patent families in which 1st publication is in 2008 or 2009 1972

7 Patent families in which 1st publication is in 2010 or 2011 1597

Page 24: Topic 5 - Patent Analytics for Empowering Business Decisions

Data Collection & Optimization Getting Full Text XML from IBM to I2E

•Over 7000 PNs obtained from PatBase

US (79%)

PCT (20%)

Other (1%)

•Export 1 PN per patent family

Using PatBase family table format

•Retrieve patent full text xml from IBM internal Database

PatBase xml quality

Getting full text xml from IBM internal database

“Patent Handler” – In-house web-based utility that automatically pulls full-text xml from IBM internal database and index them into I2E server

24

Page 25: Topic 5 - Patent Analytics for Empowering Business Decisions

Step 2 : I2E Query Development

25

Knowledge Extraction

Using Linguamatics I2E

Tool

Queries

Index Full Text Patents with

Ontologies

Create Custom Macros

Query Patents for Satisfactory Results

Assay

Technology

Macro

Kinase

Macro

Patent

Assignee

Macro

Custom Macros

The Key to Data Extraction

Data Collection and

Optimization

Create Patent Number List

Iteratively Revise Strategy for

Most Relevant Dataset

Page 26: Topic 5 - Patent Analytics for Empowering Business Decisions

Actionable

Information

Linguamatics I2E - Interactive Information Extraction

Chemical Names EMR/

EHR Unstructured text

Decision Support

Structuring the

unstructured

information world

Information Extraction

Agile, Scalable, Real-time

Natural Language Processing -based text mining

Info extraction and knowledge synthesis

Patents News

feeds

Scientific

Literature

Internal

reports

Drug

labels Clinical

trials ...

Courtesy of: Linguamatics

Page 27: Topic 5 - Patent Analytics for Empowering Business Decisions

Designed Output Columns (partial) in

Excel report output

Query Development Designed and driven by our desired I2E Output

Linguamatics I2E Query Development

27

1672 patent documents for 2004 – 2005

1839 patent documents for 2006 – 2007

2002 patent documents for 2008 – 2009

1597 patent documents for 2010 – 2011

>7000 patents total

Indexed Patents

on I2E server

Page 28: Topic 5 - Patent Analytics for Empowering Business Decisions

Linguamatics I2E Query Development - Macros for synonyms

Kinase group macro

500 Kinases with over 10 K synonyms in 10 groups

Kinase groups for trends analysis

Technology cluster macro

Terms provided by clients

Multiple iterations to get the best possible results

Therapeutic area macro

5 therapeutic areas of interests

I2E Disease Ontology

Patent assignee (major pharma) macro

STN company thesaurus

28

Page 29: Topic 5 - Patent Analytics for Empowering Business Decisions

I2E Query Development - Technology Macro

29

Fluorescence-

Activity

Fluorescence-

Binding

Radioactive

ADP-Detection

Caliper

Technology Cluster

Names

Synonyms for

Caliper Technology

Page 30: Topic 5 - Patent Analytics for Empowering Business Decisions

Screenshot of I2E Query: Kinase Technology Terms

30

Kinase

Group 1

was

queried

in claims

Technology

terms were

queried in the

description

Terms in the

“radioactive”

technology cluster

were also optionally

searched within 3

sentences of other

term lists (on the right)

This query relates to Kinase

Group 1

The final multi query

includes 10 single queries

- one per kinase group

Page 31: Topic 5 - Patent Analytics for Empowering Business Decisions

Screenshot of I2E Query: Kinases by Therapeutic Area

31

Diseases were

queried in the

patent abstract

text

One therapeutic area =10 single queries

- one per kinase group

5 therapeutic areas of interest

= 50 queries for all therapeutic areas and all

kinase groups which are then combined

Kinase terms

were queried

in the patent

claims text

Page 32: Topic 5 - Patent Analytics for Empowering Business Decisions

Step 3: Analysis & Visualization of Results

32

Data Collection and

Optimization

Knowledge Extraction

Using Linguamatics I2E

Tool

Create Patent Number List

Iteratively Revise Strategy for

Most Relevant Dataset

Queries

Index Full Text Patents with

Ontologies

Create Custom Macros

Query Patents for Satisfactory Results

Analysis &

Visualization

Output –

Excel & Spotfire

Data cleanup

Remove Duplicates

Page 33: Topic 5 - Patent Analytics for Empowering Business Decisions

I2E Results: Table of “Assertions”

33

Technology Kinase Kinase Synonym

Highlighted

Kinase Hit Term

in patent full text

Patent Number

Kinase

Group Publication Year

Assignee

Title

Abstract

Page 34: Topic 5 - Patent Analytics for Empowering Business Decisions

I2E Results: Export to Excel

34

Columns added

using I2E

Output Editor

Column added

in Excel

Page 35: Topic 5 - Patent Analytics for Empowering Business Decisions

Analysis and Visualization - Deliverables

Fluorescence Activity

- Predominant technology and growing

Radioactive

- Being replaced (old?)

Caliper

- Not changing much

ADP

- Just starting to grow (new?)

35

Note: Percentages on Y-axis are calculated from Spotfire data

What are the kinase assay technology trends?

Page 36: Topic 5 - Patent Analytics for Empowering Business Decisions

Analysis and Visualization - Deliverables

36

Some clear

differences

between various

companies

What kinase assay technology platforms are being used by companies?

Page 37: Topic 5 - Patent Analytics for Empowering Business Decisions

Analysis and Visualization - Deliverables

Therapeutic Area

•Oncology is the major therapeutic area for kinase use and is increasing

•CV and Metabolics are decreasing

•Immunology and CNS did not significantly increase over time

Kinase Family

•No clear trend in kinase families

•Kinase Group 10 is the most important group and this is consistent with its role in cellular proliferation that is critical for Oncology and Immunology indications

37

What are the trends for kinase groups/families ?

What are the trends for different therapeutic areas?

Page 38: Topic 5 - Patent Analytics for Empowering Business Decisions

Analysis and Visualization - Deliverables Kinase Group Trends in Immunology

38

What are the trends in kinase groups/families for each therapeutic area?

Page 39: Topic 5 - Patent Analytics for Empowering Business Decisions

Case Study 2 – Data Summary

> 7000 Full Text Patents; estimate half million pages of full text

510 Kinases with >10000 kinase synonyms

>110 technology terms (including trademarks)

5 Therapeutic areas (CV, CNS, Met, Oncology, and Immunology)

3 Custom-built macros

60 Single I2E queries and 2 I2E multiqueries

12000 Rows of data in Excel (after cleaning up)

~ 2 GB interactive data in Excel with HTML source data plus Spotfire visualization packages

39

Page 40: Topic 5 - Patent Analytics for Empowering Business Decisions

Business Value & Impact

•Business Value

• Manual if possible – estimate 1 hour/patent

• 7000 hours or 875 days or 3.5 years for a FTE

• I2E Efficiency gain: 90%

•Business Impact – Client Feedback

• Analyzed results were relevant to the questions asked.

• Analysis provided key competitive assay technology platforms

• Allowed implementation of investment strategy 6 months ahead of schedule

40

Page 41: Topic 5 - Patent Analytics for Empowering Business Decisions

Success Factors

Collaboration with clients

Multiple iteration in refining searching strategy, technology terms

Tool - Strengths

Powerful term extraction with ontology in various regions (fields)

Use of macro to extract client defined data

Highly interactive Excel output for easy analysis and drill-down

Massive data extraction with multi-queries

Use of Spotfire for data visualization

Repetitive description for similar information was removed

41

Page 42: Topic 5 - Patent Analytics for Empowering Business Decisions

Outcomes

Provided key actionable insights that empowered business decisions.

Encouraged closer collaborations with R&D teams

Recognized with BMS 3-I Award (Innovate,

Integrate, Improve) Sept. 26, 2013 “Natural Language Processing to Mine Unstructured Text to Support R&D”

42

Page 43: Topic 5 - Patent Analytics for Empowering Business Decisions

43

Patent Analyst‟s Role

Select appropriate database sources and types of analysis and visualization tools that are most appropriate to the query and dataset based on:

Scientific expertise

Business knowledge

Understanding of clients‟ needs

Knowledge of tools and databases

Collaborate interactively with users to refine the query & analysis criteria and output

Guide users in navigating the dynamic reports to realize the full value of the report

Page 44: Topic 5 - Patent Analytics for Empowering Business Decisions

44

Subject domain expertise

Database & business knowledge

Searching-Analytics Skillset

Standard & Optimized

Workflow integration

Innovative Tools

IT Support

Achieving Patent Analytics Service Excellence

Best Practices & Standardized/Optimized Processes Make the Difference

Collaborative and Continuous Learning Culture Collaborative & Continuous Learning Culture

Business Value = Integration of technology, process & people

Courtesy of: K. Fickenscher, AMIA CEO

Page 45: Topic 5 - Patent Analytics for Empowering Business Decisions

Acknowledgements

BMS Patent Analysis Group Information & Analytics Sciences (IAS) Research IT & Automation (RITA) Lead Evaluation & Mechanistic Biochemistry (LEMB)

Linguamatics – I2E

Search Technology, Inc. – VantagePoint

Chemical Abstracts Service – STN Anavist

Tibco Software, Inc. - Spotfire

Minesoft, Inc. - PatBase

IBM Text Analytics Consortium

45

Page 46: Topic 5 - Patent Analytics for Empowering Business Decisions

46

Thank you!

Maraming Salamat Po!

Gracias!

Xie Xie!

Contact: Cynthia Barcelon Yang

Director, Scientific Information & Patent Analysis Group

Bristol-Myers Squibb

Princeton, New Jersey USA

Phone: 609-818-5515

Email: [email protected]