internet.ppt

36
Computers and Internet in Bioinformatics Dr Tan Tin Wee Director Bioinformatics Centre

Transcript of internet.ppt

Page 1: internet.ppt

Computers and Internet in Bioinformatics

Dr Tan Tin Wee

Director

Bioinformatics Centre

Page 2: internet.ppt

Internet and Bioinformatics

• Computing Technology in Biology - biocomputing

• Molecular biology was one of first to use latest Internet technologies such as mailing list, newsgroups, WAIS, Gopher and World Wide Web

• Internet Boom occurred at the same time as Genome Project data explosion

• Close synergies between the two

Page 3: internet.ppt

Scope of Introductory Bioinformatics

• Database Searching

• Sequence Alignment

• Gene finding

• Functional Genomics

• Protein Classification

• Phylogenetic inference

ComputingTechnology

Internet

Page 4: internet.ppt

What is a Computer?

Hardware Peripherals

PrinterSpeakerScannerDiskDrive

Operating System

Software Applications

User Interface

CPU, Memory, Harddisk, I/O interface

Win95, Win98, Unix, VMS

E.g.Microsoft OfficePowerpoint, Eudora, Excel

Page 5: internet.ppt

What is a Computer Program

• Set of instructions which tells the computer

• Machine language eg. 010011010010• Assembly Language eg. MOV AX,2 command for

programming chips eg Intel SPARCs, Digital Alpha chip, Z80, Motorola 6008

• Higher level Programming languageInterpreted - BASIC, PERLByteCode - JavaCompilable - C, C++, COBOL, PASCAL etc

Page 6: internet.ppt

Programming Language

• 1GL - Machine

• 2GL - Assembly

• 3GL - Structured Programming - Fortran, Pascal, C, C++ (Object Oriented), PERL, BASIC, etc

• 4GL - Functional Programming - LISP, Standard ML, Prolog

Page 7: internet.ppt

Program Development Environment

• Visual Basic (BASIC)

• Visual C (C programming)

• Visual J++ (Java)

• Delphi (Pascal)

• Assists software developer to develop programs faster.

Page 8: internet.ppt

Example• Microsoft Word

• Developers use a variety of environments writing software for Windows operating system

• Compile the code

• End result is an executable .exe which when you double-click, powers up the application

• Application allows you to compose document and save into harddisk or floppy

Page 9: internet.ppt

What is the Internet?

• A world wide collection of networks of computers

• A network of computer networks• A network based on the TCP/IP protocol

Page 10: internet.ppt

Standalone Computer

A typical setup at homeSpeakers

PC Printer

Page 11: internet.ppt

LAN

A Small Local Area Networkof two computersand one printerin your office

Page 12: internet.ppt

InterDepartmental Network

Page 13: internet.ppt

Campus Wide Network

Page 14: internet.ppt

Campus Network

Wide Area Network

National Network

InterCountry Network

Global Network

The INTERNET

Page 15: internet.ppt

How do you connect to Internet?

The INTERNET

Modem

TelephoneLine

Local Phone Company

ISP IAP

ILCInternationalLeasedCircuit

Internet Service Provider

Internet Access Provider

Page 16: internet.ppt

Office connection to Internet?

The INTERNET

Router

Leased TelephoneLine

Local Phone Company

ISP IAP

ILCInternationalLeasedCircuit

Internet Service Provider

Internet Access Provider

Office Local Area Network

Page 17: internet.ppt

What can you do with Internet?• INTERNET APPLICATIONS• Electronic Mail (Email)• Internet Talk/Chat (IRC)• File Transfer (FTP)• Remote Login (Telnet)• Internet News (Usenet)• Info retrieval (Gopher, World Wide Web)• Virtual Reality (VRML)• AudioVideo Conferencing (CU-SeeMe, Mbone)• Internet Phone

Page 18: internet.ppt

Client and Server

Application

CPU/Harddisk

Same Machine

Client Application

Server software

Front end

RemoteBackEnd

Separate Machines

Connected by NetworkCommunicating by a Protocol

Page 19: internet.ppt

Networks and Protocols

• Many networks - BITNET, SNA (for IBM) and most famous and de facto global information infrastructure - INTERNET

• Many different protocols - most famous is TCP/IP - a set of protocols for transferring information packets through a network

• Transmission Control Protocol/Internet Protocol

Page 20: internet.ppt

Technical Notes

• IP number 137.132.9.61

• Userid tinwee

• Domain Name biomed.nus.sg

• Email address [email protected]

• URL http://biomed.nus.sg:80/welcome.html

Page 21: internet.ppt

Internet Access in Singapore

For Dialup, you will need:• An Internet account

• PC / Macintosh based computer

• Modem

• Phone line

• Communications Software

For NUS, you will need:• Network card

• configure built-in software

Page 22: internet.ppt

Internet Access in Singapore

Internet Providers• Pacific Internet• Cyberway• Singnet • Internet resellers• Through Singapore ONE• NUS, NTU and other educational institutions

Page 23: internet.ppt

Power of the Internet and Emergence of WWW

• Hypertext

• Ted Nelson’s Project Xanadu (1969)

• CDROMs and Hypermedia

• Distributed Hypertext

• Distributed Hypermedia

• Mosaic, Netscape, Internet Explorer

Page 24: internet.ppt

What is Hypertext?• Non-Linear Text

• Links embedded in the text

• Jumps to other locations in the document/db

the quick brown foxjumps overthe fence

Fence........................

Page 25: internet.ppt

Hypermedia & CDROMs

• Ted Nelson’s visionary ideas in 1969

• Project Xanadu

• Combine Text with Graphics, Pictures, Audio, Video, Movie clips etc

• CDROMs

Page 26: internet.ppt

Distributed Hypertext

the quickbrown foxleapsover the fence again.

Page 27: internet.ppt

Distributed Hypertext

the quickbrown foxleapsover the fence again.

Client Netscape Web Browser application

World Wide Web of Information Servers

Page 28: internet.ppt

Distributed Hypertext/Hypermedia

Uniform Resource Locator:http://www.sg/welcome.htmlhttp://www.whitehouse.gov/welcome.auhttp://biomed.nus.sg/logo.gif

Document File Name

+

Internet Address

Tim Berners-LeeCERN,Geneva

Page 29: internet.ppt

Mosaic, Netscape, Internet Explorer WWW Browsers

Page 30: internet.ppt

Web and Documents

Netscape Web browser FormFilling Front end

Apache Web server

Directory of Filesetc etc

Page 31: internet.ppt

Web and Databases

Netscape Web browser FormFilling Front end

Apache Web server

Database Search Engine

Common Gateway InterfaceCGI interface

•Flat Files•Relational Dbs•Object Oriented Dbs

Page 32: internet.ppt

Biological Databases

• DNA sequence databases

• Protein sequence databases

• Gene Map databases

• Motifs databases

• Bibliographic databases

• Biochemical databases

• Enzyme databases etc etc

Page 33: internet.ppt

1970s 1980s 1990s 2000s

Biocomputing

BioInformaticsStart-Ups in

USA - market hype

NCBI

EBI

EMBnet/EMBL

ICGEBnet

???Pharma’s

Rush

Internet boom

GeneticRevolution

IMCB

AustralianInstitutes

DDBJ ANGIS

APBioNet

BIC

NABBINet

GenomeNet PekingU BC/HKBIC

Advancednetworking

India

Page 34: internet.ppt

Parallel Development

94 95 96 97 98 99 00 01

94 95 96 97 98 99 00 01NetworkInfrastructureDevelopment

BioinformaticsInfrastructureDevelopment

Synergy

APNG AI3 APAN SINGAREN TRANSPACAPAN-

APBioNet

NUS bionetworkAPECSurveyBIC

APBioNet

Push-Pull

APAN-APBioNet

EMBnet-APBioNet

ASTNET

Page 35: internet.ppt

Life Scientists

• Communication with each other through email, mailing lists, newsgroups and video conferencing

• Information when and where needed• Rapid dissemination of information for global

collaborations• Access to software applications freely• Access to computational resources freely

Page 36: internet.ppt

Conclusion

• Computer and Internet Technologies has tremendous applications in the Life Sciences

• Tremendous impact on the growth and evolution of Bioinformatics