Syslog-NG and Centralized Logging - Open Science...

26
Syslog-NG and Centralized Logging Suchandra Thapa Computation Institute University of Chicago HPDC 2007 Monterey, California - June 25, 2007

Transcript of Syslog-NG and Centralized Logging - Open Science...

Page 1: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

Syslog-NG and Centralized Logging

Suchandra ThapaComputation InstituteUniversity of Chicago

HPDC 2007Monterey, California - June 25, 2007

Page 2: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Three topics of interest

• Benefits of centralized logging• Syslog-ng• Common log formats

Page 3: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Why centralized logging?

• Allows logs for a cluster to be checked on asingle system

• No need to log into multiple systems tofigure out what is happening

• Allows for more powerful data mining on logfiles (more on this later)

Page 4: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Applications?

• What does centralizing system logs give you?• Why go through the trouble of implementing

and maintaining a centralized logging setup?

Page 5: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Troubleshooting

• Log messages from several systems arecollated on a single system

• Related messages from different systems canbe correlated and queried on a single location

• Access to multiple machines are not needed,just access to the central logging host

• Can aggregate information into a database toallow for easy searching

Page 6: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Troubleshooting before

• Identify log events on worker node, ce, orclient

• Check other machines (gatekeepers, computeelement, clients) in order to get other eventsthat might be relevant

• May need to involve other sysadmins andpeople in order to access log files

Page 7: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Troubleshooting After

• Identify log events on compute element, gatekeeper,or client machine

• Search web interface to logging database for eventsthat might be related in time or by content

• Can potentially be done with just access to thelogging database (no need to involve multiplepeople to track down trivial problems)

Page 8: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Security Applications

• Centralization of logs allows auditing to bedone more effectively

• Suspicious patterns can be more effectivelypicked up

• Postmortem assesments of breaches can bedone more easily

Page 9: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Grid Site with Log Search Service

CE

SEGridapplication

VO Specapplication

Syslog-ngSyslog-ng

Log SearchService

OSG CentralFacility

AuditingService

AuditingServiceClient

Syslog-ng

Log SearchService

SE

VO Specapplication

Gridapplication

Syslog-ng

CEGrid Site without Log Search ServiceCatch-All Log Search

Service Host

Central Repository

SecurityOfficer

Site Central LogMonitoring Host

Auditing Project Architecture

VO Resource Site VO Services

Host

Gratiaprobes

Gratiaprobes

Gratiaprobes

VO Specapplication

Syslog-ng

site

cluster

host

application

log repository

auditing datarepository

flow of datarequest

flow of datastorage

Legend

T. LevshinaFermilab

Page 10: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Why Syslog-ng?

• New system logging utility• Can replace regular syslog daemon or can be

used in parallel• More powerful facilities for filtering,

formatting, and redirecting log messages• Open source license -- can be redistributed

and modified if needed

Page 11: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Syslog-ng logging facilities

• Can filter log messages based on log level,system host, facility, ip address or regularexpressions on the message

• Can reformat and modify messages usingtemplate facilities

• Inputs can be files or sockets• Outputs can be other hosts, files, or sockets

Page 12: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

More syslog-ng capabilities• Arbitary fan in and fan out for

forwarding logs• Messages can be sent to different

destinations based on originatinghost or other filter criteria(incoming connection details,message tags, regex, etc.)

• Connections to remote hosts canbe encrypted if needed usingstunnel

• Messages can be altered in flightallowing hostnames to be addedto ease in categorizing messages

Page 13: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Syslog-ng for OSG

• Currently available in VDT 1.7.x releases• Will be available in VDT 1.8.0, the next stable

VDT release• Efforts are underway to incorporate it into

the OSG 0.8.0 release• Currently used in parallel with native syslog

Page 14: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Current usage in OSG

• All log messages from compute elements inthe validation testbed (VTB) sent to centralserver

• Log messages redirected from this server to 2other servers (server hosting splunk at U of Cand CEDPS system at LBNL)

• Log messages also formatted and archived ina database

Page 15: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

DQ2 for ATLAS

• All messages get collected on central host• 6 of 8 us tier2 sites currently sending logs to

central host• Will eventually incorporate splunk and/or

php syslog-ng for log analysis and searchesover the web

Page 16: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Php Syslog-ng Query Interface

Page 17: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Php syslog-ng

• Php and mysql based addon to syslog-ng• Log messages are formatted and placed in

mysql database• PHP based web interface to allow messages

to be queried and displayed• Open source license so can be modified to

meet specific project needs

Page 18: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Php syslog-ng interface

Page 19: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Splunk

• Commercial softwre used to archive andquery log messages

• Web interface allows log messages to becategorized and correlated

• Messages can be queried and sorted based oncategorization and other parameters

• Used at fermilab as well for internal loggingcollection

Page 20: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

forwarded subscription.logfrom tier2’s

saved queries, eg:“failure” in last 10 mins

Splunk Interface

Page 21: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Common Log Formats

• Why is this needed?

Page 22: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Current situation

• Different applications format log messagesdifferently

• Difficult to extract information reliably• Need to use regular expressions to obtain

messages• Not scalable

Page 23: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Work to rationalize log messages

• CEDPS logging best practices documentation– Defines standard layout– Defines location of information (e.g. timestamps, event

names) and formatting of information (e.g. timestamps inUTC with date followed by time, etc)

– Makes suggestions as to which events should be logged

• Currently working on syslog-ng approaches• Have scripts that transform messages from a globus

to common format

Page 24: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Conclusion• Centralized logging provides benefits in managing

clusters• Troubleshooting and security analysis can be made

easier by having a central repository for log files• Even more benefits if log messages use a common

format• Implementation of centralized logging can done

relatively quickly and without disturbing existinglogging systems

• Questions?

Page 25: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Links• Syslog-ng -

http://www.balabit.com/products/syslog-ng/• OSG -

https://twiki.grid.iu.edu/twiki/bin/view/Integration/SysLogNg

• CEDS Logging Best Practices -http://www.cedps.net/wiki/index.php/LoggingBestPractices

Page 26: Syslog-NG and Centralized Logging - Open Science …osg-docdb.opensciencegrid.org/cgi-bin/RetrieveFile?docid=...June 25, 2007 HPDC 2007 - Monterey, California More syslog-ng capabilities

June 25, 2007 HPDC 2007 - Monterey, California

Thank YouSpecial thanks to: Robert Gardner, Alain Roy,Brian Tierney