Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of...

12
Information Security Inc. Web Crawlers

Transcript of Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of...

Page 1: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Inc.

Web Crawlers

Page 2: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Contents

2

• What are Web Crawlers?

• Ways to crawl a website

• References

Page 3: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

What are Web Crawlers?

3

• Web crawlers are known by a variety of names – industry jargon

labels them spiders or bots but technically they are referred to as

web crawlers

Page 4: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Ways to crawl a website

4

• Metasploit

Page 5: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Ways to crawl a website

5

• HTTrack

Page 6: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Ways to crawl a website

6

• Black Widow

Page 7: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Ways to crawl a website

7

• Burp Suite Spider

Page 8: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Ways to crawl a website

8

• Scrapy framework

(https://doc.scrapy.org/en/master/intro/tutorial.html)

Page 9: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Ways to crawl a website

9

• Scrapy framework

(https://doc.scrapy.org/en/master/intro/tutorial.html)

Page 10: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Ways to crawl a website

10

• Scrapy framework

(https://doc.scrapy.org/en/master/intro/tutorial.html)

Page 11: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

Ways to crawl a website

11

• Scrapy framework

(https://doc.scrapy.org/en/master/intro/tutorial.html)

▲ Example Spider (extract all links and follow them)

Page 12: Web Crawlers - 情報セキュリティ株式会社 · •Web crawlers are known by a variety of names –industry jargon labels them spiders or bots but technically they are referred

Information Security Confidential - Partner Use Only

References

12

• Wikipedia

https://en.wikipedia.org/wiki/Web_crawler

• ScienceDaily

https://www.sciencedaily.com/terms/web_crawler.htm

• Metasploit

https://www.metasploit.com

• HTTrack

https://www.httrack.com

• Black Widow

http://softbytelabs.com/us/downloads.html

• Burp Suite

https://portswigger.net/burp

• Scrapy

https://scrapy.org/