MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

11
MetricMiner: Supporting Researchers in Mining Software Repositories Francisco Zigmund Sokol , Maurício Finavaro Aniche, Marco Aurélio Gerosa {sokol,aniche,gerosa}@ime.usp.br

description

MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

Transcript of MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

Page 1: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

MetricMiner: Supporting Researchers in Mining Software Repositories

Francisco Zigmund Sokol, Maurício Finavaro Aniche, Marco Aurélio Gerosa

{sokol,aniche,gerosa}@ime.usp.br

Page 2: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

Motivation

Starting a study in MSR is hard as it involves expensive tasks:

➔ Download source code repositories.➔ Extract useful information from raw data.➔ Calculate metrics.➔ Typically, it’s required to install and configure

tools locally.

Page 3: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

MetricMiner

A web application for supporting common MSR studies steps and possibly making use of cloud computing to scale.

http://metricminer.org.br/

https://github.com/metricminer-msr/metricminer

Page 4: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

MetricMiner

Page 5: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
Page 6: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
Page 7: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
Page 8: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
Page 9: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

Replication study

To validate our tool, we replicated a study published by Soetens and Demeyer (2010).

Does refactoring improve code complexity?

Original study:

➔ 1 Java open source project➔ 800 commits

Extracting data from MetricMiner query interface:

➔ 256 Java projects from Apache➔ 500k commits

Page 10: MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013

Next steps

➔ Improve user interface➔ Improve scalability - maybe moving to Google

BigQuery➔ Gather more data!