Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron
description
Transcript of Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron
![Page 1: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/1.jpg)
1
Data Science for DataBayDataBay "Reclaim the Bay" Innovation Challenge:
August 1-3, 2014, Smithsonian Environmental Research Center, 647 Contees Wharf Rd, Edgewater, MD 21037http://databay.splashthat.com
Dr. Brand NiemannDirector and Senior Data Scientist
Semantic Communityand
Dr. Joan AronPrincipal
Aron ConsultingAugust 3, 2014
http://semanticommunity.info/Data_Science/Data_Science_for_DataBay
![Page 2: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/2.jpg)
2
So take a look at the new data catalogue & let us know what you think
• Can you work with this data?– Yes, especially 5 sites that provide spreadsheet downloads. There are 5 sites that
require a separate inventory. There are two sites that require browsing lots of data sets to make a selection. There is one site that does not appear to provide the actual data and one site that requires the user to have ArcGis software.
• Are you encountering any issues with the datasets?– There are two sites that return Error Messages and one site that requires
ArcGis software.• Are there relevant datasets or websites which are missing?
– Probably, but there is so much to work with and so little time that can come later.• What other information would you like to see?
– What I have done here as a Data Scientist to begin the Data Mining Process as follows: See next two slides.
![Page 3: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/3.jpg)
3
DataBay Bibliography Catalogue
http://semanticommunity.info/@api/deki/files/30279/Data-Bay-Bibliography-1.xlsx
![Page 4: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/4.jpg)
4
Data Mining Process Standard
Source: Data Science for Business (2013) at http://shop.oreilly.com/product/0636920028918.do
![Page 5: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/5.jpg)
5
Federal Big Data Working Group Meetup
• The Fourth Paradigm of Science (1):– Fourth Paradigm. Data-intensive science that exploits the large volumes of
data in new ways for scientific exploration, such as the International Virtual Observatory Alliance in astronomy.
• The Fourth Question of Big Data for Science (2):• How was the data collected?• Where is the data stored?• What are the data results?• Does the data story persuade?
• Data Science Data Publications:– In General-Open Government and Non-Government Research Data in Data
FAIRports (Findable, Accessible, Interoperable, and Reusable) or Commons (e.g. NIH BIG DATA Program)
– Specifically-Chesapeake Bay Program and EPA EnviroAltas(1) Bell G, Hey, T., & Szalay, A. (2009) Beyond the data deluge, Science 323, 6 March 2009, pp. 1297-1298.(2) de Waard, Anita, (2014) About Stories, that Persuade With Data, Federal Big Data Working Group Meetup, 20 May,, 41 slides.
![Page 6: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/6.jpg)
6
Data Science for DataBay:Knowledge Base in MindTouch (Wiki)
http://semanticommunity.info/Data_Science/Data_Science_for_DataBay
![Page 7: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/7.jpg)
7
Data Science for DataBay:Data Commons in MindTouch (Wiki)
http://semanticommunity.info/Data_Science/Data_Science_for_DataBay
![Page 8: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/8.jpg)
8
Data Science for DataBay:Data Commons in Spotfire (Business Intelligence) 1
My Note: Read Instructions to Execute These Dynamically Linked Visualizations.
![Page 9: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/9.jpg)
9
Data Science for DataBay:Data Commons in Spotfire (Business Intelligence) 2
My Note: Read Instructions to Execute These Dynamically Linked Visualizations.
![Page 10: Dr. Brand Niemann Director and Senior Data Scientist Semantic Community and Dr. Joan Aron](https://reader036.fdocuments.net/reader036/viewer/2022070419/56815b28550346895dc8e6c7/html5/thumbnails/10.jpg)
10
Some Conclusions and Next Steps
• We formed a team with a senior data scientist and a senior environmental scientist from members of the Federal Big Data Working Group Meetup.
• We looked at the new data catalogue & let you know what you thought about using it.
• We have built and deployed Knowledge Base, Data FAIRport-Data Commons, and Business Intelligence Applications on the Semantic Data Web.
• Our work on in-depth data science for the Chesapeake Bay Program and EPA EnviroAltas continues.