一個數學世界? 抑或多個數學世界? ––– STEM中 …...一個數學世界? 抑或多個數學世界? ––– STEM中的M 是個怎樣的數學世界? 蕭文強
G 社会と情報/情報世界と現実世界の融合 LODAC: 学術リソース … · 2017. 4....
Transcript of G 社会と情報/情報世界と現実世界の融合 LODAC: 学術リソース … · 2017. 4....
`
社会と学術をつなぐデータの世界
連絡先:武⽥英明 / 国⽴情報学研究所 情報学プリンシプル研究系TEL : 03-4212-2543 Email : [email protected] http://lod.ac
RDF4U: RDF Graph Visualization (武田英明, ラッタチャイチャウウタイ)
LODAC: 学術リソースのためのオープン・ソーシャル・セマンティックWeb基盤の構築
社会と情報/情報世界と現実世界の融合G
If we display a node-link diagram or a concept-map diagram to readersdirectly, we will face with the following problems.1.A Query Graphis too Complicated to Read
A lot of inferred data that create giant components in a graph diagram.2. Lacking of Reading Flow of RDF Data
Background Content and Main Pointare NOT structuredin anyRDF graphs.http ://rc.lodac.nii.ac.jp /rd f4u
bit.ly/rdf4u
Triple Ranking
s2
s1
owl:sameAs
p1 o1
s1p1
o1& fD(s1) > fD(s2)
p1 o2
s1
o1p1p1
rdf:type C2
rdfs:subClassOfx C1
rdf:type
C2
rdfs:subClassOfx C1
rdf:type
o2
s1
o1p1p1
& <p1 rdf:typeowl:transitivePropert y>
1
2
3
To merge same-as nodes
To remove inferred transitive links
To remove inferred rdf:type hierarchies読みやすくなるよ〜!
やあ〜読めない!
Well-prepared RDF repositories did reasoning on ontologies in orderto support a SPARQL service, so inferred triples create giantcomponents in a graph.
Thus, the power of Semantic Web rules is used to simplify a graph.
(High Value) Key Concept is an important term that is always found in the query result and not many in the whole dataset.
(Low Value) General Concept is a term that is commonly known and they are always found in the whole dataset.
Weight of a URI
Visualization-Weight of a Triple
Concept Level
Information Level
w(uri)=fQ(uri)
log( fD(uri) + 1)
triplew(s) + w(p) + w(o)
3vw(⟨ s,p,o⟩ ) =
fQ(uri) number of the URI in a query graph
fD(uri) number of the URI in the whole dataset
(High Value) Topic-Specific Information contains specific terms that are highly relevance to the article. (a lot of key concepts)
(Low Value) Common Information explains background knowledge that supports readers to understand the main content. (a lot of general concepts)
average
Graph Simplification
農業分野における用語標準化(武田英明, 朱成敏)どんな研究?
農業ITシステムの相互運用性
農業現場で発⽣したデータの連携と統合のために基準になる標準語彙を定義しています。
何ができる?
異なるシステムから発⽣したデータも統合・連携が可能になって農業分野全般において農作業計画の最適化、収穫量の予測など、より有⽤にデータを活⽤することが可能になります。
区画管理営農管理資材管理経営管理
…
データの発生
センサーなどによる現場観測
農業用語の多様性
#代かき #代掻き #しろかき #荒代かき #整地#均平化 #代掻き作業 #かじり システムB システムC
データ項目が標準化されていない!
代かき
整地作業
かじり
解決方法:農業分野の標準語彙を定義する
は種 : 種⼦繁殖のために圃場で種を播く作業
⽬的 : 種⼦繁殖場所 : 圃場対象 : 種⾏為 : 播く
対象分野 : 農作業、作物、肥料 ...定義⼿段 : オントロジーの構築
オントロジー(Ontology)とは?対象の概念と概念間の相互関係を定義する体系。- 上位クラスが持つ属性を下位が継承- 概念と表記を分離
は種の例
属性 属性の値
農作業概念の構造化
は種 : 播種
概念名(見出語)
表記(同意語)
1. ⽬的によって概念を細分化する作物⽣産作業 > 作物⽣育作業> 繁殖制御作業 > 種⼦繁殖作業
2.⾏為、対象、場所、⼿段、時期、機材、作物によって細分化
は種 苗箱直播(は種+場所:苗箱)湛⽔直播(は種+場所:⽔⽥)乾⽥直播(は種+場所:乾⽥)
記述論理(Description Logics)とは?理論的基盤となる概念階層の記述に特化した論理的⾔語- 概念の⽭盾や同⼀性を判断- 機械による推論処理が容易
http://cavoc.org/
農作業基本オントロジー(AAO : Agriculture Activity Ontology)
http://cavoc.org/aao/
3.記述論理を⽤いて表現
Linked Open Data を用いたオープンデータ・オープンサイエンス基盤に関する研究
〒101-8430 東京都千代田区一ツ橋 2-1-2 国立情報学研究所 武田研究室
LODAC Project 武田英明 大向一輝 加藤文彦 小出誠二 亀田 尭宙 松村冬子 嘉村哲郎 深見嘉明 高橋徹 上田洋 小林巌生〒101-8430 東京都千代田区一ツ橋2-1-2 国立情報学研究所 Tel: 03-4212-2543 Fax: 03-3556-1916 Email: [email protected] http://lod.ac
武田 英明 ( 情報学プリンシプル研究系 )・大向 一輝 ( コンテンツ科学研究系 )
社会と学術をつなぐデータの世界
Linked Open Data で創るデータの Web
DBpedia Japanese の効果
Data Extraction
Template
Mapping infobox to ontologyused for extraction
Infobox からの抽出処理イメージ
http://ja.dbpedia.org
DBpedia JapaneseDBpedia Japanese は LODクラウドの中心であるDBpedia の兄弟プロジェクトで、日本語Wikipedia をもとに汎用的な情報のデータセットを作成・提供しています。英語等他言語にもリンクしているため、多言語のハブにもなっています。
DBpedia Japnese の公開後、活用する研究やデータセットアプリケーションが増加。
研究活用推移 データセット分野別
DBpediaJapanese
Japanese WikipediaOntology
LSJ
NDL Authorities
LC
VIAF
DBpedia
LODACMuseum
saveMLAK
Yokohama Art
SOCIA
LODACSpecies
AozoraBunko
Kyoto Manga
Museum
CiNii
KAKEN
GeoLOD
EarthquakeArchives
Fukushima
Geonames
RIHN
LOD Cloud
Open license Fumihiro Kato, 2015-11-18
Publication
Life Science Cross-domainMedia Government
GeographicIndustry
User generated content
LSD
i-Scover
Allie
Senkyo
Statdb
Michishiru
N-ken
Open DATA METI
RNR
J-GLOBALknowledge
Geonames.jpEvaCva
ISIL
Linked Datasets as of August 2014
Uniprot
AlexandriaDigital Library
Gazetteer
lobidOrganizations
chem2bio2rdf
MultimediaLab University
Ghent
Open DataEcuador
GeoEcuador
Serendipity
UTPLLOD
GovAgriBusDenmark
DBpedialive
URIBurner
Linguistics
Social Networking
Life Sciences
Cross-Domain
Government
User-Generated Content
Publications
Geographic
Media
Identifiers
EionetRDF
lobidResources
WiktionaryDBpedia
Viaf
Umthes
RKBExplorer
Courseware
Opencyc
Olia
Gem.Thesaurus
AudiovisueleArchieven
DiseasomeFU-Berlin
Eurovocin
SKOS
DNBGND
Cornetto
Bio2RDFPubmed
Bio2RDFNDC
Bio2RDFMesh
IDS
OntosNewsPortal
AEMET
ineverycrea
LinkedUser
Feedback
MuseosEspaniaGNOSS
Europeana
NomenclatorAsturias
Red UnoInternacional
GNOSS
GeoWordnet
Bio2RDFHGNC
CticPublic
Dataset
Bio2RDFHomologene
Bio2RDFAffymetrix
MuninnWorld War I
CKAN
GovernmentWeb Integration
forLinkedData
Universidadde CuencaLinkeddata
Freebase
Linklion
Ariadne
OrganicEdunet
GeneExpressionAtlas RDF
ChemblRDF
BiosamplesRDF
IdentifiersOrg
BiomodelsRDF
ReactomeRDF
Disgenet
SemanticQuran
IATI asLinked Data
DutchShips and
Sailors
Verrijktkoninkrijk
IServe
Arago-dbpedia
LinkedTCGA
ABS270a.info
RDFLicense
EnvironmentalApplications
ReferenceThesaurus
Thist
JudaicaLink
BPR
OCD
ShoahVictimsNames
Reload
Data forTourists in
Castilla y Leon
2001SpanishCensusto RDF
RKBExplorer
Webscience
RKBExplorerEprintsHarvest
NVS
EU AgenciesBodies
EPO
LinkedNUTS
RKBExplorer
Epsrc
OpenMobile
Network
RKBExplorerLisbon
RKBExplorer
Italy
CE4R
EnvironmentAgency
Bathing WaterQuality
RKBExplorerKaunas
OpenData
Thesaurus
RKBExplorerWordnet
RKBExplorer
ECS
AustrianSki
Racers
Social-semweb
Thesaurus
DataOpenAc Uk
RKBExplorer
IEEE
RKBExplorer
LAAS
RKBExplorer
Wiki
RKBExplorer
JISC
RKBExplorerEprints
RKBExplorer
Pisa
RKBExplorer
Darmstadt
RKBExplorerunlocode
RKBExplorer
Newcastle
RKBExplorer
OS
RKBExplorer
Curriculum
RKBExplorer
Resex
RKBExplorer
Roma
RKBExplorerEurecom
RKBExplorer
IBM
RKBExplorer
NSF
RKBExplorer
kisti
RKBExplorer
DBLP
RKBExplorer
ACM
RKBExplorerCiteseer
RKBExplorer
Southampton
RKBExplorerDeepblue
RKBExplorerDeploy
RKBExplorer
Risks
RKBExplorer
ERA
RKBExplorer
OAI
RKBExplorer
FT
RKBExplorer
Ulm
RKBExplorer
Irit
RKBExplorerRAE2001
RKBExplorer
Dotac
RKBExplorerBudapest
SwedishOpen Cultural
Heritage
Radatana
CourtsThesaurus
GermanLabor LawThesaurus
GovUKTransport
Data
GovUKEducation
Data
EnaktingMortality
EnaktingEnergy
EnaktingCrime
EnaktingPopulation
EnaktingCO2Emission
EnaktingNHS
RKBExplorer
Crime
RKBExplorercordis
Govtrack
GeologicalSurvey of
AustriaThesaurus
GeoLinkedData
GesisThesoz
Bio2RDFPharmgkb
Bio2RDFSabiorkBio2RDF
Ncbigene
Bio2RDFIrefindex
Bio2RDFIproclass
Bio2RDFGOA
Bio2RDFDrugbank
Bio2RDFCTD
Bio2RDFBiomodels
Bio2RDFDBSNP
Bio2RDFClinicaltrials
Bio2RDFLSR
Bio2RDFOrphanet
Bio2RDFWormbase
BIS270a.info
DM2E
DBpediaPT
DBpediaES
DBpediaCS
DBnary
AlpinoRDF
YAGO
PdevLemon
Lemonuby
Isocat
Ietflang
Core
KUPKB
GettyAAT
SemanticWeb
Journal
OpenlinkSWDataspaces
MyOpenlinkDataspaces
Jugem
Typepad
AspireHarperAdams
NBNResolving
Worldcat
Bio2RDF
Bio2RDFECO
Taxon-conceptAssets
Indymedia
GovUKSocietal
WellbeingDeprivation imd
EmploymentRank La 2010
GNULicenses
GreekWordnet
DBpedia
CIPFA
Yso.fiAllars
Glottolog
StatusNetBonifaz
StatusNetshnoulle
Revyu
StatusNetKathryl
ChargingStations
AspireUCL
Tekord
Didactalia
ArtenueVosmedios
GNOSS
LinkedCrunchbase
ESDStandards
VIVOUniversityof Florida
Bio2RDFSGD
Resources
ProductOntology
DatosBne.es
StatusNetMrblog
Bio2RDFDataset
EUNIS
GovUKHousingMarket
LCSH
GovUKTransparencyImpact ind.Households
In temp.Accom.
UniprotKB
StatusNetTimttmy
SemanticWeb
Grundlagen
GovUKInput ind.
Local AuthorityFunding FromGovernment
Grant
StatusNetFcestrada
JITA
StatusNetSomsants
StatusNetIlikefreedom
DrugbankFU-Berlin
Semanlink
StatusNetDtdns
StatusNetStatus.net
DCSSheffield
AtheliaRFID
StatusNetTekk
ListaEncabezaMientosMateria
StatusNetFragdev
Morelab
DBTuneJohn PeelSessions
RDFizelast.fm
OpenData
Euskadi
GovUKTransparency
Input ind.Local auth.Funding f.
Gvmnt. Grant
MSC
Lexinfo
StatusNetEquestriarp
Asn.us
GovUKSocietal
WellbeingDeprivation ImdHealth Rank la
2010
StatusNetMacno
OceandrillingBorehole
AspireQmul
GovUKImpact
IndicatorsPlanning
ApplicationsGranted
Loius
Datahub.io
StatusNetMaymay
Prospectsand
TrendsGNOSS
GovUKTransparency
Impact IndicatorsEnergy Efficiency
new Builds
DBpediaEU
Bio2RDFTaxon
StatusNetTschlotfeldt
JamendoDBTune
AspireNTU
GovUKSocietal
WellbeingDeprivation Imd
Health Score2010
LoticoGNOSS
UniprotMetadata
LinkedEurostat
AspireSussex
Lexvo
LinkedGeoData
StatusNetSpip
SORS
GovUKHomeless-
nessAccept. per
1000
TWCIEEEvis
AspireBrunel
PlanetDataProject
Wiki
StatusNetFreelish
Statisticsdata.gov.uk
StatusNetMulestable
Enipedia
UKLegislation
API
LinkedMDB
StatusNetQth
SiderFU-Berlin
DBpediaDE
GovUKHouseholds
Social lettingsGeneral Needs
Lettings PrpNumber
Bedrooms
AgrovocSkos
MyExperiment
ProyectoApadrina
GovUKImd CrimeRank 2010
SISVU
GovUKSocietal
WellbeingDeprivation ImdHousing Rank la
2010
StatusNetUni
Siegen
OpendataScotland Simd
EducationRank
StatusNetKaimi
GovUKHouseholds
Accommodatedper 1000
StatusNetPlanetlibre
DBpediaEL
SztakiLOD
DBpediaLite
DrugInteractionKnowledge
BaseStatusNet
Qdnx
AmsterdamMuseum
AS EDN LOD
RDFOhloh
DBTuneartistslast.fm
AspireUclan
HellenicFire Brigade
Bibsonomy
NottinghamTrent
ResourceLists
OpendataScotland SimdIncome Rank
RandomnessGuide
London
OpendataScotland
Simd HealthRank
SouthamptonECS Eprints
FRB270a.info
StatusNetSebseb01
StatusNetBka
ESDToolkit
HellenicPolice
StatusNetCed117
OpenEnergy
Info Wiki
StatusNetLydiastench
OpenDataRISP
Taxon-concept
Occurences
Bio2RDFSGD
UIS270a.info
NYTimesLinked Open
Data
AspireKeele
GovUKHouseholdsProjectionsPopulation
W3C
OpendataScotland
Simd HousingRank
ZDB
StatusNet1w6
StatusNetAlexandre
Franke
DeweyDecimal
Classification
StatusNetStatus
StatusNetdoomicile
CurrencyDesignators
StatusNetHiico
LinkedEdgar
GovUKHouseholds
2008
DOI
StatusNetPandaid
BrazilianPoliticians
NHSJargon
Theses.fr
LinkedLifeData
Semantic WebDogFood
UMBEL
OpenlyLocal
StatusNetSsweeny
LinkedFood
InteractiveMaps
GNOSS
OECD270a.info
Sudoc.fr
GreenCompetitive-
nessGNOSS
StatusNetIntegralblue
WOLD
LinkedStockIndex
Apache
KDATA
LinkedOpenPiracy
GovUKSocietal
WellbeingDeprv. ImdEmpl. Rank
La 2010
BBCMusic
StatusNetQuitter
StatusNetScoffoni
OpenElection
DataProject
Referencedata.gov.uk
StatusNetJonkman
ProjectGutenbergFU-BerlinDBTropes
StatusNetSpraci
Libris
ECB270a.info
StatusNetThelovebug
Icane
GreekAdministrative
Geography
Bio2RDFOMIM
StatusNetOrangeseeds
NationalDiet Library
WEB NDLAuthorities
UniprotTaxonomy
DBpediaNL
L3SDBLP
FAOGeopolitical
Ontology
GovUKImpact
IndicatorsHousing Starts
DeutscheBiographie
StatusNetldnfai
StatusNetKeuser
StatusNetRusswurm
GovUK SocietalWellbeing
Deprivation ImdCrime Rank 2010
GovUKImd Income
Rank La2010
StatusNetDatenfahrt
StatusNetImirhil
Southamptonac.uk
LOD2Project
Wiki
DBpediaKO
DailymedFU-Berlin
WALS
DBpediaIT
StatusNetRecit
Livejournal
StatusNetExdc
Elviajero
Aves3D
OpenCalais
ZaragozaTurruta
AspireManchester
Wordnet(VU)
GovUKTransparency
Impact IndicatorsNeighbourhood
Plans
StatusNetDavid
Haberthuer
B3Kat
PubBielefeld
Prefix.cc
NALT
Vulnera-pedia
GovUKImpact
IndicatorsAffordable
Housing Starts
GovUKWellbeing lsoa
HappyYesterday
Mean
FlickrWrappr
Yso.fiYSA
OpenLibrary
AspirePlymouth
StatusNetJohndrink
Water
StatusNetGomertronic
Tags2conDelicious
StatusNettl1n
StatusNetProgval
Testee
WorldFactbookFU-Berlin
DBpediaJA
StatusNetCooleysekula
ProductDB
IMF270a.info
StatusNetPostblue
StatusNetSkilledtests
NextwebGNOSS
EurostatFU-Berlin
GovUKHouseholds
Social LettingsGeneral Needs
Lettings PrpHousehold
Composition
StatusNetFcac
DWSGroup
OpendataScotland
GraphSimd Rank
DNB
CleanEnergyData
Reegle
OpendataScotland SimdEmployment
Rank
ChroniclingAmerica
GovUKSocietal
WellbeingDeprivation
Imd Rank 2010
StatusNetBelfalas
AspireMMU
StatusNetLegadolibre
BlukBNB
StatusNetLebsanft
GADMGeovocab
GovUKImd Score
2010
SemanticXBRL
UKPostcodes
GeoNames
EEARodAspire
Roehampton
BFS270a.info
CameraDeputatiLinkedData
Bio2RDFGeneID
GovUKTransparency
Impact IndicatorsPlanning
ApplicationsGranted
StatusNetSweetie
Belle
O'Reilly
GNI
CityLichfield
GovUKImd
Rank 2010
BibleOntology
Idref.fr
StatusNetAtari
Frosch
Dev8d
NobelPrizes
StatusNetSoucy
ArchiveshubLinkedData
LinkedRailway
DataProject
FAO270a.info
GovUKWellbeing
WorthwhileMean
Bibbase
Semantic-web.org
BritishMuseum
Collection
GovUKDev LocalAuthorityServices
CodeHaus
Lingvoj
OrdnanceSurveyLinkedData
Wordpress
EurostatRDF
StatusNetKenzoid
GEMET
GovUKSocietal
WellbeingDeprv. imdScore '10
MisMuseosGNOSS
GovUKHouseholdsProjections
totalHouseolds
StatusNet20100
EEA
CiardRing
OpendataScotland Graph
EducationPupils by
School andDatazone
VIVOIndiana
University
Pokepedia
Transparency270a.info
StatusNetGlou
GovUKHomelessness
HouseholdsAccommodated
TemporaryHousing Types
STWThesaurus
forEconomics
DebianPackageTrackingSystem
DBTuneMagnatune
NUTSGeo-vocab
GovUKSocietal
WellbeingDeprivation ImdIncome Rank La
2010
BBCWildlifeFinder
StatusNetMystatus
MiguiadEviajesGNOSS
AcornSat
DataBnf.fr
GovUKimd env.
rank 2010
StatusNetOpensimchat
OpenFoodFacts
GovUKSocietal
WellbeingDeprivation Imd
Education Rank La2010
LODACBDLS
FOAF-Profiles
StatusNetSamnoble
GovUKTransparency
Impact IndicatorsAffordable
Housing Starts
StatusNetCoreyavisEnel
Shops
DBpediaFR
StatusNetRainbowdash
StatusNetMamalibre
PrincetonLibrary
Findingaids
WWWFoundation
Bio2RDFOMIM
Resources
OpendataScotland Simd
GeographicAccess Rank
Gutenberg
StatusNetOtbm
ODCLSOA
StatusNetOurcoffs
Colinda
WebNmasunoTraveler
StatusNetHackerposse
LOV
GarnicaPlywood
GovUKwellb. happy
yesterdaystd. dev.
StatusNetLudost
BBCProgram-
mes
GovUKSocietal
WellbeingDeprivation Imd
EnvironmentRank 2010
Bio2RDFTaxonomy
Worldbank270a.info
OSM
DBTuneMusic-brainz
LinkedMarkMail
StatusNetDeuxpi
GovUKTransparency
ImpactIndicators
Housing Starts
BizkaiSense
GovUKimpact
indicators energyefficiency new
builds
StatusNetMorphtown
GovUKTransparency
Input indicatorsLocal authorities
Working w. tr.Families
ISO 639Oasis
AspirePortsmouth
ZaragozaDatos
AbiertosOpendataScotland
SimdCrime Rank
Berlios
StatusNetpiana
GovUKNet Add.Dwellings
Bootsnall
StatusNetchromic
Geospecies
linkedct
Wordnet(W3C)
StatusNetthornton2
StatusNetmkuttner
StatusNetlinuxwrangling
EurostatLinkedData
GovUKsocietal
wellbeingdeprv. imd
rank '07
GovUKsocietal
wellbeingdeprv. imdrank la '10
LinkedOpen Data
ofEcology
StatusNetchickenkiller
StatusNetgegeweb
DeustoTech
StatusNetschiessle
GovUKtransparency
impactindicatorstr. families
Taxonconcept
GovUKservice
expenditure
GovUKsocietal
wellbeingdeprivation imd
employmentscore 2010
LOD とは、Webページが相互につながって巨大なサイバースペースがつくられたように、Data がオープンかつ相互につながり合うことで巨大なデータの世界ができる仕組みです。
どんな研究 ? 何ができる ?Linked Open Data (LOD) という形で学術資源を提供する実践的な研究です。汎用的な情報、博物館資料、生物種情報などを LOD化しています。
・学術データに気軽にアクセス・データ間をサーフィンして知識獲得・他データと柔軟に組み合わせて再利用再公開・データに基づくアプリ作成