FOSSMeet 2019

FOSSMeet is an annual event at NIT Calicut that brings together the Free and Open Source Community from around the country.

Structured Data Modelling for AI using Wikidata

Submitted by Ranjith Siji (@ranjithsiji) on Sunday, 23 December 2018

Section: Student Technical level: Intermediate

Abstract

Adoption of machine learning and artificial intelligence is critical for success in the era of rapid digital transformation, it’s even more important how institutions make structure data for driving insights. Nowadays all kind of industry is driven by data. Some 80% of all digital data is unstructured, Kesher said during a keynote at the AI Summit in San Francisco. The vast majority of data representing the digital universe, however, is and will remain unstructured. And although unstructured documents are widely used as key inputs and “systems” for core business activities. Structured, organized data is critical for accurate and dynamic adjustment of outputs in continuously changing environments. Unstructured data is, more or less, useless to analyse the current situation and make critical decisions. Data modelling is required to make structured data from unstructured data. Wikidata is the ultimate data modelling platform to make structured data for any kind of data. Wikidata is flexible enough to model various kinds of data sets using collaborative inputs. Through this introductory talk/workshop, I would like to introduce the process of data modelling for various kinds of items using Wikidata. Also how to get the structured data output from Wikidata using its Query service and APIs.

Outline

  1. Structured Data
  2. Open Data and Closed Data
  3. Data modelling for Meaning full output
  4. Wiki as in General
  5. Wikidata
  6. Components of Wikidata
  7. Data model in Wikidata
  8. How to model data using Wikidata
  9. How to get Data from Wikidata
  10. SPARQL Query service
  11. Tools associated with Wikidata
  12. Projects built upon Wikidata

Requirements

If it is a workshop all the participants to have computer access plus good internet conenction is required. If it is a talk then Projector, Internet connection is required.

Speaker bio

Ranjith Siji is one of the Bureaucrats in Malayalam Wikipedia, Member of IndicTechcom, CEO of Alphafork Technologies and a frequent speaker at Wikimedia events in Kerala. For the past ten years, Ranjith is contributing to Wikimedia projects mainly Wikipedia. Ranjith is one of the organizer of first India level Label-a-thon on Wikidata. He started editing Wikidata from its early years.

Links

Slides

https://docs.google.com/presentation/d/e/2PACX-1vQ-BjFfpdMMOux5bJIs_3Kvjy1pYulDwLkHICPlaAxKq037NBXQurEz5wXJoQYkfVmD3VAWwF49ykNx/pub?start=false&loop=false&delayms=60000

Comments

Login with Twitter or Google to leave a comment