Taming Big Data Variety with Semantic Graph Databases
Share this Session:
  Evren Sirin   Evren Sirin
Chief Technology Officer
Complexible Inc


Tuesday, August 18, 2015
01:00 PM - 01:45 PM

Level:  Technical - Intermediate

Three Vs are commonly used to describe challenges in dealing with Big Data: Volume, Velocity, and Variety. There has been a lot of focus on dealing with volume (e.g. horizontally scalable NoSQL solutions) and velocity (e.g. real-time streaming solutions) aspects of big data whereas there has been no satisfactory solution for handling the variety of big data. In this talk, we will explain how semantic technologies and graph databases can be used to address the schema variety problem by providing a flexible data integration framework. Graphs can be used to represent various kinds of heterogeneous data and the RDF graph model can be used to serialize, query and reason with semantic graphs using W3C standards. Our talk will describe how we extend the popular open-source MapReduce framework Spark with RDF and SPARQL support. This extension builds on top of the Stardog RDF database we develop but provides a unified view over RDF model and property graphs as they are supported in the TinkerPop stack.

Evren Sirin is co-founder and the Chief Technology Officer of Complexible Inc that develops Stardog, the leading semantic graph database. At Complexible, Evren leads research and development in the areas of semantic technologies, information management, and graph databases. Before joining Complexible, he received his Ph.D. in Computer Science from the University of Maryland, College Park, under the supervision of Jim Hendler where he published many papers in top-tier conferences and journals.

Close Window