=Paper=
{{Paper
|id=Vol-1353/Rajasekar2
|storemode=property
|title=None
|pdfUrl=https://ceur-ws.org/Vol-1353/Rajasekar2.pdf
|volume=Vol-1353
}}
==None==
Big Data & Data Science: A Practitioner’s Perspective Arcot Rajasekar Professor University of North Carolina Chapel Hill, NC rajasekar@unc.edu Biographical Sketch Arcot Rajasekar is a Professor in the School of Library and Information Sciences at the University of North Carolina at Chapel Hill, a Chief Scientist at the Renaissance Compu- ting Institute (RENCI) and co-Director of Data Intensive Cyber Environments (DICE) Center at the University of North Carolina at Chapel Hill. Previously he was at the San Diego Supercomputer Center at the University of Cali- fornia, San Diego, leading the Data Grids Technology Group. He has been involved in research and development of data grid middleware systems for over a decade and is a lead originator behind the concepts in the Storage Resource Broker (SRB) and the integrated Rule Oriented Data Sys- Abstract tems (iRODS), two premier data grid middleware devel- oped by the Data Intensive Cyber Environments Group. A When people talk about Big Data, they have a vision of large data - Volume and Velocity - data in Tera Bytes, Peta leading proponent of policy-oriented large-scale data man- Bytes and of data coming in at a fast clip - like tweets, you- agement, Rajasekar has several research projects funded by tube clips, etc. But there is another side to data - created the National Science Foundation, the National Archives, and maintained by individuals or small groups for their National Institute of Health and other federal agencies. own purpose - research or otherwise. The Volume and Va- Rajasekar has a PhD in Computer Science from the Uni- riety of these data, in toto, exceeds the size of the other Big versity of Maryland at College Park and has more than 100 Data. We call these data as "dark data" as they are there but publications in the areas of data grids, digital library, per- unknown to the world and suffer from a first mile and last sistent archives, logic programming and artificial intelli- mile data problem. The talk is geared towards examining gence. His latest projects include the Datanet Federation this aspect of big data phenomenon and its ramification for Consortium and the Data Bridge that is building a social data science. network platform for scientific data.