=Paper= {{Paper |id=Vol-1353/Rajasekar2 |storemode=property |title=None |pdfUrl=https://ceur-ws.org/Vol-1353/Rajasekar2.pdf |volume=Vol-1353 }} ==None== https://ceur-ws.org/Vol-1353/Rajasekar2.pdf
                 Big Data & Data Science: A Practitioner’s Perspective
                                                       Arcot Rajasekar
                                                             Professor
                                                    University of North Carolina
                                                          Chapel Hill, NC
                                                        rajasekar@unc.edu




                                                                                      Biographical Sketch

                                                                     Arcot Rajasekar is a Professor in the School of Library and
                                                                     Information Sciences at the University of North Carolina at
                                                                     Chapel Hill, a Chief Scientist at the Renaissance Compu-
                                                                     ting Institute (RENCI) and co-Director of Data Intensive
                                                                     Cyber Environments (DICE) Center at the University of
                                                                     North Carolina at Chapel Hill. Previously he was at the
                                                                     San Diego Supercomputer Center at the University of Cali-
                                                                     fornia, San Diego, leading the Data Grids Technology
                                                                     Group. He has been involved in research and development
                                                                     of data grid middleware systems for over a decade and is a
                                                                     lead originator behind the concepts in the Storage Resource
                                                                     Broker (SRB) and the integrated Rule Oriented Data Sys-
                         Abstract
                                                                     tems (iRODS), two premier data grid middleware devel-
                                                                     oped by the Data Intensive Cyber Environments Group. A
When people talk about Big Data, they have a vision of
large data - Volume and Velocity - data in Tera Bytes, Peta          leading proponent of policy-oriented large-scale data man-
Bytes and of data coming in at a fast clip - like tweets, you-       agement, Rajasekar has several research projects funded by
tube clips, etc. But there is another side to data - created         the National Science Foundation, the National Archives,
and maintained by individuals or small groups for their              National Institute of Health and other federal agencies.
own purpose - research or otherwise. The Volume and Va-              Rajasekar has a PhD in Computer Science from the Uni-
riety of these data, in toto, exceeds the size of the other Big      versity of Maryland at College Park and has more than 100
Data. We call these data as "dark data" as they are there but        publications in the areas of data grids, digital library, per-
unknown to the world and suffer from a first mile and last           sistent archives, logic programming and artificial intelli-
mile data problem. The talk is geared towards examining              gence. His latest projects include the Datanet Federation
this aspect of big data phenomenon and its ramification for          Consortium and the Data Bridge that is building a social
data science.                                                        network platform for scientific data.