Big Data or Big Garbage? A Tale of a Research Journey for Real-Time Business Intelligence

09:00
Mercredi
7
Juin
2017
Organisé par : 
Intervenant : 
Équipes : 

Bio : Rakesh Agrawal is the President and Founder of the Data Insights Laboratories. He is a member of National Academy of Engineering, a Fellow of ACM, and a Fellow of IEEE. He has been both an IBM Fellow and a Microsoft Fellow. ACM SIGKDD awarded him its inaugural Innovations Award and ACM SIGMOD the Edgar F. Codd Award. He was named to the Scientific American’s First list of top 50 Scientists. Rakesh has been granted 80+ patents and published 200+ papers, including the 1st and 2nd highest cited in databases and data mining. Four of his papers have received “test-of-time” awards. His research formed the nucleus of IBM Intelligent Miner that led the creation of data mining as a new software category. Besides Intelligent Miner, several other commercial products incorporate his work, including IBM DB2 and WebSphere and Microsoft Bing.

We present the story of a research expedition (code-named WaveFour) into building an enterprise-scale, real-time business intelligence system over social data. We discuss what drove us to undertake this journey and the system prototype we built. We also describe the investigation we carried out to assess the overlap between Google and Bing search results and whether including social data in the mix can produce different and useful results. We conclude with lessons learned and future directions.