Change language

Next-Generation Big Data


Next-Generation Big Data Butch Quinto

This book serves as a practical guide on how to utilize big data to store, process, and analyze structured data, focusing on three of the most popular Apache projects in the Hadoop ecosystem: Apache Spark, Apache Impala, and Apache Kudu (incubating). Together, these three Apache projects can rival most commercial data warehouse platforms in terms of performance and scalability at a fraction of the cost. Most next- generation big data and data science use cases are driven by structured data, and this book will serve as your guide.

About the Author Butch Quinto is Chief Data Officer at Lykuid, Inc. an advanced analytics company that provides an AI-powered infrastructure monitoring platform. As Chief Data Officer, Butch serves as the head of AI and data engineering, leading product innovation, strategy, research and development. Butch was previously Director of Analytics at Deloitte where he led strategy, solutions development and delivery, technology innovation, business development, vendor alliance and venture capital due diligence. While at Deloitte, Butch founded and developed several key big data, IoT and artificial intelligence applications including Deloitte’s IoT Framework, Smart City Platform and Geo-Distributed Telematics Platform. Butch was also the co-founder and lead lecturer of Deloitte’s national data science and big data training programs. Butch has more than 20 years of experience in various technical and leadership roles at start-ups and Global 2000 corporations in several industries including banking and finance, telecommunications, government, utilities, transportation, e-commerce, retail, technology, manufacturing, and bioinformatics. Butch is a recognized thought leader and a frequent speaker at conferences and events. Butch is a contributor to the Apache Spark and Apache Kudu open source projects, founder of the Cloudera Melbourne User Group and was Deloitte’s Director of Alliance for Cloudera. About the Technical Reviewer Irfan Elahi has years of multidisciplinary experience in Data Science and Machine Learning. He has worked in a number of verticals such as consultancy firms, his own start-ups, and academia research lab. Over the years he has worked on a number of data science and machine learning projects in different niches such as telecommunication, retail, Web, public sector, and energy with the goal to enable businesses to derive immense value from their data-assets.

See also

Learn programming in R: courses


Best Python online courses for 2022


Best laptop for Fortnite


Best laptop for Excel