Scale a Near Real-Time AI System by 4X and Beyond with Apache Spark Yan Li (Conviva) and Shubo Liu (Conviva) from worker nodes Watch Video
Preview(s):
Gallery
Play Video: (Note: The default playback of the video is HD VERSION. If your browser is buffering the video slowly, please play the REGULAR MP4 VERSION or Open The Video below for better experience. Thank you!)
⏲ Duration: 16 min 84 sec ✓ Published: 13-Jun-2018
Description: During last year’s Spark Summit, we presented a near real-time spark based application for video streaming quality analysis. Building a new system is always sweet with a lot of fun, productizing it sometimes can be bitter. At our scale, it was worth spending development time to reduce machine requirements.nnWe did this by: optimizing the number of splits; minimizing intra-job data shuffling; and customizing our SerDe. With these optimizations we were able to use 40% fewer machines. The Spark U
Play Video: (Note: The default playback of the video is HD VERSION. If your browser is buffering the video slowly, please play the REGULAR MP4 VERSION or Open The Video below for better experience. Thank you!)