虫虫首页| 资源下载| 资源专辑| 精品软件
登录| 注册

您现在的位置是:虫虫下载站 > 资源下载 > 技术资料 > 超越Hadoop的大数据技术:用Spark 和Shark进行基于内存的实时大数据分析

超越Hadoop的大数据技术:用Spark 和Shark进行基于内存的实时大数据分析

  • 资源大小:1841 K
  • 上传时间: 2023-09-27
  • 上传用户:lostxc
  • 资源积分:2 下载积分
  • 标      签: hadoop 大数据

资 源 简 介

超越Hadoop的大数据技术:用Spark 和Shark进行基于内存的实时大数据分析


Big Data beyond Hadoop Big Dta today • The is in the room Big Data beyond Hadoop • Real-time analytical processing (RTAP) – Discover and explore data iteratively and interactively for real-time insights • Advanced machine leaning and data mining (MLDM) – Graph-parallel predictive analytics (non-SQL) • Distributed in-memory analytics – Exploit available main memory in the entire cluster for >100x speedup


RTAP: Real-Time Analytical Processing Real-Time Analytical Processing (RTAP) • Data ingested & processed in a streaming fashion • Real-time data queried and presented in an online fashion • Real-time and history data combined and mined interactively • Predominantly RAM-based processing

相 关 资 源