美文网首页
Data Algorithm

Data Algorithm

作者: shamumu | 来源:发表于2015-11-06 12:21 被阅读19次

MapReduce

Map-Reduce
  • MapReduce is a framework to develop distrubuted applications using Java,Scala, and other programming languages.
  • MapReduces's distributed filesystem is not a replacement for a relational database management systen.

Hadoop and Spark

Hadoop and Spark are two different distributed software frameworks.

  • A spark job can be an arbitrary DAG(Directed Acyclic Graph) of map and/or reduce/shuffle phase. Spark may run with/without Hadoop.

Benefits for big data processing

  • reliability
  • scalability
  • distributed processing
  • parallelism

Chapter 10: Content-Based Recommendation:Movies

相关文章

网友评论

      本文标题:Data Algorithm

      本文链接:https://www.haomeiwen.com/subject/jadchttx.html