Top reasons why shift to spark

This blog has been moved to new address: http://www.trongkhoanguyen.com.

– Fast, in-memory (100x faster) or disk (2-10x faster). See Daytona GraySort contest and Official Result
– Usability: rich APIs (Scala, Java, Python), concise, interactive shell

Complexity

LoC of Spark in comparison with other projects

Complexity2

LoC of Spark core-framework and its integrated libraries

 

– Well designed, unified: Spark is a general platform and SparkSQL, SparkStreaming, GraphX, MLib are standard libraries included with Spark. These libraries provide a wide range of features that support multiple usages.
– Concrete foundation: Databricks & UC Berkeley AMPLab & Community
– Many adopters: Amazon, Yahoo!, Autodesk, Technicolor, Baidu, Celtra, eBay Inc., IBM Almaden, SamSung SDS, Sonny, …

Advertisements
This entry was posted in Spark. Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s