Spark Streaming: Commong Pitfalls and Tips for Long-running Streaming Applications  11 Dec 2016    spark-streaming wip
Running Spark Streaming applications may introduce a couple of problems that you may not face when you are running Spark on Batch mode. Here are a couple of things you may need to take into account to keep long-running spark streaming jobs running smoothly. Read More ›

ApplicationAttemptNotFoundException: Spark Application Stuck in ACCEPTED state on YARN  13 Nov 2016    emr yarn spark
ApplicationAttemptNotFoundException may be cause because the log directory have become too crowded with data. Read More ›

Using the AWS CLI to manage Spark Clusters on EMR: Examples and Reference  23 Mar 2016    emr cli wip spark
Update Java to JDK 8 on Amazon Elastic MapReduce  22 Mar 2016    emr spark java 8
Comparing Interactive Solutions for Running Scala and Spark: Zeppelin, Spark-notebook and Jupyter-scala  06 Mar 2016    notebook interactive scala spark zeppelin
Apache Zeppelin, Spark Streaming and Amazon Kinesis: Simple Guide and Examples  19 Feb 2016    emr spark zeppelin kinesis wip
Spark DataFrame UDFs: Examples using Scala and Python  11 Nov 2015    spark udf wip
Add an Apache Zeppelin UI to your Spark cluster on AWS EMR  10 Nov 2015    aws emr spark zeppelin wip
Creating a Spark Cluster on AWS EMR: a Tutorial  10 Nov 2015    aws emr spark
AWS now provides full support for Spark Clusters within Elastic MapReduce (EMR). It's very simples and you just need a couple of minutes to learn how to do it. Read More ›

Creating Scala Fat Jars for Spark on SBT with sbt-assembly Plugin  18 Sep 2015    sbt hadoop spark sbt-assembly
Spark-submit: Examples and Reference  13 Sep 2015    spark wip