bigdata - Apache Spark - Map Reduce Debugging Tips -

April 15, 2012

i trying implement kafka - spark environment. trying debug mapreduce logic (implemented in java). spark submit step making complicated debug break points in algorithms. incoming live data patterns complex. time consuming process simulate complex algorithms. better development environment developers validate map reduce logic on live stream data.

please suggest tips , tricks. possible have ide breakpoints, or remote debugging apache-spark.

i don't think matters if developing streaming or batch spark application. can use intellij idea graphical debugging of application.

also @ video: https://www.youtube.com/watch?v=kkog_aj9kjq. in end if want debug reactivity of application according data patterns suggest use spark's internal tools see example how dag being created or how it's being distributed.

one thing myself working on use spark debugging tools, , build log according application execution graph added information profilers (usual os tools iotop or iostat) find example not utilizing resources enough.

in end need these information make decision , ironically can become data intensive application.

Search This Blog

Call

bigdata - Apache Spark - Map Reduce Debugging Tips -

Comments

Post a Comment

Popular posts from this blog

node.js - Using Node without global install -

php - CakePHP HttpSockets send array of paramms -

angularjs - ADAL JS Angular- WebAPI add a new role claim to the token -