bigdata - Apache Spark - Map Reduce Debugging Tips -


i trying implement kafka - spark environment. trying debug mapreduce logic (implemented in java). spark submit step making complicated debug break points in algorithms. incoming live data patterns complex. time consuming process simulate complex algorithms. better development environment developers validate map reduce logic on live stream data.

please suggest tips , tricks. possible have ide breakpoints, or remote debugging apache-spark.

i don't think matters if developing streaming or batch spark application. can use intellij idea graphical debugging of application.

also @ video: https://www.youtube.com/watch?v=kkog_aj9kjq. in end if want debug reactivity of application according data patterns suggest use spark's internal tools see example how dag being created or how it's being distributed.

one thing myself working on use spark debugging tools, , build log according application execution graph added information profilers (usual os tools iotop or iostat) find example not utilizing resources enough.

in end need these information make decision , ironically can become data intensive application.


Comments

Popular posts from this blog

angularjs - ADAL JS Angular- WebAPI add a new role claim to the token -

php - CakePHP HttpSockets send array of paramms -

node.js - Using Node without global install -