While troubleshooting a performance issue, we had a need to graph the number of seconds between full collections. We used what is below. This assumes you have added -XX:+PrintGCDetails and -XX:+PrintGCDateStamps as well as a file location for -Xloggc: to…
Hive performance parser
With data in the hiveserver2.log file, this awk scriptlet prints the timestamp, SQL, and seconds to run. There is an issue where the parser thread hands off to the executor, and you can’t always tie the two together. However, at a…
Oracle SQL Developer and Apache Hive – Never the twain shall meet
If you have come trying to find out why you can’t connect when you know you haven’t fat fingered your password, take heart, you aren’t going crazy. You likely used instructions in the following post to connect SQL Developer to…
Deleting a service in ambari
Just a simple method to delete a service in Ambari using the REST API… [root@ambari2012 ~]# curl -u admin:admin -H “X-Requested-By: ambari” -X DELETE http://192.168.56.121:8080/api/v1/clusters/ambari2012/services/RANGER [root@ambari2012 ~]# curl -u admin:admin -H “X-Requested-By: ambari” -X DELETE http://192.168.56.121:8080/api/v1/clusters/ambari2012/services/RANGER { “status” :…
Troubleshooting oozie browser connection to secure cluster – Part 1
This is really a generic listing of using the spnego open source software to access a network service that requires authentication. In our case, it was useful to troubleshoot our inability to connect to the oozie web UI. We still…
Unable to download to any storage directory
We had an issue in which the secondary namenode was not checkpointing the edits file with updates from the primary namenode. In case you are unaware, the secondary namenode (not in an HA configuration) regularly transfers the edits file from…