Last week I had a bit of a trial by fire:
“Here’s a 7 node, Hortonworks Hadoop cluster, metrics is broken, fix it, go!”
The initial indication that metrics was broken was apparent in the Services tab for Ambari Metrics. Here it showed that there was an error and that Metrics Collector was Stopped. The error however wasn’t very informative:
Connection failed: [Errno 111] Connection refused…
That didn’t tell me much at all, and neither did googling.
(I hope the title of this blog helps someone else find this solution quicker.)
Jon includes the answer and some additional helpful details. Check it out.