The organization is a large multinational consumer electronics manufacturer.
The technology division of the marketing business unit is responsible for monitoring around 1,500 URLs used by various apps for marketing purposes. The technology environment included a Jenkins server that ran a job every 15 minutes to check on the applications' status. However, the system generated excessive false positives, wasting time and effort.
Many URLs that needed monitoring were generated dynamically, further compounding the problem. The company’s systems reliability engineering (SRE) team manually added all auto-generated URLs to the Jenkins job. This manual intervention was also prone to errors within the system.
In addition to the above, the business unit needed a centralized dashboard to provide the team with a single place to get an accurate picture of application availability.
The tools used by the organization’s SRE team include Kubernetes, Jenkins, Amazon Web Services (AWS), Vault, Consul, vCenter, NGINX Plus, NetScaler, FreeIPA, Argo CD, Spinnaker, GitHub, Jira, Confluence, and Artifactory.
In brief, the organization was looking for the following:
After extensive consultation with relevant stakeholders, our DevOps team decided to employ a comprehensive health analytics dashboard. This platform consists of several microservices developed by our engineers that can gather metrics, review Splunk logs, and produce analytics on application environment user access.
Our engineers developed a set of custom Prometheus exporters to adapt the health analytics dashboard to the organization’s systems. These exporters monitor application health and supply the information to Prometheus. Our team also developed a loader and other microservices that grabs data from Prometheus and pushes it to a PostgreSQL database.
We then implemented analytics that offers product tracking that the organization's internal teams require in each environment. The health analytics dashboard was integrated with Splunk and a custom alert script, allowing it to push accurate and timely notifications to the teams via email, Jira, Slack, and PagerDuty.
Key features of the solution include:
Key benefits the company enjoyed after project completion:
Unlock the power of the data insights buried deep within your diverse systems across the organization. We empower businesses to effectively collect, beautifully visualize, critically analyze, and intelligently interpret data to support organizational goals. Our team ensures good returns on the big data technology investments with the effective use of the latest data and analytics tools.