Client: ideaForge
Industry: Designing and manufacturing drones for mapping, security, and surveillance applications
AWS Services Used: AWS CloudWatch, AWS CloudTrail, AWS Lambda, AWS SNS, AWS EventBridge
The previous infrastructure lacked comprehensive monitoring and logging capabilities, resulting in difficulties in tracking application performance, identifying security issues, and maintaining compliance. This led to delayed incident response, manual monitoring, limited insight into changes, and difficulty diagnosing performance issues. To overcome these obstacles we prioritized the following objectives:
Implement AWS CloudWatch to provide real-time monitoring of the infrastructure and applications.
Implement AWS CloudTrail to log all API activities and track user actions for security and compliance.
Use the insights from monitoring and logging to optimize the performance of the infrastructure and applications.
Enhance the security posture by tracking and analysing access and activity logs.
Enable faster and more efficient troubleshooting by providing detailed logs and metrics.
Alarms were configured with thresholds at different levels for various servers: one alarm was set to trigger at greater than 90%, another at greater than 80%, and a third at 50%. This tiered approach allows for proactive management of server load and helps prevent potential performance degradation.
Alarms were established with thresholds at greater than 90% and greater than 80%. These alarms enable timely identification and resolution of memory-related issues, ensuring smooth operation of applications and services.
Root disk utilization alarms were set with thresholds at greater than 90% and greater than 80%. This ensures that disk usage is kept in check, preventing storage-related disruptions.
Additionally, alarms for HTTP errors were configured to monitor the health of web services. An alarm was set for 4XX errors with a threshold of 50 errors, and another for 5XX errors with a threshold of 10 errors. These alarms help quickly identify and address client-side and server-side issues, respectively.
EC2 Instance State Change Notification Automation using AWS CloudTrail API
We have implemented a sophisticated automation solution using Amazon EventBridge, AWS Lambda, and Amazon SNS. This setup ensures that any changes in the state of EC2 instances such as starting, stopping, or terminating—are promptly communicated to the relevant stakeholders via email.
The implementation of AWS CloudWatch and CloudTrail by Galaxy for ideaForge has greatly improved system monitoring, security, and operational efficiency. Real-time metrics, custom logs, and automated alerts now ensure high availability and optimal performance for ideaForge’s drone systems. Enhanced threat detection and compliance, coupled with reduced downtime and faster troubleshooting, showcase the effectiveness of these AWS solutions in maintaining robust IT infrastructures. This project highlights the value of comprehensive monitoring and logging in achieving superior system performance and security.