IT Infrastructure Audit and Optimization: Best Practices and Key Benefits

What is an IT infrastructure?

  • Operational Excellence
  • Security
  • Reliability
  • Performance Efficiency
  • Cost Optimization
  • Sustainability.

What type of IT infrastructure is optimal?

  • On-premises infrastructure, configured on local servers which can be physically accessed by its owner. Scalability is limited to hardware capabilities. Infrastructure maintenance requires having an in-house team of software engineers who are well familiar with it. Otherwise, it gets harder, more time-consuming and costly to find support services qualified to work with it.
  • Cloud infrastructure, configured in a cloud environment, with no possibility for the owner to access the servers. However, there’s unlimited scaling potential, and no support issues due to automation of IT infrastructure monitoring, and large pool of professionals.
  • the governance requires to store all documents only in on-premises storage
  • only specific people can have access to the underlying hardware
  • there are specific hardware requirements which ensure that all data on this particular hardware are encrypted.

Why is IT infrastructure optimization important?

  • a low-performance infrastructure which can’t scale and recover from disasters
  • a high number of provisioned servers and receive a high cloud bill by the end of a month.

What is an IT infrastructure audit?

  • the operations team
  • network operation center (NOC) engineers
  • DevOps engineers
  • basically, anyone involved in infrastructure configuration or monitoring.
  • After a new service launch, monitor its behavior, check the amount of resources it requires, and if there are any errors, etc.
  • Check CPU, RAM and Disk performance on each server
  • Check the peak values of the server loads
  • Analyze the incoming weekly amount of traffic, trying to spot any anomalies, etc.
  • Check application logs for errors, and if they store any unneeded data; pay attention to the quantity of logs
  • Analyze daily infrastructure cost throughout the week, notice any increases and their reasons, configure daily and monthly budget alarms
  • Pay attention to security
  • Check the number of users who are added to AWS and their rights for access (we use the least privilege principle, which means the users have only necessary access permissions)
  • Check if the users have enabled multi-factor authentication (MFA)
  • Check the number of open ports on the servers. It’s also necessary to make sure that the port SSH is accessible only to a few IP addresses, etc.
  • Analyze average server load to understand if you need to increase the number of servers, or if the cluster has enough resources for now
  • Make a generalized IT infrastructure cost estimate to come up with the suggestions about how to reduce expenses. Usually this can be achieved by reserving Amazon servers for 1–3 years ahead (the 3-year reserve price is the lowest)
  • Define the average and peak monthly traffic values, etc.

Prioritization: what needs to be optimized first?

  • lack of resources for the app to perform properly
  • errors which can cause an application outage, etc.
  • cost optimization
  • scalability and performance optimization
  • or both options.

What are the ways of IT infrastructure cost optimization?

  1. Made dev environments less performant. Not to affect the UX, the lower performance margin strongly depends on the application requirements, e.g., applications with active image processing need more CPU and RAM resources.
  2. Disabled autoscaling.
  3. Configured development environments only in one Availability Zone with no replications.

How to increase app scalability?

  • uses an independent storage service (e.g. S3 for static files), and does not keep any needed files in the local environment during workload
  • is hosted across different Availability Zones (independent data centers), so if one data center is down, the application will be automatically restored in another one. This would ensure that the app can be quickly restored from failures.

What has changed for Pro.Con app after its IT infrastructure migration to AWS?

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Apiko

Apiko is a software development company that enters markets with digital businesses, using a solid process and clever strategies.