Introduction

The following guides are just best practices and should help you with common tasks in a azeti SONARPLEX infrastructure. Please make small changes and monitor the performance of your device, adjust step by step. A lot of the success of optimizations depends on your configuration in detail. Make sure to test changes in a test environment first before deploying to your production system. You can use the free VAA (available on www.azeti.net) for testing.

This guide relates to SONARPLEX Generation 5.

Security Optimization

  • Agent Password: Use a strong azeti Agent Password, best is to use more than 16 letters, special chars and numbers ( > Configuration > Network > Agent Configuration)
  • IP Forward: Disable "IP Forwarding" (forwarding of ip packets between the network interfaces)
  • SNMP Community: Change the default SNMP community strings, use strong strings instead of "public" ( > Configuration > Network > SNMP Configuration and Configuration > System > Status Delivery Configuration for Distributed Monitoring)
  • HTTPS: Enable HTTPS instead of unencrypted HTTP ( > Configuration > Network > HTTP Configuration), optionally change the HTTPS Port to something different than 443,444
  • SSL Certificate: Use properly (signed by a trusted CA) signed SSL certificates instead of the default self-signed azeti certificate
  • Passwords: Change the Administrator password and use a strong password ( > Configuration > Accounts > Administrator (admin))
  • Accounts: Delete all unnecessary accounts or at least disable the web and admin GUI access for unprivileged users ( > Configuration > Accounts)
  • Updates: Keep your SONARPLEX up to date, check portal.azeti.net regularly or enable Auto Updates if your SONARPLEX does have an internet connection

 

On this page:

Performance Optimization

Troubleshooting Distributed Monitoring

The logging capabilities help you to identify issues in any Distributed Monitoring setup. Besides this you can use below check list to rule out possible errors.

  1. Check the logs for information about the delivery and receipt of the status:
  2. Check the network connectivity
  3. Monitor the azeti agent availability from the sattelites to the NOC SONARPLEX

SONARPLEX Performance Metrics

Beginning with SONARPLEX OS 3.7.0a default service checks for the performance of the SONARPLEX are added by default, find them at the default host –azeti-A-. These new checks help you to identify bottlenecks and to scale up in time.

CPU & MEM

The average CPU and MEM usage over time should range below 75%.

PROCS & LOAD

A high number of concurrent processes imply a configuration issue. Often this is caused by a service check command with a high execution time (5 seconds and more) which forces other processes to wait in the queue, this effect sums up and causes a large number of processes and a high load. The load is the number of processes, which are waiting for system resources (I/O). The load should range below 7 – 10.

Service-Check-Latency

This is the amount of time between the scheduled execution time and the actual execution time. Ideally every service check has a latency of 0 seconds. Make sure to have a latency below 10 seconds, better below 5. If you see high latencies than there are too much concurrent service checks, this can be adjusted slightly by decreasing the concurrent service check number (Configuration > System > Load Configuration) but the ideal solution is to scale up with an addition SONARPLEX. Recommendation: < 5 seconds

Service-Execution-Time-Avg

The average service check execution time is a important performance metric as it helps you to identify the average cost of your checks. The smaller the execution, the more service checks can be executed per minute. A high execution time implies slow service check commands, have a look into each different service to identify the slow and costly service checks. Either try to optimize the service check plug-in or increase the service check interval to lower the overall execution time. The service check execution time should ideally range below 3-5 seconds. Recommendation: < 3-5 seconds

SONARPLEX Virtual Appliance (VAA) Sizing

The appropriate sizing of a VAA highly depends on the used service check commands, service check interval and the “cost” (execution time) of the service checks. Try to start with a small machine setup and scale it up as the load increases. Make sure to keep an eye on the most important performance metrics (service check latency and service check execution time). Below is a table with sizing recommendations depending of the number of services.

Service ChecksCPURAM
up to 100Single Core, 500 MHz512 MB
300Dual Core, 1 GHz1 GB
1000Dual Core, 2 GHz2 GB
3000Multi CPU, Multi Core, 2,5 GHZ or better4 GB