Install the Amazon ElastiCache Redis ULM App and view the Dashboards
This page has instructions for installing the Amazon ElastiCache Redis ULM App, as well as providing examples and descriptions for each of the app dashboards. Click a link to jump to a topic.
- Install the app
- Dashboard filters
- Event Overview Dashboard
- Event Details Dashboard
- Performance Overview Dashboard
- Performance by Cluster Dashboard
- Performance by Host Dashboard
- Command Stats Dashboard
- Notifications Dashboard
Install the app
Now that you have set up log and metric collection for Amazon ElastiCache Redis, you can install the Sumo Logic App for Amazon ElastiCache Redis ULM and use its pre-configured searches and dashboards.
To install the app, do the following:
Dashboard filters
Each dashboard has a set of filters that you can apply to the entire dashboard, as shown in the following example. Click the funnel icon in the top dashboard menu bar to display a scrollable list of filters that narrow search results across the entire dashboard.
Each panel has a set of filters that are applied to the results for that panel only, as shown in the following example. Click the funnel icon in the top panel menu bar to display a list of panel-specific filters.
Event Overview Dashboard
Amazon ElastiCache Redis ULM - Event Overview Dashboard provides at-a-glance analytics for the locations of successful and failed events, event status trends and details, user activity, and failed event error codes and details.
Use this dashboard to:
- Get the location of successful and or failed activities to check if its within compliance zone.
- Get an overview of event trends to identify differences compared to typical patterns.
- Review users and type of authentication method being used.
- Review the top Replication Group IDs.
- Keep watch on the reasons for failed activities so you can take corrective actions.
Event Details Dashboard
Amazon ElastiCache Redis ULM - Event Details Dashboard provides analytics for creating, adding, and increasing ElastiCache events, as well as modifying, deleting, removing, and decreasing ElastiCache events.
Use this dashboard to:
- Review details about various cluster, shard, node related activities such as creation, modification (increase, decrease in numbers of shards or nodes in a cluster), deletion, removal and reboot of nodes. The improper configuration of clusters, shards, or nodes may have an adverse impact.
- Identify details about the ElastiCache specific events to decide next step.
Performance Overview Dashboard
Amazon ElastiCache Redis ULM - Performance Overview Dashboard provides at-a-glance metrics for cache activity and outliers, network connections, resource utilization, network traffic, performance, and performance comparisons over time.
Use this dashboard to:
- Monitor cache hit rate, maximum swap space used, number of current client connections, replication lag in seconds, and the maximum number of keys and items in cache.
- Monitor the current network throughput (bytes and packets) and view a comparison with yesterday's throughput.
- Monitor outliers for cache hits and misses. A negative outlier for cache hit is a bad signal, and a positive outlier for cache miss is a bad signal
- Monitor the cache hit ratio. The higher the cache hit ratio the better the performance.
Performance by Cluster Dashboard
Amazon ElastiCache Redis ULM - Performance Cluster Dashboard provides at-a-glance analytics for overall cluster statistics, persistence, replication, CPU utilization, memory allocation, and client connections.
Use this dashboard to:
- Monitor the number of current client connections and connections received during a specific time. You may want to investigate a sudden rise in connections or an inconsistent increase in connections.
- Monitor keys evictions. Determine a threshold based on application needs. An increase in eviction means there is not enough space, hence older items are removed resulting in a high latency for user requests.
- Monitor read-only lookups performed with cache hits and cache misses. A higher number of cache hits means better performance.
- Monitor the Redis process CPU utilization to gain better insights into your Redis workloads.
- Monitor bytes being replicated and the replication lags in seconds. A lower lag means better performance.
- Monitor how the background save process is working during snapshots and syncs. These operations can cause degraded performance. The SaveInProgress metric allows you to diagnose whether or not degraded performance was caused by a background save process.
- Monitor the total number of bytes allocated by Redis for all purposes, including the dataset, buffers, etc.
- Monitor the number of value reallocations per minute performed by the active defragmentation process.
Performance by Host Dashboard
Amazon ElastiCache Redis ULM - Performance by Host Dashboard provides at-a-glance analytics for host CPU and memory utilization, and network performance metrics.
Use this dashboard to:
- Monitor host level metrics for individual cache nodes in the cluster
- Monitor CPU Utilization. Generally speaking, you can set your threshold at 90% of your available CPU bandwidth. Because Redis is single-threaded, the actual threshold value should be calculated as a fraction of the node's total capacity.
- Monitor memory available on a node.
- Monitor SwapUsage. This metric should not exceed 50 MB.
- Monitor network throughput for bytes and packet (in and out).
Command Stats Dashboard
The Amazon ElastiCache Redis ULM - Command Stats Dashboard provides high-level visual analytics for current items in cache and commands used, such as Get, Set, string, list, hash, key, sorted Set, hyperlog, and stream based commands.
Use this dashboard to:
- Monitor the number of items in cache. Heavily used items are kept in cache to improve application performance.
- Monitor read-only (Get), write-only (Set) type of commands executed on a redis cluster.
- Monitor various data type commands executed on a redis cluster. Redis supports a variety of data structures to meet different application needs.
Notifications Dashboard
The Amazon ElastiCache Redis ULM - Notifications Dashboard provides an overview of event statistics for Amazon ElastiCache Notification with SNS. This dashboard gives you an at-a-glance event data for the top active nodes in the cluster, the number of successful and failed events, top events, recent events, and events status trends.
Use this dashboard to:
- Dig further into ElastiCache Event Audits, CloudTrail Event Overview, and CloudTrail Event Details for more granular analytics.
- Monitor events and their frequency to be able to quickly identify issues as they are arise.
- Review event trends and status, to identify when something is out of the ordinary.
- Look for the top active nodes and clusters.
- Keep watch on failed activities so you can take corrective actions as needed.