Apica partners with Boomi for Run-Time Observability powered by Apica Ascent. Learn More

Products

OVERVIEW

test

How it works
Architecture and components

InstaStore^TM
Data Storage for the Modern Enterprise

Experience Ascent
Navigate Your Tech Terrain Effortlessly with Apica’s Ascent Experience

Platform Reliability
Security, Compliance and Scale

Integrations
Inbound and outbound integrations

Generative AI Assistant
Unleashing the Power of contextualized Data

ROI Calculator
Calculate Your Observability Costs Seamlessly

OBSERVE

Active Observability

Logs
Log aggregation, management & analytics

Metrics
Application & infrastructure metrics

Traces
Trace transactions between distributed services

Convergence
Converge and analyze any data source

Synthetic Monitoring
Apica Synthetic Monitoring Built for Proactive Enterprises

LoadTest
Know How Your Apps Will Perform in Any Circumstance

Advanced Scripting Engine
Apica’s Powerful Scripting Engine

Time Series Database
Faster, Efficient, and easier to operate and Scale

FLOW

Pipeline Control

Filter/Reduce
Optimize spend and remediate faster

Mask/Transform
Improve compliance and interpret better

Enrich
Supercharge analytics and improve predictions

Route
Send right data to right target every time

Replay
Instantly replay historical data to any target

LAKE

Compliance & Search

Compliance
Petabyte-scale indexing and instant retrieval

Search
Instantly search and visualize at petabyte-scale

Replay
Instantly replay historical data to any target

Featured Articles

Data Lakes: A Comprehensive Guide
OpenTelemetry VS Prometheus: The Essential Guide
Log Management: The Apica Way

How To Choose the Best Observability Tools
What is OpenTelemetry? A Comprehensive Guide
What is Observability? The Bigger Picture
Resources

Get Started

Get Started With Our Free Tier!

REQUEST DEMO

LEARN MORE

COMPARE

E-Books
FREE e-books on technology and observability topics

Solution briefs
Learn more about Apica in these solution briefs.

Datasheets
Get a brief introduction to our key products with Datasheets

Brochures
Get a quick overview of our products with Apica’s brochures

Videos
Get the most out of Apica though these video demos.

Case studies
Get detailed case studies of Apica’s solutions to real-world challenges.

White Papers
Get a thorough insight of Apica via our comprehensive white papers

Try out Apica
Learn how to use Apica with our quick start guide

BLOG
Articles and guides that help you make data-driven decisions

How does Apica Compare
See how we stack against other vendors

Featured Articles

Data Lakes: A Comprehensive Guide
OpenTelemetry VS Prometheus: The Essential Guide
Log Management: The Apica Way

How To Choose the Best Observability Tools
What is OpenTelemetry? A Comprehensive Guide
What is Observability? The Bigger Picture
Solutions

BY INDUSTRY

BY ROLE

BY USECASE

BY TECHNOLOGY

Banking and Finance
Money, shares, credit, investments

Manufacturing
Streamline your business data with Apica

Government
Empowering Data Control and Mission Resilience

Healthcare
Facilitate the provision of healthcare to patients

IOT and IIOT
Physical objects with sensors, processing ability, software etc

Media and Entertainment
Film, television, radio, print, and gaming

Retail
Sale of goods and services to consumers

Compliance Manager
Comply with industry regulations

DevOps Engineer
Diagnose and troubleshoot complex problems

IT Ops
Maintain high reliability for your business

SOC Analyst
Secure hybrid cloud operations and protect your business

Active Observability
100% visibility with apica.io’s Active Observability Solution

Plan B for Native Observability
100% Observability with zero risk at 1/10th the cost.

Compliance
Petabyte-scale indexing and instant retrieval

Generative AI Assistant
Unleashing the Power of contextualized Data

Apica and Splunk integration
Unlock the Power of Real-Time Analytics

Hybrid Cloud Monitoring
Monitor Public, Private, and Hybrid Cloud Environments

Consolidated Monitoring
Embrace a Unified Observability Platform

AWS Observability
Gain insights into the behavior, performance, and health of your system

Kubernetes Monitoring
Leverage Kubernetes environments to identify services, pods, metrics, etc

OpenTelemetry
Unlock business insights and improve efficiency with Apica’s OpenTelemetry integration

IoT and IIoT
Ensure high levels of data-driven decision-making and powerful business outcomes

Featured Articles

Data Lakes: A Comprehensive Guide
OpenTelemetry VS Prometheus: The Essential Guide
Log Management: The Apica Way

How To Choose the Best Observability Tools
What is OpenTelemetry? A Comprehensive Guide
What is Observability? The Bigger Picture
Documentation

Get Started

Get Started With Our Free Tier!

REQUEST DEMO

DOCUMENTATION

GET STARTED

QUICKSTART GUIDES

Apica Docs

Search Docs

Observability Glossary
Learn more

User Guide
Step-by-Step instructions for common tasks

Apicactl
Integrate with automation and scripted worflows.

ApicaHub
Free dashboards for popular applications

K8S
Step-by-Step instructions to deploy Apica in Kubernetes

Sandbox
Run Apica in a Docker Compose sandbox
Company

Get Started

Get Started With Our Free Tier!

REQUEST DEMO

Company

About Us

Security

News

Leadership

Partners

Careers
Login

Get Started

Get Started With Our Free Tier!

REQUEST DEMO

Login

Load Test Portal

Monitoring Portal

Compute & Storage Should Be Decoupled for Log Management at Scale

AIOps, Architecture, AWS, DevOps, Log Analysis, Monitoring, Observability, S3, Scale
May 11, 2021

Originally published at The New Stack

Most log management solutions store log data in a database and enable search by storing an index of the data. As the database grows in size, so does the index management cost. On a small scale, this isn’t problematic. But when dealing with large-scale deployments, organizations end up using lots of compute, storage and human resources just to manage their indexes, in addition to data itself. When companies are handling terabytes of data every day, the database-backed log management system becomes untenable.

Another common issue is that most log solutions don’t store just one set of data. Many DIY log management implementations use popular databases such as MongoDB, ElasticSearch and Cassandra. Let’s take ElasticSearch as an example. An ElasticSearch cluster runs several replicas of data in the hot store tier to ensure high availability. Even with data compression, the replication required to keep the data available still dramatically increases the total amount of storage necessary. The problem is magnified when you account for storage needed for indexes.

Clustering also increases the management complexity and requires users to understand how to manage node failures and data recovery. Even with replication, it is impossible to immediately spin up a new instance when an instance goes down. In most cases, there is some downtime when the log analytics system becomes unavailable. While this happens, data continues to come in because logs are generated in real-time. Catching up requires additional provisioning of resources. Because the real-time data never stops, it can be hard to get the log analytics system to catch up. One-click elasticity is critical to managing this at scale.

The challenges outlined above are classic examples of hidden “storage operations tax” that any DIY solution has to pay. The larger the scale, the higher the tax! A company ingesting around one terabyte of data per day would need multiple terabytes of storage and a proportional amount of RAM if they wanted to keep 30 days worth of log data searchable.

The way to solve this problem is by moving away from databases and using a scalable API storage layer. An API storage layer like Amazon Web Services‘ S3, which has traditionally been used for cold storage, fits this requirement quite well. It provides high availability and durability, infinite scale, the lowest price per GB and effectively takes your storage operations tax to zero. However, to make this work, one has to ensure that applications do not have the higher latency that is typical with cold storage.

Are You Keeping 30 Days’ Worth of Data?

Enterprises think they are keeping 30 days’ worth of log data in their hot storage, but they aren’t actually doing so. Most queries are in the form of periodically run reports that are not interactive with a user sitting at the console. This is especially true at scale when it is not uncommon to ingest hundreds of megabytes or gigabytes of log data in a minute. Interactive workflows in such environments focus on identifying relevant events and data patterns that are then programmed into a machine and converted to timely real-time notifications to the administrator. This means that most data does not need to be in hot storage at all but rather can be processed in-line during ingest or asynchronously at a later point in time.

There’s another good reason that companies move data into S3-compatible or other cold storage quickly. Reducing data duration in a database separates the data storage from compute and makes it easier for organizations to scale their storage and recover from crashed clusters. It’s dramatically cheaper to store data in cold storage than in a database, and scaling cold storage is easier than scaling a database.

This approach, however, creates a new problem where we need to separate data into multiple tiers; hot and cold. Moving and managing data between the two tiers requires expertise. Considerations around what to tier, how often to move data and when to hydrate the hot tier with data from the cold tier now become business as usual. The “storage operations tax” just went up.

What if I Need Long-Term Data Retention?

In highly regulated environments, short-term retention is usually not an option as businesses must store data, index it and make data searchable for several years. The same problems exist, albeit at an even larger scale. The choice is between vast amounts of expensive primary storage or tiered storage architecture. With such requirements, it is not uncommon to have the tiered implementation with most of the data sitting in the cold tier, yet with significant data still in the hot tier (e.g., 30-day retention). The “storage operations tax” isn’t going anywhere, just increasing.

Eliminating Legacy Storage Architecture and Data Tiering

Companies use a tiered approach to storage because they fear losing the ability to search data in cold storage. If searching is necessary, an arduous request process makes accessing the logs slow and challenging. Running real-time searches on older data is impossible. For some application types, this isn’t a big deal. Still, for revenue-producing, critical path applications, it’s crucial to have quick, real-time access to logs and the ability to get the information out of them at a moment’s notice. Having multiple data tiers, where there is a “hot” store and a “cold” store, creates cost and management overhead, particularly for Day 2 operations. Moving everything to a hot store would be extremely expensive — so what if you could make cold storage your primary store?

Making S3 Searchable or ‘Zero Storage Operations Tax’

What if we could make S3-compatible storage just as searchable as a database? The reason companies keep their log data in a database is to enable real-time searches. Still, in practice, most organizations are not keeping nearly as much historical data in databases as their official data retention policies dictate. Suppose any S3-compatible store can be just as searchable as a database. In that case, organizations can dramatically cut down the amount of data stored in databases and the accompanying computing resources needed to manage that data. The most recent data — say, one minute of data — can be stored on the disk, but after a minute, everything moves to S3. There’s no longer the need to run multiple instances of a database for high availability because if the cluster goes down, a new one can be spun up and pointed to the same S3-compatible bucket.

Moving log data directly to cold storage while ensuring real-time searchability makes it easier to scale, increases the log data’s availability and dramatically decreases costs, both on storage and computing resources. When log data is accessed directly in the cold storage, users don’t have to worry about managing indexes between hot and cold store tiers, rehydrating data, or building complex policies. It also means that companies can follow the data retention plans they have to ensure developers can access logs and use them to debug critical applications.

The Apica blog

Let’s keep this a friendly and inclusive space: A few ground rules: be respectful, stay on topic, and no spam, please.

More insights. More affordable. Less hassle.

Make use of our valuable resources

Explore

Ready to get started?

Apica Platform

Features

Resources

About

Community

Leaving without a Demo?

Discover the power of Active Observability with Apica

Unlock the full potential of your data and cloud infrastructure with a personalized demo of Apica. See firsthand how our Apica Ascent platform can transform your data observability strategy, ensure scalability, flexibility, and deliver precision in every aspect of your operations.

Request Demo

Innovation Insight: Telemetry Pipelines Elevate the Handling of Operational Data

As digital infrastructures grow in size and complexity, monitoring them becomes increasingly challenging. Explore how Gartner Telemetry Pipelines Elevate the Handling of Operational Data innovation insights to understand Telemetry Pipelines, maximize efficiency, cost reduction strategies, and improve collaboration.

test

Compute & Storage Should Be Decoupled for Log Management at Scale

Are You Keeping 30 Days’ Worth of Data?

What if I Need Long-Term Data Retention?

Eliminating Legacy Storage Architecture and Data Tiering

Making S3 Searchable or ‘Zero Storage Operations Tax’

Zaigam

The Apica blog

Leave a Comment Cancel reply

Table of Contents

Share this article

Related articles

More insights. More affordable. Less hassle.

Make use of our valuable resources

Leaving without a Demo?

Innovation Insight: Telemetry Pipelines Elevate the Handling of Operational Data