Updated January 2023 Anyone in charge of ensuring their company’s data pipeline has the following five priorities in mind: reliability, security, speed, cost, and ownership. In this article we discuss how enterprise managed OpenSearch provides peace of mind, especially having someone to call when a cluster fails in the middle of the night. And we … Continue reading Enterprise Managed OpenSearch
OpenSearch Shard Optimization
Published September 2022 Optimizing OpenSearch for shard size is an important component for achieving maximum performance from your cluster. OpenSearch shards enable parallelization of data processing across both a single node and multiple OpenSearch nodes. OpenSearch automatically manages the allocation of shards within the nodes. However, choosing the number of shards needed is up to … Continue reading OpenSearch Shard Optimization
Vector Search for OpenSearch
Published August 2022 OpenSearch includes a plugin for vector search. In this post, we introduce vector search and compare the different methods available. We will also point you in the right direction for example code. For personalized help, contact us to learn more about our OpenSearch support services. What is vector search? Here’s the … Continue reading Vector Search for OpenSearch
Kafka vs Pulsar
Updated January 2023 Pulsar and Kafka achieve the same result. They both guarantee messages reach their intended destination(s). Yet, there are important differences between the two message queues. These differences can make one of the technologies a better fit, depending on your use case. In this post we cover 8 ways in which Apache Kafka … Continue reading Kafka vs Pulsar
Preparing for a Cloud Outage
Published August 2022 Nearly all of our clients and a majority of companies are using the cloud for at least a portion of their infrastructure. It’s important for companies to plan for cloud outages to minimize the damage caused by them. In this post we will cover how to minimize damage and recover quickly after … Continue reading Preparing for a Cloud Outage
OpenSearch vs. Elasticsearch
Updated January 2023 With OpenSearch originating as a fork from Elasticsearch, the two databases can appear to be near-identical to the unacquainted. However, they are unique, becoming more so with each new update. Here we will discuss how the two search engines compare when it comes to security, licensing, core features, documentation, community support, dashboards, … Continue reading OpenSearch vs. Elasticsearch
Elasticsearch Support Services FAQ
Published July 2022 Our team of engineers has been architecting, optimizing, and managing Elasticsearch for over 6 years. We’ve found that there are common questions that new clients have about Elasticsearch support services. Below is a list of a few of the most common questions inquiring new clients have when they reach out. Let us … Continue reading Elasticsearch Support Services FAQ
How to Save Money on Data Storage Costs
Published July 2022 With companies revisiting their budgets to brace for a possible recession, now is the time to review your data storage costs and find places to reduce those fees without sacrificing performance. In this article we consolidate our top tips for saving money on data storage costs. From the top we want to … Continue reading How to Save Money on Data Storage Costs
Data Engineering Study
Published June 27, 2022 Data engineering is the field dedicated to building data infrastructure to ingest, process, and store large amounts of data. This is a quickly growing field, with both the number of jobs in data engineering and the number of tools on the market steadily increasing. Despite the popularity of data engineering as … Continue reading Data Engineering Study
What is a Virtual CIO?
Published June 2022 Virtual CIOs provide the leadership and expertise to build, grow, and maintain reliable data architecture. They are often hired by midsized companies that are looking for a trusted authority to drive data architecture and the supporting team. Virtual CIOs are also referred to as vCIOs, fractional CIOs, part-time CIOs, and CIOs for … Continue reading What is a Virtual CIO?
What is OpenSearch?
Updated May 2022 OpenSearch is an open source search and analytics software. It’s a community led project with Amazon Web Services (AWS) leading the development. It was first created as a fork from Elasticsearch 7.10.2 and Kibana 7.10.2 in 2021. The OpenSearch search engine is simply referred to as OpenSearch, and the dashboard tool is … Continue reading What is OpenSearch?
How to fix the MEMBER_ID Error in Kafka
Updated May 2022 If you found this post it’s likely because you got the Kafka member_id error. Let’s first cover why the error popped up and then go through two ways to resolve the error. Reason for the Kafka Member ID Error When a new consumer joins a group it enters with the member.id set … Continue reading How to fix the MEMBER_ID Error in Kafka
Elasticsearch Basics: What it is, Licensing, Languages, and Getting Help
Updated July 2022 Elasticsearch is a distributed search and analytics engine. It is built on top of Apache Lucene. Elasticsearch was first released in 2010 by the company now known as Elastic. It was originally completely open source, but recent license changes have limited its usage. More on that below. Elasticsearch is part of a … Continue reading Elasticsearch Basics: What it is, Licensing, Languages, and Getting Help
4 Approaches to Data Backup
We outlined the four primary ways for backing up data and their benefits and drawbacks to help you decide on which approach best meets your company’s needs.
Dashboards for Data-Driven Decision Making: As Easy as Driving Your Car
When we are driving, we are routinely making data-driven decisions using the gauges on our dashboard to guide us. Data-driven decision making should be just as easy when it comes to business.
3 Business Questions to Guide Data Collection, Storage, and Insights
With this guide, you will be able to define the business and technical requirements for your data platform, making the implementation process efficient and successful.
Open Source Tools for Data Architecture, Business Analytics, and Operational Intelligence: Tech’s Little Free Library
When designing a custom data architecture, business analytics, or operational intelligence platform for a client, four benefits of open source tools make them undoubtedly a better option in the vast majority of cases.
Improving Your Company’s Relationship With Data: Centralized Reporting for Data-Driven Decision Making
The implementation of a data handling platform, whether it is a centralized reporting system, Business Analytics, Operational Intelligence, or single point of truth for your company, will improve the way you make data-driven decisions.