Published May 2023 Small, medium, and large OpenSearch clusters require different approaches for optimization. Dattell’s engineers are expert at designing, optimizing, and maintaining OpenSearch. Find out more about our OpenSearch support services. Optimizing a Small OpenSearch Cluster The minimum number of nodes for a small, highly available OpenSearch cluster is three (3). Three nodes might … Continue reading OpenSearch Cluster Optimization for Small, Medium, and Large Clusters
Top 10 Reasons to Use Open Source Software
Published April 2023 Dattell specializes in support for open source software: OpenSearch, Kafka, and Pulsar. The benefits of using open source software are vast. Learn about the top 10 reasons companies choose open source software. #1 Lower Costs Open source software is often available for free, allowing companies to save on software licensing fees. For … Continue reading Top 10 Reasons to Use Open Source Software
Top 10 Reasons to Unify Data Into a Single Data Store
Published March 2023 Often companies find themselves with data stored in disparate data stores. The use of legacy systems and subdivisions within a company using their own systems are two common reasons for this scattered data storage. In this article we explain 10 reasons why it is important and beneficial for companies to unify data … Continue reading Top 10 Reasons to Unify Data Into a Single Data Store
Enterprise Managed OpenSearch
Updated January 2023 Anyone in charge of ensuring their company’s data pipeline has the following five priorities in mind: reliability, security, speed, cost, and ownership. In this article we discuss how enterprise managed OpenSearch provides peace of mind, especially having someone to call when a cluster fails in the middle of the night. And we … Continue reading Enterprise Managed OpenSearch
OpenSearch Shard Optimization
Published September 2022 Optimizing OpenSearch for shard size is an important component for achieving maximum performance from your cluster. OpenSearch shards enable parallelization of data processing across both a single node and multiple OpenSearch nodes. OpenSearch automatically manages the allocation of shards within the nodes. However, choosing the number of shards needed is up to … Continue reading OpenSearch Shard Optimization
Vector Search for OpenSearch
Published August 2022 OpenSearch includes a plugin for vector search. In this post, we introduce vector search and compare the different methods available. We will also point you in the right direction for example code. For personalized help, contact us to learn more about our OpenSearch support services. What is vector search? Here’s the … Continue reading Vector Search for OpenSearch
Kafka vs Pulsar
Updated January 2023 Pulsar and Kafka achieve the same result. They both guarantee messages reach their intended destination(s). Yet, there are important differences between the two message queues. These differences can make one of the technologies a better fit, depending on your use case. In this post we cover 8 ways in which Apache Kafka … Continue reading Kafka vs Pulsar
Preparing for a Cloud Outage
Published August 2022 Nearly all of our clients and a majority of companies are using the cloud for at least a portion of their infrastructure. It’s important for companies to plan for cloud outages to minimize the damage caused by them. In this post we will cover how to minimize damage and recover quickly after … Continue reading Preparing for a Cloud Outage
OpenSearch vs. Elasticsearch
Updated January 2023 With OpenSearch originating as a fork from Elasticsearch, the two databases can appear to be near-identical to the unacquainted. However, they are unique, becoming more so with each new update. Here we will discuss how the two search engines compare when it comes to security, licensing, core features, documentation, community support, dashboards, … Continue reading OpenSearch vs. Elasticsearch
Elasticsearch Support Services FAQ
Published July 2022 Our team of engineers has been architecting, optimizing, and managing Elasticsearch for over 6 years. We’ve found that there are common questions that new clients have about Elasticsearch support services. Below is a list of a few of the most common questions inquiring new clients have when they reach out. Let us … Continue reading Elasticsearch Support Services FAQ
How to Save Money on Data Storage Costs
Published July 2022 With companies revisiting their budgets to brace for a possible recession, now is the time to review your data storage costs and find places to reduce those fees without sacrificing performance. In this article we consolidate our top tips for saving money on data storage costs. From the top we want to … Continue reading How to Save Money on Data Storage Costs
Data Engineering Study
Published June 27, 2022 Data engineering is the field dedicated to building data infrastructure to ingest, process, and store large amounts of data. This is a quickly growing field, with both the number of jobs in data engineering and the number of tools on the market steadily increasing. Despite the popularity of data engineering as … Continue reading Data Engineering Study
What is a Virtual CIO?
Published June 2022 Virtual CIOs provide the leadership and expertise to build, grow, and maintain reliable data architecture. They are often hired by midsized companies that are looking for a trusted authority to drive data architecture and the supporting team. Virtual CIOs are also referred to as vCIOs, fractional CIOs, part-time CIOs, and CIOs for … Continue reading What is a Virtual CIO?
What is OpenSearch?
Updated May 2022 OpenSearch is an open source search and analytics software. It’s a community led project with Amazon Web Services (AWS) leading the development. It was first created as a fork from Elasticsearch 7.10.2 and Kibana 7.10.2 in 2021. The OpenSearch search engine is simply referred to as OpenSearch, and the dashboard tool is … Continue reading What is OpenSearch?
How to fix the MEMBER_ID Error in Kafka
Updated May 2022 If you found this post it’s likely because you got the Kafka member_id error. Let’s first cover why the error popped up and then go through two ways to resolve the error. Reason for the Kafka Member ID Error When a new consumer joins a group it enters with the member.id set … Continue reading How to fix the MEMBER_ID Error in Kafka
Elasticsearch Basics: What it is, Licensing, Languages, and Getting Help
Updated March 2023 Elasticsearch is a distributed search and analytics engine. It is built on top of Apache Lucene. Elasticsearch was first released in 2010 by the company now known as Elastic. It was originally completely open source, but license changes have limited its usage. More on that below. Elasticsearch is part of a group … Continue reading Elasticsearch Basics: What it is, Licensing, Languages, and Getting Help
4 Approaches to Data Backup
We outlined the four primary ways for backing up data and their benefits and drawbacks to help you decide on which approach best meets your company’s needs.
Dashboards for Data-Driven Decision Making: As Easy as Driving Your Car
When we are driving, we are routinely making data-driven decisions using the gauges on our dashboard to guide us. Data-driven decision making should be just as easy when it comes to business.
3 Business Questions to Guide Data Collection, Storage, and Insights
With this guide, you will be able to define the business and technical requirements for your data platform, making the implementation process efficient and successful.
Open Source Tools for Data Architecture, Business Analytics, and Operational Intelligence: Tech’s Little Free Library
When designing a custom data architecture, business analytics, or operational intelligence platform for a client, four benefits of open source tools make them undoubtedly a better option in the vast majority of cases.
Improving Your Company’s Relationship With Data: Centralized Reporting for Data-Driven Decision Making
The implementation of a data handling platform, whether it is a centralized reporting system, Business Analytics, Operational Intelligence, or single point of truth for your company, will improve the way you make data-driven decisions.