Blog

Previous: Packaging Containers Using Linux Subsystem for Windows
Azure Search vs. AWS CloudSearch vs. AWS Elasticsearch vs. Self-Hosted Elasticsearch

Azure Search vs. AWS CloudSearch vs. AWS Elasticsearch vs. Self-Hosted Elasticsearch

Several of Allcloud’s projects this year have focused on enterprise search and integration in the cloud.

When choosing a back-end search platform, the general requirements are common: the ability to ingest a lot of content, query its full text quickly with flexible options, and secure the data in transit and at rest.

Sometimes you plan to write all of the crawling and indexing logic yourself. Other times, you may prefer a pre-built connector to a storage tier. The amount of data in your search index, the granularity of that data, and your management requirements may vary wildly.

To help make decisions between cloud search platforms, I’ve organized common decision criteria into a simple table:

Factor (Last Assessed) Azure Search AWS CloudSearch AWS Elasticsearch Elasticsearch (Self-Hosted)
Homepage Azure Search AWS CloudSearch AWS Elasticsearch Elasticsearch
Underlying search platform Elasticsearch Solr Elasticsearch Elasticsearch
Disposition Platform-as-a-Service (PaaS) Platform-as-a-Service (PaaS) Managed Infrastructure-as-a-Service (IaaS) Infrastructure-as-a-Service (IaaS)
Built-in Data Source Connectors Azure SQL, Azure Cosmos DB, Azure BLOB Storage AWS DynamoDB, AWS S3 AWS CloudWatch, AWS DynamoDB, AWS Kinesis, AWS Kinesis Firehose, AWS S3 Many, with plugins
Built-in File Formats Supported CSV, EML, Microsoft Office (DOC/DOCX, XLS/XLSX, PPT/PPTX, MSG), HTML, JSON, PDF, RTF, TXT, XML, ZIP CSV, JSON, XML JSON Many, with plugins
Languages Supported 56 35 33 33
Plugins Supported No No Yes, some Yes, many
Application Programming Interfaces REST APIs and many SDKs REST APIs and many SDKs REST APIs and many SDKs REST APIs and many SDKs
Maximum Number of Instances 12 (36 with replicas) 50 20 No limit
Maximum Number of Documents 1.4 Billion Unknown (Up to 10,000 batches / day) 2 Billion / index 2 Billion / index
Maximum Number of Fields 1,000 200 1,000 1,000
Maximum Document Size 16 MB 1 MB 2 GB 2 GB
Maximum Bulk Operation Size 1,000 documents / 16 MB 1,000 documents / 5 MB 100 MB Configurable
Nexted Object Fields Support No No Yes Yes
TLS Encryption in Transit Yes Yes Yes Requires X-Pack add-on
Encryption at Rest Unclear No Yes Possible with third-party software
Automatic Scaling No Yes No No
Pricing Model Managed nodes (per hour) EC2 infrastructure instances (Per hour) EC2 Infrastructure Instances (per hour) Core software is free; X-Pack requires a license; cloud fees are up to you
Price Range Per Month, Per Instance (US) Free - $1,999.87 $43.90 - $559.49 $13.39 - $4,670.83 Variable
Contact Allcloud to design and build your search solutions:

/ 844-6-CLOUD-6

Previous: Packaging Containers Using Linux Subsystem for Windows