Amazon CloudSearch is a fully-managed service in the AWS Cloud that makes it easy to set up, manage, and scale a search solution for your website or application
Understanding a Search Engine
A search engine makes it possible to search large collections of mostly textual data items (called documents) to quickly find the best matching results. Search requests are usually a few words of unstructured text, such as “matt damon movies”. The returned results are usually ranked with the best matching, or most relevant, items listed first (the ones that are most “about” the search words).
Documents may be completely unstructured, or they can contain multiple fields that can optionally be searched individually. For example, a search service for movies might have documents with fields for title, director, actor, description, and reviews. Results returned by a search engine are typically proxies for the underlying documents, such as URLs that reference particular web pages.
Amazon CloudSearch provides several benefits over running your own self-managed search service including easy configuration, auto scaling for data and traffic, self-healing clusters, and high availability with Multi-AZ. With a few clicks in the AWS Management Console, you can create a search domain and upload the data you want to make searchable, and Amazon CloudSearch automatically provisions the required resources and deploys a highly tuned search index.
Amazon CloudSearch is a fully managed search service that automatically scales with the volume of data and complexity of search requests to deliver fast and accurate results. Amazon CloudSearch lets customers add search capability without needing to manage hosts, traffic and data scaling, redundancy, or software packages. Users pay low hourly rates only for the resources consumed. Amazon CloudSearch can offer significantly lower total cost of ownership compared to operating and managing your own search environment.
A search service and a storage service are complementary. A search service requires that your documents already be stored somewhere, whether it’s in files of a file system, data in Amazon S3, or records in an Amazon DynamoDB or Amazon RDS instance. The search service is a rapid retrieval system that makes those items searchable with sub-second latencies through a process called indexing.
Supported Additional Features
With this latest release Amazon CloudSearch supports several new search and administration features. The key new features include:
- Language support:
- 34 languages, plus “multiple” to handle mixed language fields
- Per-field language configuration
- Language-specific text analysis
- Multiple levels of algorithmic stemming are available for many languages, including “none”
- Enhanced search features:
- Geospatial search
- New data types: date, double, 64 bit signed int, latlon
- Sloppy phrase search
- Term boosting
- Enhanced range searching for all field types
- Support for multiple query parsers: simple, structured, lucene, dismax
- Query parser configuration options
- Administration features:
- High availability option
- IAM integration
- User configurable scaling
- Available in additional AWS Regions: Asia Pacific (Tokyo), Asia Pacific (Singapore), Asia Pacific (Sydney), Asia Pacific (Seoul), and South America (Sao Paulo)