MarkLogic is the first operational database technology for Big Data Applications. Learn how it works and how you can deploy it.
Architecture
Application Services
Search API
- Combines searching, search parsing, search grammar, faceting, snippeting, search term completion, and other search application features into a single API
Information Studio API
- Leverages the Content Processing Framework to manage document loads and simplify creation of custom document collectors, transformers, and load policies
Library Services API
- Provides document management services including check-in/check-out and versioning of documents
Evaluation Layer
Evaluator
XSLT | XPath | XQuery
- Evaluates and handles incoming requests to access or modify documents
Query Cache
- Stores documents and logic in memory for faster response and improve IO efficiency
Broadcaster | Aggregator
- Federates requests to access or modify documents across data nodes if not in memory
- Aggregates results returned for processing / transforming as necessary
Data Layer
Transaction Controller
Multiversion Concurrency Controller
- Enables zero-latency ingestion
- Supports point-in-time querying of system
Data Cache4>
- Handles query resolution as well as stores indexes in memory
- Supports background re-indexing and merging to maintain IO efficiency
Transaction Journal
- Stores all actions on-disk to maintain transactional integrity in case of system failure even with loaded documents initially stored in memory for immediate access
Indexes
Value | Structure | Text | Scalar | Metadata | Security | Geospatial | Reverse
- Enables requests to combine value, structure, text, scalar, metadata, role-based, and/or geospatial information for sophisticated capabilities
- Enables alerting, rules-based classification, and/or matchmaking based on criteria specified by reverse indexes
Compressed Storage
XML | Binary | Text
- Converts ingested documents from XML to a compressed binary fragment representation to reduce space requirements and improve IO efficiency
Deployment Options
Value | Structure | Text | Scalar | Metadata | Security | Geospatial | Reverse
XML | Binary | Text
MarkLogic leverages a shared-nothing architecture that enables the system to scale linearly as demand increases. Customers can choose from several deployment options depending on their requirements and environment. For reasonable-sized deployments, customers may choose to deploy MarkLogic on a single server node. For larger deployments, customers can deploy MarkLogic on a cluster of servers using commodity hardware. Other customers choose to deploy MarkLogic in a private or public cloud environment.
Learn more about MarkLogic 5 deployment