Architecture

Architecture 

MarkLogic is the first operational database technology for Big Data Applications. Learn how it works and how you can deploy it.

Architecture

Architecture

Application Services

Search API

  • Combines searching, search parsing, search grammar, faceting, snippeting, search term completion, and other search application features into a single API

Information Studio API

  • Leverages the Content Processing Framework to manage document loads and simplify creation of custom document collectors, transformers, and load policies

Library Services API

  • Provides document management services including check-in/check-out and versioning of documents
Architecture

Evaluation Layer

Evaluator
XSLT | XPath | XQuery

  • Evaluates and handles incoming requests to access or modify documents

Query Cache

  • Stores documents and logic in memory for faster response and improve IO efficiency

Broadcaster | Aggregator

  • Federates requests to access or modify documents across data nodes if not in memory
  • Aggregates results returned for processing / transforming as necessary
Architecture

Data Layer

Transaction Controller
Multiversion Concurrency Controller

  • Enables zero-latency ingestion
  • Supports point-in-time querying of system

Data Cache
  • Handles query resolution as well as stores indexes in memory
  • Supports background re-indexing and merging to maintain IO efficiency

Transaction Journal

  • Stores all actions on-disk to maintain transactional integrity in case of system failure even with loaded documents initially stored in memory for immediate access

Indexes
Value | Structure | Text | Scalar | Metadata | Security | Geospatial | Reverse

  • Enables requests to combine value, structure, text, scalar, metadata, role-based, and/or geospatial information for sophisticated capabilities
  • Enables alerting, rules-based classification, and/or matchmaking based on criteria specified by reverse indexes

Compressed Storage
XML | Binary | Text

  • Converts ingested documents from XML to a compressed binary fragment representation to reduce space requirements and improve IO efficiency

Deployment Options

MarkLogic leverages a shared-nothing architecture that enables the system to scale linearly as demand increases. Customers can choose from several deployment options depending on their requirements and environment. For reasonable-sized deployments, customers may choose to deploy MarkLogic on a single server node. For larger deployments, customers can deploy MarkLogic on a cluster of servers using commodity hardware. Other customers choose to deploy MarkLogic in a private or public cloud environment.

Learn more about MarkLogic 5 deployment