MarkLogic Server

MarkLogic Server is an XML Server that provides the agility you need to build and deploy next-generation information applications. As a key part of your information infrastructure, it gives your organization the functionality and flexibility required to adapt quickly to changing market conditions and new requirements.

There are many benefits to using MarkLogic Server as the core piece of your information infrastructure. With MarkLogic you can accelerate the development of your information applications to meet your sharing and delivery needs. The unified architecture lowers your development and maintenance costs. The information applications you build can include a wide range of flexible and dynamic delivery options meaning you can expand the value of the information by delivering it to your users when and where they need it.

Speed Application Development and Lower Maintenance Costs

Faster, More Efficient Application Development

MarkLogic Server provides the single infrastructure necessary to build and deploy applications. It includes an XML repository, full-text and XML search capabilities, an XQuery engine, and an application server-everything you need to meet your XML content delivery needs. And you can more quickly develop efficient and effective applications because they are on a single infrastructure, saving you time and money.

Easy Information Integration

MarkLogic Server supports loading information "as-is" so you can avoid making costly, time consuming, and sometimes impossible information transformations. Instead, you can easily combine XML content, documents, books, messages, user-generated content, and more into a single, centralized repository. And you can get started more quickly when creating applications because you don't need to convert content to its final form before experimenting with new applications and business models.

Simple, Cost-Effective Growth

MarkLogic Server efficiently scales up and out on today's commodity hardware. Its advanced architecture is designed to scale to internet-size problems while delivering sub-second response times to user queries. As your deployment grows, you can support more users and information with fewer servers, resulting in reduced hardware and maintenance costs. And your data center costs can be lower since you require fewer servers to meet your information needs.

Comprehensive functionality

With MarkLogic Server, organizations find they can replace two or three separate technologies in their information architecture with one platform, reducing development, maintenance, and administration costs. By providing the capability for storage, search, alerting, geospatial query, entity enrichment, and more, Mark Logic provides the most complete platform for XML applications.

Flexible and Dynamic Delivery

Build Location-Based Applications and Services

Use MarkLogic Server's geospatial capability to create new, dynamic location-based applications that help busy, on-the-go users access the information they need. Location-based services can help organizations create loyal users and generate new revenue streams from content that might otherwise go unused.

Update Users Instantly When Relevant Information Arrives

With MarkLogic Server's powerful alerting capabilities, users can set up alerts which notify them when information of interest is loaded. MarkLogic Server makes it easy to monitor dynamic information streams, like RSS and ATOM feeds, and look for information. Users can combine key word, geospatial, entity, and structure information resulting in highly targeted, relevant alerts.

Improved Information Reuse

MarkLogic Server gives you the flexibility to experiment with new business models, create prototypes of applications, and repurpose content for new and possibly unforeseen uses. Make your existing content go further by using it for multiple purposes -print, online delivery, syndication, dynamic assembly, and more. You can repackage existing content into new information products that drive new revenue streams and reach new users.

Deliver Information to Any Format or Device

MarkLogic Server allows you to query content, combine it on the fly, perform any transformations, and deliver it dynamically to users. So you can satisfy a larger set of users by delivering the content they want in the context they need.

Expand the Value of Information

Content Enrichment

With MarkLogic Server you can add tags to identify key entities or concepts, improving user ability to find information. Editors can update documents with annotations and metadata, users can add comments or ratings, and automated tools can identify entities or add location information-all without reloading. MarkLogic Server lets you leverage meta-information to build new applications or features, without converting content to a standard schema.

Powerful Document Re-use

MarkLogic Server's modular document capabilities make it easy to re-use commonly used content in many documents to avoid recreating information. This is especially useful for documents that need to be translated -when a part changes, you simply retranslate it. You can better ensure content consistency by including a standard product overview in every feature description. When a standard product overview changes, each feature description that includes it is updated.

Greater Understanding of Information/Use Patterns

Gain a better understanding of information-and how users are working with it-via content analytics. MarkLogic Server lets you understand and find new patterns, relationships, and other details within your content. This knowledge of how content is related and how users interact with it lets you evolve and refine products to better meet user needs.

Making Information Easy to Find

MarkLogic Server lets you provide users with a rich navigation experience by letting them easily browse through content or refine search results using facets-based on categories or tags-to find information. This means users find what they need faster and are more satisfied with their search experience, increasing repeat visits and re-use.

MarkLogic Server differs from traditional technologies, such as relational databases and enterprise search engines, in that it was designed and optimized for storing, searching and delivering XML content. This fundamental architectural difference provides you with a sustainable, long-term advantage that can't be achieved with traditional technologies.

Performance

Superior Scalability

MarkLogic Server is designed to efficiently manage repositories from 10s of gigabytes to 100s of terabytes of XML content while providing sub-second response times during demanding user loads. Based on a distributed architecture of commodity hardware servers, configurations are easily expanded as needs grow. Performance optimization for content loading and query evaluation lowers hardware costs by requiring fewer servers.

Large-Scale Alerting

MarkLogic Server includes large-scale alerting (sometimes called triggers or profiles) functionality, designed to perform well across two dimensions: large numbers of alerts and extremely large amounts of information. Alerts can be defined using a wide range of factors including key word, structure, entity, geospatial information-in any combination so users can immediately know about any new, relevant information.

Instant Information Availability

Search engines have a built-in lag between when information is available and when it's indexed (and returned by searches). With MarkLogic Server, as soon as information is added to the system or changed, it is immediately available and returned in all appropriate searches. So you do not have to worry about users getting incorrect results due to stale information.

Unified architecture

<4>Universal Indexing

MarkLogic Server's universal index is automatically populated with the full-text and XML structure within XML content. This single view of information assets leads to faster configuration, lower maintenance costs, and increased agility. Other systems often require three or more indexes to achieve the same functionality, drastically increasing storage and maintenance requirements.

Combined XML and Text Search

MarkLogic Server allows you to write powerful queries that combine XML and text constraints (e.g., find contracts with a "governing law" section and a paragraph in the section containing "California"). The combined search provides for more powerful search capabilities that can be exposed to users as needed. This unique blend of relevance-based models and database-style functionality gives you multiple ways to access content.

One-Step Answers

MarkLogic Server provides one-step answers to queries by returning specified content elements instead of links to entire documents. Users no longer need to click through to each link in a list of results, repeating the process until they find the answer.

Feature Richness

Structure-Aware XML Search

MarkLogic Server can search XML elements, attributes, values, and combinations with a full understanding of hierarchy. Your search will examine document structure, metadata, and words, giving you more flexibility when designing a search strategy. This lets you build next-generation applications that better leverage your content.

Geospatial

MarkLogic Server includes geospatial support, which provides fast search, retrieval, and analysis of content marked up with geospatial data. By using the integrated full-text search and geospatial query, organizations can create high-performance, location-based services that fully leverage the value of their content. This allows them to deliver information to users with greater context based on physical location.

Powerful XQuery Implementation

MarkLogic Server provides the industry's most extensive implementation of the W3C-standard XML Query (XQuery) language and fully supports the XQuery 1.0 recommendation. This means you can write applications in a high-level, declarative language and take advantage of training courses and other open access resources when developing and maintaining applications.

XML Repository with Transactional Storage

MarkLogic Server can provide you with a single integrated and centralized version of your content and reduce storage costs resulting from duplication of an entire cache. It implements a transactional system designed for consistent information access, even when content is actively being updated or modified. Transactions are handled in a fully non-blocking manner, ensuring users will never have to wait for results and results will always reflect the latest changes.

Comprehensive Platform

MarkLogic Server provides the broadest set of functionality on the market for storing, managing, and delivering XML-based information. From ingest to update, enrichment, transformation, and delivery, it is architected to handle large amounts of rapidly changing XML. Unlike other systems which "bolt on" multiple products to achieve a similar level of capability, each feature is built-in and designed for use in combination with others. So organizations can imagine and build content applications to leverage information more fully than before.

MarkLogic Server was architected for XML with the goal of handling extremely large amounts of content and providing exceptionally fast access to that content using XQuery. This has resulted in a platform that provides a sustainable, long-term advantage when dealing with content. As your content grows, becomes more complex, or needs to be delivered in new ways, MarkLogic Server is in position to help you with those needs, no matter what they are.

Store

XML Information Store

MarkLogic Server stores and processes information as XML. Unlike search engines, it can serve as the system of record for information by providing persistent content storage. And information processing is performed in an extremely efficient manner, providing significant performance advantages over technologies that must convert to some other representation before processing.

Load Information "as-is"

MarkLogic Server loads XML documents without requiring adherence to any predefined DTD or XML schema. This gives you the flexibility to develop content applications faster than approaches requiring content to be standardized on a single schema.

Universal Index

Like search engines, MarkLogic Server indexes words within documents. Unlike search engines, it also indexes XML elements, their structure, and the text within them. This means MarkLogic Server can quickly evaluate text queries, structural queries, and queries that combine both text and structural constraints (e.g., find diagram captions that mention "engine" in articles whose title contains "Airbus").

High Availability

MarkLogic Server's architecture delivers superior scalability while also providing failover, hot backup, and other high-availability features. Database style journaling and transactional updates mean you can rely on it to reliably store and deliver high-value content to your users.

Auditing

MarkLogic Server makes it easy to monitor system activity by providing auditing functionality. Organizations can audit events such as document update, system shutdown, modifications of permissions, and user authentication to a log file. They can also filter the events they want to log-by user, role, outcome (success/failure), event, and/or document in order to speed analysis and understanding.

Automatic Content Conversion

MarkLogic Server offers integrated, automatic conversion of documents from popular formats-Microsoft Office, PDF and HTML-to XML. It includes a content processing framework for customizable conversion and transformation services during content ingestion.

Modular Documents

XInclude and XPointer support let you efficiently re-use content by including pieces of any document in another document. When included content changes, the inclusion may (optionally) change, so modular documents are always up-to-date. You can nest inclusions so documents further increase the reuse of and ability to centrally control information. Documents are dynamically generated and users can search the resulting modular documents as an individual document and returned as a search result.

Search

XQuery

MarkLogic Server provides the industry's most extensive implementation of the W3C-standard XML Query (XQuery) language and fully supports the XQuery 1.0 recommendation. By providing a query language, as opposed to a search box and simple API, MarkLogic Server lets you write much more powerful queries than what's possible with search engines. And you can write queries in a high-level programming language that does not bind you to a vendor's proprietary syntax or API.

Full-Text Search

MarkLogic Server provides the standard capabilities you'd find in a search engine, such as phrase, Boolean, proximity, wildcard, parametric, and range search capabilities. It also supports language processing, such as stemming, thesauri, and spell-checking. With it you can expose simple "single box" query interfaces or more complex advanced query capabilities for users who want or need to perform content research.

Search and Query

MarkLogic Server goes well beyond basic search by providing not only standard full-text search capabilities, but also database-style queries against content. Not only can you restrict search to any XML element or attribute (leveraging hierarchical structure), but you can also return the content of any elements as a result.

Geospatial

MarkLogic Server supports geo-encoding formats including GML, KML, and GeoRSS. XQuery functions let you search for content within bounding boxes, circles, and arbitrarily complex polygons anywhere on the globe. High-performance geospatial indexing lets you perform real-time analytics and visually explore content by integrating with geographic mapping applications like Google maps. This functionality can be easily combined with other advanced search and analytics functionality, such as a co-occurrence and full-text search.

Analyze

Faceted Navigation

With MarkLogic Server, users can drill down based on any number of "facets" to quickly identify information. Facets can be a single value or a range of values providing users with buckets of related items, making it easier to explore information when there are a large number of results. Ranges can be dynamically defined at query time to provide maximum flexibility to users.

Co-occurrence analysis

With MarkLogic you can perform advanced co-occurrence analysis in real-time against large datasets. Co-occurrence analysis identifies frequently occurring pairs of entities within content which allows you to identify hidden or unknown relationships within you information.

Entity Enrichment

MarkLogic Server includes built-in entity enrichment capabilities which store entity information in-line directly with your content, enabling more contextual queries. It can indentify 18 different entities within content, including people, places, and things. In order to ensure you have the content enrichment options and tools you need to develop agile content applications, Mark Logic created the Open Enrichment Framework (OEF) to enable rapid integration of third-party entity extraction engines with MarkLogic Server.

Deliver

Large-Scale Alerting

MarkLogic Server's alerting functionality lets users set up rules (queries) and actions that are triggered when relevant content is loaded. Efficient query serialization and high-performance indexing functionality lets the system easily handle large-scale alerting. Rules can be based on key word search, geographic constraints, entities, and even include constraints around information structure (e.g. tell me when the word "president" occurs in a footer or caption). Alerts can execute actions that deliver content to the web, email, and mobile devices.

Information Manipulation

MarkLogic Server is designed to process and manipulate information. The underlying capabilities include transactional updates, information processing pipelines, and flow control. MarkLogic Server enables granular insertions and updates of information that relational database management systems cannot easily handle.

Transformation

MarkLogic Server easily transforms information from one structure to another. The simple transformation process supports dynamic assembly and formatting to create multiple outputs of the same information. Information providers broaden their reach by delivering target-specific formats to various consumer devices such as web browsers, e-books, smart phones, and other mobile devices dynamically and all from one common platform.