Progress Acquires MarkLogic! Learn More

Search All Your Websites With a Master Data Hub

Back to blog
3 minute read
Back to blog
3 minute read

How many websites does your organization have? These days, it is common for one organization to own and manage multiple external websites. Take MarkLogic for example. We have our main website at, developer community at, and MarkLogic World at just to name a few, all serving one mission while targeting different audiences – business management, developers, event attendees, etc.

And like most companies infrastructure has evolved. These three websites run on different platforms, with siloed databases, providing a master search against them can be quite challenging. Like many companies, we have a tremendous amount of traffic that comes in from a “Google” search. But now that they are on the site, can they find all the information they are looking for? What if information resides on another external site — and not the one the visitors are on? You surely would not want them to go back to Google or go through each site of yours and conduct multiple searches — or worse, decide that you just don’t have what they are looking for.

To overcome this challenge and thus improve the end-user experiences of those visiting our sites, we knew the solution — build a master data hub on MarkLogic. I just had to have some free time to put it all together. I finally cleared the deck and got to it last fall.

NoSQL database MarkLogic makes it easy to bring separate siloed data into a single database and web developers managing multiple websites for a single organization or client should definitely take time to look into it. MarkLogic’s flexible data model allows data from different sources and formats provide a single platform for all your data.

Building a Master Data Hub

Our solution was to consolidate content from the different sites into a single MarkLogic instance and expose data back to each site via REST API where all search queries are made. Since MarkLogic indexes data as it is ingested and is completely transactional, any data flowed would be immediately available — so I didn’t have to worry about synchronizing databases.

My colleague Mike Wooldridge had built a MLPHP library – so integrating MarkLogic with websites or web applications built on PHP is extremely easy. However, since I was new to MarkLogic development, I did run into a few challenges including mapping different content types to different collections.

WordPress Plugin to MarkLogic

In another blog post, I will discuss technical details of our approach, what those challenges were, and how, we overcame them. Anyway… back to the solution part. Our developer site was already running on MarkLogic so it made a perfect sense to consolidate data there. We built a WordPress plugin that pushes data from the main site at www to developer site whenever new content is created or existing content is updated.

The end result is that visitors conducting searches on the main www site can now see results from the developer site, and those conducting searches on the developer site can see results from www site. Also, the search result page now has facets, making it easy to drill down on different content types, something that native WordPress search does not provide.

MarkLogic search on WordPress

Sample search results page on

Moreover, the WordPress plugin that we built is available to public for those who want to bring the same powerful search to your WordPress websites. The plugin has an option to replace native search with MarkLogic search for seamless integration.

What’s next for us? Consolidating the main site with developer site was just the first step. One of the challenges we still face is that the search does not include localized content. So we are looking at integrating our other public sites including the localized sites in German, Japanese, and French, into our data hub. We are also looking at fine-tuning the search results so that more relevant information is shown before others. All these customizations, fine-tuning, and data consolidation are possible now that the search is powered by MarkLogic. If this does not excite web developers out there like myself, I don’t know what would!

Stay tuned for a technical blog post on this topic which will be published on the developer blog. I will also add a link here when it is available.

Seong Bae

Seong Bae is a MarkLogic Alumnus passionate about anything related to websites and web technologies.

Read more by this author

Share this article

Read More

Related Posts

Like what you just read, here are a few more articles for you to check out or you can visit our blog overview page to see more.

Use Cases

Extracting Hidden Value from Your Content

Get more value from your content by improving discoverability. Learn about the key elements of a smarter content roadmap, and what steps most enterprises are missing.

All Blog Articles
Use Cases

Using MarkLogic for Cyber Threat Analytics and Awareness

MarkLogic helps agency decision-makers connect to their IT data quickly, make better decisions, become more agile, and solve their most difficult cyber challenges.

All Blog Articles

Standardizing Internal Data Models on FHIR

Learn about MarkLogic’s work on a FHIR-based standardized data model to support persisted payer data for our Medicaid Accelerators.

All Blog Articles

Sign up for a Demo

Don’t waste time stitching together components. MarkLogic combines the power of a multi-model database, search, and semantic AI technology in a single platform with mastering, metadata management, government-grade security and more.

Request a Demo