Progress Acquires MarkLogic! Learn More
Data Curation

Course Details

Learn to build a MarkLogic Data Hub powered by the MarkLogic database to help accelerate data integration projects and deliver faster time to value to your customers. This course is only recommended if you are using the MarkLogic Data Hub or want to learn the Hub Central interface.

Select Your Day and Time to Register

Course Objectives

By completing this course you will be able to:

  • Develop, test, debug and deploy custom code using a local IDE (Visual Studio Code)
  • Use custom code during ingest, mapping and mastering
  • Implement an entity model that includes nesting and relationships
  • Load data from a variety of sources
  • Load data using a variety of methods and describe the use cases and best practices for each method
  • Use custom code during data ingest
  • Implement mapping configurations for a more complex data model
  • Implement smart mastering configurations with more complexity and customization

Course Outline

Data Services First
  • Understand the high-level approach to data integration projects using the MarkLogic Data Hub
  • Understand the customer and business requirement for the course hands-on project
  • Understand the user stories and technical requirements for the course hands-on project
  • Understand the data sources available for the course hands-on project
  • Understand what it is
  • Understand what it does
  • Initialize and install a new MarkLogic Data Hub project
  • Create users and roles for both business users and members of the technical project team
  • Understand how to use Data Hub specific roles
  • Implement role hierarchies
  • Assign execute privileges necessary to meet project requirements
  • Deploy security configuration using QuickStart and ml-gradle
  • Create a new entity
  • Define properties
  • Configure Indexed
  • Protect access to PII (personally identifiable information)
  • Create flow pipelines
  • Configure ingestion steps
  • Understand the purpose and use of the staging and final databases in a MarkLogic Data Hub
  • Implement key data modeling concepts including document URIs, collections, document permissions, property naming best practices, geospatial data modeling patterns, denormalization, and the use of the envelope pattern
  • Configure mapping steps
  • Use pre-built mapping functions
  • Program, deploy and use a custom mapping function
  • Test and debug mapping steps
  • Understand key semantic data modeling concepts including triples, IRIs, ontology triples, managed and unmanaged triples
  • Load triples to a MarkLogic Data Hub
  • Program, deploy and use a custom harmonization step to add triples to the envelope of a document
  • Explore the use of JavaScript APIs
  • Explore the use of SPARQL
  • Validate that the curated data from the hub can be used to meet the business and technical requirements for the hands-on project
  • Ingest a new data source
  • Curate the new data so that it can be consumed in the same way as existing data
  • Configure a matching step
  • Configure a merging step
  • Test Smart Mastering
  • Explore mastered data
Learning Made Simple

Free Training Courses Will Make You a MarkLogician in No Time

Code editor with question bubbles

Self-Paced Training

Do you need a more flexible learning schedule? Learn on your own time! Take classes as time permits through our self-paced portal.

Get Started

On-Demand Tutorials

Watch the latest video tutorials with hands-on examples covering a wide variety of product features.

Start Watching
Code editor

Learning Assessments

Learning Assessments provide a way for you to demonstrate understanding of the learning objectives defined for a specific training course.

Take an Assessment