We’ve joined forces with Smartlogic to reveal smarter decisions—together.

You’re Solving the Wrong Problem

I remember an interesting meeting at a very large global company that I worked at years ago. Our product documentation needed to be translated into dozens of different languages – which was a very costly process. We had a great team that did the work and the team leader was very innovative. He was always looking for ways to lower the cost per word while maintaining high quality. In a staff meeting one day he was telling us about the latest refinement his team had developed which reduced the cost per word a little further; maybe another 0.5%. Every improvement helped – there were a lot of products and a lot of words.

Most of us were impressed by the continued progress, but I remember our boss saying, “We’re solving the wrong problem. We’re decreasing our cost per word by another 0.5%, but we should be focused on decreasing the number of words. Why are our products so complicated? Why does it require so many words to describe them? That’s the right problem to fix.” This reframing is something that really stuck with me. You can optimize the heck out of a system, but if you’re turning the wrong set of knobs, you’re never going to have the kind of impact that you really want – the kind you really need.

How does this relate to the database world? Substitute “transformation” for “translation.” You may have a superb team that builds your ETL processes – particularly the transformation part. They may be wringing every last efficiency out of the process. You allow them to spend a lot of time optimizing transformations because it’s costing you so much. That’s great, but they’re solving the wrong problem. The problem isn’t how to shave off another 0.5%, it’s that you have to do so much work in the first place.

Our approach with MarkLogic turns the normal process on its head. We aren’t out to make ETL simpler; we’re out to remove the need for it. We’re not focused on the next 0.5%, but rather on the other 99.5%. That’s where massive savings will arise. I know, it’s easy to say that we do that, but where’s the proof? Rather than try to include it all here, I’ll point you to another set of posts that describe what makes MarkLogic so different from relational and ETL:

Having said that, here’s the short answer. MarkLogic discovers structure rather than making you declare it up front as you must with relational. MarkLogic loads data “as-is” and accesses it with a universal index. Our flexible data model allows you to normalize and harmonize data as you need to. We don’t make you boil the ocean trying to create an uber-schema before you get any value.

The bottom line is that to get really big wins, you need to stop what you’re doing for a moment and ask yourself if you’re solving the right problem. If you’re not, and if your tools are creating the problems rather than helping you solve them, it’s time to move on.

Joe Pasqua - Executive Vice President, Products | MarkLogic

Joe Pasqua brings over three decades of experience as both an engineer and a leader. He has personally contributed to several game changing initiatives including the first personal computer at Xerox, the rise of RDBMS in the early days of Oracle, and the desktop publishing revolution at Adobe. In addition to his individual contributions, Joe has been a leader at companies ranging from small startups to the Fortune 500.

Most recently, Joe established Neustar Labs which is responsible for creating strategies, technologies, and services that enable entirely new markets. Prior to that, Joe held a number of leadership roles at Symantec and Veritas Software including VP of Strategy, VP of Global Research, and CTO of the $2B Data Center Management business.

Joe’s technical interests include system software, knowledge representation, and rights management. He has over 10 issued patents with others pending. Joe earned simultaneous Bachelor of Science Degrees in Computer Science and Mathematics from California Polytechnic State University San Luis Obispo where he is a member of the Computer Science Advisory Board.

Start a discussion

Connect with the community

STACK OVERFLOW

EVENTS

GITHUB COMMUNITY

Most Recent

View All

Unifying Data, Metadata, and Meaning

We're all drowning in data. Keeping up with our data - and our understanding of it - requires using tools in new ways to unify data, metadata, and meaning.
Read Article

How to Achieve Data Agility

Successfully responding to changes in the business landscape requires data agility. Learn what visionary organizations have done, and how you can start your journey.
Read Article

Scaling Memory in MarkLogic Server

This not-too-technical article covers a number of questions about MarkLogic Server and its use of memory. Learn more about how MarkLogic uses memory, why you might need more memory, when you need more memory, and how you can add more memory.
Read Article
This website uses cookies.

By continuing to use this website you are giving consent to cookies being used in accordance with the MarkLogic Privacy Statement.