Skip to main content

An introduction to data catalog

Danger

data.world University!

Check out our What is a data catalog? video!

Data.world is a cloud-native data catalog that makes it easy for everyone—not just the data people — to get clear, accurate, fast answers to any business question. A data catalog is a structured collection of data that allows an organization to find and manage its data. It includes metadata that describes the location, structure, and quality of data as well as information about data usage, relationships, meaning, and lineage. A data catalog acts like a central repository, storing all crucial information about datasets, helping users to discover, organize, access, understand, and use the available data.

The data.world Data Catalog Platform is built on a knowledge graph architecture. It is the only platform that uncovers hidden relationships between assets, people, domains, and processes that can improve data literacy, decision making, and company wide adoption of data and analytics.

A knowledge graph is like a smart map that shows how different pieces of information are connected to each other. Imagine it as a web where each point (or node) represents a piece of data, and the lines (or edges) between them show how they're related. A knowledge graph for a data catalog can help make sense of all the information by showing relationships between different datasets, files, and data sources.

Note

Read more about data catalog in our blog post.

knowledge_graph.png