Glossary

What is Knowledge Graph?

Concept By David Hamilton Updated
Definition

Knowledge Graph is a structured representation of information as a network of entities (nodes) and relationships (edges), used to store, query, and reason over connected data.

What a knowledge graph is

A knowledge graph represents information as a network of entities and the relationships between them. Each entity is a node. Each relationship is an edge with a label. The classic format is a triple: subject, predicate, object.

“David authored Article-42.” That’s one triple. “Article-42 is about Topic-MCP.” Another triple. “Topic-MCP is related to Topic-AI-Agents.” A third. Stack thousands of these and you have a graph that captures structured knowledge in a way that’s both machine-readable and traversable.

Google’s Knowledge Graph (the panels you see on the right side of search results) is the most famous public example. Wikidata is the public, structured version of a knowledge graph at internet scale.

How they differ from databases

A relational database stores information in tables with rows, columns, and foreign keys. Joining tables to follow relationships gets expensive when relationships matter more than the data.

A graph database is built around the relationships. Traversing edges (A connects to B connects to C) is the primary operation, and it stays fast even as the graph grows. Graph databases like Neo4j, ArangoDB, and Amazon Neptune are optimized for this access pattern.

The choice between them isn’t ideological. If most of your queries are “fetch rows where condition X”, relational databases are simpler. If most of your queries are “find all entities connected to X within 3 hops via these relationship types”, graph databases win.

Why AI systems use them

Knowledge graphs solve two problems that pure vector-based retrieval struggles with.

Disambiguation. “Apple” is a company. “Apple” is a fruit. “Apple” is a record label. Free-text search treats them as the same token. A knowledge graph has them as distinct entities with distinct relationships. Searching for “Apple’s revenue” walks toward the company; searching for “Apple varieties” walks toward the fruit.

Reasoning paths. A model can answer “what cities did this author live in?” by following the graph from author to publication to publication-location to author-location, even when no single document mentions all that information together.

The current state of the art combines both: vector search for semantic similarity, graph traversal for structured reasoning. Hybrid retrieval often outperforms either alone.

Personal knowledge graphs

The big public knowledge graphs (Wikidata, Google’s KG) cover general knowledge. Personal knowledge graphs cover one person’s content: notes, bookmarks, papers, photos, calendar events.

Tools like Roam Research, Obsidian, and Logseq build personal knowledge graphs through bidirectional links. Every backlink between two notes creates an edge. Over time, your notes become a navigable graph, not just a folder of files.

Personal knowledge management is the broader practice. Knowledge graphs are one way to implement it. Other approaches include flat note collections, topic-clustered archives, and timeline-based logs. Different tools suit different thinkers.

Where ContextBolt fits

ContextBolt doesn’t expose an explicit graph view, but the underlying retrieval uses graph-like structure. Saves get clustered by topic. Topics connect to other topics through shared content. When you query, the system can traverse these implicit connections to surface saves that don’t match keywords directly but live in the same conceptual neighborhood.

This is closer to a topic-clustered archive than a strict knowledge graph, but the retrieval benefits overlap. You ask for content about a topic, you get content from connected topics too, without manually maintaining the connections.

For most personal use cases, this is the right trade-off. Maintaining an explicit graph (linking every note to every related note) is real work. The implicit graph emerging from automatic clustering and embeddings gives you most of the benefit with none of the maintenance.

Terms related to Knowledge Graph

Knowledge Graph: FAQs

What's a simple example of a knowledge graph? +
Think of it as 'X is connected to Y by relationship Z'. 'David authored Article-42'. 'Article-42 is about Topic-MCP'. 'Topic-MCP is related to Topic-AI-Agents'. Each entity (David, Article-42, Topic-MCP) is a node. Each relationship (authored, is-about, related-to) is a labeled edge. Stack thousands of these and you have a graph.
How is a knowledge graph different from a database? +
A relational database stores information in tables with rows and columns. A knowledge graph stores it as nodes connected by labeled edges. Graphs are better when relationships matter as much as the data itself: social networks, citation networks, ontologies, recommendation systems. Tables are better when most queries don't traverse relationships.
Why do AI systems use knowledge graphs? +
Two reasons. First, graphs encode reasoning paths the model can follow ('A is related to B, B is related to C, therefore A and C may be connected'). Second, graphs disambiguate entities ('Apple the company' vs 'apple the fruit') better than free text, which improves retrieval accuracy. Many modern RAG systems combine vector search with graph traversal.
What's a personal knowledge graph? +
A knowledge graph built from one person's notes, bookmarks, and content rather than a global encyclopaedia. Tools like Roam Research and Obsidian (with backlinks) are personal knowledge graphs. So is, in a sense, your bookmark collection once it's tagged and topic-clustered. The graph emerges from how your saves connect.
Does ContextBolt build a knowledge graph? +
Implicitly, through topic clustering and tagging. ContextBolt doesn't expose a literal graph view, but the underlying structure groups related saves and connects them through shared topics. When you ask Claude via MCP for content related to a topic, the system uses these connections to surface saves that aren't direct keyword matches.