Case Study: Verifiable Resumes with Verifiable Credentials

This Rockefeller Foundation’s mission is to promote the well-being of humanity throughout the world through advances in science, data, policy, and innovation to solve global challenges related to health, food, power, and economic mobility. Since 1913, the Foundation has hosted numerous convenings and awarded thousands of grants to achieve its mission, thus resulting in a lot of unstructured data.

The Foundation sought to connect their unstructured and structured data to external sources to extract and connect entities as input into a knowledge graph to produce insights, such as what people are at different events together? Who have we funded and where do they get their next round of funding from?

The Foundation engaged Predictive UX to develop a Proof of Concept (POC) Natural Language Processing (NLP) solution for Named Entity Resolution (NER) and a graph database to enable insight discovery against internal and external sources of data.

The hypothesis was that The Foundation could use the knowledge graph to gain intelligence about grants awarded issued by other organizations, grantees, convenings, and other events.

To achieve the goals of the Foundation, we proposed an NER pipeline architecture, led data modeling and NER pipeline development, entity extraction, and knowledge graph implementation.

The NER pipeline allowed us to extract entities such as people, place, organization, funding amounts, and more. The pipeline included a step for Named Entity Disambiguation (NED) to associate names even when they present differently across contexts (e.g., Elizabeth Baker, Liz Baker, and Liz M. Baker) with certainty, thus reducing duplicate data.

This project resulted in 1.2M entities being extracted and 9M queryable relationships and contributed to the foundation’s long-term knowledge graph strategy.

Entity Disambiguation and Knowledge Graph Ingestion Flow

This diagram shows the data pipeline designed during Predictive UX’s work with The Rockefeller Foundation. It outlines how raw documents are processed through a named entity recognition (NER) and coreference resolution pipeline, followed by topic and relationship extraction, before being integrated into a structured knowledge graph. The flow visualizes the architecture of the proof-of-concept used to link people, organizations, and topics across unstructured grant and news data.

Entity Disambiguation (NER), Entity Linking, and Knowledge Graph Creation

Entity disambiguation and entity linking on a corpus of historical data to connect it semantically in a knowledge graph.

Outcomes

1.2M

distinct entities extracted

9M

queryable realtionships

About The Project

Client

What We Did

Outcomes

Delivery Time

Client

What We Did

Outcomes

Data Goals

Selecting Better Residents

Maximizing Connections

Maximizing Connections

Identifying Insights, Key Themes

Our Approach

Predictive UX News and Insights.