Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models

Building ontology-verified knowledge graphs from text using LLMs.

Alla Chepurova Aydar Bulatov Mikhail Burtsev Yuri Kuratov
EACL 2026, AAAI 2026 (Demo)

Wikontic converts unstructured text into structured knowledge graphs (Chepurova et al., 2026). It uses an ontology to control how extracted knowledge is represented: which entity types are allowed, which relations are valid, and how facts should fit together.

The pipeline extracts candidate (subject, relation, object) triplets, refines entities and relations, validates them against an Ontology (we use ontology from WikiData), and stores the resulting graph for retrieval, question answering, visualization, and any other use cases.

Wikontic pipeline for extracting, ontology-aligning, and refining knowledge graphs from text.

What It Does

  • Extracts candidate triplets from raw text with an LLM.
  • Aligns and normalizes entities and relations using constraints from Wikidata ontology.
  • Supports both ontology-aware and ontology-free modes, can adapt to Wikidata-like ontologies, and has LangChain integration.
  • Supports English and Russian languages.

Results

  • On MuSiQue, the correct answer entity appears in 96% of generated triplets.
  • In triplets-only QA (without original context), Wikontic reaches 76.0 F1 on HotpotQA and 59.8 F1 on MuSiQue.
  • On MINE-1, it reaches 86% information retention.
  • KG construction uses about 3x fewer tokens than AriGraph and under 1/20 of GraphRAG.

Wikontic For Complex QA Data Generation

Wikontic’s KGs are also useful as an intermediate representation for generating complex QA datasets and synthetic data.

  • Benchmarking: DRAGOn (Chernogorskii et al., 2026) builds RAG benchmarks over periodically updated corpora. Its generation pipeline extracts KGs from text and samples graph structures to create QA pairs with different complexity levels.
  • Training: OCC-RAG (Savkin et al., 2026) uses Wikontic KGs as one component of its synthetic data generation pipeline for multi-context, multi-hop QA. The resulting training data substantially improves compact Qwen3 models: on HotpotQA, In-Acc rises from 34.8 to 57.6 for 0.6B, and from 47.7 to 60.9 for 1.7B.

Citation

  1. Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models
    Alla Chepurova, Aydar Bulatov, Mikhail Burtsev, and Yuri Kuratov
    In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2026

References

  1. DRAGOn: Designing RAG On Periodically Updated Corpus
    Fedor Chernogorskii, Sergei Averkiev, Liliya Kudraleeva, Zaven Martirosian, Maria Tikhonova, Valentin Malykh, and Alena Fenogenova
    In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 4: Student Research Workshop), Mar 2026
  2. OCC-RAG: Optimal Cognitive Core for Faithful Question Answering
    Maksim Savkin, Mikhail Goncharov, Alexander Gambashidze, Alla Chepurova, Dmitrii Tarasov, Nikita Andriianov, Daria Pugacheva, Vasily Konovalov, Andrey Galichin, and Ivan Oseledets
    2026