Natural Language Queries to Wikidata: A Naïve Prototype

From semantic-mediawiki.org
SMWCon Fall 2023Natural Language Queries to Wikidata: A Naïve Prototype
SMWCon Fall 2023
Natural Language Queries to Wikidata: A Naïve Prototype
Talk details
Description: In this lightning talk, we'll delve into a prototype I've been tinkering with, which is all about enabling more intuitive interactions with Wikidata through natural language queries.
Speaker(s): Robert Timm
Type: Lightning talk
Audience: Everyone, Developers, Community
Event start: 2023/12/13 14:30:00
Length: 15 minutes
Video: click here
Keywords: llm, prototype, rag, wikidata
Give feedback

In this lightning talk, I'll be showcasing a prototype focused on enhancing user interaction with Wikidata through natural language processing. This prototype's aim is simple: to enable users to pose questions in a conversational manner and to receive responses backed by and directly linking to Wikidata statements.

The system uses a retrieval-augmented generation method, where the it initially retrieves data from Wikidata in textual format. This data is then assessed for its relevance to the user's query, and integrated into a large language model prompt. Currently, the project is in its experimental phase.

During the presentation, I'll demonstrate some code snippets, provide a basic walkthrough of the prototype, and discuss both its potential and the limitations I've faced with this approach. I'm eager to share ideas and receive feedback on this initial version of the tool.

Checkout the code: https://github.com/rti/askwikidata

Find the slides: https://docs.google.com/presentation/d/1YgDmcvoXaqnYdRyX5RxewVkeioEJ92nb8Sfb_halBsM

Try it out online in Google Colab: https://colab.research.google.com/drive/1yRZshpNj0kXwY0XuUYw5ziqjw_RffxH-