Augmenting cataloguers: planning an AI Agent to generate MARC21 records

Sheldon Korpet; Nathalie Rees

Authors

Sheldon Korpet Manchester Metropolitan University Library
Nathalie Rees Manchester Metropolitan University Library

Keywords:

Metadata enhancement, AI, cataloguing automation, Large Language Models (LLM), Retrieval Augmented Generation (RAG)

Abstract

This article outlines the planning and development of an AI-powered agent designed to assist with generating records at Manchester Metropolitan University’s Library and Cultural Services. The team explores the use of Large Language Models (LLMs) and Retrieval Augmented Generation (RAG) to automate and enhance cataloguing processes, particularly for unique and multilingual materials in the Special Collections Museum and Manchester Poetry Library.

The article discusses technical challenges, ethical considerations, and the importance of maintaining human oversight to ensure quality and transparency. It also details the system architecture and outlines future goals such as multilingual support and automated record enhancement. This project not only aims to improve efficiency but also empowers staff through upskilling and deeper engagement with emerging AI technologies.

Author Biographies

Sheldon Korpet, Manchester Metropolitan University Library

Library Systems and Discovery Advisor
Nathalie Rees, Manchester Metropolitan University Library

Library Systems and Discovery Librarian

References

Albada, Michael (2025) Building Applications with Agents. O'Reilly. Available at: https://learning.oreilly.com/library/view/building-applications-with/9781098176495/ [Accessed: 20 May 2025]

Gonuguntla, H., Meghana, K., Prajwal, T.S., Koundinya, N.V.S.S., Sai, K.N. and Jain, C. (2024) ‘Evaluating Modern Information Extraction Techniques for Complex Document Structures’, International Conference on Electronic, Computer and Energy Technologies (ICECET 2024), Sydney, 25-27 July 2024. Available at: https://doi.org/10.1109/ICECET61485.2024.10698618 [Accessed: 30 April 2025]

Google (2024) What are AI hallucinations? Available at: https://cloud.google.com/discover/what-are-ai-hallucinations [Accessed: 28 May 2025]

Huang, Yu (2024) Levels of AI Agents: from Rules to Large Language Models. Available at: https://doi.org/10.48550/arXiv.2405.06643 [Accessed: 28 May 2025]

Hugging Face (2025) Leaderboards and Evaluations. Available at: https://huggingface.co/docs/leaderboards/en/index [Accessed: 28 May 2025]

Kirmer, Stephanie (2024) Choosing and Implementing Hugging Face Models. Available at: https://www.stephaniekirmer.com/writing/choosingandimplementinghuggingfacemodels [Accessed: 28 May 2025]

Langchain (2025) Langgraph. Available at: https://langchain-ai.github.io/langgraph/ [Accessed: 19 May 2025]

Mendelevitch, Ofer and Bao, Forrest (2025) ‘Rag vs. fine-tuning’ in O. Mendelevitch and F. Bao Hands-On RAG for Production. Available at: https://learning.oreilly.com/library/view/hands-on-rag-for/9798341621701 [Accessed: 20 May 2025]

Resnik, Philip (2024) Large Language Models are Biased Because They Are Large Language Models. Available at: https://doi.org/10.48550/arXiv.2406.13138 [Accessed: 23 May 2025]

Taulli, Tom (2024) What is LangGraph? O'Reilly. Available at: https://learning.oreilly.com/videos/what-is-langgraph/0642572071776/ [Accessed: 20 May 2025]

Tay, Aaron (2025) ‘Deep Dive into Three AI Academic Search Tools’, Katina, (20 May). Available at: https://doi.org/10.1146/katina-052025-2 [Accessed: 20 May 2025]

Ubl, Malte (2020) ‘Design Docs at Google’, Industrial Empathy, 6 July. Available at: https://www.industrialempathy.com/posts/design-docs-at-google/ [Accessed: 28 May 2025]

Urban, Richard (2024) ‘Keeping up with next-generation metadata in archives and special collections’, Hanging Together, 17 December. Available at: https://hangingtogether.org/keeping-up-with-next-generation-metadata-in-archives-and-special-collections/ [Accessed: 8 May 2025]

Augmenting cataloguers

planning an AI Agent to generate MARC21 records

Authors

Keywords:

Abstract

Author Biographies

References

Downloads

Published

Issue

Section

License