HARC Course / Graduate Seminar Model

Computational Thinking in Archival Practice

Integrating computational methods, digital tools, and archival reasoning into modern archival workflows.

This course introduces students to computational thinking in archival science, emphasizing how abstraction, decomposition, algorithm design, systems thinking, NLP, machine learning, digital preservation, and user access tools can support archival work.

Computational Archival Science Digital Records NLP + Machine Learning Digital Preservation ArchivesSpace / GitHub / Python

Course rationale: Digital transformation has changed archival science. Archivists increasingly need to manage complex digital records, big data, metadata verification, automation, AI-assisted workflows, and scalable systems for access and preservation.

Course Type

  • Graduate-level course or intensive seminar
  • Adaptable for internship training
  • Online, hybrid, or workshop format
  • Hands-on weekly activities

Recommended Background

  • Introduction to Archival Studies
  • Digital Archives or Digital Curation
  • Equivalent archival or digital collections experience
  • No advanced programming required

Core Skills

  • Computational thinking
  • Metadata extraction
  • Digital preservation workflows
  • Search and retrieval design
  • Workflow documentation

Course Description

Computational Thinking in Archival Practice introduces the fundamental principles of computational thinking within archival science. As digital tools and methods play an increasingly significant role in archival work, future archivists need to understand how computational methods can support appraisal, arrangement, description, access, records management, and preservation.

Students participate in hands-on exercises that replicate real-world archival challenges. The course emphasizes practical problem-solving, ethical reflection, and the integration of computational tools into traditional archival workflows.

Learning Outcomes

Apply computational thinking in archival contexts

Students learn how abstraction, decomposition, algorithm design, and systems thinking can improve archival workflows.

Use digital tools and techniques

Students explore software, scripting, NLP, machine learning concepts, and digital tools for processing and analyzing archival collections.

Design computational archival workflows

Students design practical workflows that integrate computational tools into archival description, preservation, and records management.

Assess ethical and societal implications

Students evaluate issues such as privacy, algorithmic bias, access, transparency, and accountability in computational archival practice.

Core framing: computational thinking is not about replacing archival judgment. It is about giving archivists structured methods for solving complex information problems at scale.

Eight-Module Course Structure

Module 1 — Introduction to Computational Archival Science

Computational thinking, abstraction, decomposition, algorithm design, and systems thinking.

Focus

  • What is Computational Archival Science?
  • Why computational thinking matters in archival work
  • How archival problems can be broken into computational workflows

Activity

Students identify real-world applications of computational tools in archival science and write a short analysis of benefits and challenges.

Module 2 — Understanding Digital Records and Archives

Digital records, preservation challenges, format stability, and computational analysis.

Focus

  • Differences between digital and physical records
  • Strengths and vulnerabilities of digital records
  • Tools for analyzing and managing digital collections

Activity

Students compare digital and physical records and summarize three computational tools used for digital record analysis.

Module 3 — Handling Digital Archives Using Computational Tools

NLP, machine learning, batch file processing, OCR, and archival text extraction.

Focus

  • Introduction to NLP
  • Machine learning applications in archives
  • Batch renaming and OCR workflows

Activity

Students rename a batch of PDF files and apply OCR to generate text for metadata extraction and analysis.

Module 4 — Organizing and Describing Digital Archives

Metadata extraction, NER, topic modeling, and automated description support.

Focus

  • Metadata extraction techniques
  • Named Entity Recognition
  • Topic modeling for archival analysis

Activity

Students use basic topic modeling to identify themes in OCR-generated text and evaluate how useful the results are for archival description.

Module 5 — Digital Preservation and Computational Thinking

Digital preservation challenges, preservation packaging, integrity, and computational workflows.

Focus

  • Data corruption and obsolescence
  • Storage and packaging strategies
  • Computational preservation methods

Activity

Students create preservation packages and discuss how computational methods support integrity, storage, and long-term access.

Module 6 — Reference, Access, and User Interaction

Search interfaces, retrieval models, semantic search, and user-centered archival access.

Focus

  • Search and retrieval mechanisms
  • Question-and-answer archives
  • Human-computer interaction principles

Activity

Students design a simple search interface for a digital archive and evaluate how it improves user access.

Module 7 — Records and Information Management in the Digital Age

IDEF0 models, electronic records management, metadata verification, and AI-assisted workflows.

Focus

  • Records management standards
  • IDEF0 workflow modeling
  • Metadata verification and linked data

Activity

Students develop a computational records management plan using workflow modeling techniques.

Module 8 — Integrating Computational Thinking into Archival Workflows

Final project, GitHub documentation, automation, scalability, and workflow design.

Focus

  • Collaborative workflow design
  • Version control and documentation
  • Automation and scalability
  • AI-enhanced archival methods

Final Project

Students design, document, and present an archival workflow that integrates computational tools to address a real-world archival challenge.

Assignments and Evaluation

Component Purpose Approximate Weight
Participation Engagement in discussion, collaboration, and hands-on activities. 10%
Weekly Assignments and Activities Applied exercises demonstrating computational thinking and archival problem solving. 60%
Final Group Project Collaborative design of an archival workflow integrating computational tools. 30%

Why This Course Matters

Traditional archival techniques remain essential, but they are no longer sufficient by themselves for managing born-digital records, large-scale digitized collections, complex metadata environments, and computational discovery systems. This course helps students bridge foundational archival knowledge with emerging technical skills.

By integrating computational thinking into archival education, students become better prepared to lead in environments shaped by digital records, automation, AI-assisted metadata, preservation challenges, and user-centered access systems.

Connection to HARC Learning Programs

This course complements HARC short courses and seminars by providing a deeper framework for computational archival science, digital preservation, records management, and scalable archival workflow design.

🖱️ Return to Short Courses, Seminars and Lectures