Cleaning and validating ISBN numbers with Python

Abstract

This notebook demonstrates how to clean and validate ISBN numbers using Python.
It introduces a simple workflow to identify and correct formatting inconsistencies in ISBN-10 and ISBN-13 numbers.
You will learn to use string methods, regular expressions, and conditional logic to ensure that ISBN data is accurate and standardized.

Learning goals
After completing this notebook, you will be able to:

  • Identify and handle typical issues in ISBN datasets (missing digits, dashes, inconsistent lengths)
  • Understand the difference between ISBN-10 and ISBN-13 formats
  • Implement Python functions to clean and normalize ISBN strings