Data visualisation of Mozilla Common Voice v9 dataset metadata coverage in Observable

Covers how accents are represented in voice data, such as BCP-47 and ISO-639 standards, and what challenges this presents for machine learning.

Socially responsible representation of accents in voice data: Considerations for practitioners and policymakers

Covers how accents are represented in voice data, such as BCP-47 and ISO-639 standards, and what challenges this presents for machine learning.

Data, decisions and dynamics

Data, decisions and dynamics was an invited keynote for VALA22 - Bring IT on!, in which Kathy Reid covered machine learning in libraries.

Custodians and midwives cover

Custodians and midwives: the library of the future

Custodians & Midwives: The Library of the Future provides research and analysis on artificial intelligence and emerging technologies at the National Library of Australia.

Ken Behrens on a road traffic sign

Ken Behrens: A cybernetic detective story of algorithms, accents and augury

How can cybernetics help us investigate the story behind a Canberra meme?

Communities are systems presentation header

Communities are systems: invited keynote for linux.conf.au 2022

What can systems thinking teach us about building open source communities?

DeepSpeech PlayBook

I developed the DeepSpeech PlayBook to demystify the process of training speech recognition models with Mozilla's DeepSpeech engine.

More Voice Less Choice

More choice less voice: The rise of voice interfaces and the decline of open source voice

Voice is becoming ubiquitous. But the way voice has scaled means that our choice in voice technologies is constrained. Let's create voice for everyone, everywhere, in every language.

Ensuring Black voices matter: Why your voice assistant is racist, and what you can do about it

Does your voice assistant support multiple languages? At PyConAU 2020, Kathy Reid explores why not - and how we can fix it.