Analysis of voice-generative artificial intelligence software applied to podcasting

Indexing metadata

Dublin Core	PKP Metadata elements	Document metadata
Title	Document title	Analysis of voice-generative artificial intelligence software applied to podcasting
Created by	Author’s name, institution, country	María FITÓ-CARRERAS; Universidad Internacional de Catalunya
Created by	Author’s name, institution, country	Montserrat VIDAL-MESTRE; Universidad Autónoma de Barcelona
Created by	Author’s name, institution, country	Alfonso FREIRE-SÁNCHEZ; Universidad Abat Oliba CEU
3. Subject	Discipline
3. Subject	Keyword(s)
4. Description	Abstract
5. Editorial	Organizing institution, location	Universidad Francisco de Vitoria
6. Collaborator	Sponsor
7. Date	(DD-MM-AAAA)	2025-01-31
8. Type	State and gender	Article peer-reviewed, double-blind
8. Type	Type
9. Format	File format	PDF
10. Identifier	Uniform resource identifier	https://comunicacionyhombre.com/en/article/analysis-of-voice-generative-artificial-intelligence-software-applied-to-podcasting/
11. Source	Title; vol., num. (year)	Comunicación y Hombre; Num. 21 (2025): Avatars and Replicants in Communication and Humanities
12. Language	English=en
13. Relationship	Complementary files
14. Coverage	Geo-spatial location, chronological period, research sample (sex, age, etc.)
15. Rights	Copyright and permissions	Copyright (c) 2017 Universidad Francisco de Vitoria This work is under a Creative Commons Attribution-NonCommercial-NoDerivable 4.0 International license.

Dublin Core

PKP Metadata elements

Document metadata

Title

Document title

Analysis of voice-generative artificial intelligence software applied to podcasting

Created by

Author’s name, institution, country

María FITÓ-CARRERAS; Universidad Internacional de Catalunya

Created by

Author’s name, institution, country

Montserrat VIDAL-MESTRE; Universidad Autónoma de Barcelona

Created by

Author’s name, institution, country

Alfonso FREIRE-SÁNCHEZ; Universidad Abat Oliba CEU

3. Subject

Discipline

3. Subject

Keyword(s)

4. Description

Abstract

5. Editorial

Organizing institution, location

Universidad Francisco de Vitoria

6. Collaborator

Sponsor

7. Date

(DD-MM-AAAA)

2025-01-31

8. Type

State and gender

Article peer-reviewed, double-blind

8. Type

Type

9. Format

File format

PDF

10. Identifier

Uniform resource identifier

https://comunicacionyhombre.com/en/article/analysis-of-voice-generative-artificial-intelligence-software-applied-to-podcasting/

11. Source

Title; vol., num. (year)

Comunicación y Hombre; Num. 21 (2025): Avatars and Replicants in Communication and Humanities

12. Language

English=en

13. Relationship

Complementary files

14. Coverage

Geo-spatial location, chronological period, research sample (sex, age, etc.)

15. Rights

Copyright (c) 2017 Universidad Francisco de Vitoria
This work is under a Creative Commons Attribution-NonCommercial-NoDerivable 4.0 International license.

Voice AI is capable of generating human language messages through deep learning algorithms, such as convolutional neural networks (CNN), which learn to imitate vocal patterns from speech data. In this context, the main objective is to provide an overview of voice AI applied to podcasting, aiming to answer whether the current technological offerings pose a threat to audio professionals’ jobs, particularly voice-over artists. To this end, the main software used by podcast creators for voice cloning is analyzed, and a comparative framework is established. Secondly, creators’ perceptions of the results are gathered by analyzing 10 titles. The main software provides specific tools that can enhance workflow and optimize production costs. Based on the findings about the current state of voice AI in podcasting, we have identified both the opportunities and limitations this technology offers to creators. It is observed that the voice AI industry is adapting to the sector’s needs, offering multiple tools through specialized platforms that allow for voice cloning, editing recordings, publishing podcasts, and distributing them in several languages. However, it is not perceived as an immediate threat due to the reproduction of inaccurate prosody and the absence of paralinguistic elements.

Indexing metadata

Analysis of voice-generative artificial intelligence software applied to podcasting

Blog