News / Blog

The Digital Age of Voice: How AI is Shaping Our World

Sunday 30th April 2023

The Digital Age of Voice: How AI is Shaping Our World

Artificial intelligence (AI) is a rapidly evolving field with emerging developments and applications. AI is changing how we learn and communicate, and it is highly likely that researchers, health professionals, teachers, and performers will need to reorient around it.

Many of us interact with Siri or Alexa on a day-to-day basis. They have helped to popularise the use of voice assistants and voice-activated technology. Another significant development in AI is the emergence of ChatGPT, an AI language model developed by OpenAI. ChatGPT-3 has demonstrated remarkable abilities in generating natural language processing (NLP) and can be used in various applications, including language translation and text completion tasks and chatbots (simulating human conversation through text interactions).

To engage with it, I asked how it can support singers and singing teachers. Here are the responses:

  • ChatGPT-3 can generate lyrics and chord progressions.
  • It can assist teachers in selecting appropriate songs for their students based on their skill level and vocal range.
  • It can generate vocal health advice and voice exercises tailored to your needs.
  • It can also assist with pronunciation and language learning for songs in foreign languages by providing guidance on proper enunciation and pronunciation of words and phrases.

There are many exciting prospects to discover! If you haven't tried it yet, you can test it and experience it for free here. Occasionally, there are access issues due to so many people using it; you can get an upgraded version called ChatGPT Plus,  available for $20/£16.20 a month, which avoids this issue. And, if that got you excited, for a small cost ($18/£14.50), you could also try out a variant of GPT-3 called DALL-E which you can use to create images for album covers, social media – such as the image I created for this post – and so much more.

In addition to crafting textual content and images, AI can fashion bespoke spoken voices that are tailored to fit the requirements of a particular brand, product, or service. The technology that underlies text-to-speech (TTS) systems can be customised to match the interests and preferences of individual listeners, making it especially valuable for developing marketing voiceovers and for individuals with visual impairments who rely on audio content. You can create spoken audio for free; there are many voices and styles at Voice Design by Elen Labs. 

AI has become a subject of ethical scrutiny, with recent events sparking debates about its impact on various industries. One example is the original song generated by AI using Singing Voice Clone software and featuring the voices of Drake and The Weeknd, which sparked discussions about legal frameworks. The controversy led to the song being removed from streaming services and raised questions about the possibility of AI replacing human voices altogether. This development could also lead to job losses in the voice-over and news-reading industries.

The use of AI-generated voices has also raised concerns in the television industry, particularly in documentaries where AI-generated voices have been labelled as "deepfake" technology. Calls for full disclosure and consent have been made to address these issues. Additionally, psychologists have warned about the possible negative impact on society if gender stereotypes are perpetuated through AI technology without careful consideration. Finally, to ensure academic integrity in higher education, universities and colleges are implementing natural language processing and machine learning algorithms to train models that can distinguish the distinctive characteristics of content produced by AI.

It is likely that AI will become increasingly prevalent in our field, and while there are concerns about the technology's potential negative impact, it is crucial to recognise the many benefits it can provide to society. As AI continues to evolve, it is necessary to balance leveraging its capabilities with ensuring it remains safe and ethical.

References.

Coburn, C., Williams, K and Stroud, S. R. (2022) ‘Enhanced Realism or A.I.-Generated Illusion? Synthetic Voice in the Documentary Film Roadrunner’, Journal of Media Ethics,  37:4, 282-284, Available at DOI: 10.1080/23736992.2022.2113883

Cotton, D.R.E., Cotton, P. A and Shipway, J. R. (2023) ‘Chatting and cheating: Ensuring academic integrity in the era of ChatGPT’, Innovations in Education and Teaching International, Available at DOI: 10.1080/14703297.2023.2190148

Moradbakhti, L., Schreibelmayr, S and Mara, M. (2022) ‘Do Men Have No Need for “Feminist” Artificial Intelligence? Agentic and Gendered Voice Assistants in the Light of Basic Psychological Needs’, Frontiers in Psychology, 13, Available at DOI: 10.3389/fpsyg.2022.855091

MUK, I. (2023) ‘Voices from Drake and The Weekend Goes Viral’, Mixmag.

Track featuring AI generated voices from Drake and The Weeknd goes viral - News - Mixmag [Accessed 18.04.23]


Recent Posts

Student & Alumni News

Alumna Chrysa Tsaltampasi’s 'Dysphonia' Nominated for Experimental Metal Album of the Year!

Voice Study Centre
Monday 23rd December 2024

Voice Study Centre Alumna, Chrysa Tsaltampasi's album Dysphonia has been nominated for Experimental Metal Album of the Year!

Read More...

Season’s Greetings and a Happy New Year!

Voice Study Centre
Friday 20th December 2024

As we approach the end of the year, we would like to extend our warmest wishes for a healthy, joyful, and prosperous 2025.

Read More...
Student & Alumni News

Voice Study Centre Alumna Claire Turner’s Paper Published in Australian Voice’s Special Issue

Voice Study Centre
Thursday 19th December 2024

We're delighted to share that Voice Study Centre Alumna Claire Turner has had a paper published in Australian Voice, which will be part of their special issue, ‘Insights and Innovations from Singing Practitioner-Research’.

Read More...