Book of Abstracts

Overview

Conference Overview About, Committees & Schedule

Keynotes

N. Katherine Hayles Co-creating with AI: Stimulating Human Creativity or Stifling It? June 5, 3:30–5:00 Eric Lyon How to Compose AI Music That Isn't Mid June 6, 9:30–11:00

Panel

AI and Musical Creativity Panel Discussion June 6, 11:30–1:00

Session 1 — Posthuman Voice and Distributed Agency

Jörg Holzmann Posthuman Vocality and the Infrastructural Reconfiguration of Opera Paolo Paradiso From Vocal Body to Vocal Network: AI and the Reconfiguration of Musical Co-Creativity Darren Woodland Jr. Material Synthesis Composition: Speculocultural Technopoiesis as a Framework for Human-Material-AI Co-Creativity

Session 2 — AI Systems and Co-Creative Practices

Garrison Gerard Ecosystemic Music: Building Systems for Improvisation and Musical Performance Yifeng Yvonne Yuan Glitch Voice: Real-Time Neural Deconstruction of Vocal Meaning Jeremy Francoeur No Truth, No Lies: Narrative Storytelling Through Memes and AI

Session 3 — Cultural, Political & Economic Implications

Sonnet Swire Prompt and Consequence: AI-Generated Music as 21st-Century Propaganda Alvaro E. Lopez Navigating the Convergence of Artificial Intelligence and Music Composition: Labor and Attribution

Book of Abstracts · UC Riverside · June 2026

Co-Creativity in Music, Sound, and AI

Improvisation, Interaction, Composition

UC Riverside Main Campus June 5–6, 2026 · INTS 1111–1113

Artificial intelligence is transforming contemporary music, sound, and audiovisual practices. This conference explores co-creativity as a dynamic interaction between human and computational agents, focusing on improvisation, interaction, and composition.

Programme

Conference Schedule

Session 1 Posthuman Voice and Distributed Agency

9:30–10:00

Jörg Holzmann

Posthuman Vocality and the Infrastructural Reconfiguration of Opera

→ Abstract

10:00–10:30

Paolo Paradiso

From Vocal Body to Vocal Network: AI and the Reconfiguration of Musical Co-Creativity

→ Abstract

10:30–11:00

Darren Woodland Jr.

Material Synthesis Composition: Speculocultural Technopoiesis as a Framework for Human-Material-AI Co-Creativity

→ Abstract

Break 11:00–11:30

Session 2 AI Systems and Co-Creative Practices

11:30–11:35

Amit Roy Chowdhury

Overview of UC Riverside AI Research and Education Institute

11:35–12:00

Garrison Gerard

Ecosystemic Music: Building Systems for Improvisation and Musical Performance Using Algorithmic Composition and AI

→ Abstract

12:00–12:30

Yifeng Yvonne Yuan

Glitch Voice: Real-Time Neural Deconstruction of Vocal Meaning

→ Abstract

12:30–1:00

Jeremy Francoeur

No Truth, No Lies: Narrative Storytelling Through Memes and AI

→ Abstract

Lunch 1:00–2:00

Session 3 Cultural, Political, and Economic Implications of AI Music

2:00–2:30

Sonnet Swire

Prompt and Consequence: AI-Generated Music as 21st-Century Propaganda

→ Abstract

2:30–3:00

Alvaro E. Lopez

Navigating the Convergence of Artificial Intelligence and Music Composition: Labor and Attribution

→ Abstract

Break 3:00–3:30

Keynote 3:30–5:00

3:30–5:00

N. Katherine Hayles

Co-creating with AI: Stimulating Human Creativity or Stifling It?

→ Abstract

Keynote 9:30–11:00

9:30–11:00

Eric Lyon

How to Compose AI Music That Isn't Mid

→ Abstract

Break 11:00–11:30

Panel AI and Musical Creativity

11:30–1:00

Panel Discussion

Amy Skjerseth, Liz Przybylski, Kathryn Agnes Huether, Mesmi, Frank Duchêne

→ Details

Featured Presentations

Keynote Lectures

Keynote · June 5, 3:30–5:00

N. Katherine Hayles

Distinguished Research Professor, UCLA · James B. Duke Professor Emerita, Duke University

Co-creating with AI: Stimulating Human Creativity or Stifling It?

Abstract

Artificial intelligence systems such as Large Language Models learn systems of representation by ingesting vast corpora of human-authored texts. Through attention mechanisms in Transformer architectures, these systems evaluate relationships among tokens, generating context-aware probabilistic structures embedded in high-dimensional semantic spaces.

This keynote examines how such processes enable AI systems to infer implicit rules governing complex representational systems. It compares human cognition with machine-based forms of cognition, asking whether AI can be considered cognitive and, if so, in what sense.

The talk explores the nature of creativity in AI systems in relation to human creativity, addressing both their potential and their limitations. It concludes by proposing strategies for engaging AI as a co-creative partner in ways that stimulate, rather than diminish, human creative capacities.

N. Katherine Hayles is Distinguished Research Professor at UCLA and James B. Duke Professor Emerita from Duke University. Her work focuses on the relations between literature, science, and technology in the 20th and 21st centuries.

She is the author of twelve books, including Postprint: Books and Becoming Computational (2021), Unthought: The Power of the Cognitive Nonconscious (2017), and How We Think: Digital Media and Contemporary Technogenesis (2015), along with over 100 peer-reviewed articles.

Her scholarship has received numerous awards, including the René Wellek Prize and the Suzanne Langer Award. She has held NEH, Guggenheim, and Rockefeller fellowships, among others, and is a member of the American Academy of Arts and Sciences. Her most recent book (2025) is Bacteria to AI: Human Futures with our Nonhuman Symbionts.

Keynote · June 6, 9:30–11:00

Eric Lyon

School of Performing Arts, Virginia Tech · Faculty Fellow, Institute for Creativity, Arts, and Technology

How to Compose AI Music That Isn't Mid

Abstract

Artificial intelligence in music has a long history, extending back to early experiments by Lejaren Hiller and Leonard Isaacson prior to the Dartmouth Conference of 1956. Recent technological developments—particularly the use of GPUs and pre-trained transformer models—have sparked a new wave of AI-based music practices, marking a renewed phase of exploration in co-creative systems.

At the same time, these developments have generated critical responses. Alongside ethical and environmental concerns, a central aesthetic critique has emerged: that AI-generated music tends toward the average, often described as "mid." This concern is especially evident in commercial AI music platforms such as Suno and Udio. A primary focus of this keynote is to examine compositional strategies that move beyond this tendency, exploring how artists can engage AI in ways that produce distinctive and compelling musical outcomes.

The talk will also address the shifting relationship between academic and industry-based research in AI music. Whereas earlier developments in computer music were largely driven by academic research, recent advances have been led by private-sector initiatives with access to large datasets and significant computational resources. In contrast, emerging academic practices emphasize smaller datasets, local computation, and critical engagement with the ethical implications of AI.

Finally, these perspectives will be situated within the presenter's long-term work in algorithmic sound design. Systems such as Mushroom and SLURP will be discussed as examples of generative approaches to sound processing, along with new possibilities for integrating AI into these established compositional frameworks.

Eric Lyon is a composer and audio researcher focused on high-density loudspeaker arrays, dynamic timbres, virtual drum machines, and performer-computer interactions. His audio signal processing software includes "FFTease" and "LyonPotpourri." He has authored two computer music books, Designing Audio Objects for Max/MSP and Pd, a guidebook for writing audio DSP code for live performance, and Automated Sound Design, a book that presents technical processes for implementing oracular synthesis and processing of sound across a wide domain of audio applications. He has written extensively about the possibilities of multichannel spatial audio. In 2016–17, Lyon was guest editor for the Computer Music Journal on Volume 40(4) and 41(1) covering various aspects of High-Density Loudspeaker Arrays (HDLAs).

In 2015–16, Lyon architected both the Spatial Music Workshop and Cube Fest at Virginia Tech to support the work of other artists working with HDLAs. In 2025 he co-created the Spatial Audio Tidepool to provide technical instruction for creative uses of high-density loudspeaker arrays. Lyon's compositional work has been recognized with a ZKM Giga-Hertz prize, MUSLAB award, the League ISCM World Music Days competition, and a Guggenheim Fellowship. Lyon teaches in the School of Performing Arts at Virginia Tech and is a Faculty Fellow at the Institute for Creativity, Arts, and Technology.

June 6, 11:30–1:00

Panel Discussion

Panel · June 6, 11:30–1:00

AI and Musical Creativity

This roundtable brings together scholars and music industry professionals to examine the evolving role of artificial intelligence in musical creativity. Topics include the impact of AI on institutions, industries, and labels, as well as its influence on collaboration, genre formation, and artistic practice. By bridging theoretical and practical perspectives, the panel aims to foster an open and dynamic exchange while encouraging active audience engagement.

Panelists

Kathryn Agnes Huether

Postdoctoral Research Associate, UCLA

Kathryn Agnes Huether is a Postdoctoral Research Associate in Antisemitism Studies at UCLA. Her research examines sound as a political and cultural force, connecting Holocaust and Genocide Studies with sound studies and media theory. She investigates how sonic practices mediate trauma, violence, and collective memory, and how listening becomes a site where ideologies are encoded and contested.

Her recent work extends to AI, authenticity, ethics, and voice, exploring how algorithmic systems reshape presence and testimony. She holds a Ph.D. in Musicology from the University of Minnesota and an M.A. in Religious Studies from the University of Colorado Boulder.

Frank Duchêne

University of Applied Sciences and Arts, Belgium

Frank Duchêne is a Belgian music producer, sound designer, and lecturer whose work blends musical practice, recording technology, and critical analysis. He began his career as a recording artist with Hooverphonic (Columbia Records) and served as an in-house engineer at Galaxy Studios. He then established a long-standing freelance practice as a producer, mixer, and engineer for artists, record labels, and audiovisual media.

For more than 18 years, Duchêne has been a key faculty member at PXL University of Applied Sciences and Arts in Belgium, teaching Music Production and overseeing international projects. His academic focus includes critical listening, production methods, and the evolving relationship between musical creativity and recording technologies—spanning analog, digital, and hybrid workflows.

He handles curriculum development and assessment and contributes to research on sound production as both an analytical and a creative practice. Outside academia, Duchêne remains active in music production and audio post-production, working on records, documentaries, podcasts, and radio plays. More recently, he has created location-based audio experiences for museums. His current interests focus on how emerging technologies—particularly AI-assisted tools—affect creative authorship, aesthetic judgment, and collaboration in modern music and sound production.

Mesmi

Artist, Producer, and Consultant · Los Angeles

Mesmi is an artist, songwriter, producer and consultant based in Los Angeles. Over the years, her role expanded from singer-songwriter origins into the recording studio, gaining skills in production, engineering and mixing. After achieving placements in competitions like the International Songwriting Competition and GRAMMY Amplifier, Mesmi released her self-produced/engineered album "Slow Bloom," which the press described as "powerful, epic, yet fragile and beautiful." Independently distributed and marketed, the LP has since racked up 325K+ plays on streaming worldwide and led to her first television show music placement.

Mesmi was recently honored to be personally chosen and mentored by producer 9th Wonder (Jay-Z, Kendrick Lamar) as part of Sophia Chang's Unlock Her Potential program, for which she currently co-leads the UP Music Industry Chapter, as well as selected for the inaugural cohort of Paramount/MTV Group's First Time Composers Program. Aside from her own projects, Mesmi offers specialized services such as vocal training, production and music consulting through her company VATOCA Studios; she also founded and runs SOUND OFFF, a digital space dedicated to highlighting Asian Americans in the modern music industry, borne out of the need to increase AsAm visibility and strengthen ties within the community.

Panel Organizers

Amy Skjerseth

Assistant Professor of Popular Music, University of California, Riverside

Amy Skjerseth's research explores intersections of music, media, material culture, and technology. Her forthcoming book Preprogrammed: How Electronic Presets Changed Music and Media (UC Press, 2026) examines the cultural impact of technological defaults from early radio to AI systems. She is also co-editor of The Routledge Companion to Voice and Identity and Principal Investigator of the UCHRI working group "Defying Defaults in Technology and Culture."

Liz Przybylski

Professor of Ethnomusicology, University of California, Riverside

Liz Przybylski is a scholar of hip hop and the global popular music industry. She is the author of Sonic Sovereignty: Hip Hop, Indigeneity and Shifting Popular Music Mainstreams (NYU Press), and Hybrid Ethnography. Her work addresses music, technology, labor, and identity across contemporary media environments.

A recipient of an NEH Faculty Fellowship, she serves on the Board of the Society for Ethnomusicology and teaches courses on ethnographic methods, popular music, and cultural studies.

Paper Abstracts

Session 1 — Posthuman Voice and Distributed Agency

Session 1 · June 5, 9:30–10:00

Jörg Holzmann

Independent Researcher

joergholzmann@gmx.de

Posthuman Vocality and the Infrastructural Reconfiguration of Opera

Abstract

This article reconfigures current debates on artificial intelligence (AI) in opera by shifting the focus from questions of authorship and machinic creativity to the infrastructural conditions that shape operatic experience. Rather than introducing mediation into opera, AI reveals mediation as its constitutive foundation, foregrounding the distributed systems that sustain voice, presence, liveness, and authority.

Drawing on media theory, performance studies, and posthuman thought, the study proposes a four-register model of vocality—voice as informational pattern, corporeal grain, iterable trace, and objectal surplus—to analyze how digital infrastructures redistribute agency across human and computational actors. Through this framework, opera emerges as a historically composite media system in which voice is never fully anchored in a singular body but circulates across technological, institutional, and perceptual networks.

The central case study, chasing waterfalls (Semperoper Dresden, 2022), stages AI as a performing subject capable of generating text and vocal material in real time. This transforms liveness from a condition of embodied presence into one of procedural contingency, dispersing aura across a networked ecology of performers, systems, and audiences. A comparative analysis with platform-native vocal systems such as Hatsune Miku highlights divergent regimes of posthuman vocality, contrasting operatic risk and instability with infrastructural iteration and reproducibility.

The article concludes that AI opera redefines sustainability not as the preservation of stable works, but as the maintenance of executable systems and perceptual ecologies. In this context, opera becomes a laboratory for posthuman performance, where voice, agency, and presence are continuously reconfigured within evolving technological environments.

Jörg Holzmann initially studied classical guitar in Stuttgart and received prizes at international competitions in Spain, India, Korea, and the United States. He subsequently studied musicology at the Leipzig University. He was a research assistant at the Musical Instruments Museum Leipzig (2018–2020) and the Bern University of the Arts (2020–2024).

His PhD thesis, affiliated with the University Salzburg, examines the intradiegetic act of music-making in early sound film. In parallel, he is completing a master's degree in German Literary and Art History at Martin Luther University Halle-Wittenberg.

Holzmann's work is situated at the intersection of media theory, performance and nostalgia studies. His research centres on the infrastructural conditions of contemporary opera and (dis)embodied vocality, with particular attention to aesthetic and conceptual counterparts in Romantic-era literature. In addition to his academic work, he performs on the player piano in hybrid settings that integrate holographic and computer-based media technologies.

Session 1 · June 5, 10:00–10:30

Paolo Paradiso

PhD Student, Free University of Bozen-Bolzano

paolo.paradiso@student.unibz.it

From Vocal Body to Vocal Network: AI and the Reconfiguration of Musical Co-Creativity

Abstract

Within the Euro-American art-music tradition, creativity has largely been framed through humanist paradigms privileging individual authorship, intentionality, and stylistic innovation. The growing use of generative AI (GenAI) and machine learning (ML) in composition and performance challenges these assumptions and calls for a rethinking of creative agency.

This paper asks: how does the integration of GenAI into contemporary musical practice reshape musicological concepts of authorship, performance, and agency?

The study develops a posthumanist framework drawing on Donna Haraway's situated hybrid subjectivities, Rosi Braidotti's posthuman subject, and Deleuze and Guattari's notion of assemblage. Dominic Pettman's reflections on vocal relationality further inform the analysis, foregrounding the voice as a site where species, technology, and affective proximity intersect.

Methodologically, the paper combines philosophical inquiry with musicological analysis, focusing on vocal technique, performer-interface interaction, improvisational structures, and the role of artificial neural networks (ANNs) in shaping musical form and timbre. Particular attention is given to how agency is distributed across composers, programmers, performers, ANNs, and technological infrastructure in live contexts.

The theoretical framework is applied to the analysis of two case studies: Tomomibot, by Tomomi Adachi, Andreas Dzialocha and Marcello Lussana, and ULTRACHUNK, by Jennifer Walshe and Memo Akten. In these vocal improvisations between humans and ANNs, voice and body are diffracted through a technologically mediated space and connect rhizomatically with each other, reconstituting themselves in a socio-technical assemblage of co-creation comprising humans, technology, and the shared environment.

The analysis focuses on the co-creative interaction between humans and computers: in these improvisations, real-time vocalizations intertwine with sound outputs generated by ANNs, in a distributed co-construction without primary and secondary roles, but rather interactive nodes within an interconnected network. This sort of hybridization between human and non-human actors interrogates whether "artificial creativity" can be understood not as simulation, but as a materially embedded process of distributed agency, with improvisational structures that take shape precisely from the human-AI interaction. While on one hand we find highly experienced performers of experimental extended vocality, on the other we find artificial voices produced by the computer via ML algorithms (using unsupervised training, GAN and variational autoencoders in ULTRACHUNK, and Long Short-Term Memory in Tomomibot) trained with pre-existing musical material, capable of operating as actants thanks to their non-human agency.

A specific role is given to the voice, which acts as a privileged bridge between biology and technology, which in turn (through 'posthuman listening') can be reimagined not as opposing and mutually exclusive poles, but as elements situated within a continuum.

The contribution of this study is to propose a posthuman redefinition of musical creativity that integrates philosophical theory with close analysis of contemporary experimental vocal practice, offering new conceptual tools for understanding AI-mediated composition and performance within musicology.

The argument put forward in this paper is that these performances represent an attempt to inhabit the extimacy constitutive of both voice and subjectivity: an inner-outer space within us that is constantly shared and traversed by others, with whom we interact to rethink and shape new ways of being together.

Paolo Paradiso is a PhD student in Education and Social Sciences from the Free University of Bozen-Bolzano, working on a project that aims to investigate the implications of using AI in music education in primary schools (supervisor: Prof. Paolo Somigli; co-supervisor: Prof. Michele Cagol). His studies began at the "N. Paganini" Conservatory of Music of Genoa (Italy), where he earned a first-level diploma in Jazz Singing and a second-level diploma in Music Teaching. He subsequently earned a master's degree in Musicology from the University of Pavia (Cremona campus, Italy). In addition to working as a music teacher at public middle schools in Italy, he continued his research activities, which focuses on investigating how emerging technologies reshape vocality and performance, fostering new dialogues between the human and the artificial. He took part in international conferences, seminars and workshops where he had the opportunity to meet and exchange ideas with experts from various scientific fields, thereby broadening his expertise. Passionate about music, reading, and audiovisual culture, he views research as a creative space where artistic sensibility, critical reflection, and technological innovation converge.

Session 1 · June 5, 10:30–11:00

Darren Woodland Jr.

PhD Candidate, Drexel University

dkw34@drexel.edu

Material Synthesis Composition: Speculocultural Technopoiesis as a Framework for Human-Material-AI Co-Creativity

Abstract

This paper introduces Material Synthesis Composition (MSC), a methodology for sonic co-creativity in which relational material substrates serve as primary compositional feed alongside human performers and artificial intelligence and machine learning (AI/ML) systems. Material substrates include organic matter and inorganic data derived from situated cultural accumulations. MSC emerged from Speculocultural Technopoiesis (ST), a framework developed through the author's doctoral research that examines how Black speculative traditions and sonic epistemologies can guide the modification and design of digital audio technologies.

MSC belongs to a longer lineage of Black creative-technological practice. Sun Ra's Arkestra, whose self-mythology fused Afrofuturist cosmology with experimental electronics, modeled how Black artists can occupy and redefine the space of technology on their own epistemic terms. Alvin Lucier's I Am Sitting in a Room demonstrated that material environments are themselves compositional agents. Black performance artists later extended this logic on cultural grounds. Okwui Okpokwasili's on the way, undone, a processional work responding to Simone Leigh's Brick House, stages the Black body moving through public space as both archive and instrument, encoding embodied cultural memory in the act of transit. More recently, Rashaad Newsome's practice has made these stakes legible at the level of AI. From Shade Compositions, which treats Black vernacular gesture as compositional system, to Being, an AI griot trained on texts by bell hooks, Audre Lorde, and Cornel West, Newsome shows that who trains AI, and on what, is already an aesthetic and ethical question. MSC takes up that question as a compositional one.

The case study is Organic Memory (Triptych), a spatial composition currently in development. Three interactive installations generate sound through shared material transformation: substrate vibrations via piezoelectric sensors, sonic residue from dissolution in water via hydrophones, and gestural properties of hair via computer vision. Each movement uses distinct AI/ML tools, including Somax2, for classification, corpus querying, and co-improvisation. These systems are trained on culturally specific corpora, including NASA data sonifications and African-American spirituals. A composer-defined motif threads through all three movements, transformed by interaction and AI elaboration. We draw on findings from early prototype testing and lay out the conceptual scaffolding guiding the triptych's completion.

We advance three contributions to co-creativity discourse. First, we show how material affordances, read through their cultural epistemic situatedness, generate compositional structure when abstracted via sensor data. We call this "material synthesis." The concept shares ground with spectral music's acoustic materialism, but where spectralism tends toward acoustic universalism, material synthesis insists on cultural specificity. Second, we show how AI systems trained on culturally grounded corpora mediate between heterogeneous material languages, translating from earth rhythm to water texture to gestural melody. Third, we examine how Julius Eastman's concept of Organic Music is realized through relational materiality and intra-actions that become computational and compositional input for distributed human-material-machine authorship.

Compositional intelligence emerges through negotiations among culture, material, and algorithm in acts of listening and transduction. Meaningful co-creativity can only exist because such entanglements do.

Darren Woodland Jr. (he/him) is a PhD Candidate in Digital Media, experimental media artist, and creative technologist at Drexel University. His doctoral research develops Speculocultural Technopoiesis, a methodology for modifying digital instruments and tools guided by Black epistemologies and sonic identity. His work explores the entanglements of data, material, and body within media arts and design, with a particular focus on how culturally contextual systems and embodied experience reshape our relationship with technology.

His work has been presented at international venues across Europe, North America, and Asia, including Ars Electronica (2023), the Atlantica Symposium (2024), and SIGGRAPH Asia (2024). He served as Art Director and Lead Technical Artist for Black Ice VR (SIGGRAPH, SXSW, BIFAN) and has held research positions at UNCSA and NC State University. He holds an MAD in Experimental Media Arts from North Carolina State University and a dual BA in Media Arts and Art Studio from the University of South Carolina. He is also an Adjunct Professor at Drexel, where he teaches at the intersection of games, design, technology, and critical media practice.

Paper Abstracts

Session 2 — AI Systems and Co-Creative Practices

Session 2 · June 5, 11:30–12:00

Garrison Gerard

Assistant Professor of Music, UNC Pembroke

garrison.gerard@uncp.edu

Ecosystemic Music: Building Systems for Improvisation and Musical Performance Using Algorithmic Composition and AI

Abstract

Field recordings and passive acoustic monitoring (PAM) generate large archives that provide acoustic windows into ecosystem interactions. Alongside their scientific and aesthetic value, these recordings provide a creative corpus from which to draw sound material that is intimately tied to specific locations. A key challenge is how such large recording sets can be used musically, particularly in a live improvisation setting. Here I explore two approaches to developing systems for co-creativity using algorithmic composition and AI to traverse large recording archives. The first approach is the use of algorithmic systems for recording analysis and playback that respond to real-time inputs such as performer audio or listener presence.

The second is the use of autoencoders, either in the preparation or performance stage, to facilitate real-time interaction with the recording corpus. These approaches build on the innovations in soundscape ecology (such as the use of acoustic indices and PAM) and leverage tools for real-time music creation such as RAVE to open new possibilities for both music composition and deeper listening to soundscape recordings.

Two projects will illustrate these approaches: Resonance Ecology takes an algorithmic approach to facilitating performer interaction with large recording sets. The system mirrors the design of an ecosystem with audible actions triggering reactions in the system (e.g., the sound of a performer influences the system to play different recordings or to process a sound differently). The algorithmic system analyzes frequency, amplitude, and a variety of timbral descriptors to track sonic change over time and to make probabilistic assessments about the current state of the sonic ecosystem; these data points then inform the choices of the algorithm as it navigates the recording archive. The performer interacts with the system through their musical performance; the score for the piece is open, giving the performer agency to respond to the sounds they hear and influence the algorithmic system. Sonifying the Arctic links large PAM datasets with weather data through autoencoders to extend the system beyond the original recording period. For this specific realization, eight months of PAM recordings from Iceland's national parks were used to create a sonification system that extends across more than two years. Autoencoders are also used during performance in Sonifying the Arctic through a RAVE model trained on PAM recordings and traversed by performer and data-driven instrument input such as a motion controller.

Together, these approaches demonstrate how AI-mediated systems can transform large environmental recording archives into interactive frameworks for improvisation and co-creative musical performance. These systems demonstrate the capability for sonic systems to aid in the processing, analysis, and understanding of large recording corpora.

Garrison Gerard is an American composer of electroacoustic and concert music and a soundscape ecologist. His work explores the interaction between nature and music through music composition, soundscape ecology, and audio technology. He has carried out acoustic surveys tracking the impact of human noise on natural ecosystems in Patagonia, the Chihuahuan Desert, Denali National Park, Iceland, and other locations, and is currently conducting a soundscape survey of Lumber River State Park in North Carolina.

His music has been presented internationally with performances by groups such as [Mod]ular Ensemble, Fort Worth Symphony, and Nu Atmospheres Ensemble. An ardent collaborator, he has been commissioned by ensembles and soloists such as Andrew Cook, Spencer Byrd, the Avenue C Project, Atelier Piano Quartet, and Amorsima String Trio. He has also collaborated with artists in other mediums in the creation of experimental works and performance art pieces, most recently including works with the choreographer Briana Less exploring the nature of communication and joint improvisation. Gerard served as Artist-in-Residence of Great Smoky Mountains and Rocky Mountain National Park where he created new works using recorded sound from within the parks.

Gerard completed his Doctoral degree in Music Composition from the University of North Texas and received a Master's in Music Composition from UNT and a Bachelors in Piano from Harding University in Searcy, Arkansas. In 2023 Gerard served as Fulbright Fellow at the University of Iceland. Gerard currently serves as Assistant Professor of Music at the University of North Carolina at Pembroke.

Session 2 · June 5, 12:00–12:30

Yifeng Yvonne Yuan

PhD Candidate, Computer Music, UC Santa Barbara

yifengyuan@ucsb.edu

Glitch Voice: Real-Time Neural Deconstruction of Vocal Meaning

Abstract

This paper introduces Glitch Voice, a real-time neural effect unit and aesthetic inquiry designed to deconstruct semantic speech into a non-meaning-making "glitched" vernacular. While current research in neural audio synthesis predominantly prioritizes high-fidelity replication and semantic clarity, these frameworks often erase the paralinguistic meaning and semiotic space inherent in human vocalization, such as the involuntary physiological tremors, the breathy sound, and the raw, unpolished textures of the vocal apparatus that resist linguistic organization. By utilizing IRCAM's RAVE architecture within Max/MSP, this aesthetic research seeks to develop a real-time effect unit that transforms semantic voice to a visceral and embodied sonic output.

Methodology

The system is trained on two carefully curated datasets of non-semantic vocal "outliers": "Flow" (the sustained drones) and "Burst" (emotional bursts). To facilitate intuitive control over the sonic output, a custom pressure-sensitive interface was developed. By modulating physical grip, the performer can easily morph between these two states, and effectively translate their semantic meaning into a visceral, glitched vernacular.

Contribution

This work proposes a framework for "Neural Transcoding," where the machine functions as a neural mirror that re-interprets the performer's vocal energy through latent space. By centering the aesthetic output on the outliers of vocal expression—the gasp, the friction, the stutter, the scream, etc.—the system intends to process voice based on the subconscious layer of language. This research contributes to the field of performance studies by providing a low-latency, performative tool that bridges the gap between generative audio models and live embodied expression. Building upon a lineage of radical vocal exploration (e.g., Yoko Ono, Trevor Wishart, Pamela Z), this project also serves as an aesthetic exploration of the embodiment of machine learning tools.

Yifeng Yvonne Yuan is a composer and music technologist. She is currently a PhD Candidate in Computer Music and pursuing a Master of Science in Media Arts and Technology at the University of California, Santa Barbara. Her interdisciplinary research operates at the intersection of sound, text, electronics, and performance studies. She bridges technical proficiency in audio DSP programming and C++/JUCE with experimental music composition and often deals with the fragility of human emotions and visceral feelings. Her works have been featured in the PianoSphere series, ICMC, Dance at the Odyssey Los Angeles, BlackHouse Collectives and many more.

Session 2 · June 5, 12:30–1:00

Jeremy Francoeur

Musicology PhD Student, University of Western Ontario

No Truth, No Lies: Narrative Storytelling Through Memes and AI

Abstract

For metal musician BOI WHAT, the world ends not with a bang or a whimper, but with the soundscape of SpongeBob SquarePants. The most famous of BOI WHAT's songs is "Neon Tide," which is chiefly "sung" by the SpongeBob character Plankton, and is about masterminding an apocalypse. BOI WHAT is one of many online creators using AI generated voices—Plankton in this case—in musical contexts. While most such creations are "AI covers," simulating a well-known song being sung by an equally well-known pop-culture character, "Neon Tide" is an example of how emerging AI technology can also be used to create original compositions that make use of these familiar media materials. BOI WHAT leverages the uncanniness of AI-assisted voice modulation and familiarity of the aesthetics of SpongeBob within his audience to craft a surreal apocalyptic narrative.

Art critics often see AI as an inherent threat to creative expression and personal narratives within art. In this talk, however, I will argue that the AI elements of "Neon Tide" demonstrate one way that use of AI technology can expand the narrative and expressive complexity of textual music. I demonstrate this claim by placing audio-based AI technology within the context of "remediation," as proposed by Jay David Bolter and Richard Grusin, and use this context to illustrate AI advancements as part of a growing trend of hypermediacy within both popular music and digital interaction with art as a whole. As artist and audience become closer than ever before online, it becomes all the more relevant to address the ways in which digital musicians place the audience's knowledge of culture, context, and even the artistic process at the forefront, and use the growing accessibility of technology to achieve this.

Additionally, through analysis of the vocal performance, stylistic choices, and lyrics, I assert that this technology and its online context is its own unique force in connecting the elements of the song together and helping to create a cohesive narrative that would not be possible without this technology. I prove this by laying out a conceptual integration network, as theorized by Nicholas Cook, to clarify the layers of meaning within the song. I then analyze the song within this framework by highlighting the use of breath in the vocal performance, production techniques within the instrumentation, and lyrical in-jokes to validate that the use of AI voice changers and generators is integral to the understanding of the song's overall meaning. I also draw upon the statements made by BOI WHAT himself about the song's process to show how, in his own words, the use of AI informs his own vocal delivery and production choices in his music. Though it is only one layer of his process, the use of AI technology both limits and expands the way he creates his sound.

AI in music is here to stay. Analyzing its real-world practice is essential to predicting its uses, both problematic and productive, as it develops further.

Jeremy Francoeur is a musicology PhD student at the University of Western Ontario. His research concerns the impact of the internet age and its technologies on music making and community-building, with a focus on queerness, self-identity, and creativity as resistance. He is the recipient of Clark University's 2022 Manero Prize for Musical Scholarship for his honors thesis on the hyperpop microgenre and its relation to queerness.

Paper Abstracts

Session 3 — Cultural, Political, and Economic Implications of AI Music

Session 3 · June 5, 2:00–2:30

Sonnet Swire

PhD Student in Musicology, UC Riverside

sswir001@ucr.edu

Prompt and Consequence: AI-Generated Music as 21st-Century Propaganda

Abstract

Drawing on music studies, media analysis, digital ethnography, and political theory, I analyze how users circulate AI-generated songs as symbolic content. "We Are Charlie Kirk," created by the anonymous act Spalexma, encodes Christian and nationalist values that function like coordinated messaging even without top-down coordination. Its genre choices — contemporary Christian worship and country anthems — carry racialized and class-coded meanings, historically functioning as sonic markers of white, rural, and working-class identity. AI tools, trained on datasets that reflect existing cultural stereotypes, replicate these associations, constructing an imagined audience as white, Christian, and economically aggrieved. Rather than neutralizing cultural bias, AI-generated music magnifies it, producing identity-coded content.

Ironic remixes of such songs can reinforce the narratives they appear to subvert: oppositional recuts retain the original melodic hooks while underscoring the shaping of cultural identity and group thinking. Timbre and arrangement carry ideological weight, a dynamic visible in historical parallels such as the Nazi promotion of martyrdom through song and country music's role in post-draft military recruitment — cases where musical familiarity lowered resistance to political messaging.

AI music generation now produces a new form of participatory mythmaking without direct state control, paralleling how 20th-century fascist governments fused messaging with expanding radio infrastructure. AI composition tools automate the replication of genre markers that musicologists identify as community-bonding devices. As tools such as Suno, AIVA, and Udio become normalized in classrooms and culture, debates over "responsible" inclusion obscure how AI is fundamentally reshaping identity construction and our sense of reality.

This dynamic is underscored by the circulation of Iran-aligned AI-generated LEGO rap videos, in which youthful aesthetics and rap genre conventions are deployed to deliver ideologically charged messaging to American audiences. The accessibility and low cost of AI-generated music further incentivize state-adjacent and independent actors to produce compelling content, lowering the barrier to sophisticated influence operations and making the cultural landscape increasingly difficult to navigate.

Sonnet Swire is a composer, musicologist, and journalist whose work connects modern challenges with art and media. She is currently a freelance editor for a top global news app and previously produced and wrote breaking news stories for one of the leading cable news networks. Sonnet focuses on political coverage and creates newsletters, alerts, and digital stories for national audiences. She started her career in investigative and data reporting, gaining experience at major national broadcast networks. In addition to journalism, she is an award-winning composer. She was a Composition Fellow at the Aspen Music Festival and received the Charles Ives Prize from the American Academy of Arts and Letters. Currently, she is a PhD student in Musicology at the University of California, Riverside. Her research examines how technology, storytelling, and messaging come together to reveal larger cultural and social trends.

Session 3 · June 5, 2:30–3:00

Alvaro E. Lopez

Electronic Musician, Technology Researcher, UC Riverside

alope083@ucr.edu

Navigating the Convergence of Artificial Intelligence and Music Composition: Labor and Attribution

Abstract

In current music production pipelines, composers are often requested to perform both style replication and quick production of convincing performance sequences ready for publishing. As those tasks involve largely mechanical and technical procedures, automated-music algorithms emerge as a potentially efficient solution. In film and video for example, when montage specificity achieved by temp clips or stock music leads directors to request equivalents, construction and style possibilities tend to be limited to the references' musical features. This re-elaboration of musical structures using genre constraints has occurred often in commercial and popular fields when musicians experiment through the inspiration of a particular piece or author. Analogously, emerging algorithms are able to perform style replication using references or caption. Private research funding into AI music is surging to fill a potential niche of AI music in the music production business.

As in other fields, human labor in music faces the impending possibility of replacement by automation. Business models of commissioned music may exploit vague definitions found on current authoring copyright laws in the field of synthetic music. Based on the legal framework, this paper explores possible scenarios for adaptation, assimilation, and revision of the music authorship concept in light of AI music. Starting by describing several perspectives of synthetic music reception that have had commercial viability, I examine current contractual frameworks for composers to find overlapping parameters. Then, I illustrate how style replication among human music delves in the blurry zone between copyright infringement and fair use through legal cases. Tying these perspectives, I gather judiciary and legal readings on copyright for AI materials and explore current and potential plagiarism scenarios to inquire our understandings of authorship. Finally, I formulate mechanisms and predictions of how AI technologies may be incorporated into business and authorship legal frameworks.

Alvaro Lopez, Ph.D, is an electronic musician, technology researcher, educator and composer. His research focuses on automated systems for music analysis, creativity, and education, and his invention the Progressive Adaptive Music Generator holds the patent US 12,427,419 B2. His studies involving procedural music generation in videogames, and real-time parametric scoring have been featured in the 12th ACM SIGPLAN International Workshop on Functional Art, Music, Modelling, and Design (FARM '24), the 5th North American Conference in Videogame Music at the University of Michigan, the Music and the Moving Image conference at New York University Steinhardt, the Art of Record Production Conference at Berklee College of Music, Boston, and The 2020 Joint Conference on AI Music Creativity at The Royal Institute of Technology (KTH), Stockholm, Sweden. His approach to interactive music generation is published in Sound Effects — An Interdisciplinary Journal of Sound and Sound Experience.

Organization

Committees & Support

Scientific / Program Committee

Paulo C. ChagasUniversity of California, Riverside

Gérard AssayagIRCAM, Paris

Nikolay MaslovUniversity of California, Riverside

Ivana Petković LozoUniversity of California, Riverside

Liz PrzybylskiUniversity of California, Riverside

Amy SkjersethUniversity of California, Riverside

Christophe KatribUniversity of California, Riverside

Steven LeffueUniversity of California, Riverside

Tatiana CatanzaroUniversity of California, Santa Barbara

Tae Hong ParkPurdue University, West Lafayette

Kerry HaganUniversity of Illinois Urbana-Champaign

Rodrigo SigalENES–UNAM, Morelia, Mexico

Marc BattierSorbonne University, Paris

Miriam AkkermannFreie Universität, Berlin

Patrick HartonoRMIT University, Ho Chi Minh City

Constantin BasicaCCRMA, Stanford University

Julie ZhuUniversity of Michigan

Celeste BetancurStanford University

Organizing Committee

Paulo C. Chagas (Chair)University of California, Riverside

Nikolay MaslovUniversity of California, Riverside

Ivana Petković LozoUniversity of California, Riverside

Liz PrzybylskiUniversity of California, Riverside

Amy SkjersethUniversity of California, Riverside

Christophe KatribUniversity of California, Riverside

Tatiana CatanzaroUniversity of California, Santa Barbara

Tae Hong ParkPurdue University, West Lafayette

Institutional Support

Center for Ideas and Society UCR ARTS RAISE@UCR Dept. of Music Dept. of Media & Cultural Studies Dept. of History of Art Dept. of Theater, Film, and Digital Production Dept. of Dance Dept. of Anthropology

↑ Back to top

Contents

Co-Creativity in Music, Sound, and AI

Conference Schedule

Keynote Lectures

Panel Discussion

Session 1 — Posthuman Voice and Distributed Agency

Session 2 — AI Systems and Co-Creative Practices

Session 3 — Cultural, Political, and Economic Implications of AI Music

Committees & Support