COMPRESS TO CREATE
COMPRESS TO CREATE
Jean-pierre Briot
30/09/2023
80-113
5
The current tsunami of deep learning has already conquered new areas, such as the generation of creative content (images, music, text). The motivation is in using the capacity of modern deep learning architectures and associated training and generation techniques to automatically learn styles from arbitrary corpora and then to generate samples from the estimated distribution, with some degree of control over the generation. In this article, we analyze the use of autoencoder architectures and how their ability for compressing information turns out to be an interesting source for generation of music. Autoencoders are good at representation learning, that is at extracting a compressed and abstract representation (a set of latent variables) common to the set of training examples. By choosing various instances of this abstract representation (i.e., by sampling the latent variables), we may efficiently generate various instances within the style which has been learnt. Furthermore, we may use various approaches for controlling the generation, such as interpolation, attribute vector arithmetics, recursion and objective optimization, as will be illustrated by various examples. Before concluding the article, we will discuss some limitations of autoencoders, introduce the concept of variational autoencoders and briefly compare their respective merits and limitations for generating music.
Ler mais...Deep learning, Autoencoder, Latent variables, Music generation, Control.
EDUCAÇÃO, MÚSICA E ARTES: REFLEXÕES E DESAFIOS CONTEMPORÂNEOS
Esta obra está licenciada com uma Licença Creative Commons Atribuição-NãoComercial-SemDerivações 4.0 Internacional .
O conteúdo dos capítulos e seus dados e sua forma, correção e confiabilidade, são de responsabilidade exclusiva do(s) autor(es). É permitido o download e compartilhamento desde que pela origem e no formato Acesso Livre (Open Access), com os créditos e citação atribuídos ao(s) respectivo(s) autor(es). Não é permitido: alteração de nenhuma forma, catalogação em plataformas de acesso restrito e utilização para fins comerciais. O(s) autor(es) mantêm os direitos autorais do texto.