Voyages dans les systèmes obscurs > Is there a text in this dataset?

/fr == path_langue == path_year

Edit 1

> 11:30

Is there a text in this dataset?

Niccolò Monti
Conférence

Places limitées...
reservations obligatoires !
( 23, 24, 25 novembre )

vendredi 24 novembre

> La Générale > Amphithéâtre

> 39 rue Gassendi, 75014 Paris

> Amphithéâtre

A large language model requires vast and diverse amounts of data to function as a credible natural language generator. One of the most used datasets to achieve it is The Pile, whose components include Books3, the largest repository of .txt files to train models on. Its adoption was meant to refine the prose writing of LLMs, to make them generate believable texts from textual data. But the labor and methods that went into composing it, together with the authorship and ownership of the texts, have to be inquired. This talk aims to peer into some material, technical and semiotic aspects of text-based datasets for NLP, to highlight the political—as well as artistic—issues that they raise.

Niccolò Monti is a PhD candidate in joint-supervision between the Universities of Turin and Paris 8. He adopts an historical and semiotic approach toward the study of automatic methods in literary writing, from the Surrealists to AI. He has written of electronic literature and prompting, of the creativity of automatism, of cybernetics and semantics. He also does research on literary European avant-gardes, especially the Modernist works of James Joyce and Samuel Beckett. He is a member of a literary collective, named Montag, experimenting with digital simultaneous writing, textual generation and speculative fiction.

https://linktr.ee/icareide

2023/11_24/niccolo-monti_is-there-a-text-in-this-dataset /2023/11_24/niccolo-monti_is-there-a-text-in-this-dataset/ /2023/11_24/niccolo-monti_is-there-a-text-in-this-dataset/cover/

path_year :: 2023 page.url :: /fr/2023/11_24/niccolo-monti_is-there-a-text-in-this-dataset IS SUB 1 ( of year == JOUR )

Programme (2023)#

Jeudi 23 novembre

“introduction” de 9h30 à 10h avec les organisateurs du colloque Pierre Cassou-Noguès, Stéphane Degoutin, Arnaud Regnaud et Gwenola Wagon

10:00 La panne et le pas :
les aventures de l'architecture von Neumann Baptiste Loreaux
Conférence

10:45 Bibliothèques de l'ombre Vincent Bonnefille
Conférence

11:30 Une matérialité sans visibilité ?
Une approche géopolitique et
socio-technique du cloud Clotilde Bômont
Conférence

12:30 Repas

jeudi 23 novembre
> La Générale > Amphithéâtre
> 39 rue Gassendi, 75014 Paris
> Amphithéâtre

14:00 Paper Scraps Lorena Lisembard
Conférence-performance

jeudi 23 novembre
> La Générale > Amphithéâtre
> 39 rue Gassendi, 75014 Paris
> Amphithéâtre

14:30 Price Per Slot Nicolas Bailleul
Performance

jeudi 23 novembre
> La Générale > Amphithéâtre
> 39 rue Gassendi
> Amphithéâtre

15:00 Ciel noir :
Histoires de collisions en espaces profonds Angelica Ceccato
Présentation vidéo

15:30 Des taches de soleil sur le Tout va mal Judith Deschamps
Conférence-performance

16:10 L’efficience planétaire et le démon inorganique Guillaume Boissinot
Conférence

16:30 Table Ronde Intervention collective
Discussion

jeudi 23 novembre
> La Générale > Amphithéâtre
> 39 rue Gassendi, 75014 Paris
> Amphithéâtre

17:00 Codex Spatium
(Session #5) Raphaël Costa et Julien Prévieux
Jeu

jeudi 23 novembre
> La Générale > Amphithéâtre
> 39 rue Gassendi, 75014 Paris
> Amphithéâtre

Places limitées...
reservations obligatoires !
( 23, 24, 25 novembre )

Vendredi 24 novembre

09:30 Opacity and Transparency in Games.
Metroidvania versus Free View
/ Open World Mathias Fuchs
Conférence

10:30 Créateurs des mondes persistants Hortense Boulais-Ifrène
Conférence

11:00 Systemes sociaux et
obscurantisme chez Philip K. Dick Antonin Premillieu
Conférence

11:30 Is there a text in this dataset? Niccolò Monti
Conférence

vendredi 24 novembre
> La Générale > Amphithéâtre
> 39 rue Gassendi, 75014 Paris
> Amphithéâtre

12:30 Repas

vendredi 24 novembre
> La Générale > Amphithéâtre
> 39 rue Gassendi, 75014 Paris
> Amphithéâtre

14:00 L'ADN en jeu.
Transparence et opacité de la génétique de loisir Tania Ruiz et Maria Hellström
Conférence

15:00 Non-musique / non-climat / non-pensée / non-lieux / non-images / non-souris Stéphane Degoutin et Aymeric Duriez
Conférence-performance

16:00 Voyage en télémarkette Diane Rabreau et Audrey Carmes
Création sonore

17:00 Une enquête aux confins de la métropole Clémence Seurat
Conférence

Samedi 25 novembre

10:00 Internet Tour Mario Santamaría accompagné de François Muzard et Adrien Tournier
Performance, sortie

samedi 25 novembre
> RDV SAMEDI 10H À LA SORTIE DU MÉTRO AIMÉ CÉSAIRE LIGNE 12
> Bus