S-131 | Analysis of Semantic Bias in ChatGPT: Generating Word Definitions by Extension

in Session 2

S-131 | Analysis of Semantic Bias in ChatGPT: Generating Word Definitions by Extension https://csan2024.saneurociencias.org.ar/wp-content/themes/movedo/images/empty/thumbnail.jpg 150 150 SAN 2024 Annual Meeting SAN 2024 Annual Meeting //csan2024.saneurociencias.org.ar/wp-content/uploads/2024/06/logo-web.png September 25, 2024 October 7, 2024

Theoretical and Computational Neuroscience
Author: Facundo Ariel Totaro | Email: facutotaro@gmail.com

Facundo Ariel Totaro^1°, Julieta Laurino^3°, Laura Kaczer^3°, Juan Kamienkowski^1°2°4°, Bruno Bianchi^1°2°

^1° Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Departamento de Computación. Buenos Aires, Argentina.
^2° CONICET-Universidad de Buenos Aires. Instituto de Ciencias de la Computación (ICC). Buenos Aires, Argentina.
^3° .Universidad de Buenos Aires. Departamento de Fisiología, Biología Molecular y Celular. Buenos Aires, Argentina.
^4° Facultad de Ciencias Exactas y Naturales, Maestría en Explotación de Datos y Descubrimiento del Conocimiento, Universidad de Buenos Aires, Buenos Aires, Argentina

The encoding of the word meaning in our brain is one of the main unknowns in the study of language as a cognitive ability. In this regard, words with more than one meaning, homonyms and polysemous words (e.g., “banco”), provide us with great possibilities to further the understanding of this issue. Behaviouraly, state-of-the-art Language Models (LM) can disambiguate word meanings similar to the human brain. This similarity between humans and LM prompts the question of whether they process language in a similar manner. Therefore, understanding how these models work can help us better understand the brain. In a preliminary analysis, meaning assignment to ambiguous words in LM was studied neurally, using a corpus of biasing contexts and ambiguous sentences. This corpus consisted of 48 ambiguous words, with at least 2 meanings each. Meanings were defined as a unique word (e.g., for “banco”, “ecnonomía” and “mobiliario”). The meaning assignment was determined by comparing the distance between the embeddings (i.e., the vectorized representation of words) of the target-word and its meaning-word. Despite promising results, it was noted that the measurement of word meaning using only one meaning-word is noisy. In the present work, we aimed to define the ambiguous words’ meaning as lists of words, and to validate them through an online experiment. Increasing the precision of how we measure meaning assignment with LMs will help us better understand how the brain performs this task.

S-131 | Analysis of Semantic Bias in ChatGPT: Generating Word Definitions by Extension

S-120 | Low dimensional neural dynamics underlying the generation of rhythmic vocal behavior in canaries

VSD-137 | Corpus Curiosum: tackling today’s critical thinking for tomorrow’s Neuroscience

Masterfully Handcrafted for Awesomeness

WE DO MOVE

YOUR WORLD

Greatives – Design, Marketing, Sales