In this work, our primary focus was on assessing the effectiveness, in terms of quality, of an AI Retrieval-augmented Generation application. After constructing this AI web application, we designed and generated a synthetic dataset consisting of questions and answers based on a set of documents. We then compared the performance of two distinct language models powering the application according to a predefined set of metrics when responding to the dataset's questions. In this project, we opted to utilize open Large Language Models and we run them locally without relying on any cloudbased service.
Time to Hire a Robot Psychologist? Evaluating a Corporate RAG Application
Odorizzi, AndreaPenultimo
;Mazzini, Gianluca
Ultimo
2024
Abstract
In this work, our primary focus was on assessing the effectiveness, in terms of quality, of an AI Retrieval-augmented Generation application. After constructing this AI web application, we designed and generated a synthetic dataset consisting of questions and answers based on a set of documents. We then compared the performance of two distinct language models powering the application according to a predefined set of metrics when responding to the dataset's questions. In this project, we opted to utilize open Large Language Models and we run them locally without relying on any cloudbased service.| File | Dimensione | Formato | |
|---|---|---|---|
|
310.pdf
solo gestori archivio
Descrizione: Pre-print
Tipologia:
Pre-print
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
1.43 MB
Formato
Adobe PDF
|
1.43 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
Time_to_Hire_a_Robot_Psychologist_Evaluating_a_Corporate_RAG_Application.pdf
solo gestori archivio
Descrizione: Full text editoriale
Tipologia:
Full text (versione editoriale)
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
966.27 kB
Formato
Adobe PDF
|
966.27 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in SFERA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


