Through the use of DNA microarray it is now possible to obtain quantitative measurements of the expression of thousands of genes present in a biological sample. DNA arrays yield a global view of gene expression and they can be used in a number of interesting ways. In this paper we are investigating two different approaches for analyzing the analysis of data obtained from microarray experiments. The first one is a supervised learning process: a classification of cancer tissue using decision trees and Support Vector Machines (SVM). We constructed decision trees using the c4.5 algorithm. SVM are linear models in the space of attributes that can discriminate two classes of examples. After that, we will analyze the results achieved by a unsupervised learning method: clustering. This algorithm regroups set of examples that present similar characteristics, in particular, hierarchical clustering does it iteratively, starting from considering each sample as a single set.

Exploiting supervised and unsupervised learning techniques for profiling cancer data

GAMBERONI, Giacomo;LAMMA, Evelina;STORARI, Sergio;ARCELLI, Diego;FRANCIOSO, Francesca;VOLINIA, Stefano
2004

Abstract

Through the use of DNA microarray it is now possible to obtain quantitative measurements of the expression of thousands of genes present in a biological sample. DNA arrays yield a global view of gene expression and they can be used in a number of interesting ways. In this paper we are investigating two different approaches for analyzing the analysis of data obtained from microarray experiments. The first one is a supervised learning process: a classification of cancer tissue using decision trees and Support Vector Machines (SVM). We constructed decision trees using the c4.5 algorithm. SVM are linear models in the space of attributes that can discriminate two classes of examples. After that, we will analyze the results achieved by a unsupervised learning method: clustering. This algorithm regroups set of examples that present similar characteristics, in particular, hierarchical clustering does it iteratively, starting from considering each sample as a single set.
2004
microarrays; data mining; clustering
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in SFERA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11392/1189412
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact