ServicenavigationHauptnavigationTrailKarteikarten


Forschungsstelle
COST
Projektnummer
C02.0014
Projekttitel
Knowledge Discovery in Functional Proteomics (FunProt)
Projekttitel Englisch
Knowledge Discovery in Functional Proteomics (FunProt)

Texte zu diesem Projekt

 DeutschFranzösischItalienischEnglisch
Schlüsselwörter
-
-
-
Anzeigen
Forschungsprogramme
-
-
-
Anzeigen
Kurzbeschreibung
-
-
-
Anzeigen
Partner und Internationale Organisationen
-
-
-
Anzeigen
Abstract
-
-
-
Anzeigen
Datenbankreferenzen
-
-
-
Anzeigen

Erfasste Texte


KategorieText
Schlüsselwörter
(Englisch)
Knowledge discovery; machine learning; data and text mining; proteomics
Forschungsprogramme
(Englisch)
COST-Action 282 - Knowledge Exploration in Science and Technology
Kurzbeschreibung
(Englisch)
See abstract
Partner und Internationale Organisationen
(Englisch)
AT, BE, BG, CY, EE, FR, DE, IE, IT, MT, NO, PL, PT, SK, ES, CH, UK
Abstract
(Englisch)
A major research topic in functional genomics and proteomics is the search for biomarkers, or meaningful indicators of biological states or processes. Of special interest are disease biomarkers, which can be used for disease diagnosis, monitoring, and control. Genomics-based approaches have made it possible to identify changes in gene expression levels as markers of certain diseases. More recently, there has been increasing interest in the proteome which, contrary to the genome, reflects both the organism's genetic blueprint and the impact of its environment. As a consequence, mass spectrometry is emerging as a vital tool for biomarker discovery. Mass spectra of biological samples such as serum are analysed using advanced data mining techniques in view of extracting diagnostic protein patterns. To extract biomarker patterns from protein mass spectra, data miners must face a technological challenge which has become the focus of research in the FunProt project. This is the so-called high-dimensionality-for-small-sample (HDSS) or the p >> n problem -- that is, the number of variables (p) is much larger than the number of cases (n). Most data analysis algorithms fail to generalize or break down completely in the presence of datasets where p >> n. To tackle this challenge, two alternative approaches have been explored in FunProt. The first is preliminary dimensionality reduction: prior to data mining, the number of variables or features is reduced via feature (subset) selection, feature clustering, or feature transformation. The second is the investigation of learning algorithms that are either inherently insensitive to the HDSS problem (e.g. support vector machines), or that comprise built-in mechanisms for reducing the number of variables on the fly (e.g. shrunken centroids). Project work has led to the extraction of biomarker patterns for Alzheimer's disease and cerebrovascular accidents in collaboration with the Biomedical Proteomics Research Group of the Faculty of Medicine. Research results have been presented in artificial intelligence conferences and in major specialized journals such as Proteomics, Biosilico, and Mass Spectrometry Reviews.
Datenbankreferenzen
(Englisch)
Swiss Database: COST-DB of the State Secretariat for Education and Research Hallwylstrasse 4 CH-3003 Berne, Switzerland Tel. +41 31 322 74 82 Swiss Project-Number: C02.0014