Funded Projects › HORIZON
LifeLU · Understanding the Language of Life: Identifying and Characterizing the Language Units in Protein Sequences
Proteins play a key role in biological processes that govern and maintain life. Although they are three-dimensional entities, they can be represented in textual form as sequences of amino acids that largely determine their structures and functions. By analogy with natural (human) languages, we can consider proteins as written with a language, which we refer to in this proposal as the ""language of life"". Natural languages can be read and understood by humans. However, we cannot yet understand the language of life. We do not even know what the vocabulary is, i.e., what the basic language units are (analogous to words in human languages). Textual representation of proteins has enabled the application of natural language processing (NLP) techniques to the study of proteins, and breakthrough results have been achieved in various downstream tasks such as protein structure prediction. However, these efforts remain only at the ""processing level"" of the language of life. The main goal of this project is to go beyond the level of language processing and open new research horizons for understanding the language of life. Using my expertise in NLP and bioinformatics, I will pursue the following objectives: (i) develop innovative methods to determine the language units (i.e., the vocabulary) of the language of life
Consortium · 1 organisation
BOGAZICI UNIVERSITESI
TR · €1,982,800
Research fields
← Find collaborators and more funded projects
Source: CORDIS, Publications Office of the European Union. Global Research Partnerships surfaces open EU research data to help you find collaborators; we are not affiliated with the European Union.