Vidéo pédagogique

Notice

Lieu de réalisation

Grenoble

Sous-titrage

Sous-titre

Langue :

Anglais

Crédits

François Rechenmann (Intervention)

Conditions d'utilisation

Ces ressources de cours sont, sauf mention contraire, diffusées sous Licence Creative Commons. L’utilisateur doit mentionner le nom de l’auteur, il peut exploiter l’œuvre sauf dans un contexte commercial et il ne peut apporter de modifications à l’œuvre originale.

DOI : 10.60527/2xdj-pa82

Citer cette ressource :

François Rechenmann. Inria. (2015, 5 février). 3.10. Gene prediction in eukaryotic genomes , in 3. Gene prediction. [Vidéo]. Canal-U. https://doi.org/10.60527/2xdj-pa82. (Consultée le 25 janvier 2026)

3.10. Gene prediction in eukaryotic genomes

Réalisation : 5 février 2015 - Mise en ligne : 9 mai 2017

document 1 document 2 document 3
niveau 1 niveau 2 niveau 3

Descriptif

If it is possible to have verygood predictions for bacterial genes, it's certainly not the caseyet for eukaryotic genomes. Eukaryotic cells have manydifferences in comparison to prokaryotic cells. You rememberthe existence of a nucleus and you also remember on one ofthe schemes in the first week that there are more structureswithin a eukaryotic cell. But the differences lie also inthe organization of the genomes. In eukaryotic genomes, the so-calledintergenic regions are very long. Intergenic regions are theregions which separate genes. A bacterial genome is very denseindeed, if you put your fingers somewhere on the genome, if itwas possible of course, it would be on the gene. If you do the sameon a eukaryotic genome, the probability is very very very highthat it is on an intergenic region. Indeed if you take the exampleof the human genome, less than 5% of the sequences of a human genome are made up of genes, 95 % of the humangenomes are not genes. What are they? This isstill an open question. Years ago a biologist spoke about germDNA to say, well DNA which is useless. Now the feeling is somewhat different,it certainly has a reason to exist. We understand some of thesereasons but not all.

Intervention / Responsable scientifique

Rechenmann

François

Ingénieur. Auteur d'une thèse de docteur-ingénieur en sciences appliquées (Grenoble INPG, 1976). - HDR. Directeur de thèse à Grenoble INPG (1990-1994-) et à l'université de Grenoble 1. Directeur de recherche au centre Inria Grenoble – Rhône-Alpes (2002, 2015)

Thème

Disciplines :

Documentation

Liens

Support de présentation au format PDF

Dans la même collection

Vidéo pédagogique

00:04:45

Favoris
3.3. Searching for start and stop codons

Rechenmann

François

We have written an algorithm for finding genes. But you remember that we arestill to write the two functions for finding the next stop codonand the next start codon. Let's see how we can do that. We
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:05:58

Favoris
3.6. Boyer-Moore algorithm

Rechenmann

François

We have seen how we can make gene predictions more reliable through searching for all the patterns,all the occurrences of patterns. We have seen, for example, howif we locate the RBS, Ribosome
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:05:41

Favoris
3.1. All genes end on a stop codon

Rechenmann

François

Last week we studied genes and proteins and so how genes, portions of DNA, are translated into proteins. We also saw the very fast evolutionof the sequencing technology which allows for producing
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:05:35

Favoris
3.9. Benchmarking the prediction methods

Rechenmann

François

It is necessary to underline that gene predictors produce predictions. Predictions mean that you have no guarantees that the coding sequences, the coding regions,the genes you get when applying your
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:06:22

Favoris
3.4. Predicting all the genes in a sequence

Rechenmann

François

We have written an algorithm whichis able to locate potential genes on a sequence but only on one phase because we are looking triplets after triplets. Now remember that the genes maybe located on
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:07:06

Favoris
3.7. Index and suffix trees

Rechenmann

François

We have seen with the Boyer-Moore algorithm how we can increase the efficiency of spin searching through the pre-processing of the pattern to be searched. Now we will see that an alternative way of
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:05:13

Favoris
3.2. A simple algorithm for gene prediction

Rechenmann

François

Based on the principle we statedin the last session, we will now write in pseudo code a firstalgorithm for locating genes on a bacterial genome. Remember first how this algorithm should work, we first
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:04:45

Favoris
3.5. Making the predictions more reliable

Rechenmann

François

We have got a bacterial gene predictor but the way this predictor works is rather crude and if we want to have more reliable results, we have to inject into this algorithmmore biological knowledge. We
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:06:09

Favoris
3.8. Probabilistic methods

Rechenmann

François

Up to now, to predict our gene,we only rely on the process of searching certain strings or patterns. In order to further improve our gene predictor, the idea is to use, to rely onprobabilistic methods
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3

Voir tout

Avec les mêmes intervenants et intervenantes

Vidéo pédagogique

00:06:24

Favoris
1.8. Compressing the DNA walk

Rechenmann

François

We have written the algorithm for the circle DNA walk. Just a precision here: the kind of drawing we get has nothing to do with the physical drawing of the DNA molecule. It is a symbolic
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:06:57

Favoris
2.7. The algorithm design trade-off

Rechenmann

François

We saw how to increase the efficiencyof our algorithm through the introduction of a data structure. Now let's see if we can do even better. We had a table of index and weexplain how the use of these
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:06:22

Favoris
3.4. Predicting all the genes in a sequence

Rechenmann

François

We have written an algorithm whichis able to locate potential genes on a sequence but only on one phase because we are looking triplets after triplets. Now remember that the genes maybe located on
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:04:11

Favoris
4.6. A path is optimal if all its sub-paths are optimal

Rechenmann

François

A sequence alignment between two sequences is a path in a grid. So that, an optimal sequence alignmentis an optimal path in the same grid. We'll see now that a property of this optimal path provides
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:05:16

Favoris
5.1. The tree of life

Rechenmann

François

Welcome to this fifth and last week of our course on genomes and algorithms that is the computer analysis of genetic information. During this week, we will firstsee what phylogenetic trees are and how
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:07:21

Favoris
1.3. DNA codes for genetic information

Rechenmann

François

Remember at the heart of any cell,there is this very long molecule which is called a macromolecule for this reason, which is the DNA molecule. Now we will see that DNA molecules support what is called
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:05:41

Favoris
2.1. The sequence as a model of DNA

Rechenmann

François

Welcome back to our course on genomes and algorithms that is a computer analysis ofgenetic information. Last week we introduced the very basic concept in biology that is cell, DNA, genome, genes
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:04:54

Favoris
2.9. Whole genome sequencing

Rechenmann

François

Sequencing is anexponential technology. The progresses in this technologyallow now to a sequence whole genome, complete genome. What does it mean? Well let'stake two examples: some twenty years ago,
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:07:06

Favoris
3.7. Index and suffix trees

Rechenmann

François

We have seen with the Boyer-Moore algorithm how we can increase the efficiency of spin searching through the pre-processing of the pattern to be searched. Now we will see that an alternative way of
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:03:59

Favoris
4.3. Measuring sequence similarity

Rechenmann

François

So we understand why gene orprotein sequences may be similar. It's because they evolve togetherwith the species and they evolve in time, there aremodifications in the sequence and that the sequence
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:04:49

Favoris
5.3. Building an array of distances

Rechenmann

François

So using the sequences of homologous gene between several species, our aim is to reconstruct phylogenetic tree of the corresponding species. For this, we have to comparesequences and compute distances
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Vidéo pédagogique

00:04:28

Favoris
1.6. GC and AT contents of DNA sequence

Rechenmann

François

We have designed our first algorithmfor counting nucleotides. Remember, what we have writtenin pseudo code is first declaration of variables. We have several integer variables that are variables which
Biologie application informatique
DNA
Genome
Algorithm
Cell
09.05.2017
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3