Vidéo pédagogique
Lieu de réalisation
Langue :
François Rechenmann (Intervention)
Conditions d'utilisation
Ces ressources de cours sont, sauf mention contraire, diffusées sous Licence Creative Commons. L’utilisateur doit mentionner le nom de l’auteur, il peut exploiter l’œuvre sauf dans un contexte commercial et il ne peut apporter de modifications à l’œuvre originale.
DOI : 10.60527/tjnv-nb73
Citer cette ressource :
François Rechenmann. Inria. (2015, 5 février). 4.3. Measuring sequence similarity , in 4. Sequences comparison. [Vidéo]. Canal-U. (Consultée le 24 juillet 2024)

4.3. Measuring sequence similarity

Réalisation : 5 février 2015 - Mise en ligne : 9 mai 2017
  • document 1 document 2 document 3
  • niveau 1 niveau 2 niveau 3

So we understand why gene orprotein sequences may be similar. It's because they evolve togetherwith the species and they evolve in time, there aremodifications in the sequence and that the sequence may still besimilar, similar enough again to retrieve information on onesequence to transfer it to another sequence of interest. So thequestion now is how can we measure this similarity between twosequences for the moment. The first approach to similarityis a very simple one is to apply a distance which is calledhere the Editing System or the Hamming Distance.The idea is very basic. You would take two sequences likethese two sequences here and you look at the differences and youcount the number of differences. Here, for example, you have twodifferences so you will say that the distance, the similaritybetween the two sequences, the distance is two. Here wehave another pair of sequences which are less similar becausethey are three differences. That's quite nice, it'sa hamming distance. Is it really a distance? A distance is a mathematicalconcept and to be a distance, it must satisfy three conditions:the distance between a sequence and itself must be zero, a sequencebetween a sequence and another one must be the same betweenthe last one and the first one and we must have this inequalitywhich is always verified.


Dans la même collection

Avec les mêmes intervenants et intervenantes