Homology
similarity that is the result of inheritance from a common ancestor
Orthologs
genes in different species that evolved from a common ancestral gene by speciation. Normally, orthologs retain the same function in the course of evolution.
Paralogs
genes related by duplication within a genome. paralogs evolve new functions, even if these are related to the original one.
An Alignment
an hypothesis of positional homology between bases or amino acids.
Homology versus Similarity
What are other set of related terms for sequence comparison
sequence similarity and sequence identity.
Sequence Identity
the percentage of matches of the same amino acid residues between two aligned sequences.
what are sequence similarity and sequence identity synonymous for
nucleotide sequences.
Sequence identity can be calculated in two different ways
1)I=[(Li ×2)/(La +Lb)]×100
La and Lb are the total lengths of each individual sequence.
Li is the number of aligned identical residues.
2) I(S)% = Li/La
where La is the length of the shorter of the two sequences.
What issues are associated with multiple sequence alignments
All sequences show some similarity (even random sequences, 25% for nucleotides, 5% for proteins).
Similarity levels might be high in some parts of the sequence and low in other parts.
Sequences might show substantial length variation and presence/absence of various domains.
What are the 3 main methods of alignment:
Why might manual alignment be carried out ?
– Alignment is easy.
– There is some extraneous information (structural).
– Automated alignment methods have encountered
the local minimum problem.
– An automated alignment method can be “improved”.
Progressive Alignment
a heuristic method and as such is not guaranteed to find the ‘optimal’ alignment.
Devised by Feng and Doolittle in1987.
What steps are involved in the ClustalW procedure
How is the ClustalW Pairwise Alignment done?
What is the Neighbor-joining method
*The neighbor-joining method is a greedy heuristic which joins at each step, the two closest sub-trees that are not already joined.
*One of the important concepts in the NJ method is neighbors, which are defined as two taxa that are connected by a single node in an unrooted tree
Multiple Alignment- First pair
What are the advantages and disadvantages of ClustalW