Term
|
Definition
| variables can store multiple values, retrieved using integer index |
|
|
Term
|
Definition
| Basic Local Alignment Search Tool - algorithm for comparing sequences like amino acids, nucleotides of dna or proteins |
|
|
Term
| National Center for Biotechnology Information |
|
Definition
| organization that manages Genbank, BLAST and entrez |
|
|
Term
| 3 parts of genbank flat file |
|
Definition
Header - contains lookup info like definition, locus, acession #
Features - contains features of the molecule: cds, gene,
Sequence - the actual sequence |
|
|
Term
| PAM matrices(Percentage of Acceptable point Mutations per 108 years) |
|
Definition
| are based on global alignments of closely related proteins. |
|
|
Term
| BLOSUM (BLOcks SUbstitution Matrix) |
|
Definition
| based on local multiple alignments of more distantly related sequences |
|
|
Term
|
Definition
Comparisons of closely related sequences should use BLOSUM matrices with higher numbers and PAM matrices with lower numbers
comparisons of distantly related proteins use BLOSUM matrices with low numbers and PAM matrices with high numbers |
|
|
Term
| Steps of the blast algorithm |
|
Definition
1. Preprocessing of query: for each position, find the list of words whose scores are greater than T when paired with the word starting at p (wp -word)
2. Generation of hits: for each word list, find all exactmatches (hits) with the database sequences
3. Extension of hits: for each hit, extend alignment in both directions to find alignments whose scores are greater than threshold S |
|
|