Shared Flashcard Set

Details

Lecture 2
Remote computing
41
Biology
Graduate
01/20/2015

Additional Biology Flashcards

 


 

Cards

Term
Comman lines
Definition
  • some programs do not have GUI
  • keeping track of your work
  • almost all scientific servers run Linux/Unix
Term
paraellelism
Definition
multiple processsors to a single problem
Term
Access to remote server
Definition
r476519@inode1.hpc.kuleuven.be
Term
copying files to and from server
Definition
  • on linux or mac:stfp or scp
  • on windows putt: pscp or transfer window
Term
systeny
Definition
  • conservation of the order of functional elements in regions of the genome between species 
  • assign function and elucidate evolution
Term
DNAse sensitivity
Definition
chromatin sensitive to cleavage by the DNase 1 enzyme has lost its condensed structure exposing dna making it accessible
Term
CHip-seq for DNA binding proteins
Definition
  • transcription factors 
  • rna polymerase
  • histone modifications
Term
histones
Definition
  • protein that package and order DNA into structural units 
  • histones are subject to a wide variaty of post-translational mods
  • these are: enhancer regions promotors transctription and start sites
Term
Genetic Variation
Definition
  • functional effects: coding changes,splice sites, regulatory changes
  • non-function effects : synonymous, introns, intergenic
  • SNPs, Indels, CNVs/SVs
Term
Homology
Definition
  • similarity in sequence due to shared ancestry
Term
Orthology
Definition
  • gene that descended from the same ancestral gene seperated by a speciation even
Term
Speciation
Definition
mutations and adaptation drive sub-pop into sexual incompatablility
Term
paralogy
Definition
genes that arose from a duplication event within a genome
Term
Can amino acids replace each other?
Definition
  • residue charge
  • size 
  • hydrophobicity
  • rigidity in the an
  • a.a. chain
  • capacity to form ion and disulfide bonds
  • specific roles in active sites
Term
BLOSUM-L matrix
Definition
  • BLOcks SUbsitution Matrix
  • ungapped local alignments of protein families
  • group sequances with more than L% indentical amino acids
  • derive the sunsituiton matrix with the observed substituion of frequency of amino acids between different groups
Term
Point Accepted Mutation
Definition
  • PAM 
  • is a subsitution of an amino acid tolerated by natural selection as the protein function does not change
Term
PAM unit
Definition
time in which 1% of the amino acids in a protein sequence undergo a PAM
Term
Global alignment
Definition
  • finds the best alignment across the whole sequences
  • used for highly similar sequences of almost the same length
Term
Local alignment
Definition
  • finds regions of high similarity in parts of the sequences 
  • finding sub-sequences within sequences that may have a relation
  • best for sequences that share some degree of similarity 
Term
Gaps
Definition
  • compensate for insertions and deletions between the sequences
Term
Gap Penalty
Definition
  • serves to keep gaps at reasonable number 
  • a cost for openning a gap
  • cost for the extension or size of the gap
Term
Alignment algorithms
Definition
  • use dynamic programming to find the alignment with highest score
  • simth-waterman(local)
  • needleman-Wunsch(global)
Term
dynamic programming
Definition
  • split a problem into small problems, solve each subproblem one at a time, combine the solutions to the subproblem to an overall solution
  • advantage is that is very effcient to solve just a series of small problems
  • someitmes local solutions may not lead to global optimals
Term
Needleman-Wunsch
Definition
  • finds the globally optimal alignment for 2 sequences given a scoring function
  • best means highest score for aligning both whole sequences
Term
BLAST
Definition
  • basic local alignment search tool
  • calculates similarity for biological sequences
  • produces local alignment 
  • uses statistical theory to detemine if match is by chance
Term
blastp
Definition
  • protein-protein 
  • compares an amino acid sequence against a protein sequence database
Term
blastn
Definition
  • nucl-nucl
  • compares a nucleotide query sequence against a nucleotide sequence database(for speed not sensitivity)
Term
blastx
Definition
  • translated nucl-protein
  • compares the six-frame conceptual translation products of a nucleotide query against a protein sequence database
Term
tblastn
Definition
  • protein translated nucl
  • compares a protein query sequence against a sequence database dynamically translated in all six reading frames
Term
tblastx
Definition
  • translated nucl-translated-nucl
  • compares six frame translation of a nucleotide query sequence against the six frame translations of a nucleotide sequance db
Term
TSS
Definition
transcript start site
Term
TATA box
Definition
  • promotor region in higher eukaryites located 28-34 bp upstream of the TSS and is associated with strong tissue specific promoters
Term
sequence motif
Definition
a nucleotide or amino acid sequence pattern that is widespread and has  a biological significance
Term
phmmer
Definition
search one or more query protein sequence against a protein sequence database
Term
hhmmscan
Definition
search protein sequance against a collection of profiels
Term
hmmsearch
Definition
uses to search one or more profiles against a protein database
Term
jackhmmer
Definition

iteratively search a query protein sequence,

multiple-sequence alighnment or profile HMM against the target protein sequence DB

Term
PFAM
Definition
  • database of protein domain families
  • high quality manually curated multiple sequence alignments and profile HMMs
Term
domain
Definition
independent evolving unit (conserved subsequences)
Supporting users have an ad free experience!