| Term 
 | Definition 
 
        | an accurate sequence of the human genome was completed in 2003 by 2013 the genomes of >7000 species have been sequenced
 |  | 
        |  | 
        
        | Term 
 
        | THE GENERAL IDEAS BEHIND GENOME SEQUENCING |  | Definition 
 
        | fragment the genome, cloning DNA fragments, sequencing DNA fragments, reconstructing the genome sequences from fragments, and analyzing the infor found in genome |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | recognizes sequences of bases anywhere within the genomes.   cuts sugar-phosphate   restriction fragments are generated by digestion of DNA with restrict enzymes |  | 
        |  | 
        
        | Term 
 
        | RECOGNITION SITES FOR RESTRICTION ENZYMES |  | Definition 
 
        | are usually 4-8 bp of double-stranded DNA often palindromic- read 5'-3'
 cuts at same place relative to its specific recognition
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | cuts are straight through both DNA strands at the line of symmetry |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | cuts are displaced equally on either side of the line of symmetry |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | is 4^n where n is the number of bases in the recognition site |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | average restriction fragment is 256 bp |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | average restriction fragment size is 4100 bp |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | can be used to fragment DNA at random locations can break phosphodiester bonds
 pass DNA through a thin needle and sonication
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | distinguishes DNA fragments according to size place in buffered solution and send electrodes through gel so charged particles move negative to positive
 |  | 
        |  | 
        
        | Term 
 
        | AFTER GEL ELECTROPHORESIS |  | Definition 
 
        | visualize DNA fragments by staining gel with fluorescent dye, and photograph gel under uv light |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | genomes of animals, plants and microorganisms are too large to analyze all at once |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | is a means to purify a specific DNA fragment away from all other fragments and make many identical copies of that fragment |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | insert DNA fragments into cloning vectors to make a recombinant DNA molecule transport recombinant DNA into living cells to be copied
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | cuts are straight through both DNA strands at the line of symmetry |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | cuts are displaced equally on either side of the line of symmetry |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | is 4^n where n is the number of bases in the recognition site |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | average restriction fragment is 256 bp |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | average restriction fragment size is 4100 bp |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | can be used to fragment DNA at random locations can break phosphodiester bonds
 pass DNA through a thin needle and sonication
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | distinguishes DNA fragments according to size place in buffered solution and send electrodes through gel so charged particles move negative to positive
 |  | 
        |  | 
        
        | Term 
 
        | AFTER GEL ELECTROPHORESIS |  | Definition 
 
        | visualize DNA fragments by staining gel with fluorescent dye, and photograph gel under uv light |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | genomes of animals, plants and microorganisms are too large to analyze all at once |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | is a means to purify a specific DNA fragment away from all other fragments and make many identical copies of that fragment |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | insert DNA fragments into cloning vectors to make a recombinant DNA molecule transport recombinant DNA into living cells to be copied
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | cloning vectors have 2 main features: - origin of replication
 - restriction sites(s) for clonin insert DNA
 - selectable marker ex-> antibiotic resistance
 |  | 
        |  | 
        
        | Term 
 
        | ALTERNATE VECTORS THAT CAN CARRY LARGE INSERTS |  | Definition 
 
        | bacterial artificial chromosomes and yeast artificial chromosomes |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | of the vectors and human genomic DNA with a restriction enzyme results in complementary sticky ends |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | is used to seal the phosphodiester backbones between vector and inserts |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | host cells take up and amplify recombinant DNA |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | the process by which a cell organism takes up foreign DNA |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | only .1% of cells will be transformed with plasmid |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | only the cells with plasmid will grow on on media with ampicillin |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | multiplies to produce millions of genetically identical cells, each with recombinant plasmid |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | long-lived collection of cellular clones that contains copies of every sequence in the whole genome inserted into a suitable vector |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | contains a different recombinant plasmid, each with a part of the human genome |  | 
        |  | 
        
        | Term 
 
        | A PERFECT GENOMIC LIBRARY |  | Definition 
 
        | has one copy of every sequence in the entire genome. # of clones= length of genome/avg size of inserts
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | number of clones in a perfect library |  | 
        |  | 
        
        | Term 
 
        | IMPOSSIBLE TO OBTAIN A PERFECT LIBRARY |  | Definition 
 
        | -usually libraries are made that have 4-5 genomic eqivalents - gives an avg of 4-5 clones for each locus
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | uses DNA polymerase to make new DNA |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | a single strand of DNA to copy |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | -short single stranded DNA molecule that is complementary to the template - designed to be complementary to plasmid sequence adjacent to the unknown insert
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | -is a good template for sanger sequencing - template and primer interact through hybridization
 |  | 
        |  | 
        
        | Term 
 
        | SANGER SEQUENCING GENERATES |  | Definition 
 
        | -a series of single stranded DNA fragments - fragments include primer and nucleotides complementary to unknown DNA
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | they each differ in length from proceeding and succeeding fragment by one nucleotide |  | 
        |  | 
        
        | Term 
 
        | DIDEOXYNUCLEOTIDE (ddNTP) |  | Definition 
 
        | -terminates DNA synthesis -labeled with a different fluorescent dye for detection of the sequence
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | separated by size using electrophoresis |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | migrate quickly and appear at the bottom of the gel |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | after electrophoresis, fragments flow through a detector and the color of the fragment is digitally recorded |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | reads of sequence complementary to the template strand read 5'-3'
 ambiguity recorded as "N"
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | began using the hierarchical strategy BAC genomic library
 identify overlapping BAC clones
 Shear DNA from each BAC
 sequence DNA -assemble sequences 4 overlap
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | developed the whole-genome shotgun strategy for genome sequencing |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | to make shotgun (uncharacterized) clones shotgun approach can be highly automated
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | -was partially sequenced from both ends - pairs of sequence gives spatial information
 - the ends were sequenced from 3 libraries with different insert sizes
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | helps overcome problems due to repetitive DNA |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | -may be part of a protein-coding exon -DNA can be read in 6 reading frames
 -ORF is uninterrupted by stop codons
 |  | 
        |  | 
        
        | Term 
 
        | SPECIES RELATEDNESS AND GENOME CONSERVATION |  | Definition 
 
        | -branch points represent a series of nested common ancestors -# at each branch point is million years before the present
 |  | 
        |  | 
        
        | Term 
 
        | DNA SEQUENCES THAT ARE CONSERVED |  | Definition 
 
        | in highly divergent species may be part of a gene human vs zebrafish -> genome 2% conserved and protein coding is 82% conserved
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | can be discovered using a computerized visualization tool |  | 
        |  | 
        
        | Term 
 | Definition 
 | 
        |  | 
        
        | Term 
 | Definition 
 | 
        |  | 
        
        | Term 
 
        | LOCATING TRANSCRIBED REGIONS |  | Definition 
 
        | is a direct method of finding genes |  | 
        |  | 
        
        | Term 
 
        | mRNA's ARE NOT EASILY SEQUENCED |  | Definition 
 
        | to rare to purify sequencing technology not widely available
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | cDNA for sequencing retroviruses have RNA genomes
 they use reverse transcriptase to copy RNA into cDNA
 |  | 
        |  | 
        
        | Term 
 
        | CONVERTING RNA TRANSCRIPTS TO cDNA |  | Definition 
 
        | obtaining mRNA from red blood cell precursors eukaryotic mRNA poly A tails at 3' end
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | affinity to oligo (dT)- single strand DNA fragments of 20 nucleotides made of dT only |  | 
        |  | 
        
        | Term 
 
        | CONVERTING RNA TRANSCRIPTS TO cDNA |  | Definition 
 
        | synthesis of hybrid cDNA-mRNA molecule in vitro synthesis using reverse transcriptase + dATP+ dGTP+dTTP+cCTP
 |  | 
        |  | 
        
        | Term 
 
        | CREATING THE 2ND DNA STRAND COMPLEMENTARY TO THE FIRST cDNA STRAND |  | Definition 
 
        | mRNA digested with RNAse 3' end of cDNA folds back and acts as primer for 2nd strand
 |  | 
        |  | 
        
        | Term 
 
        | PRESENCE OF dNTP's AND DNA POLYMERASE |  | Definition 
 
        | the first cDNA strand acts as a template for synthesis of the second cDNA strand |  | 
        |  | 
        
        | Term 
 
        | THE ARRANGEMENT OF GENES ON GENOME |  | Definition 
 
        | human genome has about 25000 genes part of genome that corresponds to exons is exome
 |  | 
        |  | 
        
        | Term 
 
        | MOST OF A GENOME IS NON-CODING DNA |  | Definition 
 
        | exome= 2% introns
 centromeres, telomeres, transposable elements
 simple repeating sequences
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | -be transcribed in the same or opposite directions - both chromosomal strands are the template strand for some genes
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | chromosomal regions that have many more genes thatn expected form avg gene density over entire genome |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | regions of >1 Mb that have no identifiable gene 3% human genome is comprised of gene desert
 |  | 
        |  | 
        
        | Term 
 
        | CLASS III REGION OF THE HUMAN MAJOR HISTOCOMPATIBILITY |  | Definition 
 
        | MHC complex contains 60 genes within a 700kb region GC content is much higher than genome avg
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | protein domains- sequence of amino acids that fold into functional units |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | protein products have novel domain architechtures |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | are groups genes closely related in sequence and function |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | look like genes, but do not function as genes |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | arose from the same gene in the common ancestor, usually retain same function |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | arise by duplication, often refers to members of a gene family |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | colored segments contain at least 2 genes whose order is conserved in the mouse genome |  | 
        |  | 
        
        | Term 
 
        | CHROMOSOMAL REARRANGEMENTS |  | Definition 
 
        | have moved the blocks to different places in the mouse genome |  | 
        |  | 
        
        | Term 
 
        | HUMAN T CELL RECEPTOR FAMILY |  | Definition 
 
        | DNA rearrangement combines V,D, and J segments into a gene result is about 1000 different combinations
 |  | 
        |  | 
        
        | Term 
 
        | COMNINATIORIAL STRATEGIES AT THE RNA LEVEL |  | Definition 
 
        | may lead to gene amplification and diversity example-> 2 neurexin genes
 2 alternative promoters, 5 sites for alternative splicing
 can generate 2000 different mRNA's
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | database established by the NIH in 1982 still the most widely used repository for sequence data
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | single, complete, annotated version of a species'genome agreed upon standard form comparison
 maintained by the NCBI
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | find homologous sequences |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | carries oxygen in the blood adult hemoglobin
 2 alpha and 2 beta
 |  | 
        |  | 
        
        | Term 
 
        | EMBRYONIC AND FETAL HEMOGLOBINS |  | Definition 
 
        | bind more tightly to oxygen to facilitate transfer of oxygen from the mother to the embryo or fetus |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | binds to oxygen less tightly to allow delivery of oxygen to the organs |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | genes in order on chromosome genes oriented in same direction
 locus controls region-> controls sequential expression of globin genes
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | deletion of alpha and beta genes y genes continue to be expressed
 normal levels of health
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | change in amino acid sequence of a- or b-globin chain causes destruction of red blood cells
 example sickle cell
 |  | 
        |  | 
        
        | Term 
 | Definition 
 
        | mutations reduce or eliminate production of one of the two globin polypeptides range of phenotypes
 |  | 
        |  |