Chapter 18: Microbial Genomics Flashcards

(23 cards)

1
Q

DNA Sequencing Methods: Sanger DNA Sequencing

(1) DNA sequencing
Components:
* Template DNA
* Oligonucleotide primer
* DNA polymerase
* Deoxynucleoside triphosphates (dNTPs)
* (2)

In such a reaction mixture, DNA synthesis continues until a (2) , rather than a dNTP, is added to the growing chain.
* Without a (3) to attack the (4) of the next dNTP to be incorporated, synthesis stops.

A
  1. Chain-termination
  2. Dideoxynucleoside triphosphates (ddNTPs)
  3. 3’-OH group
  4. 5’-phosphate group
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sanger Method

DNA synthesis continues until a ddNTP, rather than a dNTP, is added to the growing chain.
* Without a 3′-OH group, synthesis stops.

(1) reactions prepared, one for (2).
* Results in a collection of DNA fragments of varying lengths, each ending in the same ddNTP.

Samples prepared for automated sequencing.

A
  1. Four separate synthesis
  2. each ddNTP
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Automated Sanger DNA Sequencing

  • Uses four different colored (1).
  • (2) determines order.
  • Results are recorded on a (3).
A
  1. fluorescent labeled ddNTP
  2. Electrophoresis and laser beam
  3. chromatogram
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Next-Generation DNA Sequencing (NGS)

(1) sequencing techniques—thousands of identical DNA fragments are sequenced simultaneously.

Uses (2) templates attached to a solid substrate.

Makes genomic sequencing faster and cheaper.
* Avoids need to insert individual DNA fragments into vectors.

A
  1. Massively parallel
  2. short, sheared pieces of DNA
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sequencing By Synthesis

Identifies each nucleotide as (1).
Synthesis uses a modified fluorescent nucleotide that stops the reaction because the (2).
* Nucleotide does not (3) until it is incorporated into growing DNA strand.

A
  1. it is incorporated
  2. 3′-OH is blocked
  3. fluoresce
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Whole-Genome Shotgun Sequencing

Four stage process:
* (1)—generates clones of portions of genome.
* (2)— determines sequences of genome fragments in vector.
* (3)—computer analysis joins overlap regions to form contig.
* (4)—proofreading ensures all reads of the same sequence are identical.

A
  1. Library construction
  2. Random sequencing
  3. Fragment alignment and gap closure
  4. Editing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Next-Generation Genomic Sequencing

Depth of coverage and breadth of coverage improved.
* Coverage— (1) sequenced in a genome.
– 18-100 times is (2).
– (3)—high average number of reads.

Great depth of coverage increases accuracy.
* Decreases the chance that sequence data generated is from (4).

A
  1. average number of times each nucleotide is
  2. how often nucleotides are read
  3. Deep sequencing
  4. a random mistake in the polymerase activity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Single-Cell Genomic Sequencing

Femtograms of DNA from a cell are (1).

(2)
* Occurs at single temperature and uses the DNA polymerase from bacteriophage phi29 to synthesize new DNA.

A
  1. amplified to mg of DNA required for sequencing
  2. Multiple displacement amplification (MDA)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Metagenomics Provides Access to Uncultured Microbes

Metagenomics
* Study of microbial genomes based on (1).
* Used to learn more about the diversity and metabolic potential of microbial communities.
– Each DNA fragment comes from a collection of genomes found in a (2).
* Established the Genomic Encyclopedia of Bacteria and Archaea project.
– Improves reference databases by sequencing genomes of a wide variety of cultured microorganisms.

A
  1. DNA extracted directly from the environment
  2. microbial community, not a single organism
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Bioinformatics

(1)
* Bioinformatics combines biology, mathematics, computer science, and statistics to convert raw nucleotide data into (2).

Further examination carried out using (3).

A
  1. Genome annotation
  2. the location and potential of genes
  3. in silico analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Genome Annotation

Process that locates genes in the genome map.

Identifies (1).
* must be (2) that is not interrupted by a (3).
* must have (4) at the 5’ end and (5) at the 3’ end.

A
  1. open reading frame
  2. > 100 codons
  3. stop codon
  4. ribosomal binding site
  5. terminator sequences
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Bioinformatics—ORFs and BLAST

ORFs that appear to encode protein are called (1).

(2)
* Base-by-base comparison of 2+ gene sequences.
* Assign tentative function of gene or protein structure.

A
  1. coding sequences (CDS)
  2. BLAST (basic local alignment search tool)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Gene/ORF Terminology

(1)—genes from different organisms with similar ORFs.

(2)—two or more genes with very similar nucleotides sequences, likely from a duplication event.

(3)—naming standards for proteins and their motifs based on similarities among orthologous proteins.

Proteins that don’t align with known amino acid sequences fall into two categories.
* (4)—match known sequences in databases, but don’t have an assigned function (yet).
* (5)—products of genes unique to that organism.

A
  1. Orthologues
  2. Paralogues
  3. Gene ontology
  4. Conserved hypothetical proteins
  5. Proteins of unknown function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Functional Genomics Links Genes to Phenotypes

Functional Genomics

Puts genomic information in a (1).

Provides information on:
* (2)
* (3)
* (4)

A
  1. biological context
  2. metabolic pathways
  3. transport systems
  4. regulatory and signal transduction mechanisms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

DNA Microarray Analysis

Determines which genes are expressed at a specific time.

Arrays are (1) -> organized as a grid.

Each DNA spot (2) represents a single gene.
* (2) is usually a PCR product generated from a (3).

A
  1. solid supports to which DNA is attached
  2. probe
  3. gene of interest
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Analysis of Gene Expression Using Microarrays

Based on hybridization between the (1) and the (2) from the microbe of interest.
* Reverse transcriptase converts mRNA to cDNA.
* cDNAs are (3) and incubated with the microarray.
* (4) is washed off.
* The microarray is scanned with (5) which indicates that hybridization has
occurred.

A
  1. probe DNA
  2. complementary sequences
  3. labeled with fluorescent dyes
  4. Unbound cDNA
  5. laser optics to detect fluorescence
16
Q

RNA-Seq Method for Transcriptome Analysis

NGS method quantifying mRNA levels by measuring the “reads” matching each gene.

Cellular mRNA is converted to cDNA by (1).

cDNA are then (2).

Sequence data can be used in two ways:
* If sequenced genome exists, the cDNA are (3).
* If no reference genome, nucleotide sequence is (4).

A
  1. reverse transcriptase
  2. sequenced using NGS
  3. aligned to reference genome
  4. converted to amino acid sequences and compare to databases of known protein sequences
17
Q

Hierarchical Cluster Analysis

Organization of transcriptomic experiments.
Groups genes according to ().
E.g.
* Upregulated—red
* Downregulated—green
* Unchanged—black

A

level of expression

18
Q

Metatranscriptomics

Extraction of RNA (1), followed by sequencing and comparison to known sequences.
Describes transcriptome of (2).
* Yield transcripts that map to multiple reference genomes as well as some that fail to align with any reference genome.
* Novel transcripts represent newly discovered genes.

A
  1. directly from the environment
  2. an entire ecosystem
19
Q

Systems Biology: Making and Testing Complex Predictions

(1) with the molecular interactions that become pathways for catabolism, anabolism, regulation, behavior, environmental responses, etc.
* (2) of cells.
* May be important in studying (3).

A
  1. Integration of “parts lists” of cells
  2. Holistic study
  3. host microbe interactions
20
Q

Comparative Genomics

(1) inferred by studying similar nucleotide and amino acid sequences among organisms.

Comparisons of genomes of strains within species and among species.
* (2)—genes that are no longer functional.

Comparative genomics is the tool by which virologists and evolutionary biologist have tracked the development of new strains of SARS-CoV-2.

A
  1. Gene function and evolutionary relationships
  2. Pseudogenes
21
Q

Comparative Genomics In Microbial Groups

  • Horizontal gene transfer (HGT) found to be frequent.
  • (1)—permanently integrated mobile genetic elements.
  • Pathogenicity islands—virulence proteins.
  • (2)—order of genes on genome.
A
  1. Genomic islands
  2. Synteny
22
Q

HGT Impacts Microbial Evolution

Core genome—set of genes found (1).
* Often thought of as the (2).
Pan-genome—(3) in a strain of a species.
* More recently acquired genes that enable (4).

A
  1. in all members of a species
  2. minimum genes needed to survive
  3. every gene
  4. microbial colonization of new niches