- challenge of scale (very thin and no physical features that mark genes)
-challenge of numbers (1 gene- 1/millionth genome, some genes ecpressed in small fraction of cells)
locate a gene/ DNA sequence, remove/copy DNA sequence, Visualize DNA, store new DNA sequences, Edit any genome
locate a gene/ DNA sequence, remove/Copy DNA sequence, Visualize DNA, store new DNA sequences.
technique in molecular genetics where scientists combine DNA from two sources to a create new DNA. this is usually done using a vector like a plasmid.
genetic engineering
small circula double stranded DNA that can naturally exist in bacterial cells along side the bacterial genome . have the ability to replicate indepedent of bacterial cell genome using an (ori) encodded in a dna sequence inside plasma.
synthetic DNA fragments containing restriction sites
an ori, one or more selectable markers, recognition sites for one or more restriction enzymes
because it allows cells containing the vector to be selected or identified
plasmids need these sites for cleavage because this is where the restriction enzymes (restriction endonucleases) can cut the DNA at specific sequences. allows insertion/ removal of specific genes/ DNA fragments within the plasmid.
creates sticky ends (overhangs/blunt ends) that can be used to join other pieces of DNA. allowing precision when inserting new foreign DNA.
typically cleaved with the same restriction enzyme
they have highly engineered sequences to optimize various functions (high copy number, inducible, high protein expression).
activated by high stress enviornment, bacterium takes up foreign DNA from dead bacterium and incorporates it within its genome, survival mechanism, provides anatiobiotic resistance
no (hence recombinant DNA)
make it able to take up foreign DNA
after creating a recombinant plasmid with the gene of interest, the plasmid is introduced into bacteria under special conditons allowing them to take it up. once it is taken up the bacteria can replicate it and make copies of the plasmid.
no, that is why scientists use antibiotic selection when experimenting, since the plasmid generally has antiobiotic resitance, only bacteria that succesffully taken up the plasmid will survive when grown on agar plant with antiobiotic. allows for selection and isolation
bacterial cells with their resident plasmids can be quickly cheaply expanded
these transformed bacteria can be put in glycerol and frozen and stored at -80 celsisus with little to no effect on cell health of DNA degradation (ability to store new genetic seuqences)
yes circular DNA is very stable
using selectable markers on the plasmid (genes that encode a marker) which are typically visual or growth selective.
ampicillin resistance gene. when bacteria are spread on a lawn of media containing ampicillin only transfromants will grow into colonies
a single colony is picked and cultures into large isogenic (same genotype) bacterial cultures. plasmid DNA can be purified from these cultures
allow us to multiply and maintain DNA sequences indefinitely
enzymes that cleave the phosphodiester bonds within a polynucleotide chain. They cut DNA or RNA at specific internal sites, unlike exonucleases, which remove nucleotides from the ends of the chains.
DNA endonuclease enzymes
they were discovered in bacteria as a defense mechanism agaiant invading viruses by functioning as a part of a bacterial innante immune system by identifying and cutting foregin DNA. stop viral DNA from integrating into the bacterial genome
allows researchers to target and cut DNA at very specific sites. This enables precise genetic manipulation, such as cloning, gene editing, and the construction of recombinant DNA
5' overhang, 3' overhag, blunt end
5' leaves a cut in a way that leaves a single stranded 5' end, 3' leaves a single stranded 3' end. blunt end cuts straight through both strands leaving no overhangs
they need to be cut/digested with same enzyme and mixed for the overhang ends (sticky ends) to hybridize due to the correct homology
covalently seal (ligates) 'nicks' in sugar phosphate bonds on two ends. Thus, creates stable double stranded piece of DNA.
Recognition sequences occur at specific locations in the plasmid where the restriction enzyme cuts. also called restriction sites and only occur at this one location in plasmid.
Each restriction enzyme is named based on the species of bacteria in which it was discovered, with the first portion of the name (in italics) representing the bacterial genus or species.
through gel electrophoresis
gels (agarose or polacrylamide) where one side has small wells
gels are covered in ionic buffer and electric field is applied, DNA is made of acids and acids are negatively charged so they slowly migrate to th eother end of the gel (postive pole)
larger DNA fragments move slower through the gel matrix thus run slower than small DNA fragments (stay closer to the negative side)
DNA is stained with soemthing that binds to the nucleic acids (ex. ethidium bromide), this can be imaged with a cemra and a an appropriate light source (ex. UV light). certian DNA bands can be cut out of the gel and purified
polymerase chain reaction (PCR)
kary mullis 1983
1. single stranded DNA template (unwound by Helicases)
2. replication machinery (DNA polymerase)
3. RNA primers (created by primase)
4. free dNTPS
heat used to seperate double stranded DNA instead of helicase
dna polymerase supplied a test tube
dna primers (synthetically created)- small chains of nucleotides (17-25) that are complementary to the template and provide 3'-OH end for the polymerase to work with
free dNTPs supply in a test tube
polymerase chain reaction
1. DNA template ( double stranded)
2. primers (forward and reverse)
3. dNTPs
4. DNA polymerase (heat resistant)
5. buffer (ions)
heat to 90-100 celsisus for a few minutes
cooled 30-60 celsius rapidly for less than a minute. primers are in high conc, rapid cooling give advantage for small primer to bind template before larger single stranded DNA comes back together.
both forward and reverse primer
heat to 72 celsisu, polymerase tolerates the rapid heating in step 1
by the discovery of thermal resistant DNA polymerase in bacteria, thermus aquaticus
machine that allows repeated cycling of 3 primary steps (heating and cooling) 25-35 times
ideally doubles the number of copies of the target DNA.
copies of DNA= 2^N X inital copies of DNA. N is the number of PCR cycles.
Forward Primer: Binds to the beginning of the target sequence on the sense strand of DNA.
Reverse Primer: Binds to the end of the target sequence on the antisense strand of DNA.
By choosing primers that are complementary to specific regions flanking the target sequence, you ensure that only the desired segment of DNA is amplified
20nt long
Forward Primer: This should be designed to bind just upstream of the start codon (usually in the 5' untranslated region or UTR).
Reverse Primer: This should be designed to bind just downstream of the stop codon, in the 3' UTR or exon.
locate a gene/DNA sequence
true
no, on average, only 1/10th the genes in the genome is expressed in a cell
cells of different tissues express only a subset of genes
the first step is isolation of total RNA. first you collect cells or tissues of interest than extract the total RNA from the sample.
with oligo (dT) primers which are short sequences od deoxythumidine which bind to the poly-A tail of eukaryotic mRNA molecules. the poly A tail is unique to mRNA so the oligo primer ensure that only mRNA is reverse transcribed. (this is the starting point for the reverse transcription reaction)
RT is an enzyme that synthesizes cDNA from mRNA, it synthesizes the first strand by copying the mRNA template. it had both RNA- dependent DNA polymerase activity (mRNA--> cDNA) and RNase H activty (degrade RNA template after cDNA is made)
synthesizes cDNA from the 3' end of the mRNA and works toward the 5' end. results in a single stranded cDNA copy of the mRNA. RNase nicks the mRNA strand.
after RT and the first strand of cDNA being made, DNA polymerase I which comes from E.coli and has both polymerase and exonuclease activies is used to syntehsize the second strand of cDNA using dNTPS. adding complementary bases to the first cDNA strand.
since it also has RNase H activty, which removes the RNA strand from the original template, so it can be degraded after reverse transcription
double stranded cDNA molecules where one strand is complementary to the mRNA and the other strand is the reverse complement
it can be inserted into a cloning vector.
you will cut the vector (plasmid) with appropriate restriction enzymes to generate sticky or blunt ends that will allow for insertion of the cDNA.
the double stranded cDNA is ligated into the vector using DNA ligase which forms covalent bonds between the cDNA insert and the vector. now the cDNA is part of the plasmid vector and can be introduced into bacterial cells
how the ligated is introduced into competent E.coli, the bacteria that actually take up the plasmid will begin to replicate creating clones of the cDNA insert. the colonies can then be screened to identify the ones containing the desired cDNA insert
it is specific to a particular tissue, cell type, state or time. they can be created to capture gene expression profile of specific conditon or stage in the life cycle of an organism
helps us understand the protein sequence a gene product will make (gene structure/function) as well as for designing primers for downstream molecular cloning of these gene sequences into plasmids
sanger sequencing is the standard used for seuqencing DNA fragments in laboratories. created by frederick sanger and colleagues in 1970s, it is also called dideoxy sequencing.
1. DNA replication with DNA polymerase
2. DNA template
3. primers
1. number of primers
2. use of ddNTPS
3. concentration of ddNTPS
4.labeling for detection
5. amount of DNA required
pcr- 2 primers (F and R)
ss- 1 primer
PCR- uses dNTPS (deoxyribonucleotide triphosphate) to extend the DNA strand
ss: uses ddNTPS (dideoxyribonucleotide triphosphates), which lack the 3' hydroxyl group, causing DNA synthesis to terminate when incorportaed.
PCR: dNTPS are present in equal amounts
ss: ddNTPS are present at a 100 fold lower conc than dNTPS to ensure controlled termination of DNA synthesis
pcr: no special labelling is required
ss: primer or ddNTPS are labelled (radioactive or fluorescent) to visualize and read the sequence after electrophoresis
pcr: small amount of template DNA as the target region gets amplified
ss: large amount of DNA becuase PCR amplification or cloning comes before sequencing to produce a large amount of target DNA
the original version the primer is radioactively on the 5' end. the automated version the primer is not labelled
each ddNTP (A,C,G,T) is labelled with a different fluorescent probe (illustrated by green, purple, black, red etc.)
original: many copies of target DNA and primer are added to 4 tubes
automated: everything goes into one tube and one well on gel
1. lots of dNTPS
2.DNA polymerase
3. a small amount (1/100) of one of the 4 ddNTPS (A,C,G,T)- low conc to ensure random incorporation/chain termination at various points
gel electrophoresis to seperate by DNA length, very laborious, expensive
carried out by automated machines using laser scanners, imagers and software to read out DNA sequence
a peptide hormone, used in the treatment of diabetes
1.identify human tissues that expresses insluin (pancreas)
2. purify RNA from pancreatic tissue
3. use purified RNA to create cDNA from pancreatic mRNA
using known genetic sequence of insulin,design a forward (3' end) and reverse (complentary 5' end) DNA primer for inulin cDNA
then to the 5' ends of the forward and reverse primer add an extenstion of a few nucleotides which contain a restriction enzyme recognition site
in a PCR reaction they amplify the insulin cDNA
- load you PCR product on an agarose gel, then use gel electrophoresis to make sure your DNA band is the size you expect
cut your insulin cDNA band out of the gel and purify the DNA from the agarose gel
add restriction enzyme to your DNA fragment to cut at the restriction sites that you put on the ends, use the same enzyme to digest a plasmid optimized for expressing a protein in E.coli
we use the same RE to digest both becuase it will leave complentary DNA ends that will fit together. then ligate the insulin cDNA into the expression plasmid
it gets transformed into competent e.coli cells, then we allow it to grow then spread on a culture plate to obtain single colony isolates
we perform a PCR on anumber of different bacterial colonies (1-4) to confirm they contain the insert and not just a religated plasmid
designing one primer in the plasmid and one primer within the insulin cDNA would produce a product
use gel electrophoresis to visualize wheter you have a postive PCR product or not
After selecting colonies or cultures with the plasmid containing the insert, we purify the plasmid DNA. We then design primers that span both the insulin cDNA and the surrounding plasmid regions (flanking parts). The plasmid is sent for sequencing, and the resulting DNA sequence is analyzed to confirm the correct insertion of the insulin cDNA and ensure there are no mutations.
we grow a larger bacterial culture, and because we cloned the insulin cDNA into an expression plasmid we can add a chemical (IPTG) to induce the bacteria to transcribe and translate the human insulin cDNA on the plasmid. This allows the bacteria to produce insulin. We can then purify the insulin from the bacterial culture for further use.
CRISP: clustered regularly interspaced short palindromic repeats
CRISPR was initially thought to be a form of bacterial immunity but then it was found the spacers cointaied DNA that come from foreign sources such as bacteriophages (viruses that infect bacteria), archaeal viruses, and plasmids (small, circular DNA molecules in bacteria). These foreign DNA sequences are inserted into the bacterial genome as a sort of genetic "memory" of past infections, allowing the bacteria to recognize and defend against those specific invaders if they come back.
porteins associated with CRISPR, which were initially thought to be involved in general DNA repair, actually played a crucial role in this immune system. they recognize and cut foreign DNA sequences/ defend bacteria and repair dna
1. evolved in bacteria to combat invading DNA elements (ex. viruses), similar to an adaptive immune system
2. functions as a type of oragnismal memory of past invaders
3.leads to the targeted destruction of invading DNA
they create crispr arrays in the bacterial genome (acquisiton phase)
Protospacers: These are foreign DNA sequences (from viruses or plasmids) that match the unique spacer sequences in the CRISPR system. Essentially, they are the exact targets for CRISPR's defense mechanism.
Palindromic repeats: These are short DNA sequences that flank the spacers. They read the same forward and backward, which helps the CRISPR system recognize and target the corresponding foreign DNA when it reappears.
they express complementary RNA to similar invaders (expression phase)
The CRISPR array (the sequence of spacers and repeats) is transcribed into a long precursor RNA called pre-crRNA.
The pre-crRNA is then cut by Cas nuclease proteins (or RNase II) into shorter crRNAs, each containing one of the unique protospacer sequences that match the foreign DNA previously encountered. These crRNAs are crucial for recognizing and targeting the invading DNA.
crRNA + tracrRNA + Cas9 form complex
cas9/crRNA/tracrRNA complex binds to complementry DNA of new (but similar) invaders
complex cleaves causing double stranded breaks
1.crRNA contains 20-50nt of invader homology--> CRISPR-Cas can target very unique DNA sites (like PCR primers)
2. protospacer adjacent motif (PAM) site is required downstream of spacer- enables cas9-mediated recognition and cleavage of target DNA.
(for the crispr-cas9 system, this PAM sequences is: 5'-NGG-3' <-- short 3nt sequence, occurs very frequently throughout a genome (every 8bp in humans).
crispr-cas9
natural system required crRNA and tracrRNA to pair and bind with Cas9 to make effector complex. reasearchers combine crRNA and tracrRNA into one singlue guide RNA (sgRNA)
contains ~20nt "seed" sequence that can direct the effector complex to any specific DNA sequence
DNA repair, two DNA repair mechanisms exist to fix double stranded breaks: nonhomologous end joining (NHEJ) and Homology directed repair (HDR)
when: homologous template is NOT present
(Used when cells are in G1 and no sister
chromatid available as a template)
pro: (good for making random disruptive mutations to
your gene of interest)- good for when you want to disrupt a gene (like in gene editing)
con: Error prone often leading to deletions, insertions,
and translocations because it doesn't have a template to guide it
when: homologous template IS present – Normally, it
is the sister chromatid after DNA replication.
con: Disadvantage is HDR is not very efficient – success
rate for this strategy of gene editing is one in 100-
1000.
transgenics: researchers provide a donor DNA (a synthetic piece of DNA with the desired sequence/homolgous template) to guide the repair process. This allows them to insert desired new sequences into a gene, such as adding a fluorescent protein tag to track or study a protein.
• Somatic gene therapy – ethically more permissible
• Germ-line gene editing – Within the traditional research community, this
possibility is not currently on the table (for ethical reasons)
1. ethical challenges
delivery of materials into target cells
Viral Vector Delivery: Engineered viruses are used to deliver the CRISPR components (DNA, Cas9, sgRNA) into cells. Viruses are effective at entering cells, but they have risks, including immune reactions and potential unintended genetic changes.
Non-Viral Delivery Vectors: Alternatives like nanoparticles can carry the CRISPR materials into cells without using viruses. These can be safer, but often face challenges in efficiently getting the materials into cells.
The question of whether CRISPR components will effectively reach and enter the correct tissues remains a significant challenge. Delivery methods, like viral vectors and nanoparticles, can sometimes struggle with targeting the right tissues and cells efficiently. Even if the delivery method works, there’s no guarantee that the CRISPR components will reach enough cells in the target tissue to make a meaningful impact on gene therapy.
Homology Directed Repair (HDR), which is crucial for precise gene editing, is not very efficient (only successful in about 1 in 100 to 1,000 attempts).
For gene therapy to work, a significant number of cells in the target tissue need to be edited. If only a small fraction of cells are successfully edited, the therapy may not be effective enough to restore function.
immunological challenges
* immune response against cas9 protein (since it is a bacterial protein) which destroys the protein function, there is also a immune response to delivery mechanisms (delivery of CRISPR-Cas9 constructs)
this leads to autoimmune-like disease/symptoms
Immune-privileged organs are parts of the body where the immune system has limited access or activity such as the eyes (treatment of inherited retinal diseases)
forward and reverse genetics
• Forward genetics: begins with a phenotype due to random
mutations. Then researcher uncovers what gene is responsible for phenotype.
Reverse genetics: begins with interest in a gene of unknown
function. Then, induce mutations specifically in gene and check if phenotype occurs.
Forward genetics better
for discovering unknown
genes important for a
function/phenotype.
Reverse genetics good if
you have a strong
hypothesis that a gene is
important for a trait of
interest.
• Microsatellites: short tandem repeats (STRs), variable
number of copies of repeat sequences possessed by
many organisms
• Detected by PCR
• Fragments represented as peaks on a graph
• Homozygotes: single tall peak
• Heterozygotes: two shorter peaks
to identify people. take an individual dna, subject to pcr, the length of the dna fragment from the pcr depends on the # of copies of the microsatellite sequences. the fragments are seperated by gel electrophoresis, different sized fragments appear as different bands. these patterns of fragments produced by individuals differ
represents the pattern of DNA fragments produced by performing PCT on the STR loci. number below each peak= number of STRs in that DNA fragment
used to determine the presence of a suspect at a crime scene, ex. the dna profile of suspect 2 matches dna evidence collected at the crime scence, results from 4 str loci.
usual means of victim id were of little use with remains, but used dna fingerprinting and also carried out on mitochondrial DNA
Technique to identify different species using
DNA
• Use genes that have high interspecies
variability (between species) and low
intraspecies variability (within species)
Most common gene used for animals is the
mitochondrial gene cytochrome c oxidase I
(COI aka COX1)
Can and has been used to identify:
• The species of meats sold at markets/in restaurants, to make sure food labels are accurate
Species of plants, pollen, insects (esp. larval stage)
• The diet of an organism (DNA testing of stomach contents)
• Human matrilineal lineages
• Animal remains, fecal samples, monitoring for invasive species
for evidence of yeti,bigfoot, sasquatch etc.
1. extract mystery dna
2. amplify COI region with PCR
3. check on gel/ column purify
4. DNA (sanger) sequence product from column purify
5. obtain dna sequence
6. identify likely species using DNA sequence database
because circular DNA
is stable, whereas
linear DNA is not
(will be degraded
over time)
gene (DNA)--> RNA
under the electron microscope, dna moleculues undergoing trasncription exhibit christmas tree like structures
RNA: rNTPS (ribonucleoside triphosphates) contains ribose sugar, 2' carbon of sugar has a OH group makes RNA more unstable than DNA
DNA: dNTPS (deoxyribonucleoside triphosphates)
rna- uracil
DNA typically forms a double helix with a stable, predictable structure due to its double-stranded nature/more rigid/less flexible
RNA is usually single-stranded and can fold into more complex 3D shapes (like loops, hairpins, and bulges) due to its ability to base-pair within itself. more flexibile/less rigid- can adopt more shapes
allow some RNA to acts as ribozymes (proteins) or catalytic RNA
- storing genetic info for protein translation
ribozymes:
-cut/edit out own sequence
-connect RNA molecules together
-replicate other RNA
-catalyze peptide bond formation between amino acids
phosphodiester bonds
4 bil years ago only RNA, ribozymes
then RNA ribozymes--> protein enzymes
now: dna--> rna,ribozymes--> protein enzymes
ribosmal RNA (rRNA), Messenger RNA (mRNA), Transfer RNA (tRNA), CRISPR RNA (crRNA)
prokaryotic and eukaryotic
cytoplasm
structural and functional components of the ribosome
prokaryotic and eukaryotic
nucleus and cytoplasm
carries genetic code for proteins
prokaryotic and eukaryotic
cytoplasm
helps incorporate amino acids into polypeptide chain
prokarytoic
---
assists destruction of foreign DNA
- a DNA template
-the raw materials (ribonucleotide triphosphates) needed to build a new RNA molecule
- The transcription apparatus, consisting of the proteins necessary for catalyzing the synthesis of RNA
transcription is highly selective in what DNA sequences are transcribed. genes are transcribed from only one of the DNA strands (the template strand) either strand of DNA can be the template strand (depends on the gene)
the transcribed strand, the dna strand that is used as template to syntehsize the RNA molecule. RNA is complementary to this strand
coding strand. it has the same sequence as the RNA except RNA uses uracil instead of thymine, it is not used as the template for transcription
the RNA molecule is synthesized 5' to 3' direction and is complementary and antiparallel to DNA template strand
stretch of DNA encoding RNA molecule and sequences
necessary for its transcription (Promoter, RNA-coding region, Terminator).
1. Contains sequences that Transcriptional Apparatus recognize and bind
2. Dictates which DNA strand is template strand and Transcription Start Site
1.Contains sequences that signal to end transcription.
2. Usually part of the RNA coding sequence and resulting transcript
First nucleotide transcribed is numbered +1 and numbering increases downstream.
Nucleotides upstream of start site are numbered negative moving upstream (promoter and up)
RNA Polymerase and array of accessory proteins
RNA Polymerase is large multimeric enzyme (several
polypeptide chains)
Core enzyme catalyzes addition of rNTPs , thus
elongation of RNA molecule.
Core enzyme has 5 subunits:
a - alpha (2)
b - beta (1)
b’ - beta prime (1)
w - omega (1)
a, b, b’ are essential, while w is not but helps
stabilize enzyme.
Accessory proteins join and leave polymerase at
different phases of transcription.
Thus, different accessory proteins regulate special
transcriptional functions.
initation, elongation, termination
Transcriptional apparatus that assembles on promoter and begins RNA synthesis
1. Promoter recognition/binding
2. Formation of transcription bubble
3. Creation of first bonds between rNTPs
4. Escape of transcriptional apparatus from promoter
Promoters have different affinities for RNA polymerase
Critical step for regulating selective gene expression (when and how frequent)
Promoter sequence encodes information (that RNA Polymerase “reads”)
1. Where to start
2. Which strand is template strand
3. Which direction to move
Consensus Sequence – set of most commonly encountered nucleotides among sequences with considerable similarity, or consensus.
Similar location with respect to start site.
one located at -10 bp - Pribnow box TATAAT
another at -35 bp - TTGACA
Sequence between/around consensus vary greatly from promoter to promoter
(mutations there have little impact on transcription).
Sigma factor (s) is a bacterial accessory protein that binds to core RNA polymerase
(forming the RNA polymerase “holoenzyme”).
Sigma stabilizes polymerase binding to proper promoter start sites ensuring initiation of gene transcription.
Sigma required only for promoter binding and initiation and detaches after a few RNA nucleotides have been added.
Multiple types of Sigma exist that are specific for particular sets of promoters
RNA Polymerase Holoenzyme bind consensus sequences in promoter (initially weakly)
Holoenzyme changes structure leading to tighter binding and unwinding of DNA to form transcription bubble (nucleotides -12 to +2).
-RNA polymerase starts by adding the first ribonucleoside triphosphate (rNTP) to the DNA template.
-The first base added doesn’t use the rNTP's triphosphate bonds for energy.
-As a result, the RNA transcript has 3 phosphates on the 5' end.
-For subsequent rNTPs, two phosphates are cleaved off during the addition of each new base.
-After a few bases are added, the sigma factor detaches from the RNA polymerase and transcription continues.
Polymerase changes shape again such that it “lets loose” of consensus sequences in promoter and is free to move and transcribe downstream.
1.Unwinding DNA at downstream edge of bubble
2. Adding nucleotides to RNA molecule
3. Rewinding DNA at upstream edge
RNA polymerase pauses due to backtracking, where it moves backward along the DNA template.
This pausing is important for proofreading the RNA transcript.
DNA Supercoiling:
Supercoiling occurs ahead of the bubble (positive supercoiling) and behind it (negative supercoiling).
Topoisomerases are enzymes that help relieve supercoiling and "fix" the negative supercoiling behind the transcription bubble.
1.RNA Polymerase stops RNA synthesis
2. RNA molecule released from Polymerase and dissociates from DNA
3. Polymerase detaches from DNA
Rho-dependent or Rho-independent
These terminators only cause termination
when an ancillary protein, rho (r) is present.
1. RNA sequence where rho binds called Rut
(rho utilization) site
2. Terminator sequence causes Polymerase
to pause. Polymerase pause allows rho time to
advance towards paused transcription sit
Rho protein has helicase activity that unwinds the RNA:DNA hybrid thus terminating transcription.
Terminator sequence sufficient to cause termination
(called “intrinsic terminators”)
Features:
1. Inverted repeats – two sections of terminator
have complementary (inverted) sequences.
2. String of ‘A’s in the DNA template follows last
inverted repeat (leading to string of ’U’s in RNA). The string of Uracil (U) cause Polymerase to pause, allowing time for the inverted repeat hairpin to form. Hairpin breaks binding of DNA to RNA and terminates transcription.
Eukaryotic DNA is condensed into chromatin
structure and often inaccessible
Nucleosome modifications regulate access to
DNA
Chromatin-remodeling complexes: bind directly to DNA sites and reposition nucleosomes
Destabilize nucleosomes, making DNA accessible
chromatin remodeling complexes reposition the nucleosomes, allowing transcription facors and RNA polymerase to bind to promoters and intiate transcription
The acetylation of histone proteins alters
chromatin structure and permits some
transcription factors to bind to DNA.
Acetylation of histones controls flowering in
Arabidopsis
− Flowering locus C (FLC) gene
− Flowering locus D (FLD) gene
Flowering Locus C (FLC) gene: This gene represses flowering and happens when acetyl roups on histone proteins destablize chromatin strcture .
Flowering Locus D (FLD) gene: This gene promotes flowering (encodes a deacetylase enzyme) affecting histone modification by removing acetyl groups which then restore chromatid structure so FLC is not transcribed, flowering is not supressed and so flowering takes place
Each RNA Polymerase type recognize different promoter
sequences.
Core Promoter – just upstream of gene, similar to Bacterial promoter
Regulatory Promoter – upstream of core promoter, more varied consensus seq
• Transcription factors (TF) - any DNA binding protein that affects the levels
of transcription (estimated >1000 TFs in humans)
TFs bind regulatory promoter sequences and affect transcription by directly or indirectly contacting the basal transcriptional apparatus.
General transcription factors + RNA polymerase II forms the basal
transcription apparatus, which is sufficient to initiate minimal levels of
transcription (“basal transcription”
Many types of DNA binding proteins have evolved with functional parts (60-90amino acids), referred to as domains, that bind the DNA.
Many types have evolved that share characteristic domains called motifs (e.g. zing finger motif)
1. TFIID binds to TATA box in the core promoter, 2. the trancription factors and RNA poly II bind to the core promoter
The TATA-binding protein (TBP) binds to
the minor groove of DNA, straddling the
double helix of DNA like a saddle.
Enhancers – more distant from gene, But the DNA can loop over allowing interaction with DNA binding proteins and Polymerase
3. transcriptional activator proteins bind to sequences in enhancers
4. DNA loops out, allowing the proteins bound to the enhancer to interact with the basal transcription apparatus
5. transcriptional activator proteins bind to sequences in the regulatory promoter and interact with the basal transcription apparatus through the mediator
* 11-15 bp of DNA unwinds around the transcription start site
• Open complex - template DNA strand positioned in RNA
polymerase’s active site
• RNA synthesis begins
After the first ~30nt are polymerized, RNA polymerase II
moves off the promoter and proceeds downstream to
elongate the RNA molecule
~8nts of RNA remain paired with the DNA template strand as
transcription progresses downstream
exiting DNA out of rna poly II: upstream
entering DNA: downstream
Molecular structure of RNA
polymerase II and how it functions
during elongation have been
revealed through the work of Roger
Kornberg and colleagues.
The DNA double helix enters
RNA polymerase II through a
cleft in the enzyme and unwinds.
The DNA–RNA duplex is bent at
a right angle, which positions
the 3' end of the RNA at the
active site of the enzyme.
At the active site, new
nucleotides are added to the 3'
end of the growing RNA
molecule.
RNA Polymerase I, III similar mechanisms to
Bacteria
RNA Polymerase II continues to synthesize RNA past the end of the coding sequence, creating pre-mRNA.
The pre-mRNA is cleaved at a consensus sequence, producing mRNA that will be translated into a protein.
The RNA continues to be transcribed with a 5' hanging end.
Rat1 binds to the 5' hanging end of the RNA, "eats" its way toward the RNA polymerase, causing termination of transcription.
Changes in a relatively small number of regulatory
sequences help produce the large phenotypic
differences between humans and chimpanzees.
false
genotype: DNA sequence information
phenotype: anatomical, behavioural, phsyiological
- Process by which a gene creates a “gene products” (e.g.,
proteins, RNA) that have cellular/organismal functions (e.g. underlie a phenotype).
Gene regulation are mechanisms that have evolved to control gene expression
Gene expression - Provides the molecular basis for the relationship between
genotype and phenotype
Gene regulation – controls the flow
of information (fast, moderate, slow, stopped,etc.)
• Structural genes: encoding proteins
Regulatory genes: encoding products that
interact with other sequences and affect the
transcription and translation of these sequences
Regulatory elements: DNA sequences that
are not transcribed but play a role in regulating
other nucleotide sequences
Constitutive expression: continuously
expressed under normal cellular conditions
In bacteria, gene regulation maintains internal
flexibility, turning genes on and off in response to
environmental changes
In multicellular eukaryotic organisms, gene
regulation also brings about cell differentiation
Transcription in Bacterial Cells Is
Regulated by Operons
Operon: promoter + additional sequences that control transcription (operator) + structural genes
Regulator gene: DNA sequence–encoding products that affect the operon function but are not part of the operon
Negative and positive control: inducible and
repressible operons
The trp operon is a cluster of genes in E. coli that are involved in the biosynthesis of tryptophan.
The trp operon is regulated by repression. When tryptophan levels are high, the repressor protein binds to the operator and prevents transcription of the operon.
When tryptophan levels are low, the repressor protein is inactive, allowing transcription of the trp operon to occur, leading to the production of enzymes needed for tryptophan synthesis.
The trp operon consists of the promoter, operator, and five genes (trpE, trpD, trpC, trpB, trpA) that code for enzymes involved in tryptophan biosynthesis.
Attenuation is a second level of regulation that controls transcription based on the availability of tryptophan. It involves premature termination of transcription when tryptophan is abundant.
Negative control: A repressor protein binds to the operator (partially overlap the promoter or beginning of first gene) to prevent transcription/ blocks RNA polymerase, turning gene expression off (seen in repressible operons).
Repressible operons: Typically on, but can be turned off by a corepressor (e.g., trp operon for tryptophan biosynthesis).
Positive control: An activator protein enhances transcription, turning gene expression on (seen in inducible operons). Frequently bind promoter and facilitate RNA
Polymerase binding and transcription
Inducible operons: Typically off, but can be turned on by an inducer (e.g., lac operon for lactose metabolism).
Francois Jacob and Jacques Monod used classical genetics to
uncover the “Operon model” of how Bacteria regulate genes
An operon is a cluster of multiple structural genes controlled by a single promoter. These genes usually have a related function.
Operons allow the cell to coordinate the expression of multiple related genes (e.g., proteins) with one promoter, ensuring efficient regulation.
Regulatory genes encode proteins or RNA that regulate transcription, similar to transcription factors in eukaryotes.
Regulatory proteins are DNA-binding proteins that can bind to the operon DNA to regulate transcription. They can act negatively as repressors (inhibit transcription) or positively as activators (enhance transcription).
Regulatory proteins bind to small molecules, which alters their shape and changes how they interact with the operon DNA, either positively or negatively.
Allosteric proteins are proteins that change shape and function when they bind to other molecules, influencing their ability to bind DNA or other proteins.
The control at the operator
site is negative. Molecule binding is to the operator,
inhibiting transcription. Such operons are usually off and
need to be turned on, so the transcription is inducible.
Inducer: small molecule that turns on the transcription
the allosteric modulator/inducer, which inhibits the function of the repressor
Corepressor: a small molecule that binds to the repressor
and makes it capable of binding to the operator to turn off
transcription
Allosteric modulator (small molecule) causes decreased transcription and is
called a co-repressor. Co-repressor is required for repressor protein function
The inducer is a precursor for enzymes (D,E,F) produced by the operon, thus
transcription itself leads to more enzymes and thus effects levels of inducer.
This feedback system allows bacteria to adapt to their environment and function
more economically
Genes D-F (enzymes) are only made if precursor (inducer) is present (less
wasteful production of enzymes).
The lac operon is a negative inducible operon involved in lactose metabolism.
The lac operon is regulated by the lacI gene (which encodes a repressor), the lacP promoter, and the lacO operator.
The inducer of the lac operon is allolactose, which binds to the repressor and inactivates it, allowing transcription.
lacZ: Encodes β-galactosidase, which breaks down lactose.
lacY: Encodes permease, which helps transport lactose into the cell.
lacA: Encodes transacetylase, which modifies lactose derivatives.
No, the repression of the lac operon never completely shuts down transcription; there is always a low level of transcription even in the absence of lactose.
Glucose is the preferred energy source for E. coli because it requires no further processing to enter glycolysis, making it a more efficient energy source.
E. coli breaks down lactose through catabolism, producing glucose as one of the products, which can then be used as an energy source.
Bacterial cells use the lac operon to efficiently switch on and off lactose metabolism based on the availability of glucose and lactose in the environment.
β-galactosidase breaks down lactose into galactose and glucose (catbolize), allowing the cell to use lactose as fuel. The cell makes β-galactosidase and permease in a coordinated manner (called coordinate induction) using the lac operon. β-galactosidase also converts lactose into allolactose, which regulates lactose metabolism.
When lactose is not available, the lac operon is repressed. The repressor is constitutively made, preventing the transcription of enzymes. The lac operon is in its basal state with no enzyme production.
When lactose levels increase, β-galactosidase converts lactose into glucose (for fuel) and allolactose. Allolactose acts as an inducer and reflects lactose levels. This triggers the lac operon to be induced in a negative inducible operon system, allowing lac enzyme levels to respond to lactose availability.
When glucose levels are low, cAMP levels are high. and vice versa
When glucose levels are low, CAP binds to cAMP, and together, they bind just upstream of the lac operon promoter, promoting transcription.
Low glucose levels result in high cAMP, which activates CAP to promote transcription of the lac operon, turning on lactose metabolism.
When glucose is high, cAMP levels are low, leading to catabolite repression, which represses transcription of the lac operon.
Catabolite repression occurs when glucose represses the transcription of the lac operon, even in the presence of lactose.
CAP represents positive control of the lac operon because it enhances transcription in response to low glucose and high cAMP.
Jacob and Monod used partial-diploid strains of bacteria, which contained an F plasmid carrying a second copy of the lac operon.
The F plasmid carried a second copy of the lac operon, allowing Jacob and Monod to mutate the lac operon independently and study the effects.
By using different combinations of mutations in the bacterial genome and the F plasmid, Jacob and Monod could determine whether the mutations had dominant or recessive effects on lactose metabolism.
they used mutational analysis to assess how mutations in the genome and plasmid affected lactose metabolism and tested for dominant vs. recessive effects.
Trans-acting regulation occurs when the product of Gene A (on a separate piece of DNA) affects the transcription of Gene C.
cis-regulation involves a DNA sequence (such as Region B) located on the same DNA molecule that affects the transcription of Gene C.
If Gene A product affects Gene C transcription, it is considered a trans-acting factor.
If a DNA sequence affects Gene C transcription, it is considered a cis-regulating element.
Mutations in lacZ affect the amount of β-galactosidase produced, and mutations in lacY affect the amount of permease produced. These mutations affect the two enzymes independently.
Mutations in lacI affect both β-galactosidase and permease levels, with some mutations causing lac proteins to be produced constantly, regardless of lactose levels. This is called a constitutive mutant.
If mutations occur in only one copy of lacI, both lac operons function normally, indicating that the lacI+ gene is dominant over the lacI- gene.
It suggests that lacI is a trans-acting factor that encodes the regulator protein for the lac operon.
some constitutive mutations, lac proteins are produced regardless of lactose availability- evident only if the structural genes on the same DNA were not mutated. These mutations occur in the lacO operator, preventing the binding of the regulator protein.- cis regulating elements
lacO mutations are cis-regulating elements because they affect only the lac operon on the same DNA molecule (plasmid or genome), unlike lacI, which is a trans-acting factor.
The operon is ON in both low and high lactose conditions, exhibiting constitutive expression.
The operon is OFF when lactose is low, but ON when lactose is high, showing normal regulation of the lac operon.
lacO mutations cause the operon to be ON regardless of lactose availability, resulting in constitutive expression of lac proteins.