Genome Structure Flashcards Preview

Genomics > Genome Structure > Flashcards

Flashcards in Genome Structure Deck (60)
Loading flashcards...
1
Q

What is DNA?

A

DNA is deoxyribonucleic acid

Macromolecule consisting of a linear strand of nucleotides

2
Q

How does DNA differ from RNA?

A

deoxyribose does not have the 2’-hydroxyl group of ribose

3
Q

How is DNA double stranded?

A

Single linear strands bind to complementary strands to form double-stranded DNA

4
Q

Why does DNA have a negative charge?

A

The phosphate group is negatively charged, this is what gives nucleic acids their overall negative charge as sugars and bases are neutral

5
Q

How are the carbons on the deoxyribose sugar identified?

A

5’ and 3’ carbons are indicated – numbering starts at the carbon closest to the base

6
Q

Which direction is the DNA strand read?

A

Sequence is 5’->3’ by convention

7
Q

Why is the DNA strand 5’-3’?

A

5’ (5 prime) and 3’ (3 prime) are numbered based on the carbon atoms of the sugar

The base is attached to the first carbon, the phosphate links between the 3’ of one sugar and the 5’ of the adjacent sugar

8
Q

How does DNA maintain its double stranded structure?

A

Hydrogen bonding occurs between base pairs

A-T (2H bonds) , G-C (3H bonds)

9
Q

What direction do the 2 DNA strands of the duplex run?

A

The 2 strands run antiparallel (5’-3’) and the other (3’-5’)

10
Q

Describe the structure of DNA in 3D

A
  • Two antiparallel strands of DNA
  • Bases “stacked”
  • Two grooves
    - Major
    - Minor
11
Q

How much length of DNA is found in our body?

A
There is ~2m of DNA in a nucleated cell
37.2 trillion cells in your body
That is 7.44x1013 metres of DNA
This equals 250 journeys to the sun and back!
The average cell is 50µm in diameter
12
Q

How do we fit 2m of DNA in each 50µm cell?

A

DNA strand wound around histones to form nucleosomes.
Nucleosomes are wound further into chromatin and then into extended chromosomes. They’re then looped and into the full chromosome.

13
Q

What are histones?

A

Basic proteins that bind DNA

14
Q

Describe the histone structure DNA wraps around?

A

Eight histones form the nucleosome
2A,2B, 3 & 4 (2 copies of each)
Histone 1 binds the linker DNA (piece of DNA between nucleosomes)

15
Q

How is the structure of chromosomes determined?

A

Chromosomes come in different shapes depending on where the centrosome is found.

16
Q

What are the 3 types of chromosome structure?

A

Metacentric
Subcentric
Acrocentric

17
Q

What is a metacentric chromosome?

A

X-shaped chromosomes, with the centromere in the middle; two arms of the chromosomes are almost equal

18
Q

What is a submetacentric chromosome?

A

off centre, centromere located near middle; chromosomal arms (i.e. p and q arms) are slightly unequal in length and may form L-shape

19
Q

What is an acrocentric chromosome?

A

centromere is located quite near one end of the chromosome so don’t have the short arms

20
Q

What is karyotyping?

A

process of pairing and ordering all the chromosomes of an organism, thus providing a genome-wide snapshot of an individual’s chromosomes

21
Q

What is the exome?

A

The coding Genome
sum of all the gene sequences is called the exome
Some definitions just use the coding sequences (~37 Mbp – 1.2% of genome)
Some definitions use the whole gene sequences (~60Mbp – 2% of genome)

22
Q

What does the primary DNA sequence include?

A
  • encodes all the gene products necessary for an
    organism
  • includes a large number of regulatory signals
  • Much of the DNA sequence doesn’t have a function yet
23
Q

What is a gene?

A

A gene is all the DNA transcribed into RNA as well as all the cis-linked (local) control regions

24
Q

What is the role of cis-linked control regions?

A

Cis-linked (local) control regions are required to ensure quantitatively appropriate tissue-specific expression of the final protein

25
Q

What is meant by cis-linked?

A

Cis-linked just means that these are regions physically close to the exons on the DNA strand. Contrast with trans-regulatory regions that can be on different chromosomes

26
Q

Explain how genome differs from cell to cell

A

All nucleated somatic cells contain the same genome

27
Q

Describe the human genome

A
  • Human genome size is 3 x 109 base pairs – 3Gbp
  • Contains 19-20 000 genes.
  • > 2% of DNA is gene
  • Genome size isn’t strongly related to complexity of
    organisms,
    e.g., marbled lungfish is 130Gbp and Paris japonica is
    149Gbp
28
Q

How big are genes in size?

A

Genes are often very different in size
e.g.
globin = 1.8kb
dystrophin = 2.4Mb

29
Q

What is the role of intergenic regions?

A

Intergenic regions contain sequences of no known function

e.g. as repetitive DNA, endogenous retroviruses, pseudogenes

30
Q

How are genes organised within the genome?

A

Genes often cluster in families – e.g. globin clusters

- allows for coordinate gene regulation
- may just reflect evolutionary history
31
Q

What is required for transcription to occur?

A

Recognition sequences are required in DNA that lie outside the transcribed region

32
Q

What is the role of promoters during transcription?

A

Promoters recruit RNA polymerase to a DNA template

33
Q

What is the role of RNA Polymerase during transcription?

A

RNA polymerase binds asymmetrically and can only move 5’ to 3’

34
Q

Outline the structure of a gene from 5’- 3’ end

A

5’ end
Promoter
- CAAT box
- TATA box

Transcription initiation
5’ UTR
Translation initiation (ATG)

Exons + Introns

Translation termination
3’UTR
Transcription termination

3’ end

35
Q

What two sequences form the gene promoter?

A

Regulatory CAAT box

Recruting TATA box

36
Q

What is the role of the CAAT box?

A

Regulatory element

regulates the recruitment of RNA Polymerase

37
Q

What is the role of the TATA box?

A

Needed to recruit general Transcription factors and RNA polymerase

38
Q

Explain what is involved in transcription?

A

DNA is transcribed using RNA polymerases

39
Q

What 3 polymerases do eukaryotic cells use?

A

RNA polymerase I
- needed to transcribe rRNA genes

RNA polymerase II
- needed to transcribe mRNA

RNA polymerase III
- needed to transcribe tRNA and other small RNAs

40
Q

What is the significance of transcription factors?

A

Eukaryotic RNA polymerases are unable to recognise promoters efficiently without help.

41
Q

Outline the process of transcription

A
  1. RNA pol. recruited (closed complex)
  2. DNA helix locally unwound (open complex)
  3. RNA synthesis begins
  4. Elongation occurs
  5. Termination
  6. RNA polymerase dissociates
42
Q

How are introns distributed among genes?

A

Vary in number – 0 to 311
Vary in size - 30bp to 1Mbp
Some introns contain other genes

43
Q

How long does transcription take?

A

average RNAP II elongation rate of 60 bases per second so transcription of some long introns lasts many hours

44
Q

What regulatory regions are required during transcription?

A

Enhancers
Silencers
Insulators

45
Q

What is the role of enhancers?

A

upregulate gene expression – short sequences that can be in the gene or many kilobases distant
They are targets for transcription factors (activators)

46
Q

What is the role of Silencers?

A

downregulate gene expression. They are also position-independent and are also targets for transcription factors (repressors)

47
Q

What is the role of insulators in gene transcription?

A

short sequences that act to prevent enhancers/silencers influencing other genes

48
Q

Explain how eukaryotic mRNA is modified after transcription

A
  • Capped at 5’ end
  • Polyadenylated at 3’ end
  • Intervening sequences (introns) removed
49
Q

What is the 5’ cap?

A

Soon after RNA polymerase begins transcription a methylated cap (7-methylguanosine) is added to the 5’ end. Makes it resistant to digestion by nucleic enzymes.
Transcription then continues to completion.

50
Q

What modification structures are present at the 3’ end?

A

G/U is a Guanine/Uracil rich region

PAP is polyadenylate polymerase

51
Q

Outline the modifications that occur at the 3’ end of the mRNA strand

A

Cleavage factors bind to G/U region and PolyA sequences to signal cleavage should occur.
The Gu region and everything after it is removed
Polyadenylate polymerase adds many A bases ~250 to protect the end from degradation

52
Q

What is the final modification to mRNA after transcription occurs?

A

RNA is transcribed, cap and poly A tail added, then introns are spliced out by the spliceosome.

53
Q

What is the spliceosome?

A

large and complex molecular machine made up of ~150 proteins

removes introns from a transcribed pre-mRNA

54
Q

How does the spliceosome remove introns?

A

Spliceosome brings the ends of the exons together and cleaves the intron from in between then joining the adjacent exons together.

Lariat structure is then degraded

55
Q

What is alternative splicing?

A

Exons can be skipped or added so variations of a protein (called isoforms) can be produced from the same gene

56
Q

What is the significance of splicing?

A

Splicing targets mRNAs for nuclear export

Further signalling complexes join once the strand has been spliced

57
Q

What export signals bind to the spliced mRNA?

A

Exon Junction Complex
TREX export complex
These all signal the compound out of the nucleus and into the ER ready for translation by the ribosomes

58
Q

What are pseudogenes?

A

These are genes that have been at least partially inactivated by the loss or gain of sequence that disrupt their correct transcription and/or translation

59
Q

Give an example of pseudogenes

A

For example, glucocerebrosidase has an adjacent pseudogene that only differs in the coding region by one 55bp deletion and a few single base changes

60
Q

Why do psueodgenes lack promoters and exons?

A

Processed pseudogenes have no promoter or exons as they are copied from mRNA by retrotransposition