Everything about how your body is built and run traces back to a four-letter code written in your DNA. This deep dive starts at the foundation: how that code is structured, how it's read, and how it becomes the proteins that do the work of life. (You met this briefly in Foundations — here we go deeper.)
Learning Objectives
- •Understand DNA's structure and the genetic code
- •Distinguish genes, the genome, and proteins
- •Trace the 'central dogma': DNA → RNA → protein
DNA: a four-letter code in a double helix
DNA is a long molecule shaped like a twisted ladder (the double helix). Its rungs are made of four chemical 'letters' — the bases A, T, C, and G — that pair specifically (A with T, C with G). The SEQUENCE of these letters is information, like text written in a four-character alphabet. Your complete set of DNA — your GENOME — contains about 3 billion of these letter-pairs, copied in nearly every cell.
Genes: the meaningful segments
A GENE is a segment of DNA that contains the instructions to build a specific PROTEIN (or functional molecule). Humans have roughly 20,000 protein-coding genes — but strikingly, these make up only about 1–2% of the genome. Much of the rest, once dismissed as 'junk', includes crucial regulatory regions that control WHEN and WHERE genes are switched on. The genome is far more than just a list of protein recipes.
The central dogma: DNA → RNA → protein
How does a gene become a protein? Through the 'central dogma' of molecular biology, in two steps. TRANSCRIPTION copies a gene's DNA into a messenger molecule called RNA. TRANSLATION then reads that RNA, three letters (a 'codon') at a time, and assembles the corresponding chain of amino acids into a protein. DNA is the master blueprint kept safe in the nucleus; RNA is the working copy; protein is the finished product that does the job.
DNA (master blueprint, in the nucleus) │ TRANSCRIPTION (copy gene → messenger RNA) ▼ RNA (working copy) │ TRANSLATION (read codons → assemble amino acids) ▼ PROTEIN (does the work of the cell)
Why mRNA vaccines were a triumph of the central dogma
mRNA vaccines work by delivering a piece of RNA — the 'working copy' step of the central dogma — instructing your cells to briefly make one harmless viral protein, which your immune system then learns to recognize. It's a direct, elegant application of exactly this DNA→RNA→protein logic: skip the DNA, supply the RNA message, and let the cell's own machinery make the protein. Understanding the dogma is understanding how that technology works.
DNA & genes, by the numbers
- ▸DNA uses a four-letter code (A, T, C, G) that pairs A-T and C-G
- ▸Your genome is ~3 billion base pairs, copied in nearly every cell
- ▸Humans have ~20,000 protein-coding genes — only ~1–2% of the genome
- ▸Central dogma: DNA is transcribed to RNA, which is translated to protein
Almost all of your DNA codes for proteins.
Only about 1–2% of the human genome codes for proteins. Much of the rest, once called 'junk DNA', contains essential regulatory regions that control when and where genes are expressed — so the non-coding genome is far from useless.
Quick Check
What is a gene?
Quick Check
What is the correct order of the 'central dogma'?
True or False
Only about 1–2% of the human genome codes for proteins.
Summary
- →DNA is a four-letter code (A, T, C, G) in a double helix; your genome is ~3 billion base pairs
- →Genes are DNA segments coding for proteins — ~20,000 of them, just 1–2% of the genome
- →The central dogma: DNA → RNA → protein (transcription, then translation)
- →Non-coding DNA includes crucial gene-regulation regions
If we all share the same code, why are we so different? Next: genetic variation and heritability.