Molecular Biology Interview Questions
DNA, RNA, protein synthesis, gene expression, and molecular mechanisms
1 What is the structure of DNA and who discovered it?
Easy
What is the structure of DNA and who discovered it?
DNA is a double helix structure consisting of two antiparallel strands of nucleotides connected by hydrogen bonds between complementary base pairs (A-T with 2 bonds, G-C with 3 bonds). James Watson and Francis Crick proposed this model in 1953, building on X-ray crystallography data from Rosalind Franklin. The sugar-phosphate backbone runs on the outside while the bases face inward, creating major and minor grooves important for protein binding.
2 What are the key differences between DNA and RNA?
Easy
What are the key differences between DNA and RNA?
DNA contains deoxyribose sugar while RNA contains ribose (with an extra hydroxyl group at 2' position), making RNA less stable. DNA uses thymine (T) as a base while RNA uses uracil (U). DNA is typically double-stranded and serves as long-term genetic storage, while RNA is usually single-stranded and functions in protein synthesis, gene regulation, and catalysis. DNA is found primarily in the nucleus, while RNA is synthesized in the nucleus and functions largely in the cytoplasm.
3 What is the central dogma of molecular biology?
Easy
What is the central dogma of molecular biology?
The central dogma describes the flow of genetic information in cells: DNA is transcribed into RNA, which is then translated into protein (DNA -> RNA -> Protein). Francis Crick proposed this concept in 1958. While this represents the primary information flow, exceptions exist such as reverse transcription (RNA to DNA in retroviruses) and RNA replication in RNA viruses. The dogma emphasizes that information flows from nucleic acids to proteins but not in reverse.
4 What is transcription and what enzyme catalyzes it?
Easy
What is transcription and what enzyme catalyzes it?
Transcription is the process of synthesizing RNA from a DNA template, representing the first step in gene expression. RNA polymerase is the enzyme that catalyzes transcription by reading the template strand 3' to 5' and synthesizing RNA 5' to 3'. In prokaryotes, a single RNA polymerase handles all transcription, while eukaryotes have RNA Pol I (rRNA), RNA Pol II (mRNA), and RNA Pol III (tRNA and small RNAs). The process involves initiation at promoters, elongation, and termination.
5 What is translation and where does it occur?
Easy
What is translation and where does it occur?
Translation is the process of synthesizing proteins from mRNA, occurring on ribosomes in the cytoplasm. The ribosome reads the mRNA sequence in codons (three nucleotides) and matches them with complementary anticodons on tRNA molecules carrying specific amino acids. Translation proceeds through initiation (ribosome assembly at start codon AUG), elongation (amino acid chain growth), and termination (at stop codons UAA, UAG, UGA). The process requires mRNA, tRNA, ribosomes, and various protein factors.
Get IIT Jammu PG Certification
Master these concepts with 175+ hours of industry projects and hands-on training.
6 What are codons and how many amino acids do they encode?
Easy
What are codons and how many amino acids do they encode?
Codons are three-nucleotide sequences in mRNA that specify particular amino acids during translation. With 4 nucleotides possible at each position, there are 64 possible codons (4^3). These encode 20 standard amino acids plus 3 stop signals (UAA, UAG, UGA), making the genetic code degenerate (redundant) - multiple codons can specify the same amino acid. AUG serves as both the start codon and codes for methionine. The code is nearly universal across all organisms with minor variations in mitochondria and some organisms.
7 What is semiconservative DNA replication?
Easy
What is semiconservative DNA replication?
Semiconservative replication means that each new DNA molecule consists of one original (parental) strand and one newly synthesized strand. This was demonstrated by Meselson and Stahl in 1958 using density gradient centrifugation with heavy nitrogen isotopes. During replication, the double helix unwinds at origins of replication, and each strand serves as a template for synthesizing a complementary strand. This mechanism ensures accurate transmission of genetic information and was predicted by Watson and Crick based on DNA structure.
8 What is the role of primers in DNA replication?
Easy
What is the role of primers in DNA replication?
Primers are short RNA sequences (about 10 nucleotides) synthesized by primase that provide a free 3'-OH group for DNA polymerase to begin synthesis. DNA polymerase cannot initiate new strand synthesis de novo and can only add nucleotides to an existing 3' end. On the leading strand, only one primer is needed, while the lagging strand requires multiple primers for each Okazaki fragment. After DNA synthesis, primers are removed by RNase H or DNA polymerase I and replaced with DNA by repair mechanisms.
9 What post-transcriptional modifications occur to eukaryotic mRNA?
Easy
What post-transcriptional modifications occur to eukaryotic mRNA?
Eukaryotic pre-mRNA undergoes three major modifications: 5' capping (addition of 7-methylguanosine cap that protects from degradation and aids ribosome binding), 3' polyadenylation (addition of poly-A tail of 100-250 adenines for stability and export), and splicing (removal of introns and joining of exons by the spliceosome). These modifications occur in the nucleus before export to the cytoplasm and are essential for mRNA stability, nuclear export, and efficient translation.
10 What is the difference between introns and exons?
Easy
What is the difference between introns and exons?
Exons are the coding sequences of a gene that are retained in mature mRNA and translated into protein, while introns are intervening non-coding sequences that are removed during RNA splicing. Eukaryotic genes typically contain multiple introns and exons, with introns often much longer than exons. Alternative splicing allows different combinations of exons to produce multiple protein isoforms from a single gene. Prokaryotes generally lack introns, though some are found in archaeal and bacterial genes.
11 What is a promoter and what is its function?
Easy
What is a promoter and what is its function?
A promoter is a DNA sequence located upstream (5') of a gene that serves as the binding site for RNA polymerase and transcription factors to initiate transcription. In prokaryotes, common promoter elements include the -10 box (TATAAT, Pribnow box) and -35 box recognized by sigma factor. Eukaryotic promoters contain elements like the TATA box, initiator element, and various regulatory sequences. The strength of a promoter determines the frequency of transcription initiation and thus gene expression levels.
12 What is gel electrophoresis and how does it work?
Easy
What is gel electrophoresis and how does it work?
Gel electrophoresis is a technique for separating DNA, RNA, or proteins based on size and charge by applying an electric field across a gel matrix (agarose or polyacrylamide). Nucleic acids, being negatively charged due to phosphate groups, migrate toward the positive electrode (anode). Smaller molecules move faster through the gel pores, resulting in size-based separation. DNA is visualized using fluorescent dyes like ethidium bromide or SYBR Safe, and size is determined by comparison to molecular weight markers.
13 What are restriction enzymes and how are they used?
Easy
What are restriction enzymes and how are they used?
Restriction enzymes (restriction endonucleases) are bacterial enzymes that cut DNA at specific recognition sequences, typically 4-8 base pairs long. They serve as a bacterial defense mechanism against viral DNA. Type II restriction enzymes, most commonly used in molecular biology, cut at or near their recognition sites producing either blunt ends or sticky (cohesive) ends with overhangs. They are essential tools for DNA cloning, mapping, and genetic engineering, allowing precise cutting and recombination of DNA fragments.
14 What is DNA ligase and what is its function?
Easy
What is DNA ligase and what is its function?
DNA ligase is an enzyme that catalyzes the formation of phosphodiester bonds between the 3'-OH and 5'-phosphate ends of DNA strands, joining DNA fragments together. It is essential for DNA replication (joining Okazaki fragments), DNA repair, and recombinant DNA technology. T4 DNA ligase, derived from bacteriophage T4, is commonly used in molecular cloning to ligate DNA inserts into vectors. The enzyme requires ATP (or NAD+ in bacteria) as a cofactor for the ligation reaction.
15 What is an operon and how does the lac operon work?
Easy
What is an operon and how does the lac operon work?
An operon is a cluster of genes under control of a single promoter, common in prokaryotes, that are transcribed together as a polycistronic mRNA. The lac operon in E. coli contains genes for lactose metabolism (lacZ, lacY, lacA) and is regulated by a repressor protein and CAP activator. In the absence of lactose, the repressor binds the operator blocking transcription. When lactose is present, it is converted to allolactose which binds and inactivates the repressor, allowing transcription. This represents negative inducible regulation.
3,000+ Engineers Placed at Top Companies
Join Bosch, Tata Motors, L&T, Mahindra and 500+ hiring partners.
16 Explain the role of transcription factors in eukaryotic gene expression.
Medium
Explain the role of transcription factors in eukaryotic gene expression.
Eukaryotic transcription factors (TFs) are proteins that regulate gene expression by binding to specific DNA sequences. General TFs (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, TFIIH) are required for basal transcription from all promoters and assemble with RNA Pol II at the promoter. Specific TFs (activators and repressors) bind enhancers, silencers, or response elements to modulate transcription rates. They work through recruitment of coactivators, chromatin remodelers, or the mediator complex. Combinatorial control by multiple TFs allows precise spatial and temporal gene regulation.
17 How does chromatin structure affect gene expression?
Medium
How does chromatin structure affect gene expression?
Chromatin structure regulates gene accessibility. Tightly packed heterochromatin is transcriptionally silent, while open euchromatin allows active transcription. Histone modifications play key roles: acetylation (by HATs) generally activates genes by loosening chromatin, while deacetylation (by HDACs) represses. Methylation effects depend on location - H3K4me3 activates, H3K27me3 represses. Chromatin remodeling complexes (SWI/SNF, ISWI) use ATP to reposition nucleosomes, exposing or hiding regulatory sequences. DNA methylation at CpG islands typically silences genes. This epigenetic regulation is heritable and crucial for development and disease.
18 Describe the key enzymes involved in DNA replication and their functions.
Medium
Describe the key enzymes involved in DNA replication and their functions.
DNA replication requires multiple enzymes working in coordination. Helicase unwinds the double helix at replication forks. Single-strand binding proteins (SSBs) stabilize unwound strands. Primase synthesizes RNA primers. DNA polymerase III is the main replicative enzyme, synthesizing DNA 5' to 3' with proofreading ability. DNA polymerase I removes RNA primers and fills gaps. DNA ligase seals nicks between Okazaki fragments. Topoisomerases relieve supercoiling ahead of the fork. Sliding clamp (PCNA in eukaryotes) increases processivity. This coordinated enzyme system ensures rapid, accurate replication.
19 Explain the mechanism of RNA interference (RNAi).
Medium
Explain the mechanism of RNA interference (RNAi).
RNA interference is a gene silencing mechanism triggered by double-stranded RNA. The enzyme Dicer cleaves dsRNA into small interfering RNAs (siRNAs) of 21-23 nucleotides. These siRNAs are loaded into the RNA-induced silencing complex (RISC), where Argonaute proteins unwind the duplex and retain the guide strand. The guide strand directs RISC to complementary mRNA targets, leading to mRNA cleavage or translational repression. MicroRNAs (miRNAs) work similarly but typically cause translational inhibition through partial complementarity. RNAi is used for gene knockdown studies and has therapeutic potential.
20 What are molecular chaperones and how do they assist protein folding?
Medium
What are molecular chaperones and how do they assist protein folding?
Molecular chaperones are proteins that assist in the proper folding of other proteins without being part of the final structure. The Hsp70 family binds to hydrophobic regions of nascent polypeptides, preventing aggregation and allowing proper folding. Chaperonins (like GroEL/GroES in bacteria, TRiC in eukaryotes) provide isolated chambers where proteins can fold without interference. Hsp90 assists in the maturation of signaling proteins. Chaperones use ATP hydrolysis to drive conformational changes. Misfolding despite chaperone assistance can lead to diseases like Alzheimer's and Parkinson's.
21 What is alternative splicing and why is it significant?
Medium
What is alternative splicing and why is it significant?
Alternative splicing is a process where different combinations of exons are joined together during mRNA processing, allowing one gene to produce multiple protein isoforms. Types include exon skipping, alternative 5' or 3' splice sites, intron retention, and mutually exclusive exons. Over 95% of human multi-exon genes undergo alternative splicing, greatly expanding proteome diversity beyond gene number. Splicing is regulated by splicing factors (SR proteins, hnRNPs) responding to cellular signals. Dysregulation causes diseases including cancer and neurological disorders. It explains how ~20,000 genes can encode >100,000 proteins.
22 What are the major DNA repair mechanisms?
Medium
What are the major DNA repair mechanisms?
Cells employ multiple DNA repair mechanisms to maintain genomic integrity. Base excision repair (BER) removes damaged bases via glycosylases. Nucleotide excision repair (NER) removes bulky lesions like thymine dimers by excising a ~30 nucleotide stretch. Mismatch repair (MMR) corrects replication errors using MutS/MutL proteins. Double-strand break repair occurs via non-homologous end joining (NHEJ), which is error-prone but fast, or homologous recombination (HR), which is accurate but requires a template. Defects in repair pathways cause cancer predisposition syndromes like xeroderma pigmentosum (NER) and Lynch syndrome (MMR).
23 What are telomeres and what is the role of telomerase?
Medium
What are telomeres and what is the role of telomerase?
Telomeres are repetitive DNA sequences (TTAGGG in humans) at chromosome ends that protect against degradation and fusion. Due to the end-replication problem, telomeres shorten with each cell division, eventually triggering senescence or apoptosis - this limits cell lifespan (Hayflick limit). Telomerase is a ribonucleoprotein enzyme containing an RNA template and reverse transcriptase (TERT) that extends telomeres. It is active in stem cells, germ cells, and ~90% of cancers, enabling unlimited proliferation. Telomere length serves as a biological clock and potential therapeutic target for aging and cancer.
24 Describe the structure and function of ribosomes.
Medium
Describe the structure and function of ribosomes.
Ribosomes are ribonucleoprotein complexes that catalyze protein synthesis. Prokaryotic 70S ribosomes consist of 30S (16S rRNA + 21 proteins) and 50S (23S, 5S rRNA + 34 proteins) subunits. Eukaryotic 80S ribosomes have 40S and 60S subunits. The ribosome has three tRNA binding sites: A (aminoacyl, incoming), P (peptidyl, growing chain), and E (exit). The small subunit decodes mRNA, while the large subunit catalyzes peptide bond formation via its peptidyl transferase center, which is an RNA enzyme (ribozyme). Structural differences between bacterial and eukaryotic ribosomes enable antibiotic targeting.
25 Explain the basic components of cell signal transduction pathways.
Medium
Explain the basic components of cell signal transduction pathways.
Signal transduction converts extracellular signals into cellular responses through sequential molecular interactions. Key components include: receptors (G protein-coupled receptors, receptor tyrosine kinases, nuclear receptors) that detect signals; second messengers (cAMP, Ca2+, IP3, DAG) that amplify signals intracellularly; protein kinases (PKA, PKC, MAPK) that phosphorylate target proteins; phosphatases that reverse phosphorylation; and transcription factors that alter gene expression. Signal cascades amplify signals, enable integration of multiple inputs, and allow for feedback regulation. Dysregulation of signaling pathways underlies many cancers and metabolic diseases.
Harshal
Fiat Chrysler
Abhishek
TATA ELXSI
Srinithin
Xitadel
Ranjith
Core Automotive
Gaurav
Automotive Company
Bino
Design Firm
Aseem
EV Company
Puneet
Automotive Company
Vishal
EV Startup
More Success Stories
26 What are the differences between Southern, Northern, and Western blotting?
Medium
What are the differences between Southern, Northern, and Western blotting?
These blotting techniques detect different molecules. Southern blotting (named after Edwin Southern) detects specific DNA sequences: DNA is restriction-digested, separated by gel electrophoresis, transferred to a membrane, and hybridized with a labeled probe. Northern blotting (named by analogy) similarly detects RNA to analyze gene expression. Western blotting (also named by analogy) detects proteins: samples are separated by SDS-PAGE, transferred to membrane, and detected using specific antibodies. Each technique combines electrophoretic separation with specific probe detection, enabling identification of target molecules in complex samples.
27 How does real-time PCR (qPCR) enable quantification of nucleic acids?
Medium
How does real-time PCR (qPCR) enable quantification of nucleic acids?
Real-time PCR monitors DNA amplification during each cycle using fluorescent reporters. SYBR Green intercalates into double-stranded DNA, fluorescing when bound. TaqMan probes use oligonucleotides with fluorophore and quencher that separate upon polymerase cleavage. The cycle threshold (Ct) - when fluorescence exceeds background - is inversely proportional to initial template amount. Absolute quantification uses standard curves, while relative quantification compares target to reference genes (delta-delta Ct method). qPCR is essential for gene expression analysis, pathogen detection, and validating other techniques like RNA-seq.
28 What are common post-translational modifications and their functions?
Medium
What are common post-translational modifications and their functions?
Post-translational modifications (PTMs) regulate protein function after synthesis. Phosphorylation (by kinases) is the most common, regulating enzyme activity and signaling. Glycosylation adds carbohydrates affecting folding, stability, and recognition. Ubiquitination tags proteins for proteasomal degradation or alters localization and activity. Acetylation regulates histones and metabolic enzymes. Methylation affects protein-protein interactions and gene regulation. SUMOylation alters protein localization and function. Lipidation (myristoylation, palmitoylation, prenylation) anchors proteins to membranes. PTMs expand functional diversity beyond genetic coding capacity.
29 Explain the MAPK signaling cascade and its biological significance.
Medium
Explain the MAPK signaling cascade and its biological significance.
The MAPK (Mitogen-Activated Protein Kinase) cascade is a three-tier kinase pathway: MAPKKK activates MAPKK, which activates MAPK. The classical ERK pathway begins with receptor tyrosine kinase activation, recruiting Ras GTPase, which activates Raf (MAPKKK), MEK (MAPKK), then ERK (MAPK). Activated ERK phosphorylates transcription factors and other targets regulating cell proliferation, differentiation, and survival. Parallel pathways (JNK, p38) respond to stress. Signal amplification occurs at each step. MAPK pathway mutations, particularly in Ras and Raf, drive many cancers, making them important therapeutic targets.
30 What are reporter genes and how are they used in research?
Medium
What are reporter genes and how are they used in research?
Reporter genes encode easily detectable proteins used to study gene expression and regulation. Common reporters include: GFP (green fluorescent protein) for live-cell imaging, luciferase for sensitive bioluminescence assays, beta-galactosidase (lacZ) for colorimetric assays, and CAT (chloramphenicol acetyltransferase). Reporters are placed under control of promoters of interest to measure transcriptional activity, screen for regulatory elements, track protein localization, or assess transfection efficiency. Fusion proteins with reporters enable protein tracking. Reporter assays are fundamental tools in molecular biology, drug screening, and gene therapy development.
31 How is mRNA stability regulated in cells?
Medium
How is mRNA stability regulated in cells?
mRNA stability determines gene expression levels post-transcriptionally. Stabilizing elements include the 5' cap (protects from 5'-3' degradation), poly-A tail (protects from 3'-5' degradation), and specific sequences bound by stabilizing proteins. Destabilizing elements include AU-rich elements (AREs) in 3' UTR that recruit decay machinery, and nonsense-mediated decay (NMD) that eliminates transcripts with premature stop codons. MicroRNAs target mRNAs for degradation or translational repression. RNA-binding proteins (HuR, TTP) regulate stability in response to cellular signals. mRNA half-lives range from minutes to days, providing another layer of gene regulation.
32 What are cell cycle checkpoints and how do they function?
Medium
What are cell cycle checkpoints and how do they function?
Cell cycle checkpoints are surveillance mechanisms ensuring proper cell division. The G1/S checkpoint (restriction point) verifies cell size, nutrients, and DNA integrity before S phase commitment - p53 and Rb are key regulators. The G2/M checkpoint confirms DNA replication completion and repairs damage before mitosis. The spindle assembly checkpoint (SAC) ensures proper chromosome attachment before anaphase. Checkpoints work through cyclin-dependent kinases (CDKs) regulated by cyclins, CDK inhibitors (p21, p27), and phosphatases. Checkpoint failure leads to genomic instability and cancer; DNA damage activates ATM/ATR kinases that halt the cycle.
33 How does chromatin immunoprecipitation (ChIP) work?
Medium
How does chromatin immunoprecipitation (ChIP) work?
ChIP identifies DNA sequences associated with specific proteins in vivo. The procedure involves: cross-linking proteins to DNA using formaldehyde, cell lysis and chromatin shearing (sonication or enzymatic), immunoprecipitation with antibody against target protein, reversing cross-links and purifying DNA, and analyzing precipitated DNA. Detection methods include ChIP-qPCR for specific loci, ChIP-chip (microarray), or ChIP-seq (next-generation sequencing) for genome-wide analysis. ChIP is essential for studying transcription factor binding, histone modifications, and chromatin-associated proteins. Controls include input DNA and IgG negative controls.
34 Explain bacterial two-component signaling systems.
Medium
Explain bacterial two-component signaling systems.
Two-component systems are the predominant signal transduction mechanism in bacteria, consisting of a sensor histidine kinase (HK) and a response regulator (RR). The HK, often membrane-bound, autophosphorylates a conserved histidine residue upon signal detection. This phosphate is transferred to a conserved aspartate on the RR, typically activating its DNA-binding domain to alter gene expression. Examples include EnvZ-OmpR (osmolarity response), PhoR-PhoB (phosphate limitation), and CheA-CheY (chemotaxis). These systems enable rapid environmental responses and are absent in mammals, making them potential antibiotic targets.
35 What factors affect nucleic acid hybridization stringency?
Medium
What factors affect nucleic acid hybridization stringency?
Hybridization stringency refers to conditions that favor formation of perfectly matched duplexes over mismatched ones. Key factors include: temperature (higher temperature increases stringency - Tm depends on GC content and length), salt concentration (lower ionic strength increases stringency by destabilizing duplexes), formamide concentration (lowers Tm, allowing stringent hybridization at lower temperatures), and probe length (longer probes are more stable). Stringency is adjusted through wash conditions in techniques like Southern blots, microarrays, and FISH. High stringency ensures specific detection, while low stringency detects related sequences.
36 How do you analyze and minimize CRISPR off-target effects in gene editing experiments?
Hard
How do you analyze and minimize CRISPR off-target effects in gene editing experiments?
Off-target analysis employs computational and experimental approaches. Computational tools (Cas-OFFinder, CRISPOR, CHOPCHOP) predict off-target sites based on sequence similarity, considering PAM proximity and position-dependent mismatch tolerance. Experimental validation includes targeted amplicon sequencing of predicted sites, GUIDE-seq (using dsODN integration), DISCOVER-seq (detecting DNA repair), CIRCLE-seq, or BLESS for unbiased genome-wide analysis. Minimization strategies include using high-fidelity Cas9 variants (eSpCas9, HiFi Cas9), truncated guides, paired nickases, or base editors. RNP delivery reduces exposure time. Validation in relevant cell types is essential as chromatin accessibility affects targeting.
37 Explain the workflow and analytical challenges in single-cell RNA sequencing.
Hard
Explain the workflow and analytical challenges in single-cell RNA sequencing.
scRNA-seq workflow involves cell isolation (FACS, microfluidics like 10x Genomics), lysis, reverse transcription with unique molecular identifiers (UMIs), amplification, library preparation, and sequencing. Computational analysis includes quality control (filtering low-quality cells, doublets), normalization (accounting for library size, dropout events), dimensionality reduction (PCA, t-SNE, UMAP), clustering (Louvain, Leiden algorithms), differential expression, trajectory inference, and cell-type annotation. Challenges include dropout events (technical zeros), batch effects, and computational scalability. Integration methods (Seurat, Harmony) combine datasets. Interpretation requires biological context and validation of novel cell populations.
38 How do you design and interpret epigenome-wide association studies (EWAS)?
Hard
How do you design and interpret epigenome-wide association studies (EWAS)?
EWAS design requires careful consideration of tissue relevance (target vs surrogate), sample size (effect sizes typically small), array vs sequencing platforms (Illumina EPIC for coverage, WGBS for resolution), and potential confounders. Analysis workflow includes quality control (detection p-values, beta value distribution), normalization (SWAN, functional normalization), batch effect correction (ComBat, SVA), statistical testing with multiple testing correction (FDR), and cell-type composition adjustment (reference-based deconvolution or RefFreeEWAS). Interpretation challenges include reverse causation (disease causing methylation changes), distinguishing cause from consequence, and functional validation. Replication in independent cohorts and integration with GWAS enhances findings.
39 How has AlphaFold changed protein structure prediction and what are its limitations?
Hard
How has AlphaFold changed protein structure prediction and what are its limitations?
AlphaFold2 achieves near-experimental accuracy for many protein structures using deep learning on sequence-structure relationships, multiple sequence alignments, and attention mechanisms. It revolutionized structural biology by providing predictions for most known proteins (AlphaFold DB). However, limitations include: uncertainty with proteins lacking homologs, difficulty with intrinsically disordered regions, inability to predict effects of post-translational modifications, challenges with protein-ligand complexes (though AlphaFold-Multimer addresses some protein-protein interactions), no direct dynamics or conformational flexibility information, and potential bias toward crystallographic conformations. Experimental validation remains essential for novel targets. Integration with molecular dynamics and cryo-EM fitting extends utility.
40 What are the advantages of long-read sequencing and how do you analyze such data?
Hard
What are the advantages of long-read sequencing and how do you analyze such data?
Long-read platforms (PacBio HiFi, Oxford Nanopore) produce reads >10kb, enabling resolution of structural variants, repetitive regions, full-length transcript isoforms, and direct methylation detection. PacBio HiFi achieves >Q30 accuracy through circular consensus sequencing; Nanopore enables real-time sequencing and longest reads (>1Mb). Analysis tools differ from short-read: alignment (minimap2, NGMLR), assembly (Flye, hifiasm), variant calling (PEPPER-Margin-DeepVariant, Sniffles for SVs), and phasing. Challenges include higher error rates (especially nanopore), cost per base, and computational requirements. Applications include de novo assembly, resolving complex genomic regions (HLA, centromeres), full-length isoform sequencing (Iso-Seq), and epigenetic profiling.
41 How does ribosome profiling work and what insights does it provide?
Hard
How does ribosome profiling work and what insights does it provide?
Ribosome profiling (Ribo-seq) provides genome-wide, nucleotide-resolution snapshots of translation. The technique involves arresting translation, RNase digestion to retain only ribosome-protected fragments (RPFs, ~28-32nt), ribosome isolation, RNA extraction, library preparation, and deep sequencing. Analysis reveals: actively translated ORFs (including uORFs, overlapping ORFs, novel peptides), translation efficiency (Ribo-seq/RNA-seq ratio), ribosome pausing sites, reading frame, and translation dynamics. Challenges include library preparation biases, distinguishing true signal from contamination, and statistical analysis of differential translation. Applications include identifying functional micropeptides, understanding translational regulation, and drug mechanism studies (cycloheximide vs other inhibitors reveal different aspects).
42 How does liquid-liquid phase separation contribute to gene regulation?
Hard
How does liquid-liquid phase separation contribute to gene regulation?
Biomolecular condensates formed through liquid-liquid phase separation (LLPS) create membraneless organelles that concentrate specific molecules. In gene regulation, transcription factors with intrinsically disordered regions (IDRs) phase separate to form transcriptional condensates at super-enhancers, concentrating the transcription machinery. Mediator and RNA Pol II CTD participate in these condensates. Phase separation also organizes heterochromatin (HP1 condensates), nucleoli (rRNA processing), Cajal bodies, and stress granules. Regulated by post-translational modifications, protein-RNA interactions, and cellular conditions. Disruption of phase separation properties by mutations can cause disease. Experimental approaches include FRAP, optogenetic tools, and in vitro reconstitution. This paradigm adds spatial organization to gene regulation understanding.
43 How do you design and optimize synthetic gene circuits?
Hard
How do you design and optimize synthetic gene circuits?
Synthetic gene circuit design involves defining function (toggle switch, oscillator, logic gate), selecting genetic parts (promoters, RBS, terminators, regulators from registries), mathematical modeling (ODEs for deterministic, stochastic for low copy number), simulation (COPASI, MATLAB), and iterative optimization. Key considerations include modularity (insulated parts), dynamic range, orthogonality (non-interfering components), context effects (metabolic burden, resource competition), and noise. Design-Build-Test-Learn cycle uses combinatorial assembly (Golden Gate, Gibson), high-throughput screening (FACS, microfluidics), and characterization. Advanced approaches include automated design (Cello), machine learning optimization, and cell-free prototyping. Applications span biosensors, metabolic engineering, and therapeutic cells.
44 How do you study 3D genome organization and its functional implications?
Hard
How do you study 3D genome organization and its functional implications?
3D genome organization is studied through chromosome conformation capture techniques: 3C (one-to-one), 4C (one-to-all), 5C (many-to-many), Hi-C (all-to-all genome-wide), and Micro-C (nucleosome resolution). Analysis reveals hierarchical organization: chromosome territories, A/B compartments (active/inactive), topologically associating domains (TADs), and chromatin loops. TAD boundaries often contain CTCF and cohesin. Integration with ChIP-seq, ATAC-seq, and expression data links structure to function. Enhancer-promoter loops within TADs regulate gene expression. Disruption of TAD boundaries can cause developmental disorders (limb malformations) and cancer by enabling aberrant enhancer-promoter contacts. Imaging approaches (FISH, live-cell imaging) complement sequencing methods for validation.
45 How does cryo-EM enable structure determination and what are the key considerations?
Hard
How does cryo-EM enable structure determination and what are the key considerations?
Cryo-electron microscopy determines structures by imaging frozen-hydrated samples and computationally reconstructing 3D density maps. Sample preparation involves rapid freezing in vitrified ice to preserve native state. Data collection on modern detectors (K3, Falcon) with dose fractionation and motion correction (MotionCor2) achieves high resolution. Image processing includes CTF estimation, particle picking (template or neural network-based), 2D/3D classification, and refinement (RELION, cryoSPARC). Challenges include preferred orientation, sample heterogeneity, small particle size (<150 kDa typically difficult), and beam-induced motion. Resolution is validated by FSC curves and map-model correlation. Cryo-EM excels for large complexes, membrane proteins, and conformational heterogeneity analysis, complementing X-ray crystallography.
46 How do you integrate multi-omics data to understand biological systems?
Hard
How do you integrate multi-omics data to understand biological systems?
Multi-omics integration combines genomics, transcriptomics, proteomics, metabolomics, and epigenomics to provide holistic understanding. Strategies include: late integration (analyze separately, combine at interpretation), intermediate integration (correlation networks, pathway enrichment across omics), and early integration (matrix factorization, machine learning on combined data). Methods include canonical correlation analysis (CCA), partial least squares (PLS), MOFA, DIABLO, and deep learning approaches. Network-based integration maps omics data onto interaction networks (STRING, Reactome). Challenges include data heterogeneity, missing values, different scales and distributions, and causal inference. Applications span disease subtyping, biomarker discovery, and understanding regulatory mechanisms. Validation requires functional experiments.
47 How is optogenetics used to study cell signaling dynamics?
Hard
How is optogenetics used to study cell signaling dynamics?
Optogenetics enables precise spatiotemporal control of signaling using light-activated proteins. Common tools include: LOV domains for dimerization (iLID, TULIP), CRY2-CIB for clustering, PhyB-PIF for reversible interactions, and optoRas/optoERK for direct pathway activation. Applications include dissecting pathway dynamics, determining input-output relationships, creating synthetic circuits, and studying spatially confined signals. Design considerations include activation wavelength (avoiding phototoxicity), reversibility kinetics, dark-state activity, and expression levels. Quantitative microscopy enables precise light delivery and response measurement. Optogenetics revealed importance of signaling dynamics (ERK pulse frequency encoding) and enabled synthetic morphogenesis studies. Combined with computational modeling, it allows testing mechanistic hypotheses.
48 Compare base editing and prime editing with traditional CRISPR-Cas9.
Hard
Compare base editing and prime editing with traditional CRISPR-Cas9.
Traditional CRISPR-Cas9 creates double-strand breaks (DSBs), relying on cellular repair (NHEJ for knockouts, HDR for precise edits requiring donor template). Base editors fuse catalytically impaired Cas9 to deaminases enabling C>T or A>G transitions without DSBs: cytosine base editors (CBE) deaminate C to U; adenine base editors (ABE) deaminate A to I (read as G). Prime editing uses nickase Cas9 fused to reverse transcriptase with pegRNA containing edit template, enabling all transition and transversion mutations, small insertions, and deletions without DSBs or donor templates. Efficiency varies by edit type and cell context. Base/prime editing have lower off-target risks and indel rates but limited edit scope compared to HDR. Selection depends on edit requirements and application.
49 How does proteostasis network dysfunction contribute to disease?
Hard
How does proteostasis network dysfunction contribute to disease?
The proteostasis network maintains protein homeostasis through coordinated protein synthesis, folding, trafficking, and degradation. Key components include molecular chaperones (Hsp70, Hsp90, chaperonins), the unfolded protein response (UPR in ER, mitochondrial UPR), the ubiquitin-proteasome system (UPS), and autophagy. Network decline with aging or genetic mutations causes protein aggregation diseases: Alzheimer's (amyloid-beta, tau), Parkinson's (alpha-synuclein), Huntington's (polyQ expansions), and ALS (SOD1, TDP-43). Cancer cells co-opt proteostasis for survival under stress. Therapeutic strategies include pharmacological chaperones, proteostasis regulators (HSF1 activators, UPR modulators), proteasome inhibitors (cancer), and autophagy modulators. Systems biology approaches map proteostasis network and identify intervention points.
50 How does spatial transcriptomics provide insights beyond single-cell RNA-seq?
Hard
How does spatial transcriptomics provide insights beyond single-cell RNA-seq?
Spatial transcriptomics retains spatial context lost in dissociated single-cell approaches. Technologies include: imaging-based methods (MERFISH, seqFISH+, FISH-based with multiplexed probes for ~1000 genes) and sequencing-based methods (10x Visium with 55um resolution, Slide-seq, HDST approaching single-cell resolution). Analysis involves image registration, spot/cell deconvolution for multi-cell spots, spatial clustering, identifying spatially variable genes, ligand-receptor interaction inference, and integration with scRNA-seq for cell-type annotation. Insights include tissue architecture, cell-cell communication in native context, and disease-specific microenvironments. Challenges include resolution-throughput tradeoff, depth of detection, and computational methods for spatial statistics. Applications span development, cancer microenvironment, and neuroanatomy.