NSD1

Chr 5AD

nuclear receptor binding SET domain protein 1

Also known as: ARA267, KMT3B, SOTOS, SOTOS1, STO

This gene encodes a protein containing a SET domain, 2 LXXLL motifs, 3 nuclear translocation signals (NLSs), 4 plant homeodomain (PHD) finger regions, and a proline-rich region. The encoded protein enhances androgen receptor (AR) transactivation, and this enhancement can be increased further in the presence of other androgen receptor associated coregulators. This protein may act as a nucleus-localized, basic transcriptional factor and also as a bifunctional transcriptional regulator. Mutations of this gene have been associated with Sotos syndrome and Weaver syndrome. One version of childhood acute myeloid leukemia is the result of a cryptic translocation with the breakpoints occurring within nuclear receptor-binding Su-var, enhancer of zeste, and trithorax domain protein 1 on chromosome 5 and nucleoporin, 98-kd on chromosome 11. Multiple transcript variants encoding distinct isoforms have been identified for this gene. [provided by RefSeq, Sep 2018]

Primary Disease Associations & Inheritance

Sotos syndromeMIM #117550
AD
UniProtBeckwith-Wiedemann syndrome
568
ClinVar variants
105
Pathogenic / LP
1.00
pLI score· haploinsufficient
1
Active trials
Clinical SummaryNSD1
🧬
Gene-Disease Validity (ClinGen)
Sotos syndrome · ADDefinitive

Definitive — sufficient evidence for diagnostic panels

Population Constraint (gnomAD)
Highly constrained gene — heterozygous loss-of-function variants are very rare in the population (pLI 1.00). One damaged copy is likely sufficient to cause disease.
📋
ClinVar Variants
105 Pathogenic / Likely Pathogenic· 259 VUS of 568 total submissions
💊
Clinical Trials
1 active or recruiting trial — potential therapeutic options may be available

Population Genetics & Constraint

gnomAD v4 — loss-of-function & missense intolerance

Dual constrained — LoF & missense intolerant
LoF Constraint?LOEUF (Loss-of-function Observed/Expected Upper bound Fraction) is the upper bound of the 90% CI for LoF OE — the preferred gnomAD v4 metric. Lower = more intolerant to LoF. LOEUF < 0.35 = highly constrained.
0.10LOEUF
pLI 1.000
Z-score 9.31
OE 0.05 (0.020.10)
Highly constrained

Among the most LoF-intolerant genes (~top 3%)

Missense Constraint?Missense Z-score: standard deviations fewer missense variants observed vs. expected. Z > 3.09 (p < 0.001) = gene does not tolerate missense variation. OE missense < 0.6 is also considered constrained.
3.41Z-score
OE missense 0.75 (0.710.79)
1065 obs / 1427.7 exp
Constrained

Highly missense-constrained (top ~0.1%)

Observed / Expected Ratios?Shaded band = 90% confidence interval. Vertical tick = point estimate. Grey threshold line = gnomAD constraint cutoff for that variant class.
LoF OE?Ratio of observed to expected LoF variants. Upper CI bound (LOEUF) ≤ 0.35 = strong LoF constraint signal.0.05 (0.020.10)
00.351.4
Missense OE?Ratio of observed to expected missense variants. OE ≤ 0.6 = fewer missense variants than expected by chance.0.75 (0.710.79)
00.61.4
Synonymous OE?Control metric — synonymous variants are largely neutral and expected near OE = 1.0. Significant deviation may indicate annotation issues.1.00
01.21.6
LoF obs/exp: 5 / 110.6Missense obs/exp: 1065 / 1427.7Syn Z: 0.08

ClinVar Variant Classifications

568 submitted variants in ClinVar

Classification Summary

Pathogenic65
Likely Pathogenic40
VUS259
Likely Benign171
Benign21
Conflicting12
65
Pathogenic
40
Likely Pathogenic
259
VUS
171
Likely Benign
21
Benign
12
Conflicting

Curated Variants Distribution

Classified variants from ClinVar · 5 ACMG categories

ClassificationLoFMissense + InframeNon-codingSynonymousTotal
Pathogenic
33
10
22
0
65
Likely Pathogenic
16
14
10
0
40
VUS
3
238
16
2
259
Likely Benign
0
52
26
93
171
Benign
0
16
1
4
21
Conflicting
12
Total523307599568

LoF = frameshift, stop gained/lost, canonical splice · Counts from ClinVar esearch · Updated hourly

View in ClinVar →

Protein Context — Lollipop Plot

NSD1 · protein map & ClinVar variants

Showing all ClinVar variants across the protein. Search a specific variant to highlight its position.

Gene2Phenotype Curations

NSD1-related Sotos syndrome

definitive
ADLoss Of FunctionAbsent Gene Product
Dev. DisordersSkinSkeletal
G2P ↗

NSD1-related Beckwith-Wiedemann syndrome

definitive
ADUndeterminedUncertain
Skin
G2P ↗

Gene2Phenotype curations · DECIPHER consortium patient cohort (public variants) · deciphergenomics.org

OMIM — Genotype-Phenotype Relationships

1 OMIM entry

Sotos syndrome

MIM #117550

Molecular basis of disorder known

Autosomal dominant
📖
GeneReview available — NSD1
Authoritative clinical overview · NCBI Bookshelf · Recommended first read
Open GeneReview ↗
Clinical Literature
Landmark / reviewRecent case evidence