Tandem repeat: Difference between revisions

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
imported>Vycl1994
 
imported>OAbot
m Open access bot: pmc updated in citation with #oabot.
 
(One intermediate revision by one other user not shown)
Line 6: Line 6:
Several [[protein domain]]s also form [[Protein tandem repeats|tandem repeats]] within their amino acid [[primary structure]], such as [[armadillo repeat]]s. However, in proteins, perfect tandem repeats are rare in naturally proteins, but they have been added to designed proteins.<ref>{{cite journal |vauthors=Jorda J, Xue B, Uversky VN, Kajava AV |title=Protein tandem repeats - the more perfect, the less structured |journal=The FEBS Journal |volume=277 |issue=12 |pages=2673–82 | date=June 2010 |pmid=20553501 |pmc=2928880 |doi=10.1111/j.1742-4658.2010.07684.x}}</ref>
Several [[protein domain]]s also form [[Protein tandem repeats|tandem repeats]] within their amino acid [[primary structure]], such as [[armadillo repeat]]s. However, in proteins, perfect tandem repeats are rare in naturally proteins, but they have been added to designed proteins.<ref>{{cite journal |vauthors=Jorda J, Xue B, Uversky VN, Kajava AV |title=Protein tandem repeats - the more perfect, the less structured |journal=The FEBS Journal |volume=277 |issue=12 |pages=2673–82 | date=June 2010 |pmid=20553501 |pmc=2928880 |doi=10.1111/j.1742-4658.2010.07684.x}}</ref>


Tandem repeats constitute about 8% of the [[human genome]].<ref>{{cite journal | vauthors = Duitama J, Zablotskaya A, Gemayel R, Jansen A, Belet S, Vermeesch JR, Verstrepen KJ, Froyen G | title = Large-scale analysis of tandem repeat variability in the human genome | journal = Nucleic Acids Research | volume = 42 | issue = 9 | pages = 5728–5741 | date = May 2014 | pmid = 24682812 | pmc = 4027155 | doi = 10.1093/nar/gku212 }}</ref> They are implicated in more than 50 lethal human [[disease]]s, including [[ALS|amyotrophic lateral sclerosis]], [[Huntington's disease]], and several [[cancer]]s.<ref>{{Cite journal |last1=Cui |first1=Ya |last2=Ye |first2=Wenbin |last3=Li |first3=Jason Sheng |last4=Li |first4=Jingyi Jessica |last5=Vilain |first5=Eric |last6=Sallam |first6=Tamer |last7=Li |first7=Wei |date=April 2024 |title=A genome-wide spectrum of tandem repeat expansions in 338,963 humans |journal=Cell |volume=187 |issue=9 |pages=2336–2341.e5 |doi=10.1016/j.cell.2024.03.004 |issn=0092-8674|doi-access=free |pmid=38582080 |pmc=11065452 }}</ref>
Tandem repeats constitute about 8% of the [[human genome]].<ref>{{cite journal | vauthors = Duitama J, Zablotskaya A, Gemayel R, Jansen A, Belet S, Vermeesch JR, Verstrepen KJ, Froyen G | title = Large-scale analysis of tandem repeat variability in the human genome | journal = Nucleic Acids Research | volume = 42 | issue = 9 | pages = 5728–5741 | date = May 2014 | pmid = 24682812 | pmc = 4027155 | doi = 10.1093/nar/gku212 }}</ref> They are implicated in more than 50 lethal human [[disease]]s, including [[ALS|amyotrophic lateral sclerosis]], [[Huntington's disease]], and several [[cancer]]s.<ref>{{Cite journal |last1=Cui |first1=Ya |last2=Ye |first2=Wenbin |last3=Li |first3=Jason Sheng |last4=Li |first4=Jingyi Jessica |author-link4 = Jingyi Jessica Li|last5=Vilain |first5=Eric |last6=Sallam |first6=Tamer |last7=Li |first7=Wei |date=April 2024 |title=A genome-wide spectrum of tandem repeat expansions in 338,963 humans |journal=Cell |volume=187 |issue=9 |pages=2336–2341.e5 |doi=10.1016/j.cell.2024.03.004 |issn=0092-8674|doi-access=free |pmid=38582080 |pmc=11065452 }}</ref>


==Terminology==
==Terminology==
Line 28: Line 28:


==Uses==
==Uses==
=== Inheritance signal ===
Tandem repeat describes a pattern that helps determine an individual's inherited traits.
Tandem repeat describes a pattern that helps determine an individual's inherited traits.


Line 34: Line 36:
Polymorphic tandem repeats (alias VNTRs) are also present in microorganisms and can be used to trace the origin of an outbreak. The corresponding assay in which a collection of VNTRs is typed to characterize a strain is most often called [[MLVA]] (Multiple Loci VNTR Analysis). Using tandem repeat polymorphism, recombination has been reported in the natural transmission of [[monkeypox]] ([[mpox]]) virus genome during 2022 pandemic.<ref>{{cite journal |last1=Yeh |first1=Ting-Yu |last2=Hsieh |first2=Zih-Yu |last3=Feehley |first3=Michael C. |last4=Feehley |first4=Patrick J. |last5=Contreras |first5=Gregory P. |last6=Su |first6=Ying-Chieh |last7=Hsieh |first7=Shang-Lin |last8=Lewis |first8=Dylan A. |title=Recombination shapes the 2022 monkeypox (mpox) outbreak |journal=Med |date=9 December 2022 |volume=3 |issue=12 |pages=824–826 |doi=10.1016/j.medj.2022.11.003 |pmid=36495863 |pmc=9733179 |language=en |issn=2666-6359}}</ref>
Polymorphic tandem repeats (alias VNTRs) are also present in microorganisms and can be used to trace the origin of an outbreak. The corresponding assay in which a collection of VNTRs is typed to characterize a strain is most often called [[MLVA]] (Multiple Loci VNTR Analysis). Using tandem repeat polymorphism, recombination has been reported in the natural transmission of [[monkeypox]] ([[mpox]]) virus genome during 2022 pandemic.<ref>{{cite journal |last1=Yeh |first1=Ting-Yu |last2=Hsieh |first2=Zih-Yu |last3=Feehley |first3=Michael C. |last4=Feehley |first4=Patrick J. |last5=Contreras |first5=Gregory P. |last6=Su |first6=Ying-Chieh |last7=Hsieh |first7=Shang-Lin |last8=Lewis |first8=Dylan A. |title=Recombination shapes the 2022 monkeypox (mpox) outbreak |journal=Med |date=9 December 2022 |volume=3 |issue=12 |pages=824–826 |doi=10.1016/j.medj.2022.11.003 |pmid=36495863 |pmc=9733179 |language=en |issn=2666-6359}}</ref>


=== Domesticated living beings ===
Studies in 2004 linked the unusual genetic plasticity of [[dog]]s to mutations in tandem repeats.<ref>{{cite journal |author=Pennisi E |author-link=Elizabeth Pennisi |title=Genetics. A ruff theory of evolution: gene stutters drive dog shape |journal=Science |volume=306 |issue=5705 |pages=2172 | date=December 2004 |pmid=15618495 |doi=10.1126/science.306.5705.2172|s2cid=10680162 |doi-access=free }}</ref>
== Detection and analysis ==
In the field of [[Computer Science|computer science]], tandem repeats in strings (e.g., DNA sequences) can be efficiently detected using [[suffix tree]]s or [[suffix array]]s.
In the field of [[Computer Science|computer science]], tandem repeats in strings (e.g., DNA sequences) can be efficiently detected using [[suffix tree]]s or [[suffix array]]s.


Studies in 2004 linked the unusual genetic plasticity of [[dog]]s to mutations in tandem repeats.<ref>{{cite journal |author=Pennisi E |author-link=Elizabeth Pennisi |title=Genetics. A ruff theory of evolution: gene stutters drive dog shape |journal=Science |volume=306 |issue=5705 |pages=2172 | date=December 2004 |pmid=15618495 |doi=10.1126/science.306.5705.2172|s2cid=10680162 |doi-access=free }}</ref>
Tandem repeats can themselves be divided into monomers and higher-order repeats. HORmon was developed for this purpose.<ref>{{cite journal |last1=Kunyavskaya |first1=Olga |last2=Dvorkina |first2=Tatiana |last3=Bzikadze |first3=Andrey V. |last4=Alexandrov |first4=Ivan A. |last5=Pevzner |first5=Pavel A. |title=Automated annotation of human centromeres with HORmon |journal=Genome Research |date=June 2022 |volume=32 |issue=6 |pages=1137–1151 |doi=10.1101/gr.276362.121|pmc=9248890 }}</ref>
 
The alignment of tandem repeats using a typical algorithm such as [[Smith-Waterman algorithm|Smith-Waterman]] tends to give biologically implausible results: these algorithms are unaware of the relatively high frequency of copy number changes at repeat sites and incorrectly penalize them as gaps. A more proper way to align tandem repeats from different individuals is to anchor the alignment on rare substrings.<ref>{{cite journal |last1=Bzikadze |first1=Andrey V. |last2=Pevzner |first2=Pavel A. |title=UniAligner: a parameter-free framework for fast sequence alignment |journal=Nature Methods |date=September 2023 |volume=20 |issue=9 |pages=1346–1354 |doi=10.1038/s41592-023-01970-4|biorxiv=10.1101/2022.09.15.507041}}</ref>


Nested tandem repeats are described as repeating unit lengths that are variable or unknown and frequently include an asymmetric hierarchy of smaller repeating units. These repeats are constructed from distinct groups of homologous-length monomers. An algorithm known as NTRprism was created by Oxford Nanopore Technologies researchers to enable for the annotation of repetitive structures in built satellite DNA arrays. The algorithm NTRprism is developed to find and display the satellite repeating periodicity.<ref>{{Cite journal |last1=Altemose |first1=Nicolas |last2=Logsdon |first2=Glennis A. |last3=Bzikadze |first3=Andrey V. |last4=Sidhwani |first4=Pragya |last5=Langley |first5=Sasha A. |last6=Caldas |first6=Gina V. |last7=Hoyt |first7=Savannah J. |last8=Uralsky |first8=Lev |last9=Ryabov |first9=Fedor D. |last10=Shew |first10=Colin J. |last11=Sauria |first11=Michael E. G. |last12=Borchers |first12=Matthew |last13=Gershman |first13=Ariel |last14=Mikheenko |first14=Alla |last15=Shepelev |first15=Valery A. |date=April 2022 |title=Complete genomic and epigenetic maps of human centromeres |journal=Science |language=en |volume=376 |issue=6588 |pages=eabl4178 |doi=10.1126/science.abl4178 |issn=0036-8075 |pmc=9233505 |pmid=35357911}}</ref>
Nested tandem repeats are described as repeating unit lengths that are variable or unknown and frequently include an asymmetric hierarchy of smaller repeating units. These repeats are constructed from distinct groups of homologous-length monomers. An algorithm known as NTRprism was created by Oxford Nanopore Technologies researchers to enable for the annotation of repetitive structures in built satellite DNA arrays. The algorithm NTRprism is developed to find and display the satellite repeating periodicity.<ref>{{Cite journal |last1=Altemose |first1=Nicolas |last2=Logsdon |first2=Glennis A. |last3=Bzikadze |first3=Andrey V. |last4=Sidhwani |first4=Pragya |last5=Langley |first5=Sasha A. |last6=Caldas |first6=Gina V. |last7=Hoyt |first7=Savannah J. |last8=Uralsky |first8=Lev |last9=Ryabov |first9=Fedor D. |last10=Shew |first10=Colin J. |last11=Sauria |first11=Michael E. G. |last12=Borchers |first12=Matthew |last13=Gershman |first13=Ariel |last14=Mikheenko |first14=Alla |last15=Shepelev |first15=Valery A. |date=April 2022 |title=Complete genomic and epigenetic maps of human centromeres |journal=Science |language=en |volume=376 |issue=6588 |pages=eabl4178 |doi=10.1126/science.abl4178 |issn=0036-8075 |pmc=9233505 |pmid=35357911}}</ref>

Latest revision as of 11:59, 24 June 2025

Template:Short description Script error: No such module "Distinguish".

In genetics, tandem repeats occur in DNA when a pattern of one or more nucleotides is repeated and the repetitions are directly adjacent to each other, e.g. ATTCG ATTCG ATTCG, in which the sequence ATTCG is repeated three times.[1]

Several protein domains also form tandem repeats within their amino acid primary structure, such as armadillo repeats. However, in proteins, perfect tandem repeats are rare in naturally proteins, but they have been added to designed proteins.[2]

Tandem repeats constitute about 8% of the human genome.[3] They are implicated in more than 50 lethal human diseases, including amyotrophic lateral sclerosis, Huntington's disease, and several cancers.[4]

Terminology

All tandem repeat arrays are classifiable as satellite DNA, a name originating from the fact that tandem DNA repeats, by nature of repeating the same nucleotide sequences repeatedly, have a unique ratio of the two possible nucleotide base pair combinations, conferring them a specific mass density that allows them to be separated from the rest of the genome with density-based laboratory techniques, thus appearing as "satellite bands". Albeit, a tandem repeat array could not show up as a satellite band if it had a nucleotide composition close to the average of the genome.[5]

When exactly two nucleotides are repeated, it is called a dinucleotide repeat (for example: ACACACAC...). The microsatellite instability in hereditary nonpolyposis colon cancer most commonly affects such regions.[6]

When three nucleotides are repeated, it is called a trinucleotide repeat (for example: CAGCAGCAGCAG...), and abnormalities in such regions can give rise to trinucleotide repeat disorders.

When between 10 and 60 nucleotides are repeated, it is called a minisatellite. Those with fewer are known as microsatellites or short tandem repeats.

When much larger lengths of nucleotides are repeated, on the order of 1,000 nucleotides, it is called a macrosatellite.

When the repeat unit copy number is variable in the population being considered, it is called a variable number tandem repeat (VNTR). MeSH classifies variable number tandem repeats under minisatellites.[7]

Mechanism

Tandem repeats can occur through different mechanisms. For example, slipped strand mispairing, (also known as replication slippage), is a mutation process which occurs during DNA replication. It involves denaturation and displacement of the DNA strands, resulting in mispairing of the complementary bases. Slipped strand mispairing is one explanation for the origin and evolution of repetitive DNA sequences.

Other mechanisms include unequal crossover and gene conversion.

Uses

Inheritance signal

Tandem repeat describes a pattern that helps determine an individual's inherited traits.

Tandem repeats can be very useful in determining parentage. Short tandem repeats are used for certain genealogical DNA tests. DNA is examined from microsatellites within the chromosomal DNA. Parentage can be determined through the similarity in these regions.

Polymorphic tandem repeats (alias VNTRs) are also present in microorganisms and can be used to trace the origin of an outbreak. The corresponding assay in which a collection of VNTRs is typed to characterize a strain is most often called MLVA (Multiple Loci VNTR Analysis). Using tandem repeat polymorphism, recombination has been reported in the natural transmission of monkeypox (mpox) virus genome during 2022 pandemic.[8]

Domesticated living beings

Studies in 2004 linked the unusual genetic plasticity of dogs to mutations in tandem repeats.[9]

Detection and analysis

In the field of computer science, tandem repeats in strings (e.g., DNA sequences) can be efficiently detected using suffix trees or suffix arrays.

Tandem repeats can themselves be divided into monomers and higher-order repeats. HORmon was developed for this purpose.[10]

The alignment of tandem repeats using a typical algorithm such as Smith-Waterman tends to give biologically implausible results: these algorithms are unaware of the relatively high frequency of copy number changes at repeat sites and incorrectly penalize them as gaps. A more proper way to align tandem repeats from different individuals is to anchor the alignment on rare substrings.[11]

Nested tandem repeats are described as repeating unit lengths that are variable or unknown and frequently include an asymmetric hierarchy of smaller repeating units. These repeats are constructed from distinct groups of homologous-length monomers. An algorithm known as NTRprism was created by Oxford Nanopore Technologies researchers to enable for the annotation of repetitive structures in built satellite DNA arrays. The algorithm NTRprism is developed to find and display the satellite repeating periodicity.[12]

Biotechnology

Kang. et al. successfully in vitro amplified up to 5kb of a sequence containing 36 identical 99bp tandem repeats and a 561bp sequence with 91% AT content using SHARP, which utilizes engineered superhelicases with enhanced processivity and speed.[13] SHARP combines single-stranded DNA binding protein (SSB) and superhelicases with standard PCR reagents to achieve isothermal amplification that mimics biological DNA replication. The method operates at a constant temperature, eliminating the need for thermal cycling, and has shown particular utility in cases where traditional PCR either fails to amplify target sequences or produces unwanted side products.

See also

References

Template:Reflist

External links

Template:Repeated sequence Template:Self-replicating organic structures Template:Organisms et al.

de:Tandemwiederholung

  1. Template:MeshName
  2. Script error: No such module "Citation/CS1".
  3. Script error: No such module "Citation/CS1".
  4. Script error: No such module "Citation/CS1".
  5. Script error: No such module "citation/CS1".
  6. Script error: No such module "Citation/CS1".
  7. Template:MeshName
  8. Script error: No such module "Citation/CS1".
  9. Script error: No such module "Citation/CS1".
  10. Script error: No such module "Citation/CS1".
  11. Script error: No such module "Citation/CS1".
  12. Script error: No such module "Citation/CS1".
  13. Script error: No such module "Citation/CS1".