Article Preview
TopIntroduction
In DNA computing, the double-helix structure of DNA and the complementary base pairing rules are used to encode information, the calculated objects are mapped into DNA molecular chains, various data pools are generated through the action of biological enzymes, and then according to certain rules, the original data computations of the problem map highly parallel to the controlled biochemical reaction process of the DNA molecule chain, and then the molecular biotechnology is used to detect the required computational results.
DNA calculations include DNA computations in molecular (Takahashi, Yaegashi, Asanuma et al., 2005), DNA computations between molecules (Adleman, 1994), and super-molecular DNA computations (Zhang, Zhao et al., 2007). Intra-molecular DNA calculations use a single DNA molecule to construct a programmable state machine, intra-molecular morphology is used to transfer operations, inter-molecular DNA computations focus on the hybridization reactions between different DNA molecules, it is an essential step in computations (Adleman, 1994). Super-molecular DNA calculations are calculated by using the self-assembly process of different sequences of original DNA molecules.
When exchanging between chromatids, an abnormal base pair is corrected between hybrid DNAs, a gene becomes its allele, and a gene irregularity occurs, which is called gene conversion.
From different dimensions, the same genetic mutation can have many different expressions. For example, differences in reference sequences and different levels of expression (DNA, RNA or protein levels) can lead to differences in the way in which mutations are expressed. Common reference sequences include genomic reference sequences (represented by the prefix “g.”), cDNA reference sequences (represented by the prefix “c.”), non-coding DNA reference sequences (represented by the prefix “n.”), RNA references Sequence (represented by the prefix “r.”), protein reference sequence (represented by the prefix “p.”) (Horaitis & Cotton, 2004).
The choice of reference sequence is very important. When describing mutations at the DNA level, the relationship between introns and adjacent exons is often very important for clinical studies. In order to better elucidate intron variation, cDNA is usually selected as a reference sequence because of cDNA. As a reference sequence, the relationship between the mutated base in the intron and the adjacent exon can be better described. In addition, genetic mutations are often described as changes in protein levels (Ogino, Gulley et al., 2007).
The mutant expression pattern of cDNA as a reference sequence has substitutions, deletions, deletion insertions, and repeats. In order to better understand the expression of base mutations in introns, the positions of the bases in the DNA sequence are shown in Figure 1.
Figure 1. Nucleotide coding schematic
As can be seen in the Figure 1, the number of exon sequences is continuous from the start of the Gene code to the termination, while the coding of the 5' untranslated region, the 3' untranslated region, and the intron region are closely related to the encoding of exon sequences.