MSH1 cDNA ORF clone, Amborella trichopoda

The following MSH1 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the MSH1 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OAm35126 XM_006854264.3
Latest version!
Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X1, mRNA. pcDNA3.1-C-(k)DYK or customized vector 19-21 $713.30
$1019.00
OAm35127 XM_020673531.1
Latest version!
Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X2, mRNA. pcDNA3.1-C-(k)DYK or customized vector 19-21 $713.30
$1019.00
OAm35128 XM_020673532.1
Latest version!
Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X3, mRNA. pcDNA3.1-C-(k)DYK or customized vector 19-21 $629.30
$899.00
OAm35128 XM_020673533.1
Latest version!
Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X4, mRNA. pcDNA3.1-C-(k)DYK or customized vector 19-21 $629.30
$899.00
OAm35128 XM_020673534.1
Latest version!
Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X5, mRNA. pcDNA3.1-C-(k)DYK or customized vector 19-21 $629.30
$899.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OAm35126
    Clone ID Related Accession (Same CDS sequence) XM_006854264.3
    Accession Version XM_006854264.3 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 3591bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2017-04-03
    Organism Amborella trichopoda
    Product DNA mismatch repair protein MSH1, mitochondrial isoform X1
    Comment Comment: MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_006499920.1) annotated using gene prediction method: Gnomon, supported by EST evidence. Also see: Documentation of NCBI's Annotation Process On Apr 4, 2017 this sequence version replaced XM_006854264.2. ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Amborella trichopoda Annotation Release 101 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 7.3 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    ATGCATCGGT TTGCCTCCAA GTCCCTGCTA TTCTCTTTGC CCAAATGGAA AGCTCTCACT 
    GTGCTCTTTC GCCTCTCTAA TGGCCGCTTC AACGTCCCCC AAATGCCTCT ATACTGCAGT
    AGATATGGAG AGAGGACTAT TGCTTTCAGG GCACAAAGGC TTCAAAAGAG CATAATTAGA
    GCAACCAAAA GATCCAAAGC TTCTAAGTCT TTTCTTCAAG GAGAAGATCA TGCACATATA
    ATGTGGTGGC AAGAGAGAAT GGACAAGTGC AAAAAGCCTT CATGTGTTCA ACTTGTCAAA
    AGACTTAAAT ATTCTAATTT GCTTGGGCTA GATGAAAGTT TAAGAAGTGG GAGTTTGAAG
    GAAGGCACAC TCAATTGCGA GTTATTGCAA GTTAAATCAA AATTTCCCCG TGAAGTTCTA
    GTCTGCAGGG TTGGAGAATT TTATGAAGCT GTTGGCTTTG ATGCATGCAT CCTTGTTGAA
    CATGCTGGGT TGAACCCTAT GAGTGGTTTG AGATCAGACA CTGTCCCAAG AGCTGGTTGT
    CCTGTCATGA ACTTGCGGCA AACTTTGGAT GACTTGACTC GAAGTGGATA TTCTGTTTGT
    ATAGTAGAGG AAGTGCAGGG CCCAACTCAA GCTCGATCGC GAAAAGGTCG CTTCATATCA
    GGGCATGCAC ATCCTGGAAG CCCCTATGTA TATGGACTTG CAGGAGCCGA CCTTGAAGTT
    GATTTTCCTG AGCCAGTGCC TGTAGTTGGT GTATCACATT CAGCAAAAGG TTATTGCTTG
    ATATCAGTTA TTGAGACTCT GAAAACATAT TCCATAGAAG ATGGGCTAAC TGAAGAGGCT
    ATAGTCACAA AGCTCCGTAC CCGTCCCCAC CACCATTTGT TTCTGCATAT ATCATTACGG
    CATAATTGTA CAGGGACACA TCGTTGGGGA GAATTTGGTG AAGGTGGTTT GCTATGGGGG
    GAGTGTACTG ACAAGAACTT TGAGTGGTTC GATGGCAGTC CTGCTGATGG ACTTTTGTGC
    AAGGTGAGGG ACCTTTATGG TCTTGACAAA GAAGAAATTT TTCGAAATGT CACAGTTTCT
    TCAGAAAAAA GGCCTCAGCC ATTATACCTT GGAGCTGCAA CTCAAATTGG TTGCATACCT
    ACTGAAGGAA TTCCTAGCTT GTTGAAAATT TTGCTTCCAT CAAGCTGCAC TGGTCTCCCT
    GTGATATATA TGAGGGATCT TCTACTTAAT CCTCCCATGT ATCCTACTGC ATTGTCAATT
    CAAGAATCGT GCAAGCTAAT GAGCAATATA AGTTGCCCAA TTCCTGAATT TACAGTTATT
    CCGGCAACAA AGCTTGTGAA GCTTCTTGAG TCGAAGGAAG CAAACCACAT TGAATTTTGC
    AGAATAAAAA ATGTGGTCGA CGATGTTCTG TACTTGTATG GAAAATCAGA TCTTAATCCC
    ATCCTACAAT CGCTGCTAGA TCCTACTTGG GTTGCAACTG GCTTGAAGAT CGATTGCGAG
    TCCCTGGTGA ATGAGTGCAG ATTGGTTTCA CATAGAATTG GTGAAATGAT TTCCCTACAA
    GGAGAAAGAG TTCAAGCAAC AAGTTCTGTC CCTTGTATTC CAGGAGAATT TTTTGAAGAC
    GTTGAGTCAT CATGGAAGGG TCGCGTGAAA GAAATCCATA CAGAAAATGC AATCAAAGAA
    GTAGAGAGAG CGGCAGAATC CTTGTCTTTG GCTGTTACTG AAGACTTCCT TCCAATTGTT
    CAGAGAGTAA AAGCTGTCAT GTCTCCACAT GGTGGTCCTA GGGGTGATAT AACCTATGCC
    AAAGAGCATG AAGCTGTTTG GTTTAAAGGA AAGCGTTTTG TACCATCTGT TTGGGGTGGT
    ACTCCTGGTG AAGAACAAAT TAAGAGACTA ATACCTGCTA CAGATTCTAA AGGGAAAAAT
    GTTGGAGATG AATGGTTTAC AACTGAGAAG GTGGAGGTTG CTCTGAATAG GTACCATGAG
    GCAAATGCCC GAGCCAAGGC GGTTGTCATG GATATTTTGC GAGGACTTTC TACTGAAATG
    CAAATTAAGA TAAATGTTCT TGTCTACTCA TCCATGTTGC TGGTTATTGC AAAGGCATTA
    TTTGCTCATG TCAGTGAAGG AAGGAGGAGG AAATGGGTCT TTCCCACATT AAAGGAGTTC
    GATAGATCTG CAGGCAGCAC TTTGTCATGG GAAAATGACT ATATGGAGAT TGTGGGATTA
    TCACCATATT GGTTTGATGC TGCTCAAGGA AATGCTATAC AAAATACTGT TAAAATGCAT
    TCCTTATTTA TTTTGACGGG ACCAAATGGT GGTGGCAAAT CTAGCTTGCT TCGGTCGATT
    TGTGCTGCAG CATTACTTGG AATTTGTGGG TTGATGGTTC CGGCAGTCTC TGCTGTCATT
    CCACATTTTG ATTCTATCAT GCTTCATATG AAATCCTATG ATAGTCCTGC GGATGGGAAA
    AGCTCCTTCC AGATGGAAAT GTCAGAGCTG CGGTCAATAG TTACAAGAGC CACTGCAAGG
    AGCCTTGTGC TAGTGGATGA AATTTGCAGG GGAACGGAAA TGTGGAAAGG CACCTGCATT
    GCTGGAAGCA TATTGGAAAC TCTCGATAAT ATTGGTTGCC TGGGCATTGT ATCAACCCAC
    ATCCATGCCC TCTTCAATTT ACCACTAGCG ACAAAGAACA TTGTTTCTAA AGCAATGGGA
    ACGGAGAAAG TGAATGGTCG AACAAAACCG ACATGGAAGT TAATAGACGG AATTTGTAGA
    GAAAGCCTTG CATTTGAGAC GGCTCAAAAT GAAGGAATCC CTGAAGCTAT TATCAGAAGA
    GCTGAGGAGT TGTATCTCAC TATCAAAGAT GCTCAAACAG ATTTAGAAAA GATTGATGGT
    GCAAAGGGGC ATCGAACTCA TCAGTCTGGA ATTACTTGTT TGTATGATCG TAGTGATTGT
    TTGAGACCTG AAGGTAATGA ATCTAGTTCT TTTGAAAATG TGTCAATTAA GGGAACTGAT
    ATTGATGCTT TAACTAGTAA TGGACTATCC AAATATGAAG ATGATCATGC CCCACATGAT
    TCTATGAAAT TCCGAAGATT ATGTGTCACA AATGAAGGCT GTGGAGAAAG TTCAACGAGT
    TCCTCACATG TGATGTCTCT GTCTGCGGAG AGGCAGAGAT TATTGCAGAA TGCTGGAAGA
    GCTGTGACCA TCATTTGTCA GAGAAAGTTG AACGAACTTT ACGTGCAGAA ATGTATAGCT
    GCACTTGCAG AGATCTCTTG TGTCACTGTT GATTCTAAAG AACAACCTCC TCCCTCGACA
    ATAGGCACTT CAAGTGTATA TGTGTTAGTC AGGCCTGATT TGAAGTTGTA TGTTGGACAG
    ACAGATGACC TTATTGGTCG CGTCCGTACA CACCGTTCAT CAGAAGGCAT GCAGGATGTG
    CCTTTTCTTT ATGTTGTGGT CCCAGGGAGG AGTGTGGCTT GCCTATTAGA AACTCTGCTC
    ATTAACCAGC TTCCCCTTCA AGGATTTCAT CTTTCTAACA AAGCTGATGG TAAGCACCGC
    AACTTTGGCA CGTGTCATCT CACATTGGAA GGTTCAGCGT TGGCTCAATG A

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_006854326.1
    CDS322..3912
    Translation

    Target ORF information:

    RefSeq Version XM_006854264.3
    Organism Amborella trichopoda
    Definition Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X1, mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_006854264.3

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    ATGCATCGGT TTGCCTCCAA GTCCCTGCTA TTCTCTTTGC CCAAATGGAA AGCTCTCACT 
    GTGCTCTTTC GCCTCTCTAA TGGCCGCTTC AACGTCCCCC AAATGCCTCT ATACTGCAGT
    AGATATGGAG AGAGGACTAT TGCTTTCAGG GCACAAAGGC TTCAAAAGAG CATAATTAGA
    GCAACCAAAA GATCCAAAGC TTCTAAGTCT TTTCTTCAAG GAGAAGATCA TGCACATATA
    ATGTGGTGGC AAGAGAGAAT GGACAAGTGC AAAAAGCCTT CATGTGTTCA ACTTGTCAAA
    AGACTTAAAT ATTCTAATTT GCTTGGGCTA GATGAAAGTT TAAGAAGTGG GAGTTTGAAG
    GAAGGCACAC TCAATTGCGA GTTATTGCAA GTTAAATCAA AATTTCCCCG TGAAGTTCTA
    GTCTGCAGGG TTGGAGAATT TTATGAAGCT GTTGGCTTTG ATGCATGCAT CCTTGTTGAA
    CATGCTGGGT TGAACCCTAT GAGTGGTTTG AGATCAGACA CTGTCCCAAG AGCTGGTTGT
    CCTGTCATGA ACTTGCGGCA AACTTTGGAT GACTTGACTC GAAGTGGATA TTCTGTTTGT
    ATAGTAGAGG AAGTGCAGGG CCCAACTCAA GCTCGATCGC GAAAAGGTCG CTTCATATCA
    GGGCATGCAC ATCCTGGAAG CCCCTATGTA TATGGACTTG CAGGAGCCGA CCTTGAAGTT
    GATTTTCCTG AGCCAGTGCC TGTAGTTGGT GTATCACATT CAGCAAAAGG TTATTGCTTG
    ATATCAGTTA TTGAGACTCT GAAAACATAT TCCATAGAAG ATGGGCTAAC TGAAGAGGCT
    ATAGTCACAA AGCTCCGTAC CCGTCCCCAC CACCATTTGT TTCTGCATAT ATCATTACGG
    CATAATTGTA CAGGGACACA TCGTTGGGGA GAATTTGGTG AAGGTGGTTT GCTATGGGGG
    GAGTGTACTG ACAAGAACTT TGAGTGGTTC GATGGCAGTC CTGCTGATGG ACTTTTGTGC
    AAGGTGAGGG ACCTTTATGG TCTTGACAAA GAAGAAATTT TTCGAAATGT CACAGTTTCT
    TCAGAAAAAA GGCCTCAGCC ATTATACCTT GGAGCTGCAA CTCAAATTGG TTGCATACCT
    ACTGAAGGAA TTCCTAGCTT GTTGAAAATT TTGCTTCCAT CAAGCTGCAC TGGTCTCCCT
    GTGATATATA TGAGGGATCT TCTACTTAAT CCTCCCATGT ATCCTACTGC ATTGTCAATT
    CAAGAATCGT GCAAGCTAAT GAGCAATATA AGTTGCCCAA TTCCTGAATT TACAGTTATT
    CCGGCAACAA AGCTTGTGAA GCTTCTTGAG TCGAAGGAAG CAAACCACAT TGAATTTTGC
    AGAATAAAAA ATGTGGTCGA CGATGTTCTG TACTTGTATG GAAAATCAGA TCTTAATCCC
    ATCCTACAAT CGCTGCTAGA TCCTACTTGG GTTGCAACTG GCTTGAAGAT CGATTGCGAG
    TCCCTGGTGA ATGAGTGCAG ATTGGTTTCA CATAGAATTG GTGAAATGAT TTCCCTACAA
    GGAGAAAGAG TTCAAGCAAC AAGTTCTGTC CCTTGTATTC CAGGAGAATT TTTTGAAGAC
    GTTGAGTCAT CATGGAAGGG TCGCGTGAAA GAAATCCATA CAGAAAATGC AATCAAAGAA
    GTAGAGAGAG CGGCAGAATC CTTGTCTTTG GCTGTTACTG AAGACTTCCT TCCAATTGTT
    CAGAGAGTAA AAGCTGTCAT GTCTCCACAT GGTGGTCCTA GGGGTGATAT AACCTATGCC
    AAAGAGCATG AAGCTGTTTG GTTTAAAGGA AAGCGTTTTG TACCATCTGT TTGGGGTGGT
    ACTCCTGGTG AAGAACAAAT TAAGAGACTA ATACCTGCTA CAGATTCTAA AGGGAAAAAT
    GTTGGAGATG AATGGTTTAC AACTGAGAAG GTGGAGGTTG CTCTGAATAG GTACCATGAG
    GCAAATGCCC GAGCCAAGGC GGTTGTCATG GATATTTTGC GAGGACTTTC TACTGAAATG
    CAAATTAAGA TAAATGTTCT TGTCTACTCA TCCATGTTGC TGGTTATTGC AAAGGCATTA
    TTTGCTCATG TCAGTGAAGG AAGGAGGAGG AAATGGGTCT TTCCCACATT AAAGGAGTTC
    GATAGATCTG CAGGCAGCAC TTTGTCATGG GAAAATGACT ATATGGAGAT TGTGGGATTA
    TCACCATATT GGTTTGATGC TGCTCAAGGA AATGCTATAC AAAATACTGT TAAAATGCAT
    TCCTTATTTA TTTTGACGGG ACCAAATGGT GGTGGCAAAT CTAGCTTGCT TCGGTCGATT
    TGTGCTGCAG CATTACTTGG AATTTGTGGG TTGATGGTTC CGGCAGTCTC TGCTGTCATT
    CCACATTTTG ATTCTATCAT GCTTCATATG AAATCCTATG ATAGTCCTGC GGATGGGAAA
    AGCTCCTTCC AGATGGAAAT GTCAGAGCTG CGGTCAATAG TTACAAGAGC CACTGCAAGG
    AGCCTTGTGC TAGTGGATGA AATTTGCAGG GGAACGGAAA TGTGGAAAGG CACCTGCATT
    GCTGGAAGCA TATTGGAAAC TCTCGATAAT ATTGGTTGCC TGGGCATTGT ATCAACCCAC
    ATCCATGCCC TCTTCAATTT ACCACTAGCG ACAAAGAACA TTGTTTCTAA AGCAATGGGA
    ACGGAGAAAG TGAATGGTCG AACAAAACCG ACATGGAAGT TAATAGACGG AATTTGTAGA
    GAAAGCCTTG CATTTGAGAC GGCTCAAAAT GAAGGAATCC CTGAAGCTAT TATCAGAAGA
    GCTGAGGAGT TGTATCTCAC TATCAAAGAT GCTCAAACAG ATTTAGAAAA GATTGATGGT
    GCAAAGGGGC ATCGAACTCA TCAGTCTGGA ATTACTTGTT TGTATGATCG TAGTGATTGT
    TTGAGACCTG AAGGTAATGA ATCTAGTTCT TTTGAAAATG TGTCAATTAA GGGAACTGAT
    ATTGATGCTT TAACTAGTAA TGGACTATCC AAATATGAAG ATGATCATGC CCCACATGAT
    TCTATGAAAT TCCGAAGATT ATGTGTCACA AATGAAGGCT GTGGAGAAAG TTCAACGAGT
    TCCTCACATG TGATGTCTCT GTCTGCGGAG AGGCAGAGAT TATTGCAGAA TGCTGGAAGA
    GCTGTGACCA TCATTTGTCA GAGAAAGTTG AACGAACTTT ACGTGCAGAA ATGTATAGCT
    GCACTTGCAG AGATCTCTTG TGTCACTGTT GATTCTAAAG AACAACCTCC TCCCTCGACA
    ATAGGCACTT CAAGTGTATA TGTGTTAGTC AGGCCTGATT TGAAGTTGTA TGTTGGACAG
    ACAGATGACC TTATTGGTCG CGTCCGTACA CACCGTTCAT CAGAAGGCAT GCAGGATGTG
    CCTTTTCTTT ATGTTGTGGT CCCAGGGAGG AGTGTGGCTT GCCTATTAGA AACTCTGCTC
    ATTAACCAGC TTCCCCTTCA AGGATTTCAT CTTTCTAACA AAGCTGATGG TAAGCACCGC
    AACTTTGGCA CGTGTCATCT CACATTGGAA GGTTCAGCGT TGGCTCAATG A

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    CloneID OAm35127
    Clone ID Related Accession (Same CDS sequence) XM_020673531.1
    Accession Version XM_020673531.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 3582bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2017-04-03
    Organism Amborella trichopoda
    Product DNA mismatch repair protein MSH1, mitochondrial isoform X2
    Comment Comment: MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_006499920.1) annotated using gene prediction method: Gnomon, supported by EST evidence. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Amborella trichopoda Annotation Release 101 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 7.3 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    ATGCATCGGT TTGCCTCCAA GTCCCTGCTA TTCTCTTTGC CCAAATGGAA AGCTCTCACT 
    GTGCTCTTTC GCCTCTCTAA TGGCCGCTTC AACGTCCCCC AAATGCCTCT TAGATATGGA
    GAGAGGACTA TTGCTTTCAG GGCACAAAGG CTTCAAAAGA GCATAATTAG AGCAACCAAA
    AGATCCAAAG CTTCTAAGTC TTTTCTTCAA GGAGAAGATC ATGCACATAT AATGTGGTGG
    CAAGAGAGAA TGGACAAGTG CAAAAAGCCT TCATGTGTTC AACTTGTCAA AAGACTTAAA
    TATTCTAATT TGCTTGGGCT AGATGAAAGT TTAAGAAGTG GGAGTTTGAA GGAAGGCACA
    CTCAATTGCG AGTTATTGCA AGTTAAATCA AAATTTCCCC GTGAAGTTCT AGTCTGCAGG
    GTTGGAGAAT TTTATGAAGC TGTTGGCTTT GATGCATGCA TCCTTGTTGA ACATGCTGGG
    TTGAACCCTA TGAGTGGTTT GAGATCAGAC ACTGTCCCAA GAGCTGGTTG TCCTGTCATG
    AACTTGCGGC AAACTTTGGA TGACTTGACT CGAAGTGGAT ATTCTGTTTG TATAGTAGAG
    GAAGTGCAGG GCCCAACTCA AGCTCGATCG CGAAAAGGTC GCTTCATATC AGGGCATGCA
    CATCCTGGAA GCCCCTATGT ATATGGACTT GCAGGAGCCG ACCTTGAAGT TGATTTTCCT
    GAGCCAGTGC CTGTAGTTGG TGTATCACAT TCAGCAAAAG GTTATTGCTT GATATCAGTT
    ATTGAGACTC TGAAAACATA TTCCATAGAA GATGGGCTAA CTGAAGAGGC TATAGTCACA
    AAGCTCCGTA CCCGTCCCCA CCACCATTTG TTTCTGCATA TATCATTACG GCATAATTGT
    ACAGGGACAC ATCGTTGGGG AGAATTTGGT GAAGGTGGTT TGCTATGGGG GGAGTGTACT
    GACAAGAACT TTGAGTGGTT CGATGGCAGT CCTGCTGATG GACTTTTGTG CAAGGTGAGG
    GACCTTTATG GTCTTGACAA AGAAGAAATT TTTCGAAATG TCACAGTTTC TTCAGAAAAA
    AGGCCTCAGC CATTATACCT TGGAGCTGCA ACTCAAATTG GTTGCATACC TACTGAAGGA
    ATTCCTAGCT TGTTGAAAAT TTTGCTTCCA TCAAGCTGCA CTGGTCTCCC TGTGATATAT
    ATGAGGGATC TTCTACTTAA TCCTCCCATG TATCCTACTG CATTGTCAAT TCAAGAATCG
    TGCAAGCTAA TGAGCAATAT AAGTTGCCCA ATTCCTGAAT TTACAGTTAT TCCGGCAACA
    AAGCTTGTGA AGCTTCTTGA GTCGAAGGAA GCAAACCACA TTGAATTTTG CAGAATAAAA
    AATGTGGTCG ACGATGTTCT GTACTTGTAT GGAAAATCAG ATCTTAATCC CATCCTACAA
    TCGCTGCTAG ATCCTACTTG GGTTGCAACT GGCTTGAAGA TCGATTGCGA GTCCCTGGTG
    AATGAGTGCA GATTGGTTTC ACATAGAATT GGTGAAATGA TTTCCCTACA AGGAGAAAGA
    GTTCAAGCAA CAAGTTCTGT CCCTTGTATT CCAGGAGAAT TTTTTGAAGA CGTTGAGTCA
    TCATGGAAGG GTCGCGTGAA AGAAATCCAT ACAGAAAATG CAATCAAAGA AGTAGAGAGA
    GCGGCAGAAT CCTTGTCTTT GGCTGTTACT GAAGACTTCC TTCCAATTGT TCAGAGAGTA
    AAAGCTGTCA TGTCTCCACA TGGTGGTCCT AGGGGTGATA TAACCTATGC CAAAGAGCAT
    GAAGCTGTTT GGTTTAAAGG AAAGCGTTTT GTACCATCTG TTTGGGGTGG TACTCCTGGT
    GAAGAACAAA TTAAGAGACT AATACCTGCT ACAGATTCTA AAGGGAAAAA TGTTGGAGAT
    GAATGGTTTA CAACTGAGAA GGTGGAGGTT GCTCTGAATA GGTACCATGA GGCAAATGCC
    CGAGCCAAGG CGGTTGTCAT GGATATTTTG CGAGGACTTT CTACTGAAAT GCAAATTAAG
    ATAAATGTTC TTGTCTACTC ATCCATGTTG CTGGTTATTG CAAAGGCATT ATTTGCTCAT
    GTCAGTGAAG GAAGGAGGAG GAAATGGGTC TTTCCCACAT TAAAGGAGTT CGATAGATCT
    GCAGGCAGCA CTTTGTCATG GGAAAATGAC TATATGGAGA TTGTGGGATT ATCACCATAT
    TGGTTTGATG CTGCTCAAGG AAATGCTATA CAAAATACTG TTAAAATGCA TTCCTTATTT
    ATTTTGACGG GACCAAATGG TGGTGGCAAA TCTAGCTTGC TTCGGTCGAT TTGTGCTGCA
    GCATTACTTG GAATTTGTGG GTTGATGGTT CCGGCAGTCT CTGCTGTCAT TCCACATTTT
    GATTCTATCA TGCTTCATAT GAAATCCTAT GATAGTCCTG CGGATGGGAA AAGCTCCTTC
    CAGATGGAAA TGTCAGAGCT GCGGTCAATA GTTACAAGAG CCACTGCAAG GAGCCTTGTG
    CTAGTGGATG AAATTTGCAG GGGAACGGAA ATGTGGAAAG GCACCTGCAT TGCTGGAAGC
    ATATTGGAAA CTCTCGATAA TATTGGTTGC CTGGGCATTG TATCAACCCA CATCCATGCC
    CTCTTCAATT TACCACTAGC GACAAAGAAC ATTGTTTCTA AAGCAATGGG AACGGAGAAA
    GTGAATGGTC GAACAAAACC GACATGGAAG TTAATAGACG GAATTTGTAG AGAAAGCCTT
    GCATTTGAGA CGGCTCAAAA TGAAGGAATC CCTGAAGCTA TTATCAGAAG AGCTGAGGAG
    TTGTATCTCA CTATCAAAGA TGCTCAAACA GATTTAGAAA AGATTGATGG TGCAAAGGGG
    CATCGAACTC ATCAGTCTGG AATTACTTGT TTGTATGATC GTAGTGATTG TTTGAGACCT
    GAAGGTAATG AATCTAGTTC TTTTGAAAAT GTGTCAATTA AGGGAACTGA TATTGATGCT
    TTAACTAGTA ATGGACTATC CAAATATGAA GATGATCATG CCCCACATGA TTCTATGAAA
    TTCCGAAGAT TATGTGTCAC AAATGAAGGC TGTGGAGAAA GTTCAACGAG TTCCTCACAT
    GTGATGTCTC TGTCTGCGGA GAGGCAGAGA TTATTGCAGA ATGCTGGAAG AGCTGTGACC
    ATCATTTGTC AGAGAAAGTT GAACGAACTT TACGTGCAGA AATGTATAGC TGCACTTGCA
    GAGATCTCTT GTGTCACTGT TGATTCTAAA GAACAACCTC CTCCCTCGAC AATAGGCACT
    TCAAGTGTAT ATGTGTTAGT CAGGCCTGAT TTGAAGTTGT ATGTTGGACA GACAGATGAC
    CTTATTGGTC GCGTCCGTAC ACACCGTTCA TCAGAAGGCA TGCAGGATGT GCCTTTTCTT
    TATGTTGTGG TCCCAGGGAG GAGTGTGGCT TGCCTATTAG AAACTCTGCT CATTAACCAG
    CTTCCCCTTC AAGGATTTCA TCTTTCTAAC AAAGCTGATG GTAAGCACCG CAACTTTGGC
    ACGTGTCATC TCACATTGGA AGGTTCAGCG TTGGCTCAAT GA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_020529190.1
    CDS322..3903
    Translation

    Target ORF information:

    RefSeq Version XM_020673531.1
    Organism Amborella trichopoda
    Definition Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X2, mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_020673531.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    ATGCATCGGT TTGCCTCCAA GTCCCTGCTA TTCTCTTTGC CCAAATGGAA AGCTCTCACT 
    GTGCTCTTTC GCCTCTCTAA TGGCCGCTTC AACGTCCCCC AAATGCCTCT TAGATATGGA
    GAGAGGACTA TTGCTTTCAG GGCACAAAGG CTTCAAAAGA GCATAATTAG AGCAACCAAA
    AGATCCAAAG CTTCTAAGTC TTTTCTTCAA GGAGAAGATC ATGCACATAT AATGTGGTGG
    CAAGAGAGAA TGGACAAGTG CAAAAAGCCT TCATGTGTTC AACTTGTCAA AAGACTTAAA
    TATTCTAATT TGCTTGGGCT AGATGAAAGT TTAAGAAGTG GGAGTTTGAA GGAAGGCACA
    CTCAATTGCG AGTTATTGCA AGTTAAATCA AAATTTCCCC GTGAAGTTCT AGTCTGCAGG
    GTTGGAGAAT TTTATGAAGC TGTTGGCTTT GATGCATGCA TCCTTGTTGA ACATGCTGGG
    TTGAACCCTA TGAGTGGTTT GAGATCAGAC ACTGTCCCAA GAGCTGGTTG TCCTGTCATG
    AACTTGCGGC AAACTTTGGA TGACTTGACT CGAAGTGGAT ATTCTGTTTG TATAGTAGAG
    GAAGTGCAGG GCCCAACTCA AGCTCGATCG CGAAAAGGTC GCTTCATATC AGGGCATGCA
    CATCCTGGAA GCCCCTATGT ATATGGACTT GCAGGAGCCG ACCTTGAAGT TGATTTTCCT
    GAGCCAGTGC CTGTAGTTGG TGTATCACAT TCAGCAAAAG GTTATTGCTT GATATCAGTT
    ATTGAGACTC TGAAAACATA TTCCATAGAA GATGGGCTAA CTGAAGAGGC TATAGTCACA
    AAGCTCCGTA CCCGTCCCCA CCACCATTTG TTTCTGCATA TATCATTACG GCATAATTGT
    ACAGGGACAC ATCGTTGGGG AGAATTTGGT GAAGGTGGTT TGCTATGGGG GGAGTGTACT
    GACAAGAACT TTGAGTGGTT CGATGGCAGT CCTGCTGATG GACTTTTGTG CAAGGTGAGG
    GACCTTTATG GTCTTGACAA AGAAGAAATT TTTCGAAATG TCACAGTTTC TTCAGAAAAA
    AGGCCTCAGC CATTATACCT TGGAGCTGCA ACTCAAATTG GTTGCATACC TACTGAAGGA
    ATTCCTAGCT TGTTGAAAAT TTTGCTTCCA TCAAGCTGCA CTGGTCTCCC TGTGATATAT
    ATGAGGGATC TTCTACTTAA TCCTCCCATG TATCCTACTG CATTGTCAAT TCAAGAATCG
    TGCAAGCTAA TGAGCAATAT AAGTTGCCCA ATTCCTGAAT TTACAGTTAT TCCGGCAACA
    AAGCTTGTGA AGCTTCTTGA GTCGAAGGAA GCAAACCACA TTGAATTTTG CAGAATAAAA
    AATGTGGTCG ACGATGTTCT GTACTTGTAT GGAAAATCAG ATCTTAATCC CATCCTACAA
    TCGCTGCTAG ATCCTACTTG GGTTGCAACT GGCTTGAAGA TCGATTGCGA GTCCCTGGTG
    AATGAGTGCA GATTGGTTTC ACATAGAATT GGTGAAATGA TTTCCCTACA AGGAGAAAGA
    GTTCAAGCAA CAAGTTCTGT CCCTTGTATT CCAGGAGAAT TTTTTGAAGA CGTTGAGTCA
    TCATGGAAGG GTCGCGTGAA AGAAATCCAT ACAGAAAATG CAATCAAAGA AGTAGAGAGA
    GCGGCAGAAT CCTTGTCTTT GGCTGTTACT GAAGACTTCC TTCCAATTGT TCAGAGAGTA
    AAAGCTGTCA TGTCTCCACA TGGTGGTCCT AGGGGTGATA TAACCTATGC CAAAGAGCAT
    GAAGCTGTTT GGTTTAAAGG AAAGCGTTTT GTACCATCTG TTTGGGGTGG TACTCCTGGT
    GAAGAACAAA TTAAGAGACT AATACCTGCT ACAGATTCTA AAGGGAAAAA TGTTGGAGAT
    GAATGGTTTA CAACTGAGAA GGTGGAGGTT GCTCTGAATA GGTACCATGA GGCAAATGCC
    CGAGCCAAGG CGGTTGTCAT GGATATTTTG CGAGGACTTT CTACTGAAAT GCAAATTAAG
    ATAAATGTTC TTGTCTACTC ATCCATGTTG CTGGTTATTG CAAAGGCATT ATTTGCTCAT
    GTCAGTGAAG GAAGGAGGAG GAAATGGGTC TTTCCCACAT TAAAGGAGTT CGATAGATCT
    GCAGGCAGCA CTTTGTCATG GGAAAATGAC TATATGGAGA TTGTGGGATT ATCACCATAT
    TGGTTTGATG CTGCTCAAGG AAATGCTATA CAAAATACTG TTAAAATGCA TTCCTTATTT
    ATTTTGACGG GACCAAATGG TGGTGGCAAA TCTAGCTTGC TTCGGTCGAT TTGTGCTGCA
    GCATTACTTG GAATTTGTGG GTTGATGGTT CCGGCAGTCT CTGCTGTCAT TCCACATTTT
    GATTCTATCA TGCTTCATAT GAAATCCTAT GATAGTCCTG CGGATGGGAA AAGCTCCTTC
    CAGATGGAAA TGTCAGAGCT GCGGTCAATA GTTACAAGAG CCACTGCAAG GAGCCTTGTG
    CTAGTGGATG AAATTTGCAG GGGAACGGAA ATGTGGAAAG GCACCTGCAT TGCTGGAAGC
    ATATTGGAAA CTCTCGATAA TATTGGTTGC CTGGGCATTG TATCAACCCA CATCCATGCC
    CTCTTCAATT TACCACTAGC GACAAAGAAC ATTGTTTCTA AAGCAATGGG AACGGAGAAA
    GTGAATGGTC GAACAAAACC GACATGGAAG TTAATAGACG GAATTTGTAG AGAAAGCCTT
    GCATTTGAGA CGGCTCAAAA TGAAGGAATC CCTGAAGCTA TTATCAGAAG AGCTGAGGAG
    TTGTATCTCA CTATCAAAGA TGCTCAAACA GATTTAGAAA AGATTGATGG TGCAAAGGGG
    CATCGAACTC ATCAGTCTGG AATTACTTGT TTGTATGATC GTAGTGATTG TTTGAGACCT
    GAAGGTAATG AATCTAGTTC TTTTGAAAAT GTGTCAATTA AGGGAACTGA TATTGATGCT
    TTAACTAGTA ATGGACTATC CAAATATGAA GATGATCATG CCCCACATGA TTCTATGAAA
    TTCCGAAGAT TATGTGTCAC AAATGAAGGC TGTGGAGAAA GTTCAACGAG TTCCTCACAT
    GTGATGTCTC TGTCTGCGGA GAGGCAGAGA TTATTGCAGA ATGCTGGAAG AGCTGTGACC
    ATCATTTGTC AGAGAAAGTT GAACGAACTT TACGTGCAGA AATGTATAGC TGCACTTGCA
    GAGATCTCTT GTGTCACTGT TGATTCTAAA GAACAACCTC CTCCCTCGAC AATAGGCACT
    TCAAGTGTAT ATGTGTTAGT CAGGCCTGAT TTGAAGTTGT ATGTTGGACA GACAGATGAC
    CTTATTGGTC GCGTCCGTAC ACACCGTTCA TCAGAAGGCA TGCAGGATGT GCCTTTTCTT
    TATGTTGTGG TCCCAGGGAG GAGTGTGGCT TGCCTATTAG AAACTCTGCT CATTAACCAG
    CTTCCCCTTC AAGGATTTCA TCTTTCTAAC AAAGCTGATG GTAAGCACCG CAACTTTGGC
    ACGTGTCATC TCACATTGGA AGGTTCAGCG TTGGCTCAAT GA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    CloneID OAm35128
    Clone ID Related Accession (Same CDS sequence) XM_020673533.1 , XM_020673534.1 , XM_020673532.1
    Accession Version XM_020673533.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 3093bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2017-04-03
    Organism Amborella trichopoda
    Product DNA mismatch repair protein MSH1, mitochondrial isoform X3
    Comment Comment: MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_006499920.1) annotated using gene prediction method: Gnomon, supported by EST evidence. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Amborella trichopoda Annotation Release 101 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 7.3 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    ATGAGTGGTT TGAGATCAGA CACTGTCCCA AGAGCTGGTT GTCCTGTCAT GAACTTGCGG 
    CAAACTTTGG ATGACTTGAC TCGAAGTGGA TATTCTGTTT GTATAGTAGA GGAAGTGCAG
    GGCCCAACTC AAGCTCGATC GCGAAAAGGT CGCTTCATAT CAGGGCATGC ACATCCTGGA
    AGCCCCTATG TATATGGACT TGCAGGAGCC GACCTTGAAG TTGATTTTCC TGAGCCAGTG
    CCTGTAGTTG GTGTATCACA TTCAGCAAAA GGTTATTGCT TGATATCAGT TATTGAGACT
    CTGAAAACAT ATTCCATAGA AGATGGGCTA ACTGAAGAGG CTATAGTCAC AAAGCTCCGT
    ACCCGTCCCC ACCACCATTT GTTTCTGCAT ATATCATTAC GGCATAATTG TACAGGGACA
    CATCGTTGGG GAGAATTTGG TGAAGGTGGT TTGCTATGGG GGGAGTGTAC TGACAAGAAC
    TTTGAGTGGT TCGATGGCAG TCCTGCTGAT GGACTTTTGT GCAAGGTGAG GGACCTTTAT
    GGTCTTGACA AAGAAGAAAT TTTTCGAAAT GTCACAGTTT CTTCAGAAAA AAGGCCTCAG
    CCATTATACC TTGGAGCTGC AACTCAAATT GGTTGCATAC CTACTGAAGG AATTCCTAGC
    TTGTTGAAAA TTTTGCTTCC ATCAAGCTGC ACTGGTCTCC CTGTGATATA TATGAGGGAT
    CTTCTACTTA ATCCTCCCAT GTATCCTACT GCATTGTCAA TTCAAGAATC GTGCAAGCTA
    ATGAGCAATA TAAGTTGCCC AATTCCTGAA TTTACAGTTA TTCCGGCAAC AAAGCTTGTG
    AAGCTTCTTG AGTCGAAGGA AGCAAACCAC ATTGAATTTT GCAGAATAAA AAATGTGGTC
    GACGATGTTC TGTACTTGTA TGGAAAATCA GATCTTAATC CCATCCTACA ATCGCTGCTA
    GATCCTACTT GGGTTGCAAC TGGCTTGAAG ATCGATTGCG AGTCCCTGGT GAATGAGTGC
    AGATTGGTTT CACATAGAAT TGGTGAAATG ATTTCCCTAC AAGGAGAAAG AGTTCAAGCA
    ACAAGTTCTG TCCCTTGTAT TCCAGGAGAA TTTTTTGAAG ACGTTGAGTC ATCATGGAAG
    GGTCGCGTGA AAGAAATCCA TACAGAAAAT GCAATCAAAG AAGTAGAGAG AGCGGCAGAA
    TCCTTGTCTT TGGCTGTTAC TGAAGACTTC CTTCCAATTG TTCAGAGAGT AAAAGCTGTC
    ATGTCTCCAC ATGGTGGTCC TAGGGGTGAT ATAACCTATG CCAAAGAGCA TGAAGCTGTT
    TGGTTTAAAG GAAAGCGTTT TGTACCATCT GTTTGGGGTG GTACTCCTGG TGAAGAACAA
    ATTAAGAGAC TAATACCTGC TACAGATTCT AAAGGGAAAA ATGTTGGAGA TGAATGGTTT
    ACAACTGAGA AGGTGGAGGT TGCTCTGAAT AGGTACCATG AGGCAAATGC CCGAGCCAAG
    GCGGTTGTCA TGGATATTTT GCGAGGACTT TCTACTGAAA TGCAAATTAA GATAAATGTT
    CTTGTCTACT CATCCATGTT GCTGGTTATT GCAAAGGCAT TATTTGCTCA TGTCAGTGAA
    GGAAGGAGGA GGAAATGGGT CTTTCCCACA TTAAAGGAGT TCGATAGATC TGCAGGCAGC
    ACTTTGTCAT GGGAAAATGA CTATATGGAG ATTGTGGGAT TATCACCATA TTGGTTTGAT
    GCTGCTCAAG GAAATGCTAT ACAAAATACT GTTAAAATGC ATTCCTTATT TATTTTGACG
    GGACCAAATG GTGGTGGCAA ATCTAGCTTG CTTCGGTCGA TTTGTGCTGC AGCATTACTT
    GGAATTTGTG GGTTGATGGT TCCGGCAGTC TCTGCTGTCA TTCCACATTT TGATTCTATC
    ATGCTTCATA TGAAATCCTA TGATAGTCCT GCGGATGGGA AAAGCTCCTT CCAGATGGAA
    ATGTCAGAGC TGCGGTCAAT AGTTACAAGA GCCACTGCAA GGAGCCTTGT GCTAGTGGAT
    GAAATTTGCA GGGGAACGGA AATGTGGAAA GGCACCTGCA TTGCTGGAAG CATATTGGAA
    ACTCTCGATA ATATTGGTTG CCTGGGCATT GTATCAACCC ACATCCATGC CCTCTTCAAT
    TTACCACTAG CGACAAAGAA CATTGTTTCT AAAGCAATGG GAACGGAGAA AGTGAATGGT
    CGAACAAAAC CGACATGGAA GTTAATAGAC GGAATTTGTA GAGAAAGCCT TGCATTTGAG
    ACGGCTCAAA ATGAAGGAAT CCCTGAAGCT ATTATCAGAA GAGCTGAGGA GTTGTATCTC
    ACTATCAAAG ATGCTCAAAC AGATTTAGAA AAGATTGATG GTGCAAAGGG GCATCGAACT
    CATCAGTCTG GAATTACTTG TTTGTATGAT CGTAGTGATT GTTTGAGACC TGAAGGTAAT
    GAATCTAGTT CTTTTGAAAA TGTGTCAATT AAGGGAACTG ATATTGATGC TTTAACTAGT
    AATGGACTAT CCAAATATGA AGATGATCAT GCCCCACATG ATTCTATGAA ATTCCGAAGA
    TTATGTGTCA CAAATGAAGG CTGTGGAGAA AGTTCAACGA GTTCCTCACA TGTGATGTCT
    CTGTCTGCGG AGAGGCAGAG ATTATTGCAG AATGCTGGAA GAGCTGTGAC CATCATTTGT
    CAGAGAAAGT TGAACGAACT TTACGTGCAG AAATGTATAG CTGCACTTGC AGAGATCTCT
    TGTGTCACTG TTGATTCTAA AGAACAACCT CCTCCCTCGA CAATAGGCAC TTCAAGTGTA
    TATGTGTTAG TCAGGCCTGA TTTGAAGTTG TATGTTGGAC AGACAGATGA CCTTATTGGT
    CGCGTCCGTA CACACCGTTC ATCAGAAGGC ATGCAGGATG TGCCTTTTCT TTATGTTGTG
    GTCCCAGGGA GGAGTGTGGC TTGCCTATTA GAAACTCTGC TCATTAACCA GCTTCCCCTT
    CAAGGATTTC ATCTTTCTAA CAAAGCTGAT GGTAAGCACC GCAACTTTGG CACGTGTCAT
    CTCACATTGG AAGGTTCAGC GTTGGCTCAA TGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_020529192.1
    CDS217..3309
    Translation

    Target ORF information:

    RefSeq Version XM_020673533.1
    Organism Amborella trichopoda
    Definition Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X4, mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_020673533.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    ATGAGTGGTT TGAGATCAGA CACTGTCCCA AGAGCTGGTT GTCCTGTCAT GAACTTGCGG 
    CAAACTTTGG ATGACTTGAC TCGAAGTGGA TATTCTGTTT GTATAGTAGA GGAAGTGCAG
    GGCCCAACTC AAGCTCGATC GCGAAAAGGT CGCTTCATAT CAGGGCATGC ACATCCTGGA
    AGCCCCTATG TATATGGACT TGCAGGAGCC GACCTTGAAG TTGATTTTCC TGAGCCAGTG
    CCTGTAGTTG GTGTATCACA TTCAGCAAAA GGTTATTGCT TGATATCAGT TATTGAGACT
    CTGAAAACAT ATTCCATAGA AGATGGGCTA ACTGAAGAGG CTATAGTCAC AAAGCTCCGT
    ACCCGTCCCC ACCACCATTT GTTTCTGCAT ATATCATTAC GGCATAATTG TACAGGGACA
    CATCGTTGGG GAGAATTTGG TGAAGGTGGT TTGCTATGGG GGGAGTGTAC TGACAAGAAC
    TTTGAGTGGT TCGATGGCAG TCCTGCTGAT GGACTTTTGT GCAAGGTGAG GGACCTTTAT
    GGTCTTGACA AAGAAGAAAT TTTTCGAAAT GTCACAGTTT CTTCAGAAAA AAGGCCTCAG
    CCATTATACC TTGGAGCTGC AACTCAAATT GGTTGCATAC CTACTGAAGG AATTCCTAGC
    TTGTTGAAAA TTTTGCTTCC ATCAAGCTGC ACTGGTCTCC CTGTGATATA TATGAGGGAT
    CTTCTACTTA ATCCTCCCAT GTATCCTACT GCATTGTCAA TTCAAGAATC GTGCAAGCTA
    ATGAGCAATA TAAGTTGCCC AATTCCTGAA TTTACAGTTA TTCCGGCAAC AAAGCTTGTG
    AAGCTTCTTG AGTCGAAGGA AGCAAACCAC ATTGAATTTT GCAGAATAAA AAATGTGGTC
    GACGATGTTC TGTACTTGTA TGGAAAATCA GATCTTAATC CCATCCTACA ATCGCTGCTA
    GATCCTACTT GGGTTGCAAC TGGCTTGAAG ATCGATTGCG AGTCCCTGGT GAATGAGTGC
    AGATTGGTTT CACATAGAAT TGGTGAAATG ATTTCCCTAC AAGGAGAAAG AGTTCAAGCA
    ACAAGTTCTG TCCCTTGTAT TCCAGGAGAA TTTTTTGAAG ACGTTGAGTC ATCATGGAAG
    GGTCGCGTGA AAGAAATCCA TACAGAAAAT GCAATCAAAG AAGTAGAGAG AGCGGCAGAA
    TCCTTGTCTT TGGCTGTTAC TGAAGACTTC CTTCCAATTG TTCAGAGAGT AAAAGCTGTC
    ATGTCTCCAC ATGGTGGTCC TAGGGGTGAT ATAACCTATG CCAAAGAGCA TGAAGCTGTT
    TGGTTTAAAG GAAAGCGTTT TGTACCATCT GTTTGGGGTG GTACTCCTGG TGAAGAACAA
    ATTAAGAGAC TAATACCTGC TACAGATTCT AAAGGGAAAA ATGTTGGAGA TGAATGGTTT
    ACAACTGAGA AGGTGGAGGT TGCTCTGAAT AGGTACCATG AGGCAAATGC CCGAGCCAAG
    GCGGTTGTCA TGGATATTTT GCGAGGACTT TCTACTGAAA TGCAAATTAA GATAAATGTT
    CTTGTCTACT CATCCATGTT GCTGGTTATT GCAAAGGCAT TATTTGCTCA TGTCAGTGAA
    GGAAGGAGGA GGAAATGGGT CTTTCCCACA TTAAAGGAGT TCGATAGATC TGCAGGCAGC
    ACTTTGTCAT GGGAAAATGA CTATATGGAG ATTGTGGGAT TATCACCATA TTGGTTTGAT
    GCTGCTCAAG GAAATGCTAT ACAAAATACT GTTAAAATGC ATTCCTTATT TATTTTGACG
    GGACCAAATG GTGGTGGCAA ATCTAGCTTG CTTCGGTCGA TTTGTGCTGC AGCATTACTT
    GGAATTTGTG GGTTGATGGT TCCGGCAGTC TCTGCTGTCA TTCCACATTT TGATTCTATC
    ATGCTTCATA TGAAATCCTA TGATAGTCCT GCGGATGGGA AAAGCTCCTT CCAGATGGAA
    ATGTCAGAGC TGCGGTCAAT AGTTACAAGA GCCACTGCAA GGAGCCTTGT GCTAGTGGAT
    GAAATTTGCA GGGGAACGGA AATGTGGAAA GGCACCTGCA TTGCTGGAAG CATATTGGAA
    ACTCTCGATA ATATTGGTTG CCTGGGCATT GTATCAACCC ACATCCATGC CCTCTTCAAT
    TTACCACTAG CGACAAAGAA CATTGTTTCT AAAGCAATGG GAACGGAGAA AGTGAATGGT
    CGAACAAAAC CGACATGGAA GTTAATAGAC GGAATTTGTA GAGAAAGCCT TGCATTTGAG
    ACGGCTCAAA ATGAAGGAAT CCCTGAAGCT ATTATCAGAA GAGCTGAGGA GTTGTATCTC
    ACTATCAAAG ATGCTCAAAC AGATTTAGAA AAGATTGATG GTGCAAAGGG GCATCGAACT
    CATCAGTCTG GAATTACTTG TTTGTATGAT CGTAGTGATT GTTTGAGACC TGAAGGTAAT
    GAATCTAGTT CTTTTGAAAA TGTGTCAATT AAGGGAACTG ATATTGATGC TTTAACTAGT
    AATGGACTAT CCAAATATGA AGATGATCAT GCCCCACATG ATTCTATGAA ATTCCGAAGA
    TTATGTGTCA CAAATGAAGG CTGTGGAGAA AGTTCAACGA GTTCCTCACA TGTGATGTCT
    CTGTCTGCGG AGAGGCAGAG ATTATTGCAG AATGCTGGAA GAGCTGTGAC CATCATTTGT
    CAGAGAAAGT TGAACGAACT TTACGTGCAG AAATGTATAG CTGCACTTGC AGAGATCTCT
    TGTGTCACTG TTGATTCTAA AGAACAACCT CCTCCCTCGA CAATAGGCAC TTCAAGTGTA
    TATGTGTTAG TCAGGCCTGA TTTGAAGTTG TATGTTGGAC AGACAGATGA CCTTATTGGT
    CGCGTCCGTA CACACCGTTC ATCAGAAGGC ATGCAGGATG TGCCTTTTCT TTATGTTGTG
    GTCCCAGGGA GGAGTGTGGC TTGCCTATTA GAAACTCTGC TCATTAACCA GCTTCCCCTT
    CAAGGATTTC ATCTTTCTAA CAAAGCTGAT GGTAAGCACC GCAACTTTGG CACGTGTCAT
    CTCACATTGG AAGGTTCAGC GTTGGCTCAA TGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    CloneID OAm35128
    Clone ID Related Accession (Same CDS sequence) XM_020673533.1 , XM_020673534.1 , XM_020673532.1
    Accession Version XM_020673534.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 3093bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2017-04-03
    Organism Amborella trichopoda
    Product DNA mismatch repair protein MSH1, mitochondrial isoform X3
    Comment Comment: MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_006499920.1) annotated using gene prediction method: Gnomon, supported by EST evidence. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Amborella trichopoda Annotation Release 101 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 7.3 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    ATGAGTGGTT TGAGATCAGA CACTGTCCCA AGAGCTGGTT GTCCTGTCAT GAACTTGCGG 
    CAAACTTTGG ATGACTTGAC TCGAAGTGGA TATTCTGTTT GTATAGTAGA GGAAGTGCAG
    GGCCCAACTC AAGCTCGATC GCGAAAAGGT CGCTTCATAT CAGGGCATGC ACATCCTGGA
    AGCCCCTATG TATATGGACT TGCAGGAGCC GACCTTGAAG TTGATTTTCC TGAGCCAGTG
    CCTGTAGTTG GTGTATCACA TTCAGCAAAA GGTTATTGCT TGATATCAGT TATTGAGACT
    CTGAAAACAT ATTCCATAGA AGATGGGCTA ACTGAAGAGG CTATAGTCAC AAAGCTCCGT
    ACCCGTCCCC ACCACCATTT GTTTCTGCAT ATATCATTAC GGCATAATTG TACAGGGACA
    CATCGTTGGG GAGAATTTGG TGAAGGTGGT TTGCTATGGG GGGAGTGTAC TGACAAGAAC
    TTTGAGTGGT TCGATGGCAG TCCTGCTGAT GGACTTTTGT GCAAGGTGAG GGACCTTTAT
    GGTCTTGACA AAGAAGAAAT TTTTCGAAAT GTCACAGTTT CTTCAGAAAA AAGGCCTCAG
    CCATTATACC TTGGAGCTGC AACTCAAATT GGTTGCATAC CTACTGAAGG AATTCCTAGC
    TTGTTGAAAA TTTTGCTTCC ATCAAGCTGC ACTGGTCTCC CTGTGATATA TATGAGGGAT
    CTTCTACTTA ATCCTCCCAT GTATCCTACT GCATTGTCAA TTCAAGAATC GTGCAAGCTA
    ATGAGCAATA TAAGTTGCCC AATTCCTGAA TTTACAGTTA TTCCGGCAAC AAAGCTTGTG
    AAGCTTCTTG AGTCGAAGGA AGCAAACCAC ATTGAATTTT GCAGAATAAA AAATGTGGTC
    GACGATGTTC TGTACTTGTA TGGAAAATCA GATCTTAATC CCATCCTACA ATCGCTGCTA
    GATCCTACTT GGGTTGCAAC TGGCTTGAAG ATCGATTGCG AGTCCCTGGT GAATGAGTGC
    AGATTGGTTT CACATAGAAT TGGTGAAATG ATTTCCCTAC AAGGAGAAAG AGTTCAAGCA
    ACAAGTTCTG TCCCTTGTAT TCCAGGAGAA TTTTTTGAAG ACGTTGAGTC ATCATGGAAG
    GGTCGCGTGA AAGAAATCCA TACAGAAAAT GCAATCAAAG AAGTAGAGAG AGCGGCAGAA
    TCCTTGTCTT TGGCTGTTAC TGAAGACTTC CTTCCAATTG TTCAGAGAGT AAAAGCTGTC
    ATGTCTCCAC ATGGTGGTCC TAGGGGTGAT ATAACCTATG CCAAAGAGCA TGAAGCTGTT
    TGGTTTAAAG GAAAGCGTTT TGTACCATCT GTTTGGGGTG GTACTCCTGG TGAAGAACAA
    ATTAAGAGAC TAATACCTGC TACAGATTCT AAAGGGAAAA ATGTTGGAGA TGAATGGTTT
    ACAACTGAGA AGGTGGAGGT TGCTCTGAAT AGGTACCATG AGGCAAATGC CCGAGCCAAG
    GCGGTTGTCA TGGATATTTT GCGAGGACTT TCTACTGAAA TGCAAATTAA GATAAATGTT
    CTTGTCTACT CATCCATGTT GCTGGTTATT GCAAAGGCAT TATTTGCTCA TGTCAGTGAA
    GGAAGGAGGA GGAAATGGGT CTTTCCCACA TTAAAGGAGT TCGATAGATC TGCAGGCAGC
    ACTTTGTCAT GGGAAAATGA CTATATGGAG ATTGTGGGAT TATCACCATA TTGGTTTGAT
    GCTGCTCAAG GAAATGCTAT ACAAAATACT GTTAAAATGC ATTCCTTATT TATTTTGACG
    GGACCAAATG GTGGTGGCAA ATCTAGCTTG CTTCGGTCGA TTTGTGCTGC AGCATTACTT
    GGAATTTGTG GGTTGATGGT TCCGGCAGTC TCTGCTGTCA TTCCACATTT TGATTCTATC
    ATGCTTCATA TGAAATCCTA TGATAGTCCT GCGGATGGGA AAAGCTCCTT CCAGATGGAA
    ATGTCAGAGC TGCGGTCAAT AGTTACAAGA GCCACTGCAA GGAGCCTTGT GCTAGTGGAT
    GAAATTTGCA GGGGAACGGA AATGTGGAAA GGCACCTGCA TTGCTGGAAG CATATTGGAA
    ACTCTCGATA ATATTGGTTG CCTGGGCATT GTATCAACCC ACATCCATGC CCTCTTCAAT
    TTACCACTAG CGACAAAGAA CATTGTTTCT AAAGCAATGG GAACGGAGAA AGTGAATGGT
    CGAACAAAAC CGACATGGAA GTTAATAGAC GGAATTTGTA GAGAAAGCCT TGCATTTGAG
    ACGGCTCAAA ATGAAGGAAT CCCTGAAGCT ATTATCAGAA GAGCTGAGGA GTTGTATCTC
    ACTATCAAAG ATGCTCAAAC AGATTTAGAA AAGATTGATG GTGCAAAGGG GCATCGAACT
    CATCAGTCTG GAATTACTTG TTTGTATGAT CGTAGTGATT GTTTGAGACC TGAAGGTAAT
    GAATCTAGTT CTTTTGAAAA TGTGTCAATT AAGGGAACTG ATATTGATGC TTTAACTAGT
    AATGGACTAT CCAAATATGA AGATGATCAT GCCCCACATG ATTCTATGAA ATTCCGAAGA
    TTATGTGTCA CAAATGAAGG CTGTGGAGAA AGTTCAACGA GTTCCTCACA TGTGATGTCT
    CTGTCTGCGG AGAGGCAGAG ATTATTGCAG AATGCTGGAA GAGCTGTGAC CATCATTTGT
    CAGAGAAAGT TGAACGAACT TTACGTGCAG AAATGTATAG CTGCACTTGC AGAGATCTCT
    TGTGTCACTG TTGATTCTAA AGAACAACCT CCTCCCTCGA CAATAGGCAC TTCAAGTGTA
    TATGTGTTAG TCAGGCCTGA TTTGAAGTTG TATGTTGGAC AGACAGATGA CCTTATTGGT
    CGCGTCCGTA CACACCGTTC ATCAGAAGGC ATGCAGGATG TGCCTTTTCT TTATGTTGTG
    GTCCCAGGGA GGAGTGTGGC TTGCCTATTA GAAACTCTGC TCATTAACCA GCTTCCCCTT
    CAAGGATTTC ATCTTTCTAA CAAAGCTGAT GGTAAGCACC GCAACTTTGG CACGTGTCAT
    CTCACATTGG AAGGTTCAGC GTTGGCTCAA TGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_020529193.1
    CDS894..3986
    Translation

    Target ORF information:

    RefSeq Version XM_020673534.1
    Organism Amborella trichopoda
    Definition Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X5, mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_020673534.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    ATGAGTGGTT TGAGATCAGA CACTGTCCCA AGAGCTGGTT GTCCTGTCAT GAACTTGCGG 
    CAAACTTTGG ATGACTTGAC TCGAAGTGGA TATTCTGTTT GTATAGTAGA GGAAGTGCAG
    GGCCCAACTC AAGCTCGATC GCGAAAAGGT CGCTTCATAT CAGGGCATGC ACATCCTGGA
    AGCCCCTATG TATATGGACT TGCAGGAGCC GACCTTGAAG TTGATTTTCC TGAGCCAGTG
    CCTGTAGTTG GTGTATCACA TTCAGCAAAA GGTTATTGCT TGATATCAGT TATTGAGACT
    CTGAAAACAT ATTCCATAGA AGATGGGCTA ACTGAAGAGG CTATAGTCAC AAAGCTCCGT
    ACCCGTCCCC ACCACCATTT GTTTCTGCAT ATATCATTAC GGCATAATTG TACAGGGACA
    CATCGTTGGG GAGAATTTGG TGAAGGTGGT TTGCTATGGG GGGAGTGTAC TGACAAGAAC
    TTTGAGTGGT TCGATGGCAG TCCTGCTGAT GGACTTTTGT GCAAGGTGAG GGACCTTTAT
    GGTCTTGACA AAGAAGAAAT TTTTCGAAAT GTCACAGTTT CTTCAGAAAA AAGGCCTCAG
    CCATTATACC TTGGAGCTGC AACTCAAATT GGTTGCATAC CTACTGAAGG AATTCCTAGC
    TTGTTGAAAA TTTTGCTTCC ATCAAGCTGC ACTGGTCTCC CTGTGATATA TATGAGGGAT
    CTTCTACTTA ATCCTCCCAT GTATCCTACT GCATTGTCAA TTCAAGAATC GTGCAAGCTA
    ATGAGCAATA TAAGTTGCCC AATTCCTGAA TTTACAGTTA TTCCGGCAAC AAAGCTTGTG
    AAGCTTCTTG AGTCGAAGGA AGCAAACCAC ATTGAATTTT GCAGAATAAA AAATGTGGTC
    GACGATGTTC TGTACTTGTA TGGAAAATCA GATCTTAATC CCATCCTACA ATCGCTGCTA
    GATCCTACTT GGGTTGCAAC TGGCTTGAAG ATCGATTGCG AGTCCCTGGT GAATGAGTGC
    AGATTGGTTT CACATAGAAT TGGTGAAATG ATTTCCCTAC AAGGAGAAAG AGTTCAAGCA
    ACAAGTTCTG TCCCTTGTAT TCCAGGAGAA TTTTTTGAAG ACGTTGAGTC ATCATGGAAG
    GGTCGCGTGA AAGAAATCCA TACAGAAAAT GCAATCAAAG AAGTAGAGAG AGCGGCAGAA
    TCCTTGTCTT TGGCTGTTAC TGAAGACTTC CTTCCAATTG TTCAGAGAGT AAAAGCTGTC
    ATGTCTCCAC ATGGTGGTCC TAGGGGTGAT ATAACCTATG CCAAAGAGCA TGAAGCTGTT
    TGGTTTAAAG GAAAGCGTTT TGTACCATCT GTTTGGGGTG GTACTCCTGG TGAAGAACAA
    ATTAAGAGAC TAATACCTGC TACAGATTCT AAAGGGAAAA ATGTTGGAGA TGAATGGTTT
    ACAACTGAGA AGGTGGAGGT TGCTCTGAAT AGGTACCATG AGGCAAATGC CCGAGCCAAG
    GCGGTTGTCA TGGATATTTT GCGAGGACTT TCTACTGAAA TGCAAATTAA GATAAATGTT
    CTTGTCTACT CATCCATGTT GCTGGTTATT GCAAAGGCAT TATTTGCTCA TGTCAGTGAA
    GGAAGGAGGA GGAAATGGGT CTTTCCCACA TTAAAGGAGT TCGATAGATC TGCAGGCAGC
    ACTTTGTCAT GGGAAAATGA CTATATGGAG ATTGTGGGAT TATCACCATA TTGGTTTGAT
    GCTGCTCAAG GAAATGCTAT ACAAAATACT GTTAAAATGC ATTCCTTATT TATTTTGACG
    GGACCAAATG GTGGTGGCAA ATCTAGCTTG CTTCGGTCGA TTTGTGCTGC AGCATTACTT
    GGAATTTGTG GGTTGATGGT TCCGGCAGTC TCTGCTGTCA TTCCACATTT TGATTCTATC
    ATGCTTCATA TGAAATCCTA TGATAGTCCT GCGGATGGGA AAAGCTCCTT CCAGATGGAA
    ATGTCAGAGC TGCGGTCAAT AGTTACAAGA GCCACTGCAA GGAGCCTTGT GCTAGTGGAT
    GAAATTTGCA GGGGAACGGA AATGTGGAAA GGCACCTGCA TTGCTGGAAG CATATTGGAA
    ACTCTCGATA ATATTGGTTG CCTGGGCATT GTATCAACCC ACATCCATGC CCTCTTCAAT
    TTACCACTAG CGACAAAGAA CATTGTTTCT AAAGCAATGG GAACGGAGAA AGTGAATGGT
    CGAACAAAAC CGACATGGAA GTTAATAGAC GGAATTTGTA GAGAAAGCCT TGCATTTGAG
    ACGGCTCAAA ATGAAGGAAT CCCTGAAGCT ATTATCAGAA GAGCTGAGGA GTTGTATCTC
    ACTATCAAAG ATGCTCAAAC AGATTTAGAA AAGATTGATG GTGCAAAGGG GCATCGAACT
    CATCAGTCTG GAATTACTTG TTTGTATGAT CGTAGTGATT GTTTGAGACC TGAAGGTAAT
    GAATCTAGTT CTTTTGAAAA TGTGTCAATT AAGGGAACTG ATATTGATGC TTTAACTAGT
    AATGGACTAT CCAAATATGA AGATGATCAT GCCCCACATG ATTCTATGAA ATTCCGAAGA
    TTATGTGTCA CAAATGAAGG CTGTGGAGAA AGTTCAACGA GTTCCTCACA TGTGATGTCT
    CTGTCTGCGG AGAGGCAGAG ATTATTGCAG AATGCTGGAA GAGCTGTGAC CATCATTTGT
    CAGAGAAAGT TGAACGAACT TTACGTGCAG AAATGTATAG CTGCACTTGC AGAGATCTCT
    TGTGTCACTG TTGATTCTAA AGAACAACCT CCTCCCTCGA CAATAGGCAC TTCAAGTGTA
    TATGTGTTAG TCAGGCCTGA TTTGAAGTTG TATGTTGGAC AGACAGATGA CCTTATTGGT
    CGCGTCCGTA CACACCGTTC ATCAGAAGGC ATGCAGGATG TGCCTTTTCT TTATGTTGTG
    GTCCCAGGGA GGAGTGTGGC TTGCCTATTA GAAACTCTGC TCATTAACCA GCTTCCCCTT
    CAAGGATTTC ATCTTTCTAA CAAAGCTGAT GGTAAGCACC GCAACTTTGG CACGTGTCAT
    CTCACATTGG AAGGTTCAGC GTTGGCTCAA TGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    CloneID OAm35128
    Clone ID Related Accession (Same CDS sequence) XM_020673533.1 , XM_020673534.1 , XM_020673532.1
    Accession Version XM_020673532.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 3093bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 2017-04-03
    Organism Amborella trichopoda
    Product DNA mismatch repair protein MSH1, mitochondrial isoform X3
    Comment Comment: MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_006499920.1) annotated using gene prediction method: Gnomon, supported by EST evidence. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Amborella trichopoda Annotation Release 101 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 7.3 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    ATGAGTGGTT TGAGATCAGA CACTGTCCCA AGAGCTGGTT GTCCTGTCAT GAACTTGCGG 
    CAAACTTTGG ATGACTTGAC TCGAAGTGGA TATTCTGTTT GTATAGTAGA GGAAGTGCAG
    GGCCCAACTC AAGCTCGATC GCGAAAAGGT CGCTTCATAT CAGGGCATGC ACATCCTGGA
    AGCCCCTATG TATATGGACT TGCAGGAGCC GACCTTGAAG TTGATTTTCC TGAGCCAGTG
    CCTGTAGTTG GTGTATCACA TTCAGCAAAA GGTTATTGCT TGATATCAGT TATTGAGACT
    CTGAAAACAT ATTCCATAGA AGATGGGCTA ACTGAAGAGG CTATAGTCAC AAAGCTCCGT
    ACCCGTCCCC ACCACCATTT GTTTCTGCAT ATATCATTAC GGCATAATTG TACAGGGACA
    CATCGTTGGG GAGAATTTGG TGAAGGTGGT TTGCTATGGG GGGAGTGTAC TGACAAGAAC
    TTTGAGTGGT TCGATGGCAG TCCTGCTGAT GGACTTTTGT GCAAGGTGAG GGACCTTTAT
    GGTCTTGACA AAGAAGAAAT TTTTCGAAAT GTCACAGTTT CTTCAGAAAA AAGGCCTCAG
    CCATTATACC TTGGAGCTGC AACTCAAATT GGTTGCATAC CTACTGAAGG AATTCCTAGC
    TTGTTGAAAA TTTTGCTTCC ATCAAGCTGC ACTGGTCTCC CTGTGATATA TATGAGGGAT
    CTTCTACTTA ATCCTCCCAT GTATCCTACT GCATTGTCAA TTCAAGAATC GTGCAAGCTA
    ATGAGCAATA TAAGTTGCCC AATTCCTGAA TTTACAGTTA TTCCGGCAAC AAAGCTTGTG
    AAGCTTCTTG AGTCGAAGGA AGCAAACCAC ATTGAATTTT GCAGAATAAA AAATGTGGTC
    GACGATGTTC TGTACTTGTA TGGAAAATCA GATCTTAATC CCATCCTACA ATCGCTGCTA
    GATCCTACTT GGGTTGCAAC TGGCTTGAAG ATCGATTGCG AGTCCCTGGT GAATGAGTGC
    AGATTGGTTT CACATAGAAT TGGTGAAATG ATTTCCCTAC AAGGAGAAAG AGTTCAAGCA
    ACAAGTTCTG TCCCTTGTAT TCCAGGAGAA TTTTTTGAAG ACGTTGAGTC ATCATGGAAG
    GGTCGCGTGA AAGAAATCCA TACAGAAAAT GCAATCAAAG AAGTAGAGAG AGCGGCAGAA
    TCCTTGTCTT TGGCTGTTAC TGAAGACTTC CTTCCAATTG TTCAGAGAGT AAAAGCTGTC
    ATGTCTCCAC ATGGTGGTCC TAGGGGTGAT ATAACCTATG CCAAAGAGCA TGAAGCTGTT
    TGGTTTAAAG GAAAGCGTTT TGTACCATCT GTTTGGGGTG GTACTCCTGG TGAAGAACAA
    ATTAAGAGAC TAATACCTGC TACAGATTCT AAAGGGAAAA ATGTTGGAGA TGAATGGTTT
    ACAACTGAGA AGGTGGAGGT TGCTCTGAAT AGGTACCATG AGGCAAATGC CCGAGCCAAG
    GCGGTTGTCA TGGATATTTT GCGAGGACTT TCTACTGAAA TGCAAATTAA GATAAATGTT
    CTTGTCTACT CATCCATGTT GCTGGTTATT GCAAAGGCAT TATTTGCTCA TGTCAGTGAA
    GGAAGGAGGA GGAAATGGGT CTTTCCCACA TTAAAGGAGT TCGATAGATC TGCAGGCAGC
    ACTTTGTCAT GGGAAAATGA CTATATGGAG ATTGTGGGAT TATCACCATA TTGGTTTGAT
    GCTGCTCAAG GAAATGCTAT ACAAAATACT GTTAAAATGC ATTCCTTATT TATTTTGACG
    GGACCAAATG GTGGTGGCAA ATCTAGCTTG CTTCGGTCGA TTTGTGCTGC AGCATTACTT
    GGAATTTGTG GGTTGATGGT TCCGGCAGTC TCTGCTGTCA TTCCACATTT TGATTCTATC
    ATGCTTCATA TGAAATCCTA TGATAGTCCT GCGGATGGGA AAAGCTCCTT CCAGATGGAA
    ATGTCAGAGC TGCGGTCAAT AGTTACAAGA GCCACTGCAA GGAGCCTTGT GCTAGTGGAT
    GAAATTTGCA GGGGAACGGA AATGTGGAAA GGCACCTGCA TTGCTGGAAG CATATTGGAA
    ACTCTCGATA ATATTGGTTG CCTGGGCATT GTATCAACCC ACATCCATGC CCTCTTCAAT
    TTACCACTAG CGACAAAGAA CATTGTTTCT AAAGCAATGG GAACGGAGAA AGTGAATGGT
    CGAACAAAAC CGACATGGAA GTTAATAGAC GGAATTTGTA GAGAAAGCCT TGCATTTGAG
    ACGGCTCAAA ATGAAGGAAT CCCTGAAGCT ATTATCAGAA GAGCTGAGGA GTTGTATCTC
    ACTATCAAAG ATGCTCAAAC AGATTTAGAA AAGATTGATG GTGCAAAGGG GCATCGAACT
    CATCAGTCTG GAATTACTTG TTTGTATGAT CGTAGTGATT GTTTGAGACC TGAAGGTAAT
    GAATCTAGTT CTTTTGAAAA TGTGTCAATT AAGGGAACTG ATATTGATGC TTTAACTAGT
    AATGGACTAT CCAAATATGA AGATGATCAT GCCCCACATG ATTCTATGAA ATTCCGAAGA
    TTATGTGTCA CAAATGAAGG CTGTGGAGAA AGTTCAACGA GTTCCTCACA TGTGATGTCT
    CTGTCTGCGG AGAGGCAGAG ATTATTGCAG AATGCTGGAA GAGCTGTGAC CATCATTTGT
    CAGAGAAAGT TGAACGAACT TTACGTGCAG AAATGTATAG CTGCACTTGC AGAGATCTCT
    TGTGTCACTG TTGATTCTAA AGAACAACCT CCTCCCTCGA CAATAGGCAC TTCAAGTGTA
    TATGTGTTAG TCAGGCCTGA TTTGAAGTTG TATGTTGGAC AGACAGATGA CCTTATTGGT
    CGCGTCCGTA CACACCGTTC ATCAGAAGGC ATGCAGGATG TGCCTTTTCT TTATGTTGTG
    GTCCCAGGGA GGAGTGTGGC TTGCCTATTA GAAACTCTGC TCATTAACCA GCTTCCCCTT
    CAAGGATTTC ATCTTTCTAA CAAAGCTGAT GGTAAGCACC GCAACTTTGG CACGTGTCAT
    CTCACATTGG AAGGTTCAGC GTTGGCTCAA TGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_020529191.1
    CDS250..3342
    Translation

    Target ORF information:

    RefSeq Version XM_020673532.1
    Organism Amborella trichopoda
    Definition Amborella trichopoda DNA mismatch repair protein MSH1, mitochondrial (MSH1), transcript variant X3, mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_020673532.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    ATGAGTGGTT TGAGATCAGA CACTGTCCCA AGAGCTGGTT GTCCTGTCAT GAACTTGCGG 
    CAAACTTTGG ATGACTTGAC TCGAAGTGGA TATTCTGTTT GTATAGTAGA GGAAGTGCAG
    GGCCCAACTC AAGCTCGATC GCGAAAAGGT CGCTTCATAT CAGGGCATGC ACATCCTGGA
    AGCCCCTATG TATATGGACT TGCAGGAGCC GACCTTGAAG TTGATTTTCC TGAGCCAGTG
    CCTGTAGTTG GTGTATCACA TTCAGCAAAA GGTTATTGCT TGATATCAGT TATTGAGACT
    CTGAAAACAT ATTCCATAGA AGATGGGCTA ACTGAAGAGG CTATAGTCAC AAAGCTCCGT
    ACCCGTCCCC ACCACCATTT GTTTCTGCAT ATATCATTAC GGCATAATTG TACAGGGACA
    CATCGTTGGG GAGAATTTGG TGAAGGTGGT TTGCTATGGG GGGAGTGTAC TGACAAGAAC
    TTTGAGTGGT TCGATGGCAG TCCTGCTGAT GGACTTTTGT GCAAGGTGAG GGACCTTTAT
    GGTCTTGACA AAGAAGAAAT TTTTCGAAAT GTCACAGTTT CTTCAGAAAA AAGGCCTCAG
    CCATTATACC TTGGAGCTGC AACTCAAATT GGTTGCATAC CTACTGAAGG AATTCCTAGC
    TTGTTGAAAA TTTTGCTTCC ATCAAGCTGC ACTGGTCTCC CTGTGATATA TATGAGGGAT
    CTTCTACTTA ATCCTCCCAT GTATCCTACT GCATTGTCAA TTCAAGAATC GTGCAAGCTA
    ATGAGCAATA TAAGTTGCCC AATTCCTGAA TTTACAGTTA TTCCGGCAAC AAAGCTTGTG
    AAGCTTCTTG AGTCGAAGGA AGCAAACCAC ATTGAATTTT GCAGAATAAA AAATGTGGTC
    GACGATGTTC TGTACTTGTA TGGAAAATCA GATCTTAATC CCATCCTACA ATCGCTGCTA
    GATCCTACTT GGGTTGCAAC TGGCTTGAAG ATCGATTGCG AGTCCCTGGT GAATGAGTGC
    AGATTGGTTT CACATAGAAT TGGTGAAATG ATTTCCCTAC AAGGAGAAAG AGTTCAAGCA
    ACAAGTTCTG TCCCTTGTAT TCCAGGAGAA TTTTTTGAAG ACGTTGAGTC ATCATGGAAG
    GGTCGCGTGA AAGAAATCCA TACAGAAAAT GCAATCAAAG AAGTAGAGAG AGCGGCAGAA
    TCCTTGTCTT TGGCTGTTAC TGAAGACTTC CTTCCAATTG TTCAGAGAGT AAAAGCTGTC
    ATGTCTCCAC ATGGTGGTCC TAGGGGTGAT ATAACCTATG CCAAAGAGCA TGAAGCTGTT
    TGGTTTAAAG GAAAGCGTTT TGTACCATCT GTTTGGGGTG GTACTCCTGG TGAAGAACAA
    ATTAAGAGAC TAATACCTGC TACAGATTCT AAAGGGAAAA ATGTTGGAGA TGAATGGTTT
    ACAACTGAGA AGGTGGAGGT TGCTCTGAAT AGGTACCATG AGGCAAATGC CCGAGCCAAG
    GCGGTTGTCA TGGATATTTT GCGAGGACTT TCTACTGAAA TGCAAATTAA GATAAATGTT
    CTTGTCTACT CATCCATGTT GCTGGTTATT GCAAAGGCAT TATTTGCTCA TGTCAGTGAA
    GGAAGGAGGA GGAAATGGGT CTTTCCCACA TTAAAGGAGT TCGATAGATC TGCAGGCAGC
    ACTTTGTCAT GGGAAAATGA CTATATGGAG ATTGTGGGAT TATCACCATA TTGGTTTGAT
    GCTGCTCAAG GAAATGCTAT ACAAAATACT GTTAAAATGC ATTCCTTATT TATTTTGACG
    GGACCAAATG GTGGTGGCAA ATCTAGCTTG CTTCGGTCGA TTTGTGCTGC AGCATTACTT
    GGAATTTGTG GGTTGATGGT TCCGGCAGTC TCTGCTGTCA TTCCACATTT TGATTCTATC
    ATGCTTCATA TGAAATCCTA TGATAGTCCT GCGGATGGGA AAAGCTCCTT CCAGATGGAA
    ATGTCAGAGC TGCGGTCAAT AGTTACAAGA GCCACTGCAA GGAGCCTTGT GCTAGTGGAT
    GAAATTTGCA GGGGAACGGA AATGTGGAAA GGCACCTGCA TTGCTGGAAG CATATTGGAA
    ACTCTCGATA ATATTGGTTG CCTGGGCATT GTATCAACCC ACATCCATGC CCTCTTCAAT
    TTACCACTAG CGACAAAGAA CATTGTTTCT AAAGCAATGG GAACGGAGAA AGTGAATGGT
    CGAACAAAAC CGACATGGAA GTTAATAGAC GGAATTTGTA GAGAAAGCCT TGCATTTGAG
    ACGGCTCAAA ATGAAGGAAT CCCTGAAGCT ATTATCAGAA GAGCTGAGGA GTTGTATCTC
    ACTATCAAAG ATGCTCAAAC AGATTTAGAA AAGATTGATG GTGCAAAGGG GCATCGAACT
    CATCAGTCTG GAATTACTTG TTTGTATGAT CGTAGTGATT GTTTGAGACC TGAAGGTAAT
    GAATCTAGTT CTTTTGAAAA TGTGTCAATT AAGGGAACTG ATATTGATGC TTTAACTAGT
    AATGGACTAT CCAAATATGA AGATGATCAT GCCCCACATG ATTCTATGAA ATTCCGAAGA
    TTATGTGTCA CAAATGAAGG CTGTGGAGAA AGTTCAACGA GTTCCTCACA TGTGATGTCT
    CTGTCTGCGG AGAGGCAGAG ATTATTGCAG AATGCTGGAA GAGCTGTGAC CATCATTTGT
    CAGAGAAAGT TGAACGAACT TTACGTGCAG AAATGTATAG CTGCACTTGC AGAGATCTCT
    TGTGTCACTG TTGATTCTAA AGAACAACCT CCTCCCTCGA CAATAGGCAC TTCAAGTGTA
    TATGTGTTAG TCAGGCCTGA TTTGAAGTTG TATGTTGGAC AGACAGATGA CCTTATTGGT
    CGCGTCCGTA CACACCGTTC ATCAGAAGGC ATGCAGGATG TGCCTTTTCT TTATGTTGTG
    GTCCCAGGGA GGAGTGTGGC TTGCCTATTA GAAACTCTGC TCATTAACCA GCTTCCCCTT
    CAAGGATTTC ATCTTTCTAA CAAAGCTGAT GGTAAGCACC GCAACTTTGG CACGTGTCAT
    CTCACATTGG AAGGTTCAGC GTTGGCTCAA TGA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.