BLASTX nr result

ID: Rehmannia27_contig00027235 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00027235
         (979 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The...   237   4e-92
gb|AIG55302.1| gag-pol, partial [Camellia sinensis]                   239   5e-91
ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prun...   233   2e-89
ref|XP_008244885.1| PREDICTED: uncharacterized protein LOC103342...   236   5e-89
ref|XP_012829796.1| PREDICTED: uncharacterized protein LOC105950...   236   7e-88
ref|XP_011085927.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   227   9e-88
emb|CAA73042.1| polyprotein [Ananas comosus]                          224   3e-87
ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [The...   235   4e-87
ref|XP_012487705.1| PREDICTED: uncharacterized protein LOC105800...   228   5e-87
ref|XP_007050046.1| DNA/RNA polymerases superfamily protein [The...   234   1e-86
ref|XP_007198824.1| hypothetical protein PRUPE_ppb020037mg [Prun...   231   1e-86
ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prun...   232   1e-86
ref|XP_007220718.1| hypothetical protein PRUPE_ppa022673mg [Prun...   231   2e-86
ref|XP_015960510.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   223   3e-86
ref|XP_015944834.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   229   7e-86
ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobrom...   221   7e-86
emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera]   223   9e-86
gb|AAO45752.1| pol protein [Cucumis melo subsp. melo]                 216   9e-86
ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass,...   224   1e-85
ref|XP_007099730.1| DNA/RNA polymerases superfamily protein [The...   223   1e-85

>ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508708318|gb|EOY00215.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1537

 Score =  237 bits (604), Expect(2) = 4e-92
 Identities = 111/152 (73%), Positives = 127/152 (83%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            +CLTCQ++KAEHQ+PSG LQPL IPEWKWEH+TMDFV  LPRT+ G DAIWV+VDRLTKS
Sbjct: 1141 KCLTCQQIKAEHQKPSGTLQPLSIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKS 1200

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            AHFL +  T S+++LA++YI EI+RLHGVPV IVSDRD RF SRFW    E LGT L FS
Sbjct: 1201 AHFLAIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDLRFTSRFWPKFQEALGTKLRFS 1260

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            TA HPQTDGQS +TIQTLEDMLRAC +DF GS
Sbjct: 1261 TAFHPQTDGQSERTIQTLEDMLRACVIDFIGS 1292



 Score =  130 bits (327), Expect(2) = 4e-92
 Identities = 66/132 (50%), Positives = 90/132 (68%)
 Frame = +1

Query: 439  IGFPRKLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXX 618
            I F    + HLPL EFAYNNS+Q++IGMAPYEALYGRKCR+PL W+              
Sbjct: 1287 IDFIGSWDRHLPLVEFAYNNSFQSSIGMAPYEALYGRKCRTPLCWD---------EVGER 1337

Query: 619  XXXXIDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLIN 798
                ++L+    +KV++I+  ++TA  RQK+Y+DKR+KDLEFEV D+V L++SP KG+I 
Sbjct: 1338 KLVNVELIDLTNDKVKVIRERLKTAQDRQKNYSDKRRKDLEFEVDDKVFLKVSPWKGVIR 1397

Query: 799  PKKGDKLSPRYV 834
              K  KL+PRY+
Sbjct: 1398 FAKRGKLNPRYI 1409


>gb|AIG55302.1| gag-pol, partial [Camellia sinensis]
          Length = 923

 Score =  239 bits (611), Expect(2) = 5e-91
 Identities = 111/152 (73%), Positives = 127/152 (83%)
 Frame = +2

Query: 2   RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
           +CLTCQ+VKAEHQRP+GLLQPL I EWKWEHITMDFV  LPRT+RG DAIWVVVDRLTKS
Sbjct: 543 KCLTCQQVKAEHQRPAGLLQPLPIAEWKWEHITMDFVVGLPRTQRGSDAIWVVVDRLTKS 602

Query: 182 AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
           AHF+PM+  DS+  LA +YIR+++RLHGVPV IVSDRDP F +R W+ L   LGT L+FS
Sbjct: 603 AHFIPMRVRDSMDHLADLYIRDVVRLHGVPVTIVSDRDPCFTARLWQSLQSALGTKLTFS 662

Query: 362 TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
           TA HPQTDGQS +TIQ LEDMLR C LDF G+
Sbjct: 663 TAYHPQTDGQSERTIQILEDMLRGCVLDFSGT 694



 Score =  124 bits (310), Expect(2) = 5e-91
 Identities = 63/124 (50%), Positives = 87/124 (70%), Gaps = 1/124 (0%)
 Frame = +1

Query: 466  HLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXXXXXXXIDLV 642
            HLPL EFAYNNS+Q++IGMAP+EALYGR CRSP+ W ++ D                +LV
Sbjct: 698  HLPLVEFAYNNSFQSSIGMAPFEALYGRPCRSPVFWADVGDA----------PLLGPELV 747

Query: 643  QEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLS 822
            +E  +K+ LI++ + TA SRQKSY D+RK+ + FEVGD V L++SP++GL+   K  KLS
Sbjct: 748  RETTKKIELIRKRLVTAQSRQKSYADRRKRAMVFEVGDHVFLKISPRRGLMRFGKSGKLS 807

Query: 823  PRYV 834
            PR++
Sbjct: 808  PRFI 811


>ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica]
            gi|462395665|gb|EMJ01464.1| hypothetical protein
            PRUPE_ppa015000mg [Prunus persica]
          Length = 1493

 Score =  233 bits (595), Expect(2) = 2e-89
 Identities = 109/152 (71%), Positives = 125/152 (82%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            RCL CQ+VKAE Q+PSGL+QPL IPEWKWE ITMDFV  LPRT +G D IWV+VDRLTKS
Sbjct: 1114 RCLICQQVKAERQKPSGLMQPLPIPEWKWERITMDFVFKLPRTSKGHDGIWVIVDRLTKS 1173

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
             HFLP+K+T SL KLA++++ EI+RLHG PV IVSDRD RF SRFWKCL E +GT L FS
Sbjct: 1174 THFLPIKETYSLTKLAKLFVDEIVRLHGAPVSIVSDRDARFTSRFWKCLQEAMGTRLQFS 1233

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            TA HPQTDGQS +TIQTLEDMLR+C L  + S
Sbjct: 1234 TAFHPQTDGQSERTIQTLEDMLRSCVLQMKDS 1265



 Score =  124 bits (312), Expect(2) = 2e-89
 Identities = 66/126 (52%), Positives = 84/126 (66%), Gaps = 1/126 (0%)
 Frame = +1

Query: 460  NSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXXXXXXXID 636
            ++HL L EFAYNNSY A+I MAPYEALYGR+CR+P+ W E+ D+              +D
Sbjct: 1267 DTHLALVEFAYNNSYHASIKMAPYEALYGRQCRTPICWNEVGDK----------KLEKVD 1316

Query: 637  LVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDK 816
             +Q   EKV++IK  ++ A  RQKSY D R KDLEF VGD V L+LSP KG++   K  K
Sbjct: 1317 SIQATTEKVKMIKEKLKIAQDRQKSYADNRSKDLEFAVGDWVFLKLSPWKGVMRFGKRGK 1376

Query: 817  LSPRYV 834
            LSPRY+
Sbjct: 1377 LSPRYI 1382


>ref|XP_008244885.1| PREDICTED: uncharacterized protein LOC103342989 [Prunus mume]
          Length = 1162

 Score =  236 bits (602), Expect(2) = 5e-89
 Identities = 108/152 (71%), Positives = 129/152 (84%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            +CLTCQ+VKAEHQ+PSG LQPL + EWKW+HITMDFVT LPR+ +G DAIWV+VDRLTKS
Sbjct: 615  KCLTCQQVKAEHQKPSGSLQPLPVAEWKWDHITMDFVTGLPRSPKGRDAIWVIVDRLTKS 674

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            AHFLP+K T+S + L ++Y+REI+RLHG+PV IVSDRD +F S+FW  L + LGT L+FS
Sbjct: 675  AHFLPVKTTESTENLGKLYVREIVRLHGIPVSIVSDRDSKFTSKFWGSLQKALGTQLNFS 734

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            TA HPQTDGQS +TIQ LEDMLRAC LDF GS
Sbjct: 735  TAFHPQTDGQSERTIQILEDMLRACILDFGGS 766



 Score =  120 bits (302), Expect(2) = 5e-89
 Identities = 67/133 (50%), Positives = 86/133 (64%), Gaps = 1/133 (0%)
 Frame = +1

Query: 439  IGFPRKLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXX 615
            + F      HL LAEFAYNNSYQ++I MAPYEALYGR CRSP+ W E+ +          
Sbjct: 761  LDFGGSWEDHLILAEFAYNNSYQSSIQMAPYEALYGRPCRSPVCWTEVGETVLLGP---- 816

Query: 616  XXXXXIDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLI 795
                  DLVQE  EKV+LIK  + TA SRQK+Y D+R+K L F VGD V L++SP++G+ 
Sbjct: 817  ------DLVQETTEKVKLIKEHLLTAQSRQKNYADRRRKPLSFNVGDYVFLKVSPRRGVK 870

Query: 796  NPKKGDKLSPRYV 834
               K  KL+PR++
Sbjct: 871  RFGKTGKLAPRFI 883


>ref|XP_012829796.1| PREDICTED: uncharacterized protein LOC105950954 [Erythranthe guttata]
          Length = 1316

 Score =  236 bits (601), Expect(2) = 7e-88
 Identities = 107/151 (70%), Positives = 127/151 (84%)
 Frame = +2

Query: 5    CLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKSA 184
            CL CQ++K EHQRP GLLQ   IPEWKWE +TMDFV   P+T +G D+IWV+VDRLTKSA
Sbjct: 866  CLICQQIKTEHQRPGGLLQSNHIPEWKWESVTMDFVQGFPKTLKGSDSIWVIVDRLTKSA 925

Query: 185  HFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFST 364
            HFLP+K T SL+KLA++YI EI+RLHGVP+ I+SDRDPRF S+FWK LHE +GT LSFST
Sbjct: 926  HFLPVKTTFSLEKLAELYIGEIVRLHGVPISIISDRDPRFTSKFWKRLHEAMGTRLSFST 985

Query: 365  AAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            A HPQTDGQS +TI+TLEDMLRAC +DF G+
Sbjct: 986  AYHPQTDGQSERTIKTLEDMLRACIMDFGGN 1016



 Score =  117 bits (293), Expect(2) = 7e-88
 Identities = 57/124 (45%), Positives = 83/124 (66%)
 Frame = +1

Query: 463  SHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXXXXXXIDLV 642
            S LPL EF+YNNS+Q++IGMAPYEALYGRKC SP+HW+                   +LV
Sbjct: 1019 SRLPLIEFSYNNSFQSSIGMAPYEALYGRKCHSPIHWD---------EVGERRLLGPELV 1069

Query: 643  QEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLS 822
            Q  ++ ++ I+  +RTA  RQ+ Y +KR+++LEF+ GD V L+++P KG++   K  KLS
Sbjct: 1070 QHTVDIIKNIREKMRTAQDRQQKYANKRRRELEFQAGDHVFLKVAPLKGIMRFGKRGKLS 1129

Query: 823  PRYV 834
            PR++
Sbjct: 1130 PRFI 1133


>ref|XP_011085927.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105167812
            [Sesamum indicum]
          Length = 980

 Score =  227 bits (578), Expect(2) = 9e-88
 Identities = 108/152 (71%), Positives = 126/152 (82%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            +C+TCQ+VKAEHQ P+G L+PL IPEWKWE ITMDFV  LPR  R  DAIWV+VDRLTKS
Sbjct: 629  KCMTCQQVKAEHQGPTGKLRPLLIPEWKWEKITMDFVVGLPRIFRKHDAIWVIVDRLTKS 688

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            AHFLP++ TDSL KLA +YI EI+RLHGVP+ IVS RDPRF SRF + L   LGT L FS
Sbjct: 689  AHFLPVRITDSLIKLAGLYISEIVRLHGVPISIVSXRDPRFTSRFLESLQRALGTKLHFS 748

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            TA HPQTDGQS +TIQTLEDM+RACT++F+G+
Sbjct: 749  TAFHPQTDGQSERTIQTLEDMMRACTMEFKGN 780



 Score =  125 bits (315), Expect(2) = 9e-88
 Identities = 57/130 (43%), Positives = 91/130 (70%)
 Frame = +1

Query: 445  FPRKLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXXXX 624
            F    + HLPL EFAYNNS+ ++IGMAPYEALYGR+CRSP+ W+I+              
Sbjct: 777  FKGNWDDHLPLMEFAYNNSFHSSIGMAPYEALYGRRCRSPICWDIEG------------- 823

Query: 625  XXIDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPK 804
                 +++ +EKV+++K+ ++ A  RQKSY D+ ++++E+EVGD+V L++SP +G++  +
Sbjct: 824  -----LRQTVEKVQVVKKCLKAAQDRQKSYVDQHRREMEYEVGDKVFLKISPWRGILRFR 878

Query: 805  KGDKLSPRYV 834
            + +KLSPRY+
Sbjct: 879  RQEKLSPRYI 888


>emb|CAA73042.1| polyprotein [Ananas comosus]
          Length = 871

 Score =  224 bits (571), Expect(2) = 3e-87
 Identities = 105/151 (69%), Positives = 125/151 (82%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            +CLTCQ+VKAEH+ P+G LQ L IP WKWE ITMDFVT LPR++ G DAIWV+VDRLTKS
Sbjct: 568  KCLTCQQVKAEHRVPAGKLQSLPIPVWKWEKITMDFVTGLPRSQAGHDAIWVIVDRLTKS 627

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            AHF+P+  T + ++LAQ+Y+ EI+RLHGVP  IVSDRD RFVS FW+ L + LGT L FS
Sbjct: 628  AHFIPIHTTWTGERLAQVYLDEIVRLHGVPTSIVSDRDTRFVSHFWRSLQDALGTRLDFS 687

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQG 454
            TA HPQ+DGQS +TIQTLEDMLRAC +DFQG
Sbjct: 688  TAFHPQSDGQSERTIQTLEDMLRACVIDFQG 718



 Score =  127 bits (318), Expect(2) = 3e-87
 Identities = 67/133 (50%), Positives = 89/133 (66%), Gaps = 1/133 (0%)
 Frame = +1

Query: 439  IGFPRKLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXX 615
            I F    + HLP+AEFAYNNSYQA+I MAP+EALYGRKCRSPLHW E+ +          
Sbjct: 714  IDFQGGWSQHLPMAEFAYNNSYQASIKMAPFEALYGRKCRSPLHWSEVGESLALGP---- 769

Query: 616  XXXXXIDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLI 795
                  D++QE   KVR+ +  + TA SRQ+SY D+R++DLEF+VGD V L++SP +G+ 
Sbjct: 770  ------DVLQEAEVKVRIARERLLTAQSRQRSYADRRRRDLEFQVGDHVFLKVSPTRGIK 823

Query: 796  NPKKGDKLSPRYV 834
                  KLSPR++
Sbjct: 824  RFGIRGKLSPRFI 836


>ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508702098|gb|EOX93994.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 811

 Score =  235 bits (600), Expect(2) = 4e-87
 Identities = 108/152 (71%), Positives = 127/152 (83%)
 Frame = +2

Query: 2   RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
           +CLTCQ++KAEHQ+ SG LQPL IPEWKWEH+TMDFV  LPRT+ G DAIWV+VDRLTKS
Sbjct: 542 KCLTCQQIKAEHQKSSGTLQPLPIPEWKWEHVTMDFVLGLPRTQSGKDAIWVIVDRLTKS 601

Query: 182 AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
           AHFL +  T S+++LA++YI E++RLHGVP+ IVSDRDPRF SRFW    E LGT L FS
Sbjct: 602 AHFLAIHSTYSIERLARLYIDEVVRLHGVPISIVSDRDPRFTSRFWPKFQEALGTKLRFS 661

Query: 362 TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
           T+ HPQTDGQS +TIQTLEDMLRAC +DF GS
Sbjct: 662 TSFHPQTDGQSERTIQTLEDMLRACVIDFIGS 693



 Score =  115 bits (288), Expect(2) = 4e-87
 Identities = 58/117 (49%), Positives = 79/117 (67%)
 Frame = +1

Query: 439  IGFPRKLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXX 618
            I F    + HLPL EFAYNNS+Q++IGMAPYEALYGRKCR+PL W+              
Sbjct: 688  IDFIGSWDRHLPLVEFAYNNSFQSSIGMAPYEALYGRKCRTPLCWD---------EVGER 738

Query: 619  XXXXIDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKG 789
                ++L+    +KV++I+  ++T   RQK+Y+DKR+KDLEFEV D+V L++SP KG
Sbjct: 739  KLVNVELIDLTNDKVKVIQERLKTTQDRQKNYSDKRRKDLEFEVDDKVFLKVSPWKG 795


>ref|XP_012487705.1| PREDICTED: uncharacterized protein LOC105800880, partial [Gossypium
            raimondii]
          Length = 1085

 Score =  228 bits (582), Expect(2) = 5e-87
 Identities = 104/152 (68%), Positives = 125/152 (82%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            RCL CQ+VKAEHQ P+GLLQP+ IPEWKWEH+TMDFV+ LP T +  D+IWV+VDRLTKS
Sbjct: 693  RCLICQQVKAEHQVPTGLLQPIMIPEWKWEHVTMDFVSGLPVTPKKKDSIWVIVDRLTKS 752

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            AHF+P++    L+KLA++Y+ EI+RLHGVP+ I+SDRDPRF SRFW  L E LGT L+FS
Sbjct: 753  AHFIPVRTDYQLEKLAELYVSEIVRLHGVPISIISDRDPRFTSRFWSKLQEALGTKLNFS 812

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            TA HPQTDGQS + IQ LEDMLR C L+F GS
Sbjct: 813  TAFHPQTDGQSERVIQILEDMLRCCILEFGGS 844



 Score =  122 bits (305), Expect(2) = 5e-87
 Identities = 61/124 (49%), Positives = 86/124 (69%), Gaps = 1/124 (0%)
 Frame = +1

Query: 466  HLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXXXXXXXIDLV 642
            +LPLAEFAYNNSYQ +I MAP+EALYGRKCR+PL+W E+ +               +DL+
Sbjct: 848  YLPLAEFAYNNSYQTSIKMAPFEALYGRKCRTPLYWTELSES----------KLVGVDLI 897

Query: 643  QEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLS 822
            +E  EKVR+I+  ++ A  RQKSY D +++D+EF VGD V L++SP K ++   +  KLS
Sbjct: 898  RETEEKVRIIRDCLKAASDRQKSYADLKRRDIEFSVGDRVFLKVSPWKKVLRFGRKGKLS 957

Query: 823  PRYV 834
            PR++
Sbjct: 958  PRFI 961


>ref|XP_007050046.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508702307|gb|EOX94203.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1336

 Score =  234 bits (597), Expect(2) = 1e-86
 Identities = 110/152 (72%), Positives = 125/152 (82%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            +CL CQ+VKAEHQRP+G LQ L +PEWKWEH+TMDFV  L RT+RG D IWV+VD+LTKS
Sbjct: 1046 KCLICQQVKAEHQRPAGTLQSLPVPEWKWEHVTMDFVLGLSRTQRGKDVIWVIVDQLTKS 1105

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            AHFL +  T S++KLAQ+YI EI+RLHGVPV IVSDRDPRF SRFW    E LGT L FS
Sbjct: 1106 AHFLAVHSTYSIEKLAQLYIDEIVRLHGVPVSIVSDRDPRFTSRFWPKFQEALGTKLKFS 1165

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            TA HPQTDGQS +TIQTLEDMLRAC +DF GS
Sbjct: 1166 TAFHPQTDGQSERTIQTLEDMLRACVIDFIGS 1197



 Score =  115 bits (287), Expect(2) = 1e-86
 Identities = 57/117 (48%), Positives = 78/117 (66%)
 Frame = +1

Query: 439  IGFPRKLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXX 618
            I F    + HLPL EFAYNNS+Q++IGMAPYEALYGRKCR+PL W+              
Sbjct: 1192 IDFIGSWDRHLPLVEFAYNNSFQSSIGMAPYEALYGRKCRTPLCWD---------EVGER 1242

Query: 619  XXXXIDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKG 789
                + L++   +K+++I+  ++ A  RQKSY DKR+KDLEFE+ D+V L++SP KG
Sbjct: 1243 KLVSVKLIELTNDKIKVIRERLKVAQDRQKSYADKRRKDLEFEIDDKVFLKVSPWKG 1299


>ref|XP_007198824.1| hypothetical protein PRUPE_ppb020037mg [Prunus persica]
            gi|462394119|gb|EMJ00023.1| hypothetical protein
            PRUPE_ppb020037mg [Prunus persica]
          Length = 1279

 Score =  231 bits (590), Expect(2) = 1e-86
 Identities = 111/151 (73%), Positives = 123/151 (81%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            RCL CQ+VKAE Q+PSGLLQPL IPEWKWE ITMDFV  LPRT    D +WV+VDRLTKS
Sbjct: 929  RCLICQQVKAERQKPSGLLQPLPIPEWKWERITMDFVFKLPRTHSKHDGVWVIVDRLTKS 988

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            AHFLP++   SL KLA+I+I EI+RLHGVPV IVSDRDPRF SRFW  L+E  GT L FS
Sbjct: 989  AHFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDPRFTSRFWTKLNEAFGTQLQFS 1048

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQG 454
            TA HPQTDGQS +TIQTLEDMLRAC L F+G
Sbjct: 1049 TAFHPQTDGQSERTIQTLEDMLRACALQFRG 1079



 Score =  117 bits (294), Expect(2) = 1e-86
 Identities = 59/122 (48%), Positives = 82/122 (67%)
 Frame = +1

Query: 469  LPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXXXXXXIDLVQE 648
            LPL EFAYNNSYQ +IGM+P++ALYGR+CR+P +W+   +                 V+ 
Sbjct: 1085 LPLMEFAYNNSYQVSIGMSPFDALYGRQCRTPFYWDAVGEHRLVVSED---------VEL 1135

Query: 649  IMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLSPR 828
              ++V++I+  ++TA  RQKSY D R+KDL+FEVGD V L+LSP KG++   K  KLSPR
Sbjct: 1136 TKKQVQIIRERLKTAQDRQKSYADNRRKDLQFEVGDWVFLKLSPWKGVVRFGKRGKLSPR 1195

Query: 829  YV 834
            Y+
Sbjct: 1196 YI 1197


>ref|XP_007221234.1| hypothetical protein PRUPE_ppb019121mg [Prunus persica]
           gi|462417788|gb|EMJ22433.1| hypothetical protein
           PRUPE_ppb019121mg [Prunus persica]
          Length = 552

 Score =  232 bits (591), Expect(2) = 1e-86
 Identities = 111/151 (73%), Positives = 124/151 (82%)
 Frame = +2

Query: 2   RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
           RCL CQ+VKAE Q+PSGLLQPL IPEWKWE ITMDFV  LPRT+   D +WV+VDRLTKS
Sbjct: 174 RCLICQQVKAERQKPSGLLQPLPIPEWKWERITMDFVFKLPRTQSKHDGVWVIVDRLTKS 233

Query: 182 AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
           AHFLP++   SL KLA+I+I EI+RLHGVPV IVSDRDPRF SRFW  L+E  GT L FS
Sbjct: 234 AHFLPVRANYSLNKLAKIFIDEIVRLHGVPVSIVSDRDPRFTSRFWTKLNEAFGTQLQFS 293

Query: 362 TAAHPQTDGQSVQTIQTLEDMLRACTLDFQG 454
           TA HPQTDGQS +TIQTLEDMLRAC L F+G
Sbjct: 294 TAFHPQTDGQSERTIQTLEDMLRACALQFRG 324



 Score =  117 bits (292), Expect(2) = 1e-86
 Identities = 59/122 (48%), Positives = 82/122 (67%)
 Frame = +1

Query: 469 LPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXXXXXXIDLVQE 648
           LPL EFAYNNSYQ +IGM+P++ALYGR+CR+P +W+                   + V+ 
Sbjct: 330 LPLMEFAYNNSYQVSIGMSPFDALYGRQCRTPFYWD---------EVGEHRLVVSEDVKL 380

Query: 649 IMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLSPR 828
             ++V++I+  ++TA  RQKSY D R+KDL+FEVGD V L+LSP KG++   K  KLSPR
Sbjct: 381 TKKQVQIIRERLKTAQDRQKSYADNRRKDLQFEVGDWVFLKLSPWKGVVRFGKRGKLSPR 440

Query: 829 YV 834
           Y+
Sbjct: 441 YI 442


>ref|XP_007220718.1| hypothetical protein PRUPE_ppa022673mg [Prunus persica]
            gi|462417180|gb|EMJ21917.1| hypothetical protein
            PRUPE_ppa022673mg [Prunus persica]
          Length = 1506

 Score =  231 bits (588), Expect(2) = 2e-86
 Identities = 111/151 (73%), Positives = 123/151 (81%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            RCL CQ+VKAE Q+PSGLLQPL IPEWKWE ITMDFV  LPRT+   D +WV+VDRLTKS
Sbjct: 1128 RCLICQQVKAERQKPSGLLQPLPIPEWKWERITMDFVCKLPRTQSKHDGVWVIVDRLTKS 1187

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            +HFLP+    SL KLA+I++ EI+RLHGVPV IVSDRDPRF SRFW  LHE  GT L FS
Sbjct: 1188 SHFLPVIANYSLNKLAKIFLDEIMRLHGVPVFIVSDRDPRFTSRFWTKLHEAFGTQLQFS 1247

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQG 454
            TA HPQTDGQS +TIQTLEDMLRAC L FQG
Sbjct: 1248 TAFHPQTDGQSERTIQTLEDMLRACALQFQG 1278



 Score =  117 bits (293), Expect(2) = 2e-86
 Identities = 59/122 (48%), Positives = 82/122 (67%)
 Frame = +1

Query: 469  LPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXXXXXXIDLVQE 648
            LPL EFAYNNSYQ +IGM+P++ALYGR+CR+P +W+                   + V+ 
Sbjct: 1284 LPLMEFAYNNSYQVSIGMSPFDALYGRQCRTPFYWD---------EVGEHRLVVSEDVEL 1334

Query: 649  IMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLSPR 828
              ++V++I+  ++TA  RQKSY D R+KDL+FEVGD V L+LSP KG++   K  KLSPR
Sbjct: 1335 TKKQVQIIRERLKTAQDRQKSYADNRRKDLQFEVGDWVFLKLSPWKGVVRFGKRGKLSPR 1394

Query: 829  YV 834
            Y+
Sbjct: 1395 YI 1396


>ref|XP_015960510.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC107484454
            [Arachis duranensis]
          Length = 1333

 Score =  223 bits (568), Expect(2) = 3e-86
 Identities = 102/151 (67%), Positives = 122/151 (80%)
 Frame = +2

Query: 5    CLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKSA 184
            CLTCQ+VKAEHQRP+GLLQ + IPEWKWE ITMDFVT LPR+ +G D+IWV+VDR+TKSA
Sbjct: 945  CLTCQQVKAEHQRPAGLLQQIEIPEWKWERITMDFVTGLPRSFKGFDSIWVIVDRMTKSA 1004

Query: 185  HFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFST 364
            HFLP+K T S  + AQ+Y+ EI++LHG+PV I+SDR P+F S FWK   + LGT L  ST
Sbjct: 1005 HFLPVKTTFSAARYAQLYVDEIVKLHGIPVSIISDRGPQFTSHFWKSFQKALGTRLDLST 1064

Query: 365  AAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            A HPQTDGQS +TIQ LEDMLR C LDF G+
Sbjct: 1065 AFHPQTDGQSERTIQILEDMLRCCVLDFGGN 1095



 Score =  124 bits (312), Expect(2) = 3e-86
 Identities = 65/133 (48%), Positives = 91/133 (68%), Gaps = 1/133 (0%)
 Frame = +1

Query: 439  IGFPRKLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXX 615
            + F    +S+LPL EF+YNNSYQA+I MAP+EALYGR+CRSP+ W E+ +          
Sbjct: 1090 LDFGGNWDSYLPLIEFSYNNSYQASIQMAPFEALYGRRCRSPIGWFEVGE---------- 1139

Query: 616  XXXXXIDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLI 795
                  +LVQ+ +EKVR+I+  +  A SRQK+Y D R+++LEF VGD+V L++SP KG++
Sbjct: 1140 VKLLGPNLVQDAVEKVRIIRERLLAAQSRQKAYVDNRRRNLEFSVGDQVFLKVSPMKGVM 1199

Query: 796  NPKKGDKLSPRYV 834
               K  KLSPRY+
Sbjct: 1200 RFGKRGKLSPRYI 1212


>ref|XP_015944834.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC107469968
            [Arachis duranensis]
          Length = 1201

 Score =  229 bits (583), Expect(2) = 7e-86
 Identities = 106/152 (69%), Positives = 124/152 (81%)
 Frame = +2

Query: 2    RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
            +CLTCQ+VK EHQ+PSG LQPL IP+WKWE ITMDFV  LPRT  G DAIWV+VD LTKS
Sbjct: 791  KCLTCQKVKVEHQKPSGTLQPLEIPQWKWEQITMDFVMGLPRTSTGHDAIWVIVDMLTKS 850

Query: 182  AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
            AHFLP++   +L++LA+IYI+EI+RLHG+P  IVSDRDPRF SRFW    + LGT L  S
Sbjct: 851  AHFLPIRVDYTLERLARIYIQEIVRLHGIPSSIVSDRDPRFTSRFWGAFQKALGTELHMS 910

Query: 362  TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
            TA HPQTDGQS +TIQTLEDMLR+C +D QGS
Sbjct: 911  TAYHPQTDGQSERTIQTLEDMLRSCVMDNQGS 942



 Score =  117 bits (294), Expect(2) = 7e-86
 Identities = 61/125 (48%), Positives = 81/125 (64%)
 Frame = +1

Query: 460  NSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHWEIDDQWXXXXXXXXXXXXXIDL 639
            + +LPL EF YNNSYQ +I MAPYEALYGR+C++PL W  D +               DL
Sbjct: 944  DKYLPLVEFVYNNSYQQSIEMAPYEALYGRRCQTPLCWNDDGEASVLGP---------DL 994

Query: 640  VQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKL 819
            VQE  EK++ I++ I+TA SRQKSY D R++ LEF  GD V L+++P  G+    K  KL
Sbjct: 995  VQETTEKIKGIRQKIQTAQSRQKSYADNRRRPLEFSEGDHVFLKVTPTTGIGRALKTKKL 1054

Query: 820  SPRYV 834
            +PRY+
Sbjct: 1055 NPRYI 1059


>ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobroma cacao]
           gi|508728383|gb|EOY20280.1| Uncharacterized protein
           TCM_045699 [Theobroma cacao]
          Length = 415

 Score =  221 bits (564), Expect(2) = 7e-86
 Identities = 102/148 (68%), Positives = 121/148 (81%)
 Frame = +2

Query: 2   RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
           +CL CQ+VKAEHQ+P+GLLQPL +PEWKWEHI MDFVT LPRT  G D+IW+VVDRLTKS
Sbjct: 36  KCLVCQQVKAEHQKPTGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKS 95

Query: 182 AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
           AHFL +K T    + A++Y+ EI+RLHG+P+ IVSDR+ +F SRFW  L E LGT L FS
Sbjct: 96  AHFLLVKTTYGAAQYARVYVDEIVRLHGIPISIVSDREAQFTSRFWGKLQEALGTKLDFS 155

Query: 362 TAAHPQTDGQSVQTIQTLEDMLRACTLD 445
           TA HPQTDGQS +TIQTLEDMLRAC +D
Sbjct: 156 TAFHPQTDGQSERTIQTLEDMLRACVID 183



 Score =  125 bits (313), Expect(2) = 7e-86
 Identities = 64/129 (49%), Positives = 88/129 (68%), Gaps = 1/129 (0%)
 Frame = +1

Query: 454 KLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXXXXXXX 630
           K   +LPL EFAYNNS+Q +I MAP+EALYGR+CRSP+ W E+ ++              
Sbjct: 187 KWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGP--------- 237

Query: 631 IDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKG 810
            +LVQ+  EK+ +I++ + T  SRQKSY D R++DLEF+VGD V L++SP KG++   K 
Sbjct: 238 -ELVQDATEKIHMIRQKMLTTQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKK 296

Query: 811 DKLSPRYVR 837
            KLSPRY+R
Sbjct: 297 GKLSPRYIR 305


>emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera]
          Length = 1573

 Score =  223 bits (569), Expect(2) = 9e-86
 Identities = 103/151 (68%), Positives = 123/151 (81%)
 Frame = +2

Query: 5    CLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKSA 184
            C  CQ+VKAEHQRP+ LLQPL IP+WKW++ITMDFV  LPRTR   + +WV+VDRLTKSA
Sbjct: 1240 CQICQQVKAEHQRPAELLQPLPIPKWKWDNITMDFVIGLPRTRSKKNGVWVIVDRLTKSA 1299

Query: 185  HFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFST 364
            HFL MK TDS+  LA++YI+EI+RLHG+PV IVSDRDP+F S+FW+ L   LGT L+FST
Sbjct: 1300 HFLAMKTTDSMNSLAKLYIQEIVRLHGIPVSIVSDRDPKFTSQFWQSLQRALGTQLNFST 1359

Query: 365  AAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
              HPQTDGQS + IQ LEDMLRAC LDF G+
Sbjct: 1360 VFHPQTDGQSERVIQILEDMLRACVLDFGGN 1390



 Score =  122 bits (307), Expect(2) = 9e-86
 Identities = 64/124 (51%), Positives = 86/124 (69%), Gaps = 1/124 (0%)
 Frame = +1

Query: 466  HLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXXXXXXXIDLV 642
            +LPLAEFAYNN YQ++IGMAPYEALYGR CRSPL W E+ +                ++V
Sbjct: 1394 YLPLAEFAYNNXYQSSIGMAPYEALYGRPCRSPLCWIEMGESHLLGP----------EIV 1443

Query: 643  QEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLS 822
            QE  EK++LIK  ++TA  RQK+Y DKR++ LEFE GD V +++SP++G+    K  KL+
Sbjct: 1444 QETTEKIQLIKEKLKTAQDRQKNYADKRRRPLEFEEGDWVFVKVSPRRGIFRFGKKGKLA 1503

Query: 823  PRYV 834
            PR+V
Sbjct: 1504 PRFV 1507


>gb|AAO45752.1| pol protein [Cucumis melo subsp. melo]
          Length = 923

 Score =  216 bits (551), Expect(2) = 9e-86
 Identities = 102/152 (67%), Positives = 120/152 (78%)
 Frame = +2

Query: 2   RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
           +CL CQ+VKA  Q+P+GLLQPL IPEWKWE+++MDF+T LPRT RG   IWVVVDRLTKS
Sbjct: 544 KCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKS 603

Query: 182 AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
           AHF+P K T +  K AQ+Y+ EI+RLHGVPV IVSDRD RF S+FWK L   +GT L FS
Sbjct: 604 AHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFS 663

Query: 362 TAAHPQTDGQSVQTIQTLEDMLRACTLDFQGS 457
           TA HPQTDGQ+ +  Q LEDMLRAC L+F GS
Sbjct: 664 TAFHPQTDGQTERLNQVLEDMLRACALEFPGS 695



 Score =  129 bits (325), Expect(2) = 9e-86
 Identities = 70/131 (53%), Positives = 90/131 (68%), Gaps = 1/131 (0%)
 Frame = +1

Query: 445  FPRKLNSHLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXXXX 621
            FP   +SHL L EFAYNNSYQATIGMAP+EALYGR CRSP+ W E+ +Q           
Sbjct: 692  FPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRSPVCWGEVGEQ----------R 741

Query: 622  XXXIDLVQEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINP 801
                +LVQ   E ++ I+  + TA SRQKSY D R+KDLEFEVGD+V L+++P KG++  
Sbjct: 742  LMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMKGVLRF 801

Query: 802  KKGDKLSPRYV 834
            ++  KLSPR+V
Sbjct: 802  ERRGKLSPRFV 812


>ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma
           cacao] gi|508716770|gb|EOY08667.1| Retrotransposon
           protein, Ty3-gypsy subclass, putative [Theobroma cacao]
          Length = 521

 Score =  224 bits (571), Expect(2) = 1e-85
 Identities = 103/148 (69%), Positives = 121/148 (81%)
 Frame = +2

Query: 2   RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
           +CL CQ+VKAEHQ+P+GLLQPL +PEWKWEHI MDFVT LPRT  G D+IW+VVDRLTKS
Sbjct: 142 KCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKS 201

Query: 182 AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
           AHFLP+K T    + A++Y+ EI+RLHG+P+ IVSDR  +F SRFW  L E LGT L FS
Sbjct: 202 AHFLPVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFS 261

Query: 362 TAAHPQTDGQSVQTIQTLEDMLRACTLD 445
           TA HPQTDGQS +TIQTLEDMLRAC +D
Sbjct: 262 TAFHPQTDGQSERTIQTLEDMLRACVID 289



 Score =  121 bits (304), Expect(2) = 1e-85
 Identities = 62/124 (50%), Positives = 86/124 (69%), Gaps = 1/124 (0%)
 Frame = +1

Query: 466 HLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXXXXXXXIDLV 642
           +LPL EFAYNNS+Q +I MAP+EALYGR+CRSP+ W E+ ++               +LV
Sbjct: 297 YLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGP----------ELV 346

Query: 643 QEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLS 822
           Q+  EK+ +I++ + TA SR KSY D R++DLEF+VGD V L++SP KG++   K  KLS
Sbjct: 347 QDATEKIHMIRQRMLTAQSRHKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLS 406

Query: 823 PRYV 834
           PRY+
Sbjct: 407 PRYI 410


>ref|XP_007099730.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508728378|gb|EOY20275.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 562

 Score =  223 bits (567), Expect(2) = 1e-85
 Identities = 102/148 (68%), Positives = 121/148 (81%)
 Frame = +2

Query: 2   RCLTCQRVKAEHQRPSGLLQPLRIPEWKWEHITMDFVTDLPRTRRGDDAIWVVVDRLTKS 181
           +CL CQ+VKAEHQ+P+GLLQPL  PEWKWEHI MDFVT LPRT  G D+IW+V+DRLTKS
Sbjct: 249 KCLVCQQVKAEHQKPAGLLQPLPAPEWKWEHIAMDFVTGLPRTSGGYDSIWIVMDRLTKS 308

Query: 182 AHFLPMKKTDSLQKLAQIYIREIIRLHGVPVDIVSDRDPRFVSRFWKCLHEELGTSLSFS 361
           AHFLP+K T    + A++Y+ EI+RLHG+P+ IVSDR+ +F SRFW  L E LGT L FS
Sbjct: 309 AHFLPVKTTYGAAQYARVYVDEIVRLHGIPISIVSDREAQFTSRFWGKLQEALGTKLDFS 368

Query: 362 TAAHPQTDGQSVQTIQTLEDMLRACTLD 445
           TA HPQTDGQS +TIQTLEDMLRAC +D
Sbjct: 369 TAFHPQTDGQSERTIQTLEDMLRACVID 396



 Score =  122 bits (307), Expect(2) = 1e-85
 Identities = 63/124 (50%), Positives = 86/124 (69%), Gaps = 1/124 (0%)
 Frame = +1

Query: 466 HLPLAEFAYNNSYQATIGMAPYEALYGRKCRSPLHW-EIDDQWXXXXXXXXXXXXXIDLV 642
           +LPL EFAYNNS+Q +I MAP+EALYGR+CRSP+ W E+ ++               +LV
Sbjct: 404 YLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGP----------ELV 453

Query: 643 QEIMEKVRLIKRWIRTA*SRQKSYTDKRKKDLEFEVGDEVCLRLSPQKGLINPKKGDKLS 822
           Q+  EK+ +I + + TA SRQKSY D R++DLEF+VGD V L++SP KG++   K  KLS
Sbjct: 454 QDATEKIHMISQKMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLS 513

Query: 823 PRYV 834
           PRY+
Sbjct: 514 PRYI 517


Top