BLASTX nr result

ID: Jatropha_contig00015797 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00015797
         (789 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53209.1| JHL23C09.1 [Jatropha curcas]                          349   2e-94
ref|XP_004506193.1| PREDICTED: uncharacterized protein LOC101494...   233   7e-59
ref|XP_004490487.1| PREDICTED: uncharacterized protein LOC101489...   233   7e-59
gb|EOY01054.1| RNA-directed DNA polymerase (Reverse transcriptas...   232   1e-58
gb|EOY30260.1| RNA-directed DNA polymerase (Reverse transcriptas...   231   3e-58
gb|EOY15970.1| RNA-directed DNA polymerase, putative [Theobroma ...   230   3e-58
gb|EOX94716.1| RNA-directed DNA polymerase (Reverse transcriptas...   230   3e-58
gb|EOY09277.1| RNA-directed DNA polymerase (Reverse transcriptas...   230   5e-58
ref|XP_004510004.1| PREDICTED: uncharacterized protein LOC101511...   229   8e-58
gb|EOY10681.1| Uncharacterized protein TCM_025982 [Theobroma cacao]   226   6e-57
gb|EOY28051.1| RNA-directed DNA polymerase (Reverse transcriptas...   226   8e-57
ref|XP_004490844.1| PREDICTED: uncharacterized protein LOC101509...   225   1e-56
ref|XP_003551524.1| PREDICTED: uncharacterized protein LOC100805...   224   2e-56
ref|XP_003518330.1| PREDICTED: uncharacterized protein LOC100818...   224   2e-56
gb|ABD28291.1| Integrase, catalytic region; Ribonuclease H [Medi...   224   3e-56
ref|XP_003555077.1| PREDICTED: uncharacterized protein LOC100811...   224   3e-56
ref|XP_003553327.1| PREDICTED: uncharacterized protein LOC100814...   224   3e-56
ref|XP_003544290.1| PREDICTED: uncharacterized protein LOC100815...   224   3e-56
ref|XP_003530624.1| PREDICTED: uncharacterized protein LOC100785...   223   4e-56
ref|XP_003522184.1| PREDICTED: uncharacterized protein LOC100786...   223   7e-56

>dbj|BAJ53209.1| JHL23C09.1 [Jatropha curcas]
          Length = 525

 Score =  349 bits (896), Expect(2) = 2e-94
 Identities = 170/199 (85%), Positives = 183/199 (91%)
 Frame = +1

Query: 175 ACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSF 354
           ACI+GLEAALEKEIK+L+VFGDSNLIVSQALRKWKIKEE LVPYLQRL+ELAQQF+ LSF
Sbjct: 1   ACIRGLEAALEKEIKILKVFGDSNLIVSQALRKWKIKEERLVPYLQRLDELAQQFDGLSF 60

Query: 355 HYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNLVDDKPWFWDVQNYL 534
           HYLPRAK  F DALATL SMVNVE ++IIRP TVRLQKQPAHIMNLVDDKPW+WD+QNYL
Sbjct: 61  HYLPRAKNQFADALATLASMVNVEEDRIIRPLTVRLQKQPAHIMNLVDDKPWYWDIQNYL 120

Query: 535 QNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGE 714
           QNE YP+GS+KTDQRT R+LAS YFLTGGVLYKRSW+GLHLRCVDE EAQTIMDSLHNGE
Sbjct: 121 QNEVYPEGSTKTDQRTLRQLASEYFLTGGVLYKRSWNGLHLRCVDEEEAQTIMDSLHNGE 180

Query: 715 SGPHMHGIALARKS*TWGY 771
           SGPHMHGIALARK    GY
Sbjct: 181 SGPHMHGIALARKIMNLGY 199



 Score = 24.3 bits (51), Expect(2) = 2e-94
 Identities = 9/10 (90%), Positives = 9/10 (90%)
 Frame = +3

Query: 750 KIMNLGLYWS 779
           KIMNLG YWS
Sbjct: 193 KIMNLGYYWS 202


>ref|XP_004506193.1| PREDICTED: uncharacterized protein LOC101494924 [Cicer arietinum]
          Length = 2008

 Score =  233 bits (593), Expect = 7e-59
 Identities = 110/267 (41%), Positives = 170/267 (63%), Gaps = 10/267 (3%)
 Frame = +1

Query: 1    PDVDLLNVENDV-------WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNN 159
            PD D++N+  ++       W + FDGASN  G+GIG +  +P  ++ P   +L FDCTNN
Sbjct: 1415 PDEDIMNLVEEIESSDKEKWRLVFDGASNALGHGIGAILISPENQFTPFTARLCFDCTNN 1474

Query: 160  EAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQF 339
             AEYEAC+ G++AA+E  +K L V+GDS L++ Q    W+ ++  L+PY   + EL +QF
Sbjct: 1475 IAEYEACVMGIKAAIESNVKFLEVYGDSLLVIHQTKGDWETRDSKLIPYHTHIKELTEQF 1534

Query: 340  EDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDKPW 510
            E ++FH++PR +    DALATL SM  +  NQ +    ++ + +PA+ +++   +D KPW
Sbjct: 1535 EKITFHHIPREENQLADALATLSSMFKITTNQDVPVIKIQQRDKPAYCLSIEEELDGKPW 1594

Query: 511  FWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTI 690
            F+D+++Y++N+ YP G S+ D+R  RRL+  +FL G VLYKR+   + LRCVD+ EA  I
Sbjct: 1595 FYDIKSYVKNKEYPLGISENDKRVLRRLSMNFFLNGDVLYKRNHDMVLLRCVDKAEAGKI 1654

Query: 691  MDSLHNGESGPHMHGIALARKS*TWGY 771
            +  +H G  G H +G  +ARK    GY
Sbjct: 1655 IQEVHEGSFGTHANGHTMARKILRAGY 1681


>ref|XP_004490487.1| PREDICTED: uncharacterized protein LOC101489483 [Cicer arietinum]
          Length = 973

 Score =  233 bits (593), Expect = 7e-59
 Identities = 110/267 (41%), Positives = 170/267 (63%), Gaps = 10/267 (3%)
 Frame = +1

Query: 1    PDVDLLNVENDV-------WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNN 159
            PD D++N+  ++       W + FDGASN  G+GIG +  +P  ++ P   +L FDCTNN
Sbjct: 380  PDEDIMNLVEEIESSDKEKWRLVFDGASNALGHGIGAILISPENQFTPFTARLCFDCTNN 439

Query: 160  EAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQF 339
             AEYEAC+ G++AA+E  +K L V+GDS L++ Q    W+ ++  L+PY   + EL +QF
Sbjct: 440  IAEYEACVMGIKAAIESNVKFLEVYGDSLLVIHQTKGDWETRDSKLIPYHTHIKELTEQF 499

Query: 340  EDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDKPW 510
            E ++FH++PR +    DALATL SM  +  NQ +    ++ + +PA+ +++   +D KPW
Sbjct: 500  EKITFHHIPREENQLADALATLSSMFKITTNQDVPVIKIQQRDKPAYCLSIEEELDGKPW 559

Query: 511  FWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTI 690
            F+D+++Y++N+ YP G S+ D+R  RRL+  +FL G VLYKR+   + LRCVD+ EA  I
Sbjct: 560  FYDIKSYVKNKEYPLGISENDKRVLRRLSMNFFLNGDVLYKRNHDMVLLRCVDKAEAGKI 619

Query: 691  MDSLHNGESGPHMHGIALARKS*TWGY 771
            +  +H G  G H +G  +ARK    GY
Sbjct: 620  IQEVHEGSFGTHANGHTMARKILRAGY 646


>gb|EOY01054.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H
            [Theobroma cacao]
          Length = 1047

 Score =  232 bits (591), Expect = 1e-58
 Identities = 115/268 (42%), Positives = 175/268 (65%), Gaps = 11/268 (4%)
 Frame = +1

Query: 1    PDVDLLNV--------ENDVWEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTN 156
            PD DL+++        E + W+M+FDGASN  G+GIGVV  +P G++ P+  KL+F CTN
Sbjct: 425  PDEDLMSICQTSGEESEKENWKMFFDGASNALGHGIGVVLVSPEGDHYPVIAKLNFYCTN 484

Query: 157  NEAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQ 336
            N AEYEAC+ G++AA+E++I +L V+GDS L++ Q   +W+ ++  LV Y + +++L + 
Sbjct: 485  NVAEYEACVMGIQAAIERKIHILEVYGDSALVIYQLRGEWETRDSKLVRYHKYVSKLIEN 544

Query: 337  FEDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDKP 507
            F+++ F++LPR +    DALA L +M  V  N  I+P  + L++ PAH  ++   +D KP
Sbjct: 545  FDEICFNHLPREENQMADALAMLAAMFKVGTNVKIQPIMINLRECPAHCFSVEEEIDGKP 604

Query: 508  WFWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQT 687
            W+ D+ +YL+ + YPD SS+ D++T RRLA  +FL G +LYKRS     LRCVD  EA+ 
Sbjct: 605  WYHDIVHYLKFQQYPDQSSENDKKTIRRLAMNFFLDGNILYKRSRDQTLLRCVDSTEARR 664

Query: 688  IMDSLHNGESGPHMHGIALARKS*TWGY 771
            I++ +H G  G H  G  LAR+    GY
Sbjct: 665  IVEEVHEGVCGAHASGHKLARQVMRAGY 692


>gb|EOY30260.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease
           H, putative [Theobroma cacao]
          Length = 508

 Score =  231 bits (588), Expect = 3e-58
 Identities = 115/267 (43%), Positives = 175/267 (65%), Gaps = 11/267 (4%)
 Frame = +1

Query: 4   DVDLLNV--------ENDVWEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNN 159
           D DL+++        E + W+M+FDGASN  G+GIGVV  +P G++ P+  KL+F CTNN
Sbjct: 15  DEDLMSICQTSGEESEKENWKMFFDGASNALGHGIGVVLVSPEGDHYPVIAKLNFYCTNN 74

Query: 160 EAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQF 339
            AEYEAC+ G++AA+E++I +L V+ DS L++ Q  R+W+ ++  LV Y + +++L + F
Sbjct: 75  VAEYEACVMGIQAAIERKIHILEVYEDSALVIYQLRREWETRDSKLVRYHKYVSKLVENF 134

Query: 340 EDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDKPW 510
           +++ F++LPR +    DALATL +M  V  N  I+P  + L++ PAH  ++   +D KPW
Sbjct: 135 DEICFNHLPREENQMADALATLAAMFKVGTNVKIQPIMINLRECPAHCSSVEEEIDGKPW 194

Query: 511 FWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTI 690
           + D+ +YL+ + YPD SS+ D++T RRLA  +FL G +LYKRS     LRCVD  EA+ I
Sbjct: 195 YHDIVHYLKFQQYPDQSSENDKKTIRRLAMNFFLDGNILYKRSRDQTLLRCVDSTEARRI 254

Query: 691 MDSLHNGESGPHMHGIALARKS*TWGY 771
           ++ +H G  G H  G  LAR+    GY
Sbjct: 255 VEEVHEGICGAHASGHKLARQVMRAGY 281


>gb|EOY15970.1| RNA-directed DNA polymerase, putative [Theobroma cacao]
          Length = 1685

 Score =  230 bits (587), Expect = 3e-58
 Identities = 116/269 (43%), Positives = 175/269 (65%), Gaps = 12/269 (4%)
 Frame = +1

Query: 1    PDVDL---LNVEN------DVWEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCT 153
            PD DL   L++E       + W++YFDGASN  G+ IG V  +P G+Y P   +L+F+CT
Sbjct: 1058 PDEDLMAFLHIEEVSPNELNPWKVYFDGASNALGHEIGAVLISPNGKYYPATTRLNFNCT 1117

Query: 154  NNEAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQ 333
            NN AEYEA + GL+AA+E ++  + V+GDS L++ Q   +W+ ++  LVPY + + EL++
Sbjct: 1118 NNMAEYEALVMGLQAAIEMKVDAIDVYGDSALVICQIKGEWETRDPKLVPYKKLVTELSK 1177

Query: 334  QFEDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDK 504
            QF+++SF++LPR +    DALATL +M  ++    +RPF + +++  AH +N+   VD K
Sbjct: 1178 QFKEISFNHLPREENQIADALATLAAMFKIKEAADVRPFDLEVREVSAHCLNVEQEVDGK 1237

Query: 505  PWFWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQ 684
            PW+ D+  Y++++ YP+  +  D+RT RRLA G+FL+G VLYKRS   + LRCVD  EA 
Sbjct: 1238 PWYHDIMQYIKHQTYPENVTDNDKRTLRRLAMGFFLSGEVLYKRSRDQVLLRCVDVAEAN 1297

Query: 685  TIMDSLHNGESGPHMHGIALARKS*TWGY 771
             IM  +H G  G H +G  LAR+    GY
Sbjct: 1298 KIMKEVHEGTCGAHANGHMLARQIMRAGY 1326


>gb|EOX94716.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H
           [Theobroma cacao]
          Length = 642

 Score =  230 bits (587), Expect = 3e-58
 Identities = 115/268 (42%), Positives = 175/268 (65%), Gaps = 11/268 (4%)
 Frame = +1

Query: 1   PDVDLLNV--------ENDVWEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTN 156
           PD DL+++        E + W+M+FDGASN  G+GIGVV  +P G++ P+  KL+F CTN
Sbjct: 80  PDEDLMSICQTSGEESEKENWKMFFDGASNALGHGIGVVLVSPEGDHYPVIAKLNFYCTN 139

Query: 157 NEAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQ 336
           N AEYEAC+ G++AA+E++I +L V+GDS L++ Q   +W+ ++  LV Y + +++L + 
Sbjct: 140 NVAEYEACVMGIQAAIERKIHILEVYGDSALVIYQLRGEWETRDSKLVRYHKYVSKLIEN 199

Query: 337 FEDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDKP 507
           F+++ F++LPR +    DALATL ++  V  N  I+P  + L++ PAH  ++   +D KP
Sbjct: 200 FDEICFNHLPREENQMADALATLAAIFKVGTNVKIQPIMINLRECPAHCSSVEEEIDGKP 259

Query: 508 WFWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQT 687
           W+ D+ +YL+ + YPD SS+ D++T RRLA  +FL G +LYKRS     LRCVD  EA+ 
Sbjct: 260 WYHDIVHYLKFQQYPDQSSENDKKTIRRLAMNFFLDGNILYKRSRDQTLLRCVDSIEARR 319

Query: 688 IMDSLHNGESGPHMHGIALARKS*TWGY 771
           I+  +H G  G H  G  LAR+    GY
Sbjct: 320 IVKEVHEGVCGAHASGHKLARQVMRAGY 347


>gb|EOY09277.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H,
            putative [Theobroma cacao]
          Length = 1560

 Score =  230 bits (586), Expect = 5e-58
 Identities = 115/269 (42%), Positives = 174/269 (64%), Gaps = 12/269 (4%)
 Frame = +1

Query: 1    PDVDLLNV---------ENDVWEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCT 153
            PD D++ V         E + W++YFDGASN  G+GIG V   P G+Y P   +L+F+C 
Sbjct: 963  PDEDMMAVLHIEEVGPNELNPWKVYFDGASNAFGHGIGAVLIFPNGKYYPATTRLNFNCN 1022

Query: 154  NNEAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQ 333
            NN AEYEA + GL+AA+E +   + V+GDS L++ Q   +W+ ++  LVPY + + EL++
Sbjct: 1023 NNMAEYEALVMGLQAAIEMKADAIDVYGDSALVICQMKGEWETRDPKLVPYKKLVIELSK 1082

Query: 334  QFEDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDK 504
            QF+++SF++LPR +    DALATL +M  ++    +RPF + +++  AH +N+   VD +
Sbjct: 1083 QFKEISFNHLPREENRIADALATLAAMFKIKEAADVRPFDLEVREVSAHCLNVEEEVDGR 1142

Query: 505  PWFWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQ 684
            PW+ D++ Y++++AYP+  +  D+RT RRLA G+FL+G VLYKRS   + LRCVD  EA 
Sbjct: 1143 PWYHDIRQYIKHQAYPENVTDNDKRTLRRLAMGFFLSGEVLYKRSRDQVLLRCVDVAEAN 1202

Query: 685  TIMDSLHNGESGPHMHGIALARKS*TWGY 771
             IM  +H G  G H +G  LAR+    GY
Sbjct: 1203 KIMKEVHEGTCGAHANGHMLARQIMRAGY 1231


>ref|XP_004510004.1| PREDICTED: uncharacterized protein LOC101511496 [Cicer arietinum]
          Length = 2016

 Score =  229 bits (584), Expect = 8e-58
 Identities = 109/267 (40%), Positives = 169/267 (63%), Gaps = 10/267 (3%)
 Frame = +1

Query: 1    PDVDLLNVENDV-------WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNN 159
            PD D++N+  ++       W + FDGASN  G+GIG +  +P  ++ P   K+ F CTNN
Sbjct: 1423 PDEDIMNLVEEIESSDKEKWRLVFDGASNALGHGIGAILISPENQFTPFTAKVCFYCTNN 1482

Query: 160  EAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQF 339
             AEYEAC+ G++AA+E  +K L V+GDS L++ Q    W+ ++  L+PY   + EL +QF
Sbjct: 1483 IAEYEACVMGIKAAIESNVKFLEVYGDSLLVIHQTKGDWETRDSKLIPYHTHIKELTEQF 1542

Query: 340  EDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDKPW 510
            E ++FH++PR +    DALATL SM  +  NQ +    ++ + +PA+ +++   +D KPW
Sbjct: 1543 EKITFHHIPREENQLADALATLSSMFKITTNQDVPVIKIQQRDKPAYCLSIEEELDGKPW 1602

Query: 511  FWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTI 690
            F+D+++Y++N+ YP G S+ D+R  RRL+  +FL G VLYKR+   + LRCVD+ EA  I
Sbjct: 1603 FYDIKSYVKNKEYPLGISENDKRVLRRLSMNFFLNGDVLYKRNHDMVLLRCVDKAEAGKI 1662

Query: 691  MDSLHNGESGPHMHGIALARKS*TWGY 771
            +  +H G  G H +G  +ARK    GY
Sbjct: 1663 IQEVHEGSFGTHANGHTMARKILRAGY 1689


>gb|EOY10681.1| Uncharacterized protein TCM_025982 [Theobroma cacao]
          Length = 828

 Score =  226 bits (576), Expect = 6e-57
 Identities = 112/263 (42%), Positives = 171/263 (65%), Gaps = 12/263 (4%)
 Frame = +1

Query: 1    PDVDLLNV---------ENDVWEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCT 153
            PD DL+ V         E + W++YFDGASN  G+GIG V  +P G+Y P   +L+F+CT
Sbjct: 291  PDEDLMAVLHIEKVGPNELNPWKVYFDGASNALGHGIGAVLISPNGKYYPATARLNFNCT 350

Query: 154  NNEAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQ 333
            NN AEYEA + GL+AA++ +   + V+GDS L++ Q   +W+ ++  LVPY + + EL++
Sbjct: 351  NNMAEYEALVLGLQAAIDIKADAIDVYGDSVLVICQMKGEWETRDPKLVPYKKLVTELSK 410

Query: 334  QFEDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDK 504
            QF+++SF++LPR +    DALATL +M  ++    +RPF + +++  AH +N+   VD K
Sbjct: 411  QFKEISFNHLPREENQIADALATLAAMFKIKEAADVRPFDLEVREVSAHCLNVEEEVDGK 470

Query: 505  PWFWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQ 684
            PW+ ++  Y++++ YP+  +  D+RT RRLA G+FL+G VLYKRS   + LRCVD  EA 
Sbjct: 471  PWYHNIMQYIKHQTYPENVTDNDKRTLRRLAMGFFLSGEVLYKRSRDQVLLRCVDVAEAN 530

Query: 685  TIMDSLHNGESGPHMHGIALARK 753
             IM  +H G  G H +G  L R+
Sbjct: 531  KIMKEVHEGTCGAHANGHMLVRQ 553


>gb|EOY28051.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease
            H-like protein [Theobroma cacao]
          Length = 1630

 Score =  226 bits (575), Expect = 8e-57
 Identities = 111/257 (43%), Positives = 167/257 (64%), Gaps = 12/257 (4%)
 Frame = +1

Query: 1    PDVDLLNV---------ENDVWEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCT 153
            PD DL+ V         E + W+++FDGASN  G+GIG V  +P G+Y P   +L+F+CT
Sbjct: 1303 PDEDLMAVLHVEKVGPNELNPWKVFFDGASNALGHGIGAVLISPNGKYYPATARLNFNCT 1362

Query: 154  NNEAEYEACIKGLEAALEKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQ 333
            NN AEYEA + GL+AA++ +   + V+GDS L++ Q   +W+ ++  LVPY + + EL++
Sbjct: 1363 NNMAEYEALVLGLQAAIDMKADAIDVYGDSALVICQMKGEWETRDPKLVPYKKLVTELSK 1422

Query: 334  QFEDLSFHYLPRAKK*FVDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDK 504
            QF+++SF++LPR +    DALATL +M  ++    +RPF +  ++  AH +N+   VD K
Sbjct: 1423 QFKEISFNHLPREENQIADALATLAAMFQIKEAADVRPFDLEAREVSAHCLNVEEEVDGK 1482

Query: 505  PWFWDVQNYLQNEAYPDGSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQ 684
            PW+ D+  Y++++AYP+  +  D+RT RRLA G+FL G VLYKRS   + LRCVD  EA 
Sbjct: 1483 PWYHDIMQYIKHQAYPENVTDNDKRTLRRLAMGFFLNGEVLYKRSRDQVLLRCVDVAEAN 1542

Query: 685  TIMDSLHNGESGPHMHG 735
             IM  +H G  G H +G
Sbjct: 1543 KIMKEVHEGTCGAHANG 1559


>ref|XP_004490844.1| PREDICTED: uncharacterized protein LOC101509363 [Cicer arietinum]
          Length = 1955

 Score =  225 bits (573), Expect = 1e-56
 Identities = 103/252 (40%), Positives = 164/252 (65%), Gaps = 3/252 (1%)
 Frame = +1

Query: 25   ENDVWEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAAL 204
            + + W + F GASN  G+GIG +  +P  ++ P   +L FDCTNN AEYEAC+ G++AA+
Sbjct: 1377 DKEKWRLVFVGASNALGHGIGAILISPENQFTPFTARLCFDCTNNIAEYEACVMGIKAAI 1436

Query: 205  EKEIKVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*F 384
            E  +K L V+GDS L++ Q   +W+ ++  L+PY   + EL + FE ++F+++PR +   
Sbjct: 1437 ESNVKFLEVYGDSLLVIHQTKGEWETRDSKLIPYHTHIKELTEHFEKITFNHIPREENQL 1496

Query: 385  VDALATLVSMVNVEGNQIIRPFTVRLQKQPAHIMNL---VDDKPWFWDVQNYLQNEAYPD 555
             DALATL SM  +  NQ +    ++ + +PA+ +++   +D KPWF+D+++Y++N  YP 
Sbjct: 1497 ADALATLSSMFKITTNQDVPVIKIQQRDKPAYCLSIEEELDSKPWFYDIKSYVKNREYPS 1556

Query: 556  GSSKTDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHG 735
            G S+ D+R  RRL+  +FL G VLYKR+ + + LRC+D+ EA+ I+  +H G  G H +G
Sbjct: 1557 GISENDKRVLRRLSMNFFLNGDVLYKRNHNMVLLRCLDKAEAEKIIQEVHEGSFGTHANG 1616

Query: 736  IALARKS*TWGY 771
             A+ARK    GY
Sbjct: 1617 HAMARKILRAGY 1628


>ref|XP_003551524.1| PREDICTED: uncharacterized protein LOC100805548 [Glycine max]
          Length = 2323

 Score =  224 bits (572), Expect = 2e-56
 Identities = 110/248 (44%), Positives = 156/248 (62%), Gaps = 3/248 (1%)
 Frame = +1

Query: 37   WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAALEKEI 216
            W + FDGA N  GNGIG V  TP G ++P A +L FDCTNN AEYEACI G+E A++ +I
Sbjct: 1749 WGLIFDGAVNVFGNGIGAVIITPEGNHLPFAARLQFDCTNNVAEYEACILGIEKAIDLKI 1808

Query: 217  KVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*FVDAL 396
            K L ++GDS L+++Q   +W+ +   L+PY      L   F  +  H++PR +    DAL
Sbjct: 1809 KNLDIYGDSALVINQIKGEWETRHPGLIPYKDYARRLLTFFNKVELHHIPRDENQMADAL 1868

Query: 397  ATLVSMVNVEGNQIIRPFTVRLQKQPAHIM---NLVDDKPWFWDVQNYLQNEAYPDGSSK 567
            ATL SM  V     +    ++  ++PAH+     +VDDKPWF D++ +LQ++ YP G+S 
Sbjct: 1869 ATLSSMYEVSHRNNLPTIRIQRLERPAHVFAVEEVVDDKPWFHDIKYFLQSQEYPPGASN 1928

Query: 568  TDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHGIALA 747
             D+RT RRL+  +FL G VLYKR++  + LRCVD+ EA+ +M  +H G  G H +G A+A
Sbjct: 1929 KDRRTLRRLSGNFFLNGDVLYKRNFDMVLLRCVDKQEAEFLMHEIHEGSFGTHSNGHAMA 1988

Query: 748  RKS*TWGY 771
            RK    GY
Sbjct: 1989 RKLLRAGY 1996


>ref|XP_003518330.1| PREDICTED: uncharacterized protein LOC100818337 [Glycine max]
          Length = 2323

 Score =  224 bits (571), Expect = 2e-56
 Identities = 110/248 (44%), Positives = 156/248 (62%), Gaps = 3/248 (1%)
 Frame = +1

Query: 37   WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAALEKEI 216
            W + FDGA N  GNGIG V  TP G ++P A +L FDCTNN AEYEACI G+E A++ +I
Sbjct: 1749 WGLIFDGAVNVFGNGIGAVIITPEGNHLPFAARLQFDCTNNVAEYEACILGIEKAIDLKI 1808

Query: 217  KVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*FVDAL 396
            K L ++GDS L+++Q   +W+ +   L+PY      L   F  +  H++PR +    DAL
Sbjct: 1809 KNLDIYGDSALVINQIKGEWETRHPGLIPYKDYAKHLLTFFNKVELHHIPRDENQMADAL 1868

Query: 397  ATLVSMVNVEGNQIIRPFTVRLQKQPAHIM---NLVDDKPWFWDVQNYLQNEAYPDGSSK 567
            ATL SM  V     +    ++  ++PAH+     +VDDKPWF D++ +LQ++ YP G+S 
Sbjct: 1869 ATLSSMYEVSHRNNLPTIRIQRLERPAHVFAVEEVVDDKPWFHDIKCFLQSQEYPPGASN 1928

Query: 568  TDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHGIALA 747
             D+RT RRL+  +FL G VLYKR++  + LRCVD+ EA+ +M  +H G  G H +G A+A
Sbjct: 1929 KDRRTLRRLSGNFFLNGDVLYKRNFDMVLLRCVDKQEAEFLMHEIHEGSFGTHSNGHAMA 1988

Query: 748  RKS*TWGY 771
            RK    GY
Sbjct: 1989 RKLLRAGY 1996


>gb|ABD28291.1| Integrase, catalytic region; Ribonuclease H [Medicago truncatula]
          Length = 981

 Score =  224 bits (570), Expect = 3e-56
 Identities = 110/257 (42%), Positives = 161/257 (62%), Gaps = 6/257 (2%)
 Frame = +1

Query: 37   WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAALEKEI 216
            W + FDGA N +G+GIG V  TP G ++P   +L FDCTNN AEYEACI G+E A++  I
Sbjct: 407  WGLIFDGAVNVYGSGIGAVLITPKGTHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRI 466

Query: 217  KVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*FVDAL 396
            K + ++GDS L+++Q   +W+ +   L+PY      L   F  +  H++PR +    DAL
Sbjct: 467  KKIVIYGDSALVINQIKGEWETRHPGLIPYRDYARRLLTFFNKVELHHVPRDENQMADAL 526

Query: 397  ATLVSMVNVEGNQIIRPFTVRLQKQPAHIM---NLVDDKPWFWDVQNYLQNEAYPDGSSK 567
            ATL SM+NV G+ I+    V+   +PA++     + DDKPW+ D+Q +LQ + YP G+S 
Sbjct: 527  ATLSSMINVNGHNIVPVINVQFLDRPAYVFVAEAIDDDKPWYHDIQVFLQTQKYPPGASN 586

Query: 568  TDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHGIALA 747
             D++T R+L+S +FL   VLYKR++ G+ LRCVD+ EA+ +M  +H G  G H  G A+A
Sbjct: 587  KDKKTLRKLSSRFFLNEDVLYKRNFDGVLLRCVDKHEAEKLMREIHEGSFGTHSCGHAMA 646

Query: 748  RKS*TWGY---TGHYEC 789
            +K    GY   T H +C
Sbjct: 647  KKILRAGYYWITMHADC 663


>ref|XP_003555077.1| PREDICTED: uncharacterized protein LOC100811111 [Glycine max]
          Length = 2265

 Score =  224 bits (570), Expect = 3e-56
 Identities = 110/248 (44%), Positives = 156/248 (62%), Gaps = 3/248 (1%)
 Frame = +1

Query: 37   WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAALEKEI 216
            W + FDGA N  GNGIG V  TP G ++P A +L FDCTNN AEYEACI G+E A++ +I
Sbjct: 1691 WGLIFDGAVNVFGNGIGAVIITPEGSHLPFAARLQFDCTNNVAEYEACILGIEKAIDLKI 1750

Query: 217  KVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*FVDAL 396
            K L ++GDS L+++Q   +W+ +   L+PY      L   F  +  H++PR +    DAL
Sbjct: 1751 KNLDIYGDSALVINQIKGEWETRHPGLIPYKDYAKHLLTFFNKVELHHIPRDENQMADAL 1810

Query: 397  ATLVSMVNVEGNQIIRPFTVRLQKQPAHIM---NLVDDKPWFWDVQNYLQNEAYPDGSSK 567
            ATL SM  V     +    ++  ++PAH+     +VDDKPWF D++ +LQ++ YP G+S 
Sbjct: 1811 ATLSSMYEVSHRNNLPTIRIQRLERPAHVFAVEEVVDDKPWFHDIKCFLQSQEYPPGASN 1870

Query: 568  TDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHGIALA 747
             D+RT RRL+  +FL G VLYKR++  + LRCVD+ EA+ +M  +H G  G H +G A+A
Sbjct: 1871 KDRRTLRRLSGNFFLNGDVLYKRNFDMVLLRCVDKQEAEFLMHEVHEGSFGTHSNGHAMA 1930

Query: 748  RKS*TWGY 771
            RK    GY
Sbjct: 1931 RKLLRAGY 1938


>ref|XP_003553327.1| PREDICTED: uncharacterized protein LOC100814838 [Glycine max]
          Length = 2284

 Score =  224 bits (570), Expect = 3e-56
 Identities = 110/248 (44%), Positives = 155/248 (62%), Gaps = 3/248 (1%)
 Frame = +1

Query: 37   WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAALEKEI 216
            W + FDGA N  GNGIG V  TP G ++P A +L FDCTNN AEYEACI G+E A++  I
Sbjct: 1710 WGLIFDGAVNVFGNGIGAVIITPEGNHLPFAARLQFDCTNNMAEYEACILGIEKAIDLRI 1769

Query: 217  KVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*FVDAL 396
            K L ++GDS L+++Q   +W+ +   L+PY      L   F  +  H++PR +    DAL
Sbjct: 1770 KNLDIYGDSALVINQIKGEWETRHPGLIPYKDYARRLLTFFNKVELHHIPRDENQMADAL 1829

Query: 397  ATLVSMVNVEGNQIIRPFTVRLQKQPAHIM---NLVDDKPWFWDVQNYLQNEAYPDGSSK 567
            ATL SM  V     +    ++  ++PAH+     +VDDKPWF D++ +LQ++ YP G+S 
Sbjct: 1830 ATLSSMYEVSHRNNLPTIRIQRLERPAHVFAVEEIVDDKPWFHDIKCFLQSQEYPPGASN 1889

Query: 568  TDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHGIALA 747
             D+RT RRL+  +FL G VLYKR++  + LRCVD+ EA+ +M  +H G  G H +G A+A
Sbjct: 1890 KDRRTLRRLSGNFFLNGDVLYKRNFDMVLLRCVDKQEAEFLMHEVHEGSFGTHSNGHAMA 1949

Query: 748  RKS*TWGY 771
            RK    GY
Sbjct: 1950 RKLLRAGY 1957


>ref|XP_003544290.1| PREDICTED: uncharacterized protein LOC100815788 [Glycine max]
          Length = 2270

 Score =  224 bits (570), Expect = 3e-56
 Identities = 110/248 (44%), Positives = 155/248 (62%), Gaps = 3/248 (1%)
 Frame = +1

Query: 37   WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAALEKEI 216
            W + FDGA N  GNGIG V  TP G ++P A +L FDCTNN AEYEACI G+E A++  I
Sbjct: 1696 WGLIFDGAVNVFGNGIGAVIITPEGNHLPFAARLQFDCTNNMAEYEACILGIEKAIDLRI 1755

Query: 217  KVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*FVDAL 396
            K L ++GDS L+++Q   +W+ +   L+PY      L   F  +  H++PR +    DAL
Sbjct: 1756 KNLDIYGDSALVINQIKGEWETRHPGLIPYKDYARRLLTFFNKVELHHIPRDENQMADAL 1815

Query: 397  ATLVSMVNVEGNQIIRPFTVRLQKQPAHIM---NLVDDKPWFWDVQNYLQNEAYPDGSSK 567
            ATL SM  V     +    ++  ++PAH+     +VDDKPWF D++ +LQ++ YP G+S 
Sbjct: 1816 ATLSSMYEVSHRNNLPTIRIQRLERPAHVFAVEEVVDDKPWFHDIKCFLQSQEYPPGASN 1875

Query: 568  TDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHGIALA 747
             D+RT RRL+  +FL G VLYKR++  + LRCVD+ EA+ +M  +H G  G H +G A+A
Sbjct: 1876 KDRRTLRRLSGNFFLNGDVLYKRNFDMVLLRCVDKQEAELLMHEVHEGSFGTHSNGHAMA 1935

Query: 748  RKS*TWGY 771
            RK    GY
Sbjct: 1936 RKLLRAGY 1943


>ref|XP_003530624.1| PREDICTED: uncharacterized protein LOC100785887 [Glycine max]
          Length = 2320

 Score =  223 bits (569), Expect = 4e-56
 Identities = 110/248 (44%), Positives = 156/248 (62%), Gaps = 3/248 (1%)
 Frame = +1

Query: 37   WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAALEKEI 216
            W + FDGA N  GNGIG V  TP G ++P A +L FDCTNN AEYEACI G+E A++ +I
Sbjct: 1746 WGLIFDGAVNVFGNGIGAVIITPEGNHLPFAARLRFDCTNNVAEYEACILGIEKAIDLKI 1805

Query: 217  KVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*FVDAL 396
            K L ++GDS L+++Q   +W+ +   L+PY      L   F  +  H++PR +    DAL
Sbjct: 1806 KNLDIYGDSALVINQIKGEWETRHPGLIPYKDYAKHLLTFFNKVELHHIPRDENQMADAL 1865

Query: 397  ATLVSMVNVEGNQIIRPFTVRLQKQPAHIM---NLVDDKPWFWDVQNYLQNEAYPDGSSK 567
            ATL SM  V     +    ++  ++PAH+     +VDDKPWF D++ +LQ++ YP G+S 
Sbjct: 1866 ATLSSMYEVSHRNNLPTIRIQRLERPAHVFAVEEVVDDKPWFHDIKCFLQSQEYPPGASN 1925

Query: 568  TDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHGIALA 747
             D+RT RRL+  +FL G VLYKR++  + LRCVD+ EA+ +M  +H G  G H +G A+A
Sbjct: 1926 KDRRTLRRLSGNFFLNGDVLYKRNFDMVLLRCVDKQEAEFLMHEIHEGSFGTHSNGHAMA 1985

Query: 748  RKS*TWGY 771
            RK    GY
Sbjct: 1986 RKLLRAGY 1993


>ref|XP_003522184.1| PREDICTED: uncharacterized protein LOC100786848 [Glycine max]
          Length = 2243

 Score =  223 bits (567), Expect = 7e-56
 Identities = 110/248 (44%), Positives = 154/248 (62%), Gaps = 3/248 (1%)
 Frame = +1

Query: 37   WEMYFDGASNYHGNGIGVVFKTPCGEYVPIAVKLDFDCTNNEAEYEACIKGLEAALEKEI 216
            W + FDGA N  GNGIG V  TP G ++P A +L FDCTNN AEYEACI G+E A++  I
Sbjct: 1669 WGLIFDGAVNVFGNGIGAVIITPEGNHLPFAARLQFDCTNNMAEYEACILGIEKAIDLRI 1728

Query: 217  KVLRVFGDSNLIVSQALRKWKIKEEHLVPYLQRLNELAQQFEDLSFHYLPRAKK*FVDAL 396
            K L ++GDS L++ Q   +W+ +   L+PY      L   F  +  H++PR +    DAL
Sbjct: 1729 KNLDIYGDSALVIYQIKGEWETRHPGLIPYKDYARHLLTFFNKVELHHIPRDENQMADAL 1788

Query: 397  ATLVSMVNVEGNQIIRPFTVRLQKQPAHIM---NLVDDKPWFWDVQNYLQNEAYPDGSSK 567
            ATL SM  V     +    ++  ++PAH+     +VDDKPWF D++ +LQ++ YP G+S 
Sbjct: 1789 ATLSSMYEVSHRNNLPTIRIQRLERPAHVFAVEEVVDDKPWFHDIKCFLQSQEYPPGASN 1848

Query: 568  TDQRTSRRLASGYFLTGGVLYKRSWSGLHLRCVDEGEAQTIMDSLHNGESGPHMHGIALA 747
             D+RT RRL+  +FL G VLYKR++  + LRCVD+ EA+ +M  +H G  G H +G A+A
Sbjct: 1849 KDRRTLRRLSGNFFLNGDVLYKRNFDMVLLRCVDKQEAELLMHEVHEGSFGTHSNGHAMA 1908

Query: 748  RKS*TWGY 771
            RK    GY
Sbjct: 1909 RKLLRAGY 1916


Top