BLASTX nr result

ID: Rehmannia28_contig00020769 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00020769
         (1961 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699...   394   e-127
ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697...   391   e-126
ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobrom...   280   9e-86
ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobrom...   265   5e-80
gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly...   251   1e-72
gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly...   250   2e-72
ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798...   243   7e-71
ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955...   243   2e-69
gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposo...   241   1e-68
gb|KYP75905.1| Retrovirus-related Pol polyprotein from transposo...   243   2e-68
ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954...   242   1e-67
ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662...   236   2e-67
gb|KYP31881.1| Putative transposon Ty5-1 protein YCL075W family ...   234   9e-67
gb|KYP74100.1| Retrovirus-related Pol polyprotein from transposo...   231   1e-65
gb|KYP46603.1| hypothetical protein KK1_031768 [Cajanus cajan]        232   2e-65
ref|XP_009774775.1| PREDICTED: uncharacterized protein LOC104224...   229   1e-64
ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobrom...   224   2e-64
ref|XP_012453130.1| PREDICTED: uncharacterized protein LOC105775...   228   9e-64
gb|KYP49735.1| Retrovirus-related Pol polyprotein from transposo...   226   9e-64
ref|XP_008350470.1| PREDICTED: uncharacterized protein LOC103413...   226   1e-63

>ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699729, partial [Phoenix
            dactylifera]
          Length = 490

 Score =  394 bits (1011), Expect = e-127
 Identities = 212/447 (47%), Positives = 286/447 (63%), Gaps = 46/447 (10%)
 Frame = -2

Query: 1207 AAQITQATA---NRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSIS 1037
            A+ I+ A+A   N    P+EDPN PFFL  +DN+ T+ + PPL G+NY +WSR+FSL+IS
Sbjct: 4    ASHISSASATSPNHVFTPSEDPNSPFFLHHTDNAQTVIVTPPLVGSNYLSWSRSFSLAIS 63

Query: 1036 VKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDT 857
            +KNK GFLDG+I TP+ +DPLYIPWLRCNNLIL WL+NS++KEIAS+++++ SAK+VW+ 
Sbjct: 64   IKNKLGFLDGSISTPEVTDPLYIPWLRCNNLILAWLLNSISKEIASNVLFIKSAKEVWNK 123

Query: 856  LKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTC 677
            LK R++QPD+VRI                 S+YFT LN IWEELRNYRP+P+CSCG C C
Sbjct: 124  LKSRFAQPDNVRIYQLKQQLSSITQRSLSVSEYFTQLNAIWEELRNYRPLPYCSCGHCIC 183

Query: 676  QAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARL 497
             A+K VGE    D+ F+FLMGLN+TYD+ RGQI+LM+P+PSLD  ++++LQEERQR+AR 
Sbjct: 184  DALKGVGEDLELDHIFQFLMGLNDTYDTVRGQIILMSPLPSLDKTFSLVLQEERQRQARA 243

Query: 496  SFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGK--N 323
               P+ ESSALA  A  +K K + +I C HCGK GH+ +KC+RLIGFPPNFKFTK K  +
Sbjct: 244  IIFPAPESSALA--AVLNKSKNRAEITCYHCGKSGHTKEKCYRLIGFPPNFKFTKTKFPS 301

Query: 322  AAGKGIGQNHSANCIPPPEIPAASSDK---TKHFSFTQEQVQKLMTLLNGDPMEVSQPS- 155
               K +   HSAN     ++ +++  K       S +Q Q+Q+L+ L+N    ++S  S 
Sbjct: 302  VNNKSVAP-HSAN-----QVISSTQGKGLSAPQLSLSQTQIQQLLALVNSGIPQMSLNSA 355

Query: 154  ------------PAPDNPSNTSHFSNMAG------NITLNSQFKS--------------- 74
                        P  +  +N++  SNMAG      NIT      S               
Sbjct: 356  STQQEPILPMVTPTTETGNNSAPSSNMAGIDLCLSNITHVPDSSSTKHYSHLAYIMDHRP 415

Query: 73   ----KFSWIIDTGASDHIVCCSSLFTS 5
                +  WIIDTGA+DH+VC +   TS
Sbjct: 416  HKIHEVPWIIDTGATDHMVCSTKFLTS 442


>ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697258 [Phoenix dactylifera]
          Length = 514

 Score =  391 bits (1005), Expect = e-126
 Identities = 209/445 (46%), Positives = 277/445 (62%), Gaps = 44/445 (9%)
 Frame = -2

Query: 1207 AAQITQATA---NRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSIS 1037
            A+ I+ A A   N    P+EDPN PFFL  +DN+ T+ + PPL G+NY +WSR+FSL+IS
Sbjct: 4    ASHISSALATSPNHVFTPSEDPNSPFFLHRTDNAQTVIVTPPLIGSNYLSWSRSFSLAIS 63

Query: 1036 VKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDT 857
            +KNK GFLDG+IPTP+ +DPLY+PWLRCNNLIL WL+NS++KEIAS+++++ S K+VW+ 
Sbjct: 64   IKNKLGFLDGSIPTPEVTDPLYVPWLRCNNLILAWLLNSISKEIASNVLFIKSTKEVWNK 123

Query: 856  LKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTC 677
            LK R++QPD+VRI                 S+YFT LN IWEELRNYRP+P+CSCG C C
Sbjct: 124  LKSRFAQPDNVRIYQLKQQLSSITQGTLSVSEYFTQLNAIWEELRNYRPLPYCSCGHCIC 183

Query: 676  QAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARL 497
             A+K VGE    DY F+FLM LN T+DS RGQI+LM+P+PSLD  ++++LQEERQR+AR 
Sbjct: 184  DALKGVGENLELDYIFQFLMELNNTFDSVRGQIILMSPLPSLDKTFSLVLQEERQRQARA 243

Query: 496  SFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAA 317
               P+ ESSALA  A  +K K K  I C HCGKPGH+ +KC+RLIGFPPNFKFTK K+ +
Sbjct: 244  IIFPAPESSALA--AVLNKPKNKAKITCYHCGKPGHTREKCYRLIGFPPNFKFTKTKSPS 301

Query: 316  --GKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEV------SQ 161
               K +  +HSAN +  P             S +Q QVQ+L  L+N    ++      SQ
Sbjct: 302  VNNKSVA-SHSANQVISP--TQGKGLAAPQLSLSQAQVQQLFALVNSGITQLNLNSASSQ 358

Query: 160  PSPAP-------DNPSN--------------------------TSHFSNMAGNITLNSQF 80
              P P       +  SN                          T H S++   +      
Sbjct: 359  QEPIPPMMKPITETGSNSTSTNMADIDLCLSSITRVPDTSLCSTKHHSHLTYLMDHRPHR 418

Query: 79   KSKFSWIIDTGASDHIVCCSSLFTS 5
              +  WI+DTGA+DH+VC ++  TS
Sbjct: 419  THEVPWIVDTGATDHMVCSTTFLTS 443


>ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobroma cacao]
            gi|508779769|gb|EOY27025.1| Uncharacterized protein
            TCM_028976 [Theobroma cacao]
          Length = 318

 Score =  280 bits (716), Expect = 9e-86
 Identities = 133/302 (44%), Positives = 188/302 (62%)
 Frame = -2

Query: 1213 EVAAQITQATANRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISV 1034
            ++ +QI+QA          DP  P++L  +D+  ++ + P L   NY AWSR+F L++S+
Sbjct: 17   QLTSQISQAN---------DPPSPYYLHHTDHLGSVVVNPKLTTNNYVAWSRSFLLALSI 67

Query: 1033 KNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTL 854
            +NK GF++G+IP P  +D L+  W RCNNLI++WL+NS+++ IAS+I +M S  ++W+TL
Sbjct: 68   RNKVGFINGSIPKPSITDDLHPIWNRCNNLIVSWLLNSISQPIASTIFFMESVAEIWNTL 127

Query: 853  KLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQ 674
            KL Y+QPD+  +                   YF  L  IWEELRNYRP+PHC CG+C   
Sbjct: 128  KLNYAQPDNTCVCNLQYTLGSVTQRVKIVYAYFIELKCIWEELRNYRPLPHCECGKCNAN 187

Query: 673  AIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLS 494
              K   +    D  F+FL GLNE++ + R QI+LM+PIPSLD VY+M+L+EE Q+   L 
Sbjct: 188  CFKKFSDQYQKDMVFRFLNGLNESFSAIRSQIILMDPIPSLDKVYSMVLREESQKNMFLQ 247

Query: 493  FMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAG 314
              P  ES A+    +  KK  K D+ C HCGK GH  +KC+R+I FP +FKFTKGK    
Sbjct: 248  SQPFLESLAMLAATNVKKKPMK-DLTCTHCGKKGHVKEKCYRIIRFPEDFKFTKGKPYVK 306

Query: 313  KG 308
            KG
Sbjct: 307  KG 308


>ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobroma cacao]
            gi|508722464|gb|EOY14361.1| Uncharacterized protein
            TCM_033758 [Theobroma cacao]
          Length = 328

 Score =  265 bits (678), Expect = 5e-80
 Identities = 133/294 (45%), Positives = 183/294 (62%), Gaps = 3/294 (1%)
 Frame = -2

Query: 1057 AFSLSISVKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNS 878
            +F L++S++NK  F+DG+IP PD SD L++P  RCN+LIL WL+ S++  IAS++ Y+  
Sbjct: 23   SFLLALSIQNKSRFIDGSIPEPDVSDKLFVPCTRCNSLILAWLLESISPPIASTVFYIRK 82

Query: 877  AKDVWDTLKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHC 698
            A +VW+TLK R+SQPD  RI                   YFT LN IWEELRNYRP+PHC
Sbjct: 83   AYEVWETLKERFSQPDDARICNLQFNLYNISQGTRSVDAYFTELNCIWEELRNYRPLPHC 142

Query: 697  SCGQCTCQAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEE 518
            SCG C     ++  +    D  F+FL GLNE++ + R QIL+M P PSL+  Y +++++E
Sbjct: 143  SCGICNSACFQTYIDQYQKDSVFRFLNGLNESFSALRSQILMMKPFPSLNKAYNLVIRDE 202

Query: 517  RQREARLSFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKF 338
             QR   L  MP  ESSA+A      K K K+D++C +C K GH+ DKC+RLIGFPP+FKF
Sbjct: 203  SQRNLYLHTMPIIESSAMATMTE-GKVKSKVDVVCSYCHKKGHTKDKCYRLIGFPPDFKF 261

Query: 337  TKGKNAAGKGIGQNHSANCIPPPEIPAASSDKTKHFS---FTQEQVQKLMTLLN 185
             KGK+   K  G   S N + P        + TK  S    ++ Q+QKLM+L+N
Sbjct: 262  LKGKSPLKK--GNVWSINNVGPVTSKEECDESTKSLSSLTLSKHQIQKLMSLIN 313


>gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja]
          Length = 484

 Score =  251 bits (641), Expect = 1e-72
 Identities = 139/407 (34%), Positives = 215/407 (52%), Gaps = 19/407 (4%)
 Frame = -2

Query: 1168 FPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPD 989
            F T   N P++L P++N   + + P L   NY  WSR+  +++  KNK  F+DG++P P 
Sbjct: 1    FSTNSAN-PYYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPP 59

Query: 988  FSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXX 809
             SDPLY PW+RCN ++L W+  S++  IA S++++++A  VW  L++R+SQ D  RI   
Sbjct: 60   VSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDL 119

Query: 808  XXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSDYT 632
                          SDYFT L   W+EL NYRPIPHC C   C+C  I SV   +  DY 
Sbjct: 120  QEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYV 179

Query: 631  FKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR--EARLSFMPSSESSALAV 458
             +FL GLN+ +  ++ QI++MNP+P +D V+++++Q+ER+       S   ++  SA+A+
Sbjct: 180  IRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAM 239

Query: 457  GAHPSKKKFK---------------LDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKN 323
              + ++  F                 + +C HCGK  H +D CF  IG+PP +K  K KN
Sbjct: 240  QVNSNQSNFNGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKN 299

Query: 322  AAGKGIGQNHS-ANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAP 146
            ++      N S A+ +   E     S     F FTQE  Q ++  L    +  SQP    
Sbjct: 300  SSSSSQANNTSNASAL---ESTQQGSSAQSSFQFTQEMYQGILEALQQSKVG-SQPKA-- 353

Query: 145  DNPSNTSHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5
             N   TS F+    + + N   K+   WI+DT ++++   C   FT+
Sbjct: 354  -NSVTTSPFA--LHSPSSNPNGKNPSLWILDTASTNN---CHLSFTT 394


>gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja]
          Length = 484

 Score =  250 bits (639), Expect = 2e-72
 Identities = 139/407 (34%), Positives = 215/407 (52%), Gaps = 19/407 (4%)
 Frame = -2

Query: 1168 FPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPD 989
            F T   N P++L P++N   + + P L   NY  WSR+  +++  KNK  F+DG++P P 
Sbjct: 1    FSTNSAN-PYYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPP 59

Query: 988  FSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXX 809
             SDPLY PW+RCN ++L W+  S++  IA S++++++A  VW  L++R+SQ D  RI   
Sbjct: 60   VSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDL 119

Query: 808  XXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSDYT 632
                          SDYFT L   W+EL NYRPIPHC C   C+C  I SV   +  DY 
Sbjct: 120  QEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYV 179

Query: 631  FKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR--EARLSFMPSSESSALAV 458
             +FL GLN+ +  ++ QI++MNP+P +D V+++++Q+ER+       S   ++  SA+A+
Sbjct: 180  IRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAM 239

Query: 457  GAHPSKKKFK---------------LDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKN 323
              + ++  F                 + +C HCGK  H +D CF  IG+PP +K  K KN
Sbjct: 240  QVNSNQSNFNGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKN 299

Query: 322  AAGKGIGQNHS-ANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAP 146
            ++      N S A+ +   E     S     F FTQE  Q ++  L    +  SQP    
Sbjct: 300  SSSSSQANNTSNASAL---ESTQQGSSAQSSFQFTQEMYQGILEALQQSKVG-SQPKA-- 353

Query: 145  DNPSNTSHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5
             N   TS F+    + + N   K+   WI+DT ++++   C   FT+
Sbjct: 354  -NLVTTSPFA--LHSPSSNPNGKNPSLWILDTASTNN---CHLSFTT 394


>ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max]
          Length = 389

 Score =  243 bits (621), Expect = 7e-71
 Identities = 133/389 (34%), Positives = 202/389 (51%), Gaps = 19/389 (4%)
 Frame = -2

Query: 1189 ATANRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLD 1010
            A  N   F T   N P++L P++N   + + P L   NY  WS +  +++  KNK  F+D
Sbjct: 2    ALQNFVDFSTNSAN-PYYLHPNENPALVLVSPSLTAKNYHTWSHSMHIALISKNKDKFID 60

Query: 1009 GTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 830
            G++P P  SDPLY PW+RCN ++L W+  S++  IA S++++++A  VW  L++R+SQ D
Sbjct: 61   GSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSD 120

Query: 829  SVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGE 653
              RI                 SDYFT L   W+EL NYRPIPHC C   C+C  I SV  
Sbjct: 121  IFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRV 180

Query: 652  IQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR--EARLSFMPSS 479
             +  DY  +FL GLN+ +  ++ QI++MNP+P +D V+++++Q+ER+       S   ++
Sbjct: 181  YREQDYVVRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEAT 240

Query: 478  ESSALAVGAHPSKKKFK---------------LDIICQHCGKPGHSIDKCFRLIGFPPNF 344
              SA+A+  + ++  F                 + +C HCGK  H +D CF  IG+PP +
Sbjct: 241  SDSAMAMQVNSNQSNFNGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGY 300

Query: 343  KFTKGKNAAGKGIGQNHS-ANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEV 167
            K  K KN++      N S A+ +   E     S     F FTQE  Q ++  L    +  
Sbjct: 301  KTNKSKNSSSSSQANNTSNASAL---ESTQQGSSAQSSFQFTQEMYQGILEALQQSKVG- 356

Query: 166  SQPSPAPDNPSNTSHFSNMAGNITLNSQF 80
            SQP     N   TS F+  + +   N  F
Sbjct: 357  SQPKA---NSVTTSPFALHSPSSNPNESF 382


>ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955841, partial [Erythranthe
            guttata]
          Length = 514

 Score =  243 bits (621), Expect = 2e-69
 Identities = 146/407 (35%), Positives = 214/407 (52%), Gaps = 22/407 (5%)
 Frame = -2

Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986
            P  D + P FL PSD  + I +       NY +WSRA ++S++VKNK GF+DGTI  P  
Sbjct: 8    PLGDVSHPMFLHPSDGPNLILVSQLFTEDNYASWSRAMTISLTVKNKIGFIDGTISEPA- 66

Query: 985  SDPLYI--PWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXX 812
            +D L +   W+R NN++++W+INSV+K+I  SI+Y NS+K++WD LK R+SQ +  RI  
Sbjct: 67   ADELVMRNAWIRNNNIVMSWIINSVSKDIQGSIMYSNSSKEIWDDLKTRFSQTNGPRIFQ 126

Query: 811  XXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQLSDYT 632
                           + YFT +  IW+EL NYRP   CSCG+C C   + +      +Y 
Sbjct: 127  LRRDLANLTQGSQSVNVYFTKVKAIWDELVNYRPC--CSCGKCDCGGFEKLQAHYNQEYV 184

Query: 631  FKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSESSALAVGA 452
              FLMGLNE+  STRGQILLM+P+P +  V+A + QEERQR    S + SS  S  +V  
Sbjct: 185  MSFLMGLNESLASTRGQILLMDPLPPISKVFAFVSQEERQRSVVSSHVESS-GSVFSVKN 243

Query: 451  HPSKK-----------KFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGI 305
               K+           K K    C HC   GH+++KC++L G+PP++K  K + ++    
Sbjct: 244  EGFKRSINNQFYNTGFKKKERSFCTHCNMQGHTVEKCYKLHGYPPSYKPQKSRFSSPANQ 303

Query: 304  GQNHSANCIPPPEIPAASSDKTKHF--SFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSN 131
                 ++          SS     +  S T  Q Q+ M++ +       Q S A   P +
Sbjct: 304  VSGFDSSLDSHSSDSGVSSQHVDGYLQSMTPSQCQQFMSMFSSHMAAQQQQSAASAQPQS 363

Query: 130  TSHFSNMA------GNITLN-SQFKSKFSWIIDTGASDHIVCCSSLF 11
            ++H ++ A      G   L+ +   S   WI+D+GAS HI     LF
Sbjct: 364  SAHGADTATVSCVTGTCALSGAPSLSSTDWILDSGASKHICHDKQLF 410


>gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 495

 Score =  241 bits (615), Expect = 1e-68
 Identities = 128/398 (32%), Positives = 201/398 (50%), Gaps = 18/398 (4%)
 Frame = -2

Query: 1144 PFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDFSDPLYIP 965
            P++L P++N   + + P L   NY  WSR+  +++  KNK  F+DG++P P  SDPLY P
Sbjct: 5    PYYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPPVSDPLYAP 64

Query: 964  WLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXXXXXXXXX 785
            W+RCN ++L W+  S++  IA S++++++A  VW  L++R+S  D  RI           
Sbjct: 65   WIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSHSDIFRISDLQEDLYRFR 124

Query: 784  XXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSDYTFKFLMGLN 608
                  SDYFT L   W+EL NYRPIP+C C   C+C  I SV   +  DY  +FL GLN
Sbjct: 125  QGTLDVSDYFTQLKIYWDELENYRPIPYCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLN 184

Query: 607  ETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR--EARLSFMPSSESSALAVGAHPSKKK 434
            + +  ++ QI++MNP+P +D V+++++Q+ER+       S   ++  SA+A+  + ++  
Sbjct: 185  DRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSN 244

Query: 433  FK---------------LDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQ 299
            F                 + +C HCGK  H +D CF  IG+PP +K  K KN++      
Sbjct: 245  FNGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 304

Query: 298  NHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNTSHF 119
            N S          A++ + T+  S  Q         +   P  +  PS  P+        
Sbjct: 305  NTS---------NASALESTQQGSSAQS--------ITTSPFALHSPSSNPNG------- 340

Query: 118  SNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5
                         K+   WI+DTGA+DHI    S  T+
Sbjct: 341  -------------KNPSLWILDTGATDHITFDLSSLTT 365


>gb|KYP75905.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 594

 Score =  243 bits (620), Expect = 2e-68
 Identities = 142/407 (34%), Positives = 214/407 (52%), Gaps = 14/407 (3%)
 Frame = -2

Query: 1189 ATANRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLD 1010
            +T N S  PT+D   P+FL PSDN     +  PL+G NY +WSRA  +++  KNK GF+D
Sbjct: 3    STNNSSSLPTDDYANPYFLHPSDNPGAFIVSQPLNGDNYNSWSRAILMALGEKNKIGFVD 62

Query: 1009 GTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 830
            GTIP P  +D  Y  W R NN++ +WL+N ++K++ +S+IY +SA  +W+ L++R+ Q +
Sbjct: 63   GTIPKPLPTDKSYHSWQRNNNIVASWLLNFISKDLQASVIYSSSATAIWNDLRIRFQQHN 122

Query: 829  SVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEI 650
              R+                 + YFT +  +WEEL  Y+P   C+CG      IK   + 
Sbjct: 123  GPRVFQLRRDLVTLKQGSLNITHYFTKIKALWEELAEYQPSHACTCG-----GIKPWIDH 177

Query: 649  QLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARL--------- 497
              S+Y   FLMGLNE Y   RGQILLM+PIP ++  ++++LQEE+Q+E  +         
Sbjct: 178  HQSEYAMLFLMGLNEGYSHIRGQILLMDPIPPIEKGFSLVLQEEKQQELGIPTNSNDTPT 237

Query: 496  SFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAA 317
            +F   S + A +   +P+K++ K    C+HCGK GH  DKCF+L G+P + K        
Sbjct: 238  AFAYKSGNDAKSRTNNPTKERPK----CEHCGKLGHIKDKCFKLHGYPTHLK-------- 285

Query: 316  GKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNP 137
                 Q +S        +   S    K F FT +Q  ++++LL              + P
Sbjct: 286  -----QGNSNT-----NVNQVSDKSAKAFQFTTDQYHQILSLLQ-------------NQP 322

Query: 136  SNTSHFSNMAGN---ITLNSQFKS--KFSWIIDTGASDHIVCCSSLF 11
            S+    SN   N   +++   F S     WI+D+GAS H+ C  SLF
Sbjct: 323  SSNCIESNPIVNGLLLSIRPSFNSIPSTKWILDSGASTHVACSLSLF 369


>ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954710 [Erythranthe guttata]
          Length = 659

 Score =  242 bits (618), Expect = 1e-67
 Identities = 145/406 (35%), Positives = 215/406 (52%), Gaps = 21/406 (5%)
 Frame = -2

Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986
            P +D + P FL PSD  + I +   L   NY +WSRA ++S++VKNK GF+DGTI  P  
Sbjct: 8    PLDDVSHPMFLHPSDGPNLILVSQLLTEDNYASWSRAMTISLTVKNKIGFIDGTISEPP- 66

Query: 985  SDPLYI--PWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXX 812
            +D L +   W+R NN++++W+INSV+K+I  SI+Y NS+K++WD LK R+SQ +  RI  
Sbjct: 67   ADELIMRNAWIRNNNIVMSWIINSVSKDIQGSIMYSNSSKEIWDDLKTRFSQTNGPRIFQ 126

Query: 811  XXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQLSDYT 632
                           + YFT +  IW+EL NYRP   CSCG+C C   + +      +Y 
Sbjct: 127  LRRDLANLTQGSQSVNVYFTKVKAIWDELANYRPC--CSCGKCDCGGFEKLQAHYNQEYV 184

Query: 631  FKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSESSALAVGA 452
              FLMGLN++  STRGQILLM+P+P +  V+A + QEERQR    S + SS  S  +V  
Sbjct: 185  MSFLMGLNDSLASTRGQILLMDPLPPISKVFAFISQEERQRSVVSSHVDSS-GSVFSVKN 243

Query: 451  HPSKK-----------KFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGI 305
               K+           K +    C HC   GH+++KC++L G+PP++K  K + ++    
Sbjct: 244  EGFKRSINNQFYNPGLKKRERSFCTHCNMQGHTVEKCYKLHGYPPSYKPQKSRFSSHVNQ 303

Query: 304  GQNHSANCIPPPEIPAASSDKTKHF--SFTQEQVQKLMTLLNGDPMEVSQPSPAPDNP-- 137
                 ++          SS +   +  S T  Q Q+ M++ +       Q S A   P  
Sbjct: 304  VSGFDSSLDSHSSDAGVSSQQVDGYLQSMTPSQCQQFMSMFSSHMAAQQQQSTASIQPQS 363

Query: 136  ---SNTSHFSNMAGNITLNS-QFKSKFSWIIDTGASDHIVCCSSLF 11
               ++T+  S + G   L+     S   WI+D+GAS HI     LF
Sbjct: 364  AHGADTATVSCVTGICALSGVPSLSSADWILDSGASKHICHDKQLF 409


>ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662412 [Glycine max]
          Length = 424

 Score =  236 bits (601), Expect = 2e-67
 Identities = 136/416 (32%), Positives = 214/416 (51%), Gaps = 26/416 (6%)
 Frame = -2

Query: 1174 SPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPT 995
            S F T +P+ P+++ P++N   I ++P LD  NY  W R+  +++  KNK  F+DGT+  
Sbjct: 6    SDFAT-NPSNPYYMHPNENPSLILVQPVLDNKNYQIWCRSMKVALISKNKVKFVDGTLSP 64

Query: 994  PDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIX 815
            P  SDPLY PWLRCNNL+L+WL  S ++EIA S+++ + A  VW +L+ R+SQ D  R+ 
Sbjct: 65   PPISDPLYEPWLRCNNLVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDIFRVA 124

Query: 814  XXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSD 638
                            S YFT L T+WEE+ N+RPI  C+C   C+C A   + + +  D
Sbjct: 125  DIQEEVACLQQGTLDISSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKFKEQD 184

Query: 637  YTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSE--SSAL 464
               KFL GL + Y   R QI+LM+P+P+LD  + ++LQ+ERQ     +   S E  SS  
Sbjct: 185  KVIKFLKGLGDQYSHVRSQIMLMSPLPTLDNAFNLILQQERQFNLPSTTDSSIENQSSVN 244

Query: 463  AVGAHPSKKKFKLDI--------------ICQHCGKPGHSIDKCFRLIGFPPNFKFTKGK 326
                 PS+                     +C HC +  H+++ CF   G+PP F+  K  
Sbjct: 245  HFSQTPSRPSNNSGCGRGRGYSSGGRGNRLCTHCNRTNHTVETCFIKHGYPPGFQHRKSN 304

Query: 325  NAAGKGIGQN----HSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQP 158
            ++    +  +     SA+         +++  +   S  QEQ  +++ LL    ++ + P
Sbjct: 305  SSGNASVVNSVQDAGSAHISSSSSASTSTNGSSASLSTIQEQYTQILQLLQQSNLQSTSP 364

Query: 157  SP-----APDNPSNTSHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5
            S      A ++ S+TS   +   N++ N    +   WI+DTGA+DHI      F+S
Sbjct: 365  SSVNSVFATNSVSHTSPSPSSGKNLSNN----TSHWWIVDTGATDHITHIFDSFSS 416


>gb|KYP31881.1| Putative transposon Ty5-1 protein YCL075W family [Cajanus cajan]
          Length = 437

 Score =  234 bits (597), Expect = 9e-67
 Identities = 135/392 (34%), Positives = 207/392 (52%), Gaps = 14/392 (3%)
 Frame = -2

Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986
            P+ DP  P FL  SD         PLD  NYT WSRA  +++ VKNK  F+DG++P P  
Sbjct: 8    PSSDPTNPLFLHHSDGPGLFLTSQPLDNKNYTTWSRAMLVALGVKNKIPFVDGSLPRPAA 67

Query: 985  SDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXX 806
             DP Y  W+  NN++++WL NSV+KEI +SI++ N AK++WD LK R+S+ +  RI    
Sbjct: 68   DDPTYAAWIHGNNVVISWLYNSVSKEIITSILFANIAKEIWDDLKSRFSRKNGPRIFQLR 127

Query: 805  XXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQLSDYTFK 626
                         S Y+T L +IWE+L  Y+P   C+CG    Q ++   ++   +Y   
Sbjct: 128  RQLTSLQQGTDDVSTYYTKLKSIWEDLSGYKPSFPCTCG--GLQHLQVYNDL---EYVMS 182

Query: 625  FLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQRE--ARLSFMPSSESSALAVGA 452
            FLMGLN+++   RGQILL +P+P +  V++++LQEE QRE    ++  PS  S  +A   
Sbjct: 183  FLMGLNDSFSQIRGQILLSDPLPPIGNVFSLVLQEETQREIGTAVTHTPSINSDNMAFDV 242

Query: 451  HPSKKKFKLDII---------CQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQ 299
            + S K    D           C +CG  GH+ DKC++L+G+PPN+ F   +      + +
Sbjct: 243  NSSTKSSAADHYKFNRRERPKCAYCGLLGHTKDKCYKLVGYPPNYNFKNRQTPVANQVLE 302

Query: 298  NHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNTSHF 119
            +      P P       ++ K  + T  Q Q+L+  L  + M++  P  A   P+N +  
Sbjct: 303  S------PEP------LNQNKPDNLTPAQCQQLINFLT-NQMKLDNPDEAV--PTNVT-- 345

Query: 118  SNMAGNITLNSQF---KSKFSWIIDTGASDHI 32
                  I +N+ F      + W+ID+GA+ HI
Sbjct: 346  -----GICMNTHFLLHNITYRWVIDSGATSHI 372


>gb|KYP74100.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 444

 Score =  231 bits (590), Expect = 1e-65
 Identities = 130/401 (32%), Positives = 217/401 (54%), Gaps = 8/401 (1%)
 Frame = -2

Query: 1183 ANRSPFPTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGT 1004
            A+++  P++D + P FL  SD    +    PLD  NYT WSRA  +++ VKNK  F+DGT
Sbjct: 2    ADQAKDPSQDVSNPLFLHHSDGPGLVLTSQPLDHKNYTTWSRAMQVALFVKNKLAFIDGT 61

Query: 1003 IPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSV 824
            +P P  +D  ++ W   NN++++WL NSV+K+I +SI++ ++A+++W  LK R+S+ +  
Sbjct: 62   LPKPASTDSTFVAWNHANNVVISWLYNSVSKDIITSILFASTAQEIWHDLKTRFSKKNGS 121

Query: 823  RIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQL 644
            RI                 S Y+T L +IWEEL  Y+P       QCTC  ++ +     
Sbjct: 122  RIFQLRRQLMSLHQGMDDISTYYTKLKSIWEELSGYKP-----TFQCTCGGLQQLQSFTE 176

Query: 643  SDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMP---SSES 473
            S+Y   FLMGLN++    RGQILL +P+PS+  V++++LQ+E QRE  ++  P   +S++
Sbjct: 177  SEYVMSFLMGLNDSISQIRGQILLSDPLPSIGNVFSLVLQDEAQREIAVTSSPPVANSDN 236

Query: 472  SALAVGAH---PSKKKF--KLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKG 308
                V +     S+ +F  K    C HC   GH+ D C++L+G+PPN+            
Sbjct: 237  IVFTVNSSQPATSRNRFTKKERPRCAHCNILGHTKDTCYKLVGYPPNY------------ 284

Query: 307  IGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNT 128
              +NH+ N +      + +   ++  + T +Q Q+L+  L       +Q        + T
Sbjct: 285  -FKNHTTNTVNQVTGSSDNVLTSQSSNLTPDQRQQLINFL------TNQMQADTTLDAIT 337

Query: 127  SHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5
            ++ + +  N+ L++ +    +WIID+GA+ HI C   LF S
Sbjct: 338  TNVTGICMNVALDNNY---HTWIIDSGATSHICCFKHLFHS 375


>gb|KYP46603.1| hypothetical protein KK1_031768 [Cajanus cajan]
          Length = 483

 Score =  232 bits (591), Expect = 2e-65
 Identities = 134/409 (32%), Positives = 213/409 (52%), Gaps = 10/409 (2%)
 Frame = -2

Query: 1204 AQITQATANRSPFPTEDPNK---PFFLPPSDNSHTIEIRPPLDG-TNYTAWSRAFSLSIS 1037
            A  +Q  ++       DPN     +F+ P++N     +   L G +NY  W+RA   ++ 
Sbjct: 7    ASSSQNASSSQGADLSDPNNRLSEYFIHPNENPSASLVAKLLIGLSNYHIWARAMRRNLI 66

Query: 1036 VKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDT 857
             KNK  F+DG+   PD  DPLY  W RCNNL+ +W+++SV+  IA SI YM  A DVW  
Sbjct: 67   TKNKFRFVDGSNLVPDRFDPLYGAWERCNNLVNSWILSSVSPTIADSIDYMEYASDVWKD 126

Query: 856  LKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSC-GQCT 680
            L+ R++Q D VRI                  +YFT L T+WEEL NY P+P+C C  +C 
Sbjct: 127  LRERFAQSDLVRISELQYEIFSHKQGNFSVIEYFTHLKTLWEELENYIPVPYCPCRTKCA 186

Query: 679  CQAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQ---- 512
            C A++ +   +  DY  +FL GLN+ Y++ + QILL + +PSL+  ++M++Q ERQ    
Sbjct: 187  CPALRDIKSYRDEDYVIRFLQGLNDDYNALKSQILLKDNLPSLNKAFSMVVQHERQYGLE 246

Query: 511  REARLSFMPSSESSALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTK 332
             E     + +  +S    G++    K   +  C HC K GH+I+ C++  G PPN +F  
Sbjct: 247  PENDNQVLVNYSNSRRGKGSYSGSSKSYNERYCTHCKKHGHTIEVCYQKHGLPPNLRFK- 305

Query: 331  GKNAAGKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLL-NGDPMEVSQPS 155
              N++   + Q+ + N        A  + K +  +FT+E+ + L+ LL N     +   +
Sbjct: 306  -TNSSANVVSQDGNQNESEDEITDATGTGKDEVPTFTKEEYKSLLALLHNSQSQGIHVAN 364

Query: 154  PAPDNPSNTSHFSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFT 8
                   +    S  +G + + S+  ++  WI+D GA+DHI C   LF+
Sbjct: 365  QFKTVSISALSESAESGKLLMFSKCSNEVLWILDFGATDHICCSLDLFS 413


>ref|XP_009774775.1| PREDICTED: uncharacterized protein LOC104224769 [Nicotiana
            sylvestris]
          Length = 446

 Score =  229 bits (583), Expect = 1e-64
 Identities = 135/397 (34%), Positives = 208/397 (52%), Gaps = 10/397 (2%)
 Frame = -2

Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986
            PT D + P+FL PSD+     +    DG  Y  W R+  +++S K K GF+DG+   P F
Sbjct: 18   PTIDASHPYFLYPSDSPGMTLVTSVFDGWGYGGWRRSLLIALSTKYKLGFIDGSCSAPAF 77

Query: 985  SDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXX 806
                +  W RCN++I +WL+NS++KEI +S +Y  SA+ +W  L+ R+ Q +  ++    
Sbjct: 78   DSTSFSLWTRCNDMITSWLLNSLSKEIVASALYSKSAQALWTDLEDRFGQSNGAKLYHLQ 137

Query: 805  XXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQA-IKSVGEIQLSDYTF 629
                         + YFT L   W+EL         SC  CTC   +K V  +Q ++   
Sbjct: 138  KEISDLMQGSSDIAGYFTKLKLSWDELDAIYTTVTYSCA-CTCSGKVKLVKSLQ-NERLI 195

Query: 628  KFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLS-FMPSSESSALAVGA 452
            +FLMGLN+TY   R  IL+M+P+PS++  Y++L+Q+E+QRE  ++   P   SS LA   
Sbjct: 196  QFLMGLNDTYSPVRSNILMMSPLPSINIAYSLLVQDEKQREVYVNPQFPGDFSSFLATHQ 255

Query: 451  HPSKKKF--------KLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQN 296
            + S +K         K ++IC HC KPGHS+DKC+R+IGFP +FKFTK     G     +
Sbjct: 256  NISGQKSQSSDFKGRKNNLICSHCKKPGHSVDKCYRIIGFPSDFKFTKTPKLHG-----S 310

Query: 295  HSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNTSHFS 116
              +N I      A  +  T     TQ+Q  +L+ LLN   +  +       N +      
Sbjct: 311  VKSNAI--LSFHAQPTGNTGGNPITQDQFSQLIHLLNNAQLGHTGSPTTKVNANVVQCVG 368

Query: 115  NMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5
            N+  N ++   + +  SWIID+GAS+H+   +  F +
Sbjct: 369  NIFNNPSIYLTYANTHSWIIDSGASEHMSYDTKFFAT 405


>ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobroma cacao]
            gi|508708772|gb|EOY00669.1| Uncharacterized protein
            TCM_010591 [Theobroma cacao]
          Length = 336

 Score =  224 bits (572), Expect = 2e-64
 Identities = 123/328 (37%), Positives = 186/328 (56%), Gaps = 1/328 (0%)
 Frame = -2

Query: 1165 PTEDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDF 986
            P E+    +++  SD   ++ I P L   NY +WSRAF L++S+  K+GF+DGTI  P  
Sbjct: 12   PAENLLSSYYIHHSDLHGSVVINPKLAVANYMSWSRAFLLALSICKKRGFIDGTIKKPSE 71

Query: 985  SDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXX 806
            ++ L+  W RCN LI+TWL+ S+T +IAS+++ M+SAK++ +TLK R+SQP    I    
Sbjct: 72   ANSLFEDWSRCNILIVTWLLESLTPKIASNVLDMDSAKEILETLKNRFSQPYETIICNLQ 131

Query: 805  XXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGEIQLSDYTFK 626
                         + YFT LN++W+EL+N+RP+P C          K   + Q  D  F 
Sbjct: 132  FQLRNILQGTRSVNTYFTELNSVWQELKNFRPLPQCDYEGRKNNCYKKYADQQNKDAVFC 191

Query: 625  FLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSESSALAVGAHP 446
            FL GLNE++   R  IL++ P  S+D  Y++++++  QR   L      E+S +A     
Sbjct: 192  FLNGLNESFSCLRSHILMLKPFLSIDQAYSLVIKKMLQRS--LILQSPVENSTMATVITE 249

Query: 445  SKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQNHSA-NCIPPP 269
             K+K   +++C HCGK GHS +K + +IGFP NFKFTK K    KG    +SA +     
Sbjct: 250  EKRK-NTNLVCSHCGKKGHSKEKYYCIIGFPENFKFTKLKRNMRKGGSSVNSAISGSEQD 308

Query: 268  EIPAASSDKTKHFSFTQEQVQKLMTLLN 185
            E     ++     S T+ Q+QKLMTL++
Sbjct: 309  EYDETVTNSISQLSLTKAQIQKLMTLIS 336


>ref|XP_012453130.1| PREDICTED: uncharacterized protein LOC105775144 [Gossypium raimondii]
          Length = 513

 Score =  228 bits (582), Expect = 9e-64
 Identities = 129/404 (31%), Positives = 207/404 (51%), Gaps = 21/404 (5%)
 Frame = -2

Query: 1153 PNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFLDGTIPTPDFSDPL 974
            P+ P+FL P++N   + + P L   NY +WSRA  +++  KNK  F+DG+I  P  +D +
Sbjct: 9    PSSPYFLHPNENPSLVLVTPTLTSLNYNSWSRAMRMALLSKNKLKFVDGSILPPATTDSI 68

Query: 973  YIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXXXXXX 794
            Y  W RCNN++++WL +S+++ I +SI+++++A D+W  L  R+SQ D  RI        
Sbjct: 69   YPAWERCNNMVISWLHHSISQSIVNSILWIDTAHDIWRDLHKRFSQGDVFRISDLQDEIS 128

Query: 793  XXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG-QCTCQAIKSVGEIQLSDYTFKFLM 617
                     +DYFT L  +W+EL N+RP+P CSC  QC+C A  ++ +   +DY  +FL 
Sbjct: 129  VFKQEERSVTDYFTELKVLWDELLNFRPLPSCSCRVQCSCGAFTTIRKYHNNDYVIRFLK 188

Query: 616  GLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFM--------PSSESSALA 461
            GLNE Y S R QI+L++P+P+++  ++M++Q+ RQ  A  S +        PS   S  +
Sbjct: 189  GLNERYASIRSQIMLLDPLPTINKAFSMVIQQGRQLLAPSSTVFASNAVRQPSKRPSQAS 248

Query: 460  VGAHPSKKKFKLDI-ICQHCGKPGHSIDKCFRLIGFPPNFKFTKGKNAAGKGIGQNHSAN 284
                      K+D   C  CG   H++D C+   GFPP +   K  N+  +        +
Sbjct: 249  SQVSSRSSDSKIDTRKCTFCGGLRHTVDTCYHKNGFPPGY---KSHNSTSRVHNMFEEID 305

Query: 283  CIPPPEIPAASSDKTKHFS---FTQEQVQKLMTLLNGDPMEVSQPSPAPDNPSNTSH--- 122
                      S   T   S    TQEQ+ +L+ LL     + + P+ +   P  T+    
Sbjct: 306  ADTVDSFTGYSQSVTSQGSGVTLTQEQITQLLALLPSSSNQSTNPTHSQPTPHLTNQVLA 365

Query: 121  -----FSNMAGNITLNSQFKSKFSWIIDTGASDHIVCCSSLFTS 5
                  ++  G  +    F S + WI+DT A+DHI    + F S
Sbjct: 366  TPSLTLASTEGIFSTPISFHSPY-WIVDTSATDHITHTLTSFAS 408


>gb|KYP49735.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 434

 Score =  226 bits (576), Expect = 9e-64
 Identities = 125/357 (35%), Positives = 194/357 (54%), Gaps = 11/357 (3%)
 Frame = -2

Query: 1048 LSISVKNKQGFLDGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKD 869
            ++++VKNK  F+DGT+P PD  DP ++PW R NN++++W+ NSV+KEI +SI++  +AK+
Sbjct: 3    VALTVKNKLSFIDGTLPKPDIEDPTFVPWNRENNVVISWIYNSVSKEIITSILFATTAKE 62

Query: 868  VWDTLKLRYSQPDSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCG 689
             WD LK R+S+ +  RI                 S Y+T L +IWEEL  Y+P       
Sbjct: 63   NWDDLKTRFSRKNGPRIFHLKRQLMSLQQGSDDVSTYYTKLKSIWEELAGYKP-----NF 117

Query: 688  QCTCQAIKSVGEIQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQR 509
            QCTC  ++S+ +   S+Y   FLMGLN+++   RGQILL +P+PS+  V++++LQEE Q+
Sbjct: 118  QCTCGGLESLHKHTQSEYVMSFLMGLNDSFSQIRGQILLSDPLPSIGNVFSLILQEETQK 177

Query: 508  EARLSFMPSSESSALAVGAHP--------SKKKF--KLDIICQHCGKPGHSIDKCFRLIG 359
            E  ++   S+ S  +A   +         +K KF  K  + C HC   GH+ DKC++L+G
Sbjct: 178  EIAVTHATSAHSDDMAFAVNQCSKTNFDNNKGKFVKKDRLKCAHCEMFGHTKDKCYKLVG 237

Query: 358  FPPNFKFTKGKNAAGKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGD 179
            +PPN+              +N     +   +I   SS      + T  Q Q+LMTLLN  
Sbjct: 238  YPPNY-------------FKNRQPQVVNQVDISHESSTSNTALNLTPAQCQQLMTLLNNQ 284

Query: 178  PMEVSQPSPAPDNPSNTSHFSNMAGNITLNSQFKSK-FSWIIDTGASDHIVCCSSLF 11
                        + +N +  +     I +N  F  K  +WIID+GA+ HI C  +L+
Sbjct: 285  ----------IQSDNNLNAIATNVTGICMNVDFSDKNHTWIIDSGATSHICCSKTLY 331


>ref|XP_008350470.1| PREDICTED: uncharacterized protein LOC103413804 [Malus domestica]
          Length = 451

 Score =  226 bits (577), Expect = 1e-63
 Identities = 134/396 (33%), Positives = 197/396 (49%), Gaps = 10/396 (2%)
 Frame = -2

Query: 1189 ATANRSPFPT-EDPNKPFFLPPSDNSHTIEIRPPLDGTNYTAWSRAFSLSISVKNKQGFL 1013
            +T  ++P  T  D + PF L PSD    I +   L G NY  W RA  +S+S KNK G +
Sbjct: 9    STDTQNPMETIXDVSNPFILHPSDQPGNILVSKTLQGDNYNTWXRAMRISLSAKNKLGMV 68

Query: 1012 DGTIPTPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQP 833
            DGTI  P  +D  +  W RCN+++L W++NSV  +IASS+ Y  +A DVW  L+ R+SQ 
Sbjct: 69   DGTIDPPSETDKQFASWXRCNDMVLAWILNSVHDDIASSVSYYTTATDVWADLRDRFSQG 128

Query: 832  DSVRIXXXXXXXXXXXXXXXXXSDYFTSLNTIWEELRNYRPIPHCSCGQCTCQAIKSVGE 653
            +  RI                 S Y+T L  +W+EL +Y   P C+CG      +K + +
Sbjct: 129  NDSRIYQIKREIVEHRQEQQSISVYYTKLKALWDELASYNETPTCTCG-----GLKKIND 183

Query: 652  IQLSDYTFKFLMGLNETYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFMPSSES 473
                +   +FLMGLN++Y + RGQILLM P+P     Y+++LQ+E+Q E  L+    +  
Sbjct: 184  RDEKERVMQFLMGLNDSYAAVRGQILLMQPLPDTRRAYSLVLQQEKQVEVSLNRNNINLH 243

Query: 472  SALAVGAHPSKKKFKLDIICQHCGKPGHSIDKCFRLIGFPPNFKF--------TKGKNAA 317
            +        +       + C +C    H++D+CF L GFPP  K+         K K AA
Sbjct: 244  AMNITRNRXTAAPKGNTJQCSYCDXKYHTVDRCFYLYGFPPGHKYHGKSVKPPNKRKPAA 303

Query: 316  GKGIGQNHSANCIPPPEIPAASSDKTKHFSFTQEQVQKLMTLLNGDPMEVSQPSPAPDNP 137
             +   +  +   +      A SSD  K   FT E+  +LM +L              +  
Sbjct: 304  NQVTVETETTKGVDSRH-KATSSDGPK---FTTEEYNQLMAMLK-----------KSNXD 348

Query: 136  SNTSHFSNMAGNITLNSQFKSK-FSWIIDTGASDHI 32
             N  HF+N  G IT +S    K   WIID+GA+DH+
Sbjct: 349  GNPQHFANATGTITPSSBLSEKTLYWIIDSGATDHV 384


Top