BLASTX nr result

ID: Rehmannia31_contig00009167 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00009167
         (1167 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_017415208.1| PREDICTED: uncharacterized protein LOC108326...   453   e-155
ref|XP_019442216.1| PREDICTED: uncharacterized protein LOC109346...   454   e-155
gb|PNX75692.1| hypothetical protein L195_g031632 [Trifolium prat...   446   e-153
gb|PNY16257.1| copia protein (gag-int-pol protein) [Trifolium pr...   471   e-150
gb|PNX80172.1| hypothetical protein L195_g036169, partial [Trifo...   440   e-148
gb|PNX82965.1| hypothetical protein L195_g039002, partial [Trifo...   436   e-147
gb|PNY12475.1| retroelement pol polyprotein-like [Trifolium prat...   457   e-146
gb|PNY12308.1| retrovirus-related Pol polyprotein from transposo...   454   e-144
ref|XP_017415202.1| PREDICTED: retrovirus-related Pol polyprotei...   453   e-144
gb|PNX93622.1| retrovirus-related Pol polyprotein from transposo...   452   e-143
dbj|GAU43467.1| hypothetical protein TSUD_141050 [Trifolium subt...   413   e-141
dbj|GAU34891.1| hypothetical protein TSUD_144220 [Trifolium subt...   438   e-140
ref|XP_020970369.1| uncharacterized protein LOC107621341 [Arachi...   412   e-139
ref|XP_012847972.1| PREDICTED: uncharacterized protein LOC105967...   409   e-138
gb|PNX83470.1| hypothetical protein L195_g039513 [Trifolium prat...   409   e-137
gb|KYP36798.1| hypothetical protein KK1_042036 [Cajanus cajan]        401   e-136
ref|XP_021886856.1| uncharacterized protein LOC110806350 isoform...   397   e-134
ref|XP_021886855.1| uncharacterized protein LOC110806350 isoform...   397   e-133
gb|KZV52705.1| hypothetical protein F511_23168 [Dorcoceras hygro...   396   e-133
ref|XP_015385530.1| PREDICTED: uncharacterized protein LOC107176...   396   e-131

>ref|XP_017415208.1| PREDICTED: uncharacterized protein LOC108326303 isoform X6 [Vigna
            angularis]
          Length = 399

 Score =  453 bits (1165), Expect = e-155
 Identities = 219/380 (57%), Positives = 278/380 (73%)
 Frame = +2

Query: 26   MASEPHKTEKYEESSNRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTS 205
            + SE  K  K  E S  E+ KKM SPYDL+++DNPGN+ITQV+L+G ENY+EWA AV+ S
Sbjct: 14   VTSELLKMAKEGEKSESEVVKKMSSPYDLSASDNPGNVITQVQLKG-ENYEEWAKAVKIS 72

Query: 206  LRARRKWGFVDGTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLW 385
            LRARRKWGF+DGT +EPE ++S  EDWWT+QSMLVSWI NTIEP LRSTI+++ENAKDLW
Sbjct: 73   LRARRKWGFIDGTHTEPETDTSKIEDWWTIQSMLVSWILNTIEPNLRSTIAYMENAKDLW 132

Query: 386  EDIRERFSVANGPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGC 565
            +DI+ERFS+ NGPRIQQ+K++L+ECKQ+GM++VAYYGKLK LWD+LANY+QIP C C GC
Sbjct: 133  DDIKERFSIVNGPRIQQLKSKLAECKQQGMTMVAYYGKLKILWDELANYEQIPQCKCGGC 192

Query: 566  KCNFSAKLEQRREEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKT 745
            KCN + KLE+RREEE+VHQFLMGLD+  YGT RSN+LAT+PLPSLN+VY  +V+EERV+ 
Sbjct: 193  KCNIATKLEKRREEERVHQFLMGLDDEGYGTTRSNVLATDPLPSLNRVYATMVQEERVRM 252

Query: 746  ITRVAEERGEIMGLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXX 925
            ITR  EERG I+G+      + K R E K+K+ +C+ C RTGHD   CF++IGYPDWW  
Sbjct: 253  ITRSKEERGMIVGMVVQTETKGKLRNEVKEKSIVCTHCGRTGHDKRNCFEIIGYPDWWGE 312

Query: 926  XXXXXXXXXXXXXXXXXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQ 1105
                                        R N A T     +  + SD +K  +AGL+NEQ
Sbjct: 313  RPRNENKSGGRHQQRTTFFRGKGVTP--RVNIAHT--STSSSDSKSDTKKPEVAGLSNEQ 368

Query: 1106 WQNILAILNTHKTSTGEKMT 1165
            W+ +  +LN+HK +T EKMT
Sbjct: 369  WEILATMLNSHKANTTEKMT 388


>ref|XP_019442216.1| PREDICTED: uncharacterized protein LOC109346932 [Lupinus
            angustifolius]
          Length = 473

 Score =  454 bits (1169), Expect = e-155
 Identities = 220/367 (59%), Positives = 276/367 (75%), Gaps = 4/367 (1%)
 Frame = +2

Query: 77   ELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEP 256
            E  +K+ SPYDL+S+DNP ++ITQV+LRG ENY+EWA A+RTSLRARRKWGF+DGTI +P
Sbjct: 6    ESVRKISSPYDLSSSDNPRSVITQVQLRG-ENYEEWARAMRTSLRARRKWGFIDGTIGKP 64

Query: 257  EKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQ 436
            E+ESS+ EDWWTVQSMLVSWI NT+EP LRSTIS++ENA+DLW+DI+ERFSV NGPRIQQ
Sbjct: 65   EEESSEMEDWWTVQSMLVSWILNTVEPNLRSTISYMENARDLWDDIKERFSVVNGPRIQQ 124

Query: 437  IKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKV 616
            +K+EL+ CKQ  +S+V YYGK+KSLWD+LANY+QI  CTC GC C+ ++KLE+R+EEE+V
Sbjct: 125  LKSELAGCKQGAVSMVTYYGKMKSLWDELANYEQIHICTCRGCTCDIASKLEKRQEEERV 184

Query: 617  HQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAH 796
            HQFLMGLD+V YGTVRSNLLA +PLPSLN VY  L++EER+KTITR  EERG+I+G A  
Sbjct: 185  HQFLMGLDDVIYGTVRSNLLAADPLPSLNMVYSTLIQEERMKTITRAKEERGDIVGFAVQ 244

Query: 797  AGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW--XXXXXXXXXXXXXXXXX 970
             G + + R + KDK  +CS C+R+GHD+  CFQ+IGYP+WW                   
Sbjct: 245  IGAKSREREDTKDKGDVCSHCNRSGHDTRNCFQLIGYPEWWADRPRNEGRSSGRGKGQQR 304

Query: 971  XXXXXXXXXXXIVRANAAQTL--GENVTVATGSDLEKSGLAGLTNEQWQNILAILNTHKT 1144
                        +RANAAQ +  G  V  A   D EKSGL GL +EQWQ ++ +LNT K 
Sbjct: 305  TGTGMGRGRGGTMRANAAQVISTGGGVAGAACVDAEKSGLTGLDDEQWQTLIEMLNTRKQ 364

Query: 1145 STGEKMT 1165
            S  E+MT
Sbjct: 365  SISERMT 371


>gb|PNX75692.1| hypothetical protein L195_g031632 [Trifolium pratense]
          Length = 389

 Score =  446 bits (1147), Expect = e-153
 Identities = 213/377 (56%), Positives = 275/377 (72%)
 Frame = +2

Query: 35   EPHKTEKYEESSNRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRA 214
            +P  +EK E+  N    K+  SP+DL SNDNPGNLITQV+LRG ENYDEW+ A++ SLRA
Sbjct: 12   KPVNSEKSEKRQNSP-GKRKTSPFDLTSNDNPGNLITQVQLRG-ENYDEWSKAMKVSLRA 69

Query: 215  RRKWGFVDGTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDI 394
            RRKWGF++GTI +P   + + EDWWTVQSM+VSWI NT+EP LRSTIS+ ENA+DLWEDI
Sbjct: 70   RRKWGFIEGTIDKPSDGTPELEDWWTVQSMIVSWILNTVEPNLRSTISYYENARDLWEDI 129

Query: 395  RERFSVANGPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCN 574
            +ER SVANGPRI Q+KT+L+ CKQ GM++ AYYGKLK LWD+LANY+QIP C+C GC C 
Sbjct: 130  KERLSVANGPRIHQLKTDLAACKQAGMTVAAYYGKLKVLWDELANYEQIPVCSCNGCSCR 189

Query: 575  FSAKLEQRREEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITR 754
             + KLE+RREE++VHQFLMGLD+V YGTVRSNLLA +PLPSLN++Y  +++EERV+TITR
Sbjct: 190  ITLKLEKRREEQRVHQFLMGLDDVVYGTVRSNLLAVDPLPSLNRIYSTMIQEERVRTITR 249

Query: 755  VAEERGEIMGLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXXXXX 934
              EERGE+MGLA     + +GRG+ KDK   C+ C+++GHDS+ CF++IGYPDWW     
Sbjct: 250  AKEERGEVMGLAVQI-EKNRGRGDFKDK---CTNCNKSGHDSANCFELIGYPDWWGDRPK 305

Query: 935  XXXXXXXXXXXXXXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQN 1114
                                    VRANAAQ        ++  ++EK+G  G+T+EQWQ 
Sbjct: 306  SESKSGARGKSQHRGTGRGRGSVTVRANAAQASAS----SSAGNVEKNGFPGITSEQWQK 361

Query: 1115 ILAILNTHKTSTGEKMT 1165
            ++ +LN   + T +KMT
Sbjct: 362  LMEVLNISPSDTEDKMT 378


>gb|PNY16257.1| copia protein (gag-int-pol protein) [Trifolium pratense]
          Length = 1461

 Score =  471 bits (1211), Expect = e-150
 Identities = 224/370 (60%), Positives = 275/370 (74%), Gaps = 2/370 (0%)
 Frame = +2

Query: 62   ESSNRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDG 241
            E    +L K+  SPYDL+SNDNPG++ITQV+LRGDENYDEW  A+RTSLRARRKWGF+DG
Sbjct: 17   ERLKSDLGKRNSSPYDLHSNDNPGSVITQVQLRGDENYDEWTRAMRTSLRARRKWGFIDG 76

Query: 242  TISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANG 421
             I++PE  S + EDWWTVQSMLVSWI NTIEP LRSTI++ ENAK+LWEDI++RFSV NG
Sbjct: 77   AITQPEDGSPEIEDWWTVQSMLVSWILNTIEPSLRSTITYFENAKELWEDIKDRFSVVNG 136

Query: 422  PRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRR 601
            PRIQQ+K++L+ECKQ GM++VAYYGKLK LWD+LANY+Q  TCTC GCKCN + KLE+RR
Sbjct: 137  PRIQQLKSDLAECKQGGMTMVAYYGKLKVLWDELANYEQALTCTCGGCKCNIATKLEKRR 196

Query: 602  EEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIM 781
            EEEK HQFLMGLD+  YGTVRSNLLAT+PLPS+NK+Y  LV+EER+K +TR  E   EI+
Sbjct: 197  EEEKAHQFLMGLDDALYGTVRSNLLATDPLPSVNKIYSTLVQEERMKIVTRSKEGSKEIV 256

Query: 782  GLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXXXXXXXXXXXXXX 961
            G+A   G R K  GEAK+K+  CS C+R+GHD + CF++IGYPDWW              
Sbjct: 257  GMAVQTGVRFKDPGEAKNKSTPCSHCNRSGHDEAGCFEIIGYPDWWGDRPRGVNKSRGRG 316

Query: 962  XXXXXXXXXXXXXXIVRANAAQTLGENVTVATGSDL--EKSGLAGLTNEQWQNILAILNT 1135
                          +VRANAAQT G  V  + G++   E SGL  L+NEQWQ ++ +L  
Sbjct: 317  KGVQRAGNNTGRGQMVRANAAQTAGAGVGTSVGTNFPAENSGLTSLSNEQWQTLMELLKN 376

Query: 1136 HKTSTGEKMT 1165
             K ST E+MT
Sbjct: 377  SKPSTNERMT 386


>gb|PNX80172.1| hypothetical protein L195_g036169, partial [Trifolium pratense]
          Length = 531

 Score =  440 bits (1131), Expect = e-148
 Identities = 220/376 (58%), Positives = 268/376 (71%), Gaps = 1/376 (0%)
 Frame = +2

Query: 41   HKTEKYEESSNRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARR 220
            H ++K E   N    KK  SPYDLNSNDNPGNLITQV+LRG ENYDEW+ A++ SLRARR
Sbjct: 14   HASKKEETIHNT--VKKTPSPYDLNSNDNPGNLITQVQLRG-ENYDEWSRAMKRSLRARR 70

Query: 221  KWGFVDGTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRE 400
            KWGF++GTI  P++ S + EDWWTVQSMLVSWI NTIE  LRST+S+ ENA+DLW DI+E
Sbjct: 71   KWGFIEGTIETPDENSPEIEDWWTVQSMLVSWILNTIEANLRSTMSYAENARDLWLDIKE 130

Query: 401  RFSVANGPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFS 580
            RFSV NGPRIQQ+K +L+ CKQ GMS+V YYGKLK LWDDLANYDQIP C+C  CKC+ S
Sbjct: 131  RFSVVNGPRIQQLKLDLARCKQDGMSVVTYYGKLKLLWDDLANYDQIPVCSCGRCKCDIS 190

Query: 581  AKLEQRREEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVA 760
            +KLE+RREEE+VHQFLMGLD+  YGTVRSNLLAT+PLP+LN+VY  +++EERVKT+TR  
Sbjct: 191  SKLEKRREEERVHQFLMGLDDAIYGTVRSNLLATDPLPNLNRVYSTMIQEERVKTMTRTT 250

Query: 761  EERGEIMGLAAHAGGR-LKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXXXXXX 937
            EER E+M LA    GR +KG  + KDK   C+ C R GH++  CFQ++GYPDWW      
Sbjct: 251  EERREVMSLAVQTNGRAVKGSWDGKDK---CTHCHREGHEAGGCFQLVGYPDWWGDRPKI 307

Query: 938  XXXXXXXXXXXXXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNI 1117
                                    RANA  T G +   A  S  + SGLAG+T EQ Q +
Sbjct: 308  EGKYGGRGKPIQRSGTGRGRRGTARANAVHTGGTSSETAI-SQGDGSGLAGITAEQLQTL 366

Query: 1118 LAILNTHKTSTGEKMT 1165
            + +LN  KT+  +KMT
Sbjct: 367  VGLLNAQKTNCNDKMT 382


>gb|PNX82965.1| hypothetical protein L195_g039002, partial [Trifolium pratense]
          Length = 507

 Score =  436 bits (1122), Expect = e-147
 Identities = 215/363 (59%), Positives = 263/363 (72%), Gaps = 1/363 (0%)
 Frame = +2

Query: 80   LAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEPE 259
            +AKK  SPYDLNSNDNPGNLITQV+LRG ENYDEW+ A++ SLRARRKW F++GTI  P+
Sbjct: 1    MAKKTPSPYDLNSNDNPGNLITQVRLRG-ENYDEWSRAMKISLRARRKWCFIEGTIQTPD 59

Query: 260  KESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQI 439
            + S + EDWWTVQSMLVSWI NTIE +LRST+S+ ENA+DLW DI+ER SV NGPRIQQ+
Sbjct: 60   ENSPEIEDWWTVQSMLVSWILNTIEADLRSTVSYAENARDLWLDIKERLSVVNGPRIQQL 119

Query: 440  KTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKVH 619
            K +L+ CKQ GMS+V YYGKLK LWDDLANYDQIP C+C  CKC+ S+KLE+RREEE+VH
Sbjct: 120  KLDLARCKQDGMSVVTYYGKLKLLWDDLANYDQIPVCSCGRCKCDISSKLEKRREEERVH 179

Query: 620  QFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAHA 799
            QFLMGLD+  YGTVRSNLLAT+PLP+LN+VY  +++EERVKT+TR  EER E+M LA   
Sbjct: 180  QFLMGLDDAIYGTVRSNLLATDPLPNLNRVYSTMIQEERVKTMTRTTEERREVMSLAVQT 239

Query: 800  GGR-LKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXXXXXXXXXXXXXXXXXXX 976
             GR +KG  + KDK   C+ C R GH++  CFQ++GYPDWW                   
Sbjct: 240  NGRAVKGSWDGKDK---CTHCHREGHEAGGCFQLVGYPDWWGDRPKIEGKYGGRGKPIQR 296

Query: 977  XXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAILNTHKTSTGE 1156
                       RANA  T G +   A  S  + SGLAG+T EQ Q ++ +LN  KT+  +
Sbjct: 297  SGTGRGRRGTARANAVHTGGTSSETAI-SQGDGSGLAGITAEQLQTLVGLLNAQKTNCND 355

Query: 1157 KMT 1165
            KMT
Sbjct: 356  KMT 358


>gb|PNY12475.1| retroelement pol polyprotein-like [Trifolium pratense]
          Length = 1423

 Score =  457 bits (1177), Expect = e-146
 Identities = 221/366 (60%), Positives = 274/366 (74%), Gaps = 4/366 (1%)
 Frame = +2

Query: 80   LAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEPE 259
            + KK  SPYDL+S+DNPG++ITQV+LRG ENYDEWA A++TSLRARRKWGFV+G I +P+
Sbjct: 9    IVKKTSSPYDLSSHDNPGSVITQVQLRG-ENYDEWAKAMKTSLRARRKWGFVEGNIPQPK 67

Query: 260  KESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQI 439
            + S++ EDWWTVQSMLVSWI NTIEP LRSTIS++ENAK+LWEDI+ER SV NGPRIQQ+
Sbjct: 68   EGSTEMEDWWTVQSMLVSWILNTIEPTLRSTISYMENAKELWEDIKERLSVVNGPRIQQL 127

Query: 440  KTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKVH 619
            K++L++CKQ GM++V YYGKLK LWD+LANY QIP C C GCKC+  AKLE++REEEKVH
Sbjct: 128  KSDLAQCKQEGMTMVNYYGKLKMLWDELANYQQIPICNCGGCKCDVKAKLEKQREEEKVH 187

Query: 620  QFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAHA 799
            QFLMGLD+  YGTVRSNLLAT+PLPSLNK+Y  L++EERVK+I R  EERGEI+GLA   
Sbjct: 188  QFLMGLDDALYGTVRSNLLATDPLPSLNKMYATLIQEERVKSIARTKEERGEIVGLAVQT 247

Query: 800  GGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW----XXXXXXXXXXXXXXXX 967
            GGR +GRG  K+K ++CS C++ GHD + CFQ+IGYPDWW                    
Sbjct: 248  GGRARGRGNTKEKDSVCSHCNQPGHDVAGCFQIIGYPDWWGDRPRYEAKTGAGRGKGQQQ 307

Query: 968  XXXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAILNTHKTS 1147
                        +VRANA     E  T  + +D +  GL GL+NEQ Q ++ +LNTHK S
Sbjct: 308  TRGSNHGRGRSTLVRANAVHA-HEGGTTVSNTDRDIGGLVGLSNEQLQTLMELLNTHKGS 366

Query: 1148 TGEKMT 1165
              E+MT
Sbjct: 367  NTERMT 372


>gb|PNY12308.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1449

 Score =  454 bits (1169), Expect = e-144
 Identities = 227/385 (58%), Positives = 284/385 (73%), Gaps = 5/385 (1%)
 Frame = +2

Query: 26   MASE-PHKTEKYEESSNR---ELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACA 193
            MASE  H   K  E  +    E+ KK  SPYDLNSNDNPG++ITQV+LRG ENYDEWA A
Sbjct: 1    MASEHSHDEGKPVEGKSEKKIEMVKKTPSPYDLNSNDNPGSIITQVQLRG-ENYDEWAKA 59

Query: 194  VRTSLRARRKWGFVDGTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENA 373
            +RTSLRARRKWGFV+GT+ +P++ S + EDWWTVQ+M+VSWI NTIE  LRSTIS++ENA
Sbjct: 60   MRTSLRARRKWGFVEGTVQQPDENSPEMEDWWTVQAMVVSWILNTIEASLRSTISYMENA 119

Query: 374  KDLWEDIRERFSVANGPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCT 553
            K+LW+DI++R SV NGPRIQQ+K+EL+ CKQ GM++V YYGKLK+LWD+L NY QIP CT
Sbjct: 120  KELWDDIKDRLSVVNGPRIQQLKSELASCKQEGMTMVNYYGKLKALWDELGNYQQIPICT 179

Query: 554  CTGCKCNFSAKLEQRREEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREE 733
            C GC C+  AKLE++REEEKVHQFLMGLD+V YGT RS+LLA++PLPSLN+VY  L++EE
Sbjct: 180  CKGCTCDIKAKLEKQREEEKVHQFLMGLDDVLYGTTRSSLLASDPLPSLNRVYATLIQEE 239

Query: 734  RVKTITRVAEERGEIMGLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPD 913
            RVKTI R  EER EI+GLA   GGR +GRG+AKDKT  CS C++TGH+ + CFQ++GYPD
Sbjct: 240  RVKTIARSKEERTEIVGLAVKTGGRTRGRGDAKDKT--CSNCNQTGHEIAGCFQIVGYPD 297

Query: 914  WWXXXXXXXXXXXXXXXXXXXXXXXXXXXXIVRANAAQ-TLGENVTVATGSDLEKSGLAG 1090
            WW                             VRANAAQ T+G      T  +++K GL G
Sbjct: 298  WWGDRPRHDTKGVARVKGQHSQGRGRGMG--VRANAAQATIGRTTEAIT--EVDKGGLIG 353

Query: 1091 LTNEQWQNILAILNTHKTSTGEKMT 1165
            L+ +QWQ ++ +LN  K ++ EKMT
Sbjct: 354  LSADQWQTLVEMLNNQKGNSNEKMT 378


>ref|XP_017415202.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT
            1-94 isoform X1 [Vigna angularis]
          Length = 1472

 Score =  453 bits (1165), Expect = e-144
 Identities = 219/380 (57%), Positives = 278/380 (73%)
 Frame = +2

Query: 26   MASEPHKTEKYEESSNRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTS 205
            + SE  K  K  E S  E+ KKM SPYDL+++DNPGN+ITQV+L+G ENY+EWA AV+ S
Sbjct: 14   VTSELLKMAKEGEKSESEVVKKMSSPYDLSASDNPGNVITQVQLKG-ENYEEWAKAVKIS 72

Query: 206  LRARRKWGFVDGTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLW 385
            LRARRKWGF+DGT +EPE ++S  EDWWT+QSMLVSWI NTIEP LRSTI+++ENAKDLW
Sbjct: 73   LRARRKWGFIDGTHTEPETDTSKIEDWWTIQSMLVSWILNTIEPNLRSTIAYMENAKDLW 132

Query: 386  EDIRERFSVANGPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGC 565
            +DI+ERFS+ NGPRIQQ+K++L+ECKQ+GM++VAYYGKLK LWD+LANY+QIP C C GC
Sbjct: 133  DDIKERFSIVNGPRIQQLKSKLAECKQQGMTMVAYYGKLKILWDELANYEQIPQCKCGGC 192

Query: 566  KCNFSAKLEQRREEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKT 745
            KCN + KLE+RREEE+VHQFLMGLD+  YGT RSN+LAT+PLPSLN+VY  +V+EERV+ 
Sbjct: 193  KCNIATKLEKRREEERVHQFLMGLDDEGYGTTRSNVLATDPLPSLNRVYATMVQEERVRM 252

Query: 746  ITRVAEERGEIMGLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXX 925
            ITR  EERG I+G+      + K R E K+K+ +C+ C RTGHD   CF++IGYPDWW  
Sbjct: 253  ITRSKEERGMIVGMVVQTETKGKLRNEVKEKSIVCTHCGRTGHDKRNCFEIIGYPDWWGE 312

Query: 926  XXXXXXXXXXXXXXXXXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQ 1105
                                        R N A T     +  + SD +K  +AGL+NEQ
Sbjct: 313  RPRNENKSGGRHQQRTTFFRGKGVTP--RVNIAHT--STSSSDSKSDTKKPEVAGLSNEQ 368

Query: 1106 WQNILAILNTHKTSTGEKMT 1165
            W+ +  +LN+HK +T EKMT
Sbjct: 369  WEILATMLNSHKANTTEKMT 388


>gb|PNX93622.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1454

 Score =  452 bits (1163), Expect = e-143
 Identities = 218/383 (56%), Positives = 281/383 (73%), Gaps = 3/383 (0%)
 Frame = +2

Query: 26   MASEPH--KTEKYEESS-NRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAV 196
            MASE    K EK  ESS N+   K+  SP+DLNSNDNPGNLITQV+LRG+ NYDEW  A+
Sbjct: 1    MASELKDLKDEKKPESSENQHQGKRKSSPFDLNSNDNPGNLITQVQLRGENNYDEWTRAM 60

Query: 197  RTSLRARRKWGFVDGTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAK 376
            +TSLRARRKWGF++GT+ +P++ +++ EDWWTVQSMLVSWI NT+EP LRST++++ENA+
Sbjct: 61   KTSLRARRKWGFIEGTVKKPDEGTAEIEDWWTVQSMLVSWILNTVEPNLRSTMTYMENAR 120

Query: 377  DLWEDIRERFSVANGPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTC 556
            DLWEDI+ERFSVANGP+I Q+K +L  CKQ GM+I AYYGKLK LWD+LANY+Q+P C+C
Sbjct: 121  DLWEDIKERFSVANGPKIHQLKADLVACKQAGMTIAAYYGKLKLLWDELANYEQVPVCSC 180

Query: 557  TGCKCNFSAKLEQRREEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREER 736
             GC C  + KLE+RREEE+VHQFLMGLD+V YGT RSNLLA++PLP+LN++Y V+++EER
Sbjct: 181  EGCSCRITTKLEKRREEERVHQFLMGLDDVVYGTARSNLLASDPLPNLNRIYSVMIQEER 240

Query: 737  VKTITRVAEERGEIMGLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDW 916
            V+TI R  EERG++MGLA   GG+ +GR E KDK   C+ C+R GH ++ CFQ+IGYPDW
Sbjct: 241  VRTIARNKEERGDVMGLAVQIGGKNRGRDEFKDK---CTNCNRDGHVAANCFQLIGYPDW 297

Query: 917  WXXXXXXXXXXXXXXXXXXXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLT 1096
            W                            IVRANAAQ  G     ++  + E  G  G+T
Sbjct: 298  WGDRPRGEGKSGTRGRSQNRGAGRGKGAAIVRANAAQAGGN----SSAREAESHGFPGIT 353

Query: 1097 NEQWQNILAILNTHKTSTGEKMT 1165
            ++QWQ ++ ILN  +  T E+MT
Sbjct: 354  SDQWQKLMEILNI-QPDTAERMT 375


>dbj|GAU43467.1| hypothetical protein TSUD_141050 [Trifolium subterraneum]
          Length = 333

 Score =  413 bits (1062), Expect = e-141
 Identities = 194/298 (65%), Positives = 243/298 (81%)
 Frame = +2

Query: 26  MASEPHKTEKYEESSNRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTS 205
           M SE  K E+ +     +L KK+ SPYDL+++DNPG LITQV+L+G ENYDEW+ A+RTS
Sbjct: 1   MESEDKKYEETKTEKTIDLVKKIPSPYDLHTSDNPGILITQVQLKG-ENYDEWSKAMRTS 59

Query: 206 LRARRKWGFVDGTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLW 385
           LRARRKWGFV+G+I +P K+SS+ EDWWTVQSMLVSWI NT+EP LRSTIS+ ENAKDLW
Sbjct: 60  LRARRKWGFVEGSIPQPTKDSSEMEDWWTVQSMLVSWILNTVEPSLRSTISYQENAKDLW 119

Query: 386 EDIRERFSVANGPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGC 565
           EDI+ERFSV NGPRIQQIK EL+EC+Q  M++VAYYG LK+LWD+L NY QIP CTC GC
Sbjct: 120 EDIKERFSVVNGPRIQQIKAELAECRQTKMTMVAYYGMLKTLWDELTNYQQIPRCTCGGC 179

Query: 566 KCNFSAKLEQRREEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKT 745
           KC+  +K+E++REEEKVHQFLMGLD+  YGTVRSNLLAT+PLPSLN+VY  +V+EERV+ 
Sbjct: 180 KCDIGSKMEKQREEEKVHQFLMGLDDALYGTVRSNLLATDPLPSLNRVYATMVQEERVRV 239

Query: 746 ITRVAEERGEIMGLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW 919
           I++  EERGE++G +A    R +G  E K+K   CS C+RT H++  CF++IGYP+WW
Sbjct: 240 ISKGKEERGEVVGFSAQTHTRGRGLTEIKEKDK-CSNCNRTRHEAGNCFELIGYPEWW 296


>dbj|GAU34891.1| hypothetical protein TSUD_144220 [Trifolium subterraneum]
          Length = 1218

 Score =  438 bits (1127), Expect = e-140
 Identities = 211/363 (58%), Positives = 265/363 (73%)
 Frame = +2

Query: 77   ELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEP 256
            E+ KK  SPYDLNSNDNPG++ITQV+LRG ENYDEWA A+RTSLRARRKWGFV+GT+ +P
Sbjct: 22   EMVKKTPSPYDLNSNDNPGSIITQVQLRG-ENYDEWAKAIRTSLRARRKWGFVEGTVKQP 80

Query: 257  EKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQ 436
            ++ S + EDWWTVQ+M+VSWI NTIE  LRSTIS++ENAK+LW+DI++R SV NGPRIQQ
Sbjct: 81   DENSPEMEDWWTVQAMVVSWILNTIEASLRSTISYMENAKELWDDIKDRLSVVNGPRIQQ 140

Query: 437  IKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKV 616
            +K+EL+ CKQ GM++V YYGKLK+LWD+L NY QIP CTC GC C+   KLE+RREEEKV
Sbjct: 141  LKSELASCKQEGMTMVNYYGKLKALWDELGNYQQIPICTCKGCTCDIKTKLEKRREEEKV 200

Query: 617  HQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAH 796
            HQFLMGLD+V YGT RS+LLAT+PLPSLN+VY  L++EERVKTI R  EER EI+GL   
Sbjct: 201  HQFLMGLDDVLYGTTRSSLLATDPLPSLNRVYATLIQEERVKTIARSKEERIEIVGLTVK 260

Query: 797  AGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXXXXXXXXXXXXXXXXXXX 976
             GGR++ RG+ KDK   CS C ++GH+++ CFQ+IGYPDWW                   
Sbjct: 261  TGGRMRERGDPKDKA--CSNCKQSGHEATGCFQLIGYPDWW------------------G 300

Query: 977  XXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAILNTHKTSTGE 1156
                     + R    Q   ++     G +++  GL GL+ EQWQ ++ ILN  K +  E
Sbjct: 301  DRPRHEAKGVARVKGQQ---QSQGRGRGMEVDNGGLIGLSAEQWQKLMEILNNQKGNNSE 357

Query: 1157 KMT 1165
            KMT
Sbjct: 358  KMT 360


>ref|XP_020970369.1| uncharacterized protein LOC107621341 [Arachis ipaensis]
          Length = 386

 Score =  412 bits (1059), Expect = e-139
 Identities = 203/373 (54%), Positives = 269/373 (72%)
 Frame = +2

Query: 47   TEKYEESSNRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKW 226
            ++K E+ SN     K  SPYDL+ +DNPGN+ITQV+L+G ENY+EWA AV+ SLRARRKW
Sbjct: 14   SKKDEDGSN-----KGPSPYDLSVSDNPGNVITQVQLQG-ENYEEWARAVKVSLRARRKW 67

Query: 227  GFVDGTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERF 406
            GF+DGT  +P++ +S+ EDWWTVQSMLVSW+ NTIEP+L STIS+ E+AKDLWE+I+ERF
Sbjct: 68   GFLDGTHKKPQEGASEMEDWWTVQSMLVSWVMNTIEPQLCSTISYTEDAKDLWEEIKERF 127

Query: 407  SVANGPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAK 586
            S  NGPRIQQ+K EL+ECKQ+G+++V YYGKLK+LWD+LAN++ +  C+C GCKC+  ++
Sbjct: 128  SNVNGPRIQQLKAELAECKQQGLAMVEYYGKLKTLWDELANHEPVLRCSCGGCKCDIGSR 187

Query: 587  LEQRREEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEE 766
            L++RREEEKVHQFLMGL++VSYGTVRSN+LAT+PLPSLN+VY  LV+EER+K I+R  E+
Sbjct: 188  LDKRREEEKVHQFLMGLEDVSYGTVRSNILATDPLPSLNRVYATLVQEERMKMISRTKED 247

Query: 767  RGEIMGLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXXXXXXXXX 946
            +G +MGLA H G + K R E   K  +CS C RTGH+   CFQ+IGYP+WW         
Sbjct: 248  KGSLMGLAVHTGYKHKSRNEI--KPLVCSHCGRTGHEIKGCFQLIGYPEWW--GDRSRDE 303

Query: 947  XXXXXXXXXXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAI 1126
                               ++RAN AQ   E  +     ++  S + GL+ +QWQ +L +
Sbjct: 304  KGSNTKTSHKQGGRLRGGAVIRANTAQATTEK-SSNHEEEIGGSAVTGLSQKQWQTLLEM 362

Query: 1127 LNTHKTSTGEKMT 1165
            LN +  S  E MT
Sbjct: 363  LNGNNGSRTESMT 375


>ref|XP_012847972.1| PREDICTED: uncharacterized protein LOC105967929 [Erythranthe guttata]
          Length = 386

 Score =  409 bits (1051), Expect = e-138
 Identities = 200/361 (55%), Positives = 256/361 (70%), Gaps = 1/361 (0%)
 Frame = +2

Query: 86   KKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEPEKE 265
            KKM  PY L SNDNPGN+ITQV+L+G ENYDEWA AVRTSLRA++K+GFVDGTI  P  +
Sbjct: 18   KKMSGPYTLTSNDNPGNVITQVRLKG-ENYDEWARAVRTSLRAKKKYGFVDGTIERPTDD 76

Query: 266  SSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQIKT 445
            S D EDWW+V SMLVSWI NTIEP LRSTIS++E+ KDLWE+I++RFSV+NGPR+QQI++
Sbjct: 77   SPDIEDWWSVNSMLVSWIFNTIEPTLRSTISYMEDVKDLWEEIKQRFSVSNGPRVQQIRS 136

Query: 446  ELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKVHQF 625
            +L+ CKQ G SIV YYG+LKSLWD+L NYD IP C C GCKCN + KL ++REEE++HQF
Sbjct: 137  DLANCKQNGQSIVTYYGRLKSLWDELNNYDPIPVCECAGCKCNVTTKLNKKREEERIHQF 196

Query: 626  LMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAHAGG 805
            LMGLDE  Y TVRSN+L+ E LP+LN+VY ++V++E+V+ +T   EERG  M     AG 
Sbjct: 197  LMGLDEGGYETVRSNILSAESLPNLNRVYAMVVQQEQVQIMTSTKEERGNPMSFVVQAG- 255

Query: 806  RLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWWXXXXXXXXXXXXXXXXXXXXXX 985
            R  G GE K+K + CSIC R GHD+  CFQ IGYP+WW                      
Sbjct: 256  RNSG-GERKEKPSTCSICKRKGHDAENCFQRIGYPEWWGERPRTTTGGRGSTNGRGTQQN 314

Query: 986  XXXXXX-IVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAILNTHKTSTGEKM 1162
                     RA+AAQT   + T    ++ +++G+ GL+NEQW  +L IL++HK    E++
Sbjct: 315  YGRGRGGAARAHAAQTPSFDGTRNIVTNSDRTGITGLSNEQWSALLNILDSHKDGNTERL 374

Query: 1163 T 1165
            T
Sbjct: 375  T 375


>gb|PNX83470.1| hypothetical protein L195_g039513 [Trifolium pratense]
          Length = 499

 Score =  409 bits (1051), Expect = e-137
 Identities = 196/371 (52%), Positives = 259/371 (69%), Gaps = 2/371 (0%)
 Frame = +2

Query: 59   EESSNRELAKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVD 238
            +ES++   A+K ISPYD+  NDNPG+L+TQV+L+G ENYDEWA ++RT+LRAR+K+GFVD
Sbjct: 23   KESADGGKARKTISPYDITPNDNPGSLVTQVQLKG-ENYDEWASSLRTALRARKKFGFVD 81

Query: 239  GTISEPEKESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVAN 418
            GTI +P+++S D EDWWT  S+LVSWI NTIEP +RST+SH+E A DLWEDI+ERFSV N
Sbjct: 82   GTIKKPDEDSPDLEDWWTNNSLLVSWIMNTIEPSVRSTMSHMEVAHDLWEDIKERFSVVN 141

Query: 419  GPRIQQIKTELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQR 598
            GPRIQQ+K EL++CKQ+G +I+AY+GK+K LW++LANY+QIP+C C  C CN    L+++
Sbjct: 142  GPRIQQLKAELADCKQKGSTILAYFGKMKKLWEELANYEQIPSCKCGKCTCNIGVVLQKK 201

Query: 599  REEEKVHQFLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEI 778
            REEE+VHQFLMGLD+ SYGTVRSNLL  +PLP+LN+VY VLV+EERV+TITR  E+ GE+
Sbjct: 202  REEERVHQFLMGLDDTSYGTVRSNLLTQDPLPALNRVYSVLVQEERVRTITRGKEDTGEV 261

Query: 779  MGLAAHAGGRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW--XXXXXXXXXXX 952
            M  A  A  + +GR E   KT  CS C+R GH+   CF++IGYPDWW             
Sbjct: 262  MSFAVQARNQTRGRSEGPSKTTPCSHCNRPGHEPDGCFELIGYPDWWGERPRGARQGAKR 321

Query: 953  XXXXXXXXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAILN 1132
                              ++AN  QT+  N      S         L+NEQW+ +L ++ 
Sbjct: 322  GKEGMTVGGRGRGGRGGPIKANVMQTVSNNTAENVNS---------LSNEQWEVLLNLVR 372

Query: 1133 THKTSTGEKMT 1165
              +T   EK+T
Sbjct: 373  NVQTGATEKLT 383


>gb|KYP36798.1| hypothetical protein KK1_042036 [Cajanus cajan]
          Length = 347

 Score =  401 bits (1030), Expect = e-136
 Identities = 182/278 (65%), Positives = 224/278 (80%)
 Frame = +2

Query: 86  KKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEPEKE 265
           +K ISPYD+ +NDNPG+LITQV+L+G ENYDEW  ++RT+LRAR+K+GFVDGTI +P + 
Sbjct: 10  RKTISPYDITANDNPGSLITQVQLKG-ENYDEWVRSIRTALRARKKFGFVDGTIQKPTEN 68

Query: 266 SSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQIKT 445
           S D EDWWT  S+LVSWI NTIEP LR TI H+E A+DLW DI+ERFSVANGPRIQQ+K 
Sbjct: 69  SPDIEDWWTNNSLLVSWIMNTIEPSLRFTILHMEVAQDLWNDIKERFSVANGPRIQQLKA 128

Query: 446 ELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKVHQF 625
           EL ECKQ+GM+IVAYYGKLK LW++L N+DQIPTC+C    CNF A LE++REEEK+HQF
Sbjct: 129 ELVECKQKGMTIVAYYGKLKKLWEELGNFDQIPTCSCGLYTCNFHAVLERKREEEKIHQF 188

Query: 626 LMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAHAGG 805
           LMGLD+ +YG +RSNLLA +PL +LN +Y  LV+EERV T++R  EERGE+M  A   G 
Sbjct: 189 LMGLDDTTYGIIRSNLLAQDPLLNLNNIYSTLVQEERVHTVSRAKEERGEMMAFAVQTGT 248

Query: 806 RLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW 919
           R + +GE KDK  IC  C R+GHDS  CFQ++GYPDWW
Sbjct: 249 RTRDKGEGKDKNTICGHCHRSGHDSDNCFQILGYPDWW 286


>ref|XP_021886856.1| uncharacterized protein LOC110806350 isoform X2 [Carica papaya]
          Length = 392

 Score =  397 bits (1021), Expect = e-134
 Identities = 197/365 (53%), Positives = 245/365 (67%), Gaps = 5/365 (1%)
 Frame = +2

Query: 86   KKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEPEKE 265
            +K ISPYD+ SNDNP  +IT V+L+GD NYDEWA ++RT+LRAR+K+GF+DGTI +P++E
Sbjct: 18   RKTISPYDIISNDNPRIVITHVQLKGD-NYDEWARSMRTALRARKKFGFIDGTIKQPDEE 76

Query: 266  SSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQIKT 445
            S D EDWWT+ S+LVSWIRNTIE  LRS ISH+E A+DLW DIRE FSVANG RIQQ+K 
Sbjct: 77   SPDLEDWWTINSLLVSWIRNTIESTLRSIISHMEVAQDLWIDIRECFSVANGSRIQQLKA 136

Query: 446  ELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKVHQF 625
            EL++CKQ+G++IV YYGKLK LW++L N+DQIPTC    C C+  + LE++REEEK+H F
Sbjct: 137  ELAKCKQKGLAIVDYYGKLKKLWEELGNFDQIPTCKRGKCTCDLGSVLEKKREEEKMHLF 196

Query: 626  LMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAHAGG 805
            LMGLDE  YGTVRSN+LA +PLPSLN+VY  LV+EER+K I R  EER ++M        
Sbjct: 197  LMGLDETIYGTVRSNILAQDPLPSLNRVYATLVQEERMKIIARGKEERSDVMAFVVQGTA 256

Query: 806  RLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW-----XXXXXXXXXXXXXXXXX 970
               GR E KDK  ICS C RT H+S  CFQ+IG+PD W                      
Sbjct: 257  TYCGRSEGKDKNMICSNCKRTSHESDNCFQLIGFPDLWGDRPRGDGKMGNRGRGQQQQQK 316

Query: 971  XXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAILNTHKTST 1150
                         RAN  QT+G +       D E+ G  GL N+QWQ +L +LN  K   
Sbjct: 317  IGTGGGRGRGGASRANVVQTVGGSSLATLAPDEERKGTIGLNNKQWQTLLQMLNNKKPYV 376

Query: 1151 GEKMT 1165
             EKMT
Sbjct: 377  NEKMT 381


>ref|XP_021886855.1| uncharacterized protein LOC110806350 isoform X1 [Carica papaya]
          Length = 411

 Score =  397 bits (1021), Expect = e-133
 Identities = 197/365 (53%), Positives = 245/365 (67%), Gaps = 5/365 (1%)
 Frame = +2

Query: 86   KKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEPEKE 265
            +K ISPYD+ SNDNP  +IT V+L+GD NYDEWA ++RT+LRAR+K+GF+DGTI +P++E
Sbjct: 18   RKTISPYDIISNDNPRIVITHVQLKGD-NYDEWARSMRTALRARKKFGFIDGTIKQPDEE 76

Query: 266  SSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQIKT 445
            S D EDWWT+ S+LVSWIRNTIE  LRS ISH+E A+DLW DIRE FSVANG RIQQ+K 
Sbjct: 77   SPDLEDWWTINSLLVSWIRNTIESTLRSIISHMEVAQDLWIDIRECFSVANGSRIQQLKA 136

Query: 446  ELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKVHQF 625
            EL++CKQ+G++IV YYGKLK LW++L N+DQIPTC    C C+  + LE++REEEK+H F
Sbjct: 137  ELAKCKQKGLAIVDYYGKLKKLWEELGNFDQIPTCKRGKCTCDLGSVLEKKREEEKMHLF 196

Query: 626  LMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAHAGG 805
            LMGLDE  YGTVRSN+LA +PLPSLN+VY  LV+EER+K I R  EER ++M        
Sbjct: 197  LMGLDETIYGTVRSNILAQDPLPSLNRVYATLVQEERMKIIARGKEERSDVMAFVVQGTA 256

Query: 806  RLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW-----XXXXXXXXXXXXXXXXX 970
               GR E KDK  ICS C RT H+S  CFQ+IG+PD W                      
Sbjct: 257  TYCGRSEGKDKNMICSNCKRTSHESDNCFQLIGFPDLWGDRPRGDGKMGNRGRGQQQQQK 316

Query: 971  XXXXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAILNTHKTST 1150
                         RAN  QT+G +       D E+ G  GL N+QWQ +L +LN  K   
Sbjct: 317  IGTGGGRGRGGASRANVVQTVGGSSLATLAPDEERKGTIGLNNKQWQTLLQMLNNKKPYV 376

Query: 1151 GEKMT 1165
             EKMT
Sbjct: 377  NEKMT 381


>gb|KZV52705.1| hypothetical protein F511_23168 [Dorcoceras hygrometricum]
          Length = 422

 Score =  396 bits (1018), Expect = e-133
 Identities = 186/355 (52%), Positives = 255/355 (71%), Gaps = 3/355 (0%)
 Frame = +2

Query: 86   KKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEPEKE 265
            K  ISPY L++NDNPGN+ITQV+L+G ENY+EWA A+RT+LRA++K+GF+DG++ EP ++
Sbjct: 14   KLTISPYFLSTNDNPGNIITQVQLKG-ENYEEWARAIRTALRAKKKYGFIDGSLKEPSED 72

Query: 266  SSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQIKT 445
            SS+ EDWWTV SM+VSWI NTIEP LRSTI+ +E AKDLW+DI+ERFS  NGPRI Q+KT
Sbjct: 73   SSEQEDWWTVNSMVVSWILNTIEPTLRSTITFMEIAKDLWDDIKERFSAGNGPRIHQLKT 132

Query: 446  ELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKVHQF 625
            EL ECKQRGM+IV YYGKLK +W++L NY+Q P C C  CKCN SA+L+++REEE++HQF
Sbjct: 133  ELVECKQRGMTIVNYYGKLKMIWEELGNYEQNPVCKCGSCKCNISAELDKKREEERLHQF 192

Query: 626  LMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAHAGG 805
            L+GLD+  YGTVRSN+L+ +PL +LN+ Y ++++EERV+ ITR  E+R E M  A   G 
Sbjct: 193  LIGLDDSIYGTVRSNILSADPLLNLNRAYAMMIQEERVRNITRGKEQRSEHMAFAVQTGS 252

Query: 806  RLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW---XXXXXXXXXXXXXXXXXXX 976
              KGR ++KD + +C  C++ GH++ +CF++IGYPDWW                      
Sbjct: 253  TSKGRTDSKDTSMVCPNCNKPGHNAESCFKLIGYPDWWGDRPKGSGRGSGRGQGGSRQSI 312

Query: 977  XXXXXXXXXIVRANAAQTLGENVTVATGSDLEKSGLAGLTNEQWQNILAILNTHK 1141
                      +RANAA    + +     ++ +K GL+GL++EQW  +L +LNT K
Sbjct: 313  GSGGKGRGGPIRANAAIVTSQEIKT---NETDKGGLSGLSSEQWNTLLNLLNTPK 364


>ref|XP_015385530.1| PREDICTED: uncharacterized protein LOC107176911 [Citrus sinensis]
          Length = 557

 Score =  396 bits (1017), Expect = e-131
 Identities = 188/358 (52%), Positives = 254/358 (70%), Gaps = 5/358 (1%)
 Frame = +2

Query: 83   AKKMISPYDLNSNDNPGNLITQVKLRGDENYDEWACAVRTSLRARRKWGFVDGTISEPEK 262
            +K   SPY L++NDNPGN+ITQV+L+GD NYDEWA A+RT+LRA++K+GF+DG++ +P  
Sbjct: 11   SKSATSPYFLSANDNPGNIITQVQLKGD-NYDEWARAMRTALRAKKKFGFIDGSVIQPSD 69

Query: 263  ESSDFEDWWTVQSMLVSWIRNTIEPELRSTISHIENAKDLWEDIRERFSVANGPRIQQIK 442
            +S   EDWWTV SML+SWI NTIEP LRSTI++ E AK+LW+DI+ERFS  NGPR+ Q+K
Sbjct: 70   DSMTQEDWWTVNSMLISWILNTIEPTLRSTITYREVAKELWDDIKERFSAGNGPRVHQLK 129

Query: 443  TELSECKQRGMSIVAYYGKLKSLWDDLANYDQIPTCTCTGCKCNFSAKLEQRREEEKVHQ 622
            +EL+ECKQ+G+++++YYGKLK +W++L NY+Q PTC C GC CN  A+L++RREEE++HQ
Sbjct: 130  SELAECKQQGITVMSYYGKLKMIWEELGNYEQYPTCRCGGCACNIGAELDKRREEERLHQ 189

Query: 623  FLMGLDEVSYGTVRSNLLATEPLPSLNKVYGVLVREERVKTITRVAEERGEIMGLAAHAG 802
            F MGLD+ +YGTVRSN+L+TEPLP+LN+ Y ++++EERV +ITR  E++ E M  A    
Sbjct: 190  FFMGLDDSTYGTVRSNILSTEPLPTLNRAYAMIIQEERVCSITRGKEQQVEAMAFAVQIA 249

Query: 803  GRLKGRGEAKDKTAICSICSRTGHDSSTCFQVIGYPDWW----XXXXXXXXXXXXXXXXX 970
              LKGR E+KDKT +CS   RTGHD+ +CFQ+I YPDWW                     
Sbjct: 250  TSLKGRTESKDKTVLCSNYKRTGHDAESCFQLIRYPDWWGDRPRGGGGRGTGRGQGGQKQ 309

Query: 971  XXXXXXXXXXXIVRANAAQTLGENVTVATGS-DLEKSGLAGLTNEQWQNILAILNTHK 1141
                        +RAN AQ   +  T      + +KSGL GL+NEQW  +L +LN  K
Sbjct: 310  PAASGGRGRGGQIRANVAQVTTQGSTAQEQRVEADKSGLNGLSNEQWNLLLNLLNGQK 367


Top