BLASTX nr result

ID: Rehmannia32_contig00000969 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia32_contig00000969
         (973 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamu...   310   2e-99
gb|KZV22124.1| hypothetical protein F511_11652 [Dorcoceras hygro...   204   1e-60
ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969...   170   2e-43
gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotranspo...   161   8e-43
gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygro...   167   2e-42
gb|PON41343.1| Zinc finger, CCHC-type [Parasponia andersonii]         156   9e-42
ref|XP_011100917.1| uncharacterized protein LOC105179028 [Sesamu...   151   4e-41
gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi...   160   3e-40
dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri]      154   8e-39
gb|PKA49510.1| Retrovirus-related Pol polyprotein from transposo...   145   6e-37
emb|CAN80490.1| hypothetical protein VITISV_004703 [Vitis vinifera]   150   8e-37
emb|CAN67309.1| hypothetical protein VITISV_028165 [Vitis vinifera]   144   1e-36
gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium bar...   148   5e-36
gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium bar...   148   6e-36
gb|AAK29467.1| polyprotein-like [Solanum chilense]                    145   6e-35
gb|KYP36396.1| Retrovirus-related Pol polyprotein from transposo...   142   9e-35
gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposo...   142   7e-34
sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly...   142   9e-34
emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]   141   1e-33
gb|ABA98804.1| retrotransposon protein, putative, Ty1-copia subc...   141   2e-33

>ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamum indicum]
          Length = 472

 Score =  310 bits (794), Expect = 2e-99
 Identities = 177/330 (53%), Positives = 223/330 (67%), Gaps = 8/330 (2%)
 Frame = +1

Query: 1    LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
            L+EL+TE SLP+             DL+K IDENLD FTKLIQDIKL GDK ID+Y+PIV
Sbjct: 84   LEELFTEISLPNKLFLLEKIFRYKLDLSKNIDENLDDFTKLIQDIKLAGDKYIDEYSPIV 143

Query: 181  LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQ 360
            LLNAIPES+SDVK+AIKYGRDSI L+TVVNGLKSKELDLKVNK   Q+  E+  VRGR+ 
Sbjct: 144  LLNAIPESFSDVKAAIKYGRDSINLETVVNGLKSKELDLKVNKPS-QSHYEINSVRGRT- 201

Query: 361  HRFGN-----HQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIREC--PNK 519
             RFGN     + +S +    N                    CY CG  GHYI++C  P +
Sbjct: 202  -RFGNFNSRYNSRSRSKTKTNRSKSRPRETNLRDDKIRDRRCYNCGTKGHYIKDCRKPRR 260

Query: 520  KGNQNFKNDQANMAS-SSENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMS 696
            +      +D+  +++ S E+ GE+F+V      +  NSV +  +  +EWLIDS CTFHMS
Sbjct: 261  ENRDRNYDDKEKVSNVSIESNGEVFVV------YEANSVSTFDM--HEWLIDSGCTFHMS 312

Query: 697  PFKNLFSSYSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLIS 876
            PFK++F++        VSMANEKKCE+ G+GDI L F+ GY   LKNVR+VPDL +NLIS
Sbjct: 313  PFKDIFTNLKYEHAGFVSMANEKKCEIKGLGDISLCFD-GYKMLLKNVRYVPDLSHNLIS 371

Query: 877  CAALEDDGLEGRFGNGVMKILKGSLVIFKA 966
            CAALE++GLEGR+G G+MKI+KGSLV+FKA
Sbjct: 372  CAALEENGLEGRWGKGLMKIMKGSLVVFKA 401


>gb|KZV22124.1| hypothetical protein F511_11652 [Dorcoceras hygrometricum]
          Length = 277

 Score =  204 bits (519), Expect = 1e-60
 Identities = 106/196 (54%), Positives = 129/196 (65%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           LQELYTETSLPS             DLNK++D NLDVFTKLIQDIKLTGDKNIDDYTPIV
Sbjct: 88  LQELYTETSLPSKMFLLEKFFRFKLDLNKDLDGNLDVFTKLIQDIKLTGDKNIDDYTPIV 147

Query: 181 LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQ 360
           LLNAIP+ Y+DV+SAIKYGRD ITL+TV++GLKSKELDLK  KG + N GEVMHV GRS+
Sbjct: 148 LLNAIPDDYADVRSAIKYGRDKITLETVISGLKSKELDLKAYKGTKPNGGEVMHVGGRSK 207

Query: 361 HRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFK 540
            R+ NH  ++   +++ +                  CY CGE GHY  +CP+ K    +K
Sbjct: 208 TRYRNHMGNNGDNSRSKY-------------IGNRTCYNCGEKGHYKADCPHPK-EDKYK 253

Query: 541 NDQANMASSSENVGEI 588
            D   +   S N  ++
Sbjct: 254 RDNTLLTEQSNNATDL 269


>ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969411 [Erythranthe
           guttata]
          Length = 1213

 Score =  170 bits (430), Expect = 2e-43
 Identities = 86/131 (65%), Positives = 103/131 (78%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           L ELYTETSLPS             DL K+IDEN+D FT+L+QDIKLTGDK+ID+YTPIV
Sbjct: 527 LDELYTETSLPSKLFLLEKFFRFKLDLTKDIDENIDQFTRLVQDIKLTGDKSIDNYTPIV 586

Query: 181 LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQ 360
           LLNAIP+SY+D+KSAIKYGRD+I+LDTV+NGLKSKE+DL+VNK  + + GEV  VRGR Q
Sbjct: 587 LLNAIPDSYNDLKSAIKYGRDNISLDTVINGLKSKEMDLRVNKSNK-SFGEVNFVRGRQQ 645

Query: 361 HRFGNHQKSDN 393
           +RF N   S N
Sbjct: 646 NRFSNKPSSSN 656


>gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotransposon family protein
           [Trema orientalis]
          Length = 380

 Score =  161 bits (407), Expect = 8e-43
 Identities = 104/300 (34%), Positives = 147/300 (49%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           L+ELY   +LP              D +K I+ENLD +TKL+ D++  G K  D    I+
Sbjct: 94  LEELYRAKTLPGRIYLKERFFGFKMDKSKSIEENLDDYTKLVLDLENLGIKVDDKDKAII 153

Query: 181 LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQ 360
           LLN++P +  + K  +KYGR +IT+D V N L+SK LD+K ++   Q  GE +H+RGR+ 
Sbjct: 154 LLNSLPRNLKNFKETLKYGRQTITVDEVQNALESKLLDMKGSEKNAQ--GEGLHIRGRT- 210

Query: 361 HRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFK 540
                  K DNH  +                      Y C + GH  R  P ++     K
Sbjct: 211 ------TKQDNHDGKG--KSQSRSKSRGKKDYSKVKYYHCNKNGHIRRLRPERQNKDAGK 262

Query: 541 NDQANMASSSENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSS 720
            D           G+  +V D  E   V S+ S S    EW++DS C++HM P ++ F  
Sbjct: 263 LD-----------GDAVIVDDGYESSEVLSI-SESENSKEWVMDSGCSYHMCPREDWFMD 310

Query: 721 YSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLISCAALEDDG 900
           Y EV    V M N   C+V+GIG I ++   G    LKNVRHVP+L  +LIS   L+  G
Sbjct: 311 YQEVDGGKVLMGNNMACKVMGIGSISIRMFDGVTRILKNVRHVPELKRSLISLGTLDKSG 370


>gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygrometricum]
          Length = 1309

 Score =  167 bits (423), Expect = 2e-42
 Identities = 96/321 (29%), Positives = 167/321 (52%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           L+ LY + SL +             +  K++ +++D F K+I D+K    K  D+   I+
Sbjct: 87  LESLYLKRSLANRLYLKKSLYTIHLEEGKDLKKHMDEFNKIILDLKNVDIKITDEDCAIL 146

Query: 181 LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQ 360
           +L+++P SY      + YG++++T+  V + L SKEL  K N+   +++GE ++VRGR+ 
Sbjct: 147 MLSSLPRSYEHFVDTMLYGKETLTMAEVKSALNSKELHKK-NETKMESTGEGLNVRGRTY 205

Query: 361 HRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFK 540
            R   ++K   H++Q+                    C+ C + GH+ ++CP+++      
Sbjct: 206 KRESRNEKGGKHRSQS-------------RTRGKLKCFVCHKEGHFKKDCPDRR------ 246

Query: 541 NDQANMASSSENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSS 720
              A      ++ G+  +VSD  E   V  V   +  D  W++DS C+FHM P K+ F +
Sbjct: 247 ---ARNPERRKDPGDAAVVSDGYESAEVLVVSRTNKQDC-WVMDSGCSFHMCPIKSWFQN 302

Query: 721 YSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLISCAALEDDG 900
             E ++ HV + N ++C+V+GIG + LK   G   T+  VR+VPDL  NL+S   L+  G
Sbjct: 303 LVEEESGHVLLGNNRECKVMGIGSVLLKMHDGCVRTITEVRYVPDLRRNLLSIGMLDSKG 362

Query: 901 LEGRFGNGVMKILKGSLVIFK 963
              +   G MK++KGSL + +
Sbjct: 363 FNVKIEGGTMKVIKGSLTVMR 383


>gb|PON41343.1| Zinc finger, CCHC-type [Parasponia andersonii]
          Length = 297

 Score =  156 bits (394), Expect = 9e-42
 Identities = 96/300 (32%), Positives = 157/300 (52%), Gaps = 4/300 (1%)
 Frame = +1

Query: 85  KEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRD-SITLDT 261
           K I + +D F K+I D++    K  D+   ++LLNA+P++Y   K A+ YGR+ +ITLD 
Sbjct: 24  KSISDQIDKFNKIIDDLENIEIKLEDEDKALILLNALPKAYEHFKDAMLYGREQTITLDE 83

Query: 262 VVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXX 441
           V + +K+KEL  K  +G  +N+GE +  RGRS+       K DN   +N           
Sbjct: 84  VQSAVKAKELPRK-KEGKEENTGEGLMARGRSE-------KCDNKAPRN---------ES 126

Query: 442 XXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFKND---QANMASSSENVGEIFMVSDLCE 612
                    C+ C + GH+ R+CP++K   + K     +A++AS   +  E+ +V+D   
Sbjct: 127 RSKSKGRLKCFHCHKEGHFKRDCPDRKKKVHEKPKDPGEASVASDGYDSAEVLVVTD--- 183

Query: 613 KHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGD 792
                          EW++DS C+FHM P K+ F +  +     V + N K C+V GIG 
Sbjct: 184 ----------EDSSKEWIMDSGCSFHMCPTKSWFENLEKTDGGSVLLGNNKPCKVAGIGS 233

Query: 793 ICLKFESGYAYTLKNVRHVPDLCNNLISCAALEDDGLEGRFGNGVMKILKGSLVIFKAVK 972
           + ++   G    L+ VR+VP+L  NLIS   L+ +G   +  +G +K+ KGSL++ K ++
Sbjct: 234 VRIRMFDGMERILQQVRYVPELKRNLISLGMLDMNGYSFKAEHGSLKVSKGSLIVRKGIR 293


>ref|XP_011100917.1| uncharacterized protein LOC105179028 [Sesamum indicum]
          Length = 188

 Score =  151 bits (381), Expect = 4e-41
 Identities = 76/103 (73%), Positives = 86/103 (83%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           L++LYTETSLPS             DL+K IDENLD FTKLIQDIKLTGDKNID+Y+PIV
Sbjct: 84  LEDLYTETSLPSKLFLLEKKFHYKLDLSKSIDENLDDFTKLIQDIKLTGDKNIDEYSPIV 143

Query: 181 LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNK 309
           LLNAIP+SYSD K+AIKYGRDS+ LDTVVNGLKSKE+DLKV+K
Sbjct: 144 LLNAIPKSYSDAKAAIKYGRDSVNLDTVVNGLKSKEMDLKVSK 186


>gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 838

 Score =  160 bits (405), Expect = 3e-40
 Identities = 98/331 (29%), Positives = 169/331 (51%), Gaps = 7/331 (2%)
 Frame = +1

Query: 1    LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
            L +L+   SLP+               +  I+EN++ F KLI D++       D+   IV
Sbjct: 100  LDKLFMAKSLPNRIYLKQRLYGYKMSDSMTIEENVNDFFKLISDLENVKVSVPDEDQAIV 159

Query: 181  LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHV--RGR 354
            LL ++P+ +  +K  +KYG+ ++ LD +   ++SK L+L  +    +NS + + V  RGR
Sbjct: 160  LLMSLPKQFDQLKDTLKYGKTTLALDEITGAIRSKVLELGASGKMLKNSSDALFVQDRGR 219

Query: 355  SQHRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIREC-----PNK 519
            S+ R    + S+ +++Q+                    C+ CG+ GH+ ++C      NK
Sbjct: 220  SEKR---DKSSERNKSQS-----------RSKSREKKVCWVCGKEGHFKKQCYVWKEKNK 265

Query: 520  KGNQNFKNDQANMASSSENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSP 699
            KGN + K + +N+      +G+    + L  +   N+   N  +DNEW++D+ C+FHM+P
Sbjct: 266  KGNNSEKGESSNV------IGQAADAAALAVREESNA--DNQEVDNEWIMDTGCSFHMTP 317

Query: 700  FKNLFSSYSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLISC 879
             ++ F  + E +   V MAN+   E+ GIG I ++ +      LKNVR+VP +  NLIS 
Sbjct: 318  RRDWFVEFDESQTGRVKMANQTYSEIKGIGSIRIQNDDNTTVLLKNVRYVPSMSKNLISM 377

Query: 880  AALEDDGLEGRFGNGVMKILKGSLVIFKAVK 972
              LED G   +   G +K++KG + + K  K
Sbjct: 378  GTLEDQGCWFQSKAGTLKVVKGCMTLLKGKK 408


>dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri]
          Length = 605

 Score =  154 bits (390), Expect = 8e-39
 Identities = 92/319 (28%), Positives = 161/319 (50%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           L+ELY   SL +                K IDE +D F KLI D++    K  D+   ++
Sbjct: 106 LEELYMTKSLANRLYLKQALYSFKMIEEKAIDEQMDQFIKLILDLENIEVKIEDEDQALL 165

Query: 181 LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQ 360
           L+ A+P SY+  K  + YGR+++TL  V   LKSK+L+ +++     ++ E ++V+G+ +
Sbjct: 166 LVCALPRSYNTFKDTLLYGRETLTLKEVQAALKSKQLNTRIDNKAVGSTSEALYVKGKGE 225

Query: 361 HRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFK 540
            +  + ++ +  + +                     C+ C E GH  + CP K+ ++  K
Sbjct: 226 EKKTHKERKNKSKKK-------------------VKCFYCDEEGHMCKNCPKKERDKGKK 266

Query: 541 NDQANMASSSENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSS 720
            +Q   A + E+     +++   E   V   + +     +WL+DSA +FH++  K+    
Sbjct: 267 VEQGEAAMACESYESADVLAVTHEDQDVTKSEKSG----KWLLDSASSFHVTCVKSWIKD 322

Query: 721 YSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLISCAALEDDG 900
           +     C VS+  EK+ ++LG G + ++ ++G    L+NV+ +PDL  NLIS   L+  G
Sbjct: 323 FKGCDGCLVSVGEEKQYKILGFGTVKIRLKTGGVRILRNVKFIPDLGRNLISVGLLDVQG 382

Query: 901 LEGRFGNGVMKILKGSLVI 957
            +   GNGVMK+ KGS VI
Sbjct: 383 FKCVAGNGVMKVFKGSKVI 401


>gb|PKA49510.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Apostasia shenzhenica]
          Length = 365

 Score =  145 bits (366), Expect = 6e-37
 Identities = 95/299 (31%), Positives = 155/299 (51%), Gaps = 7/299 (2%)
 Frame = +1

Query: 97  ENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGL 276
           E+++VF+K+I  ++    K  ++   ++LL+++P+SY  + + I YG+D++ ++ V   L
Sbjct: 9   EHMNVFSKMISQLRSINVKLEEENEALLLLSSLPKSYDHLVTTILYGKDTLKVEEVNATL 68

Query: 277 KSKELDLKVNKGGRQNSGEVMHV-----RGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXX 441
            S E+  K      Q++GE + V     RGRS+++FGN  +  +   +N +         
Sbjct: 69  LSNEVRNK------QSTGESLTVKTSQDRGRSKNKFGNQYRYRSISKENDNR-------- 114

Query: 442 XXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFKN--DQANMASSSENVGEIFMVSDLCEK 615
                    CY C + GH+ R+CP K   Q  K   ++A++AS  E   E      LC  
Sbjct: 115 ---------CYYCKKEGHWKRDCPKKSKQQQQKKSGEEASVASRLEKDSET-----LCTF 160

Query: 616 HAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGDI 795
             ++S  S       W++DS C++HM PF++ FS+YS      V M N  +C+ +GIG I
Sbjct: 161 SCMDSSDS-------WILDSDCSYHMCPFRDWFSTYSIHDGGRVIMGNNSECKSVGIGTI 213

Query: 796 CLKFESGYAYTLKNVRHVPDLCNNLISCAALEDDGLEGRFGNGVMKILKGSLVIFKAVK 972
            +K   G   TL  VRHVPDL   LIS   L+  G      +G++K+ KG+ V+ K  K
Sbjct: 214 KIKMFDGVIRTLTEVRHVPDLRKGLISLGTLDASGCTFIGSDGIIKVKKGAPVVMKGEK 272


>emb|CAN80490.1| hypothetical protein VITISV_004703 [Vitis vinifera]
          Length = 777

 Score =  150 bits (379), Expect = 8e-37
 Identities = 92/297 (30%), Positives = 144/297 (48%)
 Frame = +1

Query: 82  NKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDT 261
           +K ID+ LD F KL+ D++    K  D+   I+LLN++P+S    K  +KYG+D IT D 
Sbjct: 27  SKPIDDALDEFNKLVLDLESLDIKVEDEDKAIILLNSLPKSLKHFKETLKYGKDDITFDD 86

Query: 262 VVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXX 441
           V N L +K LD+K     + N G+             +  KS +   +++          
Sbjct: 87  VQNALNAKVLDMK--SSDKTNGGK-------------SRSKSKSKGKKDYRNVK------ 125

Query: 442 XXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIFMVSDLCEKHA 621
                    CY C ++G   R CP+++  +            ++  G   ++ D  +   
Sbjct: 126 ---------CYHCNKIGQIRRICPDRQQEEK-----------TQAQGSAAIIDDGYDSTE 165

Query: 622 VNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGDICL 801
           V +++ N   + EW++DS CT+HM P ++ FSSY EV    + + N   C V+GIG + +
Sbjct: 166 VLTIRLNPNHE-EWVLDSGCTYHMCPRRDWFSSYQEVNGGKLLLGNNMSCNVVGIGTMAI 224

Query: 802 KFESGYAYTLKNVRHVPDLCNNLISCAALEDDGLEGRFGNGVMKILKGSLVIFKAVK 972
               G   TLK VRHVPDL  NLIS   L+  G   +  NG + I K ++V+ K  K
Sbjct: 225 NMHDGKTRTLKEVRHVPDLKRNLISLGTLDKSGYNFKAKNGKLTISKXAMVVMKGQK 281


>emb|CAN67309.1| hypothetical protein VITISV_028165 [Vitis vinifera]
          Length = 344

 Score =  144 bits (363), Expect = 1e-36
 Identities = 89/297 (29%), Positives = 145/297 (48%), Gaps = 3/297 (1%)
 Frame = +1

Query: 91  IDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVN 270
           I ++ + F K I D++    K   +   I L+ ++P SY      + YGR ++ +  V  
Sbjct: 72  IKDHFNEFNKTIXDLRNIDVKVNYEDQAIFLMCSLPNSYEHFVDIMMYGRGTLFIKDVRV 131

Query: 271 GLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXXXXX 450
            L S+EL   V K  + +SGE +  RGR++ +    +     +++  +            
Sbjct: 132 ALNSRELKKMVFKSRKYDSGEGLVARGRTRKKNNGRRGRSRSKSRGNNK----------- 180

Query: 451 XXXXXXCYKCGEVGHYIRECPNKKGNQNFKN---DQANMASSSENVGEIFMVSDLCEKHA 621
                 C+KC + GHY++  P++KG +N +N     A  A  + N  ++ +V        
Sbjct: 181 ------CFKCKKEGHYVKNXPDRKGKENKRNYNSGDATFAKENSNTTDVLLVX------V 228

Query: 622 VNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGDICL 801
            NS       D+EW++DS C++HMSP  + FS+Y  +    V M N+  C+V+GI  I +
Sbjct: 229 TNS-------DDEWILDSGCSYHMSPNGDWFSTYQPIDGGKVLMGNKVACKVVGIHXIQI 281

Query: 802 KFESGYAYTLKNVRHVPDLCNNLISCAALEDDGLEGRFGNGVMKILKGSLVIFKAVK 972
           K   G   TL NVRHVP L  NLI    L+ +G   + G GV+++ KG LV+    K
Sbjct: 282 KMHGGIIRTLTNVRHVPKLNKNLIFLRTLDSNGCIYKAGGGVLRVSKGGLVVMNGKK 338


>gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium barbadense]
          Length = 1841

 Score =  148 bits (374), Expect = 5e-36
 Identities = 94/297 (31%), Positives = 146/297 (49%), Gaps = 6/297 (2%)
 Frame = +1

Query: 91   IDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVN 270
            + ++LD F  +I D+    +K  D+   I++L ++P SY +    + YGRD +TL+ V N
Sbjct: 642  VSQHLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMMYGRDDLTLEEVKN 701

Query: 271  GLKSKELDLKVN-KGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXXXX 447
             L S EL  K+  K    N GE +  RGRS+ + G+  KS                    
Sbjct: 702  ALSSSELRKKITGKVVENNEGEGLVARGRSKAKGGSSSKSHPRSQSK------------- 748

Query: 448  XXXXXXXCYKCGEVGHYIRECPNKKG---NQNFKNDQANMAS--SSENVGEIFMVSDLCE 612
                   CY C + GH   +CP +K    +Q  +ND+AN+A   SS +   +  VSD   
Sbjct: 749  ---KRIQCYYCKKYGHMKVDCPKRKEKSESQEQQNDRANVADADSSSDAEIVLAVSD--- 802

Query: 613  KHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGD 792
                      S     W++D+  TFH+S  K+ FS+Y E  +  V M N+  C+V+GIG 
Sbjct: 803  ----------SYAGGRWILDTGATFHISTSKDAFSTY-EKHSGSVLMGNDHACQVMGIGT 851

Query: 793  ICLKFESGYAYTLKNVRHVPDLCNNLISCAALEDDGLEGRFGNGVMKILKGSLVIFK 963
            + +K   G   TL +VRH+P++  NLIS + L+  G       GV+K+  G+L + +
Sbjct: 852  VRIKMFDGIVRTLTDVRHIPEMKKNLISLSTLDKKGFRYSAEGGVLKVFSGALTVIR 908


>gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium barbadense]
          Length = 2351

 Score =  148 bits (374), Expect = 6e-36
 Identities = 94/297 (31%), Positives = 146/297 (49%), Gaps = 6/297 (2%)
 Frame = +1

Query: 91   IDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVN 270
            + ++LD F  +I D+    +K  D+   I++L ++P SY +    + YGRD +TL+ V N
Sbjct: 621  VSQHLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMMYGRDDLTLEEVKN 680

Query: 271  GLKSKELDLKVN-KGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXXXX 447
             L S EL  K+  K    N GE +  RGRS+ + G+  KS                    
Sbjct: 681  ALSSSELRKKITGKVVENNEGEGLVARGRSKAKGGSSSKSHPRSQSK------------- 727

Query: 448  XXXXXXXCYKCGEVGHYIRECPNKKG---NQNFKNDQANMAS--SSENVGEIFMVSDLCE 612
                   CY C + GH   +CP +K    +Q  +ND+AN+A   SS +   +  VSD   
Sbjct: 728  ---KRIQCYYCKKYGHMKVDCPKRKEKSESQEQQNDRANVADADSSSDAEIVLAVSD--- 781

Query: 613  KHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGD 792
                      S     W++D+  TFH+S  K+ FS+Y E  +  V M N+  C+V+GIG 
Sbjct: 782  ----------SYAGGRWILDTGATFHISTSKDAFSTY-EKHSGSVLMGNDHACQVMGIGT 830

Query: 793  ICLKFESGYAYTLKNVRHVPDLCNNLISCAALEDDGLEGRFGNGVMKILKGSLVIFK 963
            + +K   G   TL +VRH+P++  NLIS + L+  G       GV+K+  G+L + +
Sbjct: 831  VRIKMFDGIVRTLTDVRHIPEMKKNLISLSTLDKKGFRYSAEGGVLKVFSGALTVIR 887


>gb|AAK29467.1| polyprotein-like [Solanum chilense]
          Length = 1328

 Score =  145 bits (366), Expect = 6e-35
 Identities = 103/329 (31%), Positives = 152/329 (46%), Gaps = 6/329 (1%)
 Frame = +1

Query: 1    LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
            L+ LY   +L +             D       +L+V   LI  +   G K  ++   IV
Sbjct: 89   LENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSHLNVLNGLITQLANLGVKIEEEDKRIV 148

Query: 181  LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVM--HVRGR 354
            LLN++P SY  + + I +G+DSI L  V + L   E   K+ K   +N G+V     RGR
Sbjct: 149  LLNSLPSSYDTLSTTILHGKDSIQLKDVTSALLLNE---KMRKKP-ENHGQVFITESRGR 204

Query: 355  SQHRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQN 534
            S  R           + N+                   CY C + GH+ R+CPN K  + 
Sbjct: 205  SYQR----------SSSNYGRSGARGKSKVRSKSKARNCYNCDQPGHFKRDCPNPKRGKG 254

Query: 535  F----KNDQANMASSSENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPF 702
                 KND    A    N   + ++++  E+  ++   + S    EW++D+A ++H +P 
Sbjct: 255  ESSGQKNDDNTAAMVQNNDDVVLLINE--EEECMHLAGTES----EWVVDTAASYHATPV 308

Query: 703  KNLFSSYSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLISCA 882
            ++LF  Y      +V M N    ++ GIGDIC K   G    LK+VRHVPDL  NLIS  
Sbjct: 309  RDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLKDVRHVPDLRMNLISGI 368

Query: 883  ALEDDGLEGRFGNGVMKILKGSLVIFKAV 969
            AL+ DG E  F N   ++ KG+LVI K V
Sbjct: 369  ALDQDGYENYFANQKWRLTKGALVIAKGV 397


>gb|KYP36396.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 485

 Score =  142 bits (357), Expect = 9e-35
 Identities = 95/325 (29%), Positives = 155/325 (47%), Gaps = 1/325 (0%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           L+ LY   SL                  K I + L  F K++ D++    +  D+   ++
Sbjct: 34  LESLYMTKSLAHRLCLKQRLYSFKMTETKSIVDQLAEFNKILDDLENIEVQLEDEDKALL 93

Query: 181 LLNAIPESYSDVKSAIKYGRDS-ITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRS 357
           LLN++P +Y   K AI YG++  ITLD V   +++KEL  + +     N   +   RGRS
Sbjct: 94  LLNSLPRNYEHFKDAILYGKEQDITLDEVQTSIRTKELQRQQDNKTDDNGESLNVSRGRS 153

Query: 358 QHRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNF 537
           + + G  QK    ++++                    C+ C +VGH+ + CP +  +Q  
Sbjct: 154 EKK-GQSQKGKKARSKS-----------KIGDRSKFKCFYCHKVGHFKKNCPERNRDQKS 201

Query: 538 KNDQANMASSSENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFS 717
             D A++A+ S+      ++           V + S    +W++DS C++HM P K+ F 
Sbjct: 202 SADSADIAAISDGYESADVL-----------VVTTSQTQKDWVMDSGCSYHMCPKKDYFE 250

Query: 718 SYSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLISCAALEDD 897
           +    +   V + ++  C+V GIG + LK      Y LK+VR+VPDL  NLIS +  +  
Sbjct: 251 TLKLKEGGTVLLGDDHPCQVQGIGTVRLKMFDNREYILKDVRYVPDLKRNLISISMFDSL 310

Query: 898 GLEGRFGNGVMKILKGSLVIFKAVK 972
           G   +  +GV+KIL GSLVI K  K
Sbjct: 311 GYATKTQHGVLKILNGSLVIAKGNK 335


>gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 780

 Score =  142 bits (357), Expect = 7e-34
 Identities = 95/325 (29%), Positives = 155/325 (47%), Gaps = 1/325 (0%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           L+ LY   SL                  K I + L  F K++ D++    +  D+   ++
Sbjct: 88  LESLYMTKSLAHRLCLKQRLYSFKMTETKSIVDQLAEFNKILDDLENIEVQLEDEDKALL 147

Query: 181 LLNAIPESYSDVKSAIKYGRDS-ITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRS 357
           LLN++P +Y   K AI YG++  ITLD V   +++KEL  + +     N   +   RGRS
Sbjct: 148 LLNSLPRNYEHFKDAILYGKEQDITLDEVQTSIRTKELQRQQDNKTDDNGESLNVSRGRS 207

Query: 358 QHRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNF 537
           + + G  QK    ++++                    C+ C +VGH+ + CP +  +Q  
Sbjct: 208 EKK-GQSQKGKKARSKS-----------KIGDRSKFKCFYCHKVGHFKKNCPERNRDQKS 255

Query: 538 KNDQANMASSSENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFS 717
             D A++A+ S+      ++           V + S    +W++DS C++HM P K+ F 
Sbjct: 256 SADSADIAAISDGYESADVL-----------VVTTSQTQKDWVMDSGCSYHMCPKKDYFE 304

Query: 718 SYSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLISCAALEDD 897
           +    +   V + ++  C+V GIG + LK      Y LK+VR+VPDL  NLIS +  +  
Sbjct: 305 TLKLKEGGTVLLGDDHPCQVQGIGTVRLKMFDNREYILKDVRYVPDLKRNLISISMFDSL 364

Query: 898 GLEGRFGNGVMKILKGSLVIFKAVK 972
           G   +  +GV+KIL GSLVI K  K
Sbjct: 365 GYATKTQHGVLKILNGSLVIAKGNK 389


>sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon
           TNT 1-94; Includes: RecName: Full=Protease; Includes:
           RecName: Full=Reverse transcriptase; Includes: RecName:
           Full=Endonuclease
 emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
          Length = 1328

 Score =  142 bits (357), Expect = 9e-34
 Identities = 97/294 (32%), Positives = 146/294 (49%), Gaps = 4/294 (1%)
 Frame = +1

Query: 100 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 279
           +L+VF  LI  +   G K  ++   I+LLN++P SY ++ + I +G+ +I L  V + L 
Sbjct: 121 HLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 180

Query: 280 SKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXX 459
             E   K+ K   +N G+ +   GR +    ++Q+S N    N+                
Sbjct: 181 LNE---KMRKKP-ENQGQALITEGRGR----SYQRSSN----NYGRSGARGKSKNRSKSR 228

Query: 460 XXXCYKCGEVGHYIRECPN-KKGN---QNFKNDQANMASSSENVGEIFMVSDLCEKHAVN 627
              CY C + GH+ R+CPN +KG       KND    A    N   +  +++  E   ++
Sbjct: 229 VRNCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLS 288

Query: 628 SVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGDICLKF 807
             +S      EW++D+A + H +P ++LF  Y       V M N    ++ GIGDIC+K 
Sbjct: 289 GPES------EWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKT 342

Query: 808 ESGYAYTLKNVRHVPDLCNNLISCAALEDDGLEGRFGNGVMKILKGSLVIFKAV 969
             G    LK+VRHVPDL  NLIS  AL+ DG E  F N   ++ KGSLVI K V
Sbjct: 343 NVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGV 396


>emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]
          Length = 894

 Score =  141 bits (355), Expect = 1e-33
 Identities = 94/326 (28%), Positives = 160/326 (49%), Gaps = 5/326 (1%)
 Frame = +1

Query: 1   LQELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIV 180
           L+ LY   SL +               +  I+E+LD F K+I D+K       ++   I+
Sbjct: 88  LESLYMTKSLANRLHKXIKLYTFKMTPSMSIEEHLDHFNKIILDLKNIDIAVSNEDKAIL 147

Query: 181 LLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQ 360
           LL ++  SY+++K AI YGRD +T D V + L ++EL  +  +  ++  GE +++RG+S+
Sbjct: 148 LLTSLDASYTNMKEAIMYGRDILTFDEVQSILHARELHKQ--EESKEELGEGLNIRGKSK 205

Query: 361 HRF---GNHQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQ 531
            R    GN+ KS +                         C+ C + GH+ ++CP+ + N 
Sbjct: 206 KREKKKGNNSKSRSKSKTK-----------------KFKCFICHKEGHFKKDCPDMRQNT 248

Query: 532 NFKNDQANMASSSENVGEIFMVSDLCEKHAVNSVKSNSVIDN--EWLIDSACTFHMSPFK 705
             K         + N G+  M+ D  +   V +V     +D+  EW++DS C+FHM P K
Sbjct: 249 XKK---------TMNEGDATMILDGYDNAGVLNVAE---VDSGKEWILDSGCSFHMCPIK 296

Query: 706 NLFSSYSEVKNCHVSMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNLISCAA 885
             F  + E    HV + N K C++LG G + +K   G    L+++R++P+L  NLIS   
Sbjct: 297 AWFEDFKEANGGHVLLGNNKHCKILGTGTVKIKHYDGIERVLEDIRYIPELKMNLISLGM 356

Query: 886 LEDDGLEGRFGNGVMKILKGSLVIFK 963
           L+  G   +     +++ +GSL + K
Sbjct: 357 LDKLGYTFKSEPNSLRVARGSLTVMK 382


>gb|ABA98804.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
           Japonica Group]
          Length = 1333

 Score =  141 bits (355), Expect = 2e-33
 Identities = 91/293 (31%), Positives = 146/293 (49%), Gaps = 9/293 (3%)
 Frame = +1

Query: 112 FTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKEL 291
           F K++ D+     K  D+   ++LL ++P SY++ +  I   RD +TL  V + L++KE 
Sbjct: 122 FKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFRDTILLSRDELTLKEVYDALQNKE- 180

Query: 292 DLKV---NKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXXXXXXXXX 462
            +K+   N G   + GE +HVRGR+++R  N +  D                        
Sbjct: 181 KMKIMVQNDGSSSSKGEALHVRGRTENRTSNEKNYDRRGRSK-----------SKPPGNK 229

Query: 463 XXCYKCGEVGHYIRECPN-----KKGNQNFKNDQANMASSSENVGEIFMVSDLCEKHAVN 627
             C  C    H I EC       +K  ++ K   A+ A+S ++ G+  +V   C      
Sbjct: 230 KFCVYCKLKNHNIDECKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGCVAG--- 286

Query: 628 SVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEV-KNCHVSMANEKKCEVLGIGDICLK 804
                    +EW++DSAC+FH+   +N FSSY  V K   V M ++  C ++GIG + +K
Sbjct: 287 --------HDEWILDSACSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIK 338

Query: 805 FESGYAYTLKNVRHVPDLCNNLISCAALEDDGLEGRFGNGVMKILKGSLVIFK 963
            + G   TLKNVR++P +  NLIS + L+ +G +    +GV+K+ KGSLV  K
Sbjct: 339 TDDGMTRTLKNVRYIPGMSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLK 391


Top