BLASTX nr result

ID: Rehmannia29_contig00002715 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia29_contig00002715
         (821 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamu...   259   1e-80
gb|KZV22124.1| hypothetical protein F511_11652 [Dorcoceras hygro...   202   9e-61
ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969...   170   3e-44
gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotranspo...   159   1e-42
ref|XP_011100917.1| uncharacterized protein LOC105179028 [Sesamu...   145   2e-39
gb|PON41343.1| Zinc finger, CCHC-type [Parasponia andersonii]         140   3e-36
gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygro...   143   6e-35
gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi...   140   6e-34
gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium bar...   134   8e-32
gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium bar...   134   8e-32
emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]   133   2e-31
emb|CAN80490.1| hypothetical protein VITISV_004703 [Vitis vinifera]   132   3e-31
dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri]      130   8e-31
gb|PKA49510.1| Retrovirus-related Pol polyprotein from transposo...   127   9e-31
gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposo...   125   3e-30
emb|CAN67309.1| hypothetical protein VITISV_028165 [Vitis vinifera]   124   9e-30
gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidop...   127   3e-29
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi...   126   4e-29
gb|KYP36396.1| Retrovirus-related Pol polyprotein from transposo...   124   7e-29
gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposo...   124   2e-28

>ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamum indicum]
          Length = 472

 Score =  259 bits (663), Expect = 1e-80
 Identities = 150/281 (53%), Positives = 189/281 (67%), Gaps = 8/281 (2%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           EK FR+KLDL+K IDENLD FTKLIQDIKL GDK ID+Y+PIVLLNAIPES+SDVK+AIK
Sbjct: 101 EKIFRYKLDLSKNIDENLDDFTKLIQDIKLAGDKYIDEYSPIVLLNAIPESFSDVKAAIK 160

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGN-----HQKSDNH 477
           YGRDSI L+TVVNGLKSKELDLKVNK   Q+  E+  VRGR+  RFGN     + +S + 
Sbjct: 161 YGRDSINLETVVNGLKSKELDLKVNKPS-QSHYEINSVRGRT--RFGNFNSRYNSRSRSK 217

Query: 476 QNQNFHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIREC--PNKKGNQNFKNDQANMAS-S 306
              N                   +CY CG  GHYI++C  P ++      +D+  +++ S
Sbjct: 218 TKTNRSKSRPRETNLRDDKIRDRRCYNCGTKGHYIKDCRKPRRENRDRNYDDKEKVSNVS 277

Query: 305 SENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHV 126
            E+ GE+F+V      +  NSV +  +  +EWLIDS CTFHMSPFK++F++        V
Sbjct: 278 IESNGEVFVV------YEANSVSTFDM--HEWLIDSGCTFHMSPFKDIFTNLKYEHAGFV 329

Query: 125 SMANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
           SMANEKKCE+ G+GDI L F+ GY   LKNVR+VPDL +NL
Sbjct: 330 SMANEKKCEIKGLGDISLCFD-GYKMLLKNVRYVPDLSHNL 369


>gb|KZV22124.1| hypothetical protein F511_11652 [Dorcoceras hygrometricum]
          Length = 277

 Score =  202 bits (515), Expect = 9e-61
 Identities = 102/179 (56%), Positives = 125/179 (69%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           EKFFRFKLDLNK++D NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIP+ Y+DV+SAIK
Sbjct: 105 EKFFRFKLDLNKDLDGNLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPDDYADVRSAIK 164

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNF 462
           YGRD ITL+TV++GLKSKELDLK  KG + N GEVMHV GRS+ R+ NH  ++   +++ 
Sbjct: 165 YGRDKITLETVISGLKSKELDLKAYKGTKPNGGEVMHVGGRSKTRYRNHMGNNGDNSRSK 224

Query: 461 HXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEI 285
           +                  CY CGE GHY  +CP+ K    +K D   +   S N  ++
Sbjct: 225 Y-------------IGNRTCYNCGEKGHYKADCPHPK-EDKYKRDNTLLTEQSNNATDL 269


>ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969411 [Erythranthe
           guttata]
          Length = 1213

 Score =  170 bits (431), Expect = 3e-44
 Identities = 83/114 (72%), Positives = 100/114 (87%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           EKFFRFKLDL K+IDEN+D FT+L+QDIKLTGDK+ID+YTPIVLLNAIP+SY+D+KSAIK
Sbjct: 544 EKFFRFKLDLTKDIDENIDQFTRLVQDIKLTGDKSIDNYTPIVLLNAIPDSYNDLKSAIK 603

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDN 480
           YGRD+I+LDTV+NGLKSKE+DL+VNK  + + GEV  VRGR Q+RF N   S N
Sbjct: 604 YGRDNISLDTVINGLKSKEMDLRVNKSNK-SFGEVNFVRGRQQNRFSNKPSSSN 656


>gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotransposon family protein
           [Trema orientalis]
          Length = 380

 Score =  159 bits (401), Expect = 1e-42
 Identities = 100/273 (36%), Positives = 142/273 (52%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           E+FF FK+D +K I+ENLD +TKL+ D++  G K  D    I+LLN++P +  + K  +K
Sbjct: 111 ERFFGFKMDKSKSIEENLDDYTKLVLDLENLGIKVDDKDKAIILLNSLPRNLKNFKETLK 170

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNF 462
           YGR +IT+D V N L+SK LD+K ++   Q  GE +H+RGR+        K DNH  +  
Sbjct: 171 YGRQTITVDEVQNALESKLLDMKGSEKNAQ--GEGLHIRGRT-------TKQDNHDGKG- 220

Query: 461 HXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIF 282
                             K Y C + GH  R  P ++     K D           G+  
Sbjct: 221 -KSQSRSKSRGKKDYSKVKYYHCNKNGHIRRLRPERQNKDAGKLD-----------GDAV 268

Query: 281 MVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKC 102
           +V D  E   V S+ S S    EW++DS C++HM P ++ F  Y EV    V M N   C
Sbjct: 269 IVDDGYESSEVLSI-SESENSKEWVMDSGCSYHMCPREDWFMDYQEVDGGKVLMGNNMAC 327

Query: 101 EVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
           +V+GIG I ++   G    LKNVRHVP+L  +L
Sbjct: 328 KVMGIGSISIRMFDGVTRILKNVRHVPELKRSL 360


>ref|XP_011100917.1| uncharacterized protein LOC105179028 [Sesamum indicum]
          Length = 188

 Score =  145 bits (366), Expect = 2e-39
 Identities = 71/86 (82%), Positives = 80/86 (93%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           EK F +KLDL+K IDENLD FTKLIQDIKLTGDKNID+Y+PIVLLNAIP+SYSD K+AIK
Sbjct: 101 EKKFHYKLDLSKSIDENLDDFTKLIQDIKLTGDKNIDEYSPIVLLNAIPKSYSDAKAAIK 160

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNK 564
           YGRDS+ LDTVVNGLKSKE+DLKV+K
Sbjct: 161 YGRDSVNLDTVVNGLKSKEMDLKVSK 186


>gb|PON41343.1| Zinc finger, CCHC-type [Parasponia andersonii]
          Length = 297

 Score =  140 bits (352), Expect = 3e-36
 Identities = 88/277 (31%), Positives = 143/277 (51%), Gaps = 4/277 (1%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           ++   FK++  K I + +D F K+I D++    K  D+   ++LLNA+P++Y   K A+ 
Sbjct: 13  QRLHSFKMNEEKSISDQIDKFNKIIDDLENIEIKLEDEDKALILLNALPKAYEHFKDAML 72

Query: 641 YGRD-SITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQN 465
           YGR+ +ITLD V + +K+KEL  K  +G  +N+GE +  RGRS+       K DN   +N
Sbjct: 73  YGREQTITLDEVQSAVKAKELPRK-KEGKEENTGEGLMARGRSE-------KCDNKAPRN 124

Query: 464 FHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKND---QANMASSSENV 294
                              KC+ C + GH+ R+CP++K   + K     +A++AS   + 
Sbjct: 125 ---------ESRSKSKGRLKCFHCHKEGHFKRDCPDRKKKVHEKPKDPGEASVASDGYDS 175

Query: 293 GEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMAN 114
            E+ +V+D                  EW++DS C+FHM P K+ F +  +     V + N
Sbjct: 176 AEVLVVTD-------------EDSSKEWIMDSGCSFHMCPTKSWFENLEKTDGGSVLLGN 222

Query: 113 EKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
            K C+V GIG + ++   G    L+ VR+VP+L  NL
Sbjct: 223 NKPCKVAGIGSVRIRMFDGMERILQQVRYVPELKRNL 259


>gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygrometricum]
          Length = 1309

 Score =  143 bits (361), Expect = 6e-35
 Identities = 83/273 (30%), Positives = 146/273 (53%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           +  +   L+  K++ +++D F K+I D+K    K  D+   I++L+++P SY      + 
Sbjct: 104 KSLYTIHLEEGKDLKKHMDEFNKIILDLKNVDIKITDEDCAILMLSSLPRSYEHFVDTML 163

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNF 462
           YG++++T+  V + L SKEL  K N+   +++GE ++VRGR+  R   ++K   H++Q+ 
Sbjct: 164 YGKETLTMAEVKSALNSKELHKK-NETKMESTGEGLNVRGRTYKRESRNEKGGKHRSQS- 221

Query: 461 HXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIF 282
                             KC+ C + GH+ ++CP+++         A      ++ G+  
Sbjct: 222 ------------RTRGKLKCFVCHKEGHFKKDCPDRR---------ARNPERRKDPGDAA 260

Query: 281 MVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKC 102
           +VSD  E   V  V   +  D  W++DS C+FHM P K+ F +  E ++ HV + N ++C
Sbjct: 261 VVSDGYESAEVLVVSRTNKQDC-WVMDSGCSFHMCPIKSWFQNLVEEESGHVLLGNNREC 319

Query: 101 EVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
           +V+GIG + LK   G   T+  VR+VPDL  NL
Sbjct: 320 KVMGIGSVLLKMHDGCVRTITEVRYVPDLRRNL 352


>gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 838

 Score =  140 bits (353), Expect = 6e-34
 Identities = 82/280 (29%), Positives = 149/280 (53%), Gaps = 7/280 (2%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           ++ + +K+  +  I+EN++ F KLI D++       D+   IVLL ++P+ +  +K  +K
Sbjct: 117 QRLYGYKMSDSMTIEENVNDFFKLISDLENVKVSVPDEDQAIVLLMSLPKQFDQLKDTLK 176

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHV--RGRSQHRFGNHQKSDNHQNQ 468
           YG+ ++ LD +   ++SK L+L  +    +NS + + V  RGRS+ R    + S+ +++Q
Sbjct: 177 YGKTTLALDEITGAIRSKVLELGASGKMLKNSSDALFVQDRGRSEKR---DKSSERNKSQ 233

Query: 467 NFHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIREC-----PNKKGNQNFKNDQANMASSS 303
           +                    C+ CG+ GH+ ++C      NKKGN + K + +N+    
Sbjct: 234 S-----------RSKSREKKVCWVCGKEGHFKKQCYVWKEKNKKGNNSEKGESSNV---- 278

Query: 302 ENVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVS 123
             +G+    + L  +   N+   N  +DNEW++D+ C+FHM+P ++ F  + E +   V 
Sbjct: 279 --IGQAADAAALAVREESNA--DNQEVDNEWIMDTGCSFHMTPRRDWFVEFDESQTGRVK 334

Query: 122 MANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
           MAN+   E+ GIG I ++ +      LKNVR+VP +  NL
Sbjct: 335 MANQTYSEIKGIGSIRIQNDDNTTVLLKNVRYVPSMSKNL 374


>gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium barbadense]
          Length = 1841

 Score =  134 bits (338), Expect = 8e-32
 Identities = 86/279 (30%), Positives = 137/279 (49%), Gaps = 6/279 (2%)
 Frame = -1

Query: 821  EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
            ++ +  K++    + ++LD F  +I D+    +K  D+   I++L ++P SY +    + 
Sbjct: 629  QRLYALKMEEGTPVSQHLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMM 688

Query: 641  YGRDSITLDTVVNGLKSKELDLKVN-KGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQN 465
            YGRD +TL+ V N L S EL  K+  K    N GE +  RGRS+ + G+  KS       
Sbjct: 689  YGRDDLTLEEVKNALSSSELRKKITGKVVENNEGEGLVARGRSKAKGGSSSKSHPRSQSK 748

Query: 464  FHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKG---NQNFKNDQANMAS--SSE 300
                               +CY C + GH   +CP +K    +Q  +ND+AN+A   SS 
Sbjct: 749  ----------------KRIQCYYCKKYGHMKVDCPKRKEKSESQEQQNDRANVADADSSS 792

Query: 299  NVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSM 120
            +   +  VSD             S     W++D+  TFH+S  K+ FS+Y E  +  V M
Sbjct: 793  DAEIVLAVSD-------------SYAGGRWILDTGATFHISTSKDAFSTY-EKHSGSVLM 838

Query: 119  ANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
             N+  C+V+GIG + +K   G   TL +VRH+P++  NL
Sbjct: 839  GNDHACQVMGIGTVRIKMFDGIVRTLTDVRHIPEMKKNL 877


>gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium barbadense]
          Length = 2351

 Score =  134 bits (338), Expect = 8e-32
 Identities = 86/279 (30%), Positives = 137/279 (49%), Gaps = 6/279 (2%)
 Frame = -1

Query: 821  EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
            ++ +  K++    + ++LD F  +I D+    +K  D+   I++L ++P SY +    + 
Sbjct: 608  QRLYALKMEEGTPVSQHLDKFNSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMM 667

Query: 641  YGRDSITLDTVVNGLKSKELDLKVN-KGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQN 465
            YGRD +TL+ V N L S EL  K+  K    N GE +  RGRS+ + G+  KS       
Sbjct: 668  YGRDDLTLEEVKNALSSSELRKKITGKVVENNEGEGLVARGRSKAKGGSSSKSHPRSQSK 727

Query: 464  FHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKG---NQNFKNDQANMAS--SSE 300
                               +CY C + GH   +CP +K    +Q  +ND+AN+A   SS 
Sbjct: 728  ----------------KRIQCYYCKKYGHMKVDCPKRKEKSESQEQQNDRANVADADSSS 771

Query: 299  NVGEIFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSM 120
            +   +  VSD             S     W++D+  TFH+S  K+ FS+Y E  +  V M
Sbjct: 772  DAEIVLAVSD-------------SYAGGRWILDTGATFHISTSKDAFSTY-EKHSGSVLM 817

Query: 119  ANEKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
             N+  C+V+GIG + +K   G   TL +VRH+P++  NL
Sbjct: 818  GNDHACQVMGIGTVRIKMFDGIVRTLTDVRHIPEMKKNL 856


>emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]
          Length = 894

 Score =  133 bits (334), Expect = 2e-31
 Identities = 84/277 (30%), Positives = 143/277 (51%), Gaps = 5/277 (1%)
 Frame = -1

Query: 818 KFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKY 639
           K + FK+  +  I+E+LD F K+I D+K       ++   I+LL ++  SY+++K AI Y
Sbjct: 106 KLYTFKMTPSMSIEEHLDHFNKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMY 165

Query: 638 GRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRF---GNHQKSDNHQNQ 468
           GRD +T D V + L ++EL  +  +  ++  GE +++RG+S+ R    GN+ KS +    
Sbjct: 166 GRDILTFDEVQSILHARELHKQ--EESKEELGEGLNIRGKSKKREKKKGNNSKSRSKSKT 223

Query: 467 NFHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGE 288
                               KC+ C + GH+ ++CP+ + N   K         + N G+
Sbjct: 224 K-----------------KFKCFICHKEGHFKKDCPDMRQNTXKK---------TMNEGD 257

Query: 287 IFMVSDLCEKHAVNSVKSNSVIDN--EWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMAN 114
             M+ D  +   V +V     +D+  EW++DS C+FHM P K  F  + E    HV + N
Sbjct: 258 ATMILDGYDNAGVLNVAE---VDSGKEWILDSGCSFHMCPIKAWFEDFKEANGGHVLLGN 314

Query: 113 EKKCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
            K C++LG G + +K   G    L+++R++P+L  NL
Sbjct: 315 NKHCKILGTGTVKIKHYDGIERVLEDIRYIPELKMNL 351


>emb|CAN80490.1| hypothetical protein VITISV_004703 [Vitis vinifera]
          Length = 777

 Score =  132 bits (333), Expect = 3e-31
 Identities = 81/266 (30%), Positives = 129/266 (48%)
 Frame = -1

Query: 800 LDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSIT 621
           ++ +K ID+ LD F KL+ D++    K  D+   I+LLN++P+S    K  +KYG+D IT
Sbjct: 24  IEESKPIDDALDEFNKLVLDLESLDIKVEDEDKAIILLNSLPKSLKHFKETLKYGKDDIT 83

Query: 620 LDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXX 441
            D V N L +K LD+K     + N G+             +  KS +   +++       
Sbjct: 84  FDDVQNALNAKVLDMK--SSDKTNGGK-------------SRSKSKSKGKKDYRNVK--- 125

Query: 440 XXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIFMVSDLCE 261
                       CY C ++G   R CP+++  +            ++  G   ++ D  +
Sbjct: 126 ------------CYHCNKIGQIRRICPDRQQEEK-----------TQAQGSAAIIDDGYD 162

Query: 260 KHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGD 81
              V +++ N   + EW++DS CT+HM P ++ FSSY EV    + + N   C V+GIG 
Sbjct: 163 STEVLTIRLNPNHE-EWVLDSGCTYHMCPRRDWFSSYQEVNGGKLLLGNNMSCNVVGIGT 221

Query: 80  ICLKFESGYAYTLKNVRHVPDLCNNL 3
           + +    G   TLK VRHVPDL  NL
Sbjct: 222 MAINMHDGKTRTLKEVRHVPDLKRNL 247


>dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri]
          Length = 605

 Score =  130 bits (328), Expect = 8e-31
 Identities = 75/273 (27%), Positives = 141/273 (51%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           +  + FK+   K IDE +D F KLI D++    K  D+   ++L+ A+P SY+  K  + 
Sbjct: 123 QALYSFKMIEEKAIDEQMDQFIKLILDLENIEVKIEDEDQALLLVCALPRSYNTFKDTLL 182

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNF 462
           YGR+++TL  V   LKSK+L+ +++     ++ E ++V+G+ + +  + ++ +  + +  
Sbjct: 183 YGRETLTLKEVQAALKSKQLNTRIDNKAVGSTSEALYVKGKGEEKKTHKERKNKSKKK-- 240

Query: 461 HXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIF 282
                             KC+ C E GH  + CP K+ ++  K +Q   A + E+     
Sbjct: 241 -----------------VKCFYCDEEGHMCKNCPKKERDKGKKVEQGEAAMACESYESAD 283

Query: 281 MVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKC 102
           +++   E   V    + S    +WL+DSA +FH++  K+    +     C VS+  EK+ 
Sbjct: 284 VLAVTHEDQDV----TKSEKSGKWLLDSASSFHVTCVKSWIKDFKGCDGCLVSVGEEKQY 339

Query: 101 EVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
           ++LG G + ++ ++G    L+NV+ +PDL  NL
Sbjct: 340 KILGFGTVKIRLKTGGVRILRNVKFIPDLGRNL 372


>gb|PKA49510.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Apostasia shenzhenica]
          Length = 365

 Score =  127 bits (319), Expect = 9e-31
 Identities = 84/265 (31%), Positives = 137/265 (51%), Gaps = 7/265 (2%)
 Frame = -1

Query: 776 ENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGL 597
           E+++VF+K+I  ++    K  ++   ++LL+++P+SY  + + I YG+D++ ++ V   L
Sbjct: 9   EHMNVFSKMISQLRSINVKLEEENEALLLLSSLPKSYDHLVTTILYGKDTLKVEEVNATL 68

Query: 596 KSKELDLKVNKGGRQNSGEVMHV-----RGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXX 432
            S E+  K      Q++GE + V     RGRS+++FGN  +  +   +N +         
Sbjct: 69  LSNEVRNK------QSTGESLTVKTSQDRGRSKNKFGNQYRYRSISKENDNR-------- 114

Query: 431 XXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKN--DQANMASSSENVGEIFMVSDLCEK 258
                    CY C + GH+ R+CP K   Q  K   ++A++AS  E   E      LC  
Sbjct: 115 ---------CYYCKKEGHWKRDCPKKSKQQQQKKSGEEASVASRLEKDSET-----LCTF 160

Query: 257 HAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGDI 78
             ++S  S       W++DS C++HM PF++ FS+YS      V M N  +C+ +GIG I
Sbjct: 161 SCMDSSDS-------WILDSDCSYHMCPFRDWFSTYSIHDGGRVIMGNNSECKSVGIGTI 213

Query: 77  CLKFESGYAYTLKNVRHVPDLCNNL 3
            +K   G   TL  VRHVPDL   L
Sbjct: 214 KIKMFDGVIRTLTEVRHVPDLRKGL 238


>gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 337

 Score =  125 bits (314), Expect = 3e-30
 Identities = 75/258 (29%), Positives = 129/258 (50%), Gaps = 2/258 (0%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           ++  + K++    I E++ +FTK + D+K    +  ++   ++LL ++P S+ ++   + 
Sbjct: 104 KRLHQLKMEEGSSIKEHVSLFTKAVLDLKSVDVRIDEEDQAVMLLCSLPSSFENLVDTML 163

Query: 641 YGRDSITLDTVVNGLKSKELDLKVNKG-GRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQN 465
           +GRD++TL+ V   L S+EL  K+ +  G     E +  RGR + R     KS N +   
Sbjct: 164 FGRDTLTLEEVKATLNSRELKKKITENKGEGGDPEALMARGRLEKR---DSKSKNKRRSK 220

Query: 464 FHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFK-NDQANMASSSENVGE 288
           +                   CY C + GH+ +ECP +K   N K ND++++A  ++    
Sbjct: 221 YKNEKA--------------CYYCKKEGHFRKECPERKKKNNGKYNDESDIAVVADGYES 266

Query: 287 IFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEK 108
             ++S   +KH+            EW++DS C+FHM+P    FSSY E+    V M N  
Sbjct: 267 AEVLSISTKKHS-----------EEWILDSGCSFHMTPNLEWFSSYKEIDGGKVLMGNNM 315

Query: 107 KCEVLGIGDICLKFESGY 54
            C V+GIG I LK + G+
Sbjct: 316 VCNVIGIGTIKLKVQDGF 333


>emb|CAN67309.1| hypothetical protein VITISV_028165 [Vitis vinifera]
          Length = 344

 Score =  124 bits (311), Expect = 9e-30
 Identities = 78/263 (29%), Positives = 127/263 (48%), Gaps = 3/263 (1%)
 Frame = -1

Query: 782 IDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVN 603
           I ++ + F K I D++    K   +   I L+ ++P SY      + YGR ++ +  V  
Sbjct: 72  IKDHFNEFNKTIXDLRNIDVKVNYEDQAIFLMCSLPNSYEHFVDIMMYGRGTLFIKDVRV 131

Query: 602 GLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFHXXXXXXXXXXXX 423
            L S+EL   V K  + +SGE +  RGR++ +    +     +++  +            
Sbjct: 132 ALNSRELKKMVFKSRKYDSGEGLVARGRTRKKNNGRRGRSRSKSRGNNK----------- 180

Query: 422 XXXXXKCYKCGEVGHYIRECPNKKGNQNFKN---DQANMASSSENVGEIFMVSDLCEKHA 252
                 C+KC + GHY++  P++KG +N +N     A  A  + N  ++ +V        
Sbjct: 181 ------CFKCKKEGHYVKNXPDRKGKENKRNYNSGDATFAKENSNTTDVLLVX------V 228

Query: 251 VNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKKCEVLGIGDICL 72
            NS       D+EW++DS C++HMSP  + FS+Y  +    V M N+  C+V+GI  I +
Sbjct: 229 TNS-------DDEWILDSGCSYHMSPNGDWFSTYQPIDGGKVLMGNKVACKVVGIHXIQI 281

Query: 71  KFESGYAYTLKNVRHVPDLCNNL 3
           K   G   TL NVRHVP L  NL
Sbjct: 282 KMHGGIIRTLTNVRHVPKLNKNL 304


>gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1137

 Score =  127 bits (318), Expect = 3e-29
 Identities = 79/252 (31%), Positives = 131/252 (51%), Gaps = 5/252 (1%)
 Frame = -1

Query: 818 KFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKY 639
           K + +++  +K ++EN+D F K+I D+     +  D+   I++L+A+P+SY  +K  +KY
Sbjct: 91  KVYNYRMQDSKTLEENVDEFQKMISDLNNLQIQVPDEVQAILILSALPDSYDMLKETLKY 150

Query: 638 GRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQNFH 459
           GR+ I LD V++  KSKEL+L+ + GG +  GE ++VRG+SQ R  +  KS   +     
Sbjct: 151 GREGIKLDDVISAAKSKELELRDSSGGSRPVGEGLYVRGKSQARGSDGPKSTEGKK---- 206

Query: 458 XXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIFM 279
                             C+ CG+ GH+ R+C        +K  + N A+ +   GE  +
Sbjct: 207 -----------------VCWICGKEGHFKRQC--------YKWLEKNKANGA---GETAL 238

Query: 278 VSDLCEK---HAVNSVKSNSVIDN--EWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMAN 114
           V D  +       + V  +   D+  EW++D+ C+FHM+P K     + E K+  V MAN
Sbjct: 239 VKDDAQDLVGLVASEVNMSEGKDDQEEWIMDTGCSFHMTPRKEYLMDFVEAKSGKVRMAN 298

Query: 113 EKKCEVLGIGDI 78
               EV GIG +
Sbjct: 299 NSFSEVKGIGKV 310


>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1356

 Score =  126 bits (317), Expect = 4e-29
 Identities = 76/275 (27%), Positives = 135/275 (49%), Gaps = 2/275 (0%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           +K + FK+  N  ++ N+D F ++I D++       D+   I+LL A+P+++  +K  +K
Sbjct: 120 QKLYSFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLK 179

Query: 641 Y--GRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQ 468
           Y  G+  +TLD V   + SKEL+L   K   +   E ++V+ +++++    QK      +
Sbjct: 180 YSSGKSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVKDKNENKGKGEQKGKGKGKK 239

Query: 467 NFHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGE 288
                                C+ CGE GH+   CPN+   Q FK  Q     SS   G 
Sbjct: 240 G-------------KSKKKPGCWTCGEEGHFRSSCPNQNKPQ-FKQSQVVKGESSGGKGN 285

Query: 287 IFMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEK 108
           +   +      A++S + +  +++EW++D+ C++HM+  +  F  ++E     V M N+ 
Sbjct: 286 LAEAAGYYVSEALSSTEVH--LEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKT 343

Query: 107 KCEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
              V G+G I +K   G    L NVR++PD+  NL
Sbjct: 344 VSRVRGVGTIRVKNSDGLTIVLTNVRYIPDMDRNL 378


>gb|KYP36396.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 485

 Score =  124 bits (311), Expect = 7e-29
 Identities = 78/274 (28%), Positives = 136/274 (49%), Gaps = 1/274 (0%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           ++ + FK+   K I + L  F K++ D++    +  D+   ++LLN++P +Y   K AI 
Sbjct: 51  QRLYSFKMTETKSIVDQLAEFNKILDDLENIEVQLEDEDKALLLLNSLPRNYEHFKDAIL 110

Query: 641 YGRDS-ITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQN 465
           YG++  ITLD V   +++KEL  + +     N   +   RGRS+ + G  QK    ++++
Sbjct: 111 YGKEQDITLDEVQTSIRTKELQRQQDNKTDDNGESLNVSRGRSEKK-GQSQKGKKARSKS 169

Query: 464 FHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEI 285
                              KC+ C +VGH+ + CP +  +Q    D A++A+ S+     
Sbjct: 170 -----------KIGDRSKFKCFYCHKVGHFKKNCPERNRDQKSSADSADIAAISDGYESA 218

Query: 284 FMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKK 105
            ++           V + S    +W++DS C++HM P K+ F +    +   V + ++  
Sbjct: 219 DVL-----------VVTTSQTQKDWVMDSGCSYHMCPKKDYFETLKLKEGGTVLLGDDHP 267

Query: 104 CEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
           C+V GIG + LK      Y LK+VR+VPDL  NL
Sbjct: 268 CQVQGIGTVRLKMFDNREYILKDVRYVPDLKRNL 301


>gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 780

 Score =  124 bits (311), Expect = 2e-28
 Identities = 78/274 (28%), Positives = 136/274 (49%), Gaps = 1/274 (0%)
 Frame = -1

Query: 821 EKFFRFKLDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIK 642
           ++ + FK+   K I + L  F K++ D++    +  D+   ++LLN++P +Y   K AI 
Sbjct: 105 QRLYSFKMTETKSIVDQLAEFNKILDDLENIEVQLEDEDKALLLLNSLPRNYEHFKDAIL 164

Query: 641 YGRDS-ITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNHQKSDNHQNQN 465
           YG++  ITLD V   +++KEL  + +     N   +   RGRS+ + G  QK    ++++
Sbjct: 165 YGKEQDITLDEVQTSIRTKELQRQQDNKTDDNGESLNVSRGRSEKK-GQSQKGKKARSKS 223

Query: 464 FHXXXXXXXXXXXXXXXXXKCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEI 285
                              KC+ C +VGH+ + CP +  +Q    D A++A+ S+     
Sbjct: 224 -----------KIGDRSKFKCFYCHKVGHFKKNCPERNRDQKSSADSADIAAISDGYESA 272

Query: 284 FMVSDLCEKHAVNSVKSNSVIDNEWLIDSACTFHMSPFKNLFSSYSEVKNCHVSMANEKK 105
            ++           V + S    +W++DS C++HM P K+ F +    +   V + ++  
Sbjct: 273 DVL-----------VVTTSQTQKDWVMDSGCSYHMCPKKDYFETLKLKEGGTVLLGDDHP 321

Query: 104 CEVLGIGDICLKFESGYAYTLKNVRHVPDLCNNL 3
           C+V GIG + LK      Y LK+VR+VPDL  NL
Sbjct: 322 CQVQGIGTVRLKMFDNREYILKDVRYVPDLKRNL 355


Top