BLASTX nr result

ID: Rehmannia32_contig00000970 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia32_contig00000970
         (1055 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamu...   339   e-110
gb|KZV22124.1| hypothetical protein F511_11652 [Dorcoceras hygro...   315   e-104
ref|XP_011100917.1| uncharacterized protein LOC105179028 [Sesamu...   269   7e-87
ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969...   244   8e-69
gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotranspo...   174   2e-47
ref|XP_022895327.1| uncharacterized protein LOC111409515 [Olea e...   162   7e-43
gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygro...   165   2e-41
gb|PON95322.1| hypothetical protein TorRG33x02_088440, partial [...   152   1e-40
gb|KYP67041.1| Retrovirus-related Pol polyprotein from transposo...   154   1e-40
gb|PON64464.1| hypothetical protein TorRG33x02_273130 [Trema ori...   150   3e-40
gb|PKU72844.1| Retrovirus-related Pol polyprotein from transposo...   160   1e-39
gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium bar...   159   2e-39
gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium bar...   159   2e-39
ref|XP_022881005.1| uncharacterized protein LOC111398320 [Olea e...   147   3e-39
gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposo...   146   2e-37
emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]   150   3e-36
gb|KHN13199.1| Retrovirus-related Pol polyprotein from transposo...   142   1e-35
gb|KHN13198.1| Retrovirus-related Pol polyprotein from transposo...   141   2e-35
gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposo...   147   3e-35
ref|XP_022855577.1| uncharacterized protein LOC111376806 [Olea e...   138   3e-35

>ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamum indicum]
          Length = 472

 Score =  339 bits (869), Expect = e-110
 Identities = 185/341 (54%), Positives = 238/341 (69%), Gaps = 8/341 (2%)
 Frame = +1

Query: 55   MSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSSII 234
            M+GY+L+PF+GKTDFSIWQQKMKGILIQQKV+KAI   Y +  T+E K E DE+AYSSI+
Sbjct: 1    MAGYSLQPFDGKTDFSIWQQKMKGILIQQKVFKAIDGKYAENITEEKKLENDEFAYSSIV 60

Query: 235  LNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDV 414
            LNLSD+VLRKVGK +S+K LWDKL+EL+TE SLP+             DL+K++DENLD 
Sbjct: 61   LNLSDTVLRKVGKLESSKALWDKLEELFTEISLPNKLFLLEKIFRYKLDLSKNIDENLDD 120

Query: 415  FTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKEL 594
            FTKLIQDIKL GDK ID+Y+PIVLLNAIPES+SDVK+AIKYGRDSI L+TVVNGLKSKEL
Sbjct: 121  FTKLIQDIKLAGDKYIDEYSPIVLLNAIPESFSDVKAAIKYGRDSINLETVVNGLKSKEL 180

Query: 595  DLKVNKGGRQNSGEVMHVRGRSQY-----RFDNQQKFDNHQNQNSNXXXXXXXXXXXXXX 759
            DLKVNK   Q+  E+  VRGR+++     R++++ +     N++ +              
Sbjct: 181  DLKVNKPS-QSHYEINSVRGRTRFGNFNSRYNSRSRSKTKTNRSKS--RPRETNLRDDKI 237

Query: 760  XXXXCYNCGEVGHYIREC--PNKKGNQNQSNDQANLASTS-ENAGDIFMVTGICDVPIVN 930
                CYNCG  GHYI++C  P ++      +D+  +++ S E+ G++F+V         N
Sbjct: 238  RDRRCYNCGTKGHYIKDCRKPRRENRDRNYDDKEKVSNVSIESNGEVFVVYE------AN 291

Query: 931  SVHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVS 1053
            SV  ST   +EWLIDS CTFHMSPFK++F+N K     FVS
Sbjct: 292  SV--STFDMHEWLIDSGCTFHMSPFKDIFTNLKYEHAGFVS 330


>gb|KZV22124.1| hypothetical protein F511_11652 [Dorcoceras hygrometricum]
          Length = 277

 Score =  315 bits (808), Expect = e-104
 Identities = 171/280 (61%), Positives = 193/280 (68%), Gaps = 1/280 (0%)
 Frame = +1

Query: 55  MSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAIT-NTYVDTDTKEIKAETDEYAYSSI 231
           M+ Y+LEPFNGKTDFSIWQQKMKGILIQQKVYKAI    Y +  +   + E DEYAYSSI
Sbjct: 4   MTAYHLEPFNGKTDFSIWQQKMKGILIQQKVYKAIDPEAYAEDVSAAKRKEDDEYAYSSI 63

Query: 232 ILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLD 411
           ILNLSD+VLRK GK  +AK LW+KLQELYTETSLPS             DLNKD+D NLD
Sbjct: 64  ILNLSDAVLRKCGKLDTAKLLWEKLQELYTETSLPSKMFLLEKFFRFKLDLNKDLDGNLD 123

Query: 412 VFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKE 591
           VFTKLIQDIKLTGDKNIDDYTPIVLLNAIP+ Y+DV+SAIKYGRD ITL+TV++GLKSKE
Sbjct: 124 VFTKLIQDIKLTGDKNIDDYTPIVLLNAIPDDYADVRSAIKYGRDKITLETVISGLKSKE 183

Query: 592 LDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXX 771
           LDLK  KG + N GEVMHV GRS+ R+ N    +N  N  S                   
Sbjct: 184 LDLKAYKGTKPNGGEVMHVGGRSKTRYRNHMG-NNGDNSRSK------------YIGNRT 230

Query: 772 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDI 891
           CYNCGE GHY  +CP+ K       D   L   S NA D+
Sbjct: 231 CYNCGEKGHYKADCPHPK-EDKYKRDNTLLTEQSNNATDL 269


>ref|XP_011100917.1| uncharacterized protein LOC105179028 [Sesamum indicum]
          Length = 188

 Score =  269 bits (687), Expect = 7e-87
 Identities = 133/186 (71%), Positives = 157/186 (84%)
 Frame = +1

Query: 55  MSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSSII 234
           M GY+L+ F+GK+DFSIWQQKMKGILIQQKV+KAI + Y D  + E K + DE+AYSSII
Sbjct: 1   MDGYSLQSFDGKSDFSIWQQKMKGILIQQKVFKAIDSKYTDNISDEKKIQNDEFAYSSII 60

Query: 235 LNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDV 414
           LNLSD+VLRKVGKQ S+K+LW+KL++LYTETSLPS             DL+K +DENLD 
Sbjct: 61  LNLSDNVLRKVGKQSSSKDLWEKLEDLYTETSLPSKLFLLEKKFHYKLDLSKSIDENLDD 120

Query: 415 FTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKEL 594
           FTKLIQDIKLTGDKNID+Y+PIVLLNAIP+SYSD K+AIKYGRDS+ LDTVVNGLKSKE+
Sbjct: 121 FTKLIQDIKLTGDKNIDEYSPIVLLNAIPKSYSDAKAAIKYGRDSVNLDTVVNGLKSKEM 180

Query: 595 DLKVNK 612
           DLKV+K
Sbjct: 181 DLKVSK 186


>ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969411 [Erythranthe guttata]
          Length = 1213

 Score =  244 bits (622), Expect = 8e-69
 Identities = 128/202 (63%), Positives = 152/202 (75%)
 Frame = +1

Query: 112  QKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSSIILNLSDSVLRKVGKQKSAKE 291
            QKMKG+LIQQ+ Y AI  +Y    T   K E DE AYS+IILNLSDSV+RKVG   SAK 
Sbjct: 463  QKMKGVLIQQRCYVAIDESYAAETTASKKIELDELAYSAIILNLSDSVIRKVGMHDSAKG 522

Query: 292  LWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVFTKLIQDIKLTGDKNIDDY 471
            LW+KL ELYTETSLPS             DL KD+DEN+D FT+L+QDIKLTGDK+ID+Y
Sbjct: 523  LWEKLDELYTETSLPSKLFLLEKFFRFKLDLTKDIDENIDQFTRLVQDIKLTGDKSIDNY 582

Query: 472  TPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVR 651
            TPIVLLNAIP+SY+D+KSAIKYGRD+I+LDTV+NGLKSKE+DL+VNK  + + GEV  VR
Sbjct: 583  TPIVLLNAIPDSYNDLKSAIKYGRDNISLDTVINGLKSKEMDLRVNKSNK-SFGEVNFVR 641

Query: 652  GRSQYRFDNQQKFDNHQNQNSN 717
            GR Q RF N+    N QN + N
Sbjct: 642  GRQQNRFSNKPSSSN-QNVSQN 662


>gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotransposon family protein
            [Trema orientalis]
          Length = 380

 Score =  174 bits (440), Expect = 2e-47
 Identities = 113/331 (34%), Positives = 173/331 (52%), Gaps = 7/331 (2%)
 Frame = +1

Query: 64   YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEY-------AY 222
            +++E F GK DF +W+ KM+ IL+QQ + KA+ +  +    KE  AE  +        AY
Sbjct: 7    FDIEKFTGKNDFELWKMKMEAILVQQGLEKALLSEDLTATDKESLAEMKKKIEEVSPKAY 66

Query: 223  SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402
            S+IIL+LSD VLRKV ++K    +W KL+ELY   +LP              D +K ++E
Sbjct: 67   SAIILSLSDQVLRKVLREKIISGIWIKLEELYRAKTLPGRIYLKERFFGFKMDKSKSIEE 126

Query: 403  NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 582
            NLD +TKL+ D++  G K  D    I+LLN++P +  + K  +KYGR +IT+D V N L+
Sbjct: 127  NLDDYTKLVLDLENLGIKVDDKDKAIILLNSLPRNLKNFKETLKYGRQTITVDEVQNALE 186

Query: 583  SKELDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXX 762
            SK LD+K ++   Q  GE +H+RGR+        K DNH  +  +               
Sbjct: 187  SKLLDMKGSEKNAQ--GEGLHIRGRT-------TKQDNHDGKGKSQSRSKSRGKKDYSKV 237

Query: 763  XXXCYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVPIVNSVHS 942
                Y+C + GH  R  P     + Q+ D   L       GD  +V    +   V S+ S
Sbjct: 238  KY--YHCNKNGHIRRLRP-----ERQNKDAGKL------DGDAVIVDDGYESSEVLSI-S 283

Query: 943  STVCENEWLIDSACTFHMSPFKNLFSNYKEM 1035
             +    EW++DS C++HM P ++ F +Y+E+
Sbjct: 284  ESENSKEWVMDSGCSYHMCPREDWFMDYQEV 314


>ref|XP_022895327.1| uncharacterized protein LOC111409515 [Olea europaea var.
           sylvestris]
          Length = 370

 Score =  162 bits (409), Expect = 7e-43
 Identities = 83/204 (40%), Positives = 133/204 (65%)
 Frame = +1

Query: 49  MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSS 228
           MA + + +  ++G+ DF+IW QKM+ ILIQ K  KA+ NT+    +   K E +E A+S+
Sbjct: 1   MATAKFEMSMYDGRWDFNIWSQKMRTILIQMKCAKALDNTWPAEMSAGKKTELEEIAWST 60

Query: 229 IILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENL 408
           I L +SDSV+R +G+ K+A+ELW KL+  Y   ++P+             D + D+DENL
Sbjct: 61  IFLYISDSVIRTIGETKTAEELWTKLKAQYEPKTIPNKCFLLKQFFSFKMDPSVDLDENL 120

Query: 409 DVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSK 588
           D FTKL QD+    +K ++D   +VLLN+I + Y D+K+A+KYGRD++T+D ++N L++K
Sbjct: 121 DRFTKLTQDLANCDEKLLEDQLAVVLLNSISDRYRDLKNALKYGRDNLTIDIIINTLRNK 180

Query: 589 ELDLKVNKGGRQNSGEVMHVRGRS 660
            L+LK +    Q SGE + ++G++
Sbjct: 181 VLELKSDSINHQ-SGENLLLKGKN 203


>gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygrometricum]
          Length = 1309

 Score =  165 bits (418), Expect = 2e-41
 Identities = 102/338 (30%), Positives = 177/338 (52%), Gaps = 4/338 (1%)
 Frame = +1

Query: 49   MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAIT-NTYVDTDTKEIKAETDEYAYS 225
            M+ + ++LE F G  DFS+W+ KMK +L+   +  A+      DT  K+   ETD  A+S
Sbjct: 1    MSTTKFDLEKFTGSNDFSLWRIKMKALLVHTGLGGALNPEPQDDTIDKKKIVETDSKAFS 60

Query: 226  SIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDEN 405
            +I+L L D VLR+V ++ SA  LW+KL+ LY + SL +             +  KD+ ++
Sbjct: 61   AILLCLGDEVLREVAEEVSALSLWNKLESLYLKRSLANRLYLKKSLYTIHLEEGKDLKKH 120

Query: 406  LDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKS 585
            +D F K+I D+K    K  D+   I++L+++P SY      + YG++++T+  V + L S
Sbjct: 121  MDEFNKIILDLKNVDIKITDEDCAILMLSSLPRSYEHFVDTMLYGKETLTMAEVKSALNS 180

Query: 586  KELDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXX 765
            KEL  K N+   +++GE ++VRGR+  R    +K   H++Q+                  
Sbjct: 181  KELH-KKNETKMESTGEGLNVRGRTYKRESRNEKGGKHRSQSRT-------------RGK 226

Query: 766  XXCYNCGEVGHYIRECPNKKG-NQNQSNDQANLASTSE--NAGDIFMVTGICDVPIVNSV 936
              C+ C + GH+ ++CP+++  N  +  D  + A  S+   + ++ +V            
Sbjct: 227  LKCFVCHKEGHFKKDCPDRRARNPERRKDPGDAAVVSDGYESAEVLVV------------ 274

Query: 937  HSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFV 1050
             S T  ++ W++DS C+FHM P K+ F N  E ++  V
Sbjct: 275  -SRTNKQDCWVMDSGCSFHMCPIKSWFQNLVEEESGHV 311


>gb|PON95322.1| hypothetical protein TorRG33x02_088440, partial [Trema orientalis]
          Length = 240

 Score =  152 bits (384), Expect = 1e-40
 Identities = 90/248 (36%), Positives = 143/248 (57%), Gaps = 3/248 (1%)
 Frame = +1

Query: 64  YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITN--TYVDTDTKEIKAETDEYAYSSIIL 237
           + L PF+G  DFS W++KMK +L+Q K++KA+ +  T   T T   + E  E AYS IIL
Sbjct: 6   FELSPFDGSGDFSSWRKKMKALLVQHKLHKALEDPTTLPTTMTDVQRLELQENAYSIIIL 65

Query: 238 NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417
            L+D+VLR++  + +A + W+KL++LY   SL +             D NK++++NLD F
Sbjct: 66  YLADNVLRQIDGEDTAFKAWNKLEQLYLTKSLTNRILLKEKFFGFRMDTNKNLEQNLDDF 125

Query: 418 TKLIQDIKLTGDKNI-DDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKEL 594
            K+   +    ++ I D+   I+LLN++P+SY +VK+AIK+GR SITLD V   L+S EL
Sbjct: 126 KKIAITLASIDEEKIGDESQAIILLNSLPDSYREVKAAIKFGRKSITLDEVTAALRSWEL 185

Query: 595 DLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774
           ++K ++     +GE ++VRGRS+ R    +     +++N +                  C
Sbjct: 186 EMK-SEAKSSGNGESLNVRGRSKDRNSKGRGKSRSKSKNGS--------------KSFKC 230

Query: 775 YNCGEVGH 798
           Y+C E GH
Sbjct: 231 YHCHEEGH 238


>gb|KYP67041.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 325

 Score =  154 bits (390), Expect = 1e-40
 Identities = 100/343 (29%), Positives = 172/343 (50%), Gaps = 8/343 (2%)
 Frame = +1

Query: 49   MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAI--TNTYVDTDTKEIKAETDEYAY 222
            M+   ++++ FNG  DF++W+ KMK +L+ Q    A+   +    T T ++K    E A+
Sbjct: 1    MSSYKFDIDKFNGSNDFTLWKLKMKAVLVHQGCAAALEGADKLPTTMTDDVKKAMLEKAH 60

Query: 223  SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402
            S I+L+L+D VLR+VG++ +A  +W  L++ + + SL +               N  V +
Sbjct: 61   SLILLSLTDEVLREVGEETTAAGMWKMLEDKFQKKSLTNRLYQKQRLYTLQMSENMSVRD 120

Query: 403  NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 582
            +LD F ++I D++  G K  D+   I+LL ++P+SY +    + YGRDSITL+ V + L+
Sbjct: 121  HLDNFNRIILDLQSIGVKVDDEDLAIILLCSLPKSYENFIDTMLYGRDSITLNNVKDSLQ 180

Query: 583  SKELDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXX 762
            SK+L  +V      +   +   RGRS  R ++ +    H   NS                
Sbjct: 181  SKKLKRRVVSSSNVDDVGLTVSRGRSMERGNSSK---GHTRSNS-----------LSKSK 226

Query: 763  XXXCYNCGEVGHYIRECPNKKGNQNQS------NDQANLASTSENAGDIFMVTGICDVPI 924
               CY C EVGH  + CP  K N+N +         A ++S S + GD   V  +  +  
Sbjct: 227  KVRCYKCKEVGHIRKNCPQLKKNRNSNASAAVVRSSATVSSESSDEGDGGDVLTVSTIGF 286

Query: 925  VNSVHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVS 1053
             ++          W+ID+  ++HM+  + LF+++KEM + + S
Sbjct: 287  ADT----------WVIDTGASYHMTFNRKLFNSFKEMGHEYWS 319


>gb|PON64464.1| hypothetical protein TorRG33x02_273130 [Trema orientalis]
          Length = 225

 Score =  150 bits (380), Expect = 3e-40
 Identities = 81/209 (38%), Positives = 129/209 (61%), Gaps = 2/209 (0%)
 Frame = +1

Query: 49  MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNT--YVDTDTKEIKAETDEYAY 222
           M  S   +E F+GK DF++W++KMK +L+QQK  K + +     +T     K E  E AY
Sbjct: 1   MGTSKIEIEKFDGKGDFNMWKKKMKAVLVQQKCAKVLGDASGLPETMKPSEKEELLETAY 60

Query: 223 SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402
           S +ILNL+D+VLR+V +Q +A ++W KL  LY   +L +             D  K +++
Sbjct: 61  SLLILNLADNVLRQVDEQDTAAKVWSKLDSLYLTKTLSNKIYLKEQLFGFKMDSTKSLED 120

Query: 403 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 582
           NLD F ++   +    +K  D+   I++LN++PESY D+KS IKYGR+S++LD V+  L+
Sbjct: 121 NLDDFKRITVSLANIDEKINDENQAIIILNSLPESYKDLKSTIKYGRESLSLDDVLRALR 180

Query: 583 SKELDLKVNKGGRQNSGEVMHVRGRSQYR 669
           S +L++K+ K   +++GE + VRGR+Q R
Sbjct: 181 SHDLEVKIEK---RSNGEGLQVRGRTQKR 206


>gb|PKU72844.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Dendrobium catenatum]
          Length = 993

 Score =  160 bits (404), Expect = 1e-39
 Identities = 101/284 (35%), Positives = 154/284 (54%), Gaps = 10/284 (3%)
 Frame = +1

Query: 115 KMKGILIQQKVYKAIT--NTYVDTDTKEIKAETDEYAYSSIILNLSDSVLRKVGKQKSAK 288
           K++ ILIQQ V KA+   +    T + + K    + A+SSIIL L+D VLRKV   K+  
Sbjct: 2   KLEAILIQQGVEKALLPESELPSTMSDQEKLSIQKKAFSSIILCLADQVLRKVSHVKTVS 61

Query: 289 ELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVFTKLIQDIKLTGDKNIDD 468
           ELW KL+ELY + +LP+             D  K +D+NLD F KLI D++    K  D+
Sbjct: 62  ELWKKLEELYRQKTLPNRIYLKEKFFGYKMDEAKSIDDNLDEFNKLILDLENLEVKIEDE 121

Query: 469 YTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHV 648
              I+LLN++P+S  + K  +KYGR++IT+D V N L SK LD+K+++  + +SGE +HV
Sbjct: 122 DKAIILLNSLPKSLRNFKETLKYGRETITVDEVQNALSSKILDMKISE--KNHSGEGLHV 179

Query: 649 RGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECPNKK- 825
           RGRSQ R  +Q+K+ +     S                   C+ C + GH  R CP K  
Sbjct: 180 RGRSQKRGTSQKKWKSKSRSKS---------ASKKDYKNVKCWQCNKTGHIRRFCPEKNP 230

Query: 826 GNQNQSNDQANLASTSENAGDIFMVTGI-------CDVPIVNSV 936
            +++QS   A +   + ++ D+  V+ +       CDV  + S+
Sbjct: 231 KDKSQSQGDAAIVGENYDSADVLNVSDLLLGNNKACDVVGIGSI 274


>gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium barbadense]
          Length = 1841

 Score =  159 bits (402), Expect = 2e-39
 Identities = 105/333 (31%), Positives = 166/333 (49%), Gaps = 6/333 (1%)
 Frame = +1

Query: 64   YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAIT--NTYVDTDTKEIKAETDEYAYSSIIL 237
            Y++E F GK  FS+W+ KM+ +L+QQ + KA++  +    T ++E K +  E A+S+I+L
Sbjct: 530  YDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSGKDKLPSTLSEEQKDDMLERAHSAILL 589

Query: 238  NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417
             L D VLR+V  +K+A  LW +L+  Y   SL +             +    V ++LD F
Sbjct: 590  CLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYLKQRLYALKMEEGTPVSQHLDKF 649

Query: 418  TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597
              +I D+    +K  D+   I++L ++P SY +    + YGRD +TL+ V N L S EL 
Sbjct: 650  NSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMMYGRDDLTLEEVKNALSSSELR 709

Query: 598  LKV-NKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774
             K+  K    N GE +  RGRS+ +  +  K  +H    S                   C
Sbjct: 710  KKITGKVVENNEGEGLVARGRSKAKGGSSSK--SHPRSQSK--------------KRIQC 753

Query: 775  YNCGEVGHYIRECPNKK---GNQNQSNDQANLASTSENAGDIFMVTGICDVPIVNSVHSS 945
            Y C + GH   +CP +K    +Q Q ND+AN+A    ++          D  IV +V S 
Sbjct: 754  YYCKKYGHMKVDCPKRKEKSESQEQQNDRANVADADSSS----------DAEIVLAV-SD 802

Query: 946  TVCENEWLIDSACTFHMSPFKNLFSNYKEMKNS 1044
            +     W++D+  TFH+S  K+ FS Y++   S
Sbjct: 803  SYAGGRWILDTGATFHISTSKDAFSTYEKHSGS 835


>gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium barbadense]
          Length = 2351

 Score =  159 bits (402), Expect = 2e-39
 Identities = 105/333 (31%), Positives = 166/333 (49%), Gaps = 6/333 (1%)
 Frame = +1

Query: 64   YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAIT--NTYVDTDTKEIKAETDEYAYSSIIL 237
            Y++E F GK  FS+W+ KM+ +L+QQ + KA++  +    T ++E K +  E A+S+I+L
Sbjct: 509  YDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSGKDKLPSTLSEEQKDDMLERAHSAILL 568

Query: 238  NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417
             L D VLR+V  +K+A  LW +L+  Y   SL +             +    V ++LD F
Sbjct: 569  CLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYLKQRLYALKMEEGTPVSQHLDKF 628

Query: 418  TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597
              +I D+    +K  D+   I++L ++P SY +    + YGRD +TL+ V N L S EL 
Sbjct: 629  NSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMMYGRDDLTLEEVKNALSSSELR 688

Query: 598  LKV-NKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774
             K+  K    N GE +  RGRS+ +  +  K  +H    S                   C
Sbjct: 689  KKITGKVVENNEGEGLVARGRSKAKGGSSSK--SHPRSQSK--------------KRIQC 732

Query: 775  YNCGEVGHYIRECPNKK---GNQNQSNDQANLASTSENAGDIFMVTGICDVPIVNSVHSS 945
            Y C + GH   +CP +K    +Q Q ND+AN+A    ++          D  IV +V S 
Sbjct: 733  YYCKKYGHMKVDCPKRKEKSESQEQQNDRANVADADSSS----------DAEIVLAV-SD 781

Query: 946  TVCENEWLIDSACTFHMSPFKNLFSNYKEMKNS 1044
            +     W++D+  TFH+S  K+ FS Y++   S
Sbjct: 782  SYAGGRWILDTGATFHISTSKDAFSTYEKHSGS 814


>ref|XP_022881005.1| uncharacterized protein LOC111398320 [Olea europaea var.
           sylvestris]
          Length = 200

 Score =  147 bits (371), Expect = 3e-39
 Identities = 70/185 (37%), Positives = 117/185 (63%)
 Frame = +1

Query: 49  MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSS 228
           MA + + +  F+GK D+++W QKM  IL+Q +  +A+ +T+        KAE +E A+S+
Sbjct: 1   MATTKFEVARFDGKIDYNMWSQKMNAILMQMRCVRALDDTWPQDMNPTRKAELEEIAWST 60

Query: 229 IILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENL 408
           I L LS++V+R +G+ K+A ELW KL++ Y   ++P+             D + ++DENL
Sbjct: 61  IFLYLSENVIRTIGETKTASELWTKLEKQYVTKTIPNKCYMLKQLFSFKMDPSTNLDENL 120

Query: 409 DVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSK 588
           + FTKLIQ++    +K   D   ++LLN+I E Y D+K A++YGRD +T + ++N LK+K
Sbjct: 121 NTFTKLIQNLNNCDEKLSQDQLAVILLNSISERYKDIKVALEYGRDELTTEIIINALKNK 180

Query: 589 ELDLK 603
            L++K
Sbjct: 181 ALEIK 185


>gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 337

 Score =  146 bits (369), Expect = 2e-37
 Identities = 94/328 (28%), Positives = 164/328 (50%), Gaps = 4/328 (1%)
 Frame = +1

Query: 64   YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETD--EYAYSSIIL 237
            +++E F G+ DFS+ + KM+ +L+ Q +  A+        T   K + D    A+S+IIL
Sbjct: 5    FDVEKFTGENDFSLRRIKMQALLVHQGLDDALQGASKLPSTLSDKEKKDLLSKAHSTIIL 64

Query: 238  NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417
            +L D VLR+V ++KSA  +W KL+ LY   SL +             +    + E++ +F
Sbjct: 65   SLGDEVLREVAEEKSAAGIWLKLESLYMTKSLTNKLYLKKRLHQLKMEEGSSIKEHVSLF 124

Query: 418  TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597
            TK + D+K    +  ++   ++LL ++P S+ ++   + +GRD++TL+ V   L S+EL 
Sbjct: 125  TKAVLDLKSVDVRIDEEDQAVMLLCSLPSSFENLVDTMLFGRDTLTLEEVKATLNSRELK 184

Query: 598  LKVNKG-GRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774
             K+ +  G     E +  RGR + R    +     + +N                    C
Sbjct: 185  KKITENKGEGGDPEALMARGRLEKRDSKSKNKRRSKYKNEK-----------------AC 227

Query: 775  YNCGEVGHYIRECP-NKKGNQNQSNDQANLASTSENAGDIFMVTGICDVPIVNSVHSSTV 951
            Y C + GH+ +ECP  KK N  + ND++++A  ++      +++      I    HS   
Sbjct: 228  YYCKKEGHFRKECPERKKKNNGKYNDESDIAVVADGYESAEVLS------ISTKKHS--- 278

Query: 952  CENEWLIDSACTFHMSPFKNLFSNYKEM 1035
               EW++DS C+FHM+P    FS+YKE+
Sbjct: 279  --EEWILDSGCSFHMTPNLEWFSSYKEI 304


>emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera]
          Length = 894

 Score =  150 bits (378), Expect = 3e-36
 Identities = 101/337 (29%), Positives = 174/337 (51%), Gaps = 8/337 (2%)
 Frame = +1

Query: 64   YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITN--TYVDTDTKEIKAETDEYAYSSIIL 237
            +++E F GK DF +W+ KM+ +L+QQ +  A+        T  ++ K E  E A+ +IIL
Sbjct: 6    FDVEKFTGKNDFGLWRLKMRALLVQQGLQDALLGEKNLPXTMQEKHKIELLEKAHGAIIL 65

Query: 238  NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417
            +L D+ LR+V K KSA +L  KL+ LY   SL +               +  ++E+LD F
Sbjct: 66   SLGDTXLREVAKAKSAAKLLLKLESLYMTKSLANRLHKXIKLYTFKMTPSMSIEEHLDHF 125

Query: 418  TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597
             K+I D+K       ++   I+LL ++  SY+++K AI YGRD +T D V + L ++EL 
Sbjct: 126  NKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMYGRDILTFDEVQSILHARELH 185

Query: 598  LKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCY 777
             +  +  ++  GE +++RG+S+ R   ++K +N ++++ +                  C+
Sbjct: 186  KQ--EESKEELGEGLNIRGKSKKR--EKKKGNNSKSRSKS------------KTKKFKCF 229

Query: 778  NCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMV------TGICDVPIVNSVH 939
             C + GH+ ++CP+ + N  +          + N GD  M+       G+ +V  V+S  
Sbjct: 230  ICHKEGHFKKDCPDMRQNTXKK---------TMNEGDATMILDGYDNAGVLNVAEVDS-- 278

Query: 940  SSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFV 1050
                   EW++DS C+FHM P K  F ++KE     V
Sbjct: 279  -----GKEWILDSGCSFHMCPIKAWFEDFKEANGGHV 310


>gb|KHN13199.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 344

 Score =  142 bits (358), Expect = 1e-35
 Identities = 90/334 (26%), Positives = 169/334 (50%), Gaps = 5/334 (1%)
 Frame = +1

Query: 49   MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTD--TKEIKAETDEYAY 222
            M+ + +++E F GK +F++W+ KM  +L QQ+   A+    +     T   K    + AY
Sbjct: 1    MSSTKFDVEKFTGKNNFNLWRVKMLALLTQQECELALEGEEMLPAELTAAQKRVIMKKAY 60

Query: 223  SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402
            S+I+L+L D VL ++  +K+A +LW KL+  Y   SL +                + + +
Sbjct: 61   SAILLSLGDEVLGEISGEKTADKLWAKLESRYMTKSLHNRLCLKKQLYTMQMHEGESIHK 120

Query: 403  NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 582
            ++D F +++  +K       D+   ++LL+++P +Y +    I +GR S++++ V   L+
Sbjct: 121  HIDNFNQVVLSLKNIDVAVDDEDQAVLLLSSLPRAYDNFVDTIIFGRSSLSMEEVKTALQ 180

Query: 583  SKELDLKVNKG-GRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXX 759
            S EL  ++    G  +SGE + VRGR   R   Q++    +++N N              
Sbjct: 181  SWELKRRITDSYGGTSSGEGLMVRGRMDERKSFQRRRSKSRSKNKNNNK----------- 229

Query: 760  XXXXCYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENA--GDIFMVTGICDVPIVNS 933
                C+NC + GH+ R CP  K ++  +++    A  SE +  G++  V           
Sbjct: 230  ----CHNCQKEGHWKRNCPELKKDKVSTSEFGGAAVVSEESDGGNVLFV----------- 274

Query: 934  VHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEM 1035
              SS V +++W++DSACTFHM+P ++ F+ ++ +
Sbjct: 275  --SSNVNDDDWILDSACTFHMTPNRDWFATFQNV 306


>gb|KHN13198.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 341

 Score =  141 bits (356), Expect = 2e-35
 Identities = 90/329 (27%), Positives = 166/329 (50%), Gaps = 5/329 (1%)
 Frame = +1

Query: 64   YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYV--DTDTKEIKAETDEYAYSSIIL 237
            +++E F GK +F++W+ KM  +L QQ+   A+    +     T   K    + AYS+I+L
Sbjct: 3    FDVEKFTGKNNFNLWRVKMLALLTQQECELALEGEEMLPAEMTAAQKRVIMKKAYSAILL 62

Query: 238  NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417
            +L D VL +V  +K+A +LW KL+  Y   SL +                + + +++D F
Sbjct: 63   SLGDEVLGEVSGEKTADKLWAKLESRYMTKSLHNRLCLKKQLYTMQMHEGESIHKHIDNF 122

Query: 418  TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597
             +++  +K       D+   ++LL+++P +Y +    I +GR S++++ V   L+S EL 
Sbjct: 123  NQVVLSLKNIDVAVDDEDQAVLLLSSLPRAYDNFVDTIIFGRSSLSMEEVKTALQSWELK 182

Query: 598  LKVNKG-GRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774
             ++    G  +SGE + VRGR   R   Q++    +++N N                  C
Sbjct: 183  RRITDSYGGTSSGEGLMVRGRMDERKSFQRRRSKSRSKNKNNNK---------------C 227

Query: 775  YNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENA--GDIFMVTGICDVPIVNSVHSST 948
            +NC + GH+ R CP  K ++  +++    A  SE +  G++  V             SS 
Sbjct: 228  HNCQKEGHWKRNCPELKKDKVSTSEFGGAAVVSEESDGGNVLFV-------------SSN 274

Query: 949  VCENEWLIDSACTFHMSPFKNLFSNYKEM 1035
            V +++W++DSACTFHM+P ++ F+ ++ +
Sbjct: 275  VNDDDWILDSACTFHMTPNRDWFATFQNV 303


>gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 780

 Score =  147 bits (370), Expect = 3e-35
 Identities = 95/332 (28%), Positives = 162/332 (48%), Gaps = 5/332 (1%)
 Frame = +1

Query: 49   MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETD--EYAY 222
            M  + Y++E F+G+ DF +W+ KM+ ILIQQ   +AI      + +   K +T+  E A 
Sbjct: 1    MGNTKYDIEKFSGENDFGLWRIKMEAILIQQGCAEAIKGEEKMSSSLTQKEKTNMIEKAR 60

Query: 223  SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402
            S+IIL L D  LR+V ++K+A  +W KL+ LY   SL                  K + +
Sbjct: 61   SAIILCLGDKALREVAREKTAAAMWLKLESLYMTKSLAHRLCLKQRLYSFKMTETKSIVD 120

Query: 403  NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRD-SITLDTVVNGL 579
             L  F K++ D++    +  D+   ++LLN++P +Y   K AI YG++  ITLD V   +
Sbjct: 121  QLAEFNKILDDLENIEVQLEDEDKALLLLNSLPRNYEHFKDAILYGKEQDITLDEVQTSI 180

Query: 580  KSKELDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXX 759
            ++KEL  + +     N   +   RGRS+ +  +Q+                         
Sbjct: 181  RTKELQRQQDNKTDDNGESLNVSRGRSEKKGQSQKGKKARSKSK------------IGDR 228

Query: 760  XXXXCYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSE--NAGDIFMVTGICDVPIVNS 933
                C+ C +VGH+ + CP +  +Q  S D A++A+ S+   + D+ +VT          
Sbjct: 229  SKFKCFYCHKVGHFKKNCPERNRDQKSSADSADIAAISDGYESADVLVVT---------- 278

Query: 934  VHSSTVCENEWLIDSACTFHMSPFKNLFSNYK 1029
               ++  + +W++DS C++HM P K+ F   K
Sbjct: 279  ---TSQTQKDWVMDSGCSYHMCPKKDYFETLK 307


>ref|XP_022855577.1| uncharacterized protein LOC111376806 [Olea europaea var.
           sylvestris]
          Length = 249

 Score =  138 bits (348), Expect = 3e-35
 Identities = 73/203 (35%), Positives = 125/203 (61%)
 Frame = +1

Query: 52  AMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSSI 231
           A + + +  F+G+ DFS+W QKMK IL+QQ+  + + N++        + E +E A+SSI
Sbjct: 3   ATAKFEMSMFDGRGDFSMWSQKMKVILMQQRCARVLDNSWPAELPPGRRTELEETAWSSI 62

Query: 232 ILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLD 411
            L LS++V+R +G+ K+  ELW+KL+  Y   ++P+             D + D++ENL+
Sbjct: 63  FLYLSNNVIRTIGETKTTSELWNKLKAQYEPKTVPNKCFLLKQFFSFKIDPSIDLEENLN 122

Query: 412 VFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKE 591
            FTKL QD+    +K   D   +VLLN+I + + D+K+A++YGR++ T D + N L+++ 
Sbjct: 123 RFTKLTQDLANCDEKLSQDQLAVVLLNSISDRHRDLKNALEYGRENFTTDIITNALRNEV 182

Query: 592 LDLKVNKGGRQNSGEVMHVRGRS 660
           L+LK +    Q SGE + +RG++
Sbjct: 183 LELK-SDSINQQSGENLLLRGKN 204


Top