BLASTX nr result
ID: Rehmannia32_contig00000970
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia32_contig00000970 (1055 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamu... 339 e-110 gb|KZV22124.1| hypothetical protein F511_11652 [Dorcoceras hygro... 315 e-104 ref|XP_011100917.1| uncharacterized protein LOC105179028 [Sesamu... 269 7e-87 ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969... 244 8e-69 gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotranspo... 174 2e-47 ref|XP_022895327.1| uncharacterized protein LOC111409515 [Olea e... 162 7e-43 gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygro... 165 2e-41 gb|PON95322.1| hypothetical protein TorRG33x02_088440, partial [... 152 1e-40 gb|KYP67041.1| Retrovirus-related Pol polyprotein from transposo... 154 1e-40 gb|PON64464.1| hypothetical protein TorRG33x02_273130 [Trema ori... 150 3e-40 gb|PKU72844.1| Retrovirus-related Pol polyprotein from transposo... 160 1e-39 gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium bar... 159 2e-39 gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium bar... 159 2e-39 ref|XP_022881005.1| uncharacterized protein LOC111398320 [Olea e... 147 3e-39 gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposo... 146 2e-37 emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] 150 3e-36 gb|KHN13199.1| Retrovirus-related Pol polyprotein from transposo... 142 1e-35 gb|KHN13198.1| Retrovirus-related Pol polyprotein from transposo... 141 2e-35 gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposo... 147 3e-35 ref|XP_022855577.1| uncharacterized protein LOC111376806 [Olea e... 138 3e-35 >ref|XP_011073037.1| uncharacterized protein LOC105158097 [Sesamum indicum] Length = 472 Score = 339 bits (869), Expect = e-110 Identities = 185/341 (54%), Positives = 238/341 (69%), Gaps = 8/341 (2%) Frame = +1 Query: 55 MSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSSII 234 M+GY+L+PF+GKTDFSIWQQKMKGILIQQKV+KAI Y + T+E K E DE+AYSSI+ Sbjct: 1 MAGYSLQPFDGKTDFSIWQQKMKGILIQQKVFKAIDGKYAENITEEKKLENDEFAYSSIV 60 Query: 235 LNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDV 414 LNLSD+VLRKVGK +S+K LWDKL+EL+TE SLP+ DL+K++DENLD Sbjct: 61 LNLSDTVLRKVGKLESSKALWDKLEELFTEISLPNKLFLLEKIFRYKLDLSKNIDENLDD 120 Query: 415 FTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKEL 594 FTKLIQDIKL GDK ID+Y+PIVLLNAIPES+SDVK+AIKYGRDSI L+TVVNGLKSKEL Sbjct: 121 FTKLIQDIKLAGDKYIDEYSPIVLLNAIPESFSDVKAAIKYGRDSINLETVVNGLKSKEL 180 Query: 595 DLKVNKGGRQNSGEVMHVRGRSQY-----RFDNQQKFDNHQNQNSNXXXXXXXXXXXXXX 759 DLKVNK Q+ E+ VRGR+++ R++++ + N++ + Sbjct: 181 DLKVNKPS-QSHYEINSVRGRTRFGNFNSRYNSRSRSKTKTNRSKS--RPRETNLRDDKI 237 Query: 760 XXXXCYNCGEVGHYIREC--PNKKGNQNQSNDQANLASTS-ENAGDIFMVTGICDVPIVN 930 CYNCG GHYI++C P ++ +D+ +++ S E+ G++F+V N Sbjct: 238 RDRRCYNCGTKGHYIKDCRKPRRENRDRNYDDKEKVSNVSIESNGEVFVVYE------AN 291 Query: 931 SVHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVS 1053 SV ST +EWLIDS CTFHMSPFK++F+N K FVS Sbjct: 292 SV--STFDMHEWLIDSGCTFHMSPFKDIFTNLKYEHAGFVS 330 >gb|KZV22124.1| hypothetical protein F511_11652 [Dorcoceras hygrometricum] Length = 277 Score = 315 bits (808), Expect = e-104 Identities = 171/280 (61%), Positives = 193/280 (68%), Gaps = 1/280 (0%) Frame = +1 Query: 55 MSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAIT-NTYVDTDTKEIKAETDEYAYSSI 231 M+ Y+LEPFNGKTDFSIWQQKMKGILIQQKVYKAI Y + + + E DEYAYSSI Sbjct: 4 MTAYHLEPFNGKTDFSIWQQKMKGILIQQKVYKAIDPEAYAEDVSAAKRKEDDEYAYSSI 63 Query: 232 ILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLD 411 ILNLSD+VLRK GK +AK LW+KLQELYTETSLPS DLNKD+D NLD Sbjct: 64 ILNLSDAVLRKCGKLDTAKLLWEKLQELYTETSLPSKMFLLEKFFRFKLDLNKDLDGNLD 123 Query: 412 VFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKE 591 VFTKLIQDIKLTGDKNIDDYTPIVLLNAIP+ Y+DV+SAIKYGRD ITL+TV++GLKSKE Sbjct: 124 VFTKLIQDIKLTGDKNIDDYTPIVLLNAIPDDYADVRSAIKYGRDKITLETVISGLKSKE 183 Query: 592 LDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXX 771 LDLK KG + N GEVMHV GRS+ R+ N +N N S Sbjct: 184 LDLKAYKGTKPNGGEVMHVGGRSKTRYRNHMG-NNGDNSRSK------------YIGNRT 230 Query: 772 CYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDI 891 CYNCGE GHY +CP+ K D L S NA D+ Sbjct: 231 CYNCGEKGHYKADCPHPK-EDKYKRDNTLLTEQSNNATDL 269 >ref|XP_011100917.1| uncharacterized protein LOC105179028 [Sesamum indicum] Length = 188 Score = 269 bits (687), Expect = 7e-87 Identities = 133/186 (71%), Positives = 157/186 (84%) Frame = +1 Query: 55 MSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSSII 234 M GY+L+ F+GK+DFSIWQQKMKGILIQQKV+KAI + Y D + E K + DE+AYSSII Sbjct: 1 MDGYSLQSFDGKSDFSIWQQKMKGILIQQKVFKAIDSKYTDNISDEKKIQNDEFAYSSII 60 Query: 235 LNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDV 414 LNLSD+VLRKVGKQ S+K+LW+KL++LYTETSLPS DL+K +DENLD Sbjct: 61 LNLSDNVLRKVGKQSSSKDLWEKLEDLYTETSLPSKLFLLEKKFHYKLDLSKSIDENLDD 120 Query: 415 FTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKEL 594 FTKLIQDIKLTGDKNID+Y+PIVLLNAIP+SYSD K+AIKYGRDS+ LDTVVNGLKSKE+ Sbjct: 121 FTKLIQDIKLTGDKNIDEYSPIVLLNAIPKSYSDAKAAIKYGRDSVNLDTVVNGLKSKEM 180 Query: 595 DLKVNK 612 DLKV+K Sbjct: 181 DLKVSK 186 >ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969411 [Erythranthe guttata] Length = 1213 Score = 244 bits (622), Expect = 8e-69 Identities = 128/202 (63%), Positives = 152/202 (75%) Frame = +1 Query: 112 QKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSSIILNLSDSVLRKVGKQKSAKE 291 QKMKG+LIQQ+ Y AI +Y T K E DE AYS+IILNLSDSV+RKVG SAK Sbjct: 463 QKMKGVLIQQRCYVAIDESYAAETTASKKIELDELAYSAIILNLSDSVIRKVGMHDSAKG 522 Query: 292 LWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVFTKLIQDIKLTGDKNIDDY 471 LW+KL ELYTETSLPS DL KD+DEN+D FT+L+QDIKLTGDK+ID+Y Sbjct: 523 LWEKLDELYTETSLPSKLFLLEKFFRFKLDLTKDIDENIDQFTRLVQDIKLTGDKSIDNY 582 Query: 472 TPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVR 651 TPIVLLNAIP+SY+D+KSAIKYGRD+I+LDTV+NGLKSKE+DL+VNK + + GEV VR Sbjct: 583 TPIVLLNAIPDSYNDLKSAIKYGRDNISLDTVINGLKSKEMDLRVNKSNK-SFGEVNFVR 641 Query: 652 GRSQYRFDNQQKFDNHQNQNSN 717 GR Q RF N+ N QN + N Sbjct: 642 GRQQNRFSNKPSSSN-QNVSQN 662 >gb|PON54809.1| LOW QUALITY PROTEIN: Gag-Pol-related retrotransposon family protein [Trema orientalis] Length = 380 Score = 174 bits (440), Expect = 2e-47 Identities = 113/331 (34%), Positives = 173/331 (52%), Gaps = 7/331 (2%) Frame = +1 Query: 64 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEY-------AY 222 +++E F GK DF +W+ KM+ IL+QQ + KA+ + + KE AE + AY Sbjct: 7 FDIEKFTGKNDFELWKMKMEAILVQQGLEKALLSEDLTATDKESLAEMKKKIEEVSPKAY 66 Query: 223 SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402 S+IIL+LSD VLRKV ++K +W KL+ELY +LP D +K ++E Sbjct: 67 SAIILSLSDQVLRKVLREKIISGIWIKLEELYRAKTLPGRIYLKERFFGFKMDKSKSIEE 126 Query: 403 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 582 NLD +TKL+ D++ G K D I+LLN++P + + K +KYGR +IT+D V N L+ Sbjct: 127 NLDDYTKLVLDLENLGIKVDDKDKAIILLNSLPRNLKNFKETLKYGRQTITVDEVQNALE 186 Query: 583 SKELDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXX 762 SK LD+K ++ Q GE +H+RGR+ K DNH + + Sbjct: 187 SKLLDMKGSEKNAQ--GEGLHIRGRT-------TKQDNHDGKGKSQSRSKSRGKKDYSKV 237 Query: 763 XXXCYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMVTGICDVPIVNSVHS 942 Y+C + GH R P + Q+ D L GD +V + V S+ S Sbjct: 238 KY--YHCNKNGHIRRLRP-----ERQNKDAGKL------DGDAVIVDDGYESSEVLSI-S 283 Query: 943 STVCENEWLIDSACTFHMSPFKNLFSNYKEM 1035 + EW++DS C++HM P ++ F +Y+E+ Sbjct: 284 ESENSKEWVMDSGCSYHMCPREDWFMDYQEV 314 >ref|XP_022895327.1| uncharacterized protein LOC111409515 [Olea europaea var. sylvestris] Length = 370 Score = 162 bits (409), Expect = 7e-43 Identities = 83/204 (40%), Positives = 133/204 (65%) Frame = +1 Query: 49 MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSS 228 MA + + + ++G+ DF+IW QKM+ ILIQ K KA+ NT+ + K E +E A+S+ Sbjct: 1 MATAKFEMSMYDGRWDFNIWSQKMRTILIQMKCAKALDNTWPAEMSAGKKTELEEIAWST 60 Query: 229 IILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENL 408 I L +SDSV+R +G+ K+A+ELW KL+ Y ++P+ D + D+DENL Sbjct: 61 IFLYISDSVIRTIGETKTAEELWTKLKAQYEPKTIPNKCFLLKQFFSFKMDPSVDLDENL 120 Query: 409 DVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSK 588 D FTKL QD+ +K ++D +VLLN+I + Y D+K+A+KYGRD++T+D ++N L++K Sbjct: 121 DRFTKLTQDLANCDEKLLEDQLAVVLLNSISDRYRDLKNALKYGRDNLTIDIIINTLRNK 180 Query: 589 ELDLKVNKGGRQNSGEVMHVRGRS 660 L+LK + Q SGE + ++G++ Sbjct: 181 VLELKSDSINHQ-SGENLLLKGKN 203 >gb|KZV56298.1| hypothetical protein F511_00295 [Dorcoceras hygrometricum] Length = 1309 Score = 165 bits (418), Expect = 2e-41 Identities = 102/338 (30%), Positives = 177/338 (52%), Gaps = 4/338 (1%) Frame = +1 Query: 49 MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAIT-NTYVDTDTKEIKAETDEYAYS 225 M+ + ++LE F G DFS+W+ KMK +L+ + A+ DT K+ ETD A+S Sbjct: 1 MSTTKFDLEKFTGSNDFSLWRIKMKALLVHTGLGGALNPEPQDDTIDKKKIVETDSKAFS 60 Query: 226 SIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDEN 405 +I+L L D VLR+V ++ SA LW+KL+ LY + SL + + KD+ ++ Sbjct: 61 AILLCLGDEVLREVAEEVSALSLWNKLESLYLKRSLANRLYLKKSLYTIHLEEGKDLKKH 120 Query: 406 LDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKS 585 +D F K+I D+K K D+ I++L+++P SY + YG++++T+ V + L S Sbjct: 121 MDEFNKIILDLKNVDIKITDEDCAILMLSSLPRSYEHFVDTMLYGKETLTMAEVKSALNS 180 Query: 586 KELDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXX 765 KEL K N+ +++GE ++VRGR+ R +K H++Q+ Sbjct: 181 KELH-KKNETKMESTGEGLNVRGRTYKRESRNEKGGKHRSQSRT-------------RGK 226 Query: 766 XXCYNCGEVGHYIRECPNKKG-NQNQSNDQANLASTSE--NAGDIFMVTGICDVPIVNSV 936 C+ C + GH+ ++CP+++ N + D + A S+ + ++ +V Sbjct: 227 LKCFVCHKEGHFKKDCPDRRARNPERRKDPGDAAVVSDGYESAEVLVV------------ 274 Query: 937 HSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFV 1050 S T ++ W++DS C+FHM P K+ F N E ++ V Sbjct: 275 -SRTNKQDCWVMDSGCSFHMCPIKSWFQNLVEEESGHV 311 >gb|PON95322.1| hypothetical protein TorRG33x02_088440, partial [Trema orientalis] Length = 240 Score = 152 bits (384), Expect = 1e-40 Identities = 90/248 (36%), Positives = 143/248 (57%), Gaps = 3/248 (1%) Frame = +1 Query: 64 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITN--TYVDTDTKEIKAETDEYAYSSIIL 237 + L PF+G DFS W++KMK +L+Q K++KA+ + T T T + E E AYS IIL Sbjct: 6 FELSPFDGSGDFSSWRKKMKALLVQHKLHKALEDPTTLPTTMTDVQRLELQENAYSIIIL 65 Query: 238 NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417 L+D+VLR++ + +A + W+KL++LY SL + D NK++++NLD F Sbjct: 66 YLADNVLRQIDGEDTAFKAWNKLEQLYLTKSLTNRILLKEKFFGFRMDTNKNLEQNLDDF 125 Query: 418 TKLIQDIKLTGDKNI-DDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKEL 594 K+ + ++ I D+ I+LLN++P+SY +VK+AIK+GR SITLD V L+S EL Sbjct: 126 KKIAITLASIDEEKIGDESQAIILLNSLPDSYREVKAAIKFGRKSITLDEVTAALRSWEL 185 Query: 595 DLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774 ++K ++ +GE ++VRGRS+ R + +++N + C Sbjct: 186 EMK-SEAKSSGNGESLNVRGRSKDRNSKGRGKSRSKSKNGS--------------KSFKC 230 Query: 775 YNCGEVGH 798 Y+C E GH Sbjct: 231 YHCHEEGH 238 >gb|KYP67041.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 325 Score = 154 bits (390), Expect = 1e-40 Identities = 100/343 (29%), Positives = 172/343 (50%), Gaps = 8/343 (2%) Frame = +1 Query: 49 MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAI--TNTYVDTDTKEIKAETDEYAY 222 M+ ++++ FNG DF++W+ KMK +L+ Q A+ + T T ++K E A+ Sbjct: 1 MSSYKFDIDKFNGSNDFTLWKLKMKAVLVHQGCAAALEGADKLPTTMTDDVKKAMLEKAH 60 Query: 223 SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402 S I+L+L+D VLR+VG++ +A +W L++ + + SL + N V + Sbjct: 61 SLILLSLTDEVLREVGEETTAAGMWKMLEDKFQKKSLTNRLYQKQRLYTLQMSENMSVRD 120 Query: 403 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 582 +LD F ++I D++ G K D+ I+LL ++P+SY + + YGRDSITL+ V + L+ Sbjct: 121 HLDNFNRIILDLQSIGVKVDDEDLAIILLCSLPKSYENFIDTMLYGRDSITLNNVKDSLQ 180 Query: 583 SKELDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXX 762 SK+L +V + + RGRS R ++ + H NS Sbjct: 181 SKKLKRRVVSSSNVDDVGLTVSRGRSMERGNSSK---GHTRSNS-----------LSKSK 226 Query: 763 XXXCYNCGEVGHYIRECPNKKGNQNQS------NDQANLASTSENAGDIFMVTGICDVPI 924 CY C EVGH + CP K N+N + A ++S S + GD V + + Sbjct: 227 KVRCYKCKEVGHIRKNCPQLKKNRNSNASAAVVRSSATVSSESSDEGDGGDVLTVSTIGF 286 Query: 925 VNSVHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFVS 1053 ++ W+ID+ ++HM+ + LF+++KEM + + S Sbjct: 287 ADT----------WVIDTGASYHMTFNRKLFNSFKEMGHEYWS 319 >gb|PON64464.1| hypothetical protein TorRG33x02_273130 [Trema orientalis] Length = 225 Score = 150 bits (380), Expect = 3e-40 Identities = 81/209 (38%), Positives = 129/209 (61%), Gaps = 2/209 (0%) Frame = +1 Query: 49 MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNT--YVDTDTKEIKAETDEYAY 222 M S +E F+GK DF++W++KMK +L+QQK K + + +T K E E AY Sbjct: 1 MGTSKIEIEKFDGKGDFNMWKKKMKAVLVQQKCAKVLGDASGLPETMKPSEKEELLETAY 60 Query: 223 SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402 S +ILNL+D+VLR+V +Q +A ++W KL LY +L + D K +++ Sbjct: 61 SLLILNLADNVLRQVDEQDTAAKVWSKLDSLYLTKTLSNKIYLKEQLFGFKMDSTKSLED 120 Query: 403 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 582 NLD F ++ + +K D+ I++LN++PESY D+KS IKYGR+S++LD V+ L+ Sbjct: 121 NLDDFKRITVSLANIDEKINDENQAIIILNSLPESYKDLKSTIKYGRESLSLDDVLRALR 180 Query: 583 SKELDLKVNKGGRQNSGEVMHVRGRSQYR 669 S +L++K+ K +++GE + VRGR+Q R Sbjct: 181 SHDLEVKIEK---RSNGEGLQVRGRTQKR 206 >gb|PKU72844.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum] Length = 993 Score = 160 bits (404), Expect = 1e-39 Identities = 101/284 (35%), Positives = 154/284 (54%), Gaps = 10/284 (3%) Frame = +1 Query: 115 KMKGILIQQKVYKAIT--NTYVDTDTKEIKAETDEYAYSSIILNLSDSVLRKVGKQKSAK 288 K++ ILIQQ V KA+ + T + + K + A+SSIIL L+D VLRKV K+ Sbjct: 2 KLEAILIQQGVEKALLPESELPSTMSDQEKLSIQKKAFSSIILCLADQVLRKVSHVKTVS 61 Query: 289 ELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVFTKLIQDIKLTGDKNIDD 468 ELW KL+ELY + +LP+ D K +D+NLD F KLI D++ K D+ Sbjct: 62 ELWKKLEELYRQKTLPNRIYLKEKFFGYKMDEAKSIDDNLDEFNKLILDLENLEVKIEDE 121 Query: 469 YTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHV 648 I+LLN++P+S + K +KYGR++IT+D V N L SK LD+K+++ + +SGE +HV Sbjct: 122 DKAIILLNSLPKSLRNFKETLKYGRETITVDEVQNALSSKILDMKISE--KNHSGEGLHV 179 Query: 649 RGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCYNCGEVGHYIRECPNKK- 825 RGRSQ R +Q+K+ + S C+ C + GH R CP K Sbjct: 180 RGRSQKRGTSQKKWKSKSRSKS---------ASKKDYKNVKCWQCNKTGHIRRFCPEKNP 230 Query: 826 GNQNQSNDQANLASTSENAGDIFMVTGI-------CDVPIVNSV 936 +++QS A + + ++ D+ V+ + CDV + S+ Sbjct: 231 KDKSQSQGDAAIVGENYDSADVLNVSDLLLGNNKACDVVGIGSI 274 >gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium barbadense] Length = 1841 Score = 159 bits (402), Expect = 2e-39 Identities = 105/333 (31%), Positives = 166/333 (49%), Gaps = 6/333 (1%) Frame = +1 Query: 64 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAIT--NTYVDTDTKEIKAETDEYAYSSIIL 237 Y++E F GK FS+W+ KM+ +L+QQ + KA++ + T ++E K + E A+S+I+L Sbjct: 530 YDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSGKDKLPSTLSEEQKDDMLERAHSAILL 589 Query: 238 NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417 L D VLR+V +K+A LW +L+ Y SL + + V ++LD F Sbjct: 590 CLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYLKQRLYALKMEEGTPVSQHLDKF 649 Query: 418 TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597 +I D+ +K D+ I++L ++P SY + + YGRD +TL+ V N L S EL Sbjct: 650 NSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMMYGRDDLTLEEVKNALSSSELR 709 Query: 598 LKV-NKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774 K+ K N GE + RGRS+ + + K +H S C Sbjct: 710 KKITGKVVENNEGEGLVARGRSKAKGGSSSK--SHPRSQSK--------------KRIQC 753 Query: 775 YNCGEVGHYIRECPNKK---GNQNQSNDQANLASTSENAGDIFMVTGICDVPIVNSVHSS 945 Y C + GH +CP +K +Q Q ND+AN+A ++ D IV +V S Sbjct: 754 YYCKKYGHMKVDCPKRKEKSESQEQQNDRANVADADSSS----------DAEIVLAV-SD 802 Query: 946 TVCENEWLIDSACTFHMSPFKNLFSNYKEMKNS 1044 + W++D+ TFH+S K+ FS Y++ S Sbjct: 803 SYAGGRWILDTGATFHISTSKDAFSTYEKHSGS 835 >gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium barbadense] Length = 2351 Score = 159 bits (402), Expect = 2e-39 Identities = 105/333 (31%), Positives = 166/333 (49%), Gaps = 6/333 (1%) Frame = +1 Query: 64 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAIT--NTYVDTDTKEIKAETDEYAYSSIIL 237 Y++E F GK FS+W+ KM+ +L+QQ + KA++ + T ++E K + E A+S+I+L Sbjct: 509 YDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSGKDKLPSTLSEEQKDDMLERAHSAILL 568 Query: 238 NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417 L D VLR+V +K+A LW +L+ Y SL + + V ++LD F Sbjct: 569 CLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYLKQRLYALKMEEGTPVSQHLDKF 628 Query: 418 TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597 +I D+ +K D+ I++L ++P SY + + YGRD +TL+ V N L S EL Sbjct: 629 NSIIMDLNNIDNKIDDEDQAIIVLCSLPPSYENFVDTMMYGRDDLTLEEVKNALSSSELR 688 Query: 598 LKV-NKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774 K+ K N GE + RGRS+ + + K +H S C Sbjct: 689 KKITGKVVENNEGEGLVARGRSKAKGGSSSK--SHPRSQSK--------------KRIQC 732 Query: 775 YNCGEVGHYIRECPNKK---GNQNQSNDQANLASTSENAGDIFMVTGICDVPIVNSVHSS 945 Y C + GH +CP +K +Q Q ND+AN+A ++ D IV +V S Sbjct: 733 YYCKKYGHMKVDCPKRKEKSESQEQQNDRANVADADSSS----------DAEIVLAV-SD 781 Query: 946 TVCENEWLIDSACTFHMSPFKNLFSNYKEMKNS 1044 + W++D+ TFH+S K+ FS Y++ S Sbjct: 782 SYAGGRWILDTGATFHISTSKDAFSTYEKHSGS 814 >ref|XP_022881005.1| uncharacterized protein LOC111398320 [Olea europaea var. sylvestris] Length = 200 Score = 147 bits (371), Expect = 3e-39 Identities = 70/185 (37%), Positives = 117/185 (63%) Frame = +1 Query: 49 MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSS 228 MA + + + F+GK D+++W QKM IL+Q + +A+ +T+ KAE +E A+S+ Sbjct: 1 MATTKFEVARFDGKIDYNMWSQKMNAILMQMRCVRALDDTWPQDMNPTRKAELEEIAWST 60 Query: 229 IILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENL 408 I L LS++V+R +G+ K+A ELW KL++ Y ++P+ D + ++DENL Sbjct: 61 IFLYLSENVIRTIGETKTASELWTKLEKQYVTKTIPNKCYMLKQLFSFKMDPSTNLDENL 120 Query: 409 DVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSK 588 + FTKLIQ++ +K D ++LLN+I E Y D+K A++YGRD +T + ++N LK+K Sbjct: 121 NTFTKLIQNLNNCDEKLSQDQLAVILLNSISERYKDIKVALEYGRDELTTEIIINALKNK 180 Query: 589 ELDLK 603 L++K Sbjct: 181 ALEIK 185 >gb|KHN13665.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 337 Score = 146 bits (369), Expect = 2e-37 Identities = 94/328 (28%), Positives = 164/328 (50%), Gaps = 4/328 (1%) Frame = +1 Query: 64 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETD--EYAYSSIIL 237 +++E F G+ DFS+ + KM+ +L+ Q + A+ T K + D A+S+IIL Sbjct: 5 FDVEKFTGENDFSLRRIKMQALLVHQGLDDALQGASKLPSTLSDKEKKDLLSKAHSTIIL 64 Query: 238 NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417 +L D VLR+V ++KSA +W KL+ LY SL + + + E++ +F Sbjct: 65 SLGDEVLREVAEEKSAAGIWLKLESLYMTKSLTNKLYLKKRLHQLKMEEGSSIKEHVSLF 124 Query: 418 TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597 TK + D+K + ++ ++LL ++P S+ ++ + +GRD++TL+ V L S+EL Sbjct: 125 TKAVLDLKSVDVRIDEEDQAVMLLCSLPSSFENLVDTMLFGRDTLTLEEVKATLNSRELK 184 Query: 598 LKVNKG-GRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774 K+ + G E + RGR + R + + +N C Sbjct: 185 KKITENKGEGGDPEALMARGRLEKRDSKSKNKRRSKYKNEK-----------------AC 227 Query: 775 YNCGEVGHYIRECP-NKKGNQNQSNDQANLASTSENAGDIFMVTGICDVPIVNSVHSSTV 951 Y C + GH+ +ECP KK N + ND++++A ++ +++ I HS Sbjct: 228 YYCKKEGHFRKECPERKKKNNGKYNDESDIAVVADGYESAEVLS------ISTKKHS--- 278 Query: 952 CENEWLIDSACTFHMSPFKNLFSNYKEM 1035 EW++DS C+FHM+P FS+YKE+ Sbjct: 279 --EEWILDSGCSFHMTPNLEWFSSYKEI 304 >emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] Length = 894 Score = 150 bits (378), Expect = 3e-36 Identities = 101/337 (29%), Positives = 174/337 (51%), Gaps = 8/337 (2%) Frame = +1 Query: 64 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITN--TYVDTDTKEIKAETDEYAYSSIIL 237 +++E F GK DF +W+ KM+ +L+QQ + A+ T ++ K E E A+ +IIL Sbjct: 6 FDVEKFTGKNDFGLWRLKMRALLVQQGLQDALLGEKNLPXTMQEKHKIELLEKAHGAIIL 65 Query: 238 NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417 +L D+ LR+V K KSA +L KL+ LY SL + + ++E+LD F Sbjct: 66 SLGDTXLREVAKAKSAAKLLLKLESLYMTKSLANRLHKXIKLYTFKMTPSMSIEEHLDHF 125 Query: 418 TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597 K+I D+K ++ I+LL ++ SY+++K AI YGRD +T D V + L ++EL Sbjct: 126 NKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMYGRDILTFDEVQSILHARELH 185 Query: 598 LKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXCY 777 + + ++ GE +++RG+S+ R ++K +N ++++ + C+ Sbjct: 186 KQ--EESKEELGEGLNIRGKSKKR--EKKKGNNSKSRSKS------------KTKKFKCF 229 Query: 778 NCGEVGHYIRECPNKKGNQNQSNDQANLASTSENAGDIFMV------TGICDVPIVNSVH 939 C + GH+ ++CP+ + N + + N GD M+ G+ +V V+S Sbjct: 230 ICHKEGHFKKDCPDMRQNTXKK---------TMNEGDATMILDGYDNAGVLNVAEVDS-- 278 Query: 940 SSTVCENEWLIDSACTFHMSPFKNLFSNYKEMKNSFV 1050 EW++DS C+FHM P K F ++KE V Sbjct: 279 -----GKEWILDSGCSFHMCPIKAWFEDFKEANGGHV 310 >gb|KHN13199.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 344 Score = 142 bits (358), Expect = 1e-35 Identities = 90/334 (26%), Positives = 169/334 (50%), Gaps = 5/334 (1%) Frame = +1 Query: 49 MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTD--TKEIKAETDEYAY 222 M+ + +++E F GK +F++W+ KM +L QQ+ A+ + T K + AY Sbjct: 1 MSSTKFDVEKFTGKNNFNLWRVKMLALLTQQECELALEGEEMLPAELTAAQKRVIMKKAY 60 Query: 223 SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402 S+I+L+L D VL ++ +K+A +LW KL+ Y SL + + + + Sbjct: 61 SAILLSLGDEVLGEISGEKTADKLWAKLESRYMTKSLHNRLCLKKQLYTMQMHEGESIHK 120 Query: 403 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLK 582 ++D F +++ +K D+ ++LL+++P +Y + I +GR S++++ V L+ Sbjct: 121 HIDNFNQVVLSLKNIDVAVDDEDQAVLLLSSLPRAYDNFVDTIIFGRSSLSMEEVKTALQ 180 Query: 583 SKELDLKVNKG-GRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXX 759 S EL ++ G +SGE + VRGR R Q++ +++N N Sbjct: 181 SWELKRRITDSYGGTSSGEGLMVRGRMDERKSFQRRRSKSRSKNKNNNK----------- 229 Query: 760 XXXXCYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENA--GDIFMVTGICDVPIVNS 933 C+NC + GH+ R CP K ++ +++ A SE + G++ V Sbjct: 230 ----CHNCQKEGHWKRNCPELKKDKVSTSEFGGAAVVSEESDGGNVLFV----------- 274 Query: 934 VHSSTVCENEWLIDSACTFHMSPFKNLFSNYKEM 1035 SS V +++W++DSACTFHM+P ++ F+ ++ + Sbjct: 275 --SSNVNDDDWILDSACTFHMTPNRDWFATFQNV 306 >gb|KHN13198.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 341 Score = 141 bits (356), Expect = 2e-35 Identities = 90/329 (27%), Positives = 166/329 (50%), Gaps = 5/329 (1%) Frame = +1 Query: 64 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYV--DTDTKEIKAETDEYAYSSIIL 237 +++E F GK +F++W+ KM +L QQ+ A+ + T K + AYS+I+L Sbjct: 3 FDVEKFTGKNNFNLWRVKMLALLTQQECELALEGEEMLPAEMTAAQKRVIMKKAYSAILL 62 Query: 238 NLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLDVF 417 +L D VL +V +K+A +LW KL+ Y SL + + + +++D F Sbjct: 63 SLGDEVLGEVSGEKTADKLWAKLESRYMTKSLHNRLCLKKQLYTMQMHEGESIHKHIDNF 122 Query: 418 TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKELD 597 +++ +K D+ ++LL+++P +Y + I +GR S++++ V L+S EL Sbjct: 123 NQVVLSLKNIDVAVDDEDQAVLLLSSLPRAYDNFVDTIIFGRSSLSMEEVKTALQSWELK 182 Query: 598 LKVNKG-GRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXXXXXXC 774 ++ G +SGE + VRGR R Q++ +++N N C Sbjct: 183 RRITDSYGGTSSGEGLMVRGRMDERKSFQRRRSKSRSKNKNNNK---------------C 227 Query: 775 YNCGEVGHYIRECPNKKGNQNQSNDQANLASTSENA--GDIFMVTGICDVPIVNSVHSST 948 +NC + GH+ R CP K ++ +++ A SE + G++ V SS Sbjct: 228 HNCQKEGHWKRNCPELKKDKVSTSEFGGAAVVSEESDGGNVLFV-------------SSN 274 Query: 949 VCENEWLIDSACTFHMSPFKNLFSNYKEM 1035 V +++W++DSACTFHM+P ++ F+ ++ + Sbjct: 275 VNDDDWILDSACTFHMTPNRDWFATFQNV 303 >gb|KYP64673.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 780 Score = 147 bits (370), Expect = 3e-35 Identities = 95/332 (28%), Positives = 162/332 (48%), Gaps = 5/332 (1%) Frame = +1 Query: 49 MAMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETD--EYAY 222 M + Y++E F+G+ DF +W+ KM+ ILIQQ +AI + + K +T+ E A Sbjct: 1 MGNTKYDIEKFSGENDFGLWRIKMEAILIQQGCAEAIKGEEKMSSSLTQKEKTNMIEKAR 60 Query: 223 SSIILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDE 402 S+IIL L D LR+V ++K+A +W KL+ LY SL K + + Sbjct: 61 SAIILCLGDKALREVAREKTAAAMWLKLESLYMTKSLAHRLCLKQRLYSFKMTETKSIVD 120 Query: 403 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRD-SITLDTVVNGL 579 L F K++ D++ + D+ ++LLN++P +Y K AI YG++ ITLD V + Sbjct: 121 QLAEFNKILDDLENIEVQLEDEDKALLLLNSLPRNYEHFKDAILYGKEQDITLDEVQTSI 180 Query: 580 KSKELDLKVNKGGRQNSGEVMHVRGRSQYRFDNQQKFDNHQNQNSNXXXXXXXXXXXXXX 759 ++KEL + + N + RGRS+ + +Q+ Sbjct: 181 RTKELQRQQDNKTDDNGESLNVSRGRSEKKGQSQKGKKARSKSK------------IGDR 228 Query: 760 XXXXCYNCGEVGHYIRECPNKKGNQNQSNDQANLASTSE--NAGDIFMVTGICDVPIVNS 933 C+ C +VGH+ + CP + +Q S D A++A+ S+ + D+ +VT Sbjct: 229 SKFKCFYCHKVGHFKKNCPERNRDQKSSADSADIAAISDGYESADVLVVT---------- 278 Query: 934 VHSSTVCENEWLIDSACTFHMSPFKNLFSNYK 1029 ++ + +W++DS C++HM P K+ F K Sbjct: 279 ---TSQTQKDWVMDSGCSYHMCPKKDYFETLK 307 >ref|XP_022855577.1| uncharacterized protein LOC111376806 [Olea europaea var. sylvestris] Length = 249 Score = 138 bits (348), Expect = 3e-35 Identities = 73/203 (35%), Positives = 125/203 (61%) Frame = +1 Query: 52 AMSGYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAITNTYVDTDTKEIKAETDEYAYSSI 231 A + + + F+G+ DFS+W QKMK IL+QQ+ + + N++ + E +E A+SSI Sbjct: 3 ATAKFEMSMFDGRGDFSMWSQKMKVILMQQRCARVLDNSWPAELPPGRRTELEETAWSSI 62 Query: 232 ILNLSDSVLRKVGKQKSAKELWDKLQELYTETSLPSXXXXXXXXXXXXXDLNKDVDENLD 411 L LS++V+R +G+ K+ ELW+KL+ Y ++P+ D + D++ENL+ Sbjct: 63 FLYLSNNVIRTIGETKTTSELWNKLKAQYEPKTVPNKCFLLKQFFSFKIDPSIDLEENLN 122 Query: 412 VFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDSITLDTVVNGLKSKE 591 FTKL QD+ +K D +VLLN+I + + D+K+A++YGR++ T D + N L+++ Sbjct: 123 RFTKLTQDLANCDEKLSQDQLAVVLLNSISDRHRDLKNALEYGRENFTTDIITNALRNEV 182 Query: 592 LDLKVNKGGRQNSGEVMHVRGRS 660 L+LK + Q SGE + +RG++ Sbjct: 183 LELK-SDSINQQSGENLLLRGKN 204