BLASTX nr result
ID: Ophiopogon23_contig00035171
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon23_contig00035171 (993 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP39598.1| Retrovirus-related Pol polyprotein from transposo... 105 2e-30 gb|KYP67484.1| Retrovirus-related Pol polyprotein from transposo... 99 2e-28 emb|CDP17738.1| unnamed protein product [Coffea canephora] 99 4e-27 ref|XP_022931734.1| uncharacterized protein LOC111437896 [Cucurb... 97 4e-24 gb|AAR13295.1| retrovirus-related pol polyprotein, partial [Phas... 78 1e-21 gb|PNX71218.1| putative retrotransposon Ty1-copia subclass prote... 90 1e-19 gb|KYP53542.1| Retrovirus-related Pol polyprotein from transposo... 95 1e-19 gb|ONK68196.1| uncharacterized protein A4U43_C05F8640 [Asparagus... 100 2e-19 sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly... 88 6e-19 gb|KYP77941.1| Retrovirus-related Pol polyprotein from transposo... 94 2e-18 gb|PPY92954.1| hypothetical protein C5P31_25600, partial [Escher... 71 4e-18 ref|XP_012070606.1| uncharacterized protein LOC105632772 [Jatrop... 95 5e-18 gb|KHN13198.1| Retrovirus-related Pol polyprotein from transposo... 71 5e-18 gb|ESQ46603.1| hypothetical protein EUTSA_v10000703mg, partial [... 89 8e-18 gb|KHN13199.1| Retrovirus-related Pol polyprotein from transposo... 70 1e-17 gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium bar... 78 1e-17 gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium bar... 78 1e-17 gb|PON41859.1| hypothetical protein PanWU01x14_286250 [Parasponi... 77 2e-17 gb|ABA96825.1| retrotransposon protein, putative, Ty1-copia subc... 91 2e-16 ref|XP_020266969.1| uncharacterized protein LOC109842514 [Aspara... 85 2e-16 >gb|KYP39598.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 350 Score = 105 bits (262), Expect(2) = 2e-30 Identities = 54/132 (40%), Positives = 83/132 (62%), Gaps = 9/132 (6%) Frame = +2 Query: 50 FNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMRSCLTQDI 229 F+G +F W+ EV DAL QGL TI E KP M E +W +N++AC +RSCL+++ Sbjct: 5 FDGSGHFGMWQSEVLDALFQQGLDITI-EGKKPESMGEEDWKTLNRLACGTIRSCLSREQ 63 Query: 230 KY---------RMITEMPAKYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAYTKLLAK 382 KY +++TE+ K+ KS +N+LH+K L+RF ++++HI ++ +L+A Sbjct: 64 KYAFCKETSASKLMTELEEKFLKKSSQNKLHMKKRLFRFTYIPGATMNDHITSFNQLVAD 123 Query: 383 LLNVDVDIEDED 418 LLN+DV EDED Sbjct: 124 LLNLDVTFEDED 135 Score = 56.6 bits (135), Expect(2) = 2e-30 Identities = 31/101 (30%), Positives = 47/101 (46%) Frame = +3 Query: 426 ILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGEALSEXXX 605 ++L L E ++ TL++G+ ++S EV AL ++ELR KDK+ S EAL Sbjct: 138 LMLLGSLPEEFEFLETTLLHGKVAVSLNEVCGALYSYELRRKDKKDSSEKANEAL----V 193 Query: 606 XXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 + KD C C++ GHW+ DCP Sbjct: 194 ARGRPVIQAKGKKKRSKSKAKVGKDECAFCREKGHWKKDCP 234 >gb|KYP67484.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 561 Score = 99.0 bits (245), Expect(2) = 2e-28 Identities = 51/123 (41%), Positives = 78/123 (63%), Gaps = 9/123 (7%) Frame = +2 Query: 77 WRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMRSCLTQDIKY------- 235 W+ EV DAL QGL TI E KP M E +W +N++AC +RSCL+++ KY Sbjct: 2 WQSEVLDALFQQGLDITI-EGKKPESMGEEDWKTLNRLACGTIRSCLSREQKYAFCKETS 60 Query: 236 --RMITEMPAKYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAYTKLLAKLLNVDVDIE 409 +++TE+ K+ KS +N+LH+K L+RF ++++HI ++ +L+A LLN+DV E Sbjct: 61 ASKLMTELEEKFLKKSSQNKLHMKKRLFRFTYIPGATMNDHITSFNQLVADLLNLDVTFE 120 Query: 410 DED 418 DED Sbjct: 121 DED 123 Score = 56.6 bits (135), Expect(2) = 2e-28 Identities = 31/101 (30%), Positives = 47/101 (46%) Frame = +3 Query: 426 ILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGEALSEXXX 605 ++L L E ++ TL++G+ ++S EV AL ++ELR KDK+ S EAL Sbjct: 126 LMLLGSLPEEFEFLETTLLHGKVAVSLNEVCGALYSYELRRKDKKDSSEKANEAL----V 181 Query: 606 XXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 + KD C C++ GHW+ DCP Sbjct: 182 ARGRPVIQAKGKKKRSKSKAKVGKDECAFCREKGHWKKDCP 222 >emb|CDP17738.1| unnamed protein product [Coffea canephora] Length = 798 Score = 98.6 bits (244), Expect(2) = 4e-27 Identities = 54/138 (39%), Positives = 81/138 (58%), Gaps = 9/138 (6%) Frame = +2 Query: 32 KIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMRS 211 K+ V F+G +F W+ EV D+L QGL I EE K ++ E W +N++AC +RS Sbjct: 34 KVAVETFDGTGHFGMWQGEVMDSLFQQGLDIAI-EEKKSDDIEEKEWSTINRLACGTIRS 92 Query: 212 CLTQDIKYRMITEMPA---------KYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAY 364 CL+++ KY E A K+ KS +N+L +K L+RF Q T+++ HI + Sbjct: 93 CLSKEQKYAFKNETSAWKLWKALEGKFLKKSGQNKLLMKKILFRFDYQPGTTMNEHITIF 152 Query: 365 TKLLAKLLNVDVDIEDED 418 +L+A LLN+DV+ EDED Sbjct: 153 NQLVADLLNLDVNFEDED 170 Score = 52.8 bits (125), Expect(2) = 4e-27 Identities = 32/102 (31%), Positives = 50/102 (49%) Frame = +3 Query: 423 LILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGEALSEXX 602 L+LL SL DE ++ TL++G+ ++S + V +AL + ELR +DK+ + A E Sbjct: 173 LMLLSSLPDE-FEHLETTLLHGKENVSLDAVCSALYSRELRKQDKKKKKVA---AADEAL 228 Query: 603 XXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 +AKD C C++ GHW+ DCP Sbjct: 229 VARGRQQSQSKGRRGWSKSISRVAKDECAFCREKGHWKKDCP 270 >ref|XP_022931734.1| uncharacterized protein LOC111437896 [Cucurbita moschata] Length = 1893 Score = 96.7 bits (239), Expect(2) = 4e-24 Identities = 53/140 (37%), Positives = 82/140 (58%), Gaps = 9/140 (6%) Frame = +2 Query: 29 AKIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMR 208 + +EV KFNG NF WR EV D L+ Q L T+ +E +M+E W ++N A ++R Sbjct: 1489 SNLEVEKFNGTNNFGVWRGEVSDLLVMQDLDATL-QEVMLEDMTEAEWTKLNWQASGIIR 1547 Query: 209 SCLTQDIKYRMIT---------EMPAKYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINA 361 SCL +D KY + ++ AKY KS+EN+L+LK L+RF + S+ +H++ Sbjct: 1548 SCLGKDQKYPFMKVTMAKELWDKLEAKYMQKSVENKLYLKKKLFRFYYKEGISMADHLDD 1607 Query: 362 YTKLLAKLLNVDVDIEDEDK 421 + K++ L+N+D DEDK Sbjct: 1608 FNKIITDLINLD----DEDK 1623 Score = 44.3 bits (103), Expect(2) = 4e-24 Identities = 26/111 (23%), Positives = 52/111 (46%), Gaps = 4/111 (3%) Frame = +3 Query: 405 LKTKTRLILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGE 584 L + + +LL + L E+YK + TL++G ++ E+V+ AL N+E++ K+K + +++ Sbjct: 1618 LDDEDKALLLLNSLPESYKFLVTTLLHGAYDINFEDVSNALMNNEVQKKEKGTYQDSSSN 1677 Query: 585 ALS----EXXXXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDC 725 L+ + K+ C ++ GHW+ DC Sbjct: 1678 VLTARERTSTWKRSECGESRSKSRGKYGNWIKLDKNECAYGRQKGHWKKDC 1728 >gb|AAR13295.1| retrovirus-related pol polyprotein, partial [Phaseolus vulgaris] Length = 324 Score = 78.2 bits (191), Expect(2) = 1e-21 Identities = 44/112 (39%), Positives = 69/112 (61%), Gaps = 9/112 (8%) Frame = +2 Query: 110 QGLKKTIDEETKPAEMSENNWIRMNKMACRLMRSCLTQDIKYRMITEMPA---------K 262 +GL I+ E KP E+ E +W +N++AC +RSCL+++ KY + E A K Sbjct: 13 KGLDIAIEGE-KPKEVEEKDWSIINRLACGTIRSCLSREQKYVVKNETSAHKLWKALEDK 71 Query: 263 YRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAYTKLLAKLLNVDVDIEDED 418 + KS +N+L +K L+RF Q+ T+++ HI + +L+A LLN+DV EDED Sbjct: 72 FLKKSGQNKLLMKKRLFRFDYQQGTTMNAHITMFNQLVAYLLNLDVKFEDED 123 Score = 54.7 bits (130), Expect(2) = 1e-21 Identities = 32/102 (31%), Positives = 51/102 (50%) Frame = +3 Query: 423 LILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGEALSEXX 602 L+L+ SL DE ++ TL++G+ ++S + V +AL +HELR +DK +++T E Sbjct: 126 LMLVSSLPDE-FEHLETTLLHGKDNVSLDVVCSALYSHELRKQDKMKTKSTTSE--EALV 182 Query: 603 XXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 +AKD C C + GHW+ DCP Sbjct: 183 TRGGQQSQTKERRGMCKSKGRVVAKDECAFCHEKGHWKKDCP 224 >gb|PNX71218.1| putative retrotransposon Ty1-copia subclass protein [Trifolium pratense] gb|PNY02521.1| putative retrotransposon Ty1-copia subclass protein [Trifolium pratense] Length = 257 Score = 89.7 bits (221), Expect(2) = 1e-19 Identities = 42/138 (30%), Positives = 79/138 (57%), Gaps = 9/138 (6%) Frame = +2 Query: 32 KIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMRS 211 K EV +F+G NF W V D L QGL+K + ETKP +M++++W+ + + A L+R Sbjct: 8 KFEVERFDGTGNFRLWERRVKDLLAQQGLQKAL-RETKPTDMADDDWLELQEKAAGLIRL 66 Query: 212 CLTQDIKYRMIT---------EMPAKYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAY 364 C++ ++ Y ++ ++ ++Y K+ NRL K LY +++ + + H+N + Sbjct: 67 CVSDEVMYHILDLTSPKEVLDKLESQYISKTRMNRLFTKMRLYSLKMREGSDLQQHVNTF 126 Query: 365 TKLLAKLLNVDVDIEDED 418 ++ +L+ + V I+DED Sbjct: 127 NNIITELVKLGVKIDDED 144 Score = 36.2 bits (82), Expect(2) = 1e-19 Identities = 28/102 (27%), Positives = 44/102 (43%), Gaps = 1/102 (0%) Frame = +3 Query: 426 ILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGEAL-SEXX 602 I+L L +YK + TLI G+ ++S +TA L +H ++ + T G+ L + Sbjct: 147 IMLLCSLPSSYKHLVNTLIYGKDTISLNVITATLLSHSRMSQNVEV--GTQGKGLYVKGS 204 Query: 603 XXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 IA+ C CK+ GHW+ DCP Sbjct: 205 QDHGQIKGKADSGKMSKSKNRKIAE--CYSCKQIGHWKRDCP 244 >gb|KYP53542.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 197 Score = 95.1 bits (235), Expect = 1e-19 Identities = 52/145 (35%), Positives = 87/145 (60%), Gaps = 9/145 (6%) Frame = +2 Query: 14 QNVQPAKIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMA 193 +N+Q K E+ F+GKTNF W+C V D L+ QGL + +++E KP+ ++E W ++ K A Sbjct: 3 RNIQ--KFEIPLFDGKTNFMIWQCTVQDILVQQGLDQALEDE-KPSNINEREWSQIQKKA 59 Query: 194 CRLMRSCLTQDIKYRMITEMPAK---------YRMKSIENRLHLKWCLYRFQLQRSTSID 346 +R LT +IK ++ E K Y KS+ NRL LK LY+ +++ ++ Sbjct: 60 VSTIRLALTPEIKCNVLKETTPKALWEKLESIYASKSLTNRLCLKMELYQLKMEIGENLH 119 Query: 347 NHINAYTKLLAKLLNVDVDIEDEDK 421 +HIN + +L+ +LLNV+ I +E++ Sbjct: 120 DHINHFNQLVCQLLNVNEKIYNEEQ 144 >gb|ONK68196.1| uncharacterized protein A4U43_C05F8640 [Asparagus officinalis] Length = 631 Score = 100 bits (248), Expect = 2e-19 Identities = 55/121 (45%), Positives = 73/121 (60%), Gaps = 9/121 (7%) Frame = +2 Query: 86 EVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMRSCLTQDIKYRMITEMPA-- 259 +V DAL AQ L++TI +P + E W +MN+ AC ++RSCL Q++KY +I Sbjct: 3 DVQDALNAQNLEETIMVHERPPKYDEAIWEKMNRNACGVIRSCLEQELKYNVIGVTSVMK 62 Query: 260 -------KYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAYTKLLAKLLNVDVDIEDED 418 KY KS+ENRLHL L+ FQ+ R TS+ H+N YTKLL+ L NVD I DE Sbjct: 63 MWEILNNKYLTKSVENRLHLLRRLFGFQMSRGTSLATHVNNYTKLLSNLANVDEKISDEY 122 Query: 419 K 421 K Sbjct: 123 K 123 Score = 79.7 bits (195), Expect = 1e-12 Identities = 57/168 (33%), Positives = 82/168 (48%), Gaps = 3/168 (1%) Frame = +3 Query: 327 RGALLSIIISMPTRSYSQNC*MSMWILKTKTRLILLRSLLDENYKTFMPTLINGRASLSH 506 RG L+ ++ T+ S + I + LL SL DE Y TF+ L+NGR+S+ + Sbjct: 93 RGTSLATHVNNYTKLLSNLANVDEKISDEYKAIFLLGSLPDEEYDTFVLILLNGRSSIIY 152 Query: 507 EEVTAALTNHELRMKDKQSSR-NTPGEALSEXXXXXXXXXXXXXXXXXXXXXXXXIAKDR 683 EVT ALTN++LR KDK++SR +T GE LS +AK++ Sbjct: 153 SEVTNALTNYDLRRKDKETSRASTSGETLS--TRGRNPYPKGGNRGRSKSRSHHQLAKNQ 210 Query: 684 CRRCKKHGHWEDDCPXXXXXXXXXXXXXXXXXI-VTEGNDS-DTCVSL 821 C CK+ HW+ DCP + V +G+DS D+C SL Sbjct: 211 CAFCKEQEHWKRDCPKLKEKENCKKKGPKANEVNVAKGDDSDDSCFSL 258 >sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] Length = 1328 Score = 88.2 bits (217), Expect(2) = 6e-19 Identities = 48/140 (34%), Positives = 77/140 (55%), Gaps = 10/140 (7%) Frame = +2 Query: 32 KIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETK-PAEMSENNWIRMNKMACRLMR 208 K EV KFNG F+TW+ + D L+ QGL K +D ++K P M +W +++ A +R Sbjct: 5 KYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIR 64 Query: 209 SCLTQDIKYRMITEMPAK---------YRMKSIENRLHLKWCLYRFQLQRSTSIDNHINA 361 L+ D+ +I E A+ Y K++ N+L+LK LY + T+ +H+N Sbjct: 65 LHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNV 124 Query: 362 YTKLLAKLLNVDVDIEDEDK 421 + L+ +L N+ V IE+EDK Sbjct: 125 FNGLITQLANLGVKIEEEDK 144 Score = 35.4 bits (80), Expect(2) = 6e-19 Identities = 27/112 (24%), Positives = 53/112 (47%), Gaps = 4/112 (3%) Frame = +3 Query: 405 LKTKTRLILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGE 584 ++ + + ILL + L +Y T+++G+ ++ ++VT+AL +E +M+ K ++ G+ Sbjct: 139 IEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLLNE-KMRKKPENQ---GQ 194 Query: 585 AL-SEXXXXXXXXXXXXXXXXXXXXXXXXIAKDR---CRRCKKHGHWEDDCP 728 AL +E +K R C C + GH++ DCP Sbjct: 195 ALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCP 246 >gb|KYP77941.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 270 Score = 93.6 bits (231), Expect = 2e-18 Identities = 44/139 (31%), Positives = 81/139 (58%), Gaps = 9/139 (6%) Frame = +2 Query: 32 KIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMRS 211 K EV F+G+T+F W+C + D L+ QGL + +E KP+E+ E++W + K A +R Sbjct: 5 KFEVPLFDGRTDFTLWQCTIQDYLVQQGLDLALQDEKKPSEIKESDWSTIQKKAVSTIRL 64 Query: 212 CLTQDIKYRMITE---------MPAKYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAY 364 L IK ++ E + +K+ K++ NRL +K LY +++ ++ +HIN + Sbjct: 65 ALAPQIKVTVLKETSPKKLWDTLESKFASKTLTNRLMMKMDLYSLKMEEGGNVTDHINKF 124 Query: 365 TKLLAKLLNVDVDIEDEDK 421 +++++LLN I+DE++ Sbjct: 125 NEMVSRLLNAGETIKDEEQ 143 >gb|PPY92954.1| hypothetical protein C5P31_25600, partial [Escherichia coli] Length = 292 Score = 71.2 bits (173), Expect(2) = 4e-18 Identities = 41/123 (33%), Positives = 66/123 (53%), Gaps = 9/123 (7%) Frame = +2 Query: 77 WRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMRSCLTQDIKYRMITEMP 256 W+ EV D L QGL I EE +P + E +W +N++AC +RS L ++ KY E Sbjct: 2 WQDEVLDVLFQQGLDLAI-EEKRPDVIGEEDWKIINRVACGTIRSYLAREQKYPYTKETS 60 Query: 257 A---------KYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAYTKLLAKLLNVDVDIE 409 A K+ K+ +N+L++K L+ F T+++ HI ++ KL+ L N+D + Sbjct: 61 ASKLWKALEDKFLKKNSQNKLYMKKRLFHFTYVPGTTMNEHITSFNKLVTDLQNMDTTYD 120 Query: 410 DED 418 D D Sbjct: 121 DGD 123 Score = 49.7 bits (117), Expect(2) = 4e-18 Identities = 34/102 (33%), Positives = 49/102 (48%) Frame = +3 Query: 423 LILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGEALSEXX 602 L+LL SL DE Y+ TL++G +S EV +AL ++E R ++KQ + GEAL Sbjct: 126 LMLLASLPDE-YEHLETTLLHGNDEISLREVCSALYSYEQRKREKQ--KGGEGEALFVRG 182 Query: 603 XXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 +KD C C++ GHW+ DCP Sbjct: 183 RPQNQTRTKKGRSKSRSRP----SKDECAFCREKGHWKKDCP 220 >ref|XP_012070606.1| uncharacterized protein LOC105632772 [Jatropha curcas] Length = 411 Score = 94.7 bits (234), Expect = 5e-18 Identities = 52/142 (36%), Positives = 81/142 (57%), Gaps = 9/142 (6%) Frame = +2 Query: 20 VQPAKIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACR 199 V A+ V F+G +F W+ E+ DAL QGL I+EE KP +M E+ W +N++AC Sbjct: 21 VSNARFTVEIFDGTGHFGIWQSELLDALFQQGLDVAIEEE-KPTKMEESEWRTINRLACG 79 Query: 200 LMRSCLTQDIKYRMITEMPA---------KYRMKSIENRLHLKWCLYRFQLQRSTSIDNH 352 +RSCL+++ KY E A K+ K+ +N+L +K LYRF T+++ + Sbjct: 80 TIRSCLSREQKYVFSKETSANKLWKALEEKFLKKNNQNKLFMKNKLYRFNYVSGTTMNEY 139 Query: 353 INAYTKLLAKLLNVDVDIEDED 418 I + +++A L N+DV DED Sbjct: 140 ITKFNQMVADLFNLDVTFADED 161 >gb|KHN13198.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 341 Score = 71.2 bits (173), Expect(2) = 5e-18 Identities = 44/140 (31%), Positives = 72/140 (51%), Gaps = 10/140 (7%) Frame = +2 Query: 32 KIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTID-EETKPAEMSENNWIRMNKMACRLMR 208 K +V KF GK NFN WR ++ L Q + ++ EE PAEM+ + K A + Sbjct: 2 KFDVEKFTGKNNFNLWRVKMLALLTQQECELALEGEEMLPAEMTAAQKRVIMKKAYSAIL 61 Query: 209 SCLTQDI---------KYRMITEMPAKYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINA 361 L ++ ++ ++ ++Y KS+ NRL LK LY Q+ SI HI+ Sbjct: 62 LSLGDEVLGEVSGEKTADKLWAKLESRYMTKSLHNRLCLKKQLYTMQMHEGESIHKHIDN 121 Query: 362 YTKLLAKLLNVDVDIEDEDK 421 + +++ L N+DV ++DED+ Sbjct: 122 FNQVVLSLKNIDVAVDDEDQ 141 Score = 49.3 bits (116), Expect(2) = 5e-18 Identities = 33/102 (32%), Positives = 45/102 (44%), Gaps = 1/102 (0%) Frame = +3 Query: 426 ILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNT-PGEALSEXX 602 +LL S L Y F+ T+I GR+SLS EEV AL + EL+ + S T GE L Sbjct: 143 VLLLSSLPRAYDNFVDTIIFGRSSLSMEEVKTALQSWELKRRITDSYGGTSSGEGLMVRG 202 Query: 603 XXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 ++C C+K GHW+ +CP Sbjct: 203 RMDERKSFQRRRSKSRSKNKN---NNKCHNCQKEGHWKRNCP 241 >gb|ESQ46603.1| hypothetical protein EUTSA_v10000703mg, partial [Eutrema salsugineum] Length = 162 Score = 89.4 bits (220), Expect = 8e-18 Identities = 47/133 (35%), Positives = 76/133 (57%) Frame = +2 Query: 32 KIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMRS 211 K ++ +FNGK NF W+ V D L QG+KK + E+ KP ++ +++W M A +R Sbjct: 8 KFDIPRFNGKGNFGLWQSRVKDLLTQQGMKKALLEK-KPDKIKQDDWDDMQDQAVSTIRF 66 Query: 212 CLTQDIKYRMITEMPAKYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINAYTKLLAKLLN 391 CL+ DI ++ Y KS+ ++L+LK L+ ++ S + HIN + +++ L Sbjct: 67 CLSDDIT----NQLEKMYMSKSLSSKLYLKQKLFGLKMVESGDLIAHINNFNQIIGDLTR 122 Query: 392 VDVDIEDEDKTHI 430 VD+ IEDEDK I Sbjct: 123 VDMKIEDEDKAMI 135 >gb|KHN13199.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 344 Score = 70.1 bits (170), Expect(2) = 1e-17 Identities = 43/140 (30%), Positives = 72/140 (51%), Gaps = 10/140 (7%) Frame = +2 Query: 32 KIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTID-EETKPAEMSENNWIRMNKMACRLMR 208 K +V KF GK NFN WR ++ L Q + ++ EE PAE++ + K A + Sbjct: 5 KFDVEKFTGKNNFNLWRVKMLALLTQQECELALEGEEMLPAELTAAQKRVIMKKAYSAIL 64 Query: 209 SCLTQDI---------KYRMITEMPAKYRMKSIENRLHLKWCLYRFQLQRSTSIDNHINA 361 L ++ ++ ++ ++Y KS+ NRL LK LY Q+ SI HI+ Sbjct: 65 LSLGDEVLGEISGEKTADKLWAKLESRYMTKSLHNRLCLKKQLYTMQMHEGESIHKHIDN 124 Query: 362 YTKLLAKLLNVDVDIEDEDK 421 + +++ L N+DV ++DED+ Sbjct: 125 FNQVVLSLKNIDVAVDDEDQ 144 Score = 49.3 bits (116), Expect(2) = 1e-17 Identities = 33/102 (32%), Positives = 45/102 (44%), Gaps = 1/102 (0%) Frame = +3 Query: 426 ILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNT-PGEALSEXX 602 +LL S L Y F+ T+I GR+SLS EEV AL + EL+ + S T GE L Sbjct: 146 VLLLSSLPRAYDNFVDTIIFGRSSLSMEEVKTALQSWELKRRITDSYGGTSSGEGLMVRG 205 Query: 603 XXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 ++C C+K GHW+ +CP Sbjct: 206 RMDERKSFQRRRSKSRSKNKN---NNKCHNCQKEGHWKRNCP 244 >gb|PPS20553.1| hypothetical protein GOBAR_AA00016 [Gossypium barbadense] Length = 2351 Score = 77.8 bits (190), Expect(2) = 1e-17 Identities = 46/148 (31%), Positives = 76/148 (51%), Gaps = 10/148 (6%) Frame = +2 Query: 17 NVQPAKIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETK-PAEMSENNWIRMNKMA 193 +V K +V KF GK +F+ WR ++ L+ QGL K + + K P+ +SE M + A Sbjct: 503 SVSSTKYDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSGKDKLPSTLSEEQKDDMLERA 562 Query: 194 CRLMRSCLTQDIKYRMITEMPA---------KYRMKSIENRLHLKWCLYRFQLQRSTSID 346 + CL ++ + E A KY KS+ NRL+LK LY +++ T + Sbjct: 563 HSAILLCLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYLKQRLYALKMEEGTPVS 622 Query: 347 NHINAYTKLLAKLLNVDVDIEDEDKTHI 430 H++ + ++ L N+D I+DED+ I Sbjct: 623 QHLDKFNSIIMDLNNIDNKIDDEDQAII 650 Score = 41.2 bits (95), Expect(2) = 1e-17 Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 1/102 (0%) Frame = +3 Query: 426 ILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMK-DKQSSRNTPGEALSEXX 602 I++ L +Y+ F+ T++ GR L+ EEV AL++ ELR K + N GE L Sbjct: 649 IIVLCSLPPSYENFVDTMMYGRDDLTLEEVKNALSSSELRKKITGKVVENNEGEGLVARG 708 Query: 603 XXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 + +C CKK+GH + DCP Sbjct: 709 RSKAKGGSSSKSHPRSQSK----KRIQCYYCKKYGHMKVDCP 746 >gb|PPR84446.1| hypothetical protein GOBAR_AA36262 [Gossypium barbadense] Length = 1841 Score = 77.8 bits (190), Expect(2) = 1e-17 Identities = 46/148 (31%), Positives = 76/148 (51%), Gaps = 10/148 (6%) Frame = +2 Query: 17 NVQPAKIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETK-PAEMSENNWIRMNKMA 193 +V K +V KF GK +F+ WR ++ L+ QGL K + + K P+ +SE M + A Sbjct: 524 SVSSTKYDVEKFTGKNSFSLWRIKMRAVLVQQGLLKALSGKDKLPSTLSEEQKDDMLERA 583 Query: 194 CRLMRSCLTQDIKYRMITEMPA---------KYRMKSIENRLHLKWCLYRFQLQRSTSID 346 + CL ++ + E A KY KS+ NRL+LK LY +++ T + Sbjct: 584 HSAILLCLGDEVLREVADEKTASGLWLRLESKYMTKSLTNRLYLKQRLYALKMEEGTPVS 643 Query: 347 NHINAYTKLLAKLLNVDVDIEDEDKTHI 430 H++ + ++ L N+D I+DED+ I Sbjct: 644 QHLDKFNSIIMDLNNIDNKIDDEDQAII 671 Score = 41.2 bits (95), Expect(2) = 1e-17 Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 1/102 (0%) Frame = +3 Query: 426 ILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMK-DKQSSRNTPGEALSEXX 602 I++ L +Y+ F+ T++ GR L+ EEV AL++ ELR K + N GE L Sbjct: 670 IIVLCSLPPSYENFVDTMMYGRDDLTLEEVKNALSSSELRKKITGKVVENNEGEGLVARG 729 Query: 603 XXXXXXXXXXXXXXXXXXXXXXIAKDRCRRCKKHGHWEDDCP 728 + +C CKK+GH + DCP Sbjct: 730 RSKAKGGSSSKSHPRSQSK----KRIQCYYCKKYGHMKVDCP 767 >gb|PON41859.1| hypothetical protein PanWU01x14_286250 [Parasponia andersonii] Length = 200 Score = 77.0 bits (188), Expect(2) = 2e-17 Identities = 41/99 (41%), Positives = 59/99 (59%), Gaps = 9/99 (9%) Frame = +2 Query: 155 MSENNWIRMNKMACRLMRSCLTQDIKYRMITEMPA---------KYRMKSIENRLHLKWC 307 M + W R+N+ AC +R CL ++ KY + E A KY KS ENRL+LK Sbjct: 1 MDDKEWERINRQACDTIRLCLCREQKYPFMRETFASKLWKAIENKYMKKSNENRLYLKKR 60 Query: 308 LYRFQLQRSTSIDNHINAYTKLLAKLLNVDVDIEDEDKT 424 L+ QL+ T+I +HI+ + +L+A LLN+D +DEDKT Sbjct: 61 LFHIQLKPGTTISDHIDTFNQLIADLLNLDETFKDEDKT 99 Score = 42.0 bits (97), Expect(2) = 2e-17 Identities = 20/62 (32%), Positives = 35/62 (56%) Frame = +3 Query: 408 KTKTRLILLRSLLDENYKTFMPTLINGRASLSHEEVTAALTNHELRMKDKQSSRNTPGEA 587 K + + +LL + TL++G L +EV+A L NHE+R KD++ +++ P E Sbjct: 94 KDEDKTMLLIGSFPDELDHLCITLLHGNEKLFFDEVSATLYNHEIRKKDQKENKDVPAEV 153 Query: 588 LS 593 L+ Sbjct: 154 LT 155 >gb|ABA96825.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 424 Score = 90.5 bits (223), Expect = 2e-16 Identities = 48/143 (33%), Positives = 77/143 (53%), Gaps = 9/143 (6%) Frame = +2 Query: 29 AKIEVVKFNGKTNFNTWRCEVYDALLAQGLKKTIDEETKPAEMSENNWIRMNKMACRLMR 208 +K EVVKF+G NF W+ + D L QG+ K + EET P +M W M A +R Sbjct: 7 SKFEVVKFDGTGNFVLWQMRLKDLLAQQGISKAL-EETMPEKMDAGKWEEMKAQAAATIR 65 Query: 209 SCLTQDIKYRMITEMPAK---------YRMKSIENRLHLKWCLYRFQLQRSTSIDNHINA 361 L+ + Y+++ E +K Y KS+ ++L+LK LY Q+Q + + H++ Sbjct: 66 LSLSDSVMYQVMDEKTSKEIWVKLTSLYMSKSLTSKLYLKQQLYGLQMQEESDLRKHVDV 125 Query: 362 YTKLLAKLLNVDVDIEDEDKTHI 430 + +L+ L +DV ++DEDK I Sbjct: 126 FNQLVVDLSKLDVKLDDEDKAII 148 >ref|XP_020266969.1| uncharacterized protein LOC109842514 [Asparagus officinalis] Length = 129 Score = 84.7 bits (208), Expect(2) = 2e-16 Identities = 41/97 (42%), Positives = 61/97 (62%), Gaps = 9/97 (9%) Frame = +2 Query: 155 MSENNWIRMNKMACRLMRSCLTQDIKYRMITE---------MPAKYRMKSIENRLHLKWC 307 M E W ++N AC ++RSCLT D+ Y ++ E + KY K+IEN+LHLK Sbjct: 1 MEEKVWKKVNSRACGIIRSCLTLDLIYDVMNETLTKRLWEILNNKYLTKNIENQLHLKKI 60 Query: 308 LYRFQLQRSTSIDNHINAYTKLLAKLLNVDVDIEDED 418 Y+F++ R SI H+N +TKLL+ + NVD+ ++DED Sbjct: 61 FYQFKMNRGVSIREHVNNFTKLLSDMANVDIMVDDED 97 Score = 30.8 bits (68), Expect(2) = 2e-16 Identities = 13/29 (44%), Positives = 23/29 (79%) Frame = +3 Query: 423 LILLRSLLDENYKTFMPTLINGRASLSHE 509 ++LL SL +++Y TF+ T+INGR +S++ Sbjct: 100 MLLLCSLSEDDYGTFVLTMINGRTIVSYK 128