BLASTX nr result
ID: Perilla23_contig00027964
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00027964 (963 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011073037.1| PREDICTED: retrovirus-related Pol polyprotei... 179 3e-42 ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969... 176 2e-41 ref|XP_011100917.1| PREDICTED: uncharacterized protein LOC105179... 155 6e-35 emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana] 116 2e-23 emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] 114 9e-23 dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri] 112 3e-22 emb|CAA31653.1| polyprotein [Arabidopsis thaliana] 104 9e-20 emb|CAN71583.1| hypothetical protein VITISV_043292 [Vitis vinifera] 101 1e-18 dbj|BAA96887.1| copia-like retroelement pol polyprotein [Arabido... 99 4e-18 ref|XP_011070172.1| PREDICTED: glutamate receptor 2.8-like [Sesa... 97 2e-17 gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidop... 96 3e-17 gb|AAW57815.1| putative polyprotein [Oryza sativa Japonica Group] 96 5e-17 gb|ABD96963.1| hypothetical protein [Cleome spinosa] 96 5e-17 ref|XP_012849888.1| PREDICTED: retrovirus-related Pol polyprotei... 94 1e-16 emb|CAN66576.1| hypothetical protein VITISV_016964 [Vitis vinifera] 92 6e-16 gb|AAL31047.1|AC078893_10 putative polyprotein [Oryza sativa Jap... 90 2e-15 pir||S23319 hypothetical protein 2 - Arabidopsis thaliana retrot... 90 3e-15 emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] 90 3e-15 ref|XP_013614156.1| PREDICTED: uncharacterized protein LOC106320... 89 4e-15 gb|KHN28812.1| Retrovirus-related Pol polyprotein from transposo... 88 9e-15 >ref|XP_011073037.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Sesamum indicum] Length = 472 Score = 179 bits (454), Expect = 3e-42 Identities = 97/213 (45%), Positives = 137/213 (64%), Gaps = 7/213 (3%) Frame = -3 Query: 961 TEEEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDE 782 TEE+K E ++ A ++I+LNLSD V RKV ++S+K LWDKL +EL+ + SLP+++FLL++ Sbjct: 44 TEEKKLENDEFAYSSIVLNLSDTVLRKVGKLESSKALWDKL-EELFTEISLPNKLFLLEK 102 Query: 781 FINFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNG 602 +KLD+SK++DENL F KL+QDIKL+GDK ID+Y+ VLLNAIPES+SDVK+AIK G Sbjct: 103 IFRYKLDLSKNIDENLDDFTKLIQDIKLAGDKYIDEYSPIVLLNAIPESFSDVKAAIKYG 162 Query: 601 GNDVAFDIIVDALKNKENDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTKFN- 425 + + + +V+ LK+KE D K N + ++ VRGRT+ + N KTK N Sbjct: 163 RDSINLETVVNGLKSKELDLKVNKPSQSHYEINSVRGRTRFGNFNSRYNSRSRSKTKTNR 222 Query: 424 ------XXXXXXXXXXXXKCYNCNEPGHLAKEC 344 +CYNC GH K+C Sbjct: 223 SKSRPRETNLRDDKIRDRRCYNCGTKGHYIKDC 255 Score = 112 bits (279), Expect = 6e-22 Identities = 51/79 (64%), Positives = 65/79 (82%) Frame = -3 Query: 238 NATKCPVAGIGDICLKFDSGYEYTLKNVRHVPDLCRNFLSCASLEEDGLDGIWGKGVMKI 59 N KC + G+GDI L FD GY+ LKNVR+VPDL N +SCA+LEE+GL+G WGKG+MKI Sbjct: 333 NEKKCEIKGLGDISLCFD-GYKMLLKNVRYVPDLSHNLISCAALEENGLEGRWGKGLMKI 391 Query: 58 LKGSLVIFKAEKENNMYIC 2 +KGSLV+FKAE++ N+YIC Sbjct: 392 MKGSLVVFKAERKRNLYIC 410 >ref|XP_012849618.1| PREDICTED: uncharacterized protein LOC105969411 [Erythranthe guttatus] Length = 1213 Score = 176 bits (446), Expect = 2e-41 Identities = 91/162 (56%), Positives = 122/162 (75%) Frame = -3 Query: 961 TEEEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDE 782 T +K E ++LA + IILNLSD+V RKV SAK LW+KL DELY +TSLPS++FLL++ Sbjct: 487 TASKKIELDELAYSAIILNLSDSVIRKVGMHDSAKGLWEKL-DELYTETSLPSKLFLLEK 545 Query: 781 FINFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNG 602 F FKLD++KD+DEN+ F +LVQDIKL+GDK ID YT VLLNAIP+SY+D+KSAIK G Sbjct: 546 FFRFKLDLTKDIDENIDQFTRLVQDIKLTGDKSIDNYTPIVLLNAIPDSYNDLKSAIKYG 605 Query: 601 GNDVAFDIIVDALKNKENDSKHNGSGKYSSKLKHVRGRTQTR 476 ++++ D +++ LK+KE D + N S K ++ VRGR Q R Sbjct: 606 RDNISLDTVINGLKSKEMDLRVNKSNKSFGEVNFVRGRQQNR 647 >ref|XP_011100917.1| PREDICTED: uncharacterized protein LOC105179028 [Sesamum indicum] Length = 188 Score = 155 bits (391), Expect = 6e-35 Identities = 79/141 (56%), Positives = 109/141 (77%) Frame = -3 Query: 961 TEEEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDE 782 ++E+K + ++ A ++IILNLSD V RKV S+K+LW+KL D LY +TSLPS++FLL++ Sbjct: 44 SDEKKIQNDEFAYSSIILNLSDNVLRKVGKQSSSKDLWEKLED-LYTETSLPSKLFLLEK 102 Query: 781 FINFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNG 602 ++KLD+SK +DENL F KL+QDIKL+GDK ID+Y+ VLLNAIP+SYSD K+AIK G Sbjct: 103 KFHYKLDLSKSIDENLDDFTKLIQDIKLTGDKNIDEYSPIVLLNAIPKSYSDAKAAIKYG 162 Query: 601 GNDVAFDIIVDALKNKENDSK 539 + V D +V+ LK+KE D K Sbjct: 163 RDSVNLDTVVNGLKSKEMDLK 183 >emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana] Length = 560 Score = 116 bits (291), Expect = 2e-23 Identities = 88/356 (24%), Positives = 161/356 (45%), Gaps = 38/356 (10%) Frame = -3 Query: 958 EEEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEF 779 ++ K E ++ A+ II ++ DAV RK+ H KSA E+W LN + Y +TSLP+R+++ +F Sbjct: 68 DQVKIEKSENAMNIIITHVGDAVLRKIDHCKSAAEMWKTLNKQ-YMETSLPNRIYVQLKF 126 Query: 778 INFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGG 599 +FK++ SK ++EN+ F K+V ++ +++ A + LN + YS +K +K G Sbjct: 127 YSFKMNDSKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNGLSSRYSQLKHTLKYGN 186 Query: 598 NDVAFDIIVDALKNKE---NDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTKF 428 ++ ++ + ++ E ++ K + + RGR TR+Q ++ G G++K Sbjct: 187 KALSLQDVISSARSLERELDEQKETDKNTSTVLYTNERGRPLTRNQNQNKGGQGRGRSKS 246 Query: 427 NXXXXXXXXXXXXKCYNCNEPGHLAKEC----TXXXXXXXXXXXXXKEHAVAGKCYTI-I 263 N C+ C + GH+ K+C E V + ++ Sbjct: 247 N-------SNAKLTCWYCKKEGHVKKDCFARKRKLESENPGEAGVITEKLVFSEALSVND 299 Query: 262 LCLSQMRQNATKCP----------------------------VAGIGDICLKFDS--GYE 173 L + + + + CP V G +K D+ G Sbjct: 300 LAVRDIWELDSGCPSHMSARKDWFCNFREDGGTTILLGDDHSVKSQGQGSIKIDTHGGTI 359 Query: 172 YTLKNVRHVPDLCRNFLSCASLEEDGLDGIWGKGVMKILKGSLVIFKAEKENNMYI 5 L+NV++VP+L RN +S +L++ G G G ++ K + E N +YI Sbjct: 360 TVLENVKYVPELRRNLISTGTLDKRGYKHEGGDGKVRYFKNQKTALRGEIVNGLYI 415 >emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] Length = 1208 Score = 114 bits (286), Expect = 9e-23 Identities = 88/335 (26%), Positives = 155/335 (46%), Gaps = 18/335 (5%) Frame = -3 Query: 958 EEEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEF 779 E++K E + A + IIL+L D V R+ KSA E+W KL + LY SL +R+ + Sbjct: 49 EKQKIELLEKAHSAIILSLGDTVLREXAKAKSAAEVWLKL-ESLYMTKSLANRLHKKIKL 107 Query: 778 INFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGG 599 FK+ ++ +L FNK++ D++ D+ A +LL ++ SY+++K AI G Sbjct: 108 YTFKMTPGMSIEXHLDHFNKIILDLENIDITISDEDKAILLLTSLDASYTNMKDAIMYGR 167 Query: 598 NDVAFDIIVDALKNKENDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNES-GGGKTKFNX 422 + + FD + L +E K S + S + ++RGR++ R + ++S KTK Sbjct: 168 DSLTFDEVQSILHAREL-QKQEESKEESGEGLNIRGRSEKREKKGKNSKSRSKSKTK--- 223 Query: 421 XXXXXXXXXXXKCYNCNEPGHLAKEC-----TXXXXXXXXXXXXXKEHAVAGKCYTIILC 257 KC+ C++ GH K+C + + G +T +L Sbjct: 224 ---------KFKCFICHKEGHFKKDCPDRRQNTVKKTVNRWTRVRSGYLIQGALFTCVLS 274 Query: 256 LSQMRQ------------NATKCPVAGIGDICLKFDSGYEYTLKNVRHVPDLCRNFLSCA 113 ++ N C + G G + +K G E L++VR++P+L RN +S Sbjct: 275 KLGLKTFKEADGGYVLLGNNKHCKILGTGTVRIKHYDGIERVLEDVRYIPELKRNLISLG 334 Query: 112 SLEEDGLDGIWGKGVMKILKGSLVIFKAEKENNMY 8 L++ G +++ +GSL + K +N +Y Sbjct: 335 MLDKSGYTFKSEPNSLRVARGSLTVMKGTIKNGLY 369 >dbj|BAQ19356.1| putative gag-pol polyprotein [Torenia fournieri] Length = 605 Score = 112 bits (281), Expect = 3e-22 Identities = 95/358 (26%), Positives = 152/358 (42%), Gaps = 45/358 (12%) Frame = -3 Query: 952 EKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEFIN 773 EK + A + IIL+L D V R+V H +A LW KL +ELY SL +R++L + Sbjct: 69 EKKILDSKAHSVIILSLGDRVLRQVSHESTALGLWKKL-EELYMTKSLANRLYLKQALYS 127 Query: 772 FKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGGND 593 FK+ K +DE + F KL+ D++ K D+ A +L+ A+P SY+ K + G Sbjct: 128 FKMIEEKAIDEQMDQFIKLILDLENIEVKIEDEDQALLLVCALPRSYNTFKDTLLYGRET 187 Query: 592 VAFDIIVDALKNKE-NDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTKFNXXX 416 + + ALK+K+ N N + +S+ +V+G+ + + +R K K Sbjct: 188 LTLKEVQAALKSKQLNTRIDNKAVGSTSEALYVKGKGEEKKTHKERKNKSKKKVK----- 242 Query: 415 XXXXXXXXXKCYNCNEPGHLAKECTXXXXXXXXXXXXXKEHAVAGKCY--TIILCLSQMR 242 C+ C+E GH+ K C E A+A + Y +L ++ Sbjct: 243 ----------CFYCDEEGHMCKNC-PKKERDKGKKVEQGEAAMACESYESADVLAVTHED 291 Query: 241 QNATKC------------------------------------------PVAGIGDICLKF 188 Q+ TK + G G + ++ Sbjct: 292 QDVTKSEKSGKWLLDSASSFHVTCVKSWIKDFKGCDGCLVSVGEEKQYKILGFGTVKIRL 351 Query: 187 DSGYEYTLKNVRHVPDLCRNFLSCASLEEDGLDGIWGKGVMKILKGSLVIFKAEKENN 14 +G L+NV+ +PDL RN +S L+ G + G GVMK+ KGS VI + N Sbjct: 352 KTGGVRILRNVKFIPDLGRNLISVGLLDVQGFKCVAGNGVMKVFKGSKVIMSGTLQKN 409 >emb|CAA31653.1| polyprotein [Arabidopsis thaliana] Length = 1291 Score = 104 bits (260), Expect = 9e-20 Identities = 60/204 (29%), Positives = 110/204 (53%), Gaps = 3/204 (1%) Frame = -3 Query: 949 KDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEFINF 770 K E ++ A+ II ++ DAV RK+ H KSA E+W+ LN + Y +TSLP+R+++ +F +F Sbjct: 83 KIEKSENAMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQ-YMETSLPNRIYVQLKFYSF 141 Query: 769 KLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGGNDV 590 K++ +K ++EN+ F K+V ++ +++ A + LN + YS +K +K G + Sbjct: 142 KMNDTKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKAL 201 Query: 589 AFDIIVDALKNKE---NDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTKFNXX 419 + ++ A ++ E N+ K + + R R QTR+Q ++ G G++K N Sbjct: 202 SLKDVISAARSLERELNEQKETDKNTSTVLYTNERSRPQTRNQNHNKGGQGRGRSKSN-- 259 Query: 418 XXXXXXXXXXKCYNCNEPGHLAKE 347 C+ C + GH+ K+ Sbjct: 260 -----SNAKLTCWYCKKEGHVKKD 278 >emb|CAN71583.1| hypothetical protein VITISV_043292 [Vitis vinifera] Length = 531 Score = 101 bits (251), Expect = 1e-18 Identities = 89/318 (27%), Positives = 141/318 (44%), Gaps = 23/318 (7%) Frame = -3 Query: 892 VCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEFINFKLDISKDMDENLYIFNKLV 713 V RKV A +W KL +E+Y SLP+R++L +F FK+ K +DE + F KLV Sbjct: 15 VLRKVSREILASTIWKKL-EEIYMLKSLPNRIYLKQKFFGFKMHEKKSIDEIIDEFTKLV 73 Query: 712 QDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGGNDVAFDIIVDALKNKENDSKHN 533 D++ K D+ A +LLN++P+ + ++ +K + ++ + +V A+ K+ D + Sbjct: 74 VDLESLCVKIEDEDQAVLLLNSLPKVFDQLRDTLKYSKDSLSLEGVVSAIHAKKMDIRVE 133 Query: 532 GSGKYSSKLKHVRGRTQTRSQT-----DDRNESGGGKTKFNXXXXXXXXXXXXKCYNCNE 368 G SS+ VRGR + R+ D K+KF KC++C++ Sbjct: 134 RIGTTSSEGLVVRGRPKKRNNNQKGNHDKSKSKSKSKSKFT-----------RKCFHCHK 182 Query: 367 PGHLAKECTXXXXXXXXXXXXXKEHAVAGKCYTIILCLSQMRQNATKCPV---------- 218 GH + VA K T L + CP+ Sbjct: 183 EGHYREXXDVAIAFEGYDXVDVL--MVAKKDLTSEWILDSGXKXFHMCPIKSWFSRYMEL 240 Query: 217 ------AGIGDICLKFDSG--YEYTLKNVRHVPDLCRNFLSCASLEEDGLDGIWGKGVMK 62 G + C K +G E TL NVR+V DL RN + L+ G G +K Sbjct: 241 KEGKVLMGXNEAC-KVTNGMAMERTLTNVRYVLDLKRNLIXMGMLDSMGCTIKIXNGELK 299 Query: 61 ILKGSLVIFKAEKENNMY 8 ILKG+ ++ K ++N +Y Sbjct: 300 ILKGAKIVMKGIRKNGLY 317 >dbj|BAA96887.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana] Length = 1140 Score = 99.4 bits (246), Expect = 4e-18 Identities = 62/207 (29%), Positives = 108/207 (52%), Gaps = 5/207 (2%) Frame = -3 Query: 949 KDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEFINF 770 K E ++ A+ II ++SD V RKV H K+A LW+ LN ELY +T LP+R++ +F +F Sbjct: 79 KIEKSEQAMNIIINHISDTVLRKVNHCKTAATLWELLN-ELYMETLLPNRIYAQLKFYSF 137 Query: 769 KLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGGNDV 590 ++ SK +D+N+ F ++V ++ K ++ A ++LN++P +Y +K +K G + Sbjct: 138 RMMTSKTIDQNVDDFLRIVAELGSLDIKVAEEVQAILILNSLPVTYDQLKHTLKYGNKTL 197 Query: 589 AFDIIVDALKNKENDS---KHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTKFN-- 425 + +V + K+ E + K N ++ RGR QTR+Q + + G N Sbjct: 198 SVKDVVSSSKSLEREMAELKENTKVVNTTLYTAERGRPQTRNQNGSQGNNQGNNQGKNQG 257 Query: 424 XXXXXXXXXXXXKCYNCNEPGHLAKEC 344 C+ C + GH+ K+C Sbjct: 258 KGKSRSNSKSRVTCWFCKKEGHVKKDC 284 >ref|XP_011070172.1| PREDICTED: glutamate receptor 2.8-like [Sesamum indicum] Length = 1131 Score = 96.7 bits (239), Expect = 2e-17 Identities = 42/67 (62%), Positives = 54/67 (80%) Frame = -3 Query: 238 NATKCPVAGIGDICLKFDSGYEYTLKNVRHVPDLCRNFLSCASLEEDGLDGIWGKGVMKI 59 N +C + G+GDI + F GY+ TLKNVR+VPDL N +SCA+LEEDGL+G WGKG+MKI Sbjct: 369 NEKRCEIEGLGDISMCFKDGYKMTLKNVRYVPDLSHNLISCAALEEDGLEGRWGKGLMKI 428 Query: 58 LKGSLVI 38 +KGSLV+ Sbjct: 429 MKGSLVV 435 >gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana] Length = 1137 Score = 96.3 bits (238), Expect = 3e-17 Identities = 59/208 (28%), Positives = 107/208 (51%), Gaps = 3/208 (1%) Frame = -3 Query: 958 EEEKDETNQ--LALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLD 785 EEEK +Q A+ I +N+ D V R + + K+A E W L D+LY SLP+R++L Sbjct: 32 EEEKARIDQDEKAMDMIFINVGDKVLRNIENSKTAAEAWATL-DKLYLVKSLPNRVYLQL 90 Query: 784 EFINFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKN 605 + N+++ SK ++EN+ F K++ D+ + D+ A ++L+A+P+SY +K +K Sbjct: 91 KVYNYRMQDSKTLEENVDEFQKMISDLNNLQIQVPDEVQAILILSALPDSYDMLKETLKY 150 Query: 604 GGNDVAFDIIVDALKNKENDSK-HNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTKF 428 G + D ++ A K+KE + + +G + + +VRG++Q R ++ G Sbjct: 151 GREGIKLDDVISAAKSKELELRDSSGGSRPVGEGLYVRGKSQARGSDGPKSTEG------ 204 Query: 427 NXXXXXXXXXXXXKCYNCNEPGHLAKEC 344 C+ C + GH ++C Sbjct: 205 -----------KKVCWICGKEGHFKRQC 221 >gb|AAW57815.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1109 Score = 95.5 bits (236), Expect = 5e-17 Identities = 77/327 (23%), Positives = 148/327 (45%), Gaps = 17/327 (5%) Frame = -3 Query: 961 TEEEK--DETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLL 788 T+++K +E L + I+ LSD + H+ AKELWD LN + + T ++++ Sbjct: 48 TDQQKQFEEATTLFVGCILSVLSDRLVEVYMHMTDAKELWDALNTK-FGATDASIDVYIM 106 Query: 787 DEFINFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIK 608 ++F ++++ ++ + E + + ++++L DK+ A ++ +P S+ +A+K Sbjct: 107 EQFHDYRMADNRSVVEQAHEIQTMAKELELLKCVLPDKFVAGCIIAKLPPSWRSFGTALK 166 Query: 607 NGGNDVAFDIIVDAL----KNKENDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGG 440 + + + + ++ +L K +E D+ G G SS + +++ + + + Sbjct: 167 HKRQEYSIEGLIASLDIEEKAREKDAASKGDGGQSSANVVHKAHNKSKGKYKAQQTTNFK 226 Query: 439 KTKFNXXXXXXXXXXXXKCYNCNEPGHLAKECTXXXXXXXXXXXXXKEHAVA------GK 278 K K N C+ C +PGHLA++C K V G Sbjct: 227 KQKKN---NNNPNQDEGTCFVCGQPGHLARKCLQRMGMKAPAGQTSKFDNVTIGSTGDGS 283 Query: 277 CY-----TIILCLSQMRQNATKCPVAGIGDICLKFDSGYEYTLKNVRHVPDLCRNFLSCA 113 Y T+++ N + V G+G + LKF SG LKNV+HVP + RN +S + Sbjct: 284 GYGRRGSTVLM------GNGSHASVHGVGTVDLKFTSGKIVQLKNVQHVPSIDRNLVSGS 337 Query: 112 SLEEDGLDGIWGKGVMKILKGSLVIFK 32 L +DG ++ + + K I K Sbjct: 338 RLTKDGFKLVFESNKVVVSKHGYFIGK 364 >gb|ABD96963.1| hypothetical protein [Cleome spinosa] Length = 408 Score = 95.5 bits (236), Expect = 5e-17 Identities = 61/204 (29%), Positives = 103/204 (50%) Frame = -3 Query: 955 EEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEFI 776 +E+ E ++ A I+L L+D V RKV ++A +W KL + L+ + SLP+RM+L+ Sbjct: 89 KERQERSRRARNLIVLALADQVLRKVISERTAFGIWRKL-ERLHIEQSLPNRMYLMQRVS 147 Query: 775 NFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGGN 596 F++D S+ ++ENL IF KL+ D+ K ++Y A LLN++P +Y ++ +K Sbjct: 148 GFRMDSSRTIEENLDIFQKLLSDLHSLNVKVEEEYQAVYLLNSLPPAYEQLREVLKYSRA 207 Query: 595 DVAFDIIVDALKNKENDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTKFNXXX 416 ++ + + A + KE + G+ RG + +SGGGK K Sbjct: 208 TISVEEVKAAARMKELELLAQGT--------LTRGTGEGLVVKGKPEKSGGGKKK----- 254 Query: 415 XXXXXXXXXKCYNCNEPGHLAKEC 344 +C+ C + GH KEC Sbjct: 255 ----AKDQVECWYCGKKGHYKKEC 274 >ref|XP_012849888.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Erythranthe guttatus] Length = 598 Score = 94.4 bits (233), Expect = 1e-16 Identities = 45/79 (56%), Positives = 53/79 (67%) Frame = -3 Query: 238 NATKCPVAGIGDICLKFDSGYEYTLKNVRHVPDLCRNFLSCASLEEDGLDGIWGKGVMKI 59 + T V GIG +CLKF SG TLKNVRHVP L N +SCA LE+DG G WG G M I Sbjct: 49 DGTMIVVNGIGTVCLKFVSGSVLTLKNVRHVPTLSHNLISCAVLEDDGFRGDWGDGCMNI 108 Query: 58 LKGSLVIFKAEKENNMYIC 2 +KGS +FKA + NMY+C Sbjct: 109 MKGSRYLFKALRMGNMYVC 127 >emb|CAN66576.1| hypothetical protein VITISV_016964 [Vitis vinifera] Length = 1316 Score = 92.0 bits (227), Expect = 6e-16 Identities = 78/340 (22%), Positives = 151/340 (44%), Gaps = 25/340 (7%) Frame = -3 Query: 952 EKDETNQLALATI-ILNLSDAVCRK---------VYHV----KSAKELWDKLNDELYAQT 815 E+ ETN+ +A + SD +CR +Y+V K+AKELWD L D+ Y Sbjct: 247 EEGETNKEKVAVVDAWKHSDFLCRNYMLNGLDNTLYNVYCSLKTAKELWDSL-DKKYKTE 305 Query: 814 SLPSRMFLLDEFINFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPES 635 + F++ +F++FK+ SK + + ++ +I G D + A ++ +P Sbjct: 306 DAGIKKFIVGKFLDFKMIDSKTVISQVQELQVILHEIHSEGMSFSDSFQVAAVIEKLPPL 365 Query: 634 YSDVKSAIKNGGNDVAFDIIVDALK----NKENDSKHNGSGKYSSKLKHVRGRTQTRSQT 467 + D K+ +K+ ++ + ++ L+ N++++ K N S + + + +T + + Sbjct: 366 WKDFKNYLKHKRKEMNLEELIVRLRIEEDNRKSEKKGNNSMEAKANVIEQGPKTNKKRKH 425 Query: 466 DDRNESGGGKT--KFNXXXXXXXXXXXXKCYNCNEPGHLAKECTXXXXXXXXXXXXXKEH 293 D+N++ KF KCYNC + GH K H Sbjct: 426 GDQNQNQESNVAKKFK-----------GKCYNCGKTGH--KSNDYRNTKEWWVDTGATRH 472 Query: 292 AVAGK-CYTIILCLSQMRQ----NATKCPVAGIGDICLKFDSGYEYTLKNVRHVPDLCRN 128 + K ++ + Q + N++ + G G + LK SG E TL +V HVPD+C+N Sbjct: 473 ICSNKWMFSTYKPVEQNEELFMGNSSSSKIEGRGKVILKMTSGKELTLNDVLHVPDICKN 532 Query: 127 FLSCASLEEDGLDGIWGKGVMKILKGSLVIFKAEKENNMY 8 +S + L ++G ++ + K + + K + ++ Sbjct: 533 LVSGSLLSKNGFKLVFVSDKFVLTKNEMFVGKGYLSDGLF 572 >gb|AAL31047.1|AC078893_10 putative polyprotein [Oryza sativa Japonica Group] Length = 866 Score = 90.1 bits (222), Expect = 2e-15 Identities = 74/316 (23%), Positives = 141/316 (44%), Gaps = 7/316 (2%) Frame = -3 Query: 958 EEEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEF 779 ++E DE L + I+ L D + H+ AKELWD LN + + T + ++++++F Sbjct: 20 QKEFDEATTLFVGCILSVLGDRLVEVYMHMTDAKELWDALNTK-FGATDASNDLYIMEQF 78 Query: 778 INFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGG 599 ++K+ ++ + E + + ++ +L DK+ ++ +P S+ +A+K+ Sbjct: 79 HDYKMADNRSVVEQAHEIQTMAKEFELLKCVLPDKFVDGCIIAKLPPSWRGFGTALKHKR 138 Query: 598 NDVAFDIIVDAL----KNKENDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTK 431 + + + ++ +L K +E D G G S+ +V Q +S+ + + K Sbjct: 139 QEYSVEGLIASLDVEEKAREKDVAFKGDGGQSTA--NVVHTAQNKSKGKYKAQQTTNFKK 196 Query: 430 FNXXXXXXXXXXXXKCYNCNEPGHLAKECTXXXXXXXXXXXXXK---EHAVAGKCYTIIL 260 N C+ C +PGHLA++C + + G T+ Sbjct: 197 KNNNNNPNQDERT--CFVC-QPGHLARKCPPAGQTSKSANVTIGNTGDGSGYGNLPTVSR 253 Query: 259 CLSQMRQNATKCPVAGIGDICLKFDSGYEYTLKNVRHVPDLCRNFLSCASLEEDGLDGIW 80 + + N + V G+G + LKF SG LKNV+HVP + RN +S + L DG ++ Sbjct: 254 GSTVLMGNGSHASVHGVGMVDLKFTSGKIVQLKNVQHVPSIDRNLVSGSRLTRDGFKLVF 313 Query: 79 GKGVMKILKGSLVIFK 32 + + K I K Sbjct: 314 ESNKVVVSKHGYFIGK 329 >pir||S23319 hypothetical protein 2 - Arabidopsis thaliana retrotransposon Ta1-2 (strain Landsberg) (fragment) gi|16384|emb|CAA37924.1| unnamed protein product [Arabidopsis thaliana] Length = 1084 Score = 89.7 bits (221), Expect = 3e-15 Identities = 50/180 (27%), Positives = 95/180 (52%), Gaps = 3/180 (1%) Frame = -3 Query: 874 HVKSAKELWDKLNDELYAQTSLPSRMFLLDEFINFKLDISKDMDENLYIFNKLVQDIKLS 695 + KSA E+W+ LN + Y +TSLP+R+++ +F +FK++ SK ++EN+ F K+V ++ Sbjct: 8 YCKSAAEMWETLNKQ-YMETSLPNRIYVQLKFYSFKMNDSKSINENVNEFLKIVAELSSL 66 Query: 694 GDKGIDKYTAAVLLNAIPESYSDVKSAIKNGGNDVAFDIIVDALKNKE---NDSKHNGSG 524 +++ A + LN + YS +K +K G ++ ++ + ++ E ++ K Sbjct: 67 EINVVEEVRAILFLNGLSSRYSQLKHTLKYGNKALSLQDVISSARSLERELDEQKETDKN 126 Query: 523 KYSSKLKHVRGRTQTRSQTDDRNESGGGKTKFNXXXXXXXXXXXXKCYNCNEPGHLAKEC 344 + + RGR QTR+Q ++ G G +K N C+ C + GH+ K+C Sbjct: 127 TSTVLYTNERGRPQTRNQNQNKEGQGRGISKSN-------SNAKLTCWYCKKEGHVKKDC 179 >emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] Length = 894 Score = 89.7 bits (221), Expect = 3e-15 Identities = 82/349 (23%), Positives = 147/349 (42%), Gaps = 37/349 (10%) Frame = -3 Query: 958 EEEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEF 779 E+ K E + A IIL+L D R+V KSA +L KL + LY SL +R+ + Sbjct: 49 EKHKIELLEKAHGAIILSLGDTXLREVAKAKSAAKLLLKL-ESLYMTKSLANRLHKXIKL 107 Query: 778 INFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGG 599 FK+ S ++E+L FNK++ D+K ++ A +LL ++ SY+++K AI G Sbjct: 108 YTFKMTPSMSIEEHLDHFNKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMYGR 167 Query: 598 NDVAFDIIVDALKNKENDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKTKFNXX 419 + + FD + L +E + + L ++RG+++ R + N K+K Sbjct: 168 DILTFDEVQSILHARELHKQEESKEELGEGL-NIRGKSKKREKKKGNNSKSRSKSK---- 222 Query: 418 XXXXXXXXXXKCYNCNEPGHLAKECTXXXXXXXXXXXXXKEHAVAGKCYTIILCLSQMRQ 239 KC+ C++ GH K+C + + Y L+ Sbjct: 223 ------TKKFKCFICHKEGHFKKDCPDMRQNTXKKTMNEGDATMILDGYDNAGVLNVAEV 276 Query: 238 NATK------------CP-------------------------VAGIGDICLKFDSGYEY 170 ++ K CP + G G + +K G E Sbjct: 277 DSGKEWILDSGCSFHMCPIKAWFEDFKEANGGHVLLGNNKHCKILGTGTVKIKHYDGIER 336 Query: 169 TLKNVRHVPDLCRNFLSCASLEEDGLDGIWGKGVMKILKGSLVIFKAEK 23 L+++R++P+L N +S L++ G +++ +GSL + K ++ Sbjct: 337 VLEDIRYIPELKMNLISLGMLDKLGYTFKSEPNSLRVARGSLTVMKHQR 385 >ref|XP_013614156.1| PREDICTED: uncharacterized protein LOC106320348 [Brassica oleracea var. oleracea] Length = 453 Score = 89.4 bits (220), Expect = 4e-15 Identities = 54/172 (31%), Positives = 93/172 (54%) Frame = -3 Query: 949 KDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDEFINF 770 K E ++ A I+LN+ + V RK+ H ++A +W LN+ LY +TSLP+R++L F F Sbjct: 55 KFEKSEKAKDLIVLNVGNKVLRKIQHCETAVAMWSTLNN-LYTETSLPNRIYLQLRFYTF 113 Query: 769 KLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNGGNDV 590 K+ SK +D N+ F K V D+ + +++ A +LL+++P Y +K +K G + + Sbjct: 114 KMVDSKSIDVNVDDFLKFVADLNNLQVEVLEEVQAILLLSSLPAKYDQLKETLKYGRDTL 173 Query: 589 AFDIIVDALKNKENDSKHNGSGKYSSKLKHVRGRTQTRSQTDDRNESGGGKT 434 + + A K+KE + +G S G R +++ R+ G GKT Sbjct: 174 SLAEVTGAAKSKERELIESGKFTRSGG----EGLLVDRGRSEQRSGKGNGKT 221 >gb|KHN28812.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 257 Score = 88.2 bits (217), Expect = 9e-15 Identities = 62/214 (28%), Positives = 103/214 (48%), Gaps = 8/214 (3%) Frame = -3 Query: 961 TEEEKDETNQLALATIILNLSDAVCRKVYHVKSAKELWDKLNDELYAQTSLPSRMFLLDE 782 + +EK E A + IIL L D R+V ++ +W KL + LY SL +R+ L Sbjct: 45 SSKEKSEMIAKARSAIILCLGDKALREVARERTVVSMWLKL-ESLYMTKSLANRLCLRQH 103 Query: 781 FINFKLDISKDMDENLYIFNKLVQDIKLSGDKGIDKYTAAVLLNAIPESYSDVKSAIKNG 602 FK+ S+ E L FNK++ D++ K ++ A +LLN++P+S+ K AI G Sbjct: 104 LYTFKMIESRTTTEQLVDFNKIIDDLENIEVKLEEEDKALLLLNSLPKSFEHFKDAILYG 163 Query: 601 -GNDVAFDIIVDALKNKE----NDSKHNGSGK---YSSKLKHVRGRTQTRSQTDDRNESG 446 D+ + + +++ KE DSK +G+ S +G + +S++ + Sbjct: 164 KDQDITLEEVQTSIRTKEMQKQQDSKSEDNGESLNISRGRNEKKGTRRKKSRSRSTDSKN 223 Query: 445 GGKTKFNXXXXXXXXXXXXKCYNCNEPGHLAKEC 344 G KTKF C+NC++ GH K+C Sbjct: 224 GQKTKFK-------------CFNCHKTGHFKKDC 244