BLASTX nr result
ID: Mentha29_contig00009729
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00009729 (1694 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665... 303 1e-79 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 284 1e-73 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 274 8e-71 ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661... 260 1e-66 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 256 2e-65 ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663... 254 8e-65 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 248 6e-63 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 243 1e-61 gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 238 5e-60 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 233 2e-58 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 230 1e-57 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 227 1e-56 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 221 8e-55 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 219 3e-54 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 211 6e-52 ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 209 3e-51 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 208 7e-51 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 207 9e-51 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 207 1e-50 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 207 1e-50 >ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max] Length = 506 Score = 303 bits (777), Expect = 1e-79 Identities = 167/458 (36%), Positives = 249/458 (54%), Gaps = 7/458 (1%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG + +RGLRQGDP+SP LF++ ME L+ ++ D F +HPKC T+L Sbjct: 14 NGYKTEIMGARRGLRQGDPISPMLFVIVMECLNRYLYKMQKDGDFNYHPKCDKLKITNLC 73 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDLLLF RGD S+ ++ A + F+ +GL +N K + G+ KR ILE+ GF Sbjct: 74 FADDLLLFSRGDKISVGMMMRAYESFSKATGLLVNPQKCSLLCAGIDAVTKREILEVSGF 133 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 EG LP KYLG+P+ SK L+T+ YS L+ +I I W+ LS GRL+L+ SV+ + Sbjct: 134 QEGQLPFKYLGVPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALT 193 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVW 706 YWL P +V+ I + R FLW + PV+WK +C PR GGL + D+ +W Sbjct: 194 NYWLNCFPFPKSVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIW 253 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFD 886 NKA K LWN+ +K DSLW+KWI A Y++ ++ D+ M IL+ R+ L Sbjct: 254 NKANLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDL--- 310 Query: 887 CEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHG 1066 +++ + ++ G + Y + G++K W ++ + P+ + ILWLA HG Sbjct: 311 --EKIDNMEELMIRGSINMG--KLYRKLQDCGQRKEWKNLLYGNTARPRANFILWLACHG 366 Query: 1067 RLKTFDRL-KHSDI-ARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTIP 1240 RL T DRL K+ I + C C S +E+ +HLFF CD + VW + W++ R+ + P Sbjct: 367 RLSTKDRLCKYGMIDDKSCCFC-SEEESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWP 425 Query: 1241 SAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLK 1354 + + G G +A+ T+ +W RN K Sbjct: 426 NELHWLTHHTKGKGTRAAVLKMAIAETIYEIWNIRNNK 463 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 284 bits (726), Expect = 1e-73 Identities = 159/457 (34%), Positives = 243/457 (53%), Gaps = 8/457 (1%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG + +RG+RQGDP+SP LF+L MEYL+ ++ F +H KC T+L Sbjct: 456 NGRFTRRLEARRGIRQGDPISPLLFILVMEYLNRILSQLDKIPNFNYHSKCEKMKITNLC 515 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDLLLF RGD S++++ D + F + GL +N SK +I+ G V K +L + GF Sbjct: 516 FADDLLLFSRGDIGSVQIMLDKFNTFLRSMGLHVNPSKCNIYCGSVDINVKEQLLLISGF 575 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 EG +P +YLG+PL+SK L Y +L+ +I I WS LS GR++LI+SV+ Sbjct: 576 KEGKMPFRYLGIPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATI 635 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVW 706 +W+Q LPL VI I + R FLW + + P++W+ VC P+ GGL + +LA+W Sbjct: 636 NFWMQCLPLPKFVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIW 695 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFD 886 NK K LWN+ K+D+LWIKW+H Y+RG+ IW + + M++++++R L+ Sbjct: 696 NKISILKLLWNVCNKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLR-PLLLQ 754 Query: 887 CEGNLNDA-KAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMH 1063 + + D K K + Y + EK W + + P+ LW A H Sbjct: 755 YQSRMQDVFKMKKI-----------YLALFEESEKMSWRTLMCNNLARPRALFCLWQACH 803 Query: 1064 GRLKTFDRLKH--SDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTI 1237 RL + DRL ++ C C S E+H+HLFF C + +W+ + +WL+ + +T Sbjct: 804 FRLASKDRLIKFGLNVDANCAFCSSM-ESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTW 862 Query: 1238 PSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARN 1348 + R+ G G A T+ ++W RN Sbjct: 863 SEELNWITRKCKGKGWRAMLLKCAFTETIYHIWAYRN 899 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 274 bits (701), Expect = 8e-71 Identities = 165/470 (35%), Positives = 239/470 (50%), Gaps = 11/470 (2%) Frame = +2 Query: 26 RGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLF 205 + ++GLRQGDPMSP LF LCMEYLS + F HPKC + THL FADDLL+F Sbjct: 636 QARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMF 695 Query: 206 GRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVK 385 R D S+ + A +F+ SGL + KS+I+ GV R + + G LP + Sbjct: 696 CRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFR 755 Query: 386 YLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVLP 565 YLG+PL SK LT L+ I+N W LS GRL+LI+S+L ++ YW + P Sbjct: 756 YLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFP 815 Query: 566 LQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKT 730 L VI + K+ RKFLW + PV+W T+ P+ GG + ++ WN+A K Sbjct: 816 LSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKL 875 Query: 731 LWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFDCEGNLNDA 910 LW I K D LW++WIH+ Y++ +DI + + I++ RD L N+ D Sbjct: 876 LWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRKIVKARDHL-----SNIGDW 930 Query: 911 KAKLVG-WFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTFDR 1087 +G F+ K +AY+ GE+ W + I +Y PK ILW+ +H RL T DR Sbjct: 931 DEICIGDKFSMK---KAYKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDR 987 Query: 1088 LKHSDIA--RGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTT---IPSAVR 1252 + + LC + ET HLFF C + VWS IC +R N + I S+V Sbjct: 988 ISRWGVQCDLNYRLCRNDGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVC 1047 Query: 1253 RFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402 R+K G I+ + V +W+ RN + + + + V+++I Sbjct: 1048 GQARKKKGKLIV-----MLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092 >ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max] Length = 947 Score = 260 bits (665), Expect = 1e-66 Identities = 140/404 (34%), Positives = 221/404 (54%), Gaps = 12/404 (2%) Frame = +2 Query: 23 VRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLL 202 + +RG+RQGDP+SP LF++ MEYL+ L+ D F HH KC THL FADD+LL Sbjct: 463 IAAKRGIRQGDPISPLLFVVMMEYLNRLLVKLQLDLNFNHHAKCEKLGITHLTFADDVLL 522 Query: 203 FGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPV 382 F RGD S+ ++ +++F+AT+GL +N +K I+ GGV K I ++ + EG LPV Sbjct: 523 FCRGDVMSVEMMLHVINKFSATTGLVVNPNKCRIYFGGVDGTTKNKIQQISSYEEGQLPV 582 Query: 383 KYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVL 562 +YLG+PL SK L Y L+ +I+ I W++ L+ GR++++ + + +W+Q L Sbjct: 583 RYLGVPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCL 642 Query: 563 PLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSK 727 P+ +VI I M R F+W S + P++W +VC P+ +GGL + +L VWN Sbjct: 643 PIPMSVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLN 702 Query: 728 TLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRD-----QLIFDCE 892 LWN+ K D+LW+KWIHA Y++ + + + N+L R+ Q ++D Sbjct: 703 CLWNLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWD-- 760 Query: 893 GNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRL 1072 LN + K+ +AY+ + ++ W + ++ P+ WLA HGRL Sbjct: 761 ELLNSERFKM---------KKAYDKM-MEADRVHWSGLMRKNCARPRAIHTTWLACHGRL 810 Query: 1073 KTFDRLKHSDIARGCV--LCESADETHDHLFFKCDKAMAVWSGI 1198 T DRL + + LC+ +ET +H+ F C A +WS + Sbjct: 811 GTKDRLVRFGMITDKIWSLCKEVEETQNHILFSCKVATDIWSNV 854 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 256 bits (655), Expect = 2e-65 Identities = 163/469 (34%), Positives = 232/469 (49%), Gaps = 14/469 (2%) Frame = +2 Query: 38 GLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRGD 217 GLRQG +SP LF++CM LS ++ + F +HP+C THL FADD+++F G Sbjct: 910 GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGS 969 Query: 218 PDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYLGL 397 S+ + +F A SGL I+ KS +F+ + +IL F F G+LPV+YLGL Sbjct: 970 AHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETCASILARFPFDSGSLPVRYLGL 1029 Query: 398 PLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVLPLQGT 577 PL +K +T D LL +I + I W N LS GRL+L+ SV+ + +W+ L Sbjct: 1030 PLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRA 1089 Query: 578 VIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWNI 742 I I ++ FLW + + V+W VC P+ EGGLGLR L NK K +W + Sbjct: 1090 CIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRL 1149 Query: 743 HAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILR-IRDQL-IFDCEGNLNDAKA 916 + SLW+ WI + R + E R H +IL I ++L C G + Sbjct: 1150 VSAKHSLWVNWIQNNLI--RTVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDR 1207 Query: 917 KLV----GWFAGKGTS-EAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTF 1081 L G F K S E + R +G K W+KAIW S PKF+ I WLA H RL T Sbjct: 1208 SLCRSIGGQFKAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTG 1267 Query: 1082 DRLK--HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTIPSAVRR 1255 D++ + I+ CVLC + E+ DHLFF C+ + +W + L T P+ + Sbjct: 1268 DKMASWNRGISSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLLL 1327 Query: 1256 FQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402 + SG R AT+ LW+ RN + P + H+IK I Sbjct: 1328 LSGQDF-SGTKRFLLRYVFQATIHTLWRERNKRRHGDLPIPSDHIIKFI 1375 >ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max] Length = 514 Score = 254 bits (649), Expect = 8e-65 Identities = 141/405 (34%), Positives = 217/405 (53%), Gaps = 9/405 (2%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG + + G+ QGDP+SP LF+L MEY + ++ + +F HH +C THL+ Sbjct: 117 NGELSNVLETKIGIWQGDPISPLLFVLMMEYFNRIMVKMQRNPSFNHHSQCERLGITHLS 176 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADD+ L RGD S++++ A F+ ++GL IN +K +F GG+ + I ++ GF Sbjct: 177 FADDVFLLCRGDKKSIKMIIKAFSFFSKSTGLQINPAKCKVFCGGLNCDSIQVITKITGF 236 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 EGTLPV+YLG+PL+ K L Y L+ +I I WS+ LS GR++L+RS++ + Sbjct: 237 EEGTLPVRYLGVPLSCKKLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIA 296 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVW 706 YW+ V P+ VI I + R F+W S + V+WK VC P GGL L +L +W Sbjct: 297 QYWMSVFPMPKKVIQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELW 356 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFD 886 N K LWNI +K D+LW+KWIHA +L+G ++ + ++++ R Q Sbjct: 357 NVTAMLKCLWNICSKEDNLWVKWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQ---- 412 Query: 887 CEGNLNDAKAKLVGWFAGKGTS--EAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAM 1060 +N+ + + + S + Y K W++ + + P+ +V LWLA Sbjct: 413 ----VNNLQLVWIEMLRKRKFSMKQVYMELVEDHNKIDWFRLLRYNRARPRANVTLWLAC 468 Query: 1061 HGRLKTFDRLKHSDIARG--CVLCESADETHDHLFFKCDKAMAVW 1189 RL T RLK+ ++ + C LC+ DE DHL F C A+W Sbjct: 469 QNRLATKTRLKNMNMIQCSLCSLCKEQDEDLDHLMFSCRVTKAIW 513 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 248 bits (633), Expect = 6e-63 Identities = 154/470 (32%), Positives = 228/470 (48%), Gaps = 13/470 (2%) Frame = +2 Query: 32 QRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGR 211 Q+GLRQGDP+SP LF L MEYLS + D F HPKC THL FADDLL+F R Sbjct: 641 QKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFAR 700 Query: 212 GDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYL 391 D S+ + A + F+ SGL + KS I+ GGV E + + P G+LP +YL Sbjct: 701 ADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYL 760 Query: 392 GLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVLPLQ 571 G+PLASK L L+ +I+ W LS GRL+L++++L ++ YW Q+ PL Sbjct: 761 GVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLP 820 Query: 572 GTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLW 736 +I + RKFLW + + PV+W + P+ GGL + ++ +WNKA K LW Sbjct: 821 KKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLW 880 Query: 737 NIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFDCEGNLNDAKA 916 I K D LW++W++A Y++ ++I + + I R+ L Sbjct: 881 AITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESRELL------------T 928 Query: 917 KLVGWFA-----GKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTF 1081 + GW A + Y+ + E W + I + PK ILWLAM RL T Sbjct: 929 RTGGWEAVSNHMNFSIKKTYKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATA 988 Query: 1082 DRLK--HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTIPSAVRR 1255 +R+ + D++ C +C + ET HLFF C + +W + +L + Q A + Sbjct: 989 ERVSRWNRDVSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQADA--QAKKE 1046 Query: 1256 FQREKAGSGIIRKAKWVAL-GATVQYLWQARNLKYVEKKPFEASHVIKEI 1402 +KA S R +V + +V +W RN K + +K I Sbjct: 1047 LAIKKARSTKDRNKLYVMMFTESVYAIWLLRNAKVFRGIEINQNQAVKSI 1096 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 243 bits (621), Expect = 1e-61 Identities = 160/475 (33%), Positives = 235/475 (49%), Gaps = 8/475 (1%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDST-FMHHPKCSTTDTTHL 178 NG GF +RGLRQGDP+SP LF++ ME LS I R + S F +H +C + +HL Sbjct: 464 NGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSHL 523 Query: 179 AFADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFG 358 FADDLL+F GD +S+R L DA F + S L N S+S IFL GV ++L++ Sbjct: 524 CFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTN 583 Query: 359 FPEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGV 538 F GT PV+YLG+PL + L D S LL +I I W N LS GRL+LI+SVL + Sbjct: 584 FSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSI 643 Query: 539 ECYWLQVLPLQGTVIATITKMLRKFLW---CDSQC--PVSWKTVCLPRDEGGLGLRDLAV 703 + YW L L V+ I K LR FLW C + V+W +CLP+ EGGLG++DL Sbjct: 644 QVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHC 703 Query: 704 WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIF 883 WNKAL +WN+ + + + W W+ L+G W P P + + +L+IR+ Sbjct: 704 WNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRE---L 760 Query: 884 DCEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKF-WYKAIWRSYIPPKFSVILWLAM 1060 C +N ++G G+ TS ++++ G W I K +++ Sbjct: 761 CCSFFVN-----IIG--DGRATSLWFDNWHPLGPLTLRWSSNIIGESGLSKSAMLTPNGF 813 Query: 1061 HGRLKTFDRLKHSD-IARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTI 1237 + ++ L+ S I L ETH+HLFF C + +W+ + S + Sbjct: 814 YSTSSAWNTLRPSRFIVPWYRLVWFVAETHNHLFFDCAYSFGIWTHVLSKCDVSKPLLPW 873 Query: 1238 PSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEI 1402 + G+ + +AL A V +W+ RN + + + V K I Sbjct: 874 SDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRRFRNESLPPAVVFKGI 928 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 238 bits (608), Expect = 5e-60 Identities = 155/474 (32%), Positives = 223/474 (47%), Gaps = 10/474 (2%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG GF + +RGLRQG +SP LF++ M+ LS L+ F +H +C THL+ Sbjct: 170 NGELVGFFQSKRGLRQGCSLSPYLFVMSMDVLSKLLDQAASAKKFGYHSRCKELSLTHLS 229 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + + D F SGL I+ KS I+L GV I + F Sbjct: 230 FADDLMVLSDGKVRSIDGIVEVFDIFAKFSGLKISMEKSTIYLAGVTEDVYHEIQNRYQF 289 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 G LPV+YLGLPL +K LT DYS LL I I W+ LS GRL LI SVL + Sbjct: 290 DVGQLPVRYLGLPLVTKRLTATDYSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSIC 349 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVW 706 +WL L I I K+ FLW + V W VC P+ EGGLGLR L Sbjct: 350 NFWLAAFRLPRECIREIDKICSAFLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEM 409 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFD 886 N+ K +W I + +SLW++WI L+ W + +M ++L Sbjct: 410 NEVSCLKLIWRIVSHTNSLWVRWIEQYLLKHDTFWSV----QTTTNMDSVL--------- 456 Query: 887 CEGNLNDAKAKLVGWFAGKGTSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHG 1066 G ++ K T + + R W+ IW ++ PKFS WLA+ Sbjct: 457 WRGRNDEYMPKF-------STRDTWNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQN 509 Query: 1067 RLKTFDRLK--HSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWL---RCRNQMT 1231 RL T D++ + ++ CVLC + ET +HLFF C +W + + + + Sbjct: 510 RLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWS 569 Query: 1232 TIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVI 1393 TI ++V R + S + R AT+ +W RN + ++ A+H+I Sbjct: 570 TILTSVSTTWRNRTESFLAR----YIFQATIHTIWHERNGRRHGERSNSATHLI 619 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 233 bits (594), Expect = 2e-58 Identities = 113/265 (42%), Positives = 165/265 (62%), Gaps = 5/265 (1%) Frame = +2 Query: 35 RGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLAFADDLLLFGRG 214 +GLRQGDPMSP LF + MEYLS L+ D +F +HPK + D THL FADDLLLF RG Sbjct: 447 KGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYHPKYAKLDVTHLCFADDLLLFSRG 506 Query: 215 DPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGFPEGTLPVKYLG 394 D +S++ L+ EF+ SGL N +KS I+ GGV+ ++ I++ G+ LP KYLG Sbjct: 507 DLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQMEVRQQIIQQLGYTIEELPFKYLG 566 Query: 395 LPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVECYWLQVLPLQG 574 +PL+SK L T+ + L+ ++ I W+ LS GR +L+++VL GV+ W Q+ + Sbjct: 567 VPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPA 626 Query: 575 TVIATITKMLRKFLW-----CDSQCPVSWKTVCLPRDEGGLGLRDLAVWNKALHSKTLWN 739 +I I + R +LW + ++W VC P+ EGGLGL +L +WN++ +K W+ Sbjct: 627 KIIKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWD 686 Query: 740 IHAKADSLWIKWIHAEYLRGRDIWE 814 + K D LWIKWIHA Y++G+ W+ Sbjct: 687 LANKEDKLWIKWIHAYYIKGQREWK 711 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 230 bits (587), Expect = 1e-57 Identities = 160/553 (28%), Positives = 242/553 (43%), Gaps = 86/553 (15%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG G+ R RGLRQG +SP LF++ M+ LS ++ F +HP+C T THL Sbjct: 364 NGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLGLTHLC 423 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + L++F A GL I K+ ++L GV ++ + + F Sbjct: 424 FADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSRYSF 483 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 G LPV+YLGLPL +K LTT DYS L+ QI I W++ LS GRL LI SVL + Sbjct: 484 GVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSIT 543 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWCDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVW 706 +W+ L I I ++ LW + VSW +C P+ EGGLGL+ L Sbjct: 544 NFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREA 603 Query: 707 NKALHSKTLWNIHA---------------KADSLWI--------KWIHAEYLRGRDI--- 808 NK K +W + + K +S W WI L+ R++ Sbjct: 604 NKVSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKS 663 Query: 809 ------------------WEFPYP---------------------------RRDAPHMTN 853 W P RR H Sbjct: 664 FCKIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVE 723 Query: 854 ILRIRDQLIFDCEGNLNDAKAKLVGWFAGK--------GTSEAYEHFRAKGEKKFWYKAI 1009 IL ++++ + N + W GK T + + H R ++ W+K + Sbjct: 724 ILNEFEEILLQKYQHRNIELEDAILW-RGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGV 782 Query: 1010 WRSYIPPKFSVILWLAMHGRLKTFDRLK--HSDIARGCVLCESADETHDHLFFKCDKAMA 1183 W ++ PKFS WLA+ RL T DR+ ++ CV C S ET DHLFF+C + Sbjct: 783 WFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSE 842 Query: 1184 VWSGICSWLRCRNQMTTIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARNLKYVE 1363 +W+ I + +++ +T SAV + + I ++ +W+ RN + Sbjct: 843 IWTSIAKNV-YKDRFSTKWSAVVNYISDSQPDRIQSFLSRYTFQVSIHSIWRERNSRRHG 901 Query: 1364 KKPFEASHVIKEI 1402 +K AS++I++I Sbjct: 902 EKSRSASNLIRQI 914 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 227 bits (579), Expect = 1e-56 Identities = 117/294 (39%), Positives = 165/294 (56%), Gaps = 5/294 (1%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NGG+ GF + +GLRQGDP+SP LF+L ME S+L+H+R +HPK S +HL Sbjct: 637 NGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLM 696 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADD+++F G S+ + + LD+F + SGL +NK KSH++L G+ E A +GF Sbjct: 697 FADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLESNA-NAAYGF 755 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 P GTLP++YLGLPL ++ L +Y LL +I+ W N LS GR++LI SV+ G Sbjct: 756 PIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSI 815 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAVW 706 +W+ L I I + +FLW + VSW +CLP+ EGGLGLR L W Sbjct: 816 NFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEW 875 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIR 868 NK L + +W + DSLW W H +L W + D+ +L +R Sbjct: 876 NKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLR 929 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 221 bits (563), Expect = 8e-55 Identities = 163/555 (29%), Positives = 240/555 (43%), Gaps = 91/555 (16%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG G+ R RG+RQG +SP LF++ ME LS ++ F HPKC THL Sbjct: 48 NGELAGYFRSARGIRQGCALSPYLFVISMEVLSKMLDQAAGGKRFGFHPKCKNLGLTHLC 107 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + + ++ F SGL IN K+ ++ GV + ++ + F Sbjct: 108 FADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMISRYPF 167 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 G LPV+YLGLPL +K LT D S L QI N I W++ LS GRL LI SVL Sbjct: 168 GLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTM 227 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWCDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVW 706 +W+ L + I + FLW + VSW +C P+ EGGLGLR L Sbjct: 228 NFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEA 287 Query: 707 NKALHSKTLWNIHAKADSL---WIK--------------------WIHAEYLRGRDIWEF 817 N K +W + + DSL W K W+ + L+ R+ + Sbjct: 288 NVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLTPNSSLGSWMWKKMLKYRETAK- 346 Query: 818 PYPRRDAP----------------HMTNILRIRDQLIFDCEGN----------------- 898 P+ R + H+ ++ R Q+ N Sbjct: 347 PFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNRRRRKHRT 406 Query: 899 --LNDAKAKL-------------VGWFAGKG--------TSEAYEHFRAKGEKKFWYKAI 1009 LND +A L + GKG T + + R K + WYK + Sbjct: 407 EQLNDIEAALNQKYQTRNLLREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGV 466 Query: 1010 WRSYIPPKFSVILWLAMHGRLKTFDRLK----HSDIARGCVLCESADETHDHLFFKCDKA 1177 W S+ PK+ WLA+ RL T R++ SD+ C C ++ ET DHLFF C A Sbjct: 467 WFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVK--CTFCSTSIETRDHLFFSCSYA 524 Query: 1178 MAVWSGICSWL---RCRNQMTTIPSAVRRFQREKAGSGIIRKAKWVALGATVQYLWQARN 1348 A+W+ I + R TI + + Q ++ S + R TV +W+ RN Sbjct: 525 SAIWTAIAKNVLQHRFSTDWQTIVNYISETQTDRIRSFLSR----YIFQLTVHTVWKERN 580 Query: 1349 LKYVEKKPFEASHVI 1393 + ++P ++++I Sbjct: 581 DRRHGEEPRTSANLI 595 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 219 bits (558), Expect = 3e-54 Identities = 112/275 (40%), Positives = 159/275 (57%), Gaps = 5/275 (1%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 +G G+ +G +GLRQGDP+SP+LF++ ME LS L+ + D + +HPK S + LA Sbjct: 636 SGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLA 695 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++F G S+R ++ L+ F SGL +N KS ++ G+ +K L FGF Sbjct: 696 FADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGF 754 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 GT P +YLGLPL + L DYS L+ +I+ W+ LS GRL+LI SV+ Sbjct: 755 VNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTV 814 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWCD-----SQCPVSWKTVCLPRDEGGLGLRDLAVW 706 +WL L + TI +M +FLW + VSW+ CLP+ EGGLGLR+ W Sbjct: 815 NFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTW 874 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIW 811 NK L+ + +W + A+ DSLW+ W HA LR + W Sbjct: 875 NKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFW 909 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 211 bits (538), Expect = 6e-52 Identities = 110/275 (40%), Positives = 156/275 (56%), Gaps = 5/275 (1%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG GF R +RGLRQG +SP L+++CM LS ++ + +HP+C + THL Sbjct: 790 NGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMNLTHLC 849 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADD+++F G S++ ++F A S L I+ KS IF+ G+ P K +IL+ F F Sbjct: 850 FADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFPF 909 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 GTLPVKYLGLPL +K +T DY L+ +I I W+N LS GRL+LI+SVL + Sbjct: 910 ELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSIT 969 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVW 706 +WL V L + I KM FLW + ++W VC ++EGGLGL+ L Sbjct: 970 NFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEA 1029 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIW 811 N+ K +W I + DSLW+KW++ +R W Sbjct: 1030 NEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFW 1064 Score = 67.4 bits (163), Expect = 2e-08 Identities = 34/90 (37%), Positives = 49/90 (54%), Gaps = 2/90 (2%) Frame = +2 Query: 947 TSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTFDRL-KHSDIAR-GCV 1120 +S+ ++ R+ + WY+ +W S PK+S + WLA H RL T D++ K + AR CV Sbjct: 1187 SSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCV 1246 Query: 1121 LCESADETHDHLFFKCDKAMAVWSGICSWL 1210 C ET DHLFF C + VW + L Sbjct: 1247 FCGEELETRDHLFFSCPYSSHVWFSLTKGL 1276 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 209 bits (532), Expect = 3e-51 Identities = 136/433 (31%), Positives = 202/433 (46%), Gaps = 5/433 (1%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG +G +GQRGLRQ DP+SP LF+L +EY + I + ++ F +P C+ T +HL Sbjct: 14 NGSIYGHFKGQRGLRQWDPLSPYLFVLYIEYFARDIQSLKDNANFQFNPNCAVTQLSHLT 73 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTIN---KSKSHIFLGGVRPFEKRAILEL 352 FADD++L RGD S+ + L F SGL+I+ KS + G V RA+++ Sbjct: 74 FADDIMLLSRGDLPSVSAIYAKLQHFCNVSGLSISSRWSRKSLSYAGKVELI--RAVIQ- 130 Query: 353 FGFPEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQ 532 G + + PL L T ++A NF+ W ++ ++ L Sbjct: 131 -GIANFWMSI----FPLPQSVLDT-----IIATCRNFL--WGKADGGKIKPL-------- 170 Query: 533 GVECYWLQVLPLQGTVIATITKMLRKFLWCDSQCPVSWKTVCLPRDEGGLGLRDLAVWNK 712 V+W VC P+ EGGLGL +L WN Sbjct: 171 -----------------------------------VAWSEVCTPKKEGGLGLFNLKDWNI 195 Query: 713 ALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHMTNILRIRDQLIFDCE 892 AL S LW++H+K DSLW++ +H Y +G ++W+F D+ + IRD +I E Sbjct: 196 ALLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWDFISSSSDSV----FIHIRD-IIISKE 250 Query: 893 GNLNDAKAKLVGWFAGKGT--SEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHG 1066 N+ AK L W + T + Y++ R W IW IP K S ILWLA Sbjct: 251 ENIEVAKLMLNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATKN 310 Query: 1067 RLKTFDRLKHSDIARGCVLCESADETHDHLFFKCDKAMAVWSGICSWLRCRNQMTTIPSA 1246 RL DR + C LC + E+H HLFF C ++ VW+ I W+ + Q ++ + Sbjct: 311 RLLALDRAAFLNKGFLCPLCTNEAESHAHLFFSCRTSLRVWAHIRDWIPLKRQSISLQHS 370 Query: 1247 VRRFQREKAGSGI 1285 + R +A SG+ Sbjct: 371 ISALIRRRATSGV 383 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 208 bits (529), Expect = 7e-51 Identities = 118/296 (39%), Positives = 159/296 (53%), Gaps = 6/296 (2%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG G+ + +RGLRQG +SP LF++CM+ LS ++ F HPKC THL+ Sbjct: 290 NGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPKCQRLGLTHLS 349 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + + DEF SGL I+ KS +++ GV P K+ I F F Sbjct: 350 FADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAKFLF 409 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 G LPV+YLGLPL +K LT+ DYS LL QI I W+ S GR LI+SVL + Sbjct: 410 DVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSIC 469 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWCDSQ-----CPVSWKTVCLPRDEGGLGLRDLAVW 706 +WL L I I K+ FLW S+ +SW VC P+ EGGLGLR+L Sbjct: 470 NFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLKEA 529 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWEFPYPRRDAPHM-TNILRIRD 871 N K +W I + ++SLW KW+ +R + IW + IL+IRD Sbjct: 530 NDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRD 585 Score = 71.2 bits (173), Expect = 1e-09 Identities = 39/156 (25%), Positives = 74/156 (47%), Gaps = 7/156 (4%) Frame = +2 Query: 947 TSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTFDRL----KHSDIARG 1114 T + + +A W+K +W + PK+++ WLA+H RL T DR+ ++ Sbjct: 687 TRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGN 746 Query: 1115 CVLCESADETHDHLFFKCDKAMAVWSGICSWL---RCRNQMTTIPSAVRRFQREKAGSGI 1285 CVLC + +T +HLFF C A VW+ + + R + + + + + +++ + Sbjct: 747 CVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLTHISTHFQDRVEGFL 806 Query: 1286 IRKAKWVALGATVQYLWQARNLKYVEKKPFEASHVI 1393 R AT+ ++W+ RN + + P + VI Sbjct: 807 TR----YIFQATIYHVWRERNGRRHDAAPNTPATVI 838 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 207 bits (528), Expect = 9e-51 Identities = 111/276 (40%), Positives = 151/276 (54%), Gaps = 5/276 (1%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMHHPKCSTTDTTHLA 181 NG G+ + RGLRQG +SP LF++CM+ LS ++ F +HPKC T THL+ Sbjct: 643 NGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLS 702 Query: 182 FADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFGF 361 FADDL++ G S+ + DEF SGL I+ KS ++L G+ + + + F F Sbjct: 703 FADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPF 762 Query: 362 PEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGVE 541 G LPV+YLGLPL +K L+T D LL Q+ I W++ LS GRL LI SVL + Sbjct: 763 SSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSIC 822 Query: 542 CYWLQVLPLQGTVIATITKMLRKFLWC-----DSQCPVSWKTVCLPRDEGGLGLRDLAVW 706 +WL L I + KM FLW ++ +SW VC P+DEGGLGLR L Sbjct: 823 NFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEA 882 Query: 707 NKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWE 814 N K +W I + ++SLW+KW+ LR WE Sbjct: 883 NDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWE 918 Score = 81.6 bits (200), Expect = 1e-12 Identities = 49/168 (29%), Positives = 79/168 (47%), Gaps = 5/168 (2%) Frame = +2 Query: 947 TSEAYEHFRAKGEKKFWYKAIWRSYIPPKFSVILWLAMHGRLKTFDRLKH--SDIARGCV 1120 T + + H R+ + W+K IW S+ PK+S WLA HGRL T DR+ + + IA C+ Sbjct: 1040 TRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCI 1099 Query: 1121 LCESADETHDHLFFKCDKAMAVWSGICSWL---RCRNQMTTIPSAVRRFQREKAGSGIIR 1291 C+ ET DHLFF C +W + + + + +I A+ Q + + R Sbjct: 1100 FCQGTLETRDHLFFTCSFTSVIWVDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRR 1159 Query: 1292 KAKWVALGATVQYLWQARNLKYVEKKPFEASHVIKEIKLDVYRVLYSL 1435 AT+ +W+ RN + + P AS ++ I + L S+ Sbjct: 1160 ----YVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQLSSI 1203 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 207 bits (527), Expect = 1e-50 Identities = 111/277 (40%), Positives = 153/277 (55%), Gaps = 6/277 (2%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMH-HPKCSTTDTTHL 178 NG + GF R +GLRQGDP+SP LF+L ME S L+++R +DS ++H HPK +HL Sbjct: 497 NGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR-YDSGYIHYHPKAGDLSISHL 555 Query: 179 AFADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFG 358 FADD+++F G SM + + LD+F SGL +NK KS +F G+ +R +G Sbjct: 556 MFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYG 614 Query: 359 FPEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGV 538 FP GT P++YLGLPL + L DY LL ++S + W + LS GR +LI SV+ G+ Sbjct: 615 FPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGL 674 Query: 539 ECYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAV 703 +W+ L I I + KFLW S VSW CLP+ EGGLG R Sbjct: 675 INFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGE 734 Query: 704 WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWE 814 WNK L + +W + + SLW +W L W+ Sbjct: 735 WNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQ 771 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 207 bits (527), Expect = 1e-50 Identities = 111/277 (40%), Positives = 153/277 (55%), Gaps = 6/277 (2%) Frame = +2 Query: 2 NGGSHGFVRGQRGLRQGDPMSPTLFLLCMEYLSHLIHARTHDSTFMH-HPKCSTTDTTHL 178 NG + GF R +GLRQGDP+SP LF+L ME S L+++R +DS ++H HPK +HL Sbjct: 497 NGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSR-YDSGYIHYHPKAGDLSISHL 555 Query: 179 AFADDLLLFGRGDPDSMRVLRDALDEFTATSGLTINKSKSHIFLGGVRPFEKRAILELFG 358 FADD+++F G SM + + LD+F SGL +NK KS +F G+ +R +G Sbjct: 556 MFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYG 614 Query: 359 FPEGTLPVKYLGLPLASKSLTTLDYSLLLAQISNFI*RWSNSNLSRVGRLELIRSVLQGV 538 FP GT P++YLGLPL + L DY LL ++S + W + LS GR +LI SV+ G+ Sbjct: 615 FPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGL 674 Query: 539 ECYWLQVLPLQGTVIATITKMLRKFLWCDS-----QCPVSWKTVCLPRDEGGLGLRDLAV 703 +W+ L I I + KFLW S VSW CLP+ EGGLG R Sbjct: 675 INFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGE 734 Query: 704 WNKALHSKTLWNIHAKADSLWIKWIHAEYLRGRDIWE 814 WNK L + +W + + SLW +W L W+ Sbjct: 735 WNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQ 771