BLASTX nr result
ID: Rehmannia22_contig00016196
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00016196 (4036 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659... 392 e-106 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 377 e-101 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 373 e-100 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 359 5e-96 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 339 5e-90 gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 323 4e-85 dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal... 317 4e-83 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 308 2e-80 dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ... 307 2e-80 emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li... 307 2e-80 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 303 5e-79 emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulga... 302 9e-79 emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694... 293 6e-76 gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] 286 7e-74 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 285 9e-74 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 284 2e-73 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 284 2e-73 emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulga... 283 4e-73 gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] 283 6e-73 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 281 1e-72 >ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659506 [Glycine max] Length = 964 Score = 392 bits (1007), Expect = e-106 Identities = 205/515 (39%), Positives = 299/515 (58%) Frame = -3 Query: 1664 VHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRLLWDDLYNIGHSLIDPWLI 1485 +H +L ++QLIHC I C + F SF+YGL+S+ RR LW +L +I ++ PWL+ Sbjct: 450 IHHSVLESNAQLIHCAIDCKTTAKRFQVSFIYGLHSIMARRSLWINLNSINANMNCPWLL 509 Query: 1484 LGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLGLNDVQTTGCFFTWTNNSVWCRLDRA 1305 +GDFN I SP ++ NGA +N Y+L+DF D LGL + T G +TWTN+ VW +LDRA Sbjct: 510 IGDFNSILSPTDRFNGAELNAYELQDFVDCYSDLGLGSINTHGPLYTWTNSRVWSKLDRA 569 Query: 1304 LVNSAWAQLGLQISAHAPVPGAISDHXXXXXXXXXXXXXXSKPFKFFNMWTSHPSFLNVV 1125 L N AW + +ISDH + PFKF N+ HP+FL +V Sbjct: 570 LCNQAWFNSFGNSACEVMEFISISDHTPLVVTTELVVPRGNSPFKFNNLIVDHPNFLRIV 629 Query: 1124 QNAWFNPVWGTAQFMXXXXXXXXXXXXXXXNSQHFSHISSRVKEAKDTLDSLQNQLHLDP 945 + W + G + F Q FS+IS+RV+ A+ +S+ N + +P Sbjct: 630 ADGWKQNIHGCSMFKVCKKLKALKAPLKNLFKQEFSNISNRVELAEAEYNSVLNSIKQNP 689 Query: 944 LNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFLNLSDKNTKFFHSIIKRNALKRQLS 765 + SL + I L KAE M +Q K +L +DK +KFFH++IKRN R ++ Sbjct: 690 QDPSLLALANRTRGQTIMLRKAESMKFAQLIKNKYLLQADKCSKFFHALIKRNKHSRFIA 749 Query: 764 SVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYSTNPIDQSVINHGPCLTEEHANLLSAPV 585 ++ L DG T+S E++ AFVN ++ F + T S+ N GP + + L P Sbjct: 750 AIRLEDGHNTSSQDEIALAFVNHFRNFFSAHELTQTPSISICNRGPKVPTDCFAALLCPT 809 Query: 584 LPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDIVGNDVINAIQEFFSNAKLLKQINHTS 405 + I + ++++PGPDG++ FFKK+W+IVG+D+ A+ EFF+ K+LKQ+NH Sbjct: 810 SKQKVWNIISVMANNKAPGPDGFNVLFFKKAWNIVGDDIFAAVNEFFTTGKILKQLNHAI 869 Query: 404 ITLIPKTDHSPSVADFRPIACCNVVYKAITKIIAARLEKVLPSLINPAQAAFVGGRNITD 225 I LIPK D + V FRPI+CCN++YK ++KI+A R+ VL ++I Q AF+ R + D Sbjct: 870 IVLIPKHDQASQVNHFRPISCCNLLYKIVSKILANRIAPVLETIIGETQTAFIKNRKMMD 929 Query: 224 NIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTVS 120 NIFL QEI+R YARKR SPRC +K+D+ KAYD +S Sbjct: 930 NIFLVQEILRKYARKRPSPRCLLKIDLHKAYDFIS 964 Score = 166 bits (421), Expect = 6e-38 Identities = 105/320 (32%), Positives = 159/320 (49%), Gaps = 5/320 (1%) Frame = -2 Query: 3273 LKIEPIHSEGDIVILEEEDLCNVEKDWGHCLLGCFAGKWPGMNNIKNLVKYWNVDCQVLP 3094 +K P S+ D V+LEE DL +E+ WGH L+G AG++PG + + K W V Sbjct: 151 MKFSPPPSD-DEVLLEETDLQPLEEAWGHSLIGYVAGRFPGKKALLDCCKKWGVKFSYSA 209 Query: 3093 YVNGCVVFRFKDYDTMEKILQGGPFHCLGRTLMLKIFSNDTVFSKNDFSTVPIWVKFPFL 2914 + +G +VF+F+ D + ++L GP+ R L+LK+ F + S +P+WVK L Sbjct: 210 HESGWLVFKFESEDDLNQVLSAGPYFIFQRPLLLKVMPAFFDFGNEELSKIPVWVKLRNL 269 Query: 2913 PAPCWQSKALGKICDKIGSPICCDRQTFDKSRPAFARILVDIDPTKKPPDSVQIQFSGGK 2734 P W +ALGKI KIGSPI D T K +FAR LV++D + + D V+ + GK Sbjct: 270 PLELWNPQALGKILSKIGSPIRSDHLTASKGSISFARALVEVDASLELIDEVRFRLPTGK 329 Query: 2733 SFDQKVEYEFIPKYCKKCACFGHYFDNCSATNAPSKTAKGVNQIAKNIAHPAAKNHVNLP 2554 +F QK+EYE P +C C GH NC A + + P + P Sbjct: 330 TFVQKIEYENRPSFCTHCKMTGHRLTNCKTITANKPI---LITACPTLDQPQVGDSAMPP 386 Query: 2553 SEPVIDPIDKGKNVMQTVIHVPESENFQKDKLIANN-----SSKNLEGSLDSTKDPCPGD 2389 P D KNV QT + VP +K L+AN+ +S +L+ + ++ D + Sbjct: 387 HTNTAGPSDNPKNV-QTNLSVPP---HRKHVLVANHVPVLQNSSDLKETGNTELDDPIEE 442 Query: 2388 SSFYNDPAIIALLETNTSVI 2329 +D ++LE+N +I Sbjct: 443 GFVQHDKIHHSVLESNAQLI 462 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 377 bits (967), Expect = e-101 Identities = 211/641 (32%), Positives = 341/641 (53%), Gaps = 7/641 (1%) Frame = -3 Query: 1913 MKIAVWNVRGFKNPIKHNQLLAFIKEHNLNILCLLETKLEDDICFDSALCNFIVKTKFPG 1734 MKI WNVRG +PIK ++ F+ +++ L ET++ ++ KF Sbjct: 1 MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGK-------IQKKFGN 53 Query: 1733 -WLYSNNFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYS 1557 W + NN+ GR+ + W + V++++L ++ Q+I ++ + F + VYGL++ Sbjct: 54 RWSWINNYACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHT 113 Query: 1556 VGDRRLLWDDLYNIGHSLIDPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLGL 1377 + DR++LW++LYN +P +++GD+N + S ++LNG ++ + D + L L Sbjct: 114 IADRKVLWEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQL 173 Query: 1376 NDVQTTGCFFTWTNNSVWC-----RLDRALVNSAWAQLGLQISAHAPVPGAISDHXXXXX 1212 + TTG F++W N S+ R+D++ VN AW + G ISDH Sbjct: 174 LEAPTTGLFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAG-ISDHSPLIF 232 Query: 1211 XXXXXXXXXSKPFKFFNMWTSHPSFLNVVQNAWFNPVWGTAQFMXXXXXXXXXXXXXXXN 1032 +PFKF N F+ VV+ AW + + Sbjct: 233 NLATQHDEGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRALKSFH 292 Query: 1031 SQHFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKA 852 S+ FS +V+E + L ++Q + + S L E K I ++ + ++ L QK+ Sbjct: 293 SKKFSKAHCQVEELRRKLAAVQALPEVSQV-SELQEEEKDLIAQLRKWSTIDESILKQKS 351 Query: 851 KCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTY 672 + +L+L D N+KFF + IK + ++ + G Q T E+ + NFY+ L GT Sbjct: 352 RIQWLSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTS 411 Query: 671 YST-NPIDQSVINHGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKK 495 S ID V+ G L+ L P+ +I A+ +IDD ++PG DG++S FFKK Sbjct: 412 SSQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKK 471 Query: 494 SWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAIT 315 SW ++ ++ I +FF N + K IN T++TLIPK D + D+RPIACC+ +YK I+ Sbjct: 472 SWLVIKQEIYEGILDFFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIIS 531 Query: 314 KIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKA 135 KI+ RL+ V+ +++ AQ F+ R+I DNI LA E+IR Y R+ +SPRC IKVDI+KA Sbjct: 532 KILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHVSPRCVIKVDIRKA 591 Query: 134 YDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCING 12 YD+V W FL +L +GFP++FI W+M CV + S+S+ +NG Sbjct: 592 YDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNG 632 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 373 bits (957), Expect = e-100 Identities = 212/637 (33%), Positives = 339/637 (53%), Gaps = 8/637 (1%) Frame = -3 Query: 1898 WNVRGFKNPIKHNQLLAFIKEHNLNILCLLETKLEDDICFDSALCNFIVKTKF-PGWLYS 1722 WNVRG +P K ++ F+ H + + LLET++ + V+ K W + Sbjct: 6 WNVRGMNDPFKIKEIKNFLYSHKIVVCALLETRVREQNASK-------VQGKLGKDWKWL 58 Query: 1721 NNFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRR 1542 NN+ R+ + W P+ V+V + QL+ C I + SH + + VYGL+++ DR+ Sbjct: 59 NNYSHSARERIWIGWRPAWVNVTLTHTQEQLMVCDIQ-DQSHKLKMVA-VYGLHTIADRK 116 Query: 1541 LLWDDLYNIGHSLIDPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLGLNDVQT 1362 LW L DP +I+GDFN + +++L G + + + +DFQ L L + ++ Sbjct: 117 SLWSGLLQCVQQQ-DPMIIIGDFNAVCHSNDRLYGTLVTDAETEDFQQFLLQSNLIESRS 175 Query: 1361 TGCFFTWTNNS-----VWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXXXXXXXXXX 1197 T +++W+N+S V R+D+A VN W + ++S PG ISDH Sbjct: 176 TWSYYSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPG-ISDHSPLLFNLMTG 234 Query: 1196 XXXXSKPFKFFNMWTSHPSFLNVVQNAWFNPVWGTAQFMXXXXXXXXXXXXXXXN-SQHF 1020 KPFKF N+ FL V+ AW N V G + +Q Sbjct: 235 RPQGGKPFKFMNVMAEQGEFLETVEKAW-NSVNGRFKLQAIWLNLKAVKRELKQMKTQKI 293 Query: 1019 SHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSF 840 +VK + L LQ+Q D N + + KS + + + E L QK++ ++ Sbjct: 294 GLAHEKVKNLRHQLQDLQSQDDFDH-NDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITW 352 Query: 839 LNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYST- 663 L D N+K F + +K ++ ++ DG E+ + + FYK L GT ST Sbjct: 353 LQQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRASTL 412 Query: 662 NPIDQSVINHGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDI 483 +D + + G CL+ + L V ++I A+ I +D++PG DG+++ FFKKSW Sbjct: 413 MGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGS 472 Query: 482 VGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKIIA 303 + ++ IQEFF+N+++ + IN +TL+PK H+ V +FRPIACC V+YK I+K++ Sbjct: 473 IKQEIYAGIQEFFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLT 532 Query: 302 ARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTV 123 R++ ++ ++N AQ+ F+ GR+I DNI LA E+IR Y RK +SPRC +KVDI+KAYD+V Sbjct: 533 NRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHMSPRCIMKVDIRKAYDSV 592 Query: 122 SWSFLHGILAGMGFPNLFISWVMECVTSASFSLCING 12 WSFL +L GFP+ F+ W+MECV++ S+S+ +NG Sbjct: 593 EWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNG 629 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 359 bits (922), Expect = 5e-96 Identities = 215/644 (33%), Positives = 337/644 (52%), Gaps = 13/644 (2%) Frame = -3 Query: 1898 WNVRGFKNPIKHNQLLAFIKEHNLNILCLLETKLEDDICFDSALCNFIVKTKFPGWLYSN 1719 WNVRGF N ++ + K +LET++++ S L + FPGW Sbjct: 7 WNVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARRSLL------SSFPGWKSVC 60 Query: 1718 NFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRL 1539 N+ GR+ ++W+P+ V V +L S Q I C + F+ +FVY + RR Sbjct: 61 NYEFAALGRIWVVWDPA-VEVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRR 119 Query: 1538 LWDDLYNIGHSLID---PWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLGLNDV 1368 LW +L + + PW+ILGDFN P + G + +++F++ LT ++D+ Sbjct: 120 LWSELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDL 179 Query: 1367 QTTGCFFTWTNNS----VWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXXXXXXXXX 1200 G +TW NN + ++DR LVN +W + +S + SDH Sbjct: 180 PFRGNHYTWWNNQENNPIAKKIDRILVNDSWL-IASPLSYGSFCAMEFSDHCPSCVNISN 238 Query: 1199 XXXXXSKPFKFFNMWTSHPSFLNVVQNAWFNPVW-GTAQFMXXXXXXXXXXXXXXXNSQH 1023 +KPFK N HP F+ ++ W + G+A F N +H Sbjct: 239 QSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREH 298 Query: 1022 FSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCS 843 +S + RV +A L + QN L P +S L+ K L AE+ FL QK++ Sbjct: 299 YSGLEKRVVQAAQNLKTCQNNLLAAP-SSYLAGLEKEAHRSWAELALAEERFLCQKSRVL 357 Query: 842 FLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYST 663 +L D NT FFH ++ ++ + G + + EL V+F+K LFG+ S+ Sbjct: 358 WLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGS--SS 415 Query: 662 NPIDQSVINHGPCLT-----EEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFK 498 + I I+ LT E LL A V +DIK+ F + ++SPGPDGY+S FFK Sbjct: 416 HLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFK 475 Query: 497 KSWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAI 318 K+W IVG +I A+QEFF + +LL Q N T++T++PK ++ + +FRPI+CCN +YK I Sbjct: 476 KTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIYKVI 535 Query: 317 TKIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKK 138 +K++A RLE +LP I+P+Q+AFV GR +T+N+ LA E+++ + + IS R +KVD++K Sbjct: 536 SKLLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANISSRGVLKVDLRK 595 Query: 137 AYDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCINGSL 6 A+D+V W F+ L P F++W+ +C+TS SFS+ ++GSL Sbjct: 596 AFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSL 639 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 339 bits (870), Expect = 5e-90 Identities = 212/645 (32%), Positives = 347/645 (53%), Gaps = 16/645 (2%) Frame = -3 Query: 1898 WNVRGFKNPIKHNQLLAFIKEHNLNILCLLETKLED--DICFDSALCNFIVKTKFPGWLY 1725 WN+RGF N + ++K + ++ET ++ D F +AL PGW + Sbjct: 8 WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINAL--------LPGWSF 59 Query: 1724 SNNFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDR 1545 N+ G++ ++W+PS V V ++ S Q+I C+++ S + I S VY V R Sbjct: 60 VENYAFSDLGKIWVMWDPS-VQVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASR 118 Query: 1544 RLLWDDLYNIGHSLI---DPWLILGDFNCIKSPDEKLNGAAIN-NYDLKDFQDICLTLGL 1377 + LW ++ N+ S I PWL+LGDFN + +P E N ++N + +++DF+D L L Sbjct: 119 KELWIEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAEL 178 Query: 1376 NDVQTTGCFFTWTNNS----VWCRLDRALVNSAWAQLGLQISAHAPVPGAI--SDHXXXX 1215 +D++ G FTW N S V ++DR LVN +W L + + G++ SDH Sbjct: 179 SDLRYKGNTFTWWNKSHTTPVAKKIDRILVNDSWNAL---FPSSLGIFGSLDFSDHVSCG 235 Query: 1214 XXXXXXXXXXSKPFKFFNMWTSHPSFLNVVQNAWFN-PVWGTAQFMXXXXXXXXXXXXXX 1038 +PFKFFN + FLN+V++ WF V G++ F Sbjct: 236 VVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKD 295 Query: 1037 XNSQHFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQ 858 + ++S + R KEA D L Q++ DP + SFE++++ K LT AE+ F Q Sbjct: 296 FSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASFELEAE-RKWHILTAAEESFFRQ 354 Query: 857 KAKCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFG 678 K++ S+ D NTK+FH + +S++ +G S + + D +++ L G Sbjct: 355 KSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLG 414 Query: 677 TY---YSTNPIDQSVINHGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSA 507 Y D +++ C + L S DI+AA+F++ ++S GPDG+++ Sbjct: 415 DEVDPYLMEQNDMNLLLSYRCSPAQVCELEST-FSNEDIRAALFSLPRNKSCGPDGFTAE 473 Query: 506 FFKKSWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVY 327 FF SW IVG +V +AI+EFFS+ LLKQ N T+I LIPK + +DFRPI+C N +Y Sbjct: 474 FFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLY 533 Query: 326 KAITKIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVD 147 K I +++ RL+++L +I+ AQ+AF+ GR++ +N+ LA +++ Y ISPR +KVD Sbjct: 534 KVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNISPRGMLKVD 593 Query: 146 IKKAYDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCING 12 +KKA+D+V W F+ L + P FI+W+ +C+++ +F++ ING Sbjct: 594 LKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSING 638 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 323 bits (828), Expect = 4e-85 Identities = 198/633 (31%), Positives = 318/633 (50%), Gaps = 19/633 (3%) Frame = -3 Query: 1847 FIKEHNLNILCLLETKLEDDICFDSALCNFIVKTKFPGWLYSNNFHEYRNGRMLLIWNPS 1668 ++ E N CL+ET+++++ ++ F W N+ R GR+ ++W Sbjct: 438 WVDEQNFQFGCLIETRVKEENS------QWLGSKLFKDWSMLTNYEFNRRGRLWVVWR-E 490 Query: 1667 SVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRLLWDDLYNIGHSLI---D 1497 +V + S QLI C + F SFVY +R++LW+DL + S I Sbjct: 491 NVRFTPFYKSDQLITCSVKLESQEEEFFYSFVYASNFAEERKILWNDLRDHMDSPIIRDK 550 Query: 1496 PWLILGDFNCIKSPDE--KLNGAAINNYDLKDFQDICLTLGLNDVQTTGCFFTWTN---- 1335 PW+I GDFN I DE ++ ++DFQ + +D+ + G FTW N Sbjct: 551 PWIIFGDFNEILDMDEHSRMEDHPAVTSGMRDFQSLVNYCSFSDLASHGPLFTWCNKRDN 610 Query: 1334 NSVWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXXXXXXXXXXXXXXS---KPFKFF 1164 + +W +LDR +VN AW + Q S + G SDH KPFKF Sbjct: 611 DPIWKKLDRVMVNEAWKMVYPQ-SYNVFEAGGCSDHLRCRINLNMNSGAQVRGNKPFKFV 669 Query: 1163 NMWTSHPSFLNVVQNAW--FNPVWGTAQ--FMXXXXXXXXXXXXXXXNSQHFSHISSRVK 996 N F +V+N W P+ + F + ++ R + Sbjct: 670 NAVADMEEFKPLVENFWRETEPIHMSTSSLFRFTKKLKALKPKLRGLAKEKMGNLVKRTR 729 Query: 995 EAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFLNLSDKNT 816 EA +L Q +P ++ E ++ + + + E+ +L Q +K +L + DKN Sbjct: 730 EAYLSLCQAQQSNSQNPSQRAMEIESEAYV-RWDRIASIEEKYLKQVSKLHWLKVGDKNN 788 Query: 815 KFFHSIIKRNALKRQLSSVSLLDGTQTTS---LKELSDAFVNFYKGLFGTYYSTNPIDQS 645 K FH A + + + DG+ T+ +K ++ F + L Y +++ Sbjct: 789 KTFHRAATARAAQNSIREIQKEDGSTATTKDDIKNETERFFQEFLQLIPNDYEGITVEKL 848 Query: 644 VINHGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDIVGNDVI 465 + ++L+A V +I+ A+F++ +D+SPGPDGY+S F+K++WDI+G + + Sbjct: 849 TSLLPYHCSPAEKDMLTASVSAKEIRGALFSMPNDKSPGPDGYTSEFYKRAWDIIGAEFV 908 Query: 464 NAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKIIAARLEKV 285 A++ FF L K +N T + LIPK + + D+RPI+CCNV+YK I+KIIA RL+ V Sbjct: 909 LAVKSFFEKGFLPKGVNTTILALIPKKLEAKEMKDYRPISCCNVIYKVISKIIANRLKHV 968 Query: 284 LPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTVSWSFLH 105 LP+ I Q+AFV R + +N+ LA E++++Y + IS RC IK+DI KA+D+V WSFL Sbjct: 969 LPNFIAGNQSAFVKDRLLIENLLLATELVKDYHKDTISGRCAIKIDISKAFDSVQWSFLK 1028 Query: 104 GILAGMGFPNLFISWVMECVTSASFSLCINGSL 6 +L+ + FP F+ WVM CVT+ASFS+ +NG L Sbjct: 1029 NVLSALDFPPEFVHWVMLCVTTASFSVQVNGEL 1061 >dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 910 Score = 317 bits (811), Expect = 4e-83 Identities = 200/650 (30%), Positives = 329/650 (50%), Gaps = 14/650 (2%) Frame = -3 Query: 1913 MKIAVWNVRGFKNPIKHNQLLAFIKEHNLNILCLLETKLEDDICFDSALCNFIVKTKFPG 1734 MK+ WN+RG + + + ++I +NL + C LET + + N ++ + PG Sbjct: 1 MKVFCWNIRGLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENA------NSVLASTLPG 54 Query: 1733 WLYSNNFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSV 1554 W +N+ GR+ ++W+PS + V + + Q++ C I +F +FVYG S Sbjct: 55 WRMDSNYCCSELGRIWIVWDPS-ISVLVFKRTDQIMFCSIKIPSLLQSFAVAFVYGRNSE 113 Query: 1553 GDRRLLWDDLYNIGHSL---IDPWLILGDFNCIKSPDE--KLNGAAINNYDLKDFQDICL 1389 DRR LW+D+ + + + PWL+LGDFN I + E +N + +N ++D Q Sbjct: 114 LDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLR 173 Query: 1388 TLGLNDVQTTGCFFTWTN----NSVWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXX 1221 L+D+ + G FFTW+N N + +LDRAL N W + A PG SDH Sbjct: 174 DSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPGD-SDHAP 232 Query: 1220 XXXXXXXXXXXXSKPFKFFNMWTSHPSFLNVVQNAW-FNPVWGTAQFMXXXXXXXXXXXX 1044 K FK+F+ +SHPS+L + AW N + G+ F Sbjct: 233 CIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKVAKLCC 292 Query: 1043 XXXNSQHFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFL 864 N FS+I R ++ L+ +Q +L P ++ E ++ + I A + F Sbjct: 293 RTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRREHVAR-KQWIFFAAALESFF 351 Query: 863 SQKAKCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGL 684 QK++ +L+ D NT+FFH + + + + DG + ++ ++ + +Y L Sbjct: 352 RQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHL 411 Query: 683 FGTYYSTNPIDQSVINHGPCLTEEHANLLSAPV--LPSD--IKAAIFNIDDDRSPGPDGY 516 G S N SV L + L++ + +PS+ I +F++ +++PGPDG+ Sbjct: 412 LGIP-SENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDGF 470 Query: 515 SSAFFKKSWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCN 336 FF ++W IV + V+ AI+EFF + L + N T+ITLIPK + + FRP+ACC Sbjct: 471 PVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCT 530 Query: 335 VVYKAITKIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTI 156 +YK IT+II+ RL+ + + Q F+ GR + +N+ LA E++ N+ + R + Sbjct: 531 TIYKVITRIISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVDNFEADGETTRGCL 590 Query: 155 KVDIKKAYDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCINGSL 6 +VDI KAYD V+W FL IL + P +FI W+ C++SAS+S+ NG L Sbjct: 591 QVDISKAYDNVNWEFLINILKALDLPLVFIHWIWVCISSASYSIAFNGEL 640 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 308 bits (788), Expect = 2e-80 Identities = 199/651 (30%), Positives = 309/651 (47%), Gaps = 20/651 (3%) Frame = -3 Query: 1898 WNVRGFKNPIKHNQLLAFIKEHNLNILCLLETKLEDDICFDSALCNFIVKTKFPGWLYSN 1719 WNVRG KH+ + +I+E+N CL+ET++++ + + +V F W Sbjct: 6 WNVRGLNKSSKHSVIKKWIEENNFQFGCLVETRVKE------SKVSQLVGKLFKDWSILT 59 Query: 1718 NFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRL 1539 N+ R GR+ ++W +V + ++ S QL+ C + F SFVY V +R++ Sbjct: 60 NYEHNRRGRIWVLWR-KNVRLSPIYKSCQLLTCSVKLEDRQDEFFCSFVYASNYVEERKV 118 Query: 1538 LWDDLYNIGHSLI---DPWLILGDFNCIKSPDEKLNGAA--INNYDLKDFQDICLTLGLN 1374 LW +L + S I PW +LGDFN E + ++DFQ + L Sbjct: 119 LWSELKDHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPGMRDFQQVINYCSLT 178 Query: 1373 DVQTTGCFFTWTNNS----VWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXXXXXXX 1206 D+ G FTW N + +LDR L+N W Q Q S G SDH Sbjct: 179 DMAAQGPLFTWCNKREHGLIMKKLDRVLINDCWNQTFSQ-SYSVFEAGGCSDHLRCRISL 237 Query: 1205 XXXXXXXS---KPFKFFNMWTSHPSFLNVVQNAWFNP----VWGTAQFMXXXXXXXXXXX 1047 KPFKF N T F +V W + + + F Sbjct: 238 NSEAGNKVQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLFRFSKNLKGLKPK 297 Query: 1046 XXXXNSQHFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMF 867 ++S + EA L + Q+ +P + ++ E + + + E+ + Sbjct: 298 IRSMARDRLGNLSKKANEAYKILCAKQHVNLTNPSSMAME-EENAAYSRWDRVAILEEKY 356 Query: 866 LSQKAKCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKG 687 L QK+K + + D+NTK FH + + DG T E+ F++ Sbjct: 357 LKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAERFFRE 416 Query: 686 LF----GTYYSTNPIDQSVINHGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDG 519 + + + C + +L+ PV +I+ +F + D+SPGPDG Sbjct: 417 FLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIR-PVTAEEIRKVLFRMPSDKSPGPDG 475 Query: 518 YSSAFFKKSWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACC 339 Y+S FFK +W+I+G++ A+Q FF+ L K IN T + LIPK + + D+RPI+CC Sbjct: 476 YTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAREMKDYRPISCC 535 Query: 338 NVVYKAITKIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCT 159 NV+YK I+KIIA RL+ VLP I Q+AFV R + +N+ LA E++++Y + IS RC Sbjct: 536 NVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYHKDTISTRCA 595 Query: 158 IKVDIKKAYDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCINGSL 6 IK+DI KA+D+V W FL + +GFP FI W+ C+T+ASFS+ +NG L Sbjct: 596 IKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGEL 646 >dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 307 bits (787), Expect = 2e-80 Identities = 205/655 (31%), Positives = 334/655 (50%), Gaps = 21/655 (3%) Frame = -3 Query: 1910 KIAVWNVRGFKNPIKHNQLLAFIKEHNLNILC---LLETKLEDDICFDSALCNFIVKTKF 1740 K+ WNVRGF N H + F K LN L+ET ++ + Sbjct: 4 KLFCWNVRGF-NISSHRR--GFKKWFLLNKPLFGGLIETHVKQP------KEKKFISNLL 54 Query: 1739 PGWLYSNNFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLY 1560 PGW + N+ G++ ++W+PS V V ++ S Q+I C+++ S + F+ S VY Sbjct: 55 PGWSFVENYEFSVLGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASN 113 Query: 1559 SVGDRRLLWDDLYNIGHSLI---DPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICL 1389 G R+ LW++L + S + W++LGDFN I +P+ +N A ++ F+ L Sbjct: 114 EEGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAIN--ANIGRKIRAFRSCLL 171 Query: 1388 TLGLNDVQTTGCFFTWTNNS----VWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXX 1221 L D+ G +TW N + ++DR LVN W L A+ P SDH Sbjct: 172 DSDLYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPD-FSDHSS 230 Query: 1220 XXXXXXXXXXXXSKPFKFFNMWTSHPSFLNVVQNAWFN-PVWGTAQFMXXXXXXXXXXXX 1044 +PF+FFN + +P FL +++ W++ V G+A + Sbjct: 231 CEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPI 290 Query: 1043 XXXNSQHFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFL 864 + +++S I RV EA + Q +P + E+++ K L KAE+ F Sbjct: 291 CCFSRENYSDIEKRVSEAHAIVLHRQRITLTNPSVVHATLELEAT-RKWQILAKAEESFF 349 Query: 863 SQKAKCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDG-------TQTTSLKELSDAF 705 QK+ S+L D NT +FH K +++ +++++ L TQ + + + Sbjct: 350 CQKSSISWLYEGDNNTAYFH---KMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHS 406 Query: 704 VNFYKGLFGTYYSTNPIDQSVIN---HGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRS 534 NF++ L N + QS +N C ++ N L DI+ A F++ +++ Sbjct: 407 CNFFESLLCGVEGENSLAQSDMNLLLSFRCSVDQ-INDLERSFSDLDIQEAFFSLPRNKA 465 Query: 533 PGPDGYSSAFFKKSWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFR 354 GPDGYSS FFK W +VG +V A+QEFF + +LLKQ N T++ LIPK +S + DFR Sbjct: 466 SGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFR 525 Query: 353 PIACCNVVYKAITKIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRI 174 PI+C N +YK I K++ +RL+K+L +I+P+Q+AF+ GR +++N+ LA EI+ Y K I Sbjct: 526 PISCLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNI 585 Query: 173 SPRCTIKVDIKKAYDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCINGS 9 S R +KVD++KA+D+V W F+ + P F+ W+ +C+++ FS+ +NGS Sbjct: 586 SSRGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGS 640 >emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein [Arabidopsis thaliana] Length = 893 Score = 307 bits (787), Expect = 2e-80 Identities = 205/655 (31%), Positives = 334/655 (50%), Gaps = 21/655 (3%) Frame = -3 Query: 1910 KIAVWNVRGFKNPIKHNQLLAFIKEHNLNILC---LLETKLEDDICFDSALCNFIVKTKF 1740 K+ WNVRGF N H + F K LN L+ET ++ + Sbjct: 4 KLFCWNVRGF-NISSHRR--GFKKWFLLNKPLFGGLIETHVKQP------KEKKFISNLL 54 Query: 1739 PGWLYSNNFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLY 1560 PGW + N+ G++ ++W+PS V V ++ S Q+I C+++ S + F+ S VY Sbjct: 55 PGWSFVENYEFSVLGKIWVLWDPS-VKVVVIGRSLQMITCELLLPDSPSWFVVSIVYASN 113 Query: 1559 SVGDRRLLWDDLYNIGHSLI---DPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICL 1389 G R+ LW++L + S + W++LGDFN I +P+ +N A ++ F+ L Sbjct: 114 EEGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAIN--ANIGRKIRAFRSCLL 171 Query: 1388 TLGLNDVQTTGCFFTWTNNS----VWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXX 1221 L D+ G +TW N + ++DR LVN W L A+ P SDH Sbjct: 172 DSDLYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPD-FSDHSS 230 Query: 1220 XXXXXXXXXXXXSKPFKFFNMWTSHPSFLNVVQNAWFN-PVWGTAQFMXXXXXXXXXXXX 1044 +PF+FFN + +P FL +++ W++ V G+A + Sbjct: 231 CEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPI 290 Query: 1043 XXXNSQHFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFL 864 + +++S I RV EA + Q +P + E+++ K L KAE+ F Sbjct: 291 CCFSRENYSDIEKRVSEAHAIVLHRQRITLTNPSVVHATLELEAT-RKWQILAKAEESFF 349 Query: 863 SQKAKCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDG-------TQTTSLKELSDAF 705 QK+ S+L D NT +FH K +++ +++++ L TQ + + + Sbjct: 350 CQKSSISWLYEGDNNTAYFH---KMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHS 406 Query: 704 VNFYKGLFGTYYSTNPIDQSVIN---HGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRS 534 NF++ L N + QS +N C ++ N L DI+ A F++ +++ Sbjct: 407 CNFFESLLCGVEGENSLAQSDMNLLLSFRCSVDQ-INDLERSFSDLDIQEAFFSLPRNKA 465 Query: 533 PGPDGYSSAFFKKSWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFR 354 GPDGYSS FFK W +VG +V A+QEFF + +LLKQ N T++ LIPK +S + DFR Sbjct: 466 SGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFR 525 Query: 353 PIACCNVVYKAITKIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRI 174 PI+C N +YK I K++ +RL+K+L +I+P+Q+AF+ GR +++N+ LA EI+ Y K I Sbjct: 526 PISCLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNI 585 Query: 173 SPRCTIKVDIKKAYDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCINGS 9 S R +KVD++KA+D+V W F+ + P F+ W+ +C+++ FS+ +NGS Sbjct: 586 SSRGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGS 640 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 303 bits (775), Expect = 5e-79 Identities = 195/641 (30%), Positives = 322/641 (50%), Gaps = 14/641 (2%) Frame = -3 Query: 1886 GFKNPIKHNQLLAFIKEHNLNILCLLETKLEDDICFDSALCNFIVKTKFPGWLYSNNFHE 1707 G + + + ++I +NL + C LET + + N ++ + PGW +N+ Sbjct: 53 GLNSRNRQRVVRSWIASNNLLVGCFLETHVAQENA------NSVLASTLPGWRMDSNYCC 106 Query: 1706 YRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRLLWDD 1527 GR+ ++W+PS + V + + Q++ C I +F +FVYG S DRR LW+D Sbjct: 107 SELGRIWIVWDPS-ISVLVFKRTDQIMFCSIKIPSLLQSFAVAFVYGRNSELDRRSLWED 165 Query: 1526 LYNIGHSL---IDPWLILGDFNCIKSPDE--KLNGAAINNYDLKDFQDICLTLGLNDVQT 1362 + + + + PWL+LGDFN I + E +N + +N ++D Q L+D+ + Sbjct: 166 ILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGMEDLQCCLRDSQLSDLPS 225 Query: 1361 TGCFFTWTN----NSVWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXXXXXXXXXXX 1194 G FFTW+N N + +LDRAL N W + A PG SDH Sbjct: 226 RGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPGD-SDHAPCIILIDNQP 284 Query: 1193 XXXSKPFKFFNMWTSHPSFLNVVQNAWF-NPVWGTAQFMXXXXXXXXXXXXXXXNSQHFS 1017 K FK+F+ +SHPS+L + AW N + G+ F N FS Sbjct: 285 PPSKKSFKYFSFLSSHPSYLAALSTAWEENTLVGSHMFSLRQHLKVAKLCCRTLNRLRFS 344 Query: 1016 HISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFL 837 +I R ++ L+ +Q +L P ++ E ++ + I A + F QK++ +L Sbjct: 345 NIQQRTAQSLTRLEDIQVELLTSPSDTLFRREHVAR-KQWIFFAAALESFFRQKSRIRWL 403 Query: 836 NLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYSTNP 657 + D NT+FFH + + + + DG + ++ ++ + +Y L G S N Sbjct: 404 HEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSHLLGIP-SENV 462 Query: 656 IDQSVINHGPCLTEEHANLLSAPV--LPSD--IKAAIFNIDDDRSPGPDGYSSAFFKKSW 489 SV L + L++ + +PS+ I +F++ +++PGPDG+ FF ++W Sbjct: 463 TPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPDGFPVEFFIEAW 522 Query: 488 DIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKI 309 IV + V+ AI+EFF + L + N T+ITLIPK + + FRP+ACC +YK IT+I Sbjct: 523 AIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKVITRI 582 Query: 308 IAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYD 129 I+ RL+ + + Q F+ GR + +N+ LA E++ N+ + R ++VDI KAYD Sbjct: 583 ISRRLKLFIDQAVQANQVGFIKGRLLCENVLLASELVDNFEADGETTRGCLQVDISKAYD 642 Query: 128 TVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCINGSL 6 V+W FL IL + P +FI W+ C++SAS+S+ NG L Sbjct: 643 NVNWEFLINILKALDLPLVFIHWIWVCISSASYSIAFNGEL 683 >emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1389 Score = 302 bits (773), Expect = 9e-79 Identities = 196/656 (29%), Positives = 319/656 (48%), Gaps = 22/656 (3%) Frame = -3 Query: 1913 MKIAVWNVRGFKNPIKHNQLLAFIKEHNLNILCLLETKLEDDIC-FDSALCNFIVKTKFP 1737 M IA WNVRG + F K +N+ IL L ETK + + F+ P Sbjct: 1 MSIAFWNVRGGCRKNVMEECSDFCKNNNIKILMLCETKSQSPPSQLAVSAAGFLHHDSIP 60 Query: 1736 GWLYSNNFHEYRNGRMLLIW-----NPSSVHVDILFISSQLIHCKIICNISHATFITSFV 1572 YS G + L W NP S+ V ++ S + I C I + F+ F+ Sbjct: 61 AMGYS--------GGLWLFWRDCILNPFSLVV--IYKSVRFIACSINLLNQNLQFVAIFI 110 Query: 1571 YGLYSVGDRRLLWDDLYNIGHSLIDPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDIC 1392 Y + WD+L SL P++ILGDFN I SP +KL GA ++ Q++ Sbjct: 111 YAPAQKEFKSSFWDELIAYVSSLSFPFIILGDFNEINSPSDKLGGAPFSSSRAYYMQNLF 170 Query: 1391 LTLGLNDVQTTGCFFTWTN-----NSVWCRLDRALVNSAWAQLGLQISAHAPVPGAI--- 1236 + ++ TG FTW N++ RLDR + +++W L + HA + I Sbjct: 171 SQVDCTEISFTGQIFTWRKKKDGPNNIHERLDRGVASTSW----LMLFPHAFLKHHIFTS 226 Query: 1235 SDHXXXXXXXXXXXXXXSKPFKFFNMWTSHPSFLNVVQNAWFNPVWGTAQFMXXXXXXXX 1056 SDH + PF+F MW + + ++V+ W +G+ F Sbjct: 227 SDHCQISLEYLANNKSKAPPFRFEKMWCTRKDYDSLVKRTWCTKFYGSHMFNFVQKCKLV 286 Query: 1055 XXXXXXXNSQHFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAE 876 N F +I ++++ + L+ +Q L +D N+SL + + + K L + Sbjct: 287 KINSKEWNKTQFGNIFRQLRQVDERLEEIQRNLLIDHNNTSLKTQQELFLAKRNKLLEYN 346 Query: 875 KMFLSQKAKCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNF 696 + QK K F+ L D N+KF+H+ + Q+ + D Q + +L + + Sbjct: 347 TTYWKQKCKSDFMVLGDTNSKFYHTHASIRKYRNQIKEF-IPDNAQPITQPDLIEKEITL 405 Query: 695 YKGLFGTYYSTNP-------IDQSVINHGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDR 537 F + +NP +D ++++ P ++E L++ V P +IK A+F++ D+ Sbjct: 406 ---AFKKRFISNPACKFNQNVDFNLLS--PIVSEADNAYLTSAVSPEEIKNAVFDLAPDK 460 Query: 536 SPGPDGYSSAFFKKSWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADF 357 SPGPDG+ FF+K W ++G V A+Q FF + +LK++NHT + LIPK D + F Sbjct: 461 SPGPDGFPPYFFQKYWTLIGKSVCRAVQAFFHSGYMLKEVNHTFLALIPKVDKPVNANHF 520 Query: 356 RPIACCNVVYKAITKIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKR 177 RPI+ C+ +YK I+KII RL+ L +I+P Q AF+ R I DNI +A E+ ++ K Sbjct: 521 RPISLCSTIYKVISKIITNRLKITLGKIIHPLQGAFIPERLIQDNILIAHEVFHSFKNKT 580 Query: 176 -ISPRCTIKVDIKKAYDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCING 12 IK+D++KAYD + W +++ + MGF ++I W+ C++SASFS+ +NG Sbjct: 581 GRGGWIAIKLDMEKAYDRLEWKYIYTTMDKMGFSPIWIEWIRSCISSASFSVLVNG 636 >emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1| putative protein [Arabidopsis thaliana] Length = 1141 Score = 293 bits (749), Expect = 6e-76 Identities = 182/593 (30%), Positives = 309/593 (52%), Gaps = 12/593 (2%) Frame = -3 Query: 1754 VKTKFPGWLYSNNFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSF 1575 + PGW + N+ G++ ++W+PS V V I+ S Q+I C+++ S + S Sbjct: 42 INALLPGWFFDENYGFSDLGKIWVLWDPS-VEVVIVAKSLQMITCEVLFPNSRTWIVISV 100 Query: 1574 VYGLYSVGDRRLLWDDLYNIGHSLID---PWLILGDFNCIKSPDEKLNGAAIN-NYDLKD 1407 VY R+ LW ++ + S + PW++LGDFN + P E ++N + ++D Sbjct: 101 VYAANEDDKRKELWREITALVASPVTFNRPWILLGDFNQVLHPHEHSRHVSLNVDRRIRD 160 Query: 1406 FQDICLTLGLNDVQTTGCFFTWTNNS----VWCRLDRALVNSAWAQLGLQISAHAPVPGA 1239 F++ L L+D+ G FTW N S V ++DR LVN +W+ L S P Sbjct: 161 FRECLLDAELSDLVYKGSSFTWWNKSKTRPVAKKIDRILVNESWSNL-FPSSFGLFGPPD 219 Query: 1238 ISDHXXXXXXXXXXXXXXSKPFKFFNMWTSHPSFLNVVQNAWFNP-VWGTAQFMXXXXXX 1062 SDH +PFKFFN +P FLN+V + W++ V G++ F Sbjct: 220 FSDHASCGVVLELDPIKAKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLK 279 Query: 1061 XXXXXXXXXNSQHFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTK 882 + ++S++ R +EA +TL S QN +P + + E+++Q K L Sbjct: 280 ALKKPIKDFSRLNYSNLEKRTEEAHETLLSFQNLTLDNPSLENAAHELEAQ-RKWQILAT 338 Query: 881 AEKMFLSQKAKCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFV 702 AE+ F Q+++ ++ D NT++FH + ++++ GTQ S + ++D Sbjct: 339 AEESFFRQRSRVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSGTQIDSQQGIADHCA 398 Query: 701 NFYKGLFGTY---YSTNPIDQSVINHGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRSP 531 +++ L YS D +++ C + A+L A DIKAA F + +++ Sbjct: 399 LYFENLLSDDNDPYSLEQDDMNLLLTYRCPYSQVADL-EAMFSDEDIKAAFFGLPSNKAC 457 Query: 530 GPDGYSSAFFKKSWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRP 351 GPDG+ V A++EFF + LLKQ N T+I LIPK ++ +DFRP Sbjct: 458 GPDGFP--------------VTAAVREFFISGNLLKQWNATTIVLIPKFPNASCTSDFRP 503 Query: 350 IACCNVVYKAITKIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRIS 171 I+C N +YK I +++ RL+K+L +I+P+Q+AF+ GR + +N+ LA E++ Y + IS Sbjct: 504 ISCMNTLYKVIARLLTDRLQKLLSCVISPSQSAFLPGRLLAENVLLATEMVHGYNWRNIS 563 Query: 170 PRCTIKVDIKKAYDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCING 12 R +KVD++KA+D+V W F+ L +G P FI+W+ +C+++ +F++ +NG Sbjct: 564 LRGMLKVDLRKAFDSVRWEFIIAALLALGVPTKFINWIHQCISTPTFTVSVNG 616 >gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 286 bits (731), Expect = 7e-74 Identities = 168/564 (29%), Positives = 281/564 (49%), Gaps = 1/564 (0%) Frame = -3 Query: 1700 NGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRLLWDDLY 1521 N + + +++ ++L Q +H ++ T+FVY + +R LW+ L Sbjct: 909 NSQKIWLFHSVEFICEVLLDHPQCLHVRVTIPWLDLPIFTTFVYAKCTRSERTPLWNCLR 968 Query: 1520 NIGHSLIDPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLGLNDVQTTGCFFTW 1341 N+ + PW++ GDFN I +E+L GA + ++DF + L GL D G FTW Sbjct: 969 NLAADMEGPWIVGGDFNIILKREERLYGADPHEGSIEDFASVLLDCGLLDGGFEGNPFTW 1028 Query: 1340 TNNSVWCRLDRALVNSAWA-QLGLQISAHAPVPGAISDHXXXXXXXXXXXXXXSKPFKFF 1164 TNN ++ RLDR + N W + + H G SDH F+F Sbjct: 1029 TNNRMFQRLDRMVYNQQWINKFPITRIQHLNRDG--SDHCPLLLSCSNSSEKAPSSFRFL 1086 Query: 1163 NMWTSHPSFLNVVQNAWFNPVWGTAQFMXXXXXXXXXXXXXXXNSQHFSHISSRVKEAKD 984 + W H +F V+ W P+ G+ N F I S +KEA+ Sbjct: 1087 HAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFGDIFSNIKEAEK 1146 Query: 983 TLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFLNLSDKNTKFFH 804 ++ + LH ++ ++ E++F QK+ ++ ++NTKFFH Sbjct: 1147 RVEECEI-LHQQEQTIGSRIQLNKSYAQLNKQLSMEEIFWKQKSGVKWVVEGERNTKFFH 1205 Query: 803 SIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYSTNPIDQSVINHGPC 624 +++ ++ + + DG ++L + ++F+ L + QS + Sbjct: 1206 MRMQKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFFSSLLKAESCDDTRFQSSLCPSII 1265 Query: 623 LTEEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDIVGNDVINAIQEFF 444 ++ L + P L ++K A+F ID + + GPDG+SS F+++ WDI+ +D+ A++EFF Sbjct: 1266 SDTDNGFLCAEPTL-QEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVKEFF 1324 Query: 443 SNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKIIAARLEKVLPSLINP 264 A + + + T++ LIPKT + ++FRPI+ C V+ K ITKI+A RL K+LPS+I Sbjct: 1325 HGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRLAKILPSIITE 1384 Query: 263 AQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTVSWSFLHGILAGMG 84 Q+ FVGGR I+DNI LAQE+I +K +K+D+ KAYD + WSFL +L +G Sbjct: 1385 NQSGFVGGRLISDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWSFLFKVLQHLG 1444 Query: 83 FPNLFISWVMECVTSASFSLCING 12 F +I + +C+++ FSL +NG Sbjct: 1445 FNAQWIGMIQKCISNCWFSLLLNG 1468 Score = 72.4 bits (176), Expect = 2e-09 Identities = 40/125 (32%), Positives = 62/125 (49%), Gaps = 7/125 (5%) Frame = -2 Query: 3003 TLMLKIFSNDTVFS-KNDFSTVPIWVKFPFLPAPCWQSKALGKICDKIGSPICCDRQTFD 2827 T +++F F + + + VP+W+ FP L A ++ AL I +G P+ D T + Sbjct: 53 TQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFVDEATAN 112 Query: 2826 KSRPAFARILVDIDPTKKPPDSVQIQFSGGKS------FDQKVEYEFIPKYCKKCACFGH 2665 SRP+ AR+ V+ D K P D V I K+ + Q+VE+ +P YC C GH Sbjct: 113 GSRPSVARVCVEYDCRKSPVDQVWIVVQNRKTGEVMNGYSQRVEFAQMPAYCDHCCHVGH 172 Query: 2664 YFDNC 2650 +C Sbjct: 173 KETDC 177 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 285 bits (730), Expect = 9e-74 Identities = 167/569 (29%), Positives = 284/569 (49%), Gaps = 3/569 (0%) Frame = -3 Query: 1700 NGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRLLWDDLY 1521 + + + +++ +H DI+ Q +H ++ F +FVY + +R LLWD L Sbjct: 946 SSQKIWLFHSLELHSDIILDHPQCLHVRLTSPWLEKPFFATFVYAKCTRSERTLLWDCLR 1005 Query: 1520 NIGHSLIDPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLGLNDVQTTGCFFTW 1341 + +PWL+ GDFN I +E+L G+A + ++DF + L GL D G FTW Sbjct: 1006 RLAADNEEPWLVGGDFNIILKREERLYGSAPHEGSMEDFASVLLDCGLLDGGFEGNPFTW 1065 Query: 1340 TNNSVWCRLDRALVNSAWAQLGLQISAHAPVPGAISDHXXXXXXXXXXXXXXSKPFKFFN 1161 TNN ++ RLDR + N W + I+ + SDH F+F + Sbjct: 1066 TNNRMFQRLDRVVYNHQWINM-FPITRIQHLNRDGSDHCPLLISCFISSEKSPSSFRFQH 1124 Query: 1160 MWTSHPSFLNVVQNAWFNPVWGTAQFMXXXXXXXXXXXXXXXNSQHFSHISSRVKEAKDT 981 W H F V+ W P+ G+ N F I S++KEA+ Sbjct: 1125 AWVLHHDFKTSVEGNWNLPINGSGLQAFWIKQHRLKQHLKWWNKAVFGDIFSKLKEAEKR 1184 Query: 980 LDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFLNLSDKNTKFFHS 801 ++ + LH + ++ E++F QK+ ++ ++NTKFFH Sbjct: 1185 VEECEI-LHQQEQTVGSRINLNKSYAQLNKQLNVEEIFWKQKSGVKWVVEGERNTKFFHM 1243 Query: 800 IIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYSTNPIDQSVINHG--P 627 +++ ++ + V DG ++L + + ++ L P D S + P Sbjct: 1244 RMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIEYFSSLL----KAEPCDISRFQNSLIP 1299 Query: 626 CLTEEHAN-LLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDIVGNDVINAIQE 450 + N LL A ++K A+F+ID + + GPDG+SS F+++ W+ + +D+++A+++ Sbjct: 1300 SIISNSENELLCAEPNLQEVKDAVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRD 1359 Query: 449 FFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKIIAARLEKVLPSLI 270 FF A + + + T++ L+PK + ++FRPI+ C V+ K ITK+++ RL K+LPS+I Sbjct: 1360 FFHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMNKIITKLLSNRLAKILPSII 1419 Query: 269 NPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTVSWSFLHGILAG 90 Q+ FVGGR I+DNI LAQE+IR K +K+D+ KAYD + WSFL +L Sbjct: 1420 TENQSGFVGGRLISDNILLAQELIRKLDTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQH 1479 Query: 89 MGFPNLFISWVMECVTSASFSLCINGSLK 3 GF +I + +C+++ FSL +NG ++ Sbjct: 1480 FGFNEQWIGMIQKCISNCWFSLLLNGRIE 1508 Score = 67.4 bits (163), Expect = 5e-08 Identities = 34/110 (30%), Positives = 56/110 (50%), Gaps = 6/110 (5%) Frame = -2 Query: 2961 KNDFSTVPIWVKFPFLPAPCWQSKALGKICDKIGSPICCDRQTFDKSRPAFARILVDIDP 2782 + + + VP+W+ FP L A ++ AL I +G P+ D T + SRP+ AR+ ++ D Sbjct: 68 EKESAVVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCIEYDC 127 Query: 2781 TKKPPDSVQIQFSGGKS------FDQKVEYEFIPKYCKKCACFGHYFDNC 2650 + P D V I ++ + Q+VE+ +P YC C GH +C Sbjct: 128 RRSPIDQVWIVVQNRETGTVTSGYPQRVEFSQMPAYCDHCCHVGHKEIDC 177 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 284 bits (727), Expect = 2e-73 Identities = 166/504 (32%), Positives = 279/504 (55%), Gaps = 11/504 (2%) Frame = -3 Query: 1487 ILGDFNCIKSPDEKLNGAAIN-NYDLKDFQDICLTLGLNDVQTTGCFFTWTNNS----VW 1323 +LGDFN + P E N ++N + ++DF + L+D+ G FTW N S + Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 1322 CRLDRALVNSAWAQLGLQISAHAPVPGA-ISDHXXXXXXXXXXXXXXSKPFKFFNMWTSH 1146 +LDR L N +W L S+H SDH +PFKFFN + Sbjct: 61 KKLDRILANDSWCNL--YPSSHGLFGNLDFSDHVSCGVVLEANGISAKRPFKFFNFLLKN 118 Query: 1145 PSFLNVVQNAWFNP-VWGTAQFMXXXXXXXXXXXXXXXNSQHFSHISSRVKEAKDTLDSL 969 FLNVV + WF+ V G++ + + ++S I R KEA + L + Sbjct: 119 EDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITC 178 Query: 968 QNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFLNLSDKNTKFFHSIIKR 789 QN +P S+ + E+++Q K + L+ AE+ F Q+++ S+ D NT +FH ++ Sbjct: 179 QNLTLANPSVSNAALELEAQ-RKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVDS 237 Query: 788 NALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYSTNPIDQSVINHGPCLT--- 618 ++S+ +G S + + D V +Y+ L G+ S ++Q +N LT Sbjct: 238 RKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNL--LLTYRC 295 Query: 617 -EEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDIVGNDVINAIQEFFS 441 ++ + L +IKAA ++ +++ GPDGYS FF+ +W I+G +V+ AI EFF Sbjct: 296 SQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355 Query: 440 NAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKIIAARLEKVLPSLINPA 261 + +LLKQ N T++ LIPKT ++ ++++FRPI+C N +YK I+K++ +RL+ +L ++I + Sbjct: 356 SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHS 415 Query: 260 QAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTVSWSFLHGILAGMGF 81 Q+AF+ GR++ +N+ LA E++ Y R ISPR +KVD+KKA+D+V W F+ L + Sbjct: 416 QSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAI 475 Query: 80 PNLFISWVMECVTSASFSLCINGS 9 P +I+W+ +C+T+ SF++ +NG+ Sbjct: 476 PERYINWIHQCITTPSFTISVNGA 499 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 284 bits (727), Expect = 2e-73 Identities = 166/504 (32%), Positives = 279/504 (55%), Gaps = 11/504 (2%) Frame = -3 Query: 1487 ILGDFNCIKSPDEKLNGAAIN-NYDLKDFQDICLTLGLNDVQTTGCFFTWTNNS----VW 1323 +LGDFN + P E N ++N + ++DF + L+D+ G FTW N S + Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60 Query: 1322 CRLDRALVNSAWAQLGLQISAHAPVPGA-ISDHXXXXXXXXXXXXXXSKPFKFFNMWTSH 1146 +LDR L N +W L S+H SDH +PFKFFN + Sbjct: 61 KKLDRILANDSWCNL--YPSSHGLFGNLDFSDHVSCGVVLEANGISAKRPFKFFNFLLKN 118 Query: 1145 PSFLNVVQNAWFNP-VWGTAQFMXXXXXXXXXXXXXXXNSQHFSHISSRVKEAKDTLDSL 969 FLNVV + WF+ V G++ + + ++S I R KEA + L + Sbjct: 119 EDFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRLNYSGIELRTKEAHELLITC 178 Query: 968 QNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFLNLSDKNTKFFHSIIKR 789 QN +P S+ + E+++Q K + L+ AE+ F Q+++ S+ D NT +FH ++ Sbjct: 179 QNLTLANPSVSNAALELEAQ-RKWVLLSCAEESFFHQRSRVSWFAEGDSNTHYFHRMVDS 237 Query: 788 NALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYSTNPIDQSVINHGPCLT--- 618 ++S+ +G S + + D V +Y+ L G+ S ++Q +N LT Sbjct: 238 RKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNL--LLTYRC 295 Query: 617 -EEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDIVGNDVINAIQEFFS 441 ++ + L +IKAA ++ +++ GPDGYS FF+ +W I+G +V+ AI EFF Sbjct: 296 SQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHEFFD 355 Query: 440 NAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKIIAARLEKVLPSLINPA 261 + +LLKQ N T++ LIPKT ++ ++++FRPI+C N +YK I+K++ +RL+ +L ++I + Sbjct: 356 SGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVIGHS 415 Query: 260 QAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTVSWSFLHGILAGMGF 81 Q+AF+ GR++ +N+ LA E++ Y R ISPR +KVD+KKA+D+V W F+ L + Sbjct: 416 QSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAI 475 Query: 80 PNLFISWVMECVTSASFSLCINGS 9 P +I+W+ +C+T+ SF++ +NG+ Sbjct: 476 PERYINWIHQCITTPSFTISVNGA 499 >emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1383 Score = 283 bits (724), Expect = 4e-73 Identities = 191/642 (29%), Positives = 317/642 (49%), Gaps = 7/642 (1%) Frame = -3 Query: 1913 MKIAVWNVRGFKNPIKHNQLLAFIKEHNLNILCLLETKLEDDICFDSALCNFIVKTKFPG 1734 M + WN RG K +Q I H + L + E+K E+ + + I Sbjct: 1 MSLLSWNCRGIGAREKRSQTRKLINTHKPSFLFIQESKSEN---INPKIIKTIWHNDDIE 57 Query: 1733 WLYSNNFHEYRNGRMLLIWNPSSVHVDILFISSQLIHCKIICNISHATF--ITSFVYGLY 1560 WL+S + +G ++ IW S+ ++ I I I +I H F + +Y Sbjct: 58 WLFSPSVGN--SGGLISIWEKSAFQMESSHIQRNWI--AIQGSIVHPRFRCLLINIYNPC 113 Query: 1559 SVGDRRLLWDDLYNIGHSLIDPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLG 1380 ++ R ++W+D+ I P LI+GDFN + S E+ +G + + ++DF++ +LG Sbjct: 114 NIEGRAVVWNDISEFCRINIFPTLIMGDFNEVLSSSERGSGLS-SQEGVEDFRNFIQSLG 172 Query: 1379 LNDVQTTGCFFTWTNNSVWCRLDRALVNSAWAQ----LGLQISAHAPVPGAISDHXXXXX 1212 L D+ + FTW + + RLDR LV S W Q L LQI + +SDH Sbjct: 173 LIDISSANGRFTWFHGNRKSRLDRCLVTSDWIQQYPNLSLQI-----LNRTVSDHCPILA 227 Query: 1211 XXXXXXXXXSKPFKFFNMWTSHPSFLNVVQNAWFNPVWGTAQFMXXXXXXXXXXXXXXXN 1032 KPF+F N W SHP+FL + AW N AQ + Sbjct: 228 HSPATNWGP-KPFRFLNCWVSHPNFLPTISLAWAN-----AQNLPLPDKLKQLKLKLKEW 281 Query: 1031 SQ-HFSHISSRVKEAKDTLDSLQNQLHLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQK 855 ++ F I +++KE +D + + + L+ S KS + + S K + + +Q Sbjct: 282 NKSEFGAIDTKIKELEDLIQHFDDIANDRTLSDSELDSRKSVQMDLWSWLKKREAYWAQV 341 Query: 854 AKCSFLNLSDKNTKFFHSIIKRNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGT 675 ++ +L D+NTKFFH++ K +SS+ L+D T + V++++ +F Sbjct: 342 SRSKWLKEGDRNTKFFHTLASIRRQKNSISSI-LIDNTNLVDCAGIKSEAVSYFQKIFQE 400 Query: 674 YYSTNPIDQSVINHGPCLTEEHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKK 495 P +++ L N+L P +I AA+ + D +++PGPDG++ F K Sbjct: 401 DIKNRPKFENL--EFKKLIPSQTNMLCEPFSLDEIDAAVASCDGNKAPGPDGFNFNFIKS 458 Query: 494 SWDIVGNDVINAIQEFFSNAKLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAIT 315 +W+++ DV + ++ F++ L K N I LIPK + S D+RPI+ VYK ++ Sbjct: 459 AWEVIKQDVYDMVRRFWNTGYLPKGCNTAFIALIPKVESPMSFKDYRPISMVGCVYKIVS 518 Query: 314 KIIAARLEKVLPSLINPAQAAFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKA 135 KI+A RL++V+ L+ Q++F+GGR I D +A EII + R + + +K+D KA Sbjct: 519 KILARRLQRVMDHLVGTLQSSFIGGRQILDGALVAGEIIDSCKRLK-TEAVLLKLDFHKA 577 Query: 134 YDTVSWSFLHGILAGMGFPNLFISWVMECVTSASFSLCINGS 9 +D++SW +L +L MGFP+L+ +W+ CV SAS S+ INGS Sbjct: 578 FDSISWDYLDWVLEQMGFPDLWRAWMKSCVMSASASILINGS 619 >gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 283 bits (723), Expect = 6e-73 Identities = 169/558 (30%), Positives = 279/558 (50%), Gaps = 4/558 (0%) Frame = -3 Query: 1673 PSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRLLWDDLYNIGHSLIDP 1494 P +H D++F Q +H ++ +FVY + +R LLWD L + + P Sbjct: 1125 PRELHSDVIFDHPQCLHVRLTSPWLEFPIFVTFVYAKCTRSERTLLWDCLRRLAADIEVP 1184 Query: 1493 WLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLGLNDVQTTGCFFTWTNNSVWCRL 1314 WL+ GDFN I +E+L G+A + ++DF L GL D G FTWTNN ++ RL Sbjct: 1185 WLVGGDFNIILKREERLYGSAPHEGAMEDFASTLLDCGLLDGGFEGNPFTWTNNRMFQRL 1244 Query: 1313 DRALVNSAWA-QLGLQISAHAPVPGAISDHXXXXXXXXXXXXXXSKPFKFFNMWTSHPSF 1137 DR + N W + + H G SDH F+F + W H F Sbjct: 1245 DRIVYNHHWINKFPITRIQHLNRDG--SDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDF 1302 Query: 1136 LNVVQNAWFNPVWGTAQFMXXXXXXXXXXXXXXXNSQHFSHISSRVKEAKDTLDSLQNQL 957 V++ W P+ G+ N F I S++KEA+ ++ + L Sbjct: 1303 KTSVESNWNLPINGSGLQAFWSKQHRLKQHLKWWNKVMFGDIFSKLKEAEKRVEECEI-L 1361 Query: 956 HLDPLNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFLNLSDKNTKFFHSIIKRNALK 777 H + ++ ++ E++F QK+ ++ ++NTKFFH+ +++ ++ Sbjct: 1362 HQNEQTVESIIKLNKSYAQLNKQLNIEEIFWKQKSGVKWVVEGERNTKFFHTRMQKKRIR 1421 Query: 776 RQLSSVSLLDGTQTTSLKELSDAFVNFYKGLFGTYYSTNPIDQSVINHG--PCLTEEHAN 603 + V DG ++L + + ++ L P D S P + N Sbjct: 1422 SHIFKVQEPDGRWIEDQEQLKQSAIKYFSSLL----KFEPCDDSRFQRSLIPSIISNSEN 1477 Query: 602 -LLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDIVGNDVINAIQEFFSNAKLL 426 LL A ++K A+F ID + + GPDG+SS F+++ W+I+ +D+++A+++FF A + Sbjct: 1478 ELLCAEPNLQEVKDAVFGIDPESAAGPDGFSSYFYQQCWNIIAHDLLDAVRDFFHGANIP 1537 Query: 425 KQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKIIAARLEKVLPSLINPAQAAFV 246 + + T++ L+PK + +DFRPI+ C V+ K ITK+++ RL K+LPS+I Q+ FV Sbjct: 1538 RGVTSTTLILLPKKPSASKWSDFRPISLCTVMNKIITKLLSNRLAKILPSIITENQSGFV 1597 Query: 245 GGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTVSWSFLHGILAGMGFPNLFI 66 GGR I+DNI LAQE+I K +K+D+ KAYD + WSFL +L GF + +I Sbjct: 1598 GGRLISDNILLAQELIGKLNTKSRGGNLALKLDMMKAYDRLDWSFLIKVLQHFGFNDQWI 1657 Query: 65 SWVMECVTSASFSLCING 12 + +C+++ FSL +NG Sbjct: 1658 GMIQKCISNCWFSLLLNG 1675 Score = 68.9 bits (167), Expect = 2e-08 Identities = 36/110 (32%), Positives = 56/110 (50%), Gaps = 6/110 (5%) Frame = -2 Query: 2961 KNDFSTVPIWVKFPFLPAPCWQSKALGKICDKIGSPICCDRQTFDKSRPAFARILVDIDP 2782 + + + VP+W+ FP L A ++ AL I +G P+ D T + SRP+ AR+ ++ D Sbjct: 212 EKESAMVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCIEYDC 271 Query: 2781 TKKPPDSVQIQFSGGKS------FDQKVEYEFIPKYCKKCACFGHYFDNC 2650 K P D V I ++ + QKVE+ +P YC C GH +C Sbjct: 272 RKPPIDQVWIVVQNRETGTVTSGYPQKVEFSQMPAYCDHCCHVGHKEIDC 321 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 281 bits (720), Expect = 1e-72 Identities = 172/563 (30%), Positives = 287/563 (50%), Gaps = 4/563 (0%) Frame = -3 Query: 1682 IWNPSSVHVDILFISSQLIHCKIICNISHATFITSFVYGLYSVGDRRLLWDDLYNIGHSL 1503 I++ V+ ++L Q +H ++ +FVY + +R LW+ L ++ + Sbjct: 655 IFSSMEVNCEVLMDHIQCLHVRLSLPWLPHPISATFVYAKCTRQERLELWNCLRSLSSDM 714 Query: 1502 IDPWLILGDFNCIKSPDEKLNGAAINNYDLKDFQDICLTLGLNDVQTTGCFFTWTNNSVW 1323 PW++ GDFN I S E+LNGA + ++DF GL D G FTWTNN ++ Sbjct: 715 QGPWMVGGDFNTIVSCAERLNGAPPHGGSMEDFVATLFDCGLIDAGFEGNSFTWTNNHMF 774 Query: 1322 CRLDRALVNSAWAQLGLQISA-HAPVPGAISDHXXXXXXXXXXXXXXSKPFKFFNMWTSH 1146 RLDR + N WA H G SDH F+F + WT H Sbjct: 775 QRLDRVVYNPEWAHCFSSTRVQHLNRDG--SDHCPLLISCATASQKGPSTFRFLHAWTKH 832 Query: 1145 PSFLNVVQNAWFNPVWGTAQFMXXXXXXXXXXXXXXXNSQHFSHISSRVKEAKDTLDSLQ 966 FL V+ +W P+ + N Q F I ++K A+ + + Sbjct: 833 HDFLPFVERSWQVPLNSSGLTAFWIKQQRLKRDLKWWNKQIFGDIFEKLKRAEIEAEKRE 892 Query: 965 NQLHLDP--LNSSLSFEVKSQILKVISLTKAEKMFLSQKAKCSFLNLSDKNTKFFHSIIK 792 + DP +N +L + +++ + +S+ E++F QK+ +L ++NTKFFH ++ Sbjct: 893 KEFQQDPSSINRNLMNKAYAKLNRQLSI---EELFWQQKSGVKWLVEGERNTKFFHLRMR 949 Query: 791 RNALKRQLSSVSLLDGTQTTSLKELSDAFVNFYKGLF-GTYYSTNPIDQSVINHGPCLTE 615 + ++ + + +G + + ++ V +++ L + D S+I +T+ Sbjct: 950 KKRVRNNIFRIQDSEGNIYEDPQYIQNSAVQYFQNLLTAEQCDFSRFDPSLIPRTISITD 1009 Query: 614 EHANLLSAPVLPSDIKAAIFNIDDDRSPGPDGYSSAFFKKSWDIVGNDVINAIQEFFSNA 435 L +AP L +IK +FNID D GPDG+SS F++ WDI+ D++ A+ +FF+ Sbjct: 1010 NEF-LCAAPSL-KEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDFFNGT 1067 Query: 434 KLLKQINHTSITLIPKTDHSPSVADFRPIACCNVVYKAITKIIAARLEKVLPSLINPAQA 255 + + + T++ L+PK +S +DFRPI+ C V+ K +TK +A RL K+LPS+I+ Q+ Sbjct: 1068 PMPQGVTSTTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKTLANRLSKILPSIISENQS 1127 Query: 254 AFVGGRNITDNIFLAQEIIRNYARKRISPRCTIKVDIKKAYDTVSWSFLHGILAGMGFPN 75 FV GR I+DNI LAQE++ K +K+D+ KAYD ++W FL+ ++ GF + Sbjct: 1128 GFVNGRLISDNILLAQELVGKLDAKARGGNVVLKLDMAKAYDRLNWDFLYLMMKQFGFND 1187 Query: 74 LFISWVMECVTSASFSLCINGSL 6 +IS + C+++ FSL INGSL Sbjct: 1188 RWISMIKACISNCWFSLLINGSL 1210 Score = 76.3 bits (186), Expect = 1e-10 Identities = 41/120 (34%), Positives = 60/120 (50%), Gaps = 7/120 (5%) Frame = -2 Query: 2961 KNDFSTVPIWVKFPFLPAPCWQSKALGKICDKIGSPICCDRQTFDKSRPAFARILVDIDP 2782 + + S VP+W+ FP LPA + AL + +G P+ D T ++SRP+ AR+ V+ D Sbjct: 13 EKESSVVPVWISFPNLPAHLHEKSALMMVARTVGKPLFVDEATANRSRPSVARVCVEYDC 72 Query: 2781 TKKPPDSVQIQFSGGKS------FDQKVEYEFIPKYCKKCACFGHYFDNCSAT-NAPSKT 2623 K P D V I K+ Q+VE+ +P+YC+ C GH C N P T Sbjct: 73 QKPPLDHVWIVSRNRKTETMTGGLSQRVEFAKLPEYCQHCCHVGHAVTECMVLGNKPVST 132