BLASTX nr result
ID: Akebia23_contig00021951
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00021951 (2091 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007208312.1| hypothetical protein PRUPE_ppa003035mg [Prun... 697 0.0 ref|XP_007047763.1| Transcription initiation factor TFIID subuni... 691 0.0 ref|XP_002533519.1| conserved hypothetical protein [Ricinus comm... 650 0.0 ref|XP_004288527.1| PREDICTED: uncharacterized protein LOC101303... 643 0.0 gb|EXC35477.1| hypothetical protein L484_026784 [Morus notabilis] 634 e-179 ref|XP_004143440.1| PREDICTED: uncharacterized protein LOC101223... 626 e-176 ref|XP_002310863.2| hypothetical protein POPTR_0007s14190g [Popu... 607 e-171 ref|XP_006380803.1| hypothetical protein POPTR_0007s14190g [Popu... 602 e-169 ref|XP_006466330.1| PREDICTED: uncharacterized protein LOC102616... 585 e-164 ref|XP_006466329.1| PREDICTED: uncharacterized protein LOC102616... 580 e-162 ref|NP_201357.2| uncharacterized protein [Arabidopsis thaliana] ... 552 e-154 ref|XP_006394014.1| hypothetical protein EUTSA_v10003865mg [Eutr... 548 e-153 ref|XP_006280200.1| hypothetical protein CARUB_v10026105mg [Caps... 543 e-151 emb|CAN70982.1| hypothetical protein VITISV_027119 [Vitis vinifera] 542 e-151 dbj|BAA98173.1| unnamed protein product [Arabidopsis thaliana] 538 e-150 ref|XP_006426252.1| hypothetical protein CICLE_v10025202mg [Citr... 528 e-147 ref|XP_002864962.1| hypothetical protein ARALYDRAFT_496788 [Arab... 528 e-147 gb|EYU30927.1| hypothetical protein MIMGU_mgv1a003113mg [Mimulus... 516 e-143 ref|XP_007047764.1| Transcription initiation factor TFIID subuni... 506 e-140 ref|XP_007047765.1| Transcription initiation factor TFIID subuni... 502 e-139 >ref|XP_007208312.1| hypothetical protein PRUPE_ppa003035mg [Prunus persica] gi|462403954|gb|EMJ09511.1| hypothetical protein PRUPE_ppa003035mg [Prunus persica] Length = 610 Score = 697 bits (1799), Expect = 0.0 Identities = 376/621 (60%), Positives = 441/621 (71%), Gaps = 9/621 (1%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G+ELA KLE+C VWRSWLGDS Y+NF L+SP TWE+FM DSK+RA + Sbjct: 1 MALLGDDGRGYELACKLESCNVWRSWLGDSTYANFAPFLNSPSTWEAFM---DSKSRAHL 57 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNL-----NPNYLQLHGDDVYFSL 1737 LQLR RALLFDKA VSLFLR S+L NP YLQLH DDVYF+L Sbjct: 58 HLQLRARALLFDKACVSLFLRPHSNSSSSSSSSSSSSSLAVSKLNPYYLQLHPDDVYFTL 117 Query: 1736 EDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFP 1557 E+ SSQDGVQ Q+ S +S +IQ K F VGSRY E E+DN R + D+ P Sbjct: 118 EN---SSQDGVQVQQRDPSVSS------KIQSKAAFGVGSRYGESEIDNKPSRFKNDELP 168 Query: 1556 ETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFG 1377 ETWY QF+E+Y S+ RL DRE+ KRT E MS YLKL ERHK+ R FKEDQY G+G Sbjct: 169 ETWYNQFMERYRISKPYRLSSADRESEKRTPEEMSAYLKLLERHKKRRLAFKEDQYMGYG 228 Query: 1376 NPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYG 1197 NPI EN S+++ SV DG+NS+D E FFPE MF NCVPDSALP +NR EDNQKVECYG Sbjct: 229 NPILENVSHMNPNSVLDGSNSVDSEISFFPETMFTFNCVPDSALPPLNREEDNQKVECYG 288 Query: 1196 VLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKV 1017 VLD LP +MTRSP MLER GIRPEYL + +RGKNG GN K L EQA+Q+SQ V Sbjct: 289 VLDMLPQIMTRSPVMLERLGIRPEYLSMEQGGILHRGKNGSGGNRKCLSKEQAAQLSQTV 348 Query: 1016 ISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTL 837 I+R+L S+GFE TEVP++V SQ L CHI KLG LKVLTD+YRKQCSAIE+L+MFLQT+ Sbjct: 349 IARMLTSIGFESATEVPIDVFSQMLSCHISKLGGSLKVLTDSYRKQCSAIELLKMFLQTI 408 Query: 836 GYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXX 660 GYSN G L E VKDGSRNF QQ H G QS Q QH +P QQ R Sbjct: 409 GYSNFGPLMEQVKDGSRNFQQTQQQIH--GSQSQLQPQHQNPIRLPQQTSRQMLPQMQQV 466 Query: 659 XXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHA 480 +N+ F QQ LERMRRRQPSTPRAGM +DKDRPMV+VKIE SELP+D NAF ++ Sbjct: 467 ALSKNVPFQQQQPLERMRRRQPSTPRAGMDMDKDRPMVQVKIEAPSELPMDGNAFYGLNN 526 Query: 479 RHPQIQFRQQSMAAA---MANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQ 309 R+ Q+QFRQQ A + M N++PQS +QF+Q+ASLQ+PQ+Q QN G RAPPVKVEGFQ Sbjct: 527 RNLQMQFRQQIPAMSNLTMPNVHPQSGNQFRQMASLQIPQMQAQNAGVLRAPPVKVEGFQ 586 Query: 308 ELMGGDTTLKHDSEEHKLTSP 246 ELMGGD + KHDS+E++LTSP Sbjct: 587 ELMGGDASSKHDSDENRLTSP 607 >ref|XP_007047763.1| Transcription initiation factor TFIID subunit 8, putative isoform 1 [Theobroma cacao] gi|508700024|gb|EOX91920.1| Transcription initiation factor TFIID subunit 8, putative isoform 1 [Theobroma cacao] Length = 593 Score = 691 bits (1782), Expect = 0.0 Identities = 369/616 (59%), Positives = 445/616 (72%), Gaps = 4/616 (0%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1728 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1727 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1548 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1547 YKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1368 Y QFIEKY SR +L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1367 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1188 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1187 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1008 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1007 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 828 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 827 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIP--RXXXXXXXXXX 654 N G LAE VKD +RN T Q + G+QS Q QH + QQ+P + Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQLPMRQMHPQMQQMVH 455 Query: 653 XQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARH 474 QNL F QQ QLER+RRR PSTPR M +DKDRPMV+VKIEN SELP+DSNAFNPI+ RH Sbjct: 456 PQNLTFQQQQQLERIRRRHPSTPRPVMDMDKDRPMVQVKIENPSELPMDSNAFNPINTRH 515 Query: 473 PQIQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGG 294 Q+QFRQQ AA++NL+ Q ++QF+QL S Q+ Q+QTQNMG RAPPVKVEGFQELMGG Sbjct: 516 SQMQFRQQQF-AAISNLHAQPSNQFRQLMSPQIHQMQTQNMGIVRAPPVKVEGFQELMGG 574 Query: 293 DTTLKHDSEEHKLTSP 246 DTTLKHDSEE+KLTSP Sbjct: 575 DTTLKHDSEENKLTSP 590 >ref|XP_002533519.1| conserved hypothetical protein [Ricinus communis] gi|223526616|gb|EEF28863.1| conserved hypothetical protein [Ricinus communis] Length = 573 Score = 650 bits (1676), Expect = 0.0 Identities = 353/615 (57%), Positives = 436/615 (70%), Gaps = 2/615 (0%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 M+LLG+DG G++LARKLE+ G WR+WLGDS YSNFVH LSSP +W+SFM+ DSK++AQI Sbjct: 1 MSLLGDDGNGYDLARKLESLGTWRTWLGDSLYSNFVHFLSSPSSWDSFMRTDDSKSKAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 LQLR RALLFDKA+VSLF+ LNP+YLQLHGDDVYF+LED Sbjct: 61 HLQLRARALLFDKATVSLFISNNNNSCSALAVS----KLNPSYLQLHGDDVYFTLED--- 113 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 G Q Q + S + +S FS+GSRY EPE++ +++R R ++FPE+WY Sbjct: 114 ----GDQRQNAALSKSHSKS---------AFSIGSRYGEPEMEGLTQRFRNEEFPESWYN 160 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 QFIEKY SR RL G+RE+ KR+ E MS+YL+L ++HKR R Sbjct: 161 QFIEKYKVSRPYRLSVGERESDKRSPEEMSSYLRLVDKHKRRRI---------------S 205 Query: 1361 NGSNIHSKSVSDGNNSIDDETC-FFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDS 1185 + ++HS SV DG+NS DD+ FFPE MF LNCVPDSALP I R +DNQK+E +GVLDS Sbjct: 206 STPSMHSSSVLDGSNSTDDDDLSFFPETMFMLNCVPDSALPLIIRPQDNQKIEFHGVLDS 265 Query: 1184 LPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRV 1005 LP TRS ++ER GI E S +R KNG EGN K++ EQASQM QKV++R+ Sbjct: 266 LPQ--TRSSVVIERLGISVEQ-----GGSLHRAKNGSEGNKKLISQEQASQMCQKVVARM 318 Query: 1004 LVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSN 825 L VGF+ TE+P+EVLSQ L CHI +LGR LK+L DNYRKQCSAI++L+MFLQT G++N Sbjct: 319 LARVGFDSATELPVEVLSQALRCHISELGRNLKILADNYRKQCSAIDLLKMFLQTAGFNN 378 Query: 824 LGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXXXXXQ 648 LG L E VKDG+RN PTQQ + +QS Q+QH S QQIPR Q Sbjct: 379 LGGLMELVKDGTRNVVQPTQQ-QMHAIQSQLQAQHQSTLRLPQQIPRQMHPQMQQMVHPQ 437 Query: 647 NLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 468 NLAF QQ QLERMRRRQPSTPR M +DKDRPMV+VKIEN SELP+D NAFNP+H+RHPQ Sbjct: 438 NLAFQQQQQLERMRRRQPSTPRPAMDIDKDRPMVQVKIENPSELPMDGNAFNPMHSRHPQ 497 Query: 467 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 288 +QFRQQ + AA+++L QS++QF+QLAS+QVPQ+Q+ NMG RAPPVKVEGFQELMGGD Sbjct: 498 MQFRQQQL-AAISSLQAQSSNQFRQLASMQVPQVQSPNMGIVRAPPVKVEGFQELMGGDA 556 Query: 287 TLKHDSEEHKLTSPS 243 ++KHD EE+KLTSPS Sbjct: 557 SVKHDPEENKLTSPS 571 >ref|XP_004288527.1| PREDICTED: uncharacterized protein LOC101303161 [Fragaria vesca subsp. vesca] Length = 596 Score = 643 bits (1659), Expect = 0.0 Identities = 355/623 (56%), Positives = 427/623 (68%), Gaps = 11/623 (1%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G+ELA KLE+C VWR+WLGDS+YS FVH L+SP TW+SFM+ SK+RAQI Sbjct: 1 MALLGDDGRGYELACKLESCNVWRTWLGDSSYSTFVHFLTSPSTWDSFMRSDPSKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 LQLR RALLFDKASVSLFLR NLNPNYLQLH DDVYFSLE+ Sbjct: 61 LLQLRARALLFDKASVSLFLRPDSASNSSAVS-----NLNPNYLQLHADDVYFSLEN--- 112 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 SS +GVQ Q+ S +IQ K F GSRY E E+DN S R + ++ PETWY Sbjct: 113 SSAEGVQAQQRDAS---------KIQSKTNFGFGSRYGESEIDNKSARFKNEELPETWYN 163 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 Q E++ SR RL DRE+ +RT E M Y+KL +HK+ FKE+Q G+ NP+ E Sbjct: 164 QVSERHRVSRTHRLSSADRESERRTPEEMCAYIKLAMKHKKRCIAFKEEQPVGYRNPLLE 223 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 N S + S DG+NS+D E FFPE MF NCVPDSALP +NR +D+QKVE GVLD+L Sbjct: 224 NASQ-NPHSGLDGSNSVDHEAPFFPETMFTFNCVPDSALPPMNREQDDQKVEFCGVLDTL 282 Query: 1181 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1002 P VMTRSP MLER GIRPEYL S RGKNG GN L EQA+Q+SQKVI+R+L Sbjct: 283 PQVMTRSPVMLERLGIRPEYL------SMDRGKNGSAGNKSCLTHEQAAQLSQKVIARIL 336 Query: 1001 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 822 +VGFEG++EVP+EV SQ L CHI KLG LKVLTD+YRKQCSAIE+L+MFLQT+GY N Sbjct: 337 TNVGFEGSSEVPIEVFSQLLSCHIRKLGSCLKVLTDSYRKQCSAIELLKMFLQTVGYRNF 396 Query: 821 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-------XXXXXXX 663 G LA+ VKDGSR+ H Q + G+QS Q QH +P QQI R Sbjct: 397 GPLADQVKDGSRSV-HQQNQQQIHGMQSQLQPQHQNPIRLPQQISRQMLPQMQQIQQMQQ 455 Query: 662 XXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIH 483 +NL F QQ Q+ERMRRRQPSTPRAGM + ++RPMV+VKIE SELP+DSNAFN + Sbjct: 456 MAQSKNLPFQQQQQIERMRRRQPSTPRAGMDMVQERPMVQVKIEAPSELPMDSNAFNNFN 515 Query: 482 ARHPQIQFRQQSMAA----AMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEG 315 R+PQ+QFRQQ + A M N+ QS +QF+Q Q+ Q+Q+QN G RA PVKVEG Sbjct: 516 NRNPQMQFRQQQIPAMSNPTMQNVPAQSGNQFRQ---TQIAQIQSQNAGVLRARPVKVEG 572 Query: 314 FQELMGGDTTLKHDSEEHKLTSP 246 F ELMGGD + KHDS+E++LTSP Sbjct: 573 FSELMGGDASSKHDSDENRLTSP 595 >gb|EXC35477.1| hypothetical protein L484_026784 [Morus notabilis] Length = 647 Score = 634 bits (1636), Expect = e-179 Identities = 357/662 (53%), Positives = 434/662 (65%), Gaps = 49/662 (7%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG GFELARKLETCGVWR WLGDS Y NF L+SP TWE+FM+ +K+RAQI Sbjct: 1 MALLGDDGRGFELARKLETCGVWRKWLGDSCYGNFAPYLNSPTTWEAFMRVDGTKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN--------LNPNYLQLHGDDVY 1746 LQLRVRALLFDKASVSLFLR ++ LNPNYL LHGDDVY Sbjct: 61 HLQLRVRALLFDKASVSLFLRSNPSSSSSSSSSSSSASRSSVAISKLNPNYLNLHGDDVY 120 Query: 1745 FSLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPD 1566 F+LE+ SS D SSN+ SK IQ K F VGS Y E E+DN+ + R D Sbjct: 121 FTLEN---SSSD--------VSSNTASSK---IQSKASFGVGSGYGESEIDNVHQMFRND 166 Query: 1565 DFPETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYT 1386 PETWY QFIE Y TSR RL GD+E KR+ E M Y+KL E+HK+ R +KEDQY Sbjct: 167 VLPETWYNQFIENYRTSRPYRLSLGDQEPDKRSPEEMCAYIKLLEKHKKRRVAYKEDQYM 226 Query: 1385 GFGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVE 1206 G+GNP+ EN S + S+SD NS DDE+ FFPE+MF LN VPDSAL NR+E+ +K+E Sbjct: 227 GYGNPVLENSSYMRPNSISDAINSDDDESTFFPEIMFTLNSVPDSALSVANRVEERRKIE 286 Query: 1205 CYGVLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMS 1026 YGVLD LP VMT+SP M+ERFGI P +L + + + KNG N K LG EQA ++S Sbjct: 287 FYGVLDGLPRVMTKSPVMIERFGINP-FLGMEHGGNVHHVKNGSVVNKKCLGQEQALELS 345 Query: 1025 QKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFL 846 QKVI+R+L S+GFEG+TEVP+EV SQ + CHI +LGRILKVL+D+YRKQC+A+E+L+MFL Sbjct: 346 QKVIARMLASIGFEGSTEVPVEVFSQLMSCHITELGRILKVLSDSYRKQCTAVELLKMFL 405 Query: 845 QTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQ---------------HP-- 717 Q L + G+L EHVKDGSR S Q + G+QS SQ HP Sbjct: 406 QRL-KCDFGSLVEHVKDGSRT-SVQQSQSQVHGIQSQMMSQAQAALRLQQQMSRQMHPQM 463 Query: 716 ---------------------SPNLQTQQIPRXXXXXXXXXXXQNLAFPQQPQLERMRRR 600 LQ QQ + Q L QQ QLERMRRR Sbjct: 464 QQFVHSQNMAFQQQQQQHHQQQQQLQQQQQQQLQQQQLQQQQQQQLQQQQQQQLERMRRR 523 Query: 599 QPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAA---MA 429 QPSTPR+GM +DKDRP+V+VKIE SELP+DSN+ N + R Q+ +RQQ A + M+ Sbjct: 524 QPSTPRSGMDVDKDRPLVQVKIEQPSELPMDSNSLNNFNNRISQMHYRQQMAAMSNYTMS 583 Query: 428 NLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTS 249 N++ QSN+QF+Q+AS Q+PQ+Q+QNMG RAPPVKVEGFQELMGGD KHDSEE++LTS Sbjct: 584 NVHGQSNNQFRQMASGQIPQMQSQNMGVVRAPPVKVEGFQELMGGDAASKHDSEENRLTS 643 Query: 248 PS 243 PS Sbjct: 644 PS 645 >ref|XP_004143440.1| PREDICTED: uncharacterized protein LOC101223185 [Cucumis sativus] gi|449499810|ref|XP_004160923.1| PREDICTED: uncharacterized protein LOC101224095 [Cucumis sativus] Length = 612 Score = 626 bits (1615), Expect = e-176 Identities = 337/627 (53%), Positives = 439/627 (70%), Gaps = 14/627 (2%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G+ELARKL+T GVW++WLGD +YS FV L+S TW++FM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELARKLDTLGVWQTWLGDLSYSIFVPFLASTSTWDTFMRTDDSKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN--------LNPNYLQLHGDDVY 1746 QLQLR RALLFDKASVSLFLR + L+PNYLQLHGDDVY Sbjct: 61 QLQLRARALLFDKASVSLFLRSTPSPSSPSYSTGNPLSSSSLAISKLSPNYLQLHGDDVY 120 Query: 1745 FSLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPD 1566 F+LE+ SS+DGVQ +EG SSN K IQPK + G R E ++ + S+R + + Sbjct: 121 FTLEN---SSKDGVQQREGHVSSNKASGK---IQPKAASTAGPRSRESDIGDSSQRLK-N 173 Query: 1565 DFPETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYT 1386 + PETWY QFIEKY + RL G+ A KRTSE MS+YL+L E+HK+ R FK+D T Sbjct: 174 ELPETWYSQFIEKYRVKQPYRLSHGNNVAEKRTSEEMSSYLRLLEKHKKRRMVFKDDLLT 233 Query: 1385 GFGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVE 1206 FGN + N S+ SV D +NS++D+ FFPE+MF NCVP+SALP + ++DN++ E Sbjct: 234 NFGNSVSANASS----SVFDFSNSVEDDANFFPEIMFTFNCVPESALPPPDDMKDNRRPE 289 Query: 1205 CYGVLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMS 1026 GV+D+LP +TR+ AM+ER G++P+Y+ + +R K+G GN K LG EQ+ QMS Sbjct: 290 VPGVIDTLPQPITRNSAMMERLGVKPDYVSTERGVNVHRAKSGSGGNRKSLGQEQSFQMS 349 Query: 1025 QKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFL 846 QKV++R+L+S+GFEG TEVP+EV SQFL CHICKLG L+VL D+YRKQCSA+++LRMFL Sbjct: 350 QKVVARMLMSLGFEGATEVPLEVFSQFLSCHICKLGSTLRVLADSYRKQCSAVDLLRMFL 409 Query: 845 QTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPRXXXXXX 666 +T+GYSN G LA+ VKDGSRN+ +Q G+Q Q+QH + QQ+PR Sbjct: 410 KTMGYSNFGPLADIVKDGSRNY---VRQSMHHGVQPQLQAQHQTLLQVPQQVPR-QMHPQ 465 Query: 665 XXXXXQNLAFPQQPQ------LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDS 504 + AF QQ Q LE+MRRRQ +TPRA M +KDRP+++VK+ENT ELP+D Sbjct: 466 MQQMVNSQAFQQQQQQQQQFVLEKMRRRQAATPRAVMEANKDRPLLQVKVENT-ELPMDG 524 Query: 503 NAFNPIHARHPQIQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVK 324 NA N ++ RHPQ+QFRQQ + AAM+N++ +QF+Q+ S+Q+PQ+QT N RAPPVK Sbjct: 525 NALNALNIRHPQLQFRQQQI-AAMSNIHASPGNQFRQIPSMQMPQIQTPNTNVVRAPPVK 583 Query: 323 VEGFQELMGGDTTLKHDSEEHKLTSPS 243 VEGFQELMGGDT+ KHDSEE +LTSPS Sbjct: 584 VEGFQELMGGDTSSKHDSEEARLTSPS 610 >ref|XP_002310863.2| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] gi|550334854|gb|EEE91313.2| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] Length = 577 Score = 607 bits (1565), Expect = e-171 Identities = 341/615 (55%), Positives = 420/615 (68%), Gaps = 2/615 (0%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 M++LG+DGLG++LARKLET G+WR+WLGDS YSNF+H LSSP +W+SFM+ DSK+++ Sbjct: 1 MSVLGDDGLGYDLARKLETLGMWRAWLGDSLYSNFLHSLSSPASWQSFMRTDDSKSKSHF 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 QLQLR RALLFDKASVSLFLR NLNPNYLQLHGDDVYF+LED Sbjct: 61 QLQLRARALLFDKASVSLFLRSNTVAAVS--------NLNPNYLQLHGDDVYFTLEDEDQ 112 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 + G G+T+ ++ +L F V S +V + +R + ++ PETWY Sbjct: 113 RREGG---GVGATT---------KVCSRLSFRV-SNFV---LYICCQRYKNEELPETWYT 156 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 QF+EK R RL FGDRE+ KR+ E MSTY +L RHKR QY G GN E Sbjct: 157 QFMEKRKLKRPYRLSFGDRESDKRSPEQMSTYFRLVARHKR------RCQYLGSGNSNLE 210 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 + SN+ S SV DG++S+DD+ FFPE MF NCVPDSA+P I R DNQK+E G DSL Sbjct: 211 STSNMRSGSVLDGSHSVDDDFVFFPETMFMFNCVPDSAIPPIIRARDNQKIEFRGAFDSL 270 Query: 1181 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1002 P TR+P M+ER GI E S RGKNG EG+ K L EQA QMSQKV++ +L Sbjct: 271 PQ--TRNPVMIERLGISVEQ-----GGSLNRGKNGSEGHKK-LSEEQALQMSQKVVACLL 322 Query: 1001 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 822 VGF+G +E+PMEV SQ L CHI KLGRIL+VL D+YRKQCSA+E+L+MFLQT G+SNL Sbjct: 323 TRVGFDGASEIPMEVFSQLLRCHISKLGRILRVLADSYRKQCSAVELLKMFLQTAGFSNL 382 Query: 821 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXXXXXQN 645 +L + VK+G+RN + PT Q G+QS F SQH + QQIPR QN Sbjct: 383 VHLMKIVKEGARNTAEPTHQ-QAHGIQSQFHSQHQNLLRLPQQIPRQMHPQMQPMVHSQN 441 Query: 644 LAFPQQPQ-LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 468 L F QQ Q ER+RRR STPR GM +DKD+P+V+VK+EN ELP+D+NA N H+R PQ Sbjct: 442 LTFQQQQQHFERLRRRHTSTPRPGMDVDKDKPLVQVKVENPPELPLDNNAVNAFHSRQPQ 501 Query: 467 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 288 +Q R Q + AAM+NL+ Q N+Q +QLASLQVPQ+QT NMG RAPPVKVEGFQELMGGD Sbjct: 502 MQMRHQQI-AAMSNLHAQPNNQLRQLASLQVPQMQTSNMGMVRAPPVKVEGFQELMGGDA 560 Query: 287 TLKHDSEEHKLTSPS 243 LKHD+EE+KLTSPS Sbjct: 561 ALKHDTEENKLTSPS 575 >ref|XP_006380803.1| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] gi|550334853|gb|ERP58600.1| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] Length = 558 Score = 602 bits (1553), Expect = e-169 Identities = 337/615 (54%), Positives = 412/615 (66%), Gaps = 2/615 (0%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 M++LG+DGLG++LARKLET G+WR+WLGDS YSNF+H LSSP +W+SFM+ DSK+++ Sbjct: 1 MSVLGDDGLGYDLARKLETLGMWRAWLGDSLYSNFLHSLSSPASWQSFMRTDDSKSKSHF 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 QLQLR RALLFDKASVSLFLR NLNPNYLQLHGDDVYF+LED Sbjct: 61 QLQLRARALLFDKASVSLFLRSNTVAAVS--------NLNPNYLQLHGDDVYFTLED--- 109 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 + Q + G VG+ ++R + ++ PETWY Sbjct: 110 -----------------------EDQRREGGGVGAT---------TKRYKNEELPETWYT 137 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 QF+EK R RL FGDRE+ KR+ E MSTY +L RHKR QY G GN E Sbjct: 138 QFMEKRKLKRPYRLSFGDRESDKRSPEQMSTYFRLVARHKR------RCQYLGSGNSNLE 191 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 + SN+ S SV DG++S+DD+ FFPE MF NCVPDSA+P I R DNQK+E G DSL Sbjct: 192 STSNMRSGSVLDGSHSVDDDFVFFPETMFMFNCVPDSAIPPIIRARDNQKIEFRGAFDSL 251 Query: 1181 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1002 P TR+P M+ER GI E S RGKNG EG+ K L EQA QMSQKV++ +L Sbjct: 252 PQ--TRNPVMIERLGISVEQ-----GGSLNRGKNGSEGHKK-LSEEQALQMSQKVVACLL 303 Query: 1001 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 822 VGF+G +E+PMEV SQ L CHI KLGRIL+VL D+YRKQCSA+E+L+MFLQT G+SNL Sbjct: 304 TRVGFDGASEIPMEVFSQLLRCHISKLGRILRVLADSYRKQCSAVELLKMFLQTAGFSNL 363 Query: 821 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXXXXXQN 645 +L + VK+G+RN + PT Q G+QS F SQH + QQIPR QN Sbjct: 364 VHLMKIVKEGARNTAEPTHQ-QAHGIQSQFHSQHQNLLRLPQQIPRQMHPQMQPMVHSQN 422 Query: 644 LAFPQQPQ-LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 468 L F QQ Q ER+RRR STPR GM +DKD+P+V+VK+EN ELP+D+NA N H+R PQ Sbjct: 423 LTFQQQQQHFERLRRRHTSTPRPGMDVDKDKPLVQVKVENPPELPLDNNAVNAFHSRQPQ 482 Query: 467 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 288 +Q R Q + AAM+NL+ Q N+Q +QLASLQVPQ+QT NMG RAPPVKVEGFQELMGGD Sbjct: 483 MQMRHQQI-AAMSNLHAQPNNQLRQLASLQVPQMQTSNMGMVRAPPVKVEGFQELMGGDA 541 Query: 287 TLKHDSEEHKLTSPS 243 LKHD+EE+KLTSPS Sbjct: 542 ALKHDTEENKLTSPS 556 >ref|XP_006466330.1| PREDICTED: uncharacterized protein LOC102616625 isoform X2 [Citrus sinensis] Length = 610 Score = 585 bits (1507), Expect = e-164 Identities = 335/659 (50%), Positives = 408/659 (61%), Gaps = 47/659 (7%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G+ELA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 QFIEKY SRQ +L GDRE +RT+EGMS+YL+ E++KR R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKRRRVPFQND----------- 195 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1181 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1002 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1001 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 822 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SNL Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNL 369 Query: 821 GNLAEHVKDGSRNFSHPTQQ---------------------------------------- 762 G LAE +KDG+RN +Q+ Sbjct: 370 GILAELIKDGTRNVVQQSQELIKDGSRNIVQQSQELIKDGTRNIVQQNQELVKEGTRNFV 429 Query: 761 ----PHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXXXXXQNLAFP--QQPQLERMRR 603 + G QS QS SP QQ+PR QNLAF QQ LER R Sbjct: 430 QQSPQQVHGAQSQLQSHQQSPVKLPQQVPRQMHQQMQQMVQPQNLAFQQMQQQHLERSRM 489 Query: 602 RQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMANL 423 RQPSTPR GM +DKDR M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+NL Sbjct: 490 RQPSTPRPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMSNL 548 Query: 422 NPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSP 246 QS++QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTSP Sbjct: 549 QAQSSNQFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTSP 607 >ref|XP_006466329.1| PREDICTED: uncharacterized protein LOC102616625 isoform X1 [Citrus sinensis] Length = 612 Score = 580 bits (1494), Expect = e-162 Identities = 335/661 (50%), Positives = 408/661 (61%), Gaps = 49/661 (7%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G+ELA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 QFIEKY SRQ +L GDRE +RT+EGMS+YL+ E++KR R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKRRRVPFQND----------- 195 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1181 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1002 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1001 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 822 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SNL Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNL 369 Query: 821 GNLAEHVKDGSRNFSHPTQQ---------------------------------------- 762 G LAE +KDG+RN +Q+ Sbjct: 370 GILAELIKDGTRNVVQQSQELIKDGSRNIVQQSQELIKDGTRNIVQQNQELVKEGTRNFV 429 Query: 761 ----PHLRGLQSGFQSQHPSPNL--QTQQIPR-XXXXXXXXXXXQNLAFP--QQPQLERM 609 + G QS QS SP Q Q+PR QNLAF QQ LER Sbjct: 430 QQSPQQVHGAQSQLQSHQQSPVKLPQQLQVPRQMHQQMQQMVQPQNLAFQQMQQQHLERS 489 Query: 608 RRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMA 429 R RQPSTPR GM +DKDR M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+ Sbjct: 490 RMRQPSTPRPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMS 548 Query: 428 NLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTS 249 NL QS++QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTS Sbjct: 549 NLQAQSSNQFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTS 608 Query: 248 P 246 P Sbjct: 609 P 609 >ref|NP_201357.2| uncharacterized protein [Arabidopsis thaliana] gi|26451238|dbj|BAC42721.1| unknown protein [Arabidopsis thaliana] gi|28973345|gb|AAO63997.1| unknown protein [Arabidopsis thaliana] gi|332010686|gb|AED98069.1| uncharacterized protein AT5G65540 [Arabidopsis thaliana] Length = 605 Score = 552 bits (1422), Expect = e-154 Identities = 311/629 (49%), Positives = 415/629 (65%), Gaps = 15/629 (2%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG GF+LARKLE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK+RAQI Sbjct: 1 MALLGDDGRGFDLARKLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLR-------XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYF 1743 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY+ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASISDVSSVAVSKLNPNYLQLHGDDVYY 120 Query: 1742 SLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDD 1563 +LE+ +S + G Q + G + S+ + K F+ G+R E + N+S+R R ++ Sbjct: 121 TLEN--ASLESGFQREGGIRHNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEE 174 Query: 1562 FPETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTG 1383 P+TWY QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 175 LPDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLA 232 Query: 1382 FGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVEC 1203 + S+ H S DG+ S +D+ F PE MF +NCVP++AL I R +DN K E Sbjct: 233 H-----MSRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPITRTQDNLKTEF 286 Query: 1202 YGVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQ 1032 YGVLD+LP V TRS M+ER G+ PEY R+ G+ RS+ K G +QA+ Sbjct: 287 YGVLDTLPQVTTRSHIMIERLGLMPEYHRMEERGVLRSRKAEKMG-------FSDDQAAL 339 Query: 1031 MSQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRM 852 +S+KV++R+L+++GFEG TEVP++V SQ + H+ KLGRILK+LTD+Y+K+CSA+++++M Sbjct: 340 VSRKVVARMLLTMGFEGATEVPIDVFSQLVSRHMSKLGRILKLLTDSYKKECSAMQLIKM 399 Query: 851 FLQTLGYSNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSPNLQTQQIPRXXX 675 FL T GYSNLG+LAE VKDG+RN P Q QP + LQ Q + QQI R Sbjct: 400 FLNTTGYSNLGSLAEIVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMH 457 Query: 674 XXXXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAF 495 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN SE+ +D NAF Sbjct: 458 PQMQQMVNPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAF 516 Query: 494 NPIHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPV 327 NP++ RH Q Q RQQ AAM+N+ Q + QF+QLAS+Q+PQ+QT +GT RA PV Sbjct: 517 NPMNPRHQQQLQQQLRQQQQIAAMSNMQQQPGYNQFRQLASMQIPQMQTPTLGTVRAQPV 576 Query: 326 KVEGFQELMGGDTTLKHDSEEHKLTSPSK 240 KVEGF++LMGGD++LKHDS++ + P+K Sbjct: 577 KVEGFEQLMGGDSSLKHDSDDKLRSPPTK 605 >ref|XP_006394014.1| hypothetical protein EUTSA_v10003865mg [Eutrema salsugineum] gi|557090653|gb|ESQ31300.1| hypothetical protein EUTSA_v10003865mg [Eutrema salsugineum] Length = 598 Score = 548 bits (1412), Expect = e-153 Identities = 309/622 (49%), Positives = 415/622 (66%), Gaps = 8/622 (1%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG GF+LAR+LE GVWR+WLGDS Y +F H LSSP +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGFDLARRLEVSGVWRTWLGDSTYLSFHHYLSSPSSWESFMRVDDSKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLR--XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDF 1728 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY++LE Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIPASPSSDASSVAVSKLNPNYLQLHGDDVYYTLE-- 118 Query: 1727 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1548 ++S +G ++G+ N K + K F+ G+R E + N+S+R R ++ P+TW Sbjct: 119 -NASLEGGFQRDGAIRHNPSLPKSLS---KPSFASGARGSESDFSNLSQRSRFEELPDTW 174 Query: 1547 YKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1368 Y QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF +D + Sbjct: 175 YTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDSHKRKRAPFLQDPSP--ASSA 230 Query: 1367 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1188 + S+ H S DG+ S +D+ F PE MF +NCVP++AL + R DN K E YGVLD Sbjct: 231 HMSRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPVARTHDNLKTEFYGVLD 289 Query: 1187 SLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQMSQKV 1017 +LP V TR+ M+ER G+ PEY R+ G+ R K K G EQA+Q+S+KV Sbjct: 290 TLPQVTTRNHVMIERLGMVPEYFRMEERGVLRRKKAEKLG-------FSDEQAAQVSRKV 342 Query: 1016 ISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTL 837 ++R+L+++G EG TEVP++V SQ + HICKLGRILK+LTD+Y+K+CSAI++++MFL T Sbjct: 343 VARILLTMGCEGATEVPIDVFSQLVSRHICKLGRILKLLTDSYKKECSAIQLIKMFLNTT 402 Query: 836 GYSNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSPNLQTQQIPRXXXXXXXX 660 GYSNLG+LAE VKDG+RN HP Q Q + LQ Q +P QQ+ R Sbjct: 403 GYSNLGDLAELVKDGTRN--HPPQNQKQPQVLQQQLHLQQQNPLRLPQQMQRQMHPQMQQ 460 Query: 659 XXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHA 480 + F QQ Q+ERMRRRQ ++PR + ++KDRP+V+VK+EN SE+ +D NAFNP++ Sbjct: 461 MVNPH-TFQQQQQMERMRRRQVTSPRPNIDMEKDRPLVQVKLENPSEMAVDGNAFNPMNP 519 Query: 479 RHPQIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQN-MGTTRAPPVKVEGFQE 306 RH QI +Q AAM+NL Q + QF+QLAS+Q+PQ+QT N GT RA PVKVEGF++ Sbjct: 520 RHQQI---RQQQIAAMSNLQQQPGYNQFRQLASMQIPQMQTPNTTGTVRAQPVKVEGFEQ 576 Query: 305 LMGGDTTLKHDSEEHKLTSPSK 240 LMGGD++LKH+S++ + P+K Sbjct: 577 LMGGDSSLKHESDDKLRSPPTK 598 >ref|XP_006280200.1| hypothetical protein CARUB_v10026105mg [Capsella rubella] gi|482548904|gb|EOA13098.1| hypothetical protein CARUB_v10026105mg [Capsella rubella] Length = 606 Score = 543 bits (1399), Expect = e-151 Identities = 305/627 (48%), Positives = 415/627 (66%), Gaps = 13/627 (2%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG GF+LAR+LE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK R+QI Sbjct: 1 MALLGDDGRGFDLARRLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKPRSQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN------LNPNYLQLHGDDVYFS 1740 QLQLRVRALLFDKA+VSLFLR + LNPNYLQLHGDDVY++ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNSIAASSSSTSVSDVSSVAVSKLNPNYLQLHGDDVYYT 120 Query: 1739 LEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDF 1560 LE+ +S + G Q G + S+ + K F+ G+R E + N+S+R R ++ Sbjct: 121 LEN--ASLEGGFQRDGGIRLNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEEL 174 Query: 1559 PETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGF 1380 P+TWY QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF ED+ + Sbjct: 175 PDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRNS-- 230 Query: 1379 GNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECY 1200 G+ + S+ H S DG++S +D+ F PE MF +NCVP++ALP I R +DN K E Y Sbjct: 231 GSSAHMSRSSTHPSSGFDGSSS-EDDILFLPETMFRMNCVPETALPPITRTQDNLKTEFY 289 Query: 1199 GVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQM 1029 GVLD+LP V TRS M+ER G+ PEY R+ G+ R + K G +QA+Q+ Sbjct: 290 GVLDTLPQVTTRSHVMIERLGVMPEYHRMEERGVLRRRKAEKLG-------FSDDQAAQV 342 Query: 1028 SQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMF 849 S+KV++R+L+++GFEG TEVP++V SQ + HI KLGRIL++LTD+Y+K+CSA ++++MF Sbjct: 343 SRKVVARMLLTMGFEGATEVPVDVFSQLVSRHISKLGRILRLLTDSYKKECSATQLIKMF 402 Query: 848 LQTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPRXXXXX 669 L T GYSNLG+LAE VKDG+RN P Q + LQ Q + QQI R Sbjct: 403 LNTTGYSNLGSLAELVKDGTRNHP-PLNQKQPQMLQQQLHLQQQASLRLPQQIQR-QMHP 460 Query: 668 XXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNP 489 + F QQ QLER+RRRQ ++PR M ++KDRP+V+VK+EN SE+ +D NAFNP Sbjct: 461 QMQQMVNSPTFQQQQQLERLRRRQVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAFNP 520 Query: 488 IHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPVKV 321 ++ RH Q Q RQQ + AAM+N+ Q + QF+QLAS+Q+PQ+QT T RA PVKV Sbjct: 521 MNPRHQQQIQHQLRQQHI-AAMSNMQQQPGYNQFRQLASMQIPQMQTPTPATVRAQPVKV 579 Query: 320 EGFQELMGGDTTLKHDSEEHKLTSPSK 240 EGF++LMGGD++LKH+ ++ + P+K Sbjct: 580 EGFEQLMGGDSSLKHELDDKLRSPPTK 606 >emb|CAN70982.1| hypothetical protein VITISV_027119 [Vitis vinifera] Length = 405 Score = 542 bits (1396), Expect = e-151 Identities = 281/417 (67%), Positives = 319/417 (76%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG GFELARKLE+CGVWRSWLGD+ YSNFV LSSP TWESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGFELARKLESCGVWRSWLGDALYSNFVQYLSSPNTWESFMRSDDSKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 QLQLR RALLFDKASVSLFLR LNP+YLQLHGDDVYF+LE Sbjct: 61 QLQLRARALLFDKASVSLFLRSPSTPTSSLPVS----KLNPSYLQLHGDDVYFTLE---- 112 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 QD VQ +EG +SN+ SK IQPK FSVG RY E E+DNIS+R R ++FPETWY Sbjct: 113 --QDVVQQREGVVASNTAPSK---IQPKAAFSVGXRYAESEIDNISQRFRHEEFPETWYN 167 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 FIEKY SR +L FG+RE+ KRT MS Y+KL E+HK+ R FKEDQ+ GFGNPI E Sbjct: 168 LFIEKYKASRPYKLSFGERESDKRTPRDMSVYIKLLEKHKKRRVAFKEDQHMGFGNPIVE 227 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 N S+++ SV DG NS+DD+T FFPE MF LNCVPDSAL INR+EDNQKVE YGVLD+L Sbjct: 228 NKSSMYPSSVLDGKNSVDDDTYFFPETMFTLNCVPDSALLPINRVEDNQKVEFYGVLDTL 287 Query: 1181 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1002 P VMTRSP M+ER GIRPEY + S+YR KNG EGN K+LG EQA QMSQKVI+R+L Sbjct: 288 PQVMTRSPIMIERLGIRPEYHSMEQGGSQYRNKNGTEGNRKLLGQEQALQMSQKVIARML 347 Query: 1001 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGY 831 +GFE TEVPMEVLSQ L CHICKLGRILKVL+DNYRKQCSA E+L+MFLQT GY Sbjct: 348 TKMGFEVATEVPMEVLSQLLSCHICKLGRILKVLSDNYRKQCSATELLKMFLQTTGY 404 >dbj|BAA98173.1| unnamed protein product [Arabidopsis thaliana] Length = 595 Score = 538 bits (1386), Expect = e-150 Identities = 303/622 (48%), Positives = 410/622 (65%), Gaps = 8/622 (1%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG GF+LARKLE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK+RAQI Sbjct: 1 MALLGDDGRGFDLARKLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 QLQLRVRALLFDKA+VSLFLR + + + LHGDDVY++LE+ + Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASIS---DVSSVALHGDDVYYTLEN--A 115 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 S + G Q + G + S+ + K F+ G+R E + N+S+R R ++ P+TWY Sbjct: 116 SLESGFQREGGIRHNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEELPDTWYT 171 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 172 QFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLAH-----M 224 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 + S+ H S DG+ S +D+ F PE MF +NCVP++AL I R +DN K E YGVLD+L Sbjct: 225 SRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPITRTQDNLKTEFYGVLDTL 283 Query: 1181 PHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQMSQKVIS 1011 P V TRS M+ER G+ PEY R+ G+ RS+ K G +QA+ +S+KV++ Sbjct: 284 PQVTTRSHIMIERLGLMPEYHRMEERGVLRSRKAEKMG-------FSDDQAALVSRKVVA 336 Query: 1010 RVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGY 831 R+L+++GFEG TEVP++V SQ + H+ KLGRILK+LTD+Y+K+CSA+++++MFL T GY Sbjct: 337 RMLLTMGFEGATEVPIDVFSQLVSRHMSKLGRILKLLTDSYKKECSAMQLIKMFLNTTGY 396 Query: 830 SNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSPNLQTQQIPRXXXXXXXXXX 654 SNLG+LAE VKDG+RN P Q QP + LQ Q + QQI R Sbjct: 397 SNLGSLAEIVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMHPQMQQMV 454 Query: 653 XQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARH 474 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN SE+ +D NAFNP++ RH Sbjct: 455 NPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAFNPMNPRH 513 Query: 473 P---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQE 306 Q Q RQQ AAM+N+ Q + QF+QLAS+Q+PQ+QT +GT RA PVKVEGF++ Sbjct: 514 QQQLQQQLRQQQQIAAMSNMQQQPGYNQFRQLASMQIPQMQTPTLGTVRAQPVKVEGFEQ 573 Query: 305 LMGGDTTLKHDSEEHKLTSPSK 240 LMGGD++LKHDS++ + P+K Sbjct: 574 LMGGDSSLKHDSDDKLRSPPTK 595 >ref|XP_006426252.1| hypothetical protein CICLE_v10025202mg [Citrus clementina] gi|557528242|gb|ESR39492.1| hypothetical protein CICLE_v10025202mg [Citrus clementina] Length = 604 Score = 528 bits (1361), Expect = e-147 Identities = 313/653 (47%), Positives = 397/653 (60%), Gaps = 41/653 (6%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G++LA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYQLALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 QFIEKY SRQ +L GDRE +RT+EGMS+YL+ E++K R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKIRRVPFQND----------- 195 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1181 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1002 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1001 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 822 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SN Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNF 369 Query: 821 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQ------------HPSPNL--------- 705 G LAE +KDG+RN +Q+ G ++ Q S L Sbjct: 370 GILAELIKDGNRNAVQQSQELIKDGSRNIVQQSQELIKDGARNVVQQSQELIKDGTRNIV 429 Query: 704 -QTQQIPRXXXXXXXXXXXQNL--------AFPQQP-QLERMRRRQPSTPRAGMT----- 570 Q Q++ + Q + + Q P +L + + +Q R+ M Sbjct: 430 QQNQELVKEGTRNFVQQSPQQVHGAQSQLQSHQQSPVKLPQQQMQQQHLERSRMRQPSTP 489 Query: 569 ---LDKD--RPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMANLNPQSNH 405 +D D R M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+NL QS++ Sbjct: 490 RPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMSNLQAQSSN 548 Query: 404 QFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSP 246 QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTSP Sbjct: 549 QFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTSP 601 >ref|XP_002864962.1| hypothetical protein ARALYDRAFT_496788 [Arabidopsis lyrata subsp. lyrata] gi|297310797|gb|EFH41221.1| hypothetical protein ARALYDRAFT_496788 [Arabidopsis lyrata subsp. lyrata] Length = 603 Score = 528 bits (1360), Expect = e-147 Identities = 303/627 (48%), Positives = 408/627 (65%), Gaps = 15/627 (2%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG GF+LAR+LE GVWR+WLGDS YS+F H L+SP WE+FM+ +SK RAQI Sbjct: 1 MALLGDDGRGFDLARRLELSGVWRTWLGDSIYSSFHHYLTSPSNWEAFMRVDESKCRAQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLR-------XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYF 1743 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY+ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASISDVSSVAVSKLNPNYLQLHGDDVYY 120 Query: 1742 SLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDD 1563 +LE+ +S + G Q G + S+ + K F G+R E + N+S+R R ++ Sbjct: 121 TLEN--ASLESGFQRDGGIRHNQSLTKSL----SKPSFISGTRGSESDFSNLSQRSRFEE 174 Query: 1562 FPETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTG 1383 P+TWY QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 175 LPDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLA 232 Query: 1382 FGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVEC 1203 + S+ H S DG +S +D+ F PE MF +NCVP++AL + R +DN K E Sbjct: 233 H-----MSRSSTHPSSGFDGRSS-EDDILFLPETMFRMNCVPETALSPVTRTQDNLKTEF 286 Query: 1202 YGVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQ 1032 YGVLD+LP V TRS M+ER G+ PEY R+ G+ R + K G +QA+ Sbjct: 287 YGVLDTLPQVTTRSHIMIERLGMMPEYHRMEDRGVLRRRKAEKLG-------FSDDQAAL 339 Query: 1031 MSQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRM 852 +S+KV++R+L+++GFEG TEVP++V SQ + H+ KLG ILK+L+D+Y+K+CSA+++++M Sbjct: 340 VSRKVVARMLLTMGFEGATEVPIDVFSQLVSRHMSKLGHILKLLSDSYKKECSAMQLIKM 399 Query: 851 FLQTLGYSNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSPNLQTQQIPRXXX 675 FL T GYSNLG+LAE VKDG+RN P Q QP + LQ Q + QQI R Sbjct: 400 FLNTTGYSNLGSLAELVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMH 457 Query: 674 XXXXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAF 495 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN S++ +D NAF Sbjct: 458 PQMQQMVNPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSDMAVDGNAF 516 Query: 494 NPIHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPV 327 NP++ RH Q Q RQQ + AA +N+ Q + QF+QLAS+Q+PQ+QT GT RA PV Sbjct: 517 NPMNPRHQQQMQQQLRQQQI-AAKSNMQQQPGYSQFRQLASMQIPQMQTPTPGTVRAQPV 575 Query: 326 KVEGFQELMGGDTTLKHDSEEHKLTSP 246 KVEGF++LMGGD++LKH+S++ KL SP Sbjct: 576 KVEGFEQLMGGDSSLKHESDD-KLRSP 601 >gb|EYU30927.1| hypothetical protein MIMGU_mgv1a003113mg [Mimulus guttatus] Length = 607 Score = 516 bits (1329), Expect = e-143 Identities = 312/650 (48%), Positives = 409/650 (62%), Gaps = 36/650 (5%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG GFELARKLE+ GVWR WLGD++YS F++ L+SP W+ FM+ SKT+ QI Sbjct: 1 MALLGDDGRGFELARKLESHGVWRPWLGDAHYSAFINFLASPEKWDIFMRADKSKTKDQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1722 LQLR RALLFDKASVSLF + S LNPNYL+LHGDDVYF+ ED Sbjct: 61 YLQLRARALLFDKASVSLFTQ--------SPPPAPVSKLNPNYLELHGDDVYFTFED--- 109 Query: 1721 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1542 ++D Q Q +SN+ SK K VGSR+ E E +E + ++ PETWY Sbjct: 110 GAKDVDQRQPSLAASNTTSSKGYS---KTSVGVGSRFNETE----TETDKLEELPETWYS 162 Query: 1541 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1362 QF EKY S+ RL FGDRE+ KRT E MSTYL++ E HKR R F + Sbjct: 163 QFFEKYRASKSYRLIFGDRESEKRTPEQMSTYLRVLENHKRRRVAFV------------D 210 Query: 1361 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1182 N SN+ S+S+ D+ FPE MF LNCVPDSA+ + LE++QK++ GVLD+L Sbjct: 211 NTSNLRPNSLSE-----LDDIPLFPETMFTLNCVPDSAVLQTSGLENHQKLQFNGVLDNL 265 Query: 1181 PHVMTR----SPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVI 1014 P +MT+ SP M+ER GIRPE+L + + RG+N G+ ++ G EQA Q+S+KV+ Sbjct: 266 PQIMTKSTMISPIMIERLGIRPEFLNM----EQTRGRN---GSMRIRGEEQAVQISKKVV 318 Query: 1013 SRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLG 834 +R+L +VGFE +++ +EVL Q L CHI KLGR LK+L+D+YRKQCSA E+++MFLQT G Sbjct: 319 ARLLTNVGFESCSDLSLEVLPQLLSCHIGKLGRTLKLLSDSYRKQCSANELVKMFLQTAG 378 Query: 833 YS-NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR--XXXXXXX 663 YS N+G L + +KD ++N P QQ ++ +Q+ Q Q L +QQIPR Sbjct: 379 YSNNMGALVQIIKDNTKNGVQPNQQ-QVQAIQAQLQLQQQPSILPSQQIPRQINPQMQQQ 437 Query: 662 XXXXQNLAF-PQQPQLERMRRRQPS-TPRAGMT-------------LDKD-RPMVEVKIE 531 Q LAF QQ Q ERMRRRQ PR GM +DKD RP+V+VK+E Sbjct: 438 MNNAQYLAFQQQQQQWERMRRRQQQPAPRPGMNTNVNMNMNTNTNMIDKDNRPLVQVKME 497 Query: 530 NTSELPIDSNAFNPIHARHPQI----------QFRQQSMA---AAMANLNPQSNHQFKQL 390 N SE P+D+NAF +++RHPQ+ Q QQ +A A N N +N+ F+ + Sbjct: 498 NPSEFPLDANAFAAVNSRHPQLLQIRHQQEQQQLAQQQLAQQVQANNNNNNNNNNVFRPM 557 Query: 389 ASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSPSK 240 SLQ+PQ+ + +M RAPPVKVEGFQELMGGD+++KHDSEE+KL SP K Sbjct: 558 TSLQIPQILSPSMSMPRAPPVKVEGFQELMGGDSSIKHDSEENKLLSPQK 607 >ref|XP_007047764.1| Transcription initiation factor TFIID subunit 8, putative isoform 2 [Theobroma cacao] gi|508700025|gb|EOX91921.1| Transcription initiation factor TFIID subunit 8, putative isoform 2 [Theobroma cacao] Length = 489 Score = 506 bits (1302), Expect = e-140 Identities = 269/467 (57%), Positives = 329/467 (70%), Gaps = 2/467 (0%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1728 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1727 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1548 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1547 YKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1368 Y QFIEKY SR +L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1367 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1188 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1187 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1008 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1007 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 828 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 827 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIP 687 N G LAE VKD +RN T Q + G+QS Q QH + QQ+P Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQLP 442 Score = 88.6 bits (218), Expect = 1e-14 Identities = 43/74 (58%), Positives = 50/74 (67%) Frame = -2 Query: 467 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 288 +Q Q M + L PQ + + L + Q+ QNMG RAPPVKVEGFQELMGGDT Sbjct: 413 VQQTPQQMHGIQSQLQPQHQNALRMAQQLPMRQMHPQNMGIVRAPPVKVEGFQELMGGDT 472 Query: 287 TLKHDSEEHKLTSP 246 TLKHDSEE+KLTSP Sbjct: 473 TLKHDSEENKLTSP 486 >ref|XP_007047765.1| Transcription initiation factor TFIID subunit 8, putative isoform 3 [Theobroma cacao] gi|508700026|gb|EOX91922.1| Transcription initiation factor TFIID subunit 8, putative isoform 3 [Theobroma cacao] Length = 445 Score = 502 bits (1293), Expect = e-139 Identities = 268/465 (57%), Positives = 327/465 (70%), Gaps = 2/465 (0%) Frame = -2 Query: 2081 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1902 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 1901 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1728 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1727 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1548 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1547 YKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1368 Y QFIEKY SR +L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1367 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1188 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1187 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1008 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1007 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 828 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 827 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQ 693 N G LAE VKD +RN T Q + G+QS Q QH + QQ Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQ 440