BLASTX nr result
ID: Akebia24_contig00008814
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00008814 (2245 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007208312.1| hypothetical protein PRUPE_ppa003035mg [Prun... 698 0.0 ref|XP_007047763.1| Transcription initiation factor TFIID subuni... 692 0.0 ref|XP_002533519.1| conserved hypothetical protein [Ricinus comm... 651 0.0 ref|XP_004288527.1| PREDICTED: uncharacterized protein LOC101303... 646 0.0 gb|EXC35477.1| hypothetical protein L484_026784 [Morus notabilis] 635 e-179 ref|XP_004143440.1| PREDICTED: uncharacterized protein LOC101223... 627 e-177 ref|XP_002310863.2| hypothetical protein POPTR_0007s14190g [Popu... 608 e-171 ref|XP_006380803.1| hypothetical protein POPTR_0007s14190g [Popu... 603 e-170 ref|XP_006466330.1| PREDICTED: uncharacterized protein LOC102616... 586 e-164 ref|XP_006466329.1| PREDICTED: uncharacterized protein LOC102616... 581 e-163 ref|NP_201357.2| uncharacterized protein [Arabidopsis thaliana] ... 553 e-154 ref|XP_006394014.1| hypothetical protein EUTSA_v10003865mg [Eutr... 549 e-153 ref|XP_006280200.1| hypothetical protein CARUB_v10026105mg [Caps... 544 e-152 emb|CAN70982.1| hypothetical protein VITISV_027119 [Vitis vinifera] 543 e-151 dbj|BAA98173.1| unnamed protein product [Arabidopsis thaliana] 539 e-150 ref|XP_006426252.1| hypothetical protein CICLE_v10025202mg [Citr... 530 e-147 ref|XP_002864962.1| hypothetical protein ARALYDRAFT_496788 [Arab... 529 e-147 gb|EYU30927.1| hypothetical protein MIMGU_mgv1a003113mg [Mimulus... 517 e-144 ref|XP_007047764.1| Transcription initiation factor TFIID subuni... 507 e-140 ref|XP_007047765.1| Transcription initiation factor TFIID subuni... 503 e-139 >ref|XP_007208312.1| hypothetical protein PRUPE_ppa003035mg [Prunus persica] gi|462403954|gb|EMJ09511.1| hypothetical protein PRUPE_ppa003035mg [Prunus persica] Length = 610 Score = 698 bits (1802), Expect = 0.0 Identities = 376/621 (60%), Positives = 442/621 (71%), Gaps = 9/621 (1%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G+ELA KLE+C VWRSWLGDS Y+NF L+SP TWE+FM DSK+RA + Sbjct: 1 MALLGDDGRGYELACKLESCNVWRSWLGDSTYANFAPFLNSPSTWEAFM---DSKSRAHL 57 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNL-----NPNYLQLHGDDVYFSL 1881 LQLR RALLFDKA VSLFLR S+L NP YLQLH DDVYF+L Sbjct: 58 HLQLRARALLFDKACVSLFLRPHSNSSSSSSSSSSSSSLAVSKLNPYYLQLHPDDVYFTL 117 Query: 1880 EDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFP 1701 E+ SSQDGVQ Q+ S +S +IQ K F VGSRY E E+DN R + D+ P Sbjct: 118 EN---SSQDGVQVQQRDPSVSS------KIQSKAAFGVGSRYGESEIDNKPSRFKNDELP 168 Query: 1700 ETWYKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFG 1521 ETWY QF+E+Y S+ +RL DRE+ KRT E MS YLKL ERHK+ R FKEDQY G+G Sbjct: 169 ETWYNQFMERYRISKPYRLSSADRESEKRTPEEMSAYLKLLERHKKRRLAFKEDQYMGYG 228 Query: 1520 NPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYG 1341 NPI EN S+++ SV DG+NS+D E FFPE MF NCVPDSALP +NR EDNQKVECYG Sbjct: 229 NPILENVSHMNPNSVLDGSNSVDSEISFFPETMFTFNCVPDSALPPLNREEDNQKVECYG 288 Query: 1340 VLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKV 1161 VLD LP +MTRSP MLER GIRPEYL + +RGKNG GN K L EQA+Q+SQ V Sbjct: 289 VLDMLPQIMTRSPVMLERLGIRPEYLSMEQGGILHRGKNGSGGNRKCLSKEQAAQLSQTV 348 Query: 1160 ISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTL 981 I+R+L S+GFE TEVP++V SQ L CHI KLG LKVLTD+YRKQCSAIE+L+MFLQT+ Sbjct: 349 IARMLTSIGFESATEVPIDVFSQMLSCHISKLGGSLKVLTDSYRKQCSAIELLKMFLQTI 408 Query: 980 GYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXX 804 GYSN G L E VKDGSRNF QQ H G QS Q QH +P QQ R Sbjct: 409 GYSNFGPLMEQVKDGSRNFQQTQQQIH--GSQSQLQPQHQNPIRLPQQTSRQMLPQMQQV 466 Query: 803 XXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHA 624 +N+ F QQ LERMRRRQPSTPRAGM +DKDRPMV+VKIE SELP+D NAF ++ Sbjct: 467 ALSKNVPFQQQQPLERMRRRQPSTPRAGMDMDKDRPMVQVKIEAPSELPMDGNAFYGLNN 526 Query: 623 RHPQIQFRQQSMAAA---MANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQ 453 R+ Q+QFRQQ A + M N++PQS +QF+Q+ASLQ+PQ+Q QN G RAPPVKVEGFQ Sbjct: 527 RNLQMQFRQQIPAMSNLTMPNVHPQSGNQFRQMASLQIPQMQAQNAGVLRAPPVKVEGFQ 586 Query: 452 ELMGGDTTLKHDSEEHKLTSP 390 ELMGGD + KHDS+E++LTSP Sbjct: 587 ELMGGDASSKHDSDENRLTSP 607 >ref|XP_007047763.1| Transcription initiation factor TFIID subunit 8, putative isoform 1 [Theobroma cacao] gi|508700024|gb|EOX91920.1| Transcription initiation factor TFIID subunit 8, putative isoform 1 [Theobroma cacao] Length = 593 Score = 692 bits (1785), Expect = 0.0 Identities = 369/616 (59%), Positives = 446/616 (72%), Gaps = 4/616 (0%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1872 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1871 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1692 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1691 YKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1512 Y QFIEKY SR ++L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1511 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1332 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1331 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1152 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1151 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 972 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 971 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIP--RXXXXXXXXXX 798 N G LAE VKD +RN T Q + G+QS Q QH + QQ+P + Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQLPMRQMHPQMQQMVH 455 Query: 797 XQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARH 618 QNL F QQ QLER+RRR PSTPR M +DKDRPMV+VKIEN SELP+DSNAFNPI+ RH Sbjct: 456 PQNLTFQQQQQLERIRRRHPSTPRPVMDMDKDRPMVQVKIENPSELPMDSNAFNPINTRH 515 Query: 617 PQIQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGG 438 Q+QFRQQ AA++NL+ Q ++QF+QL S Q+ Q+QTQNMG RAPPVKVEGFQELMGG Sbjct: 516 SQMQFRQQQF-AAISNLHAQPSNQFRQLMSPQIHQMQTQNMGIVRAPPVKVEGFQELMGG 574 Query: 437 DTTLKHDSEEHKLTSP 390 DTTLKHDSEE+KLTSP Sbjct: 575 DTTLKHDSEENKLTSP 590 >ref|XP_002533519.1| conserved hypothetical protein [Ricinus communis] gi|223526616|gb|EEF28863.1| conserved hypothetical protein [Ricinus communis] Length = 573 Score = 651 bits (1679), Expect = 0.0 Identities = 353/615 (57%), Positives = 437/615 (71%), Gaps = 2/615 (0%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 M+LLG+DG G++LARKLE+ G WR+WLGDS YSNFVH LSSP +W+SFM+ DSK++AQI Sbjct: 1 MSLLGDDGNGYDLARKLESLGTWRTWLGDSLYSNFVHFLSSPSSWDSFMRTDDSKSKAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 LQLR RALLFDKA+VSLF+ LNP+YLQLHGDDVYF+LED Sbjct: 61 HLQLRARALLFDKATVSLFISNNNNSCSALAVS----KLNPSYLQLHGDDVYFTLED--- 113 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 G Q Q + S + +S FS+GSRY EPE++ +++R R ++FPE+WY Sbjct: 114 ----GDQRQNAALSKSHSKS---------AFSIGSRYGEPEMEGLTQRFRNEEFPESWYN 160 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 QFIEKY SR +RL G+RE+ KR+ E MS+YL+L ++HKR R Sbjct: 161 QFIEKYKVSRPYRLSVGERESDKRSPEEMSSYLRLVDKHKRRRI---------------S 205 Query: 1505 NGSNIHSKSVSDGNNSIDDETC-FFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDS 1329 + ++HS SV DG+NS DD+ FFPE MF LNCVPDSALP I R +DNQK+E +GVLDS Sbjct: 206 STPSMHSSSVLDGSNSTDDDDLSFFPETMFMLNCVPDSALPLIIRPQDNQKIEFHGVLDS 265 Query: 1328 LPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRV 1149 LP TRS ++ER GI E S +R KNG EGN K++ EQASQM QKV++R+ Sbjct: 266 LPQ--TRSSVVIERLGISVEQ-----GGSLHRAKNGSEGNKKLISQEQASQMCQKVVARM 318 Query: 1148 LVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSN 969 L VGF+ TE+P+EVLSQ L CHI +LGR LK+L DNYRKQCSAI++L+MFLQT G++N Sbjct: 319 LARVGFDSATELPVEVLSQALRCHISELGRNLKILADNYRKQCSAIDLLKMFLQTAGFNN 378 Query: 968 LGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXXXXXQ 792 LG L E VKDG+RN PTQQ + +QS Q+QH S QQIPR Q Sbjct: 379 LGGLMELVKDGTRNVVQPTQQ-QMHAIQSQLQAQHQSTLRLPQQIPRQMHPQMQQMVHPQ 437 Query: 791 NLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 612 NLAF QQ QLERMRRRQPSTPR M +DKDRPMV+VKIEN SELP+D NAFNP+H+RHPQ Sbjct: 438 NLAFQQQQQLERMRRRQPSTPRPAMDIDKDRPMVQVKIENPSELPMDGNAFNPMHSRHPQ 497 Query: 611 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 432 +QFRQQ + AA+++L QS++QF+QLAS+QVPQ+Q+ NMG RAPPVKVEGFQELMGGD Sbjct: 498 MQFRQQQL-AAISSLQAQSSNQFRQLASMQVPQVQSPNMGIVRAPPVKVEGFQELMGGDA 556 Query: 431 TLKHDSEEHKLTSPS 387 ++KHD EE+KLTSPS Sbjct: 557 SVKHDPEENKLTSPS 571 >ref|XP_004288527.1| PREDICTED: uncharacterized protein LOC101303161 [Fragaria vesca subsp. vesca] Length = 596 Score = 646 bits (1667), Expect = 0.0 Identities = 356/623 (57%), Positives = 428/623 (68%), Gaps = 11/623 (1%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G+ELA KLE+C VWR+WLGDS+YS FVH L+SP TW+SFM+ SK+RAQI Sbjct: 1 MALLGDDGRGYELACKLESCNVWRTWLGDSSYSTFVHFLTSPSTWDSFMRSDPSKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 LQLR RALLFDKASVSLFLR NLNPNYLQLH DDVYFSLE+ Sbjct: 61 LLQLRARALLFDKASVSLFLRPDSASNSSAVS-----NLNPNYLQLHADDVYFSLEN--- 112 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 SS +GVQ Q+ S +IQ K F GSRY E E+DN S R + ++ PETWY Sbjct: 113 SSAEGVQAQQRDAS---------KIQSKTNFGFGSRYGESEIDNKSARFKNEELPETWYN 163 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 Q E++ SR HRL DRE+ +RT E M Y+KL +HK+ FKE+Q G+ NP+ E Sbjct: 164 QVSERHRVSRTHRLSSADRESERRTPEEMCAYIKLAMKHKKRCIAFKEEQPVGYRNPLLE 223 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 N S + S DG+NS+D E FFPE MF NCVPDSALP +NR +D+QKVE GVLD+L Sbjct: 224 NASQ-NPHSGLDGSNSVDHEAPFFPETMFTFNCVPDSALPPMNREQDDQKVEFCGVLDTL 282 Query: 1325 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1146 P VMTRSP MLER GIRPEYL S RGKNG GN L EQA+Q+SQKVI+R+L Sbjct: 283 PQVMTRSPVMLERLGIRPEYL------SMDRGKNGSAGNKSCLTHEQAAQLSQKVIARIL 336 Query: 1145 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 966 +VGFEG++EVP+EV SQ L CHI KLG LKVLTD+YRKQCSAIE+L+MFLQT+GY N Sbjct: 337 TNVGFEGSSEVPIEVFSQLLSCHIRKLGSCLKVLTDSYRKQCSAIELLKMFLQTVGYRNF 396 Query: 965 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-------XXXXXXX 807 G LA+ VKDGSR+ H Q + G+QS Q QH +P QQI R Sbjct: 397 GPLADQVKDGSRSV-HQQNQQQIHGMQSQLQPQHQNPIRLPQQISRQMLPQMQQIQQMQQ 455 Query: 806 XXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIH 627 +NL F QQ Q+ERMRRRQPSTPRAGM + ++RPMV+VKIE SELP+DSNAFN + Sbjct: 456 MAQSKNLPFQQQQQIERMRRRQPSTPRAGMDMVQERPMVQVKIEAPSELPMDSNAFNNFN 515 Query: 626 ARHPQIQFRQQSMAA----AMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEG 459 R+PQ+QFRQQ + A M N+ QS +QF+Q Q+ Q+Q+QN G RA PVKVEG Sbjct: 516 NRNPQMQFRQQQIPAMSNPTMQNVPAQSGNQFRQ---TQIAQIQSQNAGVLRARPVKVEG 572 Query: 458 FQELMGGDTTLKHDSEEHKLTSP 390 F ELMGGD + KHDS+E++LTSP Sbjct: 573 FSELMGGDASSKHDSDENRLTSP 595 >gb|EXC35477.1| hypothetical protein L484_026784 [Morus notabilis] Length = 647 Score = 635 bits (1639), Expect = e-179 Identities = 357/662 (53%), Positives = 435/662 (65%), Gaps = 49/662 (7%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG GFELARKLETCGVWR WLGDS Y NF L+SP TWE+FM+ +K+RAQI Sbjct: 1 MALLGDDGRGFELARKLETCGVWRKWLGDSCYGNFAPYLNSPTTWEAFMRVDGTKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN--------LNPNYLQLHGDDVY 1890 LQLRVRALLFDKASVSLFLR ++ LNPNYL LHGDDVY Sbjct: 61 HLQLRVRALLFDKASVSLFLRSNPSSSSSSSSSSSSASRSSVAISKLNPNYLNLHGDDVY 120 Query: 1889 FSLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPD 1710 F+LE+ SS D SSN+ SK IQ K F VGS Y E E+DN+ + R D Sbjct: 121 FTLEN---SSSD--------VSSNTASSK---IQSKASFGVGSGYGESEIDNVHQMFRND 166 Query: 1709 DFPETWYKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYT 1530 PETWY QFIE Y TSR +RL GD+E KR+ E M Y+KL E+HK+ R +KEDQY Sbjct: 167 VLPETWYNQFIENYRTSRPYRLSLGDQEPDKRSPEEMCAYIKLLEKHKKRRVAYKEDQYM 226 Query: 1529 GFGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVE 1350 G+GNP+ EN S + S+SD NS DDE+ FFPE+MF LN VPDSAL NR+E+ +K+E Sbjct: 227 GYGNPVLENSSYMRPNSISDAINSDDDESTFFPEIMFTLNSVPDSALSVANRVEERRKIE 286 Query: 1349 CYGVLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMS 1170 YGVLD LP VMT+SP M+ERFGI P +L + + + KNG N K LG EQA ++S Sbjct: 287 FYGVLDGLPRVMTKSPVMIERFGINP-FLGMEHGGNVHHVKNGSVVNKKCLGQEQALELS 345 Query: 1169 QKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFL 990 QKVI+R+L S+GFEG+TEVP+EV SQ + CHI +LGRILKVL+D+YRKQC+A+E+L+MFL Sbjct: 346 QKVIARMLASIGFEGSTEVPVEVFSQLMSCHITELGRILKVLSDSYRKQCTAVELLKMFL 405 Query: 989 QTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQ---------------HP-- 861 Q L + G+L EHVKDGSR S Q + G+QS SQ HP Sbjct: 406 QRL-KCDFGSLVEHVKDGSRT-SVQQSQSQVHGIQSQMMSQAQAALRLQQQMSRQMHPQM 463 Query: 860 ---------------------SPNLQTQQIPRXXXXXXXXXXXQNLAFPQQPQLERMRRR 744 LQ QQ + Q L QQ QLERMRRR Sbjct: 464 QQFVHSQNMAFQQQQQQHHQQQQQLQQQQQQQLQQQQLQQQQQQQLQQQQQQQLERMRRR 523 Query: 743 QPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAA---MA 573 QPSTPR+GM +DKDRP+V+VKIE SELP+DSN+ N + R Q+ +RQQ A + M+ Sbjct: 524 QPSTPRSGMDVDKDRPLVQVKIEQPSELPMDSNSLNNFNNRISQMHYRQQMAAMSNYTMS 583 Query: 572 NLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTS 393 N++ QSN+QF+Q+AS Q+PQ+Q+QNMG RAPPVKVEGFQELMGGD KHDSEE++LTS Sbjct: 584 NVHGQSNNQFRQMASGQIPQMQSQNMGVVRAPPVKVEGFQELMGGDAASKHDSEENRLTS 643 Query: 392 PS 387 PS Sbjct: 644 PS 645 >ref|XP_004143440.1| PREDICTED: uncharacterized protein LOC101223185 [Cucumis sativus] gi|449499810|ref|XP_004160923.1| PREDICTED: uncharacterized protein LOC101224095 [Cucumis sativus] Length = 612 Score = 627 bits (1618), Expect = e-177 Identities = 337/627 (53%), Positives = 440/627 (70%), Gaps = 14/627 (2%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G+ELARKL+T GVW++WLGD +YS FV L+S TW++FM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELARKLDTLGVWQTWLGDLSYSIFVPFLASTSTWDTFMRTDDSKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN--------LNPNYLQLHGDDVY 1890 QLQLR RALLFDKASVSLFLR + L+PNYLQLHGDDVY Sbjct: 61 QLQLRARALLFDKASVSLFLRSTPSPSSPSYSTGNPLSSSSLAISKLSPNYLQLHGDDVY 120 Query: 1889 FSLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPD 1710 F+LE+ SS+DGVQ +EG SSN K IQPK + G R E ++ + S+R + + Sbjct: 121 FTLEN---SSKDGVQQREGHVSSNKASGK---IQPKAASTAGPRSRESDIGDSSQRLK-N 173 Query: 1709 DFPETWYKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYT 1530 + PETWY QFIEKY + +RL G+ A KRTSE MS+YL+L E+HK+ R FK+D T Sbjct: 174 ELPETWYSQFIEKYRVKQPYRLSHGNNVAEKRTSEEMSSYLRLLEKHKKRRMVFKDDLLT 233 Query: 1529 GFGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVE 1350 FGN + N S+ SV D +NS++D+ FFPE+MF NCVP+SALP + ++DN++ E Sbjct: 234 NFGNSVSANASS----SVFDFSNSVEDDANFFPEIMFTFNCVPESALPPPDDMKDNRRPE 289 Query: 1349 CYGVLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMS 1170 GV+D+LP +TR+ AM+ER G++P+Y+ + +R K+G GN K LG EQ+ QMS Sbjct: 290 VPGVIDTLPQPITRNSAMMERLGVKPDYVSTERGVNVHRAKSGSGGNRKSLGQEQSFQMS 349 Query: 1169 QKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFL 990 QKV++R+L+S+GFEG TEVP+EV SQFL CHICKLG L+VL D+YRKQCSA+++LRMFL Sbjct: 350 QKVVARMLMSLGFEGATEVPLEVFSQFLSCHICKLGSTLRVLADSYRKQCSAVDLLRMFL 409 Query: 989 QTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPRXXXXXX 810 +T+GYSN G LA+ VKDGSRN+ +Q G+Q Q+QH + QQ+PR Sbjct: 410 KTMGYSNFGPLADIVKDGSRNY---VRQSMHHGVQPQLQAQHQTLLQVPQQVPR-QMHPQ 465 Query: 809 XXXXXQNLAFPQQPQ------LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDS 648 + AF QQ Q LE+MRRRQ +TPRA M +KDRP+++VK+ENT ELP+D Sbjct: 466 MQQMVNSQAFQQQQQQQQQFVLEKMRRRQAATPRAVMEANKDRPLLQVKVENT-ELPMDG 524 Query: 647 NAFNPIHARHPQIQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVK 468 NA N ++ RHPQ+QFRQQ + AAM+N++ +QF+Q+ S+Q+PQ+QT N RAPPVK Sbjct: 525 NALNALNIRHPQLQFRQQQI-AAMSNIHASPGNQFRQIPSMQMPQIQTPNTNVVRAPPVK 583 Query: 467 VEGFQELMGGDTTLKHDSEEHKLTSPS 387 VEGFQELMGGDT+ KHDSEE +LTSPS Sbjct: 584 VEGFQELMGGDTSSKHDSEEARLTSPS 610 >ref|XP_002310863.2| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] gi|550334854|gb|EEE91313.2| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] Length = 577 Score = 608 bits (1568), Expect = e-171 Identities = 341/615 (55%), Positives = 421/615 (68%), Gaps = 2/615 (0%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 M++LG+DGLG++LARKLET G+WR+WLGDS YSNF+H LSSP +W+SFM+ DSK+++ Sbjct: 1 MSVLGDDGLGYDLARKLETLGMWRAWLGDSLYSNFLHSLSSPASWQSFMRTDDSKSKSHF 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 QLQLR RALLFDKASVSLFLR NLNPNYLQLHGDDVYF+LED Sbjct: 61 QLQLRARALLFDKASVSLFLRSNTVAAVS--------NLNPNYLQLHGDDVYFTLEDEDQ 112 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 + G G+T+ ++ +L F V S +V + +R + ++ PETWY Sbjct: 113 RREGG---GVGATT---------KVCSRLSFRV-SNFV---LYICCQRYKNEELPETWYT 156 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 QF+EK R +RL FGDRE+ KR+ E MSTY +L RHKR QY G GN E Sbjct: 157 QFMEKRKLKRPYRLSFGDRESDKRSPEQMSTYFRLVARHKR------RCQYLGSGNSNLE 210 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 + SN+ S SV DG++S+DD+ FFPE MF NCVPDSA+P I R DNQK+E G DSL Sbjct: 211 STSNMRSGSVLDGSHSVDDDFVFFPETMFMFNCVPDSAIPPIIRARDNQKIEFRGAFDSL 270 Query: 1325 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1146 P TR+P M+ER GI E S RGKNG EG+ K L EQA QMSQKV++ +L Sbjct: 271 PQ--TRNPVMIERLGISVEQ-----GGSLNRGKNGSEGHKK-LSEEQALQMSQKVVACLL 322 Query: 1145 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 966 VGF+G +E+PMEV SQ L CHI KLGRIL+VL D+YRKQCSA+E+L+MFLQT G+SNL Sbjct: 323 TRVGFDGASEIPMEVFSQLLRCHISKLGRILRVLADSYRKQCSAVELLKMFLQTAGFSNL 382 Query: 965 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXXXXXQN 789 +L + VK+G+RN + PT Q G+QS F SQH + QQIPR QN Sbjct: 383 VHLMKIVKEGARNTAEPTHQ-QAHGIQSQFHSQHQNLLRLPQQIPRQMHPQMQPMVHSQN 441 Query: 788 LAFPQQPQ-LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 612 L F QQ Q ER+RRR STPR GM +DKD+P+V+VK+EN ELP+D+NA N H+R PQ Sbjct: 442 LTFQQQQQHFERLRRRHTSTPRPGMDVDKDKPLVQVKVENPPELPLDNNAVNAFHSRQPQ 501 Query: 611 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 432 +Q R Q + AAM+NL+ Q N+Q +QLASLQVPQ+QT NMG RAPPVKVEGFQELMGGD Sbjct: 502 MQMRHQQI-AAMSNLHAQPNNQLRQLASLQVPQMQTSNMGMVRAPPVKVEGFQELMGGDA 560 Query: 431 TLKHDSEEHKLTSPS 387 LKHD+EE+KLTSPS Sbjct: 561 ALKHDTEENKLTSPS 575 >ref|XP_006380803.1| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] gi|550334853|gb|ERP58600.1| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] Length = 558 Score = 603 bits (1556), Expect = e-170 Identities = 337/615 (54%), Positives = 413/615 (67%), Gaps = 2/615 (0%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 M++LG+DGLG++LARKLET G+WR+WLGDS YSNF+H LSSP +W+SFM+ DSK+++ Sbjct: 1 MSVLGDDGLGYDLARKLETLGMWRAWLGDSLYSNFLHSLSSPASWQSFMRTDDSKSKSHF 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 QLQLR RALLFDKASVSLFLR NLNPNYLQLHGDDVYF+LED Sbjct: 61 QLQLRARALLFDKASVSLFLRSNTVAAVS--------NLNPNYLQLHGDDVYFTLED--- 109 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 + Q + G VG+ ++R + ++ PETWY Sbjct: 110 -----------------------EDQRREGGGVGAT---------TKRYKNEELPETWYT 137 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 QF+EK R +RL FGDRE+ KR+ E MSTY +L RHKR QY G GN E Sbjct: 138 QFMEKRKLKRPYRLSFGDRESDKRSPEQMSTYFRLVARHKR------RCQYLGSGNSNLE 191 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 + SN+ S SV DG++S+DD+ FFPE MF NCVPDSA+P I R DNQK+E G DSL Sbjct: 192 STSNMRSGSVLDGSHSVDDDFVFFPETMFMFNCVPDSAIPPIIRARDNQKIEFRGAFDSL 251 Query: 1325 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1146 P TR+P M+ER GI E S RGKNG EG+ K L EQA QMSQKV++ +L Sbjct: 252 PQ--TRNPVMIERLGISVEQ-----GGSLNRGKNGSEGHKK-LSEEQALQMSQKVVACLL 303 Query: 1145 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 966 VGF+G +E+PMEV SQ L CHI KLGRIL+VL D+YRKQCSA+E+L+MFLQT G+SNL Sbjct: 304 TRVGFDGASEIPMEVFSQLLRCHISKLGRILRVLADSYRKQCSAVELLKMFLQTAGFSNL 363 Query: 965 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXXXXXQN 789 +L + VK+G+RN + PT Q G+QS F SQH + QQIPR QN Sbjct: 364 VHLMKIVKEGARNTAEPTHQ-QAHGIQSQFHSQHQNLLRLPQQIPRQMHPQMQPMVHSQN 422 Query: 788 LAFPQQPQ-LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 612 L F QQ Q ER+RRR STPR GM +DKD+P+V+VK+EN ELP+D+NA N H+R PQ Sbjct: 423 LTFQQQQQHFERLRRRHTSTPRPGMDVDKDKPLVQVKVENPPELPLDNNAVNAFHSRQPQ 482 Query: 611 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 432 +Q R Q + AAM+NL+ Q N+Q +QLASLQVPQ+QT NMG RAPPVKVEGFQELMGGD Sbjct: 483 MQMRHQQI-AAMSNLHAQPNNQLRQLASLQVPQMQTSNMGMVRAPPVKVEGFQELMGGDA 541 Query: 431 TLKHDSEEHKLTSPS 387 LKHD+EE+KLTSPS Sbjct: 542 ALKHDTEENKLTSPS 556 >ref|XP_006466330.1| PREDICTED: uncharacterized protein LOC102616625 isoform X2 [Citrus sinensis] Length = 610 Score = 586 bits (1510), Expect = e-164 Identities = 335/659 (50%), Positives = 409/659 (62%), Gaps = 47/659 (7%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G+ELA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 QFIEKY SRQ++L GDRE +RT+EGMS+YL+ E++KR R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKRRRVPFQND----------- 195 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1325 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1146 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1145 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 966 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SNL Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNL 369 Query: 965 GNLAEHVKDGSRNFSHPTQQ---------------------------------------- 906 G LAE +KDG+RN +Q+ Sbjct: 370 GILAELIKDGTRNVVQQSQELIKDGSRNIVQQSQELIKDGTRNIVQQNQELVKEGTRNFV 429 Query: 905 ----PHLRGLQSGFQSQHPSPNLQTQQIPR-XXXXXXXXXXXQNLAFP--QQPQLERMRR 747 + G QS QS SP QQ+PR QNLAF QQ LER R Sbjct: 430 QQSPQQVHGAQSQLQSHQQSPVKLPQQVPRQMHQQMQQMVQPQNLAFQQMQQQHLERSRM 489 Query: 746 RQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMANL 567 RQPSTPR GM +DKDR M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+NL Sbjct: 490 RQPSTPRPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMSNL 548 Query: 566 NPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSP 390 QS++QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTSP Sbjct: 549 QAQSSNQFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTSP 607 >ref|XP_006466329.1| PREDICTED: uncharacterized protein LOC102616625 isoform X1 [Citrus sinensis] Length = 612 Score = 581 bits (1497), Expect = e-163 Identities = 335/661 (50%), Positives = 409/661 (61%), Gaps = 49/661 (7%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G+ELA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 QFIEKY SRQ++L GDRE +RT+EGMS+YL+ E++KR R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKRRRVPFQND----------- 195 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1325 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1146 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1145 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 966 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SNL Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNL 369 Query: 965 GNLAEHVKDGSRNFSHPTQQ---------------------------------------- 906 G LAE +KDG+RN +Q+ Sbjct: 370 GILAELIKDGTRNVVQQSQELIKDGSRNIVQQSQELIKDGTRNIVQQNQELVKEGTRNFV 429 Query: 905 ----PHLRGLQSGFQSQHPSPNL--QTQQIPR-XXXXXXXXXXXQNLAFP--QQPQLERM 753 + G QS QS SP Q Q+PR QNLAF QQ LER Sbjct: 430 QQSPQQVHGAQSQLQSHQQSPVKLPQQLQVPRQMHQQMQQMVQPQNLAFQQMQQQHLERS 489 Query: 752 RRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMA 573 R RQPSTPR GM +DKDR M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+ Sbjct: 490 RMRQPSTPRPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMS 548 Query: 572 NLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTS 393 NL QS++QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTS Sbjct: 549 NLQAQSSNQFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTS 608 Query: 392 P 390 P Sbjct: 609 P 609 >ref|NP_201357.2| uncharacterized protein [Arabidopsis thaliana] gi|26451238|dbj|BAC42721.1| unknown protein [Arabidopsis thaliana] gi|28973345|gb|AAO63997.1| unknown protein [Arabidopsis thaliana] gi|332010686|gb|AED98069.1| uncharacterized protein AT5G65540 [Arabidopsis thaliana] Length = 605 Score = 553 bits (1425), Expect = e-154 Identities = 311/629 (49%), Positives = 416/629 (66%), Gaps = 15/629 (2%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG GF+LARKLE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK+RAQI Sbjct: 1 MALLGDDGRGFDLARKLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLR-------XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYF 1887 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY+ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASISDVSSVAVSKLNPNYLQLHGDDVYY 120 Query: 1886 SLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDD 1707 +LE+ +S + G Q + G + S+ + K F+ G+R E + N+S+R R ++ Sbjct: 121 TLEN--ASLESGFQREGGIRHNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEE 174 Query: 1706 FPETWYKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTG 1527 P+TWY QFI +Y ++ + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 175 LPDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLA 232 Query: 1526 FGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVEC 1347 + S+ H S DG+ S +D+ F PE MF +NCVP++AL I R +DN K E Sbjct: 233 H-----MSRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPITRTQDNLKTEF 286 Query: 1346 YGVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQ 1176 YGVLD+LP V TRS M+ER G+ PEY R+ G+ RS+ K G +QA+ Sbjct: 287 YGVLDTLPQVTTRSHIMIERLGLMPEYHRMEERGVLRSRKAEKMG-------FSDDQAAL 339 Query: 1175 MSQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRM 996 +S+KV++R+L+++GFEG TEVP++V SQ + H+ KLGRILK+LTD+Y+K+CSA+++++M Sbjct: 340 VSRKVVARMLLTMGFEGATEVPIDVFSQLVSRHMSKLGRILKLLTDSYKKECSAMQLIKM 399 Query: 995 FLQTLGYSNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSPNLQTQQIPRXXX 819 FL T GYSNLG+LAE VKDG+RN P Q QP + LQ Q + QQI R Sbjct: 400 FLNTTGYSNLGSLAEIVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMH 457 Query: 818 XXXXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAF 639 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN SE+ +D NAF Sbjct: 458 PQMQQMVNPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAF 516 Query: 638 NPIHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPV 471 NP++ RH Q Q RQQ AAM+N+ Q + QF+QLAS+Q+PQ+QT +GT RA PV Sbjct: 517 NPMNPRHQQQLQQQLRQQQQIAAMSNMQQQPGYNQFRQLASMQIPQMQTPTLGTVRAQPV 576 Query: 470 KVEGFQELMGGDTTLKHDSEEHKLTSPSK 384 KVEGF++LMGGD++LKHDS++ + P+K Sbjct: 577 KVEGFEQLMGGDSSLKHDSDDKLRSPPTK 605 >ref|XP_006394014.1| hypothetical protein EUTSA_v10003865mg [Eutrema salsugineum] gi|557090653|gb|ESQ31300.1| hypothetical protein EUTSA_v10003865mg [Eutrema salsugineum] Length = 598 Score = 549 bits (1415), Expect = e-153 Identities = 309/622 (49%), Positives = 416/622 (66%), Gaps = 8/622 (1%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG GF+LAR+LE GVWR+WLGDS Y +F H LSSP +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGFDLARRLEVSGVWRTWLGDSTYLSFHHYLSSPSSWESFMRVDDSKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLR--XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDF 1872 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY++LE Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIPASPSSDASSVAVSKLNPNYLQLHGDDVYYTLE-- 118 Query: 1871 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1692 ++S +G ++G+ N K + K F+ G+R E + N+S+R R ++ P+TW Sbjct: 119 -NASLEGGFQRDGAIRHNPSLPKSLS---KPSFASGARGSESDFSNLSQRSRFEELPDTW 174 Query: 1691 YKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1512 Y QFI +Y ++ + G +E+ KRT EGMSTYL++ + HKR R PF +D + Sbjct: 175 YTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDSHKRKRAPFLQDPSP--ASSA 230 Query: 1511 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1332 + S+ H S DG+ S +D+ F PE MF +NCVP++AL + R DN K E YGVLD Sbjct: 231 HMSRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPVARTHDNLKTEFYGVLD 289 Query: 1331 SLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQMSQKV 1161 +LP V TR+ M+ER G+ PEY R+ G+ R K K G EQA+Q+S+KV Sbjct: 290 TLPQVTTRNHVMIERLGMVPEYFRMEERGVLRRKKAEKLG-------FSDEQAAQVSRKV 342 Query: 1160 ISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTL 981 ++R+L+++G EG TEVP++V SQ + HICKLGRILK+LTD+Y+K+CSAI++++MFL T Sbjct: 343 VARILLTMGCEGATEVPIDVFSQLVSRHICKLGRILKLLTDSYKKECSAIQLIKMFLNTT 402 Query: 980 GYSNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSPNLQTQQIPRXXXXXXXX 804 GYSNLG+LAE VKDG+RN HP Q Q + LQ Q +P QQ+ R Sbjct: 403 GYSNLGDLAELVKDGTRN--HPPQNQKQPQVLQQQLHLQQQNPLRLPQQMQRQMHPQMQQ 460 Query: 803 XXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHA 624 + F QQ Q+ERMRRRQ ++PR + ++KDRP+V+VK+EN SE+ +D NAFNP++ Sbjct: 461 MVNPH-TFQQQQQMERMRRRQVTSPRPNIDMEKDRPLVQVKLENPSEMAVDGNAFNPMNP 519 Query: 623 RHPQIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQN-MGTTRAPPVKVEGFQE 450 RH QI +Q AAM+NL Q + QF+QLAS+Q+PQ+QT N GT RA PVKVEGF++ Sbjct: 520 RHQQI---RQQQIAAMSNLQQQPGYNQFRQLASMQIPQMQTPNTTGTVRAQPVKVEGFEQ 576 Query: 449 LMGGDTTLKHDSEEHKLTSPSK 384 LMGGD++LKH+S++ + P+K Sbjct: 577 LMGGDSSLKHESDDKLRSPPTK 598 >ref|XP_006280200.1| hypothetical protein CARUB_v10026105mg [Capsella rubella] gi|482548904|gb|EOA13098.1| hypothetical protein CARUB_v10026105mg [Capsella rubella] Length = 606 Score = 544 bits (1402), Expect = e-152 Identities = 305/627 (48%), Positives = 416/627 (66%), Gaps = 13/627 (2%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG GF+LAR+LE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK R+QI Sbjct: 1 MALLGDDGRGFDLARRLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKPRSQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN------LNPNYLQLHGDDVYFS 1884 QLQLRVRALLFDKA+VSLFLR + LNPNYLQLHGDDVY++ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNSIAASSSSTSVSDVSSVAVSKLNPNYLQLHGDDVYYT 120 Query: 1883 LEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDF 1704 LE+ +S + G Q G + S+ + K F+ G+R E + N+S+R R ++ Sbjct: 121 LEN--ASLEGGFQRDGGIRLNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEEL 174 Query: 1703 PETWYKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGF 1524 P+TWY QFI +Y ++ + G +E+ KRT EGMSTYL++ + HKR R PF ED+ + Sbjct: 175 PDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRNS-- 230 Query: 1523 GNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECY 1344 G+ + S+ H S DG++S +D+ F PE MF +NCVP++ALP I R +DN K E Y Sbjct: 231 GSSAHMSRSSTHPSSGFDGSSS-EDDILFLPETMFRMNCVPETALPPITRTQDNLKTEFY 289 Query: 1343 GVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQM 1173 GVLD+LP V TRS M+ER G+ PEY R+ G+ R + K G +QA+Q+ Sbjct: 290 GVLDTLPQVTTRSHVMIERLGVMPEYHRMEERGVLRRRKAEKLG-------FSDDQAAQV 342 Query: 1172 SQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMF 993 S+KV++R+L+++GFEG TEVP++V SQ + HI KLGRIL++LTD+Y+K+CSA ++++MF Sbjct: 343 SRKVVARMLLTMGFEGATEVPVDVFSQLVSRHISKLGRILRLLTDSYKKECSATQLIKMF 402 Query: 992 LQTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPRXXXXX 813 L T GYSNLG+LAE VKDG+RN P Q + LQ Q + QQI R Sbjct: 403 LNTTGYSNLGSLAELVKDGTRNHP-PLNQKQPQMLQQQLHLQQQASLRLPQQIQR-QMHP 460 Query: 812 XXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNP 633 + F QQ QLER+RRRQ ++PR M ++KDRP+V+VK+EN SE+ +D NAFNP Sbjct: 461 QMQQMVNSPTFQQQQQLERLRRRQVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAFNP 520 Query: 632 IHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPVKV 465 ++ RH Q Q RQQ + AAM+N+ Q + QF+QLAS+Q+PQ+QT T RA PVKV Sbjct: 521 MNPRHQQQIQHQLRQQHI-AAMSNMQQQPGYNQFRQLASMQIPQMQTPTPATVRAQPVKV 579 Query: 464 EGFQELMGGDTTLKHDSEEHKLTSPSK 384 EGF++LMGGD++LKH+ ++ + P+K Sbjct: 580 EGFEQLMGGDSSLKHELDDKLRSPPTK 606 >emb|CAN70982.1| hypothetical protein VITISV_027119 [Vitis vinifera] Length = 405 Score = 543 bits (1399), Expect = e-151 Identities = 281/417 (67%), Positives = 320/417 (76%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG GFELARKLE+CGVWRSWLGD+ YSNFV LSSP TWESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGFELARKLESCGVWRSWLGDALYSNFVQYLSSPNTWESFMRSDDSKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 QLQLR RALLFDKASVSLFLR LNP+YLQLHGDDVYF+LE Sbjct: 61 QLQLRARALLFDKASVSLFLRSPSTPTSSLPVS----KLNPSYLQLHGDDVYFTLE---- 112 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 QD VQ +EG +SN+ SK IQPK FSVG RY E E+DNIS+R R ++FPETWY Sbjct: 113 --QDVVQQREGVVASNTAPSK---IQPKAAFSVGXRYAESEIDNISQRFRHEEFPETWYN 167 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 FIEKY SR ++L FG+RE+ KRT MS Y+KL E+HK+ R FKEDQ+ GFGNPI E Sbjct: 168 LFIEKYKASRPYKLSFGERESDKRTPRDMSVYIKLLEKHKKRRVAFKEDQHMGFGNPIVE 227 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 N S+++ SV DG NS+DD+T FFPE MF LNCVPDSAL INR+EDNQKVE YGVLD+L Sbjct: 228 NKSSMYPSSVLDGKNSVDDDTYFFPETMFTLNCVPDSALLPINRVEDNQKVEFYGVLDTL 287 Query: 1325 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1146 P VMTRSP M+ER GIRPEY + S+YR KNG EGN K+LG EQA QMSQKVI+R+L Sbjct: 288 PQVMTRSPIMIERLGIRPEYHSMEQGGSQYRNKNGTEGNRKLLGQEQALQMSQKVIARML 347 Query: 1145 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGY 975 +GFE TEVPMEVLSQ L CHICKLGRILKVL+DNYRKQCSA E+L+MFLQT GY Sbjct: 348 TKMGFEVATEVPMEVLSQLLSCHICKLGRILKVLSDNYRKQCSATELLKMFLQTTGY 404 >dbj|BAA98173.1| unnamed protein product [Arabidopsis thaliana] Length = 595 Score = 539 bits (1389), Expect = e-150 Identities = 303/622 (48%), Positives = 411/622 (66%), Gaps = 8/622 (1%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG GF+LARKLE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK+RAQI Sbjct: 1 MALLGDDGRGFDLARKLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 QLQLRVRALLFDKA+VSLFLR + + + LHGDDVY++LE+ + Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASIS---DVSSVALHGDDVYYTLEN--A 115 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 S + G Q + G + S+ + K F+ G+R E + N+S+R R ++ P+TWY Sbjct: 116 SLESGFQREGGIRHNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEELPDTWYT 171 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 QFI +Y ++ + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 172 QFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLAH-----M 224 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 + S+ H S DG+ S +D+ F PE MF +NCVP++AL I R +DN K E YGVLD+L Sbjct: 225 SRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPITRTQDNLKTEFYGVLDTL 283 Query: 1325 PHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQMSQKVIS 1155 P V TRS M+ER G+ PEY R+ G+ RS+ K G +QA+ +S+KV++ Sbjct: 284 PQVTTRSHIMIERLGLMPEYHRMEERGVLRSRKAEKMG-------FSDDQAALVSRKVVA 336 Query: 1154 RVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGY 975 R+L+++GFEG TEVP++V SQ + H+ KLGRILK+LTD+Y+K+CSA+++++MFL T GY Sbjct: 337 RMLLTMGFEGATEVPIDVFSQLVSRHMSKLGRILKLLTDSYKKECSAMQLIKMFLNTTGY 396 Query: 974 SNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSPNLQTQQIPRXXXXXXXXXX 798 SNLG+LAE VKDG+RN P Q QP + LQ Q + QQI R Sbjct: 397 SNLGSLAEIVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMHPQMQQMV 454 Query: 797 XQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARH 618 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN SE+ +D NAFNP++ RH Sbjct: 455 NPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAFNPMNPRH 513 Query: 617 P---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQE 450 Q Q RQQ AAM+N+ Q + QF+QLAS+Q+PQ+QT +GT RA PVKVEGF++ Sbjct: 514 QQQLQQQLRQQQQIAAMSNMQQQPGYNQFRQLASMQIPQMQTPTLGTVRAQPVKVEGFEQ 573 Query: 449 LMGGDTTLKHDSEEHKLTSPSK 384 LMGGD++LKHDS++ + P+K Sbjct: 574 LMGGDSSLKHDSDDKLRSPPTK 595 >ref|XP_006426252.1| hypothetical protein CICLE_v10025202mg [Citrus clementina] gi|557528242|gb|ESR39492.1| hypothetical protein CICLE_v10025202mg [Citrus clementina] Length = 604 Score = 530 bits (1364), Expect = e-147 Identities = 313/653 (47%), Positives = 398/653 (60%), Gaps = 41/653 (6%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G++LA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYQLALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 QFIEKY SRQ++L GDRE +RT+EGMS+YL+ E++K R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKIRRVPFQND----------- 195 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1325 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1146 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1145 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 966 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SN Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNF 369 Query: 965 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQ------------HPSPNL--------- 849 G LAE +KDG+RN +Q+ G ++ Q S L Sbjct: 370 GILAELIKDGNRNAVQQSQELIKDGSRNIVQQSQELIKDGARNVVQQSQELIKDGTRNIV 429 Query: 848 -QTQQIPRXXXXXXXXXXXQNL--------AFPQQP-QLERMRRRQPSTPRAGMT----- 714 Q Q++ + Q + + Q P +L + + +Q R+ M Sbjct: 430 QQNQELVKEGTRNFVQQSPQQVHGAQSQLQSHQQSPVKLPQQQMQQQHLERSRMRQPSTP 489 Query: 713 ---LDKD--RPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMANLNPQSNH 549 +D D R M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+NL QS++ Sbjct: 490 RPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMSNLQAQSSN 548 Query: 548 QFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSP 390 QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTSP Sbjct: 549 QFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTSP 601 >ref|XP_002864962.1| hypothetical protein ARALYDRAFT_496788 [Arabidopsis lyrata subsp. lyrata] gi|297310797|gb|EFH41221.1| hypothetical protein ARALYDRAFT_496788 [Arabidopsis lyrata subsp. lyrata] Length = 603 Score = 529 bits (1363), Expect = e-147 Identities = 303/627 (48%), Positives = 409/627 (65%), Gaps = 15/627 (2%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG GF+LAR+LE GVWR+WLGDS YS+F H L+SP WE+FM+ +SK RAQI Sbjct: 1 MALLGDDGRGFDLARRLELSGVWRTWLGDSIYSSFHHYLTSPSNWEAFMRVDESKCRAQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLR-------XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYF 1887 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY+ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASISDVSSVAVSKLNPNYLQLHGDDVYY 120 Query: 1886 SLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDD 1707 +LE+ +S + G Q G + S+ + K F G+R E + N+S+R R ++ Sbjct: 121 TLEN--ASLESGFQRDGGIRHNQSLTKSL----SKPSFISGTRGSESDFSNLSQRSRFEE 174 Query: 1706 FPETWYKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTG 1527 P+TWY QFI +Y ++ + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 175 LPDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLA 232 Query: 1526 FGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVEC 1347 + S+ H S DG +S +D+ F PE MF +NCVP++AL + R +DN K E Sbjct: 233 H-----MSRSSTHPSSGFDGRSS-EDDILFLPETMFRMNCVPETALSPVTRTQDNLKTEF 286 Query: 1346 YGVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQ 1176 YGVLD+LP V TRS M+ER G+ PEY R+ G+ R + K G +QA+ Sbjct: 287 YGVLDTLPQVTTRSHIMIERLGMMPEYHRMEDRGVLRRRKAEKLG-------FSDDQAAL 339 Query: 1175 MSQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRM 996 +S+KV++R+L+++GFEG TEVP++V SQ + H+ KLG ILK+L+D+Y+K+CSA+++++M Sbjct: 340 VSRKVVARMLLTMGFEGATEVPIDVFSQLVSRHMSKLGHILKLLSDSYKKECSAMQLIKM 399 Query: 995 FLQTLGYSNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSPNLQTQQIPRXXX 819 FL T GYSNLG+LAE VKDG+RN P Q QP + LQ Q + QQI R Sbjct: 400 FLNTTGYSNLGSLAELVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMH 457 Query: 818 XXXXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAF 639 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN S++ +D NAF Sbjct: 458 PQMQQMVNPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSDMAVDGNAF 516 Query: 638 NPIHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPV 471 NP++ RH Q Q RQQ + AA +N+ Q + QF+QLAS+Q+PQ+QT GT RA PV Sbjct: 517 NPMNPRHQQQMQQQLRQQQI-AAKSNMQQQPGYSQFRQLASMQIPQMQTPTPGTVRAQPV 575 Query: 470 KVEGFQELMGGDTTLKHDSEEHKLTSP 390 KVEGF++LMGGD++LKH+S++ KL SP Sbjct: 576 KVEGFEQLMGGDSSLKHESDD-KLRSP 601 >gb|EYU30927.1| hypothetical protein MIMGU_mgv1a003113mg [Mimulus guttatus] Length = 607 Score = 517 bits (1332), Expect = e-144 Identities = 312/650 (48%), Positives = 410/650 (63%), Gaps = 36/650 (5%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG GFELARKLE+ GVWR WLGD++YS F++ L+SP W+ FM+ SKT+ QI Sbjct: 1 MALLGDDGRGFELARKLESHGVWRPWLGDAHYSAFINFLASPEKWDIFMRADKSKTKDQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1866 LQLR RALLFDKASVSLF + S LNPNYL+LHGDDVYF+ ED Sbjct: 61 YLQLRARALLFDKASVSLFTQ--------SPPPAPVSKLNPNYLELHGDDVYFTFED--- 109 Query: 1865 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1686 ++D Q Q +SN+ SK K VGSR+ E E +E + ++ PETWY Sbjct: 110 GAKDVDQRQPSLAASNTTSSKGYS---KTSVGVGSRFNETE----TETDKLEELPETWYS 162 Query: 1685 QFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1506 QF EKY S+ +RL FGDRE+ KRT E MSTYL++ E HKR R F + Sbjct: 163 QFFEKYRASKSYRLIFGDRESEKRTPEQMSTYLRVLENHKRRRVAFV------------D 210 Query: 1505 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1326 N SN+ S+S+ D+ FPE MF LNCVPDSA+ + LE++QK++ GVLD+L Sbjct: 211 NTSNLRPNSLSE-----LDDIPLFPETMFTLNCVPDSAVLQTSGLENHQKLQFNGVLDNL 265 Query: 1325 PHVMTR----SPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVI 1158 P +MT+ SP M+ER GIRPE+L + + RG+N G+ ++ G EQA Q+S+KV+ Sbjct: 266 PQIMTKSTMISPIMIERLGIRPEFLNM----EQTRGRN---GSMRIRGEEQAVQISKKVV 318 Query: 1157 SRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLG 978 +R+L +VGFE +++ +EVL Q L CHI KLGR LK+L+D+YRKQCSA E+++MFLQT G Sbjct: 319 ARLLTNVGFESCSDLSLEVLPQLLSCHIGKLGRTLKLLSDSYRKQCSANELVKMFLQTAG 378 Query: 977 YS-NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIPR--XXXXXXX 807 YS N+G L + +KD ++N P QQ ++ +Q+ Q Q L +QQIPR Sbjct: 379 YSNNMGALVQIIKDNTKNGVQPNQQ-QVQAIQAQLQLQQQPSILPSQQIPRQINPQMQQQ 437 Query: 806 XXXXQNLAF-PQQPQLERMRRRQPS-TPRAGMT-------------LDKD-RPMVEVKIE 675 Q LAF QQ Q ERMRRRQ PR GM +DKD RP+V+VK+E Sbjct: 438 MNNAQYLAFQQQQQQWERMRRRQQQPAPRPGMNTNVNMNMNTNTNMIDKDNRPLVQVKME 497 Query: 674 NTSELPIDSNAFNPIHARHPQI----------QFRQQSMA---AAMANLNPQSNHQFKQL 534 N SE P+D+NAF +++RHPQ+ Q QQ +A A N N +N+ F+ + Sbjct: 498 NPSEFPLDANAFAAVNSRHPQLLQIRHQQEQQQLAQQQLAQQVQANNNNNNNNNNVFRPM 557 Query: 533 ASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSPSK 384 SLQ+PQ+ + +M RAPPVKVEGFQELMGGD+++KHDSEE+KL SP K Sbjct: 558 TSLQIPQILSPSMSMPRAPPVKVEGFQELMGGDSSIKHDSEENKLLSPQK 607 >ref|XP_007047764.1| Transcription initiation factor TFIID subunit 8, putative isoform 2 [Theobroma cacao] gi|508700025|gb|EOX91921.1| Transcription initiation factor TFIID subunit 8, putative isoform 2 [Theobroma cacao] Length = 489 Score = 507 bits (1305), Expect = e-140 Identities = 269/467 (57%), Positives = 330/467 (70%), Gaps = 2/467 (0%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1872 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1871 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1692 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1691 YKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1512 Y QFIEKY SR ++L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1511 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1332 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1331 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1152 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1151 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 972 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 971 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQIP 831 N G LAE VKD +RN T Q + G+QS Q QH + QQ+P Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQLP 442 Score = 88.6 bits (218), Expect = 1e-14 Identities = 43/74 (58%), Positives = 50/74 (67%) Frame = -3 Query: 611 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 432 +Q Q M + L PQ + + L + Q+ QNMG RAPPVKVEGFQELMGGDT Sbjct: 413 VQQTPQQMHGIQSQLQPQHQNALRMAQQLPMRQMHPQNMGIVRAPPVKVEGFQELMGGDT 472 Query: 431 TLKHDSEEHKLTSP 390 TLKHDSEE+KLTSP Sbjct: 473 TLKHDSEENKLTSP 486 >ref|XP_007047765.1| Transcription initiation factor TFIID subunit 8, putative isoform 3 [Theobroma cacao] gi|508700026|gb|EOX91922.1| Transcription initiation factor TFIID subunit 8, putative isoform 3 [Theobroma cacao] Length = 445 Score = 503 bits (1296), Expect = e-139 Identities = 268/465 (57%), Positives = 328/465 (70%), Gaps = 2/465 (0%) Frame = -3 Query: 2225 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 2046 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 2045 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1872 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1871 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1692 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1691 YKQFIEKYSTSRQHRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1512 Y QFIEKY SR ++L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1511 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1332 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1331 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1152 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1151 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 972 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 971 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSPNLQTQQ 837 N G LAE VKD +RN T Q + G+QS Q QH + QQ Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQ 440