BLASTX nr result
ID: Akebia22_contig00012778
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00012778 (2129 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007208312.1| hypothetical protein PRUPE_ppa003035mg [Prun... 694 0.0 ref|XP_007047763.1| Transcription initiation factor TFIID subuni... 691 0.0 ref|XP_002533519.1| conserved hypothetical protein [Ricinus comm... 650 0.0 ref|XP_004288527.1| PREDICTED: uncharacterized protein LOC101303... 640 0.0 gb|EXC35477.1| hypothetical protein L484_026784 [Morus notabilis] 635 e-179 ref|XP_004143440.1| PREDICTED: uncharacterized protein LOC101223... 627 e-177 ref|XP_002310863.2| hypothetical protein POPTR_0007s14190g [Popu... 607 e-171 ref|XP_006380803.1| hypothetical protein POPTR_0007s14190g [Popu... 603 e-169 ref|XP_006466330.1| PREDICTED: uncharacterized protein LOC102616... 579 e-162 ref|XP_006466329.1| PREDICTED: uncharacterized protein LOC102616... 579 e-162 ref|NP_201357.2| uncharacterized protein [Arabidopsis thaliana] ... 554 e-155 ref|XP_006426252.1| hypothetical protein CICLE_v10025202mg [Citr... 553 e-155 ref|XP_006394014.1| hypothetical protein EUTSA_v10003865mg [Eutr... 548 e-153 ref|XP_006280200.1| hypothetical protein CARUB_v10026105mg [Caps... 545 e-152 emb|CAN70982.1| hypothetical protein VITISV_027119 [Vitis vinifera] 542 e-151 dbj|BAA98173.1| unnamed protein product [Arabidopsis thaliana] 540 e-150 ref|XP_002864962.1| hypothetical protein ARALYDRAFT_496788 [Arab... 530 e-147 gb|EYU30927.1| hypothetical protein MIMGU_mgv1a003113mg [Mimulus... 518 e-144 ref|XP_007047764.1| Transcription initiation factor TFIID subuni... 506 e-140 ref|XP_007047765.1| Transcription initiation factor TFIID subuni... 503 e-139 >ref|XP_007208312.1| hypothetical protein PRUPE_ppa003035mg [Prunus persica] gi|462403954|gb|EMJ09511.1| hypothetical protein PRUPE_ppa003035mg [Prunus persica] Length = 610 Score = 694 bits (1791), Expect = 0.0 Identities = 375/621 (60%), Positives = 440/621 (70%), Gaps = 9/621 (1%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G+ELA KLE+C VWRSWLGDS Y+NF L+SP TWE+FM DSK+RA + Sbjct: 1 MALLGDDGRGYELACKLESCNVWRSWLGDSTYANFAPFLNSPSTWEAFM---DSKSRAHL 57 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNL-----NPNYLQLHGDDVYFSL 1776 LQLR RALLFDKA VSLFLR S+L NP YLQLH DDVYF+L Sbjct: 58 HLQLRARALLFDKACVSLFLRPHSNSSSSSSSSSSSSSLAVSKLNPYYLQLHPDDVYFTL 117 Query: 1775 EDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFP 1596 E+ SSQDGVQ Q+ S +S +IQ K F VGSRY E E+DN R + D+ P Sbjct: 118 EN---SSQDGVQVQQRDPSVSS------KIQSKAAFGVGSRYGESEIDNKPSRFKNDELP 168 Query: 1595 ETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFG 1416 ETWY QF+E+Y S+ RL DRE+ KRT E MS YLKL ERHK+ R FKEDQY G+G Sbjct: 169 ETWYNQFMERYRISKPYRLSSADRESEKRTPEEMSAYLKLLERHKKRRLAFKEDQYMGYG 228 Query: 1415 NPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYG 1236 NPI EN S+++ SV DG+NS+D E FFPE MF NCVPDSALP +NR EDNQKVECYG Sbjct: 229 NPILENVSHMNPNSVLDGSNSVDSEISFFPETMFTFNCVPDSALPPLNREEDNQKVECYG 288 Query: 1235 VLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKV 1056 VLD LP +MTRSP MLER GIRPEYL + +RGKNG GN K L EQA+Q+SQ V Sbjct: 289 VLDMLPQIMTRSPVMLERLGIRPEYLSMEQGGILHRGKNGSGGNRKCLSKEQAAQLSQTV 348 Query: 1055 ISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTL 876 I+R+L S+GFE TEVP++V SQ L CHI KLG LKVLTD+YRKQCSAIE+L+MFLQT+ Sbjct: 349 IARMLTSIGFESATEVPIDVFSQMLSCHISKLGGSLKVLTDSYRKQCSAIELLKMFLQTI 408 Query: 875 GYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIPR-XXXXXXXX 699 GYSN G L E VKDGSRNF QQ H G QS Q QH + QQ R Sbjct: 409 GYSNFGPLMEQVKDGSRNFQQTQQQIH--GSQSQLQPQHQNPIRLPQQTSRQMLPQMQQV 466 Query: 698 XXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHA 519 +N+ F QQ LERMRRRQPSTPRAGM +DKDRPMV+VKIE SELP+D NAF ++ Sbjct: 467 ALSKNVPFQQQQPLERMRRRQPSTPRAGMDMDKDRPMVQVKIEAPSELPMDGNAFYGLNN 526 Query: 518 RHPQIQFRQQSMAAA---MANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQ 348 R+ Q+QFRQQ A + M N++PQS +QF+Q+ASLQ+PQ+Q QN G RAPPVKVEGFQ Sbjct: 527 RNLQMQFRQQIPAMSNLTMPNVHPQSGNQFRQMASLQIPQMQAQNAGVLRAPPVKVEGFQ 586 Query: 347 ELMGGDTTLKHDSEEHKLTSP 285 ELMGGD + KHDS+E++LTSP Sbjct: 587 ELMGGDASSKHDSDENRLTSP 607 >ref|XP_007047763.1| Transcription initiation factor TFIID subunit 8, putative isoform 1 [Theobroma cacao] gi|508700024|gb|EOX91920.1| Transcription initiation factor TFIID subunit 8, putative isoform 1 [Theobroma cacao] Length = 593 Score = 691 bits (1784), Expect = 0.0 Identities = 369/616 (59%), Positives = 446/616 (72%), Gaps = 4/616 (0%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1767 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1766 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1587 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1586 YKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1407 Y QFIEKY SR +L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1406 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1227 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1226 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1047 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1046 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 867 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 866 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIP--RXXXXXXXXXX 693 N G LAE VKD +RN T Q + G+QS Q QH ++ QQ+P + Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQLPMRQMHPQMQQMVH 455 Query: 692 XQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARH 513 QNL F QQ QLER+RRR PSTPR M +DKDRPMV+VKIEN SELP+DSNAFNPI+ RH Sbjct: 456 PQNLTFQQQQQLERIRRRHPSTPRPVMDMDKDRPMVQVKIENPSELPMDSNAFNPINTRH 515 Query: 512 PQIQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGG 333 Q+QFRQQ AA++NL+ Q ++QF+QL S Q+ Q+QTQNMG RAPPVKVEGFQELMGG Sbjct: 516 SQMQFRQQQF-AAISNLHAQPSNQFRQLMSPQIHQMQTQNMGIVRAPPVKVEGFQELMGG 574 Query: 332 DTTLKHDSEEHKLTSP 285 DTTLKHDSEE+KLTSP Sbjct: 575 DTTLKHDSEENKLTSP 590 >ref|XP_002533519.1| conserved hypothetical protein [Ricinus communis] gi|223526616|gb|EEF28863.1| conserved hypothetical protein [Ricinus communis] Length = 573 Score = 650 bits (1678), Expect = 0.0 Identities = 353/615 (57%), Positives = 437/615 (71%), Gaps = 2/615 (0%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 M+LLG+DG G++LARKLE+ G WR+WLGDS YSNFVH LSSP +W+SFM+ DSK++AQI Sbjct: 1 MSLLGDDGNGYDLARKLESLGTWRTWLGDSLYSNFVHFLSSPSSWDSFMRTDDSKSKAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 LQLR RALLFDKA+VSLF+ LNP+YLQLHGDDVYF+LED Sbjct: 61 HLQLRARALLFDKATVSLFISNNNNSCSALAVS----KLNPSYLQLHGDDVYFTLED--- 113 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 G Q Q + S + +S FS+GSRY EPE++ +++R R ++FPE+WY Sbjct: 114 ----GDQRQNAALSKSHSKS---------AFSIGSRYGEPEMEGLTQRFRNEEFPESWYN 160 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 QFIEKY SR RL G+RE+ KR+ E MS+YL+L ++HKR R Sbjct: 161 QFIEKYKVSRPYRLSVGERESDKRSPEEMSSYLRLVDKHKRRRI---------------S 205 Query: 1400 NGSNIHSKSVSDGNNSIDDETC-FFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDS 1224 + ++HS SV DG+NS DD+ FFPE MF LNCVPDSALP I R +DNQK+E +GVLDS Sbjct: 206 STPSMHSSSVLDGSNSTDDDDLSFFPETMFMLNCVPDSALPLIIRPQDNQKIEFHGVLDS 265 Query: 1223 LPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRV 1044 LP TRS ++ER GI E S +R KNG EGN K++ EQASQM QKV++R+ Sbjct: 266 LPQ--TRSSVVIERLGISVEQ-----GGSLHRAKNGSEGNKKLISQEQASQMCQKVVARM 318 Query: 1043 LVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSN 864 L VGF+ TE+P+EVLSQ L CHI +LGR LK+L DNYRKQCSAI++L+MFLQT G++N Sbjct: 319 LARVGFDSATELPVEVLSQALRCHISELGRNLKILADNYRKQCSAIDLLKMFLQTAGFNN 378 Query: 863 LGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIPR-XXXXXXXXXXXQ 687 LG L E VKDG+RN PTQQ + +QS Q+QH S+ QQIPR Q Sbjct: 379 LGGLMELVKDGTRNVVQPTQQ-QMHAIQSQLQAQHQSTLRLPQQIPRQMHPQMQQMVHPQ 437 Query: 686 NLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 507 NLAF QQ QLERMRRRQPSTPR M +DKDRPMV+VKIEN SELP+D NAFNP+H+RHPQ Sbjct: 438 NLAFQQQQQLERMRRRQPSTPRPAMDIDKDRPMVQVKIENPSELPMDGNAFNPMHSRHPQ 497 Query: 506 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 327 +QFRQQ + AA+++L QS++QF+QLAS+QVPQ+Q+ NMG RAPPVKVEGFQELMGGD Sbjct: 498 MQFRQQQL-AAISSLQAQSSNQFRQLASMQVPQVQSPNMGIVRAPPVKVEGFQELMGGDA 556 Query: 326 TLKHDSEEHKLTSPS 282 ++KHD EE+KLTSPS Sbjct: 557 SVKHDPEENKLTSPS 571 >ref|XP_004288527.1| PREDICTED: uncharacterized protein LOC101303161 [Fragaria vesca subsp. vesca] Length = 596 Score = 640 bits (1651), Expect = 0.0 Identities = 354/623 (56%), Positives = 426/623 (68%), Gaps = 11/623 (1%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G+ELA KLE+C VWR+WLGDS+YS FVH L+SP TW+SFM+ SK+RAQI Sbjct: 1 MALLGDDGRGYELACKLESCNVWRTWLGDSSYSTFVHFLTSPSTWDSFMRSDPSKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 LQLR RALLFDKASVSLFLR NLNPNYLQLH DDVYFSLE+ Sbjct: 61 LLQLRARALLFDKASVSLFLRPDSASNSSAVS-----NLNPNYLQLHADDVYFSLEN--- 112 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 SS +GVQ Q+ S +IQ K F GSRY E E+DN S R + ++ PETWY Sbjct: 113 SSAEGVQAQQRDAS---------KIQSKTNFGFGSRYGESEIDNKSARFKNEELPETWYN 163 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 Q E++ SR RL DRE+ +RT E M Y+KL +HK+ FKE+Q G+ NP+ E Sbjct: 164 QVSERHRVSRTHRLSSADRESERRTPEEMCAYIKLAMKHKKRCIAFKEEQPVGYRNPLLE 223 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 N S + S DG+NS+D E FFPE MF NCVPDSALP +NR +D+QKVE GVLD+L Sbjct: 224 NASQ-NPHSGLDGSNSVDHEAPFFPETMFTFNCVPDSALPPMNREQDDQKVEFCGVLDTL 282 Query: 1220 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1041 P VMTRSP MLER GIRPEYL S RGKNG GN L EQA+Q+SQKVI+R+L Sbjct: 283 PQVMTRSPVMLERLGIRPEYL------SMDRGKNGSAGNKSCLTHEQAAQLSQKVIARIL 336 Query: 1040 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 861 +VGFEG++EVP+EV SQ L CHI KLG LKVLTD+YRKQCSAIE+L+MFLQT+GY N Sbjct: 337 TNVGFEGSSEVPIEVFSQLLSCHIRKLGSCLKVLTDSYRKQCSAIELLKMFLQTVGYRNF 396 Query: 860 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIPR-------XXXXXXX 702 G LA+ VKDGSR+ H Q + G+QS Q QH + QQI R Sbjct: 397 GPLADQVKDGSRSV-HQQNQQQIHGMQSQLQPQHQNPIRLPQQISRQMLPQMQQIQQMQQ 455 Query: 701 XXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIH 522 +NL F QQ Q+ERMRRRQPSTPRAGM + ++RPMV+VKIE SELP+DSNAFN + Sbjct: 456 MAQSKNLPFQQQQQIERMRRRQPSTPRAGMDMVQERPMVQVKIEAPSELPMDSNAFNNFN 515 Query: 521 ARHPQIQFRQQSMAA----AMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEG 354 R+PQ+QFRQQ + A M N+ QS +QF+Q Q+ Q+Q+QN G RA PVKVEG Sbjct: 516 NRNPQMQFRQQQIPAMSNPTMQNVPAQSGNQFRQ---TQIAQIQSQNAGVLRARPVKVEG 572 Query: 353 FQELMGGDTTLKHDSEEHKLTSP 285 F ELMGGD + KHDS+E++LTSP Sbjct: 573 FSELMGGDASSKHDSDENRLTSP 595 >gb|EXC35477.1| hypothetical protein L484_026784 [Morus notabilis] Length = 647 Score = 635 bits (1637), Expect = e-179 Identities = 357/662 (53%), Positives = 434/662 (65%), Gaps = 49/662 (7%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG GFELARKLETCGVWR WLGDS Y NF L+SP TWE+FM+ +K+RAQI Sbjct: 1 MALLGDDGRGFELARKLETCGVWRKWLGDSCYGNFAPYLNSPTTWEAFMRVDGTKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN--------LNPNYLQLHGDDVY 1785 LQLRVRALLFDKASVSLFLR ++ LNPNYL LHGDDVY Sbjct: 61 HLQLRVRALLFDKASVSLFLRSNPSSSSSSSSSSSSASRSSVAISKLNPNYLNLHGDDVY 120 Query: 1784 FSLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPD 1605 F+LE+ SS D SSN+ SK IQ K F VGS Y E E+DN+ + R D Sbjct: 121 FTLEN---SSSD--------VSSNTASSK---IQSKASFGVGSGYGESEIDNVHQMFRND 166 Query: 1604 DFPETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYT 1425 PETWY QFIE Y TSR RL GD+E KR+ E M Y+KL E+HK+ R +KEDQY Sbjct: 167 VLPETWYNQFIENYRTSRPYRLSLGDQEPDKRSPEEMCAYIKLLEKHKKRRVAYKEDQYM 226 Query: 1424 GFGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVE 1245 G+GNP+ EN S + S+SD NS DDE+ FFPE+MF LN VPDSAL NR+E+ +K+E Sbjct: 227 GYGNPVLENSSYMRPNSISDAINSDDDESTFFPEIMFTLNSVPDSALSVANRVEERRKIE 286 Query: 1244 CYGVLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMS 1065 YGVLD LP VMT+SP M+ERFGI P +L + + + KNG N K LG EQA ++S Sbjct: 287 FYGVLDGLPRVMTKSPVMIERFGINP-FLGMEHGGNVHHVKNGSVVNKKCLGQEQALELS 345 Query: 1064 QKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFL 885 QKVI+R+L S+GFEG+TEVP+EV SQ + CHI +LGRILKVL+D+YRKQC+A+E+L+MFL Sbjct: 346 QKVIARMLASIGFEGSTEVPVEVFSQLMSCHITELGRILKVLSDSYRKQCTAVELLKMFL 405 Query: 884 QTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQ---------------HP-- 756 Q L + G+L EHVKDGSR S Q + G+QS SQ HP Sbjct: 406 QRL-KCDFGSLVEHVKDGSRT-SVQQSQSQVHGIQSQMMSQAQAALRLQQQMSRQMHPQM 463 Query: 755 ---------------------SSNLQTQQIPRXXXXXXXXXXXQNLAFPQQPQLERMRRR 639 LQ QQ + Q L QQ QLERMRRR Sbjct: 464 QQFVHSQNMAFQQQQQQHHQQQQQLQQQQQQQLQQQQLQQQQQQQLQQQQQQQLERMRRR 523 Query: 638 QPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAA---MA 468 QPSTPR+GM +DKDRP+V+VKIE SELP+DSN+ N + R Q+ +RQQ A + M+ Sbjct: 524 QPSTPRSGMDVDKDRPLVQVKIEQPSELPMDSNSLNNFNNRISQMHYRQQMAAMSNYTMS 583 Query: 467 NLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTS 288 N++ QSN+QF+Q+AS Q+PQ+Q+QNMG RAPPVKVEGFQELMGGD KHDSEE++LTS Sbjct: 584 NVHGQSNNQFRQMASGQIPQMQSQNMGVVRAPPVKVEGFQELMGGDAASKHDSEENRLTS 643 Query: 287 PS 282 PS Sbjct: 644 PS 645 >ref|XP_004143440.1| PREDICTED: uncharacterized protein LOC101223185 [Cucumis sativus] gi|449499810|ref|XP_004160923.1| PREDICTED: uncharacterized protein LOC101224095 [Cucumis sativus] Length = 612 Score = 627 bits (1616), Expect = e-177 Identities = 337/627 (53%), Positives = 439/627 (70%), Gaps = 14/627 (2%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G+ELARKL+T GVW++WLGD +YS FV L+S TW++FM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELARKLDTLGVWQTWLGDLSYSIFVPFLASTSTWDTFMRTDDSKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN--------LNPNYLQLHGDDVY 1785 QLQLR RALLFDKASVSLFLR + L+PNYLQLHGDDVY Sbjct: 61 QLQLRARALLFDKASVSLFLRSTPSPSSPSYSTGNPLSSSSLAISKLSPNYLQLHGDDVY 120 Query: 1784 FSLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPD 1605 F+LE+ SS+DGVQ +EG SSN K IQPK + G R E ++ + S+R + + Sbjct: 121 FTLEN---SSKDGVQQREGHVSSNKASGK---IQPKAASTAGPRSRESDIGDSSQRLK-N 173 Query: 1604 DFPETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYT 1425 + PETWY QFIEKY + RL G+ A KRTSE MS+YL+L E+HK+ R FK+D T Sbjct: 174 ELPETWYSQFIEKYRVKQPYRLSHGNNVAEKRTSEEMSSYLRLLEKHKKRRMVFKDDLLT 233 Query: 1424 GFGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVE 1245 FGN + N S+ SV D +NS++D+ FFPE+MF NCVP+SALP + ++DN++ E Sbjct: 234 NFGNSVSANASS----SVFDFSNSVEDDANFFPEIMFTFNCVPESALPPPDDMKDNRRPE 289 Query: 1244 CYGVLDSLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMS 1065 GV+D+LP +TR+ AM+ER G++P+Y+ + +R K+G GN K LG EQ+ QMS Sbjct: 290 VPGVIDTLPQPITRNSAMMERLGVKPDYVSTERGVNVHRAKSGSGGNRKSLGQEQSFQMS 349 Query: 1064 QKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFL 885 QKV++R+L+S+GFEG TEVP+EV SQFL CHICKLG L+VL D+YRKQCSA+++LRMFL Sbjct: 350 QKVVARMLMSLGFEGATEVPLEVFSQFLSCHICKLGSTLRVLADSYRKQCSAVDLLRMFL 409 Query: 884 QTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIPRXXXXXX 705 +T+GYSN G LA+ VKDGSRN+ +Q G+Q Q+QH + QQ+PR Sbjct: 410 KTMGYSNFGPLADIVKDGSRNY---VRQSMHHGVQPQLQAQHQTLLQVPQQVPR-QMHPQ 465 Query: 704 XXXXXQNLAFPQQPQ------LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDS 543 + AF QQ Q LE+MRRRQ +TPRA M +KDRP+++VK+ENT ELP+D Sbjct: 466 MQQMVNSQAFQQQQQQQQQFVLEKMRRRQAATPRAVMEANKDRPLLQVKVENT-ELPMDG 524 Query: 542 NAFNPIHARHPQIQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVK 363 NA N ++ RHPQ+QFRQQ + AAM+N++ +QF+Q+ S+Q+PQ+QT N RAPPVK Sbjct: 525 NALNALNIRHPQLQFRQQQI-AAMSNIHASPGNQFRQIPSMQMPQIQTPNTNVVRAPPVK 583 Query: 362 VEGFQELMGGDTTLKHDSEEHKLTSPS 282 VEGFQELMGGDT+ KHDSEE +LTSPS Sbjct: 584 VEGFQELMGGDTSSKHDSEEARLTSPS 610 >ref|XP_002310863.2| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] gi|550334854|gb|EEE91313.2| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] Length = 577 Score = 607 bits (1566), Expect = e-171 Identities = 341/615 (55%), Positives = 420/615 (68%), Gaps = 2/615 (0%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 M++LG+DGLG++LARKLET G+WR+WLGDS YSNF+H LSSP +W+SFM+ DSK+++ Sbjct: 1 MSVLGDDGLGYDLARKLETLGMWRAWLGDSLYSNFLHSLSSPASWQSFMRTDDSKSKSHF 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 QLQLR RALLFDKASVSLFLR NLNPNYLQLHGDDVYF+LED Sbjct: 61 QLQLRARALLFDKASVSLFLRSNTVAAVS--------NLNPNYLQLHGDDVYFTLEDEDQ 112 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 + G G+T+ ++ +L F V S +V + +R + ++ PETWY Sbjct: 113 RREGG---GVGATT---------KVCSRLSFRV-SNFV---LYICCQRYKNEELPETWYT 156 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 QF+EK R RL FGDRE+ KR+ E MSTY +L RHKR QY G GN E Sbjct: 157 QFMEKRKLKRPYRLSFGDRESDKRSPEQMSTYFRLVARHKR------RCQYLGSGNSNLE 210 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 + SN+ S SV DG++S+DD+ FFPE MF NCVPDSA+P I R DNQK+E G DSL Sbjct: 211 STSNMRSGSVLDGSHSVDDDFVFFPETMFMFNCVPDSAIPPIIRARDNQKIEFRGAFDSL 270 Query: 1220 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1041 P TR+P M+ER GI E S RGKNG EG+ K L EQA QMSQKV++ +L Sbjct: 271 PQ--TRNPVMIERLGISVEQ-----GGSLNRGKNGSEGHKK-LSEEQALQMSQKVVACLL 322 Query: 1040 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 861 VGF+G +E+PMEV SQ L CHI KLGRIL+VL D+YRKQCSA+E+L+MFLQT G+SNL Sbjct: 323 TRVGFDGASEIPMEVFSQLLRCHISKLGRILRVLADSYRKQCSAVELLKMFLQTAGFSNL 382 Query: 860 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIPR-XXXXXXXXXXXQN 684 +L + VK+G+RN + PT Q G+QS F SQH + QQIPR QN Sbjct: 383 VHLMKIVKEGARNTAEPTHQ-QAHGIQSQFHSQHQNLLRLPQQIPRQMHPQMQPMVHSQN 441 Query: 683 LAFPQQPQ-LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 507 L F QQ Q ER+RRR STPR GM +DKD+P+V+VK+EN ELP+D+NA N H+R PQ Sbjct: 442 LTFQQQQQHFERLRRRHTSTPRPGMDVDKDKPLVQVKVENPPELPLDNNAVNAFHSRQPQ 501 Query: 506 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 327 +Q R Q + AAM+NL+ Q N+Q +QLASLQVPQ+QT NMG RAPPVKVEGFQELMGGD Sbjct: 502 MQMRHQQI-AAMSNLHAQPNNQLRQLASLQVPQMQTSNMGMVRAPPVKVEGFQELMGGDA 560 Query: 326 TLKHDSEEHKLTSPS 282 LKHD+EE+KLTSPS Sbjct: 561 ALKHDTEENKLTSPS 575 >ref|XP_006380803.1| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] gi|550334853|gb|ERP58600.1| hypothetical protein POPTR_0007s14190g [Populus trichocarpa] Length = 558 Score = 603 bits (1554), Expect = e-169 Identities = 337/615 (54%), Positives = 412/615 (66%), Gaps = 2/615 (0%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 M++LG+DGLG++LARKLET G+WR+WLGDS YSNF+H LSSP +W+SFM+ DSK+++ Sbjct: 1 MSVLGDDGLGYDLARKLETLGMWRAWLGDSLYSNFLHSLSSPASWQSFMRTDDSKSKSHF 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 QLQLR RALLFDKASVSLFLR NLNPNYLQLHGDDVYF+LED Sbjct: 61 QLQLRARALLFDKASVSLFLRSNTVAAVS--------NLNPNYLQLHGDDVYFTLED--- 109 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 + Q + G VG+ ++R + ++ PETWY Sbjct: 110 -----------------------EDQRREGGGVGAT---------TKRYKNEELPETWYT 137 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 QF+EK R RL FGDRE+ KR+ E MSTY +L RHKR QY G GN E Sbjct: 138 QFMEKRKLKRPYRLSFGDRESDKRSPEQMSTYFRLVARHKR------RCQYLGSGNSNLE 191 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 + SN+ S SV DG++S+DD+ FFPE MF NCVPDSA+P I R DNQK+E G DSL Sbjct: 192 STSNMRSGSVLDGSHSVDDDFVFFPETMFMFNCVPDSAIPPIIRARDNQKIEFRGAFDSL 251 Query: 1220 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1041 P TR+P M+ER GI E S RGKNG EG+ K L EQA QMSQKV++ +L Sbjct: 252 PQ--TRNPVMIERLGISVEQ-----GGSLNRGKNGSEGHKK-LSEEQALQMSQKVVACLL 303 Query: 1040 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 861 VGF+G +E+PMEV SQ L CHI KLGRIL+VL D+YRKQCSA+E+L+MFLQT G+SNL Sbjct: 304 TRVGFDGASEIPMEVFSQLLRCHISKLGRILRVLADSYRKQCSAVELLKMFLQTAGFSNL 363 Query: 860 GNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIPR-XXXXXXXXXXXQN 684 +L + VK+G+RN + PT Q G+QS F SQH + QQIPR QN Sbjct: 364 VHLMKIVKEGARNTAEPTHQ-QAHGIQSQFHSQHQNLLRLPQQIPRQMHPQMQPMVHSQN 422 Query: 683 LAFPQQPQ-LERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQ 507 L F QQ Q ER+RRR STPR GM +DKD+P+V+VK+EN ELP+D+NA N H+R PQ Sbjct: 423 LTFQQQQQHFERLRRRHTSTPRPGMDVDKDKPLVQVKVENPPELPLDNNAVNAFHSRQPQ 482 Query: 506 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 327 +Q R Q + AAM+NL+ Q N+Q +QLASLQVPQ+QT NMG RAPPVKVEGFQELMGGD Sbjct: 483 MQMRHQQI-AAMSNLHAQPNNQLRQLASLQVPQMQTSNMGMVRAPPVKVEGFQELMGGDA 541 Query: 326 TLKHDSEEHKLTSPS 282 LKHD+EE+KLTSPS Sbjct: 542 ALKHDTEENKLTSPS 556 >ref|XP_006466330.1| PREDICTED: uncharacterized protein LOC102616625 isoform X2 [Citrus sinensis] Length = 610 Score = 579 bits (1492), Expect = e-162 Identities = 335/659 (50%), Positives = 407/659 (61%), Gaps = 47/659 (7%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G+ELA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 QFIEKY SRQ +L GDRE +RT+EGMS+YL+ E++KR R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKRRRVPFQND----------- 195 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1220 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1041 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1040 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 861 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SNL Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNL 369 Query: 860 GNLAEHVKDG---------------SRNFSHPTQQ------------------------- 801 G LAE +KDG SRN +Q+ Sbjct: 370 GILAELIKDGTRNVVQQSQELIKDGSRNIVQQSQELIKDGTRNIVQQNQELVKEGTRNFV 429 Query: 800 ----PHLRGLQSGFQSQHPSSNLQTQQIPR-XXXXXXXXXXXQNLAFP--QQPQLERMRR 642 + G QS QS S QQ+PR QNLAF QQ LER R Sbjct: 430 QQSPQQVHGAQSQLQSHQQSPVKLPQQVPRQMHQQMQQMVQPQNLAFQQMQQQHLERSRM 489 Query: 641 RQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMANL 462 RQPSTPR GM +DKDR M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+NL Sbjct: 490 RQPSTPRPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMSNL 548 Query: 461 NPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSP 285 QS++QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTSP Sbjct: 549 QAQSSNQFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTSP 607 >ref|XP_006466329.1| PREDICTED: uncharacterized protein LOC102616625 isoform X1 [Citrus sinensis] Length = 612 Score = 579 bits (1492), Expect = e-162 Identities = 335/661 (50%), Positives = 408/661 (61%), Gaps = 49/661 (7%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G+ELA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYELALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 QFIEKY SRQ +L GDRE +RT+EGMS+YL+ E++KR R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKRRRVPFQND----------- 195 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1220 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1041 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1040 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 861 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SNL Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNL 369 Query: 860 GNLAEHVKDGSRNFSHPTQQ---------------------------------------- 801 G LAE +KDG+RN +Q+ Sbjct: 370 GILAELIKDGTRNVVQQSQELIKDGSRNIVQQSQELIKDGTRNIVQQNQELVKEGTRNFV 429 Query: 800 ----PHLRGLQSGFQS--QHPSSNLQTQQIPR-XXXXXXXXXXXQNLAFP--QQPQLERM 648 + G QS QS Q P Q Q+PR QNLAF QQ LER Sbjct: 430 QQSPQQVHGAQSQLQSHQQSPVKLPQQLQVPRQMHQQMQQMVQPQNLAFQQMQQQHLERS 489 Query: 647 RRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMA 468 R RQPSTPR GM +DKDR M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+ Sbjct: 490 RMRQPSTPRPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMS 548 Query: 467 NLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTS 288 NL QS++QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTS Sbjct: 549 NLQAQSSNQFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTS 608 Query: 287 P 285 P Sbjct: 609 P 609 >ref|NP_201357.2| uncharacterized protein [Arabidopsis thaliana] gi|26451238|dbj|BAC42721.1| unknown protein [Arabidopsis thaliana] gi|28973345|gb|AAO63997.1| unknown protein [Arabidopsis thaliana] gi|332010686|gb|AED98069.1| uncharacterized protein AT5G65540 [Arabidopsis thaliana] Length = 605 Score = 554 bits (1427), Expect = e-155 Identities = 312/629 (49%), Positives = 416/629 (66%), Gaps = 15/629 (2%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG GF+LARKLE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK+RAQI Sbjct: 1 MALLGDDGRGFDLARKLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLR-------XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYF 1782 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY+ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASISDVSSVAVSKLNPNYLQLHGDDVYY 120 Query: 1781 SLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDD 1602 +LE+ +S + G Q + G + S+ + K F+ G+R E + N+S+R R ++ Sbjct: 121 TLEN--ASLESGFQREGGIRHNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEE 174 Query: 1601 FPETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTG 1422 P+TWY QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 175 LPDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLA 232 Query: 1421 FGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVEC 1242 + S+ H S DG+ S +D+ F PE MF +NCVP++AL I R +DN K E Sbjct: 233 H-----MSRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPITRTQDNLKTEF 286 Query: 1241 YGVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQ 1071 YGVLD+LP V TRS M+ER G+ PEY R+ G+ RS+ K G +QA+ Sbjct: 287 YGVLDTLPQVTTRSHIMIERLGLMPEYHRMEERGVLRSRKAEKMG-------FSDDQAAL 339 Query: 1070 MSQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRM 891 +S+KV++R+L+++GFEG TEVP++V SQ + H+ KLGRILK+LTD+Y+K+CSA+++++M Sbjct: 340 VSRKVVARMLLTMGFEGATEVPIDVFSQLVSRHMSKLGRILKLLTDSYKKECSAMQLIKM 399 Query: 890 FLQTLGYSNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSSNLQTQQIPRXXX 714 FL T GYSNLG+LAE VKDG+RN P Q QP + LQ Q +S QQI R Sbjct: 400 FLNTTGYSNLGSLAEIVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMH 457 Query: 713 XXXXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAF 534 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN SE+ +D NAF Sbjct: 458 PQMQQMVNPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAF 516 Query: 533 NPIHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPV 366 NP++ RH Q Q RQQ AAM+N+ Q + QF+QLAS+Q+PQ+QT +GT RA PV Sbjct: 517 NPMNPRHQQQLQQQLRQQQQIAAMSNMQQQPGYNQFRQLASMQIPQMQTPTLGTVRAQPV 576 Query: 365 KVEGFQELMGGDTTLKHDSEEHKLTSPSK 279 KVEGF++LMGGD++LKHDS++ + P+K Sbjct: 577 KVEGFEQLMGGDSSLKHDSDDKLRSPPTK 605 >ref|XP_006426252.1| hypothetical protein CICLE_v10025202mg [Citrus clementina] gi|557528242|gb|ESR39492.1| hypothetical protein CICLE_v10025202mg [Citrus clementina] Length = 604 Score = 553 bits (1426), Expect = e-155 Identities = 317/653 (48%), Positives = 398/653 (60%), Gaps = 41/653 (6%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G++LA KLE+CGVWR+WLGDS YS F H LS+P +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGYQLALKLESCGVWRTWLGDSCYSTFHHALSTPASWESFMRTDDSKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 LQLR RALLFDKA++SLFL S LNPNYLQL G DVYF+LE S Sbjct: 61 HLQLRARALLFDKATISLFL-----PSNQPPSSVAVSKLNPNYLQLDGGDVYFTLE---S 112 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 SSQDGVQH+E S +S++ K R R ++ PETWY Sbjct: 113 SSQDGVQHRESSAASSTTSGK--------------------------RFRNEELPETWYD 146 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 QFIEKY SRQ +L GDRE +RT+EGMS+YL+ E++K R PF+ D Sbjct: 147 QFIEKYRVSRQYKLSLGDRELDRRTAEGMSSYLRHLEKYKIRRVPFQND----------- 195 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 HS S D NS D + FFPE MF LN VP+ A+P I E Q +E GVLD+L Sbjct: 196 -----HSNSALDVINSTDSDV-FFPETMFTLNSVPEIAVPQIIVEETKQNIEFNGVLDTL 249 Query: 1220 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1041 P MT+SP M+ER GIRPEYL + E + + G + LEGN K EQASQ+SQKVI+R+L Sbjct: 250 PQCMTKSPVMIERLGIRPEYLGMEQEGNSHHGNSALEGNKKCFSEEQASQISQKVIARML 309 Query: 1040 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYSNL 861 GFEG TEVP+EVLS+ LG HICKLGRILKVL+DNYRKQCSA+E+L+MFLQ G+SN Sbjct: 310 TGGGFEGATEVPLEVLSEMLGSHICKLGRILKVLSDNYRKQCSALELLKMFLQAAGHSNF 369 Query: 860 GNLAEHVKDGSRNFSHPTQQPHLRG----------------------------------L 783 G LAE +KDG+RN +Q+ G + Sbjct: 370 GILAELIKDGNRNAVQQSQELIKDGSRNIVQQSQELIKDGARNVVQQSQELIKDGTRNIV 429 Query: 782 QSGFQSQHPSSNLQTQQIPRXXXXXXXXXXXQNLAFPQQPQLERMRR-------RQPSTP 624 Q + + QQ P+ + + PQ + ++ RQPSTP Sbjct: 430 QQNQELVKEGTRNFVQQSPQQVHGAQSQLQSHQQSPVKLPQQQMQQQHLERSRMRQPSTP 489 Query: 623 RAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARHPQIQFRQQSMAAAMANLNPQSNH 444 R GM +DKDR M +V EN+S+LP+D+NA N +A+ Q+QF QQ + M+NL QS++ Sbjct: 490 RPGMDMDKDRSMSQVNAENSSKLPMDANALNASNAKQSQMQFHQQQL-NTMSNLQAQSSN 548 Query: 443 QFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSP 285 QFKQ +Q+PQ+ + NMG RAPPVKV+GFQELMGGD ++KHDSEE+KLTSP Sbjct: 549 QFKQSTPVQIPQMHSPNMGVVRAPPVKVDGFQELMGGDASMKHDSEENKLTSP 601 >ref|XP_006394014.1| hypothetical protein EUTSA_v10003865mg [Eutrema salsugineum] gi|557090653|gb|ESQ31300.1| hypothetical protein EUTSA_v10003865mg [Eutrema salsugineum] Length = 598 Score = 548 bits (1412), Expect = e-153 Identities = 310/627 (49%), Positives = 416/627 (66%), Gaps = 13/627 (2%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG GF+LAR+LE GVWR+WLGDS Y +F H LSSP +WESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGFDLARRLEVSGVWRTWLGDSTYLSFHHYLSSPSSWESFMRVDDSKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLR--XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDF 1767 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY++LE Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIPASPSSDASSVAVSKLNPNYLQLHGDDVYYTLE-- 118 Query: 1766 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1587 ++S +G ++G+ N K + K F+ G+R E + N+S+R R ++ P+TW Sbjct: 119 -NASLEGGFQRDGAIRHNPSLPKSLS---KPSFASGARGSESDFSNLSQRSRFEELPDTW 174 Query: 1586 YKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1407 Y QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF +D + Sbjct: 175 YTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDSHKRKRAPFLQDPSP--ASSA 230 Query: 1406 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1227 + S+ H S DG+ S +D+ F PE MF +NCVP++AL + R DN K E YGVLD Sbjct: 231 HMSRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPVARTHDNLKTEFYGVLD 289 Query: 1226 SLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQMSQKV 1056 +LP V TR+ M+ER G+ PEY R+ G+ R K K G EQA+Q+S+KV Sbjct: 290 TLPQVTTRNHVMIERLGMVPEYFRMEERGVLRRKKAEKLG-------FSDEQAAQVSRKV 342 Query: 1055 ISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTL 876 ++R+L+++G EG TEVP++V SQ + HICKLGRILK+LTD+Y+K+CSAI++++MFL T Sbjct: 343 VARILLTMGCEGATEVPIDVFSQLVSRHICKLGRILKLLTDSYKKECSAIQLIKMFLNTT 402 Query: 875 GYSNLGNLAEHVKDGSRNFSHPTQ---QPHLRGLQSGFQSQHP---SSNLQTQQIPRXXX 714 GYSNLG+LAE VKDG+RN HP Q QP + Q Q Q+P +Q Q P+ Sbjct: 403 GYSNLGDLAELVKDGTRN--HPPQNQKQPQVLQQQLHLQQQNPLRLPQQMQRQMHPQMQQ 460 Query: 713 XXXXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAF 534 F QQ Q+ERMRRRQ ++PR + ++KDRP+V+VK+EN SE+ +D NAF Sbjct: 461 MVNPH------TFQQQQQMERMRRRQVTSPRPNIDMEKDRPLVQVKLENPSEMAVDGNAF 514 Query: 533 NPIHARHPQIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQN-MGTTRAPPVKV 360 NP++ RH QI +Q AAM+NL Q + QF+QLAS+Q+PQ+QT N GT RA PVKV Sbjct: 515 NPMNPRHQQI---RQQQIAAMSNLQQQPGYNQFRQLASMQIPQMQTPNTTGTVRAQPVKV 571 Query: 359 EGFQELMGGDTTLKHDSEEHKLTSPSK 279 EGF++LMGGD++LKH+S++ + P+K Sbjct: 572 EGFEQLMGGDSSLKHESDDKLRSPPTK 598 >ref|XP_006280200.1| hypothetical protein CARUB_v10026105mg [Capsella rubella] gi|482548904|gb|EOA13098.1| hypothetical protein CARUB_v10026105mg [Capsella rubella] Length = 606 Score = 545 bits (1404), Expect = e-152 Identities = 306/627 (48%), Positives = 416/627 (66%), Gaps = 13/627 (2%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG GF+LAR+LE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK R+QI Sbjct: 1 MALLGDDGRGFDLARRLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKPRSQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSN------LNPNYLQLHGDDVYFS 1779 QLQLRVRALLFDKA+VSLFLR + LNPNYLQLHGDDVY++ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNSIAASSSSTSVSDVSSVAVSKLNPNYLQLHGDDVYYT 120 Query: 1778 LEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDF 1599 LE+ +S + G Q G + S+ + K F+ G+R E + N+S+R R ++ Sbjct: 121 LEN--ASLEGGFQRDGGIRLNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEEL 174 Query: 1598 PETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGF 1419 P+TWY QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF ED+ + Sbjct: 175 PDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRNS-- 230 Query: 1418 GNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECY 1239 G+ + S+ H S DG++S +D+ F PE MF +NCVP++ALP I R +DN K E Y Sbjct: 231 GSSAHMSRSSTHPSSGFDGSSS-EDDILFLPETMFRMNCVPETALPPITRTQDNLKTEFY 289 Query: 1238 GVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQM 1068 GVLD+LP V TRS M+ER G+ PEY R+ G+ R + K G +QA+Q+ Sbjct: 290 GVLDTLPQVTTRSHVMIERLGVMPEYHRMEERGVLRRRKAEKLG-------FSDDQAAQV 342 Query: 1067 SQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMF 888 S+KV++R+L+++GFEG TEVP++V SQ + HI KLGRIL++LTD+Y+K+CSA ++++MF Sbjct: 343 SRKVVARMLLTMGFEGATEVPVDVFSQLVSRHISKLGRILRLLTDSYKKECSATQLIKMF 402 Query: 887 LQTLGYSNLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIPRXXXXX 708 L T GYSNLG+LAE VKDG+RN P Q + LQ Q +S QQI R Sbjct: 403 LNTTGYSNLGSLAELVKDGTRNHP-PLNQKQPQMLQQQLHLQQQASLRLPQQIQR-QMHP 460 Query: 707 XXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNP 528 + F QQ QLER+RRRQ ++PR M ++KDRP+V+VK+EN SE+ +D NAFNP Sbjct: 461 QMQQMVNSPTFQQQQQLERLRRRQVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAFNP 520 Query: 527 IHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPVKV 360 ++ RH Q Q RQQ + AAM+N+ Q + QF+QLAS+Q+PQ+QT T RA PVKV Sbjct: 521 MNPRHQQQIQHQLRQQHI-AAMSNMQQQPGYNQFRQLASMQIPQMQTPTPATVRAQPVKV 579 Query: 359 EGFQELMGGDTTLKHDSEEHKLTSPSK 279 EGF++LMGGD++LKH+ ++ + P+K Sbjct: 580 EGFEQLMGGDSSLKHELDDKLRSPPTK 606 >emb|CAN70982.1| hypothetical protein VITISV_027119 [Vitis vinifera] Length = 405 Score = 542 bits (1396), Expect = e-151 Identities = 281/417 (67%), Positives = 319/417 (76%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG GFELARKLE+CGVWRSWLGD+ YSNFV LSSP TWESFM+ DSK+RAQI Sbjct: 1 MALLGDDGRGFELARKLESCGVWRSWLGDALYSNFVQYLSSPNTWESFMRSDDSKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 QLQLR RALLFDKASVSLFLR LNP+YLQLHGDDVYF+LE Sbjct: 61 QLQLRARALLFDKASVSLFLRSPSTPTSSLPVS----KLNPSYLQLHGDDVYFTLE---- 112 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 QD VQ +EG +SN+ SK IQPK FSVG RY E E+DNIS+R R ++FPETWY Sbjct: 113 --QDVVQQREGVVASNTAPSK---IQPKAAFSVGXRYAESEIDNISQRFRHEEFPETWYN 167 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 FIEKY SR +L FG+RE+ KRT MS Y+KL E+HK+ R FKEDQ+ GFGNPI E Sbjct: 168 LFIEKYKASRPYKLSFGERESDKRTPRDMSVYIKLLEKHKKRRVAFKEDQHMGFGNPIVE 227 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 N S+++ SV DG NS+DD+T FFPE MF LNCVPDSAL INR+EDNQKVE YGVLD+L Sbjct: 228 NKSSMYPSSVLDGKNSVDDDTYFFPETMFTLNCVPDSALLPINRVEDNQKVEFYGVLDTL 287 Query: 1220 PHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISRVL 1041 P VMTRSP M+ER GIRPEY + S+YR KNG EGN K+LG EQA QMSQKVI+R+L Sbjct: 288 PQVMTRSPIMIERLGIRPEYHSMEQGGSQYRNKNGTEGNRKLLGQEQALQMSQKVIARML 347 Query: 1040 VSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGY 870 +GFE TEVPMEVLSQ L CHICKLGRILKVL+DNYRKQCSA E+L+MFLQT GY Sbjct: 348 TKMGFEVATEVPMEVLSQLLSCHICKLGRILKVLSDNYRKQCSATELLKMFLQTTGY 404 >dbj|BAA98173.1| unnamed protein product [Arabidopsis thaliana] Length = 595 Score = 540 bits (1391), Expect = e-150 Identities = 304/622 (48%), Positives = 411/622 (66%), Gaps = 8/622 (1%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG GF+LARKLE GVWR+WLGDS YS+F H LSSP TWE+FM+ +SK+RAQI Sbjct: 1 MALLGDDGRGFDLARKLEVSGVWRTWLGDSIYSSFHHYLSSPSTWEAFMRVDESKSRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 QLQLRVRALLFDKA+VSLFLR + + + LHGDDVY++LE+ + Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASIS---DVSSVALHGDDVYYTLEN--A 115 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 S + G Q + G + S+ + K F+ G+R E + N+S+R R ++ P+TWY Sbjct: 116 SLESGFQREGGIRHNPSLTKSL----SKPSFTSGTRGSESDFSNLSQRSRFEELPDTWYT 171 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 172 QFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLAH-----M 224 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 + S+ H S DG+ S +D+ F PE MF +NCVP++AL I R +DN K E YGVLD+L Sbjct: 225 SRSSTHPSSGFDGSTS-EDDILFLPETMFRMNCVPETALSPITRTQDNLKTEFYGVLDTL 283 Query: 1220 PHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQMSQKVIS 1050 P V TRS M+ER G+ PEY R+ G+ RS+ K G +QA+ +S+KV++ Sbjct: 284 PQVTTRSHIMIERLGLMPEYHRMEERGVLRSRKAEKMG-------FSDDQAALVSRKVVA 336 Query: 1049 RVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGY 870 R+L+++GFEG TEVP++V SQ + H+ KLGRILK+LTD+Y+K+CSA+++++MFL T GY Sbjct: 337 RMLLTMGFEGATEVPIDVFSQLVSRHMSKLGRILKLLTDSYKKECSAMQLIKMFLNTTGY 396 Query: 869 SNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSSNLQTQQIPRXXXXXXXXXX 693 SNLG+LAE VKDG+RN P Q QP + LQ Q +S QQI R Sbjct: 397 SNLGSLAEIVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMHPQMQQMV 454 Query: 692 XQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAFNPIHARH 513 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN SE+ +D NAFNP++ RH Sbjct: 455 NPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSEMAVDGNAFNPMNPRH 513 Query: 512 P---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQE 345 Q Q RQQ AAM+N+ Q + QF+QLAS+Q+PQ+QT +GT RA PVKVEGF++ Sbjct: 514 QQQLQQQLRQQQQIAAMSNMQQQPGYNQFRQLASMQIPQMQTPTLGTVRAQPVKVEGFEQ 573 Query: 344 LMGGDTTLKHDSEEHKLTSPSK 279 LMGGD++LKHDS++ + P+K Sbjct: 574 LMGGDSSLKHDSDDKLRSPPTK 595 >ref|XP_002864962.1| hypothetical protein ARALYDRAFT_496788 [Arabidopsis lyrata subsp. lyrata] gi|297310797|gb|EFH41221.1| hypothetical protein ARALYDRAFT_496788 [Arabidopsis lyrata subsp. lyrata] Length = 603 Score = 530 bits (1365), Expect = e-147 Identities = 304/627 (48%), Positives = 409/627 (65%), Gaps = 15/627 (2%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG GF+LAR+LE GVWR+WLGDS YS+F H L+SP WE+FM+ +SK RAQI Sbjct: 1 MALLGDDGRGFDLARRLELSGVWRTWLGDSIYSSFHHYLTSPSNWEAFMRVDESKCRAQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLR-------XXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYF 1782 QLQLRVRALLFDKA+VSLFLR S LNPNYLQLHGDDVY+ Sbjct: 61 QLQLRVRALLFDKATVSLFLRSNTIAASSSSSASISDVSSVAVSKLNPNYLQLHGDDVYY 120 Query: 1781 SLEDFGSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDD 1602 +LE+ +S + G Q G + S+ + K F G+R E + N+S+R R ++ Sbjct: 121 TLEN--ASLESGFQRDGGIRHNQSLTKSL----SKPSFISGTRGSESDFSNLSQRSRFEE 174 Query: 1601 FPETWYKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTG 1422 P+TWY QFI +Y + + G +E+ KRT EGMSTYL++ + HKR R PF ED+ Sbjct: 175 LPDTWYTQFISRYGF--KYGMSVGGQESDKRTPEGMSTYLRVVDTHKRKRAPFLEDRSLA 232 Query: 1421 FGNPIWENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVEC 1242 + S+ H S DG +S +D+ F PE MF +NCVP++AL + R +DN K E Sbjct: 233 H-----MSRSSTHPSSGFDGRSS-EDDILFLPETMFRMNCVPETALSPVTRTQDNLKTEF 286 Query: 1241 YGVLDSLPHVMTRSPAMLERFGIRPEYLRI---GLERSKYRGKNGLEGNGKVLGPEQASQ 1071 YGVLD+LP V TRS M+ER G+ PEY R+ G+ R + K G +QA+ Sbjct: 287 YGVLDTLPQVTTRSHIMIERLGMMPEYHRMEDRGVLRRRKAEKLG-------FSDDQAAL 339 Query: 1070 MSQKVISRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRM 891 +S+KV++R+L+++GFEG TEVP++V SQ + H+ KLG ILK+L+D+Y+K+CSA+++++M Sbjct: 340 VSRKVVARMLLTMGFEGATEVPIDVFSQLVSRHMSKLGHILKLLSDSYKKECSAMQLIKM 399 Query: 890 FLQTLGYSNLGNLAEHVKDGSRNFSHPTQ-QPHLRGLQSGFQSQHPSSNLQTQQIPRXXX 714 FL T GYSNLG+LAE VKDG+RN P Q QP + LQ Q +S QQI R Sbjct: 400 FLNTTGYSNLGSLAELVKDGTRNHPPPNQKQPQV--LQQQLHLQQQASLRLPQQIQRQMH 457 Query: 713 XXXXXXXXQNLAFPQQPQLERMRRRQPSTPRAGMTLDKDRPMVEVKIENTSELPIDSNAF 534 F QQ QLERMRRR ++PR M ++KDRP+V+VK+EN S++ +D NAF Sbjct: 458 PQMQQMVNPQ-NFQQQQQLERMRRRPVTSPRPNMDMEKDRPLVQVKLENPSDMAVDGNAF 516 Query: 533 NPIHARHP---QIQFRQQSMAAAMANLNPQSNH-QFKQLASLQVPQLQTQNMGTTRAPPV 366 NP++ RH Q Q RQQ + AA +N+ Q + QF+QLAS+Q+PQ+QT GT RA PV Sbjct: 517 NPMNPRHQQQMQQQLRQQQI-AAKSNMQQQPGYSQFRQLASMQIPQMQTPTPGTVRAQPV 575 Query: 365 KVEGFQELMGGDTTLKHDSEEHKLTSP 285 KVEGF++LMGGD++LKH+S++ KL SP Sbjct: 576 KVEGFEQLMGGDSSLKHESDD-KLRSP 601 >gb|EYU30927.1| hypothetical protein MIMGU_mgv1a003113mg [Mimulus guttatus] Length = 607 Score = 518 bits (1334), Expect = e-144 Identities = 313/650 (48%), Positives = 410/650 (63%), Gaps = 36/650 (5%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG GFELARKLE+ GVWR WLGD++YS F++ L+SP W+ FM+ SKT+ QI Sbjct: 1 MALLGDDGRGFELARKLESHGVWRPWLGDAHYSAFINFLASPEKWDIFMRADKSKTKDQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXSNLNPNYLQLHGDDVYFSLEDFGS 1761 LQLR RALLFDKASVSLF + S LNPNYL+LHGDDVYF+ ED Sbjct: 61 YLQLRARALLFDKASVSLFTQ--------SPPPAPVSKLNPNYLELHGDDVYFTFED--- 109 Query: 1760 SSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETWYK 1581 ++D Q Q +SN+ SK K VGSR+ E E +E + ++ PETWY Sbjct: 110 GAKDVDQRQPSLAASNTTSSKGYS---KTSVGVGSRFNETE----TETDKLEELPETWYS 162 Query: 1580 QFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPIWE 1401 QF EKY S+ RL FGDRE+ KRT E MSTYL++ E HKR R F + Sbjct: 163 QFFEKYRASKSYRLIFGDRESEKRTPEQMSTYLRVLENHKRRRVAFV------------D 210 Query: 1400 NGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLDSL 1221 N SN+ S+S+ D+ FPE MF LNCVPDSA+ + LE++QK++ GVLD+L Sbjct: 211 NTSNLRPNSLSE-----LDDIPLFPETMFTLNCVPDSAVLQTSGLENHQKLQFNGVLDNL 265 Query: 1220 PHVMTR----SPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVI 1053 P +MT+ SP M+ER GIRPE+L + + RG+N G+ ++ G EQA Q+S+KV+ Sbjct: 266 PQIMTKSTMISPIMIERLGIRPEFLNM----EQTRGRN---GSMRIRGEEQAVQISKKVV 318 Query: 1052 SRVLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLG 873 +R+L +VGFE +++ +EVL Q L CHI KLGR LK+L+D+YRKQCSA E+++MFLQT G Sbjct: 319 ARLLTNVGFESCSDLSLEVLPQLLSCHIGKLGRTLKLLSDSYRKQCSANELVKMFLQTAG 378 Query: 872 YS-NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIPR--XXXXXXX 702 YS N+G L + +KD ++N P QQ ++ +Q+ Q Q S L +QQIPR Sbjct: 379 YSNNMGALVQIIKDNTKNGVQPNQQ-QVQAIQAQLQLQQQPSILPSQQIPRQINPQMQQQ 437 Query: 701 XXXXQNLAF-PQQPQLERMRRRQPS-TPRAGMT-------------LDKD-RPMVEVKIE 570 Q LAF QQ Q ERMRRRQ PR GM +DKD RP+V+VK+E Sbjct: 438 MNNAQYLAFQQQQQQWERMRRRQQQPAPRPGMNTNVNMNMNTNTNMIDKDNRPLVQVKME 497 Query: 569 NTSELPIDSNAFNPIHARHPQI----------QFRQQSMA---AAMANLNPQSNHQFKQL 429 N SE P+D+NAF +++RHPQ+ Q QQ +A A N N +N+ F+ + Sbjct: 498 NPSEFPLDANAFAAVNSRHPQLLQIRHQQEQQQLAQQQLAQQVQANNNNNNNNNNVFRPM 557 Query: 428 ASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDTTLKHDSEEHKLTSPSK 279 SLQ+PQ+ + +M RAPPVKVEGFQELMGGD+++KHDSEE+KL SP K Sbjct: 558 TSLQIPQILSPSMSMPRAPPVKVEGFQELMGGDSSIKHDSEENKLLSPQK 607 >ref|XP_007047764.1| Transcription initiation factor TFIID subunit 8, putative isoform 2 [Theobroma cacao] gi|508700025|gb|EOX91921.1| Transcription initiation factor TFIID subunit 8, putative isoform 2 [Theobroma cacao] Length = 489 Score = 506 bits (1304), Expect = e-140 Identities = 269/467 (57%), Positives = 330/467 (70%), Gaps = 2/467 (0%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1767 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1766 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1587 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1586 YKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1407 Y QFIEKY SR +L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1406 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1227 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1226 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1047 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1046 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 867 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 866 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQIP 726 N G LAE VKD +RN T Q + G+QS Q QH ++ QQ+P Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQLP 442 Score = 88.6 bits (218), Expect = 1e-14 Identities = 43/74 (58%), Positives = 50/74 (67%) Frame = -1 Query: 506 IQFRQQSMAAAMANLNPQSNHQFKQLASLQVPQLQTQNMGTTRAPPVKVEGFQELMGGDT 327 +Q Q M + L PQ + + L + Q+ QNMG RAPPVKVEGFQELMGGDT Sbjct: 413 VQQTPQQMHGIQSQLQPQHQNALRMAQQLPMRQMHPQNMGIVRAPPVKVEGFQELMGGDT 472 Query: 326 TLKHDSEEHKLTSP 285 TLKHDSEE+KLTSP Sbjct: 473 TLKHDSEENKLTSP 486 >ref|XP_007047765.1| Transcription initiation factor TFIID subunit 8, putative isoform 3 [Theobroma cacao] gi|508700026|gb|EOX91922.1| Transcription initiation factor TFIID subunit 8, putative isoform 3 [Theobroma cacao] Length = 445 Score = 503 bits (1295), Expect = e-139 Identities = 268/465 (57%), Positives = 328/465 (70%), Gaps = 2/465 (0%) Frame = -1 Query: 2120 MALLGEDGLGFELARKLETCGVWRSWLGDSNYSNFVHVLSSPLTWESFMKPQDSKTRAQI 1941 MALLG+DG G++LAR+LE+CGVWR+WLGDS Y++F+H LSSP WESFM+ DSK+R+QI Sbjct: 1 MALLGDDGRGYDLARRLESCGVWRAWLGDSTYASFIHFLSSPSAWESFMRVDDSKSRSQI 60 Query: 1940 QLQLRVRALLFDKASVSLFLRXXXXXXXXXXXXXXXS--NLNPNYLQLHGDDVYFSLEDF 1767 LQLR RALLFDKA+V+LFLR + LNPNYLQLHGDDVYF+LE Sbjct: 61 HLQLRARALLFDKATVALFLRSNSSNPANNTSSSSVAVSKLNPNYLQLHGDDVYFTLE-- 118 Query: 1766 GSSSQDGVQHQEGSTSSNSIQSKVVQIQPKLGFSVGSRYVEPEVDNISERPRPDDFPETW 1587 GS Q+G ++N+ SK K FS GSRY E E D++S+R R ++ PETW Sbjct: 119 GSL-------QDGGAAANAAPSK-----SKSSFSAGSRYGESEFDSLSQRYRKEELPETW 166 Query: 1586 YKQFIEKYSTSRQQRLPFGDREALKRTSEGMSTYLKLHERHKRSRQPFKEDQYTGFGNPI 1407 Y QFIEKY SR +L GDRE+ KRT E M+TYL++ E+HKR R F+EDQY G+G+ Sbjct: 167 YNQFIEKYRLSRPYKLFLGDRESEKRTPEEMTTYLRIVEKHKRRRVAFQEDQYMGYGS-- 224 Query: 1406 WENGSNIHSKSVSDGNNSIDDETCFFPEMMFPLNCVPDSALPSINRLEDNQKVECYGVLD 1227 + + S SV DGNNS DDE FFPE+M +NCVPDSALP R+ D + +E YGVLD Sbjct: 225 ----TGLESNSVLDGNNSGDDEIPFFPEIMSMMNCVPDSALPPATRVWDKKTIEFYGVLD 280 Query: 1226 SLPHVMTRSPAMLERFGIRPEYLRIGLERSKYRGKNGLEGNGKVLGPEQASQMSQKVISR 1047 +LP V TRSP M+ER GIRPEYL + + +RGKN N K+LG EQASQMS+KVI+R Sbjct: 281 TLPQVSTRSPVMIERLGIRPEYLNMEQGGNTHRGKN----NRKLLGQEQASQMSRKVIAR 336 Query: 1046 VLVSVGFEGTTEVPMEVLSQFLGCHICKLGRILKVLTDNYRKQCSAIEILRMFLQTLGYS 867 +L VGFEG TE P+EV SQFL CHIC+LGR +KVLTDNYRKQCSAIE++RMFLQT GYS Sbjct: 337 LLNGVGFEGATEAPVEVFSQFLSCHICRLGRNIKVLTDNYRKQCSAIELIRMFLQTSGYS 396 Query: 866 NLGNLAEHVKDGSRNFSHPTQQPHLRGLQSGFQSQHPSSNLQTQQ 732 N G LAE VKD +RN T Q + G+QS Q QH ++ QQ Sbjct: 397 NFGTLAELVKDSTRNVVQQTPQ-QMHGIQSQLQPQHQNALRMAQQ 440