BLASTX nr result
ID: Catharanthus22_contig00007859
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00007859 (2598 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXB22546.1| hypothetical protein L484_002900 [Morus notabilis] 386 e-104 emb|CBI19274.3| unnamed protein product [Vitis vinifera] 356 3e-95 ref|XP_004252718.1| PREDICTED: uncharacterized protein LOC101249... 350 2e-93 ref|XP_006366421.1| PREDICTED: uncharacterized protein LOC102582... 348 7e-93 ref|XP_002283801.2| PREDICTED: uncharacterized protein LOC100245... 348 9e-93 ref|XP_006338056.1| PREDICTED: uncharacterized protein LOC102605... 340 2e-90 ref|XP_006433817.1| hypothetical protein CICLE_v10000622mg [Citr... 338 5e-90 ref|XP_006338055.1| PREDICTED: uncharacterized protein LOC102605... 337 1e-89 gb|EOY15457.1| Homeodomain-like superfamily protein isoform 1 [T... 337 1e-89 ref|XP_006472453.1| PREDICTED: putative GPI-anchored protein PB1... 336 3e-89 gb|EMJ23216.1| hypothetical protein PRUPE_ppa002943mg [Prunus pe... 333 2e-88 ref|XP_002302346.1| myb family transcription factor family prote... 328 6e-87 gb|EOY15458.1| Homeodomain-like superfamily protein isoform 2 [T... 327 1e-86 gb|EMJ25790.1| hypothetical protein PRUPE_ppa1027142mg [Prunus p... 324 1e-85 ref|XP_004237998.1| PREDICTED: uncharacterized protein LOC101255... 324 1e-85 ref|XP_002514048.1| DNA binding protein, putative [Ricinus commu... 311 1e-81 ref|XP_006844749.1| hypothetical protein AMTR_s00016p00255950 [A... 300 2e-78 emb|CAN60243.1| hypothetical protein VITISV_010188 [Vitis vinifera] 300 2e-78 ref|XP_004152740.1| PREDICTED: uncharacterized protein LOC101206... 294 1e-76 ref|XP_004163958.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 294 2e-76 >gb|EXB22546.1| hypothetical protein L484_002900 [Morus notabilis] Length = 854 Score = 386 bits (992), Expect = e-104 Identities = 282/726 (38%), Positives = 382/726 (52%), Gaps = 49/726 (6%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 M+E+++K+QKK VSEED+ ++LQRYTATT KIDWN LV+K++TGI Sbjct: 1 MIEKASKKQKKGSVSEEDVVSLLQRYTATTVLTLLNEVANCTDV-KIDWNVLVEKSSTGI 59 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 S+A EYQMLWRHLAYR S ACVKV Sbjct: 60 SNASEYQMLWRHLAYRHSFLEKFEDGAQPLDDDSDLEYELEASPVVNNETSNEAAACVKV 119 Query: 1942 LIASGVSND--PTGSTVEGPLTISMPRGQTSKARENSQSTNCTLGTNITVPISVQKQPMP 1769 LIASG+ +D P+GST+E PLTI++P GQ S A E Q + T GTNI VP+SVQKQP P Sbjct: 120 LIASGLPSDTNPSGSTIEAPLTINIPNGQPSGALE--QPSCSTQGTNIIVPVSVQKQPAP 177 Query: 1768 FNANAEGLDSNGAANPNLPRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDRTA 1589 E LD+NG+A+ NL +R+RKPWS AED+ELIAAVQKCGEGNWANIL+GDFKGDRTA Sbjct: 178 AVTVVEPLDTNGSASGNLLKRKRKPWSEAEDLELIAAVQKCGEGNWANILRGDFKGDRTA 237 Query: 1588 SQLSQRWNIIKKKNGNLNVGT---GSQISDVHLATRRAVDMALGKP--TMPSCSIANAGV 1424 SQLSQRW II+K++GNLN+G+ G+Q+S+ LA R A+ +AL P + + +I++AG Sbjct: 238 SQLSQRWAIIRKRHGNLNLGSSSNGTQLSEAQLAARHAMSLALNMPVKNLTANTISHAG- 296 Query: 1423 NSNAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPEST 1244 + A +S+G T T S + AG G+S ++ + + + SP + Sbjct: 297 -TTALNNSMG-------TNSTNKSAGTNAAAG----GNSSLQLQNQSQENLASKESPVGS 344 Query: 1243 ---VTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMP 1073 +TK+ + KST S D++V+ATAV GARI + SDAASL+KAAQ+KNA+HI P Sbjct: 345 LGPITKARIPMKKPLVKSTPSSDAMVRATAVAAGARIASPSDAASLLKAAQAKNAIHIRP 404 Query: 1072 GGS-LIKSSVAGS----SNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPS 908 GS IKSS+ G S + P NVHYI TGL S P S+Y++A P+ S Sbjct: 405 TGSGSIKSSMPGGLPAPSEAHP-NVHYIRTGLASAPVSNYAAATPSVPCPA--------S 455 Query: 907 TKSAASVVQPSLGGATASDLSGLAETKGGANSDASSGHPD----APAVEKSSSNAAKTIK 740 KS +S VQ + T G + D SS + PA E AKT++ Sbjct: 456 VKSISSPVQQT-------------PTSNGTSLDVSSKQKNYVSCTPAHELPLKQEAKTVE 502 Query: 739 ELVLEGQTDIKGKLTNKQIEGDQNAIAGSTP----MDVDASRNSPRDKVEGCQTA--VLS 578 E+ + G +QI+GD ++ ++ D + P +++G +S Sbjct: 503 EI----KVPASGSAAKQQIQGDGACVSANSQDGLVQDNKVAAPDPDAELKGTSDVGKPVS 558 Query: 577 KSLEDQAEGDKVSA---APDAASKAQGHIDSSSSGQAAGNGDHNI--------------- 452 E AE D++ D S+ I SS G + NI Sbjct: 559 TLNERTAENDRLIVDIKFKDRESEKGNEIISSLVGAGENSEHQNIYKMQEDHAVGENVEP 618 Query: 451 KGNDKTMSLAAEHNGEIPSAIEKIHEN------GSSSAAKEGAEEMVVDGSAGEKCPSQQ 290 + DK LA NGE P I+K+ E+ S A E +E +D + + Sbjct: 619 QNIDKMQDLAVGENGE-PHKIDKMQEDHAVGIISSLVGAGENSEHQNIDKMQEDHAVGEN 677 Query: 289 GTSNGI 272 G I Sbjct: 678 GEPRNI 683 >emb|CBI19274.3| unnamed protein product [Vitis vinifera] Length = 641 Score = 356 bits (913), Expect = 3e-95 Identities = 255/645 (39%), Positives = 337/645 (52%), Gaps = 19/645 (2%) Frame = -3 Query: 2284 KRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGISSAREY 2105 K +KK +SEED+S +LQRYT T KIDWNALV KT+TGIS+AREY Sbjct: 6 KMRKKGTISEEDVSALLQRYTPTAVLALLQEVAQLPDV-KIDWNALVNKTSTGISNAREY 64 Query: 2104 QMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKVLIASGV 1925 QMLWRHLAY + ACVKVLIAS + Sbjct: 65 QMLWRHLAYGHALLEKLEDGAQPLDDDSDLEYDLEAFPSISTEASAEATACVKVLIASSL 124 Query: 1924 SND---PTGSTVEGPLTISMPRGQTSKA-RENSQSTNCTLGTNITVPISVQKQPMPFNAN 1757 +D P S VE PLTI++P GQ+S+A E S+ + GTNIT+P+SVQK Sbjct: 125 PSDSSLPNSSMVEAPLTINIPCGQSSRAPSEYSRLSGSMQGTNITIPVSVQK-------- 176 Query: 1756 AEGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQL 1580 +EG D+NG+ + +LP R++RKPWS ED ELIAAVQKCGEGNWANILKGDFKGDR+ASQL Sbjct: 177 SEGFDANGSTSGSLPARKKRKPWSSDEDKELIAAVQKCGEGNWANILKGDFKGDRSASQL 236 Query: 1579 SQRWNIIKKKNGNLNVG----TGSQISDVHLATRRAVDMALGKPTMP-SCSIANAGVNSN 1415 SQRW II+KK+ NLNVG GSQ+S+ LA R A+ +AL P + S + AG N N Sbjct: 237 SQRWTIIRKKHKNLNVGGANSNGSQLSEAQLAARHAMSLALDMPVKNLTTSSSIAGTNPN 296 Query: 1414 AAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKP-STKTTLSPESTVT 1238 A S+ P E ++PA S+A+ + P ST + + + Sbjct: 297 ATSSNSAFPATPAE----------ALPAS---TNISQAQQLSQQGPVSTLSQMGSLGSAP 343 Query: 1237 KSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMPGGS-L 1061 KS ++KST S S++KATAV GARI T S AASL+K AQS+NAVHIMPGGS L Sbjct: 344 KSRATSKKTSAKSTFSSQSMLKATAVAAGARIATPSAAASLLKDAQSRNAVHIMPGGSTL 403 Query: 1060 IKSSVAGSSNSFPS-------NVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTK 902 IKSSVAG +N P+ NVHY C G + S+YS+ P+ S+ G S K Sbjct: 404 IKSSVAGGANPLPANHLGAHPNVHYKCAGPPTTSLSTYSAVAPSVSRTG--------SAK 455 Query: 901 SAASVVQPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVLEG 722 AA GG A S T +S+ ++ + AVE + KT +E Sbjct: 456 PAAP------GGQLAPSPSA---TSVNISSEQTNAATTSLAVEYPAKQETKTSEET---- 502 Query: 721 QTDIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPRDKVEGCQTAVLSKSLEDQAEGDKV 542 + I G + ++ DQ ++ +T AS D+ T V+ ++ + K Sbjct: 503 KVPISGNVPKAKVLEDQACVSSNT-----ASEQVQEDQATLSNTEVVLENKKAMVSDTKC 557 Query: 541 SAAPDAASKAQGHIDSSSSGQAAGNGDHNIKGNDKTMSLAAEHNG 407 + A G + S + D + G + S+A E++G Sbjct: 558 LLKTETAEN-DGEVAESQNVNDNKIMDFRVAGECENQSVANENSG 601 >ref|XP_004252718.1| PREDICTED: uncharacterized protein LOC101249442 [Solanum lycopersicum] Length = 569 Score = 350 bits (897), Expect = 2e-93 Identities = 226/509 (44%), Positives = 277/509 (54%), Gaps = 18/509 (3%) Frame = -3 Query: 2281 RQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGISSAREYQ 2102 +++K F+SEEDI+ +LQRY+ +T KIDWN +V+K+TTGI++AREYQ Sbjct: 6 KKQKCFISEEDIAILLQRYSVSTVLAILREVGQVADE-KIDWNVMVRKSTTGITNAREYQ 64 Query: 2101 MLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKVLIASGVS 1922 MLWRHLAYR A K+LIASG Sbjct: 65 MLWRHLAYRHDLIDKFDDEAQPLDDDSDLEFELEAFPAVSSEASAEAAASAKMLIASGAP 124 Query: 1921 NDPT---GSTVEGPLTISMPRGQTSKA-RENSQSTNCTLGTNITVPISVQKQPMPFNANA 1754 ND GST+E PLTI++P GQTS+ +NS GTNITVP++VQKQP+ A Sbjct: 125 NDANMLNGSTIEAPLTINIPNGQTSRTGMDNSFQGTSMHGTNITVPVAVQKQPLSTVVAA 184 Query: 1753 EGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLS 1577 EGLD++G NLP RR+RKPWS AED+ELIAAVQKCGEGNWANILKGDFKGDRTASQLS Sbjct: 185 EGLDTHGPGCTNLPPRRKRKPWSEAEDVELIAAVQKCGEGNWANILKGDFKGDRTASQLS 244 Query: 1576 QRWNIIKKKNGNLNVGTGSQISDVHLATRRAVDMALGKPTMPSCSIANAGVNSNAAQSSL 1397 QRW II+K+ G + VG GSQ+S+ LA R A+ AL P A+ G NS S+ Sbjct: 245 QRWAIIRKRQGTM-VGNGSQLSEAQLAARHAMSHALNMPIG-----ASVGPNSGGGSSNS 298 Query: 1396 GQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPESTVTKSNLNPD 1217 P SQH P SSK R+ P K Sbjct: 299 SLPVTADLASGGAQSQHQQDPL------SSKPRIVPQKP--------------------- 331 Query: 1216 SVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMPGGSLIKSSVAGS 1037 A K T S DS+VK TAV GARI T+S++AS VK AQ K + I GGS +KSSV GS Sbjct: 332 --APKPTTSSDSMVKVTAVAAGARIATSSNSASQVKLAQPKTPLQIPGGGSAVKSSVLGS 389 Query: 1036 SNSFPSNVHYICTGLVSR---PTSSYSSAPPNASQVGGTHQMQGPSTKSAASVVQP---- 878 +N PSNVH+I TGLVS P + SA P+ + GT Q S K A+ VQP Sbjct: 390 TNGLPSNVHFIRTGLVSHSAGPPKAVHSAGPSHASRPGTQQGLSHSLKPASPTVQPKPIG 449 Query: 877 ------SLGGATASDLSGLAETKGGANSD 809 +L TA + +AE K N + Sbjct: 450 NSSKPNALAVPTAPTSTPVAELKVNTNQE 478 >ref|XP_006366421.1| PREDICTED: uncharacterized protein LOC102582625 [Solanum tuberosum] Length = 574 Score = 348 bits (893), Expect = 7e-93 Identities = 227/531 (42%), Positives = 285/531 (53%), Gaps = 9/531 (1%) Frame = -3 Query: 2281 RQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGISSAREYQ 2102 +++K F+SEEDI+ +LQRY+ +T KIDWNA+V+K+ TGI++AREYQ Sbjct: 6 KKQKCFISEEDIAILLQRYSVSTVLAILQEVGQVADE-KIDWNAMVRKSATGITNAREYQ 64 Query: 2101 MLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKVLIASGVS 1922 MLWRHLAYR A K+LIA G Sbjct: 65 MLWRHLAYRHGLVDKFDDEAQPLDDDSDLEYELEAFPAVSSEASAEAAASAKMLIAYGAP 124 Query: 1921 NDPT---GSTVEGPLTISMPRGQTSKA-RENSQSTNCTLGTNITVPISVQKQPMPFNANA 1754 ND GST+E PLTI++P GQTS+ +NS GTNITVP++VQKQP+ A Sbjct: 125 NDANMLNGSTIEAPLTINIPNGQTSRTGMDNSFQGTSMHGTNITVPVAVQKQPLSTVVAA 184 Query: 1753 EGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLS 1577 EGLD++G NLP RR+RKPWS AED+ELIAAVQKCGEGNWANILKGDFKGDRTASQLS Sbjct: 185 EGLDTHGPGCTNLPPRRKRKPWSEAEDVELIAAVQKCGEGNWANILKGDFKGDRTASQLS 244 Query: 1576 QRWNIIKKKNGNLNVGTGSQISDVHLATRRAVDMALGKPTMPSCSIANAGVNSNAAQSSL 1397 QRW II+K+ G + VG GSQ+S+ LA R A+ AL P A G NS + S+ Sbjct: 245 QRWAIIRKRQGTM-VGNGSQLSEAQLAARHAMSHALNMPIG-----AGVGPNSGSGPSNS 298 Query: 1396 GQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPESTVTKSNLNPD 1217 P SQH P SSK R+ P K Sbjct: 299 SHPVTADLASGGAQSQHQQDPL------SSKPRIVPQKP--------------------- 331 Query: 1216 SVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMPGGSLIKSSVAGS 1037 A K T SPDS++K AV GARI T+S++AS VK AQ K + I GG +KSSV GS Sbjct: 332 --APKPTTSPDSMIKVAAVAAGARIATSSNSASQVKLAQPKTPLQIPGGGPAVKSSVLGS 389 Query: 1036 SNSFPSNVHYICTGLVSR----PTSSYSSAPPNASQVGGTHQMQGPSTKSAASVVQPSLG 869 +N PSNVH+I TGLVS P +S+ P NAS+ GT Q+ S K A+ VQP Sbjct: 390 TNGLPSNVHFIRTGLVSHSAGPPKVVHSAVPSNASR-PGTPQVLSHSLKPASPTVQPKPI 448 Query: 868 GATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVLEGQT 716 G ++ N+ A P + V + N + + + V + QT Sbjct: 449 GNSSK-----------PNALAERNSPTSTPVAELKVNTNQEVLQKVQQDQT 488 >ref|XP_002283801.2| PREDICTED: uncharacterized protein LOC100245507 [Vitis vinifera] Length = 606 Score = 348 bits (892), Expect = 9e-93 Identities = 255/676 (37%), Positives = 345/676 (51%), Gaps = 17/676 (2%) Frame = -3 Query: 2284 KRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGISSAREY 2105 K +KK +SEED+S +LQRYT T KIDWNALV KT+TGIS+AREY Sbjct: 6 KMRKKGTISEEDVSALLQRYTPTAVLALLQEVAQLPDV-KIDWNALVNKTSTGISNAREY 64 Query: 2104 QMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKVLIASGV 1925 QMLWRHLAY + ACVKVLIAS + Sbjct: 65 QMLWRHLAYGHALLEKLEDGAQPLDDDSDLEYDLEAFPSISTEASAEATACVKVLIASSL 124 Query: 1924 SND---PTGSTVEGPLTISMPRGQTSKA-RENSQSTNCTLGTNITVPISVQKQPMPFNAN 1757 +D P S VE PLTI++P GQ+S+A E S+ + GTNIT+P+SVQK Sbjct: 125 PSDSSLPNSSMVEAPLTINIPCGQSSRAPSEYSRLSGSMQGTNITIPVSVQK-------- 176 Query: 1756 AEGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQL 1580 +EG D+NG+ + +LP R++RKPWS ED ELIAAVQKCGEGNWANILKGDFKGDR+ASQL Sbjct: 177 SEGFDANGSTSGSLPARKKRKPWSSDEDKELIAAVQKCGEGNWANILKGDFKGDRSASQL 236 Query: 1579 SQRWNIIKKKNGNLNVG----TGSQISDVHLATRRAVDMALGKPTMPSCSIANAGVNSNA 1412 SQRW II+KK+ NLNVG GSQ+S+ LA R A+ +AL P + N + + Sbjct: 237 SQRWTIIRKKHKNLNVGGANSNGSQLSEAQLAARHAMSLALDMP------VKNLTTTNIS 290 Query: 1411 AQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPESTVTKS 1232 L Q P S + +G LGS AP + ++K T Sbjct: 291 QAQQLSQQGP------------VSTLSQMGSLGS-----APKSRATSKKT---------- 323 Query: 1231 NLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMPGGS-LIK 1055 ++KST S S++KATAV GARI T S AASL+K AQS+NAVHIMPGGS LIK Sbjct: 324 -------SAKSTFSSQSMLKATAVAAGARIATPSAAASLLKDAQSRNAVHIMPGGSTLIK 376 Query: 1054 SSVAGSSNSFPS-------NVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTKSA 896 SSVAG +N P+ NVHY C G + S+YS+ P+ S+ G S K A Sbjct: 377 SSVAGGANPLPANHLGAHPNVHYKCAGPPTTSLSTYSAVAPSVSRTG--------SAKPA 428 Query: 895 ASVVQPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVLEGQT 716 A GG A S T +S+ ++ + AVE + KT +E + Sbjct: 429 AP------GGQLAPSPSA---TSVNISSEQTNAATTSLAVEYPAKQETKTSEET----KV 475 Query: 715 DIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPRDKVEGCQTAVLSKSLEDQAEGDKVSA 536 I G + ++ DQ ++ +T AS D+ T V+ ++ + K Sbjct: 476 PISGNVPKAKVLEDQACVSSNT-----ASEQVQEDQATLSNTEVVLENKKAMVSDTKCLL 530 Query: 535 APDAASKAQGHIDSSSSGQAAGNGDHNIKGNDKTMSLAAEHNGEIPSAIEKIHENGSSSA 356 + A G + S + D + G + S+A E++G + ++ +++ Sbjct: 531 KTETAEN-DGEVAESQNVNDNKIMDFRVAGECENQSVANENSGNQNANEKQTDLPNTATD 589 Query: 355 AKEGAEEMVVDGSAGE 308 E ++E++ +AGE Sbjct: 590 CGEKSDEVLYKATAGE 605 >ref|XP_006338056.1| PREDICTED: uncharacterized protein LOC102605794 isoform X2 [Solanum tuberosum] Length = 544 Score = 340 bits (871), Expect = 2e-90 Identities = 244/590 (41%), Positives = 309/590 (52%), Gaps = 11/590 (1%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 MVE+ K K+ FV+E+D+ST+LQRYTA T KIDWN LVKKT TGI Sbjct: 1 MVEKR-KLNKRGFVTEDDMSTLLQRYTAFTMLTLLQEVGQVNGS-KIDWNDLVKKTATGI 58 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 +SAREYQM+WRHLAYR A KV Sbjct: 59 TSAREYQMVWRHLAYRKVLLDKFDDDAQPMDDDSDLEYELESFPPVSSEASTEAAAWGKV 118 Query: 1942 LIASGV---SNDPTGSTVEGPLTISMPRGQTS-KARENSQSTNCTLGTNITVPISVQKQP 1775 IASG SN GSTVE LTI +P GQTS NS GT +TVP++VQ QP Sbjct: 119 FIASGALHDSNMSNGSTVEASLTIQIPNGQTSGTVAANSLQGISAYGTKLTVPVTVQTQP 178 Query: 1774 MPFNANAEGLDSNGAANPNLPRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDR 1595 MP + AEG+D++G A+ NLPRRRRK W+ AEDMELI AVQKCGEGNWANILK DFKGDR Sbjct: 179 MPSVSAAEGVDTSGPASANLPRRRRKAWTGAEDMELITAVQKCGEGNWANILKTDFKGDR 238 Query: 1594 TASQLSQRWNIIKKKNGNLNVGTGSQISDVHLATRRAVDMALGKPTMPSCSIA-NAGVNS 1418 TASQLSQRW I+K++ + VG GS +S+ LATR AV MA G +C I+ NAG NS Sbjct: 239 TASQLSQRWATIRKQH-VMMVGNGSHLSEAQLATRHAVSMAFGDNVRAACPISPNAGPNS 297 Query: 1417 NAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPESTVT 1238 + S+ S H + A + G P +K + V Sbjct: 298 GSGPSN---------------SSHFAAAANVASAG-----------PQSK---HQQDLVP 328 Query: 1237 KSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASD-AASLVKAAQSKNAVHIMPGGS- 1064 + P K ++PD +VKA A+ +R+ T S AASL KAAQSK VHIMPGG+ Sbjct: 329 SKPIIPKIPLPKPAINPDPMVKAAAMAASSRVATHSGAAASLQKAAQSKKGVHIMPGGTP 388 Query: 1063 LIKSSVAGSSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTKSAASVV 884 +KSSV GS N PSNVH+I TGLVS P P N SQ GT Q+Q P +S + V Sbjct: 389 AVKSSVPGSFNGLPSNVHFIRTGLVSCPAD-----PSNTSQ-SGTQQLQAP--RSVSPAV 440 Query: 883 QPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVLEGQTDIKG 704 QP T + ++ASSG AP+ ++ K+ + E Q + Sbjct: 441 QPK-------------PTTVPSRTNASSGVRSAPSSYPTTVLEVKSKAAVSQENQIAVLS 487 Query: 703 KLTNKQIEGDQNAIAGSTPMDV----DASRNSPRDKVEGCQTAVLSKSLE 566 +++ + + A +TP N KV+G QT+VL +++ Sbjct: 488 NTRSEKTQVIRAASLANTPQQQVPKDQTFGNLLSGKVDG-QTSVLGDTVK 536 >ref|XP_006433817.1| hypothetical protein CICLE_v10000622mg [Citrus clementina] gi|557535939|gb|ESR47057.1| hypothetical protein CICLE_v10000622mg [Citrus clementina] Length = 612 Score = 338 bits (868), Expect = 5e-90 Identities = 258/689 (37%), Positives = 358/689 (51%), Gaps = 19/689 (2%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 MVE + K+QKK +SE D+S++LQRYTA T K+DWNALVKKT+TGI Sbjct: 1 MVENTNKKQKKGSISEGDVSSLLQRYTANTVLALLQEVAQFPDV-KLDWNALVKKTSTGI 59 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 S+AREYQMLWRHLAYR++ ACVKV Sbjct: 60 SNAREYQMLWRHLAYRNTLFDKLEDNAQPLDDDSDLEYELEAFPEVSSEASTEAAACVKV 119 Query: 1942 LIASGVSND---PTGSTVEGPLTISMPRGQTSKAR-ENSQSTNCTLGTNITVPISVQKQP 1775 LIASG+ +D P S VE PLTI++P GQ+ +A ENSQ ++ G NITVP++VQK P Sbjct: 120 LIASGLPSDSSLPNSSMVEAPLTINIPNGQSLRASTENSQPSSLMQGMNITVPVAVQKVP 179 Query: 1774 MPFNANAEGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGD 1598 +P E LD+NG ++P R++RKPW+ ED+ELI+AVQKCGEGNWANIL+GDFK D Sbjct: 180 LPA-PTPEVLDANGLIGGSMPPRKKRKPWTAEEDLELISAVQKCGEGNWANILRGDFKWD 238 Query: 1597 RTASQLSQRWNIIKKKNGNLNVG---TGSQISDVHLATRRAVDMALGKPT---MPSCSIA 1436 RTASQLSQRWNI++KK+GN+ +G +GSQ+S+ LA R A+ +AL P SC+ Sbjct: 239 RTASQLSQRWNILRKKHGNVILGSNSSGSQLSEAQLAARHAMSLALDMPVKNITASCTNT 298 Query: 1435 NAGVNSNAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSS-KARVAPPKKPSTKTTL 1259 AG S+A ++ P P T + S + +G GS+ K+RV K P Sbjct: 299 TAGTTSSA---TMNNPVPSTANAEASSVANQSKLSPVGSPGSAVKSRVPLKKMP------ 349 Query: 1258 SPESTVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHI 1079 +KS DS ++A AV GARI T SDAASL+K AQ+K A+HI Sbjct: 350 -----------------AKSNFGADSSIRAAAVAAGARIVTPSDAASLLKVAQAKKAIHI 392 Query: 1078 MPGG-SLIKSSVAGSSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTK 902 MP G S IKS AGS ++VH L + PT+ Y P+ V PS+ Sbjct: 393 MPSGVSSIKSPSAGS-----ASVH-----LEASPTTRY--VRPSLPVV--------PSSS 432 Query: 901 SAASVVQPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVLEG 722 S A S G + L + + + + ++ P E K +E+ + G Sbjct: 433 SPAVTSSASHPGLVK---AALPKVQHNTSCEQTNAVVSVPGTELQLKPEVKAGEEIKVSG 489 Query: 721 QTDIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPRDKVEGCQTAVLSKSLEDQAEGDKV 542 + + G +K+I+ D P+ E A +++ E+QA V Sbjct: 490 GS-VSGNEPSKEIQLD-----------------LPKLDAEFKNQAAVAE-FENQA---AV 527 Query: 541 SAAPDAASKAQ----GHIDSSSSGQAAGNGDHNIKGNDKTMSLAAEHNGEIPSAIEKIHE 374 + PD++S + G + S+ + Q GNG+ N GND M + NGE +A+++ + Sbjct: 528 AENPDSSSNMEIVENGQVQSNGN-QPEGNGNQN--GNDDKMVDSPVANGENQAAVKQKNS 584 Query: 373 NGSSSAAKEGAE--EMVVDGSAGEKCPSQ 293 S+ E AE +V+D KC S+ Sbjct: 585 GLPQSSNNEEAELPTLVID-----KCSSK 608 >ref|XP_006338055.1| PREDICTED: uncharacterized protein LOC102605794 isoform X1 [Solanum tuberosum] Length = 550 Score = 337 bits (865), Expect = 1e-89 Identities = 245/596 (41%), Positives = 310/596 (52%), Gaps = 17/596 (2%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 MVE+ K K+ FV+E+D+ST+LQRYTA T KIDWN LVKKT TGI Sbjct: 1 MVEKR-KLNKRGFVTEDDMSTLLQRYTAFTMLTLLQEVGQVNGS-KIDWNDLVKKTATGI 58 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 +SAREYQM+WRHLAYR A KV Sbjct: 59 TSAREYQMVWRHLAYRKVLLDKFDDDAQPMDDDSDLEYELESFPPVSSEASTEAAAWGKV 118 Query: 1942 LIASGV---SNDPTGSTVEGPLTISMPRGQTS-KARENSQSTNCTLGTNITVPISVQKQP 1775 IASG SN GSTVE LTI +P GQTS NS GT +TVP++VQ QP Sbjct: 119 FIASGALHDSNMSNGSTVEASLTIQIPNGQTSGTVAANSLQGISAYGTKLTVPVTVQTQP 178 Query: 1774 MPFNANAEGLDSNGAANPNLPRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDR 1595 MP + AEG+D++G A+ NLPRRRRK W+ AEDMELI AVQKCGEGNWANILK DFKGDR Sbjct: 179 MPSVSAAEGVDTSGPASANLPRRRRKAWTGAEDMELITAVQKCGEGNWANILKTDFKGDR 238 Query: 1594 TASQLSQRWNIIKKKNGNLNVGTGSQISDVHLATRRAVDMALGK------PTMPS-CSIA 1436 TASQLSQRW I+K++ + VG GS +S+ LATR AV MA G P P+ C I Sbjct: 239 TASQLSQRWATIRKQH-VMMVGNGSHLSEAQLATRHAVSMAFGDNVRAACPISPNGCGIV 297 Query: 1435 NAGVNSNAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLS 1256 +AG NS + S+ S H + A + G P +K Sbjct: 298 SAGPNSGSGPSN---------------SSHFAAAANVASAG-----------PQSK---H 328 Query: 1255 PESTVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASD-AASLVKAAQSKNAVHI 1079 + V + P K ++PD +VKA A+ +R+ T S AASL KAAQSK VHI Sbjct: 329 QQDLVPSKPIIPKIPLPKPAINPDPMVKAAAMAASSRVATHSGAAASLQKAAQSKKGVHI 388 Query: 1078 MPGGS-LIKSSVAGSSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTK 902 MPGG+ +KSSV GS N PSNVH+I TGLVS P P N SQ GT Q+Q P + Sbjct: 389 MPGGTPAVKSSVPGSFNGLPSNVHFIRTGLVSCPAD-----PSNTSQ-SGTQQLQAP--R 440 Query: 901 SAASVVQPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVLEG 722 S + VQP T + ++ASSG AP+ ++ K+ + E Sbjct: 441 SVSPAVQPK-------------PTTVPSRTNASSGVRSAPSSYPTTVLEVKSKAAVSQEN 487 Query: 721 QTDIKGKLTNKQIEGDQNAIAGSTPMDV----DASRNSPRDKVEGCQTAVLSKSLE 566 Q + +++ + + A +TP N KV+G QT+VL +++ Sbjct: 488 QIAVLSNTRSEKTQVIRAASLANTPQQQVPKDQTFGNLLSGKVDG-QTSVLGDTVK 542 >gb|EOY15457.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao] Length = 674 Score = 337 bits (865), Expect = 1e-89 Identities = 255/680 (37%), Positives = 332/680 (48%), Gaps = 78/680 (11%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 M+E++ K+QKK VSEEDIS++LQRYTATT K++WNALVKKT+TGI Sbjct: 1 MIEKT-KKQKKGSVSEEDISSLLQRYTATTVLALLQEVAQFPGV-KLNWNALVKKTSTGI 58 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 S+AREYQMLWRHLAYRD ACVKV Sbjct: 59 SNAREYQMLWRHLAYRDVLLEKLEDGAEPLDDESDLEYELEPCPSVSSEASAEAAACVKV 118 Query: 1942 LIASGVSND---PTGSTVEGPLTISMPRGQTSKAR-ENSQSTNCTLGTNITVPISVQKQP 1775 LIASG+ +D P STVE PLTI++P GQ+ +A ENSQ T G NITVP+SVQKQ Sbjct: 119 LIASGLPSDSSLPNSSTVEAPLTINIPNGQSFRASSENSQPTCSMRGMNITVPVSVQKQI 178 Query: 1774 MPFNANAE-GLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKG 1601 +P +AE L+ NG + NLP RR+RKPWS AED ELIAAVQKCG GNWANIL+GDFKG Sbjct: 179 LPAVTSAETSLEGNGLSGANLPARRKRKPWSEAEDRELIAAVQKCGVGNWANILRGDFKG 238 Query: 1600 DRTASQLSQRWNIIKKKNGNLNV---GTGSQISDVHLATRRAVDMALGKP--TMPSCSIA 1436 DR+ASQL+QRW IIKK+ GNLNV T Q+S+ LATR A+ +AL P + S + Sbjct: 239 DRSASQLAQRWTIIKKRLGNLNVEGNSTIPQLSEAQLATRSALSLALDMPDKNLTSACPS 298 Query: 1435 NAGVNSNAAQSSLGQPTPGTETGRTQPSQHDSVPAGL----------------------- 1325 N + + ++ S+L P T + P+Q + Sbjct: 299 NPALKTTSSNSAL----PSTSGEASVPAQSQFQQGNIASVQAQNLPQQGHIASVQGQNQS 354 Query: 1324 --GPLGSSKARVAPPKKP-----------------------------STKTTLSPESTVT 1238 GP+ S A P K P TKT+ + Sbjct: 355 QQGPITSVSAHNQPQKGPITSVPAQNLSQQGPVASLQVSNQSQQGPMITKTSPGSSGSTL 414 Query: 1237 KSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMPGGSLI 1058 KS + +KS S S++ ATAV GARIG AASL+KAAQSKNA+HIM Sbjct: 415 KSRVGLKKPPAKSFSSTGSILDATAVAAGARIGGPKAAASLLKAAQSKNAIHIMTSSGSS 474 Query: 1057 KSSVAGSSNSFPSNVHYICTGLVSRPTS-SYSSAPPNASQVGGTHQM--QGPSTKSAASV 887 + S SNV Y+CTGL + P S +S+ N V Q PS S++ Sbjct: 475 AKPLMPSGKEVHSNVQYVCTGLTTEPLSCPVTSSTLNPGSVKSPIQRVEHTPSASSSSLN 534 Query: 886 V----------QPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKE 737 V P++ G +L E K S S G P E N A K Sbjct: 535 VSIQQCNTVTSSPTVDGTLKEELDAAGENK----SFMSDGLPK----ELVKENGACVSKN 586 Query: 736 LVLEGQTDIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPRDKVEGCQTAVLSKSLEDQA 557 EG + K ++N + E S ++V A+ ++ + VEG Q ++ +E+ Sbjct: 587 EQGEGVREDKPAVSNLESE--------SKNLEVVAAHSNEKSMVEGNQLDAITNPVEESQ 638 Query: 556 EGDKVSAAPDAASKAQGHID 497 S + S+ + I+ Sbjct: 639 NAIDCSLIKKSDSQPEASIN 658 >ref|XP_006472453.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like [Citrus sinensis] Length = 603 Score = 336 bits (862), Expect = 3e-89 Identities = 253/697 (36%), Positives = 356/697 (51%), Gaps = 27/697 (3%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 MVE + K+QKK +SE D+S++LQRYTA T K+DWNALVKKT+TGI Sbjct: 1 MVENTNKKQKKGSISEGDVSSLLQRYTANTVLALLQEVAQFPDV-KLDWNALVKKTSTGI 59 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 S+AREYQMLWRHLAYR++ ACVKV Sbjct: 60 SNAREYQMLWRHLAYRNTLLDKLEDNAQPLDDDSDLEYELEAFPEVSSEASTEAAACVKV 119 Query: 1942 LIASGVSND---PTGSTVEGPLTISMPRGQTSKAR-ENSQSTNCTLGTNITVPISVQKQP 1775 LIASG+ +D P S VE PLTI++P GQ+ +A ENSQ ++ G NITVP++VQK P Sbjct: 120 LIASGLPSDSSLPNSSMVEAPLTINIPNGQSLRASTENSQPSSLMQGMNITVPVAVQKVP 179 Query: 1774 MPFNANAEGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGD 1598 +P E LD+NG ++P R++RKPW+ ED+ELI+AVQKCGEGNWANIL+GDFK D Sbjct: 180 LPA-PTPEVLDANGLIGGSMPPRKKRKPWTAEEDLELISAVQKCGEGNWANILRGDFKWD 238 Query: 1597 RTASQLSQRWNIIKKKNGNLNVG---TGSQISDVHLATRRAVDMALGKPT---MPSCSIA 1436 RTASQLSQRWNI++KK+GN+ +G +GSQ+S+ LA R A+ +AL P SC+ Sbjct: 239 RTASQLSQRWNILRKKHGNVILGSNSSGSQLSEAQLAARHAMSLALDMPVKNITASCTNT 298 Query: 1435 NAGVNSNAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLS 1256 AG S+A ++ P P T + S + +G GS+ P KK Sbjct: 299 TAGTTSSA---TMNNPVPSTANAEASSVANQSKLSPVGSPGSAAKSRVPLKK-------- 347 Query: 1255 PESTVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIM 1076 + +KS DS ++A AV GARI T SDAASL+K AQ+K A+HIM Sbjct: 348 --------------MPAKSNFGADSSIRAAAVAAGARIVTPSDAASLLKVAQAKKAIHIM 393 Query: 1075 PGG-SLIKSSVAGSSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTKS 899 P G S IKS AGS+++ L + PT+ Y Sbjct: 394 PSGVSSIKSPSAGSASAH----------LEASPTTRY----------------------- 420 Query: 898 AASVVQPSLGGATASDLSGLAETKGGANSDASSGHPD-----APAVEKSSS----NAAKT 746 V+PSL +S + +S+ HP P V+ ++S NA + Sbjct: 421 ----VRPSLPAVPSSSSPAVT---------SSASHPGLVKAALPKVQHNTSCEQTNAVVS 467 Query: 745 IKELVLEGQTDIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPRDKVEGCQTAVLSKSLE 566 + L+ + ++K G++ ++G + S N P +++ L + Sbjct: 468 VPATELQLKPEVKA--------GEEIKVSGCS-----VSGNEPSKEIQ-LDLPKLDAEFK 513 Query: 565 DQAEGDKVSAAPDAASKAQ----GHIDSSSSGQAAGNGDHNIKGNDKTMSLAAEHNGEIP 398 +QA V+ PD++S + G + S+ + Q GNG+ N GND M + NGE Sbjct: 514 NQA---AVAENPDSSSNMEIVENGQVQSNGN-QPEGNGNQN--GNDDKMVDSPVANGENQ 567 Query: 397 SAIEKIHENGSSSAAKEGAE--EMVVDGSAGEKCPSQ 293 +A+++ + S+ E AE +V+D KC S+ Sbjct: 568 AAVKQKNSGLPQSSNNEEAELPTLVID-----KCSSK 599 >gb|EMJ23216.1| hypothetical protein PRUPE_ppa002943mg [Prunus persica] Length = 619 Score = 333 bits (855), Expect = 2e-88 Identities = 235/613 (38%), Positives = 328/613 (53%), Gaps = 53/613 (8%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 MVE++ K +K++++EED + +LQRY A KIDWN LV+KT+TGI Sbjct: 1 MVEKT-KDPEKSYITEEDTANLLQRYQAANVLHLLQEVAHSQDV-KIDWNRLVEKTSTGI 58 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 S+AREYQMLWRHLAY ++ ACVKV Sbjct: 59 SNAREYQMLWRHLAYSEAFVDNFDNGAQPVDDDSDLEHELEAFPAVIGEDSTEAAACVKV 118 Query: 1942 LIASGVSNDPT---GSTVEGPLTISMPRGQTSKARENSQSTNCTLGTNITVPISVQKQPM 1772 L+ASG+ +D T G+TVE PLTI++P GQ S+ +NSQ G NITVP+SVQKQP+ Sbjct: 119 LMASGLPSDSTHRSGATVEAPLTINIPNGQPSRTHQNSQPPCSMQGMNITVPVSVQKQPL 178 Query: 1771 -----PFNANAEGLDSNGAANPNL-PRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGD 1610 A AEG D+NG+A+ N+ PR++RK WS AED+ELIA V++ GEGNWANIL+GD Sbjct: 179 LAMTTSTGATAEGGDANGSASNNMAPRKKRKKWSEAEDLELIAGVRRYGEGNWANILRGD 238 Query: 1609 FKGDRTASQLSQRWNIIKK-KNGNLNVG--TGSQISDVHLATRRAVDMALGKPTMPSCSI 1439 FKG+RTA+QLSQRW I+K + +LNVG + +++S+ LATR A+ +AL P++ + +I Sbjct: 239 FKGERTANQLSQRWKYIRKHHHQDLNVGGNSSNKLSEAQLATRHAMSLALNMPSITANTI 298 Query: 1438 ANAGVN-------SNAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKK 1280 AG N +NA +SL T E ++Q + P +G LGS Sbjct: 299 GTAGTNTHSKFGGTNATTNSL-PSTAAEEELQSQQGLKPAKPYQMGLLGS---------- 347 Query: 1279 PSTKTTLSPESTVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQ 1100 ++K+ L+ + T+TK N N D +V+ATAV GARI + SDAASL+KAAQ Sbjct: 348 -TSKSQLTSKKTLTKPNSN-----------TDGMVRATAVAAGARIASPSDAASLLKAAQ 395 Query: 1099 SKNAVHIMP-GGSLIKSSVAGSSNSFPS---NVHYICTGLVSRPTSSYSS---------- 962 +KNAVH++P GGS I+SS+ GS + P N+HY+ TGL + P S+ S Sbjct: 396 AKNAVHVLPTGGSSIQSSLPGSMRTHPEPHPNLHYMHTGLAATPVSTPLSTAVTPSATHP 455 Query: 961 ----APPNASQVGGTHQ-MQGPSTKSAASVVQPSLGGATASDL-SGLAETKGGANSDASS 800 A P SQ T+ + K + + LG + G ++ G N + Sbjct: 456 GSLKALPQTSQHAPTNSTLLSKQIKDVSCSLDSELGCTPTEQVQDGAVISENGQNEEGQK 515 Query: 799 GHPDAPAVEKSSSNAAKTIKELVLEGQTDIKGKLTNK------QIEGDQNA--------I 662 D+P + N + + + LV G DIKG T+ Q E Q+A + Sbjct: 516 DKVDSPDQKAELKNLSTSAENLV--GSLDIKGDETDNIAGIGVQSEERQSAKDNETLCSL 573 Query: 661 AGSTPMDVDASRN 623 G P D+ N Sbjct: 574 KGDDPFAADSCEN 586 >ref|XP_002302346.1| myb family transcription factor family protein [Populus trichocarpa] gi|222844072|gb|EEE81619.1| myb family transcription factor family protein [Populus trichocarpa] Length = 677 Score = 328 bits (842), Expect = 6e-87 Identities = 254/704 (36%), Positives = 353/704 (50%), Gaps = 43/704 (6%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 M+E+S K+ KK +SEED+ST+LQRYTATT KIDWNALVKKT+TGI Sbjct: 1 MIEKS-KKNKKGVISEEDVSTLLQRYTATTLLALLQEVAQFDGA-KIDWNALVKKTSTGI 58 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXA-CVK 1946 S+AREYQMLWRHLAYR A CVK Sbjct: 59 SNAREYQMLWRHLAYRHVLPEKFDDGAHPLDDDDSDLESELEAFPSVTSEASTEAAACVK 118 Query: 1945 VLIASGVSND---PTGSTVEGPLTISMPRGQTSKARENSQSTNCTLGTNITVPISVQKQP 1775 VLIASG+ +D P +TVE PLTI++P G++ +A + ++ G NI VP+SVQK Sbjct: 119 VLIASGLPSDSTHPNNTTVEAPLTINIPNGRSLRATSENSQSDVMRGVNIRVPVSVQKLS 178 Query: 1774 MPFNAN---AEGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDF 1607 +P + +E D+NG+ + P RR+RKPWS AEDMELIAAVQK GEGNWA+I++G+F Sbjct: 179 LPAVMSCPASEVYDANGSGSGTFPPRRKRKPWSEAEDMELIAAVQKLGEGNWASIVRGEF 238 Query: 1606 KGDRTASQLSQRWNIIKKKNGNLNVGTGS---QISDVHLATRRAVDMALG-KPTMPSCSI 1439 KGDRTASQLSQRW II+K++GNLNVGT S Q+S+ A R AV MAL P S Sbjct: 239 KGDRTASQLSQRWAIIRKRHGNLNVGTVSSAPQLSETQRAARDAVKMALDPHPAAKSLIA 298 Query: 1438 ANAGVNSNAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTL 1259 ++AG S ++ P T T P+QH S + SS + Sbjct: 299 SSAGTTSTKTPNNCASP---TITAEASPAQHQSQQRTMMTKSSS---------------I 340 Query: 1258 SPESTVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHI 1079 P KS + + KS LS D V+A AV GARI T SDAASL+KAAQ+KNAVHI Sbjct: 341 WPVGPAAKSQVMLAKASEKSILSSDP-VRAAAVAAGARIATQSDAASLLKAAQAKNAVHI 399 Query: 1078 MP-GGSLIKSSVAGSSNS---FPSNVHYICTGLVSRPTSS-------------YSSAPP- 953 MP G S IKSS+ G ++ N +I +G+ + PT++ +S PP Sbjct: 400 MPTGSSSIKSSMTGGISTHLDVNPNTRFISSGMATAPTTTRPPASGPCPGLPKATSPPPQ 459 Query: 952 -NASQVGGTHQMQGPSTKSAASVVQPSLGGATASDL-----SGLAETKGGANSDASSGHP 791 AS H P T A Q + A A+ L + T+ ++ +S P Sbjct: 460 MKASSSTAQHTQSTPVTSFNAQSEQTNSVLAKATVLPPQMKASSMTTQNTLSTPITSSTP 519 Query: 790 -DAPAVEKSSSNAAKTIKELVLEGQTDIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPR 614 + E S TIK+ G ++ N Q++ D ++ +V A+ + Sbjct: 520 SEQTNAESSPKQGIVTIKDTKAFGSQEV----ANGQVQRDGAHVSSEHVQEVKAALTNQE 575 Query: 613 DKVEGCQTAVLSKSLED----QAEGDKVSAAPDAASKAQGHIDSSSSGQAAGNGDHN--I 452 +++ Q A L S E V+ + +Q D+ + ++ + Sbjct: 576 AELKS-QVAALESSNGSPKLIMNESGLVNVTGNQVDGSQNADDNKMTCSPIKEAENQSAV 634 Query: 451 KGNDKTMSLAAEHNGEIPSAIEKIHENGSSSAAKEGAEEMVVDG 320 + ND+ S+ +E ++PS++ S +K A + ++DG Sbjct: 635 QENDENQSV-SERQADLPSSVSNESCIKVDSISKTEASDGMMDG 677 >gb|EOY15458.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao] Length = 606 Score = 327 bits (839), Expect = 1e-86 Identities = 248/632 (39%), Positives = 327/632 (51%), Gaps = 30/632 (4%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 M+E++ K+QKK VSEEDIS++LQRYTATT K++WNALVKKT+TGI Sbjct: 1 MIEKT-KKQKKGSVSEEDISSLLQRYTATTVLALLQEVAQFPGV-KLNWNALVKKTSTGI 58 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 S+AREYQMLWRHLAYRD ACVKV Sbjct: 59 SNAREYQMLWRHLAYRDVLLEKLEDGAEPLDDESDLEYELEPCPSVSSEASAEAAACVKV 118 Query: 1942 LIASGVSND---PTGSTVEGPLTISMPRGQTSKAR-ENSQSTNCTLGTNITVPISVQKQP 1775 LIASG+ +D P STVE PLTI++P GQ+ +A ENSQ T G NITVP+SVQKQ Sbjct: 119 LIASGLPSDSSLPNSSTVEAPLTINIPNGQSFRASSENSQPTCSMRGMNITVPVSVQKQI 178 Query: 1774 MPFNANAE-GLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKG 1601 +P +AE L+ NG + NLP RR+RKPWS AED ELIAAVQKCG GNWANIL+GDFKG Sbjct: 179 LPAVTSAETSLEGNGLSGANLPARRKRKPWSEAEDRELIAAVQKCGVGNWANILRGDFKG 238 Query: 1600 DRTASQLSQRWNIIKKKNGNLNV---GTGSQISDVHLATRRAVDMALGKP--TMPSCSIA 1436 DR+ASQL+QRW IIKK+ GNLNV T Q+S+ LATR A+ +AL P + S + Sbjct: 239 DRSASQLAQRWTIIKKRLGNLNVEGNSTIPQLSEAQLATRSALSLALDMPDKNLTSACPS 298 Query: 1435 NAGVNSNAAQSSLGQPTPGTETGRTQPSQHD--------------SVPA----GLGPLGS 1310 N + + ++ S+L P T + P+Q SVPA GP+ S Sbjct: 299 NPALKTTSSNSAL----PSTSGEASVPAQSQFQQAHNQPQKGPITSVPAQNLSQQGPVAS 354 Query: 1309 SKARVAPPKKPS-TKTTLSPESTVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTA 1133 + + P TKT+ + KS + +KS S S++ ATAV GARIG Sbjct: 355 LQVSNQSQQGPMITKTSPGSSGSTLKSRVGLKKPPAKSFSSTGSILDATAVAAGARIGGP 414 Query: 1132 SDAASLVKAAQSKNAVHIMPGGSLIKSSVAGSSNSFPSNVHYICTGLVSRPTSSYSSAPP 953 AASL+KAAQSKNA+HIM + S S V + T S SS+ Sbjct: 415 KAAASLLKAAQSKNAIHIMTSSGSSAKPLMPSVKSPIQRVEH---------TPSASSSSL 465 Query: 952 NASQVGGTHQMQGPSTKSAASVVQPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVE 773 N S +Q +T +++ P++ G +L E K S S G P E Sbjct: 466 NVS-------IQQCNTVTSS----PTVDGTLKEELDAAGENK----SFMSDGLPK----E 506 Query: 772 KSSSNAAKTIKELVLEGQTDIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPRDKVEGCQ 593 N A K EG + K ++N + E S ++V A+ ++ + VEG Q Sbjct: 507 LVKENGACVSKNEQGEGVREDKPAVSNLESE--------SKNLEVVAAHSNEKSMVEGNQ 558 Query: 592 TAVLSKSLEDQAEGDKVSAAPDAASKAQGHID 497 ++ +E+ S + S+ + I+ Sbjct: 559 LDAITNPVEESQNAIDCSLIKKSDSQPEASIN 590 >gb|EMJ25790.1| hypothetical protein PRUPE_ppa1027142mg [Prunus persica] Length = 639 Score = 324 bits (830), Expect = 1e-85 Identities = 211/490 (43%), Positives = 278/490 (56%), Gaps = 14/490 (2%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 MVE++ K KK ++EED +T+LQRYTATT KIDW LV KT+TGI Sbjct: 1 MVEKT-KDPKKCSITEEDTATLLQRYTATTVLALLQEVAHWPEA-KIDWIRLVAKTSTGI 58 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 S+AREYQMLWRHLAYR++ ACVKV Sbjct: 59 SNAREYQMLWRHLAYREALVDKFDNGSQPLDDDSDLEYELEAFPAVCGEASTEAAACVKV 118 Query: 1942 LIASGVSNDPT---GSTVEGPLTISMPRGQTSKARENSQSTNCTLGTNITVPISVQKQPM 1772 LIASG+ +D + G+TVE PLTI++P GQ S+ ENS+ T G NITVP+SV+KQP+ Sbjct: 119 LIASGLPSDSSHRNGTTVEAPLTINIPNGQPSRTHENSEPTCSMQGKNITVPVSVKKQPL 178 Query: 1771 PFN-----ANAEGLDSNGAANPNL-PRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGD 1610 P A A+G D+NG+A+ ++ PR++RK WS AED ELIAAVQKCGEGNWANIL+ D Sbjct: 179 PSATTSSVATADGGDANGSASNSMAPRKKRKKWSEAEDFELIAAVQKCGEGNWANILRAD 238 Query: 1609 FKGDRTASQLSQRWNIIKKKNGNLNVGTGS--QISDVHLATRRAVDMALGKPTMPSCSIA 1436 FKGDRTA QLSQRW IIKK+N LN+G S ++S+ LA R ++ +AL P + + +I Sbjct: 239 FKGDRTAGQLSQRWAIIKKRNQELNLGGNSSGKLSEAQLAARHSLSVALNMPNLTAKTIG 298 Query: 1435 NAGVNS-NAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTL 1259 AG N+ N + P TG Q S+ + P KKP L Sbjct: 299 TAGTNAHNKFARKVATSNPVLTTGAKAEPQ-------------SQQDLKPTKKPYQMELL 345 Query: 1258 SPESTVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHI 1079 + TKS + + +K + D +V+A AV GARI + SDAASL+KAAQ+KNAVHI Sbjct: 346 ---GSTTKSQVTSKNTLTKPNCNDDDIVRAIAVAAGARIASPSDAASLLKAAQAKNAVHI 402 Query: 1078 MPGGSLIKSSVAG--SSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPST 905 MP I+SS+ G S++S P ++ TGL + S+ PP H S+ Sbjct: 403 MPTSGSIQSSLPGGMSTHSEPHPNLHMRTGLAG---ITLSTPPPTDVTPSAVHP---GSS 456 Query: 904 KSAASVVQPS 875 K+ + QP+ Sbjct: 457 KALPPMSQPT 466 >ref|XP_004237998.1| PREDICTED: uncharacterized protein LOC101255687 [Solanum lycopersicum] Length = 571 Score = 324 bits (830), Expect = 1e-85 Identities = 244/612 (39%), Positives = 308/612 (50%), Gaps = 24/612 (3%) Frame = -3 Query: 2302 MVERSAKRQKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGI 2123 MVE+ K K+ FV+E+D+ST+LQRYTA T KIDWN LVKKT TGI Sbjct: 1 MVEKR-KVNKRGFVTEDDMSTLLQRYTAFTMLTLLQEVGQVNGS-KIDWNDLVKKTATGI 58 Query: 2122 SSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKV 1943 +SAREYQM+WRHLAYR A KV Sbjct: 59 TSAREYQMVWRHLAYRKVLLDKFDDNAQPMDDDSDLEYELESFPPVSSEASTEAAAWGKV 118 Query: 1942 LIASGV---SNDPTGSTVEGPLTISMPRGQTS-KARENSQSTNCTLGTNITVPISVQKQP 1775 IASG SN G+TVE LTI +P GQTS NS G +TVP++VQ QP Sbjct: 119 FIASGALRDSNMSNGNTVEASLTIQIPNGQTSGTVAANSLQGISAFGKKLTVPVTVQTQP 178 Query: 1774 MPFNANAEGLDSNGAANPNLPRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDR 1595 MP + AEGLD++G A NLPRRRRK W+ AEDMELI AVQK GEGNWANILK DFKGDR Sbjct: 179 MPSVSAAEGLDTSGPATANLPRRRRKAWTGAEDMELITAVQKYGEGNWANILKTDFKGDR 238 Query: 1594 TASQLSQRWNIIKKKNGNLNVGTGSQISDVHLATRRAVDMALGKPTMPSCSIA-NAGVNS 1418 TASQLSQRW I+K++ + VG GS +S+ LA R AV MA +C I+ NAG NS Sbjct: 239 TASQLSQRWATIRKQH-VMMVGNGSHLSEAQLAARHAVSMAFRDNVRAACPISPNAGTNS 297 Query: 1417 NAAQSSLGQPTPGTETGRTQPSQHDSVP--AGLGPLGSSKARVAPPKKPSTKTTLSPEST 1244 + S+ S H + A GP +P + L P Sbjct: 298 GSGPSN---------------SSHFAAADVASAGP------------QPKHQQDLVPSKP 330 Query: 1243 VTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTAS-DAASLVKAAQSKNAVHIMPGG 1067 + P K ++PD +VK A+ +R+ T S AASL KAA SK VHIMPGG Sbjct: 331 II-----PKIPLPKPAINPDLMVKTAAMAASSRVATHSGTAASLQKAALSKKGVHIMPGG 385 Query: 1066 S-LIKSSVAGSSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPST----- 905 + +KSSV GS N PSNVH++ TGLVSRP + P NA Q GT Q+ P T Sbjct: 386 TPAVKSSVPGSFNGLPSNVHFMRTGLVSRP-----AGPSNAPQ-SGTQQLHAPRTQQLQA 439 Query: 904 -KSAASVVQPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVL 728 +S + VQP T + ++ASSG AP+ ++ K+ + Sbjct: 440 PRSVSPAVQPK-------------PTTVPSRTNASSGVRSAPSSYPTTVLDVKSKAAVSQ 486 Query: 727 EGQTDIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPRD---------KVEGCQTAVLSK 575 E Q + ++ + Q A +TP + P+D KVEG QT+VL Sbjct: 487 ENQIAVLSNTRGEKTQVIQAASLANTP-----QQQVPKDQNFGDLLSGKVEG-QTSVLCD 540 Query: 574 SLEDQAEGDKVS 539 +++ K S Sbjct: 541 TVKKLGGESKAS 552 >ref|XP_002514048.1| DNA binding protein, putative [Ricinus communis] gi|223547134|gb|EEF48631.1| DNA binding protein, putative [Ricinus communis] Length = 608 Score = 311 bits (796), Expect = 1e-81 Identities = 250/666 (37%), Positives = 338/666 (50%), Gaps = 26/666 (3%) Frame = -3 Query: 2302 MVERSAKRQ-KKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTG 2126 M+E+S K +K +SEEDIS++LQRYTA T KIDWNALVKKTTTG Sbjct: 1 MIEKSKKHNSRKGLISEEDISSLLQRYTANTVLALLQEVAQFEGV-KIDWNALVKKTTTG 59 Query: 2125 ISSAREYQMLWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVK 1946 I + REYQMLWRHLAY+ + ACVK Sbjct: 60 IKNVREYQMLWRHLAYKHTLIDNLDDGAQPLDDDSDLEYELEAFPDVSSEASAEAAACVK 119 Query: 1945 VLIASGVSND---PTGSTVEGPLTISMPRGQTSKA-RENSQSTNCTLGTNITVPISVQKQ 1778 VLIASG ++D P +TVE PLTI++P GQ+++A ENSQ G NITVP+S+QKQ Sbjct: 120 VLIASGATSDSTHPNSATVEAPLTINIPNGQSARAISENSQPATMR-GMNITVPVSIQKQ 178 Query: 1777 PMPFNANAEGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKG 1601 P+P A+ E D NG N N+P RR+RKPWS AED+ELIAAVQK GEGNWANIL+ +F Sbjct: 179 PLPTVASTEVFDGNGLGNGNIPPRRKRKPWSEAEDLELIAAVQKYGEGNWANILRSEFTW 238 Query: 1600 DRTASQLSQRWNIIKKKNGNLN-VG--TGSQISDVHLATRRAVDMALGKPTMPSCSIANA 1430 DRTASQLSQRW II+K++GN N VG +G Q+S+ A R A+++AL P Sbjct: 239 DRTASQLSQRWAIIRKRHGNWNPVGNTSGVQLSEEWRAARHAMNLALDPP---------- 288 Query: 1429 GVNSNAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPE 1250 V + + G+ TP + +P S P + PLGS+ K+P Sbjct: 289 -VKNKFTNNISGEATPAQHQSQ-RPFAAKSSP--MVPLGSAPKSQIAVKRP--------- 335 Query: 1249 STVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMP- 1073 +K LS D V+ATAV GARI T SDAASL+KAAQ+KNAVHIMP Sbjct: 336 --------------AKPDLSSDP-VRATAVAAGARIATQSDAASLLKAAQAKNAVHIMPT 380 Query: 1072 GGSLIKSSVAGSSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTKSAA 893 GGS +KS++ G + S++S A PN T+ + S +S Sbjct: 381 GGSSMKSALPGGA-------------------SNHSEAHPNVH----TNDLAAGS-RSTL 416 Query: 892 SVVQPS----LGGATASDLSGLAETKGGANSDASSGHPDAPA-VEKSSSNAAKTIKELVL 728 VV PS +T + +++T N A + + PA + ++ A K + E Sbjct: 417 PVVSPSAIRPAASSTVQHIPSISDT--AKNISAKQFNAELPARKDTETAGAIKILSEDAK 474 Query: 727 EGQTD-----IKGKLTNKQIEGDQNAIAG-----STPMDVDASRNSPRDKVEGCQTAVLS 578 E Q + G +KQ++ ++ A T + V S +S K+E + ++ Sbjct: 475 EQQVKEHGACVSGNELSKQVQEEKAAFPNREAECKTQLAVSES-SSAASKLEMADSNMMD 533 Query: 577 KSLEDQAEGDKVSAAPDAASKAQGHIDSSSSGQAAGNGDHNIKGN-DKTMSLAAEHNGEI 401 L AEG + S + + DS S+ Q NGD I + +S+A + E Sbjct: 534 -VLGKPAEGSQNSNSNIITCLSVKTEDSMSAIQV--NGDKQITSDKPDRISMAIDKFSEK 590 Query: 400 PSAIEK 383 A+ K Sbjct: 591 IEAVSK 596 >ref|XP_006844749.1| hypothetical protein AMTR_s00016p00255950 [Amborella trichopoda] gi|548847220|gb|ERN06424.1| hypothetical protein AMTR_s00016p00255950 [Amborella trichopoda] Length = 661 Score = 300 bits (768), Expect = 2e-78 Identities = 222/562 (39%), Positives = 285/562 (50%), Gaps = 29/562 (5%) Frame = -3 Query: 2278 QKKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGISSAREYQM 2099 +KK +SEED S +LQRYTATT K+DWN LVKKT+TGIS+AREYQM Sbjct: 36 KKKGLISEEDASLLLQRYTATTILALLQEVAQFAGP-KVDWNVLVKKTSTGISNAREYQM 94 Query: 2098 LWRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKVLIASGVSN 1919 LWRHLAYR + ACVKVLIAS Sbjct: 95 LWRHLAYRTALAEKLEDDAEPMDDDSDLEFEVEASPTPSNEALAEATACVKVLIASSDPG 154 Query: 1918 DPTGSTVEGPLTISMPRG-QTSKARENSQSTNCT-LGTNITVPISVQKQPMPFNANAEGL 1745 + +E PLTI++P QT A+ +++++CT GTNITVP+SVQKQP+P +AEGL Sbjct: 155 PSNRTIIEAPLTINVPNNAQTLPAQSENRNSSCTGQGTNITVPVSVQKQPLPTVTSAEGL 214 Query: 1744 DSNGAANPNLPRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWN 1565 +SNG A LPRR+RKPW+ ED ELIAAVQKCGEGNWANILKGDFK DRTASQLSQRW+ Sbjct: 215 NSNGVAG--LPRRKRKPWTSEEDKELIAAVQKCGEGNWANILKGDFKHDRTASQLSQRWS 272 Query: 1564 IIKKKNGNLNVGTG-----SQISDVHLATRRAVDMALGKPTMPSCSIANAGVNSNAAQSS 1400 IIKKK N + G S +++ ATR+AV +AL P + S ++++ G S S Sbjct: 273 IIKKKQANSDSKVGGSSNSSALTEAQQATRQAVSIALNMP-ISSNTLSSGG--SGTFSSI 329 Query: 1399 LGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPESTVTKSNLNP 1220 + P P +Q Q A GP SKAR PP K +T T Sbjct: 330 VRPPAPLF----SQVPQQGPDQAHRGP---SKAR--PPAKKATPT--------------Q 366 Query: 1219 DSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMP----------- 1073 K T P+ LV+A AV GARI AS ASL+KAAQS N VH P Sbjct: 367 GQAQMKPTNGPNPLVQAAAVAAGARIAPASTVASLLKAAQSGNVVHFGPPKPLAGPSGPV 426 Query: 1072 --GGSLIKSSVAGS---SNSFPSNVHYICTGLVSRPTSS-YSSAPPNASQVGGTHQMQGP 911 G+ S + G+ + P+NVHYI T PT Y+ P + G+ + + Sbjct: 427 KLSGTRPASGINGTTMFTGPRPANVHYITTS--DNPTPPVYTGMTPTFQRPNGSGRGRTQ 484 Query: 910 STKSAASVVQPSLGGAT-----ASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKT 746 + A + LG A +S SG+ E G + E + Sbjct: 485 TRPMNADMGPVGLGSARMVSIGSSSTSGVGE--GVKGEECVKVGLAEELKETPTEKNQSM 542 Query: 745 IKELVLEGQTDIKGKLTNKQIE 680 I+ +E D++ LT +QI+ Sbjct: 543 IESTSMESSGDLERDLTKEQIQ 564 >emb|CAN60243.1| hypothetical protein VITISV_010188 [Vitis vinifera] Length = 598 Score = 300 bits (768), Expect = 2e-78 Identities = 218/568 (38%), Positives = 304/568 (53%), Gaps = 19/568 (3%) Frame = -3 Query: 1954 CVKVLIASGVSND---PTGSTVEGPLTISMPRGQTSKA-RENSQSTNCTLGTNITVPISV 1787 CVKVLIAS + +D P S VE PLTI++P GQ+S+A E S+ + GTNIT+P+SV Sbjct: 78 CVKVLIASSLPSDSSLPNSSMVEAPLTINIPCGQSSRAPSEYSRLSGSMQGTNITIPVSV 137 Query: 1786 QKQPMPFNANAEGLDSNGAANPNLP-RRRRKPWSHAEDMELIAAVQKCGEGNWANILKGD 1610 QK +EG D+NG+ + +LP R++RKPWS ED ELIAAVQKCGEGNWANILKGD Sbjct: 138 QK--------SEGFDANGSTSGSLPARKKRKPWSSDEDKELIAAVQKCGEGNWANILKGD 189 Query: 1609 FKGDRTASQLSQRWNIIKKKNGNLNVG----TGSQISDVHLATRRAVDMALGKPTMP-SC 1445 FKGDR+ASQLSQRW II+KK+ NLNVG GSQ+S+ LA R A+ +AL P + Sbjct: 190 FKGDRSASQLSQRWTIIRKKHKNLNVGGANSNGSQLSEAQLAARHAMSLALDMPVKNLTT 249 Query: 1444 SIANAGVNSNAAQSSLGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKP-STK 1268 S + AG N NA S+ P E ++PA S+A+ + P ST Sbjct: 250 SSSIAGTNPNATSSNSAFPATPAE----------ALPAS---TNISQAQQLSQQGPVSTL 296 Query: 1267 TTLSPESTVTKSNLNPDSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNA 1088 + + + KS ++KST S S++KATAV GARI T S AASL+K AQS+NA Sbjct: 297 SQMGSLGSAPKSRATSKKTSAKSTFSSQSMLKATAVAAGARIATPSAAASLLKDAQSRNA 356 Query: 1087 VHIMPGGS-LIKSSVAGSSNSFPS-------NVHYICTGLVSRPTSSYSSAPPNASQVGG 932 VHIMPGGS LIKSSVAG +N P+ NVHY C G + S+YS+ P+ S+ G Sbjct: 357 VHIMPGGSTLIKSSVAGGANPLPANHLGAHPNVHYKCAGPPTTSLSTYSAVAPSVSRTG- 415 Query: 931 THQMQGPSTKSAASVVQPSLGGATASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAA 752 S K AA GG A S T +S+ ++ + AVE + Sbjct: 416 -------SAKPAAP------GGQLAPSPSA---TSVNISSEQTNAATTSLAVEYPAKQET 459 Query: 751 KTIKELVLEGQTDIKGKLTNKQIEGDQNAIAGSTPMDVDASRNSPRDKVEGCQTAVLSKS 572 KT +E + I G + ++ DQ ++ +T AS D+ T V+ ++ Sbjct: 460 KTSEET----KVPISGNVPKAKVLEDQACVSSNT-----ASEQVQEDQATLSNTEVVLEN 510 Query: 571 LEDQAEGDKVSAAPDAASKAQGHIDSSSSGQAAGNGDHNIKGNDKTMSLAAEHNGEIPSA 392 + K + A G + S + D + G + S+A E++G + Sbjct: 511 KKAMVSDTKCLLKTETAEN-DGEVAESQNVNDNKIMDFRVAGECENQSVANENSGNQNAN 569 Query: 391 IEKIHENGSSSAAKEGAEEMVVDGSAGE 308 ++ +++ E ++E++ +AGE Sbjct: 570 EKQTDLPNTATDCGEKSDEVLYKATAGE 597 >ref|XP_004152740.1| PREDICTED: uncharacterized protein LOC101206820 [Cucumis sativus] Length = 659 Score = 294 bits (753), Expect = 1e-76 Identities = 233/679 (34%), Positives = 328/679 (48%), Gaps = 30/679 (4%) Frame = -3 Query: 2275 KKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGISSAREYQML 2096 KK V+E+D S++L+RY+ TT KIDWN LVK T+TGIS+ REYQML Sbjct: 4 KKQSVTEKDFSSLLRRYSPTTVLALLQEVAQAPDA-KIDWNDLVKNTSTGISNPREYQML 62 Query: 2095 WRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKVLIASGVSND 1916 WRHLAYR + AC KV I+SG +D Sbjct: 63 WRHLAYRHALLDDLEDEKAPLEDDSDLECDLEPFPSVSCETLTEAAACAKVFISSGSPSD 122 Query: 1915 ---PTGSTVEGPLTISMPRGQTSKARENSQSTNCTL-GTNITVPISVQKQPMPFNANAEG 1748 P S +E PLTIS+PR T + + C++ G ITVP+SVQ+QP+ +AEG Sbjct: 123 LNVPNSSIIEAPLTISLPRSYTDGVQFENVDPACSVKGAIITVPVSVQRQPVLAPPSAEG 182 Query: 1747 LDSNGAA-NPNLPRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQR 1571 L++NG N RR+RKPWS AED+EL+AAV+KCGEGNWANI++GDF DRTASQLSQR Sbjct: 183 LNTNGPTYGNNASRRKRKPWSEAEDLELMAAVKKCGEGNWANIIRGDFLSDRTASQLSQR 242 Query: 1570 WNIIKKKNGNLNVG---TGSQISDVHLATRRAVDMALGKPTMPSCSIANAGVNSNAAQSS 1400 W IIKKK+GNLNVG G+Q+S+V LA R A+ +ALG+ A +N +A+ S+ Sbjct: 243 WAIIKKKHGNLNVGVNTAGTQLSEVQLAARHAMSVALGR----HVGSLKARINGSASTST 298 Query: 1399 LGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPESTVTKSNLNP 1220 +G + T ++ Q L S P S+ T + T +K Sbjct: 299 IGNGSSLTTVATSEQVQ--------DKLHQSPTHAKPSSIGSSSLTAKTQVTTSK----- 345 Query: 1219 DSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMPGGSLIKSSVAG 1040 + KS+ D +V+A AV GARI + +DAASL+KAAQSKNA+HIM + S+ Sbjct: 346 -KMVPKSSFDSDCIVRAAAVAAGARIASPADAASLLKAAQSKNAIHIM--AKVPASTKTL 402 Query: 1039 SSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTKSAASVVQPSLGGAT 860 + PS H + PT S+ P GG ++ P+T +S VQ A Sbjct: 403 TPGRGPS--HLEAHPSIKLPT--LSTTPTVVPSRGGPLKITSPTTAKLSS-VQTDQNTAV 457 Query: 859 ASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVLEG--QTDIKGK--LTN 692 AS + A + AS+ D ++ + A+ I+ L G T KG+ L+ Sbjct: 458 ASATASTASATDQNTAVASTASAD--SLSEKEIKIAEEIRGRSLAGVQATSQKGEHCLSK 515 Query: 691 KQIEGDQNAIAGSTPMDVDAS-RNSPRDKVEGCQTAVLSKSLEDQA-EGDKVSAAPDAAS 518 + + G + P D+ + +V+ + A L L+ QA E S++ Sbjct: 516 QSLSG---RVQQEKPADLGPPFKRQSSGRVQEEKPAELGPPLKRQATETSNCSSSSQNMP 572 Query: 517 KAQGHI-----------DSSSSGQAAGNGDHNIKGNDKTMSLAAEHNGEIPS-----AIE 386 A G+ S++ G+ D N + A + +I S I Sbjct: 573 MADGNTKVETCNQAEERQKSNANMVTGSSDQQGIMNQSQVERAEPQDMDINSDGKDRPIT 632 Query: 385 KIHENGSSSAAKEGAEEMV 329 K +S KE A E++ Sbjct: 633 KTDRCSENSRHKEAASEIL 651 >ref|XP_004163958.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101223883 [Cucumis sativus] Length = 659 Score = 294 bits (752), Expect = 2e-76 Identities = 233/679 (34%), Positives = 328/679 (48%), Gaps = 30/679 (4%) Frame = -3 Query: 2275 KKNFVSEEDISTVLQRYTATTXXXXXXXXXXXXXXVKIDWNALVKKTTTGISSAREYQML 2096 KK V+E+D S++L+RY+ TT KIDWN LVK T+TGIS+ REYQML Sbjct: 4 KKQSVTEKDFSSLLRRYSPTTVLALLQEVAQAPDA-KIDWNDLVKXTSTGISNPREYQML 62 Query: 2095 WRHLAYRDSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACVKVLIASGVSND 1916 WRHLAYR + AC KV I+SG +D Sbjct: 63 WRHLAYRHALLDDLEDEKAPLEDDSDLECDLEPFPSVSCETLTEAAACAKVFISSGSPSD 122 Query: 1915 ---PTGSTVEGPLTISMPRGQTSKARENSQSTNCTL-GTNITVPISVQKQPMPFNANAEG 1748 P S +E PLTIS+PR T + + C++ G ITVP+SVQ+QP+ +AEG Sbjct: 123 LNVPNSSIIEAPLTISLPRSYTDGVQFENVDPACSVKGAIITVPVSVQRQPVLAPPSAEG 182 Query: 1747 LDSNGAA-NPNLPRRRRKPWSHAEDMELIAAVQKCGEGNWANILKGDFKGDRTASQLSQR 1571 L++NG N RR+RKPWS AED+EL+AAV+KCGEGNWANI++GDF DRTASQLSQR Sbjct: 183 LNTNGPTYGNNASRRKRKPWSEAEDLELMAAVKKCGEGNWANIIRGDFLSDRTASQLSQR 242 Query: 1570 WNIIKKKNGNLNVG---TGSQISDVHLATRRAVDMALGKPTMPSCSIANAGVNSNAAQSS 1400 W IIKKK+GNLNVG G+Q+S+V LA R A+ +ALG+ A +N +A+ S+ Sbjct: 243 WAIIKKKHGNLNVGVNTAGTQLSEVQLAARHAMSVALGR----HVGSLKARINGSASTST 298 Query: 1399 LGQPTPGTETGRTQPSQHDSVPAGLGPLGSSKARVAPPKKPSTKTTLSPESTVTKSNLNP 1220 +G + T ++ Q L S P S+ T + T +K Sbjct: 299 IGNGSSLTTVATSEQVQ--------DKLHQSPTHAKPSSIGSSSLTAKTQVTTSK----- 345 Query: 1219 DSVASKSTLSPDSLVKATAVIVGARIGTASDAASLVKAAQSKNAVHIMPGGSLIKSSVAG 1040 + KS+ D +V+A AV GARI + +DAASL+KAAQSKNA+HIM + S+ Sbjct: 346 -KMVPKSSFDSDCIVRAAAVAAGARIASPADAASLLKAAQSKNAIHIM--AKVPASTKTL 402 Query: 1039 SSNSFPSNVHYICTGLVSRPTSSYSSAPPNASQVGGTHQMQGPSTKSAASVVQPSLGGAT 860 + PS H + PT S+ P GG ++ P+T +S VQ A Sbjct: 403 TPGRGPS--HLEAHPSIKLPT--LSTTPTVVPSRGGPLKITSPTTAKLSS-VQTDQNTAV 457 Query: 859 ASDLSGLAETKGGANSDASSGHPDAPAVEKSSSNAAKTIKELVLEG--QTDIKGK--LTN 692 AS + A + AS+ D ++ + A+ I+ L G T KG+ L+ Sbjct: 458 ASATASTASATDQNTAVASTASAD--SLSEKEIKIAEEIRGRSLAGVQATSQKGEHCLSK 515 Query: 691 KQIEGDQNAIAGSTPMDVDAS-RNSPRDKVEGCQTAVLSKSLEDQA-EGDKVSAAPDAAS 518 + + G + P D+ + +V+ + A L L+ QA E S++ Sbjct: 516 QSLSG---RVQQEKPADLGPPFKRQSSGRVQEEKPAELGPPLKRQATETSNCSSSSQNMP 572 Query: 517 KAQGHI-----------DSSSSGQAAGNGDHNIKGNDKTMSLAAEHNGEIPS-----AIE 386 A G+ S++ G+ D N + A + +I S I Sbjct: 573 MADGNTKVETCNQAEERQKSNANMVTGSSDQQGIMNQSQVERAEPQDMDINSDGKDRPIT 632 Query: 385 KIHENGSSSAAKEGAEEMV 329 K +S KE A E++ Sbjct: 633 KTDRCSENSRHKEAASEIL 651