BLASTX nr result
ID: Rehmannia22_contig00012848
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00012848 (1302 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006340456.1| PREDICTED: dentin sialophosphoprotein-like [... 438 e-120 ref|XP_004237664.1| PREDICTED: uncharacterized protein LOC101249... 423 e-116 emb|CAN75603.1| hypothetical protein VITISV_016382 [Vitis vinifera] 381 e-103 ref|XP_006485937.1| PREDICTED: uncharacterized protein LOC102624... 375 e-101 ref|XP_006485936.1| PREDICTED: uncharacterized protein LOC102624... 375 e-101 ref|XP_006485935.1| PREDICTED: uncharacterized protein LOC102624... 375 e-101 ref|XP_006436204.1| hypothetical protein CICLE_v10030525mg [Citr... 375 e-101 ref|XP_006436203.1| hypothetical protein CICLE_v10030525mg [Citr... 375 e-101 gb|EOY18533.1| Tudor/PWWP/MBT superfamily protein isoform 6, par... 375 e-101 gb|EOY18532.1| Tudor/PWWP/MBT superfamily protein isoform 5 [The... 375 e-101 gb|EOY18530.1| Tudor/PWWP/MBT superfamily protein isoform 3 [The... 375 e-101 gb|EOY18528.1| Tudor/PWWP/MBT superfamily protein isoform 1 [The... 375 e-101 ref|XP_002312039.2| hypothetical protein POPTR_0008s04420g [Popu... 374 e-101 ref|XP_002315275.2| dentin sialophosphoprotein [Populus trichoca... 366 1e-98 ref|XP_002523905.1| hypothetical protein RCOM_1068550 [Ricinus c... 355 2e-95 gb|EMJ20098.1| hypothetical protein PRUPE_ppa000448mg [Prunus pe... 353 9e-95 ref|XP_004143691.1| PREDICTED: uncharacterized protein LOC101204... 349 2e-93 emb|CBI31518.3| unnamed protein product [Vitis vinifera] 347 8e-93 gb|EXC19485.1| hypothetical protein L484_014115 [Morus notabilis] 337 5e-90 ref|XP_003535180.1| PREDICTED: uncharacterized protein LOC100784... 333 1e-88 >ref|XP_006340456.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum] Length = 1656 Score = 438 bits (1127), Expect = e-120 Identities = 243/432 (56%), Positives = 296/432 (68%), Gaps = 24/432 (5%) Frame = -1 Query: 1224 SETDNFKLSNEETMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDP 1045 SET + + NE+ + S+ GYL+PPENEG ++ SDLVWGKVRSHPWWPGQIFDP Sbjct: 1025 SETSHTVMLNEKPV----SLLNMHPGYLIPPENEGEYSISDLVWGKVRSHPWWPGQIFDP 1080 Query: 1044 ADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSA 865 +DASEKA+KY+KKD +LVAYFGDRTFAWND S+L+PF S+FSQIEKQS SE FQNA+SSA Sbjct: 1081 SDASEKAIKYHKKDGFLVAYFGDRTFAWNDASVLRPFCSHFSQIEKQSNSETFQNAISSA 1140 Query: 864 LEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFL 685 LEEVSRRVELGLACSCTP +Y +I Q+VENTGIREESS+RYGVD S+ + F PDK L Sbjct: 1141 LEEVSRRVELGLACSCTPGDSYDEISCQIVENTGIREESSKRYGVDKSTGVTSFVPDKLL 1200 Query: 684 EYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNAD------- 526 Y+ LA + RADRLDL IARAQL AF RFKGY P +FS SGE LEN+AD Sbjct: 1201 HYMKALALSPTCRADRLDLTIARAQLVAFCRFKGYRLPPQFSLSGEFLENDADIPHVDSA 1260 Query: 525 ----------TEKISNKMLDSEKWKHTPKDGSQSR-KKRSLMELMGDR--EYSPDAEDVG 385 +E+ + + K KH+ KD SQ++ K+RSL ELM + EYSPD ED Sbjct: 1261 IDDNGHASEGSEQHPTSKVSARKRKHSSKDSSQNKLKERSLSELMDNMECEYSPDGEDDL 1320 Query: 384 XXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFKIGECIRRVASKLTGS 205 K D + DGS+K+ S +AAKVST+ S +PKPSF+IGECI+RVAS+LT S Sbjct: 1321 DEKSFTSSKKRKAVDSRTDGSDKKTSAYAAKVSTTASVSPKPSFRIGECIQRVASQLTRS 1380 Query: 204 TLSVKGSNDEMVID----DSPKIYDISEKQSVVVSAESFSVDEILSQLQTVAQNPKKGCN 37 +KGS+D+ D DSP K VV+ E S +E+LSQLQ VA+ P K N Sbjct: 1381 ASLLKGSSDQSGADVQSQDSP-------KGKVVIPTELPSANELLSQLQLVARAPLKSYN 1433 Query: 36 FQNNTRTFFTGF 1 F + TFF+GF Sbjct: 1434 FLKTSTTFFSGF 1445 >ref|XP_004237664.1| PREDICTED: uncharacterized protein LOC101249817 [Solanum lycopersicum] Length = 1654 Score = 423 bits (1088), Expect = e-116 Identities = 239/432 (55%), Positives = 293/432 (67%), Gaps = 24/432 (5%) Frame = -1 Query: 1224 SETDNFKLSNEETMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDP 1045 SET + + +E+ + S+ GYL+PPENEG ++ SDLVWGKVRSHPWWPGQIFDP Sbjct: 1024 SETSHTLMFSEKPV----SLLNMHPGYLIPPENEGDYSISDLVWGKVRSHPWWPGQIFDP 1079 Query: 1044 ADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSA 865 +DASEKA+KY+KKD +LVAYFGDRTFAWND S+L+PF SYFSQIEKQS SE FQNA+SSA Sbjct: 1080 SDASEKAIKYHKKDGFLVAYFGDRTFAWNDASVLRPFCSYFSQIEKQSNSETFQNAISSA 1139 Query: 864 LEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFL 685 LEEVSRRVELGLACSCTPK +Y +I Q+VENTGIREE+S+RYGVD S+ + F PDK L Sbjct: 1140 LEEVSRRVELGLACSCTPKDSYDEISCQIVENTGIREEASKRYGVDKSTGVTSFVPDKLL 1199 Query: 684 EYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNAD------- 526 Y+ LA + RADRLDL IARAQL AF RFKGY P +F SGELLEN+AD Sbjct: 1200 HYMKALALSPTCRADRLDLTIARAQLVAFCRFKGYRLPPQFLLSGELLENDADIPHVDSA 1259 Query: 525 ----------TEKISNKMLDSEKWKHTPKDGSQSR-KKRSLMELMGDR--EYSPDAEDVG 385 +E+ + + K KH+ KD SQ++ K+RSL ELM + EYSPD ED Sbjct: 1260 IDDNGHASEGSEQHPTSKVSARKRKHSSKDSSQNKLKERSLSELMDNMECEYSPDGEDDL 1319 Query: 384 XXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFKIGECIRRVASKLTGS 205 K D + D S+K+ S +A KV T+ S +PK SF+IGECI+RVAS+LT S Sbjct: 1320 DEKSFTSSKKRKGVDSRTDRSDKKTSAYAPKVLTTASVSPKTSFRIGECIQRVASQLTRS 1379 Query: 204 TLSVKGSNDEMVID----DSPKIYDISEKQSVVVSAESFSVDEILSQLQTVAQNPKKGCN 37 +KGS+D+ D DSP K VV+ E S +E+LSQLQ VA+ P KG N Sbjct: 1380 ASLLKGSSDQSGADVQSQDSP-------KGKVVIPTELPSANELLSQLQLVARAPMKGYN 1432 Query: 36 FQNNTRTFFTGF 1 + T FF+GF Sbjct: 1433 LKTIT-NFFSGF 1443 >emb|CAN75603.1| hypothetical protein VITISV_016382 [Vitis vinifera] Length = 1887 Score = 381 bits (978), Expect = e-103 Identities = 215/441 (48%), Positives = 273/441 (61%), Gaps = 39/441 (8%) Frame = -1 Query: 1206 KLSNEETMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDPADASEK 1027 K+ T+K + +R +Q+ Y LPPE+EG F+ SDLVWGKVRSHPWWPGQIFDP+DASEK Sbjct: 1224 KMVKRATLKPGNLIRGHQATYQLPPESEGEFSVSDLVWGKVRSHPWWPGQIFDPSDASEK 1283 Query: 1026 AVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSALEEVSR 847 A+KY+KKD +LVAYFGDRTFAWN+ SLLKPFR++FSQI KQS SE F NAV AL+EVSR Sbjct: 1284 AMKYHKKDCFLVAYFGDRTFAWNEASLLKPFRTHFSQIVKQSNSEVFHNAVDCALDEVSR 1343 Query: 846 RVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFLEYVTEL 667 RVELGLACSC PK Y +I+ Q+VENTGIR ESSRR GVD S+ S EPD F+EY+ L Sbjct: 1344 RVELGLACSCIPKDDYDEIKCQIVENTGIRPESSRRDGVDKSATMSLLEPDTFVEYIKAL 1403 Query: 666 APHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNADTEKISNKM----- 502 A S AD+L+LVIA+AQL AF R KGYH EF G L EN+AD + M Sbjct: 1404 AQFPSGGADQLELVIAKAQLLAFSRLKGYHRLPEFQYCGGLQENDADISCFNEMMEHETD 1463 Query: 501 -------------LDSEKWKHTPKDGSQSRKK-RSLMELMGDREYSPDAEDVG------- 385 S K KH KD + RKK RSL ELM YSPD E+ Sbjct: 1464 VLMGDDGKFKIQNSSSHKRKHNLKDSAYPRKKERSLSELMSGMAYSPDDENDSDGKATSK 1523 Query: 384 -XXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFKIGECIRRVASKLTG 208 +F ++ ++ SI AKVS +++ +P+ SFK+G+CIRR AS+LTG Sbjct: 1524 PVSSSGRKRKVVDSFGNDSEVQDRTESIFVAKVSNTSAPSPRQSFKVGDCIRRAASQLTG 1583 Query: 207 --STLSVKGSNDEMVIDDS----------PKIYDISEKQSVVVSAESFSVDEILSQLQTV 64 S L G + V+D S + + Q +++ E S+DE+LSQL+ Sbjct: 1584 SPSILKCSGERPQKVVDGSIGKLGGPGSDVSLMSPEDPQRMIIPMEYPSLDEMLSQLRLA 1643 Query: 63 AQNPKKGCNFQNNTRTFFTGF 1 A++P KG +F + +FF+ F Sbjct: 1644 ARDPMKGYSFLDTIVSFFSEF 1664 >ref|XP_006485937.1| PREDICTED: uncharacterized protein LOC102624524 isoform X3 [Citrus sinensis] Length = 1372 Score = 375 bits (962), Expect = e-101 Identities = 217/453 (47%), Positives = 285/453 (62%), Gaps = 40/453 (8%) Frame = -1 Query: 1239 MEGHSSETDNFKLSNEE-----TMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSH 1075 +EG S+T+ + + E+ T + S ++ ++ LLP E+EG F SDLVWGKVRSH Sbjct: 707 VEGQDSDTEQTETNEEKFVHRVTARGGSLVKPHRVSCLLPLEDEGEFFVSDLVWGKVRSH 766 Query: 1074 PWWPGQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKS 895 PWWPGQI+DP+DASEKA+KY+KKD +LVAYFGDRTFAW D S L+ F S+FSQ+EKQS + Sbjct: 767 PWWPGQIYDPSDASEKAMKYHKKDCFLVAYFGDRTFAWVDASQLRAFYSHFSQVEKQSNA 826 Query: 894 EAFQNAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSR 715 E FQNAV+ ALEEVSRR+ELGLAC C PK AY KI Q+VEN GIR+ESS R GVD + Sbjct: 827 EVFQNAVNCALEEVSRRIELGLACPCIPKDAYDKIRLQIVENAGIRQESSEREGVDKCAS 886 Query: 714 ASYFEPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLEN 535 A F+PDK +E++ A S ADRL+LVIA+AQL +FY FKGY EF G L E+ Sbjct: 887 AQSFQPDKLVEFMKAFALSPSGGADRLELVIAKAQLLSFYHFKGYSELPEFQFCGGLAED 946 Query: 534 NADTEKISNKM------LDSE------------KWKHTPKDGS-QSRKKRSLMELM---- 424 DT + KM +D E K KH KD S+K++SL ELM Sbjct: 947 GVDTSHFAEKMHTTPVSMDDEHIYSETQRSSHHKRKHNLKDSMYPSKKEKSLSELMTGSF 1006 Query: 423 ---GDREYSPDAEDVGXXXXXXXXXXXKTFDFQADGSNK--RVSIHAAKVSTSTSQTPKP 259 D E+ D + G K DF D S++ R +I AKVS ST+ PKP Sbjct: 1007 DSLDDDEFDSDGKAGGKLVSPSSIKKRKVVDFAGDDSSQDGRKTISLAKVSISTANIPKP 1066 Query: 258 SFKIGECIRRVASKLTGSTLSVKGSNDEMV-------IDDSPKIYDISEKQSVVVSAESF 100 SFKIGECIRRVAS++TGS+ SV SN E + DDS + ++ +E + +++ + Sbjct: 1067 SFKIGECIRRVASQMTGSS-SVLKSNSERLQKLDADGSDDSFENFEDAEGKRMILPTDYS 1125 Query: 99 SVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 S+D++LSQL + A++P +G +F N +FF+ F Sbjct: 1126 SLDDLLSQLHSAAKDPMRGYSFLNMIISFFSDF 1158 >ref|XP_006485936.1| PREDICTED: uncharacterized protein LOC102624524 isoform X2 [Citrus sinensis] Length = 1390 Score = 375 bits (962), Expect = e-101 Identities = 217/453 (47%), Positives = 285/453 (62%), Gaps = 40/453 (8%) Frame = -1 Query: 1239 MEGHSSETDNFKLSNEE-----TMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSH 1075 +EG S+T+ + + E+ T + S ++ ++ LLP E+EG F SDLVWGKVRSH Sbjct: 725 VEGQDSDTEQTETNEEKFVHRVTARGGSLVKPHRVSCLLPLEDEGEFFVSDLVWGKVRSH 784 Query: 1074 PWWPGQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKS 895 PWWPGQI+DP+DASEKA+KY+KKD +LVAYFGDRTFAW D S L+ F S+FSQ+EKQS + Sbjct: 785 PWWPGQIYDPSDASEKAMKYHKKDCFLVAYFGDRTFAWVDASQLRAFYSHFSQVEKQSNA 844 Query: 894 EAFQNAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSR 715 E FQNAV+ ALEEVSRR+ELGLAC C PK AY KI Q+VEN GIR+ESS R GVD + Sbjct: 845 EVFQNAVNCALEEVSRRIELGLACPCIPKDAYDKIRLQIVENAGIRQESSEREGVDKCAS 904 Query: 714 ASYFEPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLEN 535 A F+PDK +E++ A S ADRL+LVIA+AQL +FY FKGY EF G L E+ Sbjct: 905 AQSFQPDKLVEFMKAFALSPSGGADRLELVIAKAQLLSFYHFKGYSELPEFQFCGGLAED 964 Query: 534 NADTEKISNKM------LDSE------------KWKHTPKDGS-QSRKKRSLMELM---- 424 DT + KM +D E K KH KD S+K++SL ELM Sbjct: 965 GVDTSHFAEKMHTTPVSMDDEHIYSETQRSSHHKRKHNLKDSMYPSKKEKSLSELMTGSF 1024 Query: 423 ---GDREYSPDAEDVGXXXXXXXXXXXKTFDFQADGSNK--RVSIHAAKVSTSTSQTPKP 259 D E+ D + G K DF D S++ R +I AKVS ST+ PKP Sbjct: 1025 DSLDDDEFDSDGKAGGKLVSPSSIKKRKVVDFAGDDSSQDGRKTISLAKVSISTANIPKP 1084 Query: 258 SFKIGECIRRVASKLTGSTLSVKGSNDEMV-------IDDSPKIYDISEKQSVVVSAESF 100 SFKIGECIRRVAS++TGS+ SV SN E + DDS + ++ +E + +++ + Sbjct: 1085 SFKIGECIRRVASQMTGSS-SVLKSNSERLQKLDADGSDDSFENFEDAEGKRMILPTDYS 1143 Query: 99 SVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 S+D++LSQL + A++P +G +F N +FF+ F Sbjct: 1144 SLDDLLSQLHSAAKDPMRGYSFLNMIISFFSDF 1176 >ref|XP_006485935.1| PREDICTED: uncharacterized protein LOC102624524 isoform X1 [Citrus sinensis] Length = 1409 Score = 375 bits (962), Expect = e-101 Identities = 217/453 (47%), Positives = 285/453 (62%), Gaps = 40/453 (8%) Frame = -1 Query: 1239 MEGHSSETDNFKLSNEE-----TMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSH 1075 +EG S+T+ + + E+ T + S ++ ++ LLP E+EG F SDLVWGKVRSH Sbjct: 744 VEGQDSDTEQTETNEEKFVHRVTARGGSLVKPHRVSCLLPLEDEGEFFVSDLVWGKVRSH 803 Query: 1074 PWWPGQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKS 895 PWWPGQI+DP+DASEKA+KY+KKD +LVAYFGDRTFAW D S L+ F S+FSQ+EKQS + Sbjct: 804 PWWPGQIYDPSDASEKAMKYHKKDCFLVAYFGDRTFAWVDASQLRAFYSHFSQVEKQSNA 863 Query: 894 EAFQNAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSR 715 E FQNAV+ ALEEVSRR+ELGLAC C PK AY KI Q+VEN GIR+ESS R GVD + Sbjct: 864 EVFQNAVNCALEEVSRRIELGLACPCIPKDAYDKIRLQIVENAGIRQESSEREGVDKCAS 923 Query: 714 ASYFEPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLEN 535 A F+PDK +E++ A S ADRL+LVIA+AQL +FY FKGY EF G L E+ Sbjct: 924 AQSFQPDKLVEFMKAFALSPSGGADRLELVIAKAQLLSFYHFKGYSELPEFQFCGGLAED 983 Query: 534 NADTEKISNKM------LDSE------------KWKHTPKDGS-QSRKKRSLMELM---- 424 DT + KM +D E K KH KD S+K++SL ELM Sbjct: 984 GVDTSHFAEKMHTTPVSMDDEHIYSETQRSSHHKRKHNLKDSMYPSKKEKSLSELMTGSF 1043 Query: 423 ---GDREYSPDAEDVGXXXXXXXXXXXKTFDFQADGSNK--RVSIHAAKVSTSTSQTPKP 259 D E+ D + G K DF D S++ R +I AKVS ST+ PKP Sbjct: 1044 DSLDDDEFDSDGKAGGKLVSPSSIKKRKVVDFAGDDSSQDGRKTISLAKVSISTANIPKP 1103 Query: 258 SFKIGECIRRVASKLTGSTLSVKGSNDEMV-------IDDSPKIYDISEKQSVVVSAESF 100 SFKIGECIRRVAS++TGS+ SV SN E + DDS + ++ +E + +++ + Sbjct: 1104 SFKIGECIRRVASQMTGSS-SVLKSNSERLQKLDADGSDDSFENFEDAEGKRMILPTDYS 1162 Query: 99 SVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 S+D++LSQL + A++P +G +F N +FF+ F Sbjct: 1163 SLDDLLSQLHSAAKDPMRGYSFLNMIISFFSDF 1195 >ref|XP_006436204.1| hypothetical protein CICLE_v10030525mg [Citrus clementina] gi|567887366|ref|XP_006436205.1| hypothetical protein CICLE_v10030525mg [Citrus clementina] gi|557538400|gb|ESR49444.1| hypothetical protein CICLE_v10030525mg [Citrus clementina] gi|557538401|gb|ESR49445.1| hypothetical protein CICLE_v10030525mg [Citrus clementina] Length = 1409 Score = 375 bits (962), Expect = e-101 Identities = 217/453 (47%), Positives = 285/453 (62%), Gaps = 40/453 (8%) Frame = -1 Query: 1239 MEGHSSETDNFKLSNEE-----TMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSH 1075 +EG S+T+ + + E+ T + S ++ ++ LLP E+EG F SDLVWGKVRSH Sbjct: 744 VEGQDSDTEQTETNEEKFVHRVTARGGSLVKPHRVSCLLPLEDEGEFFVSDLVWGKVRSH 803 Query: 1074 PWWPGQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKS 895 PWWPGQI+DP+DASEKA+KY+KKD +LVAYFGDRTFAW D S L+ F S+FSQ+EKQS + Sbjct: 804 PWWPGQIYDPSDASEKAMKYHKKDCFLVAYFGDRTFAWVDASQLRAFYSHFSQVEKQSNA 863 Query: 894 EAFQNAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSR 715 E FQNAV+ ALEEVSRR+ELGLAC C PK AY KI Q+VEN GIR+ESS R GVD + Sbjct: 864 EVFQNAVNCALEEVSRRIELGLACPCIPKDAYDKIRLQIVENAGIRQESSEREGVDKCAS 923 Query: 714 ASYFEPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLEN 535 A F+PDK +E++ A S ADRL+LVIA+AQL +FY FKGY EF G L E+ Sbjct: 924 AQSFQPDKLVEFMKAFALSPSGGADRLELVIAKAQLLSFYHFKGYSELPEFQFCGGLAED 983 Query: 534 NADTEKISNKM------LDSE------------KWKHTPKDGS-QSRKKRSLMELM---- 424 DT + KM +D E K KH KD S+K++SL ELM Sbjct: 984 GVDTSHFAEKMHTTPVSMDDEHIYSETQRSSHHKRKHNLKDSMYPSKKEKSLSELMTGSF 1043 Query: 423 ---GDREYSPDAEDVGXXXXXXXXXXXKTFDFQADGSNK--RVSIHAAKVSTSTSQTPKP 259 D E+ D + G K DF D S++ R +I AKVS ST+ PKP Sbjct: 1044 DSLDDDEFDSDGKAGGKLVSPSSIKKRKVVDFAGDDSSQDGRKTISLAKVSISTANIPKP 1103 Query: 258 SFKIGECIRRVASKLTGSTLSVKGSNDEMV-------IDDSPKIYDISEKQSVVVSAESF 100 SFKIGECIRRVAS++TGS+ SV SN E + DDS + ++ +E + +++ + Sbjct: 1104 SFKIGECIRRVASQMTGSS-SVLKSNSERLQKLDADGSDDSFENFEDAEGKRMILPTDYS 1162 Query: 99 SVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 S+D++LSQL + A++P +G +F N +FF+ F Sbjct: 1163 SLDDLLSQLHSAAKDPMRGYSFLNMIISFFSDF 1195 >ref|XP_006436203.1| hypothetical protein CICLE_v10030525mg [Citrus clementina] gi|567887368|ref|XP_006436206.1| hypothetical protein CICLE_v10030525mg [Citrus clementina] gi|557538399|gb|ESR49443.1| hypothetical protein CICLE_v10030525mg [Citrus clementina] gi|557538402|gb|ESR49446.1| hypothetical protein CICLE_v10030525mg [Citrus clementina] Length = 1372 Score = 375 bits (962), Expect = e-101 Identities = 217/453 (47%), Positives = 285/453 (62%), Gaps = 40/453 (8%) Frame = -1 Query: 1239 MEGHSSETDNFKLSNEE-----TMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSH 1075 +EG S+T+ + + E+ T + S ++ ++ LLP E+EG F SDLVWGKVRSH Sbjct: 707 VEGQDSDTEQTETNEEKFVHRVTARGGSLVKPHRVSCLLPLEDEGEFFVSDLVWGKVRSH 766 Query: 1074 PWWPGQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKS 895 PWWPGQI+DP+DASEKA+KY+KKD +LVAYFGDRTFAW D S L+ F S+FSQ+EKQS + Sbjct: 767 PWWPGQIYDPSDASEKAMKYHKKDCFLVAYFGDRTFAWVDASQLRAFYSHFSQVEKQSNA 826 Query: 894 EAFQNAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSR 715 E FQNAV+ ALEEVSRR+ELGLAC C PK AY KI Q+VEN GIR+ESS R GVD + Sbjct: 827 EVFQNAVNCALEEVSRRIELGLACPCIPKDAYDKIRLQIVENAGIRQESSEREGVDKCAS 886 Query: 714 ASYFEPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLEN 535 A F+PDK +E++ A S ADRL+LVIA+AQL +FY FKGY EF G L E+ Sbjct: 887 AQSFQPDKLVEFMKAFALSPSGGADRLELVIAKAQLLSFYHFKGYSELPEFQFCGGLAED 946 Query: 534 NADTEKISNKM------LDSE------------KWKHTPKDGS-QSRKKRSLMELM---- 424 DT + KM +D E K KH KD S+K++SL ELM Sbjct: 947 GVDTSHFAEKMHTTPVSMDDEHIYSETQRSSHHKRKHNLKDSMYPSKKEKSLSELMTGSF 1006 Query: 423 ---GDREYSPDAEDVGXXXXXXXXXXXKTFDFQADGSNK--RVSIHAAKVSTSTSQTPKP 259 D E+ D + G K DF D S++ R +I AKVS ST+ PKP Sbjct: 1007 DSLDDDEFDSDGKAGGKLVSPSSIKKRKVVDFAGDDSSQDGRKTISLAKVSISTANIPKP 1066 Query: 258 SFKIGECIRRVASKLTGSTLSVKGSNDEMV-------IDDSPKIYDISEKQSVVVSAESF 100 SFKIGECIRRVAS++TGS+ SV SN E + DDS + ++ +E + +++ + Sbjct: 1067 SFKIGECIRRVASQMTGSS-SVLKSNSERLQKLDADGSDDSFENFEDAEGKRMILPTDYS 1125 Query: 99 SVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 S+D++LSQL + A++P +G +F N +FF+ F Sbjct: 1126 SLDDLLSQLHSAAKDPMRGYSFLNMIISFFSDF 1158 >gb|EOY18533.1| Tudor/PWWP/MBT superfamily protein isoform 6, partial [Theobroma cacao] Length = 1622 Score = 375 bits (962), Expect = e-101 Identities = 232/504 (46%), Positives = 295/504 (58%), Gaps = 70/504 (13%) Frame = -1 Query: 1302 DEVKKSREAGENSSKTNGFYI-----------------MEGHSSETDNFKLSN--EET-- 1186 D++ KS + ++SS Y+ ME +TD+ + +N E+T Sbjct: 442 DQLAKSSVSEDDSSVGQDLYVEEQVTGAEQDGLDQVQEMEVEEHDTDSEQPTNIDEKTVK 501 Query: 1185 ---MKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDPADASEKAVKY 1015 +K AS+++ +Q+ YLL E EG F+ S LVWGKVRSHPWWPGQIFDP+DASEKAVKY Sbjct: 502 RTVLKCASAVKVHQAKYLLLSEEEGEFSVSGLVWGKVRSHPWWPGQIFDPSDASEKAVKY 561 Query: 1014 YKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSALEEVSRRVEL 835 +KKD +LVAYFGDRTFAWN+ SLLKPFR++FSQIEKQS SE+FQNAV+ ALEEVSRR EL Sbjct: 562 HKKDCFLVAYFGDRTFAWNEASLLKPFRTHFSQIEKQSNSESFQNAVNCALEEVSRRAEL 621 Query: 834 GLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFLEYVTELAPHA 655 GLACSC P+ AY KI+ Q VENTG+R+ESS R GVD S AS FEPDK ++Y+ LA Sbjct: 622 GLACSCMPQDAYDKIKFQKVENTGVRQESSIRDGVDVSLSASSFEPDKLVDYMKALAESP 681 Query: 654 SSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNADTEKISNKMLDSE----- 490 + DRLDLVI +AQL AFYR KGYH EF G L EN A+T M E Sbjct: 682 AGGGDRLDLVIVKAQLLAFYRLKGYHQLPEFQSCGGLSENEANTSHSEENMYFGEEIEHT 741 Query: 489 -------------------------KWKHTPKDGSQ-SRKKRSLMELMGDREYSPDAED- 391 K KH KDG S+K+RSL ELM + SPD E+ Sbjct: 742 TPMDTDAEQISTGQETSMSQRSSYLKRKHNLKDGLYPSKKERSLSELMDETFDSPDVENG 801 Query: 390 -------VGXXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFKIGECIR 232 + +FD ++ +I AKVS +T PKPSFKIGECIR Sbjct: 802 TDGIANRLPSSSSGKKRKAVDSFDDSVVQEGRK-TISLAKVSLTTPHFPKPSFKIGECIR 860 Query: 231 RVASKLTGSTLSVKGSNDEMVIDDSPKIYDI-------SEKQSVVVSAESFSVDEILSQL 73 R AS++TGS L KG D + + YD+ ++++ + V+AE S+DE+LSQL Sbjct: 861 RAASQMTGSPLIPKGKLDGGSENTAADGYDVPFDNSEDAQRKRMNVTAEYSSLDELLSQL 920 Query: 72 QTVAQNPKKGCNFQNNTRTFFTGF 1 A +P K + N +FF+ F Sbjct: 921 HLAACDPMKSYSSFNIFISFFSDF 944 >gb|EOY18532.1| Tudor/PWWP/MBT superfamily protein isoform 5 [Theobroma cacao] Length = 1618 Score = 375 bits (962), Expect = e-101 Identities = 232/504 (46%), Positives = 295/504 (58%), Gaps = 70/504 (13%) Frame = -1 Query: 1302 DEVKKSREAGENSSKTNGFYI-----------------MEGHSSETDNFKLSN--EET-- 1186 D++ KS + ++SS Y+ ME +TD+ + +N E+T Sbjct: 442 DQLAKSSVSEDDSSVGQDLYVEEQVTGAEQDGLDQVQEMEVEEHDTDSEQPTNIDEKTVK 501 Query: 1185 ---MKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDPADASEKAVKY 1015 +K AS+++ +Q+ YLL E EG F+ S LVWGKVRSHPWWPGQIFDP+DASEKAVKY Sbjct: 502 RTVLKCASAVKVHQAKYLLLSEEEGEFSVSGLVWGKVRSHPWWPGQIFDPSDASEKAVKY 561 Query: 1014 YKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSALEEVSRRVEL 835 +KKD +LVAYFGDRTFAWN+ SLLKPFR++FSQIEKQS SE+FQNAV+ ALEEVSRR EL Sbjct: 562 HKKDCFLVAYFGDRTFAWNEASLLKPFRTHFSQIEKQSNSESFQNAVNCALEEVSRRAEL 621 Query: 834 GLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFLEYVTELAPHA 655 GLACSC P+ AY KI+ Q VENTG+R+ESS R GVD S AS FEPDK ++Y+ LA Sbjct: 622 GLACSCMPQDAYDKIKFQKVENTGVRQESSIRDGVDVSLSASSFEPDKLVDYMKALAESP 681 Query: 654 SSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNADTEKISNKMLDSE----- 490 + DRLDLVI +AQL AFYR KGYH EF G L EN A+T M E Sbjct: 682 AGGGDRLDLVIVKAQLLAFYRLKGYHQLPEFQSCGGLSENEANTSHSEENMYFGEEIEHT 741 Query: 489 -------------------------KWKHTPKDGSQ-SRKKRSLMELMGDREYSPDAED- 391 K KH KDG S+K+RSL ELM + SPD E+ Sbjct: 742 TPMDTDAEQISTGQETSMSQRSSYLKRKHNLKDGLYPSKKERSLSELMDETFDSPDVENG 801 Query: 390 -------VGXXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFKIGECIR 232 + +FD ++ +I AKVS +T PKPSFKIGECIR Sbjct: 802 TDGIANRLPSSSSGKKRKAVDSFDDSVVQEGRK-TISLAKVSLTTPHFPKPSFKIGECIR 860 Query: 231 RVASKLTGSTLSVKGSNDEMVIDDSPKIYDI-------SEKQSVVVSAESFSVDEILSQL 73 R AS++TGS L KG D + + YD+ ++++ + V+AE S+DE+LSQL Sbjct: 861 RAASQMTGSPLIPKGKLDGGSENTAADGYDVPFDNSEDAQRKRMNVTAEYSSLDELLSQL 920 Query: 72 QTVAQNPKKGCNFQNNTRTFFTGF 1 A +P K + N +FF+ F Sbjct: 921 HLAACDPMKSYSSFNIFISFFSDF 944 >gb|EOY18530.1| Tudor/PWWP/MBT superfamily protein isoform 3 [Theobroma cacao] Length = 1345 Score = 375 bits (962), Expect = e-101 Identities = 232/504 (46%), Positives = 295/504 (58%), Gaps = 70/504 (13%) Frame = -1 Query: 1302 DEVKKSREAGENSSKTNGFYI-----------------MEGHSSETDNFKLSN--EET-- 1186 D++ KS + ++SS Y+ ME +TD+ + +N E+T Sbjct: 442 DQLAKSSVSEDDSSVGQDLYVEEQVTGAEQDGLDQVQEMEVEEHDTDSEQPTNIDEKTVK 501 Query: 1185 ---MKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDPADASEKAVKY 1015 +K AS+++ +Q+ YLL E EG F+ S LVWGKVRSHPWWPGQIFDP+DASEKAVKY Sbjct: 502 RTVLKCASAVKVHQAKYLLLSEEEGEFSVSGLVWGKVRSHPWWPGQIFDPSDASEKAVKY 561 Query: 1014 YKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSALEEVSRRVEL 835 +KKD +LVAYFGDRTFAWN+ SLLKPFR++FSQIEKQS SE+FQNAV+ ALEEVSRR EL Sbjct: 562 HKKDCFLVAYFGDRTFAWNEASLLKPFRTHFSQIEKQSNSESFQNAVNCALEEVSRRAEL 621 Query: 834 GLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFLEYVTELAPHA 655 GLACSC P+ AY KI+ Q VENTG+R+ESS R GVD S AS FEPDK ++Y+ LA Sbjct: 622 GLACSCMPQDAYDKIKFQKVENTGVRQESSIRDGVDVSLSASSFEPDKLVDYMKALAESP 681 Query: 654 SSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNADTEKISNKMLDSE----- 490 + DRLDLVI +AQL AFYR KGYH EF G L EN A+T M E Sbjct: 682 AGGGDRLDLVIVKAQLLAFYRLKGYHQLPEFQSCGGLSENEANTSHSEENMYFGEEIEHT 741 Query: 489 -------------------------KWKHTPKDGSQ-SRKKRSLMELMGDREYSPDAED- 391 K KH KDG S+K+RSL ELM + SPD E+ Sbjct: 742 TPMDTDAEQISTGQETSMSQRSSYLKRKHNLKDGLYPSKKERSLSELMDETFDSPDVENG 801 Query: 390 -------VGXXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFKIGECIR 232 + +FD ++ +I AKVS +T PKPSFKIGECIR Sbjct: 802 TDGIANRLPSSSSGKKRKAVDSFDDSVVQEGRK-TISLAKVSLTTPHFPKPSFKIGECIR 860 Query: 231 RVASKLTGSTLSVKGSNDEMVIDDSPKIYDI-------SEKQSVVVSAESFSVDEILSQL 73 R AS++TGS L KG D + + YD+ ++++ + V+AE S+DE+LSQL Sbjct: 861 RAASQMTGSPLIPKGKLDGGSENTAADGYDVPFDNSEDAQRKRMNVTAEYSSLDELLSQL 920 Query: 72 QTVAQNPKKGCNFQNNTRTFFTGF 1 A +P K + N +FF+ F Sbjct: 921 HLAACDPMKSYSSFNIFISFFSDF 944 >gb|EOY18528.1| Tudor/PWWP/MBT superfamily protein isoform 1 [Theobroma cacao] gi|508726632|gb|EOY18529.1| Tudor/PWWP/MBT superfamily protein isoform 1 [Theobroma cacao] gi|508726634|gb|EOY18531.1| Tudor/PWWP/MBT superfamily protein isoform 1 [Theobroma cacao] Length = 1619 Score = 375 bits (962), Expect = e-101 Identities = 232/504 (46%), Positives = 295/504 (58%), Gaps = 70/504 (13%) Frame = -1 Query: 1302 DEVKKSREAGENSSKTNGFYI-----------------MEGHSSETDNFKLSN--EET-- 1186 D++ KS + ++SS Y+ ME +TD+ + +N E+T Sbjct: 442 DQLAKSSVSEDDSSVGQDLYVEEQVTGAEQDGLDQVQEMEVEEHDTDSEQPTNIDEKTVK 501 Query: 1185 ---MKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDPADASEKAVKY 1015 +K AS+++ +Q+ YLL E EG F+ S LVWGKVRSHPWWPGQIFDP+DASEKAVKY Sbjct: 502 RTVLKCASAVKVHQAKYLLLSEEEGEFSVSGLVWGKVRSHPWWPGQIFDPSDASEKAVKY 561 Query: 1014 YKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSALEEVSRRVEL 835 +KKD +LVAYFGDRTFAWN+ SLLKPFR++FSQIEKQS SE+FQNAV+ ALEEVSRR EL Sbjct: 562 HKKDCFLVAYFGDRTFAWNEASLLKPFRTHFSQIEKQSNSESFQNAVNCALEEVSRRAEL 621 Query: 834 GLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFLEYVTELAPHA 655 GLACSC P+ AY KI+ Q VENTG+R+ESS R GVD S AS FEPDK ++Y+ LA Sbjct: 622 GLACSCMPQDAYDKIKFQKVENTGVRQESSIRDGVDVSLSASSFEPDKLVDYMKALAESP 681 Query: 654 SSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNADTEKISNKMLDSE----- 490 + DRLDLVI +AQL AFYR KGYH EF G L EN A+T M E Sbjct: 682 AGGGDRLDLVIVKAQLLAFYRLKGYHQLPEFQSCGGLSENEANTSHSEENMYFGEEIEHT 741 Query: 489 -------------------------KWKHTPKDGSQ-SRKKRSLMELMGDREYSPDAED- 391 K KH KDG S+K+RSL ELM + SPD E+ Sbjct: 742 TPMDTDAEQISTGQETSMSQRSSYLKRKHNLKDGLYPSKKERSLSELMDETFDSPDVENG 801 Query: 390 -------VGXXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFKIGECIR 232 + +FD ++ +I AKVS +T PKPSFKIGECIR Sbjct: 802 TDGIANRLPSSSSGKKRKAVDSFDDSVVQEGRK-TISLAKVSLTTPHFPKPSFKIGECIR 860 Query: 231 RVASKLTGSTLSVKGSNDEMVIDDSPKIYDI-------SEKQSVVVSAESFSVDEILSQL 73 R AS++TGS L KG D + + YD+ ++++ + V+AE S+DE+LSQL Sbjct: 861 RAASQMTGSPLIPKGKLDGGSENTAADGYDVPFDNSEDAQRKRMNVTAEYSSLDELLSQL 920 Query: 72 QTVAQNPKKGCNFQNNTRTFFTGF 1 A +P K + N +FF+ F Sbjct: 921 HLAACDPMKSYSSFNIFISFFSDF 944 >ref|XP_002312039.2| hypothetical protein POPTR_0008s04420g [Populus trichocarpa] gi|550332411|gb|EEE89406.2| hypothetical protein POPTR_0008s04420g [Populus trichocarpa] Length = 1360 Score = 374 bits (959), Expect = e-101 Identities = 224/453 (49%), Positives = 285/453 (62%), Gaps = 45/453 (9%) Frame = -1 Query: 1224 SETDNFKLSNEETMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDP 1045 +E+D +L E K SS + +Q+ YLLPP NEG + SDLVWGKVRSHPWWPGQIFDP Sbjct: 708 AESDQ-QLKVAEASKPGSSEKADQACYLLPPNNEGELSVSDLVWGKVRSHPWWPGQIFDP 766 Query: 1044 ADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSA 865 +DASEKAVKY KKD YLVAYFGDRTFAWN+ SLLKPFRS+FSQ+EKQS SE FQNAV A Sbjct: 767 SDASEKAVKYNKKDCYLVAYFGDRTFAWNEASLLKPFRSHFSQVEKQSNSEVFQNAVDCA 826 Query: 864 LEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFL 685 LEEVSRRVELGLACSC P+ AY +I+ QV+E+ GIR E+S R GVD + A F+PDK + Sbjct: 827 LEEVSRRVELGLACSCVPEDAYDEIKFQVLESAGIRPEASTRDGVDKDTSADLFQPDKLV 886 Query: 684 EYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNADTEKISNK 505 Y+ LA + A+RL+LVIA++QL AFYR KGY E+ G LLE N+DT + ++ Sbjct: 887 GYMKALAQTPAGGANRLELVIAKSQLLAFYRLKGYSELPEYQFYGGLLE-NSDTLRFEDE 945 Query: 504 MLD-------------------------SEKWKHTPKDGSQSRKK-RSLMELMGDREYSP 403 ++D S K KH KD RKK R+L +LMGD S Sbjct: 946 VIDHAPAVYEDHGQISSGEEILQTQRRSSRKCKHNLKDCISPRKKERNLSDLMGDSWDSL 1005 Query: 402 DAE---------DVGXXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFK 250 D E + TF A + R +I AKVS ST+ PKPSFK Sbjct: 1006 DDEIASDGKANNKLVSPSSGKKRKGADTFADDASMTEGRKTISFAKVS-STTTLPKPSFK 1064 Query: 249 IGECIRRVASKLTGS-------TLSVKGSNDEMVID--DSPKIY-DISEKQSVVVSAESF 100 IGECI+RVAS++TGS + V+GS+D ++ D D+ ++ + +E + ++V +E Sbjct: 1065 IGECIQRVASQMTGSPSILKCNSQKVEGSSDGLIGDGSDTSSVHPEDAEIKKMIVPSEYS 1124 Query: 99 SVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 S+DE+LSQL AQ+P KG F N +FF+ F Sbjct: 1125 SLDELLSQLHLTAQDPSKGFGFLNIIISFFSDF 1157 >ref|XP_002315275.2| dentin sialophosphoprotein [Populus trichocarpa] gi|550330363|gb|EEF01446.2| dentin sialophosphoprotein [Populus trichocarpa] Length = 1404 Score = 366 bits (940), Expect = 1e-98 Identities = 221/463 (47%), Positives = 279/463 (60%), Gaps = 50/463 (10%) Frame = -1 Query: 1239 MEGHSSETDNFKLSNEE-------TMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVR 1081 ME +TD +L+ E +K SS + +Q+ YLLPP+NEG F+ SDLVWGKVR Sbjct: 742 MEVEEQDTDTEQLNTMEEKSSKLSVLKPGSSEKEDQACYLLPPDNEGEFSVSDLVWGKVR 801 Query: 1080 SHPWWPGQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQS 901 SHPWWPGQIFDP+DASEKA++Y+KKD YLVAYFGDRTFAWN+ SLLKPFRS+FSQ+EKQS Sbjct: 802 SHPWWPGQIFDPSDASEKAMRYHKKDCYLVAYFGDRTFAWNEASLLKPFRSHFSQVEKQS 861 Query: 900 KSEAFQNAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHS 721 SE FQNAV +LEEVSRRVELGLACSC PK AY +I+ QVVENTGIR E+S R GVD Sbjct: 862 NSEVFQNAVDCSLEEVSRRVELGLACSCLPKDAYDEIKCQVVENTGIRPEASTRDGVDKD 921 Query: 720 SRASYFEPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELL 541 A F+PDK ++Y+ LA S A+RL+ VIA++QL AFYR KGY E+ G LL Sbjct: 922 MSADLFQPDKLVDYMKALAQSPSGGANRLEFVIAKSQLLAFYRLKGYSELPEYQFCGGLL 981 Query: 540 EN------------------------NADTEKISNKMLDSEKWKHTPKDGSQSRKK-RSL 436 E ++ E + + S K KH KD RKK R+L Sbjct: 982 EKSDALQFEDGSIDHTSAVYEDHGQISSGEEILQTQRGSSHKRKHNLKDSIYPRKKERNL 1041 Query: 435 MELMGDR------EYSPD--AEDVGXXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTS 280 +L+ D E D A + TF A + +R +I AKVS Sbjct: 1042 SDLISDSWDSVGDEIGSDGKANSMLVSPSGKKRKGSDTFADDAYMTGRRKTISFAKVS-- 1099 Query: 279 TSQTPKPSFKIGECIRRVASKLTGS-------TLSVKGSNDEMVIDDSPKIY---DISEK 130 S KPSFKIGECI+RVAS++TGS + V GS+D +V D S + + +E Sbjct: 1100 -STALKPSFKIGECIQRVASQMTGSPSILKCNSPKVDGSSDGLVGDGSDASFLHSEDAEI 1158 Query: 129 QSVVVSAESFSVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 + ++V E S+D++LSQL AQ+P KG F N +FF+ F Sbjct: 1159 KRIIVPTEYSSLDDLLSQLHLTAQDPLKGYGFLNIIISFFSDF 1201 >ref|XP_002523905.1| hypothetical protein RCOM_1068550 [Ricinus communis] gi|223536835|gb|EEF38474.1| hypothetical protein RCOM_1068550 [Ricinus communis] Length = 1557 Score = 355 bits (912), Expect = 2e-95 Identities = 218/473 (46%), Positives = 280/473 (59%), Gaps = 49/473 (10%) Frame = -1 Query: 1272 ENSSKTNGFYIMEGHSSETDNFKLSN---EETMKSASSMRTNQSGYLLPPENEGHFAESD 1102 E+ ++ + EG E + K ++ E + ++++ Q+ Y LPP++EG F+ SD Sbjct: 881 EHDAEVQQIALHEGQEIEAEQPKTTDDKQEAALPPENTVKAYQATYQLPPDDEGEFSVSD 940 Query: 1101 LVWGKVRSHPWWPGQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYF 922 LVWGKVRSHPWWPGQIFDP+DASEKA+KYYK+D +LVAYFGDRTFAWN+ SLLKPFRS F Sbjct: 941 LVWGKVRSHPWWPGQIFDPSDASEKAMKYYKRDCFLVAYFGDRTFAWNEASLLKPFRSNF 1000 Query: 921 SQIEKQSKSEAFQNAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSR 742 S +EKQS SE FQNAV ALEEVSRRVE GLACSC P+ Y KI+ Q+VEN GIR+ESS Sbjct: 1001 SLVEKQSNSEIFQNAVDCALEEVSRRVEFGLACSCLPRNMYDKIKFQIVENAGIRQESSV 1060 Query: 741 RYGVDHSSRASYFEPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEF 562 R VD S A F PDK +EY+ L + ADRL+LVIA++QL +FYR KGY EF Sbjct: 1061 RDSVDESLHADVFGPDKLVEYMKALGQSPAGGADRLELVIAKSQLLSFYRLKGYSQLPEF 1120 Query: 561 SPSGELLENNADTEKISNKMLDS-------------------------EKWKHTPKDGSQ 457 G LLE NADT + +++ + K KH KD Sbjct: 1121 QFCGGLLE-NADTLPVEDEVTEGASALYKDDGQSSSGQEILQTQRSSYHKRKHNLKDTIY 1179 Query: 456 SRKK-RSLMELMGDREYSPDAEDVGXXXXXXXXXXXKTFDFQADGSNK----------RV 310 RKK RSL ELM D S D +++G + + GS+ R Sbjct: 1180 PRKKERSLSELMDDSWDSVD-DEIGADGKPSNKLLSPSSGKKRRGSDSFADDAAMIEGRK 1238 Query: 309 SIHAAKVSTSTSQTPKPSFKIGECIRRVASKLTGSTLSVK-------GSNDEMVIDDSPK 151 +I AKVST + PKPSFKIGECIRRVAS++TGS ++ G +D +V D S Sbjct: 1239 TISLAKVSTPVT-LPKPSFKIGECIRRVASQMTGSPSILRPNSQKPDGGSDGLVGDGSDI 1297 Query: 150 IYDIS---EKQSVVVSAESFSVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 + S E + + V E S+DE+LSQL A++P KG +F +FF+ F Sbjct: 1298 LIQHSEDLEMRRMNVPTEYSSLDELLSQLLLAARDPLKGYSFLTVIISFFSDF 1350 >gb|EMJ20098.1| hypothetical protein PRUPE_ppa000448mg [Prunus persica] Length = 1170 Score = 353 bits (906), Expect = 9e-95 Identities = 220/461 (47%), Positives = 274/461 (59%), Gaps = 43/461 (9%) Frame = -1 Query: 1254 NGFYIMEGHSSETDNFKLSNEE-----TMKSASSMRTNQSGYLLPPENEGHFAESDLVWG 1090 +G + E + T+ K S EE M+ SS Q Y LPPENEG F+ SDLVWG Sbjct: 422 HGGHYTEVETEATEQPKFSEEEIIMEEAMQPGSSDILLQPRYELPPENEGLFSASDLVWG 481 Query: 1089 KVRSHPWWPGQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIE 910 KV+SHPWWPGQIFD ASEKA+KY+KKD +LVAYFGDRTFAWN+ S LKPFRSYF Q E Sbjct: 482 KVKSHPWWPGQIFDYTVASEKAMKYHKKDCFLVAYFGDRTFAWNEPSSLKPFRSYFPQAE 541 Query: 909 KQSKSEAFQNAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGV 730 KQ SEAFQNAV+ ALEEVSRRVELGLACSC P+ Y KI Q+V N GI +ESSRR V Sbjct: 542 KQCNSEAFQNAVNCALEEVSRRVELGLACSCIPEDVYEKIRFQIVGNAGICQESSRRDEV 601 Query: 729 DHSSRASYFEPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSG 550 D S+ AS E +K LEY+ LA S +D+L+LVIA+A L AFYR KGY S EF G Sbjct: 602 DESASASSLECNKLLEYIKALARFPSGGSDQLELVIAKAHLLAFYRLKGYCSLPEFQFCG 661 Query: 549 ELLENNADTEKISNKM-------------------------LDSEKWKHTPKDGSQSR-K 448 +LLEN D+ +K+ +S K KH +DG S+ K Sbjct: 662 DLLENRTDSSLSEDKINVGERDEHTIEKVTFSGPDIVKVQSSNSNKRKHNLRDGVYSKIK 721 Query: 447 KRSLMELMG------DREYSPDAEDVGXXXXXXXXXXXKTFDFQADG---SNKRVSIHAA 295 +RSL ELM D + D +D G K F++ AD + R + A Sbjct: 722 ERSLSELMEGGIDSLDGDDWLDGKDSGGLVSPSSGKRRKGFEYHADDLTVQDGRKGLSVA 781 Query: 294 KVSTSTSQTPKPSFKIGECIRRVASKLTGSTLSVKGSNDEMVIDDSPKIYDIS---EKQS 124 KVS +T+ PK SFKIGECI+RVAS+LTGS + VK ++D D S + S + Sbjct: 782 KVS-NTTHVPKQSFKIGECIQRVASQLTGSPI-VKSNSDRPAGDTSDVAFQSSGDGHRGR 839 Query: 123 VVVSAESFSVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 + E S+ E+LSQLQ+ A++P+ +F N +FFT F Sbjct: 840 AIDPTEYASLGELLSQLQSAAEDPRNEYHFLNTIVSFFTDF 880 >ref|XP_004143691.1| PREDICTED: uncharacterized protein LOC101204371 [Cucumis sativus] Length = 1936 Score = 349 bits (895), Expect = 2e-93 Identities = 214/447 (47%), Positives = 271/447 (60%), Gaps = 56/447 (12%) Frame = -1 Query: 1173 SSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDPADASEKAVKYYKKDSYL 994 SS++ +Q+ Y LP ENEG F+ SDLVWGKVRSHPWWPGQIFDP+D+S++A+KYYKKD YL Sbjct: 534 SSVQLHQACYHLPSENEGDFSVSDLVWGKVRSHPWWPGQIFDPSDSSDQAMKYYKKDFYL 593 Query: 993 VAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSALEEVSRRVELGLACSCT 814 VAYFGDRTFAWN+ S LKPFR++FSQ E QS SEAFQN+V ALEEVSRR ELGLAC+CT Sbjct: 594 VAYFGDRTFAWNEVSHLKPFRTHFSQEEMQSHSEAFQNSVECALEEVSRRAELGLACACT 653 Query: 813 PKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFLEYVTELAPHASSRADRL 634 PK AY ++ Q++EN GIREESSRRYGVD S+ A+ FEP K +EY+ +LA S +DRL Sbjct: 654 PKEAYDMVKCQIIENAGIREESSRRYGVDKSASATSFEPAKLIEYIRDLAKFPSDGSDRL 713 Query: 633 DLVIARAQLSAFYRFKGY--------HSPTEFSPSGELLENNADT--------------- 523 +LVIA+AQL+AFYR KGY +F G L +N D+ Sbjct: 714 ELVIAKAQLTAFYRLKGYCGLPQFQFGGLPQFQFCGGLADNELDSLGIEMQSSDFDHHAA 773 Query: 522 ------------EKISNKMLDSEKWKHTPKDGSQSRKK-RSLMELMGDREYSPDAEDVG- 385 E + + K KH KDG +KK +SL ELMG+ + D E+ Sbjct: 774 PCQDDAQASPSKENVEVRSSSYHKRKHNLKDGLYPKKKEKSLYELMGENFDNIDGENWSD 833 Query: 384 ---XXXXXXXXXXXKTFDFQADGS---NKRVSIHAAKVSTSTSQTPKPSFKIGECIRRVA 223 KT + DGS + R +I AKVS + S K SFKIG+CIRRVA Sbjct: 834 ARTSTLVSPSCKRRKTVEHPIDGSGAPDGRKTISVAKVSGTASL--KQSFKIGDCIRRVA 891 Query: 222 SKLTGSTLSVK----------GSNDEMVIDDSP---KIYDISEKQSVVVSAESFSVDEIL 82 S+LTG T +K GS D + +S + +D +++ V E S+DE+L Sbjct: 892 SQLTG-TPPIKSTCERFQKPDGSFDGNALHESDVFLQNFDDAQRGKVNFPPEYSSLDELL 950 Query: 81 SQLQTVAQNPKKGCNFQNNTRTFFTGF 1 QLQ VA +P K +F N +FFT F Sbjct: 951 DQLQLVASDPMKEYSFLNVIVSFFTDF 977 >emb|CBI31518.3| unnamed protein product [Vitis vinifera] Length = 1275 Score = 347 bits (889), Expect = 8e-93 Identities = 200/406 (49%), Positives = 243/406 (59%), Gaps = 19/406 (4%) Frame = -1 Query: 1206 KLSNEETMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDPADASEK 1027 K T+K + +R +Q+ Y LPPE+EG F+ SDLVWGKVRSHPWWPGQIFDP+DASEK Sbjct: 861 KTVKRATLKPGNLIRGHQATYQLPPESEGEFSVSDLVWGKVRSHPWWPGQIFDPSDASEK 920 Query: 1026 AVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSALEEVSR 847 A+KY+KKD +LVAYFGDRTFAWN+ SLLKPFR++FSQI KQS SE F NAV AL+EVSR Sbjct: 921 AMKYHKKDCFLVAYFGDRTFAWNEASLLKPFRTHFSQIVKQSNSEVFHNAVDCALDEVSR 980 Query: 846 RVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFLEYVTEL 667 RVELGLACSC PK Y +I+ Q+VENTGIR ESSRR GVD S+ S EPD F+EY+ L Sbjct: 981 RVELGLACSCIPKDDYDEIKCQIVENTGIRSESSRRDGVDKSATMSLLEPDTFVEYIKAL 1040 Query: 666 APHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNADTEKISNKM----- 502 A S AD+L+LVIA+AQL AF R KGYH EF G L EN+AD + M Sbjct: 1041 AQFPSGGADQLELVIAKAQLLAFSRLKGYHRLPEFQYCGGLQENDADISCFNEMMEHETD 1100 Query: 501 -------------LDSEKWKHTPKDGSQSRKK-RSLMELMGDREYSPDAEDVGXXXXXXX 364 S K KH KD + RKK RSL ELM YSPD E+ Sbjct: 1101 VLMGDDGKFKIQNSSSHKRKHNLKDSAYPRKKERSLSELMSGMAYSPDDEN--------- 1151 Query: 363 XXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPKPSFKIGECIRRVASKLTGSTLSVKGS 184 D ++K VS K +P+ SFK+G+CIRR AS+LTGS +K Sbjct: 1152 -------DSDGKATSKPVSSSGRK--RKVVDSPRQSFKVGDCIRRAASQLTGSPSILK-- 1200 Query: 183 NDEMVIDDSPKIYDISEKQSVVVSAESFSVDEILSQLQTVAQNPKK 46 +++ E S+DE+ + VA N +K Sbjct: 1201 --------------------MIIPMEYPSLDEMFLTMDKVAGNRRK 1226 >gb|EXC19485.1| hypothetical protein L484_014115 [Morus notabilis] Length = 1347 Score = 337 bits (865), Expect = 5e-90 Identities = 201/454 (44%), Positives = 265/454 (58%), Gaps = 40/454 (8%) Frame = -1 Query: 1242 IMEGHSSETDNFKLSNEETMKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWP 1063 + +G +T K++N E+++ SS Q Y LPPE+EG F+ DLVWGKV+SHPWWP Sbjct: 656 VTDGEQPDTSEDKITNWESLEPGSSSTLQQPSYGLPPEDEGVFSVPDLVWGKVKSHPWWP 715 Query: 1062 GQIFDPADASEKAVKYYKKDSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQ 883 GQIFD DAS+KA+K++KKD YLVAYFGDR+FAWN+ S LKPFR++F+Q+EKQ +E FQ Sbjct: 716 GQIFDFTDASDKAMKHHKKDCYLVAYFGDRSFAWNESSTLKPFRTHFTQMEKQGNAETFQ 775 Query: 882 NAVSSALEEVSRRVELGLACSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYF 703 AV+ ALEEVSRRVELGLACSC K +Y +I+ Q+VEN GIR ESS+R VD S+ A +F Sbjct: 776 KAVNCALEEVSRRVELGLACSCISKDSYDRIKHQIVENAGIRPESSKRKSVDESASAHFF 835 Query: 702 EPDKFLEYVTELAPHASSRADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGELLENNA-- 529 + DK EY+ LA S +D L+LVIA+AQL AF RF+G+ S EF G+L+EN+ Sbjct: 836 QADKLAEYLKALAWSPSGGSDHLELVIAKAQLLAFGRFRGFSSLPEFQFCGDLVENDTAG 895 Query: 528 ------------------------------DTEKISNKMLDSEKWKHTPKDGSQSR-KKR 442 +T+K+ N K KH +DG+ + K++ Sbjct: 896 PRFQDDVYPGEVIEHASLFSKDDERTASDQETQKVHNS--SYHKRKHNLRDGAYPKIKEK 953 Query: 441 SLMELMGDREYSPDAEDVGXXXXXXXXXXXKTFDFQADGSNKRVSIHAAKVSTSTSQTPK 262 SL ELMG S D +D+ +D ++ H + S S PK Sbjct: 954 SLTELMGGAVDSLD-DDIPSGKRRKG----------SDNHVDDLTTHDGRKKVSNSTPPK 1002 Query: 261 PSFKIGECIRRVASKLTGSTLSVKGSNDEMVIDDSP----KIYDI---SEKQSVVVSAES 103 SFKIGECIRRVAS+LTGS + S +D S YD S + VV E Sbjct: 1003 QSFKIGECIRRVASQLTGSPTAKGNSERVQKLDGSSDRPGDEYDASFHSPEGRVVDPTEY 1062 Query: 102 FSVDEILSQLQTVAQNPKKGCNFQNNTRTFFTGF 1 S+DE+L QLQ +AQ+P +F N FF+ F Sbjct: 1063 SSLDELLLQLQFIAQDPLNEYSFSNVIVNFFSDF 1096 >ref|XP_003535180.1| PREDICTED: uncharacterized protein LOC100784689 isoform X1 [Glycine max] gi|571482663|ref|XP_006589021.1| PREDICTED: uncharacterized protein LOC100784689 isoform X2 [Glycine max] Length = 1019 Score = 333 bits (853), Expect = 1e-88 Identities = 202/436 (46%), Positives = 252/436 (57%), Gaps = 41/436 (9%) Frame = -1 Query: 1185 MKSASSMRTNQSGYLLPPENEGHFAESDLVWGKVRSHPWWPGQIFDPADASEKAVKYYKK 1006 MKS + + YLLP E EG F+ SD+VWGKVRSHPWWPGQIFDP+D+SEKA+K+YKK Sbjct: 344 MKSMCLESLHNARYLLPIEKEGEFSVSDMVWGKVRSHPWWPGQIFDPSDSSEKAMKHYKK 403 Query: 1005 DSYLVAYFGDRTFAWNDKSLLKPFRSYFSQIEKQSKSEAFQNAVSSALEEVSRRVELGLA 826 D +LVAYFGDRTFAWN++S LKPFR++FS IEKQS SE+FQNAV A++EV+RR E GLA Sbjct: 404 DCHLVAYFGDRTFAWNEESQLKPFRTHFSSIEKQSTSESFQNAVDCAVDEVTRRAEYGLA 463 Query: 825 CSCTPKGAYGKIEAQVVENTGIREESSRRYGVDHSSRASYFEPDKFLEYVTELAPHASSR 646 CSC PK Y I+ Q VENTGIR E S R+GVD S AS F P +EY+ L+ + Sbjct: 464 CSCIPKDTYDSIKFQTVENTGIRSELSARHGVDESLNASSFSPGNLVEYLKTLSALPTGG 523 Query: 645 ADRLDLVIARAQLSAFYRFKGYHSPTEFSPSGEL----------LENN-----------A 529 DRL+L IA+AQL +FYRFKGY E G ENN A Sbjct: 524 FDRLELEIAKAQLLSFYRFKGYSCLPELQYCGGFDDDMDSLVHDDENNHAAPVSKNYGQA 583 Query: 528 DTEKISNKMLDSEKWKHTPKD-GSQSRKKRSLMELMGDREYSPDAE------DVGXXXXX 370 + + N+ K KH KD +++K+RSL ELMG SPD + + Sbjct: 584 GSGNLKNQSSSHRKRKHNLKDIMHETKKERSLSELMGGTPDSPDGDYWSEEKVIDNLVSP 643 Query: 369 XXXXXXKTFDFQADGSNK---RVSIHAAKVSTSTSQTPKPSFKIGECIRRVASKLTGSTL 199 +T D AD K R +I AKVS +T KPSF IG+ IRRVASKLTGS Sbjct: 644 GRSKKRRTVDHYADDFGKPDGRKTISVAKVSNTT----KPSFLIGDRIRRVASKLTGSPS 699 Query: 198 SVKGSNDEMVIDDSPK----------IYDISEKQSVVVSAESFSVDEILSQLQTVAQNPK 49 +VK S D D ++ +++ S+ E S+D +LS L VAQ P Sbjct: 700 TVKSSGDRSQKTDGSTDGFSGNGTDFSFEEAQRSSMAAPTEYSSLDNLLSSLHLVAQEPL 759 Query: 48 KGCNFQNNTRTFFTGF 1 NF N +FF+ F Sbjct: 760 GDYNFLNPIVSFFSDF 775