BLASTX nr result
ID: Catharanthus22_contig00021544
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00021544 (2171 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004252856.1| PREDICTED: uncharacterized protein LOC101246... 377 e-101 ref|XP_006365820.1| PREDICTED: uncharacterized protein LOC102578... 375 e-101 gb|EOY32612.1| Sequence-specific DNA binding transcription facto... 349 2e-93 gb|EXB93287.1| hypothetical protein L484_015274 [Morus notabilis] 336 3e-89 ref|XP_002273284.1| PREDICTED: uncharacterized protein LOC100250... 325 4e-86 ref|XP_004291801.1| PREDICTED: uncharacterized protein LOC101300... 319 4e-84 ref|XP_002513474.1| transcription factor, putative [Ricinus comm... 319 4e-84 gb|ESW22374.1| hypothetical protein PHAVU_005G148400g [Phaseolus... 313 1e-82 gb|EMJ13730.1| hypothetical protein PRUPE_ppa019358mg, partial [... 305 4e-80 ref|XP_003547125.1| PREDICTED: uncharacterized protein LOC100798... 301 6e-79 ref|XP_006470589.1| PREDICTED: uncharacterized protein LOC102621... 299 4e-78 ref|XP_006446099.1| hypothetical protein CICLE_v10015189mg [Citr... 299 4e-78 ref|XP_003597439.1| hypothetical protein MTR_2g098080 [Medicago ... 298 6e-78 ref|XP_004486956.1| PREDICTED: uncharacterized protein LOC101493... 289 3e-75 ref|XP_002299233.2| hypothetical protein POPTR_0001s05670g, part... 288 5e-75 ref|XP_006470590.1| PREDICTED: uncharacterized protein LOC102621... 286 2e-74 ref|XP_004161693.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 286 3e-74 ref|XP_004142119.1| PREDICTED: uncharacterized protein LOC101205... 286 3e-74 ref|XP_003542048.1| PREDICTED: uncharacterized protein LOC100801... 286 3e-74 ref|XP_002303892.2| hypothetical protein POPTR_0003s20370g [Popu... 285 7e-74 >ref|XP_004252856.1| PREDICTED: uncharacterized protein LOC101246904 [Solanum lycopersicum] Length = 460 Score = 377 bits (967), Expect = e-101 Identities = 232/506 (45%), Positives = 282/506 (55%), Gaps = 15/506 (2%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMN--RQPQNPGG--FLGLNRNHQQTTNNPAHTQKENVQSS 721 MN+SG G GFLSGY GGFLGMN +QP++ LG NH NVQSS Sbjct: 1 MNSSGMGGGFLSGYNGGFLGMNMLQQPESTSNNAVLGHQMNHN------------NVQSS 48 Query: 722 MDIEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDD-----STDDEPSY 886 + + T + + +G M +GK+K G SNV N +++ S +DEPS+ Sbjct: 49 VSAKMGLEHEKT----IGLMDAQG-CMAYGKDKAVGPSNVVYNTSNNPNSNTSDEDEPSF 103 Query: 887 -EEGNEEGTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPD--RXXXXXXX 1057 E+GN E G GKK +PWQRMKWTDN VRLLIQVVA VGDDGSLE P Sbjct: 104 NEDGNGENNGGAPGKKGSPWQRMKWTDNVVRLLIQVVACVGDDGSLEGPGVGLKRKSACL 163 Query: 1058 XXXXXXXXXSKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFG 1237 S+I++S G HVSPQQCEDKFNDLNKRYKKLNDILGRG SC VVENP L+ Sbjct: 164 QKKGKWKTVSRIMMSNGCHVSPQQCEDKFNDLNKRYKKLNDILGRGTSCAVVENPVLMDS 223 Query: 1238 MHHLSEKAKEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDV 1417 M LS KAK+ VKK+L SKHLFYRE+CAYHNGQKIPDC+DLE P H Sbjct: 224 MPQLSAKAKDTVKKILNSKHLFYREMCAYHNGQKIPDCNDLEFPAHS------------- 270 Query: 1418 SPAAANCSKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGE 1597 SP AA C+KD + S +G+ N+ G K+G+ Sbjct: 271 SPVAAPCAKDHNGS-------QGDEAEENDESDDDDDESDD-------NHGDGDARKIGD 316 Query: 1598 YRDRERMKPVEN---IRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERI 1768 + ERM+ VE Q + DNF AEI F DPTKSQW++K WIKKRMLQL+EE+I Sbjct: 317 FD--ERMRRVEENGYFLPQINGNDNFLAEINEFFHDPTKSQWDKKMWIKKRMLQLEEEKI 374 Query: 1769 GIQAEAVELEKRRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRP 1948 GIQAEA ELEKR++KW RFC +LM S +P Sbjct: 375 GIQAEAFELEKRQVKWQRFCRKKDREFEIERLENKKLMLENEHMALQLKHKQHELDSTKP 434 Query: 1949 EASFSLTSFSIDIMGGRDHMNSARFH 2026 SF+ S+D GRD +++AR+H Sbjct: 435 NISFNSAPLSLDRPMGRDQLDAARYH 460 >ref|XP_006365820.1| PREDICTED: uncharacterized protein LOC102578195 [Solanum tuberosum] Length = 459 Score = 375 bits (962), Expect = e-101 Identities = 231/501 (46%), Positives = 277/501 (55%), Gaps = 10/501 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSMDIE 733 MN+SG G GFLSGY GGFLGMN Q HQ NN VQSS+ + Sbjct: 1 MNSSGMGGGFLSGYNGGFLGMNMLQQPEPMSNNAVHGHQMNHNN--------VQSSVSAK 52 Query: 734 PAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDD----STDDEPSY-EEGN 898 T V + +G M +GK+K G SNV N+ S +DEPS+ E+GN Sbjct: 53 LGLEHEKT----VGLMDAQG-CMAYGKDKAVGPSNVVNTSNNPNSNTSDEDEPSFNEDGN 107 Query: 899 EEGTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPD--RXXXXXXXXXXXX 1072 E G +GKK +PWQRMKWTDN VRLLIQVVA VGDDGSLE P Sbjct: 108 GENNGGAQGKKGSPWQRMKWTDNVVRLLIQVVACVGDDGSLEGPGVGLKRKSACLQKKGK 167 Query: 1073 XXXXSKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLS 1252 S+I++S G HVSPQQCEDKFNDLNKRYKKLNDILGRG SC VVENP L+ M LS Sbjct: 168 WKTVSRIMMSNGCHVSPQQCEDKFNDLNKRYKKLNDILGRGTSCAVVENPVLMDSMPQLS 227 Query: 1253 EKAKEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAA 1432 KAK+ VKK+L SKHLFYRE+CAYHNGQKIPDC+DLE P H SP AA Sbjct: 228 AKAKDTVKKILNSKHLFYREMCAYHNGQKIPDCNDLEFPAHS-------------SPVAA 274 Query: 1433 NCSKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRE 1612 C+KD + S +G+ N+ G K+G++ E Sbjct: 275 LCAKDHNGS-------QGDEAEENDESDDDDDESDD-------NHGDGDARKIGDFD--E 318 Query: 1613 RMKPVE---NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAE 1783 RM+ VE + Q + DNF AEI F DPTKSQW++K WIKKRMLQL+EE+IGIQAE Sbjct: 319 RMRRVEENGSFLPQINGNDNFLAEINEFFQDPTKSQWDQKMWIKKRMLQLEEEKIGIQAE 378 Query: 1784 AVELEKRRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFS 1963 A ELEKR++KW RFC +LM S +P SF+ Sbjct: 379 AFELEKRQVKWQRFCRKKDREFEIERLENKKLMLENEHMALQLKHKQHELDSTKPNISFN 438 Query: 1964 LTSFSIDIMGGRDHMNSARFH 2026 S+D GRD +++AR+H Sbjct: 439 SAPLSLDRPMGRDQLDAARYH 459 >gb|EOY32612.1| Sequence-specific DNA binding transcription factors [Theobroma cacao] Length = 454 Score = 349 bits (896), Expect = 2e-93 Identities = 211/495 (42%), Positives = 267/495 (53%), Gaps = 6/495 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSMDIE 733 M NSG G GFLSG GG + +NR + P+ N+ + + Sbjct: 1 MENSGLGGGFLSGPNGGLFDLESS---------INRQQKPQLGQPSLIPHHNMVLMSEND 51 Query: 734 PAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVT-INFNDDSTDDEPSY-EEGNEEG 907 R T V E K NP+GF MNFGK K G S ++ +N S +DEPSY E+GN E Sbjct: 52 ----HRSTGVMEAKGCNPKGFPMNFGKGK--GVSPISAMNNGSMSEEDEPSYIEDGNGEN 105 Query: 908 TSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX- 1084 + G KGKK +PWQRMKWTDN VRLLI VVA VGDDG +E + Sbjct: 106 SIGGKGKKGSPWQRMKWTDNVVRLLIAVVACVGDDGMIEGVEGPKRKSGILQKKGKWKTV 165 Query: 1085 SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAK 1264 SKI+ISKG HVSPQQCEDKFNDLNKRYKKLNDILGRG SC+VVENP L+ M HLS KAK Sbjct: 166 SKIMISKGCHVSPQQCEDKFNDLNKRYKKLNDILGRGTSCRVVENPSLMDSMPHLSAKAK 225 Query: 1265 EDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELP--VHPATVAEQIWNHSDVSPAAANC 1438 +DVKK+L SKHLFY E+CAYHNGQ+IP+C DL+L P + N SD A N Sbjct: 226 DDVKKILSSKHLFYPEMCAYHNGQRIPNCQDLDLQGCFVPLDRCLKDNNGSDEEEAEGND 285 Query: 1439 SKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERM 1618 + D+ NNA G ++GE R++ Sbjct: 286 DSEDDDE----------------------------MDNEDDNNADGDDERIGELNKRKKA 317 Query: 1619 KPVE-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVEL 1795 E + +Q +++D+ + E+AGIF DPT+S ERK+WIK+++LQLQEER+ +Q E EL Sbjct: 318 SAEEGHFWSQSAEQDSLKVEMAGIFHDPTRSSLERKEWIKRQILQLQEERVNLQVEGFEL 377 Query: 1796 EKRRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSF 1975 EK+R KWLR+CN +R+ R +AS TS Sbjct: 378 EKQRFKWLRYCNKKGRELERLRLENERMRLENERSLLQLRQKELEVGFRSSDASLDPTSL 437 Query: 1976 SIDIMGGRDHMNSAR 2020 ID + RD ++ R Sbjct: 438 GIDRLQSRDQIDLGR 452 >gb|EXB93287.1| hypothetical protein L484_015274 [Morus notabilis] Length = 452 Score = 336 bits (861), Expect = 3e-89 Identities = 203/493 (41%), Positives = 274/493 (55%), Gaps = 4/493 (0%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSMDIE 733 M++SG G GFLSG GG L + ++P + ++ Q ++ AH +V ++ E Sbjct: 1 MDSSGLGGGFLSGPNGGILDL----ESP---MHRHQTRQLGHSSLAHQHHMSVMGGVEDE 53 Query: 734 PAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNEEGT 910 P P+ ++ EVK S +G +M FGK K N +N S +DEPSY E+G+ E Sbjct: 54 PHPI----SLMEVKGSASKGVSM-FGKGKGVAPINGNHGYNL-SEEDEPSYMEDGSGENF 107 Query: 911 SGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX-S 1087 GVKGKK +PWQRMKWTDN V+LLI VVA VGDDG + + S Sbjct: 108 DGVKGKKGSPWQRMKWTDNVVKLLIAVVACVGDDGMVAGGEGLKRKSGILQKKGKWKTVS 167 Query: 1088 KILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAKE 1267 KI+ISKG HVSPQQCEDKFNDLNKRYK+LNDILGRG SC+VVENP L+ M HLS KAK+ Sbjct: 168 KIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSAKAKD 227 Query: 1268 DVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVH--PATVAEQIWNHSDVSPAAANCS 1441 DV+K+L SKHLFY+E+CAYHNGQ+I DC D++L + P + N SD A N Sbjct: 228 DVRKILSSKHLFYKEMCAYHNGQRILDCHDIDLQGYSLPLERCSKDNNGSDEEEAEENHD 287 Query: 1442 KDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMK 1621 + D+ NN +M E+ DR+++ Sbjct: 288 SEDDD-----------------------------LDNEDDNNDDNDGERMTEFGDRKKVN 318 Query: 1622 PVENIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELEK 1801 E+ S +D+FE E+A F DPTKS WER++W+KK+M+QL+ +R+ +QAEA+ELEK Sbjct: 319 E-EDFHFWTSPQDSFEGEMARFFQDPTKSLWERREWVKKQMMQLEVQRVNVQAEALELEK 377 Query: 1802 RRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSFSI 1981 +R +WLR+C+ +R+ RR EAS +S I Sbjct: 378 QRFRWLRYCSKKDRELERLRLENERMKLENERKVLQLRQKELELDLRRSEASLEHSSLGI 437 Query: 1982 DIMGGRDHMNSAR 2020 D + GRD ++ R Sbjct: 438 DRLQGRDQIDLGR 450 >ref|XP_002273284.1| PREDICTED: uncharacterized protein LOC100250855 [Vitis vinifera] Length = 451 Score = 325 bits (834), Expect = 4e-86 Identities = 195/486 (40%), Positives = 264/486 (54%), Gaps = 5/486 (1%) Frame = +2 Query: 578 GFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKE--NVQSSMDIEPAPVQR 751 GFLSG + G L + Q R+ Q + P HT NV + + + + Sbjct: 6 GFLSGTSAGILELETSIQ---------RHQQAQMSIPPHTHHHHMNVMTGFENDHCSI-- 54 Query: 752 PTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTD-DEPSYEEGNEEGTSGVKGK 928 E K S P+G +N+GK K +V N N++++D DEPSY E +E SG KGK Sbjct: 55 --GTLETKGSTPKGIPINYGKGKGIAPVSVANNDNNNTSDEDEPSYTE--DENFSGAKGK 110 Query: 929 KDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX-SKILISK 1105 K +PWQRMKWTDN VRLLI VVA VGDDG+LE + SKI+ISK Sbjct: 111 KGSPWQRMKWTDNVVRLLIAVVACVGDDGTLEGVEGLKRKSGILQKKGKWKTVSKIMISK 170 Query: 1106 GYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAKEDVKKLL 1285 G VSPQQCEDKFNDLNKRYK+LN+ILGRG +C+VVENP L+ M LS K K+DVKK+L Sbjct: 171 GCFVSPQQCEDKFNDLNKRYKRLNEILGRGTTCRVVENPALMDSMPQLSAKMKDDVKKIL 230 Query: 1286 GSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCSKDTDESPV 1465 SKHLFY+E+CAYHNG+ IP+C D++L + + +A +++ A ++D D+ + Sbjct: 231 SSKHLFYQEMCAYHNGKSIPNCHDIDLQGYFSPLARSSKDNNGSEEEEAEENEDFDDDEL 290 Query: 1466 APGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMKPVE-NIRT 1642 +N +MG++ R ++ + + Sbjct: 291 ---------------------------DNEEYDNVDVHAQRMGQFHQRRKVNQEDCSFWP 323 Query: 1643 QFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELEKRRLKWLR 1822 Q + +D+FE E+AGIF+DPTKS WE+K+WIK RMLQLQE+R+ I A+ ELEK+R KWLR Sbjct: 324 QDACQDSFEVEMAGIFEDPTKSLWEQKEWIKNRMLQLQEQRVTIMAQGFELEKQRFKWLR 383 Query: 1823 FCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSFSIDIMGGRD 2002 + + +RL S+RPEAS S ID + GRD Sbjct: 384 YSSKKGRDLENSRLENERLGLENERMVLELKQKELELDSKRPEASLDPASLGIDRLQGRD 443 Query: 2003 HMNSAR 2020 + R Sbjct: 444 QIELGR 449 >ref|XP_004291801.1| PREDICTED: uncharacterized protein LOC101300312 [Fragaria vesca subsp. vesca] Length = 451 Score = 319 bits (817), Expect = 4e-84 Identities = 201/487 (41%), Positives = 254/487 (52%), Gaps = 4/487 (0%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNP--AHTQKENVQSSMD 727 M+ SG G GFLSG +GG L + + R+ ++ +P AH Q+ NV S + Sbjct: 1 MDGSGLGGGFLSGPSGGTLDLESS---------IPRHQEKQMGHPVLAHHQRVNVMSGVG 51 Query: 728 IEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNEE 904 ++ PV + EVK S PRG +MN K K A T N + S +DEPSY E+GN+E Sbjct: 52 VDRCPV----GLVEVKGSIPRGASMNVVKGKGV-APFYTSNAYNLSEEDEPSYNEDGNDE 106 Query: 905 GTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX 1084 GVKGKK + WQRMKWTD VRLLI VVA VGDDG Sbjct: 107 SLEGVKGKKGSTWQRMKWTDRVVRLLISVVACVGDDGVEGGEGLKRKSAVLQKKGKWKTV 166 Query: 1085 SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAK 1264 SKI+I KG HVSPQQCEDKFNDLNKRYK+LNDILGRG SC+VVENP L+ M HLS KAK Sbjct: 167 SKIMIGKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPSLMDSMPHLSAKAK 226 Query: 1265 EDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCSK 1444 +DV+K+L SK LFY+E+CAYHNGQ IP C DL+L Sbjct: 227 DDVRKILSSKQLFYKEMCAYHNGQWIPGCHDLDL-------------------------- 260 Query: 1445 DTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGS-IGKMGEYRDRERMK 1621 P+ PG +N NNA GS +MG DR+R Sbjct: 261 --QGDPLRPGRCSTDNNVSEEEGVEEHHDTEDDGSDNEDNNADGSDRDRMGRTGDRKRSS 318 Query: 1622 PVENIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELEK 1801 ++ Q D FE E+A + DPTKS WER++WI K+ L+LQ++R+ +AE +ELEK Sbjct: 319 EEDDHFWQHVSLDTFEVEMAEVLQDPTKSLWERREWINKQKLRLQQQRVNCEAEDLELEK 378 Query: 1802 RRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSFSI 1981 +R +WLR+ + +R+ RR EAS S I Sbjct: 379 QRYRWLRYRSKKDRELERLRLENERMKLENEKRVLQLRQKELEIDVRRSEASLE-PSVGI 437 Query: 1982 DIMGGRD 2002 D + GRD Sbjct: 438 DRLLGRD 444 >ref|XP_002513474.1| transcription factor, putative [Ricinus communis] gi|223547382|gb|EEF48877.1| transcription factor, putative [Ricinus communis] Length = 454 Score = 319 bits (817), Expect = 4e-84 Identities = 204/498 (40%), Positives = 264/498 (53%), Gaps = 9/498 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGM----NRQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSS 721 M+ SG G FLS GG L + +RQ Q G L AH + N+ S Sbjct: 1 MDASGLGGQFLSSPVGGLLDLESPIHRQQQAQSGHSSL-----------AHQRHMNLISG 49 Query: 722 MDIEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGN 898 + + P+ + EVK S+PR ++ N K K + T N + S DDEPS+ E+GN Sbjct: 50 LGNDHQPI----GLMEVKGSSPRNYSTNLSKGKGVSHFSPT-NDGNVSEDDEPSFTEDGN 104 Query: 899 EEGTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXX 1078 + +SG K KK +PWQRMKWTDN VRLLI VVA VGDDG+ + + Sbjct: 105 GDNSSGAKSKKGSPWQRMKWTDNVVRLLIAVVACVGDDGAFDGVEGLKRKSGILQKKGKW 164 Query: 1079 XX-SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSE 1255 SKILISKG HVSPQQ EDKFNDLNKRYK+LNDILGRG SC+VVENP L+ M LS Sbjct: 165 KTVSKILISKGCHVSPQQSEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSM-PLSA 223 Query: 1256 KAKEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVH--PATVAEQIWNHSDVSPAA 1429 KAKEDV+K+L SKHLFY+E+CAYHNGQ IP+C DL+L P + N S+ A Sbjct: 224 KAKEDVRKILSSKHLFYKEMCAYHNGQMIPNCQDLDLQGFSLPLERCSRDNNGSEEEEAE 283 Query: 1430 ANCSKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDR 1609 + D DES NNA +MG Y +R Sbjct: 284 GHDDSDEDES-----------------------------DNEDDNNAIEEGERMGMYAER 314 Query: 1610 ERMKPVE-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEA 1786 ++ + ++ Q ++FE E+AGIF DP+ S WE+K+WI K+ LQL E+R+ IQA+A Sbjct: 315 NKVNEEDAHLWPQSGGCNSFEVEMAGIFQDPSVSLWEKKEWINKQKLQLLEQRVSIQAQA 374 Query: 1787 VELEKRRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSL 1966 ELEK+R KWLR+C+ +R+ R E+S Sbjct: 375 FELEKQRFKWLRYCSKKDKEFEKLRLENERMRLENEQSVLQLRQKQLEMDLRSSESSRDP 434 Query: 1967 TSFSIDIMGGRDHMNSAR 2020 TS ID + GRD ++ R Sbjct: 435 TSLGIDRLQGRDQIDLGR 452 >gb|ESW22374.1| hypothetical protein PHAVU_005G148400g [Phaseolus vulgaris] Length = 448 Score = 313 bits (803), Expect = 1e-82 Identities = 201/494 (40%), Positives = 262/494 (53%), Gaps = 5/494 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHT--QKENVQSSMD 727 MN SG G GFLSG +GG L + +R+ Q +P+ T Q+ N+ S ++ Sbjct: 1 MNGSGLGGGFLSGPSGGILDLESS---------FHRHQQTQLGHPSITGQQQLNIMSGLE 51 Query: 728 IEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNEE 904 + P + EVK+ N +NFGK K S+ N+ S +DEPSY EEGN E Sbjct: 52 SD-----HPIGLIEVKNLNA---ALNFGKGKAIAPSDS----NELSDEDEPSYAEEGNCE 99 Query: 905 GTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPD-RXXXXXXXXXXXXXXX 1081 G KGKK +PWQRMKWTDN VRLLI VV+ VGDDG++ D Sbjct: 100 NLDGGKGKKGSPWQRMKWTDNVVRLLITVVSCVGDDGTIAGMDGHKRKSGVLQKKGKWKT 159 Query: 1082 XSKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKA 1261 SKI+ISKG HVSPQQCEDKFNDLNKRYK+LNDILGRG C+VVENP L+ + +LS K Sbjct: 160 VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTCCQVVENPALMDSIPNLSAKM 219 Query: 1262 KEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCS 1441 K+DV+K+L SKHLFY+E+CAYHNGQ+IP+C +L+L + H S N S Sbjct: 220 KDDVRKILSSKHLFYKEMCAYHNGQRIPNCHELDLQGYS-------MEHGRDSTRDNNGS 272 Query: 1442 KDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMK 1621 +D DE+ NN NA G+M E DR + Sbjct: 273 EDEDEN---------NN-----------DSEEDELDDDININAHEDGGRMQELCDRNILS 312 Query: 1622 PVE-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELE 1798 + + Q S+ D FE E+A +F DPTKS E+++WIK +MLQLQE+ I QA+ +ELE Sbjct: 313 EEDGHFGPQASRMDKFEIEMARVFQDPTKSLREQREWIKIQMLQLQEQNISYQAQTLELE 372 Query: 1799 KRRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSFS 1978 K+RLKWLR+C+ R+ E S S Sbjct: 373 KQRLKWLRYCSKKDRELERLRLENKRMKLENEHRILKLKQKELEVDFSTSEMSLDPASIG 432 Query: 1979 IDIMGGRDHMNSAR 2020 I+ GR+H++ R Sbjct: 433 INRPQGREHVSLGR 446 >gb|EMJ13730.1| hypothetical protein PRUPE_ppa019358mg, partial [Prunus persica] Length = 402 Score = 305 bits (782), Expect = 4e-80 Identities = 184/431 (42%), Positives = 228/431 (52%), Gaps = 5/431 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPA---HTQKENVQSSM 724 M++SG G GFLSG +GG L + + R + +P H NV S + Sbjct: 1 MDSSGLGGGFLSGPSGGILDLESS---------IPRRQEIQLGHPLFAHHQHHMNVMSGV 51 Query: 725 DIEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSYEEGNEE 904 K K N +N S +DEPSY + E Sbjct: 52 -----------------------------KGKAVAPFNANNGYNL-SDEDEPSYTDDGGE 81 Query: 905 GTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX 1084 G KGKK +PWQRMKWTDN VRLLI +VA VGDDG+LES + Sbjct: 82 NFEGAKGKKGSPWQRMKWTDNVVRLLIAIVACVGDDGTLESGEGLKRKSGILQKKGKWKT 141 Query: 1085 -SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKA 1261 SKI+ISKG HVSPQQCEDKFNDLNKRYK+LNDILGRG SC+VVENP L+ M HLS KA Sbjct: 142 VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTSCRVVENPALMDSMPHLSAKA 201 Query: 1262 KEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCS 1441 K+DV+K+L SKHLFY+E+CAYHNGQ+IPDC DL+L S CS Sbjct: 202 KDDVRKILSSKHLFYKEMCAYHNGQRIPDCHDLDL--------------QGYSLPLGRCS 247 Query: 1442 KDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMK 1621 KD NN N+A +M DR++ Sbjct: 248 KD-------------NNGSDEEEAEEHHDTEDDELDNEDDNDADNDRERMRRMGDRKKAS 294 Query: 1622 PVE-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELE 1798 + + Q+ D+FE E+ IF DPTKS WER+ WIKK+ LQL+E+R+G QAEA+ELE Sbjct: 295 EEDGHFWPQYVLLDSFEVEMGEIFQDPTKSLWERRDWIKKQKLQLEEQRVGFQAEALELE 354 Query: 1799 KRRLKWLRFCN 1831 K+R KWLR+C+ Sbjct: 355 KQRYKWLRYCS 365 >ref|XP_003547125.1| PREDICTED: uncharacterized protein LOC100798932 isoform X1 [Glycine max] gi|571515450|ref|XP_006597254.1| PREDICTED: uncharacterized protein LOC100798932 isoform X2 [Glycine max] Length = 447 Score = 301 bits (772), Expect = 6e-79 Identities = 195/492 (39%), Positives = 259/492 (52%), Gaps = 3/492 (0%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSMDIE 733 MN+SG G GFLSG +G L + ++P +R+ +P+ T ++++ +E Sbjct: 1 MNSSGLGGGFLSGPSGEILDL----ESP-----FHRHQHTQLGHPSITGQQHMNMMSGLE 51 Query: 734 PAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNEEGT 910 P + EVK N +NFGK K SN N+ S +DEPSY EEGN E Sbjct: 52 S---DHPIGLIEVKSLNA---ALNFGKGKAVAPSNS----NELSEEDEPSYAEEGNCENL 101 Query: 911 SGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPD-RXXXXXXXXXXXXXXXXS 1087 G K KK +PWQRMKWTDN VRLLI VV+ VGDDG++ D S Sbjct: 102 DGGKSKKGSPWQRMKWTDNVVRLLITVVSCVGDDGTIGGMDCHKRKSGVLQKKGKWKTVS 161 Query: 1088 KILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAKE 1267 KI+I KG HVSPQQCEDKFNDLNKRYK+LNDILGRG C+VVENP L+ M +LS K K+ Sbjct: 162 KIMIGKGCHVSPQQCEDKFNDLNKRYKRLNDILGRGTCCQVVENPVLMDSMPNLSAKMKD 221 Query: 1268 DVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCSKD 1447 DV+K+L SKHLFY+E+CAYHNGQ+IP+ +L+L + ++ N S+D Sbjct: 222 DVRKILSSKHLFYKEMCAYHNGQRIPNSHELDLQGYSLEHGRDSRDN--------NGSED 273 Query: 1448 TDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMKPV 1627 DE NN NA G+M + DR ++ Sbjct: 274 EDED---------NN-----------DSEDDESDDEININAHEDGGRMQQLCDRNKLSEE 313 Query: 1628 E-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELEKR 1804 + + Q S+ D FE E+A +F DPTKS E+++WIK +MLQLQE+ I QA+A+ELEK+ Sbjct: 314 DVHFGPQTSRMDKFEVEMARVFQDPTKSLHEQREWIKIQMLQLQEQNISYQAQALELEKQ 373 Query: 1805 RLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSFSID 1984 RLKWLR+C+ R+ E S S I+ Sbjct: 374 RLKWLRYCSKKDRELEKLRLENKRMKLENERRILKLKQKELEADFSTSEMSLDPASLGIN 433 Query: 1985 IMGGRDHMNSAR 2020 GR+H++ R Sbjct: 434 RPQGREHISLGR 445 >ref|XP_006470589.1| PREDICTED: uncharacterized protein LOC102621074 isoform X1 [Citrus sinensis] Length = 458 Score = 299 bits (765), Expect = 4e-78 Identities = 181/434 (41%), Positives = 238/434 (54%), Gaps = 9/434 (2%) Frame = +2 Query: 557 NNSGTGSGFLSGYTGGFLGMN----RQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSM 724 N+SG G FLSG G L + R QN G +R HQ + ++ Sbjct: 3 NSSGLGGRFLSGQNVGLLDLESSIPRNQQNQLGHPSFSRPHQMNMMHGLENDHHHI---- 58 Query: 725 DIEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNE 901 + EVK S+ +G MNFG+ K N + N N S +DEPSY +EGN Sbjct: 59 -----------GLLEVKGSSRKGLPMNFGRGKMVSPINASNNGNT-SEEDEPSYTDEGNG 106 Query: 902 EGTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXX 1081 E ++G + KK + W RMKWTDN VRLLI VA VGDDG+++ + Sbjct: 107 ENSNGGRDKKGSMWHRMKWTDNVVRLLIAAVACVGDDGTIDGVEGLKRKSGILQKKGKWK 166 Query: 1082 X-SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGM--HHLS 1252 SKI+IS+G VSPQQCEDKFNDLNKRYKKLNDILG+G++C+VVENP L+ M HLS Sbjct: 167 TVSKIMISRGCQVSPQQCEDKFNDLNKRYKKLNDILGKGLTCQVVENPALIDTMSCSHLS 226 Query: 1253 EKAKEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAA 1432 KAK+DV+K+LGSKHLFY+E+ AYHNG+KIP+C D++L Sbjct: 227 AKAKDDVRKILGSKHLFYKEMFAYHNGKKIPNCHDIDL---------------------Q 265 Query: 1433 NCSKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRE 1612 CS +D SP KGN+ N G+M E +R Sbjct: 266 GCSVPSDRSP------KGNDESEEEEADRNNDTDDDESDNEDDRNDDEDEGRMAEIGERG 319 Query: 1613 RMKPVE-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAV 1789 + + + Q + + FE E+A IF DP+KS WER++WIKK+ L LQ +R+ IQA+A Sbjct: 320 KGNEEDGHSWPQSAGRSVFEMEMARIFQDPSKSMWERREWIKKQKLALQNQRVSIQAQAF 379 Query: 1790 ELEKRRLKWLRFCN 1831 ELEK+ LKWLR+C+ Sbjct: 380 ELEKQHLKWLRYCS 393 >ref|XP_006446099.1| hypothetical protein CICLE_v10015189mg [Citrus clementina] gi|557548710|gb|ESR59339.1| hypothetical protein CICLE_v10015189mg [Citrus clementina] Length = 458 Score = 299 bits (765), Expect = 4e-78 Identities = 181/434 (41%), Positives = 238/434 (54%), Gaps = 9/434 (2%) Frame = +2 Query: 557 NNSGTGSGFLSGYTGGFLGMN----RQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSM 724 N+SG G FLSG G L + R QN G +R HQ + ++ Sbjct: 3 NSSGLGGRFLSGQNVGLLDLESSIPRNQQNQLGHPSFSRPHQMNMMHGLENDHHHI---- 58 Query: 725 DIEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNE 901 + EVK S+ +G MNFG+ K N + N N S +DEPSY +EGN Sbjct: 59 -----------GLLEVKGSSRKGLPMNFGRGKMVSPINASNNGNT-SEEDEPSYTDEGNG 106 Query: 902 EGTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXX 1081 E ++G + KK + W RMKWTDN VRLLI VA VGDDG+++ + Sbjct: 107 ENSNGGRDKKGSMWHRMKWTDNVVRLLIAAVACVGDDGTIDGVEGLKRKSGILQKKGKWK 166 Query: 1082 X-SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGM--HHLS 1252 SKI+IS+G VSPQQCEDKFNDLNKRYKKLNDILG+G++C+VVENP L+ M HLS Sbjct: 167 TVSKIMISRGCQVSPQQCEDKFNDLNKRYKKLNDILGKGLTCQVVENPALIDTMSCSHLS 226 Query: 1253 EKAKEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAA 1432 KAK+DV+K+LGSKHLFY+E+ AYHNG+KIP+C D++L Sbjct: 227 PKAKDDVRKILGSKHLFYKEMFAYHNGKKIPNCHDIDL---------------------Q 265 Query: 1433 NCSKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRE 1612 CS +D SP KGN+ N G+M E +R Sbjct: 266 GCSVPSDRSP------KGNDESEEEEADRNNDTDDDESDNEDDRNDDEDEGRMAEIGERG 319 Query: 1613 RMKPVE-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAV 1789 + + + Q + + FE E+A IF DP+KS WER++WIKK+ L LQ +R+ IQA+A Sbjct: 320 KGNEEDGHSWPQSAGRSVFEMEMARIFQDPSKSMWERREWIKKQKLALQNQRVSIQAQAF 379 Query: 1790 ELEKRRLKWLRFCN 1831 ELEK+ LKWLR+C+ Sbjct: 380 ELEKQHLKWLRYCS 393 >ref|XP_003597439.1| hypothetical protein MTR_2g098080 [Medicago truncatula] gi|355486487|gb|AES67690.1| hypothetical protein MTR_2g098080 [Medicago truncatula] Length = 445 Score = 298 bits (763), Expect = 6e-78 Identities = 199/498 (39%), Positives = 265/498 (53%), Gaps = 7/498 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKE-NVQSSMDI 730 MN SG G GFLSG TGG L + ++P NR+ QQT AH Q N+ + ++ Sbjct: 1 MNGSGLGGGFLSGPTGGILDL----ESP-----FNRHQQQT--QLAHGQHHMNMITGLEN 49 Query: 731 EPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGN-EE 904 + + + EVK+ +NFGK K +SN N ND S DDE Y E+GN E Sbjct: 50 DSNQI----GLIEVKN-------LNFGKGKGIASSNHD-NSNDMSEDDEHGYGEDGNCEN 97 Query: 905 GTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX 1084 G KGKK +PWQRMKWTDN V LLI VV+ VG+DG++ D Sbjct: 98 FFDGGKGKKGSPWQRMKWTDNVVGLLIAVVSCVGEDGTISGVDGVKRKSGVVQKKGKWKT 157 Query: 1085 -SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKA 1261 SKI+ISKG HVSPQQCEDKFNDLNKRYK+LN+ILGRG C+VVENP L+ M +LS KA Sbjct: 158 VSKIMISKGCHVSPQQCEDKFNDLNKRYKRLNEILGRGTCCQVVENPALMDSMVNLSAKA 217 Query: 1262 KEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNH--SDVSPAAAN 1435 K+DV+K+L SKHLFY+E+CAYHNGQ+IP+ DL+L + + +H SD N Sbjct: 218 KDDVRKILSSKHLFYKEMCAYHNGQRIPNSHDLDLHSYSLEHGKDSRDHDGSDDEDEDNN 277 Query: 1436 CSKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRER 1615 S+D + G N+ NA G G+M + DR + Sbjct: 278 ESEDDELD-------NGINI-----------------------NARGDGGRMEGFCDRNK 307 Query: 1616 MKPVE-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVE 1792 + + + Q E+E+A +F DP KS WE+++WIK+++LQLQE+ + QA+A E Sbjct: 308 LSEEDGHFWPQSIGMKKLESEMARVFQDPVKSPWEKREWIKQQLLQLQEQNVDFQAKAFE 367 Query: 1793 LEKRRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTS 1972 L+K++ KWLR+ + R+ R E S TS Sbjct: 368 LQKQQFKWLRYRSKKDRELEKLAMENKRMKFENEHRILKLKQREQEAEFSRSEMSLDPTS 427 Query: 1973 FSIDIMGGRDHMNSARFH 2026 I GR+H+N AR H Sbjct: 428 IGIKRPQGREHINLARQH 445 >ref|XP_004486956.1| PREDICTED: uncharacterized protein LOC101493778 isoform X1 [Cicer arietinum] Length = 444 Score = 289 bits (740), Expect = 3e-75 Identities = 186/492 (37%), Positives = 254/492 (51%), Gaps = 3/492 (0%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSMDIE 733 MN SG G+GFLSG +GG L + ++P +R+ Q +P T + ++ +E Sbjct: 1 MNGSGLGAGFLSGPSGGILDL----ESP-----FHRHQQTQLGHPNVTGQHHMNIMTGLE 51 Query: 734 PAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNEEGT 910 + EVK+ N ++FGK K +D S DDE Y E+GN E Sbjct: 52 N---DNRIGLIEVKNLNA---ALSFGKGKAIAE-------DDLSEDDEHGYAEDGNCENF 98 Query: 911 SGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX-S 1087 G KGKK +PWQRMKWTDN V LLI VV+ VGDDG++ D S Sbjct: 99 DGGKGKKGSPWQRMKWTDNVVGLLIAVVSCVGDDGTIGGVDGVKRKSGVLQKKGKWKTVS 158 Query: 1088 KILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAKE 1267 KI+ISKG HVSPQQCEDKFNDLNKRYK+LN+ILGRG C+VVENP L+ M +L+ K+K+ Sbjct: 159 KIMISKGCHVSPQQCEDKFNDLNKRYKRLNEILGRGTCCQVVENPALMDSMPNLTAKSKD 218 Query: 1268 DVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCSKD 1447 DV+K+L SKHLFY+E+CAYHNGQ+IP+ DL+L + + ++ S+D Sbjct: 219 DVRKILSSKHLFYKEMCAYHNGQRIPNSHDLDLHSYSLEHGKDSRDNDG--------SED 270 Query: 1448 TDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMKPV 1627 DE NN NA G G+M E DR ++ Sbjct: 271 EDED---------NNESEDDELDNEINI-----------NAHGHGGRMEEVYDRNKLSEE 310 Query: 1628 E-NIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELEKR 1804 + + + + E E+A +F DP S WER++WIK ++LQLQE+ +G Q A+EL+K+ Sbjct: 311 DGHFWPRSVAMEKLEVEMARVFQDPAMSPWERREWIKVQLLQLQEQNVGYQVRALELQKQ 370 Query: 1805 RLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSFSID 1984 R KWLR+C+ R+ R E S +S I+ Sbjct: 371 RFKWLRYCSKKDRELEKLRMENKRMKLENERRILKLKQRELEADVSRAEVSLDPSSIGIN 430 Query: 1985 IMGGRDHMNSAR 2020 GR+H+N R Sbjct: 431 RPQGREHINLGR 442 >ref|XP_002299233.2| hypothetical protein POPTR_0001s05670g, partial [Populus trichocarpa] gi|550346585|gb|EEE84038.2| hypothetical protein POPTR_0001s05670g, partial [Populus trichocarpa] Length = 417 Score = 288 bits (738), Expect = 5e-75 Identities = 173/429 (40%), Positives = 228/429 (53%), Gaps = 3/429 (0%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNP--AHTQKENVQSSMD 727 M+NSG G FLSG G L + ++P ++R+ Q +P AH + N+ Sbjct: 1 MDNSGLGGRFLSGPNSGLLDL----ESP-----IHRHQQSQLGHPSLAHQHQVNLVG--- 48 Query: 728 IEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSYEEGNEEG 907 K N + N DD+ E+GN E Sbjct: 49 ------------------------------KAVSPFNCASSGNASEDDDQSFMEDGNGEN 78 Query: 908 TSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX- 1084 ++GVKGKK +PWQRMKWTDN VRLLI VVA VGDDG+L + + Sbjct: 79 STGVKGKKGSPWQRMKWTDNVVRLLIAVVACVGDDGTLNAVEGLKRKSGLLQKKGKWKMV 138 Query: 1085 SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAK 1264 SK++ISKG HVSPQQCEDKFNDLNKRYK+LN+ILGRG +C+VVENP L+ M HLS KAK Sbjct: 139 SKLMISKGCHVSPQQCEDKFNDLNKRYKRLNEILGRGTTCRVVENPVLMDSMPHLSAKAK 198 Query: 1265 EDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCSK 1444 +DV+K+LGSKHLFY+E+CAYHNGQ+IP+C DL+L S SK Sbjct: 199 DDVRKILGSKHLFYKEMCAYHNGQRIPNCQDLDL--------------QGCSLPLERSSK 244 Query: 1445 DTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMKP 1624 D + S G + NNA ++G+ + Sbjct: 245 DNNGS--------GEDEAEGNGDSDDGDDDDDESDNEENNNADEDGERVGQLCEGRVNDE 296 Query: 1625 VENIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELEKR 1804 ++ +Q ++ F+ E+A IF DP S WERK+WIKK+ LQL E+R+ IQA+ ELEK+ Sbjct: 297 HAHLWSQSGGRNGFDVEMAAIFQDPAVSPWERKEWIKKQRLQLLEQRVSIQAQTFELEKQ 356 Query: 1805 RLKWLRFCN 1831 R KWLR+C+ Sbjct: 357 RFKWLRYCS 365 >ref|XP_006470590.1| PREDICTED: uncharacterized protein LOC102621074 isoform X2 [Citrus sinensis] Length = 414 Score = 286 bits (733), Expect = 2e-74 Identities = 164/360 (45%), Positives = 214/360 (59%), Gaps = 5/360 (1%) Frame = +2 Query: 767 EVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNEEGTSGVKGKKDAPW 943 EVK S+ +G MNFG+ K N + N N S +DEPSY +EGN E ++G + KK + W Sbjct: 18 EVKGSSRKGLPMNFGRGKMVSPINASNNGNT-SEEDEPSYTDEGNGENSNGGRDKKGSMW 76 Query: 944 QRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX-SKILISKGYHVS 1120 RMKWTDN VRLLI VA VGDDG+++ + SKI+IS+G VS Sbjct: 77 HRMKWTDNVVRLLIAAVACVGDDGTIDGVEGLKRKSGILQKKGKWKTVSKIMISRGCQVS 136 Query: 1121 PQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGM--HHLSEKAKEDVKKLLGSK 1294 PQQCEDKFNDLNKRYKKLNDILG+G++C+VVENP L+ M HLS KAK+DV+K+LGSK Sbjct: 137 PQQCEDKFNDLNKRYKKLNDILGKGLTCQVVENPALIDTMSCSHLSAKAKDDVRKILGSK 196 Query: 1295 HLFYREICAYHNGQKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCSKDTDESPVAPG 1474 HLFY+E+ AYHNG+KIP+C D++L CS +D SP Sbjct: 197 HLFYKEMFAYHNGKKIPNCHDIDL---------------------QGCSVPSDRSP---- 231 Query: 1475 CLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMKPVE-NIRTQFS 1651 KGN+ N G+M E +R + + + Q + Sbjct: 232 --KGNDESEEEEADRNNDTDDDESDNEDDRNDDEDEGRMAEIGERGKGNEEDGHSWPQSA 289 Query: 1652 QKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELEKRRLKWLRFCN 1831 + FE E+A IF DP+KS WER++WIKK+ L LQ +R+ IQA+A ELEK+ LKWLR+C+ Sbjct: 290 GRSVFEMEMARIFQDPSKSMWERREWIKKQKLALQNQRVSIQAQAFELEKQHLKWLRYCS 349 >ref|XP_004161693.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101205501 [Cucumis sativus] Length = 443 Score = 286 bits (732), Expect = 3e-74 Identities = 179/432 (41%), Positives = 233/432 (53%), Gaps = 6/432 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSMDIE 733 M++SG G GFLSG GG L + ++P + R + NP+ TQ+ + + E Sbjct: 1 MDSSGLGGGFLSG-NGGLLDL----ESP-----IRRPQKTQLVNPSLTQRHQLNMMNNFE 50 Query: 734 PAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSYEEGNEEGTS 913 + + + K + M F + K + +T N+ S +DEPSY E E + Sbjct: 51 GD--HQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYT--SEEDEPSYTEDGE-CSE 105 Query: 914 GVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXXSKI 1093 +KGKK +PWQRMKWTD VRLLI VVA VGDDG + SKI Sbjct: 106 FLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRNLGXLHKKGKWKTV-SKI 164 Query: 1094 LISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAKEDV 1273 + SKG HVSPQQCEDKFNDLNKRYK+LNDILG+G SC+VVENP L+ M HLS KAK+DV Sbjct: 165 MQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDV 224 Query: 1274 KKLLGSKHLFYREICAYHNGQKIPDCSDLE-----LPVHPATVAEQIWNHSDVSPAAANC 1438 +K+L SKHLFY+E+CAYHNGQ IP C D++ LPV + SD + Sbjct: 225 RKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPVANFSKGNNESEDSDSDSDSGES 284 Query: 1439 SKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERM 1618 + D SPV N S E R R+++ Sbjct: 285 DNEDDHSPV--------------------------------ENRLWS----SESRGRDKV 308 Query: 1619 KPVEN-IRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVEL 1795 + + + K+ FE +I DPTKS WERK WIKK+MLQLQE+ QA++VEL Sbjct: 309 SADDGPLWSNSVGKNEFEGQIDVFLSDPTKSHWERKVWIKKQMLQLQEQCNSFQAQSVEL 368 Query: 1796 EKRRLKWLRFCN 1831 EK+R KWLR+C+ Sbjct: 369 EKQRFKWLRYCS 380 >ref|XP_004142119.1| PREDICTED: uncharacterized protein LOC101205501 [Cucumis sativus] Length = 443 Score = 286 bits (732), Expect = 3e-74 Identities = 178/432 (41%), Positives = 232/432 (53%), Gaps = 6/432 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNPAHTQKENVQSSMDIE 733 M++SG G GFLSG GG L + ++P + R + NP+ TQ+ + + E Sbjct: 1 MDSSGLGGGFLSG-NGGLLDL----ESP-----IRRPQKTQLVNPSLTQRHQLNMMNNFE 50 Query: 734 PAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSYEEGNEEGTS 913 + + + K + M F + K + +T N+ S +DEPSY E E + Sbjct: 51 GD--HQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYT--SEEDEPSYTEDGE-CSE 105 Query: 914 GVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXXSKI 1093 +KGKK +PWQRMKWTD VRLLI VVA VGDDG + SKI Sbjct: 106 FLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTV-SKI 164 Query: 1094 LISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAKEDV 1273 + SKG HVSPQQCEDKFNDLNKRYK+LNDILG+G SC+VVENP L+ M HLS KAK+DV Sbjct: 165 MQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDV 224 Query: 1274 KKLLGSKHLFYREICAYHNGQKIPDCSDLE-----LPVHPATVAEQIWNHSDVSPAAANC 1438 +K+L SKHLFY+E+CAYHNGQ IP C D++ LPV + SD + Sbjct: 225 RKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPVANFSKGNNESEDSDSDSDSGES 284 Query: 1439 SKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERM 1618 + D SPV N S E R R+++ Sbjct: 285 DNEDDHSPV--------------------------------ENRLWS----SESRGRDKV 308 Query: 1619 KPVEN-IRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVEL 1795 + + + K+ FE +I DPTKS WERK WIKK+MLQLQE+ QA++VEL Sbjct: 309 SADDGPLWSNSVGKNEFEGQIDVFLSDPTKSHWERKVWIKKQMLQLQEQCNSFQAQSVEL 368 Query: 1796 EKRRLKWLRFCN 1831 EK+R KWLR+C+ Sbjct: 369 EKQRFKWLRYCS 380 >ref|XP_003542048.1| PREDICTED: uncharacterized protein LOC100801014 [Glycine max] Length = 439 Score = 286 bits (732), Expect = 3e-74 Identities = 187/470 (39%), Positives = 241/470 (51%), Gaps = 9/470 (1%) Frame = +2 Query: 638 GGFLGLN---RNHQQTT---NNPAHTQKENVQSSMDIEPAPVQRPTAVREVKDSNPRGFT 799 GG L L HQ T + Q N+ S ++ + P + EVK N Sbjct: 8 GGILDLESPFHRHQHTQLGQQSITGQQHINIMSGLESD-----HPIGLIEVKSLN---VA 59 Query: 800 MNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNEEGTSGVKGKKDAPWQRMKWTDNSVR 976 +NFGK K SN N+ S +DEPSY EEGN E G KK +PWQRMKW DN VR Sbjct: 60 LNFGKAKALAPSNS----NELSEEDEPSYAEEGNCENLDGGNSKKGSPWQRMKWADNVVR 115 Query: 977 LLIQVVASVGDDGSLESPD-RXXXXXXXXXXXXXXXXSKILISKGYHVSPQQCEDKFNDL 1153 LLI VV+ VGDDG++ D SKI+I KG HVSPQQCEDKFNDL Sbjct: 116 LLITVVSCVGDDGTIGGMDGHKRKSGVLQKKGKWKMVSKIMIGKGCHVSPQQCEDKFNDL 175 Query: 1154 NKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKAKEDVKKLLGSKHLFYREICAYHNG 1333 NKRYK+LNDILGRG C+VVENP L+ M +LS K K+DV+K+L SKHLFY+E+CAYHNG Sbjct: 176 NKRYKRLNDILGRGTCCQVVENPVLMDSMPNLSAKMKDDVRKILSSKHLFYKEMCAYHNG 235 Query: 1334 QKIPDCSDLELPVHPATVAEQIWNHSDVSPAAANCSKDTDESPVAPGCLKGNNVXXXXXX 1513 Q+IP+ +L+LP + ++ N S+D DE NN Sbjct: 236 QRIPNSHELDLPGYSLEHGRDSRDN--------NGSEDEDED---------NN------- 271 Query: 1514 XXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRERMKPVE-NIRTQFSQKDNFEAEIAGIF 1690 NA G+M E DR ++ + + Q S+ D FE E+A +F Sbjct: 272 ----DSEDDESDDEINTNAHEDGGRMQELCDRNKLSDEDVHFGPQTSRMDKFEVEMARVF 327 Query: 1691 DDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVELEKRRLKWLRFCNXXXXXXXXXXXXX 1870 DPTK E+++WIK +MLQLQE+ I QA+A+ELEK+RLKWLR+C+ Sbjct: 328 QDPTKLLREQREWIKIQMLQLQEQNISYQAQALELEKQRLKWLRYCSKKDRELGKLRLEN 387 Query: 1871 DRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSFSIDIMGGRDHMNSAR 2020 R+ E S S I+ GR+H++ R Sbjct: 388 KRMKLENEHRILKLKQKELEADFSTSEMSLDPASLGINRTQGREHISLGR 437 >ref|XP_002303892.2| hypothetical protein POPTR_0003s20370g [Populus trichocarpa] gi|550343621|gb|EEE78871.2| hypothetical protein POPTR_0003s20370g [Populus trichocarpa] Length = 425 Score = 285 bits (728), Expect = 7e-74 Identities = 185/497 (37%), Positives = 249/497 (50%), Gaps = 6/497 (1%) Frame = +2 Query: 554 MNNSGTGSGFLSGYTGGFLGMNRQPQNPGGFLGLNRNHQQTTNNP--AHTQKENVQSSMD 727 M+NSG FLS +GG L + ++P ++R+ Q +P AH + NV Sbjct: 1 MDNSGLRGRFLSDSSGGLLDL----ESP-----IHRHQQSQLGHPSLAHQHQMNVMG--- 48 Query: 728 IEPAPVQRPTAVREVKDSNPRGFTMNFGKEKTFGASNVTINFNDDSTDDEPSY-EEGNEE 904 R N GA + D S DD+ S+ E+GN E Sbjct: 49 ------------RAASPFN--------------GAHS-----GDGSEDDDQSFMEDGNGE 77 Query: 905 GTSGVKGKKDAPWQRMKWTDNSVRLLIQVVASVGDDGSLESPDRXXXXXXXXXXXXXXXX 1084 ++G KGK+ +PWQRMKWTDN VRLLI VVA VGDD + + Sbjct: 78 NSTGAKGKQGSPWQRMKWTDNIVRLLISVVACVGDDDTFDGTGGLKRKSGLLQKKGKWKT 137 Query: 1085 -SKILISKGYHVSPQQCEDKFNDLNKRYKKLNDILGRGMSCKVVENPGLLFGMHHLSEKA 1261 SK++I KG HVSPQQCEDKFNDLNKRYK+LN+ILGRG SC+VVENP L+ M HLS KA Sbjct: 138 VSKLMIGKGCHVSPQQCEDKFNDLNKRYKRLNEILGRGTSCRVVENPALMDSMPHLSAKA 197 Query: 1262 KEDVKKLLGSKHLFYREICAYHNGQKIPDCSDLELP--VHPATVAEQIWNHSDVSPAAAN 1435 K+DV+K+L SKHLFY+EICAYHNGQ+IP+C D +L P + N S N Sbjct: 198 KDDVRKILSSKHLFYKEICAYHNGQRIPNCQDFDLQGCSLPLERCSKDMNGSGGDEVEGN 257 Query: 1436 CSKDTDESPVAPGCLKGNNVXXXXXXXXXXXXXXXXXXXXXXNNACGSIGKMGEYRDRER 1615 D DES NN NNA + +G+ +R Sbjct: 258 DDSDDDES---------NN--------------------EADNNADENGESVGQLCERIV 288 Query: 1616 MKPVENIRTQFSQKDNFEAEIAGIFDDPTKSQWERKQWIKKRMLQLQEERIGIQAEAVEL 1795 + ++ +Q ++++F E+ IF D S WERK+WIKK+ LQL E+R+ I+A+A EL Sbjct: 289 NEEHSHLCSQSGRQNSFGVEMTAIFQDTNVSPWERKEWIKKQRLQLLEQRVNIKAQAFEL 348 Query: 1796 EKRRLKWLRFCNXXXXXXXXXXXXXDRLMXXXXXXXXXXXXXXXXXXSRRPEASFSLTSF 1975 EK++ KWLR+C+ + + R E ++ TS Sbjct: 349 EKQQFKWLRYCSKKDKEFERLRLENEMMRLENRQSAFQLRQKQLEMGLRSSEPAYDPTSL 408 Query: 1976 SIDIMGGRDHMNSARFH 2026 ID + GRD ++ R H Sbjct: 409 GIDRVQGRDQIDLGRHH 425