BLASTX nr result
ID: Ephedra28_contig00003295
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra28_contig00003295 (1721 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245... 162 5e-37 gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao] 159 4e-36 gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theob... 158 6e-36 ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620... 155 5e-35 ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620... 151 7e-34 ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm... 149 5e-33 ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part... 139 5e-30 ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps... 139 5e-30 ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ... 137 1e-29 ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab... 137 2e-29 gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao] 134 1e-28 gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao] 133 3e-28 ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcript... 132 4e-28 ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcr... 132 5e-28 dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana] 132 6e-28 ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211... 128 7e-27 gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus pe... 127 1e-26 ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu... 125 7e-26 gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma caca... 122 4e-25 gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theob... 122 6e-25 >ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera] gi|298205214|emb|CBI17273.3| unnamed protein product [Vitis vinifera] Length = 425 Score = 162 bits (409), Expect = 5e-37 Identities = 119/427 (27%), Positives = 199/427 (46%) Frame = +1 Query: 178 SKLDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFNDSHLLRLQETLADMPLEYLKFSG 357 S++ NR+ T + D + S S F F+ R+ + L+ ++S Sbjct: 17 SRMSELNRIHTNYSHISDS-----NPLDSRSLFQEFSHHLQSRVNQILS-------QYSD 64 Query: 358 SDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIE 537 + DD D + +K EL +VE E AK NEIE L +D+N L D+E+L ++ Sbjct: 65 VESLEADDLDAYLGHLKKELNLVESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVD 124 Query: 538 FWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELED 717 F + G + + + + +E Q + + D + FE L+L Sbjct: 125 FVASQ---GLKRAEAGALVDYSSSVEDQLDSRTAHGDNN-------------FEILDLNY 168 Query: 718 QLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLI 897 Q + + L+ LD KR +A+ IE++L+ +K I NCI+L+L TFIP GL+ Sbjct: 169 QTQKNKITLKSLQDLDYTFKRFEAIEKIEDALTGLKVIDFEGNCIRLSLSTFIPNLEGLL 228 Query: 898 CQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNIS 1077 C+ +I + E + H L ++V S + E+ +K S Sbjct: 229 CEEKIEAVNEPSELNHELLIEVMDQSMELKNVEIFPNDVYLGEIIDAAKSSRKLFSHMSI 288 Query: 1078 LNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFI 1257 L L VR++Q +I LR S+++ K SR+ +Y RD++I ++ GG+ A+I Sbjct: 289 LETRSSLEWFVRKVQDKIILCALRQSIVKGANK-SRHSLEYLDRDEIIVAHMVGGVDAYI 347 Query: 1258 KLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEE 1437 K+ Q K D+ + ISL+ LC+ E+ N L++S R + F+DA+EE Sbjct: 348 KVCQGWPVSNNALKLKSLKSSDQQSKGISLSFLCKVEEMANSLDVSIRKNISSFVDAIEE 407 Query: 1438 ILLQQVK 1458 IL+QQ++ Sbjct: 408 ILVQQMQ 414 >gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 430 Score = 159 bits (402), Expect = 4e-36 Identities = 130/447 (29%), Positives = 212/447 (47%), Gaps = 6/447 (1%) Frame = +1 Query: 151 EEMAHSNAGSKLDLGNRLRTKLVKLQD----DLQGLEEEVTSVSNFDTFNDSHLLRLQET 318 E M S++ LDL + +R+++ +L + D E E S+++ D L + Sbjct: 3 EPMEISSSSEALDL-HSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSL-HFESK 60 Query: 319 LADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDT 492 + + EY + F G +D D + +K EL VE E AK NEIE+L+ +++ Sbjct: 61 VKQIIEEYSDVGFLGIED-----LDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEES 115 Query: 493 NNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICL 672 N L ++E L ++ + G +E+ +S +D + + + Sbjct: 116 NILEGNLEGLKYALDSIASQGMEG---------------VEEDPCLDSSMNDEDQSNL-M 159 Query: 673 LSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCI 852 S E FE +ELE Q+ + L+ LD KR+D L IE++L+ +K IG NCI Sbjct: 160 HSNEEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCI 219 Query: 853 KLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXX 1032 +L+L+T+IP GL+CQ I +I E + H L V++ + + E+ Sbjct: 220 RLSLQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDII 279 Query: 1033 HMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRD 1212 +K S L V ++Q RI TLR +++ K SR+ F+Y RD Sbjct: 280 DAAKSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERD 338 Query: 1213 QVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELEL 1392 + I ++ GGI AFIKL Q K D + ISL++LC+A E+ N L++ Sbjct: 339 ETIVAHLVGGIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDM 398 Query: 1393 SRRHRLLPFIDAVEEILLQQVKKSSKS 1473 R L F+DAVE++LL+Q++ +S Sbjct: 399 HIRQNLSAFVDAVEKLLLEQMRLDLQS 425 >gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] Length = 372 Score = 158 bits (400), Expect = 6e-36 Identities = 112/366 (30%), Positives = 178/366 (48%) Frame = +1 Query: 376 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 555 +D D + +K EL VE E AK NEIE+L+ +++N L ++E L ++ + Sbjct: 19 EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 78 Query: 556 NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 735 G +E+ +S +D + + + S E FE +ELE Q+ + Sbjct: 79 MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 122 Query: 736 QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 915 L+ LD KR+D L IE++L+ +K IG NCI+L+L+T+IP GL+CQ I Sbjct: 123 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 182 Query: 916 NIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 1095 +I E + H L V++ + + E+ +K S Sbjct: 183 DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 242 Query: 1096 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXX 1275 L V ++Q RI TLR +++ K SR+ F+Y RD+ I ++ GGI AFIKL Q Sbjct: 243 LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQGW 301 Query: 1276 XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1455 K D + ISL++LC+A E+ N L++ R L F+DAVE++LL+Q+ Sbjct: 302 PLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQM 361 Query: 1456 KKSSKS 1473 + +S Sbjct: 362 RLDLQS 367 >ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus sinensis] Length = 447 Score = 155 bits (392), Expect = 5e-35 Identities = 127/446 (28%), Positives = 214/446 (47%), Gaps = 9/446 (2%) Frame = +1 Query: 148 VEEMAHSNAGSKLDLGNRLRTKLVKLQD-DLQGLEEEVTSVSNFDTFNDSHLLR-----L 309 VE A ++ S LDL + LR+++ +L + G+E+E +VS+ + +LL+ Sbjct: 11 VEATATPSSSSPLDL-HSLRSEVKELMEIHRSGIEDEPNTVSS----DSENLLKEYAHDF 65 Query: 310 QETLADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIA 483 + + ++ EY + F G +D D ++ +K EL VE E +K NEIE L Sbjct: 66 ESKVKEIITEYADVSFLGIED-----LDAYLEHLKEELKTVEAESSKISNEIETLTRTQV 120 Query: 484 KDTNNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKEL-ENKIEQQKSEESKYDDISRF 660 +D++ L D+E L I+ ++ + D+ A E+++ +E+ D+ + Sbjct: 121 EDSDRLESDLEELNCAIDLIVSEGSQNAKEDRQAVCPARGEDQVCPTHTEDQS--DLIKI 178 Query: 661 EICLLSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLL 840 E+ FE LELE Q+ + + L+ LD KR DA+ IE+SL+ +K I Sbjct: 179 H------EDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFD 232 Query: 841 ENCIKLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXX 1020 C +L+++T+IP Q +I ++ E V H L ++V + + E+ Sbjct: 233 GKCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHI 292 Query: 1021 XXXXHMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQY 1200 +K + + SL + L +R +Q RI TLR V++ K SR+ F+Y Sbjct: 293 SDLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANK-SRHFFEY 351 Query: 1201 SSRDQVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILN 1380 RD++I ++ GG+ AFIK Q K D + ISL+ CR E N Sbjct: 352 FERDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISLKNSDHHSKGISLSFFCRVEEAAN 411 Query: 1381 ELELSRRHRLLPFIDAVEEILLQQVK 1458 L++ R L F+D VE+ILL+Q++ Sbjct: 412 SLDVHIRQNLSSFVDGVEKILLEQMR 437 >ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus sinensis] Length = 444 Score = 151 bits (382), Expect = 7e-34 Identities = 126/446 (28%), Positives = 211/446 (47%), Gaps = 9/446 (2%) Frame = +1 Query: 148 VEEMAHSNAGSKLDLGNRLRTKLVKLQD-DLQGLEEEVTSVSNFDTFNDSHLLR-----L 309 VE A ++ S LDL + LR+++ +L + G+E+E +VS+ + +LL+ Sbjct: 11 VEATATPSSSSPLDL-HSLRSEVKELMEIHRSGIEDEPNTVSS----DSENLLKEYAHDF 65 Query: 310 QETLADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIA 483 + + ++ EY + F G +D D ++ +K EL VE E +K NEIE L Sbjct: 66 ESKVKEIITEYADVSFLGIED-----LDAYLEHLKEELKTVEAESSKISNEIETLTRTQV 120 Query: 484 KDTNNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFE 663 +D++ L D+E L I+ +++ KE + + E+ + + Sbjct: 121 EDSDRLESDLEELNCAIDLIVSEN-----------AKEDRQAVCPARGEDQVCPTHTEDQ 169 Query: 664 ICLLS-GENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLL 840 L+ E+ FE LELE Q+ + + L+ LD KR DA+ IE+SL+ +K I Sbjct: 170 SDLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFD 229 Query: 841 ENCIKLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXX 1020 C +L+++T+IP Q +I ++ E V H L ++V + + E+ Sbjct: 230 GKCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHI 289 Query: 1021 XXXXHMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQY 1200 +K + + SL + L +R +Q RI TLR V++ K SR+ F+Y Sbjct: 290 SDLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANK-SRHFFEY 348 Query: 1201 SSRDQVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILN 1380 RD++I ++ GG+ AFIK Q K D + ISL+ CR E N Sbjct: 349 FERDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISLKNSDHHSKGISLSFFCRVEEAAN 408 Query: 1381 ELELSRRHRLLPFIDAVEEILLQQVK 1458 L++ R L F+D VE+ILL+Q++ Sbjct: 409 SLDVHIRQNLSSFVDGVEKILLEQMR 434 >ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis] gi|223542639|gb|EEF44176.1| conserved hypothetical protein [Ricinus communis] Length = 415 Score = 149 bits (375), Expect = 5e-33 Identities = 118/416 (28%), Positives = 196/416 (47%), Gaps = 13/416 (3%) Frame = +1 Query: 250 EEVTSVSNFDT-FNDSHLLRLQETLA---DMPLEYLKFSGSDDNVN--DDFDMEIQSVKN 411 EE+ S N DT SH ++ E A + ++ + SD N +D D ++ +K Sbjct: 18 EEIYSGCNGDTEMLSSHSDQVLEDCALHLESKVQQIMSECSDFNFLGIEDLDAFVEHLKE 77 Query: 412 ELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQI 591 EL+ E AK EIE L +D L DIE+L ++F + Sbjct: 78 ELSTTMSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISS-------------- 123 Query: 592 KELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKRQKYGELRMLDDK 771 K++E + E E+ D R ++ FE +L+DQ+ + + L+ D Sbjct: 124 KDVEKEKEVACREDLYSTDAHR---------DYEFEISKLDDQIAKSKMILKSLQDFDSV 174 Query: 772 SKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEINNIGESVMVEHVL 951 KRVDA+ IEE+LS +K I +CI+L+L+T++P ++CQ + + E V H L Sbjct: 175 FKRVDAVEQIEEALSGLKVIEFDGSCIRLSLRTYLPKLDDVMCQHKTEDTAEPSEVNHEL 234 Query: 952 KVKVEKDSTTFLDAELSXXXXXXXXXXHMSK-------YSSDASDKNISLNVNGQLNSLV 1110 ++V + + E+ +K YS+ + S L LV Sbjct: 235 LIEVVSGTMELKNVEIFPNDIYISDIVDAAKSFRKEFLYSALTESETRS-----SLGWLV 289 Query: 1111 REIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXXXXXXX 1290 R++Q RI +TLR V++ K SR F+Y RD+ + ++ GG+ AFIKL Q Sbjct: 290 RKVQDRIIQFTLRRLVVKSSNK-SRYSFEYLDRDETVVAHLVGGVDAFIKLSQGWPVSRS 348 Query: 1291 XXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQVK 1458 K + + +ISL+ LCR E++N L++ R LL F++ +E++L++Q++ Sbjct: 349 PLKLISLKSSNHHSKEISLSFLCRVEEVVNSLDIQMRLNLLSFVEVIEKLLVEQMR 404 >ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum] gi|557096755|gb|ESQ37263.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum] Length = 355 Score = 139 bits (349), Expect = 5e-30 Identities = 102/368 (27%), Positives = 172/368 (46%), Gaps = 5/368 (1%) Frame = +1 Query: 385 DMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDDNVG 564 D ++ ++ EL VE E AK EIE L+++ A+D++ L D+E L ++F + Sbjct: 5 DAYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQ---- 60 Query: 565 GGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLS-----GENFMFEALELEDQLIR 729 E QKS+E+ S E C S ++ F+ ELE+Q+ Sbjct: 61 ----------------EVQKSKENP-PSTSSMERCDASTWIDVNDDEKFKMFELENQIEE 103 Query: 730 KRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFE 909 KR+ L LD KR DA +E++L+ +K + N I+L L+T+IP GL+ Q + Sbjct: 104 KRRILKSLENLDSVCKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHK 163 Query: 910 INNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVN 1089 + + E + H L + ++ +T E+ + + L+ Sbjct: 164 LLHNTEPSELIHELLIDLKDKTTEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTR 223 Query: 1090 GQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQ 1269 L LV ++Q RI LR +++ K R+ F+Y +D+ I ++ GGI AF+K+ Sbjct: 224 SSLQWLVAKVQERIITTNLRKHIVK-SSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSV 282 Query: 1270 XXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQ 1449 K D + ISL+++C+ E+ N L+L R L F+DA+E+IL+Q Sbjct: 283 GWPLLSTPLKLTSLKNSDNQSNGISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQ 342 Query: 1450 QVKKSSKS 1473 Q ++ S Sbjct: 343 QTREELHS 350 >ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] gi|482566470|gb|EOA30659.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] Length = 420 Score = 139 bits (349), Expect = 5e-30 Identities = 115/446 (25%), Positives = 206/446 (46%), Gaps = 4/446 (0%) Frame = +1 Query: 148 VEEMAHSNAGSKLDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFN--DSHLLRLQETL 321 +EE H + LDL ++R+++ +L+ + + E D+ N +L+ + + Sbjct: 1 MEEDTHDGS---LDL-QQIRSRVKELESIHRNCKYEPGESCTSDSENLVQDFVLQFETKV 56 Query: 322 ADMPLEYLKFSGSDDNVND--DFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTN 495 ++ +Y SD ++ D D D ++ ++ EL VE E AK EIE L+ + A+D++ Sbjct: 57 NEIVEDY-----SDVDILDVEDSDAYLEYLRKELHSVEAESAKVSEEIERLSRSHAEDSS 111 Query: 496 NLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLL 675 L D+E L ++ + D KS+ES S E+C + Sbjct: 112 RLERDLEGLLLSLDSMSSQD--------------------VNKSKESP-PSCSSMEVCEV 150 Query: 676 SGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIK 855 + ++ F+ ELE+Q+ KR L LD KR DA +E++L+ +K + N I+ Sbjct: 151 NDDD-KFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIR 209 Query: 856 LTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXH 1035 L L+T+IP GL Q + + + + H L + ++ +T E+ Sbjct: 210 LQLRTYIPELDGLPAQHKFEHTTKPSELIHELLIYLKDKTTEITKLEMFPNDVYIGDIIE 269 Query: 1036 MSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQ 1215 + + L+ + +V ++Q RI TLR ++ K R+ F+Y +D+ Sbjct: 270 AADSFRQVRLHSAVLDTRSSVQWVVAKVQDRIITTTLRKYIVT-SSKTMRHTFKYYDKDE 328 Query: 1216 VITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELS 1395 I ++ GGI AF+K+ K D + ISL+++C+ E+ N L+L Sbjct: 329 TIVAHIAGGIDAFLKVSDGWPLLNSPLKLASLKNSDNQSKGISLSLICKVEELANSLDLQ 388 Query: 1396 RRHRLLPFIDAVEEILLQQVKKSSKS 1473 R L FIDA+E+IL+ Q ++ +S Sbjct: 389 TRQNLSGFIDAIEKILVHQTREELQS 414 >ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1| uncharacterized protein AT3G23910 [Arabidopsis thaliana] Length = 421 Score = 137 bits (345), Expect = 1e-29 Identities = 98/366 (26%), Positives = 171/366 (46%) Frame = +1 Query: 376 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 555 +D D ++ ++NEL VE E AK EIE L+ + A+D++ L D+E L ++ + D Sbjct: 73 EDSDAYLEYLRNELQSVEAESAKVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQD 132 Query: 556 NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 735 +E E Q S S E+C + ++ F+ ELE+Q+ KR Sbjct: 133 --------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKR 170 Query: 736 QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 915 L LD KR DA +E++L+ +K + N I+L L+T+I G + Q + + Sbjct: 171 MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFD 230 Query: 916 NIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 1095 +I E + H L + ++ +T E+ + + L+ Sbjct: 231 HITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSS 290 Query: 1096 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXX 1275 + +V ++Q +I TLR ++ K R F+Y +D+ I ++ GGI AF+K+ Sbjct: 291 VQWVVAKVQDKIISTTLRKYIVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGW 349 Query: 1276 XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1455 K D + ISL+++C+ E+ N L+L R L F+DA+E+IL++Q Sbjct: 350 PLLNTPLKLASLKNSDNQSKGISLSLICKVEELANSLDLETRQNLSGFMDAIEKILVEQT 409 Query: 1456 KKSSKS 1473 ++ +S Sbjct: 410 REELQS 415 >ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] Length = 421 Score = 137 bits (344), Expect = 2e-29 Identities = 114/442 (25%), Positives = 206/442 (46%), Gaps = 4/442 (0%) Frame = +1 Query: 148 VEEMAHSNAGSKLDLGNRLRTKLVKLQDDLQGLEEEV--TSVSNFDTFNDSHLLRLQETL 321 +EE H LDL +R+++ +L+ + +E + S+ +T +L+ + + Sbjct: 1 MEEETHDGP---LDL-QEIRSRVKELEFIHRNCRDEPGESCSSDSETLVQDFVLQFEPKV 56 Query: 322 ADMPLEYLKFSGSDDNVND--DFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTN 495 ++ +Y SD ++ D D D ++ ++ EL VE E AK EIE L+ + A+D++ Sbjct: 57 KEIVEDY-----SDVDLLDVEDSDAYLEYLRKELQSVEAESAKVSEEIERLSKSHAQDSS 111 Query: 496 NLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLL 675 L D+E L ++ + D +E E Q S S E+C + Sbjct: 112 RLERDLEGLLLSLDSMSSQD--------------VEKSKENQPSSSS-------MEVCEV 150 Query: 676 SGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIK 855 + ++ F+ ELE+Q+ KR L LD KR DA +E++L+ +K + N I+ Sbjct: 151 NDDD-KFKMFELENQMEEKRSILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIR 209 Query: 856 LTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXH 1035 L L+T+IP L+ Q + + E + H L + ++ +T E+ Sbjct: 210 LQLQTYIPKLDSLLGQQKFEHTTEPSELIHELLIYLKDKTTEITKFEMFPNDVYIGDIIE 269 Query: 1036 MSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQ 1215 + S + L+ + +V ++Q RI TLR ++ K R+ F+Y +D+ Sbjct: 270 AADSFRQVSLHSAVLDTRSSVQWVVAKVQDRIISSTLRKYLVT-SSKTIRHTFEYYEKDE 328 Query: 1216 VITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELS 1395 I ++ GGI AF+K+ K D + ISL+++C+ ++ N L+L Sbjct: 329 TIVGHIAGGIDAFLKVSNGWPLLNTPLKLESLKNSDNQSKGISLSLICKVEDLANSLDLQ 388 Query: 1396 RRHRLLPFIDAVEEILLQQVKK 1461 R L F+DA+E+IL+QQ ++ Sbjct: 389 TRQNLSGFMDAIEKILVQQTRE 410 >gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 432 Score = 134 bits (337), Expect = 1e-28 Identities = 117/415 (28%), Positives = 190/415 (45%), Gaps = 6/415 (1%) Frame = +1 Query: 151 EEMAHSNAGSKLDLGNRLRTKLVKLQD----DLQGLEEEVTSVSNFDTFNDSHLLRLQET 318 E M S++ LDL + +R+++ +L + D E E S+++ D L + Sbjct: 3 EPMEISSSSEALDL-HSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSL-HFESK 60 Query: 319 LADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDT 492 + + EY + F G +D D + +K EL VE E AK NEIE+L+ +++ Sbjct: 61 VKQIIEEYSDVGFLGIED-----LDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEES 115 Query: 493 NNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICL 672 N L ++E L ++ + G +E+ +S +D + + + Sbjct: 116 NILEGNLEGLKYALDSIASQGMEG---------------VEEDPCLDSSMNDEDQSNL-M 159 Query: 673 LSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCI 852 S E FE +ELE Q+ + L+ LD KR+D L IE++L+ +K IG NCI Sbjct: 160 HSNEEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCI 219 Query: 853 KLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXX 1032 +L+L+T+IP GL+CQ I +I E + H L V++ + + E+ Sbjct: 220 RLSLQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDII 279 Query: 1033 HMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRD 1212 +K S L V ++Q RI TLR +++ K SR+ F+Y RD Sbjct: 280 DAAKSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERD 338 Query: 1213 QVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEIL 1377 + I ++ GGI AFIKL Q K D + ISL++LC+A E + Sbjct: 339 ETIVAHLVGGIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEAI 393 >gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 392 Score = 133 bits (334), Expect = 3e-28 Identities = 116/414 (28%), Positives = 189/414 (45%), Gaps = 6/414 (1%) Frame = +1 Query: 151 EEMAHSNAGSKLDLGNRLRTKLVKLQD----DLQGLEEEVTSVSNFDTFNDSHLLRLQET 318 E M S++ LDL + +R+++ +L + D E E S+++ D L + Sbjct: 3 EPMEISSSSEALDL-HSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSL-HFESK 60 Query: 319 LADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDT 492 + + EY + F G +D D + +K EL VE E AK NEIE+L+ +++ Sbjct: 61 VKQIIEEYSDVGFLGIED-----LDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEES 115 Query: 493 NNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICL 672 N L ++E L ++ + G +E+ +S +D + + + Sbjct: 116 NILEGNLEGLKYALDSIASQGMEG---------------VEEDPCLDSSMNDEDQSNL-M 159 Query: 673 LSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCI 852 S E FE +ELE Q+ + L+ LD KR+D L IE++L+ +K IG NCI Sbjct: 160 HSNEEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCI 219 Query: 853 KLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXX 1032 +L+L+T+IP GL+CQ I +I E + H L V++ + + E+ Sbjct: 220 RLSLQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDII 279 Query: 1033 HMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRD 1212 +K S L V ++Q RI TLR +++ K SR+ F+Y RD Sbjct: 280 DAAKSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERD 338 Query: 1213 QVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEI 1374 + I ++ GGI AFIKL Q K D + ISL++LC+A + Sbjct: 339 ETIVAHLVGGIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAERV 392 >ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] gi|332643359|gb|AEE76880.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] Length = 746 Score = 132 bits (333), Expect = 4e-28 Identities = 101/372 (27%), Positives = 170/372 (45%) Frame = +1 Query: 358 SDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIE 537 SD N+ D + ++ ++NEL VE E AK EIE L+ + A D++ L D+E L ++ Sbjct: 395 SDGNLTDAY---LEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLD 451 Query: 538 FWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELED 717 + D +E E Q S S E+C + ++ F+ ELE+ Sbjct: 452 SMSSQD--------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELEN 489 Query: 718 QLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLI 897 Q+ KR L LD KR DA +E++L+ +K + N I+L L+T+I G + Sbjct: 490 QMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFL 549 Query: 898 CQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNIS 1077 Q + ++I E + H L + ++ +T E+ + + Sbjct: 550 GQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAV 609 Query: 1078 LNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFI 1257 L+ + +V ++Q +I TLR + K R F+Y +D+ I ++ GGI AF+ Sbjct: 610 LDTRSSVQWVVAKVQDKIISTTLRKDFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFL 668 Query: 1258 KLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEE 1437 K+ K D + SL+++ + E+ N L+L R L F+DAVE+ Sbjct: 669 KVSDGWPLLNTPLKLASLKNSDNQSKGFSLSLISKLEELANSLDLETRQNLSGFMDAVEK 728 Query: 1438 ILLQQVKKSSKS 1473 IL+QQ ++ KS Sbjct: 729 ILVQQTREELKS 740 >ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] gi|332643360|gb|AEE76881.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] Length = 428 Score = 132 bits (332), Expect = 5e-28 Identities = 106/407 (26%), Positives = 182/407 (44%) Frame = +1 Query: 253 EVTSVSNFDTFNDSHLLRLQETLADMPLEYLKFSGSDDNVNDDFDMEIQSVKNELAIVER 432 E V +F + + + E D+ L + + D N+ D + ++ ++NEL VE Sbjct: 42 ETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDHTLVDGNLTDAY---LEYLRNELQSVEA 98 Query: 433 EMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKI 612 E AK EIE L+ + A D++ L D+E L ++ + D +E Sbjct: 99 ESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQD--------------VEKSK 144 Query: 613 EQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDAL 792 E Q S S E+C + ++ F+ ELE+Q+ KR L LD KR DA Sbjct: 145 ENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKRMILKSLEDLDSLRKRFDAA 196 Query: 793 SMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKD 972 +E++L+ +K + N I+L L+T+I G + Q + ++I E + H L + ++ Sbjct: 197 EQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDK 256 Query: 973 STTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRN 1152 +T E+ + + L+ + +V ++Q +I TLR Sbjct: 257 TTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRK 316 Query: 1153 SVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESA 1332 + K R F+Y +D+ I ++ GGI AF+K+ K D + Sbjct: 317 DFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASLKNSDNQS 375 Query: 1333 GDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQVKKSSKS 1473 SL+++ + E+ N L+L R L F+DAVE+IL+QQ ++ KS Sbjct: 376 KGFSLSLISKLEELANSLDLETRQNLSGFMDAVEKILVQQTREELKS 422 >dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana] Length = 421 Score = 132 bits (331), Expect = 6e-28 Identities = 99/366 (27%), Positives = 166/366 (45%) Frame = +1 Query: 376 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 555 D D ++ ++NEL VE E AK EIE L+ + A D++ L D+E L ++ + D Sbjct: 73 DQTDAYLEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQD 132 Query: 556 NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 735 +E E Q S S E+C + ++ F+ ELE+Q+ KR Sbjct: 133 --------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKR 170 Query: 736 QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 915 L LD KR DA +E++L+ +K + N I+L L+T+I G + Q + + Sbjct: 171 MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFD 230 Query: 916 NIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 1095 +I E + H L + ++ +T E+ + + L+ Sbjct: 231 HITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSS 290 Query: 1096 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQXX 1275 + +V ++Q +I TLR + K R F+Y +D+ I ++ GGI AF+K+ Sbjct: 291 VQWVVAKVQDKIISTTLRKDFVM-SSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGW 349 Query: 1276 XXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEILLQQV 1455 K D + SL+++ + E+ N L+L R L F+DAVE+IL+QQ Sbjct: 350 PLLNTPLKLASLKNSDNQSKGFSLSLISKLEELANSLDLETRQNLSGFMDAVEKILVQQT 409 Query: 1456 KKSSKS 1473 ++ KS Sbjct: 410 REELKS 415 >ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211137 [Cucumis sativus] gi|449527675|ref|XP_004170835.1| PREDICTED: uncharacterized protein LOC101229419 [Cucumis sativus] Length = 414 Score = 128 bits (322), Expect = 7e-27 Identities = 112/425 (26%), Positives = 191/425 (44%), Gaps = 1/425 (0%) Frame = +1 Query: 184 LDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFNDSHLLRLQETLADMPLEYLKFSGSD 363 LDL +R++L +LQ L+ EE T + L L+ + + EY S D Sbjct: 16 LDL-QAVRSELEELQRSLEENEESTTDSLGSEKLLRECALHLESRIQQVLSEY---SNVD 71 Query: 364 DNVN-DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEF 540 + DD D ++ +K EL VE E +K NEIE L +D+N L D+E+L ++ Sbjct: 72 SFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEVLKRTNIEDSNKLKMDLEVLKLSLDR 131 Query: 541 WQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQ 720 + + D +E E+ ++R E FE LELE Q Sbjct: 132 FPSQDP-----------EEATFNCSSMNGEDPMNVIVNR--------ECNAFEVLELESQ 172 Query: 721 LIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLIC 900 + + ++ L+ +D+ K +D + +E ++ +K I + +N I+L+L T IP Sbjct: 173 IEKNKKILKSLQEVDEIFKSLDVIEQVEGTIGGMKVIDVADNSIRLSLHTHIPNVEDFST 232 Query: 901 QFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISL 1080 + + E ++H L ++V + +AE+ + SK S++S Sbjct: 233 LQRLEGLIEKSELDHELIIEVLDGTMELKNAEIFPADVHLHDIINASKSISNSS------ 286 Query: 1081 NVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIK 1260 L VR++Q RI TLR ++ K+ + F+Y +D++I ++ GGI A IK Sbjct: 287 -----LEWFVRKVQDRIVLCTLRRFAVKSANKSCHS-FEYLDQDEMIMCSMIGGIDACIK 340 Query: 1261 LPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEEI 1440 + Q K D +SL+++C+ ++ N L+ R L F DAVE+I Sbjct: 341 VSQGWPLADSPLKLISLKSSDHYTKGVSLSLICKVEKMANSLDAHIRRNLSSFADAVEKI 400 Query: 1441 LLQQV 1455 L +Q+ Sbjct: 401 LKEQM 405 >gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica] Length = 416 Score = 127 bits (320), Expect = 1e-26 Identities = 108/426 (25%), Positives = 199/426 (46%), Gaps = 2/426 (0%) Frame = +1 Query: 184 LDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFNDSHLLRLQETLADMPLEYLKFSGSD 363 LDL N ++ ++ +L++ ++ ++ S + L+R L +E + SD Sbjct: 12 LDL-NTIQRQVRELEEIIESCRQD--DASELSPSDSDDLIRNCGLLLQSRVEQIVSECSD 68 Query: 364 DNVNDD--FDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIE 537 + +D F+ + + EL VE E K N IE+L +D N L D+ L ++ Sbjct: 69 VGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGTDLAQLKCSLD 128 Query: 538 FWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELED 717 F + +K+ + +L ++ K + D ++ ++ + F E LELE+ Sbjct: 129 FVE---------EKDLEKAKLGADVDYHKCGKDLLDPMN------VNADKF--ELLELEN 171 Query: 718 QLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLI 897 Q+ + L+ L+ K +D IE++++ +K I NC++L+L+T+IP L Sbjct: 172 QIEKNNIILKSLQDLECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLF 231 Query: 898 CQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNIS 1077 ++ + E V H L +++ + + + E+ Y +D D S Sbjct: 232 SPKKVGDATEPSEVNHELLIELLEGTMGLRNVEIFPNDV----------YINDILDAAKS 281 Query: 1078 LNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFI 1257 L L V ++Q RI T+R V++++ K SR+ +Y +D+ + +V GG+ AFI Sbjct: 282 LR-KSSLQWFVTKVQDRIVLCTMRRLVVKNENK-SRHSLEYLDKDETVVAHVVGGVDAFI 339 Query: 1258 KLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHRLLPFIDAVEE 1437 K+PQ K D+ + ISL+ LC E+ N L + R L F+DA+E+ Sbjct: 340 KVPQGWPLLSSPLKLIYLKSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEK 399 Query: 1438 ILLQQV 1455 IL++Q+ Sbjct: 400 ILVEQM 405 >ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] gi|222847415|gb|EEE84962.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] Length = 429 Score = 125 bits (313), Expect = 7e-26 Identities = 106/436 (24%), Positives = 197/436 (45%), Gaps = 2/436 (0%) Frame = +1 Query: 154 EMAHSNAGSKLDLGNRLRTKLVKLQDDLQGLEEEVTSVSNFDTFNDSHLLR--LQETLAD 327 E++ S L+L N +R+++ +L++ + + S S ++ + L++ Q+ ++ Sbjct: 2 EISPSTTQESLNL-NTIRSRINELEEIYRDCNAD--SFSEINSSDSDELMKDSAQQLVSK 58 Query: 328 MPLEYLKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTE 507 + ++S +D D + +K EL E E AK NEIE L +D++ L Sbjct: 59 VSQTVTEYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELEN 118 Query: 508 DIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGEN 687 D+E + ++ + ++ + ++ + ++E S E++ + I+ + E Sbjct: 119 DLEWMKCSLDLISSQ--------RDREKEKGDEQMEHFSSGENQSNLIN-------TNEE 163 Query: 688 FMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLK 867 FE L+L++Q+ + ++ LD K DA+ IE+ LS +K I CI+L+L+ Sbjct: 164 NKFEILKLDNQIEESTRILKSMQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLR 223 Query: 868 TFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKY 1047 T+IP + L Q +I + H ++V S E+ +K Sbjct: 224 TYIPKQDVLFLQ-KIEETNVPYEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKS 282 Query: 1048 SSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITV 1227 + + L VR+ Q RI TLR V SR +Y RD++I Sbjct: 283 FRQMFLHLALMETSSSLEWFVRKAQDRIIQSTLRRLVAR-SASTSRQSIEYLDRDEIIVA 341 Query: 1228 NVFGGIKAFIKLPQXXXXXXXXXXXXXXKQVDESAGDISLNMLCRAMEILNELELSRRHR 1407 ++ GG+ AF+++ Q K + A +ISL LC+ E N L++ R Sbjct: 342 HMVGGVDAFMEVSQGWPITNSPLKLVSLKNSNHHAKEISLGFLCKVEEAANSLDVHTRQN 401 Query: 1408 LLPFIDAVEEILLQQV 1455 L F+D+VE+IL++Q+ Sbjct: 402 LSSFVDSVEKILVEQM 417 >gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508713298|gb|EOY05195.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 369 Score = 122 bits (307), Expect = 4e-25 Identities = 108/379 (28%), Positives = 176/379 (46%), Gaps = 6/379 (1%) Frame = +1 Query: 151 EEMAHSNAGSKLDLGNRLRTKLVKLQD----DLQGLEEEVTSVSNFDTFNDSHLLRLQET 318 E M S++ LDL + +R+++ +L + D E E S+++ D L + Sbjct: 3 EPMEISSSSEALDL-HSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSL-HFESK 60 Query: 319 LADMPLEY--LKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDT 492 + + EY + F G +D D + +K EL VE E AK NEIE+L+ +++ Sbjct: 61 VKQIIEEYSDVGFLGIED-----LDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEES 115 Query: 493 NNLTEDIELLTTYIEFWQNDDNVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICL 672 N L ++E L ++ + G +E+ +S +D + + + Sbjct: 116 NILEGNLEGLKYALDSIASQGMEG---------------VEEDPCLDSSMNDEDQSNL-M 159 Query: 673 LSGENFMFEALELEDQLIRKRQKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCI 852 S E FE +ELE Q+ + L+ LD KR+D L IE++L+ +K IG NCI Sbjct: 160 HSNEEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCI 219 Query: 853 KLTLKTFIPAKGGLICQFEINNIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXX 1032 +L+L+T+IP GL+CQ I +I E + H L V++ + + E+ Sbjct: 220 RLSLQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDII 279 Query: 1033 HMSKYSSDASDKNISLNVNGQLNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRD 1212 +K S L V ++Q RI TLR +++ K SR+ F+Y RD Sbjct: 280 DAAKSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERD 338 Query: 1213 QVITVNVFGGIKAFIKLPQ 1269 + I ++ GGI AFIKL Q Sbjct: 339 ETIVAHLVGGIDAFIKLSQ 357 >gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theobroma cacao] Length = 343 Score = 122 bits (305), Expect = 6e-25 Identities = 90/298 (30%), Positives = 142/298 (47%) Frame = +1 Query: 376 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTEDIELLTTYIEFWQNDD 555 +D D + +K EL VE E AK NEIE+L+ +++N L ++E L ++ + Sbjct: 51 EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQG 110 Query: 556 NVGGGTDKNAQIKELENKIEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 735 G +E+ +S +D + + + S E FE +ELE Q+ + Sbjct: 111 MEG---------------VEEDPCLDSSMNDEDQSNL-MHSNEEQKFEIMELESQIEKNN 154 Query: 736 QKYGELRMLDDKSKRVDALSMIEESLSDIKDIGLLENCIKLTLKTFIPAKGGLICQFEIN 915 L+ LD KR+D L IE++L+ +K IG NCI+L+L+T+IP GL+CQ I Sbjct: 155 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 214 Query: 916 NIGESVMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXXHMSKYSSDASDKNISLNVNGQ 1095 +I E + H L V++ + + E+ +K S Sbjct: 215 DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 274 Query: 1096 LNSLVREIQYRIFCYTLRNSVLEHDGKASRNLFQYSSRDQVITVNVFGGIKAFIKLPQ 1269 L V ++Q RI TLR +++ K SR+ F+Y RD+ I ++ GGI AFIKL Q Sbjct: 275 LEWFVGKVQDRIILSTLRRFIVKSTNK-SRHSFEYLERDETIVAHLVGGIDAFIKLSQ 331