BLASTX nr result
ID: Catharanthus22_contig00035392
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00035392 (686 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006353183.1| PREDICTED: uncharacterized protein LOC102580... 132 1e-28 ref|XP_002272480.2| PREDICTED: uncharacterized protein LOC100242... 115 2e-23 emb|CAN71438.1| hypothetical protein VITISV_011330 [Vitis vinifera] 114 3e-23 ref|XP_002530698.1| conserved hypothetical protein [Ricinus comm... 110 4e-22 ref|XP_004301356.1| PREDICTED: uncharacterized protein LOC101306... 104 3e-20 ref|XP_006466291.1| PREDICTED: uncharacterized protein LOC102607... 100 7e-19 ref|XP_006426288.1| hypothetical protein CICLE_v10024732mg [Citr... 99 1e-18 gb|EPS72022.1| hypothetical protein M569_02736, partial [Genlise... 91 3e-16 ref|XP_003544237.1| PREDICTED: uncharacterized protein LOC100779... 86 8e-15 ref|XP_003615261.1| hypothetical protein MTR_5g065900 [Medicago ... 85 2e-14 ref|XP_004250519.1| PREDICTED: uncharacterized protein LOC101261... 82 1e-13 gb|EXC16674.1| hypothetical protein L484_007720 [Morus notabilis] 80 6e-13 gb|EMJ28274.1| hypothetical protein PRUPE_ppa000370mg [Prunus pe... 79 2e-12 ref|XP_004490429.1| PREDICTED: uncharacterized protein LOC101498... 78 2e-12 gb|EOY15415.1| Uncharacterized protein isoform 3, partial [Theob... 77 7e-12 gb|EOY15414.1| Uncharacterized protein isoform 2 [Theobroma cacao] 77 7e-12 gb|EOY15413.1| Uncharacterized protein isoform 1 [Theobroma cacao] 77 7e-12 gb|EOX91966.1| Ribosomal protein L10 family protein isoform 2 [T... 77 7e-12 ref|XP_006575347.1| PREDICTED: uncharacterized protein LOC100813... 74 6e-11 gb|EOX91967.1| Uncharacterized protein isoform 3, partial [Theob... 74 6e-11 >ref|XP_006353183.1| PREDICTED: uncharacterized protein LOC102580091 [Solanum tuberosum] Length = 1175 Score = 132 bits (332), Expect = 1e-28 Identities = 94/261 (36%), Positives = 120/261 (45%), Gaps = 33/261 (12%) Frame = +1 Query: 1 IGLKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSG--GNDIGAGXXXXXXXXXXXXV 174 +G KN+G GFG+ + FQ+GHLPSG+IP SR IPVSG G D G G V Sbjct: 12 LGRKNNGLGFGVICGSNFQAGHLPSGVIPGSRTIPVSGSGGYDNGWGSDMDIGFDSDDEV 71 Query: 175 YGGRYSIETSPQDDKFSN--------------GSVARHAYQT---NRTSEVYYFNVHTQP 303 Y G +S+ETSPQDDKF N G+ Q N + VY NV Sbjct: 72 YDGHHSVETSPQDDKFPNVGTSKREDSFNKHIGNATNDELQQKMWNHSESVYPGNVVKSS 131 Query: 304 NVKVA--------------RQTSLLQDFHGARMQNQKAADDDIPSAPPVLSSVLGSNLAG 441 + VA + S + + Q K DIPSAPP+ S+ + Sbjct: 132 SNSVASSKTTTSLPFSIGNKSASSWESNVKSSRQRLKLFKSDIPSAPPLGGSLQECDQVA 191 Query: 442 EQLKNSGARNPACLGTSDGSANADMPSTWNGNTLGAGERKESGPSVRTAAVSSSSLPARI 621 Q K A S S D T+ T G+ + +S PS R V S+S A Sbjct: 192 VQRKTFVADEIPFPEISGCSVAMDEAKTYKTATAGSTKDGQSDPSGRAGGVPSNSSSALF 251 Query: 622 PTFHASGLGSWYGFISYEACV 684 PT+HASG GSW GF++YEAC+ Sbjct: 252 PTYHASGRGSWQGFVAYEACI 272 >ref|XP_002272480.2| PREDICTED: uncharacterized protein LOC100242393 [Vitis vinifera] Length = 1400 Score = 115 bits (287), Expect = 2e-23 Identities = 87/281 (30%), Positives = 128/281 (45%), Gaps = 55/281 (19%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 L+N G GFGLP + KF+SG++PSGIIP+S AIP SG +D G+G V+ G+ Sbjct: 222 LRNGGRGFGLPPSDKFRSGYMPSGIIPVSHAIPRSG-DDSGSGSDMDIGTDSEDDVHIGQ 280 Query: 187 YSIETSPQDDKF----------------------------SNGSVARHAYQTNRTSE--- 273 S+++SPQD++ SV RH + TS+ Sbjct: 281 DSLDSSPQDNRIPVSAGPKYPTPLQKHRCTEDVERMGDGGGGFSVGRHGCTEDGTSDSAA 340 Query: 274 ---------------VYYFNVHT-QPNVKVARQTSLL--------QDFHGARMQNQKAAD 381 + + ++T + NV + T + QD + MQ + + D Sbjct: 341 GSGVSSTQFRSLGGVMPHRAMNTSESNVSLRTDTEMAAEQLVEWPQDVYARGMQ-KLSGD 399 Query: 382 DDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANADMPSTWNGNTLGAGERK 561 DDIPSAPP + S L N +Q+ S + ++ + ++PS+ Sbjct: 400 DDIPSAPPFVGSSLEINQDRDQISGS------TVTINEPNTTKNIPSSTTAQENSGNRIP 453 Query: 562 ESGPSVRTAAVSSSSLPARIPTFHASGLGSWYGFISYEACV 684 + S+ SS SLPAR+PTFHASG G W ISY+ACV Sbjct: 454 DPSASIAETTASSGSLPARLPTFHASGQGPWCAVISYDACV 494 >emb|CAN71438.1| hypothetical protein VITISV_011330 [Vitis vinifera] Length = 1484 Score = 114 bits (285), Expect = 3e-23 Identities = 85/281 (30%), Positives = 125/281 (44%), Gaps = 55/281 (19%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 L+N G GFGLP + KF+SG++PSGIIP+S AIP SG +D G+G ++ G+ Sbjct: 683 LRNGGRGFGLPPSDKFRSGYMPSGIIPVSHAIPRSG-DDSGSGSDMDIGTDSEDDIHIGQ 741 Query: 187 YSIETSPQDDKF----------------------------SNGSVARHAYQTNRTSEV-- 276 S+++SPQD++ SV RH + TS+ Sbjct: 742 DSLDSSPQDNRIPVSAGPKYPTPLQKHRCTEDVERMGDGGGGFSVGRHGCTEDGTSDSAA 801 Query: 277 -----------------YYFNVHTQPNVKVARQTSLL--------QDFHGARMQNQKAAD 381 + ++ NV + T + QD + MQ + + D Sbjct: 802 GSGVSXTQFRSLGGVMPHRAMNXSESNVSLRTDTEMAAEQLVEWPQDVYARGMQ-KLSGD 860 Query: 382 DDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANADMPSTWNGNTLGAGERK 561 DDIPSAPP + S L N +Q+ S + ++ + ++PS+ Sbjct: 861 DDIPSAPPFVGSSLEINQDRDQISXS------TVTINEPNTTKNIPSSTTAQENSGNRIP 914 Query: 562 ESGPSVRTAAVSSSSLPARIPTFHASGLGSWYGFISYEACV 684 + S+ SS SLPAR+PTFHASG G W ISY+ACV Sbjct: 915 DPSASIAETTASSGSLPARLPTFHASGQGPWCAVISYDACV 955 >ref|XP_002530698.1| conserved hypothetical protein [Ricinus communis] gi|223529754|gb|EEF31693.1| conserved hypothetical protein [Ricinus communis] Length = 1041 Score = 110 bits (275), Expect = 4e-22 Identities = 88/266 (33%), Positives = 119/266 (44%), Gaps = 44/266 (16%) Frame = +1 Query: 19 GSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGRYSIE 198 G GFGLPS KF+SGH+ IP+SRAIPV G + G+G VYG +YS++ Sbjct: 4 GGGFGLPSPAKFRSGHMAFDAIPVSRAIPVRGKSR-GSGSDMDTSSDSEDEVYGDQYSLD 62 Query: 199 TSPQDDKFSNGSVARHA--------------------------YQTNRTSEVYYFNVHTQ 300 +SPQDD SN +RH Q + + Y+ V+T Sbjct: 63 SSPQDDNISNIVASRHTSPMKRNGNYNVDELSDSCYSTKGSYMQQKSMNNSHYHSGVYTS 122 Query: 301 ----PNV--KVARQTSLLQDFHGARMQNQKAADDDIPSAPPVLSSVLGSNLAGE--QLKN 456 P+V + + QD+ M+ +K D+PSAPP+ S ++ ++ Sbjct: 123 NSYSPSVTSQAKPDVTAKQDYSETTMKIRKFVYKDMPSAPPISSGPEIEHMTENISTFED 182 Query: 457 SGARNPACLGT------SDGSANADMPST----WNGNTLGAGERKESGPSVRTAAVSSSS 606 +G A L S S + ST T ER +G V V SSS Sbjct: 183 NGIPRLANLNNLPATYESKSSNHVHFSSTILDGTRNGTPNPAERIAAGKEVN---VPSSS 239 Query: 607 LPARIPTFHASGLGSWYGFISYEACV 684 LPAR+PTFHAS G W ISY+ACV Sbjct: 240 LPARLPTFHASAQGPWCAVISYDACV 265 >ref|XP_004301356.1| PREDICTED: uncharacterized protein LOC101306532 [Fragaria vesca subsp. vesca] Length = 1240 Score = 104 bits (259), Expect = 3e-20 Identities = 89/277 (32%), Positives = 128/277 (46%), Gaps = 55/277 (19%) Frame = +1 Query: 19 GSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGRYSIE 198 G GFGLP A+KF+SGHLPS IP+SRAIP G++ G+ VYGGRYS++ Sbjct: 63 GRGFGLPPASKFRSGHLPSNAIPVSRAIP-GDGDESGSASDNDRTTDSEDGVYGGRYSLD 121 Query: 199 TSPQDDKFSNGSVARHAY------QTNRTSEVYYFNVHTQPNVKVARQTSLLQDF-HGAR 357 +SPQD++ + + A H Y Q +S+ Y +V + + V R + + G+ Sbjct: 122 SSPQDERVPSAASA-HRYGKPSNGQPRYSSDYMYSDVSSSMDTVVGRHKPVAERLARGSE 180 Query: 358 M----QNQKAADDDIPSA------------PPVLSSVLGSNLAGEQLKNSGARNPACLGT 489 QN A D+ SA + S+V + NS ++ LG+ Sbjct: 181 RYPVGQNGYAEDESSDSAGSSEFSTSQAGGGSINSAVPHGRAYASEGYNSSVQSKRNLGS 240 Query: 490 SDG-------------SANADMPST-----------WNGNTLGAGERKESGPS-----VR 582 +D S + D+PS N + R + PS VR Sbjct: 241 TDEKGLRSRILQSEKLSDDDDVPSAPPFCGAAQEIKQNQQSPARIHRTQHTPSSSDQFVR 300 Query: 583 TAAVS---SSSLPARIPTFHASGLGSWYGFISYEACV 684 TA S +SS PA +PTF+AS LG W+G I+Y+ACV Sbjct: 301 TANTSEAAASSCPAPVPTFYASALGPWHGVIAYDACV 337 >ref|XP_006466291.1| PREDICTED: uncharacterized protein LOC102607095 [Citrus sinensis] Length = 1221 Score = 99.8 bits (247), Expect = 7e-19 Identities = 91/285 (31%), Positives = 119/285 (41%), Gaps = 59/285 (20%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 L+N G GL A+KF+SGH SG++P+S+ + V ND G+G VY G+ Sbjct: 40 LRNGGRDIGLAQASKFRSGHSSSGVVPVSQTVHVRE-NDSGSGSDMDISPDSDDEVYRGK 98 Query: 187 YSIETSPQDDKFSN-------------GSVARHAYQTNRTSEVYYFNVHTQPNVKVARQT 327 YS+++ QD K N G V H+ + E +++ V+ R Sbjct: 99 YSVKSPRQDHKIGNDAATKPGHKQADYGKVGNHSISSLSRKEAMQRQMNSAVRVERGRGG 158 Query: 328 SLL------------------------------QDFHGARMQN-----------QKAADD 384 LL G + + A Sbjct: 159 ILLGKPGTAEEELPYSATRTEVVFAHSGSNNGCDSLRGTYTSDSYSCVTSGSNLETTAKQ 218 Query: 385 DIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANADMPSTWNGNTLGAGERKE 564 DIPSAPP +SS GS A EQ+ + A T A +PST G G K Sbjct: 219 DIPSAPPFVSS--GS--AMEQVVGQSSAFSAT--TYVPKATGSIPSTAPGK--GCTGYKV 270 Query: 565 SGPSVRTAA-----VSSSSLPARIPTFHASGLGSWYGFISYEACV 684 S S RTAA S+SSLPAR+PTFHASGLG W ISY+ACV Sbjct: 271 SDVSNRTAAGIQRDTSASSLPARLPTFHASGLGPWCAVISYDACV 315 >ref|XP_006426288.1| hypothetical protein CICLE_v10024732mg [Citrus clementina] gi|557528278|gb|ESR39528.1| hypothetical protein CICLE_v10024732mg [Citrus clementina] Length = 1221 Score = 99.0 bits (245), Expect = 1e-18 Identities = 90/285 (31%), Positives = 120/285 (42%), Gaps = 59/285 (20%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 L+N G GL A+KF+SGH SG++P+S+ + V ND G+G VY G+ Sbjct: 40 LRNGGRDIGLAQASKFRSGHSSSGVVPVSQTVHVRE-NDSGSGSDMDISPDSDDQVYRGK 98 Query: 187 YSIETSPQDDKFSN-------------GSVARHAYQTNRTSEVYYFNVHTQPNVK----- 312 YS+++ QD K N G V H+ + E +++ V+ Sbjct: 99 YSVKSPRQDHKIGNDAATKPGHKQADYGKVGNHSISSLSRKEAMQRQMNSAVRVERGGGG 158 Query: 313 ---------------VARQTSLLQDFHGAR---------------------MQNQKAADD 384 A T ++ G+ +K A Sbjct: 159 ILLGKPGTAEEELPYSATSTEVVFAHSGSNNGCDSLRGTYTSDSYSCVTSGSNLEKTAKQ 218 Query: 385 DIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANADMPSTWNGNTLGAGERKE 564 DIPSAPP +SS GS A EQ+ + A T +PST G G K Sbjct: 219 DIPSAPPFVSS--GS--AMEQVVGQSSAFSAT--TYVPKTTGSIPSTAPGK--GCTGYKV 270 Query: 565 SGPSVRTAA-----VSSSSLPARIPTFHASGLGSWYGFISYEACV 684 S S RTAA S+SSLPAR+PTFHASGLG W ISY+ACV Sbjct: 271 SDVSNRTAAGIQSDTSASSLPARLPTFHASGLGPWCAVISYDACV 315 >gb|EPS72022.1| hypothetical protein M569_02736, partial [Genlisea aurea] Length = 700 Score = 90.9 bits (224), Expect = 3e-16 Identities = 81/261 (31%), Positives = 114/261 (43%), Gaps = 34/261 (13%) Frame = +1 Query: 4 GLKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGG 183 GL+ G GLPS ++F+SG+LPSG + + R G+D+ YG Sbjct: 10 GLRYRGGSSGLPSVSRFRSGYLPSG-MNVGRVTDNLSGSDMDT------CSDSEGECYGA 62 Query: 184 RYSIETSPQDDKFSNGSVARHAYQTNRTSE---------------------------VYY 282 RYS E+SPQDDK NG+ R A+ R S+ V Sbjct: 63 RYSPESSPQDDKIQNGA-RRAAFLNARISDSGDLGSYLERQGARARGYSNDYESSESVSS 121 Query: 283 FNVHTQPNVKVARQT-----SLLQDFHGARMQNQKAADDDIPSAPPVLSSVLGS-NLAGE 444 + + P +T L A ++ D+DIPSAPP+ + L + A E Sbjct: 122 SEISSAPAKPTGTETVSGNKVFLSTDDSANPVSRNKFDEDIPSAPPLAAGSLHHVHQASE 181 Query: 445 QLKNSGARNPACLGTSDGSANADMPSTWNGNTLGAGERKESGPSVRTAAVSSSSLPA-RI 621 + + A + G+S SA +T G+ + TAA S S+ PA R Sbjct: 182 TRQQARADSKFSSGSSKVSAVEPDLNTQKNKIRGSTD-------FNTAADSISAAPAPRY 234 Query: 622 PTFHASGLGSWYGFISYEACV 684 PTFHASGLG W+ +SY+ACV Sbjct: 235 PTFHASGLGYWHAVLSYDACV 255 >ref|XP_003544237.1| PREDICTED: uncharacterized protein LOC100779084 isoform X1 [Glycine max] gi|571511098|ref|XP_006596368.1| PREDICTED: uncharacterized protein LOC100779084 isoform X2 [Glycine max] Length = 1233 Score = 86.3 bits (212), Expect = 8e-15 Identities = 81/290 (27%), Positives = 115/290 (39%), Gaps = 68/290 (23%) Frame = +1 Query: 19 GSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGRYSIE 198 G GFGLP +KF+SGHLP+ IP+S + + D G+ VYGGRYS++ Sbjct: 41 GRGFGLPPPSKFRSGHLPANAIPVS-TVMLGETGDSGSNSDNDDSIESEEEVYGGRYSLD 99 Query: 199 TSPQDDKFSNGSVARHAYQT--NRTSEVYYFNVHTQPNVKVARQTSLLQD-FHGARMQNQ 369 +SPQD + NG+ R+ T S+ Y V + V R ++ GA Q Sbjct: 100 SSPQDRRVPNGAARRYGNLTGPRYASDYTYSEVSSSRETLVGRPGTVRDPLMRGATNVRQ 159 Query: 370 KAADDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANA------------- 510 +D S+ SS + G + + R L S+G A++ Sbjct: 160 SGFTED-DSSDSAASSEFSTTQVGGSINGALPRGRTYL--SEGYASSVPSRMNVKSAAEK 216 Query: 511 ----------DMPST--WNGNTLGAGERKESGPSVRTAA----VSSSSL----------- 609 D+PS + G+T + E P+ R A SSSL Sbjct: 217 NGRISDDEEDDIPSAPPFAGSTQEIRQTHEEIPASRVDATPNKAESSSLKSMSGDKIENH 276 Query: 610 -------------------------PARIPTFHASGLGSWYGFISYEACV 684 P R+PTFHAS LG W+G I+Y+ACV Sbjct: 277 VENGSPDQFARTATGSEAATSSNSHPPRLPTFHASALGPWHGVIAYDACV 326 >ref|XP_003615261.1| hypothetical protein MTR_5g065900 [Medicago truncatula] gi|355516596|gb|AES98219.1| hypothetical protein MTR_5g065900 [Medicago truncatula] Length = 1237 Score = 84.7 bits (208), Expect = 2e-14 Identities = 84/297 (28%), Positives = 124/297 (41%), Gaps = 71/297 (23%) Frame = +1 Query: 4 GLKNHGS-GFGLPSATKFQSGHLPSGIIPLS--RAIPVSGGNDIGAGXXXXXXXXXXXXV 174 G+K+ G GFGLP +KF+SGHLP+ +P+S +D+ A V Sbjct: 34 GMKSGGGRGFGLPPPSKFRSGHLPANKLPVSAVETFDSRSNSDMDAS------VDSEEEV 87 Query: 175 YGGRYSIETSPQDDKFSNGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQT-----SLLQ 339 YGGRYS+++SPQD + NG+ R+ + Y + +T +V +R+T + + Sbjct: 88 YGGRYSLDSSPQDSRVPNGAAKRYG-NVAQMPRSRYASDYTFSDVSSSRETLTGRQGMAR 146 Query: 340 D--FHGARMQNQKAADDD--------------------------------------IPSA 399 D GA Q +D +PS Sbjct: 147 DPVMRGAANGRQNGFTEDESSDSAASSEFSTTQVGSSINGTLPKRRAYMSAGYASSVPSR 206 Query: 400 PPVLSSVLGS-NLAGEQLKNSGARNPACLGTSD---------GSANADMPSTWNGNTLGA 549 V SS S L+ ++ ++ + P C T + SA P+ +TL + Sbjct: 207 MNVQSSAEKSGRLSDDEDEDFPSAPPFCGSTQEIRQTNEEIPTSAARSTPNKAESSTLKS 266 Query: 550 GERKE--------SGPSVRTA-----AVSSSSLPARIPTFHASGLGSWYGFISYEAC 681 R + S VRTA A SS+S P R+PTFHAS LG WY I+Y+AC Sbjct: 267 VSRDKLENHGDASSEKFVRTATGSEGAASSNSQPPRLPTFHASALGPWYAVIAYDAC 323 >ref|XP_004250519.1| PREDICTED: uncharacterized protein LOC101261773 [Solanum lycopersicum] Length = 206 Score = 82.4 bits (202), Expect = 1e-13 Identities = 43/89 (48%), Positives = 54/89 (60%), Gaps = 2/89 (2%) Frame = +1 Query: 1 IGLKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPV--SGGNDIGAGXXXXXXXXXXXXV 174 +G KN+G GFG+ + FQ+GHLPSG+IP SR IPV SGG D G G V Sbjct: 32 LGRKNNGLGFGVICGSNFQAGHLPSGVIPGSRTIPVSGSGGYDNGWGSDMDIGFDSDDEV 91 Query: 175 YGGRYSIETSPQDDKFSNGSVARHAYQTN 261 Y G +S+ETSPQDDKF N ++ + N Sbjct: 92 YDGYHSVETSPQDDKFPNVGTSKRKHSFN 120 >gb|EXC16674.1| hypothetical protein L484_007720 [Morus notabilis] Length = 1222 Score = 80.1 bits (196), Expect = 6e-13 Identities = 52/154 (33%), Positives = 77/154 (50%), Gaps = 2/154 (1%) Frame = +1 Query: 229 GSVARHAYQTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQNQKAAD-DDIPSAPP 405 GS+ A + NR SE Y ++ + NV+ A + L H ++QN K +D DD+PSAPP Sbjct: 187 GSINGGAARRNRFSEGYASSIPSTINVESAAEKGL----HSRKLQNGKFSDEDDVPSAPP 242 Query: 406 VLSSVLGSNLAGEQLKNSGARN-PACLGTSDGSANADMPSTWNGNTLGAGERKESGPSVR 582 S +A E S + P + D+P GN G+ ++ S Sbjct: 243 FGGSTQEIKVASESSPASKVQGTPKTTDLPEAKNTTDIPEAKGGN----GKSEQFARSTN 298 Query: 583 TAAVSSSSLPARIPTFHASGLGSWYGFISYEACV 684 + + SS AR+PTFHAS LG W+ ++Y+ACV Sbjct: 299 GSEAAPSSGAARVPTFHASALGPWHAIVAYDACV 332 Score = 73.2 bits (178), Expect = 7e-11 Identities = 58/187 (31%), Positives = 88/187 (47%), Gaps = 12/187 (6%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 L+ G GFGLP KF+SGHLP+ IP+SR IP +D +G VYGGR Sbjct: 36 LRGGGRGFGLPPPAKFRSGHLPATAIPVSRTIP---RDDSASGSENDMSTDSEEDVYGGR 92 Query: 187 YSIETSPQDDKFSNGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQT--SLLQDFHGARM 360 YS+++SPQ NG+ R+ + R S+ +Y + +T +V + +T L + A+ Sbjct: 93 YSLDSSPQR---PNGTAYRYGNPSKRDSQSHYSSDYTYSDVGSSMETVAGLTKHLMAAQR 149 Query: 361 QNQKAADDDIP----------SAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANA 510 + +A + P S SS + G + AR S+G A++ Sbjct: 150 RAAEAGNGRYPVAQNGFTEDESYDSAASSEFSTTQVGGSINGGAARRNR---FSEGYASS 206 Query: 511 DMPSTWN 531 +PST N Sbjct: 207 -IPSTIN 212 >gb|EMJ28274.1| hypothetical protein PRUPE_ppa000370mg [Prunus persica] Length = 1235 Score = 78.6 bits (192), Expect = 2e-12 Identities = 56/187 (29%), Positives = 92/187 (49%), Gaps = 19/187 (10%) Frame = +1 Query: 181 GRYSIETSPQDDKFSNGSVARHAYQTNRT---------------SEVYYFNVHTQPNVKV 315 G+Y + + + S+ S A Y T++ SE Y +V +Q N+ Sbjct: 157 GKYPVARNGYTEDESSDSAASSEYSTSQAGGSINSGVPRNRAYVSEGYASSVPSQRNL-- 214 Query: 316 ARQTSLLQDFHGARMQNQKAADDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLG-TS 492 ++S ++F+ Q++K +DDD+PSAPP A +++K +P+ + T Sbjct: 215 --ESSAKKNFNSTNQQSEKLSDDDVPSAPPFCG-------ATQEIKQDDEISPSRVHRTP 265 Query: 493 DGSANADMPSTWNGNTLGAGERKESGPSVRTAAVSSS---SLPARIPTFHASGLGSWYGF 663 +A+++ +T G E G VRT S + S PAR+PTF+AS LGSW+ Sbjct: 266 HATASSEFKTTPGRKQEGNIENGNLGQFVRTTTSSEAAVPSCPARLPTFYASALGSWHAV 325 Query: 664 ISYEACV 684 I+Y+ACV Sbjct: 326 IAYDACV 332 Score = 70.9 bits (172), Expect = 4e-10 Identities = 58/208 (27%), Positives = 101/208 (48%), Gaps = 6/208 (2%) Frame = +1 Query: 19 GSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGRYSIE 198 G GFGLP +KF+SGHLPS IP+ R IP + G++ G+ +YGGRYS++ Sbjct: 42 GRGFGLPPPSKFRSGHLPSNAIPV-RTIP-ADGDESGSASDNDRTTDSEDGIYGGRYSLD 99 Query: 199 TSPQDDKFSNGSVARHAY----QTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQN 366 +SPQDD+ + S R+ Q + S+ Y +V + + V R + + Sbjct: 100 SSPQDDRVPSASAHRYGKPSQGQPHYGSDCTYSDVSSSMDTVVGRHKPAAEKLVRGTGKY 159 Query: 367 QKAAD--DDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANADMPSTWNGNT 540 A + + S+ SS ++ AG + + RN A + S+G A++ +PS N Sbjct: 160 PVARNGYTEDESSDSAASSEYSTSQAGGSINSGVPRNRAYV--SEGYASS-VPS--QRNL 214 Query: 541 LGAGERKESGPSVRTAAVSSSSLPARIP 624 + ++ + + ++ +S +P+ P Sbjct: 215 ESSAKKNFNSTNQQSEKLSDDDVPSAPP 242 >ref|XP_004490429.1| PREDICTED: uncharacterized protein LOC101498131 [Cicer arietinum] Length = 1233 Score = 78.2 bits (191), Expect = 2e-12 Identities = 42/109 (38%), Positives = 60/109 (55%), Gaps = 1/109 (0%) Frame = +1 Query: 4 GLKN-HGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYG 180 G+K+ G GFGLP KF+SGHLP+ P+S IP + D G+ VYG Sbjct: 35 GMKSGSGRGFGLPPPAKFRSGHLPANAFPVSTVIPPAETGDSGSNTDMDVSVESEEEVYG 94 Query: 181 GRYSIETSPQDDKFSNGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQT 327 GRYS+++SPQD + NG+ R+ T R Y + +T +V +R+T Sbjct: 95 GRYSLDSSPQDSRIPNGAAGRYENHTQRRPR--YASDYTFSDVSSSRET 141 Score = 58.2 bits (139), Expect = 2e-06 Identities = 32/101 (31%), Positives = 46/101 (45%) Frame = +1 Query: 379 DDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANADMPSTWNGNTLGAGER 558 D+D+PSAPP S E++ S A + S + N + E+ Sbjct: 227 DEDVPSAPPFCGSTPEIRQTTEEIPTSRAHSTQNKAESSTVKSVSKDIKLENNGCASSEQ 286 Query: 559 KESGPSVRTAAVSSSSLPARIPTFHASGLGSWYGFISYEAC 681 + A SS+ P R+PTFHAS LG W+ I+Y+AC Sbjct: 287 FVRTATGSEGAASSNPQPPRLPTFHASALGPWHAVIAYDAC 327 >gb|EOY15415.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 1110 Score = 76.6 bits (187), Expect = 7e-12 Identities = 53/157 (33%), Positives = 76/157 (48%), Gaps = 4/157 (2%) Frame = +1 Query: 226 NGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQNQKAADDDIPSAPP 405 NG + R SE Y +V ++ NV+ A +D + ++Q++K +DDDIPSAPP Sbjct: 193 NGRIPR---SRTYVSEGYASSVPSRVNVESAAG----KDLNSRKLQHEKFSDDDIPSAPP 245 Query: 406 VLSSVLGSNLAGEQLK----NSGARNPACLGTSDGSANADMPSTWNGNTLGAGERKESGP 573 SV E + +S R L + + + N + + E SG Sbjct: 246 FSGSVQEVKQDAEHIAASEIHSTPRAADSLDPKKFKSISGVKPEQNMSNRKSDEFVRSGA 305 Query: 574 SVRTAAVSSSSLPARIPTFHASGLGSWYGFISYEACV 684 TA SS PAR+PTFHAS LG W+ I+Y+ACV Sbjct: 306 GAETATASSGVHPARVPTFHASALGPWHAVIAYDACV 342 Score = 70.1 bits (170), Expect = 6e-10 Identities = 56/215 (26%), Positives = 87/215 (40%), Gaps = 3/215 (1%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 + N G GLP KF+SGHLP IP++ G + A VYGGR Sbjct: 36 ISNGGRNIGLPPPAKFRSGHLPVTAIPVTSTSLTGGDDSASASENDVTTDSEDDTVYGGR 95 Query: 187 YSIETSPQDDKFSNGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQN 366 YS+++SPQD++ NG+ R+ R + +T +V +R+T Sbjct: 96 YSLDSSPQDERIPNGTALRYGNPVQRRPRYATASDYTYSDVSSSRET------------- 142 Query: 367 QKAADDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACL-GTSDGSANADMPSTWNGNTL 543 L +G NL G++L R P G ++ ++D + +T Sbjct: 143 --------------LMGGIGGNL-GDRLGRGNGRYPVGRDGFTEEDESSDSAGSSEFSTT 187 Query: 544 GAGERKESGPSVRTAAVS--SSSLPARIPTFHASG 642 G P RT +SS+P+R+ A+G Sbjct: 188 QVGSINGRIPRSRTYVSEGYASSVPSRVNVESAAG 222 >gb|EOY15414.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1118 Score = 76.6 bits (187), Expect = 7e-12 Identities = 53/157 (33%), Positives = 76/157 (48%), Gaps = 4/157 (2%) Frame = +1 Query: 226 NGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQNQKAADDDIPSAPP 405 NG + R SE Y +V ++ NV+ A +D + ++Q++K +DDDIPSAPP Sbjct: 193 NGRIPR---SRTYVSEGYASSVPSRVNVESAAG----KDLNSRKLQHEKFSDDDIPSAPP 245 Query: 406 VLSSVLGSNLAGEQLK----NSGARNPACLGTSDGSANADMPSTWNGNTLGAGERKESGP 573 SV E + +S R L + + + N + + E SG Sbjct: 246 FSGSVQEVKQDAEHIAASEIHSTPRAADSLDPKKFKSISGVKPEQNMSNRKSDEFVRSGA 305 Query: 574 SVRTAAVSSSSLPARIPTFHASGLGSWYGFISYEACV 684 TA SS PAR+PTFHAS LG W+ I+Y+ACV Sbjct: 306 GAETATASSGVHPARVPTFHASALGPWHAVIAYDACV 342 Score = 70.1 bits (170), Expect = 6e-10 Identities = 56/215 (26%), Positives = 87/215 (40%), Gaps = 3/215 (1%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 + N G GLP KF+SGHLP IP++ G + A VYGGR Sbjct: 36 ISNGGRNIGLPPPAKFRSGHLPVTAIPVTSTSLTGGDDSASASENDVTTDSEDDTVYGGR 95 Query: 187 YSIETSPQDDKFSNGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQN 366 YS+++SPQD++ NG+ R+ R + +T +V +R+T Sbjct: 96 YSLDSSPQDERIPNGTALRYGNPVQRRPRYATASDYTYSDVSSSRET------------- 142 Query: 367 QKAADDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACL-GTSDGSANADMPSTWNGNTL 543 L +G NL G++L R P G ++ ++D + +T Sbjct: 143 --------------LMGGIGGNL-GDRLGRGNGRYPVGRDGFTEEDESSDSAGSSEFSTT 187 Query: 544 GAGERKESGPSVRTAAVS--SSSLPARIPTFHASG 642 G P RT +SS+P+R+ A+G Sbjct: 188 QVGSINGRIPRSRTYVSEGYASSVPSRVNVESAAG 222 >gb|EOY15413.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1249 Score = 76.6 bits (187), Expect = 7e-12 Identities = 53/157 (33%), Positives = 76/157 (48%), Gaps = 4/157 (2%) Frame = +1 Query: 226 NGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQNQKAADDDIPSAPP 405 NG + R SE Y +V ++ NV+ A +D + ++Q++K +DDDIPSAPP Sbjct: 193 NGRIPR---SRTYVSEGYASSVPSRVNVESAAG----KDLNSRKLQHEKFSDDDIPSAPP 245 Query: 406 VLSSVLGSNLAGEQLK----NSGARNPACLGTSDGSANADMPSTWNGNTLGAGERKESGP 573 SV E + +S R L + + + N + + E SG Sbjct: 246 FSGSVQEVKQDAEHIAASEIHSTPRAADSLDPKKFKSISGVKPEQNMSNRKSDEFVRSGA 305 Query: 574 SVRTAAVSSSSLPARIPTFHASGLGSWYGFISYEACV 684 TA SS PAR+PTFHAS LG W+ I+Y+ACV Sbjct: 306 GAETATASSGVHPARVPTFHASALGPWHAVIAYDACV 342 Score = 70.1 bits (170), Expect = 6e-10 Identities = 56/215 (26%), Positives = 87/215 (40%), Gaps = 3/215 (1%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 + N G GLP KF+SGHLP IP++ G + A VYGGR Sbjct: 36 ISNGGRNIGLPPPAKFRSGHLPVTAIPVTSTSLTGGDDSASASENDVTTDSEDDTVYGGR 95 Query: 187 YSIETSPQDDKFSNGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQN 366 YS+++SPQD++ NG+ R+ R + +T +V +R+T Sbjct: 96 YSLDSSPQDERIPNGTALRYGNPVQRRPRYATASDYTYSDVSSSRET------------- 142 Query: 367 QKAADDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACL-GTSDGSANADMPSTWNGNTL 543 L +G NL G++L R P G ++ ++D + +T Sbjct: 143 --------------LMGGIGGNL-GDRLGRGNGRYPVGRDGFTEEDESSDSAGSSEFSTT 187 Query: 544 GAGERKESGPSVRTAAVS--SSSLPARIPTFHASG 642 G P RT +SS+P+R+ A+G Sbjct: 188 QVGSINGRIPRSRTYVSEGYASSVPSRVNVESAAG 222 >gb|EOX91966.1| Ribosomal protein L10 family protein isoform 2 [Theobroma cacao] Length = 1151 Score = 76.6 bits (187), Expect = 7e-12 Identities = 67/234 (28%), Positives = 99/234 (42%), Gaps = 8/234 (3%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 L+N G GLP A KF +GH+ SG+IP+S I VS GND G+G Y + Sbjct: 42 LRNAGWHSGLPPA-KFHNGHISSGVIPVSGGISVS-GNDGGSGSDMDTSSDSDECPYDRQ 99 Query: 187 YSIETSPQDDKFSNGSVARHAYQTNRTSEVYYFNVHTQPNVKVARQTSLLQDFHGARMQN 366 YS +SPQDDK + A A + + + + R + + + Sbjct: 100 YSFISSPQDDKVPTVAAATRAASSQKLEACGSSKIELKLGNSAQRPARVCGGNPFGKPDS 159 Query: 367 QKAADDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANADMPSTWNGNTLG 546 Q+ + +S + ++ Q ++S P + S ++ + S Sbjct: 160 QE---------EQLSNSASSTEVSFMQYRSSDGVAPHREAYNTESYSSTVTSRVRNEITS 210 Query: 547 AGERKES---GPSVRTA-----AVSSSSLPARIPTFHASGLGSWYGFISYEACV 684 + PSVRTA S+SSL P FHASGLG W +SY+ACV Sbjct: 211 KQDNTRDEILNPSVRTADSGGVDESASSLTTHHPIFHASGLGPWCAVLSYDACV 264 >ref|XP_006575347.1| PREDICTED: uncharacterized protein LOC100813198 isoform X1 [Glycine max] gi|571441127|ref|XP_006575348.1| PREDICTED: uncharacterized protein LOC100813198 isoform X2 [Glycine max] gi|571441129|ref|XP_006575349.1| PREDICTED: uncharacterized protein LOC100813198 isoform X3 [Glycine max] Length = 1234 Score = 73.6 bits (179), Expect = 6e-11 Identities = 80/291 (27%), Positives = 115/291 (39%), Gaps = 69/291 (23%) Frame = +1 Query: 19 GSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGRYSIE 198 G GFGLP KF+SGHLP+ IP+S +P G D G+ VYGGRYS++ Sbjct: 41 GRGFGLPPPAKFRSGHLPANAIPVSTVMPGETG-DSGSNSDNDDSIESEEEVYGGRYSLD 99 Query: 199 TSPQDDKF-SNGSVARHAYQTN-RTSEVYYFNVHTQPNVKVARQTSLLQD--FHGARMQN 366 +SPQD + NG+ R+ T R + Y ++ + + + ++D GA Sbjct: 100 SSPQDRRVPPNGAARRYGNLTRPRYASDYTYSEVSSSRETLVGKPGTVRDPLMRGAANVR 159 Query: 367 QKAADDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGTSDGSANA------------ 510 Q +D S+ SS + G + + R L S+G A++ Sbjct: 160 QSGFTED-DSSDSAASSEFSTTQVGGSINGALPRGRTYL--SEGYASSVPSRMNVKSTAE 216 Query: 511 -----------DMPST--WNGNTLGAGERKESGPSVRTAA----VSSSSLP--------- 612 D+PS + G+T + E + R A SSSL Sbjct: 217 KNGRISDDEDDDIPSAPPFVGSTQEIRQTHEETAASRVHATPNKAESSSLKSMSGDKIEN 276 Query: 613 ----------ARIPT-----------------FHASGLGSWYGFISYEACV 684 ARI T FHAS LG W+G I+Y+ACV Sbjct: 277 HVENGSPDQFARIATGSEAATSSNSHPPRLPTFHASALGPWHGVIAYDACV 327 >gb|EOX91967.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 886 Score = 73.6 bits (179), Expect = 6e-11 Identities = 73/251 (29%), Positives = 107/251 (42%), Gaps = 25/251 (9%) Frame = +1 Query: 7 LKNHGSGFGLPSATKFQSGHLPSGIIPLSRAIPVSGGNDIGAGXXXXXXXXXXXXVYGGR 186 L+N G GLP A KF +GH+ SG+IP+S I VS GND G+G Y + Sbjct: 42 LRNAGWHSGLPPA-KFHNGHISSGVIPVSGGISVS-GNDGGSGSDMDTSSDSDECPYDRQ 99 Query: 187 YSIETSPQDDKFSNGSVARHAYQT-------------------NRTSEVYYFNVHTQPNV 309 YS +SPQDDK + A A + R + V N +P+ Sbjct: 100 YSFISSPQDDKVPTVAAATRAASSQKLEACGSSKIELKLGNSAQRPARVCGGNPFGKPDS 159 Query: 310 KVARQTSLLQDFHGARMQNQKAADDDIPSAPPVLSSVLGSNLAGEQLKNSGARNPACLGT 489 + + ++ + MQ + + D + ++ S+ +++N Sbjct: 160 QEEQLSNSASSTEVSFMQYR--SSDGVAPHREAYNTESYSSTVTSRVRNEITSKQV---F 214 Query: 490 SDGSANADMPSTWNGNTLGAGERKE-SGPSVRTA-----AVSSSSLPARIPTFHASGLGS 651 +G PS ++ + R E PSVRTA S+SSL P FHASGLG Sbjct: 215 HNGRMQKKKPS-YDDTIVQDNTRDEILNPSVRTADSGGVDESASSLTTHHPIFHASGLGP 273 Query: 652 WYGFISYEACV 684 W +SY+ACV Sbjct: 274 WCAVLSYDACV 284