BLASTX nr result
ID: Akebia23_contig00030988
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00030988 (918 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002273695.1| PREDICTED: uncharacterized protein LOC100252... 206 8e-51 ref|XP_007026443.1| Family of Uncharacterized protein function, ... 195 2e-47 ref|XP_002525247.1| conserved hypothetical protein [Ricinus comm... 193 9e-47 ref|XP_006467255.1| PREDICTED: protein ENDOSPERM DEFECTIVE 1-lik... 182 2e-43 gb|EXB64638.1| hypothetical protein L484_017970 [Morus notabilis] 181 3e-43 ref|XP_003546223.2| PREDICTED: protein ENDOSPERM DEFECTIVE 1-lik... 181 3e-43 ref|XP_006449954.1| hypothetical protein CICLE_v10014837mg [Citr... 181 4e-43 ref|XP_006586809.1| PREDICTED: protein ENDOSPERM DEFECTIVE 1-lik... 173 7e-41 ref|XP_004486407.1| PREDICTED: uncharacterized protein LOC101508... 173 7e-41 ref|XP_007159284.1| hypothetical protein PHAVU_002G225000g [Phas... 172 1e-40 ref|XP_006373542.1| hypothetical protein POPTR_0016s00230g [Popu... 171 4e-40 gb|EYU33814.1| hypothetical protein MIMGU_mgv1a004982mg [Mimulus... 170 6e-40 ref|XP_004295340.1| PREDICTED: uncharacterized protein LOC101295... 170 6e-40 ref|XP_006857539.1| hypothetical protein AMTR_s00061p00037320 [A... 168 2e-39 ref|XP_007213944.1| hypothetical protein PRUPE_ppa003523mg [Prun... 165 2e-38 ref|XP_004231230.1| PREDICTED: uncharacterized protein LOC101250... 164 5e-38 ref|XP_006347698.1| PREDICTED: protein ENDOSPERM DEFECTIVE 1-lik... 163 8e-38 ref|XP_007026444.1| Family of Uncharacterized protein function, ... 157 7e-36 ref|XP_007026447.1| Family of Uncharacterized protein function, ... 154 4e-35 ref|XP_003594402.1| hypothetical protein MTR_2g028220 [Medicago ... 154 4e-35 >ref|XP_002273695.1| PREDICTED: uncharacterized protein LOC100252686 [Vitis vinifera] Length = 614 Score = 206 bits (525), Expect = 8e-51 Identities = 135/310 (43%), Positives = 175/310 (56%), Gaps = 8/310 (2%) Frame = -2 Query: 908 IICDSPLKSNCKMRTLPDFRSSMPEVDL----SSRLLQAPAPHRGKALLGGEYSKVTTSP 741 I SP++ NCK R+ + RSSMPE D+ S+RLL RG + G E SK + SP Sbjct: 282 ISSSSPVQ-NCKTRSHNELRSSMPEADMLPTMSTRLLAERNCGRGN-VNGAESSKFSASP 339 Query: 740 CHRSL----TEHXXXXXXXXXXXXXXXSALSKHHHHHPSTTFKSLPPHPSNTKSGGTEAA 573 RSL + S LSK H + LPP P TK G A Sbjct: 340 FSRSLNLTPSSSDQSLFHSIKTSEKLASVLSKPHTNSMKNGSIYLPPLPPCTKPGTD--A 397 Query: 572 RKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSD 393 RKGRK+S HQEDVH+L+LL N YLQWR+ANAKAE M +Q + + ++L KIS+L + Sbjct: 398 RKGRKVSGHQEDVHSLKLLHNHYLQWRFANAKAEATMQAQRRETEKALYSLGVKISDLYE 457 Query: 392 SVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPII 213 V TI+E+QMP+LD+W AI+AL + S++LPI Sbjct: 458 LVKGKRIELGLLQRTNILVTILEAQMPFLDEWSILEGDYSVSLSEAIQALVNASLQLPIN 517 Query: 212 GNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECG 33 GN+R D+REV AL SA+ +MEI+ S+V+ F+PKA EMD L+SELA V ER L EECG Sbjct: 518 GNIRADIREVNEALYSATKMMEIIISNVQMFMPKAEEMDNLVSELARVTGGERTLSEECG 577 Query: 32 DLLTKTRMLQ 3 LL++T LQ Sbjct: 578 YLLSRTHALQ 587 >ref|XP_007026443.1| Family of Uncharacterized protein function, putative isoform 1 [Theobroma cacao] gi|508781809|gb|EOY29065.1| Family of Uncharacterized protein function, putative isoform 1 [Theobroma cacao] Length = 511 Score = 195 bits (495), Expect = 2e-47 Identities = 126/298 (42%), Positives = 166/298 (55%), Gaps = 13/298 (4%) Frame = -2 Query: 869 RTLPDFRSSMPEVDL----SSRLLQAPAPHRGKALLGGEYSKVTTSPCHRSLTEHXXXXX 702 R+L +F SSMPE DL S+RLL + + SK+ SP RSL Sbjct: 205 RSLVNFCSSMPEADLLPSVSTRLLTDRNVNNVV-----DSSKLPASPLSRSLNSPLSICE 259 Query: 701 XXXXXXXXXXSALSKHHHHHPSTTFK---------SLPPHPSNTKSGGTEAARKGRKISC 549 HHP+ K SLPP PS+TK+G T+A R+ +KIS Sbjct: 260 PSLF--------------HHPNPPIKGVSTRMGPLSLPPVPSHTKAG-TDAIRRPKKISS 304 Query: 548 HQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXXXXX 369 HQED+H+L+LL N YLQWRYANAKAE +M Q + + ++L KI+EL+D V Sbjct: 305 HQEDLHSLKLLHNYYLQWRYANAKAEASMQIQKGETERTLYSLEVKIAELNDCVRRKRIE 364 Query: 368 XXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRTDVR 189 I+E+QMPYL++W AI++L + S RLPI GNV+ D R Sbjct: 365 LELLQRMKTLSKILEAQMPYLEEWSAFQGDYLNSLAEAIQSLLNTSHRLPISGNVKADTR 424 Query: 188 EVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECGDLLTKT 15 +VG A+ SA ME++ HV++F+PKA EM+ LISELA V ERALI+ECGDLL+KT Sbjct: 425 KVGEAMKSAIKWMEMILCHVQSFMPKAEEMERLISELARVAVGERALIDECGDLLSKT 482 >ref|XP_002525247.1| conserved hypothetical protein [Ricinus communis] gi|223535544|gb|EEF37213.1| conserved hypothetical protein [Ricinus communis] Length = 504 Score = 193 bits (490), Expect = 9e-47 Identities = 127/297 (42%), Positives = 163/297 (54%), Gaps = 5/297 (1%) Frame = -2 Query: 890 LKSNCKMRTLPDFRSSMPEVDLSSRLLQAPAPHRGKALLGGEYSKVTTSPCHRSLTEHXX 711 L + ++ LPD RSSMPE SSRLL R + + SK + SPC RSL Sbjct: 187 LARSSSIQILPDIRSSMPEA--SSRLLI----DRNISNRKNDSSKFSASPCSRSLD---- 236 Query: 710 XXXXXXXXXXXXXSALSKHHHHHPSTTFKSLPPHPSNTKSGG-----TEAARKGRKISCH 546 LS H +FK++ P+ K G +RKGRK+ H Sbjct: 237 -------------FPLSDTCEHSLIHSFKTVDK-PAIKKIAGLPLPPAHVSRKGRKVPSH 282 Query: 545 QEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXXXXXX 366 QEDV +LRLL N YLQWRYANAKAEV + +Q + + ++L KISEL DSV Sbjct: 283 QEDVQSLRLLHNHYLQWRYANAKAEVCIKAQRRETERTLYSLGLKISELYDSVKQKRIEH 342 Query: 365 XXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRTDVRE 186 TI+E+QMPYL++W AI+A +VS+RLPI GN R+DVRE Sbjct: 343 SLLQRIKALSTILEAQMPYLEEWSTLEEDYSVSLTEAIQAFMNVSLRLPISGNFRSDVRE 402 Query: 185 VGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECGDLLTKT 15 +G ALNSA+ +ME + H++ +PKA EM+ ISELA V+ ERALIEECGD L T Sbjct: 403 LGEALNSATKLMESIVIHIQGLMPKAEEMEYFISELARVIGGERALIEECGDQLFMT 459 >ref|XP_006467255.1| PREDICTED: protein ENDOSPERM DEFECTIVE 1-like [Citrus sinensis] Length = 536 Score = 182 bits (462), Expect = 2e-43 Identities = 111/259 (42%), Positives = 148/259 (57%), Gaps = 9/259 (3%) Frame = -2 Query: 752 TTSPCHRSL---------TEHXXXXXXXXXXXXXXXSALSKHHHHHPSTTFKSLPPHPSN 600 T +PC RSL +H ALS+ + LPP P Sbjct: 247 TATPCSRSLHLQPQLSNSQQHNGGFFSSIKGGEKPTPALSRSFSNSAKIGGLPLPPIPPP 306 Query: 599 TKSGGTEAARKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHAL 420 T GT++ RKGRK+S HQED+H+L+LL N YLQWR+ANAKA+ + +Q + S ++L Sbjct: 307 T---GTDS-RKGRKVSSHQEDLHSLKLLHNHYLQWRFANAKADSSTLTQRKETEKSLYSL 362 Query: 419 VAKISELSDSVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQ 240 K+SEL SV TI+E+QMPYLD+W AI+AL Sbjct: 363 GVKMSELYASVKRKRIELEILKRIKTLSTILEAQMPYLDEWCAFEGDYSSSLSEAIQALL 422 Query: 239 DVSIRLPIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSR 60 SI+LPI GNV+ DVREVG ALNSA+ +ME++ H+++F+P+A E++ LISELA V Sbjct: 423 GASIQLPIGGNVKADVREVGEALNSAAKLMEMIILHLQSFMPEAEEVEMLISELAKVTGG 482 Query: 59 ERALIEECGDLLTKTRMLQ 3 ERAL+EECG LL+KT Q Sbjct: 483 ERALVEECGGLLSKTHTFQ 501 >gb|EXB64638.1| hypothetical protein L484_017970 [Morus notabilis] Length = 570 Score = 181 bits (460), Expect = 3e-43 Identities = 112/295 (37%), Positives = 155/295 (52%), Gaps = 6/295 (2%) Frame = -2 Query: 881 NCKMRTLPDFRSSMPEVDLSSRLLQAPAPH------RGKALLGGEYSKVTTSPCHRSLTE 720 +C ++LPD RS MPE D+ + P G G + K++T P R L Sbjct: 251 SCSTQSLPDLRSPMPETDMLPTVSGRPRNSIRGGGGHGSTTAGSDSLKLSTFPSSRYLNS 310 Query: 719 HXXXXXXXXXXXXXXXSALSKHHHHHPSTTFKSLPPHPSNTKSGGTEAARKGRKISCHQE 540 +A+ K LPP P + RK +K+S E Sbjct: 311 ---PFNSATDGGEKPANAVFKSCVSLAKMGGLCLPPVPPCVNAKPGTEMRKAKKVSRQPE 367 Query: 539 DVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXXXXXXXX 360 D+H+L+LL NRYLQWR+ANA+AE ++ +Q AQ ++L ISEL DSV Sbjct: 368 DIHSLKLLHNRYLQWRFANARAEASVQAQQREAQTKVNSLRVSISELYDSVTRKRIELGI 427 Query: 359 XXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRTDVREVG 180 I+E+Q+P+LDQW +I+AL + SI+LP+ GNVR D++E+ Sbjct: 428 LRRTKAVSAILEAQVPHLDQWSTLEGDYSISLAESIQALLNASIQLPMGGNVRVDIKELQ 487 Query: 179 GALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECGDLLTKT 15 ALNSA+ VME SHV++ +PKA E + L+SELA +V ERAL+EECGDLLT T Sbjct: 488 EALNSATKVMETTVSHVKHLMPKAEETEILMSELAKIVGGERALVEECGDLLTTT 542 >ref|XP_003546223.2| PREDICTED: protein ENDOSPERM DEFECTIVE 1-like [Glycine max] Length = 494 Score = 181 bits (460), Expect = 3e-43 Identities = 119/292 (40%), Positives = 158/292 (54%), Gaps = 3/292 (1%) Frame = -2 Query: 881 NCKMRTLPDFRSSMPEVDLSSRLLQAPAPHRGKALLGGEYSKVTTSPCHRSLTEHXXXXX 702 NC +++LP+ E+ L S L G GG+ SP RS+T Sbjct: 185 NCSIQSLPELGER--EMLLQSNLSAGEKIGSGNGG-GGDLKFRHPSPLSRSVT------L 235 Query: 701 XXXXXXXXXXSALSKHH---HHHPSTTFKSLPPHPSNTKSGGTEAARKGRKISCHQEDVH 531 +++SK H + + SLPP P + RKG+K S HQEDVH Sbjct: 236 PSSGGENKPPASVSKQHGSGNQLAKSGGLSLPPVPPQCGKPAVDV-RKGKKGSSHQEDVH 294 Query: 530 ALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXXXXXXXXXXX 351 +LRLL NRYLQWR+ANAKA M +Q +Q + ++ +ISE+ DSV Sbjct: 295 SLRLLYNRYLQWRFANAKAHSVMKAQQTESQKALYSQAMRISEMRDSVNKKRIELELLRR 354 Query: 350 XXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRTDVREVGGAL 171 TI+E+Q+PYLD+W AI+AL + S RLP+ GNVR DVR++G AL Sbjct: 355 SKTLSTILEAQIPYLDEWSTMMEEYSVSITEAIQALVNASERLPVGGNVRVDVRQLGEAL 414 Query: 170 NSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECGDLLTKT 15 NSAS +ME M S+++ F+PKA E D ISELA V ERAL+ ECGDLL+KT Sbjct: 415 NSASKMMETMISNIQRFMPKAEETDVSISELARVAGGERALVGECGDLLSKT 466 >ref|XP_006449954.1| hypothetical protein CICLE_v10014837mg [Citrus clementina] gi|557552565|gb|ESR63194.1| hypothetical protein CICLE_v10014837mg [Citrus clementina] Length = 539 Score = 181 bits (459), Expect = 4e-43 Identities = 111/259 (42%), Positives = 147/259 (56%), Gaps = 9/259 (3%) Frame = -2 Query: 752 TTSPCHRSL---------TEHXXXXXXXXXXXXXXXSALSKHHHHHPSTTFKSLPPHPSN 600 T +PC RSL +H ALS+ + LPP P Sbjct: 247 TATPCSRSLHLQPQLSNSQQHNGGFFSSIKGGEKPTPALSRSFSNSAKIGGLPLPPIPPP 306 Query: 599 TKSGGTEAARKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHAL 420 T GT++ RKGRK+S HQED+H+L+LL N YLQWR+ANAKA+ + +Q + S ++L Sbjct: 307 T---GTDS-RKGRKVSSHQEDLHSLKLLHNHYLQWRFANAKADSSSLTQREETEKSLYSL 362 Query: 419 VAKISELSDSVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQ 240 KISEL SV TI+E+QMPYLD+W AI+AL Sbjct: 363 GVKISELYASVKRKRIELEILKRIKTLSTILEAQMPYLDEWCAFEGDYSSSLSEAIQALL 422 Query: 239 DVSIRLPIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSR 60 SI+LPI GNV+ DVREVG ALNSA+ +ME++ H+++F+P+A E++ LISELA V Sbjct: 423 GASIQLPIGGNVKADVREVGEALNSAAKLMEMIILHLQSFMPEAEEVEMLISELAKVTGG 482 Query: 59 ERALIEECGDLLTKTRMLQ 3 ERAL+EECG LL+K Q Sbjct: 483 ERALVEECGGLLSKRHTFQ 501 >ref|XP_006586809.1| PREDICTED: protein ENDOSPERM DEFECTIVE 1-like [Glycine max] Length = 495 Score = 173 bits (439), Expect = 7e-41 Identities = 97/202 (48%), Positives = 123/202 (60%) Frame = -2 Query: 623 SLPPHPSNTKSGGTEAARKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLIT 444 SLPP P + RKG+K S QEDVH+LRLL NRYLQWR+ANAKA M +Q Sbjct: 261 SLPPVPPQCGRPAVDV-RKGKKGSSQQEDVHSLRLLYNRYLQWRFANAKAHSTMKAQQTE 319 Query: 443 AQVSFHALVAKISELSDSVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXX 264 Q + ++ +ISE+ DSV TI+E Q+PYLD+W Sbjct: 320 IQKALYSQAMRISEMRDSVNKKRIELELLQKSKILSTILEPQIPYLDEWSTMMEEYSVSI 379 Query: 263 XGAIKALQDVSIRLPIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLIS 84 I+AL + ++RLP+ GNVR DVRE+G ALNSAS +ME M S+++ F+PKA E D IS Sbjct: 380 TEVIQALVNATVRLPVGGNVRLDVRELGEALNSASKMMETMISNIQRFMPKAEETDISIS 439 Query: 83 ELAGVVSRERALIEECGDLLTK 18 ELA V ERAL+ ECGDLL+K Sbjct: 440 ELARVAGGERALVGECGDLLSK 461 >ref|XP_004486407.1| PREDICTED: uncharacterized protein LOC101508486 [Cicer arietinum] Length = 452 Score = 173 bits (439), Expect = 7e-41 Identities = 95/203 (46%), Positives = 128/203 (63%), Gaps = 1/203 (0%) Frame = -2 Query: 620 LPP-HPSNTKSGGTEAARKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLIT 444 LPP P KSG AR+G+K+S HQEDVH+LRLL NRY+QWR+ NA+A +M +Q Sbjct: 223 LPPVAPQFAKSG--VGARRGKKVSSHQEDVHSLRLLYNRYMQWRFCNARAAFSMKAQQKE 280 Query: 443 AQVSFHALVAKISELSDSVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXX 264 + + +++ KISE+ +S T++E+Q+PYLD+W Sbjct: 281 CEKALYSVATKISEMRESTIRKRIELELLRRSKTLSTVLEAQIPYLDEWSAMEEDYSVSI 340 Query: 263 XGAIKALQDVSIRLPIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLIS 84 AI+AL + S +LP +GNVR DVRE+G +LNSA +ME + S+ + +PKA E DT IS Sbjct: 341 TEAIQALLNASAQLPTVGNVRVDVRELGESLNSALKMMETIVSNTQRLMPKAEETDTSIS 400 Query: 83 ELAGVVSRERALIEECGDLLTKT 15 ELA VV ERALI ECGDLL+KT Sbjct: 401 ELARVVGGERALIGECGDLLSKT 423 >ref|XP_007159284.1| hypothetical protein PHAVU_002G225000g [Phaseolus vulgaris] gi|561032699|gb|ESW31278.1| hypothetical protein PHAVU_002G225000g [Phaseolus vulgaris] Length = 487 Score = 172 bits (437), Expect = 1e-40 Identities = 91/203 (44%), Positives = 126/203 (62%) Frame = -2 Query: 623 SLPPHPSNTKSGGTEAARKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLIT 444 SLPP P + + R+G+K S HQED+H++RLL NRYLQWR+ANA+A +Q + Sbjct: 258 SLPPVPPQSLKPSVDV-RRGKKGSGHQEDMHSIRLLYNRYLQWRFANARAHSTTKAQQVE 316 Query: 443 AQVSFHALVAKISELSDSVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXX 264 +Q + ++ ISE+ DSV I+E+Q+PYLD+W Sbjct: 317 SQKALYSQAMTISEMRDSVNKKRIELEFLRRSETLSRILETQIPYLDEWSTMTEEYSVSI 376 Query: 263 XGAIKALQDVSIRLPIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLIS 84 I+AL + S++LP+ GNVR DVREVG ALNSA ++E M S+++ F+PKA E+D IS Sbjct: 377 TEVIQALVNASVQLPVGGNVRVDVREVGDALNSALKMLETMISNIQRFVPKAEEIDVSIS 436 Query: 83 ELAGVVSRERALIEECGDLLTKT 15 ELA + ERAL+ ECGDLL+KT Sbjct: 437 ELARIAGGERALVGECGDLLSKT 459 >ref|XP_006373542.1| hypothetical protein POPTR_0016s00230g [Populus trichocarpa] gi|550320452|gb|ERP51339.1| hypothetical protein POPTR_0016s00230g [Populus trichocarpa] Length = 462 Score = 171 bits (433), Expect = 4e-40 Identities = 90/189 (47%), Positives = 124/189 (65%) Frame = -2 Query: 581 EAARKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISE 402 +A+RK RK+S HQEDV +L+LL N YLQWRY NAKA+ + +Q + + ++L KI+E Sbjct: 244 DASRKTRKVSSHQEDVQSLKLLHNHYLQWRYVNAKAQASAQAQRRETERNLYSLGVKITE 303 Query: 401 LSDSVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRL 222 L DSV TI+E+QMPYLD+W AI+AL + S+++ Sbjct: 304 LYDSVKRKRAELGLLQRLKILWTIVEAQMPYLDEWAAFEMDYSVSLSEAIQALLNASLQV 363 Query: 221 PIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIE 42 PI GNVR D+REVG ALNSA+ +M+ ++ ++E+ +PKA E + LISELA V E+ALIE Sbjct: 364 PISGNVRVDIREVGEALNSATKLMDTVAFNIESLMPKAEETEHLISELARVTGGEKALIE 423 Query: 41 ECGDLLTKT 15 ECGDLL+ T Sbjct: 424 ECGDLLSMT 432 >gb|EYU33814.1| hypothetical protein MIMGU_mgv1a004982mg [Mimulus guttatus] Length = 502 Score = 170 bits (431), Expect = 6e-40 Identities = 96/207 (46%), Positives = 128/207 (61%) Frame = -2 Query: 623 SLPPHPSNTKSGGTEAARKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLIT 444 SLPPHPS+ G + RKG+K S QEDVH L++L N YLQWR+ANAKAE ++ +Q Sbjct: 267 SLPPHPSSCIRSGLDL-RKGKKGSNCQEDVHCLKMLSNHYLQWRFANAKAESSVLAQKQE 325 Query: 443 AQVSFHALVAKISELSDSVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXX 264 + ++L KIS++ ++V TI+E+QMPYLD W Sbjct: 326 VERKLYSLNGKISDIRENVKRKHSELAVLRRIKTLSTIVEAQMPYLDGWADMEEDYSASL 385 Query: 263 XGAIKALQDVSIRLPIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLIS 84 G AL + S RLPI G VR DV E+G AL+SA V+E++ SH++ F+PKA EMDT +S Sbjct: 386 IGTTNALVNSSTRLPISGEVRVDVGELGEALHSAFKVVELIGSHIQGFIPKAEEMDTSVS 445 Query: 83 ELAGVVSRERALIEECGDLLTKTRMLQ 3 ELA + E AL+EECGDLL+KT + Q Sbjct: 446 ELARMARGEIALVEECGDLLSKTSISQ 472 >ref|XP_004295340.1| PREDICTED: uncharacterized protein LOC101295210 [Fragaria vesca subsp. vesca] Length = 468 Score = 170 bits (431), Expect = 6e-40 Identities = 116/295 (39%), Positives = 154/295 (52%), Gaps = 6/295 (2%) Frame = -2 Query: 869 RTLPDFRSSMPEVDLSSRLLQAPAPHRGKALLGGEYSKVTTSPCHRSL------TEHXXX 708 R L D RSSMPE ++ + + GG +SK++ SPC RSL +EH Sbjct: 165 RCLSDIRSSMPESNMLPTVSSRQTSEKS-CNNGGGFSKISASPCSRSLNLPLSSSEHLHF 223 Query: 707 XXXXXXXXXXXXSALSKHHHHHPSTTFKSLPPHPSNTKSGGTEAARKGRKISCHQEDVHA 528 ALSK + +T LPP P + RKG+K EDVH+ Sbjct: 224 QSVKGSEKLAS--ALSKPCTNAVNTGGLCLPPVPPCATAKPASEFRKGKK-----EDVHS 276 Query: 527 LRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXXXXXXXXXXXX 348 LRLL NRYLQWRYANA AE ++ +Q + + ++L +I+EL DSV Sbjct: 277 LRLLHNRYLQWRYANATAEASLRAQQRETERTLYSLALQITELYDSVKRKRIELGVLQRT 336 Query: 347 XXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRTDVREVGGALN 168 TI+E+Q+PYLDQW A +AL + S+RLPI NV+ + +EV ALN Sbjct: 337 KNLSTILEAQIPYLDQWFSLKEDYLISLAEATQALLNTSLRLPISANVKANKQEVEEALN 396 Query: 167 SASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECGDLLTKTRMLQ 3 SA V+E + HV+ +PKA E D LISELA V ER L+E G+LL+KT Q Sbjct: 397 SAMEVIEKIVFHVQQSMPKAEETDHLISELARVAGGERTLVEVSGNLLSKTYTTQ 451 >ref|XP_006857539.1| hypothetical protein AMTR_s00061p00037320 [Amborella trichopoda] gi|548861635|gb|ERN19006.1| hypothetical protein AMTR_s00061p00037320 [Amborella trichopoda] Length = 646 Score = 168 bits (426), Expect = 2e-39 Identities = 123/322 (38%), Positives = 164/322 (50%), Gaps = 21/322 (6%) Frame = -2 Query: 905 ICDSP-----LKSNCKMRTLPDF-RSSMPEVDLSSRLLQAPAPHRGKALLGGEYSKVTTS 744 ICDSP ++S+ +RT RSSMPE DL P G GG Sbjct: 323 ICDSPPIAANIQSSKILRTASSACRSSMPEADL--------LPTMGGRDGGG-------- 366 Query: 743 PCHRSLTEHXXXXXXXXXXXXXXXSALSKHHHHHPSTTFK----SLPPHPSNTKSGGT-- 582 PC RSL S +S++ S++ K SLPPHPS+ ++ + Sbjct: 367 PC-RSLNS-AFSSCVAAKSMSKQASVMSRYLVCSSSSSSKGSGVSLPPHPSHMRTASSLS 424 Query: 581 ---------EAARKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSF 429 + RK +K S H +DVH ++LLQNRYLQWRYAN KAE A+ +Q ++A+ Sbjct: 425 TSMIKAAQPDTMRKVKKASSHDDDVHMMKLLQNRYLQWRYANVKAESALQAQGVSAEKCL 484 Query: 428 HALVAKISELSDSVXXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIK 249 AL AKI +L DSV I+E QMP+LD W G Sbjct: 485 IALWAKIVKLHDSVERKRIELVQLKMAERLSKILEGQMPFLDSWENMEESYSNSLLGVSG 544 Query: 248 ALQDVSIRLPIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGV 69 AL + ++RLP+ GNVR D EV AL SA++V+E + +FLPK EM++L S +AG+ Sbjct: 545 ALHNATLRLPVTGNVRADTGEVEQALQSAADVLEATYPSLCSFLPKTEEMNSLASAVAGI 604 Query: 68 VSRERALIEECGDLLTKTRMLQ 3 V++ERALIEECG LL LQ Sbjct: 605 VTKERALIEECGQLLMMANKLQ 626 >ref|XP_007213944.1| hypothetical protein PRUPE_ppa003523mg [Prunus persica] gi|462409809|gb|EMJ15143.1| hypothetical protein PRUPE_ppa003523mg [Prunus persica] Length = 568 Score = 165 bits (418), Expect = 2e-38 Identities = 110/308 (35%), Positives = 153/308 (49%), Gaps = 11/308 (3%) Frame = -2 Query: 893 PLKSNCKMRTLPDFRSSMPEVDL-----SSRLLQAPAPHRGKALL--GGEYSKVTTSPCH 735 P+ +C R LPD RSSMPE DL S +L+ + RG A + ++ K + SPC Sbjct: 267 PVVPSCSTRCLPDIRSSMPEADLLPSVSSRQLVDKNSSSRGNATVTVSDDFLKCSASPCS 326 Query: 734 RSL----TEHXXXXXXXXXXXXXXXSALSKHHHHHPSTTFKSLPPHPSNTKSGGTEAARK 567 RSL + S +SK + + LPP P +T + + R+ Sbjct: 327 RSLKLPLSSSDILSFQPNKGSERLTSVVSKPYTNTGKMGGLCLPPVPPSTSAKLSSDTRR 386 Query: 566 GRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSV 387 G+K+S H EDVH+LR+L NRYLQWRY NA+AE +M +Q + + ++L KI+EL DSV Sbjct: 387 GKKVSGHLEDVHSLRVLHNRYLQWRYTNARAEASMRAQQRETERTLYSLAVKIAELYDSV 446 Query: 386 XXXXXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGN 207 I+++Q+PYLDQW A +AL + S +LPI GN Sbjct: 447 KRKRIELGILQRTETLSAILDAQIPYLDQWFALQGDHSSSLAEATQALSNASFQLPISGN 506 Query: 206 VRTDVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECGDL 27 VR + F+ +A E + LISELA V ERALIEECG++ Sbjct: 507 VR------------------------KAFVIQAEETENLISELARVTGGERALIEECGNM 542 Query: 26 LTKTRMLQ 3 L+KT Q Sbjct: 543 LSKTYTTQ 550 >ref|XP_004231230.1| PREDICTED: uncharacterized protein LOC101250937 [Solanum lycopersicum] Length = 587 Score = 164 bits (415), Expect = 5e-38 Identities = 114/305 (37%), Positives = 163/305 (53%), Gaps = 7/305 (2%) Frame = -2 Query: 896 SPL-KSNCKMRTLPDFRSSMPEVD--LSSRLLQAPAPHRGKALLGGEYSKVTTSPCHRSL 726 SPL N K RTL RSS E+D L+ R +R ++ K S C RSL Sbjct: 272 SPLCPQNNKTRTLSAMRSSTSEIDRCLTER-------NRDSSVDECSSYKSAFSTCARSL 324 Query: 725 TEHXXXXXXXXXXXXXXXSALSKHHHHHPSTTFKS----LPPHPSNTKSGGTEAARKGRK 558 +L ++ ++FK LPPHP++ K G +AARKGRK Sbjct: 325 -----HLPTANNENSSSWLSLKQNDMFASRSSFKMGGLCLPPHPTSNKLGA-DAARKGRK 378 Query: 557 ISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXX 378 Q +VH+L+LL N +LQWR+ANAKAE +M+SQ +Q ++ ++S+L SV Sbjct: 379 GFSDQGEVHSLKLLYNHHLQWRFANAKAEASMHSQRHESQSKLYSFAQQLSDLRKSVSQK 438 Query: 377 XXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRT 198 TI+ESQ+P L +W G AL++ S+RLPI V Sbjct: 439 RAELGVLRRIKTLSTIVESQLPCLGEWANLEEDYSTSLSGTTDALRNCSLRLPIGTEVHV 498 Query: 197 DVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECGDLLTK 18 D++E+G AL+S++ VME++ ++NF+ KA E ++L+SELA V E+AL+EECGDLL K Sbjct: 499 DIKELGDALSSSTKVMEMIGLQIQNFMQKAEETESLVSELARVSGGEKALVEECGDLLMK 558 Query: 17 TRMLQ 3 T + Q Sbjct: 559 TYISQ 563 >ref|XP_006347698.1| PREDICTED: protein ENDOSPERM DEFECTIVE 1-like [Solanum tuberosum] Length = 587 Score = 163 bits (413), Expect = 8e-38 Identities = 115/301 (38%), Positives = 160/301 (53%), Gaps = 7/301 (2%) Frame = -2 Query: 896 SPL-KSNCKMRTLPDFRSSMPEVD--LSSRLLQAPAPHRGKALLGGEYSKVTTSPCHRSL 726 SPL N K R L RSSM E+D L+ R +R ++ K +S C RSL Sbjct: 272 SPLCTQNNKTRMLSAMRSSMSEIDRCLTER-------NRDSSVDECSSYKSASSTCARSL 324 Query: 725 TEHXXXXXXXXXXXXXXXSALSKHHHHHPSTTFKS----LPPHPSNTKSGGTEAARKGRK 558 +L ++ ++FK LPPHP++ K G ARKGRK Sbjct: 325 -----HLPTANNENSSSWLSLKQNDMFASRSSFKMGSLCLPPHPTSNKLGAD--ARKGRK 377 Query: 557 ISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXX 378 Q DVH+L+LL N +LQWR+ANAKAE +M++Q +Q ++ ++S+L SV Sbjct: 378 GFSDQGDVHSLKLLYNHHLQWRFANAKAEASMHTQRHDSQSKLYSFAQQLSDLRKSVSQK 437 Query: 377 XXXXXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRT 198 TI+ESQ+P L +W G AL++ S+RLPI V Sbjct: 438 RAELGVLRRIKTLSTILESQLPCLGEWANLEEDYSTSLSGTTDALRNCSLRLPIGTEVHV 497 Query: 197 DVREVGGALNSASNVMEIMSSHVENFLPKAGEMDTLISELAGVVSRERALIEECGDLLTK 18 DVRE+G AL+S++ VME++ ++NF+ KA E ++LISELA V E+AL+EECGDLL K Sbjct: 498 DVRELGDALSSSTKVMEMIGLQIQNFMQKAEETESLISELARVSGGEKALVEECGDLLMK 557 Query: 17 T 15 T Sbjct: 558 T 558 >ref|XP_007026444.1| Family of Uncharacterized protein function, putative isoform 2 [Theobroma cacao] gi|508781810|gb|EOY29066.1| Family of Uncharacterized protein function, putative isoform 2 [Theobroma cacao] Length = 472 Score = 157 bits (396), Expect = 7e-36 Identities = 104/268 (38%), Positives = 141/268 (52%), Gaps = 13/268 (4%) Frame = -2 Query: 869 RTLPDFRSSMPEVDL----SSRLLQAPAPHRGKALLGGEYSKVTTSPCHRSLTEHXXXXX 702 R+L +F SSMPE DL S+RLL + + SK+ SP RSL Sbjct: 205 RSLVNFCSSMPEADLLPSVSTRLLTDRNVNNVV-----DSSKLPASPLSRSLNSPLSICE 259 Query: 701 XXXXXXXXXXSALSKHHHHHPSTTFK---------SLPPHPSNTKSGGTEAARKGRKISC 549 HHP+ K SLPP PS+TK+G T+A R+ +KIS Sbjct: 260 PSLF--------------HHPNPPIKGVSTRMGPLSLPPVPSHTKAG-TDAIRRPKKISS 304 Query: 548 HQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXXXXX 369 HQED+H+L+LL N YLQWRYANAKAE +M Q + + ++L KI+EL+D V Sbjct: 305 HQEDLHSLKLLHNYYLQWRYANAKAEASMQIQKGETERTLYSLEVKIAELNDCVRRKRIE 364 Query: 368 XXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRTDVR 189 I+E+QMPYL++W AI++L + S RLPI GNV+ D R Sbjct: 365 LELLQRMKTLSKILEAQMPYLEEWSAFQGDYLNSLAEAIQSLLNTSHRLPISGNVKADTR 424 Query: 188 EVGGALNSASNVMEIMSSHVENFLPKAG 105 +VG A+ SA ME++ HV++F+PK G Sbjct: 425 KVGEAMKSAIKWMEMILCHVQSFMPKVG 452 >ref|XP_007026447.1| Family of Uncharacterized protein function, putative isoform 5 [Theobroma cacao] gi|508781813|gb|EOY29069.1| Family of Uncharacterized protein function, putative isoform 5 [Theobroma cacao] Length = 455 Score = 154 bits (390), Expect = 4e-35 Identities = 103/266 (38%), Positives = 140/266 (52%), Gaps = 13/266 (4%) Frame = -2 Query: 869 RTLPDFRSSMPEVDL----SSRLLQAPAPHRGKALLGGEYSKVTTSPCHRSLTEHXXXXX 702 R+L +F SSMPE DL S+RLL + + SK+ SP RSL Sbjct: 205 RSLVNFCSSMPEADLLPSVSTRLLTDRNVNNVV-----DSSKLPASPLSRSLNSPLSICE 259 Query: 701 XXXXXXXXXXSALSKHHHHHPSTTFK---------SLPPHPSNTKSGGTEAARKGRKISC 549 HHP+ K SLPP PS+TK+G T+A R+ +KIS Sbjct: 260 PSLF--------------HHPNPPIKGVSTRMGPLSLPPVPSHTKAG-TDAIRRPKKISS 304 Query: 548 HQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSDSVXXXXXX 369 HQED+H+L+LL N YLQWRYANAKAE +M Q + + ++L KI+EL+D V Sbjct: 305 HQEDLHSLKLLHNYYLQWRYANAKAEASMQIQKGETERTLYSLEVKIAELNDCVRRKRIE 364 Query: 368 XXXXXXXXXXXTIIESQMPYLDQWXXXXXXXXXXXXGAIKALQDVSIRLPIIGNVRTDVR 189 I+E+QMPYL++W AI++L + S RLPI GNV+ D R Sbjct: 365 LELLQRMKTLSKILEAQMPYLEEWSAFQGDYLNSLAEAIQSLLNTSHRLPISGNVKADTR 424 Query: 188 EVGGALNSASNVMEIMSSHVENFLPK 111 +VG A+ SA ME++ HV++F+PK Sbjct: 425 KVGEAMKSAIKWMEMILCHVQSFMPK 450 >ref|XP_003594402.1| hypothetical protein MTR_2g028220 [Medicago truncatula] gi|355483450|gb|AES64653.1| hypothetical protein MTR_2g028220 [Medicago truncatula] Length = 519 Score = 154 bits (390), Expect = 4e-35 Identities = 92/214 (42%), Positives = 116/214 (54%), Gaps = 24/214 (11%) Frame = -2 Query: 572 RKGRKISCHQEDVHALRLLQNRYLQWRYANAKAEVAMNSQLITAQVSFHALVAKISELSD 393 RKG+K S HQEDVH+LR+ NRYLQWR+ANA+A AM Q + + + KISE+ D Sbjct: 278 RKGKKGSSHQEDVHSLRMFYNRYLQWRFANARAVNAMKVQQKECEKALFSRAMKISEMRD 337 Query: 392 SVXXXXXXXXXXXXXXXXXTIIESQ------------------------MPYLDQWXXXX 285 SV ++E+Q +PYLD+W Sbjct: 338 SVHRKRLELELLRRSKTLSIVLEAQSNAPPWAAIQNFFLLDDFCGMKVEIPYLDEWSAME 397 Query: 284 XXXXXXXXGAIKALQDVSIRLPIIGNVRTDVREVGGALNSASNVMEIMSSHVENFLPKAG 105 AI+AL + S+RLP GN+R DVREVG +LNSA VME + S+ + +PKA Sbjct: 398 EDYSVSINEAIQALLNASVRLPTGGNIRVDVREVGESLNSALKVMETIISNTQRLMPKAE 457 Query: 104 EMDTLISELAGVVSRERALIEECGDLLTKTRMLQ 3 E DT ISELA VV ERALIEECG L+KT Q Sbjct: 458 ETDTSISELARVVGGERALIEECGGFLSKTHKSQ 491