BLASTX nr result
ID: Catharanthus23_contig00021198
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00021198 (1747 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002265116.1| PREDICTED: uncharacterized protein LOC100243... 244 7e-62 ref|XP_006358138.1| PREDICTED: caldesmon-like [Solanum tuberosum] 239 4e-60 ref|XP_004233852.1| PREDICTED: uncharacterized protein LOC101253... 211 8e-52 gb|EMJ15076.1| hypothetical protein PRUPE_ppa006822mg [Prunus pe... 209 4e-51 gb|EOY27918.1| Ovate family protein 5, putative [Theobroma cacao] 206 2e-50 ref|XP_002265079.1| PREDICTED: pentatricopeptide repeat-containi... 197 1e-47 emb|CBI30729.3| unnamed protein product [Vitis vinifera] 197 1e-47 gb|EXB31128.1| hypothetical protein L484_004602 [Morus notabilis] 193 2e-46 ref|XP_006358091.1| PREDICTED: pentatricopeptide repeat-containi... 193 2e-46 ref|XP_004163359.1| PREDICTED: uncharacterized protein LOC101232... 186 3e-44 gb|EOY27913.1| Pentatricopeptide repeat (PPR-like) superfamily p... 184 8e-44 ref|XP_002522269.1| conserved hypothetical protein [Ricinus comm... 184 1e-43 ref|XP_002305721.2| ovate family protein [Populus trichocarpa] g... 181 1e-42 ref|XP_006283500.1| hypothetical protein CARUB_v10004552mg [Caps... 177 2e-41 ref|XP_006414048.1| hypothetical protein EUTSA_v10024877mg [Eutr... 169 3e-39 ref|XP_002867972.1| pentatricopeptide repeat-containing protein ... 169 3e-39 emb|CBI30728.3| unnamed protein product [Vitis vinifera] 160 1e-36 gb|ESW31484.1| hypothetical protein PHAVU_002G241900g [Phaseolus... 158 8e-36 ref|XP_004309635.1| PREDICTED: uncharacterized protein LOC101311... 156 3e-35 ref|XP_006395137.1| hypothetical protein EUTSA_v10003805mg [Eutr... 153 3e-34 >ref|XP_002265116.1| PREDICTED: uncharacterized protein LOC100243022 [Vitis vinifera] Length = 444 Score = 244 bits (624), Expect = 7e-62 Identities = 174/435 (40%), Positives = 228/435 (52%), Gaps = 34/435 (7%) Frame = +1 Query: 1 PHHHHHPSLITRVFPSSWFSKFKQKGSSFEP-PKSAKAK--QKVPPSCSLSHAPWKEGRF 171 P S ++RVFP SWFSKFKQ G S P P+ K K + P S HA +GRF Sbjct: 11 PSSSSRASSMSRVFPVSWFSKFKQMGGSSRPQPERVKPKGRKNSPSRSSWQHASCGDGRF 70 Query: 172 YCGDDDPYWRLSFGEEG---MEQSKVWLNSVWDDPDDNVPEVTIPNSGSSKPEAV----- 327 Y G DD +WRLSFGEE + + L SV D DD V E+ + + S + + Sbjct: 71 YGGGDDGFWRLSFGEEDDVMKRRDRCILRSVLYDSDDEV-ELPLSSCRSCRSSSTKVGER 129 Query: 328 ---QMFNHMV------REMQEKKESLLENXXXXXXXXXXXXXXXSTCRQTIEGRRSRKLN 480 Q F+ MV R++ E L N ++T++ RRSRK N Sbjct: 130 GESQKFSDMVSDERKMRKLHGDVEISLGNGAHKGEKGRQETKFKIPRQETVKERRSRKAN 189 Query: 481 RRTLEEKQEKVAN----------TAEKIVFEVQPEKIIHTREDFSKSGVSESR-YQH--S 621 R L+EK + N + EK + +P I T+ + SK S R ++H S Sbjct: 190 GRVLKEKWSEFENELDAAKKSTKSVEKHSLKPEPVSRIQTKGEHSKLTTSHPRKHRHAAS 249 Query: 622 FSFPNSNLRTIEEECTLKTRNLETSNAFSKVXXXXXXXISLQCKRLEEMMLKTERQRESV 801 + +S L TIEE+CT + NLE +A SK L+ ++E+M K+E QR+S+ Sbjct: 250 MNLRSSILGTIEEDCTFASLNLEEPDAPSKEEKR-----KLKEMDIKELMSKSENQRKSI 304 Query: 802 YVDRNCXXXXXXXXXXXXCFSPRTAAKIE-CKIRALEDXXXXXXXXXXXXXXXLREAAIL 978 ++ R SPRT +K+E CKI+ALED L + Sbjct: 305 HLSRELQSRTKQRSKIRV-HSPRTPSKVEICKIKALEDMKAKMKMKKKIEERILEGRTQI 363 Query: 979 DSYAVAKSSFNPQKDFRDSMIEMIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQ 1158 +S+AV KSS +PQKDFRDSMIEMI+EKGI PEELEELLACYLTLN DEYHDLII+VFRQ Sbjct: 364 ESFAVVKSSLDPQKDFRDSMIEMIMEKGISQPEELEELLACYLTLNSDEYHDLIIKVFRQ 423 Query: 1159 VYFELNQVYLASELQ 1203 V+F LN+ Y ELQ Sbjct: 424 VWFGLNRAYFDPELQ 438 >ref|XP_006358138.1| PREDICTED: caldesmon-like [Solanum tuberosum] Length = 491 Score = 239 bits (609), Expect = 4e-60 Identities = 161/409 (39%), Positives = 223/409 (54%), Gaps = 15/409 (3%) Frame = +1 Query: 22 SLITRVFPSSWFSKFKQKGSS-FEPPKSAKAKQK----VPPSCSLSHAPWKEGRFYCGDD 186 SLIT VFP SW SKFKQK E + AK K K + S ++ KEGRFY DD Sbjct: 11 SLITHVFPVSWLSKFKQKKDGRSEDQEGAKMKHKGKVDLTSYTSRTNVCLKEGRFY--DD 68 Query: 187 DPYWRLSFGEEGMEQSKVWLNSVWDDPDDNVPEVTIPNSGSSKPEAVQMFNHMV-REMQE 363 DPYWR+SF E+ E N +W D + + NS SS E FN MV R++ E Sbjct: 69 DPYWRISFSEDRFEAHPQ--NPLWCGSYDECDQDSASNSKSSLGEENHKFNDMVSRKISE 126 Query: 364 KKESLLENXXXXXXXXXXXXXXXSTCRQTIEGRRSRKLNRRTLEEK------QEKVANTA 525 K + + + R +++ + RKL+R+ LEE+ +E A Sbjct: 127 KPKKMWNSKNEAEFSNRK--------RNSVKDEKLRKLSRKALEERIAENAREEIAAEVI 178 Query: 526 EKIVFEVQPEKIIHTREDFSKSGVSESRYQHSFSFPNSNLRTIEEE-CTLKTRNLETSNA 702 EK +FE++PE + K SR S S+ +S+L ++EE T K+ NLE Sbjct: 179 EKDIFEIEPENEKVMKRGKEKPTAYNSRKTRSLSYTDSSLNSMEESYMTFKSLNLEEEAD 238 Query: 703 FSKVXXXXXXXISLQCKRLEEMMLKTE-RQRESVYVDRNCXXXXXXXXXXXXCFSPRTAA 879 + ++ +++EM K+ +QR+SVY+++ +SPRTA Sbjct: 239 ALSEEEFESECLEMKDMKIKEMSEKSGCQQRKSVYINQK---RRRKHGIKVRAYSPRTA- 294 Query: 880 KIECKIRALEDXXXXXXXXXXXXXXXLR-EAAILDSYAVAKSSFNPQKDFRDSMIEMIIE 1056 K+EC+I+ALED R + + DSYA+ KSSF+P DFRDSMIEMI + Sbjct: 295 KMECRIKALEDMKKARMKTRHETKESFRGDRTVFDSYAIMKSSFDPFSDFRDSMIEMITQ 354 Query: 1057 KGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFELNQVYLASELQ 1203 +GI++ EELEELLACYLTLNCDEYHD+II+VFRQV+FELNQV + +EL+ Sbjct: 355 RGIKSSEELEELLACYLTLNCDEYHDIIIKVFRQVWFELNQVNIGAELE 403 >ref|XP_004233852.1| PREDICTED: uncharacterized protein LOC101253557 [Solanum lycopersicum] Length = 13995 Score = 211 bits (537), Expect = 8e-52 Identities = 149/405 (36%), Positives = 216/405 (53%), Gaps = 7/405 (1%) Frame = +1 Query: 22 SLITRVFPSSWFSKFKQKGSSFEPPKSAKAKQKVPPSCSLSHAPWKEGRFYCGDDDPYWR 201 SL+T VFP SW SKFKQK + +KV ++ K+GRFY +DDPYWR Sbjct: 9005 SLMTHVFPVSWLSKFKQKKVCRSEDQEGAKMRKVDLRTNVC---LKQGRFY--EDDPYWR 9059 Query: 202 LSFGEEGMEQSKVWLNSVWDDPDDNVPEVTIPNSGSSKPEAVQMFNHMV-REMQEKKESL 378 +SF EE Q+ +W NS SS E FN MV R++ EK ++ Sbjct: 9060 ISFSEENHPQNPLWCGECDQ------------NSKSSLGEENHKFNDMVSRKISEKPKNE 9107 Query: 379 LENXXXXXXXXXXXXXXXSTCRQTIEGRRSRKLNRRTLEEKQEKVAN--TAEKIVFEVQP 552 E + R +++ + RKL+R+ LEE+ + A EK +FE++P Sbjct: 9108 AE--------------FSNRKRNSVKDEKLRKLSRKALEERIAENAREEVTEKDIFEIEP 9153 Query: 553 EKIIHTREDFSKSGVSESRYQHSFSFPNSNLRTIEEECTLKTR-NLET-SNAFSKVXXXX 726 E + K +SR S S+ +S+ ++EE C + T NLE ++A S+ Sbjct: 9154 EDEKVMKRGKEKPTAYKSRKARSLSYNDSSPNSVEESCMMFTSLNLEEEADALSE----- 9208 Query: 727 XXXISLQCKRLEEMMLKTE-RQRESVYVDRNCXXXXXXXXXXXXCFSPRTAAKIECKIRA 903 +C +++EM K+ +QR+SVY+++ +SPRTA K+EC+I+A Sbjct: 9209 -EEFESECLKIKEMSEKSGCQQRKSVYINQK---RRRKHGIKVRAYSPRTA-KMECRIKA 9263 Query: 904 LEDXXXXXXXXXXXXXXXLR-EAAILDSYAVAKSSFNPQKDFRDSMIEMIIEKGIETPEE 1080 LED + + DSYA+ KSSF+P DFRDSMIEMI ++GI++ EE Sbjct: 9264 LEDMKKARMKTRHETKESFTGDRTVFDSYAIMKSSFDPFSDFRDSMIEMITQRGIKSSEE 9323 Query: 1081 LEELLACYLTLNCDEYHDLIIEVFRQVYFELNQVYLASELQTYLL 1215 LEELLACYLTLNCDEYHD+II+VFRQ++ +AS+L +L+ Sbjct: 9324 LEELLACYLTLNCDEYHDIIIKVFRQIHPTAANNLVASQLDQHLV 9368 Score = 104 bits (260), Expect = 1e-19 Identities = 55/126 (43%), Positives = 74/126 (58%) Frame = -1 Query: 1747 YGIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARK 1568 Y I PS E Y CMVDLLGR G LEEA L++ M T +W SL +CR H NI++A Sbjct: 2508 YSITPSCERYACMVDLLGRAGRLEEAFLLIKGMKENVTVEMWGSLFEACRMHNNIKIAGC 2567 Query: 1567 IAGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVR 1388 +LL+L+P S+ V LSN++A +GRW DV +R M+ + PG S +E + Sbjct: 2568 AIEKLLELEPHTSTNLVVLSNMYAELGRWGDVERVRETMKKSGAGRLPGCSWVEDRNQLL 2627 Query: 1387 EFLAGE 1370 FL G+ Sbjct: 2628 VFLCGD 2633 >gb|EMJ15076.1| hypothetical protein PRUPE_ppa006822mg [Prunus persica] Length = 394 Score = 209 bits (531), Expect = 4e-51 Identities = 155/413 (37%), Positives = 214/413 (51%), Gaps = 18/413 (4%) Frame = +1 Query: 19 PSLITRVFPSSWFSKFKQKGSSFEPPKSA---KAKQKVPPSCSLSHAPWKEG-RFYC--- 177 P LI+ VFP+SW SKFKQKG + EP S K KQ P S A K G RFY Sbjct: 23 PFLISHVFPTSWLSKFKQKGGNSEPKPSKVNRKGKQNSPSLGSPRFAAAKGGGRFYGVVD 82 Query: 178 GDDDPYWRLSFGEEGME--QSKVWLNSVWDDPDDNVPEVTIPNSGSSKPEAVQMFNHMV- 348 DDD +WRLSFGE+ E +++ L SVW D DD EV + GS Q + +V Sbjct: 83 DDDDAFWRLSFGEDSAEVKKNRGVLRSVWFDSDDEF-EVQPSSRGSC-----QTSDRIVK 136 Query: 349 -REMQEKKESLLENXXXXXXXXXXXXXXXSTCRQTIEGRRSRKLNRRTLEEKQEKVANTA 525 R++ +K + + ++ + RK NR +K K+ A Sbjct: 137 GRDLSQKLKGMQKSGNAKEW-------------------KLRKENRELEGKKLLKLERDA 177 Query: 526 EKIVFEVQPEKIIHTREDFSKSGVSESRYQHSFSFPNSNLRTIEEECTLKTRNLETSNAF 705 +K E+ S V Y S + NS+L+TI+EE NLET Sbjct: 178 DKA-------------EETSTETVESDEYVSSLNSRNSSLKTIQEEAL----NLETEEPS 220 Query: 706 SKVXXXXXXXISLQCKRLEEMMLKTERQRESVYVDRNCXXXXXXXXXXXXCFSPRTAAKI 885 + L+ ++++E+ K+E+ R+S+Y+ R SPRTA+++ Sbjct: 221 EEKQPSDWQ--KLKERKIQEVKSKSEKHRKSLYISRELQRRRTKRSSKVRVCSPRTASRV 278 Query: 886 E-CKIRALEDXXXXXXXXXXXXXXXLREAAI------LDSYAVAKSSFNPQKDFRDSMIE 1044 E CKI+ALE+ +E A+ L+S+AV K SF+PQ+DFRDSM+E Sbjct: 279 EICKIKALENMTKAKMKMKRVA----KEGAVQQVRTGLNSFAVVKCSFDPQQDFRDSMVE 334 Query: 1045 MIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFELNQVYLASELQ 1203 MI+EK I PE+LEELLACYLTLN DEYHDLI++VFRQV+F+LNQ ++ELQ Sbjct: 335 MIVEKKITQPEDLEELLACYLTLNSDEYHDLIVKVFRQVWFDLNQASFSTELQ 387 >gb|EOY27918.1| Ovate family protein 5, putative [Theobroma cacao] Length = 438 Score = 206 bits (525), Expect = 2e-50 Identities = 144/428 (33%), Positives = 224/428 (52%), Gaps = 33/428 (7%) Frame = +1 Query: 19 PSLITRVFPSSWFSKFKQKGSSFEPPKSAKAKQKVPPSCSLSHAPWKEG---RFYCGDDD 189 PSL +RV P++W S FK+ + EP K AK +QK + + G RFY GD + Sbjct: 19 PSL-SRVLPTAWLSTFKRMSINSEP-KPAKDRQKGMSNAVPGRSSKFAGGGARFYGGDGE 76 Query: 190 PYWRLSFGEEGME--QSKVWLNSVWDDPDDNVPEV--TIPNSGSS-----KPEAVQMFNH 342 +WRLSFGE+ + SK L S W D DD + + + GS+ + E Q F++ Sbjct: 77 AFWRLSFGEDSADGKTSKSLLRSAWYDSDDELDFAPSSCQSCGSNATRTKEKEETQKFSN 136 Query: 343 MVREMQEKKE-----SLLENXXXXXXXXXXXXXXXSTCRQTIEGRRSRKLNRRTLEEK-- 501 M ++++ KE +L + + T + + +K N R +EEK Sbjct: 137 MACDVKKMKEFRRDTQILPDVNMYKEEKATVVKTPRSRTITEKDLKLKKTNERAMEEKRV 196 Query: 502 ---------QEKVANTAEKIVFEVQPEKIIHT--REDFSKSGVSESRYQH--SFSFPNSN 642 Q+K A + K + +P + I RE+ +G + ++QH + + SN Sbjct: 197 KRQNKSGEAQQKSAKSVGKNTLDPEPMRTIPMTERENLKLTGNYQRKHQHLSTMNLRTSN 256 Query: 643 LRTIEEECTLKTRNLETSNAFSKVXXXXXXXISLQCKRLEEMMLKTERQRESVYVDRNCX 822 L TI+E+C+ + L ++ FS ++L ++ +K+++QR+S+Y+ R Sbjct: 257 LTTIKEDCSFTAQKLLETDVFSP-------------EKLSKVKVKSDKQRKSLYMSRELP 303 Query: 823 XXXXXXXXXXXCFSPRTAAKIE-CKIRALEDXXXXXXXXXXXXXXXLREAAILDSYAVAK 999 FSPRTA+++E CKI+ALED + L+++A+ K Sbjct: 304 RRRMKQNNKVRVFSPRTASRVEICKIKALEDMKKAKLKMKAAKQKTISRRTGLENFAMVK 363 Query: 1000 SSFNPQKDFRDSMIEMIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFELNQ 1179 SF+P+KDFRDSM+EMI+EK I PEELEELLACYLTLN D YHDLII+VF+QV+ +L+Q Sbjct: 364 CSFDPEKDFRDSMVEMIMEKRISQPEELEELLACYLTLNSDAYHDLIIKVFQQVWLDLDQ 423 Query: 1180 VYLASELQ 1203 ++L+ Sbjct: 424 ASSDTDLR 431 >ref|XP_002265079.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like [Vitis vinifera] Length = 536 Score = 197 bits (501), Expect = 1e-47 Identities = 90/130 (69%), Positives = 114/130 (87%) Frame = -1 Query: 1747 YGIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARK 1568 +GI+P+IEHYGCMVDLLGR GLLEEA+ELV+KMP KE ++WESLL +CRNHGN+ELA + Sbjct: 402 HGIQPTIEHYGCMVDLLGRVGLLEEAEELVQKMPQKEASVVWESLLGACRNHGNVELAER 461 Query: 1567 IAGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVR 1388 +A +LL+L PQ+SS +VQLSN++ASMGRW DV+E+R+KMRA+ + KDPG SMIE+DG V Sbjct: 462 VAQKLLELSPQESSSFVQLSNMYASMGRWKDVMEVRQKMRAQGVRKDPGCSMIEVDGTVY 521 Query: 1387 EFLAGEGIFS 1358 EFLAGEG+ S Sbjct: 522 EFLAGEGLVS 531 >emb|CBI30729.3| unnamed protein product [Vitis vinifera] Length = 506 Score = 197 bits (501), Expect = 1e-47 Identities = 90/130 (69%), Positives = 114/130 (87%) Frame = -1 Query: 1747 YGIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARK 1568 +GI+P+IEHYGCMVDLLGR GLLEEA+ELV+KMP KE ++WESLL +CRNHGN+ELA + Sbjct: 372 HGIQPTIEHYGCMVDLLGRVGLLEEAEELVQKMPQKEASVVWESLLGACRNHGNVELAER 431 Query: 1567 IAGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVR 1388 +A +LL+L PQ+SS +VQLSN++ASMGRW DV+E+R+KMRA+ + KDPG SMIE+DG V Sbjct: 432 VAQKLLELSPQESSSFVQLSNMYASMGRWKDVMEVRQKMRAQGVRKDPGCSMIEVDGTVY 491 Query: 1387 EFLAGEGIFS 1358 EFLAGEG+ S Sbjct: 492 EFLAGEGLVS 501 >gb|EXB31128.1| hypothetical protein L484_004602 [Morus notabilis] Length = 382 Score = 193 bits (491), Expect = 2e-46 Identities = 149/417 (35%), Positives = 203/417 (48%), Gaps = 23/417 (5%) Frame = +1 Query: 22 SLITRVFPSSWFSKFKQKGSSFEP-PKSAKAKQK--------VPPSCSLSHAPWKEGRFY 174 SLI+ VFP SW SKFK K + EP P+ K K P C+ S+ G+FY Sbjct: 25 SLISHVFPVSWLSKFKHKSGNSEPKPRKGNPKGKWNSPPLISSPRYCASSNG---NGQFY 81 Query: 175 C--GDDDPYWRLSFGEEGMEQSK--VWLNSVWDDPDDNVPEVTIPNSGSSKPEAVQMFNH 342 DDD +WRLSFG++G E+ K + + SVW DP+D + +P+ + + + Q N Sbjct: 82 GLEADDDAFWRLSFGQDGAEEKKKILGMKSVWHDPNDEL-RTPVPSCPNCRRKEAQKSN- 139 Query: 343 MVREMQEKKESLLENXXXXXXXXXXXXXXXSTCRQTIEGRRSRKLNRRTLEEKQEKVANT 522 KE L + T +T + R+SR Sbjct: 140 --------KERELRSP---------------TSNRTAKERKSR----------------- 159 Query: 523 AEKIVFEVQPEKIIHTREDFSKSGVSESRYQHSFSFPNSNLRTIEEECTLKTRNLETSNA 702 E FE + + RY N L+TIEEE NLE S Sbjct: 160 GESGAFETR----------------RKCRYVSPVGARNPGLKTIEEEGL----NLENSEE 199 Query: 703 FSKVXXXXXXXISLQCKRLEEMMLKTERQRESVYVDR-NCXXXXXXXXXXXXCFSPRTAA 879 S+ + + K ++E+ + E+QR+SVY+ R +SPRTA+ Sbjct: 200 HSEENTGFHCKVMKELK-MKEVKSRNEKQRKSVYLSREEQQRKRTKQINKARAYSPRTAS 258 Query: 880 KIE-CKIRALED-XXXXXXXXXXXXXXXLREAAI-------LDSYAVAKSSFNPQKDFRD 1032 ++E CK++ALED ++E LDS+AV KSSF+PQKDF+D Sbjct: 259 RLEVCKVKALEDMKKAKLKMKRKKKKKEVKERGTVEQSKTGLDSFAVVKSSFDPQKDFKD 318 Query: 1033 SMIEMIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFELNQVYLASELQ 1203 SM+EMI+EK + PEELEELLACYLTLN DEYHD+II+VFRQV+FEL Q +SELQ Sbjct: 319 SMVEMIVEKKMSKPEELEELLACYLTLNSDEYHDMIIKVFRQVWFELGQSCFSSELQ 375 >ref|XP_006358091.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like [Solanum tuberosum] Length = 536 Score = 193 bits (491), Expect = 2e-46 Identities = 90/128 (70%), Positives = 113/128 (88%) Frame = -1 Query: 1747 YGIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARK 1568 Y I+P++ HYGCMVDLLGRFGLLEEA+ELV K+PVKE P IWESLLS+ R+H ++ELA + Sbjct: 407 YRIQPTLVHYGCMVDLLGRFGLLEEAEELVSKLPVKEAPAIWESLLSASRSHNDVELAER 466 Query: 1567 IAGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVR 1388 IA +LL++DP+DS+GYVQLSN+ ASMGRW DV E+RRKMR++ I+K+PG SMIE+DG+V Sbjct: 467 IATKLLEVDPRDSAGYVQLSNVLASMGRWDDVREVRRKMRSEGITKEPGCSMIEVDGVVH 526 Query: 1387 EFLAGEGI 1364 EFLAGEGI Sbjct: 527 EFLAGEGI 534 >ref|XP_004163359.1| PREDICTED: uncharacterized protein LOC101232237 [Cucumis sativus] Length = 441 Score = 186 bits (472), Expect = 3e-44 Identities = 138/430 (32%), Positives = 206/430 (47%), Gaps = 39/430 (9%) Frame = +1 Query: 28 ITRVFPSSWFSKFKQKGSSFEP-PKSAKAKQKVPPSC-----------SLSHAPWKEGRF 171 ++ + P+SW SK KQK S+ E P+ K +K C SL R Sbjct: 1 MSNILPASWLSKLKQKKSNQEARPRKVKGTEKGSSPCIQSPDFANVTPSLGQVNGNRNRL 60 Query: 172 YCGDDDPYWRLSFGEEGME--QSKVWLNSVW--DDPDDNVPEVTIPNSGSSKPE-----A 324 + GD+ +W+L FG E ++ +S L SVW + + ++P + + + E Sbjct: 61 FTGDNGEFWKLPFGGEDIDVKKSSEILRSVWYNSENEHDLPRTSCRSCRTKYTEFEGNEE 120 Query: 325 VQMFNHMVREMQEKKESLLENXXXXXXXXXXXXXXXSTCRQTIEG-------------RR 465 +Q + MV M ++ E +T R ++ Sbjct: 121 IQNLDDMVSRMTRRRRRRREAPIQVKLLRRESETESTTPRSKYRENGNFGNFGKKGVEKK 180 Query: 466 SRKLNRRTLEEKQEKVANTAEKIVFEVQPEKIIHTRE-DFSK-SGVSESRYQHSFSFPNS 639 K R T + K+ + K + V+ E + E D +K + + RY S +S Sbjct: 181 GFKPERETDKGKEIRARRLVGKKMLGVEEESGVRKNERDKTKLTNSRKHRYVPSTMSKSS 240 Query: 640 NLRTIEEECTLKTRNLETSNAFSKVXXXXXXXIS-LQCKRLEEMMLKTERQRESVYVDRN 816 NL TIEE C + E S+ + ++ ++EE+ L+ E+QR+ +Y+ ++ Sbjct: 241 NLGTIEENCVFSSMKAEESDGHDTLGIEIDSDWERMKELKIEELKLRYEKQRQPLYIRKD 300 Query: 817 CXXXXXXXXXXXXCFSPRTAAKIE-CKIRALEDXXXXXXXXXXXXXXX-LREAAILDSYA 990 +SPRTA KIE CKI+ALED + + L+S+A Sbjct: 301 SNEKNPKGRRKIRVYSPRTANKIEICKIKALEDMKKAKLKMKKKVKESTVEDDTDLESFA 360 Query: 991 VAKSSFNPQKDFRDSMIEMIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFE 1170 V KSSF+PQ+DFRDSM+EMI+E+ I EELEELLACYLTLN D+YHDLII+VFRQV+F+ Sbjct: 361 VVKSSFDPQQDFRDSMVEMIMERRISKAEELEELLACYLTLNSDQYHDLIIKVFRQVWFD 420 Query: 1171 LNQVYLASEL 1200 LNQ L SEL Sbjct: 421 LNQAALESEL 430 >gb|EOY27913.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508780658|gb|EOY27914.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508780659|gb|EOY27915.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508780660|gb|EOY27916.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] gi|508780661|gb|EOY27917.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative isoform 1 [Theobroma cacao] Length = 535 Score = 184 bits (468), Expect = 8e-44 Identities = 81/128 (63%), Positives = 111/128 (86%) Frame = -1 Query: 1747 YGIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARK 1568 YGI+P+IEH+GCMVDLLG+ GLLEEA +LV+K P+KE P++WESLLS+C+ HGN+E+A Sbjct: 403 YGIQPTIEHFGCMVDLLGQVGLLEEALDLVKKRPLKEAPVLWESLLSACKKHGNVEMAEH 462 Query: 1567 IAGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVR 1388 +A +LL+L+PQDS+GYVQLSN +A++ RW DV+ +R KM+A +I K+PG SMIE+DG+V Sbjct: 463 VARKLLELNPQDSAGYVQLSNTYAALQRWDDVMNVRSKMKALKIKKEPGCSMIEVDGVVH 522 Query: 1387 EFLAGEGI 1364 EFL+GEG+ Sbjct: 523 EFLSGEGM 530 >ref|XP_002522269.1| conserved hypothetical protein [Ricinus communis] gi|223538522|gb|EEF40127.1| conserved hypothetical protein [Ricinus communis] Length = 417 Score = 184 bits (467), Expect = 1e-43 Identities = 149/413 (36%), Positives = 206/413 (49%), Gaps = 28/413 (6%) Frame = +1 Query: 25 LITRVFPSSWFSKFKQKGSSFEPPKSAKAKQK-------VPPSCSLSHAPWKEGRFYCGD 183 L ++V P+SW +KFKQ + K AK KQK PS ++ G+FY GD Sbjct: 15 LTSQVLPTSWLTKFKQMSMN-SGEKQAKMKQKGKWNSVTTNPSSYATNTTGV-GKFYGGD 72 Query: 184 DDPYWRLSFGEEGMEQSK-------VWLNSVWDDPDDNVPEVTIPNSGSSKPEAVQMFNH 342 D +WRLSFGE+ +E K V NS DD D P + + + Q F+ Sbjct: 73 GDAFWRLSFGEDSLEGMKSRGVFKSVRYNSDDDDNDLEFPPSSFQSYSRVNEKEAQKFSD 132 Query: 343 MVREMQEKK--ESLLENXXXXXXXXXXXXXXXSTCRQTIEGRRS-RKLNRRTLEEKQ--- 504 MV ++ + +E T R +E ++ RK N R E+KQ Sbjct: 133 MVSHARKMRGLPKEIEIFPRVQTCIREKVAEIRTPRLGVEREKTLRKGNYRVFEDKQLEG 192 Query: 505 -----EKVANTA-EKIVFEVQPEKIIHTREDFSKSGVSESRYQHSFSFPNSNLRTIEEEC 666 EK A K ++E +P K + + K ++SR +S LR IEE+C Sbjct: 193 RQAEAEKHPRKAVAKNMYERKPGKFVEGED--VKLAAADSR--------DSYLREIEEDC 242 Query: 667 TLKTRNLETSNAFSKVXXXXXXXISLQCKRLEEMMLKTERQRESVYVDRNCXXXXXXXXX 846 +L E+ +++ L+ +++EE+ + E QR+SVY+ R+ Sbjct: 243 SLCAEK-ESDGFYAENHSYKWQ--KLKERKIEEVKSRKEEQRKSVYISRDVERKTKQNNK 299 Query: 847 XXXCFSPRTAAKIE-CKIRALEDXXXXXXXXXXXXXXX-LREAAILDSYAVAKSSFNPQK 1020 SPRTA+K E CKI+ALED + E L+S+AV K S++PQK Sbjct: 300 VKVN-SPRTASKAEICKIKALEDMKKAKLKAKKKAKGKTVEEFQGLESFAVVKCSYDPQK 358 Query: 1021 DFRDSMIEMIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFELNQ 1179 DFRDSM+EMI E+ I EELEELLACYLTLN DEYHDLII VFRQV+F+LNQ Sbjct: 359 DFRDSMVEMIKEQNISRSEELEELLACYLTLNSDEYHDLIIRVFRQVWFDLNQ 411 >ref|XP_002305721.2| ovate family protein [Populus trichocarpa] gi|550340428|gb|EEE86232.2| ovate family protein [Populus trichocarpa] Length = 431 Score = 181 bits (458), Expect = 1e-42 Identities = 146/424 (34%), Positives = 213/424 (50%), Gaps = 29/424 (6%) Frame = +1 Query: 19 PSLITRVFPSSWFSKFKQKGSSFEPPKS-AKAKQKVP-PSCSLSHAPWKEG----RFYCG 180 PSLI+ VFP+SW +KFK S P + AKAKQK S S S P+ G RFY G Sbjct: 14 PSLISHVFPTSWLTKFKHM--SINPGQEHAKAKQKGKWNSVSASPLPFARGEGGGRFYGG 71 Query: 181 DDDPYWRLSFGEEGMEQSKVWLNSVWDDPDDNVPEVTIPNSG-----------SSKPEAV 327 D D +WRLSFG+E L+S +D D E+ P S +++ E Sbjct: 72 DGDAFWRLSFGDESASTGA--LSSFHNDLDS---ELQAPPSSCHSCRSNATRVNNRKEDK 126 Query: 328 QMFNHMVREMQEKK--ESLLENXXXXXXXXXXXXXXXSTCRQTIEGRRS-RKLNRRTLEE 498 F++ V E ++ + +E T R +E RK ++R E Sbjct: 127 IRFSNKVSEARKMRGLPREIEILPEMDACISEKVAEIRTPRLRVEREEKLRKTDQRVFEA 186 Query: 499 KQEKV---ANTAEKIVFEVQPEKIIHTREDFSKSGVSESRYQ----HSFSFPNSNLRTIE 657 +Q K+ + AE++ + + I T + + + + HS +++LR + Sbjct: 187 QQFKLDGESYEAERVSRKETSKNISETESERTIGRIEREDCKLTASHSKKDFSTHLRKTK 246 Query: 658 EECTLKTRNLETSNAFSKVXXXXXXXISLQCKRLEEMMLKTERQRESVYVDRNCXXXXXX 837 ++ +N A + +L+ ++EE+ K E+QR+S+Y++R Sbjct: 247 KDFVFAAQNESDGFAAENLSSEWQ---TLKDMKIEELKTKREKQRKSLYINRELQRKKKS 303 Query: 838 XXXXXXCFSPRTAAKIE-CKIRALEDXXXXXXXXXXXXXXXLREAAI-LDSYAVAKSSFN 1011 SPRTA+K+E C+I+ALED E L+++AV K+SF+ Sbjct: 304 KVR---AISPRTASKVEICRIKALEDMKKAKMKKKKKAREKKMEGFTGLENFAVVKTSFD 360 Query: 1012 PQKDFRDSMIEMIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFELNQVYLA 1191 PQKDFRDSMIEMI EK I EELEELLACYLTLN DEYHDLI++VFRQV+F+LN+ Sbjct: 361 PQKDFRDSMIEMIEEKRISRSEELEELLACYLTLNADEYHDLIVKVFRQVWFDLNEACSD 420 Query: 1192 SELQ 1203 +EL+ Sbjct: 421 TELE 424 >ref|XP_006283500.1| hypothetical protein CARUB_v10004552mg [Capsella rubella] gi|482552205|gb|EOA16398.1| hypothetical protein CARUB_v10004552mg [Capsella rubella] Length = 537 Score = 177 bits (448), Expect = 2e-41 Identities = 79/128 (61%), Positives = 107/128 (83%) Frame = -1 Query: 1747 YGIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARK 1568 YGI+P++EHYGCMVDLLGR G +EEA+ELV ++P E ++ ESLL SC+ G +E A + Sbjct: 407 YGIEPTVEHYGCMVDLLGRMGKIEEAEELVNEIPADEASMLLESLLGSCKRFGKLEQAER 466 Query: 1567 IAGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVR 1388 IA RLL+L+P +SSGYVQ+SN++AS GRW +V+E+RRKMRA++++K PG SMIE+DG+V Sbjct: 467 IANRLLELNPHESSGYVQMSNLYASNGRWDEVMEVRRKMRAERVNKKPGCSMIEVDGVVH 526 Query: 1387 EFLAGEGI 1364 EFLAGEG+ Sbjct: 527 EFLAGEGL 534 >ref|XP_006414048.1| hypothetical protein EUTSA_v10024877mg [Eutrema salsugineum] gi|557115218|gb|ESQ55501.1| hypothetical protein EUTSA_v10024877mg [Eutrema salsugineum] Length = 535 Score = 169 bits (429), Expect = 3e-39 Identities = 77/128 (60%), Positives = 102/128 (79%) Frame = -1 Query: 1747 YGIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARK 1568 YG++P+IEHYGCMVDLLGR G EEA+ELV + P E ++ ESLL +C+ G +E A Sbjct: 405 YGVEPTIEHYGCMVDLLGRMGKFEEAEELVNETPADEASVLLESLLGACKRFGRMEQAES 464 Query: 1567 IAGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVR 1388 IA RLL+L+P ++SGYVQ+SN++AS GRW V+E+RRKMRA+++ K PG SMIE+DG+V Sbjct: 465 IANRLLELNPGETSGYVQMSNLYASNGRWDQVMEVRRKMRAERVKKKPGCSMIEVDGVVH 524 Query: 1387 EFLAGEGI 1364 EFLAGEG+ Sbjct: 525 EFLAGEGL 532 >ref|XP_002867972.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297313808|gb|EFH44231.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 535 Score = 169 bits (429), Expect = 3e-39 Identities = 76/128 (59%), Positives = 104/128 (81%) Frame = -1 Query: 1747 YGIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARK 1568 YGI+P+IEHYGCMVDLLGR G EEA+ELV ++P E ++ ESLL +C+ G +E A + Sbjct: 405 YGIEPTIEHYGCMVDLLGRMGKFEEAEELVNEVPADEASILLESLLGACKRFGKLEQAER 464 Query: 1567 IAGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVR 1388 IA RLL+ +P++SSGYVQ+SN++AS GRW + +E+R KMRA+++ K+PG SMIE+DG+V Sbjct: 465 IANRLLESNPRESSGYVQMSNLYASHGRWDEAMEVRGKMRAERVKKNPGCSMIEVDGVVH 524 Query: 1387 EFLAGEGI 1364 EFLAGEG+ Sbjct: 525 EFLAGEGL 532 >emb|CBI30728.3| unnamed protein product [Vitis vinifera] Length = 195 Score = 160 bits (406), Expect = 1e-36 Identities = 95/191 (49%), Positives = 119/191 (62%), Gaps = 1/191 (0%) Frame = +1 Query: 634 NSNLRTIEEECTLKTRNLETSNAFSKVXXXXXXXISLQCKRLEEMMLKTERQRESVYVDR 813 +S L TIEE+CT + NLE +A SK L+ ++E+M K+E QR+S+++ R Sbjct: 5 SSILGTIEEDCTFASLNLEEPDAPSKEEKR-----KLKEMDIKELMSKSENQRKSIHLSR 59 Query: 814 NCXXXXXXXXXXXXCFSPRTAAKIE-CKIRALEDXXXXXXXXXXXXXXXLREAAILDSYA 990 SPRT +K+E CKI+ALED L ++S+A Sbjct: 60 ELQSRTKQRSKIRV-HSPRTPSKVEICKIKALEDMKAKMKMKKKIEERILEGRTQIESFA 118 Query: 991 VAKSSFNPQKDFRDSMIEMIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFE 1170 V KSS +PQKDFRDSMIEMI+EKGI PEELEELLACYLTLN DEYHDLII+VFRQV+F Sbjct: 119 VVKSSLDPQKDFRDSMIEMIMEKGISQPEELEELLACYLTLNSDEYHDLIIKVFRQVWFG 178 Query: 1171 LNQVYLASELQ 1203 LN+ Y ELQ Sbjct: 179 LNRAYFDPELQ 189 >gb|ESW31484.1| hypothetical protein PHAVU_002G241900g [Phaseolus vulgaris] Length = 399 Score = 158 bits (399), Expect = 8e-36 Identities = 136/413 (32%), Positives = 185/413 (44%), Gaps = 26/413 (6%) Frame = +1 Query: 19 PSLITRVFPSSWFSKFKQKGSSFEPPKSA---KAKQKVPPSCSLSHAPW--KEGRFYCGD 183 PS I+ V P SW SKFK + EP A AKQ PS S + GRFY GD Sbjct: 20 PSFISHVSPFSWLSKFKHMRINSEPKPGALKQNAKQNSTPSESSPYYACGNNRGRFYGGD 79 Query: 184 DDPYWRLSFGEEGMEQSKVWLNSVWDD---PDDNVPEVTIPNSGSSKP----EAVQMFNH 342 D+ +WRLSFGEEG E K DD P + S SS P A + Sbjct: 80 DEAFWRLSFGEEGNEHRKS------DDILKPLKYQMDAEHVISSSSFPGGLNNAKRQGRR 133 Query: 343 MVREMQEKKESLLENXXXXXXXXXXXXXXXSTCRQTIEGRRSRKLNRRTLE-EKQEKVAN 519 + ++K++ L + R+ E + R L + L+ EK E+ A Sbjct: 134 EATQKSKQKDTGLREETTLLNEAARSVKELESLRRRYERKAQRVLQEQLLKLEKAEEEAE 193 Query: 520 TA-----EKIVFEVQPEKII-----HTREDFSKSGVSESRYQHSFSFPNSNLRTIEEECT 669 A E V + + + I H DF SG+ R R + Sbjct: 194 FASSPFLENYVLQYESPRTICTPRKHLFADFKSSGLGNLR----------EARVCSPQLD 243 Query: 670 LKTRNLETSNAFSKVXXXXXXXISLQCKRLEEMML--KTERQRESVYVDRNCXXXXXXXX 843 + NL K+ EE+ L K+ +Q +S++V R Sbjct: 244 SEWHNL---------------------KQTEELKLKAKSNQQMQSLHVSRENQRRKPKHN 282 Query: 844 XXXXCFSPRTAAKIEC-KIRALEDXXXXXXXXXXXXXXXLREAAILDSYAVAKSSFNPQK 1020 +SPR +K+E KI+A+E+ + E LDS+AV K S +PQK Sbjct: 283 SKVKVYSPRIGSKVEVRKIKAIEE-KKKAKLKMKKEEEIVEETEGLDSFAVVKCSLDPQK 341 Query: 1021 DFRDSMIEMIIEKGIETPEELEELLACYLTLNCDEYHDLIIEVFRQVYFELNQ 1179 DFRDSMIEMI EK I PEE+++LLACYLTLN +EYHDLII+VF+QV+ ++Q Sbjct: 342 DFRDSMIEMITEKQISQPEEMQDLLACYLTLNSNEYHDLIIQVFKQVWLCMSQ 394 >ref|XP_004309635.1| PREDICTED: uncharacterized protein LOC101311721 [Fragaria vesca subsp. vesca] Length = 212 Score = 156 bits (394), Expect = 3e-35 Identities = 94/219 (42%), Positives = 135/219 (61%), Gaps = 3/219 (1%) Frame = +1 Query: 556 KIIHTRE-DFSKSGVSESRYQHSFSFPNSNLRTIEEECTLKTRNLETSNAFSKVXXXXXX 732 +II T+E D +K G++ ++ ++ +SNL+TIEEE + + + Sbjct: 5 RIILTKEKDSNKHGLTSGARKYRYA--SSNLKTIEEEVSEEKPTSDWQK----------- 51 Query: 733 XISLQCKRLEEMMLKTERQRESVYVDRNCXXXXXXXXXXXX-CFSPRTAAKIE-CKIRAL 906 L+ +++E+M K+E+ R+SVY+ R+ SPRTA+++E CKI+AL Sbjct: 52 ---LKEMKIQEVMAKSEKHRKSVYISRDLQRKSRPKRSNKVRVCSPRTASRVEICKIKAL 108 Query: 907 EDXXXXXXXXXXXXXXXLREAAILDSYAVAKSSFNPQKDFRDSMIEMIIEKGIETPEELE 1086 +D +R LDS+AV K SF+PQ+DFRDSM+EMI+EK + P++LE Sbjct: 109 QDMAKAKKAAKERKVQQVRTG--LDSFAVVKCSFDPQQDFRDSMVEMIVEKKLTRPDDLE 166 Query: 1087 ELLACYLTLNCDEYHDLIIEVFRQVYFELNQVYLASELQ 1203 ELLACYLTLN DEYHDLII+VFRQV+F+LNQ Y SELQ Sbjct: 167 ELLACYLTLNSDEYHDLIIKVFRQVWFDLNQTYFGSELQ 205 >ref|XP_006395137.1| hypothetical protein EUTSA_v10003805mg [Eutrema salsugineum] gi|557091776|gb|ESQ32423.1| hypothetical protein EUTSA_v10003805mg [Eutrema salsugineum] Length = 645 Score = 153 bits (386), Expect = 3e-34 Identities = 68/122 (55%), Positives = 93/122 (76%) Frame = -1 Query: 1744 GIKPSIEHYGCMVDLLGRFGLLEEAKELVEKMPVKETPLIWESLLSSCRNHGNIELARKI 1565 G++P IEHYGCMVDLLGR GLLEEA+E + MP+K +IW++LL +CR HGN+E+ +++ Sbjct: 405 GLEPRIEHYGCMVDLLGRSGLLEEAEEFILNMPIKPDDVIWKALLGACRMHGNVEMGKRV 464 Query: 1564 AGRLLDLDPQDSSGYVQLSNIHASMGRWSDVLEIRRKMRAKQISKDPGGSMIEIDGIVRE 1385 A L+D+ P DS YV LSN++AS G WS+V E+R +M+ I KDPG S I+IDG++ E Sbjct: 465 ANILMDMVPNDSGAYVALSNMYASQGNWSEVSEMRLRMKEMDIRKDPGCSWIDIDGVLHE 524 Query: 1384 FL 1379 FL Sbjct: 525 FL 526