BLASTX nr result
ID: Catharanthus22_contig00042384
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00042384 (724 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004235997.1| PREDICTED: pentatricopeptide repeat-containi... 154 4e-35 ref|XP_002282049.1| PREDICTED: pentatricopeptide repeat-containi... 146 8e-33 ref|XP_002304264.2| hypothetical protein POPTR_0003s07210g [Popu... 145 1e-32 ref|XP_006364594.1| PREDICTED: pentatricopeptide repeat-containi... 145 1e-32 ref|XP_004301147.1| PREDICTED: pentatricopeptide repeat-containi... 134 3e-29 ref|XP_006477459.1| PREDICTED: pentatricopeptide repeat-containi... 131 2e-28 gb|EXB44694.1| hypothetical protein L484_015951 [Morus notabilis] 131 3e-28 gb|EMJ11463.1| hypothetical protein PRUPE_ppa003212mg [Prunus pe... 129 7e-28 ref|XP_004155062.1| PREDICTED: pentatricopeptide repeat-containi... 127 5e-27 ref|NP_190337.1| pentatricopeptide repeat-containing protein [Ar... 123 7e-26 gb|EOY21868.1| Pentatricopeptide repeat superfamily protein isof... 122 2e-25 gb|EOY21867.1| Pentatricopeptide repeat superfamily protein isof... 122 2e-25 ref|XP_002877566.1| pentatricopeptide repeat-containing protein ... 121 2e-25 ref|XP_006440604.1| hypothetical protein CICLE_v10018999mg [Citr... 120 3e-25 ref|XP_003525465.1| PREDICTED: pentatricopeptide repeat-containi... 120 6e-25 ref|XP_006292869.1| hypothetical protein CARUB_v10019129mg [Caps... 119 8e-25 ref|XP_004508732.1| PREDICTED: pentatricopeptide repeat-containi... 117 4e-24 gb|ESW27301.1| hypothetical protein PHAVU_003G189800g [Phaseolus... 115 2e-23 ref|XP_006404369.1| hypothetical protein EUTSA_v10010230mg [Eutr... 112 1e-22 gb|EOY30986.1| Tetratricopeptide repeat (TPR)-like superfamily p... 94 6e-17 >ref|XP_004235997.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530-like [Solanum lycopersicum] Length = 621 Score = 154 bits (388), Expect = 4e-35 Identities = 87/201 (43%), Positives = 113/201 (56%) Frame = +2 Query: 119 STNSTTRSFWSTVAISLQHHHAPPSAPLYLRTASKSEAENTLISLIKSCARISHLCQIHC 298 S+ S R S A+ + +H + P + +E LISLIKS + HL QIH Sbjct: 9 SSVSAFRWLSSHAAVRVDNHRQCLTGPTNHTSHLHTEKTEPLISLIKSTSSKPHLLQIHA 68 Query: 299 YLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLNYPNHNISLYNTMIRAFSLSDSS 478 +LIR S +P F FL IALPPF ++ Y+ + F + ++ YN MIRA+ +SDS Sbjct: 69 HLIRKSLFQDPIFFSPFLFGIALPPFHDLGYASQVFSKFRKPDVFQYNIMIRAYGMSDSP 128 Query: 479 FLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQVHGMIFRDGHISDCFLSSAL 658 GF LY+EM+ G+ C IK+GSL GLQ+H I RDGH SD L + L Sbjct: 129 GNGFMLYQEMLRSGVSPNSLTSSFVTNCCIKIGSLFGGLQIHARILRDGHQSDGRLLTTL 188 Query: 659 VDFYSVNRKYDEACKVFAEMS 721 +DFYS N KY EACKVF EMS Sbjct: 189 MDFYSSNEKYTEACKVFDEMS 209 >ref|XP_002282049.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530 [Vitis vinifera] Length = 643 Score = 146 bits (368), Expect = 8e-33 Identities = 80/167 (47%), Positives = 105/167 (62%) Frame = +2 Query: 218 SKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSY 397 S+ E+EN LISLIKSC++ +HL QIH ++IRTS + N I L FLSR AL P +++ YS Sbjct: 63 SRDESENQLISLIKSCSKKTHLLQIHAHIIRTSLIQNHFISLQFLSRAALSPSRDMGYSS 122 Query: 398 RTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMG 577 + F + S YN MIRA+S+S S GF LYREM G+ K I++ Sbjct: 123 QVFSQIMKPSGSQYNVMIRAYSMSHSPEQGFYLYREMRRRGVPPNPLSSSFVMKSCIRIS 182 Query: 578 SLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 SL+ GLQ+H I RDGH SD L + L+D YS K++EACKVF E+ Sbjct: 183 SLMGGLQIHARILRDGHQSDNLLLTTLMDLYSCCDKFEEACKVFDEI 229 >ref|XP_002304264.2| hypothetical protein POPTR_0003s07210g [Populus trichocarpa] gi|550342611|gb|EEE79243.2| hypothetical protein POPTR_0003s07210g [Populus trichocarpa] Length = 636 Score = 145 bits (367), Expect = 1e-32 Identities = 90/222 (40%), Positives = 123/222 (55%), Gaps = 1/222 (0%) Frame = +2 Query: 56 MKTIFSLFHYHQYSIATARIPSTNSTTRSFWSTVAISLQHHHAPPSAPLY-LRTASKSEA 232 M F LF+ ++ S+ + T + ++A Q HH L ++ + ++ Sbjct: 1 MTPAFHLFNSNRSSLNLQHYLCLSHYTTTTPPSIAKQFQEHHRHQQNQTNPLLSSLERKS 60 Query: 233 ENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLN 412 LISLIKSC + SHL QIH YLIR S L P I L FLSR+AL P ++I YS + F Sbjct: 61 HQPLISLIKSCTQKSHLLQIHGYLIRNSLLHYPAISLPFLSRMALSPIRDISYSRQFFSQ 120 Query: 413 YPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNG 592 PN ++ LYNT+IRA+S+S S GF +Y+EM G+ +C I++ SL+ Sbjct: 121 IPNPSVFLYNTLIRAYSMSSSPTEGFFMYQEMRKKGLRADPVSLSFVIRCYIRICSLIGC 180 Query: 593 LQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 QVH I DGH SD L + L+D YS+ K EACKVF EM Sbjct: 181 EQVHARILSDGHQSDSLLLTNLMDLYSLCDKGSEACKVFDEM 222 >ref|XP_006364594.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530-like [Solanum tuberosum] Length = 621 Score = 145 bits (366), Expect = 1e-32 Identities = 79/166 (47%), Positives = 97/166 (58%) Frame = +2 Query: 224 SEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRT 403 +E LISLIKS + HL QIH +LIR S +P F FL IAL P ++ Y+ R Sbjct: 44 TEKTEPLISLIKSTSSKPHLLQIHAHLIRNSLFQHPIFFSPFLFGIALHPLHDLGYACRV 103 Query: 404 FLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSL 583 F + ++ YN MIRA+ +SDS GF LY+EM+ G+ C IK GSL Sbjct: 104 FSKFSKPDVFQYNIMIRAYGMSDSPGNGFMLYQEMLRSGVSPNSLTSSFVTNCCIKSGSL 163 Query: 584 LNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEMS 721 GLQ+H I RDGH SD L + L+DFYS N KY EACKVF EMS Sbjct: 164 FGGLQIHARILRDGHPSDGRLLTTLMDFYSSNEKYTEACKVFDEMS 209 >ref|XP_004301147.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530-like [Fragaria vesca subsp. vesca] Length = 643 Score = 134 bits (337), Expect = 3e-29 Identities = 80/218 (36%), Positives = 119/218 (54%), Gaps = 5/218 (2%) Frame = +2 Query: 80 HYHQYSIATARI----PSTNSTTRSFWSTVAISLQHHHAPPSAPLYLRTASKSEAENTLI 247 H ++I++ R+ PS +T ++ I H H + P+ + A + ++L+ Sbjct: 14 HLQHHNISSTRLTSTFPSLFTTNQTSLDHSQIQTPHDHQNQTKPIIISYAQTRK--DSLL 71 Query: 248 SLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIAL-PPFQNIPYSYRTFLNYPNH 424 SLIKSC SHL QIH ++++TS +L+ +I FLS ++L PP +N+ YS P Sbjct: 72 SLIKSCTHKSHLLQIHAHILQTSLILDSSICFHFLSLLSLSPPLKNLTYSRHFLAQIPKP 131 Query: 425 NISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQVH 604 N YNT+IRA+S SDS G LYR+ G+ +C +KM L G+QV Sbjct: 132 NAIHYNTLIRAYSTSDSPEQGIHLYRDFRRRGLHCNSLSSFFVIQCCVKMQCLSVGIQVQ 191 Query: 605 GMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 I RDGH SD L +AL++ YS +Y +ACKVF E+ Sbjct: 192 TRIVRDGHHSDSRLLTALMNLYSTCGEYHDACKVFDEI 229 >ref|XP_006477459.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530-like [Citrus sinensis] Length = 580 Score = 131 bits (330), Expect = 2e-28 Identities = 71/159 (44%), Positives = 94/159 (59%) Frame = +2 Query: 242 LISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLNYPN 421 LISLIK C R HL QI ++I TS + +PT+ L LSR ALPPF+ PYS + + P Sbjct: 8 LISLIKLCTRRPHLLQIQAHIIVTSLIQDPTVSLHILSRFALPPFRETPYSRQILDHIPR 67 Query: 422 HNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQV 601 N+S YNTM+RA+S+S S GF L+ +M I KC +K SL+ GLQ+ Sbjct: 68 PNVSHYNTMVRAYSMSSSPEEGFYLFEKMRQKRIPTNPFACSFAIKCCMKFCSLMGGLQI 127 Query: 602 HGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 H + RDG+ D L + L+D YS K EACK+F E+ Sbjct: 128 HARVLRDGYQLDSQLMTTLMDLYSTFEKSFEACKLFDEI 166 >gb|EXB44694.1| hypothetical protein L484_015951 [Morus notabilis] Length = 640 Score = 131 bits (329), Expect = 3e-28 Identities = 68/167 (40%), Positives = 102/167 (61%) Frame = +2 Query: 218 SKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSY 397 +K E+ LIS+IKSC+ +HL QIH +L+RTS +PTI L FLS IAL ++I YS Sbjct: 60 TKQIQEHPLISIIKSCSHNTHLRQIHAHLLRTSLAQDPTISLKFLSCIALSSLRDIGYSR 119 Query: 398 RTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMG 577 + F + +N MIRA+S++D G +Y++M+ G+ KC +++ Sbjct: 120 KFFAQIKRPSFLHHNAMIRAYSVTDKPDEGLRMYQDMIRRGVWANSFSSSFAVKCCVRIS 179 Query: 578 SLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 S + G+QVHG I RDG++SDC L + L++ YS ++ +A KVF EM Sbjct: 180 SFVGGVQVHGRILRDGNLSDCRLLTTLMELYSGCERFGDALKVFDEM 226 >gb|EMJ11463.1| hypothetical protein PRUPE_ppa003212mg [Prunus persica] Length = 592 Score = 129 bits (325), Expect = 7e-28 Identities = 68/148 (45%), Positives = 90/148 (60%) Frame = +2 Query: 275 SHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLNYPNHNISLYNTMIR 454 SHL QIH +++RTS +L PTI L FLS + L P ++I YS R F YNTM+R Sbjct: 31 SHLLQIHAHIVRTSLVLEPTICLQFLSLVGLSPLKSISYSRRFFDQIAKPTAFQYNTMVR 90 Query: 455 AFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQVHGMIFRDGHIS 634 A+S+SDS GF +YR+++ G+ K I++ SLL G+QVH I R GH S Sbjct: 91 AYSISDSPEEGFSMYRDLLRRGLRADALASSFVIKSCIRVSSLLGGIQVHARILRGGHES 150 Query: 635 DCFLSSALVDFYSVNRKYDEACKVFAEM 718 D L + L+D YS+ K DEACK+F EM Sbjct: 151 DSRLLTTLMDLYSICGKCDEACKLFDEM 178 >ref|XP_004155062.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530-like [Cucumis sativus] Length = 602 Score = 127 bits (318), Expect = 5e-27 Identities = 77/192 (40%), Positives = 108/192 (56%) Frame = +2 Query: 143 FWSTVAISLQHHHAPPSAPLYLRTASKSEAENTLISLIKSCARISHLCQIHCYLIRTSFL 322 F S +SL++HH S + R LISLIKSC S L QIH ++I TS + Sbjct: 5 FRSPSILSLKYHHHSISFSHFER--------EPLISLIKSCTHKSQLLQIHAHIITTSSI 56 Query: 323 LNPTIFLCFLSRIALPPFQNIPYSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYR 502 +P + L FL+R A PF+++ YS R F N +S YN M+RA+SLS S G +YR Sbjct: 57 QDPIVSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYR 116 Query: 503 EMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNR 682 +M G+ K IK+ SLL G+Q+H IF +GH +D L ++++D YS Sbjct: 117 DMERQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFINGHQADSLLLTSMMDLYSHCG 176 Query: 683 KYDEACKVFAEM 718 K +EACK+F E+ Sbjct: 177 KPEEACKLFDEV 188 >ref|NP_190337.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206890|sp|Q9SN85.1|PP267_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g47530 gi|6522536|emb|CAB61979.1| putative protein [Arabidopsis thaliana] gi|62320272|dbj|BAD94558.1| hypothetical protein [Arabidopsis thaliana] gi|332644772|gb|AEE78293.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 591 Score = 123 bits (308), Expect = 7e-26 Identities = 75/173 (43%), Positives = 98/173 (56%), Gaps = 2/173 (1%) Frame = +2 Query: 206 LRTASKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPF-QN 382 L++ S S ++ L+SLI S HL QIH L+RTS + N +F FLSR+AL ++ Sbjct: 2 LKSISSSSGDDHLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRD 61 Query: 383 IPYSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMV-HFGIXXXXXXXXXXXK 559 I YS R F N +S NTMIRAFSLS + GF L+R + + + K Sbjct: 62 INYSCRVFSQRLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALK 121 Query: 560 CSIKMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 C IK G LL GLQ+HG IF DG +SD L + L+D YS +ACKVF E+ Sbjct: 122 CCIKSGDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEI 174 >gb|EOY21868.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] Length = 625 Score = 122 bits (305), Expect = 2e-25 Identities = 74/163 (45%), Positives = 94/163 (57%) Frame = +2 Query: 233 ENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLN 412 + LISLIKS + S L QIH +LIRTS L NPT L FLS + PF+++ YS F Sbjct: 67 QQNLISLIKSGTQNS-LLQIHAHLIRTSLLQNPTFSLHFLSCLCFSPFRDLRYSRHFFSQ 125 Query: 413 YPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNG 592 + S Y+T+IRA+S S+S F LY+EM G+ K +K SL+ G Sbjct: 126 IDKPSASHYSTLIRAYSSSNSPKDAFFLYKEMTQKGLKPDPVSSSFVLKSCMKFSSLVCG 185 Query: 593 LQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEMS 721 LQ+HG I DG SD L + L+DFYS DEACKVF E+S Sbjct: 186 LQIHGRILGDGFQSDSLLLTTLMDFYSSFASRDEACKVFDEIS 228 >gb|EOY21867.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] Length = 640 Score = 122 bits (305), Expect = 2e-25 Identities = 74/163 (45%), Positives = 94/163 (57%) Frame = +2 Query: 233 ENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLN 412 + LISLIKS + S L QIH +LIRTS L NPT L FLS + PF+++ YS F Sbjct: 67 QQNLISLIKSGTQNS-LLQIHAHLIRTSLLQNPTFSLHFLSCLCFSPFRDLRYSRHFFSQ 125 Query: 413 YPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNG 592 + S Y+T+IRA+S S+S F LY+EM G+ K +K SL+ G Sbjct: 126 IDKPSASHYSTLIRAYSSSNSPKDAFFLYKEMTQKGLKPDPVSSSFVLKSCMKFSSLVCG 185 Query: 593 LQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEMS 721 LQ+HG I DG SD L + L+DFYS DEACKVF E+S Sbjct: 186 LQIHGRILGDGFQSDSLLLTTLMDFYSSFASRDEACKVFDEIS 228 >ref|XP_002877566.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297323404|gb|EFH53825.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 591 Score = 121 bits (304), Expect = 2e-25 Identities = 74/173 (42%), Positives = 97/173 (56%), Gaps = 2/173 (1%) Frame = +2 Query: 206 LRTASKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPF-QN 382 L++ S S +++ L+SLI S HL QIH L+RTS + N +F F SR+AL ++ Sbjct: 2 LKSISSSSSDDHLLSLIVSSTGKLHLRQIHAVLLRTSLIRNSDVFHHFFSRLALSLIPRD 61 Query: 383 IPYSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMV-HFGIXXXXXXXXXXXK 559 I YS R F N +S NTMIRAFSLS + GF L+R + + K Sbjct: 62 INYSCRVFSQRLNPTLSHCNTMIRAFSLSQTPCEGFRLFRALRRNISFPANPLSSSFALK 121 Query: 560 CSIKMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 C IK G LL GLQ+HG IF DG +SD L + L+D YS +ACKVF E+ Sbjct: 122 CCIKSGDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEI 174 >ref|XP_006440604.1| hypothetical protein CICLE_v10018999mg [Citrus clementina] gi|557542866|gb|ESR53844.1| hypothetical protein CICLE_v10018999mg [Citrus clementina] Length = 745 Score = 120 bits (302), Expect = 3e-25 Identities = 66/160 (41%), Positives = 91/160 (56%) Frame = +2 Query: 239 TLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLNYP 418 +LI + R HL QI ++I TS + +PT+ L LSR ALPPF+ PYS + + P Sbjct: 172 SLIVIATDVHREPHLLQIQAHIIVTSLIQDPTVSLHILSRFALPPFRETPYSRQILDHIP 231 Query: 419 NHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQ 598 N+S YNTM+RA+S+S S GF L+ +M I KC +K SL+ GLQ Sbjct: 232 RPNVSHYNTMVRAYSMSSSPEEGFYLFEKMRQKRIPTNPFACSFAIKCCMKFCSLMGGLQ 291 Query: 599 VHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 +H + RDG+ D L + L+D YS K EACK+F E+ Sbjct: 292 IHARVLRDGYQLDSQLMTTLMDLYSTFEKSFEACKLFDEI 331 >ref|XP_003525465.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530-like [Glycine max] Length = 579 Score = 120 bits (300), Expect = 6e-25 Identities = 74/164 (45%), Positives = 97/164 (59%), Gaps = 1/164 (0%) Frame = +2 Query: 230 AENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALP-PFQNIPYSYRTF 406 A T+IS IKS + + L QIH ++IRT+ + PT+ L FLSRIAL P Q+ YS R F Sbjct: 2 ALETVISAIKSVSHKTRLLQIHAHIIRTTLIQYPTVSLQFLSRIALSGPLQDASYSQRFF 61 Query: 407 LNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLL 586 + +S YNTMIRA S+SDS G LYR+M GI K I+ L Sbjct: 62 GQLSHPLVSHYNTMIRACSMSDSPQKGLLLYRDMRRRGIAADPLSSSFAVKSCIRFLYLP 121 Query: 587 NGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 G+QVH IF+DGH D L +A++D YS+ ++ +ACKVF EM Sbjct: 122 GGVQVHCNIFKDGHQWDTLLLTAVMDLYSLCQRGGDACKVFDEM 165 >ref|XP_006292869.1| hypothetical protein CARUB_v10019129mg [Capsella rubella] gi|482561576|gb|EOA25767.1| hypothetical protein CARUB_v10019129mg [Capsella rubella] Length = 589 Score = 119 bits (299), Expect = 8e-25 Identities = 73/169 (43%), Positives = 97/169 (57%), Gaps = 2/169 (1%) Frame = +2 Query: 218 SKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPF-QNIPYS 394 S S +++ LISLI S HL QIH L+RTS + N +F FLSR++L ++I YS Sbjct: 4 SISSSDDHLISLIVSSTGKLHLRQIHAVLLRTSLIRNSDVFHHFLSRLSLSLIPRDINYS 63 Query: 395 YRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMV-HFGIXXXXXXXXXXXKCSIK 571 R F N +S NTMIRAFSLS + GF L+R + + + KC IK Sbjct: 64 CRVFSQRSNPTLSHSNTMIRAFSLSKNPIEGFRLFRALRRNSSLPPNPLSSSFALKCCIK 123 Query: 572 MGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 G LL GLQ+HG I+ DG +SD L + L+D YS ++ACKVF E+ Sbjct: 124 SGDLLGGLQIHGKIYSDGFLSDSLLLTTLMDLYSACENSNDACKVFDEI 172 >ref|XP_004508732.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530-like [Cicer arietinum] Length = 633 Score = 117 bits (293), Expect = 4e-24 Identities = 79/230 (34%), Positives = 115/230 (50%), Gaps = 1/230 (0%) Frame = +2 Query: 32 SQKSRTTAMKTIFSLFHYHQYSIATARIPSTNSTTRSFWSTVAISLQHHHAPPSAPLYLR 211 S++++T+ F+L H H Y+ T+ HH Sbjct: 26 SRRNQTSFAAATFTLIHLHHYN------------------TIPQPYPHH----------- 56 Query: 212 TASKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALP-PFQNIP 388 + ++IS IKS + +HL QIH +++ T+ + +PTI L FLSR+AL P Q+ Sbjct: 57 -------KYSVISAIKSSSHKTHLLQIHAHILTTTLIQHPTISLHFLSRLALSGPLQDPT 109 Query: 389 YSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSI 568 YS+R F N + YNTMIRA+SLSDS LYR+M GI K I Sbjct: 110 YSHRFFDQISNPFVFHYNTMIRAYSLSDSPQKALFLYRDMRRKGIASDPLSSSFAVKSCI 169 Query: 569 KMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 + L GLQVH + ++GH SD L ++L+D YS ++ D+A KVF E+ Sbjct: 170 RFLYLFGGLQVHCNVLKEGHQSDTLLLTSLMDLYSQCQRCDDASKVFDEI 219 >gb|ESW27301.1| hypothetical protein PHAVU_003G189800g [Phaseolus vulgaris] Length = 579 Score = 115 bits (287), Expect = 2e-23 Identities = 68/160 (42%), Positives = 96/160 (60%), Gaps = 1/160 (0%) Frame = +2 Query: 242 LISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALP-PFQNIPYSYRTFLNYP 418 +IS IKS ++ + L QIH ++IRT+ + P + + FLSRIAL P Q+ YS+R F ++ Sbjct: 6 VISAIKSVSQKTQLLQIHAHIIRTNLIQYPPVSIQFLSRIALSGPLQDANYSHRFFEHFT 65 Query: 419 NHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQ 598 + +S YNTMIRA S+SDS G LYR+M GI K I++ L G+Q Sbjct: 66 HPLVSHYNTMIRACSMSDSPRKGLLLYRDMRRRGIAADPVSASFAVKSCIRLLYFLGGVQ 125 Query: 599 VHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 VH I +DGH D L + ++D YS ++ +ACKVF EM Sbjct: 126 VHCNILKDGHQWDTLLLTVVMDLYSQCQRGGDACKVFDEM 165 >ref|XP_006404369.1| hypothetical protein EUTSA_v10010230mg [Eutrema salsugineum] gi|557105488|gb|ESQ45822.1| hypothetical protein EUTSA_v10010230mg [Eutrema salsugineum] Length = 590 Score = 112 bits (280), Expect = 1e-22 Identities = 71/170 (41%), Positives = 91/170 (53%), Gaps = 3/170 (1%) Frame = +2 Query: 218 SKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPF-QNIPYS 394 S + + LISLI S HL QIH L+RTS + N +F FLSR+AL ++I YS Sbjct: 4 SMRSSNDHLISLIVSSTAKLHLRQIHAILLRTSLIRNSDVFHHFLSRLALSLVPRDIDYS 63 Query: 395 YRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVH--FGIXXXXXXXXXXXKCSI 568 R F N +S NTMIRAFS+S++ GF L+R + KC I Sbjct: 64 RRVFSRRSNPTVSHCNTMIRAFSVSETPVEGFRLFRALRRRRSSRPANPLSSSFALKCCI 123 Query: 569 KMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718 K G L GLQ+HG I DG +SD L + L+D YS + ACKVF E+ Sbjct: 124 KSGDFLGGLQIHGKIISDGFLSDSLLLTTLMDLYSTCENSNYACKVFDEI 173 >gb|EOY30986.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative [Theobroma cacao] Length = 847 Score = 93.6 bits (231), Expect = 6e-17 Identities = 73/246 (29%), Positives = 110/246 (44%), Gaps = 11/246 (4%) Frame = +2 Query: 5 TAIQFFFHRSQKSRTTAMKTIFSLFHYH---QYSIATARIPSTNSTTRSFWSTVAISLQH 175 TAI+ F+ R T K + H + I + IP S + ++S+ Sbjct: 11 TAIRHFYCCFPFWRVTKKKNLSMNDQRHFKKKKKIISTLIPLKTSKREMALPSTSVSISP 70 Query: 176 ---HHAPPSAPLYLRTASKSEAENTLISLIKSCARISHLCQIHCYLIRTS-----FLLNP 331 H P S P Y K + +SL+ C I L Q+HC++I+T F L+ Sbjct: 71 FPLHLLPSSDPPY-----KLLQNHPSLSLLSKCRTIQTLKQVHCHIIKTGLHHTQFALSK 125 Query: 332 TIFLCFLSRIALPPFQNIPYSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMV 511 I C A+ PF ++PY+ F + N ++NTMIR FSLS S L E Y +M+ Sbjct: 126 LIEFC-----AVSPFGDLPYALLLFESIDEPNQVIWNTMIRGFSLSSSPGLTLEFYVKMI 180 Query: 512 HFGIXXXXXXXXXXXKCSIKMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYD 691 GI K K S G Q+HG + + G SD F+ ++L++ Y+ N ++ Sbjct: 181 WSGIVPNSYTFPFVLKSCAKTASTQEGKQIHGQVLKLGLESDAFVHTSLINMYAQNGEFG 240 Query: 692 EACKVF 709 A VF Sbjct: 241 NARLVF 246