BLASTX nr result

ID: Catharanthus22_contig00042384 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00042384
         (724 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004235997.1| PREDICTED: pentatricopeptide repeat-containi...   154   4e-35
ref|XP_002282049.1| PREDICTED: pentatricopeptide repeat-containi...   146   8e-33
ref|XP_002304264.2| hypothetical protein POPTR_0003s07210g [Popu...   145   1e-32
ref|XP_006364594.1| PREDICTED: pentatricopeptide repeat-containi...   145   1e-32
ref|XP_004301147.1| PREDICTED: pentatricopeptide repeat-containi...   134   3e-29
ref|XP_006477459.1| PREDICTED: pentatricopeptide repeat-containi...   131   2e-28
gb|EXB44694.1| hypothetical protein L484_015951 [Morus notabilis]     131   3e-28
gb|EMJ11463.1| hypothetical protein PRUPE_ppa003212mg [Prunus pe...   129   7e-28
ref|XP_004155062.1| PREDICTED: pentatricopeptide repeat-containi...   127   5e-27
ref|NP_190337.1| pentatricopeptide repeat-containing protein [Ar...   123   7e-26
gb|EOY21868.1| Pentatricopeptide repeat superfamily protein isof...   122   2e-25
gb|EOY21867.1| Pentatricopeptide repeat superfamily protein isof...   122   2e-25
ref|XP_002877566.1| pentatricopeptide repeat-containing protein ...   121   2e-25
ref|XP_006440604.1| hypothetical protein CICLE_v10018999mg [Citr...   120   3e-25
ref|XP_003525465.1| PREDICTED: pentatricopeptide repeat-containi...   120   6e-25
ref|XP_006292869.1| hypothetical protein CARUB_v10019129mg [Caps...   119   8e-25
ref|XP_004508732.1| PREDICTED: pentatricopeptide repeat-containi...   117   4e-24
gb|ESW27301.1| hypothetical protein PHAVU_003G189800g [Phaseolus...   115   2e-23
ref|XP_006404369.1| hypothetical protein EUTSA_v10010230mg [Eutr...   112   1e-22
gb|EOY30986.1| Tetratricopeptide repeat (TPR)-like superfamily p...    94   6e-17

>ref|XP_004235997.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g47530-like [Solanum lycopersicum]
          Length = 621

 Score =  154 bits (388), Expect = 4e-35
 Identities = 87/201 (43%), Positives = 113/201 (56%)
 Frame = +2

Query: 119 STNSTTRSFWSTVAISLQHHHAPPSAPLYLRTASKSEAENTLISLIKSCARISHLCQIHC 298
           S+ S  R   S  A+ + +H    + P    +   +E    LISLIKS +   HL QIH 
Sbjct: 9   SSVSAFRWLSSHAAVRVDNHRQCLTGPTNHTSHLHTEKTEPLISLIKSTSSKPHLLQIHA 68

Query: 299 YLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLNYPNHNISLYNTMIRAFSLSDSS 478
           +LIR S   +P  F  FL  IALPPF ++ Y+ + F  +   ++  YN MIRA+ +SDS 
Sbjct: 69  HLIRKSLFQDPIFFSPFLFGIALPPFHDLGYASQVFSKFRKPDVFQYNIMIRAYGMSDSP 128

Query: 479 FLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQVHGMIFRDGHISDCFLSSAL 658
             GF LY+EM+  G+            C IK+GSL  GLQ+H  I RDGH SD  L + L
Sbjct: 129 GNGFMLYQEMLRSGVSPNSLTSSFVTNCCIKIGSLFGGLQIHARILRDGHQSDGRLLTTL 188

Query: 659 VDFYSVNRKYDEACKVFAEMS 721
           +DFYS N KY EACKVF EMS
Sbjct: 189 MDFYSSNEKYTEACKVFDEMS 209


>ref|XP_002282049.1| PREDICTED: pentatricopeptide repeat-containing protein At3g47530
           [Vitis vinifera]
          Length = 643

 Score =  146 bits (368), Expect = 8e-33
 Identities = 80/167 (47%), Positives = 105/167 (62%)
 Frame = +2

Query: 218 SKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSY 397
           S+ E+EN LISLIKSC++ +HL QIH ++IRTS + N  I L FLSR AL P +++ YS 
Sbjct: 63  SRDESENQLISLIKSCSKKTHLLQIHAHIIRTSLIQNHFISLQFLSRAALSPSRDMGYSS 122

Query: 398 RTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMG 577
           + F      + S YN MIRA+S+S S   GF LYREM   G+           K  I++ 
Sbjct: 123 QVFSQIMKPSGSQYNVMIRAYSMSHSPEQGFYLYREMRRRGVPPNPLSSSFVMKSCIRIS 182

Query: 578 SLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           SL+ GLQ+H  I RDGH SD  L + L+D YS   K++EACKVF E+
Sbjct: 183 SLMGGLQIHARILRDGHQSDNLLLTTLMDLYSCCDKFEEACKVFDEI 229


>ref|XP_002304264.2| hypothetical protein POPTR_0003s07210g [Populus trichocarpa]
           gi|550342611|gb|EEE79243.2| hypothetical protein
           POPTR_0003s07210g [Populus trichocarpa]
          Length = 636

 Score =  145 bits (367), Expect = 1e-32
 Identities = 90/222 (40%), Positives = 123/222 (55%), Gaps = 1/222 (0%)
 Frame = +2

Query: 56  MKTIFSLFHYHQYSIATARIPSTNSTTRSFWSTVAISLQHHHAPPSAPLY-LRTASKSEA 232
           M   F LF+ ++ S+        +  T +   ++A   Q HH         L ++ + ++
Sbjct: 1   MTPAFHLFNSNRSSLNLQHYLCLSHYTTTTPPSIAKQFQEHHRHQQNQTNPLLSSLERKS 60

Query: 233 ENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLN 412
              LISLIKSC + SHL QIH YLIR S L  P I L FLSR+AL P ++I YS + F  
Sbjct: 61  HQPLISLIKSCTQKSHLLQIHGYLIRNSLLHYPAISLPFLSRMALSPIRDISYSRQFFSQ 120

Query: 413 YPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNG 592
            PN ++ LYNT+IRA+S+S S   GF +Y+EM   G+           +C I++ SL+  
Sbjct: 121 IPNPSVFLYNTLIRAYSMSSSPTEGFFMYQEMRKKGLRADPVSLSFVIRCYIRICSLIGC 180

Query: 593 LQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
            QVH  I  DGH SD  L + L+D YS+  K  EACKVF EM
Sbjct: 181 EQVHARILSDGHQSDSLLLTNLMDLYSLCDKGSEACKVFDEM 222


>ref|XP_006364594.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g47530-like [Solanum tuberosum]
          Length = 621

 Score =  145 bits (366), Expect = 1e-32
 Identities = 79/166 (47%), Positives = 97/166 (58%)
 Frame = +2

Query: 224 SEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRT 403
           +E    LISLIKS +   HL QIH +LIR S   +P  F  FL  IAL P  ++ Y+ R 
Sbjct: 44  TEKTEPLISLIKSTSSKPHLLQIHAHLIRNSLFQHPIFFSPFLFGIALHPLHDLGYACRV 103

Query: 404 FLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSL 583
           F  +   ++  YN MIRA+ +SDS   GF LY+EM+  G+            C IK GSL
Sbjct: 104 FSKFSKPDVFQYNIMIRAYGMSDSPGNGFMLYQEMLRSGVSPNSLTSSFVTNCCIKSGSL 163

Query: 584 LNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEMS 721
             GLQ+H  I RDGH SD  L + L+DFYS N KY EACKVF EMS
Sbjct: 164 FGGLQIHARILRDGHPSDGRLLTTLMDFYSSNEKYTEACKVFDEMS 209


>ref|XP_004301147.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g47530-like [Fragaria vesca subsp. vesca]
          Length = 643

 Score =  134 bits (337), Expect = 3e-29
 Identities = 80/218 (36%), Positives = 119/218 (54%), Gaps = 5/218 (2%)
 Frame = +2

Query: 80  HYHQYSIATARI----PSTNSTTRSFWSTVAISLQHHHAPPSAPLYLRTASKSEAENTLI 247
           H   ++I++ R+    PS  +T ++      I   H H   + P+ +  A   +  ++L+
Sbjct: 14  HLQHHNISSTRLTSTFPSLFTTNQTSLDHSQIQTPHDHQNQTKPIIISYAQTRK--DSLL 71

Query: 248 SLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIAL-PPFQNIPYSYRTFLNYPNH 424
           SLIKSC   SHL QIH ++++TS +L+ +I   FLS ++L PP +N+ YS       P  
Sbjct: 72  SLIKSCTHKSHLLQIHAHILQTSLILDSSICFHFLSLLSLSPPLKNLTYSRHFLAQIPKP 131

Query: 425 NISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQVH 604
           N   YNT+IRA+S SDS   G  LYR+    G+           +C +KM  L  G+QV 
Sbjct: 132 NAIHYNTLIRAYSTSDSPEQGIHLYRDFRRRGLHCNSLSSFFVIQCCVKMQCLSVGIQVQ 191

Query: 605 GMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
             I RDGH SD  L +AL++ YS   +Y +ACKVF E+
Sbjct: 192 TRIVRDGHHSDSRLLTALMNLYSTCGEYHDACKVFDEI 229


>ref|XP_006477459.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g47530-like [Citrus sinensis]
          Length = 580

 Score =  131 bits (330), Expect = 2e-28
 Identities = 71/159 (44%), Positives = 94/159 (59%)
 Frame = +2

Query: 242 LISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLNYPN 421
           LISLIK C R  HL QI  ++I TS + +PT+ L  LSR ALPPF+  PYS +   + P 
Sbjct: 8   LISLIKLCTRRPHLLQIQAHIIVTSLIQDPTVSLHILSRFALPPFRETPYSRQILDHIPR 67

Query: 422 HNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQV 601
            N+S YNTM+RA+S+S S   GF L+ +M    I           KC +K  SL+ GLQ+
Sbjct: 68  PNVSHYNTMVRAYSMSSSPEEGFYLFEKMRQKRIPTNPFACSFAIKCCMKFCSLMGGLQI 127

Query: 602 HGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           H  + RDG+  D  L + L+D YS   K  EACK+F E+
Sbjct: 128 HARVLRDGYQLDSQLMTTLMDLYSTFEKSFEACKLFDEI 166


>gb|EXB44694.1| hypothetical protein L484_015951 [Morus notabilis]
          Length = 640

 Score =  131 bits (329), Expect = 3e-28
 Identities = 68/167 (40%), Positives = 102/167 (61%)
 Frame = +2

Query: 218 SKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSY 397
           +K   E+ LIS+IKSC+  +HL QIH +L+RTS   +PTI L FLS IAL   ++I YS 
Sbjct: 60  TKQIQEHPLISIIKSCSHNTHLRQIHAHLLRTSLAQDPTISLKFLSCIALSSLRDIGYSR 119

Query: 398 RTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMG 577
           + F      +   +N MIRA+S++D    G  +Y++M+  G+           KC +++ 
Sbjct: 120 KFFAQIKRPSFLHHNAMIRAYSVTDKPDEGLRMYQDMIRRGVWANSFSSSFAVKCCVRIS 179

Query: 578 SLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           S + G+QVHG I RDG++SDC L + L++ YS   ++ +A KVF EM
Sbjct: 180 SFVGGVQVHGRILRDGNLSDCRLLTTLMELYSGCERFGDALKVFDEM 226


>gb|EMJ11463.1| hypothetical protein PRUPE_ppa003212mg [Prunus persica]
          Length = 592

 Score =  129 bits (325), Expect = 7e-28
 Identities = 68/148 (45%), Positives = 90/148 (60%)
 Frame = +2

Query: 275 SHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLNYPNHNISLYNTMIR 454
           SHL QIH +++RTS +L PTI L FLS + L P ++I YS R F          YNTM+R
Sbjct: 31  SHLLQIHAHIVRTSLVLEPTICLQFLSLVGLSPLKSISYSRRFFDQIAKPTAFQYNTMVR 90

Query: 455 AFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQVHGMIFRDGHIS 634
           A+S+SDS   GF +YR+++  G+           K  I++ SLL G+QVH  I R GH S
Sbjct: 91  AYSISDSPEEGFSMYRDLLRRGLRADALASSFVIKSCIRVSSLLGGIQVHARILRGGHES 150

Query: 635 DCFLSSALVDFYSVNRKYDEACKVFAEM 718
           D  L + L+D YS+  K DEACK+F EM
Sbjct: 151 DSRLLTTLMDLYSICGKCDEACKLFDEM 178


>ref|XP_004155062.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g47530-like [Cucumis sativus]
          Length = 602

 Score =  127 bits (318), Expect = 5e-27
 Identities = 77/192 (40%), Positives = 108/192 (56%)
 Frame = +2

Query: 143 FWSTVAISLQHHHAPPSAPLYLRTASKSEAENTLISLIKSCARISHLCQIHCYLIRTSFL 322
           F S   +SL++HH   S   + R          LISLIKSC   S L QIH ++I TS +
Sbjct: 5   FRSPSILSLKYHHHSISFSHFER--------EPLISLIKSCTHKSQLLQIHAHIITTSSI 56

Query: 323 LNPTIFLCFLSRIALPPFQNIPYSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYR 502
            +P + L FL+R A  PF+++ YS R F    N  +S YN M+RA+SLS S   G  +YR
Sbjct: 57  QDPIVSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYR 116

Query: 503 EMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNR 682
           +M   G+           K  IK+ SLL G+Q+H  IF +GH +D  L ++++D YS   
Sbjct: 117 DMERQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFINGHQADSLLLTSMMDLYSHCG 176

Query: 683 KYDEACKVFAEM 718
           K +EACK+F E+
Sbjct: 177 KPEEACKLFDEV 188


>ref|NP_190337.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75206890|sp|Q9SN85.1|PP267_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At3g47530 gi|6522536|emb|CAB61979.1| putative protein
           [Arabidopsis thaliana] gi|62320272|dbj|BAD94558.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332644772|gb|AEE78293.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 591

 Score =  123 bits (308), Expect = 7e-26
 Identities = 75/173 (43%), Positives = 98/173 (56%), Gaps = 2/173 (1%)
 Frame = +2

Query: 206 LRTASKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPF-QN 382
           L++ S S  ++ L+SLI S     HL QIH  L+RTS + N  +F  FLSR+AL    ++
Sbjct: 2   LKSISSSSGDDHLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRD 61

Query: 383 IPYSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMV-HFGIXXXXXXXXXXXK 559
           I YS R F    N  +S  NTMIRAFSLS +   GF L+R +  +  +           K
Sbjct: 62  INYSCRVFSQRLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALK 121

Query: 560 CSIKMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           C IK G LL GLQ+HG IF DG +SD  L + L+D YS      +ACKVF E+
Sbjct: 122 CCIKSGDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEI 174


>gb|EOY21868.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma
           cacao]
          Length = 625

 Score =  122 bits (305), Expect = 2e-25
 Identities = 74/163 (45%), Positives = 94/163 (57%)
 Frame = +2

Query: 233 ENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLN 412
           +  LISLIKS  + S L QIH +LIRTS L NPT  L FLS +   PF+++ YS   F  
Sbjct: 67  QQNLISLIKSGTQNS-LLQIHAHLIRTSLLQNPTFSLHFLSCLCFSPFRDLRYSRHFFSQ 125

Query: 413 YPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNG 592
               + S Y+T+IRA+S S+S    F LY+EM   G+           K  +K  SL+ G
Sbjct: 126 IDKPSASHYSTLIRAYSSSNSPKDAFFLYKEMTQKGLKPDPVSSSFVLKSCMKFSSLVCG 185

Query: 593 LQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEMS 721
           LQ+HG I  DG  SD  L + L+DFYS     DEACKVF E+S
Sbjct: 186 LQIHGRILGDGFQSDSLLLTTLMDFYSSFASRDEACKVFDEIS 228


>gb|EOY21867.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
           cacao]
          Length = 640

 Score =  122 bits (305), Expect = 2e-25
 Identities = 74/163 (45%), Positives = 94/163 (57%)
 Frame = +2

Query: 233 ENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLN 412
           +  LISLIKS  + S L QIH +LIRTS L NPT  L FLS +   PF+++ YS   F  
Sbjct: 67  QQNLISLIKSGTQNS-LLQIHAHLIRTSLLQNPTFSLHFLSCLCFSPFRDLRYSRHFFSQ 125

Query: 413 YPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNG 592
               + S Y+T+IRA+S S+S    F LY+EM   G+           K  +K  SL+ G
Sbjct: 126 IDKPSASHYSTLIRAYSSSNSPKDAFFLYKEMTQKGLKPDPVSSSFVLKSCMKFSSLVCG 185

Query: 593 LQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEMS 721
           LQ+HG I  DG  SD  L + L+DFYS     DEACKVF E+S
Sbjct: 186 LQIHGRILGDGFQSDSLLLTTLMDFYSSFASRDEACKVFDEIS 228


>ref|XP_002877566.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297323404|gb|EFH53825.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 591

 Score =  121 bits (304), Expect = 2e-25
 Identities = 74/173 (42%), Positives = 97/173 (56%), Gaps = 2/173 (1%)
 Frame = +2

Query: 206 LRTASKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPF-QN 382
           L++ S S +++ L+SLI S     HL QIH  L+RTS + N  +F  F SR+AL    ++
Sbjct: 2   LKSISSSSSDDHLLSLIVSSTGKLHLRQIHAVLLRTSLIRNSDVFHHFFSRLALSLIPRD 61

Query: 383 IPYSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMV-HFGIXXXXXXXXXXXK 559
           I YS R F    N  +S  NTMIRAFSLS +   GF L+R +  +              K
Sbjct: 62  INYSCRVFSQRLNPTLSHCNTMIRAFSLSQTPCEGFRLFRALRRNISFPANPLSSSFALK 121

Query: 560 CSIKMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           C IK G LL GLQ+HG IF DG +SD  L + L+D YS      +ACKVF E+
Sbjct: 122 CCIKSGDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEI 174


>ref|XP_006440604.1| hypothetical protein CICLE_v10018999mg [Citrus clementina]
           gi|557542866|gb|ESR53844.1| hypothetical protein
           CICLE_v10018999mg [Citrus clementina]
          Length = 745

 Score =  120 bits (302), Expect = 3e-25
 Identities = 66/160 (41%), Positives = 91/160 (56%)
 Frame = +2

Query: 239 TLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPFQNIPYSYRTFLNYP 418
           +LI +     R  HL QI  ++I TS + +PT+ L  LSR ALPPF+  PYS +   + P
Sbjct: 172 SLIVIATDVHREPHLLQIQAHIIVTSLIQDPTVSLHILSRFALPPFRETPYSRQILDHIP 231

Query: 419 NHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQ 598
             N+S YNTM+RA+S+S S   GF L+ +M    I           KC +K  SL+ GLQ
Sbjct: 232 RPNVSHYNTMVRAYSMSSSPEEGFYLFEKMRQKRIPTNPFACSFAIKCCMKFCSLMGGLQ 291

Query: 599 VHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           +H  + RDG+  D  L + L+D YS   K  EACK+F E+
Sbjct: 292 IHARVLRDGYQLDSQLMTTLMDLYSTFEKSFEACKLFDEI 331


>ref|XP_003525465.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g47530-like [Glycine max]
          Length = 579

 Score =  120 bits (300), Expect = 6e-25
 Identities = 74/164 (45%), Positives = 97/164 (59%), Gaps = 1/164 (0%)
 Frame = +2

Query: 230 AENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALP-PFQNIPYSYRTF 406
           A  T+IS IKS +  + L QIH ++IRT+ +  PT+ L FLSRIAL  P Q+  YS R F
Sbjct: 2   ALETVISAIKSVSHKTRLLQIHAHIIRTTLIQYPTVSLQFLSRIALSGPLQDASYSQRFF 61

Query: 407 LNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLL 586
               +  +S YNTMIRA S+SDS   G  LYR+M   GI           K  I+   L 
Sbjct: 62  GQLSHPLVSHYNTMIRACSMSDSPQKGLLLYRDMRRRGIAADPLSSSFAVKSCIRFLYLP 121

Query: 587 NGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
            G+QVH  IF+DGH  D  L +A++D YS+ ++  +ACKVF EM
Sbjct: 122 GGVQVHCNIFKDGHQWDTLLLTAVMDLYSLCQRGGDACKVFDEM 165


>ref|XP_006292869.1| hypothetical protein CARUB_v10019129mg [Capsella rubella]
           gi|482561576|gb|EOA25767.1| hypothetical protein
           CARUB_v10019129mg [Capsella rubella]
          Length = 589

 Score =  119 bits (299), Expect = 8e-25
 Identities = 73/169 (43%), Positives = 97/169 (57%), Gaps = 2/169 (1%)
 Frame = +2

Query: 218 SKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPF-QNIPYS 394
           S S +++ LISLI S     HL QIH  L+RTS + N  +F  FLSR++L    ++I YS
Sbjct: 4   SISSSDDHLISLIVSSTGKLHLRQIHAVLLRTSLIRNSDVFHHFLSRLSLSLIPRDINYS 63

Query: 395 YRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMV-HFGIXXXXXXXXXXXKCSIK 571
            R F    N  +S  NTMIRAFSLS +   GF L+R +  +  +           KC IK
Sbjct: 64  CRVFSQRSNPTLSHSNTMIRAFSLSKNPIEGFRLFRALRRNSSLPPNPLSSSFALKCCIK 123

Query: 572 MGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
            G LL GLQ+HG I+ DG +SD  L + L+D YS     ++ACKVF E+
Sbjct: 124 SGDLLGGLQIHGKIYSDGFLSDSLLLTTLMDLYSACENSNDACKVFDEI 172


>ref|XP_004508732.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g47530-like [Cicer arietinum]
          Length = 633

 Score =  117 bits (293), Expect = 4e-24
 Identities = 79/230 (34%), Positives = 115/230 (50%), Gaps = 1/230 (0%)
 Frame = +2

Query: 32  SQKSRTTAMKTIFSLFHYHQYSIATARIPSTNSTTRSFWSTVAISLQHHHAPPSAPLYLR 211
           S++++T+     F+L H H Y+                  T+     HH           
Sbjct: 26  SRRNQTSFAAATFTLIHLHHYN------------------TIPQPYPHH----------- 56

Query: 212 TASKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALP-PFQNIP 388
                  + ++IS IKS +  +HL QIH +++ T+ + +PTI L FLSR+AL  P Q+  
Sbjct: 57  -------KYSVISAIKSSSHKTHLLQIHAHILTTTLIQHPTISLHFLSRLALSGPLQDPT 109

Query: 389 YSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSI 568
           YS+R F    N  +  YNTMIRA+SLSDS      LYR+M   GI           K  I
Sbjct: 110 YSHRFFDQISNPFVFHYNTMIRAYSLSDSPQKALFLYRDMRRKGIASDPLSSSFAVKSCI 169

Query: 569 KMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           +   L  GLQVH  + ++GH SD  L ++L+D YS  ++ D+A KVF E+
Sbjct: 170 RFLYLFGGLQVHCNVLKEGHQSDTLLLTSLMDLYSQCQRCDDASKVFDEI 219


>gb|ESW27301.1| hypothetical protein PHAVU_003G189800g [Phaseolus vulgaris]
          Length = 579

 Score =  115 bits (287), Expect = 2e-23
 Identities = 68/160 (42%), Positives = 96/160 (60%), Gaps = 1/160 (0%)
 Frame = +2

Query: 242 LISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALP-PFQNIPYSYRTFLNYP 418
           +IS IKS ++ + L QIH ++IRT+ +  P + + FLSRIAL  P Q+  YS+R F ++ 
Sbjct: 6   VISAIKSVSQKTQLLQIHAHIIRTNLIQYPPVSIQFLSRIALSGPLQDANYSHRFFEHFT 65

Query: 419 NHNISLYNTMIRAFSLSDSSFLGFELYREMVHFGIXXXXXXXXXXXKCSIKMGSLLNGLQ 598
           +  +S YNTMIRA S+SDS   G  LYR+M   GI           K  I++   L G+Q
Sbjct: 66  HPLVSHYNTMIRACSMSDSPRKGLLLYRDMRRRGIAADPVSASFAVKSCIRLLYFLGGVQ 125

Query: 599 VHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           VH  I +DGH  D  L + ++D YS  ++  +ACKVF EM
Sbjct: 126 VHCNILKDGHQWDTLLLTVVMDLYSQCQRGGDACKVFDEM 165


>ref|XP_006404369.1| hypothetical protein EUTSA_v10010230mg [Eutrema salsugineum]
           gi|557105488|gb|ESQ45822.1| hypothetical protein
           EUTSA_v10010230mg [Eutrema salsugineum]
          Length = 590

 Score =  112 bits (280), Expect = 1e-22
 Identities = 71/170 (41%), Positives = 91/170 (53%), Gaps = 3/170 (1%)
 Frame = +2

Query: 218 SKSEAENTLISLIKSCARISHLCQIHCYLIRTSFLLNPTIFLCFLSRIALPPF-QNIPYS 394
           S   + + LISLI S     HL QIH  L+RTS + N  +F  FLSR+AL    ++I YS
Sbjct: 4   SMRSSNDHLISLIVSSTAKLHLRQIHAILLRTSLIRNSDVFHHFLSRLALSLVPRDIDYS 63

Query: 395 YRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMVH--FGIXXXXXXXXXXXKCSI 568
            R F    N  +S  NTMIRAFS+S++   GF L+R +                  KC I
Sbjct: 64  RRVFSRRSNPTVSHCNTMIRAFSVSETPVEGFRLFRALRRRRSSRPANPLSSSFALKCCI 123

Query: 569 KMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYDEACKVFAEM 718
           K G  L GLQ+HG I  DG +SD  L + L+D YS     + ACKVF E+
Sbjct: 124 KSGDFLGGLQIHGKIISDGFLSDSLLLTTLMDLYSTCENSNYACKVFDEI 173


>gb|EOY30986.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
           [Theobroma cacao]
          Length = 847

 Score = 93.6 bits (231), Expect = 6e-17
 Identities = 73/246 (29%), Positives = 110/246 (44%), Gaps = 11/246 (4%)
 Frame = +2

Query: 5   TAIQFFFHRSQKSRTTAMKTIFSLFHYH---QYSIATARIPSTNSTTRSFWSTVAISLQH 175
           TAI+ F+      R T  K +      H   +  I +  IP   S       + ++S+  
Sbjct: 11  TAIRHFYCCFPFWRVTKKKNLSMNDQRHFKKKKKIISTLIPLKTSKREMALPSTSVSISP 70

Query: 176 ---HHAPPSAPLYLRTASKSEAENTLISLIKSCARISHLCQIHCYLIRTS-----FLLNP 331
              H  P S P Y     K    +  +SL+  C  I  L Q+HC++I+T      F L+ 
Sbjct: 71  FPLHLLPSSDPPY-----KLLQNHPSLSLLSKCRTIQTLKQVHCHIIKTGLHHTQFALSK 125

Query: 332 TIFLCFLSRIALPPFQNIPYSYRTFLNYPNHNISLYNTMIRAFSLSDSSFLGFELYREMV 511
            I  C     A+ PF ++PY+   F +    N  ++NTMIR FSLS S  L  E Y +M+
Sbjct: 126 LIEFC-----AVSPFGDLPYALLLFESIDEPNQVIWNTMIRGFSLSSSPGLTLEFYVKMI 180

Query: 512 HFGIXXXXXXXXXXXKCSIKMGSLLNGLQVHGMIFRDGHISDCFLSSALVDFYSVNRKYD 691
             GI           K   K  S   G Q+HG + + G  SD F+ ++L++ Y+ N ++ 
Sbjct: 181 WSGIVPNSYTFPFVLKSCAKTASTQEGKQIHGQVLKLGLESDAFVHTSLINMYAQNGEFG 240

Query: 692 EACKVF 709
            A  VF
Sbjct: 241 NARLVF 246


Top