BLASTX nr result

ID: Rauwolfia21_contig00033175 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00033175
         (848 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006338488.1| PREDICTED: putative pentatricopeptide repeat...   320   3e-85
emb|CBI35029.3| unnamed protein product [Vitis vinifera]              313   6e-83
ref|XP_002276684.1| PREDICTED: putative pentatricopeptide repeat...   313   6e-83
gb|EMJ17608.1| hypothetical protein PRUPE_ppa022709mg, partial [...   306   6e-81
ref|XP_004233685.1| PREDICTED: putative pentatricopeptide repeat...   302   9e-80
ref|XP_004304947.1| PREDICTED: pentatricopeptide repeat-containi...   280   6e-73
ref|NP_189505.2| putative pentatricopeptide repeat-containing pr...   250   5e-64
ref|NP_189507.2| pentatricopeptide repeat-containing protein [Ar...   249   7e-64
ref|XP_006290938.1| hypothetical protein CARUB_v10017051mg [Caps...   248   3e-63
ref|XP_006395353.1| hypothetical protein EUTSA_v10005682mg [Eutr...   234   3e-59
gb|EOY03349.1| Tetratricopeptide repeat (TPR)-like superfamily p...   231   2e-58
gb|EXB36666.1| hypothetical protein L484_002079 [Morus notabilis]     219   7e-55
ref|XP_006847904.1| hypothetical protein AMTR_s00029p00110030 [A...   176   1e-41
emb|CAN66581.1| hypothetical protein VITISV_030261 [Vitis vinifera]   176   1e-41
ref|XP_006425654.1| hypothetical protein CICLE_v10025166mg [Citr...   174   3e-41
ref|XP_002877120.1| pentatricopeptide repeat-containing protein ...   174   5e-41
gb|EMJ05156.1| hypothetical protein PRUPE_ppa022872mg [Prunus pe...   172   1e-40
ref|XP_002521565.1| pentatricopeptide repeat-containing protein,...   169   1e-39
ref|XP_006838670.1| hypothetical protein AMTR_s00002p00243230 [A...   168   3e-39
gb|EMJ11381.1| hypothetical protein PRUPE_ppa020166mg [Prunus pe...   168   3e-39

>ref|XP_006338488.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At3g28640-like [Solanum tuberosum]
          Length = 512

 Score =  320 bits (821), Expect = 3e-85
 Identities = 153/279 (54%), Positives = 204/279 (73%), Gaps = 2/279 (0%)
 Frame = -2

Query: 832 MNRLTNLHDGLRAVETITDRSIKAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAA 653
           MN L+N    +RA     + S + W WCMS+A++C+++ QLK IH I++  G+HRN+YA 
Sbjct: 1   MNCLSNARVLVRA-----NNSFQIWKWCMSMAEKCTNIGQLKAIHAIYITLGLHRNTYAV 55

Query: 652 GKLVSFCALSEKGDLSYASLLFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLML--S 479
            KL+ FCALS  GDLSYAS +F+Q+ +PN+F+YN LIRAYS S QP+ SL YFNLML  S
Sbjct: 56  SKLLDFCALSNTGDLSYASRIFAQVQTPNTFLYNALIRAYSSSSQPQFSLNYFNLMLQTS 115

Query: 478 NLLHPDGHTFPFVLMACANRFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALA 299
           N   PD  TFPF+++ACAN  L  EG+++HSWV+KN    SNAHVQ+AL+RFY+  KAL 
Sbjct: 116 NAAAPDSFTFPFLIIACANGPLEVEGKQIHSWVIKNSFSASNAHVQTALIRFYTNCKALD 175

Query: 298 DARKVFDDINSVDAVQCNILINGFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXT 119
           DARKVFD+I  +D +QCN+L++G ++ G+A EAL +F++ML RG+  DE+          
Sbjct: 176 DARKVFDEITDIDVIQCNVLMSGHLQSGLAKEALSIFQDMLGRGVGPDEYCVTTALGACA 235

Query: 118 HLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
            LGALEQGKWIHE+V K+  +  DVF+G+ALVDMYAKCG
Sbjct: 236 QLGALEQGKWIHEHVTKSEWLEYDVFIGSALVDMYAKCG 274


>emb|CBI35029.3| unnamed protein product [Vitis vinifera]
          Length = 1596

 Score =  313 bits (801), Expect = 6e-83
 Identities = 149/257 (57%), Positives = 193/257 (75%), Gaps = 1/257 (0%)
 Frame = -2

Query: 769 IKAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLL 590
           ++AW  C+SLAQ CS+M Q K IH +F+ +G+H N+YA  KL+SFCALS  G LSYASL+
Sbjct: 1   MEAWKRCISLAQSCSNMRQFKAIHALFIVNGLHLNNYAISKLISFCALSNSGSLSYASLI 60

Query: 589 FSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNL-LHPDGHTFPFVLMACANRFL 413
           FSQ+ +PN F YNTLIRAYSRS  P+L+L YF LML +  + PD HTFPF++ AC N   
Sbjct: 61  FSQIQNPNLFAYNTLIRAYSRSSTPQLALHYFQLMLDDENVGPDQHTFPFIISACTNSLW 120

Query: 412 RFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILIN 233
              G+++H+WV+KNG+  S+ HVQ+ALVRFY+E  A+ DARK+FD+I ++D VQ N+L+N
Sbjct: 121 MLLGKQIHNWVLKNGVASSDRHVQTALVRFYAECCAMGDARKLFDEIPNLDVVQWNVLLN 180

Query: 232 GFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMV 53
           G+V+ G+A EAL+ FRNMLV G+E DEF           LGAL+QGKWIHEYV K   + 
Sbjct: 181 GYVRRGLAPEALNAFRNMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLE 240

Query: 52  SDVFVGTALVDMYAKCG 2
           +DVF+GTALVDMYAKCG
Sbjct: 241 ADVFIGTALVDMYAKCG 257



 Score = 59.7 bits (143), Expect = 1e-06
 Identities = 53/238 (22%), Positives = 101/238 (42%), Gaps = 2/238 (0%)
 Frame = -2

Query: 709 KTIHGIFVAHGIHRNS-YAAGKLVSFCALSEKGDLSYASLLFSQLSSPNSFIYNTLIRAY 533
           K IH   + +G+  +  +    LV F A  E   +  A  LF ++ + +   +N L+  Y
Sbjct: 125 KQIHNWVLKNGVASSDRHVQTALVRFYA--ECCAMGDARKLFDEIPNLDVVQWNVLLNGY 182

Query: 532 SRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLHSWVVKNGLFESN 353
            R      +L  F  ML + + PD       L  CA      +G+ +H +V K    E++
Sbjct: 183 VRRGLAPEALNAFRNMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLEAD 242

Query: 352 AHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCGMALEALDVFRNMLV 173
             + +ALV  Y++   +  + +VF+ +   +    + +I GF   G   +A+     M V
Sbjct: 243 VFIGTALVDMYAKCGCIDRSVEVFEGMTKRNVFSWSAMIGGFALHGHVRKAMQCLERMQV 302

Query: 172 R-GIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
             G+  D            H G  E+G+++ E ++    ++      + +VD+  + G
Sbjct: 303 EDGLRPDGVVLLGVIMACAHAGLQEEGQFLLENMEARYGILPKHEHYSCMVDLLCRAG 360


>ref|XP_002276684.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At3g28640-like [Vitis vinifera]
          Length = 511

 Score =  313 bits (801), Expect = 6e-83
 Identities = 149/257 (57%), Positives = 193/257 (75%), Gaps = 1/257 (0%)
 Frame = -2

Query: 769 IKAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLL 590
           ++AW  C+SLAQ CS+M Q K IH +F+ +G+H N+YA  KL+SFCALS  G LSYASL+
Sbjct: 1   MEAWKRCISLAQSCSNMRQFKAIHALFIVNGLHLNNYAISKLISFCALSNSGSLSYASLI 60

Query: 589 FSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNL-LHPDGHTFPFVLMACANRFL 413
           FSQ+ +PN F YNTLIRAYSRS  P+L+L YF LML +  + PD HTFPF++ AC N   
Sbjct: 61  FSQIQNPNLFAYNTLIRAYSRSSTPQLALHYFQLMLDDENVGPDQHTFPFIISACTNSLW 120

Query: 412 RFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILIN 233
              G+++H+WV+KNG+  S+ HVQ+ALVRFY+E  A+ DARK+FD+I ++D VQ N+L+N
Sbjct: 121 MLLGKQIHNWVLKNGVASSDRHVQTALVRFYAECCAMGDARKLFDEIPNLDVVQWNVLLN 180

Query: 232 GFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMV 53
           G+V+ G+A EAL+ FRNMLV G+E DEF           LGAL+QGKWIHEYV K   + 
Sbjct: 181 GYVRRGLAPEALNAFRNMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLE 240

Query: 52  SDVFVGTALVDMYAKCG 2
           +DVF+GTALVDMYAKCG
Sbjct: 241 ADVFIGTALVDMYAKCG 257



 Score = 59.7 bits (143), Expect = 1e-06
 Identities = 53/238 (22%), Positives = 101/238 (42%), Gaps = 2/238 (0%)
 Frame = -2

Query: 709 KTIHGIFVAHGIHRNS-YAAGKLVSFCALSEKGDLSYASLLFSQLSSPNSFIYNTLIRAY 533
           K IH   + +G+  +  +    LV F A  E   +  A  LF ++ + +   +N L+  Y
Sbjct: 125 KQIHNWVLKNGVASSDRHVQTALVRFYA--ECCAMGDARKLFDEIPNLDVVQWNVLLNGY 182

Query: 532 SRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLHSWVVKNGLFESN 353
            R      +L  F  ML + + PD       L  CA      +G+ +H +V K    E++
Sbjct: 183 VRRGLAPEALNAFRNMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLEAD 242

Query: 352 AHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCGMALEALDVFRNMLV 173
             + +ALV  Y++   +  + +VF+ +   +    + +I GF   G   +A+     M V
Sbjct: 243 VFIGTALVDMYAKCGCIDRSVEVFEGMTKRNVFSWSAMIGGFALHGHVRKAMQCLERMQV 302

Query: 172 R-GIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
             G+  D            H G  E+G+++ E ++    ++      + +VD+  + G
Sbjct: 303 EDGLRPDGVVLLGVIMACAHAGLQEEGQFLLENMEARYGILPKHEHYSCMVDLLCRAG 360


>gb|EMJ17608.1| hypothetical protein PRUPE_ppa022709mg, partial [Prunus persica]
          Length = 541

 Score =  306 bits (784), Expect = 6e-81
 Identities = 151/250 (60%), Positives = 189/250 (75%), Gaps = 1/250 (0%)
 Frame = -2

Query: 748 MSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSSP 569
           MSLAQ CS+M +LK  H IF+ +G+H N+YA  KL++FCALS  GDLSYASLLF+Q+ +P
Sbjct: 1   MSLAQGCSNMRKLKATHAIFITNGLHLNNYAISKLIAFCALSNSGDLSYASLLFNQIQTP 60

Query: 568 NSFIYNTLIRAYSRSPQPELSLKYFNLMLS-NLLHPDGHTFPFVLMACANRFLRFEGQKL 392
           NS++YNTLIRAYSRS QP L++ YF LML  + L PD +TF FV++ACAN      G+++
Sbjct: 61  NSYLYNTLIRAYSRSSQPHLAVHYFLLMLKQSSLGPDNYTFNFVILACANCSWLVSGRQI 120

Query: 391 HSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCGM 212
           H+WVVKNGLF  +AHVQ+ALVR Y+E K L D++KVFD+I   D +Q N+L+NG+V+CG+
Sbjct: 121 HNWVVKNGLFLVDAHVQTALVRLYAECKVLDDSKKVFDEIPERDVIQWNVLMNGYVRCGL 180

Query: 211 ALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSDVFVGT 32
           A EAL VFR+MLV G E D F          HLGAL QGKWI EYVKK   + SDVF+GT
Sbjct: 181 ASEALKVFRDMLVTGFEPDNFCVATGLAACAHLGALRQGKWIDEYVKKRTGLKSDVFIGT 240

Query: 31  ALVDMYAKCG 2
           ALVDMYAKCG
Sbjct: 241 ALVDMYAKCG 250



 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 60/245 (24%), Positives = 103/245 (42%), Gaps = 2/245 (0%)
 Frame = -2

Query: 730 CSSMDQLKTIHGIFVAHGIHR-NSYAAGKLVSFCALSEKGDLSYASLLFSQLSSPNSFIY 554
           CS +   + IH   V +G+   +++    LV   A  E   L  +  +F ++   +   +
Sbjct: 111 CSWLVSGRQIHNWVVKNGLFLVDAHVQTALVRLYA--ECKVLDDSKKVFDEIPERDVIQW 168

Query: 553 NTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLHSWVVK 374
           N L+  Y R      +LK F  ML     PD       L ACA+     +G+ +  +V K
Sbjct: 169 NVLMNGYVRCGLASEALKVFRDMLVTGFEPDNFCVATGLAACAHLGALRQGKWIDEYVKK 228

Query: 373 NGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCGMALEALD 194
               +S+  + +ALV  Y++   +  A + F+ +   + V    +I GF   G A  A+ 
Sbjct: 229 RTGLKSDVFIGTALVDMYAKCGCIDLAVEAFEGMPKRNVVSWAAMIGGFAAHGCATNAIH 288

Query: 193 VFRNMLV-RGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDM 17
               M V  G+  D           TH G LE+GK + + +K    +V      + ++D+
Sbjct: 289 SLERMQVDDGLRPDGVVLLVVLMACTHAGLLEKGKLLLDNMKTQYGIVPKHEHYSCVIDL 348

Query: 16  YAKCG 2
             K G
Sbjct: 349 LCKAG 353


>ref|XP_004233685.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At3g28640-like [Solanum lycopersicum]
          Length = 487

 Score =  302 bits (774), Expect = 9e-80
 Identities = 141/249 (56%), Positives = 189/249 (75%), Gaps = 2/249 (0%)
 Frame = -2

Query: 742 LAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSSPNS 563
           +A++C++M QLK IH I++  G+ RN+YA  KL+ FCALS  GDLSYAS +F+Q+ +PN+
Sbjct: 1   MAEKCNNMRQLKAIHAIYITLGLQRNTYAVSKLLDFCALSNSGDLSYASRIFAQVQTPNA 60

Query: 562 FIYNTLIRAYSRSPQPELSLKYFNLML--SNLLHPDGHTFPFVLMACANRFLRFEGQKLH 389
           F+YN LIRAYS SPQP++SL YFNLM+  SN   PD  TFPF+L+ACAN  L  EG+++H
Sbjct: 61  FLYNALIRAYSSSPQPQVSLNYFNLMVQTSNAAAPDSFTFPFLLIACANGPLEVEGKQIH 120

Query: 388 SWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCGMA 209
           SW++KN    SNAHVQ+AL+RFY+  KAL DARKVFD+I  +D +QCN+L++G ++ G+A
Sbjct: 121 SWIIKNSFSASNAHVQTALIRFYTNCKALDDARKVFDEITDIDVIQCNVLMSGHLQSGLA 180

Query: 208 LEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSDVFVGTA 29
            EAL +F++ML RG+  DE+           LGALEQGKWIHE+V K+  +  DVF+G+A
Sbjct: 181 KEALSIFQDMLGRGVGPDEYCVTTALGACAQLGALEQGKWIHEHVTKSEWLEYDVFIGSA 240

Query: 28  LVDMYAKCG 2
           LVDMYAKCG
Sbjct: 241 LVDMYAKCG 249


>ref|XP_004304947.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g28660-like [Fragaria vesca subsp. vesca]
          Length = 501

 Score =  280 bits (715), Expect = 6e-73
 Identities = 139/257 (54%), Positives = 180/257 (70%), Gaps = 1/257 (0%)
 Frame = -2

Query: 769 IKAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLL 590
           I+AW  CMSLAQ C++M  LK  H +F+ HG+H N++A  KL++FCALS+ G L YASL+
Sbjct: 9   IQAWKRCMSLAQCCTTMRSLKPTHAVFITHGLHLNNFAVSKLLAFCALSDSGSLRYASLI 68

Query: 589 FSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNL-LHPDGHTFPFVLMACANRFL 413
           F Q+ +PN+++YNTLIRA+S S  P L++ YF LM   + L PD  TF F ++ C N   
Sbjct: 69  FHQVPAPNAYMYNTLIRAHSASSDPHLAMYYFQLMSKQIDLEPDNFTFHFAILGCVNCGW 128

Query: 412 RFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILIN 233
              G+++H  VVKNGL  ++AHVQ+A+VR Y E   L DA KVFD+I   D VQ N+++N
Sbjct: 129 IGPGRQMHCLVVKNGLVAADAHVQTAVVRLYVECGVLGDAHKVFDEIPERDMVQWNVIMN 188

Query: 232 GFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMV 53
           G+VK G+A EAL VF++MLVRG E D F          HLGAL QGKWIHEYV+K   + 
Sbjct: 189 GYVKRGLASEALRVFQDMLVRGFEPDGFCVATGLAACAHLGALWQGKWIHEYVRKREGLN 248

Query: 52  SDVFVGTALVDMYAKCG 2
           SDVF+GTALVDMYAKCG
Sbjct: 249 SDVFIGTALVDMYAKCG 265



 Score = 62.8 bits (151), Expect = 2e-07
 Identities = 50/208 (24%), Positives = 91/208 (43%), Gaps = 1/208 (0%)
 Frame = -2

Query: 622 EKGDLSYASLLFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPF 443
           E G L  A  +F ++   +   +N ++  Y +      +L+ F  ML     PDG     
Sbjct: 161 ECGVLGDAHKVFDEIPERDMVQWNVIMNGYVKRGLASEALRVFQDMLVRGFEPDGFCVAT 220

Query: 442 VLMACANRFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSV 263
            L ACA+    ++G+ +H +V K     S+  + +ALV  Y++   +  A + F+ +   
Sbjct: 221 GLAACAHLGALWQGKWIHEYVRKREGLNSDVFIGTALVDMYAKCGCIDLAVEAFEGMGKR 280

Query: 262 DAVQCNILINGFVKCGMALEALDVFRNMLV-RGIEADEFXXXXXXXXXTHLGALEQGKWI 86
           + V  + +I  +   G A EA+     M V  G++ D            H G LE+GK +
Sbjct: 281 NVVSWSAMIGAYGVHGYATEAISCLERMQVDDGVKPDGVVLLGVLTACNHGGLLEKGKAL 340

Query: 85  HEYVKKTNLMVSDVFVGTALVDMYAKCG 2
            + +K    +V      + ++D+  K G
Sbjct: 341 LDNMKAKYGIVPKHEHYSCVIDLLCKAG 368


>ref|NP_189505.2| putative pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana] gi|75273576|sp|Q9LJJ1.1|PP259_ARATH RecName:
           Full=Putative pentatricopeptide repeat-containing
           protein At3g28640 gi|9294278|dbj|BAB02180.1| unnamed
           protein product [Arabidopsis thaliana]
           gi|332643948|gb|AEE77469.1| putative pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  250 bits (638), Expect = 5e-64
 Identities = 125/260 (48%), Positives = 175/260 (67%), Gaps = 5/260 (1%)
 Frame = -2

Query: 766 KAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVS-FCALSEKGD-LSYASL 593
           ++W   +  +QRC+++ Q+K+ H +F+ HG+HRN+YA  KL++ F  L        YAS 
Sbjct: 9   QSWKSLILASQRCNTVKQIKSTHSLFIIHGLHRNTYAISKLLTAFLHLPNLNKHFHYASS 68

Query: 592 LFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNL---LHPDGHTFPFVLMACAN 422
           +F  +  PNSF+Y+T+IR  SRS QP L L+YF LM+      + P   TF F+++AC  
Sbjct: 69  IFDSIEIPNSFVYDTMIRICSRSSQPHLGLRYFLLMVKEEEEDIAPSYLTFHFLIVACLK 128

Query: 421 RFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNI 242
                 G+++H WVVKNG+F S++HVQ+ ++R Y E K L DARKVFD+I   D V+ ++
Sbjct: 129 ACFFSVGKQIHCWVVKNGVFLSDSHVQTGVLRIYVEDKLLLDARKVFDEIPQPDVVKWDV 188

Query: 241 LINGFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTN 62
           L+NG+V+CG+  E L+VFR MLV+G+E DEF           +GAL QGKWIHE+VKK +
Sbjct: 189 LMNGYVRCGLGSEGLEVFREMLVKGLEPDEFSVTTALTACAQVGALAQGKWIHEFVKKKS 248

Query: 61  LMVSDVFVGTALVDMYAKCG 2
            + SDVFVGTALVDMYAKCG
Sbjct: 249 WIESDVFVGTALVDMYAKCG 268



 Score = 60.8 bits (146), Expect = 6e-07
 Identities = 49/201 (24%), Positives = 85/201 (42%), Gaps = 1/201 (0%)
 Frame = -2

Query: 601 ASLLFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACAN 422
           A  +F ++  P+   ++ L+  Y R       L+ F  ML   L PD  +    L ACA 
Sbjct: 171 ARKVFDEIPQPDVVKWDVLMNGYVRCGLGSEGLEVFREMLVKGLEPDEFSVTTALTACAQ 230

Query: 421 RFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNI 242
                +G+ +H +V K    ES+  V +ALV  Y++   +  A +VF  +   +      
Sbjct: 231 VGALAQGKWIHEFVKKKSWIESDVFVGTALVDMYAKCGCIETAVEVFKKLTRRNVFSWAA 290

Query: 241 LINGFVKCGMALEALDVFRNM-LVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKT 65
           LI G+   G A +A+     +    GI+ D            H G LE+G+ + E ++  
Sbjct: 291 LIGGYAAYGYAKKAMTCLERLEREDGIKPDSVVLLGVLAACAHGGFLEEGRSMLENMEAR 350

Query: 64  NLMVSDVFVGTALVDMYAKCG 2
             +       + +VD+  + G
Sbjct: 351 YEITPKHEHYSCIVDLMCRAG 371


>ref|NP_189507.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75273574|sp|Q9LJI9.1|PP260_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At3g28660 gi|9294280|dbj|BAB02182.1| unnamed protein
           product [Arabidopsis thaliana]
           gi|20259531|gb|AAM13885.1| unknown protein [Arabidopsis
           thaliana] gi|24030460|gb|AAN41382.1| unknown protein
           [Arabidopsis thaliana] gi|332643950|gb|AEE77471.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 504

 Score =  249 bits (637), Expect = 7e-64
 Identities = 126/260 (48%), Positives = 173/260 (66%), Gaps = 5/260 (1%)
 Frame = -2

Query: 766 KAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVS-FCALSEKGD-LSYASL 593
           ++W   +  +QRC+++ Q+K+ H +F+ HG+HRN+YA  KL++ F  L        YAS 
Sbjct: 9   QSWKSLILASQRCNTVKQIKSTHSLFIIHGLHRNTYAISKLLTAFLHLPNLNKHFHYASS 68

Query: 592 LFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNL---LHPDGHTFPFVLMACAN 422
           +F  +  PNSF+Y+T+IR  SRS QP L L+YF LM+      + P   TF F+++AC  
Sbjct: 69  IFDSIEIPNSFVYDTMIRICSRSSQPHLGLRYFLLMVKEEEEDITPSYLTFHFLIVACLK 128

Query: 421 RFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNI 242
                 G+++H WVVKNG+F S+ HVQ+ ++R Y E K L DARKVFD+I   D V+ ++
Sbjct: 129 ACFFSVGKQIHCWVVKNGVFLSDGHVQTGVLRIYVEDKLLFDARKVFDEIPQPDVVKWDV 188

Query: 241 LINGFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTN 62
           L+NG+V+CG+  E L+VF+ MLVRGIE DEF           +GAL QGKWIHE+VKK  
Sbjct: 189 LMNGYVRCGLGSEGLEVFKEMLVRGIEPDEFSVTTALTACAQVGALAQGKWIHEFVKKKR 248

Query: 61  LMVSDVFVGTALVDMYAKCG 2
            + SDVFVGTALVDMYAKCG
Sbjct: 249 WIESDVFVGTALVDMYAKCG 268



 Score = 58.5 bits (140), Expect = 3e-06
 Identities = 49/201 (24%), Positives = 85/201 (42%), Gaps = 1/201 (0%)
 Frame = -2

Query: 601 ASLLFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACAN 422
           A  +F ++  P+   ++ L+  Y R       L+ F  ML   + PD  +    L ACA 
Sbjct: 171 ARKVFDEIPQPDVVKWDVLMNGYVRCGLGSEGLEVFKEMLVRGIEPDEFSVTTALTACAQ 230

Query: 421 RFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNI 242
                +G+ +H +V K    ES+  V +ALV  Y++   +  A +VF+ +   +      
Sbjct: 231 VGALAQGKWIHEFVKKKRWIESDVFVGTALVDMYAKCGCIETAVEVFEKLTRRNVFSWAA 290

Query: 241 LINGFVKCGMALEALDVF-RNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKT 65
           LI G+   G A +A     R     GI+ D            H G LE+G+ + E ++  
Sbjct: 291 LIGGYAAYGYAKKATTCLDRIEREDGIKPDSVVLLGVLAACAHGGFLEEGRTMLENMEAR 350

Query: 64  NLMVSDVFVGTALVDMYAKCG 2
             +       + +VD+  + G
Sbjct: 351 YGITPKHEHYSCIVDLMCRAG 371


>ref|XP_006290938.1| hypothetical protein CARUB_v10017051mg [Capsella rubella]
           gi|482559645|gb|EOA23836.1| hypothetical protein
           CARUB_v10017051mg [Capsella rubella]
          Length = 507

 Score =  248 bits (632), Expect = 3e-63
 Identities = 125/263 (47%), Positives = 174/263 (66%), Gaps = 8/263 (3%)
 Frame = -2

Query: 766 KAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVS-FCALSEKGD-LSYASL 593
           ++W   +  +QRC+++ Q+K+ H +F+ HG+HRN+YA  KL++ F  L        YAS 
Sbjct: 9   QSWKTLILASQRCNTVKQIKSTHALFIIHGLHRNTYAISKLLTAFLHLPNLNKHFHYAST 68

Query: 592 LFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNL------LHPDGHTFPFVLMA 431
           +F  +   N+F+Y+T+IR  SRS  P+L L+YF LM+S        + P   TF F+++A
Sbjct: 69  IFDSIEIRNTFVYDTMIRICSRSSLPQLGLRYFRLMVSEDEKEEEDIAPSYLTFHFLIVA 128

Query: 430 CANRFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQ 251
           C    L   G+++H WVVKNG+F S+ HVQ+ ++R Y E + L DARKVFD+I   D V+
Sbjct: 129 CLKACLFSVGKQIHCWVVKNGVFLSDGHVQTGVLRIYVEDRVLVDARKVFDEIPQPDVVK 188

Query: 250 CNILINGFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVK 71
            ++L+NG+V+CG+  E L+VFR MLVRGIE DEF           +GAL QGKWIHE+VK
Sbjct: 189 WDVLMNGYVRCGLGSEGLEVFREMLVRGIEPDEFSVTTALTACAQVGALAQGKWIHEFVK 248

Query: 70  KTNLMVSDVFVGTALVDMYAKCG 2
           K   + SDVFVGTALVDMYAKCG
Sbjct: 249 KKKWVKSDVFVGTALVDMYAKCG 271


>ref|XP_006395353.1| hypothetical protein EUTSA_v10005682mg [Eutrema salsugineum]
           gi|557091992|gb|ESQ32639.1| hypothetical protein
           EUTSA_v10005682mg [Eutrema salsugineum]
          Length = 505

 Score =  234 bits (597), Expect = 3e-59
 Identities = 119/261 (45%), Positives = 170/261 (65%), Gaps = 6/261 (2%)
 Frame = -2

Query: 766 KAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVS-FCALSEKGD-LSYASL 593
           ++W   +  +QRC+++ Q+K+ H +F+ HGIHRN+YA  KL++ F  L        YAS+
Sbjct: 9   QSWRSLILASQRCTTLRQIKSTHALFIIHGIHRNTYAISKLLTAFLPLPNLDKHFHYASI 68

Query: 592 LFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNL----LHPDGHTFPFVLMACA 425
           +F  +   NSF+Y+T+IR  SRS +P L ++YF LML+      + P   TF F+L+A  
Sbjct: 69  IFDSIELRNSFVYDTMIRICSRSSRPHLGVRYFRLMLTEDDEEDIAPSYLTFHFLLVAFL 128

Query: 424 NRFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCN 245
           N  L   G+++H WV+KNG+  S+ HVQ+ ++R Y E K L DARKVFD+I   D V+ +
Sbjct: 129 NASLFSVGKQIHCWVIKNGVLSSDGHVQTGIIRLYIEGKVLPDARKVFDEIPHPDVVKWD 188

Query: 244 ILINGFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKT 65
           +L+NG+V+CG+  E L VFR ML RG E D+F           +GAL QGK IH+ +KK 
Sbjct: 189 VLMNGYVRCGLGSEGLHVFREMLARGTEPDKFSVTTALTACAQVGALAQGKLIHKLLKKK 248

Query: 64  NLMVSDVFVGTALVDMYAKCG 2
            L+ SD++VGTALVDMYAKCG
Sbjct: 249 KLLESDIYVGTALVDMYAKCG 269



 Score = 61.2 bits (147), Expect = 4e-07
 Identities = 44/171 (25%), Positives = 77/171 (45%), Gaps = 1/171 (0%)
 Frame = -2

Query: 601 ASLLFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACAN 422
           A  +F ++  P+   ++ L+  Y R       L  F  ML+    PD  +    L ACA 
Sbjct: 172 ARKVFDEIPHPDVVKWDVLMNGYVRCGLGSEGLHVFREMLARGTEPDKFSVTTALTACAQ 231

Query: 421 RFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNI 242
                +G+ +H  + K  L ES+ +V +ALV  Y++   +  A +VF++++  +     +
Sbjct: 232 VGALAQGKLIHKLLKKKKLLESDIYVGTALVDMYAKCGCIETALEVFENLSRRNVFSWAV 291

Query: 241 LINGFVKCGMALEALDVFRNM-LVRGIEADEFXXXXXXXXXTHLGALEQGK 92
           LI G+   G A +A+     M    GI+ D            H G L++G+
Sbjct: 292 LIGGYAAYGYAKKAIMCLDQMEREDGIKPDSVVLLTVLAACAHGGFLQEGR 342


>gb|EOY03349.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
           [Theobroma cacao]
          Length = 499

 Score =  231 bits (589), Expect = 2e-58
 Identities = 113/257 (43%), Positives = 166/257 (64%), Gaps = 2/257 (0%)
 Frame = -2

Query: 766 KAWSWCMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLF 587
           + W+ C++L QRC+   Q++ IH + +  G+HRN     KL+SF + S   +L Y+SLLF
Sbjct: 10  QCWTRCLTLLQRCTKASQIEPIHALLITQGLHRNPCIISKLISFLS-SPPTNLHYSSLLF 68

Query: 586 SQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSN-LLHPDGHTFPFVLMACANRFLR 410
           +QL     FIYNTLI+A+S SP P+ S  YFN +L    + P+  T  F+L++CA     
Sbjct: 69  NQLHKSTLFIYNTLIKAHSNSPHPQTSFHYFNHLLEEETIRPNCQTLNFILVSCAKTCSL 128

Query: 409 FEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILING 230
             G+++ +WV KNG+F S+++VQ+ ++R Y E +   DARKVFD+I  VD V+ N+L++G
Sbjct: 129 RSGKQIQNWVFKNGMFSSDSYVQTGVIRLYVEARLWVDARKVFDEIAYVDVVKWNVLMSG 188

Query: 229 FVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVS 50
             +C +  +AL VF+ +LV GI+ DEF            G+L +GKWIHEY++K    + 
Sbjct: 189 LARCRLGTQALSVFKELLVFGIQPDEFCLTTALTACAQNGSLREGKWIHEYLRKREKCLE 248

Query: 49  -DVFVGTALVDMYAKCG 2
            DVF+GTALVDMYAKCG
Sbjct: 249 LDVFIGTALVDMYAKCG 265


>gb|EXB36666.1| hypothetical protein L484_002079 [Morus notabilis]
          Length = 487

 Score =  219 bits (559), Expect = 7e-55
 Identities = 123/259 (47%), Positives = 161/259 (62%), Gaps = 2/259 (0%)
 Frame = -2

Query: 772 SIKAWSW--CMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYA 599
           SI++W+W   +SLAQRC++M QLK IH +F+  G+H N+YA  KL++FCALS+ GDL +A
Sbjct: 7   SIQSWNWKRFISLAQRCANMRQLKPIHALFITTGLHLNNYAISKLIAFCALSDSGDLRHA 66

Query: 598 SLLFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANR 419
           SL+F+Q+ +PNSFIYNTLIRAYSRS QP L+L+YF   + + +  D  TF FVL+AC N 
Sbjct: 67  SLMFNQIQTPNSFIYNTLIRAYSRSSQPHLALRYFQPTVKDKV-ADNLTFSFVLLACVNG 125

Query: 418 FLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNIL 239
            L  EG ++H +    G  ES        VR +                   D  Q N L
Sbjct: 126 GLVLEGTQVHCY---GGCKES--------VRGHR------------------DLFQWNAL 156

Query: 238 INGFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNL 59
           ++G+++C +A EAL VFR+ML  G+E DE             GAL  GKWIHEY++K   
Sbjct: 157 MDGYIRCSLASEALGVFRDMLKFGVELDECCAVTALTACAQSGALWWGKWIHEYIEKREG 216

Query: 58  MVSDVFVGTALVDMYAKCG 2
             SDVFVGTALVDMY KCG
Sbjct: 217 FESDVFVGTALVDMYTKCG 235


>ref|XP_006847904.1| hypothetical protein AMTR_s00029p00110030 [Amborella trichopoda]
           gi|548851209|gb|ERN09485.1| hypothetical protein
           AMTR_s00029p00110030 [Amborella trichopoda]
          Length = 305

 Score =  176 bits (445), Expect = 1e-41
 Identities = 95/256 (37%), Positives = 149/256 (58%), Gaps = 1/256 (0%)
 Frame = -2

Query: 769 IKAWSW-CMSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASL 593
           +  WS  C+ L +RC+++ Q+  IH + +  G+   ++A  K++ FCA+S+  +L YA  
Sbjct: 2   VAGWSHQCLFLIKRCTTIKQVHQIHSLMITTGLSHCNFAMSKIIHFCAVSDPKNLEYALS 61

Query: 592 LFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFL 413
           LF+Q+++P +FI+NT+IR +S S  P+ ++  F  ML   L PD HTFPFVL AC N   
Sbjct: 62  LFNQVTNPTNFIWNTMIRGFSISQNPQKAILIFTKMLQKSLSPDKHTFPFVLRACVN--- 118

Query: 412 RFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILIN 233
             +G  +++ V+KNGL   +  V ++L+  YS+  AL  A +VFD+    D V    LI+
Sbjct: 119 SKQGNVIYTHVLKNGLVH-DTFVCNSLIAMYSKCDALDCAYRVFDETPQRDVVTWTALID 177

Query: 232 GFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMV 53
           G+V+   A   LD+F  M + GIE DE            +GAL  G+ +H +  +   ++
Sbjct: 178 GYVRANRATMGLDLFAKMRLVGIEPDEITMVSVLCAIGLVGALRLGRCVHAHFIEPKKVI 237

Query: 52  SDVFVGTALVDMYAKC 5
            D  +G AL+DMYAKC
Sbjct: 238 YDSILGCALLDMYAKC 253


>emb|CAN66581.1| hypothetical protein VITISV_030261 [Vitis vinifera]
          Length = 622

 Score =  176 bits (445), Expect = 1e-41
 Identities = 98/280 (35%), Positives = 153/280 (54%), Gaps = 31/280 (11%)
 Frame = -2

Query: 748 MSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSSP 569
           + L QRCS+M++L+ IHG  +  G+  +   A KL++FCA    G L+YA  +F ++  P
Sbjct: 22  LHLLQRCSNMEELRQIHGQMLKTGLILDEIPASKLLAFCASPNSGSLAYARTVFDRIFRP 81

Query: 568 NSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLH 389
           N+F++NT+IR YS S +PE +L  ++ ML + +  + +TFPF+L AC++     E Q++H
Sbjct: 82  NTFMWNTMIRGYSNSKEPEEALLLYHHMLYHSVPHNAYTFPFLLKACSSMSASEETQQIH 141

Query: 388 SWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCG-- 215
           + ++K G F S  +  ++L+  YS+   +  AR +FD ++  D V  N +I+G+ KCG  
Sbjct: 142 AHIIKMG-FGSEIYTTNSLLNVYSKSGDIKSARLLFDQVDQRDTVSWNSMIDGYTKCGEI 200

Query: 214 -MALE----------------------------ALDVFRNMLVRGIEADEFXXXXXXXXX 122
            MA E                            AL++F  M   GI+ D           
Sbjct: 201 EMAYEIFNHMPERNIISWTSMISGCVGAGKPKEALNLFHRMQTAGIKLDNVALVSTLQAC 260

Query: 121 THLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
             LG L+QGKWIH Y+KK  + + D  +G  L+DMYAKCG
Sbjct: 261 ADLGVLDQGKWIHAYIKKHEIEI-DPILGCVLIDMYAKCG 299



 Score = 68.6 bits (166), Expect = 3e-09
 Identities = 60/281 (21%), Positives = 118/281 (41%), Gaps = 34/281 (12%)
 Frame = -2

Query: 742 LAQRCSSM---DQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSS 572
           L + CSSM   ++ + IH   +  G     Y    L++    S+ GD+  A LLF Q+  
Sbjct: 124 LLKACSSMSASEETQQIHAHIIKMGFGSEIYTTNSLLN--VYSKSGDIKSARLLFDQVDQ 181

Query: 571 PNSFIYNTLIRAYSRSPQPELSLKYFNLMLS-----------------------NLLHP- 464
            ++  +N++I  Y++  + E++ + FN M                         NL H  
Sbjct: 182 RDTVSWNSMIDGYTKCGEIEMAYEIFNHMPERNIISWTSMISGCVGAGKPKEALNLFHRM 241

Query: 463 -------DGHTFPFVLMACANRFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKA 305
                  D       L ACA+  +  +G+ +H+++ K+ + E +  +   L+  Y++   
Sbjct: 242 QTAGIKLDNVALVSTLQACADLGVLDQGKWIHAYIKKHEI-EIDPILGCVLIDMYAKCGD 300

Query: 304 LADARKVFDDINSVDAVQCNILINGFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXX 125
           L +A +VF  +          +I+G+   G   EAL+ F  M   G+E ++         
Sbjct: 301 LEEAIEVFRKMEEKGVSVWTAMISGYAIHGRGREALEWFMKMQTAGVEPNQMTFTGILTA 360

Query: 124 XTHLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
            +H G + + K + E +++ +     +     +VD+  + G
Sbjct: 361 CSHAGLVHEAKLLFESMERIHGFKPSIEHYGCMVDLLGRAG 401


>ref|XP_006425654.1| hypothetical protein CICLE_v10025166mg [Citrus clementina]
           gi|568824869|ref|XP_006466814.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g66520-like [Citrus sinensis]
           gi|557527644|gb|ESR38894.1| hypothetical protein
           CICLE_v10025166mg [Citrus clementina]
          Length = 622

 Score =  174 bits (442), Expect = 3e-41
 Identities = 94/280 (33%), Positives = 148/280 (52%), Gaps = 31/280 (11%)
 Frame = -2

Query: 748 MSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSSP 569
           +SL  RCS M++LK IH      G+  N+    +L++FC  S  G L+YA ++F ++  P
Sbjct: 22  LSLLNRCSYMEELKQIHAQMFKKGLTVNTILVSRLLAFCTFSNSGSLAYAQMVFDRIIKP 81

Query: 568 NSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLH 389
           N+F++NT++R Y+ S +PE +L  +  MLS+ +  + +TFPF+L AC+      E Q++H
Sbjct: 82  NTFMWNTMVRGYADSSEPEQALLLYRQMLSHSVSHNAYTFPFLLKACSRLSALEETQQIH 141

Query: 388 SWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCG-- 215
           + ++K G F S     ++L+  Y+   ++  AR +FD +   D V  N +I+G+ KCG  
Sbjct: 142 AQIIKFG-FSSEVFATNSLLHAYAISGSIKSARLIFDHMPQRDTVSWNSMIDGYTKCGEM 200

Query: 214 -----------------------------MALEALDVFRNMLVRGIEADEFXXXXXXXXX 122
                                        M  EAL +F  M   G++ D           
Sbjct: 201 ELACEFFKDMKEKNVISWTTLISGYVGAGMDKEALHLFHEMQTAGVKPDNVALVSAVSAC 260

Query: 121 THLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
            HLGAL+QG+WI EY+K   + + D  +G AL+DMYAKCG
Sbjct: 261 AHLGALDQGRWIDEYIKHLGIKI-DPILGCALIDMYAKCG 299


>ref|XP_002877120.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297322958|gb|EFH53379.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 399

 Score =  174 bits (440), Expect = 5e-41
 Identities = 84/155 (54%), Positives = 108/155 (69%)
 Frame = -2

Query: 466 PDGHTFPFVLMACANRFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARK 287
           P   TF F+++AC    L   G+++H WVVKNG+F S+ HVQ+ ++R Y E K L DA K
Sbjct: 9   PSYLTFYFLIVACFKACLFSVGKQIHCWVVKNGVFLSDGHVQTGILRIYVEDKVLLDAHK 68

Query: 286 VFDDINSVDAVQCNILINGFVKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGA 107
           VFD+I   D V+ ++L+NG+V+CG+  E L+VFR MLVRG+E DEF           +GA
Sbjct: 69  VFDEIPKPDVVKWDVLMNGYVRCGLGSEGLEVFREMLVRGVEPDEFSVTTALTACAQVGA 128

Query: 106 LEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
           L QGKWIHE+VKK   + SDVFVGTALVDMYAKCG
Sbjct: 129 LAQGKWIHEFVKKKRWIESDVFVGTALVDMYAKCG 163



 Score = 58.2 bits (139), Expect = 4e-06
 Identities = 44/171 (25%), Positives = 75/171 (43%), Gaps = 1/171 (0%)
 Frame = -2

Query: 601 ASLLFSQLSSPNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACAN 422
           A  +F ++  P+   ++ L+  Y R       L+ F  ML   + PD  +    L ACA 
Sbjct: 66  AHKVFDEIPKPDVVKWDVLMNGYVRCGLGSEGLEVFREMLVRGVEPDEFSVTTALTACAQ 125

Query: 421 RFLRFEGQKLHSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNI 242
                +G+ +H +V K    ES+  V +ALV  Y++   +  A +VF+ ++  +      
Sbjct: 126 VGALAQGKWIHEFVKKKRWIESDVFVGTALVDMYAKCGCIEMAVEVFEKLSRRNVFSWAA 185

Query: 241 LINGFVKCGMALEALDVFRNM-LVRGIEADEFXXXXXXXXXTHLGALEQGK 92
           LI G+   G A +A+     M    GI+ D            H G L++G+
Sbjct: 186 LIGGYAAYGYAKKAMTCLDRMEREDGIKPDSVVLLGVLAACAHGGFLQEGR 236


>gb|EMJ05156.1| hypothetical protein PRUPE_ppa022872mg [Prunus persica]
          Length = 714

 Score =  172 bits (436), Expect = 1e-40
 Identities = 92/255 (36%), Positives = 149/255 (58%), Gaps = 6/255 (2%)
 Frame = -2

Query: 748 MSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSSP 569
           ++L  +C SM  LK +H   +  G+H   +A  KLV FCA+S  GDLSYA L+F  + +P
Sbjct: 37  LTLLSKCKSMQNLKQVHAHIIKTGLHNTHFALSKLVEFCAISPFGDLSYALLVFQSIENP 96

Query: 568 NSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLH 389
           N  I+NT+IR +S S +   +++++ LML + + P+ +TFPF+L +CA      EG+++H
Sbjct: 97  NQIIWNTIIRGFSLSSKSIQAVEFYVLMLLSGVEPNSYTFPFLLKSCAKFAASHEGKQIH 156

Query: 388 SWVVKNGLFESNAHVQSALVRFYSEH------KALADARKVFDDINSVDAVQCNILINGF 227
             V+K GL +S+A V ++L+  Y+++        + DAR +FD+I   D V  N +I+G+
Sbjct: 157 GHVLKLGL-DSDAFVHTSLINMYAQNVLSEMWGCMDDARYLFDEIPGRDVVSWNAMISGY 215

Query: 226 VKCGMALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSD 47
            + G   EAL +F  M    +  +E             G+LE GKW+  +++   L  S+
Sbjct: 216 AQSGRFEEALALFSEMRKANVSPNESTMVVVLSACAQSGSLELGKWVGSWIENRGL-GSN 274

Query: 46  VFVGTALVDMYAKCG 2
           + +  AL+DMYAKCG
Sbjct: 275 LRLVNALIDMYAKCG 289



 Score =  118 bits (296), Expect = 2e-24
 Identities = 77/240 (32%), Positives = 124/240 (51%), Gaps = 4/240 (1%)
 Frame = -2

Query: 709 KTIHGIFVAHGIHRNSYAAGKLVSFCA---LSEK-GDLSYASLLFSQLSSPNSFIYNTLI 542
           K IHG  +  G+  +++    L++  A   LSE  G +  A  LF ++   +   +N +I
Sbjct: 153 KQIHGHVLKLGLDSDAFVHTSLINMYAQNVLSEMWGCMDDARYLFDEIPGRDVVSWNAMI 212

Query: 541 RAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLHSWVVKNGLF 362
             Y++S + E +L  F+ M    + P+  T   VL ACA       G+ + SW+   GL 
Sbjct: 213 SGYAQSGRFEEALALFSEMRKANVSPNESTMVVVLSACAQSGSLELGKWVGSWIENRGL- 271

Query: 361 ESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCGMALEALDVFRN 182
            SN  + +AL+  Y++  AL  AR +FD +   D +  N++I G+       EAL +FR 
Sbjct: 272 GSNLRLVNALIDMYAKCGALDTARSLFDGLQQRDVISWNVMIGGYTHKSHYKEALALFRL 331

Query: 181 MLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
           ML    + ++          +HLGAL+ GKWIH Y+ K    +++  + T+L+DMYAKCG
Sbjct: 332 MLRSNADPNDVTFLGILPACSHLGALDLGKWIHAYIDKNFQSLTNTSLWTSLIDMYAKCG 391



 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 52/219 (23%), Positives = 97/219 (44%)
 Frame = -2

Query: 748 MSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSSP 569
           +S   +  S++  K +       G+  N      L+   A  + G L  A  LF  L   
Sbjct: 247 LSACAQSGSLELGKWVGSWIENRGLGSNLRLVNALIDMYA--KCGALDTARSLFDGLQQR 304

Query: 568 NSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLH 389
           +   +N +I  Y+     + +L  F LML +   P+  TF  +L AC++      G+ +H
Sbjct: 305 DVISWNVMIGGYTHKSHYKEALALFRLMLRSNADPNDVTFLGILPACSHLGALDLGKWIH 364

Query: 388 SWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCGMA 209
           +++ KN    +N  + ++L+  Y++   +  A++VF+ + +      N +I+G    G A
Sbjct: 365 AYIDKNFQSLTNTSLWTSLIDMYAKCGNIEAAKQVFNGMEAKSLASWNAMISGLAMHGHA 424

Query: 208 LEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGK 92
             AL++F  M   G + DE           H G ++ G+
Sbjct: 425 HTALELFSKMADEGFKPDEITFVGVLSACNHGGLVDLGR 463


>ref|XP_002521565.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223539243|gb|EEF40836.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 338

 Score =  169 bits (428), Expect = 1e-39
 Identities = 97/281 (34%), Positives = 155/281 (55%), Gaps = 32/281 (11%)
 Frame = -2

Query: 748 MSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSS- 572
           +SL ++CSSM +LK IH      G    +    +L +F A    G+L+YA ++F  LSS 
Sbjct: 14  LSLLEKCSSMMELKQIHAQMFKTGSVLETITISELQAFAASPNSGNLTYAKIVFDSLSSR 73

Query: 571 PNSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKL 392
           PN++I+N ++R Y+ S +PE +L  ++ ML + +  +G+TFPF+L AC++     + Q++
Sbjct: 74  PNTYIWNAMLRGYADSNKPEEALILYHQMLCHSVPHNGYTFPFLLKACSSLSAIEKAQQV 133

Query: 391 HSWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCG- 215
           H+ ++K G F S+ +  ++L+  Y+    +  AR +FD I   D V  N +I+G+VKCG 
Sbjct: 134 HAQIIKLG-FGSDVYTTNSLLHAYAASGFIESARIIFDRIPHPDTVSWNSIIDGYVKCGE 192

Query: 214 ------------------------------MALEALDVFRNMLVRGIEADEFXXXXXXXX 125
                                         +  EALD+F+ M + GI+ D+         
Sbjct: 193 TETAYELFKDMPEKNAISFTVMISGHVQAGLDKEALDLFQEMQIAGIKPDKIVLTNVLSA 252

Query: 124 XTHLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
             HLGAL+QG+WIH Y+KK ++ + D  +G AL DMYAKCG
Sbjct: 253 CAHLGALDQGRWIHTYIKKNDVQI-DPMLGCALTDMYAKCG 292


>ref|XP_006838670.1| hypothetical protein AMTR_s00002p00243230 [Amborella trichopoda]
           gi|548841176|gb|ERN01239.1| hypothetical protein
           AMTR_s00002p00243230 [Amborella trichopoda]
          Length = 479

 Score =  168 bits (425), Expect = 3e-39
 Identities = 97/251 (38%), Positives = 146/251 (58%), Gaps = 2/251 (0%)
 Frame = -2

Query: 748 MSLAQRCSSMDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSSP 569
           MSL     ++  L   H + +   +H+N+YA  KL++   LS    L+YA+ LF ++ +P
Sbjct: 1   MSLIASARTLPHLMAAHALLITSDLHQNNYALSKLLA--PLSSL-HLNYATSLFLRIQNP 57

Query: 568 NSFIYNTLIRAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLH 389
           N+F+ NT+IRA +  P P  +L  F    ++  H D H F F L+ACAN    +EG++LH
Sbjct: 58  NAFLSNTIIRARANGPDPRSALSLF----AHSPHHDAHAFAFALIACANSTSLWEGRQLH 113

Query: 388 SWVVKNGLFESNAHVQSALVRFYSEHKALADARKVFDDINSV--DAVQCNILINGFVKCG 215
           S V +NG+   +  ++S LVR Y +   LADAR VFD+      D V  + +I+G+V+ G
Sbjct: 114 SQVTRNGMACGDCFLRSNLVRLYVQCGRLADARLVFDETPGCVRDNVIWHAIIHGYVREG 173

Query: 214 MALEALDVFRNMLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSDVFVG 35
              +A  +F +M + G+  D F          H+GAL QG+W+HE +  + + V D +VG
Sbjct: 174 RCTDAFALFNDMQIEGVGVDRFVATTMLTACAHVGALRQGRWVHELL-SSGVEVDD-YVG 231

Query: 34  TALVDMYAKCG 2
           TALVDMYAKCG
Sbjct: 232 TALVDMYAKCG 242


>gb|EMJ11381.1| hypothetical protein PRUPE_ppa020166mg [Prunus persica]
          Length = 443

 Score =  168 bits (425), Expect = 3e-39
 Identities = 86/240 (35%), Positives = 141/240 (58%)
 Frame = -2

Query: 721 MDQLKTIHGIFVAHGIHRNSYAAGKLVSFCALSEKGDLSYASLLFSQLSSPNSFIYNTLI 542
           M ++K  HG  + +G+ +N +  GK++SFCA+SE+GD++YA+ +FS + +P+ F++NT+I
Sbjct: 1   MKEVKQTHGHIIRNGVDQNPFVLGKIISFCAVSERGDMNYAASVFSSIENPDGFLWNTMI 60

Query: 541 RAYSRSPQPELSLKYFNLMLSNLLHPDGHTFPFVLMACANRFLRFEGQKLHSWVVKNGLF 362
           R + ++ +PE + +++  M       D  T  F+L AC        G+++H   +K GL 
Sbjct: 61  RGFGKTRKPEKAFEFYKRMQEKGEVADNFTLSFLLKACGQLGSYLLGKQMHCATLKLGL- 119

Query: 361 ESNAHVQSALVRFYSEHKALADARKVFDDINSVDAVQCNILINGFVKCGMALEALDVFRN 182
           ES+  V++ L+  Y   +    A K+F++I S D V  N +I+  V CG   EALD+F  
Sbjct: 120 ESHVFVRNTLIHIYGVLRDDQTATKLFEEIPSPDLVAWNTIIDSHVNCGKCKEALDLFLR 179

Query: 181 MLVRGIEADEFXXXXXXXXXTHLGALEQGKWIHEYVKKTNLMVSDVFVGTALVDMYAKCG 2
           +L  G+E DE          + LGAL+ G+W+H  + + NL    V V  +L+DMYAKCG
Sbjct: 180 LLQSGVEPDEATVVVTLSACSTLGALDFGRWVHSCIDQVNL-GDIVTVSNSLIDMYAKCG 238


Top