BLASTX nr result

ID: Catharanthus23_contig00029537 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00029537
         (552 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002323442.1| hypothetical protein POPTR_0016s08300g [Popu...   264   1e-68
ref|XP_002273989.2| PREDICTED: pentatricopeptide repeat-containi...   263   2e-68
emb|CBI32643.3| unnamed protein product [Vitis vinifera]              263   2e-68
gb|EOY07667.1| Tetratricopeptide repeat-like superfamily protein...   256   3e-66
ref|XP_002525213.1| pentatricopeptide repeat-containing protein,...   252   5e-65
ref|XP_006428957.1| hypothetical protein CICLE_v10011437mg [Citr...   244   8e-63
gb|EMJ06150.1| hypothetical protein PRUPE_ppa000364mg [Prunus pe...   243   2e-62
ref|XP_006410915.1| hypothetical protein EUTSA_v10017783mg [Eutr...   233   2e-59
ref|XP_004142220.1| PREDICTED: pentatricopeptide repeat-containi...   231   1e-58
ref|XP_006345691.1| PREDICTED: pentatricopeptide repeat-containi...   230   1e-58
ref|XP_004492962.1| PREDICTED: pentatricopeptide repeat-containi...   230   1e-58
ref|XP_003624377.1| Pentatricopeptide repeat-containing protein ...   227   1e-57
ref|NP_181269.1| pentatricopeptide repeat-containing protein [Ar...   226   2e-57
gb|ESW26688.1| hypothetical protein PHAVU_003G140000g [Phaseolus...   225   6e-57
ref|XP_002879661.1| pentatricopeptide repeat-containing protein ...   224   1e-56
gb|EXB31275.1| hypothetical protein L484_014760 [Morus notabilis]     221   9e-56
ref|XP_003550640.2| PREDICTED: pentatricopeptide repeat-containi...   220   2e-55
ref|XP_006294030.1| hypothetical protein CARUB_v10023022mg [Caps...   215   7e-54
ref|XP_002462364.1| hypothetical protein SORBIDRAFT_02g024422 [S...   186   2e-45
ref|XP_003565649.1| PREDICTED: pentatricopeptide repeat-containi...   184   2e-44

>ref|XP_002323442.1| hypothetical protein POPTR_0016s08300g [Populus trichocarpa]
           gi|222868072|gb|EEF05203.1| hypothetical protein
           POPTR_0016s08300g [Populus trichocarpa]
          Length = 526

 Score =  264 bits (674), Expect = 1e-68
 Identities = 126/183 (68%), Positives = 148/183 (80%)
 Frame = +2

Query: 2   LIIDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAY 181
           L  D SVLSNA+S C S R LR GIQ+HCL    GFI+N Y+GSSL++FYGKCG L++AY
Sbjct: 124 LSFDASVLSNAVSSCASTRDLRGGIQYHCLAISAGFIANAYIGSSLVTFYGKCGELDNAY 183

Query: 182 KMFDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACG 361
           K+F EMP++NVVSWTA+I+GFAQ+ QVD  LQLY  MRN  LKPNDFTFTS LS CT  G
Sbjct: 184 KVFKEMPVRNVVSWTAIISGFAQDWQVDMCLQLYCLMRNSTLKPNDFTFTSLLSACTGSG 243

Query: 362 SLGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIA 541
           +LGQGRS HCQ I +GF S++HIANAL+SMYCKCGN+EDAF IF N+  KD+VSWNSMIA
Sbjct: 244 ALGQGRSAHCQIIEMGFVSYLHIANALVSMYCKCGNVEDAFHIFENMVGKDIVSWNSMIA 303

Query: 542 AYA 550
            YA
Sbjct: 304 GYA 306



 Score =  103 bits (256), Expect = 4e-20
 Identities = 52/174 (29%), Positives = 95/174 (54%), Gaps = 1/174 (0%)
 Frame = +2

Query: 26  SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
           ++ LS C    AL  G   HC +   GF+S +++ ++L+S Y KCG + DA+ +F+ M  
Sbjct: 233 TSLLSACTGSGALGQGRSAHCQIIEMGFVSYLHIANALVSMYCKCGNVEDAFHIFENMVG 292

Query: 206 KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSI 385
           K++VSW +MI G+AQ     + + L+  M++  +KP+  TF   LS C   G +  GR+ 
Sbjct: 293 KDIVSWNSMIAGYAQHGLAVQGIGLFERMKSQGVKPDAITFLGVLSSCRHAGFVEGGRNY 352

Query: 386 HCQTIHLGFDSHIHIANALISMYCKCGNIEDA-FFIFRNIDNKDLVSWNSMIAA 544
               +  G    +   + ++ +  + G +E+A +FI R   + + V W S++++
Sbjct: 353 FNSMVEYGVKPELDHYSCIVDLLGRAGLLEEAQYFIERMPVSPNAVIWGSLVSS 406


>ref|XP_002273989.2| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Vitis vinifera]
          Length = 510

 Score =  263 bits (673), Expect = 2e-68
 Identities = 121/182 (66%), Positives = 147/182 (80%)
 Frame = +2

Query: 5   IIDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYK 184
           I+D S LS+ALS+C S R+L+ G+QFHCL    GF+ N+YVGS LISFY KCG L  AY+
Sbjct: 110 IVDASALSHALSLCASSRSLKSGVQFHCLAIRTGFVGNVYVGSCLISFYSKCGELCHAYR 169

Query: 185 MFDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGS 364
           +F+EMP+KNVVSWTA+I GFAQE  VD  L+LY+ MRN  LKPND TFT  LS CT  GS
Sbjct: 170 VFEEMPVKNVVSWTAIIAGFAQEWLVDGCLELYSRMRNSTLKPNDLTFTCLLSTCTGGGS 229

Query: 365 LGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAA 544
           LG+GRS HCQTI +GFDS++H+ANALISMYCKCGN+EDA++IF  +D KD+VSWNSMIA 
Sbjct: 230 LGRGRSAHCQTIEMGFDSYVHVANALISMYCKCGNVEDAYYIFERMDGKDIVSWNSMIAG 289

Query: 545 YA 550
           +A
Sbjct: 290 HA 291



 Score = 98.6 bits (244), Expect = 9e-19
 Identities = 55/171 (32%), Positives = 90/171 (52%), Gaps = 1/171 (0%)
 Frame = +2

Query: 35  LSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNV 214
           LS C    +L  G   HC     GF S ++V ++LIS Y KCG + DAY +F+ M  K++
Sbjct: 221 LSTCTGGGSLGRGRSAHCQTIEMGFDSYVHVANALISMYCKCGNVEDAYYIFERMDGKDI 280

Query: 215 VSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSIHCQ 394
           VSW +MI G AQ     +++ L+ EM+  KLKP+  TF   LS C   G + QG+     
Sbjct: 281 VSWNSMIAGHAQHGLAVQAIDLFEEMKKQKLKPDAITFLGVLSSCRHVGLVKQGQFYFNS 340

Query: 395 TIHLGFDSHIHIANALISMYCKCGNIEDA-FFIFRNIDNKDLVSWNSMIAA 544
            +  G    +     ++ +  + G +E+A  FI +   + + + W S++++
Sbjct: 341 MVEHGVKPELDHFACVVDLLGRAGLLEEARDFIVKMPIHPNAIIWGSLLSS 391


>emb|CBI32643.3| unnamed protein product [Vitis vinifera]
          Length = 1400

 Score =  263 bits (673), Expect = 2e-68
 Identities = 121/182 (66%), Positives = 147/182 (80%)
 Frame = +2

Query: 5    IIDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYK 184
            I+D S LS+ALS+C S R+L+ G+QFHCL    GF+ N+YVGS LISFY KCG L  AY+
Sbjct: 971  IVDASALSHALSLCASSRSLKSGVQFHCLAIRTGFVGNVYVGSCLISFYSKCGELCHAYR 1030

Query: 185  MFDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGS 364
            +F+EMP+KNVVSWTA+I GFAQE  VD  L+LY+ MRN  LKPND TFT  LS CT  GS
Sbjct: 1031 VFEEMPVKNVVSWTAIIAGFAQEWLVDGCLELYSRMRNSTLKPNDLTFTCLLSTCTGGGS 1090

Query: 365  LGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAA 544
            LG+GRS HCQTI +GFDS++H+ANALISMYCKCGN+EDA++IF  +D KD+VSWNSMIA 
Sbjct: 1091 LGRGRSAHCQTIEMGFDSYVHVANALISMYCKCGNVEDAYYIFERMDGKDIVSWNSMIAG 1150

Query: 545  YA 550
            +A
Sbjct: 1151 HA 1152



 Score = 98.6 bits (244), Expect = 9e-19
 Identities = 55/171 (32%), Positives = 90/171 (52%), Gaps = 1/171 (0%)
 Frame = +2

Query: 35   LSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNV 214
            LS C    +L  G   HC     GF S ++V ++LIS Y KCG + DAY +F+ M  K++
Sbjct: 1082 LSTCTGGGSLGRGRSAHCQTIEMGFDSYVHVANALISMYCKCGNVEDAYYIFERMDGKDI 1141

Query: 215  VSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSIHCQ 394
            VSW +MI G AQ     +++ L+ EM+  KLKP+  TF   LS C   G + QG+     
Sbjct: 1142 VSWNSMIAGHAQHGLAVQAIDLFEEMKKQKLKPDAITFLGVLSSCRHVGLVKQGQFYFNS 1201

Query: 395  TIHLGFDSHIHIANALISMYCKCGNIEDA-FFIFRNIDNKDLVSWNSMIAA 544
             +  G    +     ++ +  + G +E+A  FI +   + + + W S++++
Sbjct: 1202 MVEHGVKPELDHFACVVDLLGRAGLLEEARDFIVKMPIHPNAIIWGSLLSS 1252


>gb|EOY07667.1| Tetratricopeptide repeat-like superfamily protein, putative
           [Theobroma cacao]
          Length = 516

 Score =  256 bits (653), Expect = 3e-66
 Identities = 121/181 (66%), Positives = 144/181 (79%), Gaps = 1/181 (0%)
 Frame = +2

Query: 11  DPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMF 190
           +P VLSNA+S CGS+R L  GIQ+HCL    GF  N+YVGSSL++ YGKCG L +AYK+F
Sbjct: 116 NPIVLSNAISSCGSKRNLYGGIQYHCLAIKTGFFPNVYVGSSLVTSYGKCGELEEAYKVF 175

Query: 191 DEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLG 370
           DEMP+KNVVSWT +I GFAQE Q+D  L+LYN M+NL LKPNDFT TS L  CT  G+LG
Sbjct: 176 DEMPVKNVVSWTTIIAGFAQEWQIDMCLELYNMMKNLTLKPNDFTLTSLLRACTGSGALG 235

Query: 371 QGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNI-DNKDLVSWNSMIAAY 547
           QGRS HCQ I +GFDS+ +I NALISMYCKCG++EDA FIF+ +  NKD+VSWNSMIA Y
Sbjct: 236 QGRSAHCQVIQMGFDSYSYICNALISMYCKCGSVEDAMFIFKKMAGNKDIVSWNSMIAGY 295

Query: 548 A 550
           A
Sbjct: 296 A 296



 Score = 92.0 bits (227), Expect = 8e-17
 Identities = 55/176 (31%), Positives = 92/176 (52%), Gaps = 2/176 (1%)
 Frame = +2

Query: 23  LSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMP 202
           L++ L  C    AL  G   HC V   GF S  Y+ ++LIS Y KCG + DA  +F +M 
Sbjct: 221 LTSLLRACTGSGALGQGRSAHCQVIQMGFDSYSYICNALISMYCKCGSVEDAMFIFKKMA 280

Query: 203 -LKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGR 379
             K++VSW +MI G+AQ      ++ L+ +M+  K+KP+  TF   LS C   G + QGR
Sbjct: 281 GNKDIVSWNSMIAGYAQHGLAMEAIDLFEKMKEEKIKPDSITFLGVLSSCRHAGLVEQGR 340

Query: 380 SIHCQTIHLGFDSHIHIANALISMYCKCGNIEDA-FFIFRNIDNKDLVSWNSMIAA 544
                 +  G +  +   + ++ +  + G +++A  FI +     + V W S++++
Sbjct: 341 VYFDSMVVHGVEPALDHYSCVVDLLGRAGLLKEAREFILKMPICPNAVIWGSLLSS 396



 Score = 59.3 bits (142), Expect = 6e-07
 Identities = 26/81 (32%), Positives = 49/81 (60%)
 Frame = +2

Query: 308 KPNDFTFTSFLSLCTACGSLGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFF 487
           K N    ++ +S C +  +L  G   HC  I  GF  ++++ ++L++ Y KCG +E+A+ 
Sbjct: 114 KFNPIVLSNAISSCGSKRNLYGGIQYHCLAIKTGFFPNVYVGSSLVTSYGKCGELEEAYK 173

Query: 488 IFRNIDNKDLVSWNSMIAAYA 550
           +F  +  K++VSW ++IA +A
Sbjct: 174 VFDEMPVKNVVSWTTIIAGFA 194


>ref|XP_002525213.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223535510|gb|EEF37179.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 501

 Score =  252 bits (643), Expect = 5e-65
 Identities = 120/183 (65%), Positives = 147/183 (80%)
 Frame = +2

Query: 2   LIIDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAY 181
           LI D S LS+A+S C S R L  GIQFHCL   +GFISN YVGSSLI+ YGKCG L++A+
Sbjct: 102 LIFDASSLSSAVSSCASSRDLPGGIQFHCLAITSGFISNSYVGSSLITLYGKCGKLDNAH 161

Query: 182 KMFDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACG 361
           K+F EMP++NVV+WTA+I+GFAQE QVD  L+L++ MRN  LKPNDFTFTS LS CT  G
Sbjct: 162 KLFHEMPVRNVVTWTAIISGFAQECQVDVCLELFSVMRNSTLKPNDFTFTSLLSACTGSG 221

Query: 362 SLGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIA 541
           +LGQG S HCQ I +GF S++H+ANALISMYCK G++ DAF+IF NI +KD+VSWNSMI+
Sbjct: 222 ALGQGTSAHCQIIQMGFHSYLHVANALISMYCKSGSVHDAFYIFNNIYSKDIVSWNSMIS 281

Query: 542 AYA 550
            YA
Sbjct: 282 GYA 284



 Score = 96.3 bits (238), Expect = 4e-18
 Identities = 53/174 (30%), Positives = 92/174 (52%), Gaps = 1/174 (0%)
 Frame = +2

Query: 26  SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
           ++ LS C    AL  G   HC +   GF S ++V ++LIS Y K G ++DA+ +F+ +  
Sbjct: 211 TSLLSACTGSGALGQGTSAHCQIIQMGFHSYLHVANALISMYCKSGSVHDAFYIFNNIYS 270

Query: 206 KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSI 385
           K++VSW +MI+G+AQ     +++ L+ +M  L +KP+  TF   LS C   G +  GR+ 
Sbjct: 271 KDIVSWNSMISGYAQHGLAMQAIDLFEKMTKLGVKPDSITFLGVLSACRHAGFVQGGRNY 330

Query: 386 HCQTIHLGFDSHIHIANALISMYCKCGNIEDAF-FIFRNIDNKDLVSWNSMIAA 544
               +       +   + L+ +  + G IE+A   I R     + V W S++++
Sbjct: 331 FNSMVEYHLRPQLDHYSCLVDLLGRAGLIEEALDIILRMPILPNAVIWGSLLSS 384


>ref|XP_006428957.1| hypothetical protein CICLE_v10011437mg [Citrus clementina]
           gi|568853804|ref|XP_006480530.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At2g37320-like [Citrus sinensis]
           gi|557531014|gb|ESR42197.1| hypothetical protein
           CICLE_v10011437mg [Citrus clementina]
          Length = 539

 Score =  244 bits (624), Expect = 8e-63
 Identities = 114/183 (62%), Positives = 142/183 (77%)
 Frame = +2

Query: 2   LIIDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAY 181
           L +D S LS A++ CGS R +R G  + CL    GFI+N+YVGSSLI+ Y KC ++ DAY
Sbjct: 131 LKVDASFLSTAVTSCGSTRNIRGGAPYQCLAIRTGFIANVYVGSSLITLYSKCRVIIDAY 190

Query: 182 KMFDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACG 361
           K+F+EMP++NVVSWTA+I  FAQE QVD  L+LY  MRN  L+PNDFTFTS LS CT  G
Sbjct: 191 KVFEEMPVRNVVSWTAIIAAFAQEWQVDMCLELYRMMRNSMLEPNDFTFTSILSACTGSG 250

Query: 362 SLGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIA 541
           +LGQGRS HCQTI +GF S+I +AN+LISMYCKCGN+E+A ++F N+  KD+VSWNSMIA
Sbjct: 251 ALGQGRSAHCQTIRMGFFSYIQVANSLISMYCKCGNVEEAVYVFNNMHGKDIVSWNSMIA 310

Query: 542 AYA 550
            YA
Sbjct: 311 GYA 313



 Score = 92.0 bits (227), Expect = 8e-17
 Identities = 54/174 (31%), Positives = 89/174 (51%), Gaps = 1/174 (0%)
 Frame = +2

Query: 26  SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
           ++ LS C    AL  G   HC     GF S I V +SLIS Y KCG + +A  +F+ M  
Sbjct: 240 TSILSACTGSGALGQGRSAHCQTIRMGFFSYIQVANSLISMYCKCGNVEEAVYVFNNMHG 299

Query: 206 KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSI 385
           K++VSW +MI G+AQ     R++ L+ EM   ++KP+  TF   +S C   G + +G+  
Sbjct: 300 KDIVSWNSMIAGYAQHGLAVRAIDLFEEMMKQRVKPDAITFLGVISSCRHGGLVEEGKVY 359

Query: 386 HCQTIHLGFDSHIHIANALISMYCKCGNIEDA-FFIFRNIDNKDLVSWNSMIAA 544
                  G    +   + ++ +  + G +E+A  FI +     + V W S++++
Sbjct: 360 FDSMAKHGLKPELDHYSCVVDLLGRAGLLEEARDFIKQMPIYPNAVIWGSLLSS 413



 Score = 60.1 bits (144), Expect = 3e-07
 Identities = 32/101 (31%), Positives = 62/101 (61%), Gaps = 4/101 (3%)
 Frame = +2

Query: 260 VDRSLQLYNEMRNLKLKPNDFTFTSFLSLC-TACGS---LGQGRSIHCQTIHLGFDSHIH 427
           V++ + +++++   +LK +     SFLS   T+CGS   +  G    C  I  GF ++++
Sbjct: 116 VEKLISMHHDLHRERLKVD----ASFLSTAVTSCGSTRNIRGGAPYQCLAIRTGFIANVY 171

Query: 428 IANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
           + ++LI++Y KC  I DA+ +F  +  +++VSW ++IAA+A
Sbjct: 172 VGSSLITLYSKCRVIIDAYKVFEEMPVRNVVSWTAIIAAFA 212


>gb|EMJ06150.1| hypothetical protein PRUPE_ppa000364mg [Prunus persica]
          Length = 1243

 Score =  243 bits (620), Expect = 2e-62
 Identities = 114/181 (62%), Positives = 144/181 (79%)
 Frame = +2

Query: 8    IDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKM 187
            ID SVLS+A+S  GS R L  GI +HC    +G ++N+Y+GSSL+SFYG+C  L +AY++
Sbjct: 792  IDASVLSHAISSYGSSRNLHGGIPYHCAAIRSGLVANVYIGSSLVSFYGRCNELQNAYRV 851

Query: 188  FDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSL 367
            F+EMP++NVVSWTA+I+GFAQE QVD  LQL++EMR+   KPNDFT+ S LS CT  G+L
Sbjct: 852  FEEMPVRNVVSWTAIISGFAQEWQVDACLQLFSEMRHSS-KPNDFTYASILSACTGSGAL 910

Query: 368  GQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAY 547
            G GRS HC TI +GFD +IHIANALISMYCKCG+++DA  IF+N+D KD VSWNSMIA Y
Sbjct: 911  GHGRSAHCHTIRMGFDLYIHIANALISMYCKCGDVKDALCIFKNLDGKDNVSWNSMIAGY 970

Query: 548  A 550
            A
Sbjct: 971  A 971



 Score = 84.3 bits (207), Expect = 2e-14
 Identities = 52/175 (29%), Positives = 88/175 (50%), Gaps = 2/175 (1%)
 Frame = +2

Query: 26   SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
            ++ LS C    AL  G   HC     GF   I++ ++LIS Y KCG + DA  +F  +  
Sbjct: 898  ASILSACTGSGALGHGRSAHCHTIRMGFDLYIHIANALISMYCKCGDVKDALCIFKNLDG 957

Query: 206  KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSI 385
            K+ VSW +MI G+AQ     +++ L+ EM+   ++P+  T    LS C   G + +GRS 
Sbjct: 958  KDNVSWNSMIAGYAQHGLASQAIDLFEEMKQQCVEPDAITLLGVLSSCRHAGLVQEGRSY 1017

Query: 386  HCQTI-HLGFDSHIHIANALISMYCKCGNIEDA-FFIFRNIDNKDLVSWNSMIAA 544
                I   G    +   + +I +  + G +E+A  FI +     + + W S++++
Sbjct: 1018 FNSMIKEHGIQPELDHYSCVIDLLGRAGCLEEAQCFIEKMPIRPNAIIWGSLLSS 1072


>ref|XP_006410915.1| hypothetical protein EUTSA_v10017783mg [Eutrema salsugineum]
           gi|557112084|gb|ESQ52368.1| hypothetical protein
           EUTSA_v10017783mg [Eutrema salsugineum]
          Length = 502

 Score =  233 bits (594), Expect = 2e-59
 Identities = 110/176 (62%), Positives = 138/176 (78%)
 Frame = +2

Query: 23  LSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMP 202
           LS+A+S CGS R  R G  FHC+    GF S++YVGSSL+  Y   G L+DA+K+F+EMP
Sbjct: 123 LSSAVSSCGSNRDFRSGSGFHCVALKCGFTSDVYVGSSLVVLYRGSGDLDDAHKVFEEMP 182

Query: 203 LKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRS 382
            KNVVSWTAMI+GFAQE +VD  L LY++MRN +  PND+TFT+ LS CT  G+LGQGRS
Sbjct: 183 EKNVVSWTAMISGFAQEWRVDICLNLYSKMRNSRSFPNDYTFTALLSACTGSGALGQGRS 242

Query: 383 IHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
           +HCQT+ +GF S++HI+NALISMYCKCG+++DAF IF    NKD+VSWNSMIA YA
Sbjct: 243 VHCQTLQMGFKSYLHISNALISMYCKCGDLKDAFCIFDQFSNKDVVSWNSMIAGYA 298



 Score = 92.4 bits (228), Expect = 6e-17
 Identities = 54/171 (31%), Positives = 89/171 (52%), Gaps = 3/171 (1%)
 Frame = +2

Query: 35  LSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNV 214
           LS C    AL  G   HC     GF S +++ ++LIS Y KCG L DA+ +FD+   K+V
Sbjct: 228 LSACTGSGALGQGRSVHCQTLQMGFKSYLHISNALISMYCKCGDLKDAFCIFDQFSNKDV 287

Query: 215 VSWTAMINGFAQENQVDRSLQLYNEMRNLK--LKPNDFTFTSFLSLCTACGSLGQGRSIH 388
           VSW +MI G+AQ     ++++L+ E+  LK   KP+  T+   LS C   G + +GR + 
Sbjct: 288 VSWNSMIAGYAQHGLASQAIELF-EVMMLKSGTKPDAITYLGVLSSCRHAGLVKEGRKVF 346

Query: 389 CQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMI 538
                 G    +   + L+ +  + G +++A  +  N+    + V W S++
Sbjct: 347 DSMEEHGLKPELSHYSCLVDLLGRFGLLQEALELIENMPMEPNPVIWGSLL 397


>ref|XP_004142220.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Cucumis sativus]
           gi|449502632|ref|XP_004161699.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At2g37320-like [Cucumis sativus]
          Length = 524

 Score =  231 bits (588), Expect = 1e-58
 Identities = 104/176 (59%), Positives = 140/176 (79%)
 Frame = +2

Query: 23  LSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMP 202
           +S+ LS+C S+R LR GIQ+H +    GFI+N+YVGSSL+S YGKCG L++AY++FDEMP
Sbjct: 133 ISSVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNAYRVFDEMP 192

Query: 203 LKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRS 382
           ++NVVSWTA+I GFA E QV+  L+L+ EM+ + L+PN+FTF + L+ CT  G+LG GRS
Sbjct: 193 VRNVVSWTAIIAGFAVEWQVNMCLELFQEMKRMALQPNEFTFVTILTACTGSGALGVGRS 252

Query: 383 IHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
           +HCQT+ +GF S++H+ANALISMYCKCG +  A +IF  ++ KD VSWNSMIA YA
Sbjct: 253 LHCQTVKMGFHSYLHVANALISMYCKCGALNFALYIFEAMEVKDTVSWNSMIAGYA 308



 Score = 91.7 bits (226), Expect = 1e-16
 Identities = 55/172 (31%), Positives = 90/172 (52%), Gaps = 2/172 (1%)
 Frame = +2

Query: 35  LSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNV 214
           L+ C    AL +G   HC     GF S ++V ++LIS Y KCG LN A  +F+ M +K+ 
Sbjct: 238 LTACTGSGALGVGRSLHCQTVKMGFHSYLHVANALISMYCKCGALNFALYIFEAMEVKDT 297

Query: 215 VSWTAMINGFAQENQVDRSLQLYNEMRNLK-LKPNDFTFTSFLSLCTACGSLGQGRSIHC 391
           VSW +MI G+AQ     R++ L+  MR  K ++ +  TF   LS C   G + +GR    
Sbjct: 298 VSWNSMIAGYAQHGLSLRAIDLFKAMRKQKQVEADAITFLGVLSSCRHAGFVEEGRHYFN 357

Query: 392 QTIHLGFDSHIHIANALISMYCKCGNIEDA-FFIFRNIDNKDLVSWNSMIAA 544
             + LG    +   + +I +  + G +++A  FI +     + + W S+++A
Sbjct: 358 LMVELGLKPELDHYSCVIDLLGRAGLLKEAQNFIEKMPITPNSIVWGSLLSA 409



 Score = 60.1 bits (144), Expect = 3e-07
 Identities = 28/83 (33%), Positives = 51/83 (61%)
 Frame = +2

Query: 302 KLKPNDFTFTSFLSLCTACGSLGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDA 481
           K   ND +  S LSLC +  +L  G   H   I  GF +++++ ++L+S+Y KCG + +A
Sbjct: 127 KFNANDIS--SVLSLCNSQRNLRGGIQYHSVAIRTGFIANVYVGSSLVSLYGKCGELSNA 184

Query: 482 FFIFRNIDNKDLVSWNSMIAAYA 550
           + +F  +  +++VSW ++IA +A
Sbjct: 185 YRVFDEMPVRNVVSWTAIIAGFA 207


>ref|XP_006345691.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Solanum tuberosum]
          Length = 483

 Score =  230 bits (587), Expect = 1e-58
 Identities = 105/180 (58%), Positives = 137/180 (76%)
 Frame = +2

Query: 8   IDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKM 187
           I  S++SNA+S+C S+R   +GIQ HCLV VNGF+SN+Y+GSSLI+FY   G + DAY+ 
Sbjct: 93  IGVSLVSNAMSLCASKRVFNVGIQVHCLVIVNGFLSNVYIGSSLITFYSNFGAIVDAYQA 152

Query: 188 FDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSL 367
           FDEM ++NVVSW+AM+NGFA+EN++   L+LY  M  L LK N+F FTS LS+C   G  
Sbjct: 153 FDEMSVRNVVSWSAMLNGFAKENELGMCLKLYKGMMGLGLKVNEFVFTSLLSVCMGSGCF 212

Query: 368 GQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAY 547
           G GRSIHCQ I +GF+S++H+ANA++SMY KCG ++DA  IF N  +KDLVSWNSMI  Y
Sbjct: 213 GHGRSIHCQIIVMGFESYVHVANAILSMYSKCGEVKDAMCIFDNTKSKDLVSWNSMICGY 272



 Score = 96.7 bits (239), Expect = 3e-18
 Identities = 50/180 (27%), Positives = 94/180 (52%), Gaps = 1/180 (0%)
 Frame = +2

Query: 2   LIIDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAY 181
           L ++  V ++ LS+C        G   HC + V GF S ++V ++++S Y KCG + DA 
Sbjct: 192 LKVNEFVFTSLLSVCMGSGCFGHGRSIHCQIIVMGFESYVHVANAILSMYSKCGEVKDAM 251

Query: 182 KMFDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACG 361
            +FD    K++VSW +MI G+ Q+    ++++L+ EM+  +++P+  TF   LS C   G
Sbjct: 252 CIFDNTKSKDLVSWNSMICGYGQQGHAIQAIELFEEMKKQEVRPDSITFLGVLSSCRHAG 311

Query: 362 SLGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDA-FFIFRNIDNKDLVSWNSMI 538
            + +G S     +  G    +   + ++ +  +   +E+A  FI +     + V W S++
Sbjct: 312 FVKEGMSYFNSMVDYGVKPEVDHYSCIVDLLGRARLLEEAREFIKKMPIQPNGVIWGSLL 371


>ref|XP_004492962.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Cicer arietinum]
          Length = 512

 Score =  230 bits (587), Expect = 1e-58
 Identities = 107/181 (59%), Positives = 138/181 (76%)
 Frame = +2

Query: 8   IDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKM 187
           ID   LS ALS CGS+R L  GIQ+HCL    GFI+N+YVG+SLIS Y +CG+L DAY++
Sbjct: 115 IDVCFLSLALSSCGSKRDLYGGIQYHCLAITTGFIANVYVGTSLISLYSRCGLLGDAYRV 174

Query: 188 FDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSL 367
           FDEM  +NVVSWTA+I GFAQE + D  L+L+++MR+L+LKPN FT+TS LS C   G+L
Sbjct: 175 FDEMSERNVVSWTAIIAGFAQEWRADMCLELFHKMRDLELKPNYFTYTSLLSACMGSGAL 234

Query: 368 GQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAY 547
           G GR +HCQ I  GF  ++H+ NALI+MY KCG I+DA +IF N+ N+D+V+WNSMI  Y
Sbjct: 235 GHGRGVHCQIIQTGFHCYLHVYNALIAMYSKCGAIDDALYIFENMVNRDVVTWNSMIVGY 294

Query: 548 A 550
           A
Sbjct: 295 A 295



 Score = 85.1 bits (209), Expect = 1e-14
 Identities = 46/174 (26%), Positives = 87/174 (50%), Gaps = 1/174 (0%)
 Frame = +2

Query: 26  SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
           ++ LS C    AL  G   HC +   GF   ++V ++LI+ Y KCG ++DA  +F+ M  
Sbjct: 222 TSLLSACMGSGALGHGRGVHCQIIQTGFHCYLHVYNALIAMYSKCGAIDDALYIFENMVN 281

Query: 206 KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSI 385
           ++VV+W +MI G+A       ++ L+ EM    + P+  TF   LS C   G + +G+  
Sbjct: 282 RDVVTWNSMIVGYAHHGLAQEAISLFEEMTKQGVNPDAVTFLGILSSCRHGGLVKEGQVY 341

Query: 386 HCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMIAA 544
               +  G    +   + ++ +  + G + +A    +N+    + V W S++++
Sbjct: 342 FNSMVDHGLQPKLDHYSCIVDLLGRAGLLLEALDFIKNMPVCPNAVIWGSLLSS 395


>ref|XP_003624377.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355499392|gb|AES80595.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 487

 Score =  227 bits (579), Expect = 1e-57
 Identities = 106/177 (59%), Positives = 136/177 (76%)
 Frame = +2

Query: 8   IDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKM 187
           ID   LS+ALS+CGS+R    GIQ+HCL    GFI+N+YVGSSLIS Y +CG+L DAY++
Sbjct: 106 IDVCFLSHALSLCGSKRDFYGGIQYHCLAIRIGFIANVYVGSSLISLYSRCGLLGDAYRV 165

Query: 188 FDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSL 367
           FDEM ++NVVSWTA+I GFAQE +VD  L+L+  MR L+LKPN FT+TS LS C   G+L
Sbjct: 166 FDEMSVRNVVSWTAIIAGFAQEWRVDMCLELFRRMRGLELKPNYFTYTSLLSACMGSGAL 225

Query: 368 GQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMI 538
           G GR +HCQ I +GF  ++H+ NALI+MY KCG I DA +IF N+ +KD+V+WNSMI
Sbjct: 226 GHGRGVHCQIIQMGFHCYLHVENALIAMYSKCGVIVDALYIFENMVSKDVVTWNSMI 282



 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 45/174 (25%), Positives = 82/174 (47%), Gaps = 1/174 (0%)
 Frame = +2

Query: 26  SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
           ++ LS C    AL  G   HC +   GF   ++V ++LI+ Y KCG++ DA  +F+ M  
Sbjct: 213 TSLLSACMGSGALGHGRGVHCQIIQMGFHCYLHVENALIAMYSKCGVIVDALYIFENMVS 272

Query: 206 KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSI 385
           K+VV+W +MI G                MR + + P+  TF   LS C   G + +G+  
Sbjct: 273 KDVVTWNSMIVG----------------MRIMGVNPDAVTFLGILSSCRHGGLVKEGQVY 316

Query: 386 HCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMIAA 544
               +  G    +   + ++ +  + G + +A    +N+    + V W S++++
Sbjct: 317 FSSMVDHGLQPELDHYSCIVDLLGRAGLLLEALDFIQNMPVCPNAVIWGSLLSS 370



 Score = 59.7 bits (143), Expect = 5e-07
 Identities = 25/71 (35%), Positives = 46/71 (64%)
 Frame = +2

Query: 338 LSLCTACGSLGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDL 517
           LSLC +      G   HC  I +GF +++++ ++LIS+Y +CG + DA+ +F  +  +++
Sbjct: 115 LSLCGSKRDFYGGIQYHCLAIRIGFIANVYVGSSLISLYSRCGLLGDAYRVFDEMSVRNV 174

Query: 518 VSWNSMIAAYA 550
           VSW ++IA +A
Sbjct: 175 VSWTAIIAGFA 185


>ref|NP_181269.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75216847|sp|Q9ZUT4.1|PP192_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g37320 gi|4056486|gb|AAC98052.1| hypothetical protein
           [Arabidopsis thaliana] gi|37202040|gb|AAQ89635.1|
           At2g37320 [Arabidopsis thaliana]
           gi|51969760|dbj|BAD43572.1| hypothetical protein
           [Arabidopsis thaliana] gi|330254289|gb|AEC09383.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  226 bits (577), Expect = 2e-57
 Identities = 105/176 (59%), Positives = 135/176 (76%)
 Frame = +2

Query: 23  LSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMP 202
           LS+A+  CG  R  R G  FHCL    GFIS++Y+GSSL+  Y   G + +AYK+F+EMP
Sbjct: 123 LSSAVRSCGLNRDFRTGSGFHCLALKGGFISDVYLGSSLVVLYRDSGEVENAYKVFEEMP 182

Query: 203 LKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRS 382
            +NVVSWTAMI+GFAQE +VD  L+LY++MR     PND+TFT+ LS CT  G+LGQGRS
Sbjct: 183 ERNVVSWTAMISGFAQEWRVDICLKLYSKMRKSTSDPNDYTFTALLSACTGSGALGQGRS 242

Query: 383 IHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
           +HCQT+H+G  S++HI+N+LISMYCKCG+++DAF IF    NKD+VSWNSMIA YA
Sbjct: 243 VHCQTLHMGLKSYLHISNSLISMYCKCGDLKDAFRIFDQFSNKDVVSWNSMIAGYA 298



 Score = 92.4 bits (228), Expect = 6e-17
 Identities = 53/170 (31%), Positives = 87/170 (51%), Gaps = 2/170 (1%)
 Frame = +2

Query: 35  LSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNV 214
           LS C    AL  G   HC     G  S +++ +SLIS Y KCG L DA+++FD+   K+V
Sbjct: 228 LSACTGSGALGQGRSVHCQTLHMGLKSYLHISNSLISMYCKCGDLKDAFRIFDQFSNKDV 287

Query: 215 VSWTAMINGFAQENQVDRSLQLYN-EMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSIHC 391
           VSW +MI G+AQ     ++++L+   M     KP+  T+   LS C   G + +GR    
Sbjct: 288 VSWNSMIAGYAQHGLAMQAIELFELMMPKSGTKPDAITYLGVLSSCRHAGLVKEGRKFFN 347

Query: 392 QTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNK-DLVSWNSMI 538
                G    ++  + L+ +  + G +++A  +  N+  K + V W S++
Sbjct: 348 LMAEHGLKPELNHYSCLVDLLGRFGLLQEALELIENMPMKPNSVIWGSLL 397


>gb|ESW26688.1| hypothetical protein PHAVU_003G140000g [Phaseolus vulgaris]
          Length = 520

 Score =  225 bits (573), Expect = 6e-57
 Identities = 102/181 (56%), Positives = 138/181 (76%)
 Frame = +2

Query: 8   IDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKM 187
           +D   LS A+S CGS+R L  GIQ+HCL    GFI+N+YVGSSLISFY +C +L+DAY+M
Sbjct: 120 VDVCFLSQAVSSCGSKRDLWGGIQYHCLAITTGFIANVYVGSSLISFYSRCALLDDAYRM 179

Query: 188 FDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSL 367
           F EMP++NVVSWTA+I GF+QE +VD  ++L+ +M    L+PN FT+TS LS C   G+L
Sbjct: 180 FGEMPVRNVVSWTAIIAGFSQEWRVDMCVELFRQMMGSDLRPNYFTYTSLLSACMGSGAL 239

Query: 368 GQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAY 547
             GR  HCQ IHLGF S++HI NALI+MY KCG I++A +IF N++ +D+V+WN++I+ Y
Sbjct: 240 EYGRCAHCQIIHLGFHSYLHIDNALIAMYSKCGAIDEALYIFENMEGRDIVTWNTVISGY 299

Query: 548 A 550
           A
Sbjct: 300 A 300



 Score = 85.1 bits (209), Expect = 1e-14
 Identities = 45/174 (25%), Positives = 90/174 (51%), Gaps = 1/174 (0%)
 Frame = +2

Query: 26  SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
           ++ LS C    AL  G   HC +   GF S +++ ++LI+ Y KCG +++A  +F+ M  
Sbjct: 227 TSLLSACMGSGALEYGRCAHCQIIHLGFHSYLHIDNALIAMYSKCGAIDEALYIFENMEG 286

Query: 206 KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSI 385
           +++V+W  +I+G+AQ      ++ L+ EM    + P+  T+ S LS C   G + +G+  
Sbjct: 287 RDIVTWNTVISGYAQHGLAQEAISLFEEMLKQGVNPDAVTYLSVLSSCRHGGLIKEGQVY 346

Query: 386 HCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMIAA 544
               I  G    +   + ++ +  + G + +A    +N+    + V W S++++
Sbjct: 347 FNSMIEHGVQPELDHYSCIVDLLGRAGLLVEARDFVQNMPVFPNAVVWGSLLSS 400


>ref|XP_002879661.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297325500|gb|EFH55920.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 500

 Score =  224 bits (571), Expect = 1e-56
 Identities = 105/176 (59%), Positives = 134/176 (76%)
 Frame = +2

Query: 23  LSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMP 202
           LS+A+  CGS R  R G  FHCL    GFIS++Y+GSSL+  Y   G + +A+K+F EMP
Sbjct: 123 LSSAVRSCGSNRDFRTGSGFHCLALKGGFISDVYLGSSLVVLYRDSGEVENAHKVFAEMP 182

Query: 203 LKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRS 382
             NVVSWTAMI+GFAQE +VD  ++LY+EMRN    PND+TFT+ LS CT  G+LGQGRS
Sbjct: 183 DNNVVSWTAMISGFAQEWRVDICMKLYSEMRNSTSDPNDYTFTALLSACTGSGALGQGRS 242

Query: 383 IHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
           +HCQT+ +G  S++HI+N+LISMYCKCG+++DAF IF    NKD+VSWNSMIA YA
Sbjct: 243 VHCQTLQMGLKSYLHISNSLISMYCKCGDLKDAFRIFDQFSNKDVVSWNSMIAGYA 298



 Score = 94.0 bits (232), Expect = 2e-17
 Identities = 53/170 (31%), Positives = 88/170 (51%), Gaps = 2/170 (1%)
 Frame = +2

Query: 35  LSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNV 214
           LS C    AL  G   HC     G  S +++ +SLIS Y KCG L DA+++FD+   K+V
Sbjct: 228 LSACTGSGALGQGRSVHCQTLQMGLKSYLHISNSLISMYCKCGDLKDAFRIFDQFSNKDV 287

Query: 215 VSWTAMINGFAQENQVDRSLQLYN-EMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSIHC 391
           VSW +MI G+AQ     ++++L+   M    +KP+  T+   LS C   G + +GR    
Sbjct: 288 VSWNSMIAGYAQYGLATQAIELFELMMPKSGIKPDAITYLGLLSSCRHAGLVIEGRKFFN 347

Query: 392 QTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNK-DLVSWNSMI 538
                G    ++  + L+ +  + G +++A  +  N+  K + V W S++
Sbjct: 348 LMAERGLKPELNHYSCLVDLLGRFGLLQEALELIENMPMKPNSVIWGSLL 397


>gb|EXB31275.1| hypothetical protein L484_014760 [Morus notabilis]
          Length = 495

 Score =  221 bits (563), Expect = 9e-56
 Identities = 105/184 (57%), Positives = 137/184 (74%), Gaps = 2/184 (1%)
 Frame = +2

Query: 5   IIDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYK 184
           I+D +VLS ALS CGS R LR G+Q HC    +GF++N+YVGSSL+S Y KC  L++AY 
Sbjct: 90  IMDATVLSLALSSCGSTRNLRAGVQHHCAAIRHGFVANVYVGSSLVSLYCKCNELDNAYL 149

Query: 185 MFDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGS 364
           +F+EMP++NVVSWTA+I GFAQE +++  L+L+  M N   +PN FTF S LS C+  G+
Sbjct: 150 VFEEMPVRNVVSWTAIIAGFAQEWRINVCLELFQRMENSDSRPNHFTFASLLSACSGSGA 209

Query: 365 LGQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID--NKDLVSWNSMI 538
           LG GRS+H Q I +GF ++IHI+NALISMYCKCG + DA  +FR +D   +D VSWNSMI
Sbjct: 210 LGLGRSVHSQVIQMGFHAYIHISNALISMYCKCGALRDALHVFRTLDGHGRDTVSWNSMI 269

Query: 539 AAYA 550
           A YA
Sbjct: 270 AGYA 273



 Score = 85.1 bits (209), Expect = 1e-14
 Identities = 49/176 (27%), Positives = 89/176 (50%), Gaps = 3/176 (1%)
 Frame = +2

Query: 26  SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
           ++ LS C    AL +G   H  V   GF + I++ ++LIS Y KCG L DA  +F  +  
Sbjct: 198 ASLLSACSGSGALGLGRSVHSQVIQMGFHAYIHISNALISMYCKCGALRDALHVFRTLDG 257

Query: 206 --KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGR 379
             ++ VSW +MI G+AQ   V +++ L+ EM+    K +  TF   LS C   G + +GR
Sbjct: 258 HGRDTVSWNSMIAGYAQHGLVLQAIDLFEEMKQQGAKSDAITFLGVLSSCRHAGLVKEGR 317

Query: 380 SIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMIAA 544
                 +  G +  +   + ++ +  + G +E+A  +   +    + + W S++++
Sbjct: 318 LYFDSMVEHGVEPELDHYSCIVDLLGRAGLLEEARDVIAKMPIRPNAIIWGSLLSS 373


>ref|XP_003550640.2| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Glycine max]
          Length = 511

 Score =  220 bits (560), Expect = 2e-55
 Identities = 102/181 (56%), Positives = 134/181 (74%)
 Frame = +2

Query: 8   IDPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKM 187
           +D   LS A+S CGS+R L  GIQ+HCL    GF++++YVGSSLIS Y +C  L DA ++
Sbjct: 115 VDVFFLSQAVSSCGSKRDLWGGIQYHCLAITTGFVASVYVGSSLISLYSRCAFLGDACRV 174

Query: 188 FDEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSL 367
           F+EMP++NVVSWTA+I GFAQE  VD  L+L+ +MR   L+PN FT+TS LS C   G+L
Sbjct: 175 FEEMPVRNVVSWTAIIAGFAQEWHVDMCLELFQQMRGSDLRPNYFTYTSLLSACMGSGAL 234

Query: 368 GQGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAY 547
           G GR  HCQ I +GF S++HI NALISMY KCG I+DA  IF N+ ++D+V+WN+MI+ Y
Sbjct: 235 GHGRCAHCQIIRMGFHSYLHIENALISMYSKCGAIDDALHIFENMVSRDVVTWNTMISGY 294

Query: 548 A 550
           A
Sbjct: 295 A 295



 Score = 85.9 bits (211), Expect = 6e-15
 Identities = 47/174 (27%), Positives = 89/174 (51%), Gaps = 1/174 (0%)
 Frame = +2

Query: 26  SNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPL 205
           ++ LS C    AL  G   HC +   GF S +++ ++LIS Y KCG ++DA  +F+ M  
Sbjct: 222 TSLLSACMGSGALGHGRCAHCQIIRMGFHSYLHIENALISMYSKCGAIDDALHIFENMVS 281

Query: 206 KNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSI 385
           ++VV+W  MI+G+AQ      ++ L+ EM    + P+  T+   LS C   G + +G+  
Sbjct: 282 RDVVTWNTMISGYAQHGLAQEAINLFEEMIKQGVNPDAVTYLGVLSSCRHGGLVKEGQVY 341

Query: 386 HCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMIAA 544
               +  G    +   + ++ +  + G + +A    +N+    + V W S++++
Sbjct: 342 FNSMVEHGVQPGLDHYSCIVDLLGRAGLLLEARDFIQNMPIFPNAVVWGSLLSS 395


>ref|XP_006294030.1| hypothetical protein CARUB_v10023022mg [Capsella rubella]
           gi|482562738|gb|EOA26928.1| hypothetical protein
           CARUB_v10023022mg [Capsella rubella]
          Length = 514

 Score =  215 bits (547), Expect = 7e-54
 Identities = 102/176 (57%), Positives = 132/176 (75%)
 Frame = +2

Query: 23  LSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMP 202
           LS+A+  CGS    R G  FHCL F  G IS++YVG+SL+  Y   G L+ A+K+F+EMP
Sbjct: 137 LSSAVRSCGSSGDFRTGSGFHCLAFKGGHISDVYVGNSLVVLYRDSGDLDSAHKVFEEMP 196

Query: 203 LKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRS 382
            K+VVSW AMI+GFAQ+ +VD  L+LY++MRN    PND+TFT+ LS CT  G+LGQGRS
Sbjct: 197 EKDVVSWAAMISGFAQKWRVDFCLKLYSKMRNSNSDPNDYTFTALLSACTGSGALGQGRS 256

Query: 383 IHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
           +HCQT+  GF +++HI+NALISMYCK G+++DA  IF    NKD+VSWNSMIA YA
Sbjct: 257 VHCQTLQTGFKTYLHISNALISMYCKSGDLKDALRIFDQFTNKDVVSWNSMIAGYA 312



 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 49/170 (28%), Positives = 83/170 (48%), Gaps = 2/170 (1%)
 Frame = +2

Query: 35  LSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNV 214
           LS C    AL  G   HC     GF + +++ ++LIS Y K G L DA ++FD+   K+V
Sbjct: 242 LSACTGSGALGQGRSVHCQTLQTGFKTYLHISNALISMYCKSGDLKDALRIFDQFTNKDV 301

Query: 215 VSWTAMINGFAQENQVDRSLQLYN-EMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSIHC 391
           VSW +MI G+AQ     ++ +L+   M     KP+  T+   LS C   G + +GR    
Sbjct: 302 VSWNSMIAGYAQHGLATQAFELFEIMMPKSGTKPDAITYLGVLSSCRHAGLVKEGRKFFN 361

Query: 392 QTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMI 538
                G    ++  + L+ +  + G +++A  +   +    + V W S++
Sbjct: 362 LMEEQGLKPELNHYSCLVDLLGRFGLLQEALKLIEGMPMEPNSVIWGSLL 411


>ref|XP_002462364.1| hypothetical protein SORBIDRAFT_02g024422 [Sorghum bicolor]
           gi|241925741|gb|EER98885.1| hypothetical protein
           SORBIDRAFT_02g024422 [Sorghum bicolor]
          Length = 520

 Score =  186 bits (473), Expect = 2e-45
 Identities = 88/180 (48%), Positives = 125/180 (69%)
 Frame = +2

Query: 11  DPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMF 190
           D S+L++ALS C   + L  G+QFH L+   G++S+I +GSSLISFY +CG L  A+++F
Sbjct: 132 DASILASALSYCADGKTLTAGVQFHALLVKVGYVSSIPIGSSLISFYSRCGQLEIAHRVF 191

Query: 191 DEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLG 370
             M  KN V+WTA+I+G+AQ+NQV+  L L+  MR    KPND TF +  S+CT    L 
Sbjct: 192 QNMTAKNTVTWTALISGYAQDNQVEPCLHLFALMRRSVCKPNDITFATIFSVCTNHAFLV 251

Query: 371 QGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
            G+S+    + +GFDS++H++NALISMY KCG+I +A  +F +I  KDLVSWNS+I  Y+
Sbjct: 252 LGKSVQALQMRMGFDSYVHVSNALISMYAKCGSIGEARAVFESITCKDLVSWNSLIFGYS 311



 Score = 76.6 bits (187), Expect = 4e-12
 Identities = 45/170 (26%), Positives = 86/170 (50%), Gaps = 1/170 (0%)
 Frame = +2

Query: 38  SICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNVV 217
           S+C +   L +G     L    GF S ++V ++LIS Y KCG + +A  +F+ +  K++V
Sbjct: 242 SVCTNHAFLVLGKSVQALQMRMGFDSYVHVSNALISMYAKCGSIGEARAVFESITCKDLV 301

Query: 218 SWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSIHCQT 397
           SW ++I G++Q    +  L L  EM    + P+  +F   LS C     + +GR      
Sbjct: 302 SWNSLIFGYSQHGLAEHCLGLLKEMEG-HIIPDAISFLGVLSSCRHACLVAEGRRCFRAM 360

Query: 398 IHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMIAA 544
           I  G    I   + ++ ++ + G +++A+ + + +    + V W S++A+
Sbjct: 361 IEHGVIPEIDHYSCMVDLFGRAGLLDEAWDLIQTMPMPPNGVIWGSLLAS 410



 Score = 58.5 bits (140), Expect = 1e-06
 Identities = 31/109 (28%), Positives = 56/109 (51%)
 Frame = +2

Query: 224 TAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSIHCQTIH 403
           T++     +  + D  +Q+    R  ++  +     S LS C    +L  G   H   + 
Sbjct: 102 TSVCPSITKLTKEDMFMQIVELHRRGQISSDASILASALSYCADGKTLTAGVQFHALLVK 161

Query: 404 LGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
           +G+ S I I ++LIS Y +CG +E A  +F+N+  K+ V+W ++I+ YA
Sbjct: 162 VGYVSSIPIGSSLISFYSRCGQLEIAHRVFQNMTAKNTVTWTALISGYA 210


>ref|XP_003565649.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37320-like [Brachypodium distachyon]
          Length = 567

 Score =  184 bits (466), Expect = 2e-44
 Identities = 88/180 (48%), Positives = 126/180 (70%)
 Frame = +2

Query: 11  DPSVLSNALSICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMF 190
           D S+ ++A+S C  ++++R G Q H L+   G+   +  G+SLIS Y +C  L +AY++F
Sbjct: 179 DVSIFASAISFCAVKQSIRGGGQLHALLVKVGYDLAVLSGTSLISLYARCYQLENAYQVF 238

Query: 191 DEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLG 370
             MP++NVVSWTA+I+G+AQ+NQV+  LQ++  MR    +PND TF +  S+CT    LG
Sbjct: 239 QNMPVRNVVSWTALISGYAQDNQVEPCLQVFQLMRQSACRPNDITFATIFSVCTNHALLG 298

Query: 371 QGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
            GRS+H   + +GFD  +HI NALISMY KCG+I++A FIF++I  KDLVSWNSMI  Y+
Sbjct: 299 LGRSVHGLELRMGFDLCVHILNALISMYAKCGSIDEAQFIFQSIACKDLVSWNSMIFGYS 358



 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 44/170 (25%), Positives = 86/170 (50%), Gaps = 1/170 (0%)
 Frame = +2

Query: 38  SICGSERALRMGIQFHCLVFVNGFISNIYVGSSLISFYGKCGILNDAYKMFDEMPLKNVV 217
           S+C +   L +G   H L    GF   +++ ++LIS Y KCG +++A  +F  +  K++V
Sbjct: 289 SVCTNHALLGLGRSVHGLELRMGFDLCVHILNALISMYAKCGSIDEAQFIFQSIACKDLV 348

Query: 218 SWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLGQGRSIHCQT 397
           SW +MI G++Q    +  L+L  EM    + P+  +F   LS C     + +GR      
Sbjct: 349 SWNSMIFGYSQYGLAEHCLKLLKEMEKEHIVPDVISFLGILSSCRHACLVEEGRRCFKAM 408

Query: 398 IHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNID-NKDLVSWNSMIAA 544
           + LG +  +   + ++ +  + G +++A  +   +    + V W S+++A
Sbjct: 409 LKLGIEPELDHYSCMVDLLGRAGLLDEACDLIHTMSMTPNAVIWGSLLSA 458



 Score = 65.1 bits (157), Expect = 1e-08
 Identities = 35/120 (29%), Positives = 67/120 (55%)
 Frame = +2

Query: 191 DEMPLKNVVSWTAMINGFAQENQVDRSLQLYNEMRNLKLKPNDFTFTSFLSLCTACGSLG 370
           D + L+++ +  +M N F +EN+      L  EMR   +  +   F S +S C    S+ 
Sbjct: 143 DIIDLRDISACYSM-NKFKKENK----FMLLAEMRRRGISADVSIFASAISFCAVKQSIR 197

Query: 371 QGRSIHCQTIHLGFDSHIHIANALISMYCKCGNIEDAFFIFRNIDNKDLVSWNSMIAAYA 550
            G  +H   + +G+D  +    +LIS+Y +C  +E+A+ +F+N+  +++VSW ++I+ YA
Sbjct: 198 GGGQLHALLVKVGYDLAVLSGTSLISLYARCYQLENAYQVFQNMPVRNVVSWTALISGYA 257


Top