BLASTX nr result

ID: Akebia24_contig00025915 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00025915
         (458 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containi...   196   3e-48
emb|CBI29222.3| unnamed protein product [Vitis vinifera]              196   3e-48
ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prun...   195   5e-48
ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containi...   194   1e-47
ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containi...   191   9e-47
ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citr...   191   1e-46
ref|XP_002303480.2| pentatricopeptide repeat-containing family p...   173   3e-41
ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containi...   168   8e-40
ref|XP_002519901.1| pentatricopeptide repeat-containing protein,...   162   5e-38
ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containi...   161   8e-38
ref|XP_004166658.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   161   8e-38
gb|AHB18408.1| pentatricopeptide repeat-containing protein [Goss...   161   1e-37
ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily pr...   157   1e-36
sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-c...   156   2e-36
emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|72687...   156   2e-36
ref|NP_567587.1| pentatricopeptide repeat-containing protein [Ar...   156   2e-36
ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containi...   156   3e-36
gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus...   155   6e-36
ref|XP_007156329.1| hypothetical protein PHAVU_003G277400g [Phas...   155   6e-36
ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutr...   152   4e-35

>ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic [Vitis vinifera]
          Length = 1022

 Score =  196 bits (498), Expect = 3e-48
 Identities = 97/153 (63%), Positives = 120/153 (78%), Gaps = 1/153 (0%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280
           LID KLP LF   K+RHIEI+ A+AD + V ES + +A+ DLL+HVYCTQF+N+G   A 
Sbjct: 205 LIDRKLPVLFGDPKNRHIEIASAMADLNEVGESGVAVAAVDLLIHVYCTQFRNVGFRNAI 264

Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100
            VFR L N+G+FP++KTC FLLSSLVKANEL++SY VF  + +G SPDVY FSTAINAFC
Sbjct: 265 GVFRFLANKGVFPTVKTCTFLLSSLVKANELEKSYWVFETMRQGVSPDVYLFSTAINAFC 324

Query: 99  KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           KGG++E A QLF+ ME  G+SP V+TYN LIHG
Sbjct: 325 KGGKVEDAIQLFFDMEKLGVSPNVVTYNNLIHG 357


>emb|CBI29222.3| unnamed protein product [Vitis vinifera]
          Length = 826

 Score =  196 bits (498), Expect = 3e-48
 Identities = 97/153 (63%), Positives = 120/153 (78%), Gaps = 1/153 (0%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280
           LID KLP LF   K+RHIEI+ A+AD + V ES + +A+ DLL+HVYCTQF+N+G   A 
Sbjct: 138 LIDRKLPVLFGDPKNRHIEIASAMADLNEVGESGVAVAAVDLLIHVYCTQFRNVGFRNAI 197

Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100
            VFR L N+G+FP++KTC FLLSSLVKANEL++SY VF  + +G SPDVY FSTAINAFC
Sbjct: 198 GVFRFLANKGVFPTVKTCTFLLSSLVKANELEKSYWVFETMRQGVSPDVYLFSTAINAFC 257

Query: 99  KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           KGG++E A QLF+ ME  G+SP V+TYN LIHG
Sbjct: 258 KGGKVEDAIQLFFDMEKLGVSPNVVTYNNLIHG 290


>ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prunus persica]
           gi|462413303|gb|EMJ18352.1| hypothetical protein
           PRUPE_ppa001463mg [Prunus persica]
          Length = 821

 Score =  195 bits (496), Expect = 5e-48
 Identities = 91/153 (59%), Positives = 118/153 (77%), Gaps = 1/153 (0%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280
           LIDG +P L+     RH+EI++A+ D ++V    L + + DLL+HVYCTQFKN+G G+A 
Sbjct: 142 LIDGNVPVLYANHNQRHMEIAIAMLDLNTVSTQGLGVQALDLLIHVYCTQFKNMGFGYAI 201

Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100
           D F + + +G+FPSLKTCNFLLSSLVKANEL +SY+VF ++CRG SPDVY F+TAINAFC
Sbjct: 202 DAFVIFSKKGVFPSLKTCNFLLSSLVKANELHKSYDVFEVMCRGVSPDVYLFTTAINAFC 261

Query: 99  KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           KGG+++ A  LF KME  GI P V+TYN +IHG
Sbjct: 262 KGGKVDDAIGLFSKMEGLGIVPNVVTYNNIIHG 294


>ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Solanum tuberosum]
          Length = 928

 Score =  194 bits (493), Expect = 1e-47
 Identities = 94/153 (61%), Positives = 121/153 (79%), Gaps = 1/153 (0%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280
           LIDGKLPALF+ S+ +H+E+++++A+ S V +  + + +FDLL+H+ CTQFKN+G   A 
Sbjct: 240 LIDGKLPALFDTSQQKHVEVAVSLAELSGVSDFGVAVRTFDLLLHLCCTQFKNVGFDAAL 299

Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100
           DVFR L +RG++PSLKTCNFLLSSLVK NEL +SYEVF I+  G  PDVY FSTAINAFC
Sbjct: 300 DVFRSLASRGVYPSLKTCNFLLSSLVKENELWKSYEVFGILKDGVEPDVYLFSTAINAFC 359

Query: 99  KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           KGG+++ A +LF KME  GI P V+TYN LIHG
Sbjct: 360 KGGKVDEAKELFRKMENIGIVPNVVTYNNLIHG 392



 Score = 60.1 bits (144), Expect = 3e-07
 Identities = 30/87 (34%), Positives = 55/87 (63%), Gaps = 1/87 (1%)
 Frame = -3

Query: 258 NRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICR-GFSPDVYSFSTAINAFCKGGRIE 82
           ++GL   + T   L++ L KA++L++  ++F  + R G +P++  ++T I AFC+ G ++
Sbjct: 691 SKGLVCDIYTYGALINGLCKADQLEKGRDLFHEMLRQGLAPNLIIYNTLIGAFCRNGNVK 750

Query: 81  VAAQLFYKMEVFGISPTVITYNTLIHG 1
            A +L   +   GI P V+TY++LIHG
Sbjct: 751 EALKLRDDIRSRGILPNVVTYSSLIHG 777



 Score = 58.5 bits (140), Expect = 9e-07
 Identities = 33/119 (27%), Positives = 62/119 (52%), Gaps = 1/119 (0%)
 Frame = -3

Query: 360 ELTMASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDR 181
           ++   +++ L+  +C   K   L  AF +   +  +G+ P + T N LL  L +  + D 
Sbjct: 625 QIDSMTYNTLICAFC---KEGNLDGAFMLREEMVKQGIAPDVSTYNVLLHGLGEKGKTDE 681

Query: 180 SYEVF-AIICRGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLI 7
           +  ++   + +G   D+Y++   IN  CK  ++E    LF++M   G++P +I YNTLI
Sbjct: 682 ALLLWDECLSKGLVCDIYTYGALINGLCKADQLEKGRDLFHEMLRQGLAPNLIIYNTLI 740


>ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Solanum lycopersicum]
          Length = 839

 Score =  191 bits (485), Expect = 9e-47
 Identities = 93/153 (60%), Positives = 120/153 (78%), Gaps = 1/153 (0%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFAF 280
           LIDGKLPALF+  + +H+E+++++A+ S V +  + + +FDLL+H+ CTQFK++G   A 
Sbjct: 151 LIDGKLPALFDSLQQKHVEVAVSLAELSGVSDFGVAVRTFDLLLHLCCTQFKSVGFDAAL 210

Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100
           DVFR L +RG++PSLKTCNFLLSSLVK NEL +SYEVF I+  G  PDVY FSTAINAFC
Sbjct: 211 DVFRSLASRGVYPSLKTCNFLLSSLVKENELWKSYEVFEILKDGVKPDVYLFSTAINAFC 270

Query: 99  KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           KGG++E A +LF KME  GI P V+TYN LIHG
Sbjct: 271 KGGKVEEAQELFRKMENMGILPNVVTYNNLIHG 303



 Score = 60.1 bits (144), Expect = 3e-07
 Identities = 30/87 (34%), Positives = 55/87 (63%), Gaps = 1/87 (1%)
 Frame = -3

Query: 258 NRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICR-GFSPDVYSFSTAINAFCKGGRIE 82
           ++GL   + T   L++ L KA++L++  ++F  + R G +P++  ++T I AFC+ G ++
Sbjct: 602 SKGLVCDIYTYGALINGLCKADQLEKGRDLFHEMLRQGLAPNLIVYNTLIGAFCRNGNVK 661

Query: 81  VAAQLFYKMEVFGISPTVITYNTLIHG 1
            A +L   +   GI P V+TY++LIHG
Sbjct: 662 EALKLRDDIRSRGILPNVVTYSSLIHG 688


>ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citrus clementina]
           gi|568835123|ref|XP_006471629.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X1 [Citrus sinensis]
           gi|568835125|ref|XP_006471630.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X2 [Citrus sinensis]
           gi|557534991|gb|ESR46109.1| hypothetical protein
           CICLE_v10000274mg [Citrus clementina]
          Length = 833

 Score =  191 bits (484), Expect = 1e-46
 Identities = 96/154 (62%), Positives = 116/154 (75%), Gaps = 2/154 (1%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKD-RHIEISLAIADSSV-DESELTMASFDLLVHVYCTQFKNLGLGFA 283
           LIDGK+P L+  +   RHIEI+  + D +V  E  L +   DLLVHVYCTQFKNLG G+A
Sbjct: 143 LIDGKMPVLYASNPSIRHIEIASQMVDLNVTSEPALGVQIADLLVHVYCTQFKNLGFGYA 202

Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAF 103
            DVF + +N+G+FPSLKTCNFLL+SLVKANE+ +  EVF  +CRG SPDV+ FSTAINAF
Sbjct: 203 IDVFSIFSNKGIFPSLKTCNFLLNSLVKANEVQKGIEVFETMCRGVSPDVFLFSTAINAF 262

Query: 102 CKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           CK GRIE A  LF KME  GI+P V+TYN +IHG
Sbjct: 263 CKRGRIEDAIGLFTKMEELGIAPNVVTYNNIIHG 296



 Score = 60.5 bits (145), Expect = 2e-07
 Identities = 36/118 (30%), Positives = 66/118 (55%), Gaps = 1/118 (0%)
 Frame = -3

Query: 351 MASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELD-RSY 175
           + +++ ++H  C   +N  L  AF +   +  R + PSL T + L++ L+K  + D  ++
Sbjct: 287 VVTYNNIIHGLC---RNGRLYEAFHLKEKMVLREVEPSLITYSILINGLIKLEKFDDANF 343

Query: 174 EVFAIICRGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
            +  +  RGF P+   ++T I+ +CK G I  A ++   M   G+SP  +T+N+LIHG
Sbjct: 344 VLKEMSVRGFVPNYVVYNTLIDGYCKKGNISEALKIRDDMVSKGMSPNSVTFNSLIHG 401


>ref|XP_002303480.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550342907|gb|EEE78459.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 842

 Score =  173 bits (438), Expect = 3e-41
 Identities = 93/155 (60%), Positives = 113/155 (72%), Gaps = 3/155 (1%)
 Frame = -3

Query: 456 LIDGKLPALFEKS-KDRHIEISLAIADSS-VDESELTMASFDLLVHVYCTQFKNLGLGFA 283
           LIDGK+PA + ++ + RH EI+  +AD + V E  + +   DLLVHVY TQFK+LG GFA
Sbjct: 152 LIDGKVPAFYARNFESRHFEIAQIMADFNLVFEPVIGVKIADLLVHVYSTQFKHLGFGFA 211

Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIIC-RGFSPDVYSFSTAINA 106
            DVF LL  +GLFPSLKTC FLLSSLVKANEL +SYEV+  IC  G  PDV+ FST INA
Sbjct: 212 ADVFSLLAKKGLFPSLKTCTFLLSSLVKANELKKSYEVYDFICLGGIIPDVHLFSTMINA 271

Query: 105 FCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           FCKG R + A  LF KME  G++P V+TYN +IHG
Sbjct: 272 FCKGHREDDAIGLFSKMEKLGVAPNVVTYNNIIHG 306


>ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Cucumis sativus]
          Length = 822

 Score =  168 bits (425), Expect = 8e-40
 Identities = 90/154 (58%), Positives = 108/154 (70%), Gaps = 2/154 (1%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAI--ADSSVDESELTMASFDLLVHVYCTQFKNLGLGFA 283
           LIDG LP L   S+  HIEI+ A+    S V   E T A FDLL+HVY TQF+NLG   A
Sbjct: 135 LIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQA-FDLLIHVYSTQFRNLGFSCA 193

Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAF 103
            DVF LL  +G FPSLKTCNFLLSSLVKANE ++  EVF ++  G  PDV+SF+  INA 
Sbjct: 194 VDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGACPDVFSFTNVINAL 253

Query: 102 CKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           CKGG++E A +LF KME  GISP V+TYN +I+G
Sbjct: 254 CKGGKMENAIELFMKMEKLGISPNVVTYNCIING 287



 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 37/118 (31%), Positives = 67/118 (56%), Gaps = 1/118 (0%)
 Frame = -3

Query: 351 MASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYE 172
           + +++ +++  C   +N  L  AF++   +T +G+ P+LKT   L++ L+K N  D+   
Sbjct: 278 VVTYNCIINGLC---QNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNH 334

Query: 171 VF-AIICRGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           V   +I  GF+P+V  F+  I+ +CK G IE A ++   M    I+PT +T  +L+ G
Sbjct: 335 VLDEMIGSGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQG 392


>ref|XP_002519901.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223540947|gb|EEF42505.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 777

 Score =  162 bits (410), Expect = 5e-38
 Identities = 77/129 (59%), Positives = 98/129 (75%), Gaps = 1/129 (0%)
 Frame = -3

Query: 384 ADSSVDESELTMASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSL 205
           A  ++ E  + +   DLL+HVY TQFK+LG G  F++F LL N+GLFPSLKTCNFLLSSL
Sbjct: 113 ASETLFEPAVAVTVVDLLIHVYSTQFKHLGFGVVFELFSLLANKGLFPSLKTCNFLLSSL 172

Query: 204 VKANELDRSYEVFAIICR-GFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTV 28
           VKANE+  SY+VF I+C  G +PDVY FST +NAFC GGR++ A +LF KME  G++P V
Sbjct: 173 VKANEVKMSYQVFDIMCHCGVTPDVYLFSTMVNAFCTGGRVDDAIELFRKMEKVGVAPNV 232

Query: 27  ITYNTLIHG 1
           +TYN +IHG
Sbjct: 233 VTYNNIIHG 241


>ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Glycine max]
          Length = 840

 Score =  161 bits (408), Expect = 8e-38
 Identities = 81/156 (51%), Positives = 107/156 (68%), Gaps = 4/156 (2%)
 Frame = -3

Query: 456 LIDGKLPALFEKSK----DRHIEISLAIADSSVDESELTMASFDLLVHVYCTQFKNLGLG 289
           LIDG +P    K+     DR  EI+ ++ + +    E  +   DLL+H+ C+QFK LG  
Sbjct: 148 LIDGHVPTWSSKTTTSFHDRLREIASSMLELNQGSDEQRLGELDLLLHILCSQFKCLGSR 207

Query: 288 FAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAIN 109
            AFD+F + + RG+FP LKTCN LLSSLVKANEL +SYEVF + C+G +PDV++F+TAIN
Sbjct: 208 CAFDIFVMFSKRGVFPCLKTCNLLLSSLVKANELHKSYEVFDLACQGVAPDVFTFTTAIN 267

Query: 108 AFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           AFCKGGR+  A  LF KME  G+ P V+TYN +I G
Sbjct: 268 AFCKGGRVGDAVDLFCKMEGLGVFPNVVTYNNVIDG 303



 Score = 57.0 bits (136), Expect = 3e-06
 Identities = 32/127 (25%), Positives = 64/127 (50%), Gaps = 4/127 (3%)
 Frame = -3

Query: 369  DESELTMASFDLLVHVYCTQFKNLGLGFAFDVFRL---LTNRGLFPSLKTCNFLLSSLVK 199
            ++ EL+   +++L+  YC       +G   + F+L   + +RG+ P+  T + L+  +  
Sbjct: 639  EKVELSSVVYNILIAAYCR------IGNVTEAFKLRDAMKSRGILPTCATYSSLIHGMCC 692

Query: 198  ANELDRSYEVFAIICR-GFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVIT 22
               +D + E+F  +   G  P+V+ ++  I   CK G++++   +  +M   GI P  IT
Sbjct: 693  IGRVDEAKEIFEEMRNEGLLPNVFCYTALIGGHCKLGQMDIVGSILLEMSSNGIRPNKIT 752

Query: 21   YNTLIHG 1
            Y  +I G
Sbjct: 753  YTIMIDG 759


>ref|XP_004166658.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At4g19440, chloroplastic-like [Cucumis sativus]
          Length = 799

 Score =  161 bits (408), Expect = 8e-38
 Identities = 86/154 (55%), Positives = 106/154 (68%), Gaps = 2/154 (1%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAI--ADSSVDESELTMASFDLLVHVYCTQFKNLGLGFA 283
           ++ G LP L   S+  HIEI+ A+    S V   E T A FDLL+HVY TQF+NLG   A
Sbjct: 112 IVYGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQA-FDLLIHVYSTQFRNLGFSCA 170

Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAF 103
            DVF LL  +G FPSLKTCNF LSSLVKANE ++  EVF ++  G  PDV+SF+  INA 
Sbjct: 171 VDVFYLLARKGTFPSLKTCNFXLSSLVKANEFEKCCEVFRVMSEGACPDVFSFTNVINAL 230

Query: 102 CKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           CKGG++E A +LF KME  GISP V+TYN +I+G
Sbjct: 231 CKGGKMENAIELFMKMEKLGISPNVVTYNCIING 264



 Score = 58.9 bits (141), Expect = 7e-07
 Identities = 36/118 (30%), Positives = 67/118 (56%), Gaps = 1/118 (0%)
 Frame = -3

Query: 351 MASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYE 172
           + +++ +++  C   +N  L  AF++   +T +G+ P+LKT   L++ L+K N  D+   
Sbjct: 255 VVTYNCIINGLC---QNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNH 311

Query: 171 VF-AIICRGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           +   +I  GF+P+V  F+  I+ +CK G IE A ++   M    I+PT +T  +L+ G
Sbjct: 312 ILDEMIGAGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQG 369


>gb|AHB18408.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 846

 Score =  161 bits (407), Expect = 1e-37
 Identities = 84/155 (54%), Positives = 110/155 (70%), Gaps = 3/155 (1%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKD---RHIEISLAIADSSVDESELTMASFDLLVHVYCTQFKNLGLGF 286
           LIDGKLP LF  +      HI+I++A+AD  ++ S   +A  DLL+H+YCTQFKN+G  +
Sbjct: 168 LIDGKLP-LFSPNNPPTVNHIQIAIALAD--LNTSFKGVAGVDLLLHLYCTQFKNVGFTY 224

Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINA 106
           A DVF  L  +G+FPS KTCNF L+SL+KANE+ ++Y+VF  + R  S DVY  +T IN 
Sbjct: 225 AIDVFFTLAYKGIFPSTKTCNFFLNSLLKANEVRKTYQVFETLSRSVSLDVYLCTTMING 284

Query: 105 FCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           FCKGGRI+ A  LF +ME  GISP V+TYN +IHG
Sbjct: 285 FCKGGRIQDAMALFSRMENLGISPNVVTYNNIIHG 319



 Score = 58.5 bits (140), Expect = 9e-07
 Identities = 32/118 (27%), Positives = 69/118 (58%), Gaps = 1/118 (0%)
 Frame = -3

Query: 351 MASFDLLVHVYCTQFKNLGLGFAFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYE 172
           + +++ ++H  C   K+  L  AF + + +T +G+  SL T + L++ L+K ++ + +  
Sbjct: 310 VVTYNNIIHGLC---KSGRLDEAFQIKQNMTKQGVDHSLITYSVLINGLIKLDKFEEANS 366

Query: 171 VFAIIC-RGFSPDVYSFSTAINAFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           V   +  +GF+P+ + ++T I  +CK   I+ A ++ ++M   G+ P  +T+N L+HG
Sbjct: 367 VLKEMSDKGFAPNEFVYNTLIAGYCKMENIDEALRIKHQMLSNGMKPNSVTFNLLMHG 424


>ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           1 [Theobroma cacao] gi|590680604|ref|XP_007040907.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590680608|ref|XP_007040908.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590680612|ref|XP_007040909.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590680616|ref|XP_007040910.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590680620|ref|XP_007040911.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778151|gb|EOY25407.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778152|gb|EOY25408.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778153|gb|EOY25409.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778154|gb|EOY25410.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778155|gb|EOY25411.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778156|gb|EOY25412.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
          Length = 845

 Score =  157 bits (397), Expect = 1e-36
 Identities = 82/154 (53%), Positives = 107/154 (69%), Gaps = 2/154 (1%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKD-RHIEISLAIAD-SSVDESELTMASFDLLVHVYCTQFKNLGLGFA 283
           LIDGKLP     +    HI+I+ A+AD +++ +    +   D+L+H+YCTQFKN G   A
Sbjct: 156 LIDGKLPLSSPNNTTIDHIQITTALADLNTLSKGVPRVMGVDMLLHLYCTQFKNAGFTSA 215

Query: 282 FDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAF 103
            DVF  L ++G+FPS KTCNF LSSLVKANEL ++Y+VF  + R  S DVY  +T INAF
Sbjct: 216 IDVFFTLADKGMFPSSKTCNFFLSSLVKANELQKTYQVFETLSRFVSLDVYLCTTMINAF 275

Query: 102 CKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           CKGGRI+ A  LF +ME  GI+P V+TYN +IHG
Sbjct: 276 CKGGRIQDAFALFSRMENLGIAPNVVTYNNIIHG 309


>sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-containing protein
           At4g19440, chloroplastic; Flags: Precursor
          Length = 838

 Score =  156 bits (395), Expect = 2e-36
 Identities = 78/153 (50%), Positives = 106/153 (69%), Gaps = 1/153 (0%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVD-ESELTMASFDLLVHVYCTQFKNLGLGFAF 280
           LI+G +P L    +D  + I+ A+A  S+  + E+     DLL+ VYCTQFK  G   A 
Sbjct: 165 LINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLAL 224

Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100
           DVF +L N+G+FPS  TCN LL+SLV+ANE  +  E F ++C+G SPDVY F+TAINAFC
Sbjct: 225 DVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFC 284

Query: 99  KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           KGG++E A +LF KME  G++P V+T+NT+I G
Sbjct: 285 KGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDG 317


>emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|7268739|emb|CAB78946.1|
           putative protein [Arabidopsis thaliana]
          Length = 814

 Score =  156 bits (395), Expect = 2e-36
 Identities = 78/153 (50%), Positives = 106/153 (69%), Gaps = 1/153 (0%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVD-ESELTMASFDLLVHVYCTQFKNLGLGFAF 280
           LI+G +P L    +D  + I+ A+A  S+  + E+     DLL+ VYCTQFK  G   A 
Sbjct: 141 LINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLAL 200

Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100
           DVF +L N+G+FPS  TCN LL+SLV+ANE  +  E F ++C+G SPDVY F+TAINAFC
Sbjct: 201 DVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFC 260

Query: 99  KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           KGG++E A +LF KME  G++P V+T+NT+I G
Sbjct: 261 KGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDG 293


>ref|NP_567587.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|334186696|ref|NP_001190771.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|15810161|gb|AAL07224.1| unknown protein [Arabidopsis
           thaliana] gi|332658782|gb|AEE84182.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332658783|gb|AEE84183.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 825

 Score =  156 bits (395), Expect = 2e-36
 Identities = 78/153 (50%), Positives = 106/153 (69%), Gaps = 1/153 (0%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVD-ESELTMASFDLLVHVYCTQFKNLGLGFAF 280
           LI+G +P L    +D  + I+ A+A  S+  + E+     DLL+ VYCTQFK  G   A 
Sbjct: 152 LINGNVPVLPCGLRDSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLAL 211

Query: 279 DVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFC 100
           DVF +L N+G+FPS  TCN LL+SLV+ANE  +  E F ++C+G SPDVY F+TAINAFC
Sbjct: 212 DVFPVLANKGMFPSKTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFC 271

Query: 99  KGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           KGG++E A +LF KME  G++P V+T+NT+I G
Sbjct: 272 KGGKVEEAVKLFSKMEEAGVAPNVVTFNTVIDG 304


>ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X1 [Cicer arietinum]
           gi|502153968|ref|XP_004509526.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X2 [Cicer arietinum]
           gi|502153970|ref|XP_004509527.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X3 [Cicer arietinum]
           gi|502153972|ref|XP_004509528.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X4 [Cicer arietinum]
           gi|502153974|ref|XP_004509529.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X5 [Cicer arietinum]
           gi|502153976|ref|XP_004509530.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X6 [Cicer arietinum]
           gi|502153978|ref|XP_004509531.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X7 [Cicer arietinum]
           gi|502153980|ref|XP_004509532.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X8 [Cicer arietinum]
           gi|502153982|ref|XP_004509533.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X9 [Cicer arietinum]
           gi|502153984|ref|XP_004509534.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X10 [Cicer arietinum]
          Length = 835

 Score =  156 bits (394), Expect = 3e-36
 Identities = 86/156 (55%), Positives = 108/156 (69%), Gaps = 4/156 (2%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVDESELTMAS---FDLLVHVYCTQFKNLGLGF 286
           LIDG +        DR  E+    A S ++ S LT  S    DLL+H+ C+QF++LG  +
Sbjct: 145 LIDGNVSTPLLNRDDRLSEM----ASSFLELSRLTERSHGELDLLLHILCSQFQHLGFHW 200

Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICR-GFSPDVYSFSTAIN 109
           AFD+F L T+ G+FPSLKTCNFLLSSLVK+NEL +SY VF ++CR G S DVY+FSTAIN
Sbjct: 201 AFDIFTLFTSNGVFPSLKTCNFLLSSLVKSNELHKSYRVFDVVCRGGVSLDVYTFSTAIN 260

Query: 108 AFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           AF KGG+I+ A  LF KME  G+ P V+TYN LI G
Sbjct: 261 AFSKGGKIDDAVGLFSKMEEQGVLPNVVTYNNLIDG 296


>gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus guttatus]
          Length = 847

 Score =  155 bits (392), Expect = 6e-36
 Identities = 81/155 (52%), Positives = 109/155 (70%), Gaps = 3/155 (1%)
 Frame = -3

Query: 456 LIDGKLP-ALFEKSKDRHIEISLAIADSSVDESELTMAS--FDLLVHVYCTQFKNLGLGF 286
           LID KLP +L +   + H EI++ +AD+     +    +  FD+LVHVY T+FK+LGL  
Sbjct: 161 LIDRKLPVSLRDNVVNLHNEIAIVLADTFSGSEKFRSGNRGFDMLVHVYATEFKSLGLDA 220

Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINA 106
           A DVFRLL  R L PS KTCNFL+S+LVKA+E ++SYE+F I+ R   PDVY +STAINA
Sbjct: 221 AMDVFRLLAGRRLVPSFKTCNFLMSTLVKADEHEKSYEIFLIVSRESLPDVYLYSTAINA 280

Query: 105 FCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
            CKGG+++ AA LF  M   G++P V+TYN L++G
Sbjct: 281 LCKGGKVDEAAMLFKVMGNSGVAPNVVTYNNLMNG 315



 Score = 56.6 bits (135), Expect = 4e-06
 Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 1/96 (1%)
 Frame = -3

Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICR-GFSPDVYSFSTAIN 109
           A  +F  + NRG+ P+L T + L+  L  A  L+ S  +F  + + G  PDV  ++  I 
Sbjct: 675 ALKLFDDMKNRGVKPTLATYSSLIHGLSNAGRLNDSKVLFDEMRKEGLMPDVVCYTALIG 734

Query: 108 AFCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
            +CK G ++ A  L  +M +F +    IT+  +IHG
Sbjct: 735 GYCKLGHMDEARNLLQEMSLFNVKANKITFTVIIHG 770


>ref|XP_007156329.1| hypothetical protein PHAVU_003G277400g [Phaseolus vulgaris]
           gi|561029683|gb|ESW28323.1| hypothetical protein
           PHAVU_003G277400g [Phaseolus vulgaris]
          Length = 837

 Score =  155 bits (392), Expect = 6e-36
 Identities = 75/152 (49%), Positives = 105/152 (69%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVDESELTMASFDLLVHVYCTQFKNLGLGFAFD 277
           LIDG +P  F   ++R  EI+ ++ + +    +      DLL+++ C+++K+ G   AFD
Sbjct: 150 LIDGHVPTSFHDRENRLREIASSMLELN-QVLDTRHGELDLLLYILCSRYKDFGFRCAFD 208

Query: 276 VFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINAFCK 97
           +F + + RG+FP LKTCNFLLSSLV ANEL +SYEVF + C+G  PDV+ F+ AINAFCK
Sbjct: 209 IFIMFSKRGVFPCLKTCNFLLSSLVTANELHKSYEVFDVTCQGVVPDVFMFTAAINAFCK 268

Query: 96  GGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           GGR+  A  LF+KME  G+SP V+TYN +I G
Sbjct: 269 GGRVGDAVDLFHKMEKLGVSPNVVTYNNVIDG 300


>ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutrema salsugineum]
           gi|557115148|gb|ESQ55431.1| hypothetical protein
           EUTSA_v10024401mg [Eutrema salsugineum]
          Length = 837

 Score =  152 bits (385), Expect = 4e-35
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 3/155 (1%)
 Frame = -3

Query: 456 LIDGKLPALFEKSKDRHIEISLAIADSSVD---ESELTMASFDLLVHVYCTQFKNLGLGF 286
           LI+G +P L   +  R   +++A A +S+    + E+ M   DLL+ VYCTQFK  G   
Sbjct: 162 LINGNVPVLPSANDSRDGRVAIADAMASLSLCFDPEIRMRISDLLIEVYCTQFKRAGCYL 221

Query: 285 AFDVFRLLTNRGLFPSLKTCNFLLSSLVKANELDRSYEVFAIICRGFSPDVYSFSTAINA 106
           A D+F LL N+GLFPS  TCN LL+SLV+ANE  +  E F  +C+G SPDVY F+T INA
Sbjct: 222 ALDIFPLLANKGLFPSRTTCNILLTSLVRANEFQKCCEAFEAVCKGVSPDVYLFTTVINA 281

Query: 105 FCKGGRIEVAAQLFYKMEVFGISPTVITYNTLIHG 1
           +CK G++  A +LF KME  G++P V+TYNT+I G
Sbjct: 282 YCKRGKVGEAIELFSKMEEAGVAPNVVTYNTVIDG 316


Top