BLASTX nr result

ID: Zingiber25_contig00018834 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00018834
         (803 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006832839.1| hypothetical protein AMTR_s00095p00039180 [A...   192   9e-47
emb|CBI34375.3| unnamed protein product [Vitis vinifera]              189   1e-45
ref|XP_002299515.2| hypothetical protein POPTR_0001s09740g, part...   188   2e-45
ref|XP_002281998.2| PREDICTED: pentatricopeptide repeat-containi...   185   2e-44
emb|CBI20738.3| unnamed protein product [Vitis vinifera]              185   2e-44
emb|CAN76239.1| hypothetical protein VITISV_016538 [Vitis vinifera]   185   2e-44
ref|XP_006285430.1| hypothetical protein CARUB_v10006847mg [Caps...   184   4e-44
ref|XP_002867196.1| pentatricopeptide repeat-containing protein ...   183   5e-44
ref|XP_004157408.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   183   7e-44
ref|XP_004141438.1| PREDICTED: pentatricopeptide repeat-containi...   183   7e-44
ref|XP_004160754.1| PREDICTED: pentatricopeptide repeat-containi...   181   3e-43
ref|XP_006574752.1| PREDICTED: pentatricopeptide repeat-containi...   181   3e-43
ref|XP_004138557.1| PREDICTED: pentatricopeptide repeat-containi...   181   3e-43
gb|EOY24832.1| Pentatricopeptide repeat superfamily protein, put...   180   6e-43
ref|XP_002308660.2| hypothetical protein POPTR_0006s26860g [Popu...   179   8e-43
ref|NP_195043.1| pentatricopeptide repeat-containing protein [Ar...   179   8e-43
gb|EOY11208.1| Pentatricopeptide repeat (PPR) superfamily protei...   179   1e-42
gb|EOY11207.1| Pentatricopeptide repeat superfamily protein isof...   179   1e-42
gb|EMJ26738.1| hypothetical protein PRUPE_ppa026705mg [Prunus pe...   178   2e-42
ref|XP_004291465.1| PREDICTED: pentatricopeptide repeat-containi...   177   3e-42

>ref|XP_006832839.1| hypothetical protein AMTR_s00095p00039180 [Amborella trichopoda]
            gi|548837339|gb|ERM98117.1| hypothetical protein
            AMTR_s00095p00039180 [Amborella trichopoda]
          Length = 819

 Score =  192 bits (489), Expect = 9e-47
 Identities = 98/213 (46%), Positives = 137/213 (64%)
 Frame = -1

Query: 803  EEDKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEK 624
            + D +I S+L+DMY KCGS+  A+  F  A K DI++WN++L+G A+HGN  E+L  +EK
Sbjct: 602  QTDSFIASSLLDMYSKCGSLEKAVSCFEEAPKGDIVIWNSILAGQARHGNGEEVLGLFEK 661

Query: 623  MIGHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRAD 444
            M   G++PD ++FLSVLSGCSHG L+D+   +F  M+ EYGI P+ EH+ACVVDALGRA 
Sbjct: 662  MKECGMKPDHVSFLSVLSGCSHGRLVDETMFWFRRMKVEYGINPKEEHYACVVDALGRAG 721

Query: 443  LLKDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSN 264
            +L +A+ F   M   P   VLR+L+S C  H  + LGLA  A+++ L   D +  VLLSN
Sbjct: 722  MLNEAVKFAGSMPFGPELEVLRSLLSSCKTHACIALGLAVAARIMKLEPHDPATLVLLSN 781

Query: 263  LYAIEKKWHARTKVRDAMEVDIKNAKKVAVSWI 165
            LYA   +W     VR+ +       K+   SW+
Sbjct: 782  LYASHGRWEEAEWVREVIGKTWMMRKEAGQSWL 814



 Score = 63.9 bits (154), Expect = 6e-08
 Identities = 53/183 (28%), Positives = 87/183 (47%), Gaps = 2/183 (1%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKH-DIILWNALLSGYAQHGNVLEMLKAYEKMIGH 612
           +  +L++MY KC S+  A+R F+      D++ W  LLSGY   G  +E L+ + KM  +
Sbjct: 404 VQDSLVNMYAKCSSMYDAVRAFNEIQGGCDLLSWTTLLSGYVYCGFSVEALRTFSKMRDN 463

Query: 611 GIEPDSITFLSVLSGCSHGGLLDKVFQYFA-SMRDEYGIIPQMEHHACVVDALGRADLLK 435
           G++PDS+  + VL+GC+    ++   Q  A +++  Y +  Q+E    ++        L 
Sbjct: 464 GVKPDSVACVGVLAGCTSIQFINHGRQVHAYAVKCGYDLNIQVE--TALLSLYAECGRLD 521

Query: 434 DAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLYA 255
            AI    +M   P       LIS  +   H +  L  + KM+  G   +  F L S L A
Sbjct: 522 FAIELFSKM-FEPDVVSWTALISAHVKLDHNQDALIWLVKMVREGTKPNK-FTLASALTA 579

Query: 254 IEK 246
             K
Sbjct: 580 SAK 582


>emb|CBI34375.3| unnamed protein product [Vitis vinifera]
          Length = 814

 Score =  189 bits (480), Expect = 1e-45
 Identities = 90/212 (42%), Positives = 141/212 (66%)
 Frame = -1

Query: 800  EDKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKM 621
            +D ++ SA+ID+YCKCG++  A + F   SK++++ WNA++ GYAQHG   E+ + + KM
Sbjct: 552  QDNFVESAVIDVYCKCGTVDEAAKTFMNVSKNNLVAWNAMVMGYAQHGCYHEVFELFNKM 611

Query: 620  IGHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADL 441
            +  GI+PD IT+L VL+ C H GL+++   Y +SM + +G++P +EH+AC++D  GR  L
Sbjct: 612  LELGIQPDEITYLGVLNSCCHAGLVNEAHTYLSSMLELHGVVPCLEHYACMIDLFGRVGL 671

Query: 440  LKDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNL 261
            L+DA   ID+M + P + + + L+S C +HG+V LG  +  K+I L   + SA+VLLSNL
Sbjct: 672  LEDAKRTIDQMPIMPDAQIWQILLSGCNIHGNVDLGEVAAKKLIELQPENDSAYVLLSNL 731

Query: 260  YAIEKKWHARTKVRDAMEVDIKNAKKVAVSWI 165
            YA   +W+A  K+R  M+  I   K+   SWI
Sbjct: 732  YASAGRWNAVGKLRRVMKKKI-ICKEPGSSWI 762



 Score = 61.2 bits (147), Expect = 4e-07
 Identities = 37/143 (25%), Positives = 73/143 (51%), Gaps = 3/143 (2%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           + +ALI MY KCG +  A   F      D + WN+L++GYA++G + + LK + +M  + 
Sbjct: 354 VNNALIFMYGKCGEMVAARHIFDEMLCGDSVSWNSLIAGYAENGLMKQALKVFSQMRDYL 413

Query: 608 IEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL--- 438
           ++P+  T  S+L   ++    ++  Q   S   + G I      +C++ A G+ +++   
Sbjct: 414 LQPNKYTLASILEVAANSNFPEQAMQ-IHSYIVKLGFIVDDSMLSCLITAYGKCNMICES 472

Query: 437 KDAINFIDEMGVTPGSTVLRTLI 369
           K   + I ++ V   + +  TL+
Sbjct: 473 KRVYSDISQINVLHLNAMAATLV 495


>ref|XP_002299515.2| hypothetical protein POPTR_0001s09740g, partial [Populus trichocarpa]
            gi|550346914|gb|EEE84320.2| hypothetical protein
            POPTR_0001s09740g, partial [Populus trichocarpa]
          Length = 706

 Score =  188 bits (478), Expect = 2e-45
 Identities = 94/214 (43%), Positives = 138/214 (64%), Gaps = 1/214 (0%)
 Frame = -1

Query: 803  EEDKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEK 624
            ++D ++ S++ID+YCKCGSIG A + F ++S + +  WNA++ GYA HG   E+   + K
Sbjct: 494  DQDSFVESSVIDIYCKCGSIGQAEKAFRSSSMNSLAAWNAMMMGYAHHGCYQEVFDLFNK 553

Query: 623  MIGHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRAD 444
            M   GIEPD IT+L VLS C HGGL+ +   Y  SM + +GIIP +EH+AC++D LGR  
Sbjct: 554  MSQFGIEPDEITYLGVLSSCCHGGLVKQARHYLDSMFELHGIIPHLEHYACMIDLLGRVG 613

Query: 443  LLKDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSN 264
            LL+DA   ID M + P   + + L+S C +HGHV+LG  +  K++ +   + SA++LLSN
Sbjct: 614  LLEDAKKTIDHMPIQPDVHIWQILLSACNIHGHVELGRVAARKLLEIHPENESAYILLSN 673

Query: 263  LYAIEKKWHARTKVRDAMEVDIKNAKK-VAVSWI 165
            LYA    W+A  ++R  M+   KN +K    SWI
Sbjct: 674  LYASVGMWNAVGRLRKEMKE--KNLRKEPGSSWI 705



 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 39/143 (27%), Positives = 71/143 (49%), Gaps = 3/143 (2%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           + +AL+ MY KCG I  A R F      D + WN+L+S  +++G V + L+ + +M    
Sbjct: 297 VSNALVSMYGKCGQICDACRVFYNMIIRDSVSWNSLISACSENGFVNQALEVFYQMRELS 356

Query: 608 IEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRA---DLL 438
           ++P   T  S+L   S+     +V Q   S+  + G +  +   +C++ A GR    D  
Sbjct: 357 LQPTIHTLASILEAVSNSNNTKQVIQ-IHSLVVKCGFMFDVSMISCLITAYGRCNSMDES 415

Query: 437 KDAINFIDEMGVTPGSTVLRTLI 369
           K     ID++ +   +T++ T +
Sbjct: 416 KRVFAEIDKVNLVHLNTMITTFV 438


>ref|XP_002281998.2| PREDICTED: pentatricopeptide repeat-containing protein At4g33170
            [Vitis vinifera]
          Length = 1580

 Score =  185 bits (469), Expect = 2e-44
 Identities = 91/212 (42%), Positives = 134/212 (63%), Gaps = 1/212 (0%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++G++L+DMY KCG+I  A R F   +  +I LWNA+L G AQHGN  E +  ++ M 
Sbjct: 1242 DPFVGTSLVDMYAKCGNIEDAYRLFKKMNVRNIALWNAMLVGLAQHGNAEEAVNLFKSMK 1301

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
             HGIEPD ++F+ +LS CSH GL  + ++Y  SM ++YGI P++EH++C+VDALGRA L+
Sbjct: 1302 SHGIEPDRVSFIGILSACSHAGLTSEAYEYLHSMPNDYGIEPEIEHYSCLVDALGRAGLV 1361

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
            ++A   I+ M     +++ R L+  C + G V+ G    A++  L   DS+A+VLLSN+Y
Sbjct: 1362 QEADKVIETMPFKASASINRALLGACRIQGDVETGKRVAARLFALEPFDSAAYVLLSNIY 1421

Query: 257  AIEKKWHARTKVRDAMEVDIKNAKK-VAVSWI 165
            A   +W   T  R  M+   KN KK    SWI
Sbjct: 1422 AAANRWDDVTDARKMMK--RKNVKKDPGFSWI 1451



 Score = 57.4 bits (137), Expect = 6e-06
 Identities = 31/102 (30%), Positives = 50/102 (49%), Gaps = 6/102 (5%)
 Frame = -1

Query: 803  EEDKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEK 624
            E D ++  AL+++Y KCG +  A   F    + D++LWN +L GY Q G   E  + + +
Sbjct: 759  EWDVFVSGALVNIYSKCGRMRDARLLFDWMRERDVVLWNMMLKGYVQLGLEKEAFQLFSE 818

Query: 623  MIGHGIEPDSITFLSVLSGCSHGG------LLDKVFQYFASM 516
                G+ PD  +   +L+G S         L D+V  Y A +
Sbjct: 819  FHRSGLRPDEFSVQLILNGVSEVNWDEGKWLADQVQAYAAKL 860


>emb|CBI20738.3| unnamed protein product [Vitis vinifera]
          Length = 865

 Score =  185 bits (469), Expect = 2e-44
 Identities = 91/212 (42%), Positives = 134/212 (63%), Gaps = 1/212 (0%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++G++L+DMY KCG+I  A R F   +  +I LWNA+L G AQHGN  E +  ++ M 
Sbjct: 527  DPFVGTSLVDMYAKCGNIEDAYRLFKKMNVRNIALWNAMLVGLAQHGNAEEAVNLFKSMK 586

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
             HGIEPD ++F+ +LS CSH GL  + ++Y  SM ++YGI P++EH++C+VDALGRA L+
Sbjct: 587  SHGIEPDRVSFIGILSACSHAGLTSEAYEYLHSMPNDYGIEPEIEHYSCLVDALGRAGLV 646

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
            ++A   I+ M     +++ R L+  C + G V+ G    A++  L   DS+A+VLLSN+Y
Sbjct: 647  QEADKVIETMPFKASASINRALLGACRIQGDVETGKRVAARLFALEPFDSAAYVLLSNIY 706

Query: 257  AIEKKWHARTKVRDAMEVDIKNAKK-VAVSWI 165
            A   +W   T  R  M+   KN KK    SWI
Sbjct: 707  AAANRWDDVTDARKMMK--RKNVKKDPGFSWI 736



 Score = 60.5 bits (145), Expect = 7e-07
 Identities = 28/87 (32%), Positives = 45/87 (51%)
 Frame = -1

Query: 803 EEDKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEK 624
           E D ++  AL+++Y KCG +  A   F    + D++LWN +L GY Q G   E  + + +
Sbjct: 215 EWDVFVSGALVNIYSKCGRMRDARLLFDWMRERDVVLWNMMLKGYVQLGLEKEAFQLFSE 274

Query: 623 MIGHGIEPDSITFLSVLSGCSHGGLLD 543
               G+ PD  +   +L+GC   G  D
Sbjct: 275 FHRSGLRPDEFSVQLILNGCLWAGTDD 301


>emb|CAN76239.1| hypothetical protein VITISV_016538 [Vitis vinifera]
          Length = 503

 Score =  185 bits (469), Expect = 2e-44
 Identities = 91/212 (42%), Positives = 134/212 (63%), Gaps = 1/212 (0%)
 Frame = -1

Query: 797 DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
           D ++G++L+DMY KCG+I  A R F   +  +I LWNA+L G AQHGN  E +  ++ M 
Sbjct: 165 DPFVGTSLVDMYAKCGNIEDAYRLFKKMNVRNIALWNAMLVGLAQHGNAEEAVNLFKSMK 224

Query: 617 GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
            HGIEPD ++F+ +LS CSH GL  + ++Y  SM ++YGI P++EH++C+VDALGRA L+
Sbjct: 225 SHGIEPDRVSFIGILSACSHAGLTSEAYEYLHSMPNDYGIEPEIEHYSCLVDALGRAGLV 284

Query: 437 KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
           ++A   I+ M     +++ R L+  C + G V+ G    A++  L   DS+A+VLLSN+Y
Sbjct: 285 QEADKVIETMPFKASASINRALLGACRIQGDVEXGKRVAARLFALEPFDSAAYVLLSNIY 344

Query: 257 AIEKKWHARTKVRDAMEVDIKNAKK-VAVSWI 165
           A   +W   T  R  M+   KN KK    SWI
Sbjct: 345 AAANRWDDVTDARKMMK--RKNVKKDPGFSWI 374


>ref|XP_006285430.1| hypothetical protein CARUB_v10006847mg [Capsella rubella]
            gi|482554135|gb|EOA18328.1| hypothetical protein
            CARUB_v10006847mg [Capsella rubella]
          Length = 996

 Score =  184 bits (466), Expect = 4e-44
 Identities = 90/211 (42%), Positives = 132/211 (62%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++G++L+DMY KCGSI  A   F      +I  WNA+L G AQHG   E+L+ +++M 
Sbjct: 658  DPFVGTSLVDMYAKCGSIDDAYSLFKRIEMRNIAAWNAMLLGLAQHGEGKEVLQLFKQMK 717

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
              GI PD +TF+ VLS CSH GL+ + +++  SM  +YGI P++EH++C+ DALGRA  L
Sbjct: 718  SLGINPDKVTFIGVLSACSHSGLVSEAYKHIGSMHRDYGIKPEIEHYSCLADALGRAGFL 777

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
            K+A N I+ M +   +++ RTL++ C V G  + G    +K++ L  +DSSA+VLLSN+Y
Sbjct: 778  KEAENLIESMSMEASASMYRTLLAACRVKGDTETGKRVASKLLELDPLDSSAYVLLSNMY 837

Query: 257  AIEKKWHARTKVRDAMEVDIKNAKKVAVSWI 165
            A   KW      R  M+   K  K   +SWI
Sbjct: 838  AAASKWDEMKLARRMMKGQ-KVKKDPGISWI 867



 Score = 63.5 bits (153), Expect = 8e-08
 Identities = 27/76 (35%), Positives = 48/76 (63%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           + ++LI+MYCK   IG A   F T S+ D+I WN++++G++Q G  +E +  + +++ +G
Sbjct: 358 VANSLINMYCKLRKIGFARTVFHTMSERDLISWNSVIAGFSQSGLEMEAVCLFMQLLRYG 417

Query: 608 IEPDSITFLSVLSGCS 561
           + PD  T  S+L   S
Sbjct: 418 LTPDQYTMTSILKAAS 433


>ref|XP_002867196.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297313032|gb|EFH43455.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 997

 Score =  183 bits (465), Expect = 5e-44
 Identities = 92/211 (43%), Positives = 131/211 (62%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++G++L+DMY KCGSI  A   F      +I  WNA+L G AQHG   E L+ +++M 
Sbjct: 659  DPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKEALQLFKQME 718

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
              GI+PD +TF+ VLS CSH GL+ + ++Y  SM  +YGI P++EH++C+ DALGRA L+
Sbjct: 719  SLGIKPDKVTFIGVLSACSHSGLVSEAYKYIRSMHRDYGIKPEIEHYSCLADALGRAGLV 778

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
            K+A N ID M +   +++ RTL++ C V G  + G     K++ L  +DSSA+VLLSN+Y
Sbjct: 779  KEAENLIDSMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMY 838

Query: 257  AIEKKWHARTKVRDAMEVDIKNAKKVAVSWI 165
            A   KW      R  M+   K  K    SWI
Sbjct: 839  AAASKWDEMKLARTMMK-GHKVKKDPGFSWI 868



 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 27/76 (35%), Positives = 46/76 (60%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           + ++LI+MYCK   IG A   F+  S+ D+I WN++++G AQ    +E +  + +++  G
Sbjct: 359 VSNSLINMYCKLRKIGLARTVFNNMSERDLISWNSVIAGIAQSDLEVEAVCLFMQLLRCG 418

Query: 608 IEPDSITFLSVLSGCS 561
           ++PD  T  SVL   S
Sbjct: 419 LKPDHYTMTSVLKAAS 434


>ref|XP_004157408.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g33170-like [Cucumis sativus]
          Length = 1573

 Score =  183 bits (464), Expect = 7e-44
 Identities = 90/212 (42%), Positives = 131/212 (61%), Gaps = 1/212 (0%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++G++L+DMYCKCGS+  A R F       ++ WNA+L G AQHG+V E L  +  M 
Sbjct: 1234 DHFVGTSLVDMYCKCGSVQDAYRVFRKMDVRKVVFWNAMLLGLAQHGHVDEALNLFRTMQ 1293

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
             +GI+PD +TF+ VLS CSH GL  + ++YF +M   YGI P++EH++C+VDALGRA  +
Sbjct: 1294 SNGIQPDKVTFIGVLSACSHSGLFSEAYKYFDAMFKTYGITPEIEHYSCLVDALGRAGRI 1353

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
            ++A N I  M     +++ R L+  C   G  +       K++ L   DSSA+VLLSN+Y
Sbjct: 1354 QEAENVIASMPFKASASMYRALLGACRTKGDAETAKRVADKLLALDPSDSSAYVLLSNIY 1413

Query: 257  AIEKKWHARTKVRDAMEVDIKNAKK-VAVSWI 165
            A  ++W   T  R+ M+  +KN KK    SWI
Sbjct: 1414 AASRQWDDVTDARNMMK--LKNVKKDPGFSWI 1443



 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 48/178 (26%), Positives = 81/178 (45%), Gaps = 4/178 (2%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++ S ++DMY KCG +  AL  F   S+ D + W  ++SGY ++G+    L  Y  M 
Sbjct: 1133 DLWVSSGVLDMYIKCGDMPNALELFGEISRPDEVAWTTMISGYIENGDEDHALSVYHLMR 1192

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFAS-MRDEYGIIPQMEHH--ACVVDALGRA 447
              G++PD  TF +++   S    L++  Q  A+ ++ +Y     ++H     +VD   + 
Sbjct: 1193 VSGVQPDEYTFATLIKASSCLTALEQGKQIHANVVKLDY----SLDHFVGTSLVDMYCKC 1248

Query: 446  DLLKDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLG-QMDSSAFV 276
              ++DA     +M V         ++     HGHV   L     M   G Q D   F+
Sbjct: 1249 GSVQDAYRVFRKMDVRK-VVFWNAMLLGLAQHGHVDEALNLFRTMQSNGIQPDKVTFI 1305



 Score = 59.7 bits (143), Expect = 1e-06
 Identities = 26/78 (33%), Positives = 46/78 (58%)
 Frame = -1

Query: 788  IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
            + ++L++MY K G +  A + F  + + D+I WN ++S YAQ+   +E +  +  ++  G
Sbjct: 931  VSNSLMNMYSKAGVVYAAEKTFINSPELDLISWNTMISSYAQNNLEMEAICTFRDLLRDG 990

Query: 608  IEPDSITFLSVLSGCSHG 555
            ++PD  T  SVL  CS G
Sbjct: 991  LKPDQFTLASVLRACSTG 1008


>ref|XP_004141438.1| PREDICTED: pentatricopeptide repeat-containing protein At4g33170-like
            [Cucumis sativus]
          Length = 1573

 Score =  183 bits (464), Expect = 7e-44
 Identities = 90/212 (42%), Positives = 131/212 (61%), Gaps = 1/212 (0%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++G++L+DMYCKCGS+  A R F       ++ WNA+L G AQHG+V E L  +  M 
Sbjct: 1234 DHFVGTSLVDMYCKCGSVQDAYRVFRKMDVRKVVFWNAMLLGLAQHGHVDEALNLFRTMQ 1293

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
             +GI+PD +TF+ VLS CSH GL  + ++YF +M   YGI P++EH++C+VDALGRA  +
Sbjct: 1294 SNGIQPDKVTFIGVLSACSHSGLFSEAYKYFDAMFKTYGITPEIEHYSCLVDALGRAGRI 1353

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
            ++A N I  M     +++ R L+  C   G  +       K++ L   DSSA+VLLSN+Y
Sbjct: 1354 QEAENVIASMPFKASASMYRALLGACRTKGDAETAKRVADKLLALDPSDSSAYVLLSNIY 1413

Query: 257  AIEKKWHARTKVRDAMEVDIKNAKK-VAVSWI 165
            A  ++W   T  R+ M+  +KN KK    SWI
Sbjct: 1414 AASRQWDDVTDARNMMK--LKNVKKDPGFSWI 1443



 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 48/178 (26%), Positives = 81/178 (45%), Gaps = 4/178 (2%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++ S ++DMY KCG +  AL  F   S+ D + W  ++SGY ++G+    L  Y  M 
Sbjct: 1133 DLWVSSGVLDMYIKCGDMPNALELFGEISRPDEVAWTTMISGYIENGDEDHALSVYHLMR 1192

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFAS-MRDEYGIIPQMEHH--ACVVDALGRA 447
              G++PD  TF +++   S    L++  Q  A+ ++ +Y     ++H     +VD   + 
Sbjct: 1193 VSGVQPDEYTFATLIKASSCLTALEQGKQIHANVVKLDY----SLDHFVGTSLVDMYCKC 1248

Query: 446  DLLKDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLG-QMDSSAFV 276
              ++DA     +M V         ++     HGHV   L     M   G Q D   F+
Sbjct: 1249 GSVQDAYRVFRKMDVRK-VVFWNAMLLGLAQHGHVDEALNLFRTMQSNGIQPDKVTFI 1305



 Score = 59.7 bits (143), Expect = 1e-06
 Identities = 26/78 (33%), Positives = 46/78 (58%)
 Frame = -1

Query: 788  IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
            + ++L++MY K G +  A + F  + + D+I WN ++S YAQ+   +E +  +  ++  G
Sbjct: 931  VSNSLMNMYSKAGVVYAAEKTFINSPELDLISWNTMISSYAQNNLEMEAICTFRDLLRDG 990

Query: 608  IEPDSITFLSVLSGCSHG 555
            ++PD  T  SVL  CS G
Sbjct: 991  LKPDQFTLASVLRACSTG 1008


>ref|XP_004160754.1| PREDICTED: pentatricopeptide repeat-containing protein At2g27610-like
            [Cucumis sativus]
          Length = 766

 Score =  181 bits (459), Expect = 3e-43
 Identities = 92/213 (43%), Positives = 134/213 (62%), Gaps = 1/213 (0%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            DK I SAL+DMY KCG +G A + F+  S  D + W A+++G+AQHG V + L+ + +M+
Sbjct: 511  DKCIESALVDMYAKCGCLGDAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMV 570

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
              G+EP+++TFL VL  CSHGGL+++  QYF  M+  YG++P+MEH+AC+VD L R   L
Sbjct: 571  QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVGHL 630

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
             DA+ FI  M V P   V +TL+  C VHG+V+LG  +  K++     +S+ +VLLSN Y
Sbjct: 631  NDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTY 690

Query: 257  AIEKKWHARTKVRDAM-EVDIKNAKKVAVSWIS 162
                 +     +R  M E  +K  K+   SWIS
Sbjct: 691  IESGSYKDGLSLRHVMKEQGVK--KEPGCSWIS 721



 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 48/179 (26%), Positives = 84/179 (46%), Gaps = 1/179 (0%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           I +A+ + Y KCGS+    + F+     D+I W +L++ Y+Q     + ++ +  M   G
Sbjct: 413 ISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLVTAYSQCSEWDKAIEIFSNMRAEG 472

Query: 608 IEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKDA 429
           I P+  TF SVL  C++  LL+   Q    +  + G+       + +VD   +   L DA
Sbjct: 473 IAPNQFTFSSVLVSCANLCLLE-YGQQVHGIICKVGLDMDKCIESALVDMYAKCGCLGDA 531

Query: 428 INFIDEMGVTPGSTVLRT-LISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLYA 255
               +   ++   TV  T +I+    HG V   L    +M+ LG ++ +A   L  L+A
Sbjct: 532 KKVFNR--ISNADTVSWTAIIAGHAQHGIVDDALQLFRRMVQLG-VEPNAVTFLCVLFA 587


>ref|XP_006574752.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like
            isoform X1 [Glycine max] gi|571439084|ref|XP_006574753.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g13650-like isoform X2 [Glycine max]
            gi|571439086|ref|XP_006574754.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g13650-like isoform X3 [Glycine max]
            gi|571439088|ref|XP_006574755.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g13650-like isoform X4 [Glycine max]
            gi|571439090|ref|XP_006574756.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g13650-like isoform X5 [Glycine max]
          Length = 1082

 Score =  181 bits (458), Expect = 3e-43
 Identities = 90/213 (42%), Positives = 137/213 (64%)
 Frame = -1

Query: 803  EEDKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEK 624
            + +  + + LI +Y KCG+I  A R F    + + I WNA+L+GY+QHG+  + L  +E 
Sbjct: 742  DSETEVSNVLITLYAKCGNIDDAERQFFEMPEKNEISWNAMLTGYSQHGHGFKALSLFED 801

Query: 623  MIGHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRAD 444
            M   G+ P+ +TF+ VLS CSH GL+D+  +YF SMR+ +G++P+ EH+ACVVD LGR+ 
Sbjct: 802  MKQLGVLPNHVTFVGVLSACSHVGLVDEGIKYFQSMREVHGLVPKPEHYACVVDLLGRSG 861

Query: 443  LLKDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSN 264
            LL  A  F++EM + P + V RTL+S CIVH ++ +G  + + ++ L   DS+ +VLLSN
Sbjct: 862  LLSRARRFVEEMPIQPDAMVCRTLLSACIVHKNIDIGEFAASHLLELEPKDSATYVLLSN 921

Query: 263  LYAIEKKWHARTKVRDAMEVDIKNAKKVAVSWI 165
            +YA+  KW  R + R  M+ D    K+   SWI
Sbjct: 922  MYAVTGKWGCRDRTRQMMK-DRGVKKEPGRSWI 953



 Score = 74.3 bits (181), Expect = 5e-11
 Identities = 49/165 (29%), Positives = 79/165 (47%)
 Frame = -1

Query: 791  YIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGH 612
            Y+ S LIDMY K G +  AL+ F    + D++ W A+++GYAQH    E L  +++M   
Sbjct: 544  YVSSVLIDMYAKLGKLDHALKIFRRLKEKDVVSWTAMIAGYAQHEKFAEALNLFKEMQDQ 603

Query: 611  GIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKD 432
            GI  D+I F S +S C+    L++  Q  A      G    +     +V    R   ++D
Sbjct: 604  GIHSDNIGFASAISACAGIQALNQGQQIHAQACVS-GYSDDLSVGNALVSLYARCGKVRD 662

Query: 431  AINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQ 297
            A    D++  +  +    +LIS     GH +  L+  ++M   GQ
Sbjct: 663  AYFAFDKI-FSKDNISWNSLISGFAQSGHCEEALSLFSQMSKAGQ 706



 Score = 64.3 bits (155), Expect = 5e-08
 Identities = 64/246 (26%), Positives = 100/246 (40%), Gaps = 34/246 (13%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D  +  AL+D+Y KC  I  A  +F +    +++LWN +L  Y    N+ E  K + +M 
Sbjct: 441  DIILEGALLDLYVKCSDIKTAHEFFLSTETENVVLWNVMLVAYGLLDNLNESFKIFTQMQ 500

Query: 617  GHGIEPDSITFLSVLSGCS-----------HGGLLDKVFQY----FASMRDEYGIIPQME 483
              GIEP+  T+ S+L  CS           H  +L   FQ+     + + D Y  + +++
Sbjct: 501  MEGIEPNQFTYPSILRTCSSLRAVDLGEQIHTQVLKTGFQFNVYVSSVLIDMYAKLGKLD 560

Query: 482  HHACVVDALGRADLLK---------------DAINFIDEM---GVTPGSTVLRTLISFCI 357
            H   +   L   D++                +A+N   EM   G+   +    + IS C 
Sbjct: 561  HALKIFRRLKEKDVVSWTAMIAGYAQHEKFAEALNLFKEMQDQGIHSDNIGFASAISACA 620

Query: 356  VHGHVKLGLASIAKMILLGQMDS-SAFVLLSNLYAIEKKWHARTKVRDAMEVDIKNAKKV 180
                +  G    A+  + G  D  S    L +LYA         KVRDA     K   K 
Sbjct: 621  GIQALNQGQQIHAQACVSGYSDDLSVGNALVSLYA------RCGKVRDAYFAFDKIFSKD 674

Query: 179  AVSWIS 162
             +SW S
Sbjct: 675  NISWNS 680


>ref|XP_004138557.1| PREDICTED: pentatricopeptide repeat-containing protein At2g27610-like
            [Cucumis sativus]
          Length = 766

 Score =  181 bits (458), Expect = 3e-43
 Identities = 92/213 (43%), Positives = 134/213 (62%), Gaps = 1/213 (0%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            DK I SAL+DMY KCG +G A + F+  S  D + W A+++G+AQHG V + L+ + +M+
Sbjct: 511  DKCIESALVDMYAKCGCLGDAKKVFNRISNADTVSWTAIIAGHAQHGIVDDALQLFRRMV 570

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
              G+EP+++TFL VL  CSHGGL+++  QYF  M+  YG++P+MEH+AC+VD L R   L
Sbjct: 571  QLGVEPNAVTFLCVLFACSHGGLVEEGLQYFKLMKKTYGLVPEMEHYACIVDLLSRVGHL 630

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
             DA+ FI  M V P   V +TL+  C VHG+V+LG  +  K++     +S+ +VLLSN Y
Sbjct: 631  NDAMEFISRMPVEPNEMVWQTLLGACRVHGNVELGELAAQKILSFKAENSATYVLLSNTY 690

Query: 257  AIEKKWHARTKVRDAM-EVDIKNAKKVAVSWIS 162
                 +     +R  M E  +K  K+   SWIS
Sbjct: 691  IESGSYKDGLSLRHLMKEQGVK--KEPGCSWIS 721



 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 48/179 (26%), Positives = 84/179 (46%), Gaps = 1/179 (0%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           I +A+ + Y KCGS+    + F+     D+I W +L++ Y+Q     + ++ +  M   G
Sbjct: 413 ISNAVANAYAKCGSLEDVRKVFNRMEDRDLISWTSLVTAYSQCSEWDKAIEIFSNMRAEG 472

Query: 608 IEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKDA 429
           I P+  TF SVL  C++  LL+   Q    +  + G+       + +VD   +   L DA
Sbjct: 473 IAPNQFTFSSVLVSCANLCLLE-YGQQVHGIICKVGLDMDKCIESALVDMYAKCGCLGDA 531

Query: 428 INFIDEMGVTPGSTVLRT-LISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLYA 255
               +   ++   TV  T +I+    HG V   L    +M+ LG ++ +A   L  L+A
Sbjct: 532 KKVFNR--ISNADTVSWTAIIAGHAQHGIVDDALQLFRRMVQLG-VEPNAVTFLCVLFA 587


>gb|EOY24832.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao]
          Length = 811

 Score =  180 bits (456), Expect = 6e-43
 Identities = 91/211 (43%), Positives = 137/211 (64%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++ +A+ID+YCKCGSIG A + F  AS  ++  WNA+++GYAQHG   E  + Y+KM 
Sbjct: 553  DCFVETAVIDLYCKCGSIGDAEKAFRYASMDNLAAWNAMITGYAQHGCYSEAFELYDKMT 612

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
              GI+PD IT+L VL+ C H GL+ +   Y  SM + +G+IP +EH+AC++D LGR  LL
Sbjct: 613  ECGIKPDEITYLGVLTSCCHTGLVLEAQYYMNSMVECHGLIPHLEHYACMIDLLGRVGLL 672

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
            +DA   ID+M + P + + + L+S C +HG+V +G  + +K++ L   + SA+VLLSNL 
Sbjct: 673  EDAKRTIDQMPIGPDARIWQILLSACSIHGNVDMGRIAASKLLELQPNNESAYVLLSNLC 732

Query: 257  AIEKKWHARTKVRDAMEVDIKNAKKVAVSWI 165
            A    W+A  K+R  M+  +   K+   SWI
Sbjct: 733  ASAGMWNAVRKLRREMKEKLL-CKEPGSSWI 762


>ref|XP_002308660.2| hypothetical protein POPTR_0006s26860g [Populus trichocarpa]
           gi|550337158|gb|EEE92183.2| hypothetical protein
           POPTR_0006s26860g [Populus trichocarpa]
          Length = 487

 Score =  179 bits (455), Expect = 8e-43
 Identities = 84/203 (41%), Positives = 129/203 (63%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           + +ALIDM+ KCG +  A   F +  + +I+ W +++ G A HG  +E +  +E+M+  G
Sbjct: 152 LSNALIDMFAKCGDVDKATNLFRSMRERNIVSWTSVIGGLAMHGRGVEAVAVFEEMVRSG 211

Query: 608 IEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKDA 429
           + PD + F+ +LS CSH GL+DK   YF SMR ++ I+P++EH+ C+VD L RA L+K+A
Sbjct: 212 VTPDDVVFIGLLSACSHSGLVDKGKGYFDSMRKDFSIVPKIEHYGCMVDMLCRAGLVKEA 271

Query: 428 INFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLYAIE 249
           + F+ EM + P   V RTLI+ C  HG +KLG     ++I    M  S +VLLSN+YA  
Sbjct: 272 LKFVQEMPIDPNPVVWRTLINACRAHGELKLGEKITRQLIRNEPMHESNYVLLSNIYAKM 331

Query: 248 KKWHARTKVRDAMEVDIKNAKKV 180
             W  +T++R+AM  D+K  KK+
Sbjct: 332 SDWEKKTRIREAM--DMKGMKKI 352



 Score = 57.8 bits (138), Expect = 4e-06
 Identities = 51/194 (26%), Positives = 74/194 (38%), Gaps = 42/194 (21%)
 Frame = -1

Query: 788 IGSALIDMYCKC----GSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKM 621
           + + L+ MYC C    G I  A + F    K D + W+A++ GY + G   + +  + +M
Sbjct: 47  VQNTLVHMYCCCRGGEGGIEFARKVFDEMYKSDSVSWSAMIGGYVRVGRSSDAINLFREM 106

Query: 620 IGHGIEPDSITFLSVLSGCSHGGLL----------------------------------- 546
              G+ PD IT +SVLS C+  G L                                   
Sbjct: 107 QIKGVCPDEITMVSVLSACTGLGALELGKWVESYVEKERVQKNVELSNALIDMFAKCGDV 166

Query: 545 DKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKDAINFIDEM---GVTPGSTVLRT 375
           DK    F SMR+       +     V+  L       +A+   +EM   GVTP   V   
Sbjct: 167 DKATNLFRSMRER-----NIVSWTSVIGGLAMHGRGVEAVAVFEEMVRSGVTPDDVVFIG 221

Query: 374 LISFCIVHGHVKLG 333
           L+S C   G V  G
Sbjct: 222 LLSACSHSGLVDKG 235


>ref|NP_195043.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75206840|sp|Q9SMZ2.1|PP347_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g33170 gi|4455331|emb|CAB36791.1| putative protein
            [Arabidopsis thaliana] gi|7270265|emb|CAB80034.1|
            putative protein [Arabidopsis thaliana]
            gi|332660786|gb|AEE86186.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 990

 Score =  179 bits (455), Expect = 8e-43
 Identities = 90/211 (42%), Positives = 130/211 (61%)
 Frame = -1

Query: 797  DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
            D ++G++L+DMY KCGSI  A   F      +I  WNA+L G AQHG   E L+ +++M 
Sbjct: 652  DPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMK 711

Query: 617  GHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLL 438
              GI+PD +TF+ VLS CSH GL+ + +++  SM  +YGI P++EH++C+ DALGRA L+
Sbjct: 712  SLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLV 771

Query: 437  KDAINFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLY 258
            K A N I+ M +   +++ RTL++ C V G  + G     K++ L  +DSSA+VLLSN+Y
Sbjct: 772  KQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMY 831

Query: 257  AIEKKWHARTKVRDAMEVDIKNAKKVAVSWI 165
            A   KW      R  M+   K  K    SWI
Sbjct: 832  AAASKWDEMKLARTMMK-GHKVKKDPGFSWI 861



 Score = 59.7 bits (143), Expect = 1e-06
 Identities = 27/76 (35%), Positives = 46/76 (60%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           + ++LI+MYCK    G A   F   S+ D+I WN++++G AQ+G  +E +  + +++  G
Sbjct: 352 VSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCG 411

Query: 608 IEPDSITFLSVLSGCS 561
           ++PD  T  SVL   S
Sbjct: 412 LKPDQYTMTSVLKAAS 427


>gb|EOY11208.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
            [Theobroma cacao]
          Length = 1072

 Score =  179 bits (454), Expect = 1e-42
 Identities = 91/207 (43%), Positives = 135/207 (65%), Gaps = 1/207 (0%)
 Frame = -1

Query: 782  SALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHGIE 603
            + LI +Y KCGSI  A + F    + + + WNA+++GY+QHG  +E +  +EKM   G+ 
Sbjct: 739  NVLITLYAKCGSIDDAKKEFLEIPEKNEVSWNAMITGYSQHGYGIEAIDLFEKMKQVGVT 798

Query: 602  PDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKDAIN 423
            P+ +T + VLS CSH GL+D+   YF SM  E+G++P+ EH+ACVVD LGRA LL  A  
Sbjct: 799  PNPVTLVGVLSACSHVGLVDEGLDYFDSMSKEHGLVPKPEHYACVVDLLGRAGLLCRARK 858

Query: 422  FIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLYAIEKK 243
            F+++M + P + + RTL+S C VH +V +G  +   ++ L   DS+++VLLSNLYA+ KK
Sbjct: 859  FVEDMPIEPDAIIWRTLLSACAVHKNVDIGEFAAHHLLKLEPQDSASYVLLSNLYAVSKK 918

Query: 242  WHARTKVRDAM-EVDIKNAKKVAVSWI 165
            W +R + R  M E  +K  K+ A SWI
Sbjct: 919  WDSRDQTRQMMKERGVK--KEPAQSWI 943



 Score = 64.7 bits (156), Expect = 4e-08
 Identities = 28/77 (36%), Positives = 44/77 (57%)
 Frame = -1

Query: 791 YIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGH 612
           Y+ S LIDMY K G +  AL       + D++ W A+++GY QH    E L+ + +M+  
Sbjct: 534 YVCSVLIDMYAKLGKLETALEILRKLPEEDVVSWTAMIAGYTQHDMFYEALELFGEMLNR 593

Query: 611 GIEPDSITFLSVLSGCS 561
           GI+ D+I   S +S C+
Sbjct: 594 GIQSDNIGLSSAISACA 610



 Score = 64.3 bits (155), Expect = 5e-08
 Identities = 29/85 (34%), Positives = 46/85 (54%)
 Frame = -1

Query: 797 DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
           D  +  +L+D+Y KC  I  A  +FST    +++LWN +L  Y Q  N+ E    + +M 
Sbjct: 431 DIIVEGSLLDLYLKCSDIETAYEFFSTTETENVVLWNVMLVAYGQLDNLSESFHIFRQMQ 490

Query: 617 GHGIEPDSITFLSVLSGCSHGGLLD 543
             G+ P+  T+ S+L  C+  G LD
Sbjct: 491 IEGLVPNQFTYPSILRTCTSLGALD 515


>gb|EOY11207.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao]
          Length = 1389

 Score =  179 bits (454), Expect = 1e-42
 Identities = 91/207 (43%), Positives = 135/207 (65%), Gaps = 1/207 (0%)
 Frame = -1

Query: 782  SALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHGIE 603
            + LI +Y KCGSI  A + F    + + + WNA+++GY+QHG  +E +  +EKM   G+ 
Sbjct: 739  NVLITLYAKCGSIDDAKKEFLEIPEKNEVSWNAMITGYSQHGYGIEAIDLFEKMKQVGVT 798

Query: 602  PDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKDAIN 423
            P+ +T + VLS CSH GL+D+   YF SM  E+G++P+ EH+ACVVD LGRA LL  A  
Sbjct: 799  PNPVTLVGVLSACSHVGLVDEGLDYFDSMSKEHGLVPKPEHYACVVDLLGRAGLLCRARK 858

Query: 422  FIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLYAIEKK 243
            F+++M + P + + RTL+S C VH +V +G  +   ++ L   DS+++VLLSNLYA+ KK
Sbjct: 859  FVEDMPIEPDAIIWRTLLSACAVHKNVDIGEFAAHHLLKLEPQDSASYVLLSNLYAVSKK 918

Query: 242  WHARTKVRDAM-EVDIKNAKKVAVSWI 165
            W +R + R  M E  +K  K+ A SWI
Sbjct: 919  WDSRDQTRQMMKERGVK--KEPAQSWI 943



 Score = 64.7 bits (156), Expect = 4e-08
 Identities = 28/77 (36%), Positives = 44/77 (57%)
 Frame = -1

Query: 791 YIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGH 612
           Y+ S LIDMY K G +  AL       + D++ W A+++GY QH    E L+ + +M+  
Sbjct: 534 YVCSVLIDMYAKLGKLETALEILRKLPEEDVVSWTAMIAGYTQHDMFYEALELFGEMLNR 593

Query: 611 GIEPDSITFLSVLSGCS 561
           GI+ D+I   S +S C+
Sbjct: 594 GIQSDNIGLSSAISACA 610



 Score = 64.3 bits (155), Expect = 5e-08
 Identities = 29/85 (34%), Positives = 46/85 (54%)
 Frame = -1

Query: 797 DKYIGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMI 618
           D  +  +L+D+Y KC  I  A  +FST    +++LWN +L  Y Q  N+ E    + +M 
Sbjct: 431 DIIVEGSLLDLYLKCSDIETAYEFFSTTETENVVLWNVMLVAYGQLDNLSESFHIFRQMQ 490

Query: 617 GHGIEPDSITFLSVLSGCSHGGLLD 543
             G+ P+  T+ S+L  C+  G LD
Sbjct: 491 IEGLVPNQFTYPSILRTCTSLGALD 515


>gb|EMJ26738.1| hypothetical protein PRUPE_ppa026705mg [Prunus persica]
          Length = 484

 Score =  178 bits (452), Expect = 2e-42
 Identities = 85/203 (41%), Positives = 129/203 (63%)
 Frame = -1

Query: 788 IGSALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHG 609
           + +ALIDM+ KCG +  AL+ F   S   I+ W +++ G A HG  +E +  +E+MIG G
Sbjct: 149 LSNALIDMFSKCGDVEKALKLFRNMSGRTIVSWTSVIDGLAMHGRGMEAMSLFEEMIGTG 208

Query: 608 IEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKDA 429
           + PD + F+ + S CSH GL++K   YF+SM +++ I+P++EH+ C+VD L RA L+K+A
Sbjct: 209 VAPDDVAFIGLFSACSHSGLVEKGKSYFSSMVEKFHIVPKIEHYGCMVDMLCRAGLVKEA 268

Query: 428 INFIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLYAIE 249
           + FI +M + P   VLRTLIS C  HG +KLG +   ++I    M  S +VLLSN+YA  
Sbjct: 269 LEFIQKMPIEPNPIVLRTLISACRAHGELKLGESITKELIRNEPMQESNYVLLSNIYAKM 328

Query: 248 KKWHARTKVRDAMEVDIKNAKKV 180
             W  + K+R+ M  D +  KK+
Sbjct: 329 THWEKKAKIREVM--DKRGMKKI 349



 Score = 68.2 bits (165), Expect = 3e-09
 Identities = 48/180 (26%), Positives = 89/180 (49%), Gaps = 4/180 (2%)
 Frame = -1

Query: 803 EEDKYIGSALIDMYCKC-GSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYE 627
           ++D ++ + ++ MYC C G I  A + F    K D + W+A++ GY + G   + +  + 
Sbjct: 42  DDDVHVCNTMVHMYCCCSGGIESARKVFDEMPKLDSVSWSAMIGGYVRVGWSTDAVDLFR 101

Query: 626 KMIGHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRA 447
           +M   G+ PD IT +SVLS C+  G L+ + ++  S  D+ GI   +E    ++D   + 
Sbjct: 102 EMQMVGVRPDEITMVSVLSACTDLGALE-LGKWVESYIDKEGIQKTVELSNALIDMFSKC 160

Query: 446 DLLKDAINFIDEMGVTPGSTVL--RTLISFCIVHGHVKLGLASIAKMILLG-QMDSSAFV 276
             ++ A+     M    G T++   ++I    +HG     ++   +MI  G   D  AF+
Sbjct: 161 GDVEKALKLFRNMS---GRTIVSWTSVIDGLAMHGRGMEAMSLFEEMIGTGVAPDDVAFI 217


>ref|XP_004291465.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21065-like [Fragaria vesca subsp. vesca]
          Length = 588

 Score =  177 bits (450), Expect = 3e-42
 Identities = 87/201 (43%), Positives = 128/201 (63%)
 Frame = -1

Query: 782 SALIDMYCKCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEKMIGHGIE 603
           +ALIDM+ KCG++  AL+ F   S   I+ W +++ G A HG   E +  +E+MIG G+E
Sbjct: 255 NALIDMFAKCGNVDKALKLFRRMSGRTIVSWTSVIDGLAMHGRGEEAVGLFEEMIGDGVE 314

Query: 602 PDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRADLLKDAIN 423
           PD + F+ +LS CSH GL+D+  +YF+ M + +  +P++EH+ C+VD L RA  +K+A+ 
Sbjct: 315 PDDVAFIGLLSACSHSGLVDEGKRYFSMMVERFSCVPKIEHYGCMVDMLCRAGRVKEALE 374

Query: 422 FIDEMGVTPGSTVLRTLISFCIVHGHVKLGLASIAKMILLGQMDSSAFVLLSNLYAIEKK 243
           FI +M + P   VLRTLIS C  HG +KLG +   ++I    M  S +VLLSN+YA    
Sbjct: 375 FIRKMPIKPNPIVLRTLISACRAHGELKLGESITKELIRAEPMHESNYVLLSNIYAKMNH 434

Query: 242 WHARTKVRDAMEVDIKNAKKV 180
           W  +TK R+AM  D K  KK+
Sbjct: 435 WEKKTKTREAM--DKKGMKKI 453



 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 43/179 (24%), Positives = 87/179 (48%), Gaps = 4/179 (2%)
 Frame = -1

Query: 800 EDKYIGSALIDMYC-KCGSIGCALRYFSTASKHDIILWNALLSGYAQHGNVLEMLKAYEK 624
           +D ++ + ++ MYC   G +  A + F      D + W+A++ GY + G   + ++ + +
Sbjct: 147 DDVHVRNTMVHMYCCSGGGVESARKVFDEMPSSDSVAWSAMIGGYVRVGWSSDAVEMFRE 206

Query: 623 MIGHGIEPDSITFLSVLSGCSHGGLLDKVFQYFASMRDEYGIIPQMEHHACVVDALGRAD 444
           M   G+ PD +T +SVLS C+  G L+ + ++  S  ++ GI   +E    ++D   +  
Sbjct: 207 MQMRGVRPDEVTMVSVLSACTDLGALE-LGKWVESYIEKEGIQKSVELCNALIDMFAKCG 265

Query: 443 LLKDAINFIDEMGVTPGSTVL--RTLISFCIVHGHVKLGLASIAKMILLG-QMDSSAFV 276
            +  A+     M    G T++   ++I    +HG  +  +    +MI  G + D  AF+
Sbjct: 266 NVDKALKLFRRMS---GRTIVSWTSVIDGLAMHGRGEEAVGLFEEMIGDGVEPDDVAFI 321


Top