BLASTX nr result

ID: Catharanthus22_contig00017323 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00017323
         (1734 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006358787.1| PREDICTED: pentatricopeptide repeat-containi...   708   0.0  
ref|XP_004248026.1| PREDICTED: pentatricopeptide repeat-containi...   700   0.0  
ref|XP_002268375.1| PREDICTED: pentatricopeptide repeat-containi...   621   e-175
gb|EPS58676.1| hypothetical protein M569_16136, partial [Genlise...   615   e-173
gb|EXB93122.1| hypothetical protein L484_024459 [Morus notabilis]     597   e-168
ref|XP_004142520.1| PREDICTED: pentatricopeptide repeat-containi...   595   e-167
gb|EOY10792.1| Pentatricopeptide repeat (PPR) superfamily protei...   593   e-166
gb|EMJ07970.1| hypothetical protein PRUPE_ppa025361mg [Prunus pe...   584   e-164
ref|XP_004296694.1| PREDICTED: pentatricopeptide repeat-containi...   559   e-156
ref|XP_006830661.1| hypothetical protein AMTR_s00210p00017530 [A...   556   e-156
ref|XP_004504788.1| PREDICTED: pentatricopeptide repeat-containi...   548   e-153
ref|XP_002522032.1| pentatricopeptide repeat-containing protein,...   548   e-153
ref|XP_006479008.1| PREDICTED: pentatricopeptide repeat-containi...   541   e-151
ref|XP_006400384.1| hypothetical protein EUTSA_v10013477mg [Eutr...   537   e-150
ref|XP_006287650.1| hypothetical protein CARUB_v10000860mg, part...   535   e-149
ref|XP_002871814.1| pentatricopeptide repeat-containing protein ...   533   e-149
ref|NP_197340.1| pentatricopeptide repeat-containing protein [Ar...   532   e-148
gb|ESW31089.1| hypothetical protein PHAVU_002G208300g [Phaseolus...   519   e-144
ref|XP_003524064.1| PREDICTED: pentatricopeptide repeat-containi...   515   e-143
ref|XP_002325369.2| hypothetical protein POPTR_0019s04240g [Popu...   481   e-133

>ref|XP_006358787.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X1 [Solanum tuberosum]
            gi|565385886|ref|XP_006358788.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X2 [Solanum tuberosum]
            gi|565385889|ref|XP_006358789.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X3 [Solanum tuberosum]
          Length = 472

 Score =  708 bits (1827), Expect = 0.0
 Identities = 350/467 (74%), Positives = 388/467 (83%), Gaps = 8/467 (1%)
 Frame = +3

Query: 177  ARYILIRITSPHLISPTNGMLKYLRTIMTTSDRV-------GNC-KNTSNDDYFAAIHHI 332
            AR    R       +    + + L+TI  + D+        G+C ++  NDDYFA IHH+
Sbjct: 6    ARIFFQRTLYSATFTNNTSIFRCLKTIAPSIDQCSNPFSGKGSCGRSVPNDDYFATIHHV 65

Query: 333  SNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXX 512
            SNIVRRDIY+ERTLNKM IS+IVNSELVYRVLRSCC+ GIESFRFFNWART HP YDP  
Sbjct: 66   SNIVRRDIYLERTLNKMHISSIVNSELVYRVLRSCCQHGIESFRFFNWARTQHPQYDPTT 125

Query: 513  XXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLIDQAVELFNR 692
                      +RT HWETMWKV  QMK QN            EHYGK GLIDQAVELFNR
Sbjct: 126  VEFEELLKTLARTAHWETMWKVVQQMKAQNIPISPSIVSFIIEHYGKRGLIDQAVELFNR 185

Query: 693  LKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGK 872
            LKNF+CPQTTEVYN++LFALCEVKNFQGAYALIRRM+RK  VPDK+TY+ILVNGWCSAGK
Sbjct: 186  LKNFDCPQTTEVYNAMLFALCEVKNFQGAYALIRRMIRKGTVPDKQTYSILVNGWCSAGK 245

Query: 873  MREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNS 1052
            MREAQ FLEEMSRKGFNPPVRGRDLLIDGLL+AGYLE+AKGLVRKMTKEGFVPDVGTFNS
Sbjct: 246  MREAQEFLEEMSRKGFNPPVRGRDLLIDGLLSAGYLESAKGLVRKMTKEGFVPDVGTFNS 305

Query: 1053 LAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDG 1232
            LAEAICK+GEIDFCIDLFNDVCR GL PD+++YKI++TAASKVGRIDEAF ILHRSIE G
Sbjct: 306  LAEAICKTGEIDFCIDLFNDVCRSGLFPDIETYKIVITAASKVGRIDEAFQILHRSIEAG 365

Query: 1233 LRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAA 1412
             RPFPSLYAPILKA  R+GQFDDAFSFFS+MK+KGHPPNRP+YTMLIKMC RGGRF+EA+
Sbjct: 366  HRPFPSLYAPILKAFFRRGQFDDAFSFFSEMKLKGHPPNRPLYTMLIKMCSRGGRFVEAS 425

Query: 1413 NYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRGI 1553
            NYLVEMTELNL+P+SR+FDMVTDGLK CGK DLAKRIEQLEIS +GI
Sbjct: 426  NYLVEMTELNLLPMSRSFDMVTDGLKNCGKHDLAKRIEQLEISVKGI 472


>ref|XP_004248026.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Solanum lycopersicum]
          Length = 470

 Score =  700 bits (1806), Expect = 0.0
 Identities = 341/426 (80%), Positives = 369/426 (86%), Gaps = 1/426 (0%)
 Frame = +3

Query: 279  GNC-KNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIE 455
            G+C ++  NDDYFA IHH+SNIVRRDIY+ERTLNKM IS IVNSELVYRVLRSCC+ GIE
Sbjct: 45   GSCGRSVPNDDYFATIHHVSNIVRRDIYLERTLNKMHISRIVNSELVYRVLRSCCQHGIE 104

Query: 456  SFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXX 635
            SFRFFNWART HP YDP            +RT HWETMWKV  QMK QN           
Sbjct: 105  SFRFFNWARTQHPQYDPTTVEFEELLKTLARTAHWETMWKVVQQMKAQNIPISPSIVSFI 164

Query: 636  XEHYGKHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDA 815
             EHYGK GLIDQAVELFNRLKNF C QTTEVYN++LFALCEVKNFQGAYALIRRM+RK  
Sbjct: 165  IEHYGKRGLIDQAVELFNRLKNFGCSQTTEVYNAMLFALCEVKNFQGAYALIRRMIRKGT 224

Query: 816  VPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKG 995
            VPDK TY+ILVNGWCSAGKMREAQ FLEEMSRKGFNPPVRGRDLLIDGLL+AGYLE+AKG
Sbjct: 225  VPDKLTYSILVNGWCSAGKMREAQEFLEEMSRKGFNPPVRGRDLLIDGLLSAGYLESAKG 284

Query: 996  LVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAAS 1175
            LVRKMTKEGFVPDVGTFNSLAEA+CK+GEIDFCIDLFNDVCRLGL PD ++YKI++TAA+
Sbjct: 285  LVRKMTKEGFVPDVGTFNSLAEAVCKTGEIDFCIDLFNDVCRLGLCPDTETYKIVITAAA 344

Query: 1176 KVGRIDEAFHILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRP 1355
            K GRIDEAF ILHRSIE G RPFPSLYAPILKA  R+GQFDDAFSFFSDMKVKGHPPNRP
Sbjct: 345  KAGRIDEAFQILHRSIEAGHRPFPSLYAPILKAFFRRGQFDDAFSFFSDMKVKGHPPNRP 404

Query: 1356 VYTMLIKMCGRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLE 1535
            +YTMLIKMC RGGRF+EA+NYLVEMTELNL+P+SR+FD VTDGLK CGK DLAKRIEQLE
Sbjct: 405  LYTMLIKMCCRGGRFVEASNYLVEMTELNLLPMSRSFDTVTDGLKNCGKHDLAKRIEQLE 464

Query: 1536 ISFRGI 1553
            IS +GI
Sbjct: 465  ISVKGI 470


>ref|XP_002268375.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial [Vitis vinifera]
            gi|296085168|emb|CBI28663.3| unnamed protein product
            [Vitis vinifera]
          Length = 454

 Score =  621 bits (1601), Expect = e-175
 Identities = 311/462 (67%), Positives = 365/462 (79%), Gaps = 1/462 (0%)
 Frame = +3

Query: 171  RPARYILIRITSPHLISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRR 350
            RP  ++LIR       SP + +L +L+T+ T +  + N   +  DDYFA +HHIS IVRR
Sbjct: 6    RPIFHLLIRN------SPASPILTHLKTLTTITTHLQNTITSKKDDYFAVVHHISAIVRR 59

Query: 351  DIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXX 530
            D Y+ERTLNK+ IS  V S+LVYRVLRSC   G ES RFFNWAR+ H +Y P        
Sbjct: 60   DFYLERTLNKLPIS--VTSDLVYRVLRSCPNSGTESLRFFNWARS-HLSYQPTTLEYEEL 116

Query: 531  XXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLIDQAVELFNRLKN-FN 707
                +RT  ++ MWK+AHQM+  +            E +GKHGL+DQAVE+FN+ K+  N
Sbjct: 117  LKTLARTKQFQPMWKIAHQMQTLSPTVVSSII----EEFGKHGLVDQAVEVFNKAKSALN 172

Query: 708  CPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQ 887
            CPQT EVYNSLLFALCEVK F GAYALIRRM+RK   P+K+TY++LVNGWC+AGKM+EAQ
Sbjct: 173  CPQTIEVYNSLLFALCEVKYFHGAYALIRRMIRKGVTPNKQTYSVLVNGWCAAGKMKEAQ 232

Query: 888  NFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAI 1067
            +FLEEMSRKGFNPPVRGRDLL+DGLLNAGYLEAAK +VRKMTKEG  PDV T NS+ EAI
Sbjct: 233  DFLEEMSRKGFNPPVRGRDLLVDGLLNAGYLEAAKEMVRKMTKEGCAPDVETLNSMLEAI 292

Query: 1068 CKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDGLRPFP 1247
            CK+GE +FCID++NDVCRLG+SP+V +YKIM+ AA K GRIDEAF ILHRSIEDG RPFP
Sbjct: 293  CKAGEAEFCIDIYNDVCRLGVSPNVGTYKIMIPAACKEGRIDEAFRILHRSIEDGHRPFP 352

Query: 1248 SLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLVE 1427
            SLYAPI+KALCR GQFDDAF FFSDMKVKGHPPNRPVYTMLI MCGRGGRF++AANYLVE
Sbjct: 353  SLYAPIIKALCRNGQFDDAFCFFSDMKVKGHPPNRPVYTMLITMCGRGGRFVDAANYLVE 412

Query: 1428 MTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRGI 1553
            MTELNL P+SR FDMVTDGLK CGK DLA++IEQLE+S RG+
Sbjct: 413  MTELNLTPISRCFDMVTDGLKNCGKHDLARKIEQLEVSLRGV 454


>gb|EPS58676.1| hypothetical protein M569_16136, partial [Genlisea aurea]
          Length = 419

 Score =  615 bits (1587), Expect = e-173
 Identities = 298/419 (71%), Positives = 342/419 (81%), Gaps = 2/419 (0%)
 Frame = +3

Query: 303  DDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWAR 482
            DDYFA IHHISNIVRRDIY+ERTL KM IS++VNSELVYRV+ +C   GIESFRFFNWAR
Sbjct: 1    DDYFATIHHISNIVRRDIYLERTLMKMNISSLVNSELVYRVINNCSSSGIESFRFFNWAR 60

Query: 483  TNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGL 662
            + HPNY+P            ++T HWETMWKV H MK Q             E Y KHGL
Sbjct: 61   SCHPNYEPTTLEFEALLKVLAQTKHWETMWKVVHTMKSQQSPISPGIMSFIIEQYAKHGL 120

Query: 663  IDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAI 842
            ID+AVELFN LKN +CPQT EVYNSLLFALCEVKNFQGAYAL+RRM+RK  VPDKRTY+I
Sbjct: 121  IDKAVELFNGLKNLDCPQTIEVYNSLLFALCEVKNFQGAYALVRRMIRKGNVPDKRTYSI 180

Query: 843  LVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEG 1022
            LVN WC AGK+ EAQ FLEEMS+KG+NPPVRGRDLLIDGLLNAGYLE AKGLVRKMTK G
Sbjct: 181  LVNAWCRAGKLIEAQEFLEEMSKKGYNPPVRGRDLLIDGLLNAGYLECAKGLVRKMTKIG 240

Query: 1023 FVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAF 1202
             +PD+ TFNSLAEA+CK+GE D CI  F+D+C LG  P+ D+YKIM+TAAS+ GRIDEA 
Sbjct: 241  SIPDIATFNSLAEALCKNGETDACIASFDDICELGFCPNSDTYKIMITAASRDGRIDEAV 300

Query: 1203 HILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMC 1382
             ++ RSIE+G RPFPSLYAPILK   R+GQFDDAFSFFS+MKVKGH PNRP+YTML+K+C
Sbjct: 301  RMIQRSIEEGQRPFPSLYAPILKGFIRRGQFDDAFSFFSEMKVKGHVPNRPIYTMLVKLC 360

Query: 1383 GRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLE--ISFRGI 1553
             RGGRF+EAANYLVEM ELNL+P+S++FDMV DGLK CGK DLA+RI+++E  IS RGI
Sbjct: 361  VRGGRFVEAANYLVEMIELNLLPMSKSFDMVCDGLKKCGKYDLAQRIQRMEMDISLRGI 419


>gb|EXB93122.1| hypothetical protein L484_024459 [Morus notabilis]
          Length = 470

 Score =  597 bits (1539), Expect = e-168
 Identities = 293/446 (65%), Positives = 354/446 (79%), Gaps = 7/446 (1%)
 Frame = +3

Query: 237  LKYLRTIMTTSDRVGNCKNT-----SNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIV 401
            L+Y +T+ +T+    + +N+     S D+YFAAIHHISNIV+RD YMERTLNK+ I+  V
Sbjct: 26   LRYFQTLPSTAVNGDHYQNSTKPSSSKDNYFAAIHHISNIVQRDFYMERTLNKLRIA-AV 84

Query: 402  NSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVA 581
            +S+LV+RVLR+C + G ES RFFNWAR++ P+Y P            +RT  +E+MWK+ 
Sbjct: 85   DSDLVFRVLRACHKFGPESLRFFNWARSHQPSYRPTSVELEELAKNLARTKKYESMWKIL 144

Query: 582  HQMKVQNXXXXXXXXXXXX-EHYGKHGLIDQAVELFNRL-KNFNCPQTTEVYNSLLFALC 755
             QMK  N             E YGK GL+DQA E+FNR+ K FNC QT EVYNSLLFALC
Sbjct: 145  QQMKTNNNLIISSETLCFIIEEYGKQGLVDQAAEVFNRVPKIFNCSQTVEVYNSLLFALC 204

Query: 756  EVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVR 935
            EVK F GAYAL+RRM+RK+ VPDKRTY+ILVN WCSAGKMREAQNFL EMS+KGFNPPVR
Sbjct: 205  EVKLFHGAYALVRRMIRKEVVPDKRTYSILVNAWCSAGKMREAQNFLSEMSKKGFNPPVR 264

Query: 936  GRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDV 1115
            GRDLLI+GLLNAGY+E+AK +VRKM KEGF+PDV TFNSL E ICKS E++FCIDL++ V
Sbjct: 265  GRDLLIEGLLNAGYIESAKEMVRKMVKEGFLPDVSTFNSLVEVICKSEEVEFCIDLYHQV 324

Query: 1116 CRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDGLRPFPSLYAPILKALCRKGQF 1295
            C LGL PD+++YK+++ A SK G+IDEAF +LH SIEDG +PFPSLYAPI+K +CRKGQF
Sbjct: 325  CGLGLCPDINTYKVLIPAVSKAGQIDEAFRLLHSSIEDGHKPFPSLYAPIIKGMCRKGQF 384

Query: 1296 DDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLVEMTELNLMPLSRNFDMV 1475
            DDA  FF +MKVKGHPPNRPVYTMLI MCGRGGRF++AANYLVEMTE+ L P+SR FD+V
Sbjct: 385  DDALCFFGEMKVKGHPPNRPVYTMLITMCGRGGRFVDAANYLVEMTEIGLTPISRCFDLV 444

Query: 1476 TDGLKACGKLDLAKRIEQLEISFRGI 1553
            TDGLK CGK DLA+RIEQLE+S RG+
Sbjct: 445  TDGLKNCGKHDLARRIEQLEVSARGM 470


>ref|XP_004142520.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Cucumis sativus]
            gi|449518358|ref|XP_004166209.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Cucumis sativus]
          Length = 455

 Score =  595 bits (1533), Expect = e-167
 Identities = 289/436 (66%), Positives = 341/436 (78%), Gaps = 1/436 (0%)
 Frame = +3

Query: 249  RTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVL 428
            R    ++  V      S DDYFAAIHHIS+IVRRD YMERTLNK+ ISN+ NSELV+RVL
Sbjct: 21   RHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNL-NSELVFRVL 79

Query: 429  RSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXX 608
            R+C   G ESFRFFNWA +++P+Y P            +RT  + TMWKV  QMK QN  
Sbjct: 80   RACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLK 139

Query: 609  XXXXXXXXXXEHYGKHGLIDQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYA 785
                      + YGK GL+D AV +FN+  K+ +CPQT EVYN+LLFALCEVK F GAYA
Sbjct: 140  ISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYA 199

Query: 786  LIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLL 965
            LIRRM+RK   PDK+TY  LV GWCSAGKM+EAQ FLEEMS+KGFNPP+RGRDLL++GLL
Sbjct: 200  LIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLL 259

Query: 966  NAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD 1145
            NAGYLE+AK +VRKMTKEG VPD+GTFNSL + IC SGE+DFCI++F++VC+LGL PD++
Sbjct: 260  NAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDIN 319

Query: 1146 SYKIMVTAASKVGRIDEAFHILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDM 1325
            +YKI++ A SKVGRIDEAF +LH  IEDG  PFPSLY PILK +C++GQFDDAF FF DM
Sbjct: 320  TYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDM 379

Query: 1326 KVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKL 1505
            K KGHPPNRPVYTMLI MCGRGGRF++AANYL+EM EL L P+SR FDMVTDGLK CGK 
Sbjct: 380  KHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKH 439

Query: 1506 DLAKRIEQLEISFRGI 1553
            DLAK+IEQLE+S RGI
Sbjct: 440  DLAKKIEQLEVSIRGI 455


>gb|EOY10792.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 455

 Score =  593 bits (1528), Expect = e-166
 Identities = 291/419 (69%), Positives = 335/419 (79%)
 Frame = +3

Query: 297  SNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNW 476
            S DDYFAAIHHISN VRR+++ ERTLN+M IS  VNSELV+RVLRSC     ES RFF+W
Sbjct: 42   SKDDYFAAIHHISNTVRREVHPERTLNRMNIS--VNSELVFRVLRSCSNSPTESLRFFSW 99

Query: 477  ARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKH 656
            AR +   Y P             R   +E+MWK   QM+ QN            E YGK+
Sbjct: 100  ARAH---YVPTSVEFEELVKILIRHRKYESMWKTIQQMQKQNLSLSCDTLSFIIEEYGKN 156

Query: 657  GLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTY 836
            GL+DQAVE+FN+  +  C QT  VYNSLLFALCEVK F GAYALIRRM+RK  VPDKRTY
Sbjct: 157  GLVDQAVEVFNKSTSLGCKQTVSVYNSLLFALCEVKMFHGAYALIRRMIRKGEVPDKRTY 216

Query: 837  AILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTK 1016
            AILVNGWCS GKMREAQ FLEEMS+ GFNPPVRGRDLL++GLLNAGYLE+AK +VR+MTK
Sbjct: 217  AILVNGWCSGGKMREAQEFLEEMSKMGFNPPVRGRDLLVEGLLNAGYLESAKEMVRRMTK 276

Query: 1017 EGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDE 1196
            EGFVPD+GTFNSL E IC SGE+DFCI++++ VC+LGL PD+++YKI++ AASKVGRIDE
Sbjct: 277  EGFVPDIGTFNSLVETICSSGEVDFCINMYHSVCKLGLCPDINTYKILIPAASKVGRIDE 336

Query: 1197 AFHILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIK 1376
            AF +L+ S+EDG RPFPSLYAPI+KA+CRKGQFDDAFSFF +MKVKGH PNRPVYTMLI 
Sbjct: 337  AFRLLNNSVEDGYRPFPSLYAPIIKAMCRKGQFDDAFSFFGEMKVKGHSPNRPVYTMLIT 396

Query: 1377 MCGRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRGI 1553
            MCGRGGRF+EAANYLVEMTEL L P+SR FDMV DGLK CGK DLAKRIEQLE+S RG+
Sbjct: 397  MCGRGGRFVEAANYLVEMTELGLAPISRCFDMVIDGLKNCGKHDLAKRIEQLEVSLRGV 455


>gb|EMJ07970.1| hypothetical protein PRUPE_ppa025361mg [Prunus persica]
          Length = 460

 Score =  584 bits (1506), Expect = e-164
 Identities = 290/437 (66%), Positives = 340/437 (77%), Gaps = 2/437 (0%)
 Frame = +3

Query: 246  LRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRV 425
            LR + T +    N    + DDYF+AI HI+NIVRRD +MERTLNK+ I+  V+SELVYRV
Sbjct: 25   LRHLATVNAAPQNRVVPTKDDYFSAIQHITNIVRRDHFMERTLNKLRIT--VDSELVYRV 82

Query: 426  LRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNX 605
            LR+C   G ES RFFNWART+HP Y P            +RT  +E+MWK+   M+  + 
Sbjct: 83   LRACSAAGTESLRFFNWARTHHPTYHPTTLELEELVKTLARTKKYESMWKLLQSMQTHHG 142

Query: 606  XXXXXXXXXXX-EHYGKHGLIDQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGA 779
                        E YG HGL+DQAVELFNR  K FNC QT EVYN+LLF+LC+ K F  A
Sbjct: 143  LTLSQESLCFVIEEYGNHGLVDQAVELFNRAPKTFNCLQTVEVYNALLFSLCQAKLFHAA 202

Query: 780  YALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDG 959
            YAL+RRM+RK  VPDKRTY+ILVN WCS GKMREAQ FLEEMS KGFNPPVRGRDLL++G
Sbjct: 203  YALVRRMIRKGLVPDKRTYSILVNAWCSNGKMREAQLFLEEMSSKGFNPPVRGRDLLVEG 262

Query: 960  LLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPD 1139
            LLNAGY+EAAK +VRKM KEGFVPDV TFNSL EAICK GE++FCIDL+ +   LGL PD
Sbjct: 263  LLNAGYIEAAKEMVRKMVKEGFVPDVSTFNSLMEAICKCGEVEFCIDLYWEANGLGLCPD 322

Query: 1140 VDSYKIMVTAASKVGRIDEAFHILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFS 1319
            +++YK+++ A SKVGRID+AF +LH SIEDG RPFPSLYAPI+K +CR+GQFDDAF FFS
Sbjct: 323  INTYKVLIPAVSKVGRIDDAFRLLHNSIEDGHRPFPSLYAPIIKGMCRRGQFDDAFCFFS 382

Query: 1320 DMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACG 1499
            +MKVKGHPPNRPVYTMLI M GRGGRF+EAANYLVEMTE+ LMP+SR FD+VTDGLK CG
Sbjct: 383  EMKVKGHPPNRPVYTMLITMSGRGGRFVEAANYLVEMTEMGLMPISRCFDLVTDGLKNCG 442

Query: 1500 KLDLAKRIEQLEISFRG 1550
            K D+AKRIEQLE+S RG
Sbjct: 443  KHDMAKRIEQLEVSLRG 459


>ref|XP_004296694.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 444

 Score =  559 bits (1440), Expect = e-156
 Identities = 272/421 (64%), Positives = 330/421 (78%), Gaps = 2/421 (0%)
 Frame = +3

Query: 294  TSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFN 473
            T+ DDYF+AIHHI+NIVRRD +MERTLNK+ I   ++S+LV+RVLR+      ES RFFN
Sbjct: 25   TTKDDYFSAIHHITNIVRRDHFMERTLNKLRIP--IDSDLVFRVLRASSSSPTESLRFFN 82

Query: 474  WARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXX-EHYG 650
            WART+HP+Y P            +R+  +E+MWK+   MK  +             + YG
Sbjct: 83   WARTHHPSYHPTSLETEELVKTLARSKKYESMWKILDSMKTHHALTLSESTLCFIIQEYG 142

Query: 651  KHGLIDQAVELFNRLKN-FNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDK 827
            KH LIDQAVELFNR  N FNC Q+ +VYN+LLF+LCE K F GAYAL+RR++RK  VP+K
Sbjct: 143  KHALIDQAVELFNRAPNTFNCLQSVQVYNALLFSLCETKLFHGAYALVRRLIRKGMVPNK 202

Query: 828  RTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRK 1007
             TY+ILVN WCS GKM+EAQ FLEEMS KGFNPPVRGRDLL++GLLNAGY+E AK +VRK
Sbjct: 203  MTYSILVNAWCSNGKMKEAQLFLEEMSEKGFNPPVRGRDLLVEGLLNAGYIEGAKDMVRK 262

Query: 1008 MTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGR 1187
            M KE  VP+V TFN+L EAICKSGE++FCI L+ +   LGL PD+++YK+M+ A SK+GR
Sbjct: 263  MVKENCVPEVSTFNALLEAICKSGEVEFCIALYWEATGLGLCPDINTYKVMIPAVSKIGR 322

Query: 1188 IDEAFHILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTM 1367
            +D+AF +LH SIEDG RPFPSLYAPI+K +CRKGQFDDAF FFS+MKVKGHPPNRPVYTM
Sbjct: 323  MDDAFRLLHNSIEDGHRPFPSLYAPIVKGMCRKGQFDDAFCFFSEMKVKGHPPNRPVYTM 382

Query: 1368 LIKMCGRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFR 1547
            LI M GRGGRF+EAANYL+EMTE+ L+P+SR FD VTDGLK CGK DLAKRIEQ+E+S R
Sbjct: 383  LITMAGRGGRFVEAANYLIEMTEVGLVPISRCFDFVTDGLKNCGKHDLAKRIEQIEVSLR 442

Query: 1548 G 1550
            G
Sbjct: 443  G 443


>ref|XP_006830661.1| hypothetical protein AMTR_s00210p00017530 [Amborella trichopoda]
            gi|548837251|gb|ERM98077.1| hypothetical protein
            AMTR_s00210p00017530 [Amborella trichopoda]
          Length = 459

 Score =  556 bits (1434), Expect = e-156
 Identities = 282/456 (61%), Positives = 340/456 (74%), Gaps = 1/456 (0%)
 Frame = +3

Query: 186  ILIRITSPHLISPTNGMLKYLRTIMTTSDRVGNCK-NTSNDDYFAAIHHISNIVRRDIYM 362
            +L+R+   H  +P    L+ +RT    S        + S DDYFA +HHISNIVRRD ++
Sbjct: 6    LLLRLHLNHNRAPFLLFLRAMRTTPLPSKPDHEVTIHGSKDDYFAVVHHISNIVRRDYFL 65

Query: 363  ERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXX 542
            ERTL K+ ++  +  ELVYRVLRSC + GIESFRFFNWART H +Y P            
Sbjct: 66   ERTLQKLNLT--LTPELVYRVLRSCNKNGIESFRFFNWART-HASYHPTTIEFEELIKTL 122

Query: 543  SRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLIDQAVELFNRLKNFNCPQTT 722
             +T +WETMWKVA  MK+              + YGK GL+D+AVE+FNR+K+F+CPQTT
Sbjct: 123  GQTKNWETMWKVADHMKILGFPLSPETFSAVMDSYGKAGLLDRAVEVFNRMKHFDCPQTT 182

Query: 723  EVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEE 902
             VYNSLL ALC VKNFQGAYALIRRM+RK   PDK+TYAILVNGWCS+GK+ EA+ FLEE
Sbjct: 183  GVYNSLLSALCMVKNFQGAYALIRRMIRKGGHPDKQTYAILVNGWCSSGKLGEAREFLEE 242

Query: 903  MSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGE 1082
            MS+KGFNPPVRGRDLLIDGLLNAGYLE+AK LV+KMTKEGF+PD+ TFNSL EA+C SGE
Sbjct: 243  MSKKGFNPPVRGRDLLIDGLLNAGYLESAKELVKKMTKEGFLPDISTFNSLLEALCNSGE 302

Query: 1083 IDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDGLRPFPSLYAP 1262
             +FCI+L   V  L L  D+ +YKI++ A SK G+IDEAF +LH SIEDG +PFPSLYAP
Sbjct: 303  TEFCIELLRVVTELSLVLDIGTYKILIPAVSKSGQIDEAFRLLHASIEDGHKPFPSLYAP 362

Query: 1263 ILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLVEMTELN 1442
            +LK LC++GQF DAFS F+DMK +GH PNRPVYTML++MC RGGR ++AANYLVEM E  
Sbjct: 363  LLKVLCKRGQFGDAFSLFADMKAEGHAPNRPVYTMLMRMCCRGGRCVDAANYLVEMVERG 422

Query: 1443 LMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRG 1550
            L P S +FDMV DGLK  GK DLAKRI+ +EIS RG
Sbjct: 423  LAPRSESFDMVIDGLKNAGKHDLAKRIDHMEISLRG 458


>ref|XP_004504788.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X1 [Cicer arietinum]
            gi|502142093|ref|XP_004504789.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X2 [Cicer arietinum]
          Length = 453

 Score =  548 bits (1413), Expect = e-153
 Identities = 271/451 (60%), Positives = 336/451 (74%), Gaps = 5/451 (1%)
 Frame = +3

Query: 216  ISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCISN 395
            +S    +L + +T+ T +        TS D+YFAAI H++NIVRRD Y+ERTLNK+ I+ 
Sbjct: 13   LSKPKPLLHHHKTLTTAT--------TSKDEYFAAIQHVANIVRRDFYLERTLNKLRIT- 63

Query: 396  IVNSELVYRVLRSCCRCGIESFRFFNWARTNH--PNYDPXXXXXXXXXXXXSRTGHWETM 569
             +  ELV+RVLR+C     ES RFFNWAR++H  P Y P            +   +++TM
Sbjct: 64   -ITPELVFRVLRACSSSPTESLRFFNWARSHHHHPPYTPTSVEFEQIVTILANANNYQTM 122

Query: 570  WKVAHQMKVQ-NXXXXXXXXXXXXEHYGKHGLIDQAVELFNRLKNFNCPQTTEVYNSLLF 746
            W + HQM    N            E YG+H  IDQ+V+LFN+ K FNCPQ   +YNSLLF
Sbjct: 123  WSIIHQMTHNHNLSLSPSAVSSLIESYGRHRHIDQSVQLFNKCKVFNCPQNLNLYNSLLF 182

Query: 747  ALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNP 926
            ALCE K F  AYALIRRM+RK   PDKRTYA+LVN WCS GKMREAQ FL+EMS KGF P
Sbjct: 183  ALCESKLFHAAYALIRRMIRKGINPDKRTYALLVNAWCSTGKMREAQQFLKEMSDKGFTP 242

Query: 927  PVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSG--EIDFCID 1100
            PVRGRDLLI+GLLNAGY+E+AKG+VRKM KEG +PDVGTFN+L E+ICK G  EI FCID
Sbjct: 243  PVRGRDLLIEGLLNAGYIESAKGMVRKMVKEGIIPDVGTFNALMESICKCGDDEIKFCID 302

Query: 1101 LFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDGLRPFPSLYAPILKALC 1280
            L++++C LG+ PDV++YKI+V A SK+G +DEAF +L+   E+G RPFPSLYAP++K L 
Sbjct: 303  LYHELCSLGMVPDVNTYKILVPAVSKIGLMDEAFKLLNNFTEEGNRPFPSLYAPVMKGLF 362

Query: 1281 RKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLVEMTELNLMPLSR 1460
            ++GQFDDAF FF+DMKVKGHPPNRP+YTMLI MCGRGGRF++AANYL EMTE+  +P+SR
Sbjct: 363  KRGQFDDAFCFFADMKVKGHPPNRPLYTMLITMCGRGGRFVDAANYLFEMTEIGFVPISR 422

Query: 1461 NFDMVTDGLKACGKLDLAKRIEQLEISFRGI 1553
             FDMVTDGLK CGK DLAKR++QLE+S RG+
Sbjct: 423  CFDMVTDGLKNCGKHDLAKRVQQLEVSIRGV 453


>ref|XP_002522032.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538836|gb|EEF40436.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 451

 Score =  548 bits (1412), Expect = e-153
 Identities = 275/459 (59%), Positives = 339/459 (73%), Gaps = 2/459 (0%)
 Frame = +3

Query: 171  RPARYILIRITSPHLISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRR 350
            R   +IL + T+   + P    L++L+ + +TS       N + D YFA IHHI+NIVRR
Sbjct: 4    RTKLFILPKTTATVTLQP----LRHLKVLASTST------NNTKDAYFALIHHITNIVRR 53

Query: 351  DIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXX 530
            D Y ERTLNK+  +  V SELV+RVLR+C R   ES RFFNW+R     Y P        
Sbjct: 54   DFYPERTLNKL--NAPVTSELVFRVLRACSRSPTESLRFFNWSRAY---YTPTSIEYEEL 108

Query: 531  XXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXX--EHYGKHGLIDQAVELFNRLKNF 704
                +++  + +MWK+  QMK QN              E YG+ GLIDQAVE+FN+  + 
Sbjct: 109  IKILAKSKRYSSMWKLITQMKDQNPQFSISSETVRSIIEEYGRSGLIDQAVEVFNQCNSL 168

Query: 705  NCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREA 884
            NC Q  ++YNSLLFALCEVK F GAYAL+RR++RK   P+K TY++LVNGWCS GK +EA
Sbjct: 169  NCEQNVDIYNSLLFALCEVKLFHGAYALVRRLIRKGLAPNKTTYSVLVNGWCSNGKFKEA 228

Query: 885  QNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEA 1064
            Q FLEEMS+KGFNPPVRGRDLLI+GLLNAGY E+AK +V KM+KEGFVPDV TFN L EA
Sbjct: 229  QLFLEEMSKKGFNPPVRGRDLLIEGLLNAGYFESAKEMVFKMSKEGFVPDVNTFNCLIEA 288

Query: 1065 ICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDGLRPF 1244
            IC SGE+DFC+D++  + +LG  PD++SYKI++ A SKVG+IDEAF +L+ SIEDG +PF
Sbjct: 289  ICNSGEVDFCVDMYYSLRKLGFCPDINSYKILIPAVSKVGKIDEAFKLLNNSIEDGHKPF 348

Query: 1245 PSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLV 1424
            P LYAPI+K +CR+GQFDDAF FF +MKVKGHPPNRPVYTMLI MCGRGG+++EAANYLV
Sbjct: 349  PGLYAPIIKGMCRRGQFDDAFCFFGEMKVKGHPPNRPVYTMLITMCGRGGKYVEAANYLV 408

Query: 1425 EMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEIS 1541
            EMTE+ L P+SR FDMVTDGLK CGK DLAKRIEQLE+S
Sbjct: 409  EMTEMGLTPISRCFDMVTDGLKNCGKHDLAKRIEQLEVS 447


>ref|XP_006479008.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Citrus sinensis]
          Length = 445

 Score =  541 bits (1393), Expect = e-151
 Identities = 266/421 (63%), Positives = 318/421 (75%), Gaps = 1/421 (0%)
 Frame = +3

Query: 294  TSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCG-IESFRFF 470
            TS DDYFAA++HI+NIVR DIY ERTLN++ ++  + SELVYRVLR C      ES RFF
Sbjct: 28   TSKDDYFAAVNHIANIVRHDIYPERTLNRLNLT--LTSELVYRVLRVCHTTSPSESLRFF 85

Query: 471  NWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYG 650
             WAR+  P Y P            +    +++MWK    MK  N            E +G
Sbjct: 86   TWARSQ-PQYSPTSLEFEPLILTLAHHKRYQSMWKTIELMKPYNLSVSPQTLSLIIEEFG 144

Query: 651  KHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKR 830
            KHGL+D AVE+FN+   FNC Q   +YNSLLFALCEVK F GAYALIRRM+RK  VPDKR
Sbjct: 145  KHGLVDNAVEVFNKCTAFNCQQCVLLYNSLLFALCEVKLFHGAYALIRRMIRKGFVPDKR 204

Query: 831  TYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKM 1010
            TYAILVN WCS+ KMREAQ FL+EMS KGFNPPVRGRDLL+ GLLNAGYLE+AK +V KM
Sbjct: 205  TYAILVNAWCSSWKMREAQEFLQEMSDKGFNPPVRGRDLLVQGLLNAGYLESAKQMVNKM 264

Query: 1011 TKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRI 1190
             K+G VPD+ TFNSL E ICKSGE++FC++++  VC+LGL  DV +YKI++ A SK G I
Sbjct: 265  IKQGSVPDLETFNSLIETICKSGEVEFCVEMYYSVCKLGLCADVSTYKILIPAVSKAGMI 324

Query: 1191 DEAFHILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTML 1370
            DEAF +LH  +EDG +PFPSLYAPI+K + R+GQFDDAF FFS+MK+KGHPPNRPVYTML
Sbjct: 325  DEAFRLLHNLVEDGHKPFPSLYAPIIKGMFRRGQFDDAFCFFSEMKIKGHPPNRPVYTML 384

Query: 1371 IKMCGRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRG 1550
            I MCGRGGRF+EAANYLVEMTE+ L P+SR FD+VTDGLK CGK DLA++IEQLE+S R 
Sbjct: 385  ITMCGRGGRFVEAANYLVEMTEMGLTPISRCFDLVTDGLKNCGKHDLAEKIEQLEVSLRS 444

Query: 1551 I 1553
            +
Sbjct: 445  V 445


>ref|XP_006400384.1| hypothetical protein EUTSA_v10013477mg [Eutrema salsugineum]
            gi|557101474|gb|ESQ41837.1| hypothetical protein
            EUTSA_v10013477mg [Eutrema salsugineum]
          Length = 461

 Score =  537 bits (1383), Expect = e-150
 Identities = 264/416 (63%), Positives = 321/416 (77%), Gaps = 1/416 (0%)
 Frame = +3

Query: 306  DYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWART 485
            DYFAAI+H+ NIVRR+++ ER+LN++ +   V SE V+RVLR+  R   +S RFFNWAR+
Sbjct: 48   DYFAAINHVVNIVRREVHPERSLNRLRLP--VTSEFVFRVLRATSRSANDSLRFFNWARS 105

Query: 486  NHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLI 665
            + PNY P            +    +E+MWKV  QMK  +            E YGK+G +
Sbjct: 106  S-PNYTPTSIEYEQLAKSLASHKKYESMWKVLKQMKDLSLDISGETLCFIIEQYGKNGHV 164

Query: 666  DQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAI 842
            DQAVELFN + K   C QT EVYNSLL ALCEVK F GAYALIRRM+RK   PDKRTY++
Sbjct: 165  DQAVELFNGVPKTLGCQQTVEVYNSLLHALCEVKMFHGAYALIRRMIRKGLKPDKRTYSV 224

Query: 843  LVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEG 1022
            LVNGWCSAGKM+EAQ FL+EMSRKGFNPP RGRDLLI+GLLNAGYLE+AK +V+KMTK G
Sbjct: 225  LVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLESAKEMVKKMTKGG 284

Query: 1023 FVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAF 1202
            FVPD+ TFN+L EAI KSGE+DFCI+++   C+LGL  D+D+YK ++ A SK+G+IDEAF
Sbjct: 285  FVPDIHTFNTLIEAISKSGEVDFCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAF 344

Query: 1203 HILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMC 1382
             +L+  +EDG +PFPSLYAPI+K +CR G FDDAFSFFSDMKVK HPPNRPVYTMLI MC
Sbjct: 345  RLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMC 404

Query: 1383 GRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRG 1550
            GRGG+F++AANYLVEMTE+ L+P+SR FD+VTDGLK  GK DLA RIEQLE+  RG
Sbjct: 405  GRGGKFVDAANYLVEMTEMGLVPISRCFDIVTDGLKNSGKHDLAMRIEQLEVQLRG 460


>ref|XP_006287650.1| hypothetical protein CARUB_v10000860mg, partial [Capsella rubella]
            gi|482556356|gb|EOA20548.1| hypothetical protein
            CARUB_v10000860mg, partial [Capsella rubella]
          Length = 477

 Score =  535 bits (1379), Expect = e-149
 Identities = 264/421 (62%), Positives = 322/421 (76%), Gaps = 1/421 (0%)
 Frame = +3

Query: 294  TSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFN 473
            ++  DYFAAI+H+ NIVRR+I+ ER+LN + +   V SE V+RVLR+  R   +S RFFN
Sbjct: 60   STKGDYFAAINHVVNIVRREIHPERSLNSLRLP--VTSEFVFRVLRATSRSANDSLRFFN 117

Query: 474  WARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGK 653
            WAR+N P+Y P            +    +E+MWK+  QMK  +            E YGK
Sbjct: 118  WARSN-PSYTPTSMEYEQLAKSLASHKKYESMWKILKQMKDLSLDISGETLCFIIEQYGK 176

Query: 654  HGLIDQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKR 830
            +G +DQAVELFN + K   C QT +VYNSLL ALC+VK F GAYALIRRM+RK   PDKR
Sbjct: 177  NGHVDQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKR 236

Query: 831  TYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKM 1010
            TYAILVNGWCSAGKM+EAQ FL+EMSRKGFNPP RGRDLLI+GLLNAGYLE+AK +V KM
Sbjct: 237  TYAILVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLESAKEMVSKM 296

Query: 1011 TKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRI 1190
            TK GFVPD+ TFN+L EAI KSGE++FCI+++   C+LGL  D+D+YK ++ A SK+G+I
Sbjct: 297  TKGGFVPDIQTFNTLIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKI 356

Query: 1191 DEAFHILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTML 1370
            DEAF +L+  +EDG +PFPSLYAPI+K +CR G FDDAFSFFSDMKVK HPPNRPVYTML
Sbjct: 357  DEAFRLLNNCVEDGHKPFPSLYAPIVKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTML 416

Query: 1371 IKMCGRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRG 1550
            I MCGRGG+F++AANYLVEMTE+ L+P+SR FDMVTDGLK  GK DLA RIEQLE+  RG
Sbjct: 417  ITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNSGKHDLAMRIEQLEVQLRG 476

Query: 1551 I 1553
            +
Sbjct: 477  V 477


>ref|XP_002871814.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297317651|gb|EFH48073.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 459

 Score =  533 bits (1373), Expect = e-149
 Identities = 263/417 (63%), Positives = 320/417 (76%), Gaps = 1/417 (0%)
 Frame = +3

Query: 306  DYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWART 485
            DYFAAI+H+ NIVRR+I+ ER+LN + +   V SE V+RVLR+  R   +S RFFNWAR+
Sbjct: 46   DYFAAINHVVNIVRREIHPERSLNSLRLP--VTSEFVFRVLRATSRSANDSLRFFNWARS 103

Query: 486  NHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLI 665
            N P+Y P            +    +E+MWK+  QMK  +            E YGK+G +
Sbjct: 104  N-PSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDISGETLCFIIEQYGKNGHV 162

Query: 666  DQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAI 842
            DQAVELFN + K   C QT +VYN+LL ALC+VK F GAYALIRRM+RK   PDKRTYAI
Sbjct: 163  DQAVELFNGVPKTLGCQQTVDVYNALLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAI 222

Query: 843  LVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEG 1022
            LVNGWCSAGKM+EAQ FL+EMSRKGFNPP RGRDLLI+GLLNAGYLE+AK +V KMTK G
Sbjct: 223  LVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLESAKEIVDKMTKGG 282

Query: 1023 FVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAF 1202
            FVPD+ TFN+L EAI KSGE++FCI+++   C+LGL  D+D+YK ++ A SK+G+IDEAF
Sbjct: 283  FVPDILTFNTLIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAF 342

Query: 1203 HILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMC 1382
             +L+  +EDG +PFPSLYAPI+K +CR G FDDAFSFFSDMKVK HPPNRPVYTMLI MC
Sbjct: 343  RLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMC 402

Query: 1383 GRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRGI 1553
            GRGG+F++AANYLVEMTE+ L+P+SR FDMVTDGLK  GK DLA RIEQLE+  RG+
Sbjct: 403  GRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNSGKHDLAMRIEQLEVQLRGV 459


>ref|NP_197340.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|223635760|sp|Q94JX6.2|PP391_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g18390, mitochondrial; Flags: Precursor
            gi|332005166|gb|AED92549.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 459

 Score =  532 bits (1371), Expect = e-148
 Identities = 263/417 (63%), Positives = 319/417 (76%), Gaps = 1/417 (0%)
 Frame = +3

Query: 306  DYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWART 485
            DYFAAI+H+ NIVRR+I+ ER+LN + +   V SE V+RVLR+  R   +S RFFNWAR+
Sbjct: 46   DYFAAINHVVNIVRREIHPERSLNSLRLP--VTSEFVFRVLRATSRSSNDSLRFFNWARS 103

Query: 486  NHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLI 665
            N P+Y P            +    +E+MWK+  QMK  +            E YGK+G +
Sbjct: 104  N-PSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDISGETLCFIIEQYGKNGHV 162

Query: 666  DQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAI 842
            DQAVELFN + K   C QT +VYNSLL ALC+VK F GAYALIRRM+RK   PDKRTYAI
Sbjct: 163  DQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAI 222

Query: 843  LVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEG 1022
            LVNGWCSAGKM+EAQ FL+EMSR+GFNPP RGRDLLI+GLLNAGYLE+AK +V KMTK G
Sbjct: 223  LVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIEGLLNAGYLESAKEMVSKMTKGG 282

Query: 1023 FVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAF 1202
            FVPD+ TFN L EAI KSGE++FCI+++   C+LGL  D+D+YK ++ A SK+G+IDEAF
Sbjct: 283  FVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAF 342

Query: 1203 HILHRSIEDGLRPFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMC 1382
             +L+  +EDG +PFPSLYAPI+K +CR G FDDAFSFFSDMKVK HPPNRPVYTMLI MC
Sbjct: 343  RLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMC 402

Query: 1383 GRGGRFLEAANYLVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRGI 1553
            GRGG+F++AANYLVEMTE+ L+P+SR FDMVTDGLK  GK DLA RIEQLE+  RG+
Sbjct: 403  GRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNGGKHDLAMRIEQLEVQLRGV 459


>gb|ESW31089.1| hypothetical protein PHAVU_002G208300g [Phaseolus vulgaris]
          Length = 448

 Score =  519 bits (1336), Expect = e-144
 Identities = 256/448 (57%), Positives = 324/448 (72%), Gaps = 1/448 (0%)
 Frame = +3

Query: 213  LISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCIS 392
            +++PT  +L    +I  T     + +    D YFA IHHISNIVRRD Y+ERTLNK+ I 
Sbjct: 9    ILTPTKTLLLNFHSIPKTLTTAASAR----DQYFAVIHHISNIVRRDFYLERTLNKLRIH 64

Query: 393  NIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMW 572
              V  ELV+RVLR+C      S RFFNWAR+ HP+Y P            +R  +++TMW
Sbjct: 65   --VTPELVFRVLRACSTAPTPSLRFFNWARS-HPSYTPTSLEFEQIVTTLARANNYQTMW 121

Query: 573  KVAHQMKVQNXXXXXXXXXXXX-EHYGKHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFA 749
             +  Q+ + +             + YG H  IDQAVE+FN+    NCPQT  +YN+LL +
Sbjct: 122  SLIRQVTLHHRLSLSPAAVATLIDAYGHHRHIDQAVEVFNKAPILNCPQTLPLYNALLKS 181

Query: 750  LCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPP 929
            LC  + F GAYAL+RRMLRK   PDK TYA+LVN WCS+GK+REA+ FL EMS KGFNPP
Sbjct: 182  LCHNRLFHGAYALLRRMLRKGLHPDKATYAVLVNAWCSSGKLREAKLFLREMSEKGFNPP 241

Query: 930  VRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFN 1109
            +RGRDLL++GLLNAGY+E+AKG+VRKM KEG VPDV TFN++ E +CK  E+ FC+DL++
Sbjct: 242  LRGRDLLVEGLLNAGYVESAKGMVRKMIKEGIVPDVETFNAVVETVCKE-EVQFCVDLYH 300

Query: 1110 DVCRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDGLRPFPSLYAPILKALCRKG 1289
            +VC LG+ PDV++YKI++ A SK   IDEAF +L+  +EDG RPFPSLYAP++KALCR+G
Sbjct: 301  EVCALGMVPDVNTYKILIPAVSKSDFIDEAFRLLNNFVEDGNRPFPSLYAPVIKALCRRG 360

Query: 1290 QFDDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLVEMTELNLMPLSRNFD 1469
            QFDDAF FF DMK K HPPNRP+YTMLI MCGR G+F+EAANYL EMTE+ L+P+SR FD
Sbjct: 361  QFDDAFCFFGDMKAKAHPPNRPLYTMLITMCGRAGKFVEAANYLFEMTEMGLVPISRCFD 420

Query: 1470 MVTDGLKACGKLDLAKRIEQLEISFRGI 1553
            MVTDGLK  GK DLA R++QLE+S RG+
Sbjct: 421  MVTDGLKNSGKHDLASRVQQLEVSIRGV 448


>ref|XP_003524064.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X1 [Glycine max]
            gi|571455122|ref|XP_006579993.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X2 [Glycine max]
            gi|571455124|ref|XP_006579994.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X3 [Glycine max]
          Length = 450

 Score =  515 bits (1326), Expect = e-143
 Identities = 255/465 (54%), Positives = 334/465 (71%), Gaps = 2/465 (0%)
 Frame = +3

Query: 165  MFRPARYILIRITSPHLISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIV 344
            M +    +++R + P L+   + + K L T            ++S D+YFA IHH+SNIV
Sbjct: 1    MLQTCSKLILRHSKPRLLLNLHSITKTLTTA-----------SSSRDEYFAVIHHVSNIV 49

Query: 345  RRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXX 524
            RRD Y+ERTLNK+ I+  V  ELV+RVLR+C     ES RFFNWART HP+Y P      
Sbjct: 50   RRDFYLERTLNKLRIT--VTPELVFRVLRACSNNPTESLRFFNWART-HPSYSPTSLEFE 106

Query: 525  XXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXX-EHYGKHGLIDQAVELFNRLKN 701
                  +R   +++MW +  Q+ + +             E YG +  +DQ+V++FN+   
Sbjct: 107  QIVTTLARANTYQSMWALIRQVTLHHRLSLSPSAVASVIEAYGDNRHVDQSVQVFNKSPL 166

Query: 702  F-NCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMR 878
              NCPQT  +YN+LL +LC  K F GAYAL+RRMLRK   PDK TYA+LVN WCS GK+R
Sbjct: 167  LLNCPQTLPLYNALLRSLCHNKLFHGAYALVRRMLRKGLRPDKTTYAVLVNAWCSNGKLR 226

Query: 879  EAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLA 1058
            EA+ FLEEMS KGFNPPVRGRDLL++GLLNAGY+E+AKG+VR M K+G VPDVGTFN++ 
Sbjct: 227  EAKLFLEEMSEKGFNPPVRGRDLLVEGLLNAGYVESAKGMVRNMIKQGSVPDVGTFNAVV 286

Query: 1059 EAICKSGEIDFCIDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDGLR 1238
            E + K  ++ FC+ L+++VC LG++PDV++YKI+V A SK G +DEAF +L+  IEDG +
Sbjct: 287  ETVSKE-DVQFCVGLYHEVCALGMAPDVNTYKILVPAVSKSGMVDEAFRLLNNFIEDGHK 345

Query: 1239 PFPSLYAPILKALCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANY 1418
            PFPSLYAP++KALCR+GQFDDAF FF DMK K HPPNRP+YTMLI MCGR G+F+EAANY
Sbjct: 346  PFPSLYAPVIKALCRRGQFDDAFCFFGDMKAKAHPPNRPLYTMLITMCGRAGKFVEAANY 405

Query: 1419 LVEMTELNLMPLSRNFDMVTDGLKACGKLDLAKRIEQLEISFRGI 1553
            + EMTE+ L+P+SR FDMVTDGLK CGK DLA+R+++LE+S RG+
Sbjct: 406  IFEMTEMGLVPISRCFDMVTDGLKNCGKHDLARRVQELEVSIRGV 450


>ref|XP_002325369.2| hypothetical protein POPTR_0019s04240g [Populus trichocarpa]
            gi|550316751|gb|EEE99750.2| hypothetical protein
            POPTR_0019s04240g [Populus trichocarpa]
          Length = 333

 Score =  481 bits (1239), Expect = e-133
 Identities = 230/332 (69%), Positives = 272/332 (81%), Gaps = 4/332 (1%)
 Frame = +3

Query: 567  MWKVAHQMKVQ---NXXXXXXXXXXXXEHYGKHGLIDQAVELFNRL-KNFNCPQTTEVYN 734
            MWK+  Q+K                  E YGKHGLIDQAVE+FN+  ++ NC     VYN
Sbjct: 1    MWKLIAQVKDNLGDKFSVSSDTVCSIIEEYGKHGLIDQAVEVFNKCSRSLNCQHNVCVYN 60

Query: 735  SLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRK 914
            SLLFALCEVK F GAYAL+RRM+RK  VPDKRTY +LVNGWCS+GK+REA+ FLEEMS+K
Sbjct: 61   SLLFALCEVKMFHGAYALVRRMIRKGIVPDKRTYGVLVNGWCSSGKLREAKGFLEEMSKK 120

Query: 915  GFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFC 1094
            GFNPPVRGRDLLI+GLLNAGYLE+AK +VR+M KEG VPDV TFNS+ EAIC +GE+DFC
Sbjct: 121  GFNPPVRGRDLLIEGLLNAGYLESAKDMVRRMMKEGLVPDVNTFNSMVEAICNAGEVDFC 180

Query: 1095 IDLFNDVCRLGLSPDVDSYKIMVTAASKVGRIDEAFHILHRSIEDGLRPFPSLYAPILKA 1274
            +D+++ VC+LG  PD++SYKI++ A SKVGRIDEAF +LH  IEDG +PFPSLYAPI+K 
Sbjct: 181  VDMYHSVCKLGFCPDINSYKILIPAVSKVGRIDEAFRLLHNLIEDGHKPFPSLYAPIIKG 240

Query: 1275 LCRKGQFDDAFSFFSDMKVKGHPPNRPVYTMLIKMCGRGGRFLEAANYLVEMTELNLMPL 1454
            + R+GQFDDAF FFS+MKVKGHPPNRPVYTM+I MCGRGG+ +EAANYLVEMTE+ L+P+
Sbjct: 241  MFRRGQFDDAFCFFSEMKVKGHPPNRPVYTMMITMCGRGGKHVEAANYLVEMTEIGLVPI 300

Query: 1455 SRNFDMVTDGLKACGKLDLAKRIEQLEISFRG 1550
            SR FDMVTDGLK CGK DLAKRIEQLE+S RG
Sbjct: 301  SRCFDMVTDGLKNCGKHDLAKRIEQLEVSLRG 332


Top