BLASTX nr result

ID: Catharanthus23_contig00023647 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00023647
         (1115 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006358787.1| PREDICTED: pentatricopeptide repeat-containi...   492   e-136
ref|XP_004248026.1| PREDICTED: pentatricopeptide repeat-containi...   485   e-134
gb|EPS58676.1| hypothetical protein M569_16136, partial [Genlise...   424   e-116
ref|XP_002268375.1| PREDICTED: pentatricopeptide repeat-containi...   401   e-109
ref|XP_004142520.1| PREDICTED: pentatricopeptide repeat-containi...   395   e-107
gb|EXB93122.1| hypothetical protein L484_024459 [Morus notabilis]     392   e-106
gb|EOY10792.1| Pentatricopeptide repeat (PPR) superfamily protei...   381   e-103
ref|XP_006830661.1| hypothetical protein AMTR_s00210p00017530 [A...   378   e-102
gb|EMJ07970.1| hypothetical protein PRUPE_ppa025361mg [Prunus pe...   375   e-101
ref|XP_004504788.1| PREDICTED: pentatricopeptide repeat-containi...   357   5e-96
ref|XP_004296694.1| PREDICTED: pentatricopeptide repeat-containi...   352   1e-94
ref|XP_002522032.1| pentatricopeptide repeat-containing protein,...   348   3e-93
ref|XP_006479008.1| PREDICTED: pentatricopeptide repeat-containi...   342   1e-91
ref|XP_006400384.1| hypothetical protein EUTSA_v10013477mg [Eutr...   342   1e-91
ref|XP_006287650.1| hypothetical protein CARUB_v10000860mg, part...   338   2e-90
ref|NP_197340.1| pentatricopeptide repeat-containing protein [Ar...   336   1e-89
ref|XP_002871814.1| pentatricopeptide repeat-containing protein ...   336   1e-89
gb|ESW31089.1| hypothetical protein PHAVU_002G208300g [Phaseolus...   330   5e-88
ref|XP_003524064.1| PREDICTED: pentatricopeptide repeat-containi...   322   2e-85
emb|CAN80394.1| hypothetical protein VITISV_001596 [Vitis vinifera]   308   3e-81

>ref|XP_006358787.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X1 [Solanum tuberosum]
            gi|565385886|ref|XP_006358788.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X2 [Solanum tuberosum]
            gi|565385889|ref|XP_006358789.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X3 [Solanum tuberosum]
          Length = 472

 Score =  492 bits (1267), Expect = e-136
 Identities = 245/343 (71%), Positives = 272/343 (79%), Gaps = 8/343 (2%)
 Frame = +1

Query: 109  ARYILIRITSPHLISPTNGMLKYLRTIMTTSDRV-------GNC-KNTSNDDYFAAIHHI 264
            AR    R       +    + + L+TI  + D+        G+C ++  NDDYFA IHH+
Sbjct: 6    ARIFFQRTLYSATFTNNTSIFRCLKTIAPSIDQCSNPFSGKGSCGRSVPNDDYFATIHHV 65

Query: 265  SNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXX 444
            SNIVRRDIY+ERTLNKM IS+IVNSELVYRVLRSCC+ GIESFRFFNWART HP YDP  
Sbjct: 66   SNIVRRDIYLERTLNKMHISSIVNSELVYRVLRSCCQHGIESFRFFNWARTQHPQYDPTT 125

Query: 445  XXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLIDQAVELFNR 624
                      +RT HWETMWKV  QMK QN            EHYGK GLIDQAVELFNR
Sbjct: 126  VEFEELLKTLARTAHWETMWKVVQQMKAQNIPISPSIVSFIIEHYGKRGLIDQAVELFNR 185

Query: 625  LKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGK 804
            LKNF+CPQTTEVYN++LFALCEVKNFQGAYALIRRM+RK  VPDK+TY+ILVNGWCSAGK
Sbjct: 186  LKNFDCPQTTEVYNAMLFALCEVKNFQGAYALIRRMIRKGTVPDKQTYSILVNGWCSAGK 245

Query: 805  MREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNS 984
            MREAQ FLEEMSRKGFNPPVRGRDLLIDGLL+AGYLE+AKGLVRKMTKEGFVPDVGTFNS
Sbjct: 246  MREAQEFLEEMSRKGFNPPVRGRDLLIDGLLSAGYLESAKGLVRKMTKEGFVPDVGTFNS 305

Query: 985  LAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            LAEAICK+GEIDFCIDLFNDVCR GL PD++ YKI++TAASKV
Sbjct: 306  LAEAICKTGEIDFCIDLFNDVCRSGLFPDIETYKIVITAASKV 348


>ref|XP_004248026.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Solanum lycopersicum]
          Length = 470

 Score =  485 bits (1248), Expect = e-134
 Identities = 235/301 (78%), Positives = 254/301 (84%), Gaps = 1/301 (0%)
 Frame = +1

Query: 211  GNC-KNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIE 387
            G+C ++  NDDYFA IHH+SNIVRRDIY+ERTLNKM IS IVNSELVYRVLRSCC+ GIE
Sbjct: 45   GSCGRSVPNDDYFATIHHVSNIVRRDIYLERTLNKMHISRIVNSELVYRVLRSCCQHGIE 104

Query: 388  SFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXX 567
            SFRFFNWART HP YDP            +RT HWETMWKV  QMK QN           
Sbjct: 105  SFRFFNWARTQHPQYDPTTVEFEELLKTLARTAHWETMWKVVQQMKAQNIPISPSIVSFI 164

Query: 568  XEHYGKHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDA 747
             EHYGK GLIDQAVELFNRLKNF C QTTEVYN++LFALCEVKNFQGAYALIRRM+RK  
Sbjct: 165  IEHYGKRGLIDQAVELFNRLKNFGCSQTTEVYNAMLFALCEVKNFQGAYALIRRMIRKGT 224

Query: 748  VPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKG 927
            VPDK TY+ILVNGWCSAGKMREAQ FLEEMSRKGFNPPVRGRDLLIDGLL+AGYLE+AKG
Sbjct: 225  VPDKLTYSILVNGWCSAGKMREAQEFLEEMSRKGFNPPVRGRDLLIDGLLSAGYLESAKG 284

Query: 928  LVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAAS 1107
            LVRKMTKEGFVPDVGTFNSLAEA+CK+GEIDFCIDLFNDVCRLGL PD + YKI++TAA+
Sbjct: 285  LVRKMTKEGFVPDVGTFNSLAEAVCKTGEIDFCIDLFNDVCRLGLCPDTETYKIVITAAA 344

Query: 1108 K 1110
            K
Sbjct: 345  K 345


>gb|EPS58676.1| hypothetical protein M569_16136, partial [Genlisea aurea]
          Length = 419

 Score =  424 bits (1090), Expect = e-116
 Identities = 205/292 (70%), Positives = 231/292 (79%)
 Frame = +1

Query: 235  DDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWAR 414
            DDYFA IHHISNIVRRDIY+ERTL KM IS++VNSELVYRV+ +C   GIESFRFFNWAR
Sbjct: 1    DDYFATIHHISNIVRRDIYLERTLMKMNISSLVNSELVYRVINNCSSSGIESFRFFNWAR 60

Query: 415  TNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGL 594
            + HPNY+P            ++T HWETMWKV H MK Q             E Y KHGL
Sbjct: 61   SCHPNYEPTTLEFEALLKVLAQTKHWETMWKVVHTMKSQQSPISPGIMSFIIEQYAKHGL 120

Query: 595  IDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAI 774
            ID+AVELFN LKN +CPQT EVYNSLLFALCEVKNFQGAYAL+RRM+RK  VPDKRTY+I
Sbjct: 121  IDKAVELFNGLKNLDCPQTIEVYNSLLFALCEVKNFQGAYALVRRMIRKGNVPDKRTYSI 180

Query: 775  LVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEG 954
            LVN WC AGK+ EAQ FLEEMS+KG+NPPVRGRDLLIDGLLNAGYLE AKGLVRKMTK G
Sbjct: 181  LVNAWCRAGKLIEAQEFLEEMSKKGYNPPVRGRDLLIDGLLNAGYLECAKGLVRKMTKIG 240

Query: 955  FVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASK 1110
             +PD+ TFNSLAEA+CK+GE D CI  F+D+C LG  P+ D YKIM+TAAS+
Sbjct: 241  SIPDIATFNSLAEALCKNGETDACIASFDDICELGFCPNSDTYKIMITAASR 292


>ref|XP_002268375.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial [Vitis vinifera]
            gi|296085168|emb|CBI28663.3| unnamed protein product
            [Vitis vinifera]
          Length = 454

 Score =  401 bits (1030), Expect = e-109
 Identities = 205/337 (60%), Positives = 250/337 (74%), Gaps = 1/337 (0%)
 Frame = +1

Query: 103  RPARYILIRITSPHLISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRR 282
            RP  ++LIR       SP + +L +L+T+ T +  + N   +  DDYFA +HHIS IVRR
Sbjct: 6    RPIFHLLIRN------SPASPILTHLKTLTTITTHLQNTITSKKDDYFAVVHHISAIVRR 59

Query: 283  DIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXX 462
            D Y+ERTLNK+ IS  V S+LVYRVLRSC   G ES RFFNWAR+ H +Y P        
Sbjct: 60   DFYLERTLNKLPIS--VTSDLVYRVLRSCPNSGTESLRFFNWARS-HLSYQPTTLEYEEL 116

Query: 463  XXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLIDQAVELFNRLKN-FN 639
                +RT  ++ MWK+AHQM+  +            E +GKHGL+DQAVE+FN+ K+  N
Sbjct: 117  LKTLARTKQFQPMWKIAHQMQTLSPTVVSSII----EEFGKHGLVDQAVEVFNKAKSALN 172

Query: 640  CPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQ 819
            CPQT EVYNSLLFALCEVK F GAYALIRRM+RK   P+K+TY++LVNGWC+AGKM+EAQ
Sbjct: 173  CPQTIEVYNSLLFALCEVKYFHGAYALIRRMIRKGVTPNKQTYSVLVNGWCAAGKMKEAQ 232

Query: 820  NFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAI 999
            +FLEEMSRKGFNPPVRGRDLL+DGLLNAGYLEAAK +VRKMTKEG  PDV T NS+ EAI
Sbjct: 233  DFLEEMSRKGFNPPVRGRDLLVDGLLNAGYLEAAKEMVRKMTKEGCAPDVETLNSMLEAI 292

Query: 1000 CKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASK 1110
            CK+GE +FCID++NDVCRLG+SP+V  YKIM+ AA K
Sbjct: 293  CKAGEAEFCIDIYNDVCRLGVSPNVGTYKIMIPAACK 329


>ref|XP_004142520.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Cucumis sativus]
            gi|449518358|ref|XP_004166209.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Cucumis sativus]
          Length = 455

 Score =  395 bits (1014), Expect = e-107
 Identities = 193/312 (61%), Positives = 234/312 (75%), Gaps = 1/312 (0%)
 Frame = +1

Query: 181  RTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVL 360
            R    ++  V      S DDYFAAIHHIS+IVRRD YMERTLNK+ ISN+ NSELV+RVL
Sbjct: 21   RHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNL-NSELVFRVL 79

Query: 361  RSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXX 540
            R+C   G ESFRFFNWA +++P+Y P            +RT  + TMWKV  QMK QN  
Sbjct: 80   RACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLK 139

Query: 541  XXXXXXXXXXEHYGKHGLIDQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYA 717
                      + YGK GL+D AV +FN+  K+ +CPQT EVYN+LLFALCEVK F GAYA
Sbjct: 140  ISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYA 199

Query: 718  LIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLL 897
            LIRRM+RK   PDK+TY  LV GWCSAGKM+EAQ FLEEMS+KGFNPP+RGRDLL++GLL
Sbjct: 200  LIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLL 259

Query: 898  NAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD 1077
            NAGYLE+AK +VRKMTKEG VPD+GTFNSL + IC SGE+DFCI++F++VC+LGL PD++
Sbjct: 260  NAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDIN 319

Query: 1078 *YKIMVTAASKV 1113
             YKI++ A SKV
Sbjct: 320  TYKILIPATSKV 331



 Score = 63.2 bits (152), Expect = 2e-07
 Identities = 41/176 (23%), Positives = 80/176 (45%), Gaps = 2/176 (1%)
 Frame = +1

Query: 589  GLIDQAVELFNRL--KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKR 762
            G + +A E    +  K FN P      + L+  L      + A  ++R+M ++ +VPD  
Sbjct: 227  GKMKEAQEFLEEMSQKGFNPPLRGR--DLLVEGLLNAGYLESAKDMVRKMTKEGSVPDIG 284

Query: 763  TYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKM 942
            T+  L++  C++G++    N   E+ + G  P +    +LI      G ++ A  L+   
Sbjct: 285  TFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCC 344

Query: 943  TKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASK 1110
             ++G VP    +  + + +CK G+ D     F D+   G  P+   Y +++T   +
Sbjct: 345  IEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGR 400


>gb|EXB93122.1| hypothetical protein L484_024459 [Morus notabilis]
          Length = 470

 Score =  392 bits (1008), Expect = e-106
 Identities = 197/321 (61%), Positives = 243/321 (75%), Gaps = 7/321 (2%)
 Frame = +1

Query: 169  LKYLRTIMTTSDRVGNCKNT-----SNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIV 333
            L+Y +T+ +T+    + +N+     S D+YFAAIHHISNIV+RD YMERTLNK+ I+  V
Sbjct: 26   LRYFQTLPSTAVNGDHYQNSTKPSSSKDNYFAAIHHISNIVQRDFYMERTLNKLRIA-AV 84

Query: 334  NSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVA 513
            +S+LV+RVLR+C + G ES RFFNWAR++ P+Y P            +RT  +E+MWK+ 
Sbjct: 85   DSDLVFRVLRACHKFGPESLRFFNWARSHQPSYRPTSVELEELAKNLARTKKYESMWKIL 144

Query: 514  HQMKVQNXXXXXXXXXXXX-EHYGKHGLIDQAVELFNRL-KNFNCPQTTEVYNSLLFALC 687
             QMK  N             E YGK GL+DQA E+FNR+ K FNC QT EVYNSLLFALC
Sbjct: 145  QQMKTNNNLIISSETLCFIIEEYGKQGLVDQAAEVFNRVPKIFNCSQTVEVYNSLLFALC 204

Query: 688  EVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVR 867
            EVK F GAYAL+RRM+RK+ VPDKRTY+ILVN WCSAGKMREAQNFL EMS+KGFNPPVR
Sbjct: 205  EVKLFHGAYALVRRMIRKEVVPDKRTYSILVNAWCSAGKMREAQNFLSEMSKKGFNPPVR 264

Query: 868  GRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDV 1047
            GRDLLI+GLLNAGY+E+AK +VRKM KEGF+PDV TFNSL E ICKS E++FCIDL++ V
Sbjct: 265  GRDLLIEGLLNAGYIESAKEMVRKMVKEGFLPDVSTFNSLVEVICKSEEVEFCIDLYHQV 324

Query: 1048 CRLGLSPDVD*YKIMVTAASK 1110
            C LGL PD++ YK+++ A SK
Sbjct: 325  CGLGLCPDINTYKVLIPAVSK 345


>gb|EOY10792.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 455

 Score =  381 bits (979), Expect = e-103
 Identities = 190/295 (64%), Positives = 223/295 (75%)
 Frame = +1

Query: 229  SNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNW 408
            S DDYFAAIHHISN VRR+++ ERTLN+M IS  VNSELV+RVLRSC     ES RFF+W
Sbjct: 42   SKDDYFAAIHHISNTVRREVHPERTLNRMNIS--VNSELVFRVLRSCSNSPTESLRFFSW 99

Query: 409  ARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKH 588
            AR +   Y P             R   +E+MWK   QM+ QN            E YGK+
Sbjct: 100  ARAH---YVPTSVEFEELVKILIRHRKYESMWKTIQQMQKQNLSLSCDTLSFIIEEYGKN 156

Query: 589  GLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTY 768
            GL+DQAVE+FN+  +  C QT  VYNSLLFALCEVK F GAYALIRRM+RK  VPDKRTY
Sbjct: 157  GLVDQAVEVFNKSTSLGCKQTVSVYNSLLFALCEVKMFHGAYALIRRMIRKGEVPDKRTY 216

Query: 769  AILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTK 948
            AILVNGWCS GKMREAQ FLEEMS+ GFNPPVRGRDLL++GLLNAGYLE+AK +VR+MTK
Sbjct: 217  AILVNGWCSGGKMREAQEFLEEMSKMGFNPPVRGRDLLVEGLLNAGYLESAKEMVRRMTK 276

Query: 949  EGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            EGFVPD+GTFNSL E IC SGE+DFCI++++ VC+LGL PD++ YKI++ AASKV
Sbjct: 277  EGFVPDIGTFNSLVETICSSGEVDFCINMYHSVCKLGLCPDINTYKILIPAASKV 331



 Score = 62.8 bits (151), Expect = 2e-07
 Identities = 38/174 (21%), Positives = 75/174 (43%)
 Frame = +1

Query: 589  GLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTY 768
            G + +A E    +            + L+  L      + A  ++RRM ++  VPD  T+
Sbjct: 227  GKMREAQEFLEEMSKMGFNPPVRGRDLLVEGLLNAGYLESAKEMVRRMTKEGFVPDIGTF 286

Query: 769  AILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTK 948
              LV   CS+G++    N    + + G  P +    +LI      G ++ A  L+    +
Sbjct: 287  NSLVETICSSGEVDFCINMYHSVCKLGLCPDINTYKILIPAASKVGRIDEAFRLLNNSVE 346

Query: 949  EGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASK 1110
            +G+ P    +  + +A+C+ G+ D     F ++   G SP+   Y +++T   +
Sbjct: 347  DGYRPFPSLYAPIIKAMCRKGQFDDAFSFFGEMKVKGHSPNRPVYTMLITMCGR 400



 Score = 58.5 bits (140), Expect = 5e-06
 Identities = 36/114 (31%), Positives = 53/114 (46%)
 Frame = +1

Query: 583 KHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKR 762
           K G ID+A  L N            +Y  ++ A+C    F  A++    M  K   P++ 
Sbjct: 330 KVGRIDEAFRLLNNSVEDGYRPFPSLYAPIIKAMCRKGQFDDAFSFFGEMKVKGHSPNRP 389

Query: 763 TYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAK 924
            Y +L+      G+  EA N+L EM+  G  P  R  D++IDGL N G  + AK
Sbjct: 390 VYTMLITMCGRGGRFVEAANYLVEMTELGLAPISRCFDMVIDGLKNCGKHDLAK 443


>ref|XP_006830661.1| hypothetical protein AMTR_s00210p00017530 [Amborella trichopoda]
            gi|548837251|gb|ERM98077.1| hypothetical protein
            AMTR_s00210p00017530 [Amborella trichopoda]
          Length = 459

 Score =  378 bits (970), Expect = e-102
 Identities = 196/332 (59%), Positives = 238/332 (71%), Gaps = 1/332 (0%)
 Frame = +1

Query: 118  ILIRITSPHLISPTNGMLKYLRTIMTTSDRVGNCK-NTSNDDYFAAIHHISNIVRRDIYM 294
            +L+R+   H  +P    L+ +RT    S        + S DDYFA +HHISNIVRRD ++
Sbjct: 6    LLLRLHLNHNRAPFLLFLRAMRTTPLPSKPDHEVTIHGSKDDYFAVVHHISNIVRRDYFL 65

Query: 295  ERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXX 474
            ERTL K+ ++  +  ELVYRVLRSC + GIESFRFFNWART H +Y P            
Sbjct: 66   ERTLQKLNLT--LTPELVYRVLRSCNKNGIESFRFFNWART-HASYHPTTIEFEELIKTL 122

Query: 475  SRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLIDQAVELFNRLKNFNCPQTT 654
             +T +WETMWKVA  MK+              + YGK GL+D+AVE+FNR+K+F+CPQTT
Sbjct: 123  GQTKNWETMWKVADHMKILGFPLSPETFSAVMDSYGKAGLLDRAVEVFNRMKHFDCPQTT 182

Query: 655  EVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEE 834
             VYNSLL ALC VKNFQGAYALIRRM+RK   PDK+TYAILVNGWCS+GK+ EA+ FLEE
Sbjct: 183  GVYNSLLSALCMVKNFQGAYALIRRMIRKGGHPDKQTYAILVNGWCSSGKLGEAREFLEE 242

Query: 835  MSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGE 1014
            MS+KGFNPPVRGRDLLIDGLLNAGYLE+AK LV+KMTKEGF+PD+ TFNSL EA+C SGE
Sbjct: 243  MSKKGFNPPVRGRDLLIDGLLNAGYLESAKELVKKMTKEGFLPDISTFNSLLEALCNSGE 302

Query: 1015 IDFCIDLFNDVCRLGLSPDVD*YKIMVTAASK 1110
             +FCI+L   V  L L  D+  YKI++ A SK
Sbjct: 303  TEFCIELLRVVTELSLVLDIGTYKILIPAVSK 334



 Score = 61.6 bits (148), Expect = 5e-07
 Identities = 37/114 (32%), Positives = 55/114 (48%)
 Frame = +1

Query: 583 KHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKR 762
           K G ID+A  L +            +Y  LL  LC+   F  A++L   M  +   P++ 
Sbjct: 334 KSGQIDEAFRLLHASIEDGHKPFPSLYAPLLKVLCKRGQFGDAFSLFADMKAEGHAPNRP 393

Query: 763 TYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAK 924
            Y +L+   C  G+  +A N+L EM  +G  P     D++IDGL NAG  + AK
Sbjct: 394 VYTMLMRMCCRGGRCVDAANYLVEMVERGLAPRSESFDMVIDGLKNAGKHDLAK 447


>gb|EMJ07970.1| hypothetical protein PRUPE_ppa025361mg [Prunus persica]
          Length = 460

 Score =  375 bits (962), Expect = e-101
 Identities = 191/314 (60%), Positives = 228/314 (72%), Gaps = 2/314 (0%)
 Frame = +1

Query: 178  LRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRV 357
            LR + T +    N    + DDYF+AI HI+NIVRRD +MERTLNK+ I+  V+SELVYRV
Sbjct: 25   LRHLATVNAAPQNRVVPTKDDYFSAIQHITNIVRRDHFMERTLNKLRIT--VDSELVYRV 82

Query: 358  LRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNX 537
            LR+C   G ES RFFNWART+HP Y P            +RT  +E+MWK+   M+  + 
Sbjct: 83   LRACSAAGTESLRFFNWARTHHPTYHPTTLELEELVKTLARTKKYESMWKLLQSMQTHHG 142

Query: 538  XXXXXXXXXXX-EHYGKHGLIDQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGA 711
                        E YG HGL+DQAVELFNR  K FNC QT EVYN+LLF+LC+ K F  A
Sbjct: 143  LTLSQESLCFVIEEYGNHGLVDQAVELFNRAPKTFNCLQTVEVYNALLFSLCQAKLFHAA 202

Query: 712  YALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDG 891
            YAL+RRM+RK  VPDKRTY+ILVN WCS GKMREAQ FLEEMS KGFNPPVRGRDLL++G
Sbjct: 203  YALVRRMIRKGLVPDKRTYSILVNAWCSNGKMREAQLFLEEMSSKGFNPPVRGRDLLVEG 262

Query: 892  LLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPD 1071
            LLNAGY+EAAK +VRKM KEGFVPDV TFNSL EAICK GE++FCIDL+ +   LGL PD
Sbjct: 263  LLNAGYIEAAKEMVRKMVKEGFVPDVSTFNSLMEAICKCGEVEFCIDLYWEANGLGLCPD 322

Query: 1072 VD*YKIMVTAASKV 1113
            ++ YK+++ A SKV
Sbjct: 323  INTYKVLIPAVSKV 336


>ref|XP_004504788.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X1 [Cicer arietinum]
            gi|502142093|ref|XP_004504789.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X2 [Cicer arietinum]
          Length = 453

 Score =  357 bits (916), Expect = 5e-96
 Identities = 183/327 (55%), Positives = 228/327 (69%), Gaps = 5/327 (1%)
 Frame = +1

Query: 148  ISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCISN 327
            +S    +L + +T+ T +        TS D+YFAAI H++NIVRRD Y+ERTLNK+ I+ 
Sbjct: 13   LSKPKPLLHHHKTLTTAT--------TSKDEYFAAIQHVANIVRRDFYLERTLNKLRIT- 63

Query: 328  IVNSELVYRVLRSCCRCGIESFRFFNWARTNH--PNYDPXXXXXXXXXXXXSRTGHWETM 501
             +  ELV+RVLR+C     ES RFFNWAR++H  P Y P            +   +++TM
Sbjct: 64   -ITPELVFRVLRACSSSPTESLRFFNWARSHHHHPPYTPTSVEFEQIVTILANANNYQTM 122

Query: 502  WKVAHQMKVQ-NXXXXXXXXXXXXEHYGKHGLIDQAVELFNRLKNFNCPQTTEVYNSLLF 678
            W + HQM    N            E YG+H  IDQ+V+LFN+ K FNCPQ   +YNSLLF
Sbjct: 123  WSIIHQMTHNHNLSLSPSAVSSLIESYGRHRHIDQSVQLFNKCKVFNCPQNLNLYNSLLF 182

Query: 679  ALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNP 858
            ALCE K F  AYALIRRM+RK   PDKRTYA+LVN WCS GKMREAQ FL+EMS KGF P
Sbjct: 183  ALCESKLFHAAYALIRRMIRKGINPDKRTYALLVNAWCSTGKMREAQQFLKEMSDKGFTP 242

Query: 859  PVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSG--EIDFCID 1032
            PVRGRDLLI+GLLNAGY+E+AKG+VRKM KEG +PDVGTFN+L E+ICK G  EI FCID
Sbjct: 243  PVRGRDLLIEGLLNAGYIESAKGMVRKMVKEGIIPDVGTFNALMESICKCGDDEIKFCID 302

Query: 1033 LFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            L++++C LG+ PDV+ YKI+V A SK+
Sbjct: 303  LYHELCSLGMVPDVNTYKILVPAVSKI 329


>ref|XP_004296694.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 444

 Score =  352 bits (904), Expect = 1e-94
 Identities = 175/298 (58%), Positives = 219/298 (73%), Gaps = 2/298 (0%)
 Frame = +1

Query: 226  TSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFN 405
            T+ DDYF+AIHHI+NIVRRD +MERTLNK+ I   ++S+LV+RVLR+      ES RFFN
Sbjct: 25   TTKDDYFSAIHHITNIVRRDHFMERTLNKLRIP--IDSDLVFRVLRASSSSPTESLRFFN 82

Query: 406  WARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXX-EHYG 582
            WART+HP+Y P            +R+  +E+MWK+   MK  +             + YG
Sbjct: 83   WARTHHPSYHPTSLETEELVKTLARSKKYESMWKILDSMKTHHALTLSESTLCFIIQEYG 142

Query: 583  KHGLIDQAVELFNRLKN-FNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDK 759
            KH LIDQAVELFNR  N FNC Q+ +VYN+LLF+LCE K F GAYAL+RR++RK  VP+K
Sbjct: 143  KHALIDQAVELFNRAPNTFNCLQSVQVYNALLFSLCETKLFHGAYALVRRLIRKGMVPNK 202

Query: 760  RTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRK 939
             TY+ILVN WCS GKM+EAQ FLEEMS KGFNPPVRGRDLL++GLLNAGY+E AK +VRK
Sbjct: 203  MTYSILVNAWCSNGKMKEAQLFLEEMSEKGFNPPVRGRDLLVEGLLNAGYIEGAKDMVRK 262

Query: 940  MTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            M KE  VP+V TFN+L EAICKSGE++FCI L+ +   LGL PD++ YK+M+ A SK+
Sbjct: 263  MVKENCVPEVSTFNALLEAICKSGEVEFCIALYWEATGLGLCPDINTYKVMIPAVSKI 320


>ref|XP_002522032.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538836|gb|EEF40436.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 451

 Score =  348 bits (892), Expect = 3e-93
 Identities = 181/339 (53%), Positives = 231/339 (68%), Gaps = 2/339 (0%)
 Frame = +1

Query: 103  RPARYILIRITSPHLISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRR 282
            R   +IL + T+   + P    L++L+ + +TS       N + D YFA IHHI+NIVRR
Sbjct: 4    RTKLFILPKTTATVTLQP----LRHLKVLASTST------NNTKDAYFALIHHITNIVRR 53

Query: 283  DIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXX 462
            D Y ERTLNK+  +  V SELV+RVLR+C R   ES RFFNW+R     Y P        
Sbjct: 54   DFYPERTLNKL--NAPVTSELVFRVLRACSRSPTESLRFFNWSRAY---YTPTSIEYEEL 108

Query: 463  XXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXX--EHYGKHGLIDQAVELFNRLKNF 636
                +++  + +MWK+  QMK QN              E YG+ GLIDQAVE+FN+  + 
Sbjct: 109  IKILAKSKRYSSMWKLITQMKDQNPQFSISSETVRSIIEEYGRSGLIDQAVEVFNQCNSL 168

Query: 637  NCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREA 816
            NC Q  ++YNSLLFALCEVK F GAYAL+RR++RK   P+K TY++LVNGWCS GK +EA
Sbjct: 169  NCEQNVDIYNSLLFALCEVKLFHGAYALVRRLIRKGLAPNKTTYSVLVNGWCSNGKFKEA 228

Query: 817  QNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEA 996
            Q FLEEMS+KGFNPPVRGRDLLI+GLLNAGY E+AK +V KM+KEGFVPDV TFN L EA
Sbjct: 229  QLFLEEMSKKGFNPPVRGRDLLIEGLLNAGYFESAKEMVFKMSKEGFVPDVNTFNCLIEA 288

Query: 997  ICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            IC SGE+DFC+D++  + +LG  PD++ YKI++ A SKV
Sbjct: 289  ICNSGEVDFCVDMYYSLRKLGFCPDINSYKILIPAVSKV 327


>ref|XP_006479008.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like [Citrus sinensis]
          Length = 445

 Score =  342 bits (878), Expect = 1e-91
 Identities = 174/296 (58%), Positives = 209/296 (70%), Gaps = 1/296 (0%)
 Frame = +1

Query: 226  TSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCG-IESFRFF 402
            TS DDYFAA++HI+NIVR DIY ERTLN++ ++  + SELVYRVLR C      ES RFF
Sbjct: 28   TSKDDYFAAVNHIANIVRHDIYPERTLNRLNLT--LTSELVYRVLRVCHTTSPSESLRFF 85

Query: 403  NWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYG 582
             WAR+  P Y P            +    +++MWK    MK  N            E +G
Sbjct: 86   TWARSQ-PQYSPTSLEFEPLILTLAHHKRYQSMWKTIELMKPYNLSVSPQTLSLIIEEFG 144

Query: 583  KHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKR 762
            KHGL+D AVE+FN+   FNC Q   +YNSLLFALCEVK F GAYALIRRM+RK  VPDKR
Sbjct: 145  KHGLVDNAVEVFNKCTAFNCQQCVLLYNSLLFALCEVKLFHGAYALIRRMIRKGFVPDKR 204

Query: 763  TYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKM 942
            TYAILVN WCS+ KMREAQ FL+EMS KGFNPPVRGRDLL+ GLLNAGYLE+AK +V KM
Sbjct: 205  TYAILVNAWCSSWKMREAQEFLQEMSDKGFNPPVRGRDLLVQGLLNAGYLESAKQMVNKM 264

Query: 943  TKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASK 1110
             K+G VPD+ TFNSL E ICKSGE++FC++++  VC+LGL  DV  YKI++ A SK
Sbjct: 265  IKQGSVPDLETFNSLIETICKSGEVEFCVEMYYSVCKLGLCADVSTYKILIPAVSK 320


>ref|XP_006400384.1| hypothetical protein EUTSA_v10013477mg [Eutrema salsugineum]
            gi|557101474|gb|ESQ41837.1| hypothetical protein
            EUTSA_v10013477mg [Eutrema salsugineum]
          Length = 461

 Score =  342 bits (878), Expect = 1e-91
 Identities = 173/293 (59%), Positives = 214/293 (73%), Gaps = 1/293 (0%)
 Frame = +1

Query: 238  DYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWART 417
            DYFAAI+H+ NIVRR+++ ER+LN++ +   V SE V+RVLR+  R   +S RFFNWAR+
Sbjct: 48   DYFAAINHVVNIVRREVHPERSLNRLRLP--VTSEFVFRVLRATSRSANDSLRFFNWARS 105

Query: 418  NHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLI 597
            + PNY P            +    +E+MWKV  QMK  +            E YGK+G +
Sbjct: 106  S-PNYTPTSIEYEQLAKSLASHKKYESMWKVLKQMKDLSLDISGETLCFIIEQYGKNGHV 164

Query: 598  DQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAI 774
            DQAVELFN + K   C QT EVYNSLL ALCEVK F GAYALIRRM+RK   PDKRTY++
Sbjct: 165  DQAVELFNGVPKTLGCQQTVEVYNSLLHALCEVKMFHGAYALIRRMIRKGLKPDKRTYSV 224

Query: 775  LVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEG 954
            LVNGWCSAGKM+EAQ FL+EMSRKGFNPP RGRDLLI+GLLNAGYLE+AK +V+KMTK G
Sbjct: 225  LVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLESAKEMVKKMTKGG 284

Query: 955  FVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            FVPD+ TFN+L EAI KSGE+DFCI+++   C+LGL  D+D YK ++ A SK+
Sbjct: 285  FVPDIHTFNTLIEAISKSGEVDFCIEMYYTACKLGLCVDIDTYKTLIPAVSKI 337


>ref|XP_006287650.1| hypothetical protein CARUB_v10000860mg, partial [Capsella rubella]
            gi|482556356|gb|EOA20548.1| hypothetical protein
            CARUB_v10000860mg, partial [Capsella rubella]
          Length = 477

 Score =  338 bits (868), Expect = 2e-90
 Identities = 172/297 (57%), Positives = 214/297 (72%), Gaps = 1/297 (0%)
 Frame = +1

Query: 226  TSNDDYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFN 405
            ++  DYFAAI+H+ NIVRR+I+ ER+LN + +   V SE V+RVLR+  R   +S RFFN
Sbjct: 60   STKGDYFAAINHVVNIVRREIHPERSLNSLRLP--VTSEFVFRVLRATSRSANDSLRFFN 117

Query: 406  WARTNHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGK 585
            WAR+N P+Y P            +    +E+MWK+  QMK  +            E YGK
Sbjct: 118  WARSN-PSYTPTSMEYEQLAKSLASHKKYESMWKILKQMKDLSLDISGETLCFIIEQYGK 176

Query: 586  HGLIDQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKR 762
            +G +DQAVELFN + K   C QT +VYNSLL ALC+VK F GAYALIRRM+RK   PDKR
Sbjct: 177  NGHVDQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKR 236

Query: 763  TYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKM 942
            TYAILVNGWCSAGKM+EAQ FL+EMSRKGFNPP RGRDLLI+GLLNAGYLE+AK +V KM
Sbjct: 237  TYAILVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLESAKEMVSKM 296

Query: 943  TKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            TK GFVPD+ TFN+L EAI KSGE++FCI+++   C+LGL  D+D YK ++ A SK+
Sbjct: 297  TKGGFVPDIQTFNTLIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKI 353


>ref|NP_197340.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|223635760|sp|Q94JX6.2|PP391_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g18390, mitochondrial; Flags: Precursor
            gi|332005166|gb|AED92549.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 459

 Score =  336 bits (861), Expect = 1e-89
 Identities = 171/293 (58%), Positives = 211/293 (72%), Gaps = 1/293 (0%)
 Frame = +1

Query: 238  DYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWART 417
            DYFAAI+H+ NIVRR+I+ ER+LN + +   V SE V+RVLR+  R   +S RFFNWAR+
Sbjct: 46   DYFAAINHVVNIVRREIHPERSLNSLRLP--VTSEFVFRVLRATSRSSNDSLRFFNWARS 103

Query: 418  NHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLI 597
            N P+Y P            +    +E+MWK+  QMK  +            E YGK+G +
Sbjct: 104  N-PSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDISGETLCFIIEQYGKNGHV 162

Query: 598  DQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAI 774
            DQAVELFN + K   C QT +VYNSLL ALC+VK F GAYALIRRM+RK   PDKRTYAI
Sbjct: 163  DQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAI 222

Query: 775  LVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEG 954
            LVNGWCSAGKM+EAQ FL+EMSR+GFNPP RGRDLLI+GLLNAGYLE+AK +V KMTK G
Sbjct: 223  LVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIEGLLNAGYLESAKEMVSKMTKGG 282

Query: 955  FVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            FVPD+ TFN L EAI KSGE++FCI+++   C+LGL  D+D YK ++ A SK+
Sbjct: 283  FVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKI 335


>ref|XP_002871814.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297317651|gb|EFH48073.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 459

 Score =  336 bits (861), Expect = 1e-89
 Identities = 171/293 (58%), Positives = 212/293 (72%), Gaps = 1/293 (0%)
 Frame = +1

Query: 238  DYFAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWART 417
            DYFAAI+H+ NIVRR+I+ ER+LN + +   V SE V+RVLR+  R   +S RFFNWAR+
Sbjct: 46   DYFAAINHVVNIVRREIHPERSLNSLRLP--VTSEFVFRVLRATSRSANDSLRFFNWARS 103

Query: 418  NHPNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLI 597
            N P+Y P            +    +E+MWK+  QMK  +            E YGK+G +
Sbjct: 104  N-PSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDISGETLCFIIEQYGKNGHV 162

Query: 598  DQAVELFNRL-KNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAI 774
            DQAVELFN + K   C QT +VYN+LL ALC+VK F GAYALIRRM+RK   PDKRTYAI
Sbjct: 163  DQAVELFNGVPKTLGCQQTVDVYNALLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAI 222

Query: 775  LVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEG 954
            LVNGWCSAGKM+EAQ FL+EMSRKGFNPP RGRDLLI+GLLNAGYLE+AK +V KMTK G
Sbjct: 223  LVNGWCSAGKMKEAQEFLDEMSRKGFNPPARGRDLLIEGLLNAGYLESAKEIVDKMTKGG 282

Query: 955  FVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASKV 1113
            FVPD+ TFN+L EAI KSGE++FCI+++   C+LGL  D+D YK ++ A SK+
Sbjct: 283  FVPDILTFNTLIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKI 335


>gb|ESW31089.1| hypothetical protein PHAVU_002G208300g [Phaseolus vulgaris]
          Length = 448

 Score =  330 bits (847), Expect = 5e-88
 Identities = 168/323 (52%), Positives = 219/323 (67%), Gaps = 1/323 (0%)
 Frame = +1

Query: 145  LISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIVRRDIYMERTLNKMCIS 324
            +++PT  +L    +I  T     + +    D YFA IHHISNIVRRD Y+ERTLNK+ I 
Sbjct: 9    ILTPTKTLLLNFHSIPKTLTTAASAR----DQYFAVIHHISNIVRRDFYLERTLNKLRIH 64

Query: 325  NIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXXXXXXXXSRTGHWETMW 504
              V  ELV+RVLR+C      S RFFNWAR+ HP+Y P            +R  +++TMW
Sbjct: 65   --VTPELVFRVLRACSTAPTPSLRFFNWARS-HPSYTPTSLEFEQIVTTLARANNYQTMW 121

Query: 505  KVAHQMKVQNXXXXXXXXXXXX-EHYGKHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFA 681
             +  Q+ + +             + YG H  IDQAVE+FN+    NCPQT  +YN+LL +
Sbjct: 122  SLIRQVTLHHRLSLSPAAVATLIDAYGHHRHIDQAVEVFNKAPILNCPQTLPLYNALLKS 181

Query: 682  LCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMREAQNFLEEMSRKGFNPP 861
            LC  + F GAYAL+RRMLRK   PDK TYA+LVN WCS+GK+REA+ FL EMS KGFNPP
Sbjct: 182  LCHNRLFHGAYALLRRMLRKGLHPDKATYAVLVNAWCSSGKLREAKLFLREMSEKGFNPP 241

Query: 862  VRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLAEAICKSGEIDFCIDLFN 1041
            +RGRDLL++GLLNAGY+E+AKG+VRKM KEG VPDV TFN++ E +CK  E+ FC+DL++
Sbjct: 242  LRGRDLLVEGLLNAGYVESAKGMVRKMIKEGIVPDVETFNAVVETVCKE-EVQFCVDLYH 300

Query: 1042 DVCRLGLSPDVD*YKIMVTAASK 1110
            +VC LG+ PDV+ YKI++ A SK
Sbjct: 301  EVCALGMVPDVNTYKILIPAVSK 323


>ref|XP_003524064.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X1 [Glycine max]
            gi|571455122|ref|XP_006579993.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X2 [Glycine max]
            gi|571455124|ref|XP_006579994.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g18390,
            mitochondrial-like isoform X3 [Glycine max]
          Length = 450

 Score =  322 bits (824), Expect = 2e-85
 Identities = 168/340 (49%), Positives = 226/340 (66%), Gaps = 2/340 (0%)
 Frame = +1

Query: 97   MFRPARYILIRITSPHLISPTNGMLKYLRTIMTTSDRVGNCKNTSNDDYFAAIHHISNIV 276
            M +    +++R + P L+   + + K L T            ++S D+YFA IHH+SNIV
Sbjct: 1    MLQTCSKLILRHSKPRLLLNLHSITKTLTTA-----------SSSRDEYFAVIHHVSNIV 49

Query: 277  RRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNHPNYDPXXXXXX 456
            RRD Y+ERTLNK+ I+  V  ELV+RVLR+C     ES RFFNWART HP+Y P      
Sbjct: 50   RRDFYLERTLNKLRIT--VTPELVFRVLRACSNNPTESLRFFNWART-HPSYSPTSLEFE 106

Query: 457  XXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXX-EHYGKHGLIDQAVELFNRLKN 633
                  +R   +++MW +  Q+ + +             E YG +  +DQ+V++FN+   
Sbjct: 107  QIVTTLARANTYQSMWALIRQVTLHHRLSLSPSAVASVIEAYGDNRHVDQSVQVFNKSPL 166

Query: 634  F-NCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILVNGWCSAGKMR 810
              NCPQT  +YN+LL +LC  K F GAYAL+RRMLRK   PDK TYA+LVN WCS GK+R
Sbjct: 167  LLNCPQTLPLYNALLRSLCHNKLFHGAYALVRRMLRKGLRPDKTTYAVLVNAWCSNGKLR 226

Query: 811  EAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMTKEGFVPDVGTFNSLA 990
            EA+ FLEEMS KGFNPPVRGRDLL++GLLNAGY+E+AKG+VR M K+G VPDVGTFN++ 
Sbjct: 227  EAKLFLEEMSEKGFNPPVRGRDLLVEGLLNAGYVESAKGMVRNMIKQGSVPDVGTFNAVV 286

Query: 991  EAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTAASK 1110
            E + K  ++ FC+ L+++VC LG++PDV+ YKI+V A SK
Sbjct: 287  ETVSKE-DVQFCVGLYHEVCALGMAPDVNTYKILVPAVSK 325



 Score = 58.2 bits (139), Expect = 6e-06
 Identities = 35/114 (30%), Positives = 53/114 (46%)
 Frame = +1

Query: 583 KHGLIDQAVELFNRLKNFNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKR 762
           K G++D+A  L N            +Y  ++ ALC    F  A+     M  K   P++ 
Sbjct: 325 KSGMVDEAFRLLNNFIEDGHKPFPSLYAPVIKALCRRGQFDDAFCFFGDMKAKAHPPNRP 384

Query: 763 TYAILVNGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAK 924
            Y +L+     AGK  EA N++ EM+  G  P  R  D++ DGL N G  + A+
Sbjct: 385 LYTMLITMCGRAGKFVEAANYIFEMTEMGLVPISRCFDMVTDGLKNCGKHDLAR 438


>emb|CAN80394.1| hypothetical protein VITISV_001596 [Vitis vinifera]
          Length = 356

 Score =  308 bits (788), Expect = 3e-81
 Identities = 161/303 (53%), Positives = 205/303 (67%), Gaps = 14/303 (4%)
 Frame = +1

Query: 244  FAAIHHISNIVRRDIYMERTLNKMCISNIVNSELVYRVLRSCCRCGIESFRFFNWARTNH 423
            FA +HHIS IVRRD Y+ERTLNK+ IS  V S+LVYRVLRSC   G ES RFFNWAR+ H
Sbjct: 4    FAVVHHISAIVRRDFYLERTLNKLPIS--VTSDLVYRVLRSCPNSGTESLRFFNWARS-H 60

Query: 424  PNYDPXXXXXXXXXXXXSRTGHWETMWKVAHQMKVQNXXXXXXXXXXXXEHYGKHGLIDQ 603
             +Y P            +RT  ++ MWK+AHQM+  +            E +GKHGL+DQ
Sbjct: 61   XSYQPTTLEYEELLKTLARTKQFQPMWKIAHQMQTLSPTVVSSII----EEFGKHGLVDQ 116

Query: 604  AVELFNRLKN-FNCPQTTEVYNSLLFALCEVKNFQGAYALIRRMLRKDAVPDKRTYAILV 780
            AVE+FN+ K+  NCPQT EVYNSLLFALCEVK F GAYALIRRM+RK   P+K+TY++LV
Sbjct: 117  AVEVFNKAKSALNCPQTIEVYNSLLFALCEVKYFHGAYALIRRMIRKGVTPNKQTYSVLV 176

Query: 781  NGWCSAGKMREAQNFLEEMSRKGFNPPVRGRDLLIDGLLNAGYLEAAKGLVRKMT----- 945
            NGWC+AGKM+EAQ+FLEEMSRKGFNPPVRGRDLL+DGLLNAGYLEAAK ++ K       
Sbjct: 177  NGWCAAGKMKEAQDFLEEMSRKGFNPPVRGRDLLVDGLLNAGYLEAAKEMLAKRVELMRP 236

Query: 946  --------KEGFVPDVGTFNSLAEAICKSGEIDFCIDLFNDVCRLGLSPDVD*YKIMVTA 1101
                    ++G  P    +  + +A+C++G+ D     F+D+   G  P+   Y +++T 
Sbjct: 237  FRILHRSIEDGHRPFPSLYAPIIKALCRNGQFDDAFCFFSDMKVKGHPPNRPVYTMLITM 296

Query: 1102 ASK 1110
              +
Sbjct: 297  CGR 299


Top