BLASTX nr result

ID: Mentha27_contig00016723 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00016723
         (887 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU24286.1| hypothetical protein MIMGU_mgv1a025107mg [Mimulus...   445   e-122
gb|EYU18955.1| hypothetical protein MIMGU_mgv1a022111mg [Mimulus...   437   e-120
ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containi...   410   e-112
ref|XP_007017888.1| Pentatricopeptide repeat (PPR) superfamily p...   409   e-111
gb|EXB68664.1| hypothetical protein L484_024678 [Morus notabilis]     406   e-111
ref|XP_004292932.1| PREDICTED: pentatricopeptide repeat-containi...   395   e-108
ref|XP_007227203.1| hypothetical protein PRUPE_ppa019251mg [Prun...   395   e-108
ref|XP_007152607.1| hypothetical protein PHAVU_004G144300g [Phas...   393   e-107
ref|XP_002301973.2| pentatricopeptide repeat-containing family p...   390   e-106
ref|XP_004250454.1| PREDICTED: pentatricopeptide repeat-containi...   385   e-105
ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containi...   384   e-104
gb|EPS74339.1| hypothetical protein M569_00411 [Genlisea aurea]       383   e-104
ref|XP_006416418.1| hypothetical protein EUTSA_v10009574mg [Eutr...   375   e-101
ref|XP_006587447.1| PREDICTED: pentatricopeptide repeat-containi...   374   e-101
ref|NP_173449.1| pentatricopeptide repeat-containing protein [Ar...   371   e-100
ref|XP_006306841.1| hypothetical protein CARUB_v10008385mg [Caps...   371   e-100
ref|XP_002890375.1| pentatricopeptide repeat-containing protein ...   367   3e-99
ref|XP_006655248.1| PREDICTED: pentatricopeptide repeat-containi...   359   9e-97
ref|XP_004963823.1| PREDICTED: pentatricopeptide repeat-containi...   358   2e-96
ref|NP_001055349.1| Os05g0370000 [Oryza sativa Japonica Group] g...   358   2e-96

>gb|EYU24286.1| hypothetical protein MIMGU_mgv1a025107mg [Mimulus guttatus]
          Length = 654

 Score =  445 bits (1145), Expect = e-122
 Identities = 203/257 (78%), Positives = 238/257 (92%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG+ YF+ M+  +G++PRVEHYAC+ SLLGRAGKLEEAYS+I +MP  PDACVWGAL
Sbjct: 398  LTEEGHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKLEEAYSMIEKMPFEPDACVWGAL 457

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +HHN+SLGE+AA KLFELEP NPGNYIL+SNIYASKG+++EVDK+RD+MR+KGL+
Sbjct: 458  LSSCRVHHNMSLGEVAARKLFELEPMNPGNYILMSNIYASKGRYKEVDKIRDIMRDKGLR 517

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWIE+K K+HM++AGDKSLPQMAQI+D+L ++S EMKKAGYSP TD+VLQDVEEQ
Sbjct: 518  KNPGCSWIEVKNKVHMLLAGDKSLPQMAQIMDKLNRLSIEMKKAGYSPNTDYVLQDVEEQ 577

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            EKEHILCGHSEKLAV+FGI+NT+ GSPLR+TKNLRICGDCHAV+KFISR ERREIFVRD 
Sbjct: 578  EKEHILCGHSEKLAVVFGILNTSPGSPLRVTKNLRICGDCHAVIKFISRFERREIFVRDT 637

Query: 166  NRYHHFKDGECSCQDYW 116
            NRYHHFKDG+CSC DYW
Sbjct: 638  NRYHHFKDGDCSCGDYW 654


>gb|EYU18955.1| hypothetical protein MIMGU_mgv1a022111mg [Mimulus guttatus]
          Length = 654

 Score =  437 bits (1125), Expect = e-120
 Identities = 200/257 (77%), Positives = 235/257 (91%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG+ YF+ M+  +G++PRVEHYAC+ SLLGRAGKLEEAYS+I +MP  PDACVWGAL
Sbjct: 398  LTEEGHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKLEEAYSMIEKMPFEPDACVWGAL 457

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +HHN+SLG +AA KLFELEP NPGNYILLSNIYASKG+++EVDK+RD+M +KGL+
Sbjct: 458  LSSCRVHHNMSLGGVAARKLFELEPKNPGNYILLSNIYASKGRYKEVDKIRDIMGDKGLR 517

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWIE+K K+HM++AGDKSLPQMAQI+++L ++S EMKKAGYSP TD+VLQDVEEQ
Sbjct: 518  KNPGCSWIEVKNKVHMLLAGDKSLPQMAQIMEKLNRLSIEMKKAGYSPNTDYVLQDVEEQ 577

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            EKEHILCGHSEKLAV+FGI+N + GSPLR+TKNLRICGDCHAV+KFISR ERREIFVRD 
Sbjct: 578  EKEHILCGHSEKLAVVFGILNMSPGSPLRVTKNLRICGDCHAVIKFISRFERREIFVRDT 637

Query: 166  NRYHHFKDGECSCQDYW 116
            NRYHHFKDG+CSC DYW
Sbjct: 638  NRYHHFKDGDCSCGDYW 654


>ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230
            [Vitis vinifera]
          Length = 758

 Score =  410 bits (1055), Expect = e-112
 Identities = 189/257 (73%), Positives = 222/257 (86%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG +YF SMS  YG+E RVEHYACM +LL RAGKLE+AY++I  MP  PDACVWGAL
Sbjct: 502  LTEEGSYYFNSMSSKYGIEARVEHYACMVTLLSRAGKLEQAYAMIRRMPVNPDACVWGAL 561

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +H+NVSLGE+AAEKLFELEP NPGNYILLSNIYASKG W EV++VRDMM+ KGL+
Sbjct: 562  LSSCRVHNNVSLGEVAAEKLFELEPSNPGNYILLSNIYASKGMWNEVNRVRDMMKNKGLR 621

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWIE+K K+HM++AGDKS PQM QI+++L K+S EMKK GY P+ ++VLQDVEEQ
Sbjct: 622  KNPGCSWIEVKNKVHMLLAGDKSHPQMTQIIEKLDKLSMEMKKLGYFPEINFVLQDVEEQ 681

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +KE ILCGHSEKLAV+FG++NT  G PL++ KNLRICGDCH V+KFIS  ERREIFVRD 
Sbjct: 682  DKEQILCGHSEKLAVVFGLLNTPPGYPLQVIKNLRICGDCHVVIKFISSFERREIFVRDT 741

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFK+G CSC DYW
Sbjct: 742  NRFHHFKEGACSCGDYW 758


>ref|XP_007017888.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
            gi|508723216|gb|EOY15113.1| Pentatricopeptide repeat
            (PPR) superfamily protein [Theobroma cacao]
          Length = 758

 Score =  409 bits (1050), Expect = e-111
 Identities = 183/257 (71%), Positives = 227/257 (88%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG+H+F SMS+ +GV+ ++EHY+CM +LLGR+GKLE+AY+LI +MP  PDACVWGAL
Sbjct: 502  LTEEGWHFFNSMSRDHGVKAKMEHYSCMVNLLGRSGKLEQAYALIQQMPFEPDACVWGAL 561

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC LH+N+SLGE+AA+ LF+LEP NPGNYILLSNIYASKG W+EVD VRD+MR +G+K
Sbjct: 562  LSSCRLHNNISLGEIAAQNLFKLEPSNPGNYILLSNIYASKGMWDEVDAVRDVMRSRGMK 621

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWIE+K ++HM++AGDKS PQM +I++++ K+S +MKKAGY P TD+VLQDV+EQ
Sbjct: 622  KNPGCSWIEIKNQVHMLLAGDKSHPQMTEIIEKIYKLSMDMKKAGYLPNTDFVLQDVDEQ 681

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +KE ILCGHSEKLAV FG++NT  GSPL+I KNLRICGDCHAV+KFIS  E REI+VRD 
Sbjct: 682  DKEQILCGHSEKLAVAFGLLNTPPGSPLQIIKNLRICGDCHAVIKFISGFEGREIYVRDT 741

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFKDG CSC+DYW
Sbjct: 742  NRFHHFKDGVCSCRDYW 758


>gb|EXB68664.1| hypothetical protein L484_024678 [Morus notabilis]
          Length = 728

 Score =  406 bits (1043), Expect = e-111
 Identities = 185/257 (71%), Positives = 220/257 (85%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LT+EG+HYF SMSK +G+E R+EHYACM +LLGR+GKLEEAYSLI +MP  PDACVWG+L
Sbjct: 472  LTDEGWHYFSSMSKEHGIEARLEHYACMVTLLGRSGKLEEAYSLINKMPMEPDACVWGSL 531

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +H+NVSLGE+AAEKLFELEP NPGNY++LSNIY SKG W +VD+VRDMM +KGL+
Sbjct: 532  LSSCRVHNNVSLGEVAAEKLFELEPRNPGNYVILSNIYGSKGMWSQVDRVRDMMNQKGLR 591

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWIE+K ++HM++AGDKS PQ  QI+ +L K+S EMK +GY P   +VLQDVEEQ
Sbjct: 592  KNPGCSWIEVKNEVHMLLAGDKSHPQRIQIIGKLNKLSMEMKNSGYFPNFTFVLQDVEEQ 651

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +K HILCGHSEKLAV FG++NT  GS LR+ KNLRICGDCH V+KFIS  E+REIFVRD 
Sbjct: 652  DKVHILCGHSEKLAVAFGLLNTPPGSSLRVIKNLRICGDCHVVIKFISSFEQREIFVRDT 711

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFKDG CSC DYW
Sbjct: 712  NRFHHFKDGHCSCGDYW 728


>ref|XP_004292932.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Fragaria vesca subsp. vesca]
          Length = 755

 Score =  395 bits (1016), Expect = e-108
 Identities = 183/257 (71%), Positives = 216/257 (84%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG++YF SMSK +G+E R+EHYACM +LLGRAGKL+EAYS+I +MP  PDACVWGAL
Sbjct: 499  LTEEGWYYFNSMSKEHGIEARIEHYACMVTLLGRAGKLDEAYSMIKKMPFEPDACVWGAL 558

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +H+NV+LGE  A+KLF LEP NPGNYILLSNIYASKG W EVD+VRD M+  GL+
Sbjct: 559  LSSCRVHNNVTLGESTAKKLFNLEPGNPGNYILLSNIYASKGMWTEVDRVRDTMKSLGLR 618

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWIE K  +HM++AGDK+ PQM +I ++L  +SSEMKK+GY P T +VLQDVEEQ
Sbjct: 619  KNPGCSWIEFKNNVHMLLAGDKTHPQMNKITEKLNTLSSEMKKSGYLPSTHFVLQDVEEQ 678

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            EKE ILCGHSEKLAV+ G++NT  GS LR+ KNLRICGDCH+V+KFIS  E REI VRD 
Sbjct: 679  EKEQILCGHSEKLAVVLGLLNTPPGSSLRVIKNLRICGDCHSVIKFISSLEGREISVRDT 738

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFKDG CSC DYW
Sbjct: 739  NRFHHFKDGVCSCGDYW 755


>ref|XP_007227203.1| hypothetical protein PRUPE_ppa019251mg [Prunus persica]
            gi|462424139|gb|EMJ28402.1| hypothetical protein
            PRUPE_ppa019251mg [Prunus persica]
          Length = 654

 Score =  395 bits (1016), Expect = e-108
 Identities = 183/257 (71%), Positives = 217/257 (84%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LT+EG++YF SMSK +G+E RVEHYACM +LL R+GKLEEAYS+I +MP  PDACVWGAL
Sbjct: 398  LTDEGWYYFNSMSKEHGLEARVEHYACMVTLLSRSGKLEEAYSMIKQMPFEPDACVWGAL 457

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +H NV+LG+  A+KLF LEP NPGNYILLSNIYASKG W EVDKVRD M+  GL+
Sbjct: 458  LSSCRVHSNVTLGKYVAKKLFNLEPKNPGNYILLSNIYASKGMWSEVDKVRDKMKSLGLR 517

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWIE+K K+HM++AGDK+ PQM QI+++L K+SSEMKK GY P T +VLQDVEEQ
Sbjct: 518  KNPGCSWIEVKNKVHMLLAGDKAHPQMNQIIEKLNKLSSEMKKLGYFPNTHFVLQDVEEQ 577

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +KE ILCGHSEKLAV+ G++N+  GS LR+ KNLRICGDCHAV+KFIS  E REI VRD 
Sbjct: 578  DKEQILCGHSEKLAVVLGLLNSPPGSSLRVIKNLRICGDCHAVIKFISSFEGREISVRDT 637

Query: 166  NRYHHFKDGECSCQDYW 116
            N +HHFKDG CSC+DYW
Sbjct: 638  NLFHHFKDGVCSCEDYW 654


>ref|XP_007152607.1| hypothetical protein PHAVU_004G144300g [Phaseolus vulgaris]
            gi|561025916|gb|ESW24601.1| hypothetical protein
            PHAVU_004G144300g [Phaseolus vulgaris]
          Length = 601

 Score =  393 bits (1009), Expect = e-107
 Identities = 177/257 (68%), Positives = 221/257 (85%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG+HY+ SMSK +G+EP++EHYACM +LL R GKLEEAYS+I EMP  PDACVWGAL
Sbjct: 345  LTEEGWHYYNSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVWGAL 404

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +H+N+SLGE+AAEKLF LEP NPGNY+LLSNIYASKG W+E +++R+MM+ KGL+
Sbjct: 405  LSSCRVHNNLSLGEIAAEKLFPLEPANPGNYVLLSNIYASKGLWDEENRIREMMKSKGLR 464

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPG SWIE+  K+HM++AGD+S PQM  IL++L K++ EMKK+GY PKT++VLQDVEEQ
Sbjct: 465  KNPGYSWIEVGHKVHMLLAGDQSHPQMKDILEKLDKLNMEMKKSGYLPKTNFVLQDVEEQ 524

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +KE ILCGHSEKLAV+ G++NT+ G PL++ KNLRIC DCHAV+K ISR E REI++RD 
Sbjct: 525  DKEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKAISRLEGREIYIRDT 584

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HH KDG CSC D+W
Sbjct: 585  NRFHHIKDGVCSCGDFW 601


>ref|XP_002301973.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550344115|gb|EEE81246.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 724

 Score =  390 bits (1001), Expect = e-106
 Identities = 179/257 (69%), Positives = 219/257 (85%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG+ YF+SMS+ +GVE R+EHY+CM +LLGR+G+LEEAY++I +MP  PD+CVWGAL
Sbjct: 468  LTEEGWFYFDSMSRNHGVEARMEHYSCMVTLLGRSGRLEEAYAMIKQMPFEPDSCVWGAL 527

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +H+ V LGE+AA+++FELEP NPGNYILLSNIYASK  W EVD VRDMMR +GLK
Sbjct: 528  LSSCRVHNRVDLGEIAAKRVFELEPRNPGNYILLSNIYASKAMWVEVDMVRDMMRSRGLK 587

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPG SWIE+K K+HM++AGD S PQM QI+++LAK++ EMKK+GY P TD+VLQDVEEQ
Sbjct: 588  KNPGYSWIEIKNKVHMLLAGDSSHPQMPQIIEKLAKLTVEMKKSGYVPHTDFVLQDVEEQ 647

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +KE ILCGHSEKLAV+ G++NT  G PL++ KNLRIC DCHAV+KFIS  E+REIFVRD 
Sbjct: 648  DKEQILCGHSEKLAVVLGLLNTKPGFPLQVIKNLRICRDCHAVIKFISDFEKREIFVRDT 707

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+H FK G CSC DYW
Sbjct: 708  NRFHQFKGGVCSCGDYW 724


>ref|XP_004250454.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Solanum lycopersicum]
          Length = 828

 Score =  385 bits (990), Expect = e-105
 Identities = 179/257 (69%), Positives = 210/257 (81%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTE+G HYF+ MS+ +G+E RVEHYACM SLLGR GKL+EAY +I+ MP  PDACVWGAL
Sbjct: 572  LTEQGQHYFDCMSRIHGLEARVEHYACMVSLLGRTGKLKEAYDMISTMPIEPDACVWGAL 631

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC  H N+SLGE+AA+KLFELEP NPGNYILLSNIYAS  +W EVDKVRDMM+  GL 
Sbjct: 632  LSSCRTHRNMSLGEIAADKLFELEPKNPGNYILLSNIYASNNRWNEVDKVRDMMKHVGLS 691

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWIE+K K+HM++AGD   PQM QI+++L K+S +MK  G S  T+ VLQDVEEQ
Sbjct: 692  KNPGCSWIEIKNKVHMLLAGDDLHPQMPQIMEKLRKLSMDMKNTGVSHDTELVLQDVEEQ 751

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +KE ILCGHSEKLAV+ GI+NT  G+ LR+ KNLRICGDCH  +KFIS  E REI+VRDA
Sbjct: 752  DKELILCGHSEKLAVVLGILNTNPGTSLRVIKNLRICGDCHTFIKFISSFEGREIYVRDA 811

Query: 166  NRYHHFKDGECSCQDYW 116
            NRYHHF +G CSC DYW
Sbjct: 812  NRYHHFNEGICSCGDYW 828


>ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            isoform X1 [Glycine max]
          Length = 748

 Score =  384 bits (986), Expect = e-104
 Identities = 174/257 (67%), Positives = 220/257 (85%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG+  + SMS+ +G+EP++EHYAC+ +LL R GKLEEAYS+I EMP  PDACVWGAL
Sbjct: 492  LTEEGWRCYNSMSEEHGIEPKMEHYACLVTLLSRVGKLEEAYSIIKEMPFEPDACVWGAL 551

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +H+N+SLGE+AAEKLF LEP NPGNYILLSNIYASKG W+E +++R++M+ KGL+
Sbjct: 552  LSSCRVHNNLSLGEIAAEKLFFLEPTNPGNYILLSNIYASKGLWDEENRIREVMKSKGLR 611

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPG SWIE+  K+HM++AGD+S PQM  IL++L K++ +MKK+GY PKT++VLQDVEEQ
Sbjct: 612  KNPGYSWIEVGHKVHMLLAGDQSHPQMKDILEKLDKLNMQMKKSGYLPKTNFVLQDVEEQ 671

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +KE ILCGHSEKLAV+ G++NT+ G PL++ KNLRIC DCHAV+K ISR E REI+VRD 
Sbjct: 672  DKEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIYVRDT 731

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFKDG CSC D+W
Sbjct: 732  NRFHHFKDGVCSCGDFW 748


>gb|EPS74339.1| hypothetical protein M569_00411 [Genlisea aurea]
          Length = 1063

 Score =  383 bits (983), Expect = e-104
 Identities = 184/259 (71%), Positives = 218/259 (84%), Gaps = 2/259 (0%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            L EEG  YFESM + +G+EPR+EHYAC+  LLGRAGKL+EAY+ I  MP   DACVWGAL
Sbjct: 805  LAEEGRRYFESMFEDHGLEPRLEHYACIVGLLGRAGKLDEAYAKIKRMPFEADACVWGAL 864

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC LH+N  LGE+AAEKLFELE  N GNYILLSNIYAS  KW+EV ++RDMM  KG+K
Sbjct: 865  LSSCALHNNEFLGEVAAEKLFELELGNSGNYILLSNIYASSRKWKEVRRIRDMMSLKGMK 924

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKA-GYSPKTDWVLQDVEE 350
            KNPGCSWIE+K K+HMI+AGDK+LPQ+++I++RL +++ EMK A GY P T++VLQDVEE
Sbjct: 925  KNPGCSWIEVKNKVHMILAGDKALPQVSKIMERLKRLNQEMKGAGGYFPNTNYVLQDVEE 984

Query: 349  Q-EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVR 173
            Q E+E ILCGHSEKLAV+FGI+NT+ GSP+R+TKNLRICGDCHAV+KFIS  E REI VR
Sbjct: 985  QEEREGILCGHSEKLAVVFGILNTSRGSPIRVTKNLRICGDCHAVIKFISGFEGREISVR 1044

Query: 172  DANRYHHFKDGECSCQDYW 116
            D NRYHHFKDG CSC DYW
Sbjct: 1045 DTNRYHHFKDGICSCGDYW 1063


>ref|XP_006416418.1| hypothetical protein EUTSA_v10009574mg [Eutrema salsugineum]
            gi|557094189|gb|ESQ34771.1| hypothetical protein
            EUTSA_v10009574mg [Eutrema salsugineum]
          Length = 760

 Score =  375 bits (963), Expect = e-101
 Identities = 168/257 (65%), Positives = 214/257 (83%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LT+EG+ YF  M++ YG++PR+EHY+CM SLLGRAGKL+EAY LI E+P  PD+CVWGAL
Sbjct: 504  LTDEGWKYFGMMTEEYGIKPRLEHYSCMVSLLGRAGKLQEAYDLIKEIPFEPDSCVWGAL 563

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            L+SC L +NV L E+AAEKLF+LEP+NPG Y+LLSNIYA+KG W EVD VR+ M   GLK
Sbjct: 564  LNSCRLQNNVDLAEIAAEKLFDLEPENPGTYVLLSNIYAAKGMWAEVDSVRNKMESLGLK 623

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWI++K K++ ++AGDKS PQ+ QI +++ ++S EM+K+G+ P  D+ LQDVEEQ
Sbjct: 624  KNPGCSWIQVKNKVYTLLAGDKSHPQIEQITEKMDEISKEMRKSGHRPNLDFALQDVEEQ 683

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            EKE IL GHSEKLAV+FG++NT +G+PL++ KNLRICGDCH+V+KFIS    REIFVRD 
Sbjct: 684  EKEQILLGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHSVIKFISGYAGREIFVRDT 743

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFKDG CSC D+W
Sbjct: 744  NRFHHFKDGICSCGDFW 760


>ref|XP_006587447.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Glycine max]
          Length = 601

 Score =  374 bits (959), Expect = e-101
 Identities = 172/257 (66%), Positives = 215/257 (83%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTEEG+ Y+ SMS+ +G EP++EHYACM +LL R GKLEEAYS+I EMP  PDACV GAL
Sbjct: 345  LTEEGWRYYNSMSEEHGFEPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVRGAL 404

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            LSSC +H+N+SLGE+ AEKLF LEP NPGNYI+LSNIYASKG W+E +++R++M+ KGL+
Sbjct: 405  LSSCRVHNNLSLGEITAEKLFLLEPTNPGNYIILSNIYASKGLWDEENRIREVMKSKGLR 464

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPG SWIE+  KIHM++AGD+S PQM  IL++L K++ EMKK+GY PK+++V QDVEE 
Sbjct: 465  KNPGYSWIEVGHKIHMLLAGDQSHPQMKDILEKLDKLNMEMKKSGYLPKSNFVWQDVEEH 524

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            +KE ILCGHSEKLAV+ G++NT+ G PL++ KNLRIC DCHAV+K ISR E REI+VRD 
Sbjct: 525  DKEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIYVRDT 584

Query: 166  NRYHHFKDGECSCQDYW 116
            NR HHFKDG CSC D+W
Sbjct: 585  NRLHHFKDGVCSCGDFW 601


>ref|NP_173449.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806503|sp|Q9LNU6.2|PPR53_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g20230 gi|332191832|gb|AEE29953.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 760

 Score =  371 bits (953), Expect = e-100
 Identities = 164/257 (63%), Positives = 213/257 (82%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LT+EG+ YF+ MS+ YG++PR+EHY+CM +LLGRAGKL+EAY LI EMP  PD+CVWGAL
Sbjct: 504  LTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGAL 563

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            L+SC L +NV L E+AAEKLF LEP+NPG Y+LLSNIYA+KG W EVD +R+ M   GLK
Sbjct: 564  LNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLK 623

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWI++K +++ ++AGDKS PQ+ QI +++ ++S EM+K+G+ P  D+ L DVEEQ
Sbjct: 624  KNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEMRKSGHRPNLDFALHDVEEQ 683

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            E+E +L GHSEKLAV+FG++NT +G+PL++ KNLRICGDCHAV+KFIS    REIF+RD 
Sbjct: 684  EQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHAVIKFISSYAGREIFIRDT 743

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFKDG CSC D+W
Sbjct: 744  NRFHHFKDGICSCGDFW 760


>ref|XP_006306841.1| hypothetical protein CARUB_v10008385mg [Capsella rubella]
            gi|482575552|gb|EOA39739.1| hypothetical protein
            CARUB_v10008385mg [Capsella rubella]
          Length = 760

 Score =  371 bits (952), Expect = e-100
 Identities = 164/257 (63%), Positives = 213/257 (82%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LT+EG+ YF  MS+ YG++PR+EHY+CM +LLGRAGKL+EAY LI EMP  PD+CVWGAL
Sbjct: 504  LTDEGWKYFSMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYELIKEMPFEPDSCVWGAL 563

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            L+SC L  NV L E+AA+KLF+LEP+NPG Y+LLSNIYA+KG W EVD +R+ M   GLK
Sbjct: 564  LNSCRLQSNVDLAEIAADKLFDLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLK 623

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWI++K +++ ++AGDKS PQ+ QI +++ ++S EM+K+G+ P  D+ LQDVEEQ
Sbjct: 624  KNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISEEMRKSGHRPNLDFALQDVEEQ 683

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            E+E +L GHSEKLAV+FG++NT +G+PL++ KNLRICGDCH+V+KFIS    REIFVRD 
Sbjct: 684  EQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHSVIKFISSYAGREIFVRDT 743

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFKDG CSC D+W
Sbjct: 744  NRFHHFKDGICSCGDFW 760


>ref|XP_002890375.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297336217|gb|EFH66634.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 760

 Score =  367 bits (942), Expect = 3e-99
 Identities = 162/257 (63%), Positives = 212/257 (82%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LT+EG+ YF  MS+ YG++PR+EHY+CM +LLGRAGKL+EAY LI E+P  PD+CVWGAL
Sbjct: 504  LTDEGWKYFNMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEIPFEPDSCVWGAL 563

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            L+SC L +NV L E+AA+KLF LEP+NPG Y+L+SNIYA+KG W EVD +R+ M   GLK
Sbjct: 564  LNSCRLQNNVDLAEIAAQKLFHLEPENPGTYVLMSNIYAAKGMWTEVDSIRNKMESLGLK 623

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            KNPGCSWI++K K++ ++A DKS PQ+ QI +++ ++S EM+K+G+ P  D+ LQDVEEQ
Sbjct: 624  KNPGCSWIQVKNKVYTLLACDKSHPQIDQITEKMDEISEEMRKSGHRPNLDFALQDVEEQ 683

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            E+E +L GHSEKLAV+FG++NT +G+PL++ KNLRICGDCHAV+KFIS    REIF+RD 
Sbjct: 684  EQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHAVIKFISSYAGREIFIRDT 743

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHFKDG CSC D+W
Sbjct: 744  NRFHHFKDGICSCGDFW 760


>ref|XP_006655248.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Oryza brachyantha]
          Length = 584

 Score =  359 bits (921), Expect = 9e-97
 Identities = 159/256 (62%), Positives = 205/256 (80%)
 Frame = -2

Query: 883  TEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGALL 704
            TEEG HYF  M   +G+ PR+EHYACM +LLGRAGKL++AY +I +MP  PD+C+WG+LL
Sbjct: 329  TEEGRHYFNEMQDKHGISPRMEHYACMVTLLGRAGKLDDAYDVINQMPFEPDSCIWGSLL 388

Query: 703  SSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLKK 524
             SC +H NV L E+AAE LF+LEP+N GNY+LLSNIYASK  W+ V++VRDMM+  GLKK
Sbjct: 389  GSCRVHGNVVLAEIAAENLFQLEPENAGNYVLLSNIYASKKMWDGVNRVRDMMKNVGLKK 448

Query: 523  NPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQE 344
              GCSWI++K K+HM++AGD S P +A I ++L  +S EM++ G++P TD+VL DVEEQE
Sbjct: 449  EKGCSWIQIKDKVHMLLAGDSSHPMIAAITEKLKHLSIEMRRLGFAPSTDYVLHDVEEQE 508

Query: 343  KEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDAN 164
            K+ IL  HSEKLAV  G+I+T++G+P+R+ KNLRICGDCH  +KFIS  E REI+VRD N
Sbjct: 509  KDDILSVHSEKLAVALGLISTSQGTPIRVIKNLRICGDCHEAIKFISSFEEREIYVRDTN 568

Query: 163  RYHHFKDGECSCQDYW 116
            R+HHFKDG+CSC DYW
Sbjct: 569  RFHHFKDGKCSCADYW 584


>ref|XP_004963823.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Setaria italica]
          Length = 669

 Score =  358 bits (918), Expect = 2e-96
 Identities = 163/257 (63%), Positives = 202/257 (78%)
 Frame = -2

Query: 886  LTEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGAL 707
            LTE G HYF  M   YG+ PR+EHYACM +LLGRAGKL+EAY +IT+MP  PD C+WG+L
Sbjct: 413  LTEVGRHYFNKMQHGYGISPRMEHYACMVTLLGRAGKLDEAYDVITDMPFEPDGCIWGSL 472

Query: 706  LSSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLK 527
            L SC +H +V L E+AAEKLF LEPDN GNY+LLSNIYASK  W  V++VR+MM++ GLK
Sbjct: 473  LGSCRVHGSVDLAEVAAEKLFHLEPDNAGNYVLLSNIYASKKMWGGVNRVREMMKDMGLK 532

Query: 526  KNPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQ 347
            K  GCSWIE+K K+HM++AGD S P M  I D+L +++ EM++ G++P TD+VL DVEEQ
Sbjct: 533  KEKGCSWIEIKNKVHMLLAGDDSHPMMTAITDKLKQLNIEMRRLGFAPSTDFVLHDVEEQ 592

Query: 346  EKEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDA 167
            EK+ IL  HSEKLAV  G+I+T+ G+PLR+ KNLRIC DCH  MKFIS  E REI VRD 
Sbjct: 593  EKDDILAVHSEKLAVALGLISTSPGTPLRVIKNLRICDDCHEAMKFISCFEGREISVRDT 652

Query: 166  NRYHHFKDGECSCQDYW 116
            NR+HHF+DG+CSC DYW
Sbjct: 653  NRFHHFRDGKCSCGDYW 669


>ref|NP_001055349.1| Os05g0370000 [Oryza sativa Japonica Group] gi|54287484|gb|AAV31228.1|
            unknown protein [Oryza sativa Japonica Group]
            gi|113578900|dbj|BAF17263.1| Os05g0370000 [Oryza sativa
            Japonica Group]
          Length = 664

 Score =  358 bits (918), Expect = 2e-96
 Identities = 160/256 (62%), Positives = 203/256 (79%)
 Frame = -2

Query: 883  TEEGYHYFESMSKAYGVEPRVEHYACMASLLGRAGKLEEAYSLITEMPHVPDACVWGALL 704
            TEEG  YF  M   +G+ PR+EHYACM +LLGRAGKL++AY +I +MP  PD C+WG+LL
Sbjct: 409  TEEGRSYFNEMQHKHGISPRMEHYACMVTLLGRAGKLDDAYDIINQMPFEPDGCIWGSLL 468

Query: 703  SSC*LHHNVSLGELAAEKLFELEPDNPGNYILLSNIYASKGKWEEVDKVRDMMREKGLKK 524
             SC +H NV L E+AAE LF+LEP+N GNY+LLSNIYASK  W+ V+++RDMM+  GLKK
Sbjct: 469  GSCRVHGNVVLAEVAAENLFQLEPENAGNYVLLSNIYASKKMWDGVNRLRDMMKTVGLKK 528

Query: 523  NPGCSWIELKTKIHMIVAGDKSLPQMAQILDRLAKVSSEMKKAGYSPKTDWVLQDVEEQE 344
              GCSWIE+K K+HM++AGD S P MA I ++L  ++ EM++ G++P TD+VL DVEEQE
Sbjct: 529  EKGCSWIEIKNKVHMLLAGDSSHPMMAAITEKLKHLTMEMRRLGFAPSTDYVLHDVEEQE 588

Query: 343  KEHILCGHSEKLAVIFGIINTTEGSPLRITKNLRICGDCHAVMKFISRSERREIFVRDAN 164
            K+ IL  HSEKLAV  G+I+T+ G+PL++ KNLRICGDCH  MKFIS  ERREI+VRD N
Sbjct: 589  KDDILSVHSEKLAVALGLISTSHGTPLQVIKNLRICGDCHEAMKFISSFERREIYVRDTN 648

Query: 163  RYHHFKDGECSCQDYW 116
            R+HHFKDG+CSC DYW
Sbjct: 649  RFHHFKDGKCSCADYW 664


Top