BLASTX nr result

ID: Paeonia22_contig00013368 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00013368
         (1610 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containi...   542   e-151
ref|XP_007227203.1| hypothetical protein PRUPE_ppa019251mg [Prun...   541   e-151
ref|XP_004292932.1| PREDICTED: pentatricopeptide repeat-containi...   539   e-150
ref|XP_007017888.1| Pentatricopeptide repeat (PPR) superfamily p...   533   e-149
gb|EXB68664.1| hypothetical protein L484_024678 [Morus notabilis]     524   e-146
gb|EYU24286.1| hypothetical protein MIMGU_mgv1a025107mg [Mimulus...   520   e-145
gb|EYU18955.1| hypothetical protein MIMGU_mgv1a022111mg [Mimulus...   516   e-144
ref|XP_007152607.1| hypothetical protein PHAVU_004G144300g [Phas...   509   e-141
ref|XP_002301973.2| pentatricopeptide repeat-containing family p...   503   e-139
ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containi...   503   e-139
ref|XP_004250454.1| PREDICTED: pentatricopeptide repeat-containi...   500   e-139
ref|XP_006587447.1| PREDICTED: pentatricopeptide repeat-containi...   492   e-136
ref|XP_006416418.1| hypothetical protein EUTSA_v10009574mg [Eutr...   476   e-131
ref|XP_002890375.1| pentatricopeptide repeat-containing protein ...   474   e-131
ref|NP_173449.1| pentatricopeptide repeat-containing protein [Ar...   473   e-130
ref|XP_006306841.1| hypothetical protein CARUB_v10008385mg [Caps...   471   e-130
ref|XP_004515286.1| PREDICTED: pentatricopeptide repeat-containi...   462   e-127
gb|EPS74339.1| hypothetical protein M569_00411 [Genlisea aurea]       460   e-127
ref|XP_006655248.1| PREDICTED: pentatricopeptide repeat-containi...   457   e-126
emb|CAN63659.1| hypothetical protein VITISV_008415 [Vitis vinifera]   449   e-123

>ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230
            [Vitis vinifera]
          Length = 758

 Score =  542 bits (1397), Expect = e-151
 Identities = 253/316 (80%), Positives = 285/316 (90%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C DG+P +NLVCWNA+I GYAMHGK KEA+EIF +MQRSGQKPD+I+FT VLSACSQSGL
Sbjct: 443  CFDGIPTKNLVCWNAVIAGYAMHGKAKEAMEIFDLMQRSGQKPDIISFTCVLSACSQSGL 502

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            TEEG Y+FNSMS ++GI+ARVEHYACMV LL RAGKL++AY+MI++MP  PDACVWGALL
Sbjct: 503  TEEGSYYFNSMSSKYGIEARVEHYACMVTLLSRAGKLEQAYAMIRRMPVNPDACVWGALL 562

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVHNN+ LGE+AA++LFELEP NPGNYILLSNIYAS GMWN+V+ VR MMK+ GLRK
Sbjct: 563  SSCRVHNNVSLGEVAAEKLFELEPSNPGNYILLSNIYASKGMWNEVNRVRDMMKNKGLRK 622

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPGCSWIE+KNKVHMLLAGDKSHP M QIIEKL+KLS+EMKK G+FP  NFVLQDVEEQD
Sbjct: 623  NPGCSWIEVKNKVHMLLAGDKSHPQMTQIIEKLDKLSMEMKKLGYFPEINFVLQDVEEQD 682

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            KEQ LCGHSEKLAVVFGLLNT  G PL+VIKNLRICGDCH  IKF+SS+E REIFVRDTN
Sbjct: 683  KEQILCGHSEKLAVVFGLLNTPPGYPLQVIKNLRICGDCHVVIKFISSFERREIFVRDTN 742

Query: 702  LFHHFKDGACSCGDFW 655
             FHHFK+GACSCGD+W
Sbjct: 743  RFHHFKEGACSCGDYW 758


>ref|XP_007227203.1| hypothetical protein PRUPE_ppa019251mg [Prunus persica]
            gi|462424139|gb|EMJ28402.1| hypothetical protein
            PRUPE_ppa019251mg [Prunus persica]
          Length = 654

 Score =  541 bits (1394), Expect = e-151
 Identities = 252/316 (79%), Positives = 281/316 (88%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C D MP RNLVCWNA++GGYAMHGK  E +E+F +MQRSGQKPD I+FT VLSACSQ GL
Sbjct: 339  CFDEMPTRNLVCWNAVMGGYAMHGKANETMEVFRLMQRSGQKPDFISFTCVLSACSQKGL 398

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            T+EGWY+FNSMS EHG++ARVEHYACMV LL R+GKL+EAYSMI++MPFEPDACVWGALL
Sbjct: 399  TDEGWYYFNSMSKEHGLEARVEHYACMVTLLSRSGKLEEAYSMIKQMPFEPDACVWGALL 458

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVH+N+ LG+  AK+LF LEPKNPGNYILLSNIYAS GMW++VD VR  MKS+GLRK
Sbjct: 459  SSCRVHSNVTLGKYVAKKLFNLEPKNPGNYILLSNIYASKGMWSEVDKVRDKMKSLGLRK 518

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPGCSWIE+KNKVHMLLAGDK+HP M QIIEKLNKLS EMKK G+FP T+FVLQDVEEQD
Sbjct: 519  NPGCSWIEVKNKVHMLLAGDKAHPQMNQIIEKLNKLSSEMKKLGYFPNTHFVLQDVEEQD 578

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            KEQ LCGHSEKLAVV GLLN+  GS LRVIKNLRICGDCHA IKF+SS+EGREI VRDTN
Sbjct: 579  KEQILCGHSEKLAVVLGLLNSPPGSSLRVIKNLRICGDCHAVIKFISSFEGREISVRDTN 638

Query: 702  LFHHFKDGACSCGDFW 655
            LFHHFKDG CSC D+W
Sbjct: 639  LFHHFKDGVCSCEDYW 654


>ref|XP_004292932.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Fragaria vesca subsp. vesca]
          Length = 755

 Score =  539 bits (1389), Expect = e-150
 Identities = 254/316 (80%), Positives = 276/316 (87%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C D MP RNLVCWNA++ GYAMHGK KE +EIFHMMQRSG KPD+I+FT VLSACSQ+GL
Sbjct: 440  CFDKMPTRNLVCWNAVMSGYAMHGKAKETMEIFHMMQRSGLKPDIISFTCVLSACSQNGL 499

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            TEEGWY+FNSMS EHGI+AR+EHYACMV LLGRAGKL EAYSMI+KMPFEPDACVWGALL
Sbjct: 500  TEEGWYYFNSMSKEHGIEARIEHYACMVTLLGRAGKLDEAYSMIKKMPFEPDACVWGALL 559

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVHNN+ LGE  AK+LF LEP NPGNYILLSNIYAS GMW +VD VR  MKS+GLRK
Sbjct: 560  SSCRVHNNVTLGESTAKKLFNLEPGNPGNYILLSNIYASKGMWTEVDRVRDTMKSLGLRK 619

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPGCSWIE KN VHMLLAGDK+HP M +I EKLN LS EMKKSG+ P T+FVLQDVEEQ+
Sbjct: 620  NPGCSWIEFKNNVHMLLAGDKTHPQMNKITEKLNTLSSEMKKSGYLPSTHFVLQDVEEQE 679

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            KEQ LCGHSEKLAVV GLLNT  GS LRVIKNLRICGDCH+ IKF+SS EGREI VRDTN
Sbjct: 680  KEQILCGHSEKLAVVLGLLNTPPGSSLRVIKNLRICGDCHSVIKFISSLEGREISVRDTN 739

Query: 702  LFHHFKDGACSCGDFW 655
             FHHFKDG CSCGD+W
Sbjct: 740  RFHHFKDGVCSCGDYW 755


>ref|XP_007017888.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
            gi|508723216|gb|EOY15113.1| Pentatricopeptide repeat
            (PPR) superfamily protein [Theobroma cacao]
          Length = 758

 Score =  533 bits (1374), Expect = e-149
 Identities = 240/316 (75%), Positives = 285/316 (90%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C D +P++N VCWNAI+GGYAMHGK KEA++IFHMMQR GQKPD I+F+ VLSACSQ GL
Sbjct: 443  CFDRIPSKNSVCWNAIMGGYAMHGKAKEAIDIFHMMQRRGQKPDFISFSCVLSACSQGGL 502

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            TEEGW+ FNSMS +HG+KA++EHY+CMVNLLGR+GKL++AY++IQ+MPFEPDACVWGALL
Sbjct: 503  TEEGWHFFNSMSRDHGVKAKMEHYSCMVNLLGRSGKLEQAYALIQQMPFEPDACVWGALL 562

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCR+HNN+ LGEIAA+ LF+LEP NPGNYILLSNIYAS GMW++VD VR +M+S G++K
Sbjct: 563  SSCRLHNNISLGEIAAQNLFKLEPSNPGNYILLSNIYASKGMWDEVDAVRDVMRSRGMKK 622

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPGCSWIEIKN+VHMLLAGDKSHP M +IIEK+ KLS++MKK+G+ P T+FVLQDV+EQD
Sbjct: 623  NPGCSWIEIKNQVHMLLAGDKSHPQMTEIIEKIYKLSMDMKKAGYLPNTDFVLQDVDEQD 682

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            KEQ LCGHSEKLAV FGLLNT  GSPL++IKNLRICGDCHA IKF+S +EGREI+VRDTN
Sbjct: 683  KEQILCGHSEKLAVAFGLLNTPPGSPLQIIKNLRICGDCHAVIKFISGFEGREIYVRDTN 742

Query: 702  LFHHFKDGACSCGDFW 655
             FHHFKDG CSC D+W
Sbjct: 743  RFHHFKDGVCSCRDYW 758


>gb|EXB68664.1| hypothetical protein L484_024678 [Morus notabilis]
          Length = 728

 Score =  524 bits (1350), Expect = e-146
 Identities = 242/316 (76%), Positives = 274/316 (86%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C D +P RNLVCWNAI+ GYAMHGK +E +EIF MMQ+SGQKPD I+FT VLSACSQ+GL
Sbjct: 413  CFDQLPVRNLVCWNAIMSGYAMHGKARETIEIFQMMQKSGQKPDFISFTCVLSACSQNGL 472

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            T+EGW++F+SMS EHGI+AR+EHYACMV LLGR+GKL+EAYS+I KMP EPDACVWG+LL
Sbjct: 473  TDEGWHYFSSMSKEHGIEARLEHYACMVTLLGRSGKLEEAYSLINKMPMEPDACVWGSLL 532

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVHNN+ LGE+AA++LFELEP+NPGNY++LSNIY S GMW+ VD VR MM   GLRK
Sbjct: 533  SSCRVHNNVSLGEVAAEKLFELEPRNPGNYVILSNIYGSKGMWSQVDRVRDMMNQKGLRK 592

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPGCSWIE+KN+VHMLLAGDKSHP   QII KLNKLS+EMK SG+FP   FVLQDVEEQD
Sbjct: 593  NPGCSWIEVKNEVHMLLAGDKSHPQRIQIIGKLNKLSMEMKNSGYFPNFTFVLQDVEEQD 652

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            K   LCGHSEKLAV FGLLNT  GS LRVIKNLRICGDCH  IKF+SS+E REIFVRDTN
Sbjct: 653  KVHILCGHSEKLAVAFGLLNTPPGSSLRVIKNLRICGDCHVVIKFISSFEQREIFVRDTN 712

Query: 702  LFHHFKDGACSCGDFW 655
             FHHFKDG CSCGD+W
Sbjct: 713  RFHHFKDGHCSCGDYW 728



 Score = 65.5 bits (158), Expect = 7e-08
 Identities = 39/121 (32%), Positives = 63/121 (52%), Gaps = 3/121 (2%)
 Frame = -3

Query: 1596 DGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTE 1417
            DGMP R+LV W+A+I GY+  G V+EA  +F+ M   G +P+++T+  ++S  S+SG   
Sbjct: 166  DGMPQRDLVAWSALISGYSSRGLVEEAKGLFYDMGMGGLEPNVVTWNGMISGFSRSGSCS 225

Query: 1416 EGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLK---EAYSMIQKMPFEPDACVWGAL 1246
            E    F  M  E G+       + ++  +G    L    + +  + K  F  D CV  AL
Sbjct: 226  EAVDMFRRMHSE-GVPPDGSSVSSVLPAIGDLEDLNVGIQVHGYVVKRGFGSDKCVTSAL 284

Query: 1245 L 1243
            +
Sbjct: 285  I 285


>gb|EYU24286.1| hypothetical protein MIMGU_mgv1a025107mg [Mimulus guttatus]
          Length = 654

 Score =  520 bits (1339), Expect = e-145
 Identities = 239/318 (75%), Positives = 281/318 (88%)
 Frame = -3

Query: 1608 QCCSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQS 1429
            +CC D MP RNLVCWNA++GGYAMHGK  EA+E F +MQRSGQKPD ++ TS+LSACSQS
Sbjct: 337  RCCFDRMPVRNLVCWNAMLGGYAMHGKANEAIEFFLLMQRSGQKPDSVSLTSLLSACSQS 396

Query: 1428 GLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGA 1249
            GLTEEG  +F+ M+ +HGIK RVEHYAC+V+LLGRAGKL+EAYSMI+KMPFEPDACVWGA
Sbjct: 397  GLTEEGHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKLEEAYSMIEKMPFEPDACVWGA 456

Query: 1248 LLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGL 1069
            LLSSCRVH+N+ LGE+AA++LFELEP NPGNYIL+SNIYAS G + +VD +R +M+  GL
Sbjct: 457  LLSSCRVHHNMSLGEVAARKLFELEPMNPGNYILMSNIYASKGRYKEVDKIRDIMRDKGL 516

Query: 1068 RKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEE 889
            RKNPGCSWIE+KNKVHMLLAGDKS P MAQI++KLN+LS+EMKK+G+ P T++VLQDVEE
Sbjct: 517  RKNPGCSWIEVKNKVHMLLAGDKSLPQMAQIMDKLNRLSIEMKKAGYSPNTDYVLQDVEE 576

Query: 888  QDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRD 709
            Q+KE  LCGHSEKLAVVFG+LNT  GSPLRV KNLRICGDCHA IKF+S +E REIFVRD
Sbjct: 577  QEKEHILCGHSEKLAVVFGILNTSPGSPLRVTKNLRICGDCHAVIKFISRFERREIFVRD 636

Query: 708  TNLFHHFKDGACSCGDFW 655
            TN +HHFKDG CSCGD+W
Sbjct: 637  TNRYHHFKDGDCSCGDYW 654



 Score = 63.2 bits (152), Expect = 3e-07
 Identities = 36/118 (30%), Positives = 57/118 (48%)
 Frame = -3

Query: 1578 NLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHF 1399
            N+V W ++I   + HGK  EALE+F  MQ +G KP+ +T   +L AC        G    
Sbjct: 246  NVVSWTSVIACCSQHGKDIEALELFREMQSAGVKPNAVTIPCLLPACGNIAALMHG-KAA 304

Query: 1398 NSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVH 1225
            +  S+  GI   V   + ++++    GK++ A     +MP     C W A+L    +H
Sbjct: 305  HCFSLRRGISGDVYVGSALIDMYANCGKIQLARCCFDRMPVRNLVC-WNAMLGGYAMH 361


>gb|EYU18955.1| hypothetical protein MIMGU_mgv1a022111mg [Mimulus guttatus]
          Length = 654

 Score =  516 bits (1330), Expect = e-144
 Identities = 240/318 (75%), Positives = 279/318 (87%)
 Frame = -3

Query: 1608 QCCSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQS 1429
            +CC D M  RNLVCWNA++GGYAMHGK KEA+E F +MQRSGQKPD ++ TS+LSACSQS
Sbjct: 337  RCCFDRMSVRNLVCWNAMLGGYAMHGKAKEAIEFFLLMQRSGQKPDSVSLTSLLSACSQS 396

Query: 1428 GLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGA 1249
            GLTEEG  +F+ M+ +HGIK RVEHYAC+V+LLGRAGKL+EAYSMI+KMPFEPDACVWGA
Sbjct: 397  GLTEEGHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKLEEAYSMIEKMPFEPDACVWGA 456

Query: 1248 LLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGL 1069
            LLSSCRVH+N+ LG +AA++LFELEPKNPGNYILLSNIYAS G + +VD +R +M   GL
Sbjct: 457  LLSSCRVHHNMSLGGVAARKLFELEPKNPGNYILLSNIYASKGRYKEVDKIRDIMGDKGL 516

Query: 1068 RKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEE 889
            RKNPGCSWIE+KNKVHMLLAGDKS P MAQI+EKLN+LS+EMKK+G+ P T++VLQDVEE
Sbjct: 517  RKNPGCSWIEVKNKVHMLLAGDKSLPQMAQIMEKLNRLSIEMKKAGYSPNTDYVLQDVEE 576

Query: 888  QDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRD 709
            Q+KE  LCGHSEKLAVVFG+LN   GSPLRV KNLRICGDCHA IKF+S +E REIFVRD
Sbjct: 577  QEKEHILCGHSEKLAVVFGILNMSPGSPLRVTKNLRICGDCHAVIKFISRFERREIFVRD 636

Query: 708  TNLFHHFKDGACSCGDFW 655
            TN +HHFKDG CSCGD+W
Sbjct: 637  TNRYHHFKDGDCSCGDYW 654



 Score = 61.6 bits (148), Expect = 1e-06
 Identities = 36/122 (29%), Positives = 57/122 (46%)
 Frame = -3

Query: 1578 NLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHF 1399
            N+V W ++I   + HGK  EALE+F  MQ SG KP+ +T   +L AC        G    
Sbjct: 246  NVVSWTSVIACCSQHGKDIEALELFREMQASGVKPNAVTIPCLLPACGNIAALMHG-KAA 304

Query: 1398 NSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNN 1219
            +  S+  GI   V   + ++++    GK++ A     +M      C W A+L    +H  
Sbjct: 305  HCFSLRRGISGDVYVGSALIDMYANCGKIQLARCCFDRMSVRNLVC-WNAMLGGYAMHGK 363

Query: 1218 LR 1213
             +
Sbjct: 364  AK 365


>ref|XP_007152607.1| hypothetical protein PHAVU_004G144300g [Phaseolus vulgaris]
            gi|561025916|gb|ESW24601.1| hypothetical protein
            PHAVU_004G144300g [Phaseolus vulgaris]
          Length = 601

 Score =  509 bits (1312), Expect = e-141
 Identities = 236/316 (74%), Positives = 273/316 (86%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C D M A NLV WNA+I GYAMHGK KE +E+FHMMQ+SGQKPD ITFT +LSAC+Q+GL
Sbjct: 286  CFDNMLAPNLVSWNAVISGYAMHGKAKETMEMFHMMQQSGQKPDSITFTCILSACAQNGL 345

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            TEEGW+++NSMS EHGI+ ++EHYACMV LL R GKL+EAYS+I++MPFEPDACVWGALL
Sbjct: 346  TEEGWHYYNSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVWGALL 405

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVHNNL LGEIAA++LF LEP NPGNY+LLSNIYAS G+W++ + +R MMKS GLRK
Sbjct: 406  SSCRVHNNLSLGEIAAEKLFPLEPANPGNYVLLSNIYASKGLWDEENRIREMMKSKGLRK 465

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPG SWIE+ +KVHMLLAGD+SHP M  I+EKL+KL++EMKKSG+ P TNFVLQDVEEQD
Sbjct: 466  NPGYSWIEVGHKVHMLLAGDQSHPQMKDILEKLDKLNMEMKKSGYLPKTNFVLQDVEEQD 525

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            KEQ LCGHSEKLAVV GLLNT  G PL+VIKNLRIC DCHA IK +S  EGREI++RDTN
Sbjct: 526  KEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKAISRLEGREIYIRDTN 585

Query: 702  LFHHFKDGACSCGDFW 655
             FHH KDG CSCGDFW
Sbjct: 586  RFHHIKDGVCSCGDFW 601


>ref|XP_002301973.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550344115|gb|EEE81246.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 724

 Score =  503 bits (1294), Expect = e-139
 Identities = 235/316 (74%), Positives = 272/316 (86%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C D MP RNLV WN+++ GYAMHGK  EA+ IF +MQR GQKPD ++FT VLSAC+Q GL
Sbjct: 409  CFDMMPNRNLVSWNSLMAGYAMHGKTFEAINIFELMQRCGQKPDHVSFTCVLSACTQGGL 468

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            TEEGW++F+SMS  HG++AR+EHY+CMV LLGR+G+L+EAY+MI++MPFEPD+CVWGALL
Sbjct: 469  TEEGWFYFDSMSRNHGVEARMEHYSCMVTLLGRSGRLEEAYAMIKQMPFEPDSCVWGALL 528

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVHN + LGEIAAK +FELEP+NPGNYILLSNIYAS  MW +VD+VR MM+S GL+K
Sbjct: 529  SSCRVHNRVDLGEIAAKRVFELEPRNPGNYILLSNIYASKAMWVEVDMVRDMMRSRGLKK 588

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPG SWIEIKNKVHMLLAGD SHP M QIIEKL KL++EMKKSG+ P T+FVLQDVEEQD
Sbjct: 589  NPGYSWIEIKNKVHMLLAGDSSHPQMPQIIEKLAKLTVEMKKSGYVPHTDFVLQDVEEQD 648

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            KEQ LCGHSEKLAVV GLLNT  G PL+VIKNLRIC DCHA IKF+S +E REIFVRDTN
Sbjct: 649  KEQILCGHSEKLAVVLGLLNTKPGFPLQVIKNLRICRDCHAVIKFISDFEKREIFVRDTN 708

Query: 702  LFHHFKDGACSCGDFW 655
             FH FK G CSCGD+W
Sbjct: 709  RFHQFKGGVCSCGDYW 724


>ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            isoform X1 [Glycine max]
          Length = 748

 Score =  503 bits (1294), Expect = e-139
 Identities = 235/316 (74%), Positives = 272/316 (86%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C D M A NLV WNA++ GYAMHGK KE +E+FHMM +SGQKPDL+TFT VLSAC+Q+GL
Sbjct: 433  CFDKMSALNLVSWNAVMKGYAMHGKAKETMEMFHMMLQSGQKPDLVTFTCVLSACAQNGL 492

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            TEEGW  +NSMS EHGI+ ++EHYAC+V LL R GKL+EAYS+I++MPFEPDACVWGALL
Sbjct: 493  TEEGWRCYNSMSEEHGIEPKMEHYACLVTLLSRVGKLEEAYSIIKEMPFEPDACVWGALL 552

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVHNNL LGEIAA++LF LEP NPGNYILLSNIYAS G+W++ + +R +MKS GLRK
Sbjct: 553  SSCRVHNNLSLGEIAAEKLFFLEPTNPGNYILLSNIYASKGLWDEENRIREVMKSKGLRK 612

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPG SWIE+ +KVHMLLAGD+SHP M  I+EKL+KL+++MKKSG+ P TNFVLQDVEEQD
Sbjct: 613  NPGYSWIEVGHKVHMLLAGDQSHPQMKDILEKLDKLNMQMKKSGYLPKTNFVLQDVEEQD 672

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            KEQ LCGHSEKLAVV GLLNT  G PL+VIKNLRIC DCHA IK +S  EGREI+VRDTN
Sbjct: 673  KEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIYVRDTN 732

Query: 702  LFHHFKDGACSCGDFW 655
             FHHFKDG CSCGDFW
Sbjct: 733  RFHHFKDGVCSCGDFW 748


>ref|XP_004250454.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Solanum lycopersicum]
          Length = 828

 Score =  500 bits (1288), Expect = e-139
 Identities = 236/314 (75%), Positives = 265/314 (84%)
 Frame = -3

Query: 1596 DGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTE 1417
            D MP RNLVCWNA+  GYAMHGK KEA+EIF  M+RSGQKPD I+FTSVLSACSQ+GLTE
Sbjct: 515  DRMPVRNLVCWNAMTSGYAMHGKAKEAIEIFDSMRRSGQKPDFISFTSVLSACSQAGLTE 574

Query: 1416 EGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSS 1237
            +G ++F+ MS  HG++ARVEHYACMV+LLGR GKLKEAY MI  MP EPDACVWGALLSS
Sbjct: 575  QGQHYFDCMSRIHGLEARVEHYACMVSLLGRTGKLKEAYDMISTMPIEPDACVWGALLSS 634

Query: 1236 CRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNP 1057
            CR H N+ LGEIAA +LFELEPKNPGNYILLSNIYASN  WN+VD VR MMK +GL KNP
Sbjct: 635  CRTHRNMSLGEIAADKLFELEPKNPGNYILLSNIYASNNRWNEVDKVRDMMKHVGLSKNP 694

Query: 1056 GCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKE 877
            GCSWIEIKNKVHMLLAGD  HP M QI+EKL KLS++MK +G    T  VLQDVEEQDKE
Sbjct: 695  GCSWIEIKNKVHMLLAGDDLHPQMPQIMEKLRKLSMDMKNTGVSHDTELVLQDVEEQDKE 754

Query: 876  QFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLF 697
              LCGHSEKLAVV G+LNT+ G+ LRVIKNLRICGDCH FIKF+SS+EGREI+VRD N +
Sbjct: 755  LILCGHSEKLAVVLGILNTNPGTSLRVIKNLRICGDCHTFIKFISSFEGREIYVRDANRY 814

Query: 696  HHFKDGACSCGDFW 655
            HHF +G CSCGD+W
Sbjct: 815  HHFNEGICSCGDYW 828


>ref|XP_006587447.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Glycine max]
          Length = 601

 Score =  492 bits (1266), Expect = e-136
 Identities = 228/318 (71%), Positives = 269/318 (84%)
 Frame = -3

Query: 1608 QCCSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQS 1429
            +CC D M A NLV WNA++ GYAMHGK KE +E+FHMM +SGQKP+L+TFT VLSAC+Q+
Sbjct: 284  RCCFDKMSAPNLVSWNAVMSGYAMHGKAKETMEMFHMMLQSGQKPNLVTFTCVLSACAQN 343

Query: 1428 GLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGA 1249
            GLTEEGW ++NSMS EHG + ++EHYACMV LL R GKL+EAYS+I++MPFEPDACV GA
Sbjct: 344  GLTEEGWRYYNSMSEEHGFEPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVRGA 403

Query: 1248 LLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGL 1069
            LLSSCRVHNNL LGEI A++LF LEP NPGNYI+LSNIYAS G+W++ + +R +MKS GL
Sbjct: 404  LLSSCRVHNNLSLGEITAEKLFLLEPTNPGNYIILSNIYASKGLWDEENRIREVMKSKGL 463

Query: 1068 RKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEE 889
            RKNPG SWIE+ +K+HMLLAGD+SHP M  I+EKL+KL++EMKKSG+ P +NFV QDVEE
Sbjct: 464  RKNPGYSWIEVGHKIHMLLAGDQSHPQMKDILEKLDKLNMEMKKSGYLPKSNFVWQDVEE 523

Query: 888  QDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRD 709
             DKEQ LCGHSEKLAVV GLLNT  G PL+VIKNLRIC DCHA IK +S  EGREI+VRD
Sbjct: 524  HDKEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIYVRD 583

Query: 708  TNLFHHFKDGACSCGDFW 655
            TN  HHFKDG CSCGDFW
Sbjct: 584  TNRLHHFKDGVCSCGDFW 601



 Score = 58.9 bits (141), Expect = 6e-06
 Identities = 37/134 (27%), Positives = 64/134 (47%)
 Frame = -3

Query: 1578 NLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHF 1399
            N+V W +II   + +GK  EALE+F  MQ  G +P+ +T  S++ AC        G    
Sbjct: 193  NVVTWTSIIASCSQNGKDLEALELFRDMQADGVEPNAVTIPSLIPACGNISALMHG-KEI 251

Query: 1398 NSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNN 1219
            +  S+  GI   V   + ++++  + G+++ +     KM   P+   W A++S   +H  
Sbjct: 252  HCFSLRRGIFDDVYVGSALIDMYAKCGRIQLSRCCFDKMS-APNLVSWNAVMSGYAMHGK 310

Query: 1218 LRLGEIAAKELFEL 1177
                   AKE  E+
Sbjct: 311  -------AKETMEM 317


>ref|XP_006416418.1| hypothetical protein EUTSA_v10009574mg [Eutrema salsugineum]
            gi|557094189|gb|ESQ34771.1| hypothetical protein
            EUTSA_v10009574mg [Eutrema salsugineum]
          Length = 760

 Score =  476 bits (1226), Expect = e-131
 Identities = 218/318 (68%), Positives = 268/318 (84%)
 Frame = -3

Query: 1608 QCCSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQS 1429
            Q   D MP RNLVCWN+++ GY+MHGK KE + IF  + R+  KPD I+FTS+LSACSQ 
Sbjct: 443  QMVFDMMPTRNLVCWNSLMSGYSMHGKAKEVMSIFDSLVRTRLKPDFISFTSLLSACSQV 502

Query: 1428 GLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGA 1249
            GLT+EGW +F  M+ E+GIK R+EHY+CMV+LLGRAGKL+EAY +I+++PFEPD+CVWGA
Sbjct: 503  GLTDEGWKYFGMMTEEYGIKPRLEHYSCMVSLLGRAGKLQEAYDLIKEIPFEPDSCVWGA 562

Query: 1248 LLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGL 1069
            LL+SCR+ NN+ L EIAA++LF+LEP+NPG Y+LLSNIYA+ GMW +VD VR  M+S+GL
Sbjct: 563  LLNSCRLQNNVDLAEIAAEKLFDLEPENPGTYVLLSNIYAAKGMWAEVDSVRNKMESLGL 622

Query: 1068 RKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEE 889
            +KNPGCSWI++KNKV+ LLAGDKSHP + QI EK++++S EM+KSG  P  +F LQDVEE
Sbjct: 623  KKNPGCSWIQVKNKVYTLLAGDKSHPQIEQITEKMDEISKEMRKSGHRPNLDFALQDVEE 682

Query: 888  QDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRD 709
            Q+KEQ L GHSEKLAVVFGLLNT  G+PL+VIKNLRICGDCH+ IKF+S Y GREIFVRD
Sbjct: 683  QEKEQILLGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHSVIKFISGYAGREIFVRD 742

Query: 708  TNLFHHFKDGACSCGDFW 655
            TN FHHFKDG CSCGDFW
Sbjct: 743  TNRFHHFKDGICSCGDFW 760


>ref|XP_002890375.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297336217|gb|EFH66634.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 760

 Score =  474 bits (1219), Expect = e-131
 Identities = 214/312 (68%), Positives = 265/312 (84%)
 Frame = -3

Query: 1590 MPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEG 1411
            MP +NLVCWN+++ GY+MHGK KE + IF  + R+  KPD I+FTS+LSAC Q GLT+EG
Sbjct: 449  MPTKNLVCWNSLMNGYSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEG 508

Query: 1410 WYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCR 1231
            W +FN MS E+GIK R+EHY+CMVNLLGRAGKL+EAY +I+++PFEPD+CVWGALL+SCR
Sbjct: 509  WKYFNMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEIPFEPDSCVWGALLNSCR 568

Query: 1230 VHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGC 1051
            + NN+ L EIAA++LF LEP+NPG Y+L+SNIYA+ GMW +VD +R  M+S+GL+KNPGC
Sbjct: 569  LQNNVDLAEIAAQKLFHLEPENPGTYVLMSNIYAAKGMWTEVDSIRNKMESLGLKKNPGC 628

Query: 1050 SWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQF 871
            SWI++KNKV+ LLA DKSHP + QI EK++++S EM+KSG  P  +F LQDVEEQ++EQ 
Sbjct: 629  SWIQVKNKVYTLLACDKSHPQIDQITEKMDEISEEMRKSGHRPNLDFALQDVEEQEQEQM 688

Query: 870  LCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHH 691
            L GHSEKLAVVFGLLNT  G+PL+VIKNLRICGDCHA IKF+SSY GREIF+RDTN FHH
Sbjct: 689  LWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHAVIKFISSYAGREIFIRDTNRFHH 748

Query: 690  FKDGACSCGDFW 655
            FKDG CSCGDFW
Sbjct: 749  FKDGICSCGDFW 760



 Score = 61.2 bits (147), Expect = 1e-06
 Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 6/128 (4%)
 Frame = -3

Query: 1578 NLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHF 1399
            N+V W +II G A +GK  EALE+F  MQ +G KP+ +T  S+L AC        G    
Sbjct: 352  NVVSWTSIIAGCAQNGKDIEALELFREMQVAGVKPNRVTIPSMLPACGNIAALGHG---- 407

Query: 1398 NSMSVEHGIKARVEHY------ACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSS 1237
                  HG   RV         + ++++  + G++K +  +   MP +   C W +L++ 
Sbjct: 408  ---RSTHGFAVRVHLLDDVHVGSALIDMYAKCGRIKMSQIVFNMMPTKNLVC-WNSLMNG 463

Query: 1236 CRVHNNLR 1213
              +H   +
Sbjct: 464  YSMHGKAK 471


>ref|NP_173449.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806503|sp|Q9LNU6.2|PPR53_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g20230 gi|332191832|gb|AEE29953.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 760

 Score =  473 bits (1217), Expect = e-130
 Identities = 213/312 (68%), Positives = 264/312 (84%)
 Frame = -3

Query: 1590 MPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEG 1411
            MP +NLVCWN+++ G++MHGK KE + IF  + R+  KPD I+FTS+LSAC Q GLT+EG
Sbjct: 449  MPTKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEG 508

Query: 1410 WYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCR 1231
            W +F  MS E+GIK R+EHY+CMVNLLGRAGKL+EAY +I++MPFEPD+CVWGALL+SCR
Sbjct: 509  WKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCR 568

Query: 1230 VHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGC 1051
            + NN+ L EIAA++LF LEP+NPG Y+LLSNIYA+ GMW +VD +R  M+S+GL+KNPGC
Sbjct: 569  LQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGC 628

Query: 1050 SWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQF 871
            SWI++KN+V+ LLAGDKSHP + QI EK++++S EM+KSG  P  +F L DVEEQ++EQ 
Sbjct: 629  SWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEMRKSGHRPNLDFALHDVEEQEQEQM 688

Query: 870  LCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHH 691
            L GHSEKLAVVFGLLNT  G+PL+VIKNLRICGDCHA IKF+SSY GREIF+RDTN FHH
Sbjct: 689  LWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHAVIKFISSYAGREIFIRDTNRFHH 748

Query: 690  FKDGACSCGDFW 655
            FKDG CSCGDFW
Sbjct: 749  FKDGICSCGDFW 760



 Score = 58.9 bits (141), Expect = 6e-06
 Identities = 34/122 (27%), Positives = 61/122 (50%)
 Frame = -3

Query: 1578 NLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHF 1399
            N+V W +II G A +GK  EALE+F  MQ +G KP+ +T  S+L AC        G    
Sbjct: 352  NVVSWTSIIAGCAQNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHG-RST 410

Query: 1398 NSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNN 1219
            +  +V   +   V   + ++++  + G++  +  +   MP +   C W +L++   +H  
Sbjct: 411  HGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLVC-WNSLMNGFSMHGK 469

Query: 1218 LR 1213
             +
Sbjct: 470  AK 471


>ref|XP_006306841.1| hypothetical protein CARUB_v10008385mg [Capsella rubella]
            gi|482575552|gb|EOA39739.1| hypothetical protein
            CARUB_v10008385mg [Capsella rubella]
          Length = 760

 Score =  471 bits (1211), Expect = e-130
 Identities = 212/312 (67%), Positives = 266/312 (85%)
 Frame = -3

Query: 1590 MPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEG 1411
            MP +NLVCWN+++ GY+MHGK KE + IF  + R+  KPD I+FTS+L++C Q GLT+EG
Sbjct: 449  MPTKNLVCWNSLMNGYSMHGKAKEVMSIFESLLRTRLKPDFISFTSLLASCGQVGLTDEG 508

Query: 1410 WYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCR 1231
            W +F+ MS E+GIK R+EHY+CMVNLLGRAGKL+EAY +I++MPFEPD+CVWGALL+SCR
Sbjct: 509  WKYFSMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYELIKEMPFEPDSCVWGALLNSCR 568

Query: 1230 VHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGC 1051
            + +N+ L EIAA +LF+LEP+NPG Y+LLSNIYA+ GMW +VD +R  M+S+GL+KNPGC
Sbjct: 569  LQSNVDLAEIAADKLFDLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGC 628

Query: 1050 SWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQF 871
            SWI++KN+V+ LLAGDKSHP + QI EK++++S EM+KSG  P  +F LQDVEEQ++EQ 
Sbjct: 629  SWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISEEMRKSGHRPNLDFALQDVEEQEQEQM 688

Query: 870  LCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHH 691
            L GHSEKLAVVFGLLNT  G+PL+VIKNLRICGDCH+ IKF+SSY GREIFVRDTN FHH
Sbjct: 689  LWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHSVIKFISSYAGREIFVRDTNRFHH 748

Query: 690  FKDGACSCGDFW 655
            FKDG CSCGDFW
Sbjct: 749  FKDGICSCGDFW 760



 Score = 61.2 bits (147), Expect = 1e-06
 Identities = 43/161 (26%), Positives = 75/161 (46%), Gaps = 8/161 (4%)
 Frame = -3

Query: 1578 NLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHF 1399
            N+V W +II G A +GK  EALE+F  MQ +G KP+ +T  S+L AC        G    
Sbjct: 352  NVVSWTSIIAGCAQNGKDIEALELFREMQVAGVKPNRVTIPSMLPACGNIAALGHG---- 407

Query: 1398 NSMSVEHGIKARVEHY------ACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSS 1237
                  HG   RV  +      + ++++  + G++  +  +   MP +   C W +L++ 
Sbjct: 408  ---RSTHGFAVRVHLWDDVHVGSALIDMYAKCGRINMSQFVFNMMPTKNLVC-WNSLMNG 463

Query: 1236 CRVHNNLRLGEIAAKELFE--LEPKNPGNYILLSNIYASNG 1120
              +H   +        +FE  L  +   ++I  +++ AS G
Sbjct: 464  YSMHGKAK----EVMSIFESLLRTRLKPDFISFTSLLASCG 500


>ref|XP_004515286.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Cicer arietinum]
          Length = 730

 Score =  462 bits (1189), Expect = e-127
 Identities = 217/316 (68%), Positives = 256/316 (81%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C D MPA+NLV WN+++ GYAMHGK +E +E+F+MM +SGQKPDLITFT VLSAC+Q+GL
Sbjct: 430  CFDIMPAKNLVSWNSVMSGYAMHGKARETIEMFNMMLQSGQKPDLITFTCVLSACTQNGL 489

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
             EEGW +FNSMS EH ++ R+EHYA               YS++++MPFEPDACVWG+LL
Sbjct: 490  IEEGWNYFNSMSKEHDVEPRMEHYA---------------YSIVKEMPFEPDACVWGSLL 534

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVH NL LGEIAA++LF LEP NPGNY+LLSNIYAS GMW + + +R MMK+ GLRK
Sbjct: 535  SSCRVHKNLSLGEIAAEKLFVLEPDNPGNYVLLSNIYASKGMWGEENRIRNMMKNKGLRK 594

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPGCSWIEI  +VH LL+GDKSHP M +I+EK +KLS+E+KKSG+ P+TN VLQDVEEQD
Sbjct: 595  NPGCSWIEIGRRVHTLLSGDKSHPQMKEILEKSDKLSIEIKKSGYLPMTNTVLQDVEEQD 654

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTN 703
            KEQ LCGHSEKLAVV GLLNT  G PL+VIKNLRIC DCHA IK +S  E REI+VRDTN
Sbjct: 655  KEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEAREIYVRDTN 714

Query: 702  LFHHFKDGACSCGDFW 655
             FHHFKDG CSC DFW
Sbjct: 715  RFHHFKDGVCSCEDFW 730


>gb|EPS74339.1| hypothetical protein M569_00411 [Genlisea aurea]
          Length = 1063

 Score =  460 bits (1183), Expect = e-127
 Identities = 215/320 (67%), Positives = 265/320 (82%), Gaps = 2/320 (0%)
 Frame = -3

Query: 1608 QCCSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQS 1429
            +C  + MP RNLVCWNA++G Y+MHG+ KEA+ +F  MQR GQKPD ++FTS+LSACSQS
Sbjct: 744  RCLFERMPVRNLVCWNAMLGAYSMHGEAKEAIGLFQSMQRCGQKPDSVSFTSLLSACSQS 803

Query: 1428 GLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGA 1249
            GL EEG  +F SM  +HG++ R+EHYAC+V LLGRAGKL EAY+ I++MPFE DACVWGA
Sbjct: 804  GLAEEGRRYFESMFEDHGLEPRLEHYACIVGLLGRAGKLDEAYAKIKRMPFEADACVWGA 863

Query: 1248 LLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGL 1069
            LLSSC +HNN  LGE+AA++LFELE  N GNYILLSNIYAS+  W +V  +R MM   G+
Sbjct: 864  LLSSCALHNNEFLGEVAAEKLFELELGNSGNYILLSNIYASSRKWKEVRRIRDMMSLKGM 923

Query: 1068 RKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMK-KSGFFPVTNFVLQDVE 892
            +KNPGCSWIE+KNKVHM+LAGDK+ P +++I+E+L +L+ EMK   G+FP TN+VLQDVE
Sbjct: 924  KKNPGCSWIEVKNKVHMILAGDKALPQVSKIMERLKRLNQEMKGAGGYFPNTNYVLQDVE 983

Query: 891  EQ-DKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFV 715
            EQ ++E  LCGHSEKLAVVFG+LNT +GSP+RV KNLRICGDCHA IKF+S +EGREI V
Sbjct: 984  EQEEREGILCGHSEKLAVVFGILNTSRGSPIRVTKNLRICGDCHAVIKFISGFEGREISV 1043

Query: 714  RDTNLFHHFKDGACSCGDFW 655
            RDTN +HHFKDG CSCGD+W
Sbjct: 1044 RDTNRYHHFKDGICSCGDYW 1063



 Score = 63.5 bits (153), Expect = 3e-07
 Identities = 37/122 (30%), Positives = 60/122 (49%)
 Frame = -3

Query: 1578 NLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHF 1399
            NLV W + I   + HG+  EAL +F  MQ SG KP+ +T  S+L AC        G    
Sbjct: 653  NLVSWTSAISCCSQHGRDMEALGLFREMQFSGVKPNAVTIPSLLPACGNIAALSYG-KAV 711

Query: 1398 NSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNN 1219
            +  S+ + I   V   + ++++    GK+K A  + ++MP     C W A+L +  +H  
Sbjct: 712  HCFSLRNNICNDVYVGSALIDMYANCGKIKAARCLFERMPVRNLVC-WNAMLGAYSMHGE 770

Query: 1218 LR 1213
             +
Sbjct: 771  AK 772


>ref|XP_006655248.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like
            [Oryza brachyantha]
          Length = 584

 Score =  457 bits (1176), Expect = e-126
 Identities = 211/314 (67%), Positives = 257/314 (81%)
 Frame = -3

Query: 1596 DGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTE 1417
            D MP+RN+V WNA+IGGYAMHG+   ALE+FH MQ S +KPDL+TFT VL ACSQ+G TE
Sbjct: 271  DAMPSRNVVSWNAMIGGYAMHGEATNALELFHSMQSSKEKPDLVTFTCVLGACSQAGRTE 330

Query: 1416 EGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSS 1237
            EG ++FN M  +HGI  R+EHYACMV LLGRAGKL +AY +I +MPFEPD+C+WG+LL S
Sbjct: 331  EGRHYFNEMQDKHGISPRMEHYACMVTLLGRAGKLDDAYDVINQMPFEPDSCIWGSLLGS 390

Query: 1236 CRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNP 1057
            CRVH N+ L EIAA+ LF+LEP+N GNY+LLSNIYAS  MW+ V+ VR MMK++GL+K  
Sbjct: 391  CRVHGNVVLAEIAAENLFQLEPENAGNYVLLSNIYASKKMWDGVNRVRDMMKNVGLKKEK 450

Query: 1056 GCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKE 877
            GCSWI+IK+KVHMLLAGD SHP +A I EKL  LS+EM++ GF P T++VL DVEEQ+K+
Sbjct: 451  GCSWIQIKDKVHMLLAGDSSHPMIAAITEKLKHLSIEMRRLGFAPSTDYVLHDVEEQEKD 510

Query: 876  QFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLF 697
              L  HSEKLAV  GL++T QG+P+RVIKNLRICGDCH  IKF+SS+E REI+VRDTN F
Sbjct: 511  DILSVHSEKLAVALGLISTSQGTPIRVIKNLRICGDCHEAIKFISSFEEREIYVRDTNRF 570

Query: 696  HHFKDGACSCGDFW 655
            HHFKDG CSC D+W
Sbjct: 571  HHFKDGKCSCADYW 584


>emb|CAN63659.1| hypothetical protein VITISV_008415 [Vitis vinifera]
          Length = 760

 Score =  449 bits (1155), Expect = e-123
 Identities = 212/268 (79%), Positives = 240/268 (89%)
 Frame = -3

Query: 1602 CSDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGL 1423
            C DG+P +NLVCWNA+I GYAMHGK KEA+EIF +MQRSGQKPD+I+FT VLSACSQSGL
Sbjct: 443  CFDGIPTKNLVCWNAVIAGYAMHGKAKEAMEIFDLMQRSGQKPDIISFTCVLSACSQSGL 502

Query: 1422 TEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALL 1243
            TEEG Y+FNSMS ++GI+ARVEHYACMV LL RAGKL++AY+MI++MP  PDACVWGALL
Sbjct: 503  TEEGSYYFNSMSSKYGIEARVEHYACMVTLLSRAGKLEQAYAMIRRMPVNPDACVWGALL 562

Query: 1242 SSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRK 1063
            SSCRVHNN+ LGE+AA++LFELEP NPGNYILLSNIYAS GMWN+V+ VR MMK+ GLRK
Sbjct: 563  SSCRVHNNVSLGEVAAEKLFELEPSNPGNYILLSNIYASKGMWNEVNRVRDMMKNKGLRK 622

Query: 1062 NPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQD 883
            NPGCSWIE+KNKVHMLLAGDKSHP M QIIE L+KLS+EMKK G+FP  NFVLQDVEEQD
Sbjct: 623  NPGCSWIEVKNKVHMLLAGDKSHPQMTQIIENLDKLSMEMKKLGYFPEINFVLQDVEEQD 682

Query: 882  KEQFLCGHSEKLAVVFGLLNTHQGSPLR 799
            KEQ LCGHSEKLAVVFGLLNT  G PL+
Sbjct: 683  KEQILCGHSEKLAVVFGLLNTPPGYPLQ 710


Top