BLASTX nr result

ID: Ziziphus21_contig00016372 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ziziphus21_contig00016372
         (1662 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007204496.1| hypothetical protein PRUPE_ppa019323mg [Prun...   541   e-151
ref|XP_010110548.1| hypothetical protein L484_023382 [Morus nota...   539   e-150
ref|XP_009345148.1| PREDICTED: pentatricopeptide repeat-containi...   538   e-150
ref|XP_008240720.1| PREDICTED: pentatricopeptide repeat-containi...   537   e-149
ref|XP_008392809.1| PREDICTED: pentatricopeptide repeat-containi...   536   e-149
ref|XP_010662700.1| PREDICTED: pentatricopeptide repeat-containi...   501   e-139
emb|CAN69520.1| hypothetical protein VITISV_018331 [Vitis vinifera]   498   e-138
ref|XP_011031992.1| PREDICTED: pentatricopeptide repeat-containi...   494   e-136
ref|XP_004301723.1| PREDICTED: pentatricopeptide repeat-containi...   494   e-136
ref|XP_006480449.1| PREDICTED: pentatricopeptide repeat-containi...   489   e-135
ref|XP_006428630.1| hypothetical protein CICLE_v10011185mg [Citr...   489   e-135
ref|XP_006385578.1| hypothetical protein POPTR_0003s08270g [Popu...   488   e-135
gb|KDO39066.1| hypothetical protein CISIN_1g048743mg, partial [C...   486   e-134
ref|XP_007048805.1| Pentatricopeptide repeat superfamily protein...   480   e-132
gb|KRH04247.1| hypothetical protein GLYMA_17G149100 [Glycine max]     460   e-126
ref|XP_003550925.1| PREDICTED: pentatricopeptide repeat-containi...   460   e-126
ref|XP_012075523.1| PREDICTED: pentatricopeptide repeat-containi...   456   e-125
gb|KDP34852.1| hypothetical protein JCGZ_09140 [Jatropha curcas]      456   e-125
gb|KJB08223.1| hypothetical protein B456_001G071600 [Gossypium r...   452   e-124
ref|XP_012469457.1| PREDICTED: pentatricopeptide repeat-containi...   452   e-124

>ref|XP_007204496.1| hypothetical protein PRUPE_ppa019323mg [Prunus persica]
            gi|462400027|gb|EMJ05695.1| hypothetical protein
            PRUPE_ppa019323mg [Prunus persica]
          Length = 659

 Score =  541 bits (1393), Expect = e-151
 Identities = 274/395 (69%), Positives = 330/395 (83%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI AA ELVL  C Y ES+P+QRDRK  +++YLVPIGSHNL++ L MQI P LL 
Sbjct: 267  HFKFNDIEAATELVLQMCDYHESLPIQRDRKISQRSYLVPIGSHNLKSGLNMQILPELLL 326

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
             DSVLK+EG+QELV+  NG+LVLSN+ALAKLI GYKK G+  ++S+ILL+IQKELCS+RG
Sbjct: 327  CDSVLKIEGKQELVLCWNGKLVLSNRALAKLINGYKKGGDTCKLSEILLKIQKELCSLRG 386

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S LCSDVI ACI+LGWLE AHD+LDDM+AAGAPM  T +MSLL AY +GKMFREAKAL+K
Sbjct: 387  SRLCSDVIDACINLGWLETAHDLLDDMDAAGAPMGLTAFMSLLEAYYRGKMFREAKALIK 446

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRKAG + +LSDE+V++  Q  ++D S+   N SS T KS+LA + VQ +R+E+ A   
Sbjct: 447  QMRKAGFLSSLSDEMVVSKCQ-PILDTSSTCTNVSSSTSKSDLANALVQEMRDEKDA--S 503

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VY+ NSSI FFCKAKM+ DAL+TYR+MQ+MKIQPT QTFT L+ GYSSLGM R ITILW
Sbjct: 504  VVYQFNSSINFFCKAKMMDDALKTYRRMQEMKIQPTEQTFTYLLYGYSSLGMIRTITILW 563

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRNM+ G LVV+RDLYEYLL+NFL+GGYFERVMEV  LM+E GMYTDKW+YRSEF+K
Sbjct: 564  GDIKRNMESGNLVVNRDLYEYLLLNFLRGGYFERVMEVTDLMKEHGMYTDKWLYRSEFVK 623

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LYRNLKASEARTE Q+ R++YV  FRKWAG+
Sbjct: 624  LHKNLYRNLKASEARTETQRKRIKYVERFRKWAGV 658


>ref|XP_010110548.1| hypothetical protein L484_023382 [Morus notabilis]
            gi|587940145|gb|EXC26766.1| hypothetical protein
            L484_023382 [Morus notabilis]
          Length = 718

 Score =  539 bits (1388), Expect = e-150
 Identities = 267/395 (67%), Positives = 327/395 (82%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI AAA LV + CRY+ES+P++ ++K  +K + +PIGSHNL+  LK+QI+P LL 
Sbjct: 324  HFKFNDIDAAAGLVWNMCRYRESLPIKSEKKNPQKIFHIPIGSHNLKAGLKLQIQPELLQ 383

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KD+VLK+E +QELV+FRNG+LVLSN+ALAK I G+K+ GNIS++SK+LL IQKE CS+RG
Sbjct: 384  KDTVLKVESKQELVIFRNGKLVLSNRALAKFIKGFKRDGNISQLSKLLLGIQKESCSLRG 443

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S LCSDVI ACI LGWLE AHDILDDMEA+  P+    YMSLL AY K KM REAKALLK
Sbjct: 444  SDLCSDVIEACIRLGWLEYAHDILDDMEASQTPVGCATYMSLLTAYFKRKMLREAKALLK 503

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            +MRKAGI  +L D++V+ A   E+ ++++   N S+LT K +L ESF+Q +R EE A+P 
Sbjct: 504  KMRKAGITTHLPDKMVVIACLSEIANDNSLSFNVSTLTDKLDLVESFIQEMRNEE-AVPS 562

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            ++YE NSSIYFFCKAKMI DA+RTYR+MQ+ KIQ T++TFTNLVCGYSSLGMYRDITILW
Sbjct: 563  LLYEFNSSIYFFCKAKMIEDAVRTYRRMQETKIQLTVETFTNLVCGYSSLGMYRDITILW 622

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GD+KRNM+ G+L V+RDLYEYLL++FLQGGYFER MEV   M +  M+ DKWMY++EFLK
Sbjct: 623  GDMKRNMECGSLSVNRDLYEYLLISFLQGGYFERAMEVSEYMNKYNMFADKWMYKTEFLK 682

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK LYRNLKASEARTEAQ+NRLRYVLAFRKW GI
Sbjct: 683  LHKKLYRNLKASEARTEAQRNRLRYVLAFRKWVGI 717


>ref|XP_009345148.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Pyrus x bretschneideri]
          Length = 740

 Score =  538 bits (1385), Expect = e-150
 Identities = 268/395 (67%), Positives = 328/395 (83%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI AA ELVL  C Y  S+PVQRDRK   K+Y VPIGSHNL++ L+MQI P LL 
Sbjct: 346  HFKFNDIEAATELVLQMCDYHVSLPVQRDRKNSHKSYNVPIGSHNLKSGLQMQILPELLQ 405

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSVLK+EG+ ELV++ NG+LVLSN+ALAKL+ GY+K G+  ++SKILL++QKELCS RG
Sbjct: 406  KDSVLKVEGKHELVIYWNGKLVLSNRALAKLVNGYRKGGDTCKLSKILLKMQKELCSSRG 465

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S LC+DVI ACIHLGWLE AHD+LDD++AAG+P+  T +MSLL AY   KMF EAKAL+K
Sbjct: 466  SGLCTDVIDACIHLGWLETAHDLLDDLDAAGSPLGLTPFMSLLTAYYNEKMFLEAKALIK 525

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMR AG++ NLSDE+V++  +  +VD+SA + NASS T KS+LA + VQ +R+E+K IP 
Sbjct: 526  QMRNAGLLENLSDEMVVSKCR-SIVDSSAMFTNASSSTSKSDLANALVQEMRDEKKEIPS 584

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
             VY+ NSSI FFCKAKMI DAL+TYR+M +MKIQPT QTFT L+ GY SLGM+R +TILW
Sbjct: 585  TVYQFNSSINFFCKAKMIDDALKTYRRMHEMKIQPTEQTFTYLLYGYYSLGMFRAMTILW 644

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRN++ G LVVSRDLYEYLL+NF++GGYFERVMEVI  M++ GMY DKW+YRSEF+K
Sbjct: 645  GDIKRNIESGNLVVSRDLYEYLLLNFIRGGYFERVMEVIDYMKKRGMYIDKWLYRSEFVK 704

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LYRNLKASEA+TEAQ+ RL+YV AFRKWA I
Sbjct: 705  LHKNLYRNLKASEAKTEAQRKRLKYVEAFRKWADI 739


>ref|XP_008240720.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Prunus mume]
          Length = 718

 Score =  537 bits (1383), Expect = e-149
 Identities = 270/395 (68%), Positives = 329/395 (83%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI AA ELVL  C Y ES+P+QRDRK  +++YLVPIGSHNL++ L MQI P LL 
Sbjct: 326  HFKFNDIEAAIELVLQMCNYHESLPIQRDRKISQRSYLVPIGSHNLKSGLNMQILPELLL 385

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
             DSVLK+EG+QELV++ NG+L LSN+ALAKLI GY++  +  ++S+ILL++QKELCS+RG
Sbjct: 386  CDSVLKIEGKQELVLYWNGKLALSNRALAKLINGYRRGRDTCKLSEILLKMQKELCSLRG 445

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S LCSDVI ACI+LGWLE AHD+LDDM+AAGAPM  T +MSLL AY +GKMFREAKALLK
Sbjct: 446  SRLCSDVIDACINLGWLETAHDLLDDMDAAGAPMGLTAFMSLLEAYYRGKMFREAKALLK 505

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRKAG + ++ DE++++  Q  ++D S+   N SS T KS+LA + VQ +R+E+ A   
Sbjct: 506  QMRKAGFLSSIPDEMIVSKCQ-PILDTSSTCTNVSSATSKSDLANALVQEMRDEKDA--S 562

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VY+ NSSI FFCKAKM+ DAL+TYR+MQ+MKIQPT QTFT L+ GYSSLGM R ITILW
Sbjct: 563  VVYQFNSSINFFCKAKMMDDALKTYRRMQEMKIQPTEQTFTYLLYGYSSLGMIRTITILW 622

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRNM+ G LVVSRDLYEYLL+NFL+GGYFERVMEVI  M+E GMYTDKW+YRSEF+K
Sbjct: 623  GDIKRNMESGNLVVSRDLYEYLLLNFLRGGYFERVMEVIDFMKEHGMYTDKWLYRSEFVK 682

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LYRNLKASEARTEAQ+ RL+YV  FRKWAG+
Sbjct: 683  LHKNLYRNLKASEARTEAQRKRLKYVEKFRKWAGV 717


>ref|XP_008392809.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Malus domestica] gi|658000706|ref|XP_008392811.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Malus domestica]
          Length = 714

 Score =  536 bits (1380), Expect = e-149
 Identities = 268/395 (67%), Positives = 327/395 (82%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI AA ELVL  C Y ES+PVQRDRK   K+Y VPIGSHNL++ L+MQI P LL 
Sbjct: 320  HFKFNDIEAATELVLQMCDYHESLPVQRDRKNSHKSYNVPIGSHNLKSGLQMQILPELLQ 379

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSVLK+EG+ ELV++ NG+LVLSN+ALAKL+ GY+K G+   +SKILL++QKELCS RG
Sbjct: 380  KDSVLKVEGKHELVIYWNGKLVLSNRALAKLVNGYRKGGDTCNLSKILLKMQKELCSSRG 439

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S LCSDVI ACIHL WLE AHD+LDD++AAG+P+  T +MSLL AY K KMF EAKAL+K
Sbjct: 440  SGLCSDVIDACIHLXWLETAHDLLDDLDAAGSPLGLTPFMSLLTAYYKEKMFLEAKALIK 499

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMR AG++ NLSDE+V++     +VD+SA + NASS T KS+LA + +Q +R+E+K IP 
Sbjct: 500  QMRNAGLLENLSDEMVVSKCG-SIVDSSAMFTNASSSTVKSDLANALLQEMRDEKKEIPS 558

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
             VY+ NSSI FFCKAKMI DAL+TYR++ +MKIQPT QTFT L+ GY SLGM+R +TILW
Sbjct: 559  TVYQFNSSINFFCKAKMIDDALKTYRRLHEMKIQPTEQTFTYLLYGYYSLGMFRAMTILW 618

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRN++   LVVSRDLYEYLL+NF++GGYFERVMEVI  M++ GMYTDKW+YRSEF+K
Sbjct: 619  GDIKRNIESSNLVVSRDLYEYLLLNFIRGGYFERVMEVIDYMKKRGMYTDKWLYRSEFVK 678

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LYRNLKASEA+TEAQ+ RL+YV AFRKWA I
Sbjct: 679  LHKNLYRNLKASEAKTEAQRKRLKYVEAFRKWADI 713


>ref|XP_010662700.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Vitis vinifera]
          Length = 716

 Score =  501 bits (1291), Expect = e-139
 Identities = 255/395 (64%), Positives = 306/395 (77%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI  AA LVLD CR  +S+ +Q+DR +  K  LVPIGS+ L+  LK+QI P LL 
Sbjct: 322  HFKFNDIDGAAGLVLDMCRCWDSLSIQKDRNDPHKTCLVPIGSYYLKEGLKLQIVPELLQ 381

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSV K++ +QEL++FRNG+ VLSNKALAKLI  YK+ G I E+S+++L +QKEL ++ G
Sbjct: 382  KDSVFKMDSKQELLLFRNGKYVLSNKALAKLIIAYKRDGRIGELSRLMLSLQKELGTLEG 441

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
              L SDVI ACI LGWLE AHDILDDME AGAP     YMSLL AY KGKM REAKALLK
Sbjct: 442  G-LISDVIDACIQLGWLETAHDILDDMELAGAPASSITYMSLLTAYYKGKMVREAKALLK 500

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRKAG++++LSDE+V+T     VVD +  +   S+   KS LAES V+ ++++EKAI P
Sbjct: 501  QMRKAGLIVDLSDEMVMTTCLSGVVDKNRMHTRTSTSIWKSGLAESLVREMKKQEKAILP 560

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VY+ NSSIYFFCKAKMI DALR Y +MQ+MKI+PT+QTF NLV GYS L MYR+ITILW
Sbjct: 561  VVYKFNSSIYFFCKAKMIDDALRIYGRMQEMKIEPTVQTFINLVYGYSCLNMYREITILW 620

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIK +   G+LVV RDLYE+L++NFL+GGYFERVMEVIG M+E  MY DKWMY+ EFLK
Sbjct: 621  GDIKSSRKSGSLVVCRDLYEFLVLNFLRGGYFERVMEVIGCMKEQNMYCDKWMYKREFLK 680

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
             HK LYRNLKAS  RTEAQ  RL YV AFR WAGI
Sbjct: 681  FHKDLYRNLKASNTRTEAQSKRLEYVEAFRTWAGI 715


>emb|CAN69520.1| hypothetical protein VITISV_018331 [Vitis vinifera]
          Length = 444

 Score =  498 bits (1283), Expect = e-138
 Identities = 254/395 (64%), Positives = 305/395 (77%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI  AA LVLD CR  +S+ +Q+DR +  K  LVPI S+ L+  LK+QI P LL 
Sbjct: 50   HFKFNDIDGAAGLVLDMCRCWDSLSIQKDRNDPHKTCLVPIESYYLKEGLKLQIVPELLQ 109

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSV K++ +QEL++FRNG+ VLSNKALAKLI  YK+ G I E+S+++L +QKEL ++ G
Sbjct: 110  KDSVFKMDSKQELLLFRNGKYVLSNKALAKLIIAYKRDGRIGELSRLMLSLQKELGTLEG 169

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
              L SDVI ACI LGWLE AHDILDDME AGAP     YMSLL AY KGKM REAKALLK
Sbjct: 170  G-LISDVIDACIQLGWLETAHDILDDMELAGAPASSITYMSLLTAYYKGKMVREAKALLK 228

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRKAG++++LSDE+V+T     VVD +  +   S+   KS LAES V+ ++++EKAI P
Sbjct: 229  QMRKAGLIVDLSDEMVMTTCLSGVVDKNRMHTRTSTSIWKSGLAESLVREMKKQEKAILP 288

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VY+ NSSIYFFCKAKMI DALR Y +MQ+MKI+PT+QTF NLV GYS L MYR+ITILW
Sbjct: 289  VVYKFNSSIYFFCKAKMIDDALRIYGRMQEMKIEPTVQTFINLVYGYSCLNMYREITILW 348

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIK +   G+LVV RDLYE+L++NFL+GGYFERVMEVIG M+E  MY DKWMY+ EFLK
Sbjct: 349  GDIKSSRKSGSLVVCRDLYEFLVLNFLRGGYFERVMEVIGCMKEQNMYCDKWMYKREFLK 408

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
             HK LYRNLKAS  RTEAQ  RL YV AFR WAGI
Sbjct: 409  FHKDLYRNLKASNTRTEAQSKRLEYVEAFRTWAGI 443


>ref|XP_011031992.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616-like
            [Populus euphratica]
          Length = 701

 Score =  494 bits (1271), Expect = e-136
 Identities = 250/395 (63%), Positives = 318/395 (80%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+DI +AA+L+LD  ++QE VP ++ R + EK  LVPIGS+NL+T LK+Q+ P LL 
Sbjct: 312  HFKFDDIDSAAQLLLDMHKFQEPVPNKKLRMDQEKRLLVPIGSNNLKTGLKIQVMPELLQ 371

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDS+L+++ +QELVMFR+G+L+LSN+ALAKL+ GY++ G  ++ SK+LL +Q++   +  
Sbjct: 372  KDSILRVKHKQELVMFRSGKLLLSNRALAKLVNGYRRHGRTTDFSKLLLCMQQDFHVLGQ 431

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            SS CSDVI ACI LGWLE+AHDILDDM+A+GAP+  T +M+LL AY   +MF+EAKALL+
Sbjct: 432  SSFCSDVIDACIRLGWLEMAHDILDDMDASGAPIGSTLHMALLTAYYCREMFKEAKALLR 491

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            +MRKAG V+NLSDE+V TA   E  +N      ASS + KS+L +S V+ +REEEKAIP 
Sbjct: 492  KMRKAGFVVNLSDEMVATACLSEAANN------ASSSSSKSDLIDSLVREMREEEKAIPS 545

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VYE+NSSIY+FCKAKM+ DAL+TY++MQ MKIQPT+QTF+ L+ G+SSLGMYRDITILW
Sbjct: 546  VVYELNSSIYYFCKAKMMEDALKTYKRMQHMKIQPTVQTFSYLIDGFSSLGMYRDITILW 605

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRNM    L VSRDLYE L +NFL+GGYFER MEVIG M+E  MY DKWMY+ EFLK
Sbjct: 606  GDIKRNMGSKDLEVSRDLYEVLHLNFLRGGYFERAMEVIGYMKERNMYCDKWMYKDEFLK 665

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LYR+LKASEARTEAQ  RL +V AFRKW GI
Sbjct: 666  LHKNLYRSLKASEARTEAQSKRLEHVKAFRKWVGI 700


>ref|XP_004301723.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Fragaria vesca subsp. vesca]
            gi|764591024|ref|XP_011465204.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g17616
            [Fragaria vesca subsp. vesca]
          Length = 741

 Score =  494 bits (1271), Expect = e-136
 Identities = 259/413 (62%), Positives = 317/413 (76%), Gaps = 21/413 (5%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFND+ AA+EL+L  C  ++S+ +QRD+K  +++YLVPIGSHN ++ L MQI P LL 
Sbjct: 326  HFKFNDVVAASELILQMCDDRKSLLIQRDKKNSQRSYLVPIGSHNQKSGLNMQIVPELLQ 385

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSVLKLEG+QELVM+ NG+LVLSN+ALAKLI  YK  G+ SE+SK+L +IQKELCS RG
Sbjct: 386  KDSVLKLEGKQELVMYLNGKLVLSNRALAKLITRYKIDGDTSELSKLLHKIQKELCSFRG 445

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S L +DVI ACI LGWLE AHDILDDMEAA  PM Y+ +MSLL AY KGK+  EAKALLK
Sbjct: 446  SRLGNDVIDACIQLGWLETAHDILDDMEAAETPMGYSTFMSLLTAYYKGKLVPEAKALLK 505

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEE----- 957
            QMRKAG++++LSDE+V +   L VVD SA   +ASS T KS+LA + VQ  R+EE     
Sbjct: 506  QMRKAGLLVSLSDEMVASTC-LSVVDTSACCTSASSSTSKSDLANALVQESRDEEETPSR 564

Query: 956  ----------------KAIPPMVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQT 825
                            + I   VY+ NSSI FFCKAKMI DAL+TY++MQ++KI PT  T
Sbjct: 565  VSDLVNALVQETRDEKEGISSRVYQFNSSINFFCKAKMIDDALKTYKRMQELKIYPTELT 624

Query: 824  FTNLVCGYSSLGMYRDITILWGDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVI 645
            FT ++  YSSLGM+R+IT LWGD+KRNM+ G LVVSRDLYEYLL++FL GGYFERVMEVI
Sbjct: 625  FTYMIKAYSSLGMFRNITFLWGDMKRNMENGNLVVSRDLYEYLLLDFLGGGYFERVMEVI 684

Query: 644  GLMEESGMYTDKWMYRSEFLKLHKSLYRNLKASEARTEAQKNRLRYVLAFRKW 486
              M++ GM+ DKWMYRSEF KLHK+LYRNLKASEART+AQ+ RL +V AFRK+
Sbjct: 685  SYMKKHGMFADKWMYRSEFEKLHKNLYRNLKASEARTDAQRKRLEFVQAFRKY 737


>ref|XP_006480449.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616-like
            isoform X1 [Citrus sinensis]
            gi|568853626|ref|XP_006480450.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g17616-like isoform X2 [Citrus sinensis]
          Length = 712

 Score =  489 bits (1260), Expect = e-135
 Identities = 258/395 (65%), Positives = 308/395 (77%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+DI AA EL+LD  RY+E +P  + R++ +K YL+ IGS NLR  LK+QI P LL 
Sbjct: 319  HFKFDDIDAAGELILDMNRYREPLPNPKLRQDAQKPYLISIGSPNLRCGLKLQIMPELLE 378

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDS+LK+EG+QELV+FRNG+L+ SN+A+AKLI GYKK G  SE+S +LL I+KE  S   
Sbjct: 379  KDSILKMEGKQELVLFRNGKLLHSNRAMAKLINGYKKHGKNSELSGLLLSIKKEHHSFGE 438

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S+LCSDVI A I LG+LE AHDILDDME AG PMD T Y SLL AY K KMFREA+ALLK
Sbjct: 439  STLCSDVIDALIQLGFLEAAHDILDDMEFAGHPMDSTTYKSLLTAYYKVKMFREAEALLK 498

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRK+ +V NLS E++++    EV D SA + + SSL  KS+LAES +Q +REE  A   
Sbjct: 499  QMRKSCLVQNLSCEMIVSERFSEVADKSASFTDTSSLMDKSDLAESLIQEMREE--AALS 556

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            M+Y++NSSIYFFCK KMIGDAL+ YR+MQ+MKI+PT++TF  LV GYSSL MYRDITILW
Sbjct: 557  MIYKLNSSIYFFCKGKMIGDALKIYRRMQEMKIRPTVETFYYLVYGYSSLEMYRDITILW 616

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRN++ G L VSRDLYE LL+NFLQGGYFERVMEVIG M++  MY DK MY+SEFLK
Sbjct: 617  GDIKRNIESGVLAVSRDLYETLLLNFLQGGYFERVMEVIGYMKKQNMYVDKLMYKSEFLK 676

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
             HK LYR LK S ARTEAQ  RL  V AFRKWAGI
Sbjct: 677  HHKHLYRRLKVSNARTEAQSKRLVNVQAFRKWAGI 711


>ref|XP_006428630.1| hypothetical protein CICLE_v10011185mg [Citrus clementina]
            gi|557530687|gb|ESR41870.1| hypothetical protein
            CICLE_v10011185mg [Citrus clementina]
          Length = 712

 Score =  489 bits (1258), Expect = e-135
 Identities = 258/395 (65%), Positives = 308/395 (77%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+DI AA EL+LD  RY+E +P  + R++ +K YL+ IGS NLR  LK+QI P LL 
Sbjct: 319  HFKFDDIDAAGELILDMNRYREPLPNPKLRQDAQKPYLISIGSPNLRCGLKLQIMPELLE 378

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDS+LK+EG+QELV+FRNG+L+ SN+A+AKLI GYKK G  SE+S +LL I+KE  S   
Sbjct: 379  KDSILKMEGKQELVLFRNGKLLHSNRAMAKLINGYKKHGKNSELSGLLLSIKKEHHSFGE 438

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S+LCSDVI A I LG+LE AHDILDDME AG PMD T Y SLL AY K KMFREA+ALLK
Sbjct: 439  STLCSDVIDALIQLGFLEAAHDILDDMEFAGHPMDSTTYKSLLTAYYKVKMFREAEALLK 498

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRK+ +V NLS E++++    EV D SA + + SSL  KS+LAES +Q +REE  A   
Sbjct: 499  QMRKSCLVQNLSCEMIVSERFSEVEDKSASFTDTSSLMDKSDLAESLIQEMREE--AALS 556

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            M+Y++NSSIYFFCK KMIGDAL+ YR+MQ+MKI+PT++TF  LV GYSSL MYRDITILW
Sbjct: 557  MIYKLNSSIYFFCKGKMIGDALKIYRRMQEMKIRPTVETFYYLVYGYSSLEMYRDITILW 616

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRN++ G L VSRDLYE LL+NFLQGGYFERVMEVIG M++  MY DK MY+SEFLK
Sbjct: 617  GDIKRNIESGVLAVSRDLYETLLLNFLQGGYFERVMEVIGYMKKQNMYVDKLMYKSEFLK 676

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
             HK LYR LK S ARTEAQ  RL  V AFRKWAGI
Sbjct: 677  HHKHLYRRLKVSNARTEAQSKRLVNVQAFRKWAGI 711


>ref|XP_006385578.1| hypothetical protein POPTR_0003s08270g [Populus trichocarpa]
            gi|550342705|gb|ERP63375.1| hypothetical protein
            POPTR_0003s08270g [Populus trichocarpa]
          Length = 701

 Score =  488 bits (1257), Expect = e-135
 Identities = 248/395 (62%), Positives = 316/395 (80%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+DI +AA+L+LD  ++QESVP ++ R + EK  LVPIGS+NL+T LK+Q+ P LL 
Sbjct: 312  HFKFDDIDSAAQLLLDMHKFQESVPNKKLRMDQEKRLLVPIGSNNLKTGLKIQVMPELLQ 371

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDS+L ++ +QELVMFR+G+L+LSN+ALAKL+ GY++ G  +++SK+LL +Q++   +  
Sbjct: 372  KDSILTVKHKQELVMFRSGKLLLSNRALAKLVNGYRRHGRTTDLSKLLLCMQQDFHVLGQ 431

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            SS CSDVI ACI LGWLE+AHDILDDM+AAGAP+  T +M+LL AY   +MF+EAKALL+
Sbjct: 432  SSFCSDVIDACIRLGWLEMAHDILDDMDAAGAPIGSTLHMALLTAYYCREMFKEAKALLR 491

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            +MRKAG V+NLSDE+V TA   E  +N      ASS + KS+L +  V+ +REEEKAIP 
Sbjct: 492  KMRKAGFVVNLSDEMVATACLSEAANN------ASSSSSKSDLIDFLVREMREEEKAIPS 545

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            + YE+NSSIY+FCKAKM+ DAL+TY++MQ MKIQPT+QTF+ L+ G+SSLGMYRDITILW
Sbjct: 546  VGYELNSSIYYFCKAKMMEDALKTYKRMQHMKIQPTVQTFSYLIDGFSSLGMYRDITILW 605

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRN+    L VSRDLYE L +NFL+GGYFER MEVIG M+E  MY DKWMY+ EFLK
Sbjct: 606  GDIKRNVGSKDLEVSRDLYEVLHLNFLRGGYFERAMEVIGYMKERNMYCDKWMYKDEFLK 665

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
             HK+LYR+LKASEARTEAQ  RL +V AFRKW GI
Sbjct: 666  FHKNLYRSLKASEARTEAQSKRLEHVKAFRKWVGI 700


>gb|KDO39066.1| hypothetical protein CISIN_1g048743mg, partial [Citrus sinensis]
          Length = 653

 Score =  486 bits (1250), Expect = e-134
 Identities = 257/395 (65%), Positives = 307/395 (77%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+DI AA EL+LD  RY+E +P  + R++ +K YL+ IGS NLR  LK+QI P LL 
Sbjct: 260  HFKFDDIDAAGELILDMNRYREPLPNPKLRQDAQKPYLISIGSPNLRCGLKLQIMPELLE 319

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDS+LK+EG+QELV+FRNG+L+ SN+A+AKLI GYKK G  SE+S +LL I+KE  S   
Sbjct: 320  KDSILKMEGKQELVLFRNGKLLHSNRAMAKLINGYKKHGKNSELSWLLLSIKKEHHSFGE 379

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S+LCSDVI A I LG+LE AHDILDDME AG PMD T Y SLL AY K KMFREA+ALLK
Sbjct: 380  STLCSDVIDALIQLGFLEAAHDILDDMELAGHPMDSTTYKSLLTAYYKVKMFREAEALLK 439

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRK+ +V NLS E+V++    EV D SA + + SSL  KS+LAES +Q +REE  A   
Sbjct: 440  QMRKSCLVQNLSCEMVVSERFSEVADKSASFTDTSSLMDKSDLAESLIQEMREE--AALS 497

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
             +Y++NSSIYFFCK KMIGDAL+ YR+MQ+MKI+PT++TF  LV G+SSL MYRDITILW
Sbjct: 498  TIYKLNSSIYFFCKGKMIGDALKIYRRMQEMKIRPTVETFYYLVYGHSSLEMYRDITILW 557

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRN++ G L VSRDLYE LL+NFLQGGYFERVMEVIG M++  MY DK MY+SEFLK
Sbjct: 558  GDIKRNIESGVLAVSRDLYETLLLNFLQGGYFERVMEVIGYMKKQNMYVDKLMYKSEFLK 617

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
             HK LYR LK S ARTEAQ  RL  V AFRKWAGI
Sbjct: 618  HHKHLYRRLKVSNARTEAQSKRLVNVQAFRKWAGI 652


>ref|XP_007048805.1| Pentatricopeptide repeat superfamily protein, putative isoform 1
            [Theobroma cacao] gi|590710359|ref|XP_007048806.1|
            Pentatricopeptide repeat superfamily protein, putative
            isoform 1 [Theobroma cacao] gi|508701066|gb|EOX92962.1|
            Pentatricopeptide repeat superfamily protein, putative
            isoform 1 [Theobroma cacao] gi|508701067|gb|EOX92963.1|
            Pentatricopeptide repeat superfamily protein, putative
            isoform 1 [Theobroma cacao]
          Length = 708

 Score =  480 bits (1235), Expect = e-132
 Identities = 247/395 (62%), Positives = 305/395 (77%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+DI AAAELVL+  R +ES P+   RK+ +K   VPIGS NLR  LK+QI P LL 
Sbjct: 313  HFKFDDIDAAAELVLEMNRSRESHPIGELRKDYQKPRFVPIGSQNLRNGLKIQIVPELLQ 372

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDS L  EG+ +L+M+R+ +L  SN+ALAKLI GYKK G I+E+SK LL +++ELCS  G
Sbjct: 373  KDSALIAEGKSDLIMYRDKKLCPSNRALAKLINGYKKHGKINELSKFLLSLKRELCSSGG 432

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            SSL SDVI ACI LGWLEIAHDIL+DME++G P+  + YM+LL AY K  M RE   LLK
Sbjct: 433  SSLFSDVIDACITLGWLEIAHDILEDMESSGDPLGLSTYMALLTAYYKRNMSREGNILLK 492

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRK G+V+NLSDE+V++    E V  S+  +N SS   + +L ES V+ I E EKAI P
Sbjct: 493  QMRKVGLVLNLSDEIVISKNAPENVGRSSLCINESSSICQPSLMESLVREISEAEKAISP 552

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            ++YE+NSSIYFF KAKM+GDAL+ YR+MQ+MKIQPT+ TF  LVCGYSSL +YRDITILW
Sbjct: 553  ILYELNSSIYFFSKAKMMGDALKIYRRMQEMKIQPTVHTFAYLVCGYSSLKLYRDITILW 612

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIK+ M+   L +S DLY  LL+NFLQGGYFERVMEVIG M++  MY DKWMY+SE+LK
Sbjct: 613  GDIKKAMESRNLSMSSDLYALLLLNFLQGGYFERVMEVIGYMKKGSMYIDKWMYKSEYLK 672

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            +HK+LYR+LKAS+ARTEAQ  RL +V AF+KWAGI
Sbjct: 673  IHKNLYRSLKASQARTEAQGKRLDHVKAFKKWAGI 707


>gb|KRH04247.1| hypothetical protein GLYMA_17G149100 [Glycine max]
          Length = 574

 Score =  460 bits (1184), Expect = e-126
 Identities = 240/395 (60%), Positives = 287/395 (72%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI AAA+LVLD          +   K L+K   + IGS  LRT LK+ I P LLH
Sbjct: 188  HFKFNDIDAAAKLVLDMTSSHNYDVKKECEKHLQKPCFIAIGSPFLRTVLKIHIEPELLH 247

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSVLK+E RQ+L+ ++ G+LVLSN ALAK I GYKK G I E+SK+LL IQ EL S+ G
Sbjct: 248  KDSVLKVESRQDLIFYKGGKLVLSNSALAKFISGYKKYGRIGELSKLLLSIQGELNSVAG 307

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            SSLCSDVIGACI LGWLE AHDILDD+EA G+PM    YM L+ AY KG M RE KALLK
Sbjct: 308  SSLCSDVIGACIQLGWLECAHDILDDVEATGSPMGRDTYMLLVSAYQKGGMQRETKALLK 367

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QM+K G+   LSD+ +      E   NS          GK++LA + VQ +++E++ + P
Sbjct: 368  QMKKVGLDKGLSDDAIDEHNLCEETLNSL---------GKADLAIALVQILKDEDQTVFP 418

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VY +NSSI+FFCKA MI DALR YR+M DMKIQPT QTF  L+CGYSSLGMYR+ITILW
Sbjct: 419  LVYNLNSSIFFFCKAGMIEDALRAYRRMVDMKIQPTSQTFAFLMCGYSSLGMYREITILW 478

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKR M  G LV +RDLYE LL+NFL+GGYFERV+EVI  M +  MY DKWMY++EFL+
Sbjct: 479  GDIKRFMRSGNLVGNRDLYELLLLNFLRGGYFERVLEVISHMRDHNMYPDKWMYKNEFLR 538

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LYR+LKAS  RTEAQ  RL +V  FRKW GI
Sbjct: 539  LHKNLYRSLKASNTRTEAQSKRLEHVQEFRKWVGI 573


>ref|XP_003550925.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616-like
            [Glycine max] gi|734314236|gb|KHN01639.1|
            Pentatricopeptide repeat-containing protein [Glycine
            soja] gi|947054793|gb|KRH04246.1| hypothetical protein
            GLYMA_17G149100 [Glycine max]
          Length = 684

 Score =  460 bits (1184), Expect = e-126
 Identities = 240/395 (60%), Positives = 287/395 (72%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKFNDI AAA+LVLD          +   K L+K   + IGS  LRT LK+ I P LLH
Sbjct: 298  HFKFNDIDAAAKLVLDMTSSHNYDVKKECEKHLQKPCFIAIGSPFLRTVLKIHIEPELLH 357

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSVLK+E RQ+L+ ++ G+LVLSN ALAK I GYKK G I E+SK+LL IQ EL S+ G
Sbjct: 358  KDSVLKVESRQDLIFYKGGKLVLSNSALAKFISGYKKYGRIGELSKLLLSIQGELNSVAG 417

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            SSLCSDVIGACI LGWLE AHDILDD+EA G+PM    YM L+ AY KG M RE KALLK
Sbjct: 418  SSLCSDVIGACIQLGWLECAHDILDDVEATGSPMGRDTYMLLVSAYQKGGMQRETKALLK 477

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QM+K G+   LSD+ +      E   NS          GK++LA + VQ +++E++ + P
Sbjct: 478  QMKKVGLDKGLSDDAIDEHNLCEETLNSL---------GKADLAIALVQILKDEDQTVFP 528

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VY +NSSI+FFCKA MI DALR YR+M DMKIQPT QTF  L+CGYSSLGMYR+ITILW
Sbjct: 529  LVYNLNSSIFFFCKAGMIEDALRAYRRMVDMKIQPTSQTFAFLMCGYSSLGMYREITILW 588

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKR M  G LV +RDLYE LL+NFL+GGYFERV+EVI  M +  MY DKWMY++EFL+
Sbjct: 589  GDIKRFMRSGNLVGNRDLYELLLLNFLRGGYFERVLEVISHMRDHNMYPDKWMYKNEFLR 648

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LYR+LKAS  RTEAQ  RL +V  FRKW GI
Sbjct: 649  LHKNLYRSLKASNTRTEAQSKRLEHVQEFRKWVGI 683


>ref|XP_012075523.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Jatropha curcas] gi|802619714|ref|XP_012075524.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Jatropha curcas]
          Length = 715

 Score =  456 bits (1174), Expect = e-125
 Identities = 232/395 (58%), Positives = 293/395 (74%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+D+ +AAEL+LD  +++ S P +   K+++K YLV IGS NLR  LK+QI P LL 
Sbjct: 326  HFKFDDLDSAAELLLDMNKFRVSTPNKNSTKDIQKPYLVSIGSQNLRAGLKIQIMPELLQ 385

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSV+KLE ++ELV+F NG+L+LSN+AL KLI GYK+ G ++E+ K+L+ +QK+   + G
Sbjct: 386  KDSVIKLEDKKELVIFENGKLLLSNRALTKLILGYKRHGRMAELPKVLVSMQKDFQKLGG 445

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S+LC DVI ACI LGWLE AHDI+DDME +G P+    YM LL AY    M REA+ L +
Sbjct: 446  SNLCFDVIDACIRLGWLETAHDIVDDMETSGVPVGLNAYMVLLRAYYSRDMSREAEGLQR 505

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRKAGIV N+  E+V +    +  DN+      SS   KS+LA+  VQ +R+E+  IPP
Sbjct: 506  QMRKAGIVTNIPGEIVASNDLSKTADNT------SSSVSKSDLADFLVQEMRKEDVVIPP 559

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VYE+NSSIYFFCKAKM+ DAL+TYR+MQ + IQPT QTF  L  GYSSLG YRDIT+LW
Sbjct: 560  VVYELNSSIYFFCKAKMMVDALKTYRRMQVVGIQPTEQTFAYLAYGYSSLGRYRDITVLW 619

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRNM   ++ VSRDLYE LLMNFLQGGYFERVMEVI  M+E  M+ DK MY+SEFLK
Sbjct: 620  GDIKRNMKNKSMAVSRDLYETLLMNFLQGGYFERVMEVISYMKEHNMHMDKEMYKSEFLK 679

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LY+ +KAS  R E Q+ RL +V  FRKW GI
Sbjct: 680  LHKNLYKGVKASTVRNEVQRKRLEFVQTFRKWVGI 714


>gb|KDP34852.1| hypothetical protein JCGZ_09140 [Jatropha curcas]
          Length = 691

 Score =  456 bits (1174), Expect = e-125
 Identities = 232/395 (58%), Positives = 293/395 (74%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+D+ +AAEL+LD  +++ S P +   K+++K YLV IGS NLR  LK+QI P LL 
Sbjct: 302  HFKFDDLDSAAELLLDMNKFRVSTPNKNSTKDIQKPYLVSIGSQNLRAGLKIQIMPELLQ 361

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDSV+KLE ++ELV+F NG+L+LSN+AL KLI GYK+ G ++E+ K+L+ +QK+   + G
Sbjct: 362  KDSVIKLEDKKELVIFENGKLLLSNRALTKLILGYKRHGRMAELPKVLVSMQKDFQKLGG 421

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            S+LC DVI ACI LGWLE AHDI+DDME +G P+    YM LL AY    M REA+ L +
Sbjct: 422  SNLCFDVIDACIRLGWLETAHDIVDDMETSGVPVGLNAYMVLLRAYYSRDMSREAEGLQR 481

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            QMRKAGIV N+  E+V +    +  DN+      SS   KS+LA+  VQ +R+E+  IPP
Sbjct: 482  QMRKAGIVTNIPGEIVASNDLSKTADNT------SSSVSKSDLADFLVQEMRKEDVVIPP 535

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            +VYE+NSSIYFFCKAKM+ DAL+TYR+MQ + IQPT QTF  L  GYSSLG YRDIT+LW
Sbjct: 536  VVYELNSSIYFFCKAKMMVDALKTYRRMQVVGIQPTEQTFAYLAYGYSSLGRYRDITVLW 595

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GDIKRNM   ++ VSRDLYE LLMNFLQGGYFERVMEVI  M+E  M+ DK MY+SEFLK
Sbjct: 596  GDIKRNMKNKSMAVSRDLYETLLMNFLQGGYFERVMEVISYMKEHNMHMDKEMYKSEFLK 655

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            LHK+LY+ +KAS  R E Q+ RL +V  FRKW GI
Sbjct: 656  LHKNLYKGVKASTVRNEVQRKRLEFVQTFRKWVGI 690


>gb|KJB08223.1| hypothetical protein B456_001G071600 [Gossypium raimondii]
          Length = 575

 Score =  452 bits (1162), Expect = e-124
 Identities = 231/395 (58%), Positives = 298/395 (75%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+DI AAAEL+LD  R + S P+    K+ +K   VPIGS NLR  LK+QI P L+H
Sbjct: 180  HFKFDDIDAAAELLLDMNRSRGSHPMDDPGKDSQKPRFVPIGSQNLRNGLKIQIMPELIH 239

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDS LK EG+ +LV+FR+ +L+ SN+AL+KLI GYK+ G + E+SK LL ++KEL S   
Sbjct: 240  KDSALKEEGKSDLVLFRDKKLLPSNRALSKLINGYKRHGKMDELSKFLLGLKKELYSSGE 299

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            SS+  DVI ACI LGW+EIAHDILDDME++G  +D + YM+LL AY K  M REA  LLK
Sbjct: 300  SSVICDVIDACISLGWVEIAHDILDDMESSGDSLDSSAYMALLTAYYKRNMSREANVLLK 359

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            Q+RKAG+V+NL++ +VL+      V  S   +  +S   + +L++  V+ + + EKA+  
Sbjct: 360  QVRKAGLVINLANNIVLSKNVPSNVGRSPLSIKEASSIYQPSLSKCLVEEVSDAEKAVSH 419

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            ++YE+NSSIYFF KAKM+GDAL  YR+MQ+MKIQPT  TF  LVCGYSSL MYRDITILW
Sbjct: 420  IIYELNSSIYFFSKAKMMGDALNIYRRMQEMKIQPTEHTFMYLVCGYSSLEMYRDITILW 479

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GD+KR M+ G+L +S DLYE+ L+NFL+GGYFERVME IG M +  MY DKWMY+SE+LK
Sbjct: 480  GDMKRIMETGSLTLSSDLYEFFLLNFLRGGYFERVMEAIGYMNKCNMYVDKWMYKSEYLK 539

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            +HK+LYR+LKAS+ARTEAQ  RL +V AF+KWAGI
Sbjct: 540  IHKNLYRSLKASKARTEAQGKRLEHVKAFKKWAGI 574


>ref|XP_012469457.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Gossypium raimondii] gi|823122110|ref|XP_012469466.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Gossypium raimondii]
            gi|823122112|ref|XP_012469475.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g17616
            [Gossypium raimondii] gi|823122114|ref|XP_012469482.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Gossypium raimondii]
            gi|763740722|gb|KJB08221.1| hypothetical protein
            B456_001G071600 [Gossypium raimondii]
            gi|763740723|gb|KJB08222.1| hypothetical protein
            B456_001G071600 [Gossypium raimondii]
          Length = 708

 Score =  452 bits (1162), Expect = e-124
 Identities = 231/395 (58%), Positives = 298/395 (75%)
 Frame = -2

Query: 1661 HFKFNDIGAAAELVLDFCRYQESVPVQRDRKELEKAYLVPIGSHNLRTPLKMQIRPALLH 1482
            HFKF+DI AAAEL+LD  R + S P+    K+ +K   VPIGS NLR  LK+QI P L+H
Sbjct: 313  HFKFDDIDAAAELLLDMNRSRGSHPMDDPGKDSQKPRFVPIGSQNLRNGLKIQIMPELIH 372

Query: 1481 KDSVLKLEGRQELVMFRNGQLVLSNKALAKLICGYKKVGNISEMSKILLQIQKELCSIRG 1302
            KDS LK EG+ +LV+FR+ +L+ SN+AL+KLI GYK+ G + E+SK LL ++KEL S   
Sbjct: 373  KDSALKEEGKSDLVLFRDKKLLPSNRALSKLINGYKRHGKMDELSKFLLGLKKELYSSGE 432

Query: 1301 SSLCSDVIGACIHLGWLEIAHDILDDMEAAGAPMDYTNYMSLLIAYSKGKMFREAKALLK 1122
            SS+  DVI ACI LGW+EIAHDILDDME++G  +D + YM+LL AY K  M REA  LLK
Sbjct: 433  SSVICDVIDACISLGWVEIAHDILDDMESSGDSLDSSAYMALLTAYYKRNMSREANVLLK 492

Query: 1121 QMRKAGIVMNLSDEVVLTAYQLEVVDNSAPYMNASSLTGKSNLAESFVQGIREEEKAIPP 942
            Q+RKAG+V+NL++ +VL+      V  S   +  +S   + +L++  V+ + + EKA+  
Sbjct: 493  QVRKAGLVINLANNIVLSKNVPSNVGRSPLSIKEASSIYQPSLSKCLVEEVSDAEKAVSH 552

Query: 941  MVYEINSSIYFFCKAKMIGDALRTYRKMQDMKIQPTLQTFTNLVCGYSSLGMYRDITILW 762
            ++YE+NSSIYFF KAKM+GDAL  YR+MQ+MKIQPT  TF  LVCGYSSL MYRDITILW
Sbjct: 553  IIYELNSSIYFFSKAKMMGDALNIYRRMQEMKIQPTEHTFMYLVCGYSSLEMYRDITILW 612

Query: 761  GDIKRNMDGGTLVVSRDLYEYLLMNFLQGGYFERVMEVIGLMEESGMYTDKWMYRSEFLK 582
            GD+KR M+ G+L +S DLYE+ L+NFL+GGYFERVME IG M +  MY DKWMY+SE+LK
Sbjct: 613  GDMKRIMETGSLTLSSDLYEFFLLNFLRGGYFERVMEAIGYMNKCNMYVDKWMYKSEYLK 672

Query: 581  LHKSLYRNLKASEARTEAQKNRLRYVLAFRKWAGI 477
            +HK+LYR+LKAS+ARTEAQ  RL +V AF+KWAGI
Sbjct: 673  IHKNLYRSLKASKARTEAQGKRLEHVKAFKKWAGI 707