BLASTX nr result

ID: Paeonia23_contig00014729 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00014729
         (1628 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi...   733   0.0  
ref|XP_007205048.1| hypothetical protein PRUPE_ppa004609mg [Prun...   723   0.0  
ref|XP_007016943.1| Pentatricopeptide repeat-containing protein ...   702   0.0  
ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citr...   696   0.0  
ref|XP_002313976.1| ubiquitin family protein [Populus trichocarp...   693   0.0  
ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containi...   693   0.0  
gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]     690   0.0  
ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm...   688   0.0  
ref|XP_007146808.1| hypothetical protein PHAVU_006G071400g [Phas...   681   0.0  
ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292...   678   0.0  
ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi...   673   0.0  
ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi...   673   0.0  
ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   669   0.0  
ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207...   644   0.0  
ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containi...   642   0.0  
ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containi...   638   e-180
ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containi...   618   e-174
gb|EYU45707.1| hypothetical protein MIMGU_mgv1a020921mg [Mimulus...   582   e-163
ref|XP_006293980.1| hypothetical protein CARUB_v10022972mg [Caps...   563   e-158
ref|XP_006410063.1| hypothetical protein EUTSA_v10016546mg [Eutr...   563   e-157

>ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic [Vitis vinifera]
          Length = 511

 Score =  733 bits (1892), Expect = 0.0
 Identities = 381/512 (74%), Positives = 420/512 (82%), Gaps = 16/512 (3%)
 Frame = +2

Query: 62   MAAALGFAFL----TNLGFQYR------------PQFYRHLTTESCYKISMKISNHQNPS 193
            MA+A GFA      T LGF               P+F R    E C + +  I NHQNP 
Sbjct: 1    MASAHGFASSLMSPTELGFTLSSSFSIQRPRLIVPKFSRSFLGEYCSRATT-ICNHQNPR 59

Query: 194  FVVSKRCRKLGFGLFKAVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEEMNDR 373
            FVV KR +   F LFK+VELDQFLTSDD+DEM EGFFEAIEELERMTREPSDVLEEMNDR
Sbjct: 60   FVVPKRDKIREFRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDVLEEMNDR 119

Query: 374  LSDRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEH 553
            LS RELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMC WVKKLIE EH
Sbjct: 120  LSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEGEH 179

Query: 554  AXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGE 733
                             KP FSMIEK ISLYWE  EKE+A+LFVKEVL+REIAY+EDDG+
Sbjct: 180  DVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIAYSEDDGD 239

Query: 734  GQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKAL 913
            G KGGP GYLAWKMMAEGNY  AV+L+IHL ES LKPEVYSYLIAMTAVVKELNEFAKAL
Sbjct: 240  GHKGGPTGYLAWKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKELNEFAKAL 299

Query: 914  RKLKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAM 1093
            RKLKG+ K+G IAELD E + LIEKYQSDLLADGVRLS+WVI++G S L G V+ERLLAM
Sbjct: 300  RKLKGFTKSGLIAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVVYERLLAM 359

Query: 1094 YICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQK 1273
            YICAGRG+EAE Q W+MKLVGK+ADR+LYDIVLAICAS+K  S+I+RLLT MEV+SS+++
Sbjct: 360  YICAGRGLEAERQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGMEVTSSIRR 419

Query: 1274 KKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYL 1453
            KKTLSWLLRGYIKG +FDDA ETIIKMLDLGL PEYLDRAAVLQGLR +IQQ+GNVE YL
Sbjct: 420  KKTLSWLLRGYIKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQTGNVETYL 479

Query: 1454 KLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            KLCK+LSDANLIGPCLVY++IKKYKLWI+K I
Sbjct: 480  KLCKHLSDANLIGPCLVYLYIKKYKLWILKTI 511


>ref|XP_007205048.1| hypothetical protein PRUPE_ppa004609mg [Prunus persica]
            gi|462400690|gb|EMJ06247.1| hypothetical protein
            PRUPE_ppa004609mg [Prunus persica]
          Length = 500

 Score =  723 bits (1865), Expect = 0.0
 Identities = 366/500 (73%), Positives = 415/500 (83%), Gaps = 4/500 (0%)
 Frame = +2

Query: 62   MAAALGFAFLTNLGFQYRPQFY---RHLTTESCYKISMKISNHQNPSFVVSKRCRKLGFG 232
            MA+A G A LT+  F  + Q +   R  + +SC ++  +I  HQ P+F+V+K  +   F 
Sbjct: 1    MASAQGLASLTHSLFAVKRQRFMGLRGFSAQSCGRVFPRICKHQKPNFIVAKSSKVRDFR 60

Query: 233  LFKAVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSDRELQLVLVYF 412
            LFK+VELDQFLTSDD+DEMGEGFFEAIEELERMTREPSDVLEEMNDRLS RELQLVLVYF
Sbjct: 61   LFKSVELDQFLTSDDEDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSARELQLVLVYF 120

Query: 413  SQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXXXXXX 592
            SQEGRDSWCALEVFEWLRKENRVDKETM+LMVSIMC WVKKLI+ EH             
Sbjct: 121  SQEGRDSWCALEVFEWLRKENRVDKETMDLMVSIMCSWVKKLIQREHDIGDVVDLLVDMD 180

Query: 593  XXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAE-DDGEGQKGGPAGYLAW 769
                KPSFSM+EK ISLYWE GEKE+A+LFVKEVLKR I Y+E DD +G KGGP GYLAW
Sbjct: 181  CVGLKPSFSMMEKVISLYWEMGEKEKAVLFVKEVLKRGIVYSEEDDTDGHKGGPTGYLAW 240

Query: 770  KMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKTGYI 949
            KMM EGNY D+V+L+IHL ES LKPEVYSYLIAMTAVVKELNE AKALRKLKG+ + G I
Sbjct: 241  KMMVEGNYRDSVKLVIHLRESGLKPEVYSYLIAMTAVVKELNELAKALRKLKGFTRAGLI 300

Query: 950  AELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVEAET 1129
            AE D E + LIEKYQSDLL+DGV+LSNWVI++GSSSL G VHERLLAMYIC+G G+EAE 
Sbjct: 301  AEFDTENVGLIEKYQSDLLSDGVQLSNWVIQEGSSSLHGVVHERLLAMYICSGHGLEAER 360

Query: 1130 QQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKKKTLSWLLRGYI 1309
            Q W+MKLVGK+AD DLYDIVLAICASQK  S+I RLLTR EV+SSL+KKK+LSWLLRGYI
Sbjct: 361  QLWEMKLVGKEADADLYDIVLAICASQKEASAIGRLLTRTEVTSSLRKKKSLSWLLRGYI 420

Query: 1310 KGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDANLI 1489
            KGG+FDDA ET+IKMLDLGL PE+LDRAAVLQGLRK IQ+SG V+ YLKLCK LSDA+LI
Sbjct: 421  KGGHFDDAAETVIKMLDLGLCPEFLDRAAVLQGLRKSIQESGGVDTYLKLCKRLSDASLI 480

Query: 1490 GPCLVYMHIKKYKLWIIKMI 1549
            GPCLVY+ I+KYKLWI KM+
Sbjct: 481  GPCLVYLFIRKYKLWITKML 500


>ref|XP_007016943.1| Pentatricopeptide repeat-containing protein [Theobroma cacao]
            gi|508787306|gb|EOY34562.1| Pentatricopeptide
            repeat-containing protein [Theobroma cacao]
          Length = 504

 Score =  702 bits (1811), Expect = 0.0
 Identities = 357/509 (70%), Positives = 412/509 (80%), Gaps = 13/509 (2%)
 Frame = +2

Query: 62   MAAALGFAFLTNLGFQYR------PQFYR-HLTTESCYKISMKISNHQNPSFVVSK---- 208
            MA A  F  L  L   ++      PQFY+      S  +IS +I NHQNPSFV+ K    
Sbjct: 1    MAFARRFTSLAELSLPFQSHRFLAPQFYQTFFWRHSLRRISTRICNHQNPSFVLRKIQPK 60

Query: 209  --RCRKLGFGLFKAVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSD 382
               CR     LFK+VELDQFLTSDD+DEM EGFFEAIEELERMTREPSD+LEEMNDRLS 
Sbjct: 61   TRECR-----LFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDILEEMNDRLSS 115

Query: 383  RELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAXX 562
            RELQLVLVYFSQEGRDSWCALEVFEWL+KEN+VD ETMELMVSIMC WVKKLIE E    
Sbjct: 116  RELQLVLVYFSQEGRDSWCALEVFEWLKKENKVDNETMELMVSIMCSWVKKLIEGEGDVG 175

Query: 563  XXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGEGQK 742
                          KP FSMIEK IS+YWE  +K+RA++FVKEVL+R I+Y ++DGEGQK
Sbjct: 176  DVVDLLVDMDCVGLKPGFSMIEKVISMYWEMEKKDRAVVFVKEVLRRGISYEDEDGEGQK 235

Query: 743  GGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRKL 922
            GGP GYLAWKMM EGNY DA++L+I L ES LKPE+YSYLIAMTA+VKELNEFAKALRKL
Sbjct: 236  GGPTGYLAWKMMVEGNYRDAIKLVIELRESGLKPEIYSYLIAMTAIVKELNEFAKALRKL 295

Query: 923  KGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAMYIC 1102
            KG+ ++G +AELD+E + LI+KYQSDLLADG+RLSNW I++G+SSL G VHERLLAMYIC
Sbjct: 296  KGFARSGLVAELDMENVELIKKYQSDLLADGLRLSNWAIQEGTSSLFGLVHERLLAMYIC 355

Query: 1103 AGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKKKT 1282
            AGRG+EAE Q W+MKL GK+AD DL+DIVLAICASQK  S+I+RLLTRMEVSSSL++KKT
Sbjct: 356  AGRGLEAERQLWEMKLAGKEADGDLHDIVLAICASQKEASAISRLLTRMEVSSSLRRKKT 415

Query: 1283 LSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLKLC 1462
            LSWLLRGYIKGG+  DA ET+IKMLDLGL PEYLDRAAVLQ LRK+IQQ GN+E Y+ LC
Sbjct: 416  LSWLLRGYIKGGHISDAAETVIKMLDLGLHPEYLDRAAVLQELRKRIQQPGNIETYVNLC 475

Query: 1463 KYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            K L DA+LIGPCL+Y++IKKYKLW+IKM+
Sbjct: 476  KRLYDASLIGPCLIYLYIKKYKLWVIKML 504


>ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citrus clementina]
            gi|557527050|gb|ESR38356.1| hypothetical protein
            CICLE_v10028251mg [Citrus clementina]
          Length = 502

 Score =  696 bits (1797), Expect = 0.0
 Identities = 356/496 (71%), Positives = 398/496 (80%), Gaps = 4/496 (0%)
 Frame = +2

Query: 74   LGFAFLTNLGFQYR----PQFYRHLTTESCYKISMKISNHQNPSFVVSKRCRKLGFGLFK 241
            LGFA   N   Q      PQ +    T    +IS +I N+QNPSF+ +K  +   F   K
Sbjct: 7    LGFALSPNFLLQRHKLLVPQLHGSCLTRPPPRISTRIKNYQNPSFIATKVSKIREFRFLK 66

Query: 242  AVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSDRELQLVLVYFSQE 421
            +VELDQF+TSDD+DEM E FFEAIEELERMTREPSD+LEEMNDRLS RELQLVLVYFSQE
Sbjct: 67   SVELDQFVTSDDEDEMSEEFFEAIEELERMTREPSDILEEMNDRLSARELQLVLVYFSQE 126

Query: 422  GRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXXXXXXXXX 601
            GRDSWCALEVFEWL+KENRVD ETMELMVSIMC WVKK IEEE                 
Sbjct: 127  GRDSWCALEVFEWLKKENRVDNETMELMVSIMCSWVKKYIEEERDVGDVIDLLVDMDCVG 186

Query: 602  XKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGEGQKGGPAGYLAWKMMA 781
             KP FSMIEK ISLYWE  +KERA+LFVK VL R IAYAE DGEGQKGGP GYLAWKMM 
Sbjct: 187  LKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRGIAYAEGDGEGQKGGPTGYLAWKMMV 246

Query: 782  EGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKTGYIAELD 961
            EG Y+DA++L+IHL ES LKPEVYSYLIA+TAVVKELNEF KALRKLKGY + G IAELD
Sbjct: 247  EGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVKELNEFGKALRKLKGYVRAGSIAELD 306

Query: 962  VETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVEAETQQWK 1141
             + L LIEKYQSDLLADG RLS+W I++G SSL G VHERLLAMYICAGRG+EAE Q W+
Sbjct: 307  GKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYGVVHERLLAMYICAGRGLEAERQLWE 366

Query: 1142 MKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKKKTLSWLLRGYIKGGY 1321
            MKLVGK+AD DLYDIVLAICASQ   S+++RLL+R+EV +SL KKKTLSWLLRGYIKGG+
Sbjct: 367  MKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSRIEVMNSLCKKKTLSWLLRGYIKGGH 426

Query: 1322 FDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDANLIGPCL 1501
             +DA ET+ KMLDLGL PEY+DR AVLQGLRK+IQQSGNVE YL LCK LSD +LIGPCL
Sbjct: 427  INDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQQSGNVEAYLNLCKRLSDTSLIGPCL 486

Query: 1502 VYMHIKKYKLWIIKMI 1549
            VY++IKKYKLWIIKM+
Sbjct: 487  VYLYIKKYKLWIIKML 502


>ref|XP_002313976.1| ubiquitin family protein [Populus trichocarpa]
            gi|222850384|gb|EEE87931.1| ubiquitin family protein
            [Populus trichocarpa]
          Length = 500

 Score =  693 bits (1789), Expect = 0.0
 Identities = 349/501 (69%), Positives = 414/501 (82%), Gaps = 5/501 (0%)
 Frame = +2

Query: 62   MAAALGFAFLTNLG--FQYRPQFYRHLTTESCYKISMKISNHQNP---SFVVSKRCRKLG 226
            MA+A  F+ L+ +   F  + +++     + C  +S  I N+Q P   +FVV+K  +   
Sbjct: 1    MASAHAFSSLSKVSPVFSLKKRYWNSCM-KPCCMVSTIICNYQTPKRPNFVVAKTTKVRE 59

Query: 227  FGLFKAVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSDRELQLVLV 406
            F LFK+VELDQ++TSDD++EMGEGFFEAIEELERMTREPSD+LEEMNDRLS RELQLVLV
Sbjct: 60   FRLFKSVELDQYVTSDDEEEMGEGFFEAIEELERMTREPSDILEEMNDRLSARELQLVLV 119

Query: 407  YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXXXX 586
            YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMC WVKKLIE E            
Sbjct: 120  YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEGEQDVGDVVDLLVD 179

Query: 587  XXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGEGQKGGPAGYLA 766
                  KPSFSMIEK ISLYW+ G+KE A+ FVKEVL+R IAY+ DDGEGQKGGP GYL 
Sbjct: 180  MDCVGLKPSFSMIEKVISLYWDMGKKEGAVSFVKEVLRRGIAYSGDDGEGQKGGPTGYLT 239

Query: 767  WKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKTGY 946
            WKMM +GNY +AV+L+IHL ES LKPE+Y+YLIAMTAVVKELNEF+KALRKLKGY ++G 
Sbjct: 240  WKMMVDGNYRNAVKLVIHLRESGLKPEIYAYLIAMTAVVKELNEFSKALRKLKGYSRSGM 299

Query: 947  IAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVEAE 1126
            + ELD E + L+EKYQSDLLADGV LS+WVI++GS +L G VHERLLAMYICAGRG++AE
Sbjct: 300  VTELDAENVELVEKYQSDLLADGVCLSSWVIQEGSPALYGVVHERLLAMYICAGRGLDAE 359

Query: 1127 TQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKKKTLSWLLRGY 1306
             Q W+MKLVGK+AD DLYDIVLAICASQK  S++ARLLTR+EV+SS++KKK+LSWLLRGY
Sbjct: 360  RQLWEMKLVGKEADGDLYDIVLAICASQKEASAVARLLTRIEVASSMRKKKSLSWLLRGY 419

Query: 1307 IKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDANL 1486
            IKGG++ +A ET+IKMLDLGL P+YLDR AV+QGLRK+IQQ GNVE+YLKLCK LSD NL
Sbjct: 420  IKGGHYGEAAETLIKMLDLGLSPDYLDRVAVMQGLRKRIQQWGNVESYLKLCKRLSDVNL 479

Query: 1487 IGPCLVYMHIKKYKLWIIKMI 1549
            IGP LVY++IKKYKLWI+K++
Sbjct: 480  IGPSLVYLYIKKYKLWIMKLL 500


>ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Citrus sinensis]
          Length = 502

 Score =  693 bits (1788), Expect = 0.0
 Identities = 354/496 (71%), Positives = 397/496 (80%), Gaps = 4/496 (0%)
 Frame = +2

Query: 74   LGFAFLTNLGFQYR----PQFYRHLTTESCYKISMKISNHQNPSFVVSKRCRKLGFGLFK 241
            LGFA   N   Q      PQ      T    +IS +I N+QNP+F+ +K  +   F   K
Sbjct: 7    LGFALSPNFLLQRHKLLVPQLRGSCLTRPPPRISTRIKNYQNPNFIATKVSKIREFRFLK 66

Query: 242  AVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSDRELQLVLVYFSQE 421
            +VELDQF+TSDD+DEM E FFEAIEELERMTREPSD+LEEMNDRLS RELQLVLVYFSQE
Sbjct: 67   SVELDQFVTSDDEDEMSEEFFEAIEELERMTREPSDILEEMNDRLSARELQLVLVYFSQE 126

Query: 422  GRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXXXXXXXXX 601
            GRDSWCALEVFEWL+KENRVD ETMELMVSIMC WVKK IEEE                 
Sbjct: 127  GRDSWCALEVFEWLKKENRVDNETMELMVSIMCSWVKKYIEEERGVGDVVDLLVDMDCVG 186

Query: 602  XKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGEGQKGGPAGYLAWKMMA 781
             KP FSMIEK ISLYWE  +KERA+LFVK VL R IAYAE DGEGQ+GGP GYLAWKMM 
Sbjct: 187  LKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRGIAYAEGDGEGQQGGPTGYLAWKMMV 246

Query: 782  EGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKTGYIAELD 961
            EG Y+DA++L+IHL ES LKPEVYSYLIA+TAVVKELNEF KALRKLKGY + G IAELD
Sbjct: 247  EGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVKELNEFGKALRKLKGYVRAGSIAELD 306

Query: 962  VETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVEAETQQWK 1141
             + L LIEKYQSDLLADG RLS+W I++G SSL G VHERLLAMYICAGRG+EAE Q W+
Sbjct: 307  GKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYGVVHERLLAMYICAGRGLEAERQLWE 366

Query: 1142 MKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKKKTLSWLLRGYIKGGY 1321
            MKLVGK+AD DLYDIVLAICASQ   S+++RLL+R+EV +SL KKKTLSWLLRGYIKGG+
Sbjct: 367  MKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSRIEVMNSLCKKKTLSWLLRGYIKGGH 426

Query: 1322 FDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDANLIGPCL 1501
             +DA ET+ KMLDLGL PEY+DR AVLQGLRK+IQQSGNVE YL LCK LSD +LIGPCL
Sbjct: 427  INDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQQSGNVEAYLNLCKRLSDTSLIGPCL 486

Query: 1502 VYMHIKKYKLWIIKMI 1549
            VY++IKKYKLWIIKM+
Sbjct: 487  VYLYIKKYKLWIIKML 502


>gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]
          Length = 516

 Score =  690 bits (1780), Expect = 0.0
 Identities = 356/516 (68%), Positives = 408/516 (79%), Gaps = 20/516 (3%)
 Frame = +2

Query: 62   MAAALGFAFLTNLGFQ--------------YRPQFYRHLTTESCY-KISMKIS-----NH 181
            MA+A GF  LT LGF               +R + +     E+ + + S K         
Sbjct: 1    MASAQGFTPLTELGFPSSSSSSSSSSSNSLHRNRIFLCRMDENLWGRTSAKFCPVICCKQ 60

Query: 182  QNPSFVVSKRCRKLGFGLFKAVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEE 361
            QNP+F+  K  +   F LF +VELDQFLTSDD++EMGEGFFEAIEELERMTREPSDVLEE
Sbjct: 61   QNPNFIAPKPSKLREFRLFTSVELDQFLTSDDEEEMGEGFFEAIEELERMTREPSDVLEE 120

Query: 362  MNDRLSDRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLI 541
            MNDRLS RELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMV++MC WVKKLI
Sbjct: 121  MNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVTLMCSWVKKLI 180

Query: 542  EEEHAXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAE 721
            E EH                 +P FSM+E  I LYWE GEK RA+ FVKEVL+R IA  E
Sbjct: 181  EGEHDVGDVVDLLVDMACVGLRPGFSMMENVILLYWEMGEKGRAVSFVKEVLRRGIACLE 240

Query: 722  DDGEGQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEF 901
            DDGEG KGGP GYLAWKMM EGNYM+AV+L++ + ES LKPEVYSYLIAMTAVVKELNEF
Sbjct: 241  DDGEGPKGGPTGYLAWKMMVEGNYMEAVKLVVDIRESGLKPEVYSYLIAMTAVVKELNEF 300

Query: 902  AKALRKLKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHER 1081
            AKALRKLKG+++ G  AELD E++ LIEKYQSDLL DGVRLSNWVIE+G +SL+G VHER
Sbjct: 301  AKALRKLKGFERAGLTAELDEESVELIEKYQSDLLDDGVRLSNWVIEEGITSLNGVVHER 360

Query: 1082 LLAMYICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSS 1261
            LLAMYICAGRG+EAE Q WKMKLVGK+AD DLYDIVLAICASQK   +IARLLTR+  SS
Sbjct: 361  LLAMYICAGRGIEAERQLWKMKLVGKEADGDLYDIVLAICASQKEGRAIARLLTRVNFSS 420

Query: 1262 SLQKKKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNV 1441
            +L+K+K+LSWLLRGYIKGG+FD+A ET++KMLDLGL PEYLDRAAVLQGLRK+I+    V
Sbjct: 421  TLRKRKSLSWLLRGYIKGGHFDNAAETVVKMLDLGLCPEYLDRAAVLQGLRKRIKGPDTV 480

Query: 1442 ENYLKLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            E YLKLCK+LSD NLIGPCL+Y++IKKYKLWI+KM+
Sbjct: 481  ETYLKLCKHLSDYNLIGPCLIYLYIKKYKLWIMKML 516


>ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis]
            gi|223539607|gb|EEF41193.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  688 bits (1776), Expect = 0.0
 Identities = 342/458 (74%), Positives = 391/458 (85%), Gaps = 2/458 (0%)
 Frame = +2

Query: 182  QNPSFVVSK--RCRKLGFGLFKAVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVL 355
            ++ +FVV++  + R   F + K+VELDQ++ SDD++EM EGFFEAIEELERMTREPSDVL
Sbjct: 42   KSSNFVVAQQSKSRNREFRVLKSVELDQYIASDDEEEMSEGFFEAIEELERMTREPSDVL 101

Query: 356  EEMNDRLSDRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKK 535
            EEMND+LS RELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMC W+KK
Sbjct: 102  EEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWIKK 161

Query: 536  LIEEEHAXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAY 715
            LIE EH                 KPSFSMIEK ISLYWE GEKE+++ FVKEVL+RE+AY
Sbjct: 162  LIEGEHEIGDVVDLLVDMDCVGLKPSFSMIEKVISLYWEIGEKEKSVSFVKEVLRREVAY 221

Query: 716  AEDDGEGQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELN 895
             EDDGEGQKGGP GYLAWKMM +GNY DAV+L+IH  ES LKPEVYSYLIAMTAVVKELN
Sbjct: 222  FEDDGEGQKGGPTGYLAWKMMVDGNYRDAVKLVIHFRESGLKPEVYSYLIAMTAVVKELN 281

Query: 896  EFAKALRKLKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVH 1075
            EFAKALRKLKG+ K+G IAELD E   LIEKYQSDL+ADGV LS+WVI++GS SL G VH
Sbjct: 282  EFAKALRKLKGFAKSGLIAELDAENTRLIEKYQSDLIADGVCLSSWVIQEGSPSLYGVVH 341

Query: 1076 ERLLAMYICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEV 1255
            ERLLAMYICAGRG++AE Q W+MKLVGK AD DLYDIVLAICASQK  S+++RLLTR+EV
Sbjct: 342  ERLLAMYICAGRGLDAERQLWEMKLVGKHADGDLYDIVLAICASQKEASAVSRLLTRVEV 401

Query: 1256 SSSLQKKKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSG 1435
            +SSLQKKKTLSWLLRGY+KGG +D+A E ++KMLD+GL P+YLDR AVLQGLRK+IQQ G
Sbjct: 402  TSSLQKKKTLSWLLRGYLKGGQYDEAAEALVKMLDMGLCPDYLDRVAVLQGLRKRIQQWG 461

Query: 1436 NVENYLKLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            NVE+YL LCK LSD NLIGP LVY++IKKYKLWI+KM+
Sbjct: 462  NVESYLNLCKRLSDENLIGPSLVYLYIKKYKLWIMKML 499


>ref|XP_007146808.1| hypothetical protein PHAVU_006G071400g [Phaseolus vulgaris]
            gi|561020031|gb|ESW18802.1| hypothetical protein
            PHAVU_006G071400g [Phaseolus vulgaris]
          Length = 510

 Score =  681 bits (1757), Expect = 0.0
 Identities = 347/510 (68%), Positives = 404/510 (79%), Gaps = 14/510 (2%)
 Frame = +2

Query: 62   MAAALGFAFLTNLGFQYRP-----QFYRHLTTESCYKISMKISN--------HQNPSFVV 202
            MA A GFA    LGF +       Q Y  +   +    S+K  N         QNPS V 
Sbjct: 1    MAYAQGFAPNFKLGFVFSSGSPSQQRYPLMFPAAHCGFSLKFYNGVCARSFKFQNPSIVA 60

Query: 203  SKRCRKLGFGLFKAVELDQFLTSDDK-DEMGEGFFEAIEELERMTREPSDVLEEMNDRLS 379
            +K C   GF + K+VELDQF+TSDD+ DEMG+GFFEAIEELERMTREPSD+LEEMNDRLS
Sbjct: 61   AKHCSVRGFRVLKSVELDQFVTSDDEEDEMGDGFFEAIEELERMTREPSDILEEMNDRLS 120

Query: 380  DRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAX 559
             RELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMVSIMCGWVKKLI+E+H  
Sbjct: 121  ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVSIMCGWVKKLIQEQHGV 180

Query: 560  XXXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGEGQ 739
                           +P FSMIEK ISLYWE GEKE A+LFV+EVL+R I YA +D EG 
Sbjct: 181  GDVIDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYASEDKEGH 240

Query: 740  KGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRK 919
            KGGP GYLAWKMMAEG+Y  AVRL+I   ES LKPEVYSYL+AMTAVVKELNEFAKALRK
Sbjct: 241  KGGPTGYLAWKMMAEGDYRSAVRLVIRFRESGLKPEVYSYLVAMTAVVKELNEFAKALRK 300

Query: 920  LKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAMYI 1099
            LK + + G + ELD+E + L EKYQ+DLLADGVRLSNWVI+ G  SL G VHERLLAMYI
Sbjct: 301  LKSFTRAGLVTELDLEDVELAEKYQTDLLADGVRLSNWVIQDGRPSLYGVVHERLLAMYI 360

Query: 1100 CAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKKK 1279
            CAG G+EAE Q W+MKLVGK+AD DLYDIVLAICASQK  ++ ARLLTR+E+++S QKKK
Sbjct: 361  CAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKEVNATARLLTRLELANSPQKKK 420

Query: 1280 TLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLKL 1459
            +LSWLLRGYIKGG+F +A ET++KML+LG  PEYLDRAAVLQGLRK+IQQ GN++ Y++L
Sbjct: 421  SLSWLLRGYIKGGHFTEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVRL 480

Query: 1460 CKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            CK LSDANLIGPCLV+++I+KYKLW++KM+
Sbjct: 481  CKSLSDANLIGPCLVHLYIRKYKLWVVKML 510


>ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292395 [Fragaria vesca
            subsp. vesca]
          Length = 1304

 Score =  678 bits (1750), Expect = 0.0
 Identities = 343/478 (71%), Positives = 393/478 (82%), Gaps = 1/478 (0%)
 Frame = +2

Query: 113  RPQFYRHLTTESCYKISMKISNHQNPSFVVSKRCRKLGFGLFKAVELDQFLTSDDKDEMG 292
            R +F    + + C ++   I   +NPSFVV+K  +   F LF +V+LDQF+TSDD+DEMG
Sbjct: 15   RYRFSGGFSGKRCSRVCNVIYKEKNPSFVVAKSGKVRDFRLFNSVQLDQFVTSDDEDEMG 74

Query: 293  EGFFEAIEELERMTREPSDVLEEMNDRLSDRELQLVLVYFSQEGRDSWCALEVFEWLRKE 472
            E FFEAIEELERM REPSDVLEEMNDRLS RELQLVLVYFSQEGRDSWCALEVFEWLR+E
Sbjct: 75   ESFFEAIEELERMRREPSDVLEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRRE 134

Query: 473  NRVDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWE 652
            NRVDKETMELMVSIMCGW+K+LIEE +                 KPSFSM+EK ISLYWE
Sbjct: 135  NRVDKETMELMVSIMCGWLKRLIEEGNDVADVIDLLVDVDCVGLKPSFSMMEKVISLYWE 194

Query: 653  TGEKERAILFVKEVLKREIAYAE-DDGEGQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGE 829
             GEKE A+LFVKEVLKR I Y+E DD +G KGGP GYLAWKM  +GNY D+V+ +I L E
Sbjct: 195  MGEKENAVLFVKEVLKRGIVYSEEDDRDGHKGGPTGYLAWKMTVDGNYRDSVKFVIQLRE 254

Query: 830  SELKPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKTGYIAELDVETLALIEKYQSDLLA 1009
            S LKPEVYSYLIAMTAVVKELNE  KALRKLK + + G +AE D E + LIEKYQSDLLA
Sbjct: 255  SGLKPEVYSYLIAMTAVVKELNELGKALRKLKAFTRAGLVAEFDSEDVGLIEKYQSDLLA 314

Query: 1010 DGVRLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVEAETQQWKMKLVGKQADRDLYDIV 1189
            DGV+LSNWVI++GSS+L G VHERLLAMYIC+GRG+EAE Q W+MKLVGK+ D DLYDIV
Sbjct: 315  DGVQLSNWVIQEGSSTLCGVVHERLLAMYICSGRGLEAERQLWEMKLVGKEPDGDLYDIV 374

Query: 1190 LAICASQKATSSIARLLTRMEVSSSLQKKKTLSWLLRGYIKGGYFDDALETIIKMLDLGL 1369
            LAICAS+K TS+IARLLTR EVSSSL KKK+LSWLLRGYIKGG+F+DA ET+IKMLDLGL
Sbjct: 375  LAICASRKETSAIARLLTRTEVSSSLSKKKSLSWLLRGYIKGGHFNDAAETVIKMLDLGL 434

Query: 1370 IPEYLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDANLIGPCLVYMHIKKYKLWIIK 1543
             P+YLDRAAVL GLRK+IQQSG V+ YLKLCK LSDANLI  CL+Y++IKK+KLWII+
Sbjct: 435  FPDYLDRAAVLHGLRKRIQQSGTVDTYLKLCKRLSDANLIESCLLYLYIKKHKLWIIR 492


>ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 508

 Score =  673 bits (1737), Expect = 0.0
 Identities = 336/477 (70%), Positives = 393/477 (82%)
 Frame = +2

Query: 119  QFYRHLTTESCYKISMKISNHQNPSFVVSKRCRKLGFGLFKAVELDQFLTSDDKDEMGEG 298
            +FY  L+  SC          +NPSFV +K     GF   K+VE+DQ++TS+D  EM +G
Sbjct: 42   KFYGGLSARSC--------KFKNPSFVSAKHGSLRGFRALKSVEMDQYVTSND--EMSDG 91

Query: 299  FFEAIEELERMTREPSDVLEEMNDRLSDRELQLVLVYFSQEGRDSWCALEVFEWLRKENR 478
            FFEAIEELERMTREPSDVLEEMNDRLS RELQLVLVYFSQ+GRDSWCALEVF+WLRKENR
Sbjct: 92   FFEAIEELERMTREPSDVLEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENR 151

Query: 479  VDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWETG 658
            VDKETMELMV+IMCGWVKKLI+++H                 +P FSMIEK ISLYWE G
Sbjct: 152  VDKETMELMVAIMCGWVKKLIQQQHGVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMG 211

Query: 659  EKERAILFVKEVLKREIAYAEDDGEGQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESEL 838
            EKE A+LFV+EVL+R I Y E+D EG KGGP GYLAWKMMAEG+Y +AVRL+I   ES L
Sbjct: 212  EKEGAVLFVEEVLRRGIPYVEEDEEGHKGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGL 271

Query: 839  KPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKTGYIAELDVETLALIEKYQSDLLADGV 1018
            KPE+YSYL+AMTAVVKELNEFAKALRKLKG+ + G +AELD+E + L EKYQSD LADGV
Sbjct: 272  KPEIYSYLVAMTAVVKELNEFAKALRKLKGFTRAGLVAELDLEDVELTEKYQSDTLADGV 331

Query: 1019 RLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAI 1198
            RLSNWVI+ GS SL G VHERLLAMYICAG G+EAE Q W+MKLVGK+AD DLYDIVLAI
Sbjct: 332  RLSNWVIQDGSPSLHGIVHERLLAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAI 391

Query: 1199 CASQKATSSIARLLTRMEVSSSLQKKKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPE 1378
            CASQK +++ ARLLTR+EV SS QKKK+LSWLLRGYIKGG+F++A ETI+KML+LG  PE
Sbjct: 392  CASQKESNATARLLTRLEVVSSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLELGFYPE 451

Query: 1379 YLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            YLDRAAVLQGLRK+IQQ GN++ Y++LCK LSDANLIGPCLV+++I+KYKLW++KM+
Sbjct: 452  YLDRAAVLQGLRKRIQQYGNLDTYVRLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 508


>ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 510

 Score =  673 bits (1736), Expect = 0.0
 Identities = 350/512 (68%), Positives = 404/512 (78%), Gaps = 16/512 (3%)
 Frame = +2

Query: 62   MAAALGFAFLTNLGFQYR---PQFYRH---LTTESC-YKI-------SMKISNHQNPSFV 199
            MA A GFA +  LGF +    P   RH        C Y +       S +    +NPSFV
Sbjct: 1    MAYAHGFAPIFKLGFVFSSVSPSQKRHPLVFPASHCGYSLKFYDGVLSARSCKFKNPSFV 60

Query: 200  VSKRCRKLGFGLFKAVELDQFLTSDDK-DEMGEGFFEAIEELERMTREPSDVLEEMNDRL 376
                 R  GF   K+VELDQ++TSDD+ DEM +GFFEAIEELERMTREPSDVLEEMNDRL
Sbjct: 61   KQGSIR--GFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSDVLEEMNDRL 118

Query: 377  SDRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHA 556
            S RELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMV+IMCGWVKKLI+E H 
Sbjct: 119  SARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQEHHG 178

Query: 557  XXXXXXXXXXXXXXXX-KPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGE 733
                             +P FSMIEK ISLYWE GEKE A+LFV+EVL+R I Y E+D E
Sbjct: 179  VVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYLEEDEE 238

Query: 734  GQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKAL 913
            G KGGP GYLAWKMMAEG+Y  AVRL+IH  ES LKPEVYSYL+AMTAVVKELNE AKAL
Sbjct: 239  GHKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVKELNELAKAL 298

Query: 914  RKLKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAM 1093
            RKLK + +TG +AELD+E + L EKYQSDLL DGVRLSNW I+ GS SL G +HERLLAM
Sbjct: 299  RKLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHGIIHERLLAM 358

Query: 1094 YICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQK 1273
            YICAG G+EAE Q W+MKLVGK+AD DLYDIVLAICASQK +++ ARLLTR+EV+SS QK
Sbjct: 359  YICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVASSPQK 418

Query: 1274 KKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYL 1453
            KK+LSWLLRGYIKGG+F++A ETI+KMLDLG  PEYLDRAAVLQGLRK+IQQ GN++ Y+
Sbjct: 419  KKSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQQYGNLDTYV 478

Query: 1454 KLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            +LCK LSDANLIGPCLV+++I+KYKLW++KM+
Sbjct: 479  RLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 510


>ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g30100, chloroplastic-like [Cucumis sativus]
          Length = 501

 Score =  669 bits (1725), Expect = 0.0
 Identities = 333/503 (66%), Positives = 400/503 (79%), Gaps = 7/503 (1%)
 Frame = +2

Query: 62   MAAALGFAFLTNLGFQYRPQFYRHLTTESC-------YKISMKISNHQNPSFVVSKRCRK 220
            M  A GF  LT  GF +       L ++ C       Y +S    N+Q+ +F VS+  + 
Sbjct: 1    MICAQGFTPLTQFGFSF--SLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKF 58

Query: 221  LGFGLFKAVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSDRELQLV 400
                LFK+VELDQF+TSDD+DEMG+GFFEAIEELERMTREPSDVLEEMNDRLS RE+QLV
Sbjct: 59   RDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLV 118

Query: 401  LVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXX 580
            LVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC W+KKL+E  H         
Sbjct: 119  LVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLL 178

Query: 581  XXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGEGQKGGPAGY 760
                    KP FSMIEK ISLYWE GEKE+A+ FVKEVL R +A+ +DD EG KGGP+GY
Sbjct: 179  VDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGY 238

Query: 761  LAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKT 940
            LAWKMM +G+Y  AV++++HL ES L+PEVYSYLIAMTAVVKELNEFAKALRKLKGY + 
Sbjct: 239  LAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARD 298

Query: 941  GYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVE 1120
            G++AELD   + L+ KYQ++LLADGV+LSNWV+E+GSSS+ G VHERLLAMYICAG+GVE
Sbjct: 299  GFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVE 358

Query: 1121 AETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKKKTLSWLLR 1300
            AE Q W+MKLVGK+AD DLYDIVLAICASQK T ++ RLLTR+E++S + KKK+L+WLLR
Sbjct: 359  AERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLR 418

Query: 1301 GYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDA 1480
            GYIKGG+F DA  T++KM++LG +PEYLDR AVLQGL K+I++  +V  YL LCK LSDA
Sbjct: 419  GYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLXKEIREPESVHTYLDLCKCLSDA 478

Query: 1481 NLIGPCLVYMHIKKYKLWIIKMI 1549
            NLIGP LVY+H++K+KLWIIKM+
Sbjct: 479  NLIGPSLVYLHLQKHKLWIIKML 501


>ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207176 [Cucumis sativus]
          Length = 1290

 Score =  644 bits (1660), Expect = 0.0
 Identities = 323/487 (66%), Positives = 385/487 (79%), Gaps = 7/487 (1%)
 Frame = +2

Query: 62   MAAALGFAFLTNLGFQYRPQFYRHLTTESC-------YKISMKISNHQNPSFVVSKRCRK 220
            M  A GF  LT  GF +       L ++ C       Y +S    N+Q+ +F VS+  + 
Sbjct: 1    MICAQGFTPLTQFGFSF--SLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRAAKF 58

Query: 221  LGFGLFKAVELDQFLTSDDKDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSDRELQLV 400
                LFK+VELDQF+TSDD+DEMG+GFFEAIEELERMTREPSDVLEEMNDRLS RE+QLV
Sbjct: 59   RDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLV 118

Query: 401  LVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXX 580
            LVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC W+KKL+E  H         
Sbjct: 119  LVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLL 178

Query: 581  XXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGEGQKGGPAGY 760
                    KP FSMIEK ISLYWE GEKE+A+ FVKEVL R +A+ +DD EG KGGP+GY
Sbjct: 179  VDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGY 238

Query: 761  LAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKT 940
            LAWKMM +G+Y  AV++++HL ES L+PEVYSYLIAMTAVVKELNEFAKALRKLKGY + 
Sbjct: 239  LAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARD 298

Query: 941  GYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVE 1120
            G++AELD   + L+ KYQ++LLADGV+LSNWV+E+GSSS+ G VHERLLAMYICAG+GVE
Sbjct: 299  GFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVE 358

Query: 1121 AETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKKKTLSWLLR 1300
            AE Q W+MKLVGK+AD DLYDIVLAICASQK T ++ RLLTR+E++S + KKK+L+WLLR
Sbjct: 359  AERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLR 418

Query: 1301 GYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDA 1480
            GYIKGG+F DA  T++KM++LG +PEYLDR AVLQGLRK+I++  +V  YL LCK LSDA
Sbjct: 419  GYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDA 478

Query: 1481 NLIGPCL 1501
            NLIGP L
Sbjct: 479  NLIGPSL 485


>ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum lycopersicum]
          Length = 503

 Score =  642 bits (1657), Expect = 0.0
 Identities = 320/458 (69%), Positives = 381/458 (83%), Gaps = 2/458 (0%)
 Frame = +2

Query: 182  QNPSFVVSKRCRKLGFGLFKAVELDQFLTSD--DKDEMGEGFFEAIEELERMTREPSDVL 355
            +NPSFV  +R    GF LF +VEL  F+TSD  +K+EM + FFEAIEELERMTREPSDVL
Sbjct: 49   RNPSFVSPRRN---GFKLFSSVELGSFVTSDGEEKNEMSDCFFEAIEELERMTREPSDVL 105

Query: 356  EEMNDRLSDRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKK 535
            EEMN+RLSDRELQLVLVYF+QEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWV+K
Sbjct: 106  EEMNERLSDRELQLVLVYFAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQK 165

Query: 536  LIEEEHAXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAY 715
            LI  +                   PSFSM+EK ISLYW+ GE+E A+ FVKEVL+R+IAY
Sbjct: 166  LIGSKSEAGDVVDLLVDMDCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAY 225

Query: 716  AEDDGEGQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELN 895
            ++ + +G K GPAGYLAWKMM EGNY DAV+L+I + +S LKPE+YSYLIAMTAVVKELN
Sbjct: 226  SDGNVDGHKAGPAGYLAWKMMEEGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELN 285

Query: 896  EFAKALRKLKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVH 1075
            EF KALRKLKG+ +TG +AELD+E L LIE+YQ+DLLA+GV+LS+W+I++G  SL G VH
Sbjct: 286  EFGKALRKLKGFARTGLVAELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVH 345

Query: 1076 ERLLAMYICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEV 1255
            ERLLAMY+CAGRG+EAE   W+MK+ GK+   DL+DIVLAICASQK    I+RLLT ME 
Sbjct: 346  ERLLAMYVCAGRGIEAERHLWQMKISGKEVSGDLHDIVLAICASQKELGPISRLLTGMEA 405

Query: 1256 SSSLQKKKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSG 1435
            SSSLQKKKTLSWLLRGYIKGG+ ++A ET+IKMLDLGL P++LDRAAVLQ LR++IQQSG
Sbjct: 406  SSSLQKKKTLSWLLRGYIKGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSG 465

Query: 1436 NVENYLKLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            N+E YL LCK+LSDA+LIGPCLVY++IKKY+LWII+ +
Sbjct: 466  NLETYLNLCKHLSDASLIGPCLVYLYIKKYRLWIIRTL 503


>ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum tuberosum]
          Length = 503

 Score =  638 bits (1646), Expect = e-180
 Identities = 320/458 (69%), Positives = 379/458 (82%), Gaps = 2/458 (0%)
 Frame = +2

Query: 182  QNPSFVVSKRCRKLGFGLFKAVELDQFLTSDD--KDEMGEGFFEAIEELERMTREPSDVL 355
            +NPSFV     R+ GF LF +VEL  F+TSDD  K+EM + FFEAIEELERMTREPSDVL
Sbjct: 49   RNPSFV---NPRRNGFKLFNSVELGSFVTSDDEEKNEMSDCFFEAIEELERMTREPSDVL 105

Query: 356  EEMNDRLSDRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKK 535
            EEMN+RLSDRELQLVLVYF+QEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWV+K
Sbjct: 106  EEMNERLSDRELQLVLVYFAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQK 165

Query: 536  LIEEEHAXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAY 715
            LI  +                   PSFSM+EK ISLYW+ GE+E A+ FVKEVL+R+IAY
Sbjct: 166  LIGSKSEAGDVVDLLVDMDCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAY 225

Query: 716  AEDDGEGQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELN 895
            ++ + +G K GPAGYLAWKMM  GNY DAV+L+I + +S LKPE+YSYLIAMTAVVKELN
Sbjct: 226  SDGNVDGHKAGPAGYLAWKMMEVGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELN 285

Query: 896  EFAKALRKLKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSSSLSGAVH 1075
            EF KALRKLKG+ +TG +AELD+E L LIE+YQ+DLLA+GV+LS+W+I++G  SL G VH
Sbjct: 286  EFGKALRKLKGFARTGLVAELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVH 345

Query: 1076 ERLLAMYICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEV 1255
            ERLLAMY+CAGRG+EAE   W+MKL GK+   DL DIVLAICASQK    I+RLLT ME 
Sbjct: 346  ERLLAMYVCAGRGIEAERHLWQMKLSGKKVTGDLQDIVLAICASQKELGPISRLLTGMEA 405

Query: 1256 SSSLQKKKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSG 1435
            SSSLQKKKTLSWLLRGYIKGG+ ++A ET+IKMLDLGL P++LDRAAVLQ LR++IQQSG
Sbjct: 406  SSSLQKKKTLSWLLRGYIKGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSG 465

Query: 1436 NVENYLKLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            ++E YL LCK+LSDA+LIGPCLVY++IKKY+LWII+ +
Sbjct: 466  SLETYLNLCKHLSDASLIGPCLVYLYIKKYRLWIIRTL 503


>ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Cicer arietinum]
          Length = 508

 Score =  618 bits (1594), Expect = e-174
 Identities = 326/513 (63%), Positives = 389/513 (75%), Gaps = 17/513 (3%)
 Frame = +2

Query: 62   MAAALGFAFLTNLGFQYRPQF---YRH--LTTESCYKISMKISN----HQNPSFVVSKRC 214
            MA+  GFA    LGF +   F    +H  +   S    S+K  +     QNPSF  +K  
Sbjct: 1    MASLHGFAPTLKLGFAFSSLFSPKQKHPLVFPSSKRGFSLKFCDGSFKFQNPSFPPTK-- 58

Query: 215  RKLGFGLFKAVELDQFLTSDDKDE-------MGEGFFEAIEELERMTREPSDVLEEMNDR 373
                +   K+VELDQF+TSDD++E       MG+GF EAIEELERMTREPSDVLEEMNDR
Sbjct: 59   -PNSYMRKKSVELDQFVTSDDEEEEEEEEEEMGDGFLEAIEELERMTREPSDVLEEMNDR 117

Query: 374  LSDRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEH 553
            LS RELQLVLVYFSQEGRDSWCALEVF+WLRKENRVDKETMELMV+IMCGWVKKLI E+H
Sbjct: 118  LSARELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIMEKH 177

Query: 554  AXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAEDDGE 733
                             +P FSMIEK ISLYWE GEK+ A+LFV+EVL+R I+  EDD E
Sbjct: 178  GVDDVIDLLVNMNCVGLRPGFSMIEKVISLYWEMGEKDDAVLFVEEVLRRGISSNEDDPE 237

Query: 734  GQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKAL 913
              KGGP GYLAWKMM EG+Y  AVRL+    E+ LKP++YSYL+AMTAVVKELNE AKAL
Sbjct: 238  --KGGPTGYLAWKMMVEGDYRGAVRLVTRFREAGLKPDIYSYLVAMTAVVKELNELAKAL 295

Query: 914  RKLKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGS-SSLSGAVHERLLA 1090
            RKLK + + G I E D E + L EKYQSDLLADG RLS WVI+ GS SS+ G +HERLLA
Sbjct: 296  RKLKSFSRAGLITEFDREDVELAEKYQSDLLADGARLSKWVIQDGSPSSIHGIIHERLLA 355

Query: 1091 MYICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQ 1270
            MYICAGRG+EAE Q W+MKL+GK+A   LYD+VLAICASQK  ++ ARL+ RMEV+SS Q
Sbjct: 356  MYICAGRGIEAERQLWEMKLLGKEAVGGLYDMVLAICASQKEAAATARLMIRMEVASSPQ 415

Query: 1271 KKKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENY 1450
            KKK+LSWLLRGYIKGG+F++A ET++KML+LG  P+YLDR AV+QGLRK+IQQ GN++ Y
Sbjct: 416  KKKSLSWLLRGYIKGGHFNEAAETVMKMLELGFYPDYLDRVAVMQGLRKRIQQYGNLDTY 475

Query: 1451 LKLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            +KLCK L +ANLIG C+ Y++I+KYKLW++KMI
Sbjct: 476  IKLCKSLYEANLIGACVCYLYIRKYKLWVVKMI 508


>gb|EYU45707.1| hypothetical protein MIMGU_mgv1a020921mg [Mimulus guttatus]
          Length = 421

 Score =  582 bits (1501), Expect = e-163
 Identities = 287/421 (68%), Positives = 337/421 (80%)
 Frame = +2

Query: 287  MGEGFFEAIEELERMTREPSDVLEEMNDRLSDRELQLVLVYFSQEGRDSWCALEVFEWLR 466
            MGEGFFEAIEELERM REPSDVLEEMND+LS RELQLVLVYF+QEGRDSWCALEVFEWL+
Sbjct: 1    MGEGFFEAIEELERMAREPSDVLEEMNDKLSARELQLVLVYFAQEGRDSWCALEVFEWLK 60

Query: 467  KENRVDKETMELMVSIMCGWVKKLIEEEHAXXXXXXXXXXXXXXXXKPSFSMIEKAISLY 646
            KENRVDKETMELMVSIMC WVKKLIE ++                 K SFSM+EK ISLY
Sbjct: 61   KENRVDKETMELMVSIMCTWVKKLIEGKNEVEDVVDLLVDMDCVGLKTSFSMVEKVISLY 120

Query: 647  WETGEKERAILFVKEVLKREIAYAEDDGEGQKGGPAGYLAWKMMAEGNYMDAVRLIIHLG 826
            WE GE++  +LFVKEVL+R I+   D  EG+KGGPAGYLAWKMM EG Y DA +L+IHL 
Sbjct: 121  WEAGERDGTVLFVKEVLRRGISMRLDGDEGKKGGPAGYLAWKMMEEGKYRDAAKLVIHLK 180

Query: 827  ESELKPEVYSYLIAMTAVVKELNEFAKALRKLKGYKKTGYIAELDVETLALIEKYQSDLL 1006
            E  LKP+VYSYLIAMTAVVKELNEFAK+LRKLK + K   +A LD E+L L++ YQ+DLL
Sbjct: 181  ECGLKPDVYSYLIAMTAVVKELNEFAKSLRKLKSFTKANLVAHLDPESLHLVQDYQNDLL 240

Query: 1007 ADGVRLSNWVIEQGSSSLSGAVHERLLAMYICAGRGVEAETQQWKMKLVGKQADRDLYDI 1186
            +DG+ LSN ++++      G VHERLLAMYICAGRG EAE Q W+MKLVGK+AD DLYDI
Sbjct: 241  SDGLHLSNCILQEWGPQFHGMVHERLLAMYICAGRGSEAERQLWEMKLVGKEADADLYDI 300

Query: 1187 VLAICASQKATSSIARLLTRMEVSSSLQKKKTLSWLLRGYIKGGYFDDALETIIKMLDLG 1366
            VLAICASQ  T SI RL+ R++   +L++KKTLSWLLRGY+KGG+F  A ET++KMLDLG
Sbjct: 301  VLAICASQGETGSIGRLMARVDCVGALRRKKTLSWLLRGYVKGGHFKKAAETLVKMLDLG 360

Query: 1367 LIPEYLDRAAVLQGLRKKIQQSGNVENYLKLCKYLSDANLIGPCLVYMHIKKYKLWIIKM 1546
              PE LDR AV+QGL ++IQ  GNV+ YL LCK LSDA LIGP LVY+H++K+KLW+IKM
Sbjct: 361  FFPELLDRVAVMQGLSRRIQLQGNVDTYLTLCKRLSDAGLIGPALVYVHMRKHKLWVIKM 420

Query: 1547 I 1549
            +
Sbjct: 421  L 421


>ref|XP_006293980.1| hypothetical protein CARUB_v10022972mg [Capsella rubella]
            gi|482562688|gb|EOA26878.1| hypothetical protein
            CARUB_v10022972mg [Capsella rubella]
          Length = 532

 Score =  563 bits (1451), Expect = e-158
 Identities = 294/519 (56%), Positives = 370/519 (71%), Gaps = 17/519 (3%)
 Frame = +2

Query: 44   SNRSEEMAAALGFAFLTNLGFQYRPQF-----YRHLTTESCYKISMKIS-NHQNPSFVVS 205
            S  S  MA A GFA LT L   + P       YR    +S  +IS  +  N+    F   
Sbjct: 22   SFHSHVMAYARGFASLTQLNLIFSPSISLRRVYRTPGVKSVSRISCNLKLNYSAGKFRDL 81

Query: 206  KRCRKLGFGLFKAVELDQFLTSDDK------DEMGEGFFEAIEELERMTREPSDVLEEMN 367
            K        L ++VELDQF+TS+++      DE+GEGFFEAIEELERMTREPSDVLEEMN
Sbjct: 82   K--------LSRSVELDQFITSEEEGGEEAEDEIGEGFFEAIEELERMTREPSDVLEEMN 133

Query: 368  DRLSDRELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEE 547
             RLS RELQL+LVYF+QEGRDSWC LEVFEWL+KENRVD++ +ELMVSIMCGWVKKLI+E
Sbjct: 134  HRLSSRELQLMLVYFAQEGRDSWCTLEVFEWLKKENRVDEQMVELMVSIMCGWVKKLIQE 193

Query: 548  EHAXXXXXXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKREIAYAED- 724
            E                  KP FSM+EK I+LY E G+KE A+LFVKEVL+R   +    
Sbjct: 194  ECGADQVFDLLIEMDCVGLKPGFSMMEKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSV 253

Query: 725  --DGEGQKGGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNE 898
                EG+KGGP GYLAWK+M +G+Y  AV L++ L  S L PE YSYLIAMTA+VKELN 
Sbjct: 254  VGGSEGRKGGPVGYLAWKLMVDGDYKKAVDLVVELRLSGLMPEAYSYLIAMTAIVKELNS 313

Query: 899  FAKALRKLKGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQGSS--SLSGAV 1072
              K LR+LK + + GY+ E+D     LIEKYQS+ L+ G++L+ W +E+G    S+ G V
Sbjct: 314  LGKTLRELKRFTRAGYVTEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQQEDSIIGVV 373

Query: 1073 HERLLAMYICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRME 1252
            HERLLAMYICAGRG EAE Q WKMKL G++ + +L+DIV+AICASQK  ++++RLLTR+E
Sbjct: 374  HERLLAMYICAGRGPEAEKQLWKMKLAGREPEAELHDIVMAICASQKEVNAVSRLLTRVE 433

Query: 1253 VSSSLQKKKTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQS 1432
               S +KKKTLSWLLRGY+KGG+F++A ET+I M+D GL PEY+DR AV+QG+ +KIQ+ 
Sbjct: 434  FMESKRKKKTLSWLLRGYVKGGHFEEAAETLITMIDSGLHPEYIDRVAVMQGMTRKIQRP 493

Query: 1433 GNVENYLKLCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
             ++E Y+ LCK L DA L+GPCLVYM++ KYKLWI+KM+
Sbjct: 494  RDIEAYMGLCKRLFDAGLVGPCLVYMYMDKYKLWIVKMM 532


>ref|XP_006410063.1| hypothetical protein EUTSA_v10016546mg [Eutrema salsugineum]
            gi|557111232|gb|ESQ51516.1| hypothetical protein
            EUTSA_v10016546mg [Eutrema salsugineum]
          Length = 503

 Score =  563 bits (1450), Expect = e-157
 Identities = 295/511 (57%), Positives = 375/511 (73%), Gaps = 15/511 (2%)
 Frame = +2

Query: 62   MAAALGFAFLT--------NLGFQYRPQFYRHLTTESCYKISMKISNHQNPSFVVSKRCR 217
            MA A GFA LT         L F +RP+ +R+   +   +IS  +  +       + + R
Sbjct: 1    MAYARGFASLTFSPPISLRRLRF-FRPRLHRNYRVKPDSRISCNLKFN-----FAAGKFR 54

Query: 218  KLGFGLFKAVELDQFLTSDDK---DEMGEGFFEAIEELERMTREPSDVLEEMNDRLSDRE 388
            +LG  L ++VELDQF+TS+++   DE+G+GFFEAIEELERMTREPSD+LEEMN RLS RE
Sbjct: 55   ELG--LSRSVELDQFITSEEENQADEIGQGFFEAIEELERMTREPSDILEEMNHRLSSRE 112

Query: 389  LQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVKKLIEEEHAXXXX 568
            LQL+LVYF+QEGRDSWCALEVFEWL+KENRVD+E MELMVSIMCGWVKKLI+EE      
Sbjct: 113  LQLMLVYFAQEGRDSWCALEVFEWLKKENRVDEEMMELMVSIMCGWVKKLIQEECDAAQV 172

Query: 569  XXXXXXXXXXXXKPSFSMIEKAISLYWETGEKERAILFVKEVLKRE--IAYAEDDGEGQK 742
                        KP FSM+EK I+LY E  +KE A+LFVKEVL+R     Y+    EG+K
Sbjct: 173  FDLLIEMDCVGLKPGFSMMEKVIALYCEMEKKESAVLFVKEVLRRRDTSGYSVVVSEGRK 232

Query: 743  GGPAGYLAWKMMAEGNYMDAVRLIIHLGESELKPEVYSYLIAMTAVVKELNEFAKALRKL 922
            GGP GYLAWKMM +G+Y  AV L++ L  S LKPE YSYLIAMTA+VKELN   K LR+L
Sbjct: 233  GGPTGYLAWKMMVDGDYKKAVDLVVELRFSGLKPEAYSYLIAMTAIVKELNSLGKTLREL 292

Query: 923  KGYKKTGYIAELDVETLALIEKYQSDLLADGVRLSNWVIEQG--SSSLSGAVHERLLAMY 1096
            K + + G +AE+D     LIEKYQS+L++ G+ L+ W +++G  + S+ GAVHERLL MY
Sbjct: 293  KRFTRAGLVAEIDDHDRLLIEKYQSELISRGLELAAWAVQEGQQNDSIIGAVHERLLGMY 352

Query: 1097 ICAGRGVEAETQQWKMKLVGKQADRDLYDIVLAICASQKATSSIARLLTRMEVSSSLQKK 1276
            ICAGRG EAE Q W MKL G++ + DL+DIV+AICASQK  ++++RLLTR+E   S  KK
Sbjct: 353  ICAGRGPEAEKQLWNMKLTGREPEADLHDIVMAICASQKEVNAVSRLLTRVEFMESKGKK 412

Query: 1277 KTLSWLLRGYIKGGYFDDALETIIKMLDLGLIPEYLDRAAVLQGLRKKIQQSGNVENYLK 1456
            K+LSWLLRGY+KGG+F++A ET+I M+D GL PEY+DR AV+QG+ KKIQ+  +VE Y+ 
Sbjct: 413  KSLSWLLRGYVKGGHFEEAAETLITMMDSGLYPEYIDRVAVMQGMTKKIQRPRDVEAYMG 472

Query: 1457 LCKYLSDANLIGPCLVYMHIKKYKLWIIKMI 1549
            LCK L DA L+GPCLVYM++ KYKLWI+KM+
Sbjct: 473  LCKRLFDAGLVGPCLVYMYMDKYKLWIVKMM 503


Top