BLASTX nr result

ID: Rehmannia23_contig00006362 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00006362
         (1892 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ06247.1| hypothetical protein PRUPE_ppa004609mg [Prunus pe...   678   0.0  
ref|XP_002313976.1| ubiquitin family protein [Populus trichocarp...   674   0.0  
ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm...   672   0.0  
ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi...   672   0.0  
gb|EOY34562.1| Pentatricopeptide repeat-containing protein [Theo...   669   0.0  
ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi...   663   0.0  
gb|ESW18802.1| hypothetical protein PHAVU_006G071400g [Phaseolus...   660   0.0  
ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citr...   658   0.0  
ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containi...   655   0.0  
ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi...   652   0.0  
ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292...   651   0.0  
gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]     649   0.0  
ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   643   0.0  
ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containi...   634   e-179
ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containi...   633   e-179
ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207...   612   e-172
ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containi...   604   e-170
gb|EPS70238.1| hypothetical protein M569_04522 [Genlisea aurea]       589   e-165
ref|XP_006293981.1| hypothetical protein CARUB_v10022972mg [Caps...   561   e-157
ref|XP_006293980.1| hypothetical protein CARUB_v10022972mg [Caps...   561   e-157

>gb|EMJ06247.1| hypothetical protein PRUPE_ppa004609mg [Prunus persica]
          Length = 500

 Score =  678 bits (1750), Expect = 0.0
 Identities = 344/464 (74%), Positives = 385/464 (82%), Gaps = 1/464 (0%)
 Frame = +1

Query: 472  YARISVGESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMARE 651
            + RI   + P  +  K SKV+ F + KSV+LD F+TSDDEDEM EGFF AIEELERM RE
Sbjct: 37   FPRICKHQKPNFIVAKSSKVRDFRLFKSVELDQFLTSDDEDEMGEGFFEAIEELERMTRE 96

Query: 652  PSDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMC 831
            PSDVLEEMND+LSARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETM+LMVSIMC
Sbjct: 97   PSDVLEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMDLMVSIMC 156

Query: 832  TWVKKLIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLR 1011
            +WVKKLI+ ++                 K SFSM+EKVISLYWE GEKE  +LFVKEVL+
Sbjct: 157  SWVKKLIQREHDIGDVVDLLVDMDCVGLKPSFSMMEKVISLYWEMGEKEKAVLFVKEVLK 216

Query: 1012 RGISVLD-GDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTA 1188
            RGI   +  D DG+KGGP GYLAWKMM EGNYRD+ KLVIHLRE GLKPEVYSYLIAMTA
Sbjct: 217  RGIVYSEEDDTDGHKGGPTGYLAWKMMVEGNYRDSVKLVIHLRESGLKPEVYSYLIAMTA 276

Query: 1189 VVKELNEFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPS 1368
            VVKELNE AKALRKLKGFT+AGL+AE D +NVGLIE YQ DLL+ G+ LSNWVI+EG  S
Sbjct: 277  VVKELNELAKALRKLKGFTRAGLIAEFDTENVGLIEKYQSDLLSDGVQLSNWVIQEGSSS 336

Query: 1369 LLGVVHERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRL 1548
            L GVVHERLLAMY+C+G G  AERQLWEMKLVGKEAD DLYDIVLAICASQKE+S++GRL
Sbjct: 337  LHGVVHERLLAMYICSGHGLEAERQLWEMKLVGKEADADLYDIVLAICASQKEASAIGRL 396

Query: 1549 LTRMDVASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSR 1728
            LTR +V S  R+KK+LSWLLRGYIKGGHF +AAETVIKMLD GL PEFLDR AV+QGL +
Sbjct: 397  LTRTEVTSSLRKKKSLSWLLRGYIKGGHFDDAAETVIKMLDLGLCPEFLDRAAVLQGLRK 456

Query: 1729 RIQQQGNVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
             IQ+ G VDTYL LCKRLSDA+LIGP LVYL +RK+KLWI KML
Sbjct: 457  SIQESGGVDTYLKLCKRLSDASLIGPCLVYLFIRKYKLWITKML 500


>ref|XP_002313976.1| ubiquitin family protein [Populus trichocarpa]
            gi|222850384|gb|EEE87931.1| ubiquitin family protein
            [Populus trichocarpa]
          Length = 500

 Score =  674 bits (1738), Expect = 0.0
 Identities = 340/506 (67%), Positives = 407/506 (80%), Gaps = 7/506 (1%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSSKHY-------IFLSTHLPNIKSYARISVGESPFSLQKKR 522
            MAS    +++S +   +S   +++         +ST + N ++  R      P  +  K 
Sbjct: 1    MASAHAFSSLSKVSPVFSLKKRYWNSCMKPCCMVSTIICNYQTPKR------PNFVVAKT 54

Query: 523  SKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLSAREL 702
            +KV+ F + KSV+LD ++TSDDE+EM EGFF AIEELERM REPSD+LEEMND+LSAREL
Sbjct: 55   TKVREFRLFKSVELDQYVTSDDEEEMGEGFFEAIEELERMTREPSDILEEMNDRLSAREL 114

Query: 703  QLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXX 882
            QLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+WVKKLIEG+       
Sbjct: 115  QLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEGEQDVGDVV 174

Query: 883  XXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGP 1062
                       K SFSMIEKVISLYW+ G+KEG + FVKEVLRRGI+    D +G KGGP
Sbjct: 175  DLLVDMDCVGLKPSFSMIEKVISLYWDMGKKEGAVSFVKEVLRRGIAYSGDDGEGQKGGP 234

Query: 1063 AGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGF 1242
             GYL WKMM +GNYR+A KLVIHLRE GLKPE+Y+YLIAMTAVVKELNEF+KALRKLKG+
Sbjct: 235  TGYLTWKMMVDGNYRNAVKLVIHLRESGLKPEIYAYLIAMTAVVKELNEFSKALRKLKGY 294

Query: 1243 TKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGR 1422
            +++G+V ELD +NV L+E YQ DLL  G+ LS+WVI+EG P+L GVVHERLLAMY+CAGR
Sbjct: 295  SRSGMVTELDAENVELVEKYQSDLLADGVCLSSWVIQEGSPALYGVVHERLLAMYICAGR 354

Query: 1423 GSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSW 1602
            G  AERQLWEMKLVGKEADGDLYDIVLAICASQKE+S+V RLLTR++VAS  R+KK+LSW
Sbjct: 355  GLDAERQLWEMKLVGKEADGDLYDIVLAICASQKEASAVARLLTRIEVASSMRKKKSLSW 414

Query: 1603 LLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTLCKRL 1782
            LLRGYIKGGH+  AAET+IKMLD GL P++LDRVAV+QGL +RIQQ GNV++YL LCKRL
Sbjct: 415  LLRGYIKGGHYGEAAETLIKMLDLGLSPDYLDRVAVMQGLRKRIQQWGNVESYLKLCKRL 474

Query: 1783 SDANLIGPSLVYLHMRKHKLWIIKML 1860
            SD NLIGPSLVYL+++K+KLWI+K+L
Sbjct: 475  SDVNLIGPSLVYLYIKKYKLWIMKLL 500


>ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis]
            gi|223539607|gb|EEF41193.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  672 bits (1734), Expect = 0.0
 Identities = 333/450 (74%), Positives = 383/450 (85%)
 Frame = +1

Query: 511  QKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLS 690
            Q+ +S+ + F +LKSV+LD +I SDDE+EMSEGFF AIEELERM REPSDVLEEMNDKLS
Sbjct: 50   QQSKSRNREFRVLKSVELDQYIASDDEEEMSEGFFEAIEELERMTREPSDVLEEMNDKLS 109

Query: 691  ARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKX 870
            ARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+W+KKLIEG+++ 
Sbjct: 110  ARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWIKKLIEGEHEI 169

Query: 871  XXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGN 1050
                           K SFSMIEKVISLYWE GEKE ++ FVKEVLRR ++  + D +G 
Sbjct: 170  GDVVDLLVDMDCVGLKPSFSMIEKVISLYWEIGEKEKSVSFVKEVLRREVAYFEDDGEGQ 229

Query: 1051 KGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRK 1230
            KGGP GYLAWKMM +GNYRDA KLVIH RE GLKPEVYSYLIAMTAVVKELNEFAKALRK
Sbjct: 230  KGGPTGYLAWKMMVDGNYRDAVKLVIHFRESGLKPEVYSYLIAMTAVVKELNEFAKALRK 289

Query: 1231 LKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYV 1410
            LKGF K+GL+AELD +N  LIE YQ DL+  G+ LS+WVI+EG PSL GVVHERLLAMY+
Sbjct: 290  LKGFAKSGLIAELDAENTRLIEKYQSDLIADGVCLSSWVIQEGSPSLYGVVHERLLAMYI 349

Query: 1411 CAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKK 1590
            CAGRG  AERQLWEMKLVGK ADGDLYDIVLAICASQKE+S+V RLLTR++V S  ++KK
Sbjct: 350  CAGRGLDAERQLWEMKLVGKHADGDLYDIVLAICASQKEASAVSRLLTRVEVTSSLQKKK 409

Query: 1591 TLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTL 1770
            TLSWLLRGY+KGG +  AAE ++KMLD GL P++LDRVAV+QGL +RIQQ GNV++YL L
Sbjct: 410  TLSWLLRGYLKGGQYDEAAEALVKMLDMGLCPDYLDRVAVLQGLRKRIQQWGNVESYLNL 469

Query: 1771 CKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            CKRLSD NLIGPSLVYL+++K+KLWI+KML
Sbjct: 470  CKRLSDENLIGPSLVYLYIKKYKLWIMKML 499


>ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic [Vitis vinifera]
          Length = 511

 Score =  672 bits (1734), Expect = 0.0
 Identities = 354/517 (68%), Positives = 402/517 (77%), Gaps = 18/517 (3%)
 Frame = +1

Query: 364  MASVGGIAAI----SNLGLTYSSSSKHYIFLSTHLPN--IKSYARISVGE---------- 495
            MAS  G A+     + LG T SSS       S   P   +  ++R  +GE          
Sbjct: 1    MASAHGFASSLMSPTELGFTLSSS------FSIQRPRLIVPKFSRSFLGEYCSRATTICN 54

Query: 496  --SPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLE 669
              +P  +  KR K++ F + KSV+LD F+TSDDEDEMSEGFF AIEELERM REPSDVLE
Sbjct: 55   HQNPRFVVPKRDKIREFRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDVLE 114

Query: 670  EMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKL 849
            EMND+LSARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+WVKKL
Sbjct: 115  EMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKL 174

Query: 850  IEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVL 1029
            IEG++                 K  FSMIEKVISLYWE  EKE  +LFVKEVLRR I+  
Sbjct: 175  IEGEHDVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIAYS 234

Query: 1030 DGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNE 1209
            + D DG+KGGP GYLAWKMM EGNYR A KLVIHLRE GLKPEVYSYLIAMTAVVKELNE
Sbjct: 235  EDDGDGHKGGPTGYLAWKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKELNE 294

Query: 1210 FAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHE 1389
            FAKALRKLKGFTK+GL+AELD +NV LIE YQ DLL  G+ LS+WVI+EG   L GVV+E
Sbjct: 295  FAKALRKLKGFTKSGLIAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVVYE 354

Query: 1390 RLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVA 1569
            RLLAMY+CAGRG  AERQLWEMKLVGKEAD +LYDIVLAICAS+KE+S++ RLLT M+V 
Sbjct: 355  RLLAMYICAGRGLEAERQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGMEVT 414

Query: 1570 SPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGN 1749
            S  RRKKTLSWLLRGYIKG HF +A+ET+IKMLD GL PE+LDR AV+QGL  RIQQ GN
Sbjct: 415  SSIRRKKTLSWLLRGYIKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQTGN 474

Query: 1750 VDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            V+TYL LCK LSDANLIGP LVYL+++K+KLWI+K +
Sbjct: 475  VETYLKLCKHLSDANLIGPCLVYLYIKKYKLWILKTI 511


>gb|EOY34562.1| Pentatricopeptide repeat-containing protein [Theobroma cacao]
          Length = 504

 Score =  669 bits (1726), Expect = 0.0
 Identities = 335/462 (72%), Positives = 385/462 (83%), Gaps = 1/462 (0%)
 Frame = +1

Query: 478  RISVGESP-FSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREP 654
            RI   ++P F L+K + K +   + KSV+LD F+TSDDEDEMSEGFF AIEELERM REP
Sbjct: 43   RICNHQNPSFVLRKIQPKTRECRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREP 102

Query: 655  SDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCT 834
            SD+LEEMND+LS+RELQLVLVYFSQEGRDSWCALEVFEWLKKEN+VD ETMELMVSIMC+
Sbjct: 103  SDILEEMNDRLSSRELQLVLVYFSQEGRDSWCALEVFEWLKKENKVDNETMELMVSIMCS 162

Query: 835  WVKKLIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRR 1014
            WVKKLIEG+                  K  FSMIEKVIS+YWE  +K+  ++FVKEVLRR
Sbjct: 163  WVKKLIEGEGDVGDVVDLLVDMDCVGLKPGFSMIEKVISMYWEMEKKDRAVVFVKEVLRR 222

Query: 1015 GISVLDGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVV 1194
            GIS  D D +G KGGP GYLAWKMM EGNYRDA KLVI LRE GLKPE+YSYLIAMTA+V
Sbjct: 223  GISYEDEDGEGQKGGPTGYLAWKMMVEGNYRDAIKLVIELRESGLKPEIYSYLIAMTAIV 282

Query: 1195 KELNEFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLL 1374
            KELNEFAKALRKLKGF ++GLVAELDM+NV LI+ YQ DLL  GL LSNW I+EG  SL 
Sbjct: 283  KELNEFAKALRKLKGFARSGLVAELDMENVELIKKYQSDLLADGLRLSNWAIQEGTSSLF 342

Query: 1375 GVVHERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLT 1554
            G+VHERLLAMY+CAGRG  AERQLWEMKL GKEADGDL+DIVLAICASQKE+S++ RLLT
Sbjct: 343  GLVHERLLAMYICAGRGLEAERQLWEMKLAGKEADGDLHDIVLAICASQKEASAISRLLT 402

Query: 1555 RMDVASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRI 1734
            RM+V+S  RRKKTLSWLLRGYIKGGH  +AAETVIKMLD GL+PE+LDR AV+Q L +RI
Sbjct: 403  RMEVSSSLRRKKTLSWLLRGYIKGGHISDAAETVIKMLDLGLHPEYLDRAAVLQELRKRI 462

Query: 1735 QQQGNVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            QQ GN++TY+ LCKRL DA+LIGP L+YL+++K+KLW+IKML
Sbjct: 463  QQPGNIETYVNLCKRLYDASLIGPCLIYLYIKKYKLWVIKML 504


>ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 508

 Score =  663 bits (1711), Expect = 0.0
 Identities = 337/510 (66%), Positives = 407/510 (79%), Gaps = 11/510 (2%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSS-----KH-YIFLSTHLP-NIKSYARISVG----ESPFSL 510
            MAS  G+A I  LG  +SS S     +H  +F ++H   ++K Y  +S      ++P  +
Sbjct: 1    MASAHGLAPIFKLGFVFSSVSPSQRKRHPLMFPASHCGFSLKFYGGLSARSCKFKNPSFV 60

Query: 511  QKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLS 690
              K   ++GF  LKSV++D ++TS+DE  MS+GFF AIEELERM REPSDVLEEMND+LS
Sbjct: 61   SAKHGSLRGFRALKSVEMDQYVTSNDE--MSDGFFEAIEELERMTREPSDVLEEMNDRLS 118

Query: 691  ARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKX 870
            ARELQLVLVYFSQ+GRDSWCALEVF+WL+KENRVDKETMELMV+IMC WVKKLI+ ++  
Sbjct: 119  ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQQQHGV 178

Query: 871  XXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGN 1050
                           +  FSMIEKVISLYWE GEKEG +LFV+EVLRRGI  ++ D +G+
Sbjct: 179  GDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYVEEDEEGH 238

Query: 1051 KGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRK 1230
            KGGP GYLAWKMM EG+YR+A +LVI  RE GLKPE+YSYL+AMTAVVKELNEFAKALRK
Sbjct: 239  KGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAVVKELNEFAKALRK 298

Query: 1231 LKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYV 1410
            LKGFT+AGLVAELD+++V L E YQ D L  G+ LSNWVI++G PSL G+VHERLLAMY+
Sbjct: 299  LKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSLHGIVHERLLAMYI 358

Query: 1411 CAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKK 1590
            CAG G  AERQLWEMKLVGKEADGDLYDIVLAICASQKES++  RLLTR++V S  ++KK
Sbjct: 359  CAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVVSSPQKKK 418

Query: 1591 TLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTL 1770
            +LSWLLRGYIKGGHF  AAET++KML+ G YPE+LDR AV+QGL +RIQQ GN+DTY+ L
Sbjct: 419  SLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVRL 478

Query: 1771 CKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            CK LSDANLIGP LV+L++RK+KLW++KML
Sbjct: 479  CKSLSDANLIGPCLVHLYIRKYKLWVVKML 508


>gb|ESW18802.1| hypothetical protein PHAVU_006G071400g [Phaseolus vulgaris]
          Length = 510

 Score =  660 bits (1704), Expect = 0.0
 Identities = 340/510 (66%), Positives = 400/510 (78%), Gaps = 11/510 (2%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSSKHY-----IFLSTHLP-NIKSY----ARISVGESPFSLQ 513
            MA   G A    LG  +SS S        +F + H   ++K Y    AR    ++P  + 
Sbjct: 1    MAYAQGFAPNFKLGFVFSSGSPSQQRYPLMFPAAHCGFSLKFYNGVCARSFKFQNPSIVA 60

Query: 514  KKRSKVQGFGMLKSVQLDVFITSDDE-DEMSEGFFAAIEELERMAREPSDVLEEMNDKLS 690
             K   V+GF +LKSV+LD F+TSDDE DEM +GFF AIEELERM REPSD+LEEMND+LS
Sbjct: 61   AKHCSVRGFRVLKSVELDQFVTSDDEEDEMGDGFFEAIEELERMTREPSDILEEMNDRLS 120

Query: 691  ARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKX 870
            ARELQLVLVYFSQ+GRDSWCALEVF+WL+KENRVDKETMELMVSIMC WVKKLI+ ++  
Sbjct: 121  ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVSIMCGWVKKLIQEQHGV 180

Query: 871  XXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGN 1050
                           +  FSMIEKVISLYWE GEKEG +LFV+EVLRRGI     D++G+
Sbjct: 181  GDVIDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYASEDKEGH 240

Query: 1051 KGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRK 1230
            KGGP GYLAWKMM EG+YR A +LVI  RE GLKPEVYSYL+AMTAVVKELNEFAKALRK
Sbjct: 241  KGGPTGYLAWKMMAEGDYRSAVRLVIRFRESGLKPEVYSYLVAMTAVVKELNEFAKALRK 300

Query: 1231 LKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYV 1410
            LK FT+AGLV ELD+++V L E YQ DLL  G+ LSNWVI++G PSL GVVHERLLAMY+
Sbjct: 301  LKSFTRAGLVTELDLEDVELAEKYQTDLLADGVRLSNWVIQDGRPSLYGVVHERLLAMYI 360

Query: 1411 CAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKK 1590
            CAG G  AERQLWEMKLVGKEADGDLYDIVLAICASQKE ++  RLLTR+++A+  ++KK
Sbjct: 361  CAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKEVNATARLLTRLELANSPQKKK 420

Query: 1591 TLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTL 1770
            +LSWLLRGYIKGGHF  AAETV+KML+ G YPE+LDR AV+QGL +RIQQ GN+DTY+ L
Sbjct: 421  SLSWLLRGYIKGGHFTEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVRL 480

Query: 1771 CKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            CK LSDANLIGP LV+L++RK+KLW++KML
Sbjct: 481  CKSLSDANLIGPCLVHLYIRKYKLWVVKML 510


>ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citrus clementina]
            gi|557527050|gb|ESR38356.1| hypothetical protein
            CICLE_v10028251mg [Citrus clementina]
          Length = 502

 Score =  658 bits (1697), Expect = 0.0
 Identities = 332/461 (72%), Positives = 374/461 (81%)
 Frame = +1

Query: 478  RISVGESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPS 657
            RI   ++P  +  K SK++ F  LKSV+LD F+TSDDEDEMSE FF AIEELERM REPS
Sbjct: 42   RIKNYQNPSFIATKVSKIREFRFLKSVELDQFVTSDDEDEMSEEFFEAIEELERMTREPS 101

Query: 658  DVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTW 837
            D+LEEMND+LSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVD ETMELMVSIMC+W
Sbjct: 102  DILEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDNETMELMVSIMCSW 161

Query: 838  VKKLIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRG 1017
            VKK IE +                  K  FSMIEKVISLYWE  +KE  +LFVK VL RG
Sbjct: 162  VKKYIEEERDVGDVIDLLVDMDCVGLKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRG 221

Query: 1018 ISVLDGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVK 1197
            I+  +GD +G KGGP GYLAWKMM EG Y DA KLVIHLRE GLKPEVYSYLIA+TAVVK
Sbjct: 222  IAYAEGDGEGQKGGPTGYLAWKMMVEGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVK 281

Query: 1198 ELNEFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLG 1377
            ELNEF KALRKLKG+ +AG +AELD  N+GLIE YQ DLL  G  LS+W I+EGG SL G
Sbjct: 282  ELNEFGKALRKLKGYVRAGSIAELDGKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYG 341

Query: 1378 VVHERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTR 1557
            VVHERLLAMY+CAGRG  AERQLWEMKLVGKEADGDLYDIVLAICASQ E S+V RLL+R
Sbjct: 342  VVHERLLAMYICAGRGLEAERQLWEMKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSR 401

Query: 1558 MDVASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQ 1737
            ++V +   +KKTLSWLLRGYIKGGH  +AAET+ KMLD GLYPE++DRVAV+QGL +RIQ
Sbjct: 402  IEVMNSLCKKKTLSWLLRGYIKGGHINDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQ 461

Query: 1738 QQGNVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            Q GNV+ YL LCKRLSD +LIGP LVYL+++K+KLWIIKML
Sbjct: 462  QSGNVEAYLNLCKRLSDTSLIGPCLVYLYIKKYKLWIIKML 502


>ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Citrus sinensis]
          Length = 502

 Score =  655 bits (1691), Expect = 0.0
 Identities = 331/461 (71%), Positives = 374/461 (81%)
 Frame = +1

Query: 478  RISVGESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPS 657
            RI   ++P  +  K SK++ F  LKSV+LD F+TSDDEDEMSE FF AIEELERM REPS
Sbjct: 42   RIKNYQNPNFIATKVSKIREFRFLKSVELDQFVTSDDEDEMSEEFFEAIEELERMTREPS 101

Query: 658  DVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTW 837
            D+LEEMND+LSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVD ETMELMVSIMC+W
Sbjct: 102  DILEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDNETMELMVSIMCSW 161

Query: 838  VKKLIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRG 1017
            VKK IE +                  K  FSMIEKVISLYWE  +KE  +LFVK VL RG
Sbjct: 162  VKKYIEEERGVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRG 221

Query: 1018 ISVLDGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVK 1197
            I+  +GD +G +GGP GYLAWKMM EG Y DA KLVIHLRE GLKPEVYSYLIA+TAVVK
Sbjct: 222  IAYAEGDGEGQQGGPTGYLAWKMMVEGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVK 281

Query: 1198 ELNEFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLG 1377
            ELNEF KALRKLKG+ +AG +AELD  N+GLIE YQ DLL  G  LS+W I+EGG SL G
Sbjct: 282  ELNEFGKALRKLKGYVRAGSIAELDGKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYG 341

Query: 1378 VVHERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTR 1557
            VVHERLLAMY+CAGRG  AERQLWEMKLVGKEADGDLYDIVLAICASQ E S+V RLL+R
Sbjct: 342  VVHERLLAMYICAGRGLEAERQLWEMKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSR 401

Query: 1558 MDVASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQ 1737
            ++V +   +KKTLSWLLRGYIKGGH  +AAET+ KMLD GLYPE++DRVAV+QGL +RIQ
Sbjct: 402  IEVMNSLCKKKTLSWLLRGYIKGGHINDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQ 461

Query: 1738 QQGNVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            Q GNV+ YL LCKRLSD +LIGP LVYL+++K+KLWIIKML
Sbjct: 462  QSGNVEAYLNLCKRLSDTSLIGPCLVYLYIKKYKLWIIKML 502


>ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 510

 Score =  652 bits (1683), Expect = 0.0
 Identities = 339/512 (66%), Positives = 401/512 (78%), Gaps = 13/512 (2%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSS----KH-YIFLSTHLP-NIKSY-----ARISVGESPFSL 510
            MA   G A I  LG  +SS S    +H  +F ++H   ++K Y     AR    ++P  +
Sbjct: 1    MAYAHGFAPIFKLGFVFSSVSPSQKRHPLVFPASHCGYSLKFYDGVLSARSCKFKNPSFV 60

Query: 511  QKKRSKVQGFGMLKSVQLDVFITSDDE-DEMSEGFFAAIEELERMAREPSDVLEEMNDKL 687
              K+  ++GF  LKSV+LD ++TSDDE DEMS+GFF AIEELERM REPSDVLEEMND+L
Sbjct: 61   --KQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSDVLEEMNDRL 118

Query: 688  SARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLI-EGKN 864
            SARELQLVLVYFSQ+GRDSWCALEVF+WL+KENRVDKETMELMV+IMC WVKKLI E   
Sbjct: 119  SARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQEHHG 178

Query: 865  KXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRD 1044
                             +  FSMIEKVISLYWE GEKEG +LFV+EVLRRGI  L+ D +
Sbjct: 179  VVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYLEEDEE 238

Query: 1045 GNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKAL 1224
            G+KGGP GYLAWKMM EG+Y  A +LVIH  E GLKPEVYSYL+AMTAVVKELNE AKAL
Sbjct: 239  GHKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVKELNELAKAL 298

Query: 1225 RKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAM 1404
            RKLK F + GLVAELD+++V L E YQ DLL  G+ LSNW I++G PSL G++HERLLAM
Sbjct: 299  RKLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHGIIHERLLAM 358

Query: 1405 YVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRR 1584
            Y+CAG G  AE+QLWEMKLVGKEADGDLYDIVLAICASQKES++  RLLTR++VAS  ++
Sbjct: 359  YICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVASSPQK 418

Query: 1585 KKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYL 1764
            KK+LSWLLRGYIKGGHF  AAET++KMLD G YPE+LDR AV+QGL +RIQQ GN+DTY+
Sbjct: 419  KKSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQQYGNLDTYV 478

Query: 1765 TLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
             LCK LSDANLIGP LV+L++RK+KLW++KML
Sbjct: 479  RLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 510


>ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292395 [Fragaria vesca
            subsp. vesca]
          Length = 1304

 Score =  651 bits (1679), Expect = 0.0
 Identities = 327/455 (71%), Positives = 372/455 (81%), Gaps = 1/455 (0%)
 Frame = +1

Query: 493  ESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEE 672
            ++P  +  K  KV+ F +  SVQLD F+TSDDEDEM E FF AIEELERM REPSDVLEE
Sbjct: 38   KNPSFVVAKSGKVRDFRLFNSVQLDQFVTSDDEDEMGESFFEAIEELERMRREPSDVLEE 97

Query: 673  MNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLI 852
            MND+LSARELQLVLVYFSQEGRDSWCALEVFEWL++ENRVDKETMELMVSIMC W+K+LI
Sbjct: 98   MNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRRENRVDKETMELMVSIMCGWLKRLI 157

Query: 853  EGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLD 1032
            E  N                 K SFSM+EKVISLYWE GEKE  +LFVKEVL+RGI   +
Sbjct: 158  EEGNDVADVIDLLVDVDCVGLKPSFSMMEKVISLYWEMGEKENAVLFVKEVLKRGIVYSE 217

Query: 1033 -GDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNE 1209
              DRDG+KGGP GYLAWKM  +GNYRD+ K VI LRE GLKPEVYSYLIAMTAVVKELNE
Sbjct: 218  EDDRDGHKGGPTGYLAWKMTVDGNYRDSVKFVIQLRESGLKPEVYSYLIAMTAVVKELNE 277

Query: 1210 FAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHE 1389
              KALRKLK FT+AGLVAE D ++VGLIE YQ DLL  G+ LSNWVI+EG  +L GVVHE
Sbjct: 278  LGKALRKLKAFTRAGLVAEFDSEDVGLIEKYQSDLLADGVQLSNWVIQEGSSTLCGVVHE 337

Query: 1390 RLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVA 1569
            RLLAMY+C+GRG  AERQLWEMKLVGKE DGDLYDIVLAICAS+KE+S++ RLLTR +V+
Sbjct: 338  RLLAMYICSGRGLEAERQLWEMKLVGKEPDGDLYDIVLAICASRKETSAIARLLTRTEVS 397

Query: 1570 SPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGN 1749
            S   +KK+LSWLLRGYIKGGHF +AAETVIKMLD GL+P++LDR AV+ GL +RIQQ G 
Sbjct: 398  SSLSKKKSLSWLLRGYIKGGHFNDAAETVIKMLDLGLFPDYLDRAAVLHGLRKRIQQSGT 457

Query: 1750 VDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIK 1854
            VDTYL LCKRLSDANLI   L+YL+++KHKLWII+
Sbjct: 458  VDTYLKLCKRLSDANLIESCLLYLYIKKHKLWIIR 492


>gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]
          Length = 516

 Score =  649 bits (1674), Expect = 0.0
 Identities = 339/518 (65%), Positives = 396/518 (76%), Gaps = 19/518 (3%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSS----------KHYIFLSTHLPNIKSYARISVG------- 492
            MAS  G   ++ LG   SSSS          ++ IFL     N+  + R S         
Sbjct: 1    MASAQGFTPLTELGFPSSSSSSSSSSSNSLHRNRIFLCRMDENL--WGRTSAKFCPVICC 58

Query: 493  --ESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVL 666
              ++P  +  K SK++ F +  SV+LD F+TSDDE+EM EGFF AIEELERM REPSDVL
Sbjct: 59   KQQNPNFIAPKPSKLREFRLFTSVELDQFLTSDDEEEMGEGFFEAIEELERMTREPSDVL 118

Query: 667  EEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKK 846
            EEMND+LSARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMV++MC+WVKK
Sbjct: 119  EEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVTLMCSWVKK 178

Query: 847  LIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISV 1026
            LIEG++                 +  FSM+E VI LYWE GEK   + FVKEVLRRGI+ 
Sbjct: 179  LIEGEHDVGDVVDLLVDMACVGLRPGFSMMENVILLYWEMGEKGRAVSFVKEVLRRGIAC 238

Query: 1027 LDGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELN 1206
            L+ D +G KGGP GYLAWKMM EGNY +A KLV+ +RE GLKPEVYSYLIAMTAVVKELN
Sbjct: 239  LEDDGEGPKGGPTGYLAWKMMVEGNYMEAVKLVVDIRESGLKPEVYSYLIAMTAVVKELN 298

Query: 1207 EFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVH 1386
            EFAKALRKLKGF +AGL AELD ++V LIE YQ DLL  G+ LSNWVIEEG  SL GVVH
Sbjct: 299  EFAKALRKLKGFERAGLTAELDEESVELIEKYQSDLLDDGVRLSNWVIEEGITSLNGVVH 358

Query: 1387 ERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDV 1566
            ERLLAMY+CAGRG  AERQLW+MKLVGKEADGDLYDIVLAICASQKE  ++ RLLTR++ 
Sbjct: 359  ERLLAMYICAGRGIEAERQLWKMKLVGKEADGDLYDIVLAICASQKEGRAIARLLTRVNF 418

Query: 1567 ASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQG 1746
            +S  R++K+LSWLLRGYIKGGHF NAAETV+KMLD GL PE+LDR AV+QGL +RI+   
Sbjct: 419  SSTLRKRKSLSWLLRGYIKGGHFDNAAETVVKMLDLGLCPEYLDRAAVLQGLRKRIKGPD 478

Query: 1747 NVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
             V+TYL LCK LSD NLIGP L+YL+++K+KLWI+KML
Sbjct: 479  TVETYLKLCKHLSDYNLIGPCLIYLYIKKYKLWIMKML 516


>ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g30100, chloroplastic-like [Cucumis sativus]
          Length = 501

 Score =  643 bits (1659), Expect = 0.0
 Identities = 325/502 (64%), Positives = 392/502 (78%), Gaps = 3/502 (0%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSS---KHYIFLSTHLPNIKSYARISVGESPFSLQKKRSKVQ 534
            M    G   ++  G ++S SS         ST    + S    +  +S FS+ +  +K +
Sbjct: 1    MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRA-AKFR 59

Query: 535  GFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQLVL 714
               + KSV+LD FITSDDEDEM +GFF AIEELERM REPSDVLEEMND+LSARE+QLVL
Sbjct: 60   DLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVL 119

Query: 715  VYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXXXX 894
            VYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+W+KKL+EG++          
Sbjct: 120  VYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLV 179

Query: 895  XXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAGYL 1074
                   K  FSMIEKVISLYWE GEKE  + FVKEVL R ++ +  D +G+KGGP+GYL
Sbjct: 180  DMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYL 239

Query: 1075 AWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKAG 1254
            AWKMM +G+YR A K+V+HLRE GL+PEVYSYLIAMTAVVKELNEFAKALRKLKG+ + G
Sbjct: 240  AWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDG 299

Query: 1255 LVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGRGSAA 1434
             VAELD +NV L+  YQ +LL  G+ LSNWV+EEG  S+ GVVHERLLAMY+CAG+G  A
Sbjct: 300  FVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEA 359

Query: 1435 ERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSWLLRG 1614
            ERQLWEMKLVGKEAD DLYDIVLAICASQKE+ ++ RLLTR+++ SP  +KK+L+WLLRG
Sbjct: 360  ERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRG 419

Query: 1615 YIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTLCKRLSDAN 1794
            YIKGGHF++AA T++KM++ G  PE+LDRVAV+QGL + I++  +V TYL LCK LSDAN
Sbjct: 420  YIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLXKEIREPESVHTYLDLCKCLSDAN 479

Query: 1795 LIGPSLVYLHMRKHKLWIIKML 1860
            LIGPSLVYLH++KHKLWIIKML
Sbjct: 480  LIGPSLVYLHLQKHKLWIIKML 501


>ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum lycopersicum]
          Length = 503

 Score =  634 bits (1636), Expect = e-179
 Identities = 316/444 (71%), Positives = 367/444 (82%), Gaps = 2/444 (0%)
 Frame = +1

Query: 535  GFGMLKSVQLDVFITSDDED--EMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQL 708
            GF +  SV+L  F+TSD E+  EMS+ FF AIEELERM REPSDVLEEMN++LS RELQL
Sbjct: 60   GFKLFSSVELGSFVTSDGEEKNEMSDCFFEAIEELERMTREPSDVLEEMNERLSDRELQL 119

Query: 709  VLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXX 888
            VLVYF+QEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC WV+KLI  K++       
Sbjct: 120  VLVYFAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQKLIGSKSEAGDVVDL 179

Query: 889  XXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAG 1068
                       SFSM+EKVISLYW+AGE+EG + FVKEVLRR I+  DG+ DG+K GPAG
Sbjct: 180  LVDMDCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAYSDGNVDGHKAGPAG 239

Query: 1069 YLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTK 1248
            YLAWKMMEEGNY+DA KLVI +R+ GLKPE+YSYLIAMTAVVKELNEF KALRKLKGF +
Sbjct: 240  YLAWKMMEEGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELNEFGKALRKLKGFAR 299

Query: 1249 AGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGRGS 1428
             GLVAELD++N+ LIE YQ DLL  G+ LS+W+I+EGGPSL GVVHERLLAMYVCAGRG 
Sbjct: 300  TGLVAELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVHERLLAMYVCAGRGI 359

Query: 1429 AAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSWLL 1608
             AER LW+MK+ GKE  GDL+DIVLAICASQKE   + RLLT M+ +S  ++KKTLSWLL
Sbjct: 360  EAERHLWQMKISGKEVSGDLHDIVLAICASQKELGPISRLLTGMEASSSLQKKKTLSWLL 419

Query: 1609 RGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTLCKRLSD 1788
            RGYIKGGH +NAAETVIKMLD GLYP+FLDR AV+Q L RRIQQ GN++TYL LCK LSD
Sbjct: 420  RGYIKGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSGNLETYLNLCKHLSD 479

Query: 1789 ANLIGPSLVYLHMRKHKLWIIKML 1860
            A+LIGP LVYL+++K++LWII+ L
Sbjct: 480  ASLIGPCLVYLYIKKYRLWIIRTL 503


>ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum tuberosum]
          Length = 503

 Score =  633 bits (1633), Expect = e-179
 Identities = 330/503 (65%), Positives = 389/503 (77%), Gaps = 4/503 (0%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSSKHYIF--LSTHLPNIKSYARISVGESPFSLQKKRSKVQG 537
            MA+V  IA+++ LGL+     K        T L    S+    VG S  +      +  G
Sbjct: 1    MATVNEIASLTYLGLSKVVFPKRCRLGIPQTWLKWRSSWVLGGVGCSSRNPSFVNPRRNG 60

Query: 538  FGMLKSVQLDVFITSDDED--EMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQLV 711
            F +  SV+L  F+TSDDE+  EMS+ FF AIEELERM REPSDVLEEMN++LS RELQLV
Sbjct: 61   FKLFNSVELGSFVTSDDEEKNEMSDCFFEAIEELERMTREPSDVLEEMNERLSDRELQLV 120

Query: 712  LVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXXX 891
            LVYF+QEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC WV+KLI  K++        
Sbjct: 121  LVYFAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQKLIGSKSEAGDVVDLL 180

Query: 892  XXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAGY 1071
                      SFSM+EKVISLYW+AGE+EG + FVKEVLRR I+  DG+ DG+K GPAGY
Sbjct: 181  VDMDCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAYSDGNVDGHKAGPAGY 240

Query: 1072 LAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKA 1251
            LAWKMME GNY+DA KLVI +R+ GLKPE+YSYLIAMTAVVKELNEF KALRKLKGF + 
Sbjct: 241  LAWKMMEVGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELNEFGKALRKLKGFART 300

Query: 1252 GLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGRGSA 1431
            GLVAELD++N+ LIE YQ DLL  G+ LS+W+I+EGGPSL GVVHERLLAMYVCAGRG  
Sbjct: 301  GLVAELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVHERLLAMYVCAGRGIE 360

Query: 1432 AERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSWLLR 1611
            AER LW+MKL GK+  GDL DIVLAICASQKE   + RLLT M+ +S  ++KKTLSWLLR
Sbjct: 361  AERHLWQMKLSGKKVTGDLQDIVLAICASQKELGPISRLLTGMEASSSLQKKKTLSWLLR 420

Query: 1612 GYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTLCKRLSDA 1791
            GYIKGGH +NAAETVIKMLD GLYP+FLDR AV+Q L RRIQQ G+++TYL LCK LSDA
Sbjct: 421  GYIKGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSGSLETYLNLCKHLSDA 480

Query: 1792 NLIGPSLVYLHMRKHKLWIIKML 1860
            +LIGP LVYL+++K++LWII+ L
Sbjct: 481  SLIGPCLVYLYIKKYRLWIIRTL 503


>ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207176 [Cucumis sativus]
          Length = 1290

 Score =  612 bits (1578), Expect = e-172
 Identities = 311/486 (63%), Positives = 376/486 (77%), Gaps = 3/486 (0%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSS---KHYIFLSTHLPNIKSYARISVGESPFSLQKKRSKVQ 534
            M    G   ++  G ++S SS         ST    + S    +  +S FS+ +  +K +
Sbjct: 1    MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRA-AKFR 59

Query: 535  GFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQLVL 714
               + KSV+LD FITSDDEDEM +GFF AIEELERM REPSDVLEEMND+LSARE+QLVL
Sbjct: 60   DLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVL 119

Query: 715  VYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXXXX 894
            VYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+W+KKL+EG++          
Sbjct: 120  VYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLV 179

Query: 895  XXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAGYL 1074
                   K  FSMIEKVISLYWE GEKE  + FVKEVL R ++ +  D +G+KGGP+GYL
Sbjct: 180  DMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYL 239

Query: 1075 AWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKAG 1254
            AWKMM +G+YR A K+V+HLRE GL+PEVYSYLIAMTAVVKELNEFAKALRKLKG+ + G
Sbjct: 240  AWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDG 299

Query: 1255 LVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGRGSAA 1434
             VAELD +NV L+  YQ +LL  G+ LSNWV+EEG  S+ GVVHERLLAMY+CAG+G  A
Sbjct: 300  FVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEA 359

Query: 1435 ERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSWLLRG 1614
            ERQLWEMKLVGKEAD DLYDIVLAICASQKE+ ++ RLLTR+++ SP  +KK+L+WLLRG
Sbjct: 360  ERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRG 419

Query: 1615 YIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTLCKRLSDAN 1794
            YIKGGHF++AA T++KM++ G  PE+LDRVAV+QGL + I++  +V TYL LCK LSDAN
Sbjct: 420  YIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDAN 479

Query: 1795 LIGPSL 1812
            LIGPSL
Sbjct: 480  LIGPSL 485


>ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Cicer arietinum]
          Length = 508

 Score =  604 bits (1558), Expect = e-170
 Identities = 315/514 (61%), Positives = 379/514 (73%), Gaps = 15/514 (2%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSS----SSKHYIFLSTHLPNIKSYARISVGESPFSLQKKR--- 522
            MAS+ G A    LG  +SS      KH +      P+ K    +   +  F  Q      
Sbjct: 1    MASLHGFAPTLKLGFAFSSLFSPKQKHPLVF----PSSKRGFSLKFCDGSFKFQNPSFPP 56

Query: 523  SKVQGFGMLKSVQLDVFITSDDEDE-------MSEGFFAAIEELERMAREPSDVLEEMND 681
            +K   +   KSV+LD F+TSDDE+E       M +GF  AIEELERM REPSDVLEEMND
Sbjct: 57   TKPNSYMRKKSVELDQFVTSDDEEEEEEEEEEMGDGFLEAIEELERMTREPSDVLEEMND 116

Query: 682  KLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGK 861
            +LSARELQLVLVYFSQEGRDSWCALEVF+WL+KENRVDKETMELMV+IMC WVKKLI  K
Sbjct: 117  RLSARELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIMEK 176

Query: 862  NKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDR 1041
            +                 +  FSMIEKVISLYWE GEK+  +LFV+EVLRRGIS    + 
Sbjct: 177  HGVDDVIDLLVNMNCVGLRPGFSMIEKVISLYWEMGEKDDAVLFVEEVLRRGIS--SNED 234

Query: 1042 DGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKA 1221
            D  KGGP GYLAWKMM EG+YR A +LV   RE GLKP++YSYL+AMTAVVKELNE AKA
Sbjct: 235  DPEKGGPTGYLAWKMMVEGDYRGAVRLVTRFREAGLKPDIYSYLVAMTAVVKELNELAKA 294

Query: 1222 LRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLL-GVVHERLL 1398
            LRKLK F++AGL+ E D ++V L E YQ DLL  G  LS WVI++G PS + G++HERLL
Sbjct: 295  LRKLKSFSRAGLITEFDREDVELAEKYQSDLLADGARLSKWVIQDGSPSSIHGIIHERLL 354

Query: 1399 AMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPF 1578
            AMY+CAGRG  AERQLWEMKL+GKEA G LYD+VLAICASQKE+++  RL+ RM+VAS  
Sbjct: 355  AMYICAGRGIEAERQLWEMKLLGKEAVGGLYDMVLAICASQKEAAATARLMIRMEVASSP 414

Query: 1579 RRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDT 1758
            ++KK+LSWLLRGYIKGGHF  AAETV+KML+ G YP++LDRVAV+QGL +RIQQ GN+DT
Sbjct: 415  QKKKSLSWLLRGYIKGGHFNEAAETVMKMLELGFYPDYLDRVAVMQGLRKRIQQYGNLDT 474

Query: 1759 YLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            Y+ LCK L +ANLIG  + YL++RK+KLW++KM+
Sbjct: 475  YIKLCKSLYEANLIGACVCYLYIRKYKLWVVKMI 508


>gb|EPS70238.1| hypothetical protein M569_04522 [Genlisea aurea]
          Length = 504

 Score =  589 bits (1518), Expect = e-165
 Identities = 317/508 (62%), Positives = 382/508 (75%), Gaps = 9/508 (1%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSSKHYIFLSTHLPNIKSYARISVGESP---FSLQKKRSKVQ 534
            MA VGG +AI++L   Y S S      S     I++  R S  +S    F + K+R    
Sbjct: 1    MAVVGGFSAINDLSSRYYSPSPCIFLESRRKLVIRTSIRDSDRKSKPPGFRIGKRRP--- 57

Query: 535  GFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQLVL 714
            G   L+SV L   ITSDDEDEMSEGFF AIEELERMAREPSDVLEEMNDKLS RELQLVL
Sbjct: 58   GVWSLESVHLGTIITSDDEDEMSEGFFEAIEELERMAREPSDVLEEMNDKLSNRELQLVL 117

Query: 715  VYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXXXX 894
            VYFSQEGRDSW  LEVFEWLKKEN+VD+ETMELMVSIMC W+KKLIE KNK         
Sbjct: 118  VYFSQEGRDSWFTLEVFEWLKKENKVDQETMELMVSIMCNWMKKLIEAKNKVQDVVDLLV 177

Query: 895  XXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAGYL 1074
                   + +FSMIEKVISLYWEAGEK+ TI FVKEVLRRGIS    D +G+K GP GYL
Sbjct: 178  DMDCVGLEANFSMIEKVISLYWEAGEKQETIAFVKEVLRRGISSCS-DEEGDKTGPVGYL 236

Query: 1075 AWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKAG 1254
            AWKMMEEG+ RDAAKLV+H R+CGLKP++YSYLIAMT +++ELNEFAK++R L  +TK+G
Sbjct: 237  AWKMMEEGSCRDAAKLVLHFRDCGLKPDIYSYLIAMTGILRELNEFAKSMRSLNRYTKSG 296

Query: 1255 LVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGG-----PS-LLGVVHERLLAMYVCA 1416
            LV+ELD ++V L E YQQ+LL  GL LS + +EE G     PS L  VV +RLLAM+V A
Sbjct: 297  LVSELDANSVWLAEKYQQELLDDGLLLSAFALEERGGGGDPPSPLREVVRKRLLAMHVVA 356

Query: 1417 GRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTL 1596
            GRG+ AER L E      + D DL ++VLAICASQKES SV RLLTRM+ +SP  R+KTL
Sbjct: 357  GRGTEAERLLSEKNTGSDDGDDDLCNVVLAICASQKESVSVSRLLTRMEGSSPDSRRKTL 416

Query: 1597 SWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLTLCK 1776
            +WLLRGY+KGGHF+NA ET+++M+++G+ PEF DRVAV+QGL RRI++ G VDTYL +C+
Sbjct: 417  TWLLRGYVKGGHFRNAGETLVRMVEAGVLPEFTDRVAVLQGLGRRIRRSGYVDTYLDVCR 476

Query: 1777 RLSDANLIGPSLVYLHMRKHKLWIIKML 1860
             L+D +LI PSLVY+H+RKHKLWII ++
Sbjct: 477  CLADVDLISPSLVYVHLRKHKLWIISLV 504


>ref|XP_006293981.1| hypothetical protein CARUB_v10022972mg [Capsella rubella]
            gi|482562689|gb|EOA26879.1| hypothetical protein
            CARUB_v10022972mg [Capsella rubella]
          Length = 505

 Score =  561 bits (1447), Expect = e-157
 Identities = 293/511 (57%), Positives = 372/511 (72%), Gaps = 12/511 (2%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSSKHYIFLSTHLPNIKSYARISVGESPFSLQKKRSKVQGFG 543
            MA   G A+++ L L +S S        T  P +KS +RIS       L     K +   
Sbjct: 1    MAYARGFASLTQLNLIFSPSISLRRVYRT--PGVKSVSRISCN---LKLNYSAGKFRDLK 55

Query: 544  MLKSVQLDVFITSDDE------DEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQ 705
            + +SV+LD FITS++E      DE+ EGFF AIEELERM REPSDVLEEMN +LS+RELQ
Sbjct: 56   LSRSVELDQFITSEEEGGEEAEDEIGEGFFEAIEELERMTREPSDVLEEMNHRLSSRELQ 115

Query: 706  LVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXX 885
            L+LVYF+QEGRDSWC LEVFEWLKKENRVD++ +ELMVSIMC WVKKLI+ +        
Sbjct: 116  LMLVYFAQEGRDSWCTLEVFEWLKKENRVDEQMVELMVSIMCGWVKKLIQEECGADQVFD 175

Query: 886  XXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRR----GISVLDGDRDGNK 1053
                      K  FSM+EKVI+LY E G+KE  +LFVKEVLRR    G SV+ G  +G K
Sbjct: 176  LLIEMDCVGLKPGFSMMEKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGGS-EGRK 234

Query: 1054 GGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKL 1233
            GGP GYLAWK+M +G+Y+ A  LV+ LR  GL PE YSYLIAMTA+VKELN   K LR+L
Sbjct: 235  GGPVGYLAWKLMVDGDYKKAVDLVVELRLSGLMPEAYSYLIAMTAIVKELNSLGKTLREL 294

Query: 1234 KGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGP--SLLGVVHERLLAMY 1407
            K FT+AG V E+D  +  LIE YQ + L+ GL L+ W +EEG    S++GVVHERLLAMY
Sbjct: 295  KRFTRAGYVTEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQQEDSIIGVVHERLLAMY 354

Query: 1408 VCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRK 1587
            +CAGRG  AE+QLW+MKL G+E + +L+DIV+AICASQKE ++V RLLTR++     R+K
Sbjct: 355  ICAGRGPEAEKQLWKMKLAGREPEAELHDIVMAICASQKEVNAVSRLLTRVEFMESKRKK 414

Query: 1588 KTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLT 1767
            KTLSWLLRGY+KGGHF+ AAET+I M+DSGL+PE++DRVAV+QG++R+IQ+  +++ Y+ 
Sbjct: 415  KTLSWLLRGYVKGGHFEEAAETLITMIDSGLHPEYIDRVAVMQGMTRKIQRPRDIEAYMG 474

Query: 1768 LCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            LCKRL DA L+GP LVY++M K+KLWI+KM+
Sbjct: 475  LCKRLFDAGLVGPCLVYMYMDKYKLWIVKMM 505


>ref|XP_006293980.1| hypothetical protein CARUB_v10022972mg [Capsella rubella]
            gi|482562688|gb|EOA26878.1| hypothetical protein
            CARUB_v10022972mg [Capsella rubella]
          Length = 532

 Score =  561 bits (1447), Expect = e-157
 Identities = 293/511 (57%), Positives = 372/511 (72%), Gaps = 12/511 (2%)
 Frame = +1

Query: 364  MASVGGIAAISNLGLTYSSSSKHYIFLSTHLPNIKSYARISVGESPFSLQKKRSKVQGFG 543
            MA   G A+++ L L +S S        T  P +KS +RIS       L     K +   
Sbjct: 28   MAYARGFASLTQLNLIFSPSISLRRVYRT--PGVKSVSRISCN---LKLNYSAGKFRDLK 82

Query: 544  MLKSVQLDVFITSDDE------DEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQ 705
            + +SV+LD FITS++E      DE+ EGFF AIEELERM REPSDVLEEMN +LS+RELQ
Sbjct: 83   LSRSVELDQFITSEEEGGEEAEDEIGEGFFEAIEELERMTREPSDVLEEMNHRLSSRELQ 142

Query: 706  LVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXX 885
            L+LVYF+QEGRDSWC LEVFEWLKKENRVD++ +ELMVSIMC WVKKLI+ +        
Sbjct: 143  LMLVYFAQEGRDSWCTLEVFEWLKKENRVDEQMVELMVSIMCGWVKKLIQEECGADQVFD 202

Query: 886  XXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRR----GISVLDGDRDGNK 1053
                      K  FSM+EKVI+LY E G+KE  +LFVKEVLRR    G SV+ G  +G K
Sbjct: 203  LLIEMDCVGLKPGFSMMEKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGGS-EGRK 261

Query: 1054 GGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKL 1233
            GGP GYLAWK+M +G+Y+ A  LV+ LR  GL PE YSYLIAMTA+VKELN   K LR+L
Sbjct: 262  GGPVGYLAWKLMVDGDYKKAVDLVVELRLSGLMPEAYSYLIAMTAIVKELNSLGKTLREL 321

Query: 1234 KGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGP--SLLGVVHERLLAMY 1407
            K FT+AG V E+D  +  LIE YQ + L+ GL L+ W +EEG    S++GVVHERLLAMY
Sbjct: 322  KRFTRAGYVTEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQQEDSIIGVVHERLLAMY 381

Query: 1408 VCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRK 1587
            +CAGRG  AE+QLW+MKL G+E + +L+DIV+AICASQKE ++V RLLTR++     R+K
Sbjct: 382  ICAGRGPEAEKQLWKMKLAGREPEAELHDIVMAICASQKEVNAVSRLLTRVEFMESKRKK 441

Query: 1588 KTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVVQGLSRRIQQQGNVDTYLT 1767
            KTLSWLLRGY+KGGHF+ AAET+I M+DSGL+PE++DRVAV+QG++R+IQ+  +++ Y+ 
Sbjct: 442  KTLSWLLRGYVKGGHFEEAAETLITMIDSGLHPEYIDRVAVMQGMTRKIQRPRDIEAYMG 501

Query: 1768 LCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1860
            LCKRL DA L+GP LVY++M K+KLWI+KM+
Sbjct: 502  LCKRLFDAGLVGPCLVYMYMDKYKLWIVKMM 532


Top