BLASTX nr result

ID: Rehmannia22_contig00009052 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00009052
         (2042 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ06247.1| hypothetical protein PRUPE_ppa004609mg [Prunus pe...   679   0.0  
ref|XP_002313976.1| ubiquitin family protein [Populus trichocarp...   674   0.0  
ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm...   673   0.0  
ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi...   673   0.0  
gb|EOY34562.1| Pentatricopeptide repeat-containing protein [Theo...   670   0.0  
ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi...   664   0.0  
gb|ESW18802.1| hypothetical protein PHAVU_006G071400g [Phaseolus...   662   0.0  
ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citr...   659   0.0  
ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containi...   657   0.0  
ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi...   654   0.0  
ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292...   652   0.0  
gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]     650   0.0  
ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   644   0.0  
ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containi...   635   e-179
ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containi...   634   e-179
ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207...   613   e-173
ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containi...   605   e-170
gb|EPS70238.1| hypothetical protein M569_04522 [Genlisea aurea]       590   e-166
ref|XP_006293981.1| hypothetical protein CARUB_v10022972mg [Caps...   562   e-157
ref|XP_006293980.1| hypothetical protein CARUB_v10022972mg [Caps...   562   e-157

>gb|EMJ06247.1| hypothetical protein PRUPE_ppa004609mg [Prunus persica]
          Length = 500

 Score =  679 bits (1753), Expect = 0.0
 Identities = 345/464 (74%), Positives = 385/464 (82%), Gaps = 1/464 (0%)
 Frame = +2

Query: 509  YARISVGESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMARE 688
            + RI   + P  +  K SKV+ F + KSV+LD F+TSDDEDEM EGFF AIEELERM RE
Sbjct: 37   FPRICKHQKPNFIVAKSSKVRDFRLFKSVELDQFLTSDDEDEMGEGFFEAIEELERMTRE 96

Query: 689  PSDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMC 868
            PSDVLEEMND+LSARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETM+LMVSIMC
Sbjct: 97   PSDVLEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMDLMVSIMC 156

Query: 869  TWVKKLIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLR 1048
            +WVKKLI+ ++                 K SFSM+EKVISLYWE GEKE  +LFVKEVL+
Sbjct: 157  SWVKKLIQREHDIGDVVDLLVDMDCVGLKPSFSMMEKVISLYWEMGEKEKAVLFVKEVLK 216

Query: 1049 RGISVLD-GDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTA 1225
            RGI   +  D DG+KGGP GYLAWKMM EGNYRD+ KLVIHLRE GLKPEVYSYLIAMTA
Sbjct: 217  RGIVYSEEDDTDGHKGGPTGYLAWKMMVEGNYRDSVKLVIHLRESGLKPEVYSYLIAMTA 276

Query: 1226 VVKELNEFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPS 1405
            VVKELNE AKALRKLKGFT+AGL+AE D +NVGLIE YQ DLL+ G+ LSNWVI+EG  S
Sbjct: 277  VVKELNELAKALRKLKGFTRAGLIAEFDTENVGLIEKYQSDLLSDGVQLSNWVIQEGSSS 336

Query: 1406 LLGVVHERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRL 1585
            L GVVHERLLAMY+C+G G  AERQLWEMKLVGKEAD DLYDIVLAICASQKE+S++GRL
Sbjct: 337  LHGVVHERLLAMYICSGHGLEAERQLWEMKLVGKEADADLYDIVLAICASQKEASAIGRL 396

Query: 1586 LTRMDVASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSR 1765
            LTR +V S  R+KK+LSWLLRGYIKGGHF +AAETVIKMLD GL PEFLDR AVLQGL +
Sbjct: 397  LTRTEVTSSLRKKKSLSWLLRGYIKGGHFDDAAETVIKMLDLGLCPEFLDRAAVLQGLRK 456

Query: 1766 RIQQQGNVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
             IQ+ G VDTYL LCKRLSDA+LIGP LVYL +RK+KLWI KML
Sbjct: 457  SIQESGGVDTYLKLCKRLSDASLIGPCLVYLFIRKYKLWITKML 500


>ref|XP_002313976.1| ubiquitin family protein [Populus trichocarpa]
            gi|222850384|gb|EEE87931.1| ubiquitin family protein
            [Populus trichocarpa]
          Length = 500

 Score =  674 bits (1739), Expect = 0.0
 Identities = 340/506 (67%), Positives = 407/506 (80%), Gaps = 7/506 (1%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSSKHY-------IFLSTHLPNIKSYARISVGESPFSLQKKR 559
            MAS    +++S +   +S   +++         +ST + N ++  R      P  +  K 
Sbjct: 1    MASAHAFSSLSKVSPVFSLKKRYWNSCMKPCCMVSTIICNYQTPKR------PNFVVAKT 54

Query: 560  SKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLSAREL 739
            +KV+ F + KSV+LD ++TSDDE+EM EGFF AIEELERM REPSD+LEEMND+LSAREL
Sbjct: 55   TKVREFRLFKSVELDQYVTSDDEEEMGEGFFEAIEELERMTREPSDILEEMNDRLSAREL 114

Query: 740  QLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXX 919
            QLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+WVKKLIEG+       
Sbjct: 115  QLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEGEQDVGDVV 174

Query: 920  XXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGP 1099
                       K SFSMIEKVISLYW+ G+KEG + FVKEVLRRGI+    D +G KGGP
Sbjct: 175  DLLVDMDCVGLKPSFSMIEKVISLYWDMGKKEGAVSFVKEVLRRGIAYSGDDGEGQKGGP 234

Query: 1100 AGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGF 1279
             GYL WKMM +GNYR+A KLVIHLRE GLKPE+Y+YLIAMTAVVKELNEF+KALRKLKG+
Sbjct: 235  TGYLTWKMMVDGNYRNAVKLVIHLRESGLKPEIYAYLIAMTAVVKELNEFSKALRKLKGY 294

Query: 1280 TKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGR 1459
            +++G+V ELD +NV L+E YQ DLL  G+ LS+WVI+EG P+L GVVHERLLAMY+CAGR
Sbjct: 295  SRSGMVTELDAENVELVEKYQSDLLADGVCLSSWVIQEGSPALYGVVHERLLAMYICAGR 354

Query: 1460 GSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSW 1639
            G  AERQLWEMKLVGKEADGDLYDIVLAICASQKE+S+V RLLTR++VAS  R+KK+LSW
Sbjct: 355  GLDAERQLWEMKLVGKEADGDLYDIVLAICASQKEASAVARLLTRIEVASSMRKKKSLSW 414

Query: 1640 LLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTLCKRL 1819
            LLRGYIKGGH+  AAET+IKMLD GL P++LDRVAV+QGL +RIQQ GNV++YL LCKRL
Sbjct: 415  LLRGYIKGGHYGEAAETLIKMLDLGLSPDYLDRVAVMQGLRKRIQQWGNVESYLKLCKRL 474

Query: 1820 SDANLIGPSLVYLHMRKHKLWIIKML 1897
            SD NLIGPSLVYL+++K+KLWI+K+L
Sbjct: 475  SDVNLIGPSLVYLYIKKYKLWIMKLL 500


>ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis]
            gi|223539607|gb|EEF41193.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  673 bits (1737), Expect = 0.0
 Identities = 334/450 (74%), Positives = 383/450 (85%)
 Frame = +2

Query: 548  QKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLS 727
            Q+ +S+ + F +LKSV+LD +I SDDE+EMSEGFF AIEELERM REPSDVLEEMNDKLS
Sbjct: 50   QQSKSRNREFRVLKSVELDQYIASDDEEEMSEGFFEAIEELERMTREPSDVLEEMNDKLS 109

Query: 728  ARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKX 907
            ARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+W+KKLIEG+++ 
Sbjct: 110  ARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWIKKLIEGEHEI 169

Query: 908  XXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGN 1087
                           K SFSMIEKVISLYWE GEKE ++ FVKEVLRR ++  + D +G 
Sbjct: 170  GDVVDLLVDMDCVGLKPSFSMIEKVISLYWEIGEKEKSVSFVKEVLRREVAYFEDDGEGQ 229

Query: 1088 KGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRK 1267
            KGGP GYLAWKMM +GNYRDA KLVIH RE GLKPEVYSYLIAMTAVVKELNEFAKALRK
Sbjct: 230  KGGPTGYLAWKMMVDGNYRDAVKLVIHFRESGLKPEVYSYLIAMTAVVKELNEFAKALRK 289

Query: 1268 LKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYV 1447
            LKGF K+GL+AELD +N  LIE YQ DL+  G+ LS+WVI+EG PSL GVVHERLLAMY+
Sbjct: 290  LKGFAKSGLIAELDAENTRLIEKYQSDLIADGVCLSSWVIQEGSPSLYGVVHERLLAMYI 349

Query: 1448 CAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKK 1627
            CAGRG  AERQLWEMKLVGK ADGDLYDIVLAICASQKE+S+V RLLTR++V S  ++KK
Sbjct: 350  CAGRGLDAERQLWEMKLVGKHADGDLYDIVLAICASQKEASAVSRLLTRVEVTSSLQKKK 409

Query: 1628 TLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTL 1807
            TLSWLLRGY+KGG +  AAE ++KMLD GL P++LDRVAVLQGL +RIQQ GNV++YL L
Sbjct: 410  TLSWLLRGYLKGGQYDEAAEALVKMLDMGLCPDYLDRVAVLQGLRKRIQQWGNVESYLNL 469

Query: 1808 CKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            CKRLSD NLIGPSLVYL+++K+KLWI+KML
Sbjct: 470  CKRLSDENLIGPSLVYLYIKKYKLWIMKML 499


>ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic [Vitis vinifera]
          Length = 511

 Score =  673 bits (1737), Expect = 0.0
 Identities = 355/517 (68%), Positives = 402/517 (77%), Gaps = 18/517 (3%)
 Frame = +2

Query: 401  MASVGGIAAI----SNLGLTYSSSSKHYIFLSTHLPN--IKSYARISVGE---------- 532
            MAS  G A+     + LG T SSS       S   P   +  ++R  +GE          
Sbjct: 1    MASAHGFASSLMSPTELGFTLSSS------FSIQRPRLIVPKFSRSFLGEYCSRATTICN 54

Query: 533  --SPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLE 706
              +P  +  KR K++ F + KSV+LD F+TSDDEDEMSEGFF AIEELERM REPSDVLE
Sbjct: 55   HQNPRFVVPKRDKIREFRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDVLE 114

Query: 707  EMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKL 886
            EMND+LSARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+WVKKL
Sbjct: 115  EMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKL 174

Query: 887  IEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVL 1066
            IEG++                 K  FSMIEKVISLYWE  EKE  +LFVKEVLRR I+  
Sbjct: 175  IEGEHDVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIAYS 234

Query: 1067 DGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNE 1246
            + D DG+KGGP GYLAWKMM EGNYR A KLVIHLRE GLKPEVYSYLIAMTAVVKELNE
Sbjct: 235  EDDGDGHKGGPTGYLAWKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKELNE 294

Query: 1247 FAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHE 1426
            FAKALRKLKGFTK+GL+AELD +NV LIE YQ DLL  G+ LS+WVI+EG   L GVV+E
Sbjct: 295  FAKALRKLKGFTKSGLIAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVVYE 354

Query: 1427 RLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVA 1606
            RLLAMY+CAGRG  AERQLWEMKLVGKEAD +LYDIVLAICAS+KE+S++ RLLT M+V 
Sbjct: 355  RLLAMYICAGRGLEAERQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGMEVT 414

Query: 1607 SPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGN 1786
            S  RRKKTLSWLLRGYIKG HF +A+ET+IKMLD GL PE+LDR AVLQGL  RIQQ GN
Sbjct: 415  SSIRRKKTLSWLLRGYIKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQTGN 474

Query: 1787 VDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            V+TYL LCK LSDANLIGP LVYL+++K+KLWI+K +
Sbjct: 475  VETYLKLCKHLSDANLIGPCLVYLYIKKYKLWILKTI 511


>gb|EOY34562.1| Pentatricopeptide repeat-containing protein [Theobroma cacao]
          Length = 504

 Score =  670 bits (1729), Expect = 0.0
 Identities = 336/462 (72%), Positives = 385/462 (83%), Gaps = 1/462 (0%)
 Frame = +2

Query: 515  RISVGESP-FSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREP 691
            RI   ++P F L+K + K +   + KSV+LD F+TSDDEDEMSEGFF AIEELERM REP
Sbjct: 43   RICNHQNPSFVLRKIQPKTRECRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREP 102

Query: 692  SDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCT 871
            SD+LEEMND+LS+RELQLVLVYFSQEGRDSWCALEVFEWLKKEN+VD ETMELMVSIMC+
Sbjct: 103  SDILEEMNDRLSSRELQLVLVYFSQEGRDSWCALEVFEWLKKENKVDNETMELMVSIMCS 162

Query: 872  WVKKLIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRR 1051
            WVKKLIEG+                  K  FSMIEKVIS+YWE  +K+  ++FVKEVLRR
Sbjct: 163  WVKKLIEGEGDVGDVVDLLVDMDCVGLKPGFSMIEKVISMYWEMEKKDRAVVFVKEVLRR 222

Query: 1052 GISVLDGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVV 1231
            GIS  D D +G KGGP GYLAWKMM EGNYRDA KLVI LRE GLKPE+YSYLIAMTA+V
Sbjct: 223  GISYEDEDGEGQKGGPTGYLAWKMMVEGNYRDAIKLVIELRESGLKPEIYSYLIAMTAIV 282

Query: 1232 KELNEFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLL 1411
            KELNEFAKALRKLKGF ++GLVAELDM+NV LI+ YQ DLL  GL LSNW I+EG  SL 
Sbjct: 283  KELNEFAKALRKLKGFARSGLVAELDMENVELIKKYQSDLLADGLRLSNWAIQEGTSSLF 342

Query: 1412 GVVHERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLT 1591
            G+VHERLLAMY+CAGRG  AERQLWEMKL GKEADGDL+DIVLAICASQKE+S++ RLLT
Sbjct: 343  GLVHERLLAMYICAGRGLEAERQLWEMKLAGKEADGDLHDIVLAICASQKEASAISRLLT 402

Query: 1592 RMDVASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRI 1771
            RM+V+S  RRKKTLSWLLRGYIKGGH  +AAETVIKMLD GL+PE+LDR AVLQ L +RI
Sbjct: 403  RMEVSSSLRRKKTLSWLLRGYIKGGHISDAAETVIKMLDLGLHPEYLDRAAVLQELRKRI 462

Query: 1772 QQQGNVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            QQ GN++TY+ LCKRL DA+LIGP L+YL+++K+KLW+IKML
Sbjct: 463  QQPGNIETYVNLCKRLYDASLIGPCLIYLYIKKYKLWVIKML 504


>ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 508

 Score =  664 bits (1714), Expect = 0.0
 Identities = 338/510 (66%), Positives = 407/510 (79%), Gaps = 11/510 (2%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSS-----KH-YIFLSTHLP-NIKSYARISVG----ESPFSL 547
            MAS  G+A I  LG  +SS S     +H  +F ++H   ++K Y  +S      ++P  +
Sbjct: 1    MASAHGLAPIFKLGFVFSSVSPSQRKRHPLMFPASHCGFSLKFYGGLSARSCKFKNPSFV 60

Query: 548  QKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLS 727
              K   ++GF  LKSV++D ++TS+DE  MS+GFF AIEELERM REPSDVLEEMND+LS
Sbjct: 61   SAKHGSLRGFRALKSVEMDQYVTSNDE--MSDGFFEAIEELERMTREPSDVLEEMNDRLS 118

Query: 728  ARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKX 907
            ARELQLVLVYFSQ+GRDSWCALEVF+WL+KENRVDKETMELMV+IMC WVKKLI+ ++  
Sbjct: 119  ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQQQHGV 178

Query: 908  XXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGN 1087
                           +  FSMIEKVISLYWE GEKEG +LFV+EVLRRGI  ++ D +G+
Sbjct: 179  GDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYVEEDEEGH 238

Query: 1088 KGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRK 1267
            KGGP GYLAWKMM EG+YR+A +LVI  RE GLKPE+YSYL+AMTAVVKELNEFAKALRK
Sbjct: 239  KGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAVVKELNEFAKALRK 298

Query: 1268 LKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYV 1447
            LKGFT+AGLVAELD+++V L E YQ D L  G+ LSNWVI++G PSL G+VHERLLAMY+
Sbjct: 299  LKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSLHGIVHERLLAMYI 358

Query: 1448 CAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKK 1627
            CAG G  AERQLWEMKLVGKEADGDLYDIVLAICASQKES++  RLLTR++V S  ++KK
Sbjct: 359  CAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVVSSPQKKK 418

Query: 1628 TLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTL 1807
            +LSWLLRGYIKGGHF  AAET++KML+ G YPE+LDR AVLQGL +RIQQ GN+DTY+ L
Sbjct: 419  SLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVRL 478

Query: 1808 CKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            CK LSDANLIGP LV+L++RK+KLW++KML
Sbjct: 479  CKSLSDANLIGPCLVHLYIRKYKLWVVKML 508


>gb|ESW18802.1| hypothetical protein PHAVU_006G071400g [Phaseolus vulgaris]
          Length = 510

 Score =  662 bits (1707), Expect = 0.0
 Identities = 341/510 (66%), Positives = 400/510 (78%), Gaps = 11/510 (2%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSSKHY-----IFLSTHLP-NIKSY----ARISVGESPFSLQ 550
            MA   G A    LG  +SS S        +F + H   ++K Y    AR    ++P  + 
Sbjct: 1    MAYAQGFAPNFKLGFVFSSGSPSQQRYPLMFPAAHCGFSLKFYNGVCARSFKFQNPSIVA 60

Query: 551  KKRSKVQGFGMLKSVQLDVFITSDDE-DEMSEGFFAAIEELERMAREPSDVLEEMNDKLS 727
             K   V+GF +LKSV+LD F+TSDDE DEM +GFF AIEELERM REPSD+LEEMND+LS
Sbjct: 61   AKHCSVRGFRVLKSVELDQFVTSDDEEDEMGDGFFEAIEELERMTREPSDILEEMNDRLS 120

Query: 728  ARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKX 907
            ARELQLVLVYFSQ+GRDSWCALEVF+WL+KENRVDKETMELMVSIMC WVKKLI+ ++  
Sbjct: 121  ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVSIMCGWVKKLIQEQHGV 180

Query: 908  XXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGN 1087
                           +  FSMIEKVISLYWE GEKEG +LFV+EVLRRGI     D++G+
Sbjct: 181  GDVIDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYASEDKEGH 240

Query: 1088 KGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRK 1267
            KGGP GYLAWKMM EG+YR A +LVI  RE GLKPEVYSYL+AMTAVVKELNEFAKALRK
Sbjct: 241  KGGPTGYLAWKMMAEGDYRSAVRLVIRFRESGLKPEVYSYLVAMTAVVKELNEFAKALRK 300

Query: 1268 LKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYV 1447
            LK FT+AGLV ELD+++V L E YQ DLL  G+ LSNWVI++G PSL GVVHERLLAMY+
Sbjct: 301  LKSFTRAGLVTELDLEDVELAEKYQTDLLADGVRLSNWVIQDGRPSLYGVVHERLLAMYI 360

Query: 1448 CAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKK 1627
            CAG G  AERQLWEMKLVGKEADGDLYDIVLAICASQKE ++  RLLTR+++A+  ++KK
Sbjct: 361  CAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKEVNATARLLTRLELANSPQKKK 420

Query: 1628 TLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTL 1807
            +LSWLLRGYIKGGHF  AAETV+KML+ G YPE+LDR AVLQGL +RIQQ GN+DTY+ L
Sbjct: 421  SLSWLLRGYIKGGHFTEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVRL 480

Query: 1808 CKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            CK LSDANLIGP LV+L++RK+KLW++KML
Sbjct: 481  CKSLSDANLIGPCLVHLYIRKYKLWVVKML 510


>ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citrus clementina]
            gi|557527050|gb|ESR38356.1| hypothetical protein
            CICLE_v10028251mg [Citrus clementina]
          Length = 502

 Score =  659 bits (1700), Expect = 0.0
 Identities = 333/461 (72%), Positives = 374/461 (81%)
 Frame = +2

Query: 515  RISVGESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPS 694
            RI   ++P  +  K SK++ F  LKSV+LD F+TSDDEDEMSE FF AIEELERM REPS
Sbjct: 42   RIKNYQNPSFIATKVSKIREFRFLKSVELDQFVTSDDEDEMSEEFFEAIEELERMTREPS 101

Query: 695  DVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTW 874
            D+LEEMND+LSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVD ETMELMVSIMC+W
Sbjct: 102  DILEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDNETMELMVSIMCSW 161

Query: 875  VKKLIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRG 1054
            VKK IE +                  K  FSMIEKVISLYWE  +KE  +LFVK VL RG
Sbjct: 162  VKKYIEEERDVGDVIDLLVDMDCVGLKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRG 221

Query: 1055 ISVLDGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVK 1234
            I+  +GD +G KGGP GYLAWKMM EG Y DA KLVIHLRE GLKPEVYSYLIA+TAVVK
Sbjct: 222  IAYAEGDGEGQKGGPTGYLAWKMMVEGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVK 281

Query: 1235 ELNEFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLG 1414
            ELNEF KALRKLKG+ +AG +AELD  N+GLIE YQ DLL  G  LS+W I+EGG SL G
Sbjct: 282  ELNEFGKALRKLKGYVRAGSIAELDGKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYG 341

Query: 1415 VVHERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTR 1594
            VVHERLLAMY+CAGRG  AERQLWEMKLVGKEADGDLYDIVLAICASQ E S+V RLL+R
Sbjct: 342  VVHERLLAMYICAGRGLEAERQLWEMKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSR 401

Query: 1595 MDVASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQ 1774
            ++V +   +KKTLSWLLRGYIKGGH  +AAET+ KMLD GLYPE++DRVAVLQGL +RIQ
Sbjct: 402  IEVMNSLCKKKTLSWLLRGYIKGGHINDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQ 461

Query: 1775 QQGNVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            Q GNV+ YL LCKRLSD +LIGP LVYL+++K+KLWIIKML
Sbjct: 462  QSGNVEAYLNLCKRLSDTSLIGPCLVYLYIKKYKLWIIKML 502


>ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Citrus sinensis]
          Length = 502

 Score =  657 bits (1694), Expect = 0.0
 Identities = 332/461 (72%), Positives = 374/461 (81%)
 Frame = +2

Query: 515  RISVGESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPS 694
            RI   ++P  +  K SK++ F  LKSV+LD F+TSDDEDEMSE FF AIEELERM REPS
Sbjct: 42   RIKNYQNPNFIATKVSKIREFRFLKSVELDQFVTSDDEDEMSEEFFEAIEELERMTREPS 101

Query: 695  DVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTW 874
            D+LEEMND+LSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVD ETMELMVSIMC+W
Sbjct: 102  DILEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDNETMELMVSIMCSW 161

Query: 875  VKKLIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRG 1054
            VKK IE +                  K  FSMIEKVISLYWE  +KE  +LFVK VL RG
Sbjct: 162  VKKYIEEERGVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRG 221

Query: 1055 ISVLDGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVK 1234
            I+  +GD +G +GGP GYLAWKMM EG Y DA KLVIHLRE GLKPEVYSYLIA+TAVVK
Sbjct: 222  IAYAEGDGEGQQGGPTGYLAWKMMVEGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVK 281

Query: 1235 ELNEFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLG 1414
            ELNEF KALRKLKG+ +AG +AELD  N+GLIE YQ DLL  G  LS+W I+EGG SL G
Sbjct: 282  ELNEFGKALRKLKGYVRAGSIAELDGKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYG 341

Query: 1415 VVHERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTR 1594
            VVHERLLAMY+CAGRG  AERQLWEMKLVGKEADGDLYDIVLAICASQ E S+V RLL+R
Sbjct: 342  VVHERLLAMYICAGRGLEAERQLWEMKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSR 401

Query: 1595 MDVASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQ 1774
            ++V +   +KKTLSWLLRGYIKGGH  +AAET+ KMLD GLYPE++DRVAVLQGL +RIQ
Sbjct: 402  IEVMNSLCKKKTLSWLLRGYIKGGHINDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQ 461

Query: 1775 QQGNVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            Q GNV+ YL LCKRLSD +LIGP LVYL+++K+KLWIIKML
Sbjct: 462  QSGNVEAYLNLCKRLSDTSLIGPCLVYLYIKKYKLWIIKML 502


>ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 510

 Score =  654 bits (1686), Expect = 0.0
 Identities = 340/512 (66%), Positives = 401/512 (78%), Gaps = 13/512 (2%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSS----KH-YIFLSTHLP-NIKSY-----ARISVGESPFSL 547
            MA   G A I  LG  +SS S    +H  +F ++H   ++K Y     AR    ++P  +
Sbjct: 1    MAYAHGFAPIFKLGFVFSSVSPSQKRHPLVFPASHCGYSLKFYDGVLSARSCKFKNPSFV 60

Query: 548  QKKRSKVQGFGMLKSVQLDVFITSDDE-DEMSEGFFAAIEELERMAREPSDVLEEMNDKL 724
              K+  ++GF  LKSV+LD ++TSDDE DEMS+GFF AIEELERM REPSDVLEEMND+L
Sbjct: 61   --KQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSDVLEEMNDRL 118

Query: 725  SARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLI-EGKN 901
            SARELQLVLVYFSQ+GRDSWCALEVF+WL+KENRVDKETMELMV+IMC WVKKLI E   
Sbjct: 119  SARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQEHHG 178

Query: 902  KXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRD 1081
                             +  FSMIEKVISLYWE GEKEG +LFV+EVLRRGI  L+ D +
Sbjct: 179  VVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYLEEDEE 238

Query: 1082 GNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKAL 1261
            G+KGGP GYLAWKMM EG+Y  A +LVIH  E GLKPEVYSYL+AMTAVVKELNE AKAL
Sbjct: 239  GHKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVKELNELAKAL 298

Query: 1262 RKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAM 1441
            RKLK F + GLVAELD+++V L E YQ DLL  G+ LSNW I++G PSL G++HERLLAM
Sbjct: 299  RKLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHGIIHERLLAM 358

Query: 1442 YVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRR 1621
            Y+CAG G  AE+QLWEMKLVGKEADGDLYDIVLAICASQKES++  RLLTR++VAS  ++
Sbjct: 359  YICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVASSPQK 418

Query: 1622 KKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYL 1801
            KK+LSWLLRGYIKGGHF  AAET++KMLD G YPE+LDR AVLQGL +RIQQ GN+DTY+
Sbjct: 419  KKSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQQYGNLDTYV 478

Query: 1802 TLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
             LCK LSDANLIGP LV+L++RK+KLW++KML
Sbjct: 479  RLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 510


>ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292395 [Fragaria vesca
            subsp. vesca]
          Length = 1304

 Score =  652 bits (1682), Expect = 0.0
 Identities = 328/455 (72%), Positives = 372/455 (81%), Gaps = 1/455 (0%)
 Frame = +2

Query: 530  ESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEE 709
            ++P  +  K  KV+ F +  SVQLD F+TSDDEDEM E FF AIEELERM REPSDVLEE
Sbjct: 38   KNPSFVVAKSGKVRDFRLFNSVQLDQFVTSDDEDEMGESFFEAIEELERMRREPSDVLEE 97

Query: 710  MNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLI 889
            MND+LSARELQLVLVYFSQEGRDSWCALEVFEWL++ENRVDKETMELMVSIMC W+K+LI
Sbjct: 98   MNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRRENRVDKETMELMVSIMCGWLKRLI 157

Query: 890  EGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLD 1069
            E  N                 K SFSM+EKVISLYWE GEKE  +LFVKEVL+RGI   +
Sbjct: 158  EEGNDVADVIDLLVDVDCVGLKPSFSMMEKVISLYWEMGEKENAVLFVKEVLKRGIVYSE 217

Query: 1070 -GDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNE 1246
              DRDG+KGGP GYLAWKM  +GNYRD+ K VI LRE GLKPEVYSYLIAMTAVVKELNE
Sbjct: 218  EDDRDGHKGGPTGYLAWKMTVDGNYRDSVKFVIQLRESGLKPEVYSYLIAMTAVVKELNE 277

Query: 1247 FAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHE 1426
              KALRKLK FT+AGLVAE D ++VGLIE YQ DLL  G+ LSNWVI+EG  +L GVVHE
Sbjct: 278  LGKALRKLKAFTRAGLVAEFDSEDVGLIEKYQSDLLADGVQLSNWVIQEGSSTLCGVVHE 337

Query: 1427 RLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVA 1606
            RLLAMY+C+GRG  AERQLWEMKLVGKE DGDLYDIVLAICAS+KE+S++ RLLTR +V+
Sbjct: 338  RLLAMYICSGRGLEAERQLWEMKLVGKEPDGDLYDIVLAICASRKETSAIARLLTRTEVS 397

Query: 1607 SPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGN 1786
            S   +KK+LSWLLRGYIKGGHF +AAETVIKMLD GL+P++LDR AVL GL +RIQQ G 
Sbjct: 398  SSLSKKKSLSWLLRGYIKGGHFNDAAETVIKMLDLGLFPDYLDRAAVLHGLRKRIQQSGT 457

Query: 1787 VDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIK 1891
            VDTYL LCKRLSDANLI   L+YL+++KHKLWII+
Sbjct: 458  VDTYLKLCKRLSDANLIESCLLYLYIKKHKLWIIR 492


>gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]
          Length = 516

 Score =  650 bits (1677), Expect = 0.0
 Identities = 340/518 (65%), Positives = 396/518 (76%), Gaps = 19/518 (3%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSS----------KHYIFLSTHLPNIKSYARISVG------- 529
            MAS  G   ++ LG   SSSS          ++ IFL     N+  + R S         
Sbjct: 1    MASAQGFTPLTELGFPSSSSSSSSSSSNSLHRNRIFLCRMDENL--WGRTSAKFCPVICC 58

Query: 530  --ESPFSLQKKRSKVQGFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVL 703
              ++P  +  K SK++ F +  SV+LD F+TSDDE+EM EGFF AIEELERM REPSDVL
Sbjct: 59   KQQNPNFIAPKPSKLREFRLFTSVELDQFLTSDDEEEMGEGFFEAIEELERMTREPSDVL 118

Query: 704  EEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKK 883
            EEMND+LSARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMV++MC+WVKK
Sbjct: 119  EEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVTLMCSWVKK 178

Query: 884  LIEGKNKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISV 1063
            LIEG++                 +  FSM+E VI LYWE GEK   + FVKEVLRRGI+ 
Sbjct: 179  LIEGEHDVGDVVDLLVDMACVGLRPGFSMMENVILLYWEMGEKGRAVSFVKEVLRRGIAC 238

Query: 1064 LDGDRDGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELN 1243
            L+ D +G KGGP GYLAWKMM EGNY +A KLV+ +RE GLKPEVYSYLIAMTAVVKELN
Sbjct: 239  LEDDGEGPKGGPTGYLAWKMMVEGNYMEAVKLVVDIRESGLKPEVYSYLIAMTAVVKELN 298

Query: 1244 EFAKALRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVH 1423
            EFAKALRKLKGF +AGL AELD ++V LIE YQ DLL  G+ LSNWVIEEG  SL GVVH
Sbjct: 299  EFAKALRKLKGFERAGLTAELDEESVELIEKYQSDLLDDGVRLSNWVIEEGITSLNGVVH 358

Query: 1424 ERLLAMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDV 1603
            ERLLAMY+CAGRG  AERQLW+MKLVGKEADGDLYDIVLAICASQKE  ++ RLLTR++ 
Sbjct: 359  ERLLAMYICAGRGIEAERQLWKMKLVGKEADGDLYDIVLAICASQKEGRAIARLLTRVNF 418

Query: 1604 ASPFRRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQG 1783
            +S  R++K+LSWLLRGYIKGGHF NAAETV+KMLD GL PE+LDR AVLQGL +RI+   
Sbjct: 419  SSTLRKRKSLSWLLRGYIKGGHFDNAAETVVKMLDLGLCPEYLDRAAVLQGLRKRIKGPD 478

Query: 1784 NVDTYLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
             V+TYL LCK LSD NLIGP L+YL+++K+KLWI+KML
Sbjct: 479  TVETYLKLCKHLSDYNLIGPCLIYLYIKKYKLWIMKML 516


>ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g30100, chloroplastic-like [Cucumis sativus]
          Length = 501

 Score =  644 bits (1662), Expect = 0.0
 Identities = 326/502 (64%), Positives = 392/502 (78%), Gaps = 3/502 (0%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSS---KHYIFLSTHLPNIKSYARISVGESPFSLQKKRSKVQ 571
            M    G   ++  G ++S SS         ST    + S    +  +S FS+ +  +K +
Sbjct: 1    MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRA-AKFR 59

Query: 572  GFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQLVL 751
               + KSV+LD FITSDDEDEM +GFF AIEELERM REPSDVLEEMND+LSARE+QLVL
Sbjct: 60   DLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVL 119

Query: 752  VYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXXXX 931
            VYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+W+KKL+EG++          
Sbjct: 120  VYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLV 179

Query: 932  XXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAGYL 1111
                   K  FSMIEKVISLYWE GEKE  + FVKEVL R ++ +  D +G+KGGP+GYL
Sbjct: 180  DMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYL 239

Query: 1112 AWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKAG 1291
            AWKMM +G+YR A K+V+HLRE GL+PEVYSYLIAMTAVVKELNEFAKALRKLKG+ + G
Sbjct: 240  AWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDG 299

Query: 1292 LVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGRGSAA 1471
             VAELD +NV L+  YQ +LL  G+ LSNWV+EEG  S+ GVVHERLLAMY+CAG+G  A
Sbjct: 300  FVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEA 359

Query: 1472 ERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSWLLRG 1651
            ERQLWEMKLVGKEAD DLYDIVLAICASQKE+ ++ RLLTR+++ SP  +KK+L+WLLRG
Sbjct: 360  ERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRG 419

Query: 1652 YIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTLCKRLSDAN 1831
            YIKGGHF++AA T++KM++ G  PE+LDRVAVLQGL + I++  +V TYL LCK LSDAN
Sbjct: 420  YIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLXKEIREPESVHTYLDLCKCLSDAN 479

Query: 1832 LIGPSLVYLHMRKHKLWIIKML 1897
            LIGPSLVYLH++KHKLWIIKML
Sbjct: 480  LIGPSLVYLHLQKHKLWIIKML 501


>ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum lycopersicum]
          Length = 503

 Score =  635 bits (1639), Expect = e-179
 Identities = 317/444 (71%), Positives = 367/444 (82%), Gaps = 2/444 (0%)
 Frame = +2

Query: 572  GFGMLKSVQLDVFITSDDED--EMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQL 745
            GF +  SV+L  F+TSD E+  EMS+ FF AIEELERM REPSDVLEEMN++LS RELQL
Sbjct: 60   GFKLFSSVELGSFVTSDGEEKNEMSDCFFEAIEELERMTREPSDVLEEMNERLSDRELQL 119

Query: 746  VLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXX 925
            VLVYF+QEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC WV+KLI  K++       
Sbjct: 120  VLVYFAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQKLIGSKSEAGDVVDL 179

Query: 926  XXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAG 1105
                       SFSM+EKVISLYW+AGE+EG + FVKEVLRR I+  DG+ DG+K GPAG
Sbjct: 180  LVDMDCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAYSDGNVDGHKAGPAG 239

Query: 1106 YLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTK 1285
            YLAWKMMEEGNY+DA KLVI +R+ GLKPE+YSYLIAMTAVVKELNEF KALRKLKGF +
Sbjct: 240  YLAWKMMEEGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELNEFGKALRKLKGFAR 299

Query: 1286 AGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGRGS 1465
             GLVAELD++N+ LIE YQ DLL  G+ LS+W+I+EGGPSL GVVHERLLAMYVCAGRG 
Sbjct: 300  TGLVAELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVHERLLAMYVCAGRGI 359

Query: 1466 AAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSWLL 1645
             AER LW+MK+ GKE  GDL+DIVLAICASQKE   + RLLT M+ +S  ++KKTLSWLL
Sbjct: 360  EAERHLWQMKISGKEVSGDLHDIVLAICASQKELGPISRLLTGMEASSSLQKKKTLSWLL 419

Query: 1646 RGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTLCKRLSD 1825
            RGYIKGGH +NAAETVIKMLD GLYP+FLDR AVLQ L RRIQQ GN++TYL LCK LSD
Sbjct: 420  RGYIKGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSGNLETYLNLCKHLSD 479

Query: 1826 ANLIGPSLVYLHMRKHKLWIIKML 1897
            A+LIGP LVYL+++K++LWII+ L
Sbjct: 480  ASLIGPCLVYLYIKKYRLWIIRTL 503


>ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum tuberosum]
          Length = 503

 Score =  634 bits (1636), Expect = e-179
 Identities = 331/503 (65%), Positives = 389/503 (77%), Gaps = 4/503 (0%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSSKHYIF--LSTHLPNIKSYARISVGESPFSLQKKRSKVQG 574
            MA+V  IA+++ LGL+     K        T L    S+    VG S  +      +  G
Sbjct: 1    MATVNEIASLTYLGLSKVVFPKRCRLGIPQTWLKWRSSWVLGGVGCSSRNPSFVNPRRNG 60

Query: 575  FGMLKSVQLDVFITSDDED--EMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQLV 748
            F +  SV+L  F+TSDDE+  EMS+ FF AIEELERM REPSDVLEEMN++LS RELQLV
Sbjct: 61   FKLFNSVELGSFVTSDDEEKNEMSDCFFEAIEELERMTREPSDVLEEMNERLSDRELQLV 120

Query: 749  LVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXXX 928
            LVYF+QEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC WV+KLI  K++        
Sbjct: 121  LVYFAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQKLIGSKSEAGDVVDLL 180

Query: 929  XXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAGY 1108
                      SFSM+EKVISLYW+AGE+EG + FVKEVLRR I+  DG+ DG+K GPAGY
Sbjct: 181  VDMDCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAYSDGNVDGHKAGPAGY 240

Query: 1109 LAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKA 1288
            LAWKMME GNY+DA KLVI +R+ GLKPE+YSYLIAMTAVVKELNEF KALRKLKGF + 
Sbjct: 241  LAWKMMEVGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELNEFGKALRKLKGFART 300

Query: 1289 GLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGRGSA 1468
            GLVAELD++N+ LIE YQ DLL  G+ LS+W+I+EGGPSL GVVHERLLAMYVCAGRG  
Sbjct: 301  GLVAELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVHERLLAMYVCAGRGIE 360

Query: 1469 AERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSWLLR 1648
            AER LW+MKL GK+  GDL DIVLAICASQKE   + RLLT M+ +S  ++KKTLSWLLR
Sbjct: 361  AERHLWQMKLSGKKVTGDLQDIVLAICASQKELGPISRLLTGMEASSSLQKKKTLSWLLR 420

Query: 1649 GYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTLCKRLSDA 1828
            GYIKGGH +NAAETVIKMLD GLYP+FLDR AVLQ L RRIQQ G+++TYL LCK LSDA
Sbjct: 421  GYIKGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSGSLETYLNLCKHLSDA 480

Query: 1829 NLIGPSLVYLHMRKHKLWIIKML 1897
            +LIGP LVYL+++K++LWII+ L
Sbjct: 481  SLIGPCLVYLYIKKYRLWIIRTL 503


>ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207176 [Cucumis sativus]
          Length = 1290

 Score =  613 bits (1581), Expect = e-173
 Identities = 312/486 (64%), Positives = 376/486 (77%), Gaps = 3/486 (0%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSS---KHYIFLSTHLPNIKSYARISVGESPFSLQKKRSKVQ 571
            M    G   ++  G ++S SS         ST    + S    +  +S FS+ +  +K +
Sbjct: 1    MICAQGFTPLTQFGFSFSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFSVSRA-AKFR 59

Query: 572  GFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQLVL 751
               + KSV+LD FITSDDEDEM +GFF AIEELERM REPSDVLEEMND+LSARE+QLVL
Sbjct: 60   DLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVL 119

Query: 752  VYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXXXX 931
            VYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVSIMC+W+KKL+EG++          
Sbjct: 120  VYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLV 179

Query: 932  XXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAGYL 1111
                   K  FSMIEKVISLYWE GEKE  + FVKEVL R ++ +  D +G+KGGP+GYL
Sbjct: 180  DMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYL 239

Query: 1112 AWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKAG 1291
            AWKMM +G+YR A K+V+HLRE GL+PEVYSYLIAMTAVVKELNEFAKALRKLKG+ + G
Sbjct: 240  AWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDG 299

Query: 1292 LVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLLGVVHERLLAMYVCAGRGSAA 1471
             VAELD +NV L+  YQ +LL  G+ LSNWV+EEG  S+ GVVHERLLAMY+CAG+G  A
Sbjct: 300  FVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEA 359

Query: 1472 ERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTLSWLLRG 1651
            ERQLWEMKLVGKEAD DLYDIVLAICASQKE+ ++ RLLTR+++ SP  +KK+L+WLLRG
Sbjct: 360  ERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRG 419

Query: 1652 YIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTLCKRLSDAN 1831
            YIKGGHF++AA T++KM++ G  PE+LDRVAVLQGL + I++  +V TYL LCK LSDAN
Sbjct: 420  YIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDLCKCLSDAN 479

Query: 1832 LIGPSL 1849
            LIGPSL
Sbjct: 480  LIGPSL 485


>ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Cicer arietinum]
          Length = 508

 Score =  605 bits (1559), Expect = e-170
 Identities = 315/514 (61%), Positives = 379/514 (73%), Gaps = 15/514 (2%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSS----SSKHYIFLSTHLPNIKSYARISVGESPFSLQKKR--- 559
            MAS+ G A    LG  +SS      KH +      P+ K    +   +  F  Q      
Sbjct: 1    MASLHGFAPTLKLGFAFSSLFSPKQKHPLVF----PSSKRGFSLKFCDGSFKFQNPSFPP 56

Query: 560  SKVQGFGMLKSVQLDVFITSDDEDE-------MSEGFFAAIEELERMAREPSDVLEEMND 718
            +K   +   KSV+LD F+TSDDE+E       M +GF  AIEELERM REPSDVLEEMND
Sbjct: 57   TKPNSYMRKKSVELDQFVTSDDEEEEEEEEEEMGDGFLEAIEELERMTREPSDVLEEMND 116

Query: 719  KLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGK 898
            +LSARELQLVLVYFSQEGRDSWCALEVF+WL+KENRVDKETMELMV+IMC WVKKLI  K
Sbjct: 117  RLSARELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIMEK 176

Query: 899  NKXXXXXXXXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDR 1078
            +                 +  FSMIEKVISLYWE GEK+  +LFV+EVLRRGIS    + 
Sbjct: 177  HGVDDVIDLLVNMNCVGLRPGFSMIEKVISLYWEMGEKDDAVLFVEEVLRRGIS--SNED 234

Query: 1079 DGNKGGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKA 1258
            D  KGGP GYLAWKMM EG+YR A +LV   RE GLKP++YSYL+AMTAVVKELNE AKA
Sbjct: 235  DPEKGGPTGYLAWKMMVEGDYRGAVRLVTRFREAGLKPDIYSYLVAMTAVVKELNELAKA 294

Query: 1259 LRKLKGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGPSLL-GVVHERLL 1435
            LRKLK F++AGL+ E D ++V L E YQ DLL  G  LS WVI++G PS + G++HERLL
Sbjct: 295  LRKLKSFSRAGLITEFDREDVELAEKYQSDLLADGARLSKWVIQDGSPSSIHGIIHERLL 354

Query: 1436 AMYVCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPF 1615
            AMY+CAGRG  AERQLWEMKL+GKEA G LYD+VLAICASQKE+++  RL+ RM+VAS  
Sbjct: 355  AMYICAGRGIEAERQLWEMKLLGKEAVGGLYDMVLAICASQKEAAATARLMIRMEVASSP 414

Query: 1616 RRKKTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDT 1795
            ++KK+LSWLLRGYIKGGHF  AAETV+KML+ G YP++LDRVAV+QGL +RIQQ GN+DT
Sbjct: 415  QKKKSLSWLLRGYIKGGHFNEAAETVMKMLELGFYPDYLDRVAVMQGLRKRIQQYGNLDT 474

Query: 1796 YLTLCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            Y+ LCK L +ANLIG  + YL++RK+KLW++KM+
Sbjct: 475  YIKLCKSLYEANLIGACVCYLYIRKYKLWVVKMI 508


>gb|EPS70238.1| hypothetical protein M569_04522 [Genlisea aurea]
          Length = 504

 Score =  590 bits (1521), Expect = e-166
 Identities = 318/508 (62%), Positives = 382/508 (75%), Gaps = 9/508 (1%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSSKHYIFLSTHLPNIKSYARISVGESP---FSLQKKRSKVQ 571
            MA VGG +AI++L   Y S S      S     I++  R S  +S    F + K+R    
Sbjct: 1    MAVVGGFSAINDLSSRYYSPSPCIFLESRRKLVIRTSIRDSDRKSKPPGFRIGKRRP--- 57

Query: 572  GFGMLKSVQLDVFITSDDEDEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQLVL 751
            G   L+SV L   ITSDDEDEMSEGFF AIEELERMAREPSDVLEEMNDKLS RELQLVL
Sbjct: 58   GVWSLESVHLGTIITSDDEDEMSEGFFEAIEELERMAREPSDVLEEMNDKLSNRELQLVL 117

Query: 752  VYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXXXXX 931
            VYFSQEGRDSW  LEVFEWLKKEN+VD+ETMELMVSIMC W+KKLIE KNK         
Sbjct: 118  VYFSQEGRDSWFTLEVFEWLKKENKVDQETMELMVSIMCNWMKKLIEAKNKVQDVVDLLV 177

Query: 932  XXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRRGISVLDGDRDGNKGGPAGYL 1111
                   + +FSMIEKVISLYWEAGEK+ TI FVKEVLRRGIS    D +G+K GP GYL
Sbjct: 178  DMDCVGLEANFSMIEKVISLYWEAGEKQETIAFVKEVLRRGISSCS-DEEGDKTGPVGYL 236

Query: 1112 AWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKAG 1291
            AWKMMEEG+ RDAAKLV+H R+CGLKP++YSYLIAMT +++ELNEFAK++R L  +TK+G
Sbjct: 237  AWKMMEEGSCRDAAKLVLHFRDCGLKPDIYSYLIAMTGILRELNEFAKSMRSLNRYTKSG 296

Query: 1292 LVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGG-----PS-LLGVVHERLLAMYVCA 1453
            LV+ELD ++V L E YQQ+LL  GL LS + +EE G     PS L  VV +RLLAM+V A
Sbjct: 297  LVSELDANSVWLAEKYQQELLDDGLLLSAFALEERGGGGDPPSPLREVVRKRLLAMHVVA 356

Query: 1454 GRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRKKTL 1633
            GRG+ AER L E      + D DL ++VLAICASQKES SV RLLTRM+ +SP  R+KTL
Sbjct: 357  GRGTEAERLLSEKNTGSDDGDDDLCNVVLAICASQKESVSVSRLLTRMEGSSPDSRRKTL 416

Query: 1634 SWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLTLCK 1813
            +WLLRGY+KGGHF+NA ET+++M+++G+ PEF DRVAVLQGL RRI++ G VDTYL +C+
Sbjct: 417  TWLLRGYVKGGHFRNAGETLVRMVEAGVLPEFTDRVAVLQGLGRRIRRSGYVDTYLDVCR 476

Query: 1814 RLSDANLIGPSLVYLHMRKHKLWIIKML 1897
             L+D +LI PSLVY+H+RKHKLWII ++
Sbjct: 477  CLADVDLISPSLVYVHLRKHKLWIISLV 504


>ref|XP_006293981.1| hypothetical protein CARUB_v10022972mg [Capsella rubella]
            gi|482562689|gb|EOA26879.1| hypothetical protein
            CARUB_v10022972mg [Capsella rubella]
          Length = 505

 Score =  562 bits (1448), Expect = e-157
 Identities = 293/511 (57%), Positives = 372/511 (72%), Gaps = 12/511 (2%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSSKHYIFLSTHLPNIKSYARISVGESPFSLQKKRSKVQGFG 580
            MA   G A+++ L L +S S        T  P +KS +RIS       L     K +   
Sbjct: 1    MAYARGFASLTQLNLIFSPSISLRRVYRT--PGVKSVSRISCN---LKLNYSAGKFRDLK 55

Query: 581  MLKSVQLDVFITSDDE------DEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQ 742
            + +SV+LD FITS++E      DE+ EGFF AIEELERM REPSDVLEEMN +LS+RELQ
Sbjct: 56   LSRSVELDQFITSEEEGGEEAEDEIGEGFFEAIEELERMTREPSDVLEEMNHRLSSRELQ 115

Query: 743  LVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXX 922
            L+LVYF+QEGRDSWC LEVFEWLKKENRVD++ +ELMVSIMC WVKKLI+ +        
Sbjct: 116  LMLVYFAQEGRDSWCTLEVFEWLKKENRVDEQMVELMVSIMCGWVKKLIQEECGADQVFD 175

Query: 923  XXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRR----GISVLDGDRDGNK 1090
                      K  FSM+EKVI+LY E G+KE  +LFVKEVLRR    G SV+ G  +G K
Sbjct: 176  LLIEMDCVGLKPGFSMMEKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGGS-EGRK 234

Query: 1091 GGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKL 1270
            GGP GYLAWK+M +G+Y+ A  LV+ LR  GL PE YSYLIAMTA+VKELN   K LR+L
Sbjct: 235  GGPVGYLAWKLMVDGDYKKAVDLVVELRLSGLMPEAYSYLIAMTAIVKELNSLGKTLREL 294

Query: 1271 KGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGP--SLLGVVHERLLAMY 1444
            K FT+AG V E+D  +  LIE YQ + L+ GL L+ W +EEG    S++GVVHERLLAMY
Sbjct: 295  KRFTRAGYVTEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQQEDSIIGVVHERLLAMY 354

Query: 1445 VCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRK 1624
            +CAGRG  AE+QLW+MKL G+E + +L+DIV+AICASQKE ++V RLLTR++     R+K
Sbjct: 355  ICAGRGPEAEKQLWKMKLAGREPEAELHDIVMAICASQKEVNAVSRLLTRVEFMESKRKK 414

Query: 1625 KTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLT 1804
            KTLSWLLRGY+KGGHF+ AAET+I M+DSGL+PE++DRVAV+QG++R+IQ+  +++ Y+ 
Sbjct: 415  KTLSWLLRGYVKGGHFEEAAETLITMIDSGLHPEYIDRVAVMQGMTRKIQRPRDIEAYMG 474

Query: 1805 LCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            LCKRL DA L+GP LVY++M K+KLWI+KM+
Sbjct: 475  LCKRLFDAGLVGPCLVYMYMDKYKLWIVKMM 505


>ref|XP_006293980.1| hypothetical protein CARUB_v10022972mg [Capsella rubella]
            gi|482562688|gb|EOA26878.1| hypothetical protein
            CARUB_v10022972mg [Capsella rubella]
          Length = 532

 Score =  562 bits (1448), Expect = e-157
 Identities = 293/511 (57%), Positives = 372/511 (72%), Gaps = 12/511 (2%)
 Frame = +2

Query: 401  MASVGGIAAISNLGLTYSSSSKHYIFLSTHLPNIKSYARISVGESPFSLQKKRSKVQGFG 580
            MA   G A+++ L L +S S        T  P +KS +RIS       L     K +   
Sbjct: 28   MAYARGFASLTQLNLIFSPSISLRRVYRT--PGVKSVSRISCN---LKLNYSAGKFRDLK 82

Query: 581  MLKSVQLDVFITSDDE------DEMSEGFFAAIEELERMAREPSDVLEEMNDKLSARELQ 742
            + +SV+LD FITS++E      DE+ EGFF AIEELERM REPSDVLEEMN +LS+RELQ
Sbjct: 83   LSRSVELDQFITSEEEGGEEAEDEIGEGFFEAIEELERMTREPSDVLEEMNHRLSSRELQ 142

Query: 743  LVLVYFSQEGRDSWCALEVFEWLKKENRVDKETMELMVSIMCTWVKKLIEGKNKXXXXXX 922
            L+LVYF+QEGRDSWC LEVFEWLKKENRVD++ +ELMVSIMC WVKKLI+ +        
Sbjct: 143  LMLVYFAQEGRDSWCTLEVFEWLKKENRVDEQMVELMVSIMCGWVKKLIQEECGADQVFD 202

Query: 923  XXXXXXXXXXKTSFSMIEKVISLYWEAGEKEGTILFVKEVLRR----GISVLDGDRDGNK 1090
                      K  FSM+EKVI+LY E G+KE  +LFVKEVLRR    G SV+ G  +G K
Sbjct: 203  LLIEMDCVGLKPGFSMMEKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGGS-EGRK 261

Query: 1091 GGPAGYLAWKMMEEGNYRDAAKLVIHLRECGLKPEVYSYLIAMTAVVKELNEFAKALRKL 1270
            GGP GYLAWK+M +G+Y+ A  LV+ LR  GL PE YSYLIAMTA+VKELN   K LR+L
Sbjct: 262  GGPVGYLAWKLMVDGDYKKAVDLVVELRLSGLMPEAYSYLIAMTAIVKELNSLGKTLREL 321

Query: 1271 KGFTKAGLVAELDMDNVGLIENYQQDLLTYGLHLSNWVIEEGGP--SLLGVVHERLLAMY 1444
            K FT+AG V E+D  +  LIE YQ + L+ GL L+ W +EEG    S++GVVHERLLAMY
Sbjct: 322  KRFTRAGYVTEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQQEDSIIGVVHERLLAMY 381

Query: 1445 VCAGRGSAAERQLWEMKLVGKEADGDLYDIVLAICASQKESSSVGRLLTRMDVASPFRRK 1624
            +CAGRG  AE+QLW+MKL G+E + +L+DIV+AICASQKE ++V RLLTR++     R+K
Sbjct: 382  ICAGRGPEAEKQLWKMKLAGREPEAELHDIVMAICASQKEVNAVSRLLTRVEFMESKRKK 441

Query: 1625 KTLSWLLRGYIKGGHFKNAAETVIKMLDSGLYPEFLDRVAVLQGLSRRIQQQGNVDTYLT 1804
            KTLSWLLRGY+KGGHF+ AAET+I M+DSGL+PE++DRVAV+QG++R+IQ+  +++ Y+ 
Sbjct: 442  KTLSWLLRGYVKGGHFEEAAETLITMIDSGLHPEYIDRVAVMQGMTRKIQRPRDIEAYMG 501

Query: 1805 LCKRLSDANLIGPSLVYLHMRKHKLWIIKML 1897
            LCKRL DA L+GP LVY++M K+KLWI+KM+
Sbjct: 502  LCKRLFDAGLVGPCLVYMYMDKYKLWIVKMM 532


Top