BLASTX nr result

ID: Catharanthus22_contig00002524 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00002524
         (2384 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ06247.1| hypothetical protein PRUPE_ppa004609mg [Prunus pe...   729   0.0  
ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi...   711   0.0  
ref|XP_002313976.1| ubiquitin family protein [Populus trichocarp...   705   0.0  
ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm...   705   0.0  
gb|EOY34562.1| Pentatricopeptide repeat-containing protein [Theo...   692   0.0  
ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292...   691   0.0  
gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]     687   0.0  
ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containi...   684   0.0  
ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containi...   679   0.0  
gb|ESW18802.1| hypothetical protein PHAVU_006G071400g [Phaseolus...   677   0.0  
ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi...   674   0.0  
ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citr...   668   0.0  
ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containi...   667   0.0  
ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi...   666   0.0  
ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   647   0.0  
ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207...   645   0.0  
ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containi...   615   e-173
ref|NP_180571.3| pentatricopeptide repeat-containing protein [Ar...   582   e-163
ref|XP_006410063.1| hypothetical protein EUTSA_v10016546mg [Eutr...   575   e-161
ref|XP_006293981.1| hypothetical protein CARUB_v10022972mg [Caps...   569   e-159

>gb|EMJ06247.1| hypothetical protein PRUPE_ppa004609mg [Prunus persica]
          Length = 500

 Score =  729 bits (1881), Expect = 0.0
 Identities = 375/490 (76%), Positives = 421/490 (85%), Gaps = 2/490 (0%)
 Frame = -2

Query: 1867 MVALDGIASFAHLGFSQTCSSIRRSFLLSKEFRIQSCPRVLSIICA-LKPDFVVERRSKS 1691
            M +  G+AS  H  F+      R+ F+  + F  QSC RV   IC   KP+F+V + SK 
Sbjct: 1    MASAQGLASLTHSLFAVK----RQRFMGLRGFSAQSCGRVFPRICKHQKPNFIVAKSSKV 56

Query: 1690 REFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLV 1511
            R+FRLF SVELD F+TSDDE EM E FFEAIEELERMTREPSDVLEEMN++L+ARELQLV
Sbjct: 57   RDFRLFKSVELDQFLTSDDEDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSARELQLV 116

Query: 1510 LVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLL 1331
            LVYFSQEGRDSWCALEVFEWLRKENRVDKETM+LMVS+MC WVKKLI+ + +IG+VVDLL
Sbjct: 117  LVYFSQEGRDSWCALEVFEWLRKENRVDKETMDLMVSIMCSWVKKLIQREHDIGDVVDLL 176

Query: 1330 VDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSN-DDRGQNKGGPTG 1154
            VDMDCVGL+PSFSM+EKVISLYWE GEKE AV FVKEVL+RGI YS  DD   +KGGPTG
Sbjct: 177  VDMDCVGLKPSFSMMEKVISLYWEMGEKEKAVLFVKEVLKRGIVYSEEDDTDGHKGGPTG 236

Query: 1153 YLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAK 974
            YLAWKMMVEGNY+D+VKLVI +RE GLKPEVYSYLIAMTAVVKELNE+AKALRKLKGF +
Sbjct: 237  YLAWKMMVEGNYRDSVKLVIHLRESGLKPEVYSYLIAMTAVVKELNELAKALRKLKGFTR 296

Query: 973  AGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGV 794
            AGL+AE D EN  LIEKYQ++LL  GV LSN VI+E S SL+GVVHERLLAMYIC+G G+
Sbjct: 297  AGLIAEFDTENVGLIEKYQSDLLSDGVQLSNWVIQEGSSSLHGVVHERLLAMYICSGHGL 356

Query: 793  EAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLL 614
            EAERQLWEMKL GKEAD DLYDIVLAICASQKEA AIGRLLTR EVT+SLR+KK+LSWLL
Sbjct: 357  EAERQLWEMKLVGKEADADLYDIVLAICASQKEASAIGRLLTRTEVTSSLRKKKSLSWLL 416

Query: 613  RGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSD 434
            RGYIKGGHFD+AAETV+KMLD GL PEFLDRAAVLQGLR+ IQ+SG +D YL+LCKRLSD
Sbjct: 417  RGYIKGGHFDDAAETVIKMLDLGLCPEFLDRAAVLQGLRKSIQESGGVDTYLKLCKRLSD 476

Query: 433  ANLIGPCLVY 404
            A+LIGPCLVY
Sbjct: 477  ASLIGPCLVY 486


>ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic [Vitis vinifera]
          Length = 511

 Score =  711 bits (1834), Expect = 0.0
 Identities = 369/487 (75%), Positives = 413/487 (84%), Gaps = 5/487 (1%)
 Frame = -2

Query: 1849 IASFAHLGFSQTCS-SIRRSFLL----SKEFRIQSCPRVLSIICALKPDFVVERRSKSRE 1685
            + S   LGF+ + S SI+R  L+    S+ F  + C R  +I     P FVV +R K RE
Sbjct: 11   LMSPTELGFTLSSSFSIQRPRLIVPKFSRSFLGEYCSRATTICNHQNPRFVVPKRDKIRE 70

Query: 1684 FRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLV 1505
            FRLF SVELD F+TSDDE EMSE FFEAIEELERMTREPSDVLEEMN++L+ARELQLVLV
Sbjct: 71   FRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDVLEEMNDRLSARELQLVLV 130

Query: 1504 YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVD 1325
            YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVS+MC WVKKLIEG+ ++G+VVDLLVD
Sbjct: 131  YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEGEHDVGDVVDLLVD 190

Query: 1324 MDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGYLA 1145
            MDCVGL+P FSMIEKVISLYWE  EKE AV FVKEVLRR IAYS DD   +KGGPTGYLA
Sbjct: 191  MDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIAYSEDDGDGHKGGPTGYLA 250

Query: 1144 WKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGL 965
            WKMM EGNY+ AVKLVI +RE GLKPEVYSYLIAMTAVVKELNE AKALRKLKGF K+GL
Sbjct: 251  WKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKSGL 310

Query: 964  VAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAE 785
            +AELD EN  LIEKYQ++LL  GV LS+ VI+E    L+GVV+ERLLAMYICAGRG+EAE
Sbjct: 311  IAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVVYERLLAMYICAGRGLEAE 370

Query: 784  RQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGY 605
            RQLWEMKL GKEAD +LYDIVLAICAS+KEA AI RLLT MEVT+S+RRKKTLSWLLRGY
Sbjct: 371  RQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGMEVTSSIRRKKTLSWLLRGY 430

Query: 604  IKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANL 425
            IKG HFD+A+ET++KMLD GL PE+LDRAAVLQGLR RIQQ+G ++ YL+LCK LSDANL
Sbjct: 431  IKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQTGNVETYLKLCKHLSDANL 490

Query: 424  IGPCLVY 404
            IGPCLVY
Sbjct: 491  IGPCLVY 497


>ref|XP_002313976.1| ubiquitin family protein [Populus trichocarpa]
            gi|222850384|gb|EEE87931.1| ubiquitin family protein
            [Populus trichocarpa]
          Length = 500

 Score =  705 bits (1820), Expect = 0.0
 Identities = 353/476 (74%), Positives = 412/476 (86%), Gaps = 7/476 (1%)
 Frame = -2

Query: 1810 SSIRRSFLLSKEFR---IQSCPRVLSIICAL----KPDFVVERRSKSREFRLFSSVELDT 1652
            S +   F L K +    ++ C  V +IIC      +P+FVV + +K REFRLF SVELD 
Sbjct: 11   SKVSPVFSLKKRYWNSCMKPCCMVSTIICNYQTPKRPNFVVAKTTKVREFRLFKSVELDQ 70

Query: 1651 FVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGRDSWC 1472
            +VTSDDE EM E FFEAIEELERMTREPSD+LEEMN++L+ARELQLVLVYFSQEGRDSWC
Sbjct: 71   YVTSDDEEEMGEGFFEAIEELERMTREPSDILEEMNDRLSARELQLVLVYFSQEGRDSWC 130

Query: 1471 ALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLRPSFS 1292
            ALEVFEWLRKENRVDKETMELMVS+MC WVKKLIEG+ ++G+VVDLLVDMDCVGL+PSFS
Sbjct: 131  ALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEGEQDVGDVVDLLVDMDCVGLKPSFS 190

Query: 1291 MIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGYLAWKMMVEGNYKD 1112
            MIEKVISLYW+ G+KEGAV+FVKEVLRRGIAYS DD    KGGPTGYL WKMMV+GNY++
Sbjct: 191  MIEKVISLYWDMGKKEGAVSFVKEVLRRGIAYSGDDGEGQKGGPTGYLTWKMMVDGNYRN 250

Query: 1111 AVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLVAELDLENTLL 932
            AVKLVI +RE GLKPE+Y+YLIAMTAVVKELNE +KALRKLKG++++G+V ELD EN  L
Sbjct: 251  AVKLVIHLRESGLKPEIYAYLIAMTAVVKELNEFSKALRKLKGYSRSGMVTELDAENVEL 310

Query: 931  IEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAERQLWEMKLAGK 752
            +EKYQ++LL  GV LS+ VI+E SP+LYGVVHERLLAMYICAGRG++AERQLWEMKL GK
Sbjct: 311  VEKYQSDLLADGVCLSSWVIQEGSPALYGVVHERLLAMYICAGRGLDAERQLWEMKLVGK 370

Query: 751  EADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHFDNAAE 572
            EADGDLYDIVLAICASQKEA A+ RLLTR+EV +S+R+KK+LSWLLRGYIKGGH+  AAE
Sbjct: 371  EADGDLYDIVLAICASQKEASAVARLLTRIEVASSMRKKKSLSWLLRGYIKGGHYGEAAE 430

Query: 571  TVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLIGPCLVY 404
            T++KMLD GL P++LDR AV+QGLR+RIQQ G ++ YL+LCKRLSD NLIGP LVY
Sbjct: 431  TLIKMLDLGLSPDYLDRVAVMQGLRKRIQQWGNVESYLKLCKRLSDVNLIGPSLVY 486


>ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis]
            gi|223539607|gb|EEF41193.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  705 bits (1819), Expect = 0.0
 Identities = 360/473 (76%), Positives = 409/473 (86%), Gaps = 5/473 (1%)
 Frame = -2

Query: 1807 SIRRSFLLSKEFRIQSCPRVLSIICALKP---DFVVERRSKSR--EFRLFSSVELDTFVT 1643
            S+  +FLL + +++ + PR   +     P   +FVV ++SKSR  EFR+  SVELD ++ 
Sbjct: 14   SVSSTFLLQRRYKLLN-PRFFQLSSIKFPKSSNFVVAQQSKSRNREFRVLKSVELDQYIA 72

Query: 1642 SDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGRDSWCALE 1463
            SDDE EMSE FFEAIEELERMTREPSDVLEEMN+KL+ARELQLVLVYFSQEGRDSWCALE
Sbjct: 73   SDDEEEMSEGFFEAIEELERMTREPSDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALE 132

Query: 1462 VFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLRPSFSMIE 1283
            VFEWLRKENRVDKETMELMVS+MC W+KKLIEG+ EIG+VVDLLVDMDCVGL+PSFSMIE
Sbjct: 133  VFEWLRKENRVDKETMELMVSIMCSWIKKLIEGEHEIGDVVDLLVDMDCVGLKPSFSMIE 192

Query: 1282 KVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGYLAWKMMVEGNYKDAVK 1103
            KVISLYWE GEKE +V+FVKEVLRR +AY  DD    KGGPTGYLAWKMMV+GNY+DAVK
Sbjct: 193  KVISLYWEIGEKEKSVSFVKEVLRREVAYFEDDGEGQKGGPTGYLAWKMMVDGNYRDAVK 252

Query: 1102 LVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLVAELDLENTLLIEK 923
            LVI  RE GLKPEVYSYLIAMTAVVKELNE AKALRKLKGFAK+GL+AELD ENT LIEK
Sbjct: 253  LVIHFRESGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFAKSGLIAELDAENTRLIEK 312

Query: 922  YQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAERQLWEMKLAGKEAD 743
            YQ++L+  GV LS+ VI+E SPSLYGVVHERLLAMYICAGRG++AERQLWEMKL GK AD
Sbjct: 313  YQSDLIADGVCLSSWVIQEGSPSLYGVVHERLLAMYICAGRGLDAERQLWEMKLVGKHAD 372

Query: 742  GDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHFDNAAETVM 563
            GDLYDIVLAICASQKEA A+ RLLTR+EVT+SL++KKTLSWLLRGY+KGG +D AAE ++
Sbjct: 373  GDLYDIVLAICASQKEASAVSRLLTRVEVTSSLQKKKTLSWLLRGYLKGGQYDEAAEALV 432

Query: 562  KMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLIGPCLVY 404
            KMLD GL P++LDR AVLQGLR+RIQQ G ++ YL LCKRLSD NLIGP LVY
Sbjct: 433  KMLDMGLCPDYLDRVAVLQGLRKRIQQWGNVESYLNLCKRLSDENLIGPSLVY 485


>gb|EOY34562.1| Pentatricopeptide repeat-containing protein [Theobroma cacao]
          Length = 504

 Score =  692 bits (1785), Expect = 0.0
 Identities = 346/455 (76%), Positives = 403/455 (88%), Gaps = 2/455 (0%)
 Frame = -2

Query: 1762 SCPRVLSIICALK-PDFVVER-RSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEEL 1589
            S  R+ + IC  + P FV+ + + K+RE RLF SVELD F+TSDDE EMSE FFEAIEEL
Sbjct: 36   SLRRISTRICNHQNPSFVLRKIQPKTRECRLFKSVELDQFLTSDDEDEMSEGFFEAIEEL 95

Query: 1588 ERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMEL 1409
            ERMTREPSD+LEEMN++L++RELQLVLVYFSQEGRDSWCALEVFEWL+KEN+VD ETMEL
Sbjct: 96   ERMTREPSDILEEMNDRLSSRELQLVLVYFSQEGRDSWCALEVFEWLKKENKVDNETMEL 155

Query: 1408 MVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAF 1229
            MVS+MC WVKKLIEG+ ++G+VVDLLVDMDCVGL+P FSMIEKVIS+YWE  +K+ AV F
Sbjct: 156  MVSIMCSWVKKLIEGEGDVGDVVDLLVDMDCVGLKPGFSMIEKVISMYWEMEKKDRAVVF 215

Query: 1228 VKEVLRRGIAYSNDDRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYL 1049
            VKEVLRRGI+Y ++D    KGGPTGYLAWKMMVEGNY+DA+KLVI++RE GLKPE+YSYL
Sbjct: 216  VKEVLRRGISYEDEDGEGQKGGPTGYLAWKMMVEGNYRDAIKLVIELRESGLKPEIYSYL 275

Query: 1048 IAMTAVVKELNEVAKALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIK 869
            IAMTA+VKELNE AKALRKLKGFA++GLVAELD+EN  LI+KYQ++LL  G+ LSN  I+
Sbjct: 276  IAMTAIVKELNEFAKALRKLKGFARSGLVAELDMENVELIKKYQSDLLADGLRLSNWAIQ 335

Query: 868  EESPSLYGVVHERLLAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAG 689
            E + SL+G+VHERLLAMYICAGRG+EAERQLWEMKLAGKEADGDL+DIVLAICASQKEA 
Sbjct: 336  EGTSSLFGLVHERLLAMYICAGRGLEAERQLWEMKLAGKEADGDLHDIVLAICASQKEAS 395

Query: 688  AIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVL 509
            AI RLLTRMEV++SLRRKKTLSWLLRGYIKGGH  +AAETV+KMLD GL PE+LDRAAVL
Sbjct: 396  AISRLLTRMEVSSSLRRKKTLSWLLRGYIKGGHISDAAETVIKMLDLGLHPEYLDRAAVL 455

Query: 508  QGLRRRIQQSGILDIYLELCKRLSDANLIGPCLVY 404
            Q LR+RIQQ G ++ Y+ LCKRL DA+LIGPCL+Y
Sbjct: 456  QELRKRIQQPGNIETYVNLCKRLYDASLIGPCLIY 490


>ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292395 [Fragaria vesca
            subsp. vesca]
          Length = 1304

 Score =  691 bits (1784), Expect = 0.0
 Identities = 353/481 (73%), Positives = 411/481 (85%), Gaps = 5/481 (1%)
 Frame = -2

Query: 1831 LGFSQT---CSSIRRSFLLSKEFRIQSCPRVLSIICALK-PDFVVERRSKSREFRLFSSV 1664
            + F+Q+   C +++R +  S  F  + C RV ++I   K P FVV +  K R+FRLF+SV
Sbjct: 1    MAFAQSLTQCFAVKR-YRFSGGFSGKRCSRVCNVIYKEKNPSFVVAKSGKVRDFRLFNSV 59

Query: 1663 ELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGR 1484
            +LD FVTSDDE EM E FFEAIEELERM REPSDVLEEMN++L+ARELQLVLVYFSQEGR
Sbjct: 60   QLDQFVTSDDEDEMGESFFEAIEELERMRREPSDVLEEMNDRLSARELQLVLVYFSQEGR 119

Query: 1483 DSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLR 1304
            DSWCALEVFEWLR+ENRVDKETMELMVS+MCGW+K+LIE  +++ +V+DLLVD+DCVGL+
Sbjct: 120  DSWCALEVFEWLRRENRVDKETMELMVSIMCGWLKRLIEEGNDVADVIDLLVDVDCVGLK 179

Query: 1303 PSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSN-DDRGQNKGGPTGYLAWKMMVE 1127
            PSFSM+EKVISLYWE GEKE AV FVKEVL+RGI YS  DDR  +KGGPTGYLAWKM V+
Sbjct: 180  PSFSMMEKVISLYWEMGEKENAVLFVKEVLKRGIVYSEEDDRDGHKGGPTGYLAWKMTVD 239

Query: 1126 GNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLVAELDL 947
            GNY+D+VK VIQ+RE GLKPEVYSYLIAMTAVVKELNE+ KALRKLK F +AGLVAE D 
Sbjct: 240  GNYRDSVKFVIQLRESGLKPEVYSYLIAMTAVVKELNELGKALRKLKAFTRAGLVAEFDS 299

Query: 946  ENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAERQLWEM 767
            E+  LIEKYQ++LL  GV LSN VI+E S +L GVVHERLLAMYIC+GRG+EAERQLWEM
Sbjct: 300  EDVGLIEKYQSDLLADGVQLSNWVIQEGSSTLCGVVHERLLAMYICSGRGLEAERQLWEM 359

Query: 766  KLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHF 587
            KL GKE DGDLYDIVLAICAS+KE  AI RLLTR EV++SL +KK+LSWLLRGYIKGGHF
Sbjct: 360  KLVGKEPDGDLYDIVLAICASRKETSAIARLLTRTEVSSSLSKKKSLSWLLRGYIKGGHF 419

Query: 586  DNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLIGPCLV 407
            ++AAETV+KMLD GLFP++LDRAAVL GLR+RIQQSG +D YL+LCKRLSDANLI  CL+
Sbjct: 420  NDAAETVIKMLDLGLFPDYLDRAAVLHGLRKRIQQSGTVDTYLKLCKRLSDANLIESCLL 479

Query: 406  Y 404
            Y
Sbjct: 480  Y 480


>gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]
          Length = 516

 Score =  687 bits (1773), Expect = 0.0
 Identities = 358/502 (71%), Positives = 406/502 (80%), Gaps = 14/502 (2%)
 Frame = -2

Query: 1867 MVALDGIASFAHLGFSQTCSSI----------RRSFLLSKEFRI--QSCPRVLSIICALK 1724
            M +  G      LGF  + SS            R FL   +  +  ++  +   +IC  +
Sbjct: 1    MASAQGFTPLTELGFPSSSSSSSSSSSNSLHRNRIFLCRMDENLWGRTSAKFCPVICCKQ 60

Query: 1723 --PDFVVERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEE 1550
              P+F+  + SK REFRLF+SVELD F+TSDDE EM E FFEAIEELERMTREPSDVLEE
Sbjct: 61   QNPNFIAPKPSKLREFRLFTSVELDQFLTSDDEEEMGEGFFEAIEELERMTREPSDVLEE 120

Query: 1549 MNEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLI 1370
            MN++L+ARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMV+LMC WVKKLI
Sbjct: 121  MNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVTLMCSWVKKLI 180

Query: 1369 EGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSN 1190
            EG+ ++G+VVDLLVDM CVGLRP FSM+E VI LYWE GEK  AV+FVKEVLRRGIA   
Sbjct: 181  EGEHDVGDVVDLLVDMACVGLRPGFSMMENVILLYWEMGEKGRAVSFVKEVLRRGIACLE 240

Query: 1189 DDRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEV 1010
            DD    KGGPTGYLAWKMMVEGNY +AVKLV+ IRE GLKPEVYSYLIAMTAVVKELNE 
Sbjct: 241  DDGEGPKGGPTGYLAWKMMVEGNYMEAVKLVVDIRESGLKPEVYSYLIAMTAVVKELNEF 300

Query: 1009 AKALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHER 830
            AKALRKLKGF +AGL AELD E+  LIEKYQ++LLD GV LSN VI+E   SL GVVHER
Sbjct: 301  AKALRKLKGFERAGLTAELDEESVELIEKYQSDLLDDGVRLSNWVIEEGITSLNGVVHER 360

Query: 829  LLAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTN 650
            LLAMYICAGRG+EAERQLW+MKL GKEADGDLYDIVLAICASQKE  AI RLLTR+  ++
Sbjct: 361  LLAMYICAGRGIEAERQLWKMKLVGKEADGDLYDIVLAICASQKEGRAIARLLTRVNFSS 420

Query: 649  SLRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGIL 470
            +LR++K+LSWLLRGYIKGGHFDNAAETV+KMLD GL PE+LDRAAVLQGLR+RI+    +
Sbjct: 421  TLRKRKSLSWLLRGYIKGGHFDNAAETVVKMLDLGLCPEYLDRAAVLQGLRKRIKGPDTV 480

Query: 469  DIYLELCKRLSDANLIGPCLVY 404
            + YL+LCK LSD NLIGPCL+Y
Sbjct: 481  ETYLKLCKHLSDYNLIGPCLIY 502


>ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum lycopersicum]
          Length = 503

 Score =  684 bits (1764), Expect = 0.0
 Identities = 357/486 (73%), Positives = 407/486 (83%), Gaps = 4/486 (0%)
 Frame = -2

Query: 1849 IASFAHLGFSQTCSSIRRSFLLSKEFRIQSCPRVLSII-CALK-PDFVVERRSKSREFRL 1676
            I SF +LG S+  S  R    + + +       VL  + C+ + P FV  RR+    F+L
Sbjct: 7    IVSFTYLGLSKAVSPKRCRLGIPQTWLKWRSSLVLGGVGCSSRNPSFVSPRRNG---FKL 63

Query: 1675 FSSVELDTFVTSDDE--GEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVY 1502
            FSSVEL +FVTSD E   EMS+ FFEAIEELERMTREPSDVLEEMNE+L+ RELQLVLVY
Sbjct: 64   FSSVELGSFVTSDGEEKNEMSDCFFEAIEELERMTREPSDVLEEMNERLSDRELQLVLVY 123

Query: 1501 FSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDM 1322
            F+QEGRDSWCALEVFEWLRKENRVDKETMELMVS+MCGWV+KLI  KSE G+VVDLLVDM
Sbjct: 124  FAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQKLIGSKSEAGDVVDLLVDM 183

Query: 1321 DCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGYLAW 1142
            DCVGL PSFSM+EKVISLYW+AGE+EGAV+FVKEVLRR IAYS+ +   +K GP GYLAW
Sbjct: 184  DCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAYSDGNVDGHKAGPAGYLAW 243

Query: 1141 KMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLV 962
            KMM EGNYKDAVKLVI IR+ GLKPE+YSYLIAMTAVVKELNE  KALRKLKGFA+ GLV
Sbjct: 244  KMMEEGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELNEFGKALRKLKGFARTGLV 303

Query: 961  AELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAER 782
            AELDLEN  LIE+YQA+LL  GV LS+ +I+E  PSL+GVVHERLLAMY+CAGRG+EAER
Sbjct: 304  AELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVHERLLAMYVCAGRGIEAER 363

Query: 781  QLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYI 602
             LW+MK++GKE  GDL+DIVLAICASQKE G I RLLT ME ++SL++KKTLSWLLRGYI
Sbjct: 364  HLWQMKISGKEVSGDLHDIVLAICASQKELGPISRLLTGMEASSSLQKKKTLSWLLRGYI 423

Query: 601  KGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLI 422
            KGGH +NAAETV+KMLD GL+P+FLDRAAVLQ LRRRIQQSG L+ YL LCK LSDA+LI
Sbjct: 424  KGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSGNLETYLNLCKHLSDASLI 483

Query: 421  GPCLVY 404
            GPCLVY
Sbjct: 484  GPCLVY 489


>ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum tuberosum]
          Length = 503

 Score =  679 bits (1751), Expect = 0.0
 Identities = 356/492 (72%), Positives = 408/492 (82%), Gaps = 4/492 (0%)
 Frame = -2

Query: 1867 MVALDGIASFAHLGFSQTCSSIRRSFLLSKEFRIQSCPRVLSII-CALK-PDFVVERRSK 1694
            M  ++ IAS  +LG S+     R    + + +       VL  + C+ + P FV  RR+ 
Sbjct: 1    MATVNEIASLTYLGLSKVVFPKRCRLGIPQTWLKWRSSWVLGGVGCSSRNPSFVNPRRNG 60

Query: 1693 SREFRLFSSVELDTFVTSDDE--GEMSEVFFEAIEELERMTREPSDVLEEMNEKLTAREL 1520
               F+LF+SVEL +FVTSDDE   EMS+ FFEAIEELERMTREPSDVLEEMNE+L+ REL
Sbjct: 61   ---FKLFNSVELGSFVTSDDEEKNEMSDCFFEAIEELERMTREPSDVLEEMNERLSDREL 117

Query: 1519 QLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVV 1340
            QLVLVYF+QEGRDSWCALEVFEWLRKENRVDKETMELMVS+MCGWV+KLI  KSE G+VV
Sbjct: 118  QLVLVYFAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQKLIGSKSEAGDVV 177

Query: 1339 DLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGP 1160
            DLLVDMDCVGL PSFSM+EKVISLYW+AGE+EGAV+FVKEVLRR IAYS+ +   +K GP
Sbjct: 178  DLLVDMDCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAYSDGNVDGHKAGP 237

Query: 1159 TGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGF 980
             GYLAWKMM  GNYKDAVKLVI IR+ GLKPE+YSYLIAMTAVVKELNE  KALRKLKGF
Sbjct: 238  AGYLAWKMMEVGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELNEFGKALRKLKGF 297

Query: 979  AKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGR 800
            A+ GLVAELDLEN  LIE+YQA+LL  GV LS+ +I+E  PSL+GVVHERLLAMY+CAGR
Sbjct: 298  ARTGLVAELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVHERLLAMYVCAGR 357

Query: 799  GVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSW 620
            G+EAER LW+MKL+GK+  GDL DIVLAICASQKE G I RLLT ME ++SL++KKTLSW
Sbjct: 358  GIEAERHLWQMKLSGKKVTGDLQDIVLAICASQKELGPISRLLTGMEASSSLQKKKTLSW 417

Query: 619  LLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRL 440
            LLRGYIKGGH +NAAETV+KMLD GL+P+FLDRAAVLQ LRRRIQQSG L+ YL LCK L
Sbjct: 418  LLRGYIKGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSGSLETYLNLCKHL 477

Query: 439  SDANLIGPCLVY 404
            SDA+LIGPCLVY
Sbjct: 478  SDASLIGPCLVY 489


>gb|ESW18802.1| hypothetical protein PHAVU_006G071400g [Phaseolus vulgaris]
          Length = 510

 Score =  677 bits (1747), Expect = 0.0
 Identities = 338/441 (76%), Positives = 381/441 (86%), Gaps = 1/441 (0%)
 Frame = -2

Query: 1723 PDFVVERRSKSREFRLFSSVELDTFVTSDDE-GEMSEVFFEAIEELERMTREPSDVLEEM 1547
            P  V  +    R FR+  SVELD FVTSDDE  EM + FFEAIEELERMTREPSD+LEEM
Sbjct: 56   PSIVAAKHCSVRGFRVLKSVELDQFVTSDDEEDEMGDGFFEAIEELERMTREPSDILEEM 115

Query: 1546 NEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIE 1367
            N++L+ARELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMVS+MCGWVKKLI+
Sbjct: 116  NDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVSIMCGWVKKLIQ 175

Query: 1366 GKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSND 1187
             +  +G+V+DLLVDMDCVGLRP FSMIEKVISLYWE GEKEGAV FV+EVLRRGI Y+++
Sbjct: 176  EQHGVGDVIDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYASE 235

Query: 1186 DRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVA 1007
            D+  +KGGPTGYLAWKMM EG+Y+ AV+LVI+ RE GLKPEVYSYL+AMTAVVKELNE A
Sbjct: 236  DKEGHKGGPTGYLAWKMMAEGDYRSAVRLVIRFRESGLKPEVYSYLVAMTAVVKELNEFA 295

Query: 1006 KALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERL 827
            KALRKLK F +AGLV ELDLE+  L EKYQ +LL  GV LSN VI++  PSLYGVVHERL
Sbjct: 296  KALRKLKSFTRAGLVTELDLEDVELAEKYQTDLLADGVRLSNWVIQDGRPSLYGVVHERL 355

Query: 826  LAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNS 647
            LAMYICAG G+EAERQLWEMKL GKEADGDLYDIVLAICASQKE  A  RLLTR+E+ NS
Sbjct: 356  LAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKEVNATARLLTRLELANS 415

Query: 646  LRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILD 467
             ++KK+LSWLLRGYIKGGHF  AAETVMKML+ G +PE+LDRAAVLQGLR+RIQQ G LD
Sbjct: 416  PQKKKSLSWLLRGYIKGGHFTEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLD 475

Query: 466  IYLELCKRLSDANLIGPCLVY 404
             Y+ LCK LSDANLIGPCLV+
Sbjct: 476  TYVRLCKSLSDANLIGPCLVH 496


>ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 508

 Score =  674 bits (1738), Expect = 0.0
 Identities = 337/445 (75%), Positives = 386/445 (86%), Gaps = 1/445 (0%)
 Frame = -2

Query: 1735 CALK-PDFVVERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDV 1559
            C  K P FV  +    R FR   SVE+D +VTS+DE  MS+ FFEAIEELERMTREPSDV
Sbjct: 52   CKFKNPSFVSAKHGSLRGFRALKSVEMDQYVTSNDE--MSDGFFEAIEELERMTREPSDV 109

Query: 1558 LEEMNEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVK 1379
            LEEMN++L+ARELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMV++MCGWVK
Sbjct: 110  LEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVK 169

Query: 1378 KLIEGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIA 1199
            KLI+ +  +G+VVDLLVDMDCVGLRP FSMIEKVISLYWE GEKEGAV FV+EVLRRGI 
Sbjct: 170  KLIQQQHGVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIP 229

Query: 1198 YSNDDRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKEL 1019
            Y  +D   +KGGPTGYLAWKMM EG+Y++AV+LVI+ RE GLKPE+YSYL+AMTAVVKEL
Sbjct: 230  YVEEDEEGHKGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAVVKEL 289

Query: 1018 NEVAKALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVV 839
            NE AKALRKLKGF +AGLVAELDLE+  L EKYQ++ L  GV LSN VI++ SPSL+G+V
Sbjct: 290  NEFAKALRKLKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSLHGIV 349

Query: 838  HERLLAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRME 659
            HERLLAMYICAG G+EAERQLWEMKL GKEADGDLYDIVLAICASQKE+ A  RLLTR+E
Sbjct: 350  HERLLAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLE 409

Query: 658  VTNSLRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQS 479
            V +S ++KK+LSWLLRGYIKGGHF+ AAET+MKML+ G +PE+LDRAAVLQGLR+RIQQ 
Sbjct: 410  VVSSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKRIQQY 469

Query: 478  GILDIYLELCKRLSDANLIGPCLVY 404
            G LD Y+ LCK LSDANLIGPCLV+
Sbjct: 470  GNLDTYVRLCKSLSDANLIGPCLVH 494


>ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citrus clementina]
            gi|557527050|gb|ESR38356.1| hypothetical protein
            CICLE_v10028251mg [Citrus clementina]
          Length = 502

 Score =  668 bits (1724), Expect = 0.0
 Identities = 334/440 (75%), Positives = 376/440 (85%)
 Frame = -2

Query: 1723 PDFVVERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMN 1544
            P F+  + SK REFR   SVELD FVTSDDE EMSE FFEAIEELERMTREPSD+LEEMN
Sbjct: 49   PSFIATKVSKIREFRFLKSVELDQFVTSDDEDEMSEEFFEAIEELERMTREPSDILEEMN 108

Query: 1543 EKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEG 1364
            ++L+ARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVD ETMELMVS+MC WVKK IE 
Sbjct: 109  DRLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDNETMELMVSIMCSWVKKYIEE 168

Query: 1363 KSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDD 1184
            + ++G+V+DLLVDMDCVGL+P FSMIEKVISLYWE  +KE AV FVK VL RGIAY+  D
Sbjct: 169  ERDVGDVIDLLVDMDCVGLKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRGIAYAEGD 228

Query: 1183 RGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAK 1004
                KGGPTGYLAWKMMVEG Y DA+KLVI +RE GLKPEVYSYLIA+TAVVKELNE  K
Sbjct: 229  GEGQKGGPTGYLAWKMMVEGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVKELNEFGK 288

Query: 1003 ALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLL 824
            ALRKLKG+ +AG +AELD +N  LIEKYQ++LL  G  LS+  I+E   SLYGVVHERLL
Sbjct: 289  ALRKLKGYVRAGSIAELDGKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYGVVHERLL 348

Query: 823  AMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSL 644
            AMYICAGRG+EAERQLWEMKL GKEADGDLYDIVLAICASQ E  A+ RLL+R+EV NSL
Sbjct: 349  AMYICAGRGLEAERQLWEMKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSRIEVMNSL 408

Query: 643  RRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDI 464
             +KKTLSWLLRGYIKGGH ++AAET+ KMLD GL+PE++DR AVLQGLR+RIQQSG ++ 
Sbjct: 409  CKKKTLSWLLRGYIKGGHINDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQQSGNVEA 468

Query: 463  YLELCKRLSDANLIGPCLVY 404
            YL LCKRLSD +LIGPCLVY
Sbjct: 469  YLNLCKRLSDTSLIGPCLVY 488


>ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Citrus sinensis]
          Length = 502

 Score =  667 bits (1722), Expect = 0.0
 Identities = 348/489 (71%), Positives = 400/489 (81%), Gaps = 7/489 (1%)
 Frame = -2

Query: 1849 IASFAHLGFSQTCSSI-RRSFLLSKEFRIQSC-----PRVLSIICALK-PDFVVERRSKS 1691
            +AS   LGF+ + + + +R  LL  + R  SC     PR+ + I   + P+F+  + SK 
Sbjct: 1    MASVPELGFALSPNFLLQRHKLLVPQLR-GSCLTRPPPRISTRIKNYQNPNFIATKVSKI 59

Query: 1690 REFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLV 1511
            REFR   SVELD FVTSDDE EMSE FFEAIEELERMTREPSD+LEEMN++L+ARELQLV
Sbjct: 60   REFRFLKSVELDQFVTSDDEDEMSEEFFEAIEELERMTREPSDILEEMNDRLSARELQLV 119

Query: 1510 LVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLL 1331
            LVYFSQEGRDSWCALEVFEWL+KENRVD ETMELMVS+MC WVKK IE +  +G+VVDLL
Sbjct: 120  LVYFSQEGRDSWCALEVFEWLKKENRVDNETMELMVSIMCSWVKKYIEEERGVGDVVDLL 179

Query: 1330 VDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGY 1151
            VDMDCVGL+P FSMIEKVISLYWE  +KE AV FVK VL RGIAY+  D    +GGPTGY
Sbjct: 180  VDMDCVGLKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRGIAYAEGDGEGQQGGPTGY 239

Query: 1150 LAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKA 971
            LAWKMMVEG Y DA+KLVI +RE GLKPEVYSYLIA+TAVVKELNE  KALRKLKG+ +A
Sbjct: 240  LAWKMMVEGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVKELNEFGKALRKLKGYVRA 299

Query: 970  GLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVE 791
            G +AELD +N  LIEKYQ++LL  G  LS+  I+E   SLYGVVHERLLAMYICAGRG+E
Sbjct: 300  GSIAELDGKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYGVVHERLLAMYICAGRGLE 359

Query: 790  AERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLR 611
            AERQLWEMKL GKEADGDLYDIVLAICASQ E  A+ RLL+R+EV NSL +KKTLSWLLR
Sbjct: 360  AERQLWEMKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSRIEVMNSLCKKKTLSWLLR 419

Query: 610  GYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDA 431
            GYIKGGH ++AAET+ KMLD GL+PE++DR AVLQGLR+RIQQSG ++ YL LCKRLSD 
Sbjct: 420  GYIKGGHINDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQQSGNVEAYLNLCKRLSDT 479

Query: 430  NLIGPCLVY 404
            +LIGPCLVY
Sbjct: 480  SLIGPCLVY 488


>ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 510

 Score =  666 bits (1719), Expect = 0.0
 Identities = 339/447 (75%), Positives = 383/447 (85%), Gaps = 3/447 (0%)
 Frame = -2

Query: 1735 CALK-PDFVVERRSKSREFRLFSSVELDTFVTSDDE-GEMSEVFFEAIEELERMTREPSD 1562
            C  K P FV  ++   R FR   SVELD +VTSDDE  EMS+ FFEAIEELERMTREPSD
Sbjct: 52   CKFKNPSFV--KQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSD 109

Query: 1561 VLEEMNEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWV 1382
            VLEEMN++L+ARELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMV++MCGWV
Sbjct: 110  VLEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWV 169

Query: 1381 KKLI-EGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRG 1205
            KKLI E    +G+VVDLLVDMDCVGLRP FSMIEKVISLYWE GEKEGAV FV+EVLRRG
Sbjct: 170  KKLIQEHHGVVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRG 229

Query: 1204 IAYSNDDRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVK 1025
            I Y  +D   +KGGPTGYLAWKMM EG+Y  AV+LVI   E GLKPEVYSYL+AMTAVVK
Sbjct: 230  IPYLEEDEEGHKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVK 289

Query: 1024 ELNEVAKALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYG 845
            ELNE+AKALRKLK FA+ GLVAELDLE+  L EKYQ++LL  GV LSN  I++ SPSL+G
Sbjct: 290  ELNELAKALRKLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHG 349

Query: 844  VVHERLLAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTR 665
            ++HERLLAMYICAG G+EAE+QLWEMKL GKEADGDLYDIVLAICASQKE+ A  RLLTR
Sbjct: 350  IIHERLLAMYICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTR 409

Query: 664  MEVTNSLRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQ 485
            +EV +S ++KK+LSWLLRGYIKGGHF+ AAET+MKMLD G +PE+LDRAAVLQGLR+RIQ
Sbjct: 410  LEVASSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQ 469

Query: 484  QSGILDIYLELCKRLSDANLIGPCLVY 404
            Q G LD Y+ LCK LSDANLIGPCLV+
Sbjct: 470  QYGNLDTYVRLCKSLSDANLIGPCLVH 496


>ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g30100, chloroplastic-like [Cucumis sativus]
          Length = 501

 Score =  647 bits (1669), Expect = 0.0
 Identities = 335/496 (67%), Positives = 390/496 (78%), Gaps = 8/496 (1%)
 Frame = -2

Query: 1867 MVALDGIASFAHLGFSQTCSSIRRSFLLSKEFRIQSC----PR---VLSIICALKPD-FV 1712
            M+   G       GFS         F LS     Q C    PR   V  I C  +   F 
Sbjct: 1    MICAQGFTPLTQFGFS---------FSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFS 51

Query: 1711 VERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLT 1532
            V R +K R+ RLF SVELD F+TSDDE EM + FFEAIEELERMTREPSDVLEEMN++L+
Sbjct: 52   VSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLS 111

Query: 1531 ARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEI 1352
            ARE+QLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVS+MC W+KKL+EG+  +
Sbjct: 112  AREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV 171

Query: 1351 GEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQN 1172
            G+VVDLLVDMDCVGL+P FSMIEKVISLYWE GEKE AV FVKEVL R +A+  DD   +
Sbjct: 172  GDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGH 231

Query: 1171 KGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRK 992
            KGGP+GYLAWKMMV+G+Y+ AVK+V+ +RE GL+PEVYSYLIAMTAVVKELNE AKALRK
Sbjct: 232  KGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRK 291

Query: 991  LKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYI 812
            LKG+A+ G VAELD  N  L+ KYQ ELL  GV LSN V++E S S+ GVVHERLLAMYI
Sbjct: 292  LKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYI 351

Query: 811  CAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKK 632
            CAG+GVEAERQLWEMKL GKEAD DLYDIVLAICASQKE  A+ RLLTR+E+T+ + +KK
Sbjct: 352  CAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKK 411

Query: 631  TLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLEL 452
            +L+WLLRGYIKGGHF +AA T++KM++ G  PE+LDR AVLQGL + I++   +  YL+L
Sbjct: 412  SLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLXKEIREPESVHTYLDL 471

Query: 451  CKRLSDANLIGPCLVY 404
            CK LSDANLIGP LVY
Sbjct: 472  CKCLSDANLIGPSLVY 487


>ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207176 [Cucumis sativus]
          Length = 1290

 Score =  645 bits (1664), Expect = 0.0
 Identities = 334/494 (67%), Positives = 389/494 (78%), Gaps = 8/494 (1%)
 Frame = -2

Query: 1867 MVALDGIASFAHLGFSQTCSSIRRSFLLSKEFRIQSC----PR---VLSIICALKPD-FV 1712
            M+   G       GFS         F LS     Q C    PR   V  I C  +   F 
Sbjct: 1    MICAQGFTPLTQFGFS---------FSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFS 51

Query: 1711 VERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLT 1532
            V R +K R+ RLF SVELD F+TSDDE EM + FFEAIEELERMTREPSDVLEEMN++L+
Sbjct: 52   VSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLS 111

Query: 1531 ARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEI 1352
            ARE+QLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVS+MC W+KKL+EG+  +
Sbjct: 112  AREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV 171

Query: 1351 GEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQN 1172
            G+VVDLLVDMDCVGL+P FSMIEKVISLYWE GEKE AV FVKEVL R +A+  DD   +
Sbjct: 172  GDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGH 231

Query: 1171 KGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRK 992
            KGGP+GYLAWKMMV+G+Y+ AVK+V+ +RE GL+PEVYSYLIAMTAVVKELNE AKALRK
Sbjct: 232  KGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRK 291

Query: 991  LKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYI 812
            LKG+A+ G VAELD  N  L+ KYQ ELL  GV LSN V++E S S+ GVVHERLLAMYI
Sbjct: 292  LKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYI 351

Query: 811  CAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKK 632
            CAG+GVEAERQLWEMKL GKEAD DLYDIVLAICASQKE  A+ RLLTR+E+T+ + +KK
Sbjct: 352  CAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKK 411

Query: 631  TLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLEL 452
            +L+WLLRGYIKGGHF +AA T++KM++ G  PE+LDR AVLQGLR+ I++   +  YL+L
Sbjct: 412  SLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDL 471

Query: 451  CKRLSDANLIGPCL 410
            CK LSDANLIGP L
Sbjct: 472  CKCLSDANLIGPSL 485


>ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Cicer arietinum]
          Length = 508

 Score =  615 bits (1587), Expect = e-173
 Identities = 311/440 (70%), Positives = 367/440 (83%), Gaps = 8/440 (1%)
 Frame = -2

Query: 1699 SKSREFRLFSSVELDTFVTSDDEGE-------MSEVFFEAIEELERMTREPSDVLEEMNE 1541
            +K   +    SVELD FVTSDDE E       M + F EAIEELERMTREPSDVLEEMN+
Sbjct: 57   TKPNSYMRKKSVELDQFVTSDDEEEEEEEEEEMGDGFLEAIEELERMTREPSDVLEEMND 116

Query: 1540 KLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGK 1361
            +L+ARELQLVLVYFSQEGRDSWCALEVF+WLRKENRVDKETMELMV++MCGWVKKLI  K
Sbjct: 117  RLSARELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIMEK 176

Query: 1360 SEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDR 1181
              + +V+DLLV+M+CVGLRP FSMIEKVISLYWE GEK+ AV FV+EVLRRGI+ + DD 
Sbjct: 177  HGVDDVIDLLVNMNCVGLRPGFSMIEKVISLYWEMGEKDDAVLFVEEVLRRGISSNEDD- 235

Query: 1180 GQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKA 1001
               KGGPTGYLAWKMMVEG+Y+ AV+LV + RE GLKP++YSYL+AMTAVVKELNE+AKA
Sbjct: 236  -PEKGGPTGYLAWKMMVEGDYRGAVRLVTRFREAGLKPDIYSYLVAMTAVVKELNELAKA 294

Query: 1000 LRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESP-SLYGVVHERLL 824
            LRKLK F++AGL+ E D E+  L EKYQ++LL  G  LS  VI++ SP S++G++HERLL
Sbjct: 295  LRKLKSFSRAGLITEFDREDVELAEKYQSDLLADGARLSKWVIQDGSPSSIHGIIHERLL 354

Query: 823  AMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSL 644
            AMYICAGRG+EAERQLWEMKL GKEA G LYD+VLAICASQKEA A  RL+ RMEV +S 
Sbjct: 355  AMYICAGRGIEAERQLWEMKLLGKEAVGGLYDMVLAICASQKEAAATARLMIRMEVASSP 414

Query: 643  RRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDI 464
            ++KK+LSWLLRGYIKGGHF+ AAETVMKML+ G +P++LDR AV+QGLR+RIQQ G LD 
Sbjct: 415  QKKKSLSWLLRGYIKGGHFNEAAETVMKMLELGFYPDYLDRVAVMQGLRKRIQQYGNLDT 474

Query: 463  YLELCKRLSDANLIGPCLVY 404
            Y++LCK L +ANLIG C+ Y
Sbjct: 475  YIKLCKSLYEANLIGACVCY 494


>ref|NP_180571.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546771|sp|Q0WNN7.2|PP176_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g30100, chloroplastic; Flags: Precursor
            gi|330253250|gb|AEC08344.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 503

 Score =  582 bits (1499), Expect = e-163
 Identities = 302/491 (61%), Positives = 377/491 (76%), Gaps = 11/491 (2%)
 Frame = -2

Query: 1843 SFAHLGFSQTCSSIRRSFLLSKEFRIQSCPRVLSIICALKPDFVVERRSKSREFRLFSSV 1664
            ++AH+  S T S+I     L +  R  S      IIC LK ++      K RE  L  SV
Sbjct: 2    AYAHVFASLTISTISLRRFLPRLHRNHSVKPNSRIICNLKLNYSA---GKFREMGLSRSV 58

Query: 1663 ELDTFVTSDDE-GEMSEV---FFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFS 1496
            ELD F+TS++E GE  E+   FFEAIEELERMTREPSD+LEEMN +L++RELQL+LVYF+
Sbjct: 59   ELDQFITSEEEEGEAEEIGEGFFEAIEELERMTREPSDILEEMNHRLSSRELQLMLVYFA 118

Query: 1495 QEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDC 1316
            QEGRDSWC LEVFEWL+KENRVD+E MELMVS+MCGWVKKLIE +    +V DLL++MDC
Sbjct: 119  QEGRDSWCTLEVFEWLKKENRVDEEIMELMVSIMCGWVKKLIEDECNAHQVFDLLIEMDC 178

Query: 1315 VGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRR--GIAYSNDDRGQN---KGGPTGY 1151
            VGL+P FSM++KVI+LY E G+KE AV FVKEVLRR  G  YS    G +   KGGP GY
Sbjct: 179  VGLKPGFSMMDKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGGGGSEGRKGGPVGY 238

Query: 1150 LAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKA 971
            LAWK MV+G+Y+ AV +V+++R  GLKPE YSYLIAMTA+VKELN + K LR+LK FA+A
Sbjct: 239  LAWKFMVDGDYRKAVDMVMELRLSGLKPEAYSYLIAMTAIVKELNSLGKTLRELKRFARA 298

Query: 970  GLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKE--ESPSLYGVVHERLLAMYICAGRG 797
            G VAE+D  + +LIEKYQ+E L  G+ L+   ++E  E+ S+ GVVHERLLAMYICAGRG
Sbjct: 299  GFVAEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQENDSIIGVVHERLLAMYICAGRG 358

Query: 796  VEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWL 617
             EAE+QLW+MKLAG+E + DL+DIV+AICASQKE  A+ RLLTR+E   S R+KKTLSWL
Sbjct: 359  PEAEKQLWKMKLAGREPEADLHDIVMAICASQKEVNAVSRLLTRVEFMGSQRKKKTLSWL 418

Query: 616  LRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLS 437
            LRGY+KGGHF+ AAET++ M+DSGL PE++DR AV+QG+ R+IQ+   ++ Y+ LCKRL 
Sbjct: 419  LRGYVKGGHFEEAAETLVSMIDSGLHPEYIDRVAVMQGMTRKIQRPRDVEAYMSLCKRLF 478

Query: 436  DANLIGPCLVY 404
            DA L+GPCLVY
Sbjct: 479  DAGLVGPCLVY 489


>ref|XP_006410063.1| hypothetical protein EUTSA_v10016546mg [Eutrema salsugineum]
            gi|557111232|gb|ESQ51516.1| hypothetical protein
            EUTSA_v10016546mg [Eutrema salsugineum]
          Length = 503

 Score =  575 bits (1482), Expect = e-161
 Identities = 298/489 (60%), Positives = 373/489 (76%), Gaps = 10/489 (2%)
 Frame = -2

Query: 1840 FAHLGFSQTCSSIRRSFL---LSKEFRIQSCPRVLSIICALKPDFVVERRSKSREFRLFS 1670
            FA L FS   S  R  F    L + +R++   R   I C LK +F      K RE  L  
Sbjct: 7    FASLTFSPPISLRRLRFFRPRLHRNYRVKPDSR---ISCNLKFNFAA---GKFRELGLSR 60

Query: 1669 SVELDTFVTSDDEGEMSEV---FFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYF 1499
            SVELD F+TS++E +  E+   FFEAIEELERMTREPSD+LEEMN +L++RELQL+LVYF
Sbjct: 61   SVELDQFITSEEENQADEIGQGFFEAIEELERMTREPSDILEEMNHRLSSRELQLMLVYF 120

Query: 1498 SQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMD 1319
            +QEGRDSWCALEVFEWL+KENRVD+E MELMVS+MCGWVKKLI+ + +  +V DLL++MD
Sbjct: 121  AQEGRDSWCALEVFEWLKKENRVDEEMMELMVSIMCGWVKKLIQEECDAAQVFDLLIEMD 180

Query: 1318 CVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRR--GIAYSNDDRGQNKGGPTGYLA 1145
            CVGL+P FSM+EKVI+LY E  +KE AV FVKEVLRR     YS       KGGPTGYLA
Sbjct: 181  CVGLKPGFSMMEKVIALYCEMEKKESAVLFVKEVLRRRDTSGYSVVVSEGRKGGPTGYLA 240

Query: 1144 WKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGL 965
            WKMMV+G+YK AV LV+++R  GLKPE YSYLIAMTA+VKELN + K LR+LK F +AGL
Sbjct: 241  WKMMVDGDYKKAVDLVVELRFSGLKPEAYSYLIAMTAIVKELNSLGKTLRELKRFTRAGL 300

Query: 964  VAELDLENTLLIEKYQAELLDYGVSLSNCVIKE--ESPSLYGVVHERLLAMYICAGRGVE 791
            VAE+D  + LLIEKYQ+EL+  G+ L+   ++E  ++ S+ G VHERLL MYICAGRG E
Sbjct: 301  VAEIDDHDRLLIEKYQSELISRGLELAAWAVQEGQQNDSIIGAVHERLLGMYICAGRGPE 360

Query: 790  AERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLR 611
            AE+QLW MKL G+E + DL+DIV+AICASQKE  A+ RLLTR+E   S  +KK+LSWLLR
Sbjct: 361  AEKQLWNMKLTGREPEADLHDIVMAICASQKEVNAVSRLLTRVEFMESKGKKKSLSWLLR 420

Query: 610  GYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDA 431
            GY+KGGHF+ AAET++ M+DSGL+PE++DR AV+QG+ ++IQ+   ++ Y+ LCKRL DA
Sbjct: 421  GYVKGGHFEEAAETLITMMDSGLYPEYIDRVAVMQGMTKKIQRPRDVEAYMGLCKRLFDA 480

Query: 430  NLIGPCLVY 404
             L+GPCLVY
Sbjct: 481  GLVGPCLVY 489


>ref|XP_006293981.1| hypothetical protein CARUB_v10022972mg [Capsella rubella]
            gi|482562689|gb|EOA26879.1| hypothetical protein
            CARUB_v10022972mg [Capsella rubella]
          Length = 505

 Score =  569 bits (1466), Expect = e-159
 Identities = 290/475 (61%), Positives = 365/475 (76%), Gaps = 11/475 (2%)
 Frame = -2

Query: 1795 SFLLSKEFRIQSCPRVLSIICALKPDFVVERRSKSREFRLFSSVELDTFVTSDDEG---- 1628
            S  L + +R      V  I C LK ++      K R+ +L  SVELD F+TS++EG    
Sbjct: 20   SISLRRVYRTPGVKSVSRISCNLKLNYSA---GKFRDLKLSRSVELDQFITSEEEGGEEA 76

Query: 1627 --EMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGRDSWCALEVFE 1454
              E+ E FFEAIEELERMTREPSDVLEEMN +L++RELQL+LVYF+QEGRDSWC LEVFE
Sbjct: 77   EDEIGEGFFEAIEELERMTREPSDVLEEMNHRLSSRELQLMLVYFAQEGRDSWCTLEVFE 136

Query: 1453 WLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVI 1274
            WL+KENRVD++ +ELMVS+MCGWVKKLI+ +    +V DLL++MDCVGL+P FSM+EKVI
Sbjct: 137  WLKKENRVDEQMVELMVSIMCGWVKKLIQEECGADQVFDLLIEMDCVGLKPGFSMMEKVI 196

Query: 1273 SLYWEAGEKEGAVAFVKEVLRR--GIAYSNDDRGQN-KGGPTGYLAWKMMVEGNYKDAVK 1103
            +LY E G+KE AV FVKEVLRR  G  YS     +  KGGP GYLAWK+MV+G+YK AV 
Sbjct: 197  ALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGGSEGRKGGPVGYLAWKLMVDGDYKKAVD 256

Query: 1102 LVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLVAELDLENTLLIEK 923
            LV+++R  GL PE YSYLIAMTA+VKELN + K LR+LK F +AG V E+D  + +LIEK
Sbjct: 257  LVVELRLSGLMPEAYSYLIAMTAIVKELNSLGKTLRELKRFTRAGYVTEIDDHDRVLIEK 316

Query: 922  YQAELLDYGVSLSNCVIKE--ESPSLYGVVHERLLAMYICAGRGVEAERQLWEMKLAGKE 749
            YQ+E L  G+ L+   ++E  +  S+ GVVHERLLAMYICAGRG EAE+QLW+MKLAG+E
Sbjct: 317  YQSETLSRGLQLATWAVEEGQQEDSIIGVVHERLLAMYICAGRGPEAEKQLWKMKLAGRE 376

Query: 748  ADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHFDNAAET 569
             + +L+DIV+AICASQKE  A+ RLLTR+E   S R+KKTLSWLLRGY+KGGHF+ AAET
Sbjct: 377  PEAELHDIVMAICASQKEVNAVSRLLTRVEFMESKRKKKTLSWLLRGYVKGGHFEEAAET 436

Query: 568  VMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLIGPCLVY 404
            ++ M+DSGL PE++DR AV+QG+ R+IQ+   ++ Y+ LCKRL DA L+GPCLVY
Sbjct: 437  LITMIDSGLHPEYIDRVAVMQGMTRKIQRPRDIEAYMGLCKRLFDAGLVGPCLVY 491


Top