BLASTX nr result

ID: Catharanthus23_contig00005845 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00005845
         (2323 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ06247.1| hypothetical protein PRUPE_ppa004609mg [Prunus pe...   729   0.0  
ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi...   711   0.0  
ref|XP_002313976.1| ubiquitin family protein [Populus trichocarp...   705   0.0  
ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm...   705   0.0  
gb|EOY34562.1| Pentatricopeptide repeat-containing protein [Theo...   692   0.0  
ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292...   691   0.0  
gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]     687   0.0  
ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containi...   684   0.0  
ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containi...   679   0.0  
gb|ESW18802.1| hypothetical protein PHAVU_006G071400g [Phaseolus...   677   0.0  
ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi...   674   0.0  
ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citr...   668   0.0  
ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containi...   667   0.0  
ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi...   666   0.0  
ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   647   0.0  
ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207...   645   0.0  
ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containi...   615   e-173
ref|NP_180571.3| pentatricopeptide repeat-containing protein [Ar...   582   e-163
ref|XP_006410063.1| hypothetical protein EUTSA_v10016546mg [Eutr...   575   e-161
ref|XP_006293981.1| hypothetical protein CARUB_v10022972mg [Caps...   569   e-159

>gb|EMJ06247.1| hypothetical protein PRUPE_ppa004609mg [Prunus persica]
          Length = 500

 Score =  729 bits (1881), Expect = 0.0
 Identities = 375/490 (76%), Positives = 421/490 (85%), Gaps = 2/490 (0%)
 Frame = +1

Query: 454  MVALDGIASFAHLGFSQTCSSIRRSFLLSKEFRIQSCPRVLSIICA-LKPDFVVERRSKS 630
            M +  G+AS  H  F+      R+ F+  + F  QSC RV   IC   KP+F+V + SK 
Sbjct: 1    MASAQGLASLTHSLFAVK----RQRFMGLRGFSAQSCGRVFPRICKHQKPNFIVAKSSKV 56

Query: 631  REFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLV 810
            R+FRLF SVELD F+TSDDE EM E FFEAIEELERMTREPSDVLEEMN++L+ARELQLV
Sbjct: 57   RDFRLFKSVELDQFLTSDDEDEMGEGFFEAIEELERMTREPSDVLEEMNDRLSARELQLV 116

Query: 811  LVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLL 990
            LVYFSQEGRDSWCALEVFEWLRKENRVDKETM+LMVS+MC WVKKLI+ + +IG+VVDLL
Sbjct: 117  LVYFSQEGRDSWCALEVFEWLRKENRVDKETMDLMVSIMCSWVKKLIQREHDIGDVVDLL 176

Query: 991  VDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSN-DDRGQNKGGPTG 1167
            VDMDCVGL+PSFSM+EKVISLYWE GEKE AV FVKEVL+RGI YS  DD   +KGGPTG
Sbjct: 177  VDMDCVGLKPSFSMMEKVISLYWEMGEKEKAVLFVKEVLKRGIVYSEEDDTDGHKGGPTG 236

Query: 1168 YLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAK 1347
            YLAWKMMVEGNY+D+VKLVI +RE GLKPEVYSYLIAMTAVVKELNE+AKALRKLKGF +
Sbjct: 237  YLAWKMMVEGNYRDSVKLVIHLRESGLKPEVYSYLIAMTAVVKELNELAKALRKLKGFTR 296

Query: 1348 AGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGV 1527
            AGL+AE D EN  LIEKYQ++LL  GV LSN VI+E S SL+GVVHERLLAMYIC+G G+
Sbjct: 297  AGLIAEFDTENVGLIEKYQSDLLSDGVQLSNWVIQEGSSSLHGVVHERLLAMYICSGHGL 356

Query: 1528 EAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLL 1707
            EAERQLWEMKL GKEAD DLYDIVLAICASQKEA AIGRLLTR EVT+SLR+KK+LSWLL
Sbjct: 357  EAERQLWEMKLVGKEADADLYDIVLAICASQKEASAIGRLLTRTEVTSSLRKKKSLSWLL 416

Query: 1708 RGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSD 1887
            RGYIKGGHFD+AAETV+KMLD GL PEFLDRAAVLQGLR+ IQ+SG +D YL+LCKRLSD
Sbjct: 417  RGYIKGGHFDDAAETVIKMLDLGLCPEFLDRAAVLQGLRKSIQESGGVDTYLKLCKRLSD 476

Query: 1888 ANLIGPCLVY 1917
            A+LIGPCLVY
Sbjct: 477  ASLIGPCLVY 486


>ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic [Vitis vinifera]
          Length = 511

 Score =  711 bits (1834), Expect = 0.0
 Identities = 369/487 (75%), Positives = 413/487 (84%), Gaps = 5/487 (1%)
 Frame = +1

Query: 472  IASFAHLGFSQTCS-SIRRSFLL----SKEFRIQSCPRVLSIICALKPDFVVERRSKSRE 636
            + S   LGF+ + S SI+R  L+    S+ F  + C R  +I     P FVV +R K RE
Sbjct: 11   LMSPTELGFTLSSSFSIQRPRLIVPKFSRSFLGEYCSRATTICNHQNPRFVVPKRDKIRE 70

Query: 637  FRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLV 816
            FRLF SVELD F+TSDDE EMSE FFEAIEELERMTREPSDVLEEMN++L+ARELQLVLV
Sbjct: 71   FRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDVLEEMNDRLSARELQLVLV 130

Query: 817  YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVD 996
            YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVS+MC WVKKLIEG+ ++G+VVDLLVD
Sbjct: 131  YFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEGEHDVGDVVDLLVD 190

Query: 997  MDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGYLA 1176
            MDCVGL+P FSMIEKVISLYWE  EKE AV FVKEVLRR IAYS DD   +KGGPTGYLA
Sbjct: 191  MDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIAYSEDDGDGHKGGPTGYLA 250

Query: 1177 WKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGL 1356
            WKMM EGNY+ AVKLVI +RE GLKPEVYSYLIAMTAVVKELNE AKALRKLKGF K+GL
Sbjct: 251  WKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKSGL 310

Query: 1357 VAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAE 1536
            +AELD EN  LIEKYQ++LL  GV LS+ VI+E    L+GVV+ERLLAMYICAGRG+EAE
Sbjct: 311  IAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVVYERLLAMYICAGRGLEAE 370

Query: 1537 RQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGY 1716
            RQLWEMKL GKEAD +LYDIVLAICAS+KEA AI RLLT MEVT+S+RRKKTLSWLLRGY
Sbjct: 371  RQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGMEVTSSIRRKKTLSWLLRGY 430

Query: 1717 IKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANL 1896
            IKG HFD+A+ET++KMLD GL PE+LDRAAVLQGLR RIQQ+G ++ YL+LCK LSDANL
Sbjct: 431  IKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQTGNVETYLKLCKHLSDANL 490

Query: 1897 IGPCLVY 1917
            IGPCLVY
Sbjct: 491  IGPCLVY 497


>ref|XP_002313976.1| ubiquitin family protein [Populus trichocarpa]
            gi|222850384|gb|EEE87931.1| ubiquitin family protein
            [Populus trichocarpa]
          Length = 500

 Score =  705 bits (1820), Expect = 0.0
 Identities = 353/476 (74%), Positives = 412/476 (86%), Gaps = 7/476 (1%)
 Frame = +1

Query: 511  SSIRRSFLLSKEFR---IQSCPRVLSIICAL----KPDFVVERRSKSREFRLFSSVELDT 669
            S +   F L K +    ++ C  V +IIC      +P+FVV + +K REFRLF SVELD 
Sbjct: 11   SKVSPVFSLKKRYWNSCMKPCCMVSTIICNYQTPKRPNFVVAKTTKVREFRLFKSVELDQ 70

Query: 670  FVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGRDSWC 849
            +VTSDDE EM E FFEAIEELERMTREPSD+LEEMN++L+ARELQLVLVYFSQEGRDSWC
Sbjct: 71   YVTSDDEEEMGEGFFEAIEELERMTREPSDILEEMNDRLSARELQLVLVYFSQEGRDSWC 130

Query: 850  ALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLRPSFS 1029
            ALEVFEWLRKENRVDKETMELMVS+MC WVKKLIEG+ ++G+VVDLLVDMDCVGL+PSFS
Sbjct: 131  ALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEGEQDVGDVVDLLVDMDCVGLKPSFS 190

Query: 1030 MIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGYLAWKMMVEGNYKD 1209
            MIEKVISLYW+ G+KEGAV+FVKEVLRRGIAYS DD    KGGPTGYL WKMMV+GNY++
Sbjct: 191  MIEKVISLYWDMGKKEGAVSFVKEVLRRGIAYSGDDGEGQKGGPTGYLTWKMMVDGNYRN 250

Query: 1210 AVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLVAELDLENTLL 1389
            AVKLVI +RE GLKPE+Y+YLIAMTAVVKELNE +KALRKLKG++++G+V ELD EN  L
Sbjct: 251  AVKLVIHLRESGLKPEIYAYLIAMTAVVKELNEFSKALRKLKGYSRSGMVTELDAENVEL 310

Query: 1390 IEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAERQLWEMKLAGK 1569
            +EKYQ++LL  GV LS+ VI+E SP+LYGVVHERLLAMYICAGRG++AERQLWEMKL GK
Sbjct: 311  VEKYQSDLLADGVCLSSWVIQEGSPALYGVVHERLLAMYICAGRGLDAERQLWEMKLVGK 370

Query: 1570 EADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHFDNAAE 1749
            EADGDLYDIVLAICASQKEA A+ RLLTR+EV +S+R+KK+LSWLLRGYIKGGH+  AAE
Sbjct: 371  EADGDLYDIVLAICASQKEASAVARLLTRIEVASSMRKKKSLSWLLRGYIKGGHYGEAAE 430

Query: 1750 TVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLIGPCLVY 1917
            T++KMLD GL P++LDR AV+QGLR+RIQQ G ++ YL+LCKRLSD NLIGP LVY
Sbjct: 431  TLIKMLDLGLSPDYLDRVAVMQGLRKRIQQWGNVESYLKLCKRLSDVNLIGPSLVY 486


>ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis]
            gi|223539607|gb|EEF41193.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  705 bits (1819), Expect = 0.0
 Identities = 360/473 (76%), Positives = 409/473 (86%), Gaps = 5/473 (1%)
 Frame = +1

Query: 514  SIRRSFLLSKEFRIQSCPRVLSIICALKP---DFVVERRSKSR--EFRLFSSVELDTFVT 678
            S+  +FLL + +++ + PR   +     P   +FVV ++SKSR  EFR+  SVELD ++ 
Sbjct: 14   SVSSTFLLQRRYKLLN-PRFFQLSSIKFPKSSNFVVAQQSKSRNREFRVLKSVELDQYIA 72

Query: 679  SDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGRDSWCALE 858
            SDDE EMSE FFEAIEELERMTREPSDVLEEMN+KL+ARELQLVLVYFSQEGRDSWCALE
Sbjct: 73   SDDEEEMSEGFFEAIEELERMTREPSDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALE 132

Query: 859  VFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLRPSFSMIE 1038
            VFEWLRKENRVDKETMELMVS+MC W+KKLIEG+ EIG+VVDLLVDMDCVGL+PSFSMIE
Sbjct: 133  VFEWLRKENRVDKETMELMVSIMCSWIKKLIEGEHEIGDVVDLLVDMDCVGLKPSFSMIE 192

Query: 1039 KVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGYLAWKMMVEGNYKDAVK 1218
            KVISLYWE GEKE +V+FVKEVLRR +AY  DD    KGGPTGYLAWKMMV+GNY+DAVK
Sbjct: 193  KVISLYWEIGEKEKSVSFVKEVLRREVAYFEDDGEGQKGGPTGYLAWKMMVDGNYRDAVK 252

Query: 1219 LVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLVAELDLENTLLIEK 1398
            LVI  RE GLKPEVYSYLIAMTAVVKELNE AKALRKLKGFAK+GL+AELD ENT LIEK
Sbjct: 253  LVIHFRESGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFAKSGLIAELDAENTRLIEK 312

Query: 1399 YQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAERQLWEMKLAGKEAD 1578
            YQ++L+  GV LS+ VI+E SPSLYGVVHERLLAMYICAGRG++AERQLWEMKL GK AD
Sbjct: 313  YQSDLIADGVCLSSWVIQEGSPSLYGVVHERLLAMYICAGRGLDAERQLWEMKLVGKHAD 372

Query: 1579 GDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHFDNAAETVM 1758
            GDLYDIVLAICASQKEA A+ RLLTR+EVT+SL++KKTLSWLLRGY+KGG +D AAE ++
Sbjct: 373  GDLYDIVLAICASQKEASAVSRLLTRVEVTSSLQKKKTLSWLLRGYLKGGQYDEAAEALV 432

Query: 1759 KMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLIGPCLVY 1917
            KMLD GL P++LDR AVLQGLR+RIQQ G ++ YL LCKRLSD NLIGP LVY
Sbjct: 433  KMLDMGLCPDYLDRVAVLQGLRKRIQQWGNVESYLNLCKRLSDENLIGPSLVY 485


>gb|EOY34562.1| Pentatricopeptide repeat-containing protein [Theobroma cacao]
          Length = 504

 Score =  692 bits (1785), Expect = 0.0
 Identities = 346/455 (76%), Positives = 403/455 (88%), Gaps = 2/455 (0%)
 Frame = +1

Query: 559  SCPRVLSIICALK-PDFVVER-RSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEEL 732
            S  R+ + IC  + P FV+ + + K+RE RLF SVELD F+TSDDE EMSE FFEAIEEL
Sbjct: 36   SLRRISTRICNHQNPSFVLRKIQPKTRECRLFKSVELDQFLTSDDEDEMSEGFFEAIEEL 95

Query: 733  ERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMEL 912
            ERMTREPSD+LEEMN++L++RELQLVLVYFSQEGRDSWCALEVFEWL+KEN+VD ETMEL
Sbjct: 96   ERMTREPSDILEEMNDRLSSRELQLVLVYFSQEGRDSWCALEVFEWLKKENKVDNETMEL 155

Query: 913  MVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAF 1092
            MVS+MC WVKKLIEG+ ++G+VVDLLVDMDCVGL+P FSMIEKVIS+YWE  +K+ AV F
Sbjct: 156  MVSIMCSWVKKLIEGEGDVGDVVDLLVDMDCVGLKPGFSMIEKVISMYWEMEKKDRAVVF 215

Query: 1093 VKEVLRRGIAYSNDDRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYL 1272
            VKEVLRRGI+Y ++D    KGGPTGYLAWKMMVEGNY+DA+KLVI++RE GLKPE+YSYL
Sbjct: 216  VKEVLRRGISYEDEDGEGQKGGPTGYLAWKMMVEGNYRDAIKLVIELRESGLKPEIYSYL 275

Query: 1273 IAMTAVVKELNEVAKALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIK 1452
            IAMTA+VKELNE AKALRKLKGFA++GLVAELD+EN  LI+KYQ++LL  G+ LSN  I+
Sbjct: 276  IAMTAIVKELNEFAKALRKLKGFARSGLVAELDMENVELIKKYQSDLLADGLRLSNWAIQ 335

Query: 1453 EESPSLYGVVHERLLAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAG 1632
            E + SL+G+VHERLLAMYICAGRG+EAERQLWEMKLAGKEADGDL+DIVLAICASQKEA 
Sbjct: 336  EGTSSLFGLVHERLLAMYICAGRGLEAERQLWEMKLAGKEADGDLHDIVLAICASQKEAS 395

Query: 1633 AIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVL 1812
            AI RLLTRMEV++SLRRKKTLSWLLRGYIKGGH  +AAETV+KMLD GL PE+LDRAAVL
Sbjct: 396  AISRLLTRMEVSSSLRRKKTLSWLLRGYIKGGHISDAAETVIKMLDLGLHPEYLDRAAVL 455

Query: 1813 QGLRRRIQQSGILDIYLELCKRLSDANLIGPCLVY 1917
            Q LR+RIQQ G ++ Y+ LCKRL DA+LIGPCL+Y
Sbjct: 456  QELRKRIQQPGNIETYVNLCKRLYDASLIGPCLIY 490


>ref|XP_004296059.1| PREDICTED: uncharacterized protein LOC101292395 [Fragaria vesca
            subsp. vesca]
          Length = 1304

 Score =  691 bits (1784), Expect = 0.0
 Identities = 353/481 (73%), Positives = 411/481 (85%), Gaps = 5/481 (1%)
 Frame = +1

Query: 490  LGFSQT---CSSIRRSFLLSKEFRIQSCPRVLSIICALK-PDFVVERRSKSREFRLFSSV 657
            + F+Q+   C +++R +  S  F  + C RV ++I   K P FVV +  K R+FRLF+SV
Sbjct: 1    MAFAQSLTQCFAVKR-YRFSGGFSGKRCSRVCNVIYKEKNPSFVVAKSGKVRDFRLFNSV 59

Query: 658  ELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGR 837
            +LD FVTSDDE EM E FFEAIEELERM REPSDVLEEMN++L+ARELQLVLVYFSQEGR
Sbjct: 60   QLDQFVTSDDEDEMGESFFEAIEELERMRREPSDVLEEMNDRLSARELQLVLVYFSQEGR 119

Query: 838  DSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLR 1017
            DSWCALEVFEWLR+ENRVDKETMELMVS+MCGW+K+LIE  +++ +V+DLLVD+DCVGL+
Sbjct: 120  DSWCALEVFEWLRRENRVDKETMELMVSIMCGWLKRLIEEGNDVADVIDLLVDVDCVGLK 179

Query: 1018 PSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSN-DDRGQNKGGPTGYLAWKMMVE 1194
            PSFSM+EKVISLYWE GEKE AV FVKEVL+RGI YS  DDR  +KGGPTGYLAWKM V+
Sbjct: 180  PSFSMMEKVISLYWEMGEKENAVLFVKEVLKRGIVYSEEDDRDGHKGGPTGYLAWKMTVD 239

Query: 1195 GNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLVAELDL 1374
            GNY+D+VK VIQ+RE GLKPEVYSYLIAMTAVVKELNE+ KALRKLK F +AGLVAE D 
Sbjct: 240  GNYRDSVKFVIQLRESGLKPEVYSYLIAMTAVVKELNELGKALRKLKAFTRAGLVAEFDS 299

Query: 1375 ENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAERQLWEM 1554
            E+  LIEKYQ++LL  GV LSN VI+E S +L GVVHERLLAMYIC+GRG+EAERQLWEM
Sbjct: 300  EDVGLIEKYQSDLLADGVQLSNWVIQEGSSTLCGVVHERLLAMYICSGRGLEAERQLWEM 359

Query: 1555 KLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHF 1734
            KL GKE DGDLYDIVLAICAS+KE  AI RLLTR EV++SL +KK+LSWLLRGYIKGGHF
Sbjct: 360  KLVGKEPDGDLYDIVLAICASRKETSAIARLLTRTEVSSSLSKKKSLSWLLRGYIKGGHF 419

Query: 1735 DNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLIGPCLV 1914
            ++AAETV+KMLD GLFP++LDRAAVL GLR+RIQQSG +D YL+LCKRLSDANLI  CL+
Sbjct: 420  NDAAETVIKMLDLGLFPDYLDRAAVLHGLRKRIQQSGTVDTYLKLCKRLSDANLIESCLL 479

Query: 1915 Y 1917
            Y
Sbjct: 480  Y 480


>gb|EXB37964.1| hypothetical protein L484_011688 [Morus notabilis]
          Length = 516

 Score =  687 bits (1773), Expect = 0.0
 Identities = 358/502 (71%), Positives = 406/502 (80%), Gaps = 14/502 (2%)
 Frame = +1

Query: 454  MVALDGIASFAHLGFSQTCSSI----------RRSFLLSKEFRI--QSCPRVLSIICALK 597
            M +  G      LGF  + SS            R FL   +  +  ++  +   +IC  +
Sbjct: 1    MASAQGFTPLTELGFPSSSSSSSSSSSNSLHRNRIFLCRMDENLWGRTSAKFCPVICCKQ 60

Query: 598  --PDFVVERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEE 771
              P+F+  + SK REFRLF+SVELD F+TSDDE EM E FFEAIEELERMTREPSDVLEE
Sbjct: 61   QNPNFIAPKPSKLREFRLFTSVELDQFLTSDDEEEMGEGFFEAIEELERMTREPSDVLEE 120

Query: 772  MNEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLI 951
            MN++L+ARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMV+LMC WVKKLI
Sbjct: 121  MNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVTLMCSWVKKLI 180

Query: 952  EGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSN 1131
            EG+ ++G+VVDLLVDM CVGLRP FSM+E VI LYWE GEK  AV+FVKEVLRRGIA   
Sbjct: 181  EGEHDVGDVVDLLVDMACVGLRPGFSMMENVILLYWEMGEKGRAVSFVKEVLRRGIACLE 240

Query: 1132 DDRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEV 1311
            DD    KGGPTGYLAWKMMVEGNY +AVKLV+ IRE GLKPEVYSYLIAMTAVVKELNE 
Sbjct: 241  DDGEGPKGGPTGYLAWKMMVEGNYMEAVKLVVDIRESGLKPEVYSYLIAMTAVVKELNEF 300

Query: 1312 AKALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHER 1491
            AKALRKLKGF +AGL AELD E+  LIEKYQ++LLD GV LSN VI+E   SL GVVHER
Sbjct: 301  AKALRKLKGFERAGLTAELDEESVELIEKYQSDLLDDGVRLSNWVIEEGITSLNGVVHER 360

Query: 1492 LLAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTN 1671
            LLAMYICAGRG+EAERQLW+MKL GKEADGDLYDIVLAICASQKE  AI RLLTR+  ++
Sbjct: 361  LLAMYICAGRGIEAERQLWKMKLVGKEADGDLYDIVLAICASQKEGRAIARLLTRVNFSS 420

Query: 1672 SLRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGIL 1851
            +LR++K+LSWLLRGYIKGGHFDNAAETV+KMLD GL PE+LDRAAVLQGLR+RI+    +
Sbjct: 421  TLRKRKSLSWLLRGYIKGGHFDNAAETVVKMLDLGLCPEYLDRAAVLQGLRKRIKGPDTV 480

Query: 1852 DIYLELCKRLSDANLIGPCLVY 1917
            + YL+LCK LSD NLIGPCL+Y
Sbjct: 481  ETYLKLCKHLSDYNLIGPCLIY 502


>ref|XP_004239038.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum lycopersicum]
          Length = 503

 Score =  684 bits (1764), Expect = 0.0
 Identities = 357/486 (73%), Positives = 407/486 (83%), Gaps = 4/486 (0%)
 Frame = +1

Query: 472  IASFAHLGFSQTCSSIRRSFLLSKEFRIQSCPRVLSII-CALK-PDFVVERRSKSREFRL 645
            I SF +LG S+  S  R    + + +       VL  + C+ + P FV  RR+    F+L
Sbjct: 7    IVSFTYLGLSKAVSPKRCRLGIPQTWLKWRSSLVLGGVGCSSRNPSFVSPRRNG---FKL 63

Query: 646  FSSVELDTFVTSDDE--GEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVY 819
            FSSVEL +FVTSD E   EMS+ FFEAIEELERMTREPSDVLEEMNE+L+ RELQLVLVY
Sbjct: 64   FSSVELGSFVTSDGEEKNEMSDCFFEAIEELERMTREPSDVLEEMNERLSDRELQLVLVY 123

Query: 820  FSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDM 999
            F+QEGRDSWCALEVFEWLRKENRVDKETMELMVS+MCGWV+KLI  KSE G+VVDLLVDM
Sbjct: 124  FAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQKLIGSKSEAGDVVDLLVDM 183

Query: 1000 DCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGYLAW 1179
            DCVGL PSFSM+EKVISLYW+AGE+EGAV+FVKEVLRR IAYS+ +   +K GP GYLAW
Sbjct: 184  DCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAYSDGNVDGHKAGPAGYLAW 243

Query: 1180 KMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLV 1359
            KMM EGNYKDAVKLVI IR+ GLKPE+YSYLIAMTAVVKELNE  KALRKLKGFA+ GLV
Sbjct: 244  KMMEEGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELNEFGKALRKLKGFARTGLV 303

Query: 1360 AELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVEAER 1539
            AELDLEN  LIE+YQA+LL  GV LS+ +I+E  PSL+GVVHERLLAMY+CAGRG+EAER
Sbjct: 304  AELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVHERLLAMYVCAGRGIEAER 363

Query: 1540 QLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYI 1719
             LW+MK++GKE  GDL+DIVLAICASQKE G I RLLT ME ++SL++KKTLSWLLRGYI
Sbjct: 364  HLWQMKISGKEVSGDLHDIVLAICASQKELGPISRLLTGMEASSSLQKKKTLSWLLRGYI 423

Query: 1720 KGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLI 1899
            KGGH +NAAETV+KMLD GL+P+FLDRAAVLQ LRRRIQQSG L+ YL LCK LSDA+LI
Sbjct: 424  KGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSGNLETYLNLCKHLSDASLI 483

Query: 1900 GPCLVY 1917
            GPCLVY
Sbjct: 484  GPCLVY 489


>ref|XP_006348674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Solanum tuberosum]
          Length = 503

 Score =  679 bits (1751), Expect = 0.0
 Identities = 356/492 (72%), Positives = 408/492 (82%), Gaps = 4/492 (0%)
 Frame = +1

Query: 454  MVALDGIASFAHLGFSQTCSSIRRSFLLSKEFRIQSCPRVLSII-CALK-PDFVVERRSK 627
            M  ++ IAS  +LG S+     R    + + +       VL  + C+ + P FV  RR+ 
Sbjct: 1    MATVNEIASLTYLGLSKVVFPKRCRLGIPQTWLKWRSSWVLGGVGCSSRNPSFVNPRRNG 60

Query: 628  SREFRLFSSVELDTFVTSDDE--GEMSEVFFEAIEELERMTREPSDVLEEMNEKLTAREL 801
               F+LF+SVEL +FVTSDDE   EMS+ FFEAIEELERMTREPSDVLEEMNE+L+ REL
Sbjct: 61   ---FKLFNSVELGSFVTSDDEEKNEMSDCFFEAIEELERMTREPSDVLEEMNERLSDREL 117

Query: 802  QLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVV 981
            QLVLVYF+QEGRDSWCALEVFEWLRKENRVDKETMELMVS+MCGWV+KLI  KSE G+VV
Sbjct: 118  QLVLVYFAQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCGWVQKLIGSKSEAGDVV 177

Query: 982  DLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGP 1161
            DLLVDMDCVGL PSFSM+EKVISLYW+AGE+EGAV+FVKEVLRR IAYS+ +   +K GP
Sbjct: 178  DLLVDMDCVGLNPSFSMVEKVISLYWDAGEREGAVSFVKEVLRRQIAYSDGNVDGHKAGP 237

Query: 1162 TGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGF 1341
             GYLAWKMM  GNYKDAVKLVI IR+ GLKPE+YSYLIAMTAVVKELNE  KALRKLKGF
Sbjct: 238  AGYLAWKMMEVGNYKDAVKLVIDIRDSGLKPELYSYLIAMTAVVKELNEFGKALRKLKGF 297

Query: 1342 AKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGR 1521
            A+ GLVAELDLEN  LIE+YQA+LL  GV LS+ +I+E  PSL+GVVHERLLAMY+CAGR
Sbjct: 298  ARTGLVAELDLENLRLIEEYQADLLAEGVQLSDWLIQEGGPSLFGVVHERLLAMYVCAGR 357

Query: 1522 GVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSW 1701
            G+EAER LW+MKL+GK+  GDL DIVLAICASQKE G I RLLT ME ++SL++KKTLSW
Sbjct: 358  GIEAERHLWQMKLSGKKVTGDLQDIVLAICASQKELGPISRLLTGMEASSSLQKKKTLSW 417

Query: 1702 LLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRL 1881
            LLRGYIKGGH +NAAETV+KMLD GL+P+FLDRAAVLQ LRRRIQQSG L+ YL LCK L
Sbjct: 418  LLRGYIKGGHLENAAETVIKMLDLGLYPDFLDRAAVLQRLRRRIQQSGSLETYLNLCKHL 477

Query: 1882 SDANLIGPCLVY 1917
            SDA+LIGPCLVY
Sbjct: 478  SDASLIGPCLVY 489


>gb|ESW18802.1| hypothetical protein PHAVU_006G071400g [Phaseolus vulgaris]
          Length = 510

 Score =  677 bits (1747), Expect = 0.0
 Identities = 338/441 (76%), Positives = 381/441 (86%), Gaps = 1/441 (0%)
 Frame = +1

Query: 598  PDFVVERRSKSREFRLFSSVELDTFVTSDDE-GEMSEVFFEAIEELERMTREPSDVLEEM 774
            P  V  +    R FR+  SVELD FVTSDDE  EM + FFEAIEELERMTREPSD+LEEM
Sbjct: 56   PSIVAAKHCSVRGFRVLKSVELDQFVTSDDEEDEMGDGFFEAIEELERMTREPSDILEEM 115

Query: 775  NEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIE 954
            N++L+ARELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMVS+MCGWVKKLI+
Sbjct: 116  NDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVSIMCGWVKKLIQ 175

Query: 955  GKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSND 1134
             +  +G+V+DLLVDMDCVGLRP FSMIEKVISLYWE GEKEGAV FV+EVLRRGI Y+++
Sbjct: 176  EQHGVGDVIDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYASE 235

Query: 1135 DRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVA 1314
            D+  +KGGPTGYLAWKMM EG+Y+ AV+LVI+ RE GLKPEVYSYL+AMTAVVKELNE A
Sbjct: 236  DKEGHKGGPTGYLAWKMMAEGDYRSAVRLVIRFRESGLKPEVYSYLVAMTAVVKELNEFA 295

Query: 1315 KALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERL 1494
            KALRKLK F +AGLV ELDLE+  L EKYQ +LL  GV LSN VI++  PSLYGVVHERL
Sbjct: 296  KALRKLKSFTRAGLVTELDLEDVELAEKYQTDLLADGVRLSNWVIQDGRPSLYGVVHERL 355

Query: 1495 LAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNS 1674
            LAMYICAG G+EAERQLWEMKL GKEADGDLYDIVLAICASQKE  A  RLLTR+E+ NS
Sbjct: 356  LAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKEVNATARLLTRLELANS 415

Query: 1675 LRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILD 1854
             ++KK+LSWLLRGYIKGGHF  AAETVMKML+ G +PE+LDRAAVLQGLR+RIQQ G LD
Sbjct: 416  PQKKKSLSWLLRGYIKGGHFTEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLD 475

Query: 1855 IYLELCKRLSDANLIGPCLVY 1917
             Y+ LCK LSDANLIGPCLV+
Sbjct: 476  TYVRLCKSLSDANLIGPCLVH 496


>ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 508

 Score =  674 bits (1738), Expect = 0.0
 Identities = 337/445 (75%), Positives = 386/445 (86%), Gaps = 1/445 (0%)
 Frame = +1

Query: 586  CALK-PDFVVERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDV 762
            C  K P FV  +    R FR   SVE+D +VTS+DE  MS+ FFEAIEELERMTREPSDV
Sbjct: 52   CKFKNPSFVSAKHGSLRGFRALKSVEMDQYVTSNDE--MSDGFFEAIEELERMTREPSDV 109

Query: 763  LEEMNEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVK 942
            LEEMN++L+ARELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMV++MCGWVK
Sbjct: 110  LEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVK 169

Query: 943  KLIEGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIA 1122
            KLI+ +  +G+VVDLLVDMDCVGLRP FSMIEKVISLYWE GEKEGAV FV+EVLRRGI 
Sbjct: 170  KLIQQQHGVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIP 229

Query: 1123 YSNDDRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKEL 1302
            Y  +D   +KGGPTGYLAWKMM EG+Y++AV+LVI+ RE GLKPE+YSYL+AMTAVVKEL
Sbjct: 230  YVEEDEEGHKGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAVVKEL 289

Query: 1303 NEVAKALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVV 1482
            NE AKALRKLKGF +AGLVAELDLE+  L EKYQ++ L  GV LSN VI++ SPSL+G+V
Sbjct: 290  NEFAKALRKLKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSLHGIV 349

Query: 1483 HERLLAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRME 1662
            HERLLAMYICAG G+EAERQLWEMKL GKEADGDLYDIVLAICASQKE+ A  RLLTR+E
Sbjct: 350  HERLLAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLE 409

Query: 1663 VTNSLRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQS 1842
            V +S ++KK+LSWLLRGYIKGGHF+ AAET+MKML+ G +PE+LDRAAVLQGLR+RIQQ 
Sbjct: 410  VVSSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKRIQQY 469

Query: 1843 GILDIYLELCKRLSDANLIGPCLVY 1917
            G LD Y+ LCK LSDANLIGPCLV+
Sbjct: 470  GNLDTYVRLCKSLSDANLIGPCLVH 494


>ref|XP_006425116.1| hypothetical protein CICLE_v10028251mg [Citrus clementina]
            gi|557527050|gb|ESR38356.1| hypothetical protein
            CICLE_v10028251mg [Citrus clementina]
          Length = 502

 Score =  668 bits (1724), Expect = 0.0
 Identities = 334/440 (75%), Positives = 376/440 (85%)
 Frame = +1

Query: 598  PDFVVERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMN 777
            P F+  + SK REFR   SVELD FVTSDDE EMSE FFEAIEELERMTREPSD+LEEMN
Sbjct: 49   PSFIATKVSKIREFRFLKSVELDQFVTSDDEDEMSEEFFEAIEELERMTREPSDILEEMN 108

Query: 778  EKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEG 957
            ++L+ARELQLVLVYFSQEGRDSWCALEVFEWL+KENRVD ETMELMVS+MC WVKK IE 
Sbjct: 109  DRLSARELQLVLVYFSQEGRDSWCALEVFEWLKKENRVDNETMELMVSIMCSWVKKYIEE 168

Query: 958  KSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDD 1137
            + ++G+V+DLLVDMDCVGL+P FSMIEKVISLYWE  +KE AV FVK VL RGIAY+  D
Sbjct: 169  ERDVGDVIDLLVDMDCVGLKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRGIAYAEGD 228

Query: 1138 RGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAK 1317
                KGGPTGYLAWKMMVEG Y DA+KLVI +RE GLKPEVYSYLIA+TAVVKELNE  K
Sbjct: 229  GEGQKGGPTGYLAWKMMVEGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVKELNEFGK 288

Query: 1318 ALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLL 1497
            ALRKLKG+ +AG +AELD +N  LIEKYQ++LL  G  LS+  I+E   SLYGVVHERLL
Sbjct: 289  ALRKLKGYVRAGSIAELDGKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYGVVHERLL 348

Query: 1498 AMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSL 1677
            AMYICAGRG+EAERQLWEMKL GKEADGDLYDIVLAICASQ E  A+ RLL+R+EV NSL
Sbjct: 349  AMYICAGRGLEAERQLWEMKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSRIEVMNSL 408

Query: 1678 RRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDI 1857
             +KKTLSWLLRGYIKGGH ++AAET+ KMLD GL+PE++DR AVLQGLR+RIQQSG ++ 
Sbjct: 409  CKKKTLSWLLRGYIKGGHINDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQQSGNVEA 468

Query: 1858 YLELCKRLSDANLIGPCLVY 1917
            YL LCKRLSD +LIGPCLVY
Sbjct: 469  YLNLCKRLSDTSLIGPCLVY 488


>ref|XP_006488563.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Citrus sinensis]
          Length = 502

 Score =  667 bits (1722), Expect = 0.0
 Identities = 348/489 (71%), Positives = 400/489 (81%), Gaps = 7/489 (1%)
 Frame = +1

Query: 472  IASFAHLGFSQTCSSI-RRSFLLSKEFRIQSC-----PRVLSIICALK-PDFVVERRSKS 630
            +AS   LGF+ + + + +R  LL  + R  SC     PR+ + I   + P+F+  + SK 
Sbjct: 1    MASVPELGFALSPNFLLQRHKLLVPQLR-GSCLTRPPPRISTRIKNYQNPNFIATKVSKI 59

Query: 631  REFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLV 810
            REFR   SVELD FVTSDDE EMSE FFEAIEELERMTREPSD+LEEMN++L+ARELQLV
Sbjct: 60   REFRFLKSVELDQFVTSDDEDEMSEEFFEAIEELERMTREPSDILEEMNDRLSARELQLV 119

Query: 811  LVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLL 990
            LVYFSQEGRDSWCALEVFEWL+KENRVD ETMELMVS+MC WVKK IE +  +G+VVDLL
Sbjct: 120  LVYFSQEGRDSWCALEVFEWLKKENRVDNETMELMVSIMCSWVKKYIEEERGVGDVVDLL 179

Query: 991  VDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQNKGGPTGY 1170
            VDMDCVGL+P FSMIEKVISLYWE  +KE AV FVK VL RGIAY+  D    +GGPTGY
Sbjct: 180  VDMDCVGLKPGFSMIEKVISLYWEMEKKERAVLFVKAVLSRGIAYAEGDGEGQQGGPTGY 239

Query: 1171 LAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKA 1350
            LAWKMMVEG Y DA+KLVI +RE GLKPEVYSYLIA+TAVVKELNE  KALRKLKG+ +A
Sbjct: 240  LAWKMMVEGKYVDAIKLVIHLRESGLKPEVYSYLIALTAVVKELNEFGKALRKLKGYVRA 299

Query: 1351 GLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYICAGRGVE 1530
            G +AELD +N  LIEKYQ++LL  G  LS+  I+E   SLYGVVHERLLAMYICAGRG+E
Sbjct: 300  GSIAELDGKNLGLIEKYQSDLLADGSRLSSWAIQEGGSSLYGVVHERLLAMYICAGRGLE 359

Query: 1531 AERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLR 1710
            AERQLWEMKL GKEADGDLYDIVLAICASQ E  A+ RLL+R+EV NSL +KKTLSWLLR
Sbjct: 360  AERQLWEMKLVGKEADGDLYDIVLAICASQNEGSAVSRLLSRIEVMNSLCKKKTLSWLLR 419

Query: 1711 GYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDA 1890
            GYIKGGH ++AAET+ KMLD GL+PE++DR AVLQGLR+RIQQSG ++ YL LCKRLSD 
Sbjct: 420  GYIKGGHINDAAETLTKMLDLGLYPEYMDRVAVLQGLRKRIQQSGNVEAYLNLCKRLSDT 479

Query: 1891 NLIGPCLVY 1917
            +LIGPCLVY
Sbjct: 480  SLIGPCLVY 488


>ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 510

 Score =  666 bits (1719), Expect = 0.0
 Identities = 339/447 (75%), Positives = 383/447 (85%), Gaps = 3/447 (0%)
 Frame = +1

Query: 586  CALK-PDFVVERRSKSREFRLFSSVELDTFVTSDDE-GEMSEVFFEAIEELERMTREPSD 759
            C  K P FV  ++   R FR   SVELD +VTSDDE  EMS+ FFEAIEELERMTREPSD
Sbjct: 52   CKFKNPSFV--KQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSD 109

Query: 760  VLEEMNEKLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWV 939
            VLEEMN++L+ARELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMV++MCGWV
Sbjct: 110  VLEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWV 169

Query: 940  KKLI-EGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRG 1116
            KKLI E    +G+VVDLLVDMDCVGLRP FSMIEKVISLYWE GEKEGAV FV+EVLRRG
Sbjct: 170  KKLIQEHHGVVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRG 229

Query: 1117 IAYSNDDRGQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVK 1296
            I Y  +D   +KGGPTGYLAWKMM EG+Y  AV+LVI   E GLKPEVYSYL+AMTAVVK
Sbjct: 230  IPYLEEDEEGHKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVK 289

Query: 1297 ELNEVAKALRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYG 1476
            ELNE+AKALRKLK FA+ GLVAELDLE+  L EKYQ++LL  GV LSN  I++ SPSL+G
Sbjct: 290  ELNELAKALRKLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHG 349

Query: 1477 VVHERLLAMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTR 1656
            ++HERLLAMYICAG G+EAE+QLWEMKL GKEADGDLYDIVLAICASQKE+ A  RLLTR
Sbjct: 350  IIHERLLAMYICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTR 409

Query: 1657 MEVTNSLRRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQ 1836
            +EV +S ++KK+LSWLLRGYIKGGHF+ AAET+MKMLD G +PE+LDRAAVLQGLR+RIQ
Sbjct: 410  LEVASSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQ 469

Query: 1837 QSGILDIYLELCKRLSDANLIGPCLVY 1917
            Q G LD Y+ LCK LSDANLIGPCLV+
Sbjct: 470  QYGNLDTYVRLCKSLSDANLIGPCLVH 496


>ref|XP_004168796.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g30100, chloroplastic-like [Cucumis sativus]
          Length = 501

 Score =  647 bits (1669), Expect = 0.0
 Identities = 335/496 (67%), Positives = 390/496 (78%), Gaps = 8/496 (1%)
 Frame = +1

Query: 454  MVALDGIASFAHLGFSQTCSSIRRSFLLSKEFRIQSC----PR---VLSIICALKPD-FV 609
            M+   G       GFS         F LS     Q C    PR   V  I C  +   F 
Sbjct: 1    MICAQGFTPLTQFGFS---------FSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFS 51

Query: 610  VERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLT 789
            V R +K R+ RLF SVELD F+TSDDE EM + FFEAIEELERMTREPSDVLEEMN++L+
Sbjct: 52   VSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLS 111

Query: 790  ARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEI 969
            ARE+QLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVS+MC W+KKL+EG+  +
Sbjct: 112  AREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV 171

Query: 970  GEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQN 1149
            G+VVDLLVDMDCVGL+P FSMIEKVISLYWE GEKE AV FVKEVL R +A+  DD   +
Sbjct: 172  GDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGH 231

Query: 1150 KGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRK 1329
            KGGP+GYLAWKMMV+G+Y+ AVK+V+ +RE GL+PEVYSYLIAMTAVVKELNE AKALRK
Sbjct: 232  KGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRK 291

Query: 1330 LKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYI 1509
            LKG+A+ G VAELD  N  L+ KYQ ELL  GV LSN V++E S S+ GVVHERLLAMYI
Sbjct: 292  LKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYI 351

Query: 1510 CAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKK 1689
            CAG+GVEAERQLWEMKL GKEAD DLYDIVLAICASQKE  A+ RLLTR+E+T+ + +KK
Sbjct: 352  CAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKK 411

Query: 1690 TLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLEL 1869
            +L+WLLRGYIKGGHF +AA T++KM++ G  PE+LDR AVLQGL + I++   +  YL+L
Sbjct: 412  SLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLXKEIREPESVHTYLDL 471

Query: 1870 CKRLSDANLIGPCLVY 1917
            CK LSDANLIGP LVY
Sbjct: 472  CKCLSDANLIGPSLVY 487


>ref|XP_004143220.1| PREDICTED: uncharacterized protein LOC101207176 [Cucumis sativus]
          Length = 1290

 Score =  645 bits (1664), Expect = 0.0
 Identities = 334/494 (67%), Positives = 389/494 (78%), Gaps = 8/494 (1%)
 Frame = +1

Query: 454  MVALDGIASFAHLGFSQTCSSIRRSFLLSKEFRIQSC----PR---VLSIICALKPD-FV 609
            M+   G       GFS         F LS     Q C    PR   V  I C  +   F 
Sbjct: 1    MICAQGFTPLTQFGFS---------FSLSSPLESQRCGFSTPRLYMVSPISCNYQDSTFS 51

Query: 610  VERRSKSREFRLFSSVELDTFVTSDDEGEMSEVFFEAIEELERMTREPSDVLEEMNEKLT 789
            V R +K R+ RLF SVELD F+TSDDE EM + FFEAIEELERMTREPSDVLEEMN++L+
Sbjct: 52   VSRAAKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSDVLEEMNDRLS 111

Query: 790  ARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEI 969
            ARE+QLVLVYFSQEGRDSWCALEVFEWL+KENRVDKETMELMVS+MC W+KKL+EG+  +
Sbjct: 112  AREIQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGRHNV 171

Query: 970  GEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDRGQN 1149
            G+VVDLLVDMDCVGL+P FSMIEKVISLYWE GEKE AV FVKEVL R +A+  DD   +
Sbjct: 172  GDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEKAVFFVKEVLGRNLAFMKDDWEGH 231

Query: 1150 KGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRK 1329
            KGGP+GYLAWKMMV+G+Y+ AVK+V+ +RE GL+PEVYSYLIAMTAVVKELNE AKALRK
Sbjct: 232  KGGPSGYLAWKMMVDGDYRGAVKMVLHLRESGLRPEVYSYLIAMTAVVKELNEFAKALRK 291

Query: 1330 LKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESPSLYGVVHERLLAMYI 1509
            LKG+A+ G VAELD  N  L+ KYQ ELL  GV LSN V++E S S+ GVVHERLLAMYI
Sbjct: 292  LKGYARDGFVAELDKNNVELVAKYQTELLADGVQLSNWVLEEGSSSIRGVVHERLLAMYI 351

Query: 1510 CAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKK 1689
            CAG+GVEAERQLWEMKL GKEAD DLYDIVLAICASQKE  A+ RLLTR+E+T+ + +KK
Sbjct: 352  CAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKETKAMKRLLTRIEITSPMIKKK 411

Query: 1690 TLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLEL 1869
            +L+WLLRGYIKGGHF +AA T++KM++ G  PE+LDR AVLQGLR+ I++   +  YL+L
Sbjct: 412  SLTWLLRGYIKGGHFRDAAGTLVKMINLGFLPEYLDRVAVLQGLRKEIREPESVHTYLDL 471

Query: 1870 CKRLSDANLIGPCL 1911
            CK LSDANLIGP L
Sbjct: 472  CKCLSDANLIGPSL 485


>ref|XP_004500294.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Cicer arietinum]
          Length = 508

 Score =  615 bits (1587), Expect = e-173
 Identities = 311/440 (70%), Positives = 367/440 (83%), Gaps = 8/440 (1%)
 Frame = +1

Query: 622  SKSREFRLFSSVELDTFVTSDDEGE-------MSEVFFEAIEELERMTREPSDVLEEMNE 780
            +K   +    SVELD FVTSDDE E       M + F EAIEELERMTREPSDVLEEMN+
Sbjct: 57   TKPNSYMRKKSVELDQFVTSDDEEEEEEEEEEMGDGFLEAIEELERMTREPSDVLEEMND 116

Query: 781  KLTARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGK 960
            +L+ARELQLVLVYFSQEGRDSWCALEVF+WLRKENRVDKETMELMV++MCGWVKKLI  K
Sbjct: 117  RLSARELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIMEK 176

Query: 961  SEIGEVVDLLVDMDCVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRRGIAYSNDDR 1140
              + +V+DLLV+M+CVGLRP FSMIEKVISLYWE GEK+ AV FV+EVLRRGI+ + DD 
Sbjct: 177  HGVDDVIDLLVNMNCVGLRPGFSMIEKVISLYWEMGEKDDAVLFVEEVLRRGISSNEDD- 235

Query: 1141 GQNKGGPTGYLAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKA 1320
               KGGPTGYLAWKMMVEG+Y+ AV+LV + RE GLKP++YSYL+AMTAVVKELNE+AKA
Sbjct: 236  -PEKGGPTGYLAWKMMVEGDYRGAVRLVTRFREAGLKPDIYSYLVAMTAVVKELNELAKA 294

Query: 1321 LRKLKGFAKAGLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKEESP-SLYGVVHERLL 1497
            LRKLK F++AGL+ E D E+  L EKYQ++LL  G  LS  VI++ SP S++G++HERLL
Sbjct: 295  LRKLKSFSRAGLITEFDREDVELAEKYQSDLLADGARLSKWVIQDGSPSSIHGIIHERLL 354

Query: 1498 AMYICAGRGVEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSL 1677
            AMYICAGRG+EAERQLWEMKL GKEA G LYD+VLAICASQKEA A  RL+ RMEV +S 
Sbjct: 355  AMYICAGRGIEAERQLWEMKLLGKEAVGGLYDMVLAICASQKEAAATARLMIRMEVASSP 414

Query: 1678 RRKKTLSWLLRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDI 1857
            ++KK+LSWLLRGYIKGGHF+ AAETVMKML+ G +P++LDR AV+QGLR+RIQQ G LD 
Sbjct: 415  QKKKSLSWLLRGYIKGGHFNEAAETVMKMLELGFYPDYLDRVAVMQGLRKRIQQYGNLDT 474

Query: 1858 YLELCKRLSDANLIGPCLVY 1917
            Y++LCK L +ANLIG C+ Y
Sbjct: 475  YIKLCKSLYEANLIGACVCY 494


>ref|NP_180571.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546771|sp|Q0WNN7.2|PP176_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g30100, chloroplastic; Flags: Precursor
            gi|330253250|gb|AEC08344.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 503

 Score =  582 bits (1499), Expect = e-163
 Identities = 302/491 (61%), Positives = 377/491 (76%), Gaps = 11/491 (2%)
 Frame = +1

Query: 478  SFAHLGFSQTCSSIRRSFLLSKEFRIQSCPRVLSIICALKPDFVVERRSKSREFRLFSSV 657
            ++AH+  S T S+I     L +  R  S      IIC LK ++      K RE  L  SV
Sbjct: 2    AYAHVFASLTISTISLRRFLPRLHRNHSVKPNSRIICNLKLNYSA---GKFREMGLSRSV 58

Query: 658  ELDTFVTSDDE-GEMSEV---FFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFS 825
            ELD F+TS++E GE  E+   FFEAIEELERMTREPSD+LEEMN +L++RELQL+LVYF+
Sbjct: 59   ELDQFITSEEEEGEAEEIGEGFFEAIEELERMTREPSDILEEMNHRLSSRELQLMLVYFA 118

Query: 826  QEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDC 1005
            QEGRDSWC LEVFEWL+KENRVD+E MELMVS+MCGWVKKLIE +    +V DLL++MDC
Sbjct: 119  QEGRDSWCTLEVFEWLKKENRVDEEIMELMVSIMCGWVKKLIEDECNAHQVFDLLIEMDC 178

Query: 1006 VGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRR--GIAYSNDDRGQN---KGGPTGY 1170
            VGL+P FSM++KVI+LY E G+KE AV FVKEVLRR  G  YS    G +   KGGP GY
Sbjct: 179  VGLKPGFSMMDKVIALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGGGGSEGRKGGPVGY 238

Query: 1171 LAWKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKA 1350
            LAWK MV+G+Y+ AV +V+++R  GLKPE YSYLIAMTA+VKELN + K LR+LK FA+A
Sbjct: 239  LAWKFMVDGDYRKAVDMVMELRLSGLKPEAYSYLIAMTAIVKELNSLGKTLRELKRFARA 298

Query: 1351 GLVAELDLENTLLIEKYQAELLDYGVSLSNCVIKE--ESPSLYGVVHERLLAMYICAGRG 1524
            G VAE+D  + +LIEKYQ+E L  G+ L+   ++E  E+ S+ GVVHERLLAMYICAGRG
Sbjct: 299  GFVAEIDDHDRVLIEKYQSETLSRGLQLATWAVEEGQENDSIIGVVHERLLAMYICAGRG 358

Query: 1525 VEAERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWL 1704
             EAE+QLW+MKLAG+E + DL+DIV+AICASQKE  A+ RLLTR+E   S R+KKTLSWL
Sbjct: 359  PEAEKQLWKMKLAGREPEADLHDIVMAICASQKEVNAVSRLLTRVEFMGSQRKKKTLSWL 418

Query: 1705 LRGYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLS 1884
            LRGY+KGGHF+ AAET++ M+DSGL PE++DR AV+QG+ R+IQ+   ++ Y+ LCKRL 
Sbjct: 419  LRGYVKGGHFEEAAETLVSMIDSGLHPEYIDRVAVMQGMTRKIQRPRDVEAYMSLCKRLF 478

Query: 1885 DANLIGPCLVY 1917
            DA L+GPCLVY
Sbjct: 479  DAGLVGPCLVY 489


>ref|XP_006410063.1| hypothetical protein EUTSA_v10016546mg [Eutrema salsugineum]
            gi|557111232|gb|ESQ51516.1| hypothetical protein
            EUTSA_v10016546mg [Eutrema salsugineum]
          Length = 503

 Score =  575 bits (1482), Expect = e-161
 Identities = 298/489 (60%), Positives = 373/489 (76%), Gaps = 10/489 (2%)
 Frame = +1

Query: 481  FAHLGFSQTCSSIRRSFL---LSKEFRIQSCPRVLSIICALKPDFVVERRSKSREFRLFS 651
            FA L FS   S  R  F    L + +R++   R   I C LK +F      K RE  L  
Sbjct: 7    FASLTFSPPISLRRLRFFRPRLHRNYRVKPDSR---ISCNLKFNFAA---GKFRELGLSR 60

Query: 652  SVELDTFVTSDDEGEMSEV---FFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYF 822
            SVELD F+TS++E +  E+   FFEAIEELERMTREPSD+LEEMN +L++RELQL+LVYF
Sbjct: 61   SVELDQFITSEEENQADEIGQGFFEAIEELERMTREPSDILEEMNHRLSSRELQLMLVYF 120

Query: 823  SQEGRDSWCALEVFEWLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMD 1002
            +QEGRDSWCALEVFEWL+KENRVD+E MELMVS+MCGWVKKLI+ + +  +V DLL++MD
Sbjct: 121  AQEGRDSWCALEVFEWLKKENRVDEEMMELMVSIMCGWVKKLIQEECDAAQVFDLLIEMD 180

Query: 1003 CVGLRPSFSMIEKVISLYWEAGEKEGAVAFVKEVLRR--GIAYSNDDRGQNKGGPTGYLA 1176
            CVGL+P FSM+EKVI+LY E  +KE AV FVKEVLRR     YS       KGGPTGYLA
Sbjct: 181  CVGLKPGFSMMEKVIALYCEMEKKESAVLFVKEVLRRRDTSGYSVVVSEGRKGGPTGYLA 240

Query: 1177 WKMMVEGNYKDAVKLVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGL 1356
            WKMMV+G+YK AV LV+++R  GLKPE YSYLIAMTA+VKELN + K LR+LK F +AGL
Sbjct: 241  WKMMVDGDYKKAVDLVVELRFSGLKPEAYSYLIAMTAIVKELNSLGKTLRELKRFTRAGL 300

Query: 1357 VAELDLENTLLIEKYQAELLDYGVSLSNCVIKE--ESPSLYGVVHERLLAMYICAGRGVE 1530
            VAE+D  + LLIEKYQ+EL+  G+ L+   ++E  ++ S+ G VHERLL MYICAGRG E
Sbjct: 301  VAEIDDHDRLLIEKYQSELISRGLELAAWAVQEGQQNDSIIGAVHERLLGMYICAGRGPE 360

Query: 1531 AERQLWEMKLAGKEADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLR 1710
            AE+QLW MKL G+E + DL+DIV+AICASQKE  A+ RLLTR+E   S  +KK+LSWLLR
Sbjct: 361  AEKQLWNMKLTGREPEADLHDIVMAICASQKEVNAVSRLLTRVEFMESKGKKKSLSWLLR 420

Query: 1711 GYIKGGHFDNAAETVMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDA 1890
            GY+KGGHF+ AAET++ M+DSGL+PE++DR AV+QG+ ++IQ+   ++ Y+ LCKRL DA
Sbjct: 421  GYVKGGHFEEAAETLITMMDSGLYPEYIDRVAVMQGMTKKIQRPRDVEAYMGLCKRLFDA 480

Query: 1891 NLIGPCLVY 1917
             L+GPCLVY
Sbjct: 481  GLVGPCLVY 489


>ref|XP_006293981.1| hypothetical protein CARUB_v10022972mg [Capsella rubella]
            gi|482562689|gb|EOA26879.1| hypothetical protein
            CARUB_v10022972mg [Capsella rubella]
          Length = 505

 Score =  569 bits (1466), Expect = e-159
 Identities = 290/475 (61%), Positives = 365/475 (76%), Gaps = 11/475 (2%)
 Frame = +1

Query: 526  SFLLSKEFRIQSCPRVLSIICALKPDFVVERRSKSREFRLFSSVELDTFVTSDDEG---- 693
            S  L + +R      V  I C LK ++      K R+ +L  SVELD F+TS++EG    
Sbjct: 20   SISLRRVYRTPGVKSVSRISCNLKLNYSA---GKFRDLKLSRSVELDQFITSEEEGGEEA 76

Query: 694  --EMSEVFFEAIEELERMTREPSDVLEEMNEKLTARELQLVLVYFSQEGRDSWCALEVFE 867
              E+ E FFEAIEELERMTREPSDVLEEMN +L++RELQL+LVYF+QEGRDSWC LEVFE
Sbjct: 77   EDEIGEGFFEAIEELERMTREPSDVLEEMNHRLSSRELQLMLVYFAQEGRDSWCTLEVFE 136

Query: 868  WLRKENRVDKETMELMVSLMCGWVKKLIEGKSEIGEVVDLLVDMDCVGLRPSFSMIEKVI 1047
            WL+KENRVD++ +ELMVS+MCGWVKKLI+ +    +V DLL++MDCVGL+P FSM+EKVI
Sbjct: 137  WLKKENRVDEQMVELMVSIMCGWVKKLIQEECGADQVFDLLIEMDCVGLKPGFSMMEKVI 196

Query: 1048 SLYWEAGEKEGAVAFVKEVLRR--GIAYSNDDRGQN-KGGPTGYLAWKMMVEGNYKDAVK 1218
            +LY E G+KE AV FVKEVLRR  G  YS     +  KGGP GYLAWK+MV+G+YK AV 
Sbjct: 197  ALYCEMGKKESAVLFVKEVLRRRDGFGYSVVGGSEGRKGGPVGYLAWKLMVDGDYKKAVD 256

Query: 1219 LVIQIRECGLKPEVYSYLIAMTAVVKELNEVAKALRKLKGFAKAGLVAELDLENTLLIEK 1398
            LV+++R  GL PE YSYLIAMTA+VKELN + K LR+LK F +AG V E+D  + +LIEK
Sbjct: 257  LVVELRLSGLMPEAYSYLIAMTAIVKELNSLGKTLRELKRFTRAGYVTEIDDHDRVLIEK 316

Query: 1399 YQAELLDYGVSLSNCVIKE--ESPSLYGVVHERLLAMYICAGRGVEAERQLWEMKLAGKE 1572
            YQ+E L  G+ L+   ++E  +  S+ GVVHERLLAMYICAGRG EAE+QLW+MKLAG+E
Sbjct: 317  YQSETLSRGLQLATWAVEEGQQEDSIIGVVHERLLAMYICAGRGPEAEKQLWKMKLAGRE 376

Query: 1573 ADGDLYDIVLAICASQKEAGAIGRLLTRMEVTNSLRRKKTLSWLLRGYIKGGHFDNAAET 1752
             + +L+DIV+AICASQKE  A+ RLLTR+E   S R+KKTLSWLLRGY+KGGHF+ AAET
Sbjct: 377  PEAELHDIVMAICASQKEVNAVSRLLTRVEFMESKRKKKTLSWLLRGYVKGGHFEEAAET 436

Query: 1753 VMKMLDSGLFPEFLDRAAVLQGLRRRIQQSGILDIYLELCKRLSDANLIGPCLVY 1917
            ++ M+DSGL PE++DR AV+QG+ R+IQ+   ++ Y+ LCKRL DA L+GPCLVY
Sbjct: 437  LITMIDSGLHPEYIDRVAVMQGMTRKIQRPRDIEAYMGLCKRLFDAGLVGPCLVY 491


Top