BLASTX nr result

ID: Catharanthus23_contig00005025 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00005025
         (1998 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004246209.1| PREDICTED: pentatricopeptide repeat-containi...   329   2e-87
gb|EOY32970.1| Pentatricopeptide repeat-containing protein, puta...   315   6e-83
ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containi...   314   8e-83
ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citr...   311   9e-82
ref|XP_003612457.1| Pentatricopeptide repeat-containing protein ...   308   4e-81
ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containi...   304   8e-80
ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containi...   296   2e-77
gb|ESW30051.1| hypothetical protein PHAVU_002G120500g [Phaseolus...   296   3e-77
gb|AFK33630.1| unknown [Lotus japonicus]                              295   6e-77
ref|XP_002519945.1| pentatricopeptide repeat-containing protein,...   285   4e-74
emb|CAN69066.1| hypothetical protein VITISV_016070 [Vitis vinifera]   283   2e-73
gb|EXB70628.1| hypothetical protein L484_023813 [Morus notabilis]     270   1e-69
ref|NP_174459.1| pentatricopeptide repeat-containing protein [Ar...   261   1e-66
ref|XP_002893686.1| pentatricopeptide repeat-containing protein ...   256   3e-65
ref|XP_006303657.1| hypothetical protein CARUB_v10011695mg [Caps...   249   4e-63
ref|XP_004292904.1| PREDICTED: pentatricopeptide repeat-containi...   230   2e-57
ref|XP_006415303.1| hypothetical protein EUTSA_v10009456mg [Eutr...   224   1e-55
ref|XP_006841553.1| hypothetical protein AMTR_s00003p00175270 [A...   208   8e-51
ref|NP_001131386.1| hypothetical protein [Zea mays] gi|194691388...   181   1e-42
ref|XP_002461747.1| hypothetical protein SORBIDRAFT_02g007340 [S...   177   2e-41

>ref|XP_004246209.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Solanum lycopersicum]
          Length = 465

 Score =  329 bits (844), Expect = 2e-87
 Identities = 171/347 (49%), Positives = 227/347 (65%)
 Frame = +3

Query: 306  MDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHVRSSGVXXXXXXXXXXXXXXXXXXX 485
            MDS+   + +D+YVSLIKECT+  DPL A+E++ HV  S V                   
Sbjct: 1    MDSLGFNIPVDVYVSLIKECTESRDPLNAVEVYEHVCKSDVIPSLPLLNRLLLMLVLCGC 60

Query: 486  XRNARELFDKSFDRNAYSWAVLIAGYLESGDYGEVIDLFLEMKSWESAEGGFYDLSNSAV 665
               AR+LFDK   RN+ SWA +IAG +E+G+    + LF+EM   +S  G      +   
Sbjct: 61   FEQARQLFDKMRVRNSQSWAAMIAGCVENGECVGALRLFMEM---QSEAGNLCKCGDLID 117

Query: 666  SGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLRSAVLMSSLMWFYGEFGSLVNSEGAFN 845
             GI+VC+LK+CV+ MN E G+Q+H  L+K G   S VL S L+ FYGEFG L +++  F+
Sbjct: 118  DGILVCVLKACVELMNLEFGRQIHGWLLKLGNCESMVLNSFLIKFYGEFGYLESADNVFD 177

Query: 846  QVYNENMVVWTARIVNCCKEERFDKAVHVFREMGQQGVKRNGYTFSSILKACGKMGDDGC 1025
             V + N VVWTARI N CKEE+F+ A+ +FREM  +GVK+N +TFSSILKACGK+ D GC
Sbjct: 178  HVPHCNTVVWTARIGNLCKEEQFEGAIRIFREMVSEGVKKNSFTFSSILKACGKLRDAGC 237

Query: 1026 CGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAVNDAKSVFNMCAHEGNAACWNALINGY 1205
            CG+Q+HA +VK+GL+ DSYV C LIDMYGKYG + DA+ VFN    + N ACWNA++ G 
Sbjct: 238  CGQQIHATSVKVGLDTDSYVLCSLIDMYGKYGLLKDARRVFNAREDKSNIACWNAMLMGC 297

Query: 1206 MQKGLCVEAIKILYEMKAAGLQPQESLLHELRSLCGSYEI*EAGSVS 1346
            +Q G  VEA+K+LYEMK AGLQP ESL++E+       E+  A S S
Sbjct: 298  IQHGFGVEAMKVLYEMKEAGLQPHESLINEVLLASTGTELAGASSSS 344


>gb|EOY32970.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao]
          Length = 413

 Score =  315 bits (806), Expect = 6e-83
 Identities = 161/363 (44%), Positives = 224/363 (61%)
 Frame = +3

Query: 228  PIKQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHN 407
            P   S  K  S++  +  TTSD+LRLMDS+ +P+  D+Y SL+KECT       A+ELH+
Sbjct: 58   PTPISTSKPISSNPCSSHTTSDILRLMDSLSLPIPPDIYASLVKECTVTRHSRRALELHS 117

Query: 408  HVRSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGE 587
            H+R+S +                      AR LFD+   R+  SWA++I   L +GD  +
Sbjct: 118  HIRNSRIKPSLPLLNRLLLMHVSCGHLDIARHLFDQMLLRDFNSWAIMIVACLHAGDSEQ 177

Query: 588  VIDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLR 767
             I  F+ M+         ++L     S I+VC+LKSCV T N  LGKQVH  L+K G   
Sbjct: 178  AIAYFVRMER--------HNLLFKCPSWIIVCLLKSCVVTKNMGLGKQVHGQLLKLGASN 229

Query: 768  SAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREMG 947
             + L  SL+ FYG+F  L +++  FNQ+   N V WTARIVN C+E++F K +  F EMG
Sbjct: 230  DSSLSGSLINFYGKFRCLDDADFVFNQLSRRNTVTWTARIVNSCREDQFGKVIDDFNEMG 289

Query: 948  QQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAV 1127
            +QG+K+N +TFS + KAC +M DDG  GRQVHA A+KLGLE D +VQCGLI +YGK G+V
Sbjct: 290  RQGIKKNNFTFSGVFKACARMDDDGMSGRQVHANALKLGLESDVFVQCGLIHLYGKCGSV 349

Query: 1128 NDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRSL 1307
             DA+  F +   + N ACWNA++ GY+   LC+ AIK+LY MK AG++ QESL++++R  
Sbjct: 350  RDAEKAFEIVGDKRNIACWNAMLMGYVHNELCLRAIKLLYRMKEAGIKVQESLINDVRIA 409

Query: 1308 CGS 1316
            C +
Sbjct: 410  CAT 412


>ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Vitis vinifera]
          Length = 414

 Score =  314 bits (805), Expect = 8e-83
 Identities = 162/361 (44%), Positives = 229/361 (63%), Gaps = 4/361 (1%)
 Frame = +3

Query: 246  KKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHVRSSG 425
            KK++SN + T ST +D+LRLMD + +P+  D+Y SLIKE +  GD   A +L  H+  SG
Sbjct: 48   KKSNSNATPTTSTPTDILRLMDGLGLPIPPDIYASLIKESSTTGDATQATQLLAHINRSG 107

Query: 426  VXXXXXXXXXXXXXXXXXXXXRNARELFDKS--FDRNAYSWAVLIAGYLESGDYGEVIDL 599
            +                      AR +FDK    ++N+ SWA+++A Y+++G Y E I L
Sbjct: 108  LPLSSALLNRILLMYVSCGLIHTARHMFDKMNVLNKNSISWAIMLAAYMDNGFYEEAIFL 167

Query: 600  FLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLRSAVL 779
            F++M    S       +     + I +C+LK+CV TMN  LGKQVH  L+K G   +  L
Sbjct: 168  FVQMMELHST------IMLELPAWIFICVLKACVHTMNLTLGKQVHGWLLKVGYATNLFL 221

Query: 780  MSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREMGQQGV 959
               L+ FYG+F  L +++  F+Q    N V+WTA++VN C+ E   +A+  F EMG+ GV
Sbjct: 222  SCYLISFYGKFRCLDDADFVFDQTSERNTVIWTAKMVNKCQGEYMHEALVAFTEMGRAGV 281

Query: 960  KRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAVNDAK 1139
            KRN +T+SS+L+ACG+M D G CGR +HA  +KLGLE D YVQCGL+DMYGK G + +A+
Sbjct: 282  KRNEFTYSSVLRACGRMKDHGRCGRLIHASTIKLGLESDIYVQCGLVDMYGKCGLLVEAR 341

Query: 1140 SVFNMCA--HEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRSLCG 1313
             VF   +  ++ N  CWNA++ GY++ GL +EAIK LY+MKAAG+QPQESLL+ELR  CG
Sbjct: 342  RVFETVSDTNKTNIVCWNAMLTGYIRHGLYIEAIKFLYQMKAAGIQPQESLLNELRIACG 401

Query: 1314 S 1316
            S
Sbjct: 402  S 402


>ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citrus clementina]
            gi|557539679|gb|ESR50723.1| hypothetical protein
            CICLE_v10033975mg [Citrus clementina]
          Length = 425

 Score =  311 bits (796), Expect = 9e-82
 Identities = 168/366 (45%), Positives = 228/366 (62%), Gaps = 2/366 (0%)
 Frame = +3

Query: 225  QPIKQSPKKTHSNDSRTCSTTS-DVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIEL 401
            +P+K S     +  S   +T+S ++L LMD++ +P+T D+Y  LIKECT + D   A EL
Sbjct: 48   KPLKTSSNWRETTQSIPANTSSANILHLMDNLCLPITTDMYTCLIKECTFQKDSAGAFEL 107

Query: 402  HNHVRSS-GVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGD 578
             NH+R    +                      AR+LFD+   R+  SWAV+I GY++  D
Sbjct: 108  LNHIRKRVNIKPTLLFLNRLLLMHVSCGQLDTARQLFDEMPLRDFNSWAVMIVGYVDVAD 167

Query: 579  YGEVIDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAG 758
            Y E I LF EM   +        +     + I+VC+LK+CV TMN ELGKQVH LL K G
Sbjct: 168  YQECITLFAEMMKRKKGH-----MLLVFPAWIIVCVLKACVCTMNMELGKQVHGLLFKLG 222

Query: 759  CLRSAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFR 938
              R+  L  SL+ FYG+F  L +++  F+Q+   N VVWTA+IVN C+E  F +  + F+
Sbjct: 223  SSRNISLTGSLINFYGKFRCLEDADFVFSQLKRHNTVVWTAKIVNNCREGHFHQVFNDFK 282

Query: 939  EMGQQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKY 1118
            EMG++ +K+N YTFSS+LKACG + DDG CGRQVHA  VK+GLE D YVQCGL+DMYGK 
Sbjct: 283  EMGRERIKKNSYTFSSVLKACGGVDDDGNCGRQVHANIVKIGLESDEYVQCGLVDMYGKC 342

Query: 1119 GAVNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHEL 1298
              + DAK VF +   + N A WNA++ GY++ GL VEA K LY MKA+G+Q QESL+++L
Sbjct: 343  RLLRDAKRVFELIVDKKNIASWNAMLMGYIRNGLYVEATKFLYLMKASGIQIQESLINDL 402

Query: 1299 RSLCGS 1316
            R  C S
Sbjct: 403  RIACSS 408


>ref|XP_003612457.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355513792|gb|AES95415.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 418

 Score =  308 bits (790), Expect = 4e-81
 Identities = 163/364 (44%), Positives = 220/364 (60%)
 Frame = +3

Query: 225  QPIKQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELH 404
            QPI   PKK  S   R C TTS +L LMD++  P+T+D+Y SL+KECT   DP  AIELH
Sbjct: 57   QPITP-PKK--SKRRRKCDTTSHILPLMDALHFPITIDIYTSLVKECTLSTDPETAIELH 113

Query: 405  NHVRSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYG 584
              + + G+                     NAR +FD    R+ +SWA L   Y E+G+Y 
Sbjct: 114  TQIITRGIELPLTLLNRILIMFVSCGLLENARRVFDVMSVRDFHSWATLFVSYYENGEYE 173

Query: 585  EVIDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCL 764
              ID+F+ M       G  +         I  C+LK+C  TMN  LG QVH  L+K G  
Sbjct: 174  NAIDVFVSMLCQLDVMGFSFP------PWIWSCLLKACACTMNVPLGMQVHGCLLKLGAC 227

Query: 765  RSAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREM 944
               ++ SSL+ FYG F  L ++   FN+V   N + WTA+IV+ C+E  F +A+  F++M
Sbjct: 228  DHVLISSSLIRFYGRFKCLEDANMVFNRVSRHNTLTWTAKIVSSCRERHFSEALGDFKKM 287

Query: 945  GQQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGA 1124
            G+ GVK++ +TFSS+LKACG+M + G CG QVHA A+KLGL+ DSYVQC LI MYG+ G 
Sbjct: 288  GRVGVKKDSFTFSSVLKACGRMQNRGSCGEQVHADAIKLGLDSDSYVQCSLIAMYGRSGL 347

Query: 1125 VNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRS 1304
            + DA+ VF M  +E N    NA++ GY+Q GL +EA+K +Y+MKAAG+QP E LL +LR 
Sbjct: 348  LRDAELVFEMTRNERNVDSLNAMLMGYIQNGLYIEAVKFVYQMKAAGVQPHEPLLEKLRI 407

Query: 1305 LCGS 1316
             CGS
Sbjct: 408  ACGS 411


>ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Cicer arietinum]
          Length = 418

 Score =  304 bits (779), Expect = 8e-80
 Identities = 155/364 (42%), Positives = 221/364 (60%), Gaps = 1/364 (0%)
 Frame = +3

Query: 228  PIKQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHN 407
            P  ++  K  +N+ R  +TTS +L LMD++  P+ +D+Y SL+KECT  GDP  A ELH+
Sbjct: 55   PRNKNNTKNKNNNKRKSATTSHILPLMDALHFPIPIDIYTSLVKECTLSGDPETATELHS 114

Query: 408  HVRSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGE 587
            H+  SG+                    ++AR +FD+   RN +SWA+L   Y E+ DY  
Sbjct: 115  HITRSGIGPPLTLLNRILIMFVSCGLLQSARHVFDEMPVRNFHSWAILFVAYYENSDYEN 174

Query: 588  VIDLFLEM-KSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCL 764
             ID+F+ M +     E  F     S       C+L +C  T+N  LG QVH  L K G  
Sbjct: 175  AIDVFMRMLRQLGVMEFPFLPWFWS-------CLLTACACTVNVPLGMQVHGSLTKLGAC 227

Query: 765  RSAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREM 944
               ++ SSL+ FYG F  L ++   FN+V   N + WTA+IV+ C+E  F + +  F+EM
Sbjct: 228  DHVLISSSLIRFYGRFKCLEDANVVFNRVSRHNTLTWTAKIVSGCRERHFTQVLGDFKEM 287

Query: 945  GQQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGA 1124
            G+ G+K++ +TFSS+LKACG+M + G CG QVHA ++KLGL+ D+YVQC LI MYG+ G 
Sbjct: 288  GRVGIKKDSFTFSSVLKACGRMQNYGSCGEQVHADSIKLGLDSDNYVQCSLIAMYGRSGL 347

Query: 1125 VNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRS 1304
            + DAK VF    +E N   WNA++ GY+Q GL ++A+K +Y+MKAAG+ P ESLL +LR 
Sbjct: 348  LRDAKLVFETTLNERNVDSWNAMLMGYIQNGLYIKAVKFVYQMKAAGVHPHESLLEKLRI 407

Query: 1305 LCGS 1316
             CGS
Sbjct: 408  ACGS 411


>ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Glycine max]
          Length = 423

 Score =  296 bits (759), Expect = 2e-77
 Identities = 157/366 (42%), Positives = 216/366 (59%), Gaps = 2/366 (0%)
 Frame = +3

Query: 225  QPIKQSPK--KTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIE 398
            QP+ Q+    K      R  +TTSD+L LM+++  P+ +D+Y SLIKECT  GDP  AIE
Sbjct: 59   QPLTQTTTFTKKKKKKKRKGATTSDILHLMEALPFPVPIDIYTSLIKECTVSGDPETAIE 118

Query: 399  LHNHVRSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGD 578
            L  H+  SG+                     NAR +FDK   R+  +WA L   Y ++ D
Sbjct: 119  LATHISKSGIKPPLPFLNRILVMFVSCGLLENARHMFDKMRVRDFNTWATLFVAYYDNTD 178

Query: 579  YGEVIDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAG 758
            Y E  ++F+ M +    + G  +        I  C+L++C  T+N  LG QVH  L+K G
Sbjct: 179  YEEATNVFVNMLT----QLGMMEFP----PWIWACLLRACACTVNVPLGMQVHGWLLKLG 230

Query: 759  CLRSAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFR 938
                 +L SSL+ FYG F  L ++   F+ V   N + WTA+IV+ C+E  F +    F+
Sbjct: 231  TCDHVLLSSSLINFYGRFTCLEDASVVFDGVSRHNTLTWTAKIVSGCRERHFSEVFDDFK 290

Query: 939  EMGQQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKY 1118
            EMG +GVK++ +TFSS+LKACG+M +   CG QVH  A+KLGL  D YVQC LI MYG+ 
Sbjct: 291  EMGMRGVKKDCFTFSSVLKACGRMLNQERCGEQVHVDAIKLGLVSDHYVQCSLIAMYGRC 350

Query: 1119 GAVNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHEL 1298
            G + DAK VF M   E    CWNA++ GY+Q GL +EA+K LY+M+AAG+QP+ESLL +L
Sbjct: 351  GLLEDAKRVFEMSQEERKVDCWNAMLMGYIQNGLYIEAVKFLYQMQAAGMQPRESLLKKL 410

Query: 1299 RSLCGS 1316
            R  CGS
Sbjct: 411  RMACGS 416


>gb|ESW30051.1| hypothetical protein PHAVU_002G120500g [Phaseolus vulgaris]
          Length = 420

 Score =  296 bits (757), Expect = 3e-77
 Identities = 157/357 (43%), Positives = 207/357 (57%)
 Frame = +3

Query: 246  KKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHVRSSG 425
            KK      R  +TT D+L LMD++  P+T+D+Y SLIKECT  GDP  AIEL+ H+  S 
Sbjct: 65   KKEIKKKKRKEATTLDILHLMDALPFPITIDIYTSLIKECTVSGDPETAIELYTHISKSD 124

Query: 426  VXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGEVIDLFL 605
            +                     NAR +F+K   R+  SWA L   Y ++ +Y E   +F+
Sbjct: 125  IKPPLPFLNRILIMFVSCGMLENARHMFEKMRVRDFNSWATLFVAYYDNAEYEEATAVFV 184

Query: 606  EMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLRSAVLMS 785
             M      + G           I  C+L++C  T+N  LG QVH  L+K G     +L S
Sbjct: 185  NMLG----QLGMLQFP----PWIWACLLRACACTLNVPLGLQVHGWLLKLGACDHVLLSS 236

Query: 786  SLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREMGQQGVKR 965
            SL+ FYG F  L ++   FN V   N + WTA+IV+ C+E  F +    FREMG +GVK+
Sbjct: 237  SLINFYGRFTCLEDASAVFNGVSRHNTLTWTAKIVSGCRERHFSEVFGDFREMGMRGVKK 296

Query: 966  NGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAVNDAKSV 1145
            + +TFSS+LKACGKM +   CG QVHA A+KLGL  D YVQC LI MYG+ G + DAK V
Sbjct: 297  DCFTFSSVLKACGKMLNQERCGEQVHADAIKLGLISDHYVQCSLIAMYGRCGLLTDAKDV 356

Query: 1146 FNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRSLCGS 1316
            F M   E    CWNA++ GY Q G  +EA+K LY+M+AAG+QP ESLL +LR  CGS
Sbjct: 357  FEMTREERKVDCWNAMLMGYTQNGFHIEAVKFLYQMQAAGMQPWESLLKKLRIACGS 413


>gb|AFK33630.1| unknown [Lotus japonicus]
          Length = 356

 Score =  295 bits (754), Expect = 6e-77
 Identities = 158/357 (44%), Positives = 207/357 (57%), Gaps = 1/357 (0%)
 Frame = +3

Query: 249  KTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHVRSSGV 428
            K      R  +TTS +L LMD +  P+ +D+Y SLIKECT   DP  AIELH H+  SG+
Sbjct: 2    KKKKKRKRKGATTSHILHLMDVLPFPIPIDIYTSLIKECTLSPDPQTAIELHTHIAHSGI 61

Query: 429  XXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGEVIDLFLE 608
                                  A +LFD    ++  SWA L   Y ++ DY E ID+FL 
Sbjct: 62   KPPLSFINRILVMFVSCGLLDYACQLFDAMPVKDFNSWATLFIAYYDNADYEEAIDVFLA 121

Query: 609  MKSWESAEGGFYDLSNSAVSG-IVVCILKSCVKTMNFELGKQVHTLLIKAGCLRSAVLMS 785
            M          + L  S     I  C LK+C    N  LG QVH  L+K G     +L S
Sbjct: 122  M---------LHQLGMSEFPPWICACFLKACACIENIPLGMQVHGWLLKLGTCDHVLLSS 172

Query: 786  SLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREMGQQGVKR 965
            SL+ FYG F  + ++   FN++   N   WTA+IV+ C+E  F +  + F+EMG+QG+K+
Sbjct: 173  SLIRFYGRFTCVKDANAVFNKLSRHNTSTWTAKIVSGCREMDFPEVFNDFKEMGRQGIKK 232

Query: 966  NGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAVNDAKSV 1145
            + YTFSS+LKACGKM D G CG QVHA A+KLGL  D+YVQC LI MYG+ G + DAK V
Sbjct: 233  DTYTFSSVLKACGKMMDHGRCGEQVHADAMKLGLASDNYVQCSLIAMYGRSGLLRDAKQV 292

Query: 1146 FNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRSLCGS 1316
            F     E N   WNA++ GY++ GL +EA+K LY+MKAAGL+P ESLL ++R  CGS
Sbjct: 293  FETSRSERNVDSWNAMLMGYLENGLYIEAVKFLYQMKAAGLKPHESLLDKVRIACGS 349


>ref|XP_002519945.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223540991|gb|EEF42549.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 403

 Score =  285 bits (730), Expect = 4e-74
 Identities = 154/367 (41%), Positives = 220/367 (59%), Gaps = 3/367 (0%)
 Frame = +3

Query: 225  QPIKQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELH 404
            +PI   P K      ++CS+ SD++RLMDS+  P+  D+Y SLIKECT   D   A+ LH
Sbjct: 44   KPINHLPAK------KSCSS-SDIMRLMDSLCHPIPPDIYTSLIKECTLTSDSTEALCLH 96

Query: 405  NHVRS-SGVXXXXXXXXXXXXXXXXXXXXRNARELFDKS-FDRNAYSWAVLIAGYLESGD 578
            +H+ S + +                      AR LFDK    ++  SW ++I G   +  
Sbjct: 97   SHLISQTNLKLTPPLVHRLLLMHVSCGQLDIARNLFDKMPLKKDFISWVIVIVGCFSNSK 156

Query: 579  YGEVIDLFLEMKSWESA-EGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKA 755
            Y   I+LF++M    S  +G  +DL+   +  I++CI+K C+ +MN  LGKQVH +L K 
Sbjct: 157  YEAGINLFIDMLLQHSVYDGLMFDLNTWNI--IILCIIKCCIYSMNISLGKQVHGILFKV 214

Query: 756  GCLRSAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVF 935
            G         SLM FYG+ G L +    FN++ N N   WTA+IVN C+ +RF + +  F
Sbjct: 215  GLTSEISFNVSLMDFYGKLGCLEDVNSVFNKLDNHNTATWTAKIVNSCRNQRFYEVIEDF 274

Query: 936  REMGQQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGK 1115
            +EMG+ G+KRN +T SS+L+AC +MGD G CG+QVH   +KLGLE D++VQCGLI MYGK
Sbjct: 275  KEMGEAGIKRNSFTVSSVLRACARMGDGGNCGKQVHVIVIKLGLESDAFVQCGLIAMYGK 334

Query: 1116 YGAVNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHE 1295
             G +  AK VF +   + N ACWNAL+  Y++  L +EA+K+LY+M+AA +Q  ESLL  
Sbjct: 335  CGMIRKAKKVFELVIDKTNTACWNALLMAYVRNELFIEAMKLLYQMEAAKIQVNESLLDH 394

Query: 1296 LRSLCGS 1316
            +R  CG+
Sbjct: 395  VRIACGT 401


>emb|CAN69066.1| hypothetical protein VITISV_016070 [Vitis vinifera]
          Length = 543

 Score =  283 bits (724), Expect = 2e-73
 Identities = 153/365 (41%), Positives = 213/365 (58%), Gaps = 4/365 (1%)
 Frame = +3

Query: 234  KQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHV 413
            K+  KK++SN + T ST +D+LRLMD + +P+  D+Y SLIKE +  GD   A +L  H+
Sbjct: 207  KKEKKKSNSNATPTTSTPTDILRLMDGLGLPIPPDIYASLIKESSTTGDATQATQLLAHI 266

Query: 414  RSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKS--FDRNAYSWAVLIAGYLESGDYGE 587
              SG+                      AR +FDK    ++N+ SWA+++A Y+++G Y E
Sbjct: 267  NRSGLPLSSALLNRILLMYVSCGLIHTARHMFDKMNVLNKNSISWAIMLAAYMDNGFYEE 326

Query: 588  VIDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLR 767
             I LF++M    S       +     + I +C+LK+CV TMN  LGKQVH  L K     
Sbjct: 327  AIFLFVQMMELHST------IMLELPAWIFICVLKACVHTMNLTLGKQVHGWLTK----- 375

Query: 768  SAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREMG 947
                                           N V+WTA++VN C+ E   +A+  F EMG
Sbjct: 376  -----------------------------ERNTVIWTAKMVNKCQGEYMHEALVAFTEMG 406

Query: 948  QQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAV 1127
            + GVKRN +T+SS+L+ACG+M D G CGR +HA  +KLGLE D YVQCGL+DMYGK G +
Sbjct: 407  RAGVKRNEFTYSSVLRACGRMKDHGRCGRLIHASTIKLGLESDIYVQCGLVDMYGKCGLL 466

Query: 1128 NDAKSVFNMCA--HEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELR 1301
             +A+ VF   +  ++ N  CWNA++ GY++ GL +EAIK LY+MKAAG+QPQESLL+ELR
Sbjct: 467  VEARRVFETVSDTNKTNIVCWNAMLTGYIRHGLYIEAIKFLYQMKAAGIQPQESLLNELR 526

Query: 1302 SLCGS 1316
              CGS
Sbjct: 527  IACGS 531


>gb|EXB70628.1| hypothetical protein L484_023813 [Morus notabilis]
          Length = 453

 Score =  270 bits (691), Expect = 1e-69
 Identities = 148/363 (40%), Positives = 215/363 (59%), Gaps = 1/363 (0%)
 Frame = +3

Query: 231  IKQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNH 410
            +++  +K ++  +    +TSDVLRLMD++ +P++ D+Y+S +KECT   D   A +LHNH
Sbjct: 88   VEKKMRKKNALIAPPACSTSDVLRLMDALCLPISPDMYISFMKECTISADFCGAEDLHNH 147

Query: 411  VRSSGVXXXXXXXXXXXXXXXXXXXXRN-ARELFDKSFDRNAYSWAVLIAGYLESGDYGE 587
            +  + +                     + A +LF +   ++  SWA +I   + + DY E
Sbjct: 148  ISRNSLQHLALPLLNRLLFMNVSCGRLDLACDLFYRMPFKDFKSWATMIVANVNNSDYEE 207

Query: 588  VIDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLR 767
               LFL+M    +             S I+VC+LK+CV T N ELGKQVH   +K G   
Sbjct: 208  ATSLFLKMLHHINML--------EFPSWIIVCLLKTCVCTRNMELGKQVHACALKLGHAN 259

Query: 768  SAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREMG 947
            S  L S L+ FYG++G L ++   FNQ+   + + W  R++N  KEE F + +  F E+G
Sbjct: 260  SLYLASCLINFYGKYGCLESANLVFNQLPRHDTLTWMTRLINNSKEELFFEVLRDFNEVG 319

Query: 948  QQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAV 1127
            + G+K+N   FSS+LKACG++ D    G+QVHA A+KLG E D YVQCGLIDMYG+ G +
Sbjct: 320  KAGIKKNVLMFSSVLKACGRIHDRRKSGQQVHANAIKLGFESDLYVQCGLIDMYGRSGLL 379

Query: 1128 NDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRSL 1307
             DA+ VF   +   N ACWNA++ GY++  L VEAIK +Y+MKA GLQ Q+S+L ELR  
Sbjct: 380  RDAQRVFEKSSDRRNNACWNAMLGGYIRNELYVEAIKFVYQMKAVGLQLQQSMLDELRIA 439

Query: 1308 CGS 1316
            CGS
Sbjct: 440  CGS 442


>ref|NP_174459.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75169166|sp|Q9C6R9.1|PPR66_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g31790 gi|12321298|gb|AAG50719.1|AC079041_12
            hypothetical protein [Arabidopsis thaliana]
            gi|111074348|gb|ABH04547.1| At1g31790 [Arabidopsis
            thaliana] gi|332193272|gb|AEE31393.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 409

 Score =  261 bits (666), Expect = 1e-66
 Identities = 142/355 (40%), Positives = 202/355 (56%), Gaps = 2/355 (0%)
 Frame = +3

Query: 237  QSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHVR 416
            Q P+    N S  CST SD+LRLMDS+ +P   D+Y  L KE  ++ D   A EL  H+ 
Sbjct: 57   QQPQIQPQNPSSRCST-SDILRLMDSLSLPGNEDIYSCLAKESARENDQRGAHELQVHIM 115

Query: 417  SSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGEVID 596
             S +                       R++FD+   R+ +SWA++  G +E GDY +   
Sbjct: 116  KSSIRPTITFINRLLLMHVSCGRLDITRQMFDRMPHRDFHSWAIVFLGCIEMGDYEDAAF 175

Query: 597  LFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCL--RS 770
            LF+ M    S +G F        S I+ C+LK+C    +FELGKQVH L  K G +    
Sbjct: 176  LFVSMLK-HSQKGAF-----KIPSWILGCVLKACAMIRDFELGKQVHALCHKLGFIDEED 229

Query: 771  AVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREMGQ 950
            + L  SL+ FYGEF  L ++    +Q+ N N V W A++ N  +E  F + +  F EMG 
Sbjct: 230  SYLSGSLIRFYGEFRCLEDANLVLHQLSNANTVAWAAKVTNDYREGEFQEVIRDFIEMGN 289

Query: 951  QGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAVN 1130
             G+K+N   FS++LKAC  + D G  G+QVHA A+KLG E D  ++C LI+MYGKYG V 
Sbjct: 290  HGIKKNVSVFSNVLKACSWVSDGGRSGQQVHANAIKLGFESDCLIRCRLIEMYGKYGKVK 349

Query: 1131 DAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHE 1295
            DA+ VF     E + +CWNA++  YMQ G+ +EAIK+LY+MKA G++  ++LL+E
Sbjct: 350  DAEKVFKSSKDETSVSCWNAMVASYMQNGIYIEAIKLLYQMKATGIKAHDTLLNE 404


>ref|XP_002893686.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297339528|gb|EFH69945.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 410

 Score =  256 bits (654), Expect = 3e-65
 Identities = 145/356 (40%), Positives = 200/356 (56%), Gaps = 3/356 (0%)
 Frame = +3

Query: 237  QSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHVR 416
            Q P+      S  CST SD+LRLMDS+ +P   DLY  L KE  ++ D   A EL  H+ 
Sbjct: 57   QQPQIQPQKPSPRCST-SDILRLMDSLSLPGNEDLYSCLAKESARENDRRGAYELQVHIM 115

Query: 417  SSGVXXXXXXXXXXXXXXXXXXXXRN-ARELFDKSFDRNAYSWAVLIAGYLESGDYGEVI 593
             S +                     +  R +FDK   R+ +SWA++  G +E GDY +  
Sbjct: 116  KSSIRRPTTTFVNRLLLMHVSCGRLDITRHMFDKMPHRDFHSWAIVFLGCIEMGDYEDAA 175

Query: 594  DLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCL--R 767
             LF+ M    S  G F        S I+ C+LK+C    +FELGKQVH L  K GC+   
Sbjct: 176  LLFVSMLK-HSQNGAF-----KIPSWIMGCVLKACAMIRDFELGKQVHALCHKLGCIDEE 229

Query: 768  SAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREMG 947
             + L  SL+ FYGEF  L ++    +Q+ N N V W A++ N  +E  F + +  F EMG
Sbjct: 230  DSYLSGSLIRFYGEFRCLEDANLVLHQLSNANTVAWAAKVTNDYREGEFQEVIRDFIEMG 289

Query: 948  QQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAV 1127
               +++N   FS++LKAC  + D G  G+QVHA A+KLG E D  ++C LI+MYGKYG V
Sbjct: 290  NHRIRKNVSVFSNVLKACTWVSDGGRSGKQVHAVAIKLGFESDCLIRCRLIEMYGKYGKV 349

Query: 1128 NDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHE 1295
             DA+ VF     E N  CWNA++ GYMQ G+ VEAIK+L +MKA G++ Q++LL+E
Sbjct: 350  KDAEKVFKSSKDETNVNCWNAMVAGYMQNGIYVEAIKLLCQMKATGIKAQDTLLNE 405


>ref|XP_006303657.1| hypothetical protein CARUB_v10011695mg [Capsella rubella]
            gi|482572368|gb|EOA36555.1| hypothetical protein
            CARUB_v10011695mg [Capsella rubella]
          Length = 411

 Score =  249 bits (635), Expect = 4e-63
 Identities = 141/357 (39%), Positives = 200/357 (56%), Gaps = 2/357 (0%)
 Frame = +3

Query: 231  IKQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNH 410
            I+Q   +T    S  CS  SD+LRLMD++ +P   DLY  L KE  ++ D   A EL  H
Sbjct: 56   IQQPQIQTTQKSSPRCSI-SDILRLMDTLSLPGNEDLYSCLAKESARENDRRGAYELQVH 114

Query: 411  VRSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGEV 590
            +  S +                       R +FDK   R+ +SWA++  G +E GDY + 
Sbjct: 115  IMKSSIRPSTTFVNRLLLMHVSCGRLDITRNMFDKMPHRDFHSWAIVFLGCIEMGDYEDA 174

Query: 591  IDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCL-- 764
              LF+ M    S  GG + +     S I+ C+LK+C    +  LGKQVH L  K G +  
Sbjct: 175  ALLFVAMLK-HSKNGGAFKIP----SWIMGCVLKACAMIRDLALGKQVHGLCQKLGFIGE 229

Query: 765  RSAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREM 944
              + L+ SL+ FYGEF  L ++    +Q+ N N VVW A++ N  +E  F + +  F EM
Sbjct: 230  EDSYLLGSLIRFYGEFRCLEDANLVLHQLSNANTVVWAAKVTNDYREGEFQEVIRDFIEM 289

Query: 945  GQQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGA 1124
            G+ GVK+N    S++LKAC  + D G  G+QVHA A+KLG E D  ++C LI+MYGKY  
Sbjct: 290  GKLGVKKNVSVVSNVLKACTWVSDGGRSGQQVHANAIKLGFESDCLIRCQLIEMYGKYEK 349

Query: 1125 VNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHE 1295
            V DA+ VF     E + +CWNA++ GYMQ G  +EAIK+LY+MKA G++  + LL+E
Sbjct: 350  VKDAEKVFKSRKDETSVSCWNAMVAGYMQNGFYIEAIKLLYQMKATGIKADDMLLNE 406


>ref|XP_004292904.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Fragaria vesca subsp. vesca]
          Length = 421

 Score =  230 bits (586), Expect = 2e-57
 Identities = 134/364 (36%), Positives = 200/364 (54%), Gaps = 7/364 (1%)
 Frame = +3

Query: 246  KKTHSNDSRTCSTTSDVLRLMDSIEVPLTLD------LYVSLIKECTKKGDPLLAIELHN 407
            KK   N++ +  +TSD+LRLMD ++VP+T        +Y SLI +C+  G    A+ L  
Sbjct: 65   KKRKKNENGSRCSTSDILRLMDGLQVPVTSTTLSDNHMYASLINDCSDSG---AALHLQA 121

Query: 408  HVRSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGE 587
            H+                          NA +LFD+   ++  SWA LI  Y ++ DY E
Sbjct: 122  HLTRKSPPPPLHLLNRLLLRHVCNGRLDNAHQLFDEMPLKDFNSWATLIVAYAQNADYAE 181

Query: 588  VIDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAG-CL 764
             + LFL M   +       D+S    + I+ C+L +   TM+  LG+Q+H   +K G   
Sbjct: 182  ALRLFLSMLHLQDCH---VDISEFP-AWIMACVLDA---TMDVGLGEQLHGCCLKLGHAN 234

Query: 765  RSAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFREM 944
            R   + +SL+  YG       ++ A   +   N + WTAR++N  + ERF + +  F+E+
Sbjct: 235  RDMFVATSLINLYGRLRCHEAAQRASLGLSQPNALTWTARMINNSRGERFFEVISDFKEI 294

Query: 945  GQQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGA 1124
            G+ G+ +N    S +L+AC +M D G  GRQVHA A+KLG++  S+V CGLIDMYG+ G 
Sbjct: 295  GRAGISKNTSMISCVLRACARMHDSGFRGRQVHANAIKLGVDSHSFVHCGLIDMYGRNGL 354

Query: 1125 VNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRS 1304
            + DAK VF       + ACWNA++  Y++ GL +EA+K LYEM+A GLQPQE LL ++R 
Sbjct: 355  LRDAKLVFQTFNDTTSTACWNAMLTNYLRNGLHIEALKFLYEMQADGLQPQEYLLDQVRI 414

Query: 1305 LCGS 1316
             C S
Sbjct: 415  ACAS 418


>ref|XP_006415303.1| hypothetical protein EUTSA_v10009456mg [Eutrema salsugineum]
            gi|557093074|gb|ESQ33656.1| hypothetical protein
            EUTSA_v10009456mg [Eutrema salsugineum]
          Length = 400

 Score =  224 bits (571), Expect = 1e-55
 Identities = 140/360 (38%), Positives = 196/360 (54%), Gaps = 2/360 (0%)
 Frame = +3

Query: 225  QPIKQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELH 404
            QP  Q  +   SN    CST SD+LRLMDS+ +P   DLY  L KE T + D   A +L 
Sbjct: 56   QPQIQIDRAPKSNPR--CST-SDILRLMDSLSLPGNEDLYSCLAKESTTECDQRGAYDLQ 112

Query: 405  NHVRSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYG 584
             H+ +S V                       R++FDK   R+ +SWA++I G +E GDY 
Sbjct: 113  VHIMNSSVRPRTTFLNRLLLMHVSCGRLDITRQMFDKMPQRDFHSWAIVILGCIEMGDYQ 172

Query: 585  EVIDLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCL 764
            + + LF+ M   ++         +     I+ C+LK+C    + +LGKQVH L  K G +
Sbjct: 173  DAVFLFVSMLKNQNRV-------SKIPPWIMGCVLKACGMIRDLDLGKQVHGLCQKLGFI 225

Query: 765  R--SAVLMSSLMWFYGEFGSLVNSEGAFNQVYNENMVVWTARIVNCCKEERFDKAVHVFR 938
                + L   L+ FYGEF  L ++    NQ+ N N VVW A++ N  +E RF + +  F 
Sbjct: 226  EVEDSYLSGCLVRFYGEFRCLEDANLVLNQLSNANTVVWAAKVTNDYREGRFQEVILDFI 285

Query: 939  EMGQQGVKRNGYTFSSILKACGKMGDDGCCGRQVHAGAVKLGLEFDSYVQCGLIDMYGKY 1118
            EMG+ G+K+N   FS++LKAC  + D G  GR VHA A+KLG E D  ++C LI+MYGKY
Sbjct: 286  EMGKHGIKKNVSVFSNVLKACTWVSDGGRSGRGVHASAIKLGFESDCMIRCRLIEMYGKY 345

Query: 1119 GAVNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHEL 1298
            G V DA+ VF            N   NG+      VEAIK+LY+MKA GLQ +++LL+E+
Sbjct: 346  GKVKDAEKVFK-----------NERSNGFY-----VEAIKLLYQMKATGLQVEDTLLNEV 389


>ref|XP_006841553.1| hypothetical protein AMTR_s00003p00175270 [Amborella trichopoda]
            gi|548843574|gb|ERN03228.1| hypothetical protein
            AMTR_s00003p00175270 [Amborella trichopoda]
          Length = 327

 Score =  208 bits (529), Expect = 8e-51
 Identities = 122/337 (36%), Positives = 182/337 (54%)
 Frame = +3

Query: 306  MDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHVRSSGVXXXXXXXXXXXXXXXXXXX 485
            M S+++PLT   Y SL+KECT     +   E+H H+  + +                   
Sbjct: 1    MYSLQIPLTPIAYSSLLKECTSSKSLVEGSEIHAHINKTSLYPGIHIENQIILMYMACRC 60

Query: 486  XRNARELFDKSFDRNAYSWAVLIAGYLESGDYGEVIDLFLEMKSWESAEGGFYDLSNSAV 665
               A ++FDK   RN  +W  +I G ++ G   E +DL++ M      +       N+A+
Sbjct: 61   PTLAYQVFDKMSHRNTDTWQFMITGLMDLGMNEETLDLYIRMH-----QEMVRMKPNTAI 115

Query: 666  SGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLRSAVLMSSLMWFYGEFGSLVNSEGAFN 845
             G V   L++C    +  LGKQ+H   IK+G  +   L   L+ FY E   LV++  AF+
Sbjct: 116  QGGV---LRACAFIEDVGLGKQIHAKAIKSGSSKDTYLGCCLVDFYVEMKCLVSARKAFD 172

Query: 846  QVYNENMVVWTARIVNCCKEERFDKAVHVFREMGQQGVKRNGYTFSSILKACGKMGDDGC 1025
            ++   N+V WTA IV C +E  F   + VFREM + G + N YT+S +L A GKMG    
Sbjct: 173  EICKPNVVAWTAMIVGCAREGEFHGVLEVFREMERVGKRGNCYTYSCLLGASGKMG-HVW 231

Query: 1026 CGRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAVNDAKSVFNMCAHEGNAACWNALINGY 1205
             G+QV A  +K+G+E D YV   ++ MYGK G V DA+ VF+    E NA  WNA++ GY
Sbjct: 232  MGKQVQARVIKVGVEKDVYVGSSIVGMYGKCGFVEDARLVFD-GMREKNAVSWNAMLCGY 290

Query: 1206 MQKGLCVEAIKILYEMKAAGLQPQESLLHELRSLCGS 1316
             + G C EAIK+LYEM+  GL+P + +++E+   CG+
Sbjct: 291  AKNGCCDEAIKLLYEMRCKGLEPPQVMVNEVAIACGA 327


>ref|NP_001131386.1| hypothetical protein [Zea mays] gi|194691388|gb|ACF79778.1| unknown
            [Zea mays] gi|414884126|tpg|DAA60140.1| TPA: hypothetical
            protein ZEAMMB73_895402 [Zea mays]
          Length = 438

 Score =  181 bits (459), Expect = 1e-42
 Identities = 117/367 (31%), Positives = 179/367 (48%), Gaps = 8/367 (2%)
 Frame = +3

Query: 234  KQSPKKTHSNDSRTCSTTSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHV 413
            K  P+ T S+     S   DVLRLMD++ +P   D+Y+SL++EC    + + ++  H   
Sbjct: 80   KPPPEATDSHPPS--SGAGDVLRLMDALGIPPDEDIYISLLRECADAAE-VASVHAHITA 136

Query: 414  RSSGVXXXXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGEVI 593
            R +                        AR +FD     N  +WA +++ Y +   + E +
Sbjct: 137  RRASDGLPSPVANRLLLSYAACGDIEAARRVFDGMPTTNGMAWATMVSAYSDGCLHHEAM 196

Query: 594  DLFLEMKSWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLRSA 773
             LF  M        G   L     S  +V +L+SC +     LG+QVH L++K G +   
Sbjct: 197  RLFAHMCH------GTPVLDGDCYSHAIVAVLRSCTRAGELRLGEQVHALVVKKGRIHGD 250

Query: 774  VLMSSLMWFYGEFGSLVNSE----GAFNQVYNENMV---VWTARIVNCCKEERFDKAVHV 932
            +  SSL+  Y + G    S         Q + +  V    WT+ I +C +E    +AV V
Sbjct: 251  I-GSSLVQLYCDGGGFHRSARRVLATTMQHHCQEPVPEAAWTSLITSCHRESLLSEAVDV 309

Query: 933  FREMGQQGVKRNGYTFSSILKACGKMGDDGCC-GRQVHAGAVKLGLEFDSYVQCGLIDMY 1109
            FR+M   GV R+ ++ SSIL    +  D GCC G+QVHA A+K G++ + +V  GLI MY
Sbjct: 310  FRDMASSGVPRSSFSLSSILAVFAESQDPGCCCGQQVHADAIKRGVDTNQFVGSGLIHMY 369

Query: 1110 GKYGAVNDAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLL 1289
             K G + DA   F     + +AACW+AL   Y + G   EA +I+Y+MKAAG+ P + + 
Sbjct: 370  AKQGQLADATRAFETIGGKPDAACWSALAMAYARGGRYREATRIMYQMKAAGMNPSKEMA 429

Query: 1290 HELRSLC 1310
              +R  C
Sbjct: 430  DAVRLAC 436


>ref|XP_002461747.1| hypothetical protein SORBIDRAFT_02g007340 [Sorghum bicolor]
            gi|241925124|gb|EER98268.1| hypothetical protein
            SORBIDRAFT_02g007340 [Sorghum bicolor]
          Length = 442

 Score =  177 bits (449), Expect = 2e-41
 Identities = 109/360 (30%), Positives = 177/360 (49%), Gaps = 9/360 (2%)
 Frame = +3

Query: 258  SNDSRTCST-TSDVLRLMDSIEVPLTLDLYVSLIKECTKKGDPLLAIELHNHVRSSGVXX 434
            + DS  CS+   DVLRLMD++ +P   D+Y+SL++EC    + + ++  H     +    
Sbjct: 89   ATDSHPCSSGAGDVLRLMDALGIPPDEDIYISLLRECADAAE-VASVHAHMTACCASDAL 147

Query: 435  XXXXXXXXXXXXXXXXXXRNARELFDKSFDRNAYSWAVLIAGYLESGDYGEVIDLFLEMK 614
                                AR +FD   DRN  +WA +++ Y +   + E + LF  M 
Sbjct: 148  PSPVANRVLLSYAACGDIEAARRVFDGMPDRNGMAWATMVSAYSDGCFHHEAMRLFAHMC 207

Query: 615  SWESAEGGFYDLSNSAVSGIVVCILKSCVKTMNFELGKQVHTLLIKAGCLRSAVLMSSLM 794
                       L     S  ++ +L+SC++     LG+QVH L+IK G +   +  SSL+
Sbjct: 208  HRTLV------LDGDCCSHAILAVLRSCIRAGELRLGEQVHALVIKKGRILGDI-GSSLV 260

Query: 795  WFYGEFGSLVNSEGAFNQVYNENM-------VVWTARIVNCCKEERFDKAVHVFREMGQQ 953
              Y E   L  S      +  ++          WT+ I  C ++ +  +A+ VFR+M   
Sbjct: 261  QLYCESSGLHRSARRVLVMMMQHHCQEPVPEAAWTSLITCCHRDGQLSEAIDVFRDMASS 320

Query: 954  GVKRNGYTFSSILKACGKMGDDGCC-GRQVHAGAVKLGLEFDSYVQCGLIDMYGKYGAVN 1130
            GV R+ ++ SSIL    +  + GCC G+QVHA A+K G++ + +V  GL+ MY K G + 
Sbjct: 321  GVPRSSFSLSSILAVFAESQNQGCCCGQQVHADAIKRGVDTNQFVGSGLVHMYAKQGWLA 380

Query: 1131 DAKSVFNMCAHEGNAACWNALINGYMQKGLCVEAIKILYEMKAAGLQPQESLLHELRSLC 1310
            DA   F     + + ACW+AL   Y + G   EA +++Y+MKAAG+ P + +   +R  C
Sbjct: 381  DAVRAFGAIGGKPDTACWSALALAYARGGRYREATRVMYQMKAAGMTPSQEMADAVRLAC 440


Top