BLASTX nr result

ID: Rheum21_contig00022200 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00022200
         (1566 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containi...   328   4e-87
ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containi...   295   3e-77
ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citr...   295   5e-77
gb|ESW30051.1| hypothetical protein PHAVU_002G120500g [Phaseolus...   293   1e-76
ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containi...   284   6e-74
emb|CAN69066.1| hypothetical protein VITISV_016070 [Vitis vinifera]   283   1e-73
gb|AFK33630.1| unknown [Lotus japonicus]                              282   2e-73
ref|XP_003612457.1| Pentatricopeptide repeat-containing protein ...   279   3e-72
ref|XP_002519945.1| pentatricopeptide repeat-containing protein,...   273   2e-70
gb|EXB70628.1| hypothetical protein L484_023813 [Morus notabilis]     272   3e-70
gb|EOY32970.1| Pentatricopeptide repeat-containing protein, puta...   271   6e-70
ref|XP_004246209.1| PREDICTED: pentatricopeptide repeat-containi...   262   3e-67
ref|XP_004292904.1| PREDICTED: pentatricopeptide repeat-containi...   237   1e-59
ref|NP_174459.1| pentatricopeptide repeat-containing protein [Ar...   237   1e-59
ref|XP_006303657.1| hypothetical protein CARUB_v10011695mg [Caps...   236   2e-59
ref|XP_002893686.1| pentatricopeptide repeat-containing protein ...   227   1e-56
ref|XP_006415303.1| hypothetical protein EUTSA_v10009456mg [Eutr...   220   1e-54
ref|XP_006841553.1| hypothetical protein AMTR_s00003p00175270 [A...   194   1e-46
ref|XP_002461747.1| hypothetical protein SORBIDRAFT_02g007340 [S...   184   7e-44
ref|NP_001131386.1| hypothetical protein [Zea mays] gi|194691388...   184   7e-44

>ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Vitis vinifera]
          Length = 414

 Score =  328 bits (841), Expect = 4e-87
 Identities = 165/388 (42%), Positives = 245/388 (63%), Gaps = 5/388 (1%)
 Frame = -3

Query: 1285 DQEIKTARLKTAHEKNTSSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHM 1106
            ++E K +        +T +D+L LM  LG  + PD+YA L++E +  GDA +A++L  H+
Sbjct: 44   NKEKKKSNSNATPTTSTPTDILRLMDGLGLPIPPDIYASLIKESSTTGDATQATQLLAHI 103

Query: 1105 KRNKPMMRYLSKPKRLFFLNRILIMFMSCGSVDAAQQLFDKMS--RRDTAAWCLMIAGLV 932
             R+         P     LNRIL+M++SCG +  A+ +FDKM+   +++ +W +M+A  +
Sbjct: 104  NRS-------GLPLSSALLNRILLMYVSCGLIHTARHMFDKMNVLNKNSISWAIMLAAYM 156

Query: 931  DNSLHCEALDLFSQMMLHPMNLQNDALLQI---LFACVLKACIFDVDMDLGKTLHGWAMK 761
            DN  + EA+ LF QMM     L +  +L++   +F CVLKAC+  +++ LGK +HGW +K
Sbjct: 157  DNGFYEEAIFLFVQMM----ELHSTIMLELPAWIFICVLKACVHTMNLTLGKQVHGWLLK 212

Query: 760  LGYTRNLHVCTAFLDFYGKFRCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRF 581
            +GY  NL +    + FYGKFRC ++ + +F      D     +TV WT  +V  C     
Sbjct: 213  VGYATNLFLSCYLISFYGKFRCLDDADFVF------DQTSERNTVIWTAKMVNKCQGEYM 266

Query: 580  QETMYMFRDMGRAGVRKNEFTLSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQC 401
             E +  F +MGRAGV++NEFT SSVL+ACGR+KD G CG+  HA  IK GL ES ++VQC
Sbjct: 267  HEALVAFTEMGRAGVKRNEFTYSSVLRACGRMKDHGRCGRLIHASTIKLGL-ESDIYVQC 325

Query: 400  SLVDMYSRCGMLMEARKAFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAA 221
             LVDMY +CG+L+EAR+ F+ ++ +T   +  CWNAM+ GY+ +G Y+EAIK LYQMKAA
Sbjct: 326  GLVDMYGKCGLLVEARRVFETVS-DTNKTNIVCWNAMLTGYIRHGLYIEAIKFLYQMKAA 384

Query: 220  GVEPQESLLGRVRLVCGTNRYKLPIDYM 137
            G++PQESLL  +R+ CG+   +   D M
Sbjct: 385  GIQPQESLLNELRIACGSTTLENKTDGM 412


>ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Glycine max]
          Length = 423

 Score =  295 bits (756), Expect = 3e-77
 Identities = 161/396 (40%), Positives = 237/396 (59%), Gaps = 6/396 (1%)
 Frame = -3

Query: 1327 PITKP--RFPGHTDSGDQEIKTARLKTAHEKN----TSSDVLCLMGSLGFTVTPDVYACL 1166
            P+  P   FP HT        T   K   +K     T+SD+L LM +L F V  D+Y  L
Sbjct: 44   PLRHPIHNFPNHTSPQPLTQTTTFTKKKKKKKRKGATTSDILHLMEALPFPVPIDIYTSL 103

Query: 1165 VEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLFFLNRILIMFMSCGSVDAAQQLFD 986
            ++ECT  GD   A EL TH+ ++        KP  L FLNRIL+MF+SCG ++ A+ +FD
Sbjct: 104  IKECTVSGDPETAIELATHISKSG------IKPP-LPFLNRILVMFVSCGLLENARHMFD 156

Query: 985  KMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHPMNLQNDALLQILFACVLKACIFD 806
            KM  RD   W  +     DN+ + EA ++F  M+     ++    +   +AC+L+AC   
Sbjct: 157  KMRVRDFNTWATLFVAYYDNTDYEEATNVFVNMLTQLGMMEFPPWI---WACLLRACACT 213

Query: 805  VDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKFRCPEETNLIFNNHIQEDNPHAGDTV 626
            V++ LG  +HGW +KLG   ++ + ++ ++FYG+F C E+ +++F      D     +T+
Sbjct: 214  VNVPLGMQVHGWLLKLGTCDHVLLSSSLINFYGRFTCLEDASVVF------DGVSRHNTL 267

Query: 625  FWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFTLSSVLKACGRLKDGGCCGQQAHAD 446
             WT  +V+ C E  F E    F++MG  GV+K+ FT SSVLKACGR+ +   CG+Q H D
Sbjct: 268  TWTAKIVSGCRERHFSEVFDDFKEMGMRGVKKDCFTFSSVLKACGRMLNQERCGEQVHVD 327

Query: 445  AIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDDINCETRSRSTPCWNAMINGYLENG 266
            AIK GLV  H +VQCSL+ MY RCG+L +A++ F+        R   CWNAM+ GY++NG
Sbjct: 328  AIKLGLVSDH-YVQCSLIAMYGRCGLLEDAKRVFE---MSQEERKVDCWNAMLMGYIQNG 383

Query: 265  AYVEAIKTLYQMKAAGVEPQESLLGRVRLVCGTNRY 158
             Y+EA+K LYQM+AAG++P+ESLL ++R+ CG+  Y
Sbjct: 384  LYIEAVKFLYQMQAAGMQPRESLLKKLRMACGSISY 419


>ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citrus clementina]
            gi|557539679|gb|ESR50723.1| hypothetical protein
            CICLE_v10033975mg [Citrus clementina]
          Length = 425

 Score =  295 bits (754), Expect = 5e-77
 Identities = 158/385 (41%), Positives = 236/385 (61%)
 Frame = -3

Query: 1318 KPRFPGHTDSGDQEIKTARLKTAHEKNTSSDVLCLMGSLGFTVTPDVYACLVEECTHKGD 1139
            KP  P  T S  +E      ++     +S+++L LM +L   +T D+Y CL++ECT + D
Sbjct: 45   KPTKPLKTSSNWRETT----QSIPANTSSANILHLMDNLCLPITTDMYTCLIKECTFQKD 100

Query: 1138 AYRASELWTHMKRNKPMMRYLSKPKRLFFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAA 959
            +  A EL  H+++     R   KP  L FLNR+L+M +SCG +D A+QLFD+M  RD  +
Sbjct: 101  SAGAFELLNHIRK-----RVNIKPT-LLFLNRLLLMHVSCGQLDTARQLFDEMPLRDFNS 154

Query: 958  WCLMIAGLVDNSLHCEALDLFSQMMLHPMNLQNDALLQILFACVLKACIFDVDMDLGKTL 779
            W +MI G VD + + E + LF++MM              +  CVLKAC+  ++M+LGK +
Sbjct: 155  WAVMIVGYVDVADYQECITLFAEMMKRKKGHMLLVFPAWIIVCVLKACVCTMNMELGKQV 214

Query: 778  HGWAMKLGYTRNLHVCTAFLDFYGKFRCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTN 599
            HG   KLG +RN+ +  + ++FYGKFRC E+ + +F+  ++  N     TV WT  +V N
Sbjct: 215  HGLLFKLGSSRNISLTGSLINFYGKFRCLEDADFVFSQ-LKRHN-----TVVWTAKIVNN 268

Query: 598  CHENRFQETMYMFRDMGRAGVRKNEFTLSSVLKACGRLKDGGCCGQQAHADAIKHGLVES 419
            C E  F +    F++MGR  ++KN +T SSVLKACG + D G CG+Q HA+ +K GL ES
Sbjct: 269  CREGHFHQVFNDFKEMGRERIKKNSYTFSSVLKACGGVDDDGNCGRQVHANIVKIGL-ES 327

Query: 418  HVFVQCSLVDMYSRCGMLMEARKAFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTL 239
              +VQC LVDMY +C +L +A++ F+ I      ++   WNAM+ GY+ NG YVEA K L
Sbjct: 328  DEYVQCGLVDMYGKCRLLRDAKRVFELI---VDKKNIASWNAMLMGYIRNGLYVEATKFL 384

Query: 238  YQMKAAGVEPQESLLGRVRLVCGTN 164
            Y MKA+G++ QESL+  +R+ C ++
Sbjct: 385  YLMKASGIQIQESLINDLRIACSSS 409


>gb|ESW30051.1| hypothetical protein PHAVU_002G120500g [Phaseolus vulgaris]
          Length = 420

 Score =  293 bits (750), Expect = 1e-76
 Identities = 159/387 (41%), Positives = 236/387 (60%)
 Frame = -3

Query: 1318 KPRFPGHTDSGDQEIKTARLKTAHEKNTSSDVLCLMGSLGFTVTPDVYACLVEECTHKGD 1139
            +P  P  T    +EIK  + K A    T+ D+L LM +L F +T D+Y  L++ECT  GD
Sbjct: 54   RPLIPQTTTFTKKEIKKKKRKEA----TTLDILHLMDALPFPITIDIYTSLIKECTVSGD 109

Query: 1138 AYRASELWTHMKRNKPMMRYLSKPKRLFFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAA 959
               A EL+TH+ ++        KP  L FLNRILIMF+SCG ++ A+ +F+KM  RD  +
Sbjct: 110  PETAIELYTHISKSD------IKPP-LPFLNRILIMFVSCGMLENARHMFEKMRVRDFNS 162

Query: 958  WCLMIAGLVDNSLHCEALDLFSQMMLHPMNLQNDALLQILFACVLKACIFDVDMDLGKTL 779
            W  +     DN+ + EA  +F  M+     LQ    +   +AC+L+AC   +++ LG  +
Sbjct: 163  WATLFVAYYDNAEYEEATAVFVNMLGQLGMLQFPPWI---WACLLRACACTLNVPLGLQV 219

Query: 778  HGWAMKLGYTRNLHVCTAFLDFYGKFRCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTN 599
            HGW +KLG   ++ + ++ ++FYG+F C E+ + +FN   + +      T+ WT  +V+ 
Sbjct: 220  HGWLLKLGACDHVLLSSSLINFYGRFTCLEDASAVFNGVSRHN------TLTWTAKIVSG 273

Query: 598  CHENRFQETMYMFRDMGRAGVRKNEFTLSSVLKACGRLKDGGCCGQQAHADAIKHGLVES 419
            C E  F E    FR+MG  GV+K+ FT SSVLKACG++ +   CG+Q HADAIK GL+  
Sbjct: 274  CRERHFSEVFGDFREMGMRGVKKDCFTFSSVLKACGKMLNQERCGEQVHADAIKLGLISD 333

Query: 418  HVFVQCSLVDMYSRCGMLMEARKAFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTL 239
            H +VQCSL+ MY RCG+L +A+  F+        R   CWNAM+ GY +NG ++EA+K L
Sbjct: 334  H-YVQCSLIAMYGRCGLLTDAKDVFE---MTREERKVDCWNAMLMGYTQNGFHIEAVKFL 389

Query: 238  YQMKAAGVEPQESLLGRVRLVCGTNRY 158
            YQM+AAG++P ESLL ++R+ CG+  Y
Sbjct: 390  YQMQAAGMQPWESLLKKLRIACGSITY 416


>ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Cicer arietinum]
          Length = 418

 Score =  284 bits (727), Expect = 6e-74
 Identities = 155/401 (38%), Positives = 242/401 (60%), Gaps = 11/401 (2%)
 Frame = -3

Query: 1327 PITKPR---FPGHTDSGDQEIKTARLK--TAHEKN------TSSDVLCLMGSLGFTVTPD 1181
            P   P+   FP H  S    +   R K  T ++ N      T+S +L LM +L F +  D
Sbjct: 32   PFRNPKLINFPHHPSSQPLTVTPPRNKNNTKNKNNNKRKSATTSHILPLMDALHFPIPID 91

Query: 1180 VYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLFFLNRILIMFMSCGSVDAA 1001
            +Y  LV+ECT  GD   A+EL +H+ R+      +  P  L  LNRILIMF+SCG + +A
Sbjct: 92   IYTSLVKECTLSGDPETATELHSHITRSG-----IGPP--LTLLNRILIMFVSCGLLQSA 144

Query: 1000 QQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHPMNLQNDALLQILFACVLK 821
            + +FD+M  R+  +W ++     +NS +  A+D+F +M L  + +     L   ++C+L 
Sbjct: 145  RHVFDEMPVRNFHSWAILFVAYYENSDYENAIDVFMRM-LRQLGVMEFPFLPWFWSCLLT 203

Query: 820  ACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKFRCPEETNLIFNNHIQEDNPH 641
            AC   V++ LG  +HG   KLG   ++ + ++ + FYG+F+C E+ N++FN   + +   
Sbjct: 204  ACACTVNVPLGMQVHGSLTKLGACDHVLISSSLIRFYGRFKCLEDANVVFNRVSRHN--- 260

Query: 640  AGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFTLSSVLKACGRLKDGGCCGQ 461
               T+ WT  +V+ C E  F + +  F++MGR G++K+ FT SSVLKACGR+++ G CG+
Sbjct: 261  ---TLTWTAKIVSGCRERHFTQVLGDFKEMGRVGIKKDSFTFSSVLKACGRMQNYGSCGE 317

Query: 460  QAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDDINCETRSRSTPCWNAMING 281
            Q HAD+IK GL +S  +VQCSL+ MY R G+L +A+  F+        R+   WNAM+ G
Sbjct: 318  QVHADSIKLGL-DSDNYVQCSLIAMYGRSGLLRDAKLVFE---TTLNERNVDSWNAMLMG 373

Query: 280  YLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLVCGTNRY 158
            Y++NG Y++A+K +YQMKAAGV P ESLL ++R+ CG++ +
Sbjct: 374  YIQNGLYIKAVKFVYQMKAAGVHPHESLLEKLRIACGSSNF 414


>emb|CAN69066.1| hypothetical protein VITISV_016070 [Vitis vinifera]
          Length = 543

 Score =  283 bits (724), Expect = 1e-73
 Identities = 154/387 (39%), Positives = 224/387 (57%), Gaps = 5/387 (1%)
 Frame = -3

Query: 1282 QEIKTARLKTAHEKNTSSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMK 1103
            +E K +        +T +D+L LM  LG  + PD+YA L++E +  GDA +A++L  H+ 
Sbjct: 208  KEKKKSNSNATPTTSTPTDILRLMDGLGLPIPPDIYASLIKESSTTGDATQATQLLAHIN 267

Query: 1102 RNKPMMRYLSKPKRLFFLNRILIMFMSCGSVDAAQQLFDKMS--RRDTAAWCLMIAGLVD 929
            R+         P     LNRIL+M++SCG +  A+ +FDKM+   +++ +W +M+A  +D
Sbjct: 268  RS-------GLPLSSALLNRILLMYVSCGLIHTARHMFDKMNVLNKNSISWAIMLAAYMD 320

Query: 928  NSLHCEALDLFSQMMLHPMNLQNDALLQI---LFACVLKACIFDVDMDLGKTLHGWAMKL 758
            N  + EA+ LF QMM     L +  +L++   +F CVLKAC+  +++ LGK +HGW  K 
Sbjct: 321  NGFYEEAIFLFVQMM----ELHSTIMLELPAWIFICVLKACVHTMNLTLGKQVHGWLTK- 375

Query: 757  GYTRNLHVCTAFLDFYGKFRCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQ 578
                                   E N                TV WT  +V  C      
Sbjct: 376  -----------------------ERN----------------TVIWTAKMVNKCQGEYMH 396

Query: 577  ETMYMFRDMGRAGVRKNEFTLSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCS 398
            E +  F +MGRAGV++NEFT SSVL+ACGR+KD G CG+  HA  IK GL ES ++VQC 
Sbjct: 397  EALVAFTEMGRAGVKRNEFTYSSVLRACGRMKDHGRCGRLIHASTIKLGL-ESDIYVQCG 455

Query: 397  LVDMYSRCGMLMEARKAFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAG 218
            LVDMY +CG+L+EAR+ F+ ++ +T   +  CWNAM+ GY+ +G Y+EAIK LYQMKAAG
Sbjct: 456  LVDMYGKCGLLVEARRVFETVS-DTNKTNIVCWNAMLTGYIRHGLYIEAIKFLYQMKAAG 514

Query: 217  VEPQESLLGRVRLVCGTNRYKLPIDYM 137
            ++PQESLL  +R+ CG+   +   D M
Sbjct: 515  IQPQESLLNELRIACGSTTLENKTDGM 541


>gb|AFK33630.1| unknown [Lotus japonicus]
          Length = 356

 Score =  282 bits (722), Expect = 2e-73
 Identities = 153/360 (42%), Positives = 220/360 (61%)
 Frame = -3

Query: 1237 TSSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRL 1058
            T+S +L LM  L F +  D+Y  L++ECT   D   A EL TH+  +        KP  L
Sbjct: 13   TTSHILHLMDVLPFPIPIDIYTSLIKECTLSPDPQTAIELHTHIAHSG------IKPP-L 65

Query: 1057 FFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLH 878
             F+NRIL+MF+SCG +D A QLFD M  +D  +W  +     DN+ + EA+D+F  M LH
Sbjct: 66   SFINRILVMFVSCGLLDYACQLFDAMPVKDFNSWATLFIAYYDNADYEEAIDVFLAM-LH 124

Query: 877  PMNLQNDALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKFR 698
             + +        + AC LKAC    ++ LG  +HGW +KLG   ++ + ++ + FYG+F 
Sbjct: 125  QLGMSE--FPPWICACFLKACACIENIPLGMQVHGWLLKLGTCDHVLLSSSLIRFYGRFT 182

Query: 697  CPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFT 518
            C ++ N +FN   + +      T  WT  +V+ C E  F E    F++MGR G++K+ +T
Sbjct: 183  CVKDANAVFNKLSRHN------TSTWTAKIVSGCREMDFPEVFNDFKEMGRQGIKKDTYT 236

Query: 517  LSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDD 338
             SSVLKACG++ D G CG+Q HADA+K GL   + +VQCSL+ MY R G+L +A++ F+ 
Sbjct: 237  FSSVLKACGKMMDHGRCGEQVHADAMKLGLASDN-YVQCSLIAMYGRSGLLRDAKQVFET 295

Query: 337  INCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLVCGTNRY 158
               E   R+   WNAM+ GYLENG Y+EA+K LYQMKAAG++P ESLL +VR+ CG+  Y
Sbjct: 296  SRSE---RNVDSWNAMLMGYLENGLYIEAVKFLYQMKAAGLKPHESLLDKVRIACGSVTY 352


>ref|XP_003612457.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355513792|gb|AES95415.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 418

 Score =  279 bits (713), Expect = 3e-72
 Identities = 156/407 (38%), Positives = 242/407 (59%)
 Frame = -3

Query: 1378 RRLTTGVRVSQIGTTDAPITKPRFPGHTDSGDQEIKTARLKTAHEKNTSSDVLCLMGSLG 1199
            RR      +S I  +  PIT P+               + K   + +T+S +L LM +L 
Sbjct: 41   RRNPKPKNLSLIHPSSQPITPPK---------------KSKRRRKCDTTSHILPLMDALH 85

Query: 1198 FTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLFFLNRILIMFMSC 1019
            F +T D+Y  LV+ECT   D   A EL T +     + R +  P  L  LNRILIMF+SC
Sbjct: 86   FPITIDIYTSLVKECTLSTDPETAIELHTQI-----ITRGIELP--LTLLNRILIMFVSC 138

Query: 1018 GSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHPMNLQNDALLQIL 839
            G ++ A+++FD MS RD  +W  +     +N  +  A+D+F  M+   +++   +    +
Sbjct: 139  GLLENARRVFDVMSVRDFHSWATLFVSYYENGEYENAIDVFVSMLCQ-LDVMGFSFPPWI 197

Query: 838  FACVLKACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKFRCPEETNLIFNNHI 659
            ++C+LKAC   +++ LG  +HG  +KLG   ++ + ++ + FYG+F+C E+ N++FN   
Sbjct: 198  WSCLLKACACTMNVPLGMQVHGCLLKLGACDHVLISSSLIRFYGRFKCLEDANMVFNRVS 257

Query: 658  QEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFTLSSVLKACGRLKD 479
            + +      T+ WT  +V++C E  F E +  F+ MGR GV+K+ FT SSVLKACGR+++
Sbjct: 258  RHN------TLTWTAKIVSSCRERHFSEALGDFKKMGRVGVKKDSFTFSSVLKACGRMQN 311

Query: 478  GGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDDINCETRSRSTPCW 299
             G CG+Q HADAIK GL +S  +VQCSL+ MY R G+L +A   F+    E   R+    
Sbjct: 312  RGSCGEQVHADAIKLGL-DSDSYVQCSLIAMYGRSGLLRDAELVFEMTRNE---RNVDSL 367

Query: 298  NAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLVCGTNRY 158
            NAM+ GY++NG Y+EA+K +YQMKAAGV+P E LL ++R+ CG++ +
Sbjct: 368  NAMLMGYIQNGLYIEAVKFVYQMKAAGVQPHEPLLEKLRIACGSSNF 414


>ref|XP_002519945.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223540991|gb|EEF42549.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 403

 Score =  273 bits (697), Expect = 2e-70
 Identities = 152/398 (38%), Positives = 231/398 (58%), Gaps = 7/398 (1%)
 Frame = -3

Query: 1339 TTDAPITKPRFPGHTDSGDQEIKTARLKTAHEKNTSSDVLCLMGSLGFTVTPDVYACLVE 1160
            +T   IT P            IK      A +  +SSD++ LM SL   + PD+Y  L++
Sbjct: 22   STKKNITAPNHTKLPPLRTPNIKPINHLPAKKSCSSSDIMRLMDSLCHPIPPDIYTSLIK 81

Query: 1159 ECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLFFLNRILIMFMSCGSVDAAQQLFDKM 980
            ECT   D+  A  L +H+     +   L+ P     ++R+L+M +SCG +D A+ LFDKM
Sbjct: 82   ECTLTSDSTEALCLHSHLISQTNLK--LTPP----LVHRLLLMHVSCGQLDIARNLFDKM 135

Query: 979  S-RRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHPMNLQNDALL------QILFACVLK 821
              ++D  +W ++I G   NS +   ++LF  M+L   +   D L+       I+  C++K
Sbjct: 136  PLKKDFISWVIVIVGCFSNSKYEAGINLFIDMLLQ--HSVYDGLMFDLNTWNIIILCIIK 193

Query: 820  ACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKFRCPEETNLIFNNHIQEDNPH 641
             CI+ +++ LGK +HG   K+G T  +    + +DFYGK  C E+ N +FN   + DN +
Sbjct: 194  CCIYSMNISLGKQVHGILFKVGLTSEISFNVSLMDFYGKLGCLEDVNSVFN---KLDNHN 250

Query: 640  AGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFTLSSVLKACGRLKDGGCCGQ 461
               T  WT  +V +C   RF E +  F++MG AG+++N FT+SSVL+AC R+ DGG CG+
Sbjct: 251  ---TATWTAKIVNSCRNQRFYEVIEDFKEMGEAGIKRNSFTVSSVLRACARMGDGGNCGK 307

Query: 460  QAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDDINCETRSRSTPCWNAMING 281
            Q H   IK GL ES  FVQC L+ MY +CGM+ +A+K F+ +  +T   +T CWNA++  
Sbjct: 308  QVHVIVIKLGL-ESDAFVQCGLIAMYGKCGMIRKAKKVFELVIDKT---NTACWNALLMA 363

Query: 280  YLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLVCGT 167
            Y+ N  ++EA+K LYQM+AA ++  ESLL  VR+ CGT
Sbjct: 364  YVRNELFIEAMKLLYQMEAAKIQVNESLLDHVRIACGT 401


>gb|EXB70628.1| hypothetical protein L484_023813 [Morus notabilis]
          Length = 453

 Score =  272 bits (695), Expect = 3e-70
 Identities = 146/358 (40%), Positives = 221/358 (61%)
 Frame = -3

Query: 1237 TSSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRL 1058
            ++SDVL LM +L   ++PD+Y   ++ECT   D   A +L  H+ RN   +++L+ P   
Sbjct: 105  STSDVLRLMDALCLPISPDMYISFMKECTISADFCGAEDLHNHISRNS--LQHLALP--- 159

Query: 1057 FFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLH 878
              LNR+L M +SCG +D A  LF +M  +D  +W  MI   V+NS + EA  LF +M+ H
Sbjct: 160  -LLNRLLFMNVSCGRLDLACDLFYRMPFKDFKSWATMIVANVNNSDYEEATSLFLKMLHH 218

Query: 877  PMNLQNDALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKFR 698
               L+  + + +   C+LK C+   +M+LGK +H  A+KLG+  +L++ +  ++FYGK+ 
Sbjct: 219  INMLEFPSWIIV---CLLKTCVCTRNMELGKQVHACALKLGHANSLYLASCLINFYGKYG 275

Query: 697  CPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFT 518
            C E  NL+FN   + D      T+ W   L+ N  E  F E +  F ++G+AG++KN   
Sbjct: 276  CLESANLVFNQLPRHD------TLTWMTRLINNSKEELFFEVLRDFNEVGKAGIKKNVLM 329

Query: 517  LSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDD 338
             SSVLKACGR+ D    GQQ HA+AIK G  ES ++VQC L+DMY R G+L +A++ F+ 
Sbjct: 330  FSSVLKACGRIHDRRKSGQQVHANAIKLGF-ESDLYVQCGLIDMYGRSGLLRDAQRVFEK 388

Query: 337  INCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLVCGTN 164
                +  R+  CWNAM+ GY+ N  YVEAIK +YQMKA G++ Q+S+L  +R+ CG++
Sbjct: 389  ---SSDRRNNACWNAMLGGYIRNELYVEAIKFVYQMKAVGLQLQQSMLDELRIACGSD 443


>gb|EOY32970.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao]
          Length = 413

 Score =  271 bits (693), Expect = 6e-70
 Identities = 144/358 (40%), Positives = 220/358 (61%)
 Frame = -3

Query: 1240 NTSSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKR 1061
            +T+SD+L LM SL   + PD+YA LV+ECT    + RA EL +H++ ++       KP  
Sbjct: 75   HTTSDILRLMDSLSLPIPPDIYASLVKECTVTRHSRRALELHSHIRNSR------IKPS- 127

Query: 1060 LFFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMML 881
            L  LNR+L+M +SCG +D A+ LFD+M  RD  +W +MI   +      +A+  F +M  
Sbjct: 128  LPLLNRLLLMHVSCGHLDIARHLFDQMLLRDFNSWAIMIVACLHAGDSEQAIAYFVRMER 187

Query: 880  HPMNLQNDALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKF 701
            H +  +  + + +   C+LK+C+   +M LGK +HG  +KLG + +  +  + ++FYGKF
Sbjct: 188  HNLLFKCPSWIIV---CLLKSCVVTKNMGLGKQVHGQLLKLGASNDSSLSGSLINFYGKF 244

Query: 700  RCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEF 521
            RC ++ + +FN   + +      TV WT  +V +C E++F + +  F +MGR G++KN F
Sbjct: 245  RCLDDADFVFNQLSRRN------TVTWTARIVNSCREDQFGKVIDDFNEMGRQGIKKNNF 298

Query: 520  TLSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFD 341
            T S V KAC R+ D G  G+Q HA+A+K GL ES VFVQC L+ +Y +CG + +A KAF+
Sbjct: 299  TFSGVFKACARMDDDGMSGRQVHANALKLGL-ESDVFVQCGLIHLYGKCGSVRDAEKAFE 357

Query: 340  DINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLVCGT 167
             +      R+  CWNAM+ GY+ N   + AIK LY+MK AG++ QESL+  VR+ C T
Sbjct: 358  IVG---DKRNIACWNAMLMGYVHNELCLRAIKLLYRMKEAGIKVQESLINDVRIACAT 412


>ref|XP_004246209.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Solanum lycopersicum]
          Length = 465

 Score =  262 bits (669), Expect = 3e-67
 Identities = 141/347 (40%), Positives = 211/347 (60%), Gaps = 2/347 (0%)
 Frame = -3

Query: 1213 MGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLFFLNRILI 1034
            M SLGF +  DVY  L++ECT   D   A E++ H+ ++  +         L  LNR+L+
Sbjct: 1    MDSLGFNIPVDVYVSLIKECTESRDPLNAVEVYEHVCKSDVI-------PSLPLLNRLLL 53

Query: 1033 MFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHPMNLQN-- 860
            M + CG  + A+QLFDKM  R++ +W  MIAG V+N     AL LF +M     NL    
Sbjct: 54   MLVLCGCFEQARQLFDKMRVRNSQSWAAMIAGCVENGECVGALRLFMEMQSEAGNLCKCG 113

Query: 859  DALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKFRCPEETN 680
            D +   +  CVLKAC+  ++++ G+ +HGW +KLG   ++ + +  + FYG+F   E  +
Sbjct: 114  DLIDDGILVCVLKACVELMNLEFGRQIHGWLLKLGNCESMVLNSFLIKFYGEFGYLESAD 173

Query: 679  LIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFTLSSVLK 500
             +F+ H+    PH  +TV WT  +   C E +F+  + +FR+M   GV+KN FT SS+LK
Sbjct: 174  NVFD-HV----PHC-NTVVWTARIGNLCKEEQFEGAIRIFREMVSEGVKKNSFTFSSILK 227

Query: 499  ACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDDINCETR 320
            ACG+L+D GCCGQQ HA ++K GL ++  +V CSL+DMY + G+L +AR+ F   N    
Sbjct: 228  ACGKLRDAGCCGQQIHATSVKVGL-DTDSYVLCSLIDMYGKYGLLKDARRVF---NARED 283

Query: 319  SRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRL 179
              +  CWNAM+ G +++G  VEA+K LY+MK AG++P ESL+  V L
Sbjct: 284  KSNIACWNAMLMGCIQHGFGVEAMKVLYEMKEAGLQPHESLINEVLL 330


>ref|XP_004292904.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Fragaria vesca subsp. vesca]
          Length = 421

 Score =  237 bits (604), Expect = 1e-59
 Identities = 142/367 (38%), Positives = 206/367 (56%), Gaps = 9/367 (2%)
 Frame = -3

Query: 1237 TSSDVLCLMGSLGFTVTPD------VYACLVEECTHKGDAYRASELWTHMKRNKPMMRYL 1076
            ++SD+L LM  L   VT        +YA L+ +C+  G A     L  H+ R  P     
Sbjct: 77   STSDILRLMDGLQVPVTSTTLSDNHMYASLINDCSDSGAALH---LQAHLTRKSP----- 128

Query: 1075 SKPKRLFFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLF 896
              P  L  LNR+L+  +  G +D A QLFD+M  +D  +W  +I     N+ + EAL LF
Sbjct: 129  --PPPLHLLNRLLLRHVCNGRLDNAHQLFDEMPLKDFNSWATLIVAYAQNADYAEALRLF 186

Query: 895  SQMMLHPMNLQNDALLQILFACVLKACIFDVDMD--LGKTLHGWAMKLGYT-RNLHVCTA 725
              M+    +LQ+  +    F   + AC+ D  MD  LG+ LHG  +KLG+  R++ V T+
Sbjct: 187  LSML----HLQDCHVDISEFPAWIMACVLDATMDVGLGEQLHGCCLKLGHANRDMFVATS 242

Query: 724  FLDFYGKFRCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGR 545
             ++ YG+ RC E         +    P+A   + WT  ++ N    RF E +  F+++GR
Sbjct: 243  LINLYGRLRCHEAAQ---RASLGLSQPNA---LTWTARMINNSRGERFFEVISDFKEIGR 296

Query: 544  AGVRKNEFTLSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGML 365
            AG+ KN   +S VL+AC R+ D G  G+Q HA+AIK G V+SH FV C L+DMY R G+L
Sbjct: 297  AGISKNTSMISCVLRACARMHDSGFRGRQVHANAIKLG-VDSHSFVHCGLIDMYGRNGLL 355

Query: 364  MEARKAFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRV 185
             +A+  F   N  T   ST CWNAM+  YL NG ++EA+K LY+M+A G++PQE LL +V
Sbjct: 356  RDAKLVFQTFNDTT---STACWNAMLTNYLRNGLHIEALKFLYEMQADGLQPQEYLLDQV 412

Query: 184  RLVCGTN 164
            R+ C +N
Sbjct: 413  RIACASN 419


>ref|NP_174459.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75169166|sp|Q9C6R9.1|PPR66_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g31790 gi|12321298|gb|AAG50719.1|AC079041_12
            hypothetical protein [Arabidopsis thaliana]
            gi|111074348|gb|ABH04547.1| At1g31790 [Arabidopsis
            thaliana] gi|332193272|gb|AEE31393.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 409

 Score =  237 bits (604), Expect = 1e-59
 Identities = 134/355 (37%), Positives = 198/355 (55%), Gaps = 2/355 (0%)
 Frame = -3

Query: 1237 TSSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRL 1058
            ++SD+L LM SL      D+Y+CL +E   + D   A EL  H+ ++       S    +
Sbjct: 71   STSDILRLMDSLSLPGNEDIYSCLAKESARENDQRGAHELQVHIMKS-------SIRPTI 123

Query: 1057 FFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLH 878
             F+NR+L+M +SCG +D  +Q+FD+M  RD  +W ++  G ++   + +A  LF  M+ H
Sbjct: 124  TFINRLLLMHVSCGRLDITRQMFDRMPHRDFHSWAIVFLGCIEMGDYEDAAFLFVSMLKH 183

Query: 877  PMNLQNDALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYT--RNLHVCTAFLDFYGK 704
                       IL  CVLKAC    D +LGK +H    KLG+    + ++  + + FYG+
Sbjct: 184  SQKGAFKIPSWIL-GCVLKACAMIRDFELGKQVHALCHKLGFIDEEDSYLSGSLIRFYGE 242

Query: 703  FRCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNE 524
            FRC E+ NL+ +   Q  N    +TV W   +  +  E  FQE +  F +MG  G++KN 
Sbjct: 243  FRCLEDANLVLH---QLSN---ANTVAWAAKVTNDYREGEFQEVIRDFIEMGNHGIKKNV 296

Query: 523  FTLSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAF 344
               S+VLKAC  + DGG  GQQ HA+AIK G  ES   ++C L++MY + G + +A K F
Sbjct: 297  SVFSNVLKACSWVSDGGRSGQQVHANAIKLGF-ESDCLIRCRLIEMYGKYGKVKDAEKVF 355

Query: 343  DDINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRL 179
                 ET   S  CWNAM+  Y++NG Y+EAIK LYQMKA G++  ++LL    L
Sbjct: 356  KSSKDET---SVSCWNAMVASYMQNGIYIEAIKLLYQMKATGIKAHDTLLNEAHL 407


>ref|XP_006303657.1| hypothetical protein CARUB_v10011695mg [Capsella rubella]
            gi|482572368|gb|EOA36555.1| hypothetical protein
            CARUB_v10011695mg [Capsella rubella]
          Length = 411

 Score =  236 bits (602), Expect = 2e-59
 Identities = 134/348 (38%), Positives = 197/348 (56%), Gaps = 2/348 (0%)
 Frame = -3

Query: 1231 SDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLFF 1052
            SD+L LM +L      D+Y+CL +E   + D   A EL  H+      M+   +P   F 
Sbjct: 74   SDILRLMDTLSLPGNEDLYSCLAKESARENDRRGAYELQVHI------MKSSIRPSTTF- 126

Query: 1051 LNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHPM 872
            +NR+L+M +SCG +D  + +FDKM  RD  +W ++  G ++   + +A  LF  M+ H  
Sbjct: 127  VNRLLLMHVSCGRLDITRNMFDKMPHRDFHSWAIVFLGCIEMGDYEDAALLFVAMLKHSK 186

Query: 871  NLQNDALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYT--RNLHVCTAFLDFYGKFR 698
            N     +   +  CVLKAC    D+ LGK +HG   KLG+    + ++  + + FYG+FR
Sbjct: 187  NGGAFKIPSWIMGCVLKACAMIRDLALGKQVHGLCQKLGFIGEEDSYLLGSLIRFYGEFR 246

Query: 697  CPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFT 518
            C E+ NL+ +   Q  N    +TV W   +  +  E  FQE +  F +MG+ GV+KN   
Sbjct: 247  CLEDANLVLH---QLSN---ANTVVWAAKVTNDYREGEFQEVIRDFIEMGKLGVKKNVSV 300

Query: 517  LSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDD 338
            +S+VLKAC  + DGG  GQQ HA+AIK G  ES   ++C L++MY +   + +A K F  
Sbjct: 301  VSNVLKACTWVSDGGRSGQQVHANAIKLGF-ESDCLIRCQLIEMYGKYEKVKDAEKVFKS 359

Query: 337  INCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLL 194
               ET   S  CWNAM+ GY++NG Y+EAIK LYQMKA G++  + LL
Sbjct: 360  RKDET---SVSCWNAMVAGYMQNGFYIEAIKLLYQMKATGIKADDMLL 404


>ref|XP_002893686.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297339528|gb|EFH69945.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 410

 Score =  227 bits (578), Expect = 1e-56
 Identities = 134/352 (38%), Positives = 198/352 (56%), Gaps = 4/352 (1%)
 Frame = -3

Query: 1237 TSSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRL 1058
            ++SD+L LM SL      D+Y+CL +E   + D   A EL  H+ ++      + +P   
Sbjct: 71   STSDILRLMDSLSLPGNEDLYSCLAKESARENDRRGAYELQVHIMKSS-----IRRPTTT 125

Query: 1057 FFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLH 878
            F +NR+L+M +SCG +D  + +FDKM  RD  +W ++  G ++   + +A  LF  M+ H
Sbjct: 126  F-VNRLLLMHVSCGRLDITRHMFDKMPHRDFHSWAIVFLGCIEMGDYEDAALLFVSMLKH 184

Query: 877  PMNLQNDA--LLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYT--RNLHVCTAFLDFY 710
                QN A  +   +  CVLKAC    D +LGK +H    KLG     + ++  + + FY
Sbjct: 185  S---QNGAFKIPSWIMGCVLKACAMIRDFELGKQVHALCHKLGCIDEEDSYLSGSLIRFY 241

Query: 709  GKFRCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRK 530
            G+FRC E+ NL+ +   Q  N    +TV W   +  +  E  FQE +  F +MG   +RK
Sbjct: 242  GEFRCLEDANLVLH---QLSN---ANTVAWAAKVTNDYREGEFQEVIRDFIEMGNHRIRK 295

Query: 529  NEFTLSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARK 350
            N    S+VLKAC  + DGG  G+Q HA AIK G  ES   ++C L++MY + G + +A K
Sbjct: 296  NVSVFSNVLKACTWVSDGGRSGKQVHAVAIKLGF-ESDCLIRCRLIEMYGKYGKVKDAEK 354

Query: 349  AFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLL 194
             F     ET   +  CWNAM+ GY++NG YVEAIK L QMKA G++ Q++LL
Sbjct: 355  VFKSSKDET---NVNCWNAMVAGYMQNGIYVEAIKLLCQMKATGIKAQDTLL 403


>ref|XP_006415303.1| hypothetical protein EUTSA_v10009456mg [Eutrema salsugineum]
            gi|557093074|gb|ESQ33656.1| hypothetical protein
            EUTSA_v10009456mg [Eutrema salsugineum]
          Length = 400

 Score =  220 bits (560), Expect = 1e-54
 Identities = 137/376 (36%), Positives = 207/376 (55%), Gaps = 2/376 (0%)
 Frame = -3

Query: 1279 EIKTARLKTAHEKNTSSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKR 1100
            +I+  R   ++ + ++SD+L LM SL      D+Y+CL +E T + D   A +L  H+  
Sbjct: 58   QIQIDRAPKSNPRCSTSDILRLMDSLSLPGNEDLYSCLAKESTTECDQRGAYDLQVHIMN 117

Query: 1099 NKPMMRYLSKPKRLFFLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSL 920
            +       S   R  FLNR+L+M +SCG +D  +Q+FDKM +RD  +W ++I G ++   
Sbjct: 118  S-------SVRPRTTFLNRLLLMHVSCGRLDITRQMFDKMPQRDFHSWAIVILGCIEMGD 170

Query: 919  HCEALDLFSQMMLHPMNLQNDALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGY--TR 746
            + +A+ LF  M+ +   +    +   +  CVLKAC    D+DLGK +HG   KLG+    
Sbjct: 171  YQDAVFLFVSMLKNQNRVSK--IPPWIMGCVLKACGMIRDLDLGKQVHGLCQKLGFIEVE 228

Query: 745  NLHVCTAFLDFYGKFRCPEETNLIFNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMY 566
            + ++    + FYG+FRC E+ NL+ N   Q  N    +TV W   +  +  E RFQE + 
Sbjct: 229  DSYLSGCLVRFYGEFRCLEDANLVLN---QLSN---ANTVVWAAKVTNDYREGRFQEVIL 282

Query: 565  MFRDMGRAGVRKNEFTLSSVLKACGRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDM 386
             F +MG+ G++KN    S+VLKAC  + DGG  G+  HA AIK G  ES   ++C L++M
Sbjct: 283  DFIEMGKHGIKKNVSVFSNVLKACTWVSDGGRSGRGVHASAIKLGF-ESDCMIRCRLIEM 341

Query: 385  YSRCGMLMEARKAFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQ 206
            Y + G + +A K F +       RS             NG YVEAIK LYQMKA G++ +
Sbjct: 342  YGKYGKVKDAEKVFKN------ERS-------------NGFYVEAIKLLYQMKATGLQVE 382

Query: 205  ESLLGRVRLVCGTNRY 158
            ++LL  V L   T+RY
Sbjct: 383  DTLLNEVNLK-PTSRY 397


>ref|XP_006841553.1| hypothetical protein AMTR_s00003p00175270 [Amborella trichopoda]
            gi|548843574|gb|ERN03228.1| hypothetical protein
            AMTR_s00003p00175270 [Amborella trichopoda]
          Length = 327

 Score =  194 bits (492), Expect = 1e-46
 Identities = 119/348 (34%), Positives = 183/348 (52%)
 Frame = -3

Query: 1213 MGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLFFLNRILI 1034
            M SL   +TP  Y+ L++ECT        SE+  H+ +        S    +   N+I++
Sbjct: 1    MYSLQIPLTPIAYSSLLKECTSSKSLVEGSEIHAHINKT-------SLYPGIHIENQIIL 53

Query: 1033 MFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHPMNLQNDA 854
            M+M+C     A Q+FDKMS R+T  W  MI GL+D  ++ E LDL+ +M    + ++ + 
Sbjct: 54   MYMACRCPTLAYQVFDKMSHRNTDTWQFMITGLMDLGMNEETLDLYIRMHQEMVRMKPNT 113

Query: 853  LLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFYGKFRCPEETNLI 674
             +Q     VL+AC F  D+ LGK +H  A+K G +++ ++    +DFY + +C       
Sbjct: 114  AIQ---GGVLRACAFIEDVGLGKQIHAKAIKSGSSKDTYLGCCLVDFYVEMKCLVSARKA 170

Query: 673  FNNHIQEDNPHAGDTVFWTGALVTNCHENRFQETMYMFRDMGRAGVRKNEFTLSSVLKAC 494
            F      D     + V WT  +V    E  F   + +FR+M R G R N +T S +L A 
Sbjct: 171  F------DEICKPNVVAWTAMIVGCAREGEFHGVLEVFREMERVGKRGNCYTYSCLLGAS 224

Query: 493  GRLKDGGCCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEARKAFDDINCETRSR 314
            G++      G+Q  A  IK G VE  V+V  S+V MY +CG + +AR  FD +    R +
Sbjct: 225  GKM-GHVWMGKQVQARVIKVG-VEKDVYVGSSIVGMYGKCGFVEDARLVFDGM----REK 278

Query: 313  STPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLVCG 170
            +   WNAM+ GY +NG   EAIK LY+M+  G+EP + ++  V + CG
Sbjct: 279  NAVSWNAMLCGYAKNGCCDEAIKLLYEMRCKGLEPPQVMVNEVAIACG 326


>ref|XP_002461747.1| hypothetical protein SORBIDRAFT_02g007340 [Sorghum bicolor]
            gi|241925124|gb|EER98268.1| hypothetical protein
            SORBIDRAFT_02g007340 [Sorghum bicolor]
          Length = 442

 Score =  184 bits (468), Expect = 7e-44
 Identities = 123/361 (34%), Positives = 187/361 (51%), Gaps = 7/361 (1%)
 Frame = -3

Query: 1234 SSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLF 1055
            + DVL LM +LG     D+Y  L+ EC    DA   + +  HM          S      
Sbjct: 99   AGDVLRLMDALGIPPDEDIYISLLRECA---DAAEVASVHAHMTACCASDALPSP----- 150

Query: 1054 FLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHP 875
              NR+L+ + +CG ++AA+++FD M  R+  AW  M++   D   H EA+ LF+ M    
Sbjct: 151  VANRVLLSYAACGDIEAARRVFDGMPDRNGMAWATMVSAYSDGCFHHEAMRLFAHMCHRT 210

Query: 874  MNLQNDALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFY----G 707
            + L  D     + A VL++CI   ++ LG+ +H   +K G      + ++ +  Y    G
Sbjct: 211  LVLDGDCCSHAILA-VLRSCIRAGELRLGEQVHALVIKKGRILG-DIGSSLVQLYCESSG 268

Query: 706  KFRCPEET-NLIFNNHIQEDNPHAGDTVFWTGALVTNCH-ENRFQETMYMFRDMGRAGVR 533
              R       ++  +H QE  P A     WT +L+T CH + +  E + +FRDM  +GV 
Sbjct: 269  LHRSARRVLVMMMQHHCQEPVPEAA----WT-SLITCCHRDGQLSEAIDVFRDMASSGVP 323

Query: 532  KNEFTLSSVLKACGRLKDGG-CCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEA 356
            ++ F+LSS+L      ++ G CCGQQ HADAIK G V+++ FV   LV MY++ G L +A
Sbjct: 324  RSSFSLSSILAVFAESQNQGCCCGQQVHADAIKRG-VDTNQFVGSGLVHMYAKQGWLADA 382

Query: 355  RKAFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLV 176
             +AF  I  +     T CW+A+   Y   G Y EA + +YQMKAAG+ P + +   VRL 
Sbjct: 383  VRAFGAIGGKP---DTACWSALALAYARGGRYREATRVMYQMKAAGMTPSQEMADAVRLA 439

Query: 175  C 173
            C
Sbjct: 440  C 440


>ref|NP_001131386.1| hypothetical protein [Zea mays] gi|194691388|gb|ACF79778.1| unknown
            [Zea mays] gi|414884126|tpg|DAA60140.1| TPA: hypothetical
            protein ZEAMMB73_895402 [Zea mays]
          Length = 438

 Score =  184 bits (468), Expect = 7e-44
 Identities = 122/361 (33%), Positives = 186/361 (51%), Gaps = 7/361 (1%)
 Frame = -3

Query: 1234 SSDVLCLMGSLGFTVTPDVYACLVEECTHKGDAYRASELWTHMKRNKPMMRYLSKPKRLF 1055
            + DVL LM +LG     D+Y  L+ EC    DA   + +  H+   +      S      
Sbjct: 95   AGDVLRLMDALGIPPDEDIYISLLRECA---DAAEVASVHAHITARRASDGLPSP----- 146

Query: 1054 FLNRILIMFMSCGSVDAAQQLFDKMSRRDTAAWCLMIAGLVDNSLHCEALDLFSQMMLHP 875
              NR+L+ + +CG ++AA+++FD M   +  AW  M++   D  LH EA+ LF+ M    
Sbjct: 147  VANRLLLSYAACGDIEAARRVFDGMPTTNGMAWATMVSAYSDGCLHHEAMRLFAHMCHGT 206

Query: 874  MNLQNDALLQILFACVLKACIFDVDMDLGKTLHGWAMKLGYTRNLHVCTAFLDFY---GK 704
              L  D     + A VL++C    ++ LG+ +H   +K G      + ++ +  Y   G 
Sbjct: 207  PVLDGDCYSHAIVA-VLRSCTRAGELRLGEQVHALVVKKGRIHG-DIGSSLVQLYCDGGG 264

Query: 703  FRCPEETNL--IFNNHIQEDNPHAGDTVFWTGALVTNCH-ENRFQETMYMFRDMGRAGVR 533
            F       L     +H QE  P A     WT +L+T+CH E+   E + +FRDM  +GV 
Sbjct: 265  FHRSARRVLATTMQHHCQEPVPEAA----WT-SLITSCHRESLLSEAVDVFRDMASSGVP 319

Query: 532  KNEFTLSSVLKACGRLKDGG-CCGQQAHADAIKHGLVESHVFVQCSLVDMYSRCGMLMEA 356
            ++ F+LSS+L      +D G CCGQQ HADAIK G V+++ FV   L+ MY++ G L +A
Sbjct: 320  RSSFSLSSILAVFAESQDPGCCCGQQVHADAIKRG-VDTNQFVGSGLIHMYAKQGQLADA 378

Query: 355  RKAFDDINCETRSRSTPCWNAMINGYLENGAYVEAIKTLYQMKAAGVEPQESLLGRVRLV 176
             +AF+ I  +       CW+A+   Y   G Y EA + +YQMKAAG+ P + +   VRL 
Sbjct: 379  TRAFETIGGKP---DAACWSALAMAYARGGRYREATRIMYQMKAAGMNPSKEMADAVRLA 435

Query: 175  C 173
            C
Sbjct: 436  C 436


Top