BLASTX nr result

ID: Papaver27_contig00041294 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver27_contig00041294
         (1732 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006419209.1| hypothetical protein CICLE_v10004653mg [Citr...   375   e-101
ref|XP_006488716.1| PREDICTED: protein CHUP1, chloroplastic-like...   374   e-101
ref|XP_007223070.1| hypothetical protein PRUPE_ppa003741mg [Prun...   366   2e-98
ref|XP_002314334.2| hypothetical protein POPTR_0010s00550g [Popu...   363   1e-97
emb|CBI26022.3| unnamed protein product [Vitis vinifera]              355   4e-95
ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256...   354   6e-95
ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306...   348   4e-93
ref|XP_002519338.1| conserved hypothetical protein [Ricinus comm...   347   1e-92
ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511...   343   2e-91
ref|XP_006597906.1| PREDICTED: uncharacterized protein LOC100820...   335   3e-89
ref|XP_006597905.1| PREDICTED: uncharacterized protein LOC100820...   335   3e-89
ref|XP_007138573.1| hypothetical protein PHAVU_009G220500g [Phas...   334   6e-89
ref|XP_006587085.1| PREDICTED: protein CHUP1, chloroplastic-like...   334   8e-89
ref|NP_564524.1| hydroxyproline-rich glycoprotein-like protein [...   322   3e-85
ref|XP_003609889.1| Protein CHUP1 [Medicago truncatula] gi|35551...   320   1e-84
gb|ADN34042.1| hydroxyproline-rich glycoprotein family protein [...   308   6e-81
ref|XP_007036013.1| Hydroxyproline-rich glycoprotein family prot...   307   1e-80
ref|XP_007036016.1| Hydroxyproline-rich glycoprotein family prot...   305   5e-80
ref|XP_006393445.1| hypothetical protein EUTSA_v10012201mg [Eutr...   300   2e-78
ref|XP_006840355.1| hypothetical protein AMTR_s00045p00113980 [A...   300   2e-78

>ref|XP_006419209.1| hypothetical protein CICLE_v10004653mg [Citrus clementina]
            gi|557521082|gb|ESR32449.1| hypothetical protein
            CICLE_v10004653mg [Citrus clementina]
          Length = 561

 Score =  375 bits (963), Expect = e-101
 Identities = 247/540 (45%), Positives = 319/540 (59%), Gaps = 14/540 (2%)
 Frame = -3

Query: 1580 MKQEIPAKPSENKVMQSTHLQTTPSRFRSTATKVVKESPRSEL-VNGVS--PGLKSRPVI 1410
            MKQ      + N  M  +   TT SR R  A    +ESP+ E  +NGVS  P LK+R   
Sbjct: 1    MKQHQELSKTNN--MSHSTAATTTSRLR--ANSKTRESPKQEAGINGVSLSPELKARAKS 56

Query: 1409 MPIPTESSPSINSQQKVRRSLLGLNSKPKLAT-------NEDVKTVSRFGNRSLNEQFLR 1251
            +P   ++    N+  K RR+L+ LN KPK A        +++VK   R  NR + EQF R
Sbjct: 57   VPPDVKT----NNISKSRRALV-LN-KPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFAR 110

Query: 1250 PRRN--VDSVVCDKNGDESEVKVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXX 1077
            PRR   VD+          + K KE +EKL +SENLV++LQ EV  L+ + VK       
Sbjct: 111  PRRQRIVDANPGKIEDGLMDKKKKEFEEKLRLSENLVKDLQSEVFALKAEFVKAQSLNAE 170

Query: 1076 XXXXNRVYEEKL--SQAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKD 903
                N+   E L  ++AK ++ S+ +Q EA      Q P+ +DVQ+    KL       D
Sbjct: 171  LEKQNKKLVEDLVAAEAKIASLSSREQREA--VGEYQSPKFKDVQKLIANKLEHSIVMTD 228

Query: 902  VAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTS 723
               +                   AA  +                       +A   +KT 
Sbjct: 229  AISETSINTPPSEPKIPIRN---AAGVERKPQAYPSMPAPLPPPPPPRPPARAAATQKTP 285

Query: 722  SIVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGEL 543
            S  Q YHSLT+Q  K +     N+  P  + AH+SIVGEIQNRS+HLLAI+ DIETKG  
Sbjct: 286  SFAQLYHSLTKQVEKKDLPSPVNQKRPAVSIAHSSIVGEIQNRSAHLLAIKADIETKGGF 345

Query: 542  VNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEY 363
            +NSLI KV  +AY++I+D+L+FV WLD EL+SLADERAVLKHF WPE+KADAM+EAA+EY
Sbjct: 346  INSLIQKVLAAAYTNIEDLLEFVDWLDKELSSLADERAVLKHFKWPEKKADAMQEAAVEY 405

Query: 362  RDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPT 183
            RDLK+LE+++SSY+D+  +P+  ALKKMA+LLDKSERSIQRL+ LR S M SYKDCKIP 
Sbjct: 406  RDLKQLENEISSYRDDTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVMHSYKDCKIPV 465

Query: 182  DWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
            DWMLDSG++SKIKQASMKLA+MYMKRV             STQEAL+LQG+ FA+RAHQF
Sbjct: 466  DWMLDSGIISKIKQASMKLAQMYMKRVTRELELVHNSDRESTQEALLLQGLHFAYRAHQF 525


>ref|XP_006488716.1| PREDICTED: protein CHUP1, chloroplastic-like [Citrus sinensis]
          Length = 561

 Score =  374 bits (961), Expect = e-101
 Identities = 244/531 (45%), Positives = 315/531 (59%), Gaps = 14/531 (2%)
 Frame = -3

Query: 1553 SENKVMQSTHLQTTPSRFRSTATKVVKESPRSEL-VNGVS--PGLKSRPVIMPIPTESSP 1383
            S+   M  +   TT SR R  A    +ESP+ E  +NGVS  P LK+R   +P   ++  
Sbjct: 8    SKTNSMSHSTAATTTSRLR--ANSKTRESPKQEAGINGVSLSPELKARAKSVPPDVKT-- 63

Query: 1382 SINSQQKVRRSLLGLNSKPKLAT-------NEDVKTVSRFGNRSLNEQFLRPRRN--VDS 1230
              N+  K R +L+ LN KPK A        +++VK   R  NR + EQF RPRR   VD+
Sbjct: 64   --NNISKSRMALV-LN-KPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDA 119

Query: 1229 VVCDKNGDESEVKVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRVYE 1050
                      + K KE +EKL +SENLV++LQ EV  L+ + VK           N+   
Sbjct: 120  NPGKIEDGLMDKKKKEFEEKLMLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLV 179

Query: 1049 EKL--SQAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKDVAIDRXXXX 876
            E L  ++AK ++ S+ +Q EA      Q P+ +DVQ+    KL       D   +     
Sbjct: 180  EDLVAAEAKIASLSSREQREA--VGEYQSPKFKDVQKLIANKLEHSIVMTDAISETSINT 237

Query: 875  XXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTSSIVQFYHSL 696
                          AA  +                       +A   +KT S  Q YHSL
Sbjct: 238  PPSEPKIPIRN---AAGVERKPQAYPSMPAPLPPPPPPRPPARAAATQKTPSFAQLYHSL 294

Query: 695  TRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGELVNSLIDKVQ 516
            T+Q  K +     N+  P  + AH+SIVGEIQNRS+HLLAI+ DIETKG  +NSLI KV 
Sbjct: 295  TKQVEKKDLPSPVNQKRPAVSIAHSSIVGEIQNRSAHLLAIKADIETKGGFINSLIQKVL 354

Query: 515  TSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEYRDLKRLESD 336
             +AY++I+D+L+FV WLD EL+SLADERAVLKHF WPE+KADAMREAA+EYRDLK+LE++
Sbjct: 355  AAAYTNIEDLLEFVDWLDKELSSLADERAVLKHFKWPEKKADAMREAAVEYRDLKQLENE 414

Query: 335  VSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPTDWMLDSGMV 156
            +SSY+D+  +P+  ALKKMA+LLDKSERSIQRL+ LR S M SYKDCKIP DWMLDSG++
Sbjct: 415  ISSYRDDTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVMHSYKDCKIPVDWMLDSGII 474

Query: 155  SKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
            SKIKQASMKLA+MYMKRV             STQEAL+LQG+ FA+RAHQF
Sbjct: 475  SKIKQASMKLAQMYMKRVTRELELVHNSDRESTQEALLLQGLHFAYRAHQF 525


>ref|XP_007223070.1| hypothetical protein PRUPE_ppa003741mg [Prunus persica]
            gi|462420006|gb|EMJ24269.1| hypothetical protein
            PRUPE_ppa003741mg [Prunus persica]
          Length = 552

 Score =  366 bits (940), Expect = 2e-98
 Identities = 239/545 (43%), Positives = 308/545 (56%), Gaps = 19/545 (3%)
 Frame = -3

Query: 1580 MKQEIP--AKPSENKVMQSTHLQTTPSRFRSTATKVVKESPRSELVNGVSPGLKSRPVIM 1407
            MKQ  P  +  SE+KV  +    T PS  R++A+   KESP                   
Sbjct: 1    MKQGTPPSSTKSESKVSGNMSQPTPPSYLRASASSKAKESP------------------- 41

Query: 1406 PIPTESSPSINSQQKVRRSLLGLNSKPKLATN----------EDVKTVSRFGNRSLNEQF 1257
                  SP  +  + +RRSLL LN KPK              E+ K V R GNR + EQF
Sbjct: 42   ------SPRPSRAKSIRRSLL-LN-KPKSGELVLGSQKSKELEETKAVGRPGNRQVAEQF 93

Query: 1256 LRPRRNVDSVVCDKNGDES-EVKVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXX 1080
             RPR    +    K  +E   VK +ELQE+L +SE+L  N Q EVL L+ +L K      
Sbjct: 94   ARPRPQRPADPNSKRNEEDPHVKNRELQERLDMSESLTMNFQAEVLALKAELDKAQGLNV 153

Query: 1079 XXXXXNRVYEEKLS--QAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRK 906
                 N+   EKL+  +AK +A +  +Q E  T    Q P+ +D+Q+    KL     +K
Sbjct: 154  ELQSQNKNLTEKLAAAEAKIAAFTTREQRE--TNGEYQSPKFKDLQKLIANKLERPVVKK 211

Query: 905  DVAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTP--- 735
            +   ++                 +A  +                           T    
Sbjct: 212  EAVKEKSANKTPAPAPTGAIPRVAATQSGPPPPPPPPPSVRSPTPPPPPPQPSVRTTTSA 271

Query: 734  -RKTSSIVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIE 558
             +K  S+V+F+HSL +QE K ++    N + P   +AHNSIVGEIQNRS+HLLAI+ D++
Sbjct: 272  TQKAPSLVEFFHSLRKQEVKRDSPESRNHHKPSAISAHNSIVGEIQNRSAHLLAIKADVQ 331

Query: 557  TKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMRE 378
            TKGE +N LI KV  +AY+ I+DVLKFV WLDGEL+SLADERAVLKHF WPERKADAMRE
Sbjct: 332  TKGEFINDLIQKVLVAAYTDIEDVLKFVDWLDGELSSLADERAVLKHFKWPERKADAMRE 391

Query: 377  AAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKD 198
            AAIEYRDLK L+S++SSYKD+  +P   ALKKMA LLDKSERSIQRLI LR S M SY++
Sbjct: 392  AAIEYRDLKLLQSEISSYKDDTDIPCAAALKKMAGLLDKSERSIQRLIKLRNSVMRSYQE 451

Query: 197  CKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAF 18
             KIP DWMLDSG+VSKIK+ASM LA +YMKRV +           ++QE+L+LQGV F +
Sbjct: 452  LKIPIDWMLDSGIVSKIKKASMNLANVYMKRVTMELESIRNSDRETSQESLLLQGVHFVY 511

Query: 17   RAHQF 3
            RAHQF
Sbjct: 512  RAHQF 516


>ref|XP_002314334.2| hypothetical protein POPTR_0010s00550g [Populus trichocarpa]
            gi|550328806|gb|EEF00505.2| hypothetical protein
            POPTR_0010s00550g [Populus trichocarpa]
          Length = 547

 Score =  363 bits (933), Expect = 1e-97
 Identities = 238/523 (45%), Positives = 305/523 (58%), Gaps = 18/523 (3%)
 Frame = -3

Query: 1517 TTPSRFRSTATKVVKESPRSELVNG----VSPGLKSRPVIMPIPTESSPSINSQQKVRRS 1350
            TTPSR R       K    +E+ N      SP  K+R   +P      P +    KVR+S
Sbjct: 6    TTPSRHRVN----FKTPKPAEVANNGSPVPSPANKTRAKSVP------PDVKKDTKVRKS 55

Query: 1349 LLGLNSKPK----LATNEDVKTVSRFGNRSLNEQFLRPRRN---VDSVVCDKNGDESEVK 1191
            L+G N+KPK    +  ++DV  V R  NR  +EQF RPRR    +D +   +  +E   K
Sbjct: 56   LVG-NNKPKSGELVVGSQDVTVVGRSVNRPGSEQFARPRRQRPVLDPINASRRNEEESYK 114

Query: 1190 VKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRVYEEKLS--QAKNSAR 1017
             K L EKL +SE L+ +LQ EVL L+ +L K           N+   E L+  +AK SA 
Sbjct: 115  -KGLHEKLELSETLINDLQSEVLALKVELDKANGLNQELELQNKKLTEDLAAAEAKVSAL 173

Query: 1016 SNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKDVAID-----RXXXXXXXXXXXX 852
            +   Q         QRPR +D+Q+    KL     +K+ AI+     +            
Sbjct: 174  NTRHQSVGEH----QRPRFKDIQKLIAIKLENSPVKKE-AINGPSKVKTPQSPPPPPVPR 228

Query: 851  XXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTSSIVQFYHSLTRQEGKNE 672
                   A+ +                       +A T  KT +IV+FY+S+ +QEGK +
Sbjct: 229  FISKADVAERKAPTCPSLMPPPPPPPLPPMRPLARATTAPKTPAIVEFYNSIRKQEGKRD 288

Query: 671  TSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGELVNSLIDKVQTSAYSSID 492
            +    ++  P   +AH+SIVGEIQNRS+HLLAI+ DIETKG+ +N LI KV  +AY+ I+
Sbjct: 289  SPGLRSQYKPEKTSAHSSIVGEIQNRSTHLLAIKADIETKGDFINGLIQKVLAAAYTDIE 348

Query: 491  DVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEYRDLKRLESDVSSYKDEP 312
            DVLKFV WLDGEL+SLADERAVLKHF WPE+KADA+REAAIEYR LK LES++SS+KDE 
Sbjct: 349  DVLKFVDWLDGELSSLADERAVLKHFKWPEKKADAIREAAIEYRGLKLLESEISSFKDES 408

Query: 311  FMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPTDWMLDSGMVSKIKQASM 132
              P  TALKKMA L DKSERSIQ+LI LR S M SY+  KIPTDWMLDSG+VSKIKQASM
Sbjct: 409  NNPCGTALKKMAVLHDKSERSIQKLIKLRNSVMNSYQAWKIPTDWMLDSGIVSKIKQASM 468

Query: 131  KLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
            +LA+MYMKRV               QEAL+LQG+ FA+RAHQF
Sbjct: 469  RLAKMYMKRVITELELARNSERECNQEALLLQGLHFAYRAHQF 511


>emb|CBI26022.3| unnamed protein product [Vitis vinifera]
          Length = 572

 Score =  355 bits (911), Expect = 4e-95
 Identities = 240/540 (44%), Positives = 314/540 (58%), Gaps = 13/540 (2%)
 Frame = -3

Query: 1583 AMKQEIPAKPSENKVMQSTHLQTTPSRFRSTATKVVKESPRSELVNGVS-PGLKSRPVIM 1407
            AMKQ     P+  K    +HL+  PS   S+++    +     ++NGVS P    RP   
Sbjct: 21   AMKQN---PPTPCKTTTPSHLRR-PSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRAR 76

Query: 1406 PIPTESSPSINSQQKVRRSLLGLNSKPKLATN----------EDVKTVSRFGNRSLNEQF 1257
              P E    +N+  K RRSLL LN KPK   +          E+VK + R  NR + +Q 
Sbjct: 77   SGPLE----MNNSHKARRSLL-LN-KPKSGDHALGSQKPRDAEEVKVMGRSRNRPVVDQ- 129

Query: 1256 LRPRRNVDSVVCDKNGDESEVKVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXX 1077
            L PRR          G E + K KELQEKL + +NL+ NLQ EVL L+ +L K       
Sbjct: 130  LAPRR-------PSEGPEPDDKTKELQEKLDLRQNLINNLQSEVLGLKAELDKAQSFNLE 182

Query: 1076 XXXXNRVYEEKLSQA--KNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKD 903
                N    E L+ A  K +A ++  Q E S TE  Q P+ +D+Q+    KL   + +++
Sbjct: 183  LQSLNAKLTEDLAAALAKITALTSRQQ-EESVTE-YQSPKFKDIQKLIANKLEHPKIKQE 240

Query: 902  VAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTS 723
             + +                   A D+Q                        A   RK  
Sbjct: 241  ASNEASTVQAPSAASVPRVPR--AMDSQRKVPPCPAPPPPPLPPPQPPAR--AAATRKAP 296

Query: 722  SIVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGEL 543
            ++V+FYHSLT+  GK + ++  N N  V ++AH+SIVGEIQNRS+H LAI+ DIETKG+ 
Sbjct: 297  TLVEFYHSLTKGVGKRDFAQSGNHNKLVVSSAHSSIVGEIQNRSAHQLAIKADIETKGDF 356

Query: 542  VNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEY 363
            +N LI +V  ++YS ++D++KFV WLD EL++LADERAVLKHF WPE+KADAMREAAIEY
Sbjct: 357  INGLIQRVLAASYSDMEDIVKFVDWLDNELSTLADERAVLKHFKWPEKKADAMREAAIEY 416

Query: 362  RDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPT 183
            RDLK LES+VS YKD   +P   ALKKMA LLDKSERSIQRLI LR S + SY++C IPT
Sbjct: 417  RDLKLLESEVSCYKDNANVPCGVALKKMAGLLDKSERSIQRLIKLRNSVVRSYQECGIPT 476

Query: 182  DWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
             WMLDSG+VSKIKQAS+ LA+MYM+RV +           S+QEAL+LQGV FA+RAHQF
Sbjct: 477  GWMLDSGIVSKIKQASINLAKMYMQRVAMELESVRNSERESSQEALLLQGVHFAYRAHQF 536


>ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256278 [Vitis vinifera]
          Length = 551

 Score =  354 bits (909), Expect = 6e-95
 Identities = 234/520 (45%), Positives = 302/520 (58%), Gaps = 15/520 (2%)
 Frame = -3

Query: 1517 TTPSRFRSTATKVVKESPRSELVN--GVSPGLKS-RPVIMPIPTESSPSINSQQKVRRSL 1347
            TTPS  R  ++     S  S  V   GV  G+ S  P   P        +N+  K RRSL
Sbjct: 12   TTPSHLRRPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSL 71

Query: 1346 LGLNSKPKLATN----------EDVKTVSRFGNRSLNEQFLRPRRNVDSVVCDKNGDESE 1197
            L LN KPK   +          E+VK + R  NR + +Q L PRR          G E +
Sbjct: 72   L-LN-KPKSGDHALGSQKPRDAEEVKVMGRSRNRPVVDQ-LAPRR-------PSEGPEPD 121

Query: 1196 VKVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRVYEEKLSQA--KNS 1023
             K KELQEKL + +NL+ NLQ EVL L+ +L K           N    E L+ A  K +
Sbjct: 122  DKTKELQEKLDLRQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKIT 181

Query: 1022 ARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKDVAIDRXXXXXXXXXXXXXXX 843
            A ++  Q E S TE  Q P+ +D+Q+    KL   + +++ + +                
Sbjct: 182  ALTSRQQ-EESVTE-YQSPKFKDIQKLIANKLEHPKIKQEASNEASTVQAPSAASVPRVP 239

Query: 842  XXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTSSIVQFYHSLTRQEGKNETSR 663
               A D+Q                        A   RK  ++V+FYHSLT+  GK + ++
Sbjct: 240  R--AMDSQRKVPPCPAPPPPPLPPPQPPAR--AAATRKAPTLVEFYHSLTKGVGKRDFAQ 295

Query: 662  GTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGELVNSLIDKVQTSAYSSIDDVL 483
              N N  V ++AH+SIVGEIQNRS+H LAI+ DIETKG+ +N LI +V  ++YS ++D++
Sbjct: 296  SGNHNKLVVSSAHSSIVGEIQNRSAHQLAIKADIETKGDFINGLIQRVLAASYSDMEDIV 355

Query: 482  KFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEYRDLKRLESDVSSYKDEPFMP 303
            KFV WLD EL++LADERAVLKHF WPE+KADAMREAAIEYRDLK LES+VS YKD   +P
Sbjct: 356  KFVDWLDNELSTLADERAVLKHFKWPEKKADAMREAAIEYRDLKLLESEVSCYKDNANVP 415

Query: 302  YETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPTDWMLDSGMVSKIKQASMKLA 123
               ALKKMA LLDKSERSIQRLI LR S + SY++C IPT WMLDSG+VSKIKQAS+ LA
Sbjct: 416  CGVALKKMAGLLDKSERSIQRLIKLRNSVVRSYQECGIPTGWMLDSGIVSKIKQASINLA 475

Query: 122  RMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
            +MYM+RV +           S+QEAL+LQGV FA+RAHQF
Sbjct: 476  KMYMQRVAMELESVRNSERESSQEALLLQGVHFAYRAHQF 515


>ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306034 [Fragaria vesca
            subsp. vesca]
          Length = 560

 Score =  348 bits (893), Expect = 4e-93
 Identities = 226/539 (41%), Positives = 298/539 (55%), Gaps = 20/539 (3%)
 Frame = -3

Query: 1559 KPSENKVMQSTHLQTTPSRFRSTATKVVKESPRSELVNGVSPGLKSRPVIMPIPTESSPS 1380
            K   +  +QS H  T  S+ R+++     +SP        S        + P    SS S
Sbjct: 4    KQGASTKIQSKH-STNMSQLRASSKAKESQSPTQRPSRAKS--------VTPDVNHSSDS 54

Query: 1379 INSQQKVRRSLLGLNSKPKLATN----------EDVKTVSRFGNRSLNEQFLRPRRN--V 1236
                + VRR+LL   +KPK              E+ K V       + EQF +PRR   V
Sbjct: 55   ----RSVRRALL--QNKPKSGELVLGSQKSKDFEEFKVVGSSRKPQVVEQFAKPRRQRPV 108

Query: 1235 DSVVCDKNGDESEVKVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRV 1056
                C +N D+    +KE+QEK+ +SE+++  LQ EVL L+ +L K           N+ 
Sbjct: 109  VEANCKRNEDDPHRNMKEMQEKIEMSESMIMKLQAEVLGLKVELDKEHGLNLELQAKNKK 168

Query: 1055 YEEKLS--QAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKDVAID--- 891
              E L+  +AK +A +   Q E++   G Q P+ +D+Q+    KL     +K+   +   
Sbjct: 169  LSENLTAAEAKIAALTTPQQRESN---GYQSPKFKDLQKLIANKLECSVVKKEALNEPSP 225

Query: 890  ---RXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTSS 720
                                   A T                        +  T +K   
Sbjct: 226  IKAASPPPPPPPPPPPPPVIPRVAATFSPPPPPPPPSLLPPPPPPPQPSVRVSTTQKAPE 285

Query: 719  IVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGELV 540
            +VQ YHSL ++E K ++    +   P   +AHNSIVGEIQNRS+HL+AI+ D+ETKGE +
Sbjct: 286  LVQIYHSLRKREVKRDSPESRSHQKPGAISAHNSIVGEIQNRSAHLIAIKADVETKGEFI 345

Query: 539  NSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEYR 360
            N LI KV  +AY  I+DVLKFV WLDGELASLADERAVLKHF WPERKADAMREAAIEYR
Sbjct: 346  NGLIQKVLAAAYKDIEDVLKFVDWLDGELASLADERAVLKHFKWPERKADAMREAAIEYR 405

Query: 359  DLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPTD 180
            DLK LES++SSYKD+  +    ALKKMA LLDKSERSIQRL+ +R S M SY++CKIPTD
Sbjct: 406  DLKLLESEISSYKDDTTIQCAAALKKMAGLLDKSERSIQRLVKMRNSVMRSYQECKIPTD 465

Query: 179  WMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
            WMLDSG+ SKIKQAS+ LA++YMKRV             S+QE+L++QGV FA+RAHQF
Sbjct: 466  WMLDSGIGSKIKQASINLAKIYMKRVTSELESVRYSDRESSQESLLVQGVNFAYRAHQF 524


>ref|XP_002519338.1| conserved hypothetical protein [Ricinus communis]
            gi|223541653|gb|EEF43202.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 532

 Score =  347 bits (890), Expect = 1e-92
 Identities = 224/515 (43%), Positives = 303/515 (58%), Gaps = 10/515 (1%)
 Frame = -3

Query: 1517 TTPSRFRSTATKVVKESPRSELVNGVSPGLKSRPVIMPIPTESSPSINSQQKVRRSLLGL 1338
            TTPSRFR  +     ++P+ E      P  K R   +P      P      K+RRS+L +
Sbjct: 5    TTPSRFRLNS-----KAPKPE-----PPAKKERAQSVP------PDFKKDTKLRRSVL-V 47

Query: 1337 NSKPK-----LATNEDVKTV---SRFGNRSLNEQFLRPRRNVDSVVCDKNGDESEVKVKE 1182
            N+KPK     L +  +V  V   S   NR ++EQF +PR    +        E + K KE
Sbjct: 48   NTKPKSRDELLGSQMEVARVVSPSLSVNRPVHEQFSKPRTQRSA-----RKIEEDTK-KE 101

Query: 1181 LQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRVYEEKLS--QAKNSARSNH 1008
            L E++ +++NL+Q+L+ +VL L+ +L K           N+  ++ L+  +AK +A  N+
Sbjct: 102  LLERIELNDNLIQDLKSQVLSLKAELDKAQSLNEELESQNKKLQQDLASAEAKVAAALNN 161

Query: 1007 DQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKDVAIDRXXXXXXXXXXXXXXXXXSAA 828
              +  S   G Q P+ +D+Q+    KL     +KD A++                    +
Sbjct: 162  TPLPESIG-GYQSPKFKDIQKLIANKLENSTVKKD-AMNGPTSVKTPSPPPPSRPIHLLS 219

Query: 827  DTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTSSIVQFYHSLTRQEGKNETSRGTNRN 648
              +                       +A T  KT +IV+FY SL +   K       N+ 
Sbjct: 220  KAETKAPSCPSLPPPPPPPPPLRPLARAATAPKTPAIVEFYQSLRKHGEKRHVQGHENQY 279

Query: 647  YPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGELVNSLIDKVQTSAYSSIDDVLKFVVW 468
             PV  +AH+S+VGEIQNRS+HLLAI+ DIETKG+ +N LI KV   AY+ I+DVLKFV W
Sbjct: 280  KPVVTSAHSSVVGEIQNRSAHLLAIKSDIETKGDFINGLIKKVLAVAYTDIEDVLKFVDW 339

Query: 467  LDGELASLADERAVLKHFNWPERKADAMREAAIEYRDLKRLESDVSSYKDEPFMPYETAL 288
            LDGEL++LADERAVLKHFNWPERKADA+REAAIEYR LK+LE+++SS+KD+P +P  +AL
Sbjct: 340  LDGELSTLADERAVLKHFNWPERKADAIREAAIEYRSLKQLENEISSFKDDPSIPCGSAL 399

Query: 287  KKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPTDWMLDSGMVSKIKQASMKLARMYMK 108
            KKMA LLDKSER I RL+ LR S + SY++ KIP++WMLDSGM+SKIKQASMKLA+MYM+
Sbjct: 400  KKMAILLDKSERGIGRLVKLRNSVLRSYQEWKIPSNWMLDSGMMSKIKQASMKLAKMYMR 459

Query: 107  RVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
            RV             S QEALVLQGV FA+RAHQF
Sbjct: 460  RVIEELEVGRNTDRESNQEALVLQGVNFAYRAHQF 494


>ref|XP_004508251.1| PREDICTED: uncharacterized protein LOC101511271 [Cicer arietinum]
          Length = 933

 Score =  343 bits (879), Expect = 2e-91
 Identities = 216/529 (40%), Positives = 304/529 (57%), Gaps = 12/529 (2%)
 Frame = -3

Query: 1553 SENKVMQSTHLQTTPSRFRST-ATKVVKESPRS--ELVNGVSPGLKS-RPVIMPIPTESS 1386
            S+NK   S    TT +R R    +  +KESP++  E+VN     + S R   +P      
Sbjct: 60   SDNKSQHSIPQTTTTTRIRVVKGSSKIKESPKTPPEIVNNNRASISSTRAKSVP------ 113

Query: 1385 PSINSQQKVRRSLLGLNSKPKLATNEDVKTVSRFGNRSLNEQ---FLRPRRNVDSVVCDK 1215
            P + +  K +R ++ +N   K  +NE+V+  S+ G +   E     +RPRR         
Sbjct: 114  PDLKNNSKAKRGIVVMNKLVK--SNEEVECSSQKGTKEAEEAKIVVVRPRRR------RT 165

Query: 1214 NGDESEVKVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRVYEEKLS- 1038
            N D  E + KE+ EKL +S+NL++NL+ EV  L+ +L K           N    + L+ 
Sbjct: 166  NDDPDEKEKKEMVEKLEMSDNLIKNLESEVKALKAELDKVKNLNVELESQNVKLTQNLAA 225

Query: 1037 -QAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKDV---AIDRXXXXXX 870
             +AK +A  +++  +       Q P+ +D+Q+    KL + + +K+     I        
Sbjct: 226  AEAKIAAVGSNNSRKKELIGEHQSPKFKDIQKLIADKLEMSKVKKEANHEVIFVKASIPA 285

Query: 869  XXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTSSIVQFYHSLTR 690
                       ++   +                       K    +K  ++VQ +HSL  
Sbjct: 286  PTQNHAIPETTTSLGRKFPPNLCVMPPPPPPPPIPSRPLAKLANTQKAPAVVQLFHSLKN 345

Query: 689  QEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGELVNSLIDKVQTS 510
            Q+GK ++    N + P+  +AH+SIVGEIQNRS+HLLAIR DI+TKGE +N LI KV  +
Sbjct: 346  QDGKKDSKGSINHHKPIAISAHSSIVGEIQNRSAHLLAIRADIQTKGEFINDLIKKVVDA 405

Query: 509  AYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEYRDLKRLESDVS 330
            AY  I+DVLKFV WLDGEL++LADERAVLKHF WPE+KADAMREAA+EYR+LK LE ++S
Sbjct: 406  AYVEIEDVLKFVDWLDGELSTLADERAVLKHFKWPEKKADAMREAAVEYRELKMLEQEIS 465

Query: 329  SYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPTDWMLDSGMVSK 150
            SYKD+P +P   +LKKMA+LLDKSERSIQ+LITLR S   SY+   IPT WMLDSG+ SK
Sbjct: 466  SYKDDPDIPCAASLKKMASLLDKSERSIQKLITLRNSVTRSYQMYNIPTAWMLDSGITSK 525

Query: 149  IKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
            IK+ASM L +MYMKR+ +           S+Q++L+LQGV FA+RAHQF
Sbjct: 526  IKKASMTLVKMYMKRLTMELESIRNSDRESSQDSLLLQGVHFAYRAHQF 574


>ref|XP_006597906.1| PREDICTED: uncharacterized protein LOC100820086 isoform X2 [Glycine
            max] gi|571519858|ref|XP_006597907.1| PREDICTED:
            uncharacterized protein LOC100820086 isoform X3 [Glycine
            max] gi|571519862|ref|XP_006597908.1| PREDICTED:
            uncharacterized protein LOC100820086 isoform X4 [Glycine
            max] gi|571519866|ref|XP_006597909.1| PREDICTED:
            uncharacterized protein LOC100820086 isoform X5 [Glycine
            max] gi|571519870|ref|XP_006597910.1| PREDICTED:
            uncharacterized protein LOC100820086 isoform X6 [Glycine
            max] gi|571519874|ref|XP_006597911.1| PREDICTED:
            uncharacterized protein LOC100820086 isoform X7 [Glycine
            max]
          Length = 577

 Score =  335 bits (860), Expect = 3e-89
 Identities = 226/550 (41%), Positives = 300/550 (54%), Gaps = 24/550 (4%)
 Frame = -3

Query: 1580 MKQEIPAKPSENKVMQST---------HLQTTPSRFRSTATKVVKESPRS--ELVNGVSP 1434
            MKQE  A P+ + + Q T             TPSR R  +    +E P++  E+VN    
Sbjct: 1    MKQEPLASPTTSSLPQPTITTKNVIKIQNSLTPSRLRLPSK--YREPPKTPPEVVNN--- 55

Query: 1433 GLKSRPVIMPIPTESSPSINSQQKVRRSLLGLNSKPK---LATN------EDVKTVSRFG 1281
            G+ S P+        +P +    ++++ L+   +KP    L T       E+ K VSRF 
Sbjct: 56   GMVSTPLRRA--KSVTPELKHNSRIKKGLVLNKAKPNEEVLGTTQRGREVEEAKVVSRFV 113

Query: 1280 NRSLNEQFLRPRRNVDSVVCDKNGDESEVKVK-ELQEKLAVSENLVQNLQDEVLDLRNQL 1104
                 EQF RPR  V      ++ ++ + K K EL EKL  SE+L++NLQ EVL L+ +L
Sbjct: 114  RPHAVEQFSRPRSGVGDFAFKRDKEDPDGKSKKELMEKLEASESLIKNLQSEVLALKAEL 173

Query: 1103 VKXXXXXXXXXXXNRVYEEKLSQAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLG 924
             K           NR   E L+ A+    S     +    E  Q P+ + +Q+    KL 
Sbjct: 174  EKVKGLNVELESNNRKLTEDLAAAEAKVVSLSGNEKEPNGEH-QSPKFKLIQKLIADKLE 232

Query: 923  ILQGRKDVAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKA 744
                +K+ +I                   +   T                          
Sbjct: 233  RSIVKKE-SITNGGFVKASIPAQTAIPEVTTTRTGRKPTCNSCLPPPPPPMPPSIPSRPI 291

Query: 743  VTPRKTSSIVQF---YHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAI 573
                 T     F   +H+L  QEG   T+    +  PV  N H+SIVGEIQNRS+HLLAI
Sbjct: 292  AKANNTQRAPAFVKLFHTLKNQEGMKSTTGSGKQQRPVAVNVHSSIVGEIQNRSAHLLAI 351

Query: 572  RRDIETKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKA 393
            R DIETKGE +N LI KV  +AY+ I+DVL FV WLDGEL+SLADERAVLKHFNWPERKA
Sbjct: 352  RADIETKGEFINDLIKKVVEAAYTDIEDVLNFVNWLDGELSSLADERAVLKHFNWPERKA 411

Query: 392  DAMREAAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTM 213
            DA+REAA+EYR+LK LE ++SS+KD+P +P   +L+KMA+LLDKSE SIQRLI LR S M
Sbjct: 412  DAIREAAVEYRELKSLEQEISSFKDDPEIPCGASLRKMASLLDKSESSIQRLIKLRNSAM 471

Query: 212  VSYKDCKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQG 33
             SY++ KIPT WMLDSG+++KIKQASM L +MYMKRV +           S+QE+L+LQG
Sbjct: 472  RSYQEYKIPTAWMLDSGIMTKIKQASMTLVKMYMKRVTMELGSARNSDRQSSQESLLLQG 531

Query: 32   VRFAFRAHQF 3
            + FA+RAHQF
Sbjct: 532  MHFAYRAHQF 541


>ref|XP_006597905.1| PREDICTED: uncharacterized protein LOC100820086 isoform X1 [Glycine
            max]
          Length = 596

 Score =  335 bits (860), Expect = 3e-89
 Identities = 226/550 (41%), Positives = 300/550 (54%), Gaps = 24/550 (4%)
 Frame = -3

Query: 1580 MKQEIPAKPSENKVMQST---------HLQTTPSRFRSTATKVVKESPRS--ELVNGVSP 1434
            MKQE  A P+ + + Q T             TPSR R  +    +E P++  E+VN    
Sbjct: 20   MKQEPLASPTTSSLPQPTITTKNVIKIQNSLTPSRLRLPSK--YREPPKTPPEVVNN--- 74

Query: 1433 GLKSRPVIMPIPTESSPSINSQQKVRRSLLGLNSKPK---LATN------EDVKTVSRFG 1281
            G+ S P+        +P +    ++++ L+   +KP    L T       E+ K VSRF 
Sbjct: 75   GMVSTPLRRA--KSVTPELKHNSRIKKGLVLNKAKPNEEVLGTTQRGREVEEAKVVSRFV 132

Query: 1280 NRSLNEQFLRPRRNVDSVVCDKNGDESEVKVK-ELQEKLAVSENLVQNLQDEVLDLRNQL 1104
                 EQF RPR  V      ++ ++ + K K EL EKL  SE+L++NLQ EVL L+ +L
Sbjct: 133  RPHAVEQFSRPRSGVGDFAFKRDKEDPDGKSKKELMEKLEASESLIKNLQSEVLALKAEL 192

Query: 1103 VKXXXXXXXXXXXNRVYEEKLSQAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLG 924
             K           NR   E L+ A+    S     +    E  Q P+ + +Q+    KL 
Sbjct: 193  EKVKGLNVELESNNRKLTEDLAAAEAKVVSLSGNEKEPNGEH-QSPKFKLIQKLIADKLE 251

Query: 923  ILQGRKDVAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKA 744
                +K+ +I                   +   T                          
Sbjct: 252  RSIVKKE-SITNGGFVKASIPAQTAIPEVTTTRTGRKPTCNSCLPPPPPPMPPSIPSRPI 310

Query: 743  VTPRKTSSIVQF---YHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAI 573
                 T     F   +H+L  QEG   T+    +  PV  N H+SIVGEIQNRS+HLLAI
Sbjct: 311  AKANNTQRAPAFVKLFHTLKNQEGMKSTTGSGKQQRPVAVNVHSSIVGEIQNRSAHLLAI 370

Query: 572  RRDIETKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKA 393
            R DIETKGE +N LI KV  +AY+ I+DVL FV WLDGEL+SLADERAVLKHFNWPERKA
Sbjct: 371  RADIETKGEFINDLIKKVVEAAYTDIEDVLNFVNWLDGELSSLADERAVLKHFNWPERKA 430

Query: 392  DAMREAAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTM 213
            DA+REAA+EYR+LK LE ++SS+KD+P +P   +L+KMA+LLDKSE SIQRLI LR S M
Sbjct: 431  DAIREAAVEYRELKSLEQEISSFKDDPEIPCGASLRKMASLLDKSESSIQRLIKLRNSAM 490

Query: 212  VSYKDCKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQG 33
             SY++ KIPT WMLDSG+++KIKQASM L +MYMKRV +           S+QE+L+LQG
Sbjct: 491  RSYQEYKIPTAWMLDSGIMTKIKQASMTLVKMYMKRVTMELGSARNSDRQSSQESLLLQG 550

Query: 32   VRFAFRAHQF 3
            + FA+RAHQF
Sbjct: 551  MHFAYRAHQF 560


>ref|XP_007138573.1| hypothetical protein PHAVU_009G220500g [Phaseolus vulgaris]
            gi|593330295|ref|XP_007138574.1| hypothetical protein
            PHAVU_009G220500g [Phaseolus vulgaris]
            gi|561011660|gb|ESW10567.1| hypothetical protein
            PHAVU_009G220500g [Phaseolus vulgaris]
            gi|561011661|gb|ESW10568.1| hypothetical protein
            PHAVU_009G220500g [Phaseolus vulgaris]
          Length = 584

 Score =  334 bits (857), Expect = 6e-89
 Identities = 222/548 (40%), Positives = 297/548 (54%), Gaps = 27/548 (4%)
 Frame = -3

Query: 1565 PAKPSENKVMQSTHLQTTPSRFRSTATKVVKESPRS--ELVNGV--SPGLKSRPVIMPIP 1398
            P  P   K +       TPSR R  +    +E PR+  E+VNGV  +P  +++ V     
Sbjct: 16   PQPPPTTKNVIRLQSSLTPSRLRLPSK--YREPPRTPPEVVNGVVSTPTRRAKSV----- 68

Query: 1397 TESSPSINSQQKVRRSLLGLNSKPKLATNEDV------------KTVSRFGNRSLNEQFL 1254
               +P +    +++R L+   +KP    NE+V            K V RF      EQF 
Sbjct: 69   ---TPELKHASRIKRGLVLNKAKP----NEEVVGTHRGREAVEPKAVPRFMRPHAVEQFA 121

Query: 1253 RPRRNVDSVVCDKNGDESEVKVK-ELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXX 1077
             PR  V      ++ +E + K K EL EKL VSE+L++NLQ EVL L+ +L K       
Sbjct: 122  SPRSAVGDFAMKRDKEEPDGKSKKELMEKLEVSESLIRNLQSEVLALKAELEKVKGLNVE 181

Query: 1076 XXXXNRVYEEKLSQAKNSARS--NHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKD 903
                NR   + ++ A++   S    ++M+    E  Q P+ + +Q+    KL   + +K+
Sbjct: 182  LESHNRKLTKDIAAAESKVMSLGGSEKMKEPIGEH-QSPKFKHIQKLIADKLERSRVKKE 240

Query: 902  VAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVT----- 738
               D                   A   +                          +     
Sbjct: 241  ALTDGCFVKASTSAPTAIPTIPEATTIRIGRKPALKACLPPPPPPPPPMPPSIPSRPVAK 300

Query: 737  ---PRKTSSIVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRR 567
                ++  + V+ +HSL  QE    T+    +  P   N H+SIVGEIQNRS+HLLAIR 
Sbjct: 301  VSNTQRAPAFVKLFHSLKNQEEMKNTTGPVKQQKPDAVNVHSSIVGEIQNRSAHLLAIRA 360

Query: 566  DIETKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADA 387
            DIETKG+ +N LI KV  +AY  I+DVL FV WLDGEL+SLADERAVLKHFNWPERKADA
Sbjct: 361  DIETKGDFINDLIKKVVEAAYMDIEDVLNFVNWLDGELSSLADERAVLKHFNWPERKADA 420

Query: 386  MREAAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVS 207
            MREAA+EYRDLK LE ++ S+KD+P +P   +L+KMA LLDKSE SIQRLI LR S M S
Sbjct: 421  MREAAVEYRDLKLLEQEIFSFKDDPEIPCGASLRKMATLLDKSECSIQRLIKLRNSVMRS 480

Query: 206  YKDCKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVR 27
            Y+D KIPT WMLDSG+ +KIKQASM L +MYMKRV +           S+QE+L+LQG+ 
Sbjct: 481  YQDYKIPTAWMLDSGITAKIKQASMTLVKMYMKRVTMELGSAKNSDRQSSQESLLLQGMH 540

Query: 26   FAFRAHQF 3
            FA+RAHQF
Sbjct: 541  FAYRAHQF 548


>ref|XP_006587085.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Glycine max]
            gi|571476832|ref|XP_006587086.1| PREDICTED: protein
            CHUP1, chloroplastic-like isoform X2 [Glycine max]
          Length = 583

 Score =  334 bits (856), Expect = 8e-89
 Identities = 221/554 (39%), Positives = 299/554 (53%), Gaps = 28/554 (5%)
 Frame = -3

Query: 1580 MKQEIPAKPSENKVMQSTHLQTT------------PSRFRSTAT-KVVKESPRSELVNGV 1440
            MKQE PA P  + + Q +   TT            PSR R  +  +   ++P   +VN V
Sbjct: 1    MKQESPASPLPSSLPQPSPTMTTTKNVIKLQNSLTPSRLRLPSKYREPPKTPPEVVVNNV 60

Query: 1439 SPGLKSRPVIMPIPTESSPSINSQQKVRRSLLGLNSKPK---LATN------EDVKTVSR 1287
                 SR          +P +    +++R L+   +KP    + T       E+ K V+R
Sbjct: 61   VVSTPSRRA-----KSVTPELKHNSRIKRGLVLNKAKPNEEVVGTTQRGREAEETKVVAR 115

Query: 1286 FGNRSLNEQFLRPRRNVDSVVCDKNGDESEVKVK-ELQEKLAVSENLVQNLQDEVLDLRN 1110
            F    + EQF RPR         ++ ++S+ K K EL EKL  SE+L++NLQ EV  L+ 
Sbjct: 116  FVRPHVVEQFARPRNGAGDFAFKRDKEDSDEKSKKELMEKLEASESLIKNLQSEVQALKA 175

Query: 1109 QLVKXXXXXXXXXXXNRVYEEKLSQAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYK 930
            +L K           NR   E L+ A+    S     +    E  Q P+ + +Q+    K
Sbjct: 176  ELEKVKGLKVELESHNRKLTEDLAAAEVKVVSLGGNEKEPNGEH-QSPKFKHIQKLIADK 234

Query: 929  LGILQGRKDVAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXX 750
            L     +K+   +                   A   +                       
Sbjct: 235  LERSIVKKEAIANGGFVEASIPPPTAIPAIPDAPTARKGRKPTPNSCLPPPPPPMPPSIP 294

Query: 749  K-----AVTPRKTSSIVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSH 585
                  A   ++  + V+ +H+L  QEG   T+    +  PV  N H+SIVGEIQNRS+H
Sbjct: 295  SRPIAKASNTQRVPAFVKLFHTLKNQEGMKSTTGTVKQQKPVSVNVHSSIVGEIQNRSAH 354

Query: 584  LLAIRRDIETKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWP 405
            LLAIR DIETKG  +N LI KV  +AY+ I+DVL FV WLDGEL+SLADERAVLKHFNWP
Sbjct: 355  LLAIRADIETKGAFINDLIKKVVEAAYTDIEDVLNFVNWLDGELSSLADERAVLKHFNWP 414

Query: 404  ERKADAMREAAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLR 225
            ERKADAMREAA+EYR+LK LE ++SS+KD+P +P   +L+KMA+LLDKSE SIQRLI L+
Sbjct: 415  ERKADAMREAAVEYRELKLLEQEISSFKDDPEIPCGASLRKMASLLDKSESSIQRLIKLQ 474

Query: 224  TSTMVSYKDCKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEAL 45
             S M SY++ KIPT WMLDSG+++KIKQASM L +MYMKRV +           S+QE+L
Sbjct: 475  NSAMRSYQEYKIPTAWMLDSGIMTKIKQASMILVKMYMKRVTM-ELGSARNSDRSSQESL 533

Query: 44   VLQGVRFAFRAHQF 3
            +LQG+ FA+RAHQF
Sbjct: 534  LLQGMHFAYRAHQF 547


>ref|NP_564524.1| hydroxyproline-rich glycoprotein-like protein [Arabidopsis thaliana]
            gi|8778962|gb|AAD49768.2|AC007932_16 F11A17.16
            [Arabidopsis thaliana] gi|332194150|gb|AEE32271.1|
            hydroxyproline-rich glycoprotein-like protein
            [Arabidopsis thaliana]
          Length = 558

 Score =  322 bits (825), Expect = 3e-85
 Identities = 209/531 (39%), Positives = 300/531 (56%), Gaps = 26/531 (4%)
 Frame = -3

Query: 1517 TTPSRFRSTATKV-VKESPRSELVNGVSPGLKSRPVIMPIPTESSPSINSQQKVRRSLLG 1341
            TTPSR R+  +   V   PR++  NG++ G          P  S   + +    RRS+L 
Sbjct: 9    TTPSRVRAANSHYSVISKPRAQDDNGLTGGK---------PKSSGYDVKNDPAKRRSILL 59

Query: 1340 LNSKPKLATNEDVKTVSRFGNRSLN-----EQFLRPRRNV-----DSVVCDKNG-DESEV 1194
              +K   +  E++  ++    RS+N     EQF  PRR +     ++V+      DE   
Sbjct: 60   KRAK---SAEEEMAVLAPQRARSVNRPAVVEQFGCPRRPISRKSEETVMATAAAEDEKRK 116

Query: 1193 KVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRVYEEKL--SQAKNSA 1020
            +++EL+EKL V+E+L+++LQ +VL+L+ +L +           NR   + L  ++AK S+
Sbjct: 117  RMEELEEKLVVNESLIKDLQLQVLNLKTELEEARNSNVELELNNRKLSQDLVSAEAKISS 176

Query: 1019 RSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKDVAID------------RXXXX 876
             S++D+      +  Q  R +D+Q     KL   + +K+VA++            R    
Sbjct: 177  LSSNDK----PAKEHQNSRFKDIQRLIASKLEQPKVKKEVAVESSRLSPPSPSPSRLPPT 232

Query: 875  XXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTSSIVQFYHSL 696
                              +                       KA   +K+  + Q +  L
Sbjct: 233  PPLPKFLVSPASSLGKRDENSSPFAPPTPPPPPPPPPPRPLAKAARAQKSPPVSQLFQLL 292

Query: 695  TRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGELVNSLIDKVQ 516
             +Q+     S+  N N     +AHNSIVGEIQNRS+HL+AI+ DIETKGE +N LI KV 
Sbjct: 293  NKQDNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIAIKADIETKGEFINDLIQKVL 352

Query: 515  TSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEYRDLKRLESD 336
            T+ +S ++DV+KFV WLD ELA+LADERAVLKHF WPE+KAD ++EAA+EYR+LK+LE +
Sbjct: 353  TTCFSDMEDVMKFVDWLDKELATLADERAVLKHFKWPEKKADTLQEAAVEYRELKKLEKE 412

Query: 335  VSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPTDWMLDSGMV 156
            +SSY D+P + Y  ALKKMANLLDKSE+ I+RL+ LR S+M SY+D KIP +WMLDSGM+
Sbjct: 413  LSSYSDDPNIHYGVALKKMANLLDKSEQRIRRLVRLRGSSMRSYQDFKIPVEWMLDSGMI 472

Query: 155  SKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
             KIK+AS+KLA+ YM RV             ST+EAL+LQGVRFA+R HQF
Sbjct: 473  CKIKRASIKLAKTYMNRVANELQSARNLDRESTKEALLLQGVRFAYRTHQF 523


>ref|XP_003609889.1| Protein CHUP1 [Medicago truncatula] gi|355510944|gb|AES92086.1|
            Protein CHUP1 [Medicago truncatula]
          Length = 574

 Score =  320 bits (821), Expect = 1e-84
 Identities = 215/540 (39%), Positives = 298/540 (55%), Gaps = 14/540 (2%)
 Frame = -3

Query: 1580 MKQEIPAKPSENKVMQSTHLQTTP-SRFRSTATKVVKESPRS--ELVNGVSPGLKSRPVI 1410
            +K     + S+NK      LQT P +R R  A+   KESP++  E+VN VS    +R   
Sbjct: 18   LKHHQQQQHSDNK-----SLQTVPQTRLRVRASSKAKESPKTPPEIVNRVSTISSTRAKS 72

Query: 1409 MPIPTESSPSINSQQKVRRSLLGLNSKPKLATNEDVKTV---SRFGNRSLNEQFLRPRRN 1239
            +P      P + +  K +RS+    +K   +  E+V++    S+ G  +       PRR 
Sbjct: 73   VP------PDMKNNSKAKRSIF--MNKVVKSIEEEVESSHKGSKEGEVAKVVVVAPPRRR 124

Query: 1238 VDSVVCDKNGDESEVKVK-ELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXN 1062
                      D+ +VK K EL EKL VSENL+++LQ E+  L+++L +           N
Sbjct: 125  ------RIEEDDPDVKEKKELLEKLEVSENLIKSLQSEIKALKDELNQVKGLNIDLESQN 178

Query: 1061 RVYEEKLSQAKNSARSNHDQMEASTTEGL---QRPRLEDVQEFTGYKLGILQGRKD---- 903
                + L+ A+    +          E +   Q P+ +D+Q+    KL + + +K+    
Sbjct: 179  IKLNQNLASAEAKIVAFGTSSSTRKKEPIGERQSPKFKDIQKIIADKLEMSKVKKEANPE 238

Query: 902  VAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTS 723
            V   +                 S                            K    +K  
Sbjct: 239  VIFVKSSIPAPIPNHAAIREITSLGRKSPPNHCLMPPPPPPPPPIPSRPLAKLANTQKAP 298

Query: 722  SIVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGEL 543
            ++VQ +HSL  Q+ K +     N   P+  +AHNSIVGEIQNRS+HLLAIR DI+TKGE 
Sbjct: 299  AVVQLFHSLKNQDTKKDLKGSINHQKPITNSAHNSIVGEIQNRSAHLLAIREDIQTKGEF 358

Query: 542  VNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEY 363
            +N LI+KV  ++Y  I+DVLKFV WLDGEL++LADERAVLKHF WPERKAD MREAA+EY
Sbjct: 359  INGLINKVVDASYVDIEDVLKFVDWLDGELSTLADERAVLKHFKWPERKADTMREAAVEY 418

Query: 362  RDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPT 183
            R+LK LE ++SSYKD+P +P   +LKK+A+LLDKSERSIQ+LI LR S + SY+   IPT
Sbjct: 419  RELKMLEQEISSYKDDPDIPCVASLKKIASLLDKSERSIQKLIVLRNSVIRSYQMYNIPT 478

Query: 182  DWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQF 3
             WMLDSG+ SKIKQ+SM L +MYMKR+ +           S Q++L+LQGV FA+RAHQF
Sbjct: 479  AWMLDSGISSKIKQSSMTLVKMYMKRLTMELESIRNSDRESNQDSLLLQGVHFAYRAHQF 538


>gb|ADN34042.1| hydroxyproline-rich glycoprotein family protein [Cucumis melo subsp.
            melo]
          Length = 486

 Score =  308 bits (788), Expect = 6e-81
 Identities = 202/468 (43%), Positives = 271/468 (57%), Gaps = 5/468 (1%)
 Frame = -3

Query: 1394 ESSPSINSQQ---KVRRSLLGLNSKPKLATNEDVKTVSRFGNRSLNEQFLRPRRNVDSVV 1224
            ES+P    ++   +V RSL   N+  K    E+V   +R  NR   +Q    R    +  
Sbjct: 41   ESTPQSGVKKQSSRVSRSLTP-NAPKKGRDGENVGVSARTVNRGGLKQVSHRRSLSVAGS 99

Query: 1223 CDKNGDESEVKVKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRVYEEK 1044
            C    D + VK   LQEKL  +E+L+++LQ ++++L+ +L K           N +    
Sbjct: 100  CVNVEDCNGVK-SGLQEKLYFAEDLIKDLQSQLVELKEELRKSQSLNLELQSQNDLLVRD 158

Query: 1043 LS--QAKNSARSNHDQMEASTTEGLQRPRLEDVQEFTGYKLGILQGRKDVAIDRXXXXXX 870
            L+  +AK ++ SN+D+ + S +E  QR R ED Q+    KL                   
Sbjct: 159  LAAAEAKFASASNNDKRK-SVSEESQR-RTEDNQKLENGKLETQPSSS------------ 204

Query: 869  XXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPRKTSSIVQFYHSLTR 690
                       +  D                         +A   +K+  +V+ +HSL +
Sbjct: 205  ---------CRNVRDLDCKAPPPRAAPPPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRK 255

Query: 689  QEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRRDIETKGELVNSLIDKVQTS 510
            +EGK +         P   NAHNSIVGEIQNRS+HLLAI+ DIETKGE +N LIDKV  +
Sbjct: 256  KEGKRDPPL---LGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVA 312

Query: 509  AYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADAMREAAIEYRDLKRLESDVS 330
            A++ I+D+LKFV WLD +L+SLADERAVLKHF WPE+KADAMREAAIEYR LK LE+++S
Sbjct: 313  AHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEIS 372

Query: 329  SYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVSYKDCKIPTDWMLDSGMVSK 150
             YKD+   P E ALKKMA+LLDKSER IQRLITLR++ M SY+D K+PT+WMLDSG++SK
Sbjct: 373  FYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQDLKLPTNWMLDSGIMSK 432

Query: 149  IKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVRFAFRAHQ 6
            IKQASM LA+MYMKRV             S  E+L+LQG+ FA+R HQ
Sbjct: 433  IKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQ 480


>ref|XP_007036013.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|508715042|gb|EOY06939.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 564

 Score =  307 bits (786), Expect = 1e-80
 Identities = 218/550 (39%), Positives = 298/550 (54%), Gaps = 17/550 (3%)
 Frame = -3

Query: 1601 EPKLV*AMKQEIPA--KPSENKVMQSTHLQ-TTPSRFRSTATKVVKESPRSELVNGVSPG 1431
            +P L    + E P   KP+  K+   +HLQ TTPSR R  + K +  S ++E        
Sbjct: 3    DPSLTSMKQHETPTTLKPAACKLTPMSHLQSTTPSRCRVNS-KPINHSAKAEAR------ 55

Query: 1430 LKSRPVIMPIPTESSPSINSQQKVRRSLLGLNSKPKLATNEDVKTVSRFGNRSLNEQFLR 1251
                      P  ++P +    K     L LN KPK  + +  + V       + +QF R
Sbjct: 56   ----------PETATPHVKDSTKNSSKSLLLN-KPK--SGDQPQVVGSHHKGRVVDQFAR 102

Query: 1250 PRRNVDSVVCDKNGDESEVK---VKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXX 1080
            PRR   ++        S ++   + EL+EKL+ SE LV++L+ +VL L+ +L        
Sbjct: 103  PRRLNANLTKKSEESRSAIEKNNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNM 162

Query: 1079 XXXXXNRVYEEKL--SQAKNSARSNHD--QMEASTTEGLQRPRLEDVQEFTGYKLGILQG 912
                 NR   E L  ++AK +A ++ D  Q++  +    Q  + +D+QEF   KL   + 
Sbjct: 163  ELESLNRKLNEDLVAAEAKIAALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKI 222

Query: 911  RKDVAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPR 732
             ++   +                  + A+                            TP+
Sbjct: 223  TREAIKEIRTVQTPLPQPASLTTKLAGAEP----CAKAVSSPPPPPPPPRPPAKTTTTPK 278

Query: 731  KTSSIV---QFYHSL--TRQEGKN--ETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAI 573
              SS+V   Q Y+SL  TRQE K     +   N N P   +AH+SIVGEIQNRS+HLLAI
Sbjct: 279  ADSSVVVPLQCYNSLSLTRQERKKYPPAAAAWNHNKPTVISAHSSIVGEIQNRSAHLLAI 338

Query: 572  RRDIETKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKA 393
            + D+ETKGE +NSLI KV  +A++ I+DVLKFV WLD EL+SLADERAVLKHF WPERKA
Sbjct: 339  KADVETKGEFINSLIHKVLAAAHTDIEDVLKFVDWLDSELSSLADERAVLKHFKWPERKA 398

Query: 392  DAMREAAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTM 213
            DAMREAAIEYRDLK LE+++SSY+D+  +P   ALK++A LLDKSE+S+QRLI LR   M
Sbjct: 399  DAMREAAIEYRDLKLLENEISSYEDDTSIPCGAALKRLAGLLDKSEKSMQRLIKLRNLVM 458

Query: 212  VSYKDCKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQG 33
             SY++ KIP DWMLDSG+  KIKQ SMKLA +Y+KRV             S Q AL+LQ 
Sbjct: 459  HSYQEYKIPIDWMLDSGITCKIKQGSMKLATLYIKRVATELQLVRSLDKESAQGALLLQV 518

Query: 32   VRFAFRAHQF 3
            + FA +  QF
Sbjct: 519  MHFAHKVQQF 528


>ref|XP_007036016.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 4
            [Theobroma cacao] gi|508715045|gb|EOY06942.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 4 [Theobroma cacao]
          Length = 565

 Score =  305 bits (780), Expect = 5e-80
 Identities = 217/549 (39%), Positives = 297/549 (54%), Gaps = 17/549 (3%)
 Frame = -3

Query: 1601 EPKLV*AMKQEIPA--KPSENKVMQSTHLQ-TTPSRFRSTATKVVKESPRSELVNGVSPG 1431
            +P L    + E P   KP+  K+   +HLQ TTPSR R  + K +  S ++E        
Sbjct: 3    DPSLTSMKQHETPTTLKPAACKLTPMSHLQSTTPSRCRVNS-KPINHSAKAEAR------ 55

Query: 1430 LKSRPVIMPIPTESSPSINSQQKVRRSLLGLNSKPKLATNEDVKTVSRFGNRSLNEQFLR 1251
                      P  ++P +    K     L LN KPK  + +  + V       + +QF R
Sbjct: 56   ----------PETATPHVKDSTKNSSKSLLLN-KPK--SGDQPQVVGSHHKGRVVDQFAR 102

Query: 1250 PRRNVDSVVCDKNGDESEVK---VKELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXX 1080
            PRR   ++        S ++   + EL+EKL+ SE LV++L+ +VL L+ +L        
Sbjct: 103  PRRLNANLTKKSEESRSAIEKNNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNM 162

Query: 1079 XXXXXNRVYEEKL--SQAKNSARSNHD--QMEASTTEGLQRPRLEDVQEFTGYKLGILQG 912
                 NR   E L  ++AK +A ++ D  Q++  +    Q  + +D+QEF   KL   + 
Sbjct: 163  ELESLNRKLNEDLVAAEAKIAALASRDKVQLQRESNGDDQSFKFKDIQEFIANKLEHPKI 222

Query: 911  RKDVAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXXXXXXXXXXXKAVTPR 732
             ++   +                  + A+                            TP+
Sbjct: 223  TREAIKEIRTVQTPLPQPASLTTKLAGAEP----CAKAVSSPPPPPPPPRPPAKTTTTPK 278

Query: 731  KTSSIV---QFYHSL--TRQEGKN--ETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAI 573
              SS+V   Q Y+SL  TRQE K     +   N N P   +AH+SIVGEIQNRS+HLLAI
Sbjct: 279  ADSSVVVPLQCYNSLSLTRQERKKYPPAAAAWNHNKPTVISAHSSIVGEIQNRSAHLLAI 338

Query: 572  RRDIETKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKA 393
            + D+ETKGE +NSLI KV  +A++ I+DVLKFV WLD EL+SLADERAVLKHF WPERKA
Sbjct: 339  KADVETKGEFINSLIHKVLAAAHTDIEDVLKFVDWLDSELSSLADERAVLKHFKWPERKA 398

Query: 392  DAMREAAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTM 213
            DAMREAAIEYRDLK LE+++SSY+D+  +P   ALK++A LLDKSE+S+QRLI LR   M
Sbjct: 399  DAMREAAIEYRDLKLLENEISSYEDDTSIPCGAALKRLAGLLDKSEKSMQRLIKLRNLVM 458

Query: 212  VSYKDCKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQG 33
             SY++ KIP DWMLDSG+  KIKQ SMKLA +Y+KRV             S Q AL+LQ 
Sbjct: 459  HSYQEYKIPIDWMLDSGITCKIKQGSMKLATLYIKRVATELQLVRSLDKESAQGALLLQV 518

Query: 32   VRFAFRAHQ 6
            + FA +  Q
Sbjct: 519  MHFAHKVQQ 527


>ref|XP_006393445.1| hypothetical protein EUTSA_v10012201mg [Eutrema salsugineum]
            gi|557090023|gb|ESQ30731.1| hypothetical protein
            EUTSA_v10012201mg [Eutrema salsugineum]
          Length = 554

 Score =  300 bits (767), Expect = 2e-78
 Identities = 148/248 (59%), Positives = 189/248 (76%)
 Frame = -3

Query: 746  AVTPRKTSSIVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQNRSSHLLAIRR 567
            A   +K+  + Q YH L +Q+   + S   N N P   +AHNSIVGEIQNRS+HL+AI+ 
Sbjct: 272  AARAQKSPPVSQLYHLLKKQDNSRDLSPSVNGNKPQVNSAHNSIVGEIQNRSAHLIAIKA 331

Query: 566  DIETKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKHFNWPERKADA 387
            DIETKG+ +N LI KV T+ +S ++DV++FV WLD ELA+LADERAVLKHF WPERKADA
Sbjct: 332  DIETKGDFINDLIQKVLTTCFSDMEDVMRFVDWLDSELATLADERAVLKHFKWPERKADA 391

Query: 386  MREAAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRLITLRTSTMVS 207
            ++EAA+EYR+LK+LE ++SSY D+P + Y  ALKKM NLLDKSE+ I+RL+ LR S+M S
Sbjct: 392  LQEAAVEYRELKKLEKELSSYSDDPSIHYGVALKKMVNLLDKSEQRIRRLVRLRASSMRS 451

Query: 206  YKDCKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXSTQEALVLQGVR 27
            Y+D KIP +WMLDSGM+SKIK+AS+KLA++YM RV             STQEAL+LQGVR
Sbjct: 452  YQDFKIPVEWMLDSGMISKIKRASIKLAKLYMNRVANELESVRNLDRESTQEALLLQGVR 511

Query: 26   FAFRAHQF 3
            FA+R HQF
Sbjct: 512  FAYRTHQF 519


>ref|XP_006840355.1| hypothetical protein AMTR_s00045p00113980 [Amborella trichopoda]
            gi|548842073|gb|ERN02030.1| hypothetical protein
            AMTR_s00045p00113980 [Amborella trichopoda]
          Length = 568

 Score =  300 bits (767), Expect = 2e-78
 Identities = 209/558 (37%), Positives = 287/558 (51%), Gaps = 32/558 (5%)
 Frame = -3

Query: 1580 MKQEIPAKPSENKVMQSTHLQTTPSRFRSTATKVVKESPRSELVNGVSPGLKSRPVIMPI 1401
            MKQEI A  S   +  S     T +R R   T+  ++S     +NGVS G K RP     
Sbjct: 1    MKQEI-ATESRASLFHSL----TGNRTRVQKTRDTQKSGAP--INGVSSGQKLRPK---- 49

Query: 1400 PTESSPSINSQQKVRRSLLGLNSKPKLATNEDVKTVSRFGNRSLNE--QFLRPRRNVDSV 1227
            P  S P  +++ +  +  L   S  ++  ++  + + R   + +    +  RPR +    
Sbjct: 50   PVVSEPDSSAKTRKNQPKLKPFSGEEIEAHK-AREMGRMRQQPVESYARLRRPRGHELKK 108

Query: 1226 VCDKNGDESEVKVK-ELQEKLAVSENLVQNLQDEVLDLRNQLVKXXXXXXXXXXXNRVYE 1050
            V D + ++  +  K ELQ KL +SE LV +L  EV +LR Q+             NR   
Sbjct: 109  VVDSDEEKKGLDEKGELQRKLDLSEGLVNDLHSEVAELRAQVESLQSLNQKLELQNRKVA 168

Query: 1049 EKLSQAKNSARSN-----------------------------HDQMEASTTEGLQRPRLE 957
             +L+ A+    S                               DQ   S +E +QR   +
Sbjct: 169  VELAAAEAKLNSRILSANQSLDRENGFKKKSMIESVVGEIKASDQEMESPSEEVQRAEFK 228

Query: 956  DVQEFTGYKLGILQGRKDVAIDRXXXXXXXXXXXXXXXXXSAADTQXXXXXXXXXXXXXX 777
            D+++    K+    G K +AI +                      Q              
Sbjct: 229  DIRKLIASKMEQQLGPKPMAIKQVPTPK-------------TVQIQPKPPPICPPPPPPP 275

Query: 776  XXXXXXXXXKAVTPRKTSSIVQFYHSLTRQEGKNETSRGTNRNYPVPANAHNSIVGEIQN 597
                     K    +K   +V+FYH LT++EGK +       + P   +AH+SI+GEIQN
Sbjct: 276  PPPTQALSKKGAAMKKAPDLVEFYHLLTKREGKKDGLGSGTSSSPGVMSAHSSIIGEIQN 335

Query: 596  RSSHLLAIRRDIETKGELVNSLIDKVQTSAYSSIDDVLKFVVWLDGELASLADERAVLKH 417
            RSSH+LA+R D+E KGE +  +I K++  A++ +++VL FV WLD EL+SL+DERAVLKH
Sbjct: 336  RSSHMLAVRADVEKKGEFIKFVIKKIREMAFADMEEVLAFVDWLDTELSSLSDERAVLKH 395

Query: 416  FNWPERKADAMREAAIEYRDLKRLESDVSSYKDEPFMPYETALKKMANLLDKSERSIQRL 237
            F+WPERKADAMREAA EYRDLKRLE +VSSY+D+  +P ETALKKMA LLDKSE+ I RL
Sbjct: 396  FDWPERKADAMREAAFEYRDLKRLELEVSSYEDDLCLPCETALKKMATLLDKSEQRIPRL 455

Query: 236  ITLRTSTMVSYKDCKIPTDWMLDSGMVSKIKQASMKLARMYMKRVCIXXXXXXXXXXXST 57
              LR   M  Y+DCKIPT WM DSGMV KIK AS+KLA+  M R+ +           S 
Sbjct: 456  AKLRDLVMPCYRDCKIPTAWMCDSGMVDKIKLASVKLAKKCMNRLSMELELVKHSERESA 515

Query: 56   QEALVLQGVRFAFRAHQF 3
             E L+LQGVRFA+RAHQF
Sbjct: 516  HEGLLLQGVRFAYRAHQF 533


Top