BLASTX nr result

ID: Rheum21_contig00008109 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00008109
         (1639 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21267.3| unnamed protein product [Vitis vinifera]              366   2e-98
gb|EMJ16124.1| hypothetical protein PRUPE_ppa001243mg [Prunus pe...   347   7e-93
gb|EOY02964.1| Nuclear factor kappa-B-binding protein, putative ...   344   7e-92
ref|XP_002512826.1| hypothetical protein RCOM_1445020 [Ricinus c...   340   1e-90
gb|EXB41410.1| hypothetical protein L484_007560 [Morus notabilis]     326   2e-86
ref|XP_006386860.1| hypothetical protein POPTR_0002s23880g [Popu...   323   1e-85
ref|XP_002320420.1| hypothetical protein POPTR_0014s14110g [Popu...   311   4e-82
ref|XP_006468901.1| PREDICTED: uncharacterized protein LOC102625...   307   8e-81
dbj|BAB11123.1| unnamed protein product [Arabidopsis thaliana]        305   3e-80
ref|XP_006287010.1| hypothetical protein CARUB_v10000158mg [Caps...   305   5e-80
ref|NP_196899.2| uncharacterized protein [Arabidopsis thaliana] ...   305   5e-80
dbj|BAC41867.1| unknown protein [Arabidopsis thaliana] gi|290290...   303   1e-79
ref|XP_004516003.1| PREDICTED: uncharacterized protein LOC101502...   302   3e-79
ref|XP_002873641.1| hypothetical protein ARALYDRAFT_488220 [Arab...   302   3e-79
ref|NP_001190306.1| uncharacterized protein [Arabidopsis thalian...   300   2e-78
ref|XP_004306320.1| PREDICTED: uncharacterized protein LOC101311...   298   3e-78
gb|ESW23708.1| hypothetical protein PHAVU_004G069600g [Phaseolus...   297   8e-78
ref|XP_006574737.1| PREDICTED: intracellular protein transport p...   297   1e-77
ref|XP_006574736.1| PREDICTED: intracellular protein transport p...   297   1e-77
ref|XP_006574734.1| PREDICTED: intracellular protein transport p...   297   1e-77

>emb|CBI21267.3| unnamed protein product [Vitis vinifera]
          Length = 716

 Score =  366 bits (939), Expect = 2e-98
 Identities = 199/441 (45%), Positives = 283/441 (64%), Gaps = 12/441 (2%)
 Frame = -3

Query: 1502 QRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAKGEQV 1323
            Q+KKRLS+A + G SS +  RAK+K L S+Q  LN  + SH    WDDN+KR VAK EQ+
Sbjct: 3    QQKKRLSAASIVGCSSHQPSRAKRKSLGSTQCGLN--MRSHISLNWDDNKKRVVAKREQI 60

Query: 1322 GLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERSQLIQ 1143
             +S R+L PF ++ PH  N +ADI AIP            LS EVW + L+EKER  L Q
Sbjct: 61   AISWRDLSPFINSVPHCPNILADIWAIPPEIFELKGLTEVLSFEVWQTHLSEKERDLLTQ 120

Query: 1142 FLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKNAYYS 963
            FLPSG++  +VVQ LLAG+N HFGNPFLKW ++LCSG+ HP++V   ++  + NK AYY 
Sbjct: 121  FLPSGLDGQQVVQALLAGDNFHFGNPFLKWGASLCSGDLHPDAVLSKEQCLKTNKKAYYL 180

Query: 962  ELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQPG--------EEVRYLNREL 807
            ELQKYH++ I +LQK K+R A    PE EI Q +WR ++           E +   +   
Sbjct: 181  ELQKYHNDNIANLQKWKERWAICKDPEKEIVQNIWRSKKHADESGFHDSEENLAATSESC 240

Query: 806  SVAADEKAWSSDNQSPLKAMGSEMPRSKDTRKDE----DGKTDVMXXXXXXXXXXXXXXA 639
            S AADEKA SSDNQ+  +  G E+ + KD  KD+       ++ +               
Sbjct: 241  SWAADEKACSSDNQNSSRKDG-ELQKGKDLMKDKCKSPVAASNGLKVVTRTRKRVKFSKL 299

Query: 638  NVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGNLNSFNVKPYVMFEEE 459
            N+++GDG+KYMSY+KISK QH+LV+SMKQSGNSIQ ++LNRV+G+L+SF+++PY +FEEE
Sbjct: 300  NIHYGDGAKYMSYIKISKKQHQLVKSMKQSGNSIQPRSLNRVLGDLDSFHIRPYEVFEEE 359

Query: 458  EQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEKLKHLMEEEGMDHSNG 279
            E+++  E+WS L+ +D+P+AF+  G  + +R +M +SL LE+EE+LK L+E++  +  + 
Sbjct: 360  EKRKFHEHWSQLATRDLPAAFANRGKKQLQRRQMTQSLALEMEERLKPLVEDDEKEGPDS 419

Query: 278  AHFYEGGSISSDDTEQTQEDE 216
                E     + D E T +D+
Sbjct: 420  I-LQEQEDNGATDHEPTMDDD 439


>gb|EMJ16124.1| hypothetical protein PRUPE_ppa001243mg [Prunus persica]
          Length = 873

 Score =  347 bits (891), Expect = 7e-93
 Identities = 189/457 (41%), Positives = 279/457 (61%), Gaps = 24/457 (5%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KRL+ A + G +SREQ++AKKK++   + D +  +NSH   +WD N+K  VAK
Sbjct: 1    MAADQRRKRLNGASIIGCNSREQHKAKKKNMGLLKDDSD--INSHISLEWDGNQKMVVAK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             +Q+G+S R+LRPF  +  + +N +AD+ A+P            LS+EVW + L+E ER 
Sbjct: 59   SDQIGISWRDLRPFIDSTFNSHNILADVFAVPEGIYDLEDLEDVLSYEVWQTHLSENERK 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             LIQFLP G   ++VVQ LL+G+   FGNPFLKW ++LCSG+ HP+++   ++    +K 
Sbjct: 119  HLIQFLPRGPEAEQVVQALLSGDYFDFGNPFLKWGASLCSGDFHPDAILRREQCLNTDKK 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIRE-------QPGEEVRYLN 816
            AYY ELQKYH++MI +L KLK+RCA+   PE EI Q +WR R            E R+ +
Sbjct: 179  AYYKELQKYHNDMIAYLLKLKERCASCKDPEKEIVQKIWRSRNDMEKKIYSHANESRFRD 238

Query: 815  RE---------LSVAADEKAWSSDNQSPLKAMGSEMPRS---KDTRKDEDGKTDV----- 687
             E          S  ADEKA SSDNQ      G ++      K   KD+     V     
Sbjct: 239  LEENATVTSESCSWVADEKACSSDNQISSVVKGGKLQNRIYVKGFVKDKGRNVLVTADRA 298

Query: 686  MXXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIG 507
            +               N    DG+KYMSYVKISK Q+E+V+SMKQSG SIQS++LNRV+G
Sbjct: 299  VNVGARSKTGDRLHKRNFYSSDGAKYMSYVKISKKQYEIVKSMKQSGKSIQSRSLNRVLG 358

Query: 506  NLNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEE 327
            NL+SF+V+PY +F EEEQK++ ++W  L+ KD+P+A++ W  +  +R +M KSL  +++ 
Sbjct: 359  NLDSFDVQPYEVFVEEEQKKLHQHWLQLANKDLPAAYANWKEMHLQRRQMTKSLEKDMKR 418

Query: 326  KLKHLMEEEGMDHSNGAHFYEGGSISSDDTEQTQEDE 216
            +L+ L+E++G D ++ +       I ++D +   ED+
Sbjct: 419  RLESLVEDDGGDENHESLLQGEIDIGAEDHDSPLEDD 455


>gb|EOY02964.1| Nuclear factor kappa-B-binding protein, putative [Theobroma cacao]
          Length = 878

 Score =  344 bits (882), Expect = 7e-92
 Identities = 189/469 (40%), Positives = 282/469 (60%), Gaps = 32/469 (6%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KRL+ A + G +SR+QYR KK+ L S Q DLN K       +WD N+KR VAK
Sbjct: 1    MAADQRRKRLNGASIAGCNSRDQYRTKKRKLESLQNDLNTKCC--ISLEWDGNKKRVVAK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQ+GLS+R LRPF  +APH++  +AD++ +P+           LS+EVW + L+E ER+
Sbjct: 59   REQIGLSRRHLRPFIDSAPHYHRVLADVLTLPHETFDLENLTEVLSYEVWQNHLSENERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L+QFLP+G + ++V+Q LLA EN HFGNPFLKW ++LC G  HP++V   ++  +A K 
Sbjct: 119  LLMQFLPTGTDKEQVLQALLAEENFHFGNPFLKWGASLCLGHLHPDAVIQGEQRLKAEKK 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIRE----------------Q 843
            AYYSELQ YHD++I  LQKLK++  +   PE EI Q  WR R                  
Sbjct: 179  AYYSELQDYHDDIIECLQKLKEKWESCQDPEQEIVQKFWRSRRVGEKRVFSNSNESRLGS 238

Query: 842  PGEEVRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPR---SKDTRKDE-----DGKTDV 687
              ++V   +   S  ADEKA SSDNQ+     G E  R    K   K++      G  D 
Sbjct: 239  VEQDVTATSESCSWVADEKACSSDNQNSSVMKGGEQQRRMYEKGFIKEKCRILLTGSGDA 298

Query: 686  MXXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIG 507
            +               N+   DG+KYMS  KISK QHEL+++MKQSG SIQ+++LNRV+G
Sbjct: 299  LTAEERPKKGDKLHKRNIQQSDGAKYMSCFKISKKQHELIKNMKQSGRSIQARSLNRVLG 358

Query: 506  NLNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEE 327
            +++S +V+PY +F EEEQ+++ E+W  L+++D+P+A++ W  ++ ++ E+ K L  +++E
Sbjct: 359  DIDSLHVQPYEVFMEEEQRKLHEHWLRLAQEDLPAAYANWREVQLQKWEITKLLKHDMKE 418

Query: 326  KLKHLMEEEGMDHSNGAHFYE--GG------SISSDDTEQTQEDEVPAD 204
            KL  ++E++  + +      E  GG       +  +D E+  ED+  A+
Sbjct: 419  KLNPVLEDDEEEDTGKVQDQEDYGGPNLAVLDVEKEDPEEFLEDQKDAE 467


>ref|XP_002512826.1| hypothetical protein RCOM_1445020 [Ricinus communis]
            gi|223547837|gb|EEF49329.1| hypothetical protein
            RCOM_1445020 [Ricinus communis]
          Length = 858

 Score =  340 bits (871), Expect = 1e-90
 Identities = 184/436 (42%), Positives = 265/436 (60%), Gaps = 31/436 (7%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M AD R+KRL+   + G SS EQY+ KKK L S + +LN K  SH   +WD N++R VAK
Sbjct: 3    MVADHRRKRLNGVSIAGCSSWEQYKTKKKKLESPKNELNTK--SHISLEWDGNKRRVVAK 60

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQ+GL +++LR F   +P  ++ +AD++AIP            LS+EVW + L+E ER 
Sbjct: 61   REQIGLRQKDLREFVDPSPQCHSFLADVLAIPQEIFEVDNLTEILSYEVWKTHLSESERK 120

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWS------------SALCSGEHHPESV 1011
             L+QFLP G + D+VVQ LL G+N HFGNP+LKW             +++CSG+ HP++V
Sbjct: 121  YLMQFLPRGSDGDKVVQALLTGDNFHFGNPYLKWQVLKYDDSITLEGASVCSGKLHPDAV 180

Query: 1010 CHYDETFRANKNAYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQ---- 843
             H ++  +A+K AYYSE+Q YH++MIR+LQKLK+   ++  PE E+ Q +WR R      
Sbjct: 181  VHQEQCIKADKKAYYSEIQNYHNDMIRYLQKLKETWESSKDPEKEVLQKLWRSRRDVDKQ 240

Query: 842  ------------PGEEVRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPR---SKDTRKD 708
                        P E     +   S+ A+EKA SSDNQ+     G E+ R    K   ++
Sbjct: 241  NFSHANESRFHDPEETSAATSESCSLVAEEKACSSDNQNSSITKGGEVQRRIYEKRFIEE 300

Query: 707  EDGKTDVMXXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSK 528
            +  K  V                N++  DG KYMSY+KISK QHELV+SMKQSG SIQSK
Sbjct: 301  KRRKPSVSSDDARFKRGEKLQKHNIHHTDGVKYMSYLKISKKQHELVKSMKQSGKSIQSK 360

Query: 527  ALNRVIGNLNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKS 348
             LNRV+GN ++  V+PY  F +EEQK++ E+W  L+ KD+P+A+  W N + +R E+ KS
Sbjct: 361  CLNRVLGNFDTLQVQPYEKFVKEEQKKLREHWLQLANKDLPAAYENWQNRQFQRCEIAKS 420

Query: 347  LCLEIEEKLKHLMEEE 300
            L  +++++L+ L+E+E
Sbjct: 421  LECDMKDRLESLLEDE 436


>gb|EXB41410.1| hypothetical protein L484_007560 [Morus notabilis]
          Length = 874

 Score =  326 bits (836), Expect = 2e-86
 Identities = 179/443 (40%), Positives = 257/443 (58%), Gaps = 40/443 (9%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQ +KRL+SA V G+  REQYRAK+K+    Q+D N K  SH   +WD N+KR VA+
Sbjct: 1    MAADQWRKRLNSAGVVGFHGREQYRAKRKNTGLPQYDPNMK--SHISLEWDGNQKRVVAR 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             +Q+ +S+R++ PF  ++P  NN +AD+ ++P            LS+EVW + L+E ER+
Sbjct: 59   RDQISISRRDMWPFMRSSPSVNNPIADVFSVPQEIYTLENLNDVLSYEVWETYLSESERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSAL----CSGEHHPESVCHYDETFR 987
             L+QFLP G   +EV++ LLAG+N HFG+PFL W   L      G+ HP+++   ++  +
Sbjct: 119  HLMQFLPRGPEAEEVLEALLAGDNFHFGSPFLNWQVLLHDSYTVGDLHPDAIFQKEQCLK 178

Query: 986  ANKNAYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQ------------ 843
              K AY +EL KYH+NMI +L KLK+R  N   PE EI Q +WR R              
Sbjct: 179  TEKKAYNAELHKYHNNMIGYLLKLKERFENCKDPEKEIVQKIWRSRNDTDKRISSSANDS 238

Query: 842  ----PGEEVRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPRSKD--------TRKDEDG 699
                P + +   +   S  ADEKA SSDNQ+     G E+  S +         RK E G
Sbjct: 239  RFCVPEDNIAASSESCSWVADEKACSSDNQNSSMLKGGELQNSGEVILLVATGVRKREKG 298

Query: 698  ------------KTDVMXXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMK 555
                          DV+               N+   DG+KYMSY K+SK QH++V++MK
Sbjct: 299  SLKGKSGNPSVVSDDVLNVGLKSRKGDKRHLQNITCSDGAKYMSYFKVSKKQHDIVKNMK 358

Query: 554  QSGNSIQSKALNRVIGNLNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIR 375
              G SIQSK+LNRV+GN+ S NV+PY +F +EEQK++ EYW +L+ K +P+A++ W ++ 
Sbjct: 359  --GKSIQSKSLNRVLGNIESINVQPYELFIKEEQKKLREYWIHLANKALPAAYANWRDLH 416

Query: 374  SERGEMMKSLCLEIEEKLKHLME 306
            S+R +M +SL  E+ EKLK   E
Sbjct: 417  SQRQQMRESLEQELNEKLKMTTE 439


>ref|XP_006386860.1| hypothetical protein POPTR_0002s23880g [Populus trichocarpa]
            gi|550345700|gb|ERP64657.1| hypothetical protein
            POPTR_0002s23880g [Populus trichocarpa]
          Length = 890

 Score =  323 bits (829), Expect = 1e-85
 Identities = 196/506 (38%), Positives = 285/506 (56%), Gaps = 29/506 (5%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+ RL+ A + G SS E YR KKK    S+ DLN K  S    +WD N K+ +AK
Sbjct: 1    MAADQRRNRLNGASLEGCSSWEPYRTKKK--KKSKHDLNAK--SLISLEWDGNRKKVIAK 56

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQ+G+S+R+LRPF  + P ++N +AD   +P            LS+EVW + L+E ER+
Sbjct: 57   REQIGISQRDLRPFIDSVPQYHNLLADAFPVPREIFELKNLTEVLSNEVWQTHLSENERN 116

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L+QFLP+G+   EVV+ LL+G+N  FGNP L+W ++LCSG HHP++V   ++  +A+K 
Sbjct: 117  FLMQFLPTGLGTVEVVEALLSGDNFRFGNPLLRWGASLCSGNHHPDAVLCQEQHLKADKK 176

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQP--------------- 840
            AYYS LQ YH++MI +LQKLKD   ++  PE E+ Q +WR                    
Sbjct: 177  AYYSNLQDYHNDMITYLQKLKDAWESSKDPEKEVLQKMWRRSRSDADKRISPCDNESKFH 236

Query: 839  --GEEVRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPR---SKDTRKDEDGKTDVMXXX 675
              GE +   +   S+ A+EKA SSDNQS     G E  +    K + K++  K  V    
Sbjct: 237  DLGENLVVTSESSSLVAEEKASSSDNQSSPATKGGEFQKRIFEKGSMKEKRRKPLVASDH 296

Query: 674  XXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGNLNS 495
                        N+   DG+KYMSY+KISK QH+LV+SMKQSG SIQSK+LN V+G+L++
Sbjct: 297  ATPGKEDKIHKRNIYRSDGAKYMSYLKISKKQHQLVKSMKQSGKSIQSKSLNCVLGDLDT 356

Query: 494  FNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEKLKH 315
             +V+PY  F +EE K++ E+W  L+ KD+P+A++ W   + +R E+ KS+  E++ KLK+
Sbjct: 357  LHVQPYEEFVKEEHKKLLEHWMQLAHKDLPAAYAIWRQRQFQRQEITKSMEQEMKGKLKY 416

Query: 314  ---LMEEEG-----MDHSN-GAHFYEGGSISSDDTEQTQEDEVPADEXXXXXXXXXXXXX 162
                +E++G      D S+ GA+ +E    +S +  Q Q  E+                 
Sbjct: 417  PVEYLEKDGHETVLQDQSDQGANKHE----TSLEDMQEQNHEIMLQGQNDHGTRYQESDN 472

Query: 161  XXXXXXXXXXXDEQSPLQIPSLDVNQ 84
                        +QSP  I SL V Q
Sbjct: 473  SEDGISGSISPQDQSPQHISSLSVGQ 498


>ref|XP_002320420.1| hypothetical protein POPTR_0014s14110g [Populus trichocarpa]
            gi|222861193|gb|EEE98735.1| hypothetical protein
            POPTR_0014s14110g [Populus trichocarpa]
          Length = 912

 Score =  311 bits (798), Expect = 4e-82
 Identities = 175/423 (41%), Positives = 257/423 (60%), Gaps = 20/423 (4%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KRL+ A + G SSRE YR K+   + S+  LN K  S    +WD N K+ VAK
Sbjct: 1    MAADQRRKRLNGASLAGCSSREPYRMKR---NKSKNGLNAK--SLISLEWDGNRKKVVAK 55

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQ+G+S+R+L PF  +  H++N +AD+ A+P            LS+E W + L+E ER+
Sbjct: 56   KEQIGISQRDLMPFVDSVLHYHNPLADVFAVPREIFELQNLAEVLSYETWQNHLSEDERN 115

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L QFLP+G+  +EVV+ LLAG+N HFGNP L+W ++LCSG  HP+ V   ++  +A+K 
Sbjct: 116  FLKQFLPTGLGTEEVVEALLAGDNFHFGNPLLRWGASLCSGNLHPDVVLCQEQHLKADKK 175

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQ---------------- 843
            A+YS+LQ YH +MI +LQKLKD   ++  PE EI Q +WR                    
Sbjct: 176  AFYSKLQDYHIDMITYLQKLKDTWESSKDPEKEILQKIWRRSRSDADKRISPCDTESKFH 235

Query: 842  -PGEEVRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPR---SKDTRKDEDGKTDVMXXX 675
              GE     +   S+ A+EK  SSD Q+       E+ +    K + K++  K+ +    
Sbjct: 236  GTGENESATSGSCSLVAEEKTSSSDTQNSHVTKSGEVQKRICEKGSMKEKLRKSLLASDD 295

Query: 674  XXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGNLNS 495
                        N++  DG+KYMSY+KISK QH+LV++MKQSG SIQSK+LN V+G+L++
Sbjct: 296  ARPGKGDKLRKRNIHRSDGAKYMSYLKISKKQHQLVKNMKQSGKSIQSKSLNCVLGDLDT 355

Query: 494  FNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEKLKH 315
             +V+PY  F +EEQK+++E+W  L+ KD+P A + W   + +R E+ KSL  EIE +LK+
Sbjct: 356  LHVQPYEEFVKEEQKKLQEHWMQLANKDLPVAHAIWRERQFQRQEITKSLEEEIEGQLKY 415

Query: 314  LME 306
             +E
Sbjct: 416  PVE 418


>ref|XP_006468901.1| PREDICTED: uncharacterized protein LOC102625405 isoform X1 [Citrus
            sinensis] gi|568829168|ref|XP_006468902.1| PREDICTED:
            uncharacterized protein LOC102625405 isoform X2 [Citrus
            sinensis] gi|568829170|ref|XP_006468903.1| PREDICTED:
            uncharacterized protein LOC102625405 isoform X3 [Citrus
            sinensis]
          Length = 940

 Score =  307 bits (787), Expect = 8e-81
 Identities = 172/438 (39%), Positives = 263/438 (60%), Gaps = 28/438 (6%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQ +KRL+   V G S  E Y+ KK+ L S Q  LN K  S+   KWD+++K+ +AK
Sbjct: 1    MAADQWRKRLNGVSVGGCSPLEDYKMKKRKLGSLQNGLNSK--SNISLKWDESKKKVIAK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNN---NVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEK 1164
             EQ+G+S+R  +PF  +         ++AD  ++P            LS+EVW + L+E+
Sbjct: 59   QEQIGISRRISKPFTDSVSGSKTVLGHLADAFSVPQEIFELENLTEVLSYEVWQTQLSEE 118

Query: 1163 ERSQLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRA 984
            ER+ L QFLPS  N ++VV+ LLAGEN HFG+PFLKW ++LCSG  HP++V H + + +A
Sbjct: 119  ERNYLKQFLPSAQNAEQVVEALLAGENFHFGSPFLKWGASLCSGNFHPDAVLHKERSLKA 178

Query: 983  NKNAYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRI-------------REQ 843
            +K AY+ ELQKYH++++ +LQKLK R  +   PE EI   +WR+               +
Sbjct: 179  DKKAYFLELQKYHNDILEYLQKLKQRWESCKDPENEILPKIWRLGRDVEKRISSNAYESR 238

Query: 842  P---GEEVRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPR---SKDTRKDED-----GK 696
            P    ++V   +   S  ADEKA SSDNQ+     G E+ +    K  +K++        
Sbjct: 239  PHDLEQDVTATSESCSWVADEKACSSDNQNSSVMKGGELHKRNYDKGFKKNKSTNSLIAS 298

Query: 695  TDVMXXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNR 516
             +V+               N++  DG++YMSYVKIS+ QHELV+SMKQSG SIQ +++NR
Sbjct: 299  ENVLNVGTKLKKGYKLNKHNIHHNDGAQYMSYVKISRKQHELVKSMKQSGKSIQCRSMNR 358

Query: 515  VIGNLNSFNVKPYVMF-EEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCL 339
            V+GNL S +V+PY +F EEE++K++ E+W  L+ +D+P+ +  W   + +  E+  SL  
Sbjct: 359  VLGNLESLHVQPYEVFLEEEQKKKLHEHWLKLATEDLPAFYVNWKERKKQLWEVTLSLRQ 418

Query: 338  EIEEKLKHLMEEEGMDHS 285
            E+ +KL+  +E+E  ++S
Sbjct: 419  EMMDKLECQIEDEEKENS 436


>dbj|BAB11123.1| unnamed protein product [Arabidopsis thaliana]
          Length = 978

 Score =  305 bits (782), Expect = 3e-80
 Identities = 166/421 (39%), Positives = 254/421 (60%), Gaps = 9/421 (2%)
 Frame = -3

Query: 1520 LGMSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAV 1341
            L M+ADQR+KR++SA+V G SSRE YRAK+K  +S    L  +   H   +WD N  + V
Sbjct: 38   LRMAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGAL--RSGDHITLEWDRNRSKVV 95

Query: 1340 AKGEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKE 1161
            +K EQVGLS R LR F    P   N +A +  +P+           LS+EVW S L++ E
Sbjct: 96   SKKEQVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGE 155

Query: 1160 RSQLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRAN 981
            R+ L QFLP GV+ ++VVQ LL GEN HFGNP L W +A+CSG+ HP+ +   +E  RA+
Sbjct: 156  RNYLRQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRAD 215

Query: 980  KNAYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQPGEEVRYLNRELSV 801
            K  YYS L+KYH ++I +LQ LK++  +   PE +I +M+W        +V    + L+ 
Sbjct: 216  KRRYYSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGNAQVNGSCQGLTA 275

Query: 800  AADEKAWSSDNQ--------SPLKAMGSEMPRSKDT-RKDEDGKTDVMXXXXXXXXXXXX 648
            A+   +W+ D++        SP+   G    RSK +  + E  + + +            
Sbjct: 276  ASGSSSWNEDDKPDSSDNMISPVVRCGEVQRRSKRSGLEKEKTQNNGVNVGGKVRKKNVL 335

Query: 647  XXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGNLNSFNVKPYVMF 468
               ++   DG+KYMSY+KISK QH++V SMKQSG SIQS+ALNR+ GN++S +V+PY +F
Sbjct: 336  PKDSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRALNRIFGNIDSLDVQPYGVF 395

Query: 467  EEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEKLKHLMEEEGMDH 288
             EEEQK++  +W +L  KD+P+A++ W  ++ ++ +++ S+  E++EK    ME++   +
Sbjct: 396  VEEEQKKLNAHWLHLV-KDLPAAYAIWKRLQLQKRDIISSMGRELKEKCNLWMEDKQQQY 454

Query: 287  S 285
            +
Sbjct: 455  A 455


>ref|XP_006287010.1| hypothetical protein CARUB_v10000158mg [Capsella rubella]
            gi|482555716|gb|EOA19908.1| hypothetical protein
            CARUB_v10000158mg [Capsella rubella]
          Length = 946

 Score =  305 bits (780), Expect = 5e-80
 Identities = 165/420 (39%), Positives = 253/420 (60%), Gaps = 15/420 (3%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++SA+V G+SSR+ YR+K+K + S    L  +   H   +WD N  + V+K
Sbjct: 1    MAADQRRKRMNSANVIGFSSRDHYRSKRKKIGSPDGAL--RSGDHISLEWDRNRSKVVSK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQVGLS R LR F    P   + +A +  +P+           LS+EVW S L++ ER+
Sbjct: 59   KEQVGLSFRHLREFVDVLPPRRHILAQVCPVPHDTFQLENLSEVLSNEVWRSSLSDGERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L QFLP GV+ ++VVQ LL GEN HFGNPFL W  A+CSG+ HP+ +   +E  RA K 
Sbjct: 119  YLRQFLPDGVDVEQVVQSLLDGENFHFGNPFLDWGEAVCSGKAHPDQIVSREEDLRAGKR 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQPGEEVRYLNRELSVA- 798
             YYS+L+KYH ++I +LQ LK++      PE +  +++W         V    ++L+ A 
Sbjct: 179  RYYSDLEKYHHDIIDYLQTLKEKWEICKDPEKDAVKIIWGRSRGASAHVNGSCQDLTAAS 238

Query: 797  ------ADEKAWSSDNQSPLKAMGSEM---PRSKDTRKDE-----DGKTDVMXXXXXXXX 660
                  ADEK  SSDN++P      E+   P S    K++       +  V+        
Sbjct: 239  ESSSWTADEKPCSSDNKNPSVLRSGEVQKRPNSSAVEKEKYQSLLIARDHVVNAGVKARK 298

Query: 659  XXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGNLNSFNVKP 480
                   ++   DG+KYMSY+KISK QH++V SMKQSG SIQS+ALNR++G++N+ +V+P
Sbjct: 299  KDMLPKHSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRALNRILGSINNLDVQP 358

Query: 479  YVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEKLKHLMEEE 300
            Y +F EEEQK+++ +W  L  KD+P+A++ W  ++ ++ +++ S+  E+++KL   ME++
Sbjct: 359  YGVFVEEEQKKLKAHWLQLV-KDLPAAYAIWKKLQLQKRDIISSVGRELKDKLDPWMEDK 417


>ref|NP_196899.2| uncharacterized protein [Arabidopsis thaliana]
            gi|145334397|ref|NP_001078580.1| uncharacterized protein
            [Arabidopsis thaliana] gi|332004580|gb|AED91963.1|
            uncharacterized protein AT5G13950 [Arabidopsis thaliana]
            gi|332004581|gb|AED91964.1| uncharacterized protein
            AT5G13950 [Arabidopsis thaliana]
          Length = 939

 Score =  305 bits (780), Expect = 5e-80
 Identities = 165/419 (39%), Positives = 253/419 (60%), Gaps = 9/419 (2%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++SA+V G SSRE YRAK+K  +S    L  +   H   +WD N  + V+K
Sbjct: 1    MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGAL--RSGDHITLEWDRNRSKVVSK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQVGLS R LR F    P   N +A +  +P+           LS+EVW S L++ ER+
Sbjct: 59   KEQVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L QFLP GV+ ++VVQ LL GEN HFGNP L W +A+CSG+ HP+ +   +E  RA+K 
Sbjct: 119  YLRQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKR 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQPGEEVRYLNRELSVAA 795
             YYS L+KYH ++I +LQ LK++  +   PE +I +M+W        +V    + L+ A+
Sbjct: 179  RYYSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGNAQVNGSCQGLTAAS 238

Query: 794  DEKAWSSDNQ--------SPLKAMGSEMPRSKDT-RKDEDGKTDVMXXXXXXXXXXXXXX 642
               +W+ D++        SP+   G    RSK +  + E  + + +              
Sbjct: 239  GSSSWNEDDKPDSSDNMISPVVRCGEVQRRSKRSGLEKEKTQNNGVNVGGKVRKKNVLPK 298

Query: 641  ANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGNLNSFNVKPYVMFEE 462
             ++   DG+KYMSY+KISK QH++V SMKQSG SIQS+ALNR+ GN++S +V+PY +F E
Sbjct: 299  DSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRALNRIFGNIDSLDVQPYGVFVE 358

Query: 461  EEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEKLKHLMEEEGMDHS 285
            EEQK++  +W +L  KD+P+A++ W  ++ ++ +++ S+  E++EK    ME++   ++
Sbjct: 359  EEQKKLNAHWLHLV-KDLPAAYAIWKRLQLQKRDIISSMGRELKEKCNLWMEDKQQQYA 416


>dbj|BAC41867.1| unknown protein [Arabidopsis thaliana] gi|29029052|gb|AAO64905.1|
            At5g13950 [Arabidopsis thaliana]
          Length = 939

 Score =  303 bits (776), Expect = 1e-79
 Identities = 164/419 (39%), Positives = 253/419 (60%), Gaps = 9/419 (2%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++SA+V G SSRE YRAK+K  +S    L  +   H   +WD N  + V+K
Sbjct: 1    MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGAL--RSGDHITLEWDRNRSKVVSK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQVGLS R LR F    P   N +A +  +P+           LS+EVW S L++ ER+
Sbjct: 59   KEQVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L QFLP GV+ ++VVQ LL GEN HFGNP L W +A+CSG+ HP+ +   +E  RA+K 
Sbjct: 119  YLRQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKR 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQPGEEVRYLNRELSVAA 795
             YYS L+KYH ++I +LQ L+++  +   PE +I +M+W        +V    + L+ A+
Sbjct: 179  RYYSNLEKYHQDIIDYLQTLEEKWESCKDPEKDIVKMMWGRSRGGNAQVNGSCQGLTAAS 238

Query: 794  DEKAWSSDNQ--------SPLKAMGSEMPRSKDT-RKDEDGKTDVMXXXXXXXXXXXXXX 642
               +W+ D++        SP+   G    RSK +  + E  + + +              
Sbjct: 239  GSSSWNEDDKPDSSDNMISPVVRCGEVQRRSKRSGLEKEKTQNNGVNVGGKVRKKNVLPK 298

Query: 641  ANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGNLNSFNVKPYVMFEE 462
             ++   DG+KYMSY+KISK QH++V SMKQSG SIQS+ALNR+ GN++S +V+PY +F E
Sbjct: 299  DSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRALNRIFGNIDSLDVQPYGVFVE 358

Query: 461  EEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEKLKHLMEEEGMDHS 285
            EEQK++  +W +L  KD+P+A++ W  ++ ++ +++ S+  E++EK    ME++   ++
Sbjct: 359  EEQKKLNAHWLHLV-KDLPAAYAIWKRLQLQKRDIISSMGRELKEKCNLWMEDKQQQYA 416


>ref|XP_004516003.1| PREDICTED: uncharacterized protein LOC101502546 isoform X1 [Cicer
            arietinum] gi|502177085|ref|XP_004516004.1| PREDICTED:
            uncharacterized protein LOC101502546 isoform X2 [Cicer
            arietinum]
          Length = 940

 Score =  302 bits (773), Expect = 3e-79
 Identities = 175/465 (37%), Positives = 260/465 (55%), Gaps = 27/465 (5%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++ A   GY SREQ R K+K+L   Q D+     SH   +WD N+KR VAK
Sbjct: 1    MAADQRRKRVNGASSIGYGSREQQRTKRKNLGLVQNDMR----SHVSVEWDGNQKRVVAK 56

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQ+G+S R+++PF S   + +  +AD   +P+           LS+EVW + L+E ER+
Sbjct: 57   REQIGISWRQMKPFVSYVSNDHKVLADAFTVPHEIFELDNLSEVLSYEVWKTHLSENERN 116

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L+QFLP G+ P + V+DLLAG +  FG PFL W +++CSG+ HP+ +   ++  ++ K 
Sbjct: 117  HLMQFLPRGIEPHQTVEDLLAGIDFDFGKPFLNWGASVCSGDLHPDIIVDREQHVKSEKR 176

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMV----------------WRIREQ 843
            AYY++L  YH+NMI  L KLK+R  +   PE EI Q +                 R+ + 
Sbjct: 177  AYYTQLHNYHNNMIGFLSKLKERWQSCRDPEKEIVQKMRRPKHVQKRMPSNVNESRVNDH 236

Query: 842  PGEEVRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPRSKDTRKDEDGKT--------DV 687
             G  V   +   S  A+E+A SSD          ++ R    + +  GK+        D+
Sbjct: 237  DG-NVAVTSESCSWDAEERACSSDYLISSMRKDDKLQRKVLEKVNVKGKSRNLMLSSDDM 295

Query: 686  MXXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIG 507
                            N++F D  +YMS +KIS+ QHELV++MKQSG SIQSK+LNRV+G
Sbjct: 296  HIKEEKPKKGDKVLNRNIHFIDSDQYMSCIKISRQQHELVKNMKQSGKSIQSKSLNRVLG 355

Query: 506  NLNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEE 327
            NLN+ +V+PY +F +EEQK++ E+W  L  KD+P A++ W   + +R  M  SL  E+E+
Sbjct: 356  NLNNIHVQPYKVFVKEEQKKLHEHWLQLVIKDLPVAYANWMQRQKQRHAMRNSLMEEMED 415

Query: 326  KLKHLMEEEGMDHSNGAHFY---EGGSISSDDTEQTQEDEVPADE 201
            K   + EEE  + S G       +  S  S+   Q ++D  P DE
Sbjct: 416  KSNPIFEEED-NVSIGRELQDQDDAMSSGSNPRGQNEDDISPVDE 459


>ref|XP_002873641.1| hypothetical protein ARALYDRAFT_488220 [Arabidopsis lyrata subsp.
            lyrata] gi|297319478|gb|EFH49900.1| hypothetical protein
            ARALYDRAFT_488220 [Arabidopsis lyrata subsp. lyrata]
          Length = 943

 Score =  302 bits (773), Expect = 3e-79
 Identities = 160/419 (38%), Positives = 256/419 (61%), Gaps = 9/419 (2%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++SA+V G+SSRE YRAK+K + S    L  +   H   +WD N  + V+K
Sbjct: 1    MAADQRRKRMNSANVIGFSSREHYRAKRKKIGSPDGAL--RSGDHISLEWDRNRSKVVSK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQVGLS R LR F    P   N +A +  +P+           LS+EVW S L++ ER+
Sbjct: 59   REQVGLSWRHLREFFDVVPPRQNVLAQVCPVPHETFQLENLSQVLSNEVWHSCLSDGERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L QFLP GV+ ++VVQ LL GEN HFGNP L W +A+CS + HP+ +   +E  RA+K 
Sbjct: 119  YLRQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSSKAHPDQIVSREERLRADKK 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIREQPGEEVRYLNRELSVAA 795
             YYS+L+KYH ++I +LQ LK++  +   PE +I +M+W        +V    ++L+ A+
Sbjct: 179  RYYSDLEKYHHDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGNAQVNGSCQDLTAAS 238

Query: 794  DEKAWSSDNQ--------SPLKAMGSEMPRSKDT-RKDEDGKTDVMXXXXXXXXXXXXXX 642
               +W++D++        S +   G    R K++  + E  + + +              
Sbjct: 239  GSSSWNADDKPDSSDNKISSVVRSGDVQRRPKNSGLEKEKSQNNGVNVGGKVRKKDVFPK 298

Query: 641  ANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGNLNSFNVKPYVMFEE 462
             ++   DG+KYMSY+KISK QH++V SMKQSG SIQS+ALNR++G+++S +V+PY +F E
Sbjct: 299  DSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRALNRILGSIDSLDVQPYGVFVE 358

Query: 461  EEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEKLKHLMEEEGMDHS 285
            EEQK++  +W +L  KD+P+A++ W  ++ ++ +++ S+  E+++K    ME++   ++
Sbjct: 359  EEQKKLNAHWLHLV-KDLPAAYAIWKKLQLQKRDIISSMGRELKDKRNPWMEDKQQQYA 416


>ref|NP_001190306.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332004582|gb|AED91965.1| uncharacterized protein
            AT5G13950 [Arabidopsis thaliana]
          Length = 954

 Score =  300 bits (767), Expect = 2e-78
 Identities = 167/434 (38%), Positives = 257/434 (59%), Gaps = 24/434 (5%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++SA+V G SSRE YRAK+K  +S    L  +   H   +WD N  + V+K
Sbjct: 1    MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGAL--RSGDHITLEWDRNRSKVVSK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQVGLS R LR F    P   N +A +  +P+           LS+EVW S L++ ER+
Sbjct: 59   KEQVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L QFLP GV+ ++VVQ LL GEN HFGNP L W +A+CSG+ HP+ +   +E  RA+K 
Sbjct: 119  YLRQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKR 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVW--------------RIREQPG 837
             YYS L+KYH ++I +LQ LK++  +   PE +I +M+W               +R + G
Sbjct: 179  RYYSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGSVLYNFLFKRRTVEVRSRGG 238

Query: 836  E-EVRYLNRELSVAADEKAWSSDNQ--------SPLKAMGSEMPRSKDT-RKDEDGKTDV 687
              +V    + L+ A+   +W+ D++        SP+   G    RSK +  + E  + + 
Sbjct: 239  NAQVNGSCQGLTAASGSSSWNEDDKPDSSDNMISPVVRCGEVQRRSKRSGLEKEKTQNNG 298

Query: 686  MXXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIG 507
            +               ++   DG+KYMSY+KISK QH++V SMKQSG SIQS+ALNR+ G
Sbjct: 299  VNVGGKVRKKNVLPKDSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRALNRIFG 358

Query: 506  NLNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEE 327
            N++S +V+PY +F EEEQK++  +W +L  KD+P+A++ W  ++ ++ +++ S+  E++E
Sbjct: 359  NIDSLDVQPYGVFVEEEQKKLNAHWLHLV-KDLPAAYAIWKRLQLQKRDIISSMGRELKE 417

Query: 326  KLKHLMEEEGMDHS 285
            K    ME++   ++
Sbjct: 418  KCNLWMEDKQQQYA 431


>ref|XP_004306320.1| PREDICTED: uncharacterized protein LOC101311025 [Fragaria vesca
            subsp. vesca]
          Length = 861

 Score =  298 bits (764), Expect = 3e-78
 Identities = 174/467 (37%), Positives = 259/467 (55%), Gaps = 29/467 (6%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR + A + G +SR+Q+RAKKK++  ++ D    +N H   +WD ++K  VAK
Sbjct: 1    MAADQRRKRSNGATLVGCNSRDQHRAKKKNMGVNEDDST--INPHISLEWDGSQKMVVAK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             +Q+G+S  ++RP+  +     +  AD+  +P+           LS+EVW + L+E ERS
Sbjct: 59   RDQIGISWNDMRPYIDSTFTSYDIPADVFVVPHGIYELKNLEDVLSYEVWQTHLSENERS 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L+QFLP G    +VV+ LLAG+  HFGNPF+KW ++LCSG  HP+ +   +     +K 
Sbjct: 119  YLMQFLPRGSEAQQVVKALLAGDYFHFGNPFVKWGTSLCSGSFHPDVILRREHCLMTDKK 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWR-----------IREQPGEEV 828
             YY ELQKYH+++I +L KLK+R A    PE E  Q +WR                  E 
Sbjct: 179  VYYKELQKYHNDLIAYLLKLKERYARCEDPEEEFLQKIWRSVGLSRKDMEKYSSSHANES 238

Query: 827  RY---------LNRELSVAADEKAWSSDNQSPLKAMGSEMP---------RSKDTRKDED 702
            R+          +   S  ADEKA  SDNQ      G E+          + K+ +K   
Sbjct: 239  RFCELDENATPTSESGSWVADEKACCSDNQISSVNKGGELQSRFNEKGFLKDKNRQKLLT 298

Query: 701  GKTDVMXXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKAL 522
                +                N+   DGSKYMSYVKISK Q+E+V+SMKQSG SIQ ++L
Sbjct: 299  EDDALHVGATPKKGDKLHKRNNIYNNDGSKYMSYVKISKKQYEIVKSMKQSGRSIQFRSL 358

Query: 521  NRVIGNLNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLC 342
            NRV+G  N  N++PY +F EEEQK + ++W  L+RKD+P+A++ W  +R +R +M+ SL 
Sbjct: 359  NRVLG--NDCNIQPYHVFVEEEQKNLHKHWLQLARKDLPAAYAIWMEMRLQRRKMIMSLE 416

Query: 341  LEIEEKLKHLMEEEGMDHSNGAHFYEGGSISSDDTEQTQEDEVPADE 201
              ++E+L+ +ME+   D  N     E GS+  D+ E    +    D+
Sbjct: 417  TNMKERLESVMEQ---DEGN-----ENGSMRQDELELENHESTMHDD 455


>gb|ESW23708.1| hypothetical protein PHAVU_004G069600g [Phaseolus vulgaris]
            gi|561025024|gb|ESW23709.1| hypothetical protein
            PHAVU_004G069600g [Phaseolus vulgaris]
          Length = 978

 Score =  297 bits (761), Expect = 8e-78
 Identities = 172/469 (36%), Positives = 263/469 (56%), Gaps = 36/469 (7%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++ A++ GY SREQ R K+K+L   Q DLN  + SH   +WD N+K+ VAK
Sbjct: 1    MAADQRRKRVNGANIAGYGSREQQRIKRKNLGLVQNDLN--MRSHISVEWDGNQKKVVAK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQVG+S R+ +PF ++  + +  VAD++ +P            LS+EVW + L+E ER+
Sbjct: 59   REQVGISWRQTKPFINSVANGHKLVADVLTVPQEIFDLDNLSDVLSYEVWKTHLSENERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L+ FLP    P ++V+DLL+G N +FGNPF KW ++LC G+ HP+ + + ++  ++ K 
Sbjct: 119  LLMNFLPRDFEPHQLVEDLLSGINFNFGNPFSKWGASLCLGDLHPDMIVYREQHLKSEKK 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRI----REQPGE--EVRYLNR 813
             YYS +  YH++MI  L  LK    +   PE EI Q +WR     +  P +  E R  + 
Sbjct: 179  EYYSHIHNYHNDMIGFLSNLKKSWQSCKDPEKEIVQKIWRSKHVEKRMPSKVIESRVYDH 238

Query: 812  ELSVA---------ADEKAWSSDNQSPLKAMGSEMPR---SKDTRKDE-----DGKTDVM 684
            + +V          A++K  SSDNQ        ++ R    KD  K +     D    V 
Sbjct: 239  DGNVTGTSESCSWDAEDKPCSSDNQISSLRKDDKLQRRVLEKDIVKGKSRNLMDSMDRVP 298

Query: 683  XXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGN 504
                           N +  DG KYMSY+KIS+ QHELV++MKQSG SIQS++LNRV+GN
Sbjct: 299  NLGEKPKTGDKLPKLNSHSSDGDKYMSYIKISRQQHELVKNMKQSGKSIQSRSLNRVLGN 358

Query: 503  LNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEK 324
            L    V+PY +F +EE+K+++E+W  L   D+P A+  W   ++ R  +  SL  E+++K
Sbjct: 359  LEKIQVQPYNIFVKEEKKKLQEHWLLLVNNDLPEAYVNWKERQTRRHAVRNSLVAEMKDK 418

Query: 323  LKHLMEEEG-----------MDHSNGAHFYE--GGSISSDDTEQTQEDE 216
                +EEE             D ++G+  ++   G+I+S    Q Q+D+
Sbjct: 419  SNSFIEEEDDVNSGSELKDQDDVNSGSELHDQVKGNINSGSELQDQDDD 467


>ref|XP_006574737.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X4 [Glycine max]
          Length = 936

 Score =  297 bits (760), Expect = 1e-77
 Identities = 171/458 (37%), Positives = 256/458 (55%), Gaps = 24/458 (5%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++ A++ GY SREQ+R K+K+L   Q DLN  +  H   +WD N K+ VAK
Sbjct: 1    MAADQRRKRVNGANIAGYGSREQHRIKRKNLGLVQNDLN--MRPHISVEWDGNHKKVVAK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQ+G+S R+++PF +   + +  +AD+ A+P            LS+EVW + L+E ER+
Sbjct: 59   WEQIGISWRQMKPFINLVSNDHKILADVFAVPQEIFELDNLSEVLSYEVWKTHLSENERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L+ FLPSG    +VV++LL G N +FGNPF KW ++LC G  HP+ +   ++  +  + 
Sbjct: 119  LLMNFLPSGFESHQVVEELLGGINFNFGNPFSKWGASLCLGSLHPDMIVDQEQHLKTERR 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIR-----------EQPGEE- 831
             YYS +  YH++MI  L KLK    +   PE EI Q +WR +           E  G + 
Sbjct: 179  EYYSHIHNYHNDMIGFLSKLKKSWQSCKDPEKEIVQKIWRTKHVEKRMLSKVIESRGYDH 238

Query: 830  ---VRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPR--------SKDTRKDEDGKTDVM 684
               V   +   S  A+EKA SSDNQ        ++ R           +R   D   ++ 
Sbjct: 239  NGNVTGTSESCSWDAEEKACSSDNQISSLRKDDKLQRRVLEKCIVKGKSRNLMDSLDNMP 298

Query: 683  XXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGN 504
                           +++  D  KYMS +KISK QHELV++MKQ+G SIQS++LNRV+GN
Sbjct: 299  NVGEKPKTGDKLPKHSIHSSDSDKYMSCIKISKQQHELVKNMKQAGKSIQSRSLNRVLGN 358

Query: 503  LNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEK 324
            L   +V+PY  F +EEQK+++E+W  L  KD+P+A+  W   R +R  +  SL  E+++K
Sbjct: 359  LEKIHVQPYNTFVKEEQKKLQEHWLLLVNKDLPAAYLNWTERRIQRHAVRNSLVAEMKDK 418

Query: 323  LKHLMEEE-GMDHSNGAHFYEGGSISSDDTEQTQEDEV 213
                MEEE G+D   G+   +   ++S  +E    DEV
Sbjct: 419  SNPFMEEEDGVD--TGSELKDQDGVNS-GSELQDHDEV 453


>ref|XP_006574736.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X3 [Glycine max]
          Length = 938

 Score =  297 bits (760), Expect = 1e-77
 Identities = 171/458 (37%), Positives = 256/458 (55%), Gaps = 24/458 (5%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++ A++ GY SREQ+R K+K+L   Q DLN  +  H   +WD N K+ VAK
Sbjct: 1    MAADQRRKRVNGANIAGYGSREQHRIKRKNLGLVQNDLN--MRPHISVEWDGNHKKVVAK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQ+G+S R+++PF +   + +  +AD+ A+P            LS+EVW + L+E ER+
Sbjct: 59   WEQIGISWRQMKPFINLVSNDHKILADVFAVPQEIFELDNLSEVLSYEVWKTHLSENERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L+ FLPSG    +VV++LL G N +FGNPF KW ++LC G  HP+ +   ++  +  + 
Sbjct: 119  LLMNFLPSGFESHQVVEELLGGINFNFGNPFSKWGASLCLGSLHPDMIVDQEQHLKTERR 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIR-----------EQPGEE- 831
             YYS +  YH++MI  L KLK    +   PE EI Q +WR +           E  G + 
Sbjct: 179  EYYSHIHNYHNDMIGFLSKLKKSWQSCKDPEKEIVQKIWRTKHVEKRMLSKVIESRGYDH 238

Query: 830  ---VRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPR--------SKDTRKDEDGKTDVM 684
               V   +   S  A+EKA SSDNQ        ++ R           +R   D   ++ 
Sbjct: 239  NGNVTGTSESCSWDAEEKACSSDNQISSLRKDDKLQRRVLEKCIVKGKSRNLMDSLDNMP 298

Query: 683  XXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGN 504
                           +++  D  KYMS +KISK QHELV++MKQ+G SIQS++LNRV+GN
Sbjct: 299  NVGEKPKTGDKLPKHSIHSSDSDKYMSCIKISKQQHELVKNMKQAGKSIQSRSLNRVLGN 358

Query: 503  LNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEK 324
            L   +V+PY  F +EEQK+++E+W  L  KD+P+A+  W   R +R  +  SL  E+++K
Sbjct: 359  LEKIHVQPYNTFVKEEQKKLQEHWLLLVNKDLPAAYLNWTERRIQRHAVRNSLVAEMKDK 418

Query: 323  LKHLMEEE-GMDHSNGAHFYEGGSISSDDTEQTQEDEV 213
                MEEE G+D   G+   +   ++S  +E    DEV
Sbjct: 419  SNPFMEEEDGVD--TGSELKDQDGVNS-GSELQDHDEV 453


>ref|XP_006574734.1| PREDICTED: intracellular protein transport protein USO1-like isoform
            X1 [Glycine max] gi|571439016|ref|XP_006574735.1|
            PREDICTED: intracellular protein transport protein
            USO1-like isoform X2 [Glycine max]
          Length = 960

 Score =  297 bits (760), Expect = 1e-77
 Identities = 171/458 (37%), Positives = 256/458 (55%), Gaps = 24/458 (5%)
 Frame = -3

Query: 1514 MSADQRKKRLSSADVFGYSSREQYRAKKKHLSSSQFDLNWKLNSHTPFKWDDNEKRAVAK 1335
            M+ADQR+KR++ A++ GY SREQ+R K+K+L   Q DLN  +  H   +WD N K+ VAK
Sbjct: 1    MAADQRRKRVNGANIAGYGSREQHRIKRKNLGLVQNDLN--MRPHISVEWDGNHKKVVAK 58

Query: 1334 GEQVGLSKRELRPFKSTAPHHNNNVADIIAIPNXXXXXXXXXXXLSHEVWLSILTEKERS 1155
             EQ+G+S R+++PF +   + +  +AD+ A+P            LS+EVW + L+E ER+
Sbjct: 59   WEQIGISWRQMKPFINLVSNDHKILADVFAVPQEIFELDNLSEVLSYEVWKTHLSENERN 118

Query: 1154 QLIQFLPSGVNPDEVVQDLLAGENLHFGNPFLKWSSALCSGEHHPESVCHYDETFRANKN 975
             L+ FLPSG    +VV++LL G N +FGNPF KW ++LC G  HP+ +   ++  +  + 
Sbjct: 119  LLMNFLPSGFESHQVVEELLGGINFNFGNPFSKWGASLCLGSLHPDMIVDQEQHLKTERR 178

Query: 974  AYYSELQKYHDNMIRHLQKLKDRCANANVPEAEIEQMVWRIR-----------EQPGEE- 831
             YYS +  YH++MI  L KLK    +   PE EI Q +WR +           E  G + 
Sbjct: 179  EYYSHIHNYHNDMIGFLSKLKKSWQSCKDPEKEIVQKIWRTKHVEKRMLSKVIESRGYDH 238

Query: 830  ---VRYLNRELSVAADEKAWSSDNQSPLKAMGSEMPR--------SKDTRKDEDGKTDVM 684
               V   +   S  A+EKA SSDNQ        ++ R           +R   D   ++ 
Sbjct: 239  NGNVTGTSESCSWDAEEKACSSDNQISSLRKDDKLQRRVLEKCIVKGKSRNLMDSLDNMP 298

Query: 683  XXXXXXXXXXXXXXANVNFGDGSKYMSYVKISKSQHELVRSMKQSGNSIQSKALNRVIGN 504
                           +++  D  KYMS +KISK QHELV++MKQ+G SIQS++LNRV+GN
Sbjct: 299  NVGEKPKTGDKLPKHSIHSSDSDKYMSCIKISKQQHELVKNMKQAGKSIQSRSLNRVLGN 358

Query: 503  LNSFNVKPYVMFEEEEQKRIEEYWSNLSRKDIPSAFSKWGNIRSERGEMMKSLCLEIEEK 324
            L   +V+PY  F +EEQK+++E+W  L  KD+P+A+  W   R +R  +  SL  E+++K
Sbjct: 359  LEKIHVQPYNTFVKEEQKKLQEHWLLLVNKDLPAAYLNWTERRIQRHAVRNSLVAEMKDK 418

Query: 323  LKHLMEEE-GMDHSNGAHFYEGGSISSDDTEQTQEDEV 213
                MEEE G+D   G+   +   ++S  +E    DEV
Sbjct: 419  SNPFMEEEDGVD--TGSELKDQDGVNS-GSELQDHDEV 453


Top