BLASTX nr result

ID: Perilla23_contig00013896 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00013896
         (1172 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011086145.1| PREDICTED: uncharacterized protein LOC105167...   228   6e-57
ref|XP_012841687.1| PREDICTED: uncharacterized protein LOC105961...   204   9e-50
ref|XP_011081285.1| PREDICTED: uncharacterized protein LOC105164...   198   8e-48
ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263...   149   4e-33
ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604...   147   1e-32
ref|XP_012475864.1| PREDICTED: digestive organ expansion factor ...   146   3e-32
ref|XP_009589991.1| PREDICTED: uncharacterized protein LOC104087...   145   6e-32
gb|KHF98922.1| hypothetical protein F383_19218 [Gossypium arboreum]   143   2e-31
ref|XP_007039766.1| Uncharacterized protein isoform 4 [Theobroma...   142   4e-31
ref|XP_007039763.1| Uncharacterized protein isoform 1 [Theobroma...   142   4e-31
ref|XP_009792874.1| PREDICTED: uncharacterized protein LOC104239...   140   2e-30
emb|CDP10003.1| unnamed protein product [Coffea canephora]            139   5e-30
ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citr...   139   5e-30
gb|KDO61351.1| hypothetical protein CISIN_1g027940mg [Citrus sin...   137   1e-29
ref|XP_007155691.1| hypothetical protein PHAVU_003G223000g, part...   137   2e-29
ref|XP_014506116.1| PREDICTED: uncharacterized protein LOC106765...   135   9e-29
gb|KRH57463.1| hypothetical protein GLYMA_05G062500 [Glycine max]     135   9e-29
ref|XP_008355575.1| PREDICTED: uncharacterized protein LOC103419...   135   9e-29
ref|XP_008355574.1| PREDICTED: uncharacterized protein LOC103419...   135   9e-29
ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferas...   135   9e-29

>ref|XP_011086145.1| PREDICTED: uncharacterized protein LOC105167946 [Sesamum indicum]
          Length = 213

 Score =  228 bits (582), Expect = 6e-57
 Identities = 127/215 (59%), Positives = 139/215 (64%), Gaps = 3/215 (1%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDSGDQHRRGQXXXXXXX 813
           MAAPPVKSQPLHNFSLPHLKWAHKN           HHRFRRRDS DQH+          
Sbjct: 1   MAAPPVKSQPLHNFSLPHLKWAHKNASSAPGSSS-QHHRFRRRDSPDQHQHRFQDPDSDP 59

Query: 812 XXXXXXPKQQQRPAD---DAVSPAVEEENGEAKPWKLRPRKETIXXXXXXXXXXXXXXXX 642
                 PKQQ        D+V P  EE+N + KPW LRPR+E I                
Sbjct: 60  GTRLTNPKQQPPVGSSRKDSVLPMGEEDNEDDKPWNLRPRREIIRASSTSNKETGAENND 119

Query: 641 XXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSREEIEEDVYALTGGKPA 462
                    G+KSQRLRG+ EG  QNG VER+  KRKVWISLSREEIEEDVYALTGG+PA
Sbjct: 120 NKGYSKSN-GVKSQRLRGLVEGRQQNGGVERRG-KRKVWISLSREEIEEDVYALTGGRPA 177

Query: 461 RRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRVTM 357
           RRP++WPKNVQKQLD+VFPGLYLVGV ADSYRV M
Sbjct: 178 RRPRRWPKNVQKQLDSVFPGLYLVGVTADSYRVMM 212


>ref|XP_012841687.1| PREDICTED: uncharacterized protein LOC105961974 [Erythranthe
           guttatus]
          Length = 216

 Score =  204 bits (520), Expect = 9e-50
 Identities = 116/222 (52%), Positives = 135/222 (60%), Gaps = 10/222 (4%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXS-----NHHRFRRRDSGDQHRRGQXX 828
           MAAPP+KSQPLHNFSLP L+WAHKN         +     +    RRRDS DQ       
Sbjct: 1   MAAPPMKSQPLHNFSLPTLRWAHKNSSSSSPAAAAAAGGGSSRCHRRRDSPDQLHETDSD 60

Query: 827 XXXXXXXXXXXPKQQQRPADDAVSPAVEEENGEAKPWKLRPRKE--TIXXXXXXXXXXXX 654
                         Q+ P  D V  A +E NGEAKPW LRPRKE  T+            
Sbjct: 61  NETQPPNP-----NQRTP--DPVPAAADEGNGEAKPWNLRPRKEAPTVKAASKSRKEARG 113

Query: 653 XXXXXXXXXXXXNGLKSQRLRGMA---EGGPQNGSVERKAAKRKVWISLSREEIEEDVYA 483
                        G+KSQR+RGMA   EGG +NG  ER  + RK+WISLS+EEIEED+YA
Sbjct: 114 ECNESKINSINNGGVKSQRIRGMAAAVEGGQRNGGGERNGSSRKIWISLSKEEIEEDIYA 173

Query: 482 LTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRVTM 357
           +TGGKP+RRP+KWP+NVQKQLDNVFPGLYLVGVA DSYRV+M
Sbjct: 174 MTGGKPSRRPRKWPRNVQKQLDNVFPGLYLVGVAPDSYRVSM 215


>ref|XP_011081285.1| PREDICTED: uncharacterized protein LOC105164351 [Sesamum indicum]
          Length = 205

 Score =  198 bits (503), Expect = 8e-48
 Identities = 111/209 (53%), Positives = 124/209 (59%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDSGDQHRRGQXXXXXXX 813
           MA PPVKSQPLHNFSLPHLKW HKN         S  HR R  DS DQ            
Sbjct: 1   MAVPPVKSQPLHNFSLPHLKWPHKNSSSSASGSTSLQHRLRHHDSPDQRHHCYPDAGCNL 60

Query: 812 XXXXXXPKQQQRPADDAVSPAVEEENGEAKPWKLRPRKETIXXXXXXXXXXXXXXXXXXX 633
                   Q      +++    +++N  A+PW LRPRKE I                   
Sbjct: 61  ESGPTTASQS-----NSLVWEEDDDNEAARPWNLRPRKEVIKAAWNLDKGTRDGNNYNKT 115

Query: 632 XXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSREEIEEDVYALTGGKPARRP 453
                 G+KSQRL G+ EGG  +G VERK  KRKVWISLSREEIEEDVYALTGGKPARRP
Sbjct: 116 IRSSN-GVKSQRLSGLLEGGKLSGGVERKE-KRKVWISLSREEIEEDVYALTGGKPARRP 173

Query: 452 KKWPKNVQKQLDNVFPGLYLVGVAADSYR 366
           ++WPKNVQKQLD+VFPGLYLVGV  DSYR
Sbjct: 174 RRWPKNVQKQLDSVFPGLYLVGVGPDSYR 202


>ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263341 [Solanum
           lycopersicum]
          Length = 219

 Score =  149 bits (376), Expect = 4e-33
 Identities = 97/230 (42%), Positives = 115/230 (50%), Gaps = 20/230 (8%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDS--------------- 858
           MA  PVKSQPLH FSLP LKW +K+           +HRFRRRDS               
Sbjct: 1   MATAPVKSQPLHYFSLPQLKWGNKSNTNA-------NHRFRRRDSPPSNGDNPTQTADVD 53

Query: 857 -GDQHRRGQXXXXXXXXXXXXXPKQQQRPADDAVSPAVEEE----NGEAKPWKLRPRKET 693
            G    + Q               Q +   ++ V    EEE     GE K W LRPR+  
Sbjct: 54  GGSDSEKVQPRSEAEADPNGVSSLQGREEHEEKVKEEEEEEVGCEEGEVKLWNLRPRRGV 113

Query: 692 IXXXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLS 513
                                       +SQRL+  A+G   NG    K  K+K+WISLS
Sbjct: 114 TKVETTSLKNVEMRVESSNHMQ------RSQRLKDNADG---NGVGSGKKGKKKLWISLS 164

Query: 512 REEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           REEIEEDVY++TG +PARRPKK  K +QKQLDNVFPGLYLVGV ADS+RV
Sbjct: 165 REEIEEDVYSMTGSRPARRPKKRSKTIQKQLDNVFPGLYLVGVTADSFRV 214


>ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604791 [Solanum tuberosum]
          Length = 220

 Score =  147 bits (372), Expect = 1e-32
 Identities = 97/231 (41%), Positives = 115/231 (49%), Gaps = 21/231 (9%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDS--------------- 858
           MAA PVKSQPLH FSLP LKW +K+           +HRFRRRDS               
Sbjct: 1   MAAAPVKSQPLHYFSLPQLKWGNKSHTNA-------NHRFRRRDSPPSNGDNPPQTADVD 53

Query: 857 -GDQHRRGQXXXXXXXXXXXXXPKQQQRPADDAVSPAVEEEN-----GEAKPWKLRPRKE 696
            G    + Q               Q +   +  V    EEE      GE K W LRPR+ 
Sbjct: 54  GGSDSEKVQPRSEAEADPNGVSSLQGEDEHEKEVKEEEEEEEVGCEEGEVKLWNLRPRRG 113

Query: 695 TIXXXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISL 516
                                        +SQRL+  A+G   NG    K  K+K+WISL
Sbjct: 114 VTKVETASLKNVEMRVESSNHMQ------RSQRLKDNADG---NGVGSGKKGKKKLWISL 164

Query: 515 SREEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           SREEIEEDVY++TG +PARRPKK  K +QKQLDNVFPGLYLVG+ ADS+RV
Sbjct: 165 SREEIEEDVYSMTGSRPARRPKKRSKTIQKQLDNVFPGLYLVGLTADSFRV 215


>ref|XP_012475864.1| PREDICTED: digestive organ expansion factor homolog [Gossypium
           raimondii] gi|763758192|gb|KJB25523.1| hypothetical
           protein B456_004G196000 [Gossypium raimondii]
           gi|763758193|gb|KJB25524.1| hypothetical protein
           B456_004G196000 [Gossypium raimondii]
           gi|763758194|gb|KJB25525.1| hypothetical protein
           B456_004G196000 [Gossypium raimondii]
          Length = 224

 Score =  146 bits (369), Expect = 3e-32
 Identities = 92/228 (40%), Positives = 114/228 (50%), Gaps = 18/228 (7%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKW---------AHKNXXXXXXXXXSNHHRFRRRDSGDQ--- 849
           MA  P+KSQPLHNF+ P LKW         A            S+H R R    G +   
Sbjct: 1   MATAPIKSQPLHNFNFPFLKWGAHGGGSSSAATADHSRSPESDSDHDRLRPTRVGSRSTR 60

Query: 848 -HRRGQXXXXXXXXXXXXXPKQQQRPADDAVSPAV-----EEENGEAKPWKLRPRKETIX 687
            HR                 +QQQ+  +++  P       +EE    +PW LRPRK  + 
Sbjct: 61  IHRSSFPPPPKPIKQSHREEEQQQQREEESSKPRENEAEEDEEETVQRPWNLRPRKVVME 120

Query: 686 XXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSRE 507
                                   G KS RLRG AE    NG V  K  KRK WI+LS+E
Sbjct: 121 TSAAVVTSAAEKTSETV-------GPKSMRLRGFAE----NGGVAEKKEKRKFWIALSKE 169

Query: 506 EIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           EIEED++ +TG +PARRPKK PKN+QKQLDNVFPGL+LVG  AD+YR+
Sbjct: 170 EIEEDIFVITGSRPARRPKKRPKNIQKQLDNVFPGLWLVGTTADAYRI 217


>ref|XP_009589991.1| PREDICTED: uncharacterized protein LOC104087283 [Nicotiana
           tomentosiformis]
          Length = 211

 Score =  145 bits (366), Expect = 6e-32
 Identities = 90/215 (41%), Positives = 111/215 (51%), Gaps = 5/215 (2%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDS----GDQH-RRGQXX 828
           MA  PVKSQPLH FSLP LKW  K+           +HRFRRR+S     D H    Q  
Sbjct: 1   MATAPVKSQPLHYFSLPQLKWGQKSHTN-------TNHRFRRRESPSSTADNHLNTPQTT 53

Query: 827 XXXXXXXXXXXPKQQQRPADDAVSPAVEEENGEAKPWKLRPRKETIXXXXXXXXXXXXXX 648
                      P ++QR  +  V    E +  E   W LRPRK  +              
Sbjct: 54  DLNGGSDSDKLPVEEQRQEEHVVEEEEEGQKEEKVLWNLRPRKSVMKVGLEAETAPLKKN 113

Query: 647 XXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSREEIEEDVYALTGGK 468
                       ++SQR+R             +K  K+K+WISLSREEIEEDVY++TG +
Sbjct: 114 VEMEVESSNH--IRSQRVRDNNVDNGHGFGSGKKEKKKKLWISLSREEIEEDVYSMTGSR 171

Query: 467 PARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           PARRPKK  K +QKQLDNVFPG+YLVG+ ADS+RV
Sbjct: 172 PARRPKKRSKTIQKQLDNVFPGMYLVGLTADSFRV 206


>gb|KHF98922.1| hypothetical protein F383_19218 [Gossypium arboreum]
          Length = 255

 Score =  143 bits (361), Expect = 2e-31
 Identities = 90/228 (39%), Positives = 114/228 (50%), Gaps = 18/228 (7%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKW---------AHKNXXXXXXXXXSNHHRFRRRDSGDQ--- 849
           MA  P+KSQPLHNF+ P LKW         A            S+H R R    G +   
Sbjct: 1   MATAPIKSQPLHNFNFPFLKWGTHGGGSSSAATADHSRSPESDSDHDRLRPTRVGSRSTR 60

Query: 848 -HRRGQXXXXXXXXXXXXXPKQQQRPADDAVSPAV-----EEENGEAKPWKLRPRKETIX 687
            HR                 +Q+++  +++  P       +EE    +PW LRPRK  + 
Sbjct: 61  IHRSSFPPPPKAIKQSHREEEQRKQREEESSKPRENEAEEDEEETVQRPWNLRPRKVVME 120

Query: 686 XXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSRE 507
                                   G KS RLRG AE    NG V  K  KRK WI+LS+E
Sbjct: 121 TSAAVVTSAAEKTSETV-------GPKSMRLRGFAE----NGGVAEKKEKRKFWIALSKE 169

Query: 506 EIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           EIEED++ +TG +PARRPKK PKN+QKQLDNVFPGL+LVG  AD+YR+
Sbjct: 170 EIEEDIFVITGSRPARRPKKRPKNIQKQLDNVFPGLWLVGTTADAYRI 217


>ref|XP_007039766.1| Uncharacterized protein isoform 4 [Theobroma cacao]
           gi|508777011|gb|EOY24267.1| Uncharacterized protein
           isoform 4 [Theobroma cacao]
          Length = 227

 Score =  142 bits (359), Expect = 4e-31
 Identities = 91/228 (39%), Positives = 111/228 (48%), Gaps = 18/228 (7%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSN---------HHRFRRRDSGDQHRR 840
           MA  PVKSQPLHNF+ P LKW              +         H R R    G +  R
Sbjct: 1   MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSADHRRSPESDSDHDRLRPTRVGSRSTR 60

Query: 839 GQXXXXXXXXXXXXXP-------KQQQRPADDAVSPAVEEENGEA--KPWKLRPRKETIX 687
            Q                     +Q+++P     + A EEE  E   +PW LRPRK  + 
Sbjct: 61  IQRLSFLPPPKPIKQSHGEDEEQQQEEQPLKPHKNEAEEEEEEETVQRPWNLRPRKVVVE 120

Query: 686 XXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSRE 507
                                     KS RLRG+AE    NG +  K  KRK WI+LSRE
Sbjct: 121 TTAVVTTAMEKVSETAAP--------KSMRLRGLAE----NGGIVEKKEKRKFWIALSRE 168

Query: 506 EIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           EIEED++ +TG +PARRPKK PKN+QKQLD VFPGL+LVG  AD+YRV
Sbjct: 169 EIEEDIFVMTGSRPARRPKKRPKNIQKQLDAVFPGLWLVGTTADAYRV 216


>ref|XP_007039763.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|590676536|ref|XP_007039764.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
           gi|590676539|ref|XP_007039765.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
           gi|590676547|ref|XP_007039767.1| Uncharacterized protein
           isoform 1 [Theobroma cacao] gi|508777008|gb|EOY24264.1|
           Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508777009|gb|EOY24265.1| Uncharacterized protein
           isoform 1 [Theobroma cacao] gi|508777010|gb|EOY24266.1|
           Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508777012|gb|EOY24268.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 223

 Score =  142 bits (359), Expect = 4e-31
 Identities = 91/228 (39%), Positives = 111/228 (48%), Gaps = 18/228 (7%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSN---------HHRFRRRDSGDQHRR 840
           MA  PVKSQPLHNF+ P LKW              +         H R R    G +  R
Sbjct: 1   MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSADHRRSPESDSDHDRLRPTRVGSRSTR 60

Query: 839 GQXXXXXXXXXXXXXP-------KQQQRPADDAVSPAVEEENGEA--KPWKLRPRKETIX 687
            Q                     +Q+++P     + A EEE  E   +PW LRPRK  + 
Sbjct: 61  IQRLSFLPPPKPIKQSHGEDEEQQQEEQPLKPHKNEAEEEEEEETVQRPWNLRPRKVVVE 120

Query: 686 XXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSRE 507
                                     KS RLRG+AE    NG +  K  KRK WI+LSRE
Sbjct: 121 TTAVVTTAMEKVSETAAP--------KSMRLRGLAE----NGGIVEKKEKRKFWIALSRE 168

Query: 506 EIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           EIEED++ +TG +PARRPKK PKN+QKQLD VFPGL+LVG  AD+YRV
Sbjct: 169 EIEEDIFVMTGSRPARRPKKRPKNIQKQLDAVFPGLWLVGTTADAYRV 216


>ref|XP_009792874.1| PREDICTED: uncharacterized protein LOC104239849 [Nicotiana
           sylvestris] gi|698493167|ref|XP_009792875.1| PREDICTED:
           uncharacterized protein LOC104239849 [Nicotiana
           sylvestris]
          Length = 209

 Score =  140 bits (354), Expect = 2e-30
 Identities = 86/213 (40%), Positives = 106/213 (49%), Gaps = 3/213 (1%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDSGDQHRRGQXXXXXXX 813
           MA  PVKSQPLH FSLP LKW  K+           +HRFRRR+S               
Sbjct: 1   MATAPVKSQPLHYFSLPQLKWGQKSHTN-------TNHRFRRRESPSSTADNHLSPLQPA 53

Query: 812 XXXXXXPKQQQRPADDAVSPAVEEENG---EAKPWKLRPRKETIXXXXXXXXXXXXXXXX 642
                    +    +      VEEE G   E   W LRPRK  +                
Sbjct: 54  DLNGGSDSDKLPVEEQRQEKHVEEEEGLKEEKVLWNLRPRKSVMKVGLEAETAPLKKNVE 113

Query: 641 XXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSREEIEEDVYALTGGKPA 462
                     ++SQR+R             +K  K+K+WISLSREEIEEDVY++TG +PA
Sbjct: 114 MEVESSNH--IRSQRVRDNNVDNGHGFGSGKKEKKKKLWISLSREEIEEDVYSMTGSRPA 171

Query: 461 RRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           RRPK+  K +QKQLDNVFPG+YLVG+ ADS+RV
Sbjct: 172 RRPKRRSKTIQKQLDNVFPGMYLVGLTADSFRV 204


>emb|CDP10003.1| unnamed protein product [Coffea canephora]
          Length = 233

 Score =  139 bits (350), Expect = 5e-30
 Identities = 91/230 (39%), Positives = 109/230 (47%), Gaps = 20/230 (8%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDSGDQHRRGQXXXXXXX 813
           MA  P+KSQPLHNFSLPHL+W HKN         S      RRDS D    G        
Sbjct: 1   MATAPIKSQPLHNFSLPHLRWVHKNSPHQQSPPHSTLQH--RRDSPDFDPPGNDNNTTAA 58

Query: 812 XXXXXXPKQQQRPADDAV-------SPAVEEENGEA-----------KPWKLRPRKETIX 687
                  +  ++P   +        S +   +N +A           KPW LRPRK    
Sbjct: 59  ASPKPASRTPRKPQPFSSPCLASFPSASSTHQNQKAEQGDDVVEEGHKPWNLRPRKVVTY 118

Query: 686 XXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAA--KRKVWISLS 513
                                         LR    G       +RK    KRK+WISLS
Sbjct: 119 PTSTATFTTPSSFRKNDKEKEKSQEETGSSLRNTCPGFAGTERQQRKVVEEKRKLWISLS 178

Query: 512 REEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
           +EEIEEDVY+LTG +P+RRPKK P+ VQKQLDNVFPGLYLVG++ DSYRV
Sbjct: 179 KEEIEEDVYSLTGSRPSRRPKKRPRTVQKQLDNVFPGLYLVGLSIDSYRV 228


>ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citrus clementina]
           gi|557542514|gb|ESR53492.1| hypothetical protein
           CICLE_v10022000mg [Citrus clementina]
          Length = 216

 Score =  139 bits (350), Expect = 5e-30
 Identities = 94/230 (40%), Positives = 112/230 (48%), Gaps = 19/230 (8%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWA--HKNXXXXXXXXXSNHHRFRR--------RDSGDQHR 843
           M   P+KSQPLHNFSL  LKW   H N          NH+R R          D   +H 
Sbjct: 1   MTTAPMKSQPLHNFSLSFLKWGTHHPNP---------NHNRTRTPPPTEPDTTDDSTRHH 51

Query: 842 R---GQXXXXXXXXXXXXXPKQQQ----RPADDAVSPAVEEENGEAKPWKLRPRK--ETI 690
           R    +              K QQ    RP         EEE+   +PW LRPRK  ET+
Sbjct: 52  RVVGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTADTEEEEEDEVGRPWNLRPRKVQETL 111

Query: 689 XXXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSR 510
                                      KS RLR M E    NG    K  K K W++LSR
Sbjct: 112 VDVAVFQNRGDNNANTKAP--------KSTRLREMVESRGSNGD---KKEKNKFWVTLSR 160

Query: 509 EEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRVT 360
           EEIEED++ +TG +PARRP+K PKNVQKQLDNVFPGL+LVG+ AD+YRV+
Sbjct: 161 EEIEEDIFIMTGSRPARRPRKRPKNVQKQLDNVFPGLWLVGLTADAYRVS 210


>gb|KDO61351.1| hypothetical protein CISIN_1g027940mg [Citrus sinensis]
           gi|641842447|gb|KDO61352.1| hypothetical protein
           CISIN_1g027940mg [Citrus sinensis]
          Length = 216

 Score =  137 bits (346), Expect = 1e-29
 Identities = 93/230 (40%), Positives = 111/230 (48%), Gaps = 19/230 (8%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWA--HKNXXXXXXXXXSNHHRFRR--------RDSGDQHR 843
           M   P+KSQPLHNFSL  LKW   H N          NH+R R          D   +H 
Sbjct: 1   MTTAPMKSQPLHNFSLSFLKWGTHHPNP---------NHNRTRTPPPTEPDTTDDSTRHH 51

Query: 842 R---GQXXXXXXXXXXXXXPKQQQ----RPADDAVSPAVEEENGEAKPWKLRPRK--ETI 690
           R    +              K QQ    RP         EEE+   +PW LRPRK  ET+
Sbjct: 52  RVVGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTADTEEEEEDEVGRPWNLRPRKVQETL 111

Query: 689 XXXXXXXXXXXXXXXXXXXXXXXXNGLKSQRLRGMAEGGPQNGSVERKAAKRKVWISLSR 510
                                      KS RLR M E    NG    K  K K W++LSR
Sbjct: 112 VDVAVFQNRGDNNANTKAP--------KSTRLREMVESRGSNGD---KKEKNKFWVTLSR 160

Query: 509 EEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRVT 360
           EEIEED++ +TG +PARRP+K PKNVQKQLDNVFPGL+LVG+  D+YRV+
Sbjct: 161 EEIEEDIFIMTGSRPARRPRKRPKNVQKQLDNVFPGLWLVGLTVDAYRVS 210


>ref|XP_007155691.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus
           vulgaris] gi|593785303|ref|XP_007155692.1| hypothetical
           protein PHAVU_003G223000g, partial [Phaseolus vulgaris]
           gi|561029045|gb|ESW27685.1| hypothetical protein
           PHAVU_003G223000g, partial [Phaseolus vulgaris]
           gi|561029046|gb|ESW27686.1| hypothetical protein
           PHAVU_003G223000g, partial [Phaseolus vulgaris]
          Length = 306

 Score =  137 bits (344), Expect = 2e-29
 Identities = 90/246 (36%), Positives = 112/246 (45%), Gaps = 37/246 (15%)
 Frame = -1

Query: 989 AAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDS--------------GD 852
           A PPVKSQPLHNF+LP LKW             ++HHR RR  S                
Sbjct: 62  AQPPVKSQPLHNFALPFLKWGASG---KNHTNAAHHHRCRRPSSLSSDHASEPDSDPDSR 118

Query: 851 QHRRGQXXXXXXXXXXXXXPKQQQRPADDAVSPAVEEENGE--------------AKPWK 714
            HR G               K    P +    P+  +E  +               KPW 
Sbjct: 119 PHRVGSRTTRNRFALPTCSLKPLPPPPEPPQPPSCNDETDDEAAKRDIEDAEEAVQKPWN 178

Query: 713 LRPRKETIXXXXXXXXXXXXXXXXXXXXXXXXNGL---------KSQRLRGMAEGGPQNG 561
           LRPRK  +                        +G+         KS RLRG A+      
Sbjct: 179 LRPRKPALPKSALEIGTGPSRNHANNGVGEFHDGVSHHGENPAPKSLRLRGFAD-----T 233

Query: 560 SVERKAAKRKVWISLSREEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVA 381
               K  KRK WI+LSREEIEED++ +TG +PARRP+K PKNVQKQ+D+VFPGL+LVG+ 
Sbjct: 234 QCAEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSVFPGLWLVGIT 293

Query: 380 ADSYRV 363
           AD+YRV
Sbjct: 294 ADAYRV 299


>ref|XP_014506116.1| PREDICTED: uncharacterized protein LOC106765862 [Vigna radiata var.
           radiata]
          Length = 250

 Score =  135 bits (339), Expect = 9e-29
 Identities = 89/246 (36%), Positives = 110/246 (44%), Gaps = 37/246 (15%)
 Frame = -1

Query: 989 AAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDSGDQ------------- 849
           A PPVKSQPLHNF+LP LKW              +HHR RR  S                
Sbjct: 6   AQPPVKSQPLHNFALPFLKWGASGKNHTNAA---HHHRCRRPSSHPSDHASEPDSDPDSR 62

Query: 848 -HRRGQXXXXXXXXXXXXXPKQQQRPADDAVSPAVEEENGEA--------------KPWK 714
            HR G               K    P     +P+  +E  +               KPW 
Sbjct: 63  PHRLGSRTARNRFALPTCSLKPLAPPPQPLQAPSCNDETDDEAAKRDIEDAEEAVQKPWN 122

Query: 713 LRPRKETIXXXXXXXXXXXXXXXXXXXXXXXXNGL---------KSQRLRGMAEGGPQNG 561
           LRPRK  +                        + +         KS RLRG A+      
Sbjct: 123 LRPRKPALPKSALEIGTGPSRNHGNNGAGEFHDAVSHHSENPAPKSLRLRGFADT----- 177

Query: 560 SVERKAAKRKVWISLSREEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVA 381
               K  KRK WI+LSREEIEED++ +TG +PARRP+K PKNVQKQ+D+VFPGL+LVG+ 
Sbjct: 178 QCAEKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSVFPGLWLVGIT 237

Query: 380 ADSYRV 363
           AD+YRV
Sbjct: 238 ADAYRV 243


>gb|KRH57463.1| hypothetical protein GLYMA_05G062500 [Glycine max]
          Length = 277

 Score =  135 bits (339), Expect = 9e-29
 Identities = 93/236 (39%), Positives = 111/236 (47%), Gaps = 27/236 (11%)
 Frame = -1

Query: 989 AAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRR-------DSGDQ----HR 843
           A  PVKSQPLHNF+LP LKW              +HHRFRR        DS D     HR
Sbjct: 48  APAPVKSQPLHNFALPFLKWGASGKNNTTNAA--HHHRFRRPSDHASEPDSSDPDSRPHR 105

Query: 842 RGQXXXXXXXXXXXXXPKQQ---QRPADDAVSPAVEEENGEAKPWKLRPRKETIXXXXXX 672
            G              P      Q P DD    +V+      KPWKLRPRK  +      
Sbjct: 106 LGSRTARNRFSLPLKPPPPPPPPQPPHDDDADDSVQ------KPWKLRPRKPALLPNKTA 159

Query: 671 XXXXXXXXXXXXXXXXXXNGL-------------KSQRLRGMAEGGPQNGSVERKAAKRK 531
                                             KS RLRG ++          K  KRK
Sbjct: 160 LEIGTGPSRNHHHHHHHATNNGEFLDGGDNNPAPKSLRLRGFSDT-----QCSEKKEKRK 214

Query: 530 VWISLSREEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
            WI+LSREEIEED++ +TG +PARRP+K PKNVQKQ+D+VFPGL+LVG+ AD+YRV
Sbjct: 215 FWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSVFPGLWLVGITADAYRV 270


>ref|XP_008355575.1| PREDICTED: uncharacterized protein LOC103419236 isoform X2 [Malus
           domestica]
          Length = 228

 Score =  135 bits (339), Expect = 9e-29
 Identities = 91/234 (38%), Positives = 112/234 (47%), Gaps = 25/234 (10%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDSG----------DQHR 843
           MA  PVK  PLHNF L  LKW  KN          NH+R+RR  S           D+H 
Sbjct: 1   MATAPVKP-PLHNFPLTFLKWGTKNATAN------NHNRYRRPASAEPASEPDSESDRHN 53

Query: 842 RGQXXXXXXXXXXXXXPKQQQRPADDAVSPAVEEENGEA---KPWKLRPRKETIXXXXXX 672
           R                 + +R   +      EEE  E    KPW LRPR+         
Sbjct: 54  RVGSSRADRRRLSLISCSENKRRRSEERESDQEEEEAEVLLQKPWNLRPRRPPATASFQK 113

Query: 671 XXXXXXXXXXXXXXXXXXNGLKSQ------------RLRGMAEGGPQNGSVERKAAKRKV 528
                                +SQ            RLRG+AEG     SVE+K  K K 
Sbjct: 114 ASGPNAAVGANREGQEPEGPNRSQSEMMQQQQPKSMRLRGLAEGQ----SVEKKKEKSKF 169

Query: 527 WISLSREEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYR 366
           WI+LS+EEIEEDV+ +TG +P+RRPKK PKNVQKQLD++FPGL+LVGV AD+Y+
Sbjct: 170 WIALSKEEIEEDVFVMTGSRPSRRPKKRPKNVQKQLDSIFPGLWLVGVTADAYK 223


>ref|XP_008355574.1| PREDICTED: uncharacterized protein LOC103419236 isoform X1 [Malus
           domestica]
          Length = 231

 Score =  135 bits (339), Expect = 9e-29
 Identities = 91/234 (38%), Positives = 112/234 (47%), Gaps = 25/234 (10%)
 Frame = -1

Query: 992 MAAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRRDSG----------DQHR 843
           MA  PVK  PLHNF L  LKW  KN          NH+R+RR  S           D+H 
Sbjct: 1   MATAPVKP-PLHNFPLTFLKWGTKNATAN------NHNRYRRPASAEPASEPDSESDRHN 53

Query: 842 RGQXXXXXXXXXXXXXPKQQQRPADDAVSPAVEEENGEA---KPWKLRPRKETIXXXXXX 672
           R                 + +R   +      EEE  E    KPW LRPR+         
Sbjct: 54  RVGSSRADRRRLSLISCSENKRRRSEERESDQEEEEAEVLLQKPWNLRPRRPPATASFQK 113

Query: 671 XXXXXXXXXXXXXXXXXXNGLKSQ------------RLRGMAEGGPQNGSVERKAAKRKV 528
                                +SQ            RLRG+AEG     SVE+K  K K 
Sbjct: 114 ASGPNAAVGANREGQEPEGPNRSQSEMMQQQQPKSMRLRGLAEGQ----SVEKKKEKSKF 169

Query: 527 WISLSREEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYR 366
           WI+LS+EEIEEDV+ +TG +P+RRPKK PKNVQKQLD++FPGL+LVGV AD+Y+
Sbjct: 170 WIALSKEEIEEDVFVMTGSRPSRRPKKRPKNVQKQLDSIFPGLWLVGVTADAYK 223


>ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferase 2E-like [Glycine max]
           gi|947109138|gb|KRH57464.1| hypothetical protein
           GLYMA_05G062500 [Glycine max]
          Length = 241

 Score =  135 bits (339), Expect = 9e-29
 Identities = 93/236 (39%), Positives = 111/236 (47%), Gaps = 27/236 (11%)
 Frame = -1

Query: 989 AAPPVKSQPLHNFSLPHLKWAHKNXXXXXXXXXSNHHRFRRR-------DSGDQ----HR 843
           A  PVKSQPLHNF+LP LKW              +HHRFRR        DS D     HR
Sbjct: 12  APAPVKSQPLHNFALPFLKWGASGKNNTTNAA--HHHRFRRPSDHASEPDSSDPDSRPHR 69

Query: 842 RGQXXXXXXXXXXXXXPKQQ---QRPADDAVSPAVEEENGEAKPWKLRPRKETIXXXXXX 672
            G              P      Q P DD    +V+      KPWKLRPRK  +      
Sbjct: 70  LGSRTARNRFSLPLKPPPPPPPPQPPHDDDADDSVQ------KPWKLRPRKPALLPNKTA 123

Query: 671 XXXXXXXXXXXXXXXXXXNGL-------------KSQRLRGMAEGGPQNGSVERKAAKRK 531
                                             KS RLRG ++          K  KRK
Sbjct: 124 LEIGTGPSRNHHHHHHHATNNGEFLDGGDNNPAPKSLRLRGFSDT-----QCSEKKEKRK 178

Query: 530 VWISLSREEIEEDVYALTGGKPARRPKKWPKNVQKQLDNVFPGLYLVGVAADSYRV 363
            WI+LSREEIEED++ +TG +PARRP+K PKNVQKQ+D+VFPGL+LVG+ AD+YRV
Sbjct: 179 FWIALSREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSVFPGLWLVGITADAYRV 234


Top