BLASTX nr result

ID: Papaver25_contig00029124 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00029124
         (1023 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXB58479.1| hypothetical protein L484_005213 [Morus notabilis]     182   2e-43
ref|XP_004250018.1| PREDICTED: uncharacterized protein LOC101258...   173   9e-41
gb|EYU38051.1| hypothetical protein MIMGU_mgv1a000603mg [Mimulus...   172   2e-40
ref|XP_006360510.1| PREDICTED: uncharacterized protein LOC102588...   169   1e-39
ref|XP_007010094.1| UDP-Glycosyltransferase superfamily protein ...   167   6e-39
ref|XP_007010093.1| UDP-Glycosyltransferase superfamily protein ...   167   6e-39
ref|XP_007010092.1| UDP-Glycosyltransferase superfamily protein ...   167   6e-39
ref|XP_007010091.1| UDP-Glycosyltransferase superfamily protein ...   167   6e-39
ref|XP_007010090.1| UDP-Glycosyltransferase superfamily protein ...   167   6e-39
ref|XP_004496154.1| PREDICTED: uncharacterized protein LOC101505...   160   1e-36
ref|XP_006589360.1| PREDICTED: uncharacterized protein LOC100779...   155   3e-35
ref|XP_007144256.1| hypothetical protein PHAVU_007G141200g [Phas...   155   3e-35
ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779...   155   3e-35
ref|XP_006379502.1| hypothetical protein POPTR_0008s02940g [Popu...   154   4e-35
ref|XP_006485287.1| PREDICTED: uncharacterized protein LOC102618...   151   5e-34
ref|XP_006436561.1| hypothetical protein CICLE_v10030581mg [Citr...   151   5e-34
ref|XP_006378794.1| hypothetical protein POPTR_0010s23830g [Popu...   150   1e-33
ref|XP_002873152.1| hypothetical protein ARALYDRAFT_487229 [Arab...   146   2e-32
ref|XP_006606300.1| PREDICTED: uncharacterized protein LOC100790...   143   1e-31
ref|XP_006606298.1| PREDICTED: uncharacterized protein LOC100790...   143   1e-31

>gb|EXB58479.1| hypothetical protein L484_005213 [Morus notabilis]
          Length = 1043

 Score =  182 bits (462), Expect = 2e-43
 Identities = 116/251 (46%), Positives = 159/251 (63%), Gaps = 8/251 (3%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQTGV-----VYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRK 566
           MGRNS+  PD   ++    G       +SIRDR R KRNP N S + +  +         
Sbjct: 1   MGRNSSPSPDNTFDANGNAGGGNDLGFHSIRDRLRFKRNP-NPSHDRDRTKVFA------ 53

Query: 565 MDRQ-WRNRSHHN-RIVRKGFS-FKITYFLYGAAILAFLVFVVGSISLQTSISSVFTSGS 395
            DR   R RSH+N R  RKGF  FK    LY   I A  +F + S+ LQ+SI SVF  GS
Sbjct: 54  -DRAPVRGRSHYNSRFNRKGFLWFKGKSTLYLVIIFAVFLFGMASMVLQSSIMSVFKQGS 112

Query: 394 DRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILGNM 215
           +R  +   + GLK+G  L+F+P    + R    A+ LDRLR E R+A+R PRLAL+LGNM
Sbjct: 113 ERGRLL--REGLKFGTTLRFVPGR--ISRRLADANGLDRLRNEPRIAVRKPRLALVLGNM 168

Query: 214 DKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWSIY 35
            K+  +LML T+VK++++LGY+ K+FAV++G+AR +WE +GG++SILG  S  ++DWSI+
Sbjct: 169 KKNSESLMLITIVKNIQKLGYALKIFAVENGNARTMWEQLGGQISILGFESYGHMDWSIF 228

Query: 34  EGVIVNSLEAK 2
           EGVIV+SL AK
Sbjct: 229 EGVIVDSLGAK 239


>ref|XP_004250018.1| PREDICTED: uncharacterized protein LOC101258810 [Solanum
           lycopersicum]
          Length = 1050

 Score =  173 bits (439), Expect = 9e-41
 Identities = 105/253 (41%), Positives = 149/253 (58%), Gaps = 10/253 (3%)
 Frame = -3

Query: 730 MGRNS--TSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDR 557
           MGR+S   +    +NN+    G  + IRDRFR KRN    ++ +              DR
Sbjct: 3   MGRSSGDDNKDKNDNNAISSGGGFHLIRDRFRFKRNSQRPTEAVT-----LPSSSSPSDR 57

Query: 556 QWRN--RSHHNRIVRKGFSFKITYF------LYGAAILAFLVFVVGSISLQTSISSVFTS 401
           QW+   RSHH+    + FS K+ +F      LY    L   VF + S+ LQ+SI SVF  
Sbjct: 58  QWKTPARSHHHHHHNRSFSRKLIFFCFRGKWLYLCIFLVIFVFALASMVLQSSIMSVFRQ 117

Query: 400 GSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILG 221
                S ++ ++ LK G +L+F+P  +         + LD +R + R+ +RPPR+AL+LG
Sbjct: 118 NERARSRWSVRDDLKLGSSLEFVPPPRF-----QLGNGLDLVRNQPRIGVRPPRIALVLG 172

Query: 220 NMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWS 41
           NM KDP++LML TVVK+L+ LGY  K++AV+DG AR +WE IGG+VSIL A+    IDWS
Sbjct: 173 NMRKDPLSLMLSTVVKNLRGLGYMIKIYAVEDGIARSVWEEIGGKVSILTADRYDLIDWS 232

Query: 40  IYEGVIVNSLEAK 2
           I++GVI +SLE K
Sbjct: 233 IFDGVIADSLEDK 245


>gb|EYU38051.1| hypothetical protein MIMGU_mgv1a000603mg [Mimulus guttatus]
          Length = 1048

 Score =  172 bits (437), Expect = 2e-40
 Identities = 108/249 (43%), Positives = 151/249 (60%), Gaps = 6/249 (2%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQT-GVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRK-MDR 557
           MGR+S S    E+ S D T G   SIRDRF  KRN  N S N ++         +  +  
Sbjct: 1   MGRHSVSAA--ESASDDATAGPFRSIRDRFPFKRN--NSSSNYSSTNTLTRSSSKTTLSS 56

Query: 556 QWRNRSHHNRIVRKGFS-FKITYFLYGAAILAFLVFVVGSISLQTSISSVFTSG--SDRI 386
              +RSHH+   +   S F+     Y         F + S+ LQ+SI+SV   G   DR+
Sbjct: 57  HKASRSHHHHKRKLSLSPFRGKSCFYLCIFTVIFTFALASMVLQSSITSVLRQGVGGDRM 116

Query: 385 SI-FTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILGNMDK 209
              ++ K+GLK G +L+F+P     +RF+   S +D LR++ R+ IRPPR+ LILGNM+K
Sbjct: 117 RWRWSVKDGLKEGSSLEFVPR----RRFELNGSRVDWLRSQPRIGIRPPRIGLILGNMEK 172

Query: 208 DPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWSIYEG 29
           DP  L+LY+V+K+LK LGY  K++A+ DG ARP+W+ IGG+VSIL      YIDWSI+EG
Sbjct: 173 DPSALLLYSVMKNLKGLGYLLKLYALGDGRARPIWQEIGGQVSILSPERYGYIDWSIFEG 232

Query: 28  VIVNSLEAK 2
           ++V+SLEAK
Sbjct: 233 IVVDSLEAK 241


>ref|XP_006360510.1| PREDICTED: uncharacterized protein LOC102588632 [Solanum tuberosum]
          Length = 1048

 Score =  169 bits (429), Expect = 1e-39
 Identities = 104/253 (41%), Positives = 149/253 (58%), Gaps = 10/253 (3%)
 Frame = -3

Query: 730 MGRNS--TSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDR 557
           MGR+S   +    +NN+    G  +SIRDRFR KRN    ++ +              DR
Sbjct: 3   MGRSSGDDNKDKNDNNAISSGGGFHSIRDRFRFKRNSQRPTETVT-----LPSSSSSPDR 57

Query: 556 QWRN--RSHHNRIVRKGFSFKITYF------LYGAAILAFLVFVVGSISLQTSISSVFTS 401
           QW+   RSHH+    + FS K+ +F      LY    +   VF + S+ LQ+SI SVF  
Sbjct: 58  QWKTLARSHHHHHHNRSFSRKLIFFCFRGKWLYLCIFMVIFVFALASMVLQSSIMSVFRQ 117

Query: 400 GSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILG 221
                  ++ ++ LK G +L+F+      +RF    + LD +R + R+ +RPPR+AL+LG
Sbjct: 118 NERARWRWSVRDDLKLGSSLEFVQP----RRFQL-GNGLDLVRNQPRIGVRPPRIALVLG 172

Query: 220 NMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWS 41
           NM KDP++LML TVVK+L+ LGY  K++ V+DG AR +WE IGG+VSIL A+    IDWS
Sbjct: 173 NMRKDPLSLMLSTVVKNLRGLGYMIKIYTVEDGIARSIWEEIGGKVSILTADRYDLIDWS 232

Query: 40  IYEGVIVNSLEAK 2
           I++GVI +SLE K
Sbjct: 233 IFDGVIADSLEDK 245


>ref|XP_007010094.1| UDP-Glycosyltransferase superfamily protein isoform 5 [Theobroma
           cacao] gi|508727007|gb|EOY18904.1|
           UDP-Glycosyltransferase superfamily protein isoform 5
           [Theobroma cacao]
          Length = 782

 Score =  167 bits (423), Expect = 6e-39
 Identities = 104/257 (40%), Positives = 145/257 (56%), Gaps = 14/257 (5%)
 Frame = -3

Query: 730 MGRNSTSP------------PDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXX 587
           MGRNS+ P             + +NN+ D  G  YSIRDR   KRNPI+  D        
Sbjct: 1   MGRNSSPPILDGNGNENGKNKNSDNNNDDDQGF-YSIRDRLPFKRNPIHTRDRTKQSSL- 58

Query: 586 XXXXDRKMDRQW-RNRSHHNRIVRKGFSFKITYFLYGAAILAFLVFVVGSISLQTSISSV 410
                  +DR   RNR   NR     F  +  +  Y     +   F + S+ +Q+SI++V
Sbjct: 59  -------LDRPLVRNRPRFNRKGFLLFPLRGIHLFYFLIFFSVFAFAMASMLMQSSIAAV 111

Query: 409 -FTSGSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLA 233
            F  G +R    + + GL+ G  LKF+P    + R+  +   LDR+R+ +R+ +R PRLA
Sbjct: 112 VFRQGGERGWRKSVREGLRLGSTLKFMPAG--MSRWVAEGGGLDRMRSTARIGVRGPRLA 169

Query: 232 LILGNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLY 53
           LILGNM KDP +LM+ TVVKSL+ LGY  K++AV +G A  +WE I G++S LG    ++
Sbjct: 170 LILGNMKKDPQSLMMLTVVKSLQRLGYVIKIYAVANGKAHAMWEHISGQISFLGPEQFVH 229

Query: 52  IDWSIYEGVIVNSLEAK 2
           IDWSI+EGVI +SLEAK
Sbjct: 230 IDWSIFEGVIADSLEAK 246


>ref|XP_007010093.1| UDP-Glycosyltransferase superfamily protein isoform 4 [Theobroma
           cacao] gi|508727006|gb|EOY18903.1|
           UDP-Glycosyltransferase superfamily protein isoform 4
           [Theobroma cacao]
          Length = 969

 Score =  167 bits (423), Expect = 6e-39
 Identities = 104/257 (40%), Positives = 145/257 (56%), Gaps = 14/257 (5%)
 Frame = -3

Query: 730 MGRNSTSP------------PDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXX 587
           MGRNS+ P             + +NN+ D  G  YSIRDR   KRNPI+  D        
Sbjct: 1   MGRNSSPPILDGNGNENGKNKNSDNNNDDDQGF-YSIRDRLPFKRNPIHTRDRTKQSSL- 58

Query: 586 XXXXDRKMDRQW-RNRSHHNRIVRKGFSFKITYFLYGAAILAFLVFVVGSISLQTSISSV 410
                  +DR   RNR   NR     F  +  +  Y     +   F + S+ +Q+SI++V
Sbjct: 59  -------LDRPLVRNRPRFNRKGFLLFPLRGIHLFYFLIFFSVFAFAMASMLMQSSIAAV 111

Query: 409 -FTSGSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLA 233
            F  G +R    + + GL+ G  LKF+P    + R+  +   LDR+R+ +R+ +R PRLA
Sbjct: 112 VFRQGGERGWRKSVREGLRLGSTLKFMPAG--MSRWVAEGGGLDRMRSTARIGVRGPRLA 169

Query: 232 LILGNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLY 53
           LILGNM KDP +LM+ TVVKSL+ LGY  K++AV +G A  +WE I G++S LG    ++
Sbjct: 170 LILGNMKKDPQSLMMLTVVKSLQRLGYVIKIYAVANGKAHAMWEHISGQISFLGPEQFVH 229

Query: 52  IDWSIYEGVIVNSLEAK 2
           IDWSI+EGVI +SLEAK
Sbjct: 230 IDWSIFEGVIADSLEAK 246


>ref|XP_007010092.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma
           cacao] gi|508727005|gb|EOY18902.1|
           UDP-Glycosyltransferase superfamily protein isoform 3
           [Theobroma cacao]
          Length = 1034

 Score =  167 bits (423), Expect = 6e-39
 Identities = 104/257 (40%), Positives = 145/257 (56%), Gaps = 14/257 (5%)
 Frame = -3

Query: 730 MGRNSTSP------------PDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXX 587
           MGRNS+ P             + +NN+ D  G  YSIRDR   KRNPI+  D        
Sbjct: 1   MGRNSSPPILDGNGNENGKNKNSDNNNDDDQGF-YSIRDRLPFKRNPIHTRDRTKQSSL- 58

Query: 586 XXXXDRKMDRQW-RNRSHHNRIVRKGFSFKITYFLYGAAILAFLVFVVGSISLQTSISSV 410
                  +DR   RNR   NR     F  +  +  Y     +   F + S+ +Q+SI++V
Sbjct: 59  -------LDRPLVRNRPRFNRKGFLLFPLRGIHLFYFLIFFSVFAFAMASMLMQSSIAAV 111

Query: 409 -FTSGSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLA 233
            F  G +R    + + GL+ G  LKF+P    + R+  +   LDR+R+ +R+ +R PRLA
Sbjct: 112 VFRQGGERGWRKSVREGLRLGSTLKFMPAG--MSRWVAEGGGLDRMRSTARIGVRGPRLA 169

Query: 232 LILGNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLY 53
           LILGNM KDP +LM+ TVVKSL+ LGY  K++AV +G A  +WE I G++S LG    ++
Sbjct: 170 LILGNMKKDPQSLMMLTVVKSLQRLGYVIKIYAVANGKAHAMWEHISGQISFLGPEQFVH 229

Query: 52  IDWSIYEGVIVNSLEAK 2
           IDWSI+EGVI +SLEAK
Sbjct: 230 IDWSIFEGVIADSLEAK 246


>ref|XP_007010091.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma
           cacao] gi|508727004|gb|EOY18901.1|
           UDP-Glycosyltransferase superfamily protein isoform 2
           [Theobroma cacao]
          Length = 735

 Score =  167 bits (423), Expect = 6e-39
 Identities = 104/257 (40%), Positives = 145/257 (56%), Gaps = 14/257 (5%)
 Frame = -3

Query: 730 MGRNSTSP------------PDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXX 587
           MGRNS+ P             + +NN+ D  G  YSIRDR   KRNPI+  D        
Sbjct: 1   MGRNSSPPILDGNGNENGKNKNSDNNNDDDQGF-YSIRDRLPFKRNPIHTRDRTKQSSL- 58

Query: 586 XXXXDRKMDRQW-RNRSHHNRIVRKGFSFKITYFLYGAAILAFLVFVVGSISLQTSISSV 410
                  +DR   RNR   NR     F  +  +  Y     +   F + S+ +Q+SI++V
Sbjct: 59  -------LDRPLVRNRPRFNRKGFLLFPLRGIHLFYFLIFFSVFAFAMASMLMQSSIAAV 111

Query: 409 -FTSGSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLA 233
            F  G +R    + + GL+ G  LKF+P    + R+  +   LDR+R+ +R+ +R PRLA
Sbjct: 112 VFRQGGERGWRKSVREGLRLGSTLKFMPAG--MSRWVAEGGGLDRMRSTARIGVRGPRLA 169

Query: 232 LILGNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLY 53
           LILGNM KDP +LM+ TVVKSL+ LGY  K++AV +G A  +WE I G++S LG    ++
Sbjct: 170 LILGNMKKDPQSLMMLTVVKSLQRLGYVIKIYAVANGKAHAMWEHISGQISFLGPEQFVH 229

Query: 52  IDWSIYEGVIVNSLEAK 2
           IDWSI+EGVI +SLEAK
Sbjct: 230 IDWSIFEGVIADSLEAK 246


>ref|XP_007010090.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma
           cacao] gi|508727003|gb|EOY18900.1|
           UDP-Glycosyltransferase superfamily protein isoform 1
           [Theobroma cacao]
          Length = 1041

 Score =  167 bits (423), Expect = 6e-39
 Identities = 104/257 (40%), Positives = 145/257 (56%), Gaps = 14/257 (5%)
 Frame = -3

Query: 730 MGRNSTSP------------PDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXX 587
           MGRNS+ P             + +NN+ D  G  YSIRDR   KRNPI+  D        
Sbjct: 1   MGRNSSPPILDGNGNENGKNKNSDNNNDDDQGF-YSIRDRLPFKRNPIHTRDRTKQSSL- 58

Query: 586 XXXXDRKMDRQW-RNRSHHNRIVRKGFSFKITYFLYGAAILAFLVFVVGSISLQTSISSV 410
                  +DR   RNR   NR     F  +  +  Y     +   F + S+ +Q+SI++V
Sbjct: 59  -------LDRPLVRNRPRFNRKGFLLFPLRGIHLFYFLIFFSVFAFAMASMLMQSSIAAV 111

Query: 409 -FTSGSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLA 233
            F  G +R    + + GL+ G  LKF+P    + R+  +   LDR+R+ +R+ +R PRLA
Sbjct: 112 VFRQGGERGWRKSVREGLRLGSTLKFMPAG--MSRWVAEGGGLDRMRSTARIGVRGPRLA 169

Query: 232 LILGNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLY 53
           LILGNM KDP +LM+ TVVKSL+ LGY  K++AV +G A  +WE I G++S LG    ++
Sbjct: 170 LILGNMKKDPQSLMMLTVVKSLQRLGYVIKIYAVANGKAHAMWEHISGQISFLGPEQFVH 229

Query: 52  IDWSIYEGVIVNSLEAK 2
           IDWSI+EGVI +SLEAK
Sbjct: 230 IDWSIFEGVIADSLEAK 246


>ref|XP_004496154.1| PREDICTED: uncharacterized protein LOC101505326 [Cicer arietinum]
          Length = 1042

 Score =  160 bits (404), Expect = 1e-36
 Identities = 101/250 (40%), Positives = 145/250 (58%), Gaps = 7/250 (2%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDRQW 551
           + RNS+S P+ ++  G       SIR RF  KRNP     N+N  +      DR++ R  
Sbjct: 3   LSRNSSSQPEIDDAGGGSDVGFSSIRGRFPFKRNP-----NLNR-DRHRSSSDRQLPRSA 56

Query: 550 RN-RSH-HNRIVRKGFSFKITYF-----LYGAAILAFLVFVVGSISLQTSISSVFTSGSD 392
            + RSH HNR  RKGF     +F     LY    +   +F + S+ +Q SI+SVF   ++
Sbjct: 57  NSSRSHLHNRFTRKGFLSLFPFFKGKSGLYALIFVVVFLFALASMVMQNSITSVFRQRNE 116

Query: 391 RISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILGNMD 212
                  + GLK+G  +KF+P  K+ Q+F      LDRLR++ R+ +R PR+ALILG+M 
Sbjct: 117 GSRYL--REGLKFGSTIKFVPG-KVSQKF-LSGDGLDRLRSQPRIGVRSPRIALILGHMS 172

Query: 211 KDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWSIYE 32
            DP +LML TV+++L++LGY FK+F V    AR +WE++GG +S L       IDWS Y 
Sbjct: 173 VDPQSLMLVTVIQNLQKLGYVFKIFVVGHRKARSIWENVGGGLSSLSTEQQGQIDWSTYX 232

Query: 31  GVIVNSLEAK 2
            +IV+SLEAK
Sbjct: 233 XIIVDSLEAK 242


>ref|XP_006589360.1| PREDICTED: uncharacterized protein LOC100779157 isoform X2 [Glycine
           max]
          Length = 1043

 Score =  155 bits (392), Expect = 3e-35
 Identities = 96/249 (38%), Positives = 143/249 (57%), Gaps = 6/249 (2%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDRQW 551
           + RN+ S P+ ++  G       +IR  F  KRNP +H    +         +       
Sbjct: 3   LSRNAASQPEIDDGGGGGDIGFGAIRGGFPFKRNPSHHRHRGSFDRQLPRSNNNSNSNNN 62

Query: 550 RNRSHHNRIVRKGFSFKITYF------LYGAAILAFLVFVVGSISLQTSISSVFTSGSDR 389
            NRSH ++  RKG    +  F       Y   I    +F + S+ +Q+SI+SVF   ++R
Sbjct: 63  INRSHLHK--RKGLLLWLFPFPKSKSGFYAFIIAVVFLFALASLVMQSSITSVFRQRAER 120

Query: 388 ISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILGNMDK 209
            S    + G+++G  L+F+P  K+ QRF      LD +R++ R+ +R PR+ALILG+M  
Sbjct: 121 ASYI--RGGIRFGSALRFVPG-KISQRF-LSGDGLDPVRSQPRIGVRAPRIALILGHMTI 176

Query: 208 DPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWSIYEG 29
           DP +LML TV+++L++LGY FK+FAV  G AR +WE+IGG +S L A     IDWSI+EG
Sbjct: 177 DPQSLMLVTVIRNLQKLGYVFKIFAVGHGKARSIWENIGGGISPLSAKHQGLIDWSIFEG 236

Query: 28  VIVNSLEAK 2
           +IV+SLEAK
Sbjct: 237 IIVDSLEAK 245


>ref|XP_007144256.1| hypothetical protein PHAVU_007G141200g [Phaseolus vulgaris]
           gi|561017446|gb|ESW16250.1| hypothetical protein
           PHAVU_007G141200g [Phaseolus vulgaris]
          Length = 1049

 Score =  155 bits (392), Expect = 3e-35
 Identities = 96/250 (38%), Positives = 145/250 (58%), Gaps = 7/250 (2%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDRQW 551
           + RN+ S P+ ++  GD     ++IR  F  KRNP +H  +  + +              
Sbjct: 3   LSRNAASQPEIDDAGGDIG--FHAIRGGFPFKRNP-SHYRHRGSFDRQLPRSSNSSSSNS 59

Query: 550 RNRSH-HNRIVRKGFSFKITYF------LYGAAILAFLVFVVGSISLQTSISSVFTSGSD 392
            +RSH H+R+ RKG    +  F       Y   I+   +F   S+ +Q SI+SVF   ++
Sbjct: 60  SSRSHLHSRLTRKGLLLWLFPFSKCKSGFYALIIVVVFLFAFSSMVMQNSITSVFRQRTE 119

Query: 391 RISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILGNMD 212
           R      + GL++G  L+F+P  ++ Q F      LDR+R++ R+ +RPPR+ALILG+M 
Sbjct: 120 RGRY--HREGLRFGTALRFVPG-RVSQGF-LSGDGLDRVRSQPRLGVRPPRIALILGHMT 175

Query: 211 KDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWSIYE 32
            DP +LML TV+++L++LGY FK+FAV +G A  +WE+IGG +S L       IDWSI+E
Sbjct: 176 IDPQSLMLVTVIRNLQKLGYVFKIFAVGNGKAHSIWENIGGGISHLNTERQGLIDWSIFE 235

Query: 31  GVIVNSLEAK 2
           G+IV SLEAK
Sbjct: 236 GIIVGSLEAK 245


>ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779157 isoform X1 [Glycine
           max]
          Length = 1044

 Score =  155 bits (392), Expect = 3e-35
 Identities = 96/249 (38%), Positives = 143/249 (57%), Gaps = 6/249 (2%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDRQW 551
           + RN+ S P+ ++  G       +IR  F  KRNP +H    +         +       
Sbjct: 3   LSRNAASQPEIDDGGGGGDIGFGAIRGGFPFKRNPSHHRHRGSFDRQLPRSNNNSNSNNN 62

Query: 550 RNRSHHNRIVRKGFSFKITYF------LYGAAILAFLVFVVGSISLQTSISSVFTSGSDR 389
            NRSH ++  RKG    +  F       Y   I    +F + S+ +Q+SI+SVF   ++R
Sbjct: 63  INRSHLHK--RKGLLLWLFPFPKSKSGFYAFIIAVVFLFALASLVMQSSITSVFRQRAER 120

Query: 388 ISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILGNMDK 209
            S    + G+++G  L+F+P  K+ QRF      LD +R++ R+ +R PR+ALILG+M  
Sbjct: 121 ASYI--RGGIRFGSALRFVPG-KISQRF-LSGDGLDPVRSQPRIGVRAPRIALILGHMTI 176

Query: 208 DPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWSIYEG 29
           DP +LML TV+++L++LGY FK+FAV  G AR +WE+IGG +S L A     IDWSI+EG
Sbjct: 177 DPQSLMLVTVIRNLQKLGYVFKIFAVGHGKARSIWENIGGGISPLSAKHQGLIDWSIFEG 236

Query: 28  VIVNSLEAK 2
           +IV+SLEAK
Sbjct: 237 IIVDSLEAK 245


>ref|XP_006379502.1| hypothetical protein POPTR_0008s02940g [Populus trichocarpa]
           gi|550332296|gb|ERP57299.1| hypothetical protein
           POPTR_0008s02940g [Populus trichocarpa]
          Length = 1061

 Score =  154 bits (390), Expect = 4e-35
 Identities = 109/259 (42%), Positives = 143/259 (55%), Gaps = 16/259 (6%)
 Frame = -3

Query: 730 MGRNSTSPPD--------PENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXX 575
           M RN  SPP+          +   DQ+    SIRDR   KRNP   + N N  +      
Sbjct: 1   MIRNHHSPPELPVSAINGGSDGGSDQSS--NSIRDRSLFKRNP---NYNTNTPDKSSKSP 55

Query: 574 DRKMDRQWRNRSHHNRIV-RKG-----FSFKITYFLYGAAILAFLVFVVGSISLQTSISS 413
             + DR+ R   + NR   RKG     F F+  Y  Y     A L FV+ SI LQ+SI+ 
Sbjct: 56  LDRSDRRSRWHPYTNRSYNRKGWLLPCFPFRGVYLFYCLIFFAVLAFVLASILLQSSITG 115

Query: 412 VFTSGSDRISIFTR-KNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRT-ESRVAIRPPR 239
           +       I  +   K  LK G  LKF+P  K   R   +   LD +R   +RV +RPPR
Sbjct: 116 MAVFRRGWIDHWRPIKEDLKSGAMLKFVPVLK--SRLPLEGHGLDHVRLLANRVGLRPPR 173

Query: 238 LALILGNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSS 59
           LA+ILGNM K P +LML +VV +L++LGY+ K++AVD+G  R +WE IGGR+SILG    
Sbjct: 174 LAVILGNMKKGPQSLMLISVVMNLRKLGYALKIYAVDNGVTRSVWEEIGGRISILGPEQY 233

Query: 58  LYIDWSIYEGVIVNSLEAK 2
            +IDWSI+E VIV+SLEAK
Sbjct: 234 DHIDWSIFEAVIVDSLEAK 252


>ref|XP_006485287.1| PREDICTED: uncharacterized protein LOC102618162 isoform X2 [Citrus
           sinensis]
          Length = 962

 Score =  151 bits (381), Expect = 5e-34
 Identities = 98/250 (39%), Positives = 136/250 (54%), Gaps = 10/250 (4%)
 Frame = -3

Query: 721 NSTSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDRQWRNR 542
           N+ +     + + DQ    +SIRDRFR KR+P NH+ +    +        +        
Sbjct: 14  NAAAAATTTSGNNDQQQHPHSIRDRFRFKRSP-NHTQDKTQTKPSLHRYLLRHRHVNSTP 72

Query: 541 SHHN------RIVRKGFS----FKITYFLYGAAILAFLVFVVGSISLQTSISSVFTSGSD 392
           S  N      R  RKGFS    F+  Y LY    LA   F + S+ LQ SI+SVF +   
Sbjct: 73  SAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAERG 132

Query: 391 RISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILGNMD 212
           R      +  L++G  LKF+P            + LD LR+  R  +RPPR+ LILGNM 
Sbjct: 133 R----PIREELRFGSRLKFVPDQVGF------GNGLDGLRSTPRFGVRPPRIGLILGNMA 182

Query: 211 KDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWSIYE 32
           KD  +L+L TVVK+L++LGY FK++AV  G++  LWE I G++SILG      IDWSI++
Sbjct: 183 KDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFD 242

Query: 31  GVIVNSLEAK 2
           G+I +SLEAK
Sbjct: 243 GIIADSLEAK 252


>ref|XP_006436561.1| hypothetical protein CICLE_v10030581mg [Citrus clementina]
           gi|568863734|ref|XP_006485286.1| PREDICTED:
           uncharacterized protein LOC102618162 isoform X1 [Citrus
           sinensis] gi|557538757|gb|ESR49801.1| hypothetical
           protein CICLE_v10030581mg [Citrus clementina]
          Length = 1055

 Score =  151 bits (381), Expect = 5e-34
 Identities = 98/250 (39%), Positives = 136/250 (54%), Gaps = 10/250 (4%)
 Frame = -3

Query: 721 NSTSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDRQWRNR 542
           N+ +     + + DQ    +SIRDRFR KR+P NH+ +    +        +        
Sbjct: 14  NAAAAATTTSGNNDQQQHPHSIRDRFRFKRSP-NHTQDKTQTKPSLHRYLLRHRHVNSTP 72

Query: 541 SHHN------RIVRKGFS----FKITYFLYGAAILAFLVFVVGSISLQTSISSVFTSGSD 392
           S  N      R  RKGFS    F+  Y LY    LA   F + S+ LQ SI+SVF +   
Sbjct: 73  SAANAATSGPRFNRKGFSSLFPFRGAYLLYFMIFLAVFAFAMASMVLQNSIASVFGAERG 132

Query: 391 RISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALILGNMD 212
           R      +  L++G  LKF+P            + LD LR+  R  +RPPR+ LILGNM 
Sbjct: 133 R----PIREELRFGSRLKFVPDQVGF------GNGLDGLRSTPRFGVRPPRIGLILGNMA 182

Query: 211 KDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDWSIYE 32
           KD  +L+L TVVK+L++LGY FK++AV  G++  LWE I G++SILG      IDWSI++
Sbjct: 183 KDSRSLLLITVVKNLQKLGYVFKIYAVRSGNSHSLWEQIAGQISILGQEQYSLIDWSIFD 242

Query: 31  GVIVNSLEAK 2
           G+I +SLEAK
Sbjct: 243 GIIADSLEAK 252


>ref|XP_006378794.1| hypothetical protein POPTR_0010s23830g [Populus trichocarpa]
           gi|550330474|gb|ERP56591.1| hypothetical protein
           POPTR_0010s23830g [Populus trichocarpa]
          Length = 1053

 Score =  150 bits (378), Expect = 1e-33
 Identities = 110/261 (42%), Positives = 149/261 (57%), Gaps = 18/261 (6%)
 Frame = -3

Query: 730 MGRNSTSP---PD-PENNSGDQTGV----VYSIRDRFRLKRNPINHSDNINNVEXXXXXX 575
           M RN  +P   PD P  N+G + GV     +SI DRF  KRNP N S N  +        
Sbjct: 1   MNRNHHNPSELPDSPATNTGSE-GVSDQNFHSISDRFLFKRNP-NPSTNSPHKSSKSPPD 58

Query: 574 DRKMDRQWRNRSHHNRIVRKGFSFKITYF-----LYGAAILAFLVFVVGSISLQTSISS- 413
             +    + N+S++    RKG  F    F      Y    LA   FV+ SI LQ+SI+  
Sbjct: 59  RLRRWHHYTNKSNN----RKGGWFSCIPFRGICLFYFVIFLAVFAFVLASILLQSSITGM 114

Query: 412 -VFTSG--SDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRT-ESRVAIRP 245
            VF+ G    R SI   + GLK G  LKF+P   L  R   +   LD  R   +RV +RP
Sbjct: 115 VVFSKGWIDHRRSI---REGLKSGTTLKFVPG--LRSRLLLEGHGLDHARVLANRVGLRP 169

Query: 244 PRLALILGNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGAN 65
           PRLA+ILGNM KDP +LML +V+K+L++LGY+ K++A+ +G+ R +WE IGG++S+L   
Sbjct: 170 PRLAVILGNMKKDPQSLMLLSVMKNLRKLGYALKIYALGNGETRTMWEDIGGQISVLRPK 229

Query: 64  SSLYIDWSIYEGVIVNSLEAK 2
               IDWSI+EGV+V+SLEAK
Sbjct: 230 QYDLIDWSIFEGVMVDSLEAK 250


>ref|XP_002873152.1| hypothetical protein ARALYDRAFT_487229 [Arabidopsis lyrata subsp.
           lyrata] gi|297318989|gb|EFH49411.1| hypothetical protein
           ARALYDRAFT_487229 [Arabidopsis lyrata subsp. lyrata]
          Length = 1051

 Score =  146 bits (368), Expect = 2e-32
 Identities = 103/264 (39%), Positives = 141/264 (53%), Gaps = 21/264 (7%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQTG--------------VVYSIRDRFRLKRNPINHSDNINNVE 593
           MGRNS S    +N    + G                +SIRDR RLKRN  +  D  ++  
Sbjct: 1   MGRNSLSLEIDDNGGAGRDGNHNNANNVAGNGDTSFHSIRDRLRLKRNSSDRRDRSHS-- 58

Query: 592 XXXXXXDRKMDR-QWRNRSHH--NRIVRKGFSFKI----TYFLYGAAILAFLVFVVGSIS 434
                    +DR   RNR HH    + RKG    +    T  LY         FV+ S+ 
Sbjct: 59  --------GLDRPSLRNRPHHIARSLNRKGLISLLKPRGTCLLYFLVAFTVCAFVMSSLL 110

Query: 433 LQTSISSVFTSGSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVA 254
           LQ SI+     G+ +      + GL  G  LK++P    + R   +   LD LR+  R+ 
Sbjct: 111 LQNSIT---WQGNVKRGQVRSQIGL--GSTLKYVPGG--IARTLIEGEGLDPLRSTVRIG 163

Query: 253 IRPPRLALILGNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSIL 74
           +RPPRLAL+LGNM KDP TLML TV+K+L++LGY FK+FAV++G+AR LWE + G V +L
Sbjct: 164 VRPPRLALVLGNMKKDPRTLMLVTVMKNLQKLGYVFKVFAVENGEARSLWEHLAGHVKVL 223

Query: 73  GANSSLYIDWSIYEGVIVNSLEAK 2
            +    + DW+I+EGVI +SLEAK
Sbjct: 224 VSEQLGHADWTIFEGVIADSLEAK 247


>ref|XP_006606300.1| PREDICTED: uncharacterized protein LOC100790929 isoform X5 [Glycine
           max]
          Length = 796

 Score =  143 bits (361), Expect = 1e-31
 Identities = 95/254 (37%), Positives = 140/254 (55%), Gaps = 11/254 (4%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDRQW 551
           + RN  S P+ ++  GD      +IR  F  KRNP +H    +         +       
Sbjct: 3   LSRNVASQPEIDDAGGDIG--FGAIRGGFPFKRNPGHHRHRASFDRQLPRSNNSSSSSSS 60

Query: 550 RN-----RSHHNRIVRKGFSFKITYF------LYGAAILAFLVFVVGSISLQTSISSVFT 404
            N     RSH ++  RKG    +  F       Y   I+   +F + S+ LQ+SI+SVF 
Sbjct: 61  NNNNISIRSHLHK--RKGLLLWLFPFPKSKSGFYAFIIVVVFLFALASMVLQSSITSVFR 118

Query: 403 SGSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALIL 224
             +D     +   G+++G  L+F+P  ++ QRF      LD +R++ R+ +R PR+ALIL
Sbjct: 119 QSADSARYIS--GGIRFGSALRFVPG-RISQRF-LSGDGLDPVRSQPRIGVRAPRIALIL 174

Query: 223 GNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDW 44
           G+M  DP +LML TV+ +L++LGY FK+FAV  G AR +WE+IGGR+  L       IDW
Sbjct: 175 GHMTIDPQSLMLVTVIWNLQKLGYVFKIFAVGHGKARSIWENIGGRICPLSTEHQGLIDW 234

Query: 43  SIYEGVIVNSLEAK 2
           SI+EG+IV+SLEAK
Sbjct: 235 SIFEGIIVDSLEAK 248


>ref|XP_006606298.1| PREDICTED: uncharacterized protein LOC100790929 isoform X3 [Glycine
           max]
          Length = 1015

 Score =  143 bits (361), Expect = 1e-31
 Identities = 95/254 (37%), Positives = 140/254 (55%), Gaps = 11/254 (4%)
 Frame = -3

Query: 730 MGRNSTSPPDPENNSGDQTGVVYSIRDRFRLKRNPINHSDNINNVEXXXXXXDRKMDRQW 551
           + RN  S P+ ++  GD      +IR  F  KRNP +H    +         +       
Sbjct: 3   LSRNVASQPEIDDAGGDIG--FGAIRGGFPFKRNPGHHRHRASFDRQLPRSNNSSSSSSS 60

Query: 550 RN-----RSHHNRIVRKGFSFKITYF------LYGAAILAFLVFVVGSISLQTSISSVFT 404
            N     RSH ++  RKG    +  F       Y   I+   +F + S+ LQ+SI+SVF 
Sbjct: 61  NNNNISIRSHLHK--RKGLLLWLFPFPKSKSGFYAFIIVVVFLFALASMVLQSSITSVFR 118

Query: 403 SGSDRISIFTRKNGLKYGDNLKFLPTTKLLQRFDTQASLLDRLRTESRVAIRPPRLALIL 224
             +D     +   G+++G  L+F+P  ++ QRF      LD +R++ R+ +R PR+ALIL
Sbjct: 119 QSADSARYIS--GGIRFGSALRFVPG-RISQRF-LSGDGLDPVRSQPRIGVRAPRIALIL 174

Query: 223 GNMDKDPVTLMLYTVVKSLKELGYSFKMFAVDDGDARPLWESIGGRVSILGANSSLYIDW 44
           G+M  DP +LML TV+ +L++LGY FK+FAV  G AR +WE+IGGR+  L       IDW
Sbjct: 175 GHMTIDPQSLMLVTVIWNLQKLGYVFKIFAVGHGKARSIWENIGGRICPLSTEHQGLIDW 234

Query: 43  SIYEGVIVNSLEAK 2
           SI+EG+IV+SLEAK
Sbjct: 235 SIFEGIIVDSLEAK 248


Top