BLASTX nr result

ID: Forsythia22_contig00015344 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00015344
         (763 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002520074.1| protein with unknown function [Ricinus commu...   325   1e-86
ref|XP_006484602.1| PREDICTED: uncharacterized protein LOC102629...   322   1e-85
ref|XP_006437428.1| hypothetical protein CICLE_v10032089mg [Citr...   322   1e-85
ref|XP_007029676.1| PLATZ transcription factor family protein is...   322   1e-85
ref|XP_011084632.1| PREDICTED: uncharacterized protein LOC105166...   322   2e-85
ref|XP_002271206.1| PREDICTED: uncharacterized protein LOC100261...   321   3e-85
gb|KHG03012.1| hypothetical protein F383_25875 [Gossypium arboreum]   320   6e-85
ref|XP_012070242.1| PREDICTED: uncharacterized protein LOC105632...   319   1e-84
ref|XP_012470056.1| PREDICTED: uncharacterized protein LOC105787...   318   2e-84
ref|XP_003517556.1| PREDICTED: uncharacterized protein LOC100796...   313   9e-83
ref|XP_011014381.1| PREDICTED: uncharacterized protein LOC105118...   311   2e-82
ref|XP_006590432.1| PREDICTED: uncharacterized protein LOC100792...   311   2e-82
ref|XP_003537704.1| PREDICTED: uncharacterized protein LOC100792...   311   2e-82
ref|XP_007157082.1| hypothetical protein PHAVU_002G041400g [Phas...   311   4e-82
ref|XP_003540084.1| PREDICTED: uncharacterized protein LOC100810...   311   4e-82
ref|XP_003537821.1| PREDICTED: uncharacterized protein LOC100810...   311   4e-82
gb|KHN46470.1| hypothetical protein glysoja_035146 [Glycine soja]     310   5e-82
ref|XP_002325391.1| hypothetical protein POPTR_0019s07790g [Popu...   310   6e-82
ref|XP_010088252.1| hypothetical protein L484_003212 [Morus nota...   309   1e-81
ref|XP_010240867.1| PREDICTED: uncharacterized protein LOC104585...   308   2e-81

>ref|XP_002520074.1| protein with unknown function [Ricinus communis]
           gi|223540838|gb|EEF42398.1| protein with unknown
           function [Ricinus communis]
          Length = 251

 Score =  325 bits (834), Expect = 1e-86
 Identities = 162/212 (76%), Positives = 179/212 (84%), Gaps = 7/212 (3%)
 Frame = -3

Query: 686 GEDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKEH 507
           G D+DD  KWP WLRPLLQTSFF QCKLHA AHKSECNMYCLDCMNGALCSLCL++HKEH
Sbjct: 40  GGDKDDV-KWPPWLRPLLQTSFFVQCKLHADAHKSECNMYCLDCMNGALCSLCLSYHKEH 98

Query: 506 RAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRVC 327
           RAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSA+IVFLNERPQPRPGKGVTNTC+VC
Sbjct: 99  RAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSARIVFLNERPQPRPGKGVTNTCQVC 158

Query: 326 ERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESINGKV----IKKKI 159
           ERSLLDSF+FCSLGCKIVGTS NF      K+ +  E+ GSD+EES+N  +     K KI
Sbjct: 159 ERSLLDSFSFCSLGCKIVGTSKNFR-----KKKMRKEIEGSDTEESMNNGIGKGSPKSKI 213

Query: 158 RSFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
           +SF+PSTPP T     + KRRKG+PHR+P GG
Sbjct: 214 QSFTPSTPPPTAVNYRTAKRRKGVPHRSPMGG 245


>ref|XP_006484602.1| PREDICTED: uncharacterized protein LOC102629881 [Citrus sinensis]
           gi|641830104|gb|KDO49202.1| hypothetical protein
           CISIN_1g026096mg [Citrus sinensis]
          Length = 243

 Score =  322 bits (826), Expect = 1e-85
 Identities = 163/219 (74%), Positives = 178/219 (81%), Gaps = 10/219 (4%)
 Frame = -3

Query: 698 QKIMG------EDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALC 537
           ++IMG      EDE+  +KWP WLRPLLQTSFF QCKLHA AHKSECNMYCLDCMNGALC
Sbjct: 25  RRIMGGGGPEEEDEEMSNKWPPWLRPLLQTSFFVQCKLHADAHKSECNMYCLDCMNGALC 84

Query: 536 SLCLNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPG 357
           SLCL+ H++HRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSA+IVFLNERPQPRPG
Sbjct: 85  SLCLSLHRDHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSARIVFLNERPQPRPG 144

Query: 356 KGVTNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVG-GSDSEESING 180
           KGVTNTC VCERSLLDSF FCSLGCKI GTS NF+     KR +C E+  GSD+E S N 
Sbjct: 145 KGVTNTCLVCERSLLDSFTFCSLGCKIAGTSKNFK-----KRKMCKEMDQGSDAEISSNN 199

Query: 179 KVIKKKIRSFSPSTPPQTCAS---GKRRKGIPHRAPTGG 72
              K + +SFSPSTPP T  S    KRRKG+PHRAP GG
Sbjct: 200 GSSKSRTQSFSPSTPPPTAVSFRTAKRRKGVPHRAPMGG 238


>ref|XP_006437428.1| hypothetical protein CICLE_v10032089mg [Citrus clementina]
           gi|557539624|gb|ESR50668.1| hypothetical protein
           CICLE_v10032089mg [Citrus clementina]
          Length = 329

 Score =  322 bits (826), Expect = 1e-85
 Identities = 163/219 (74%), Positives = 178/219 (81%), Gaps = 10/219 (4%)
 Frame = -3

Query: 698 QKIMG------EDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALC 537
           ++IMG      EDE+  +KWP WLRPLLQTSFF QCKLHA AHKSECNMYCLDCMNGALC
Sbjct: 111 RRIMGGGGPEEEDEEMSNKWPPWLRPLLQTSFFVQCKLHADAHKSECNMYCLDCMNGALC 170

Query: 536 SLCLNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPG 357
           SLCL+ H++HRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSA+IVFLNERPQPRPG
Sbjct: 171 SLCLSLHRDHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSARIVFLNERPQPRPG 230

Query: 356 KGVTNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVG-GSDSEESING 180
           KGVTNTC VCERSLLDSF FCSLGCKI GTS NF+     KR +C E+  GSD+E S N 
Sbjct: 231 KGVTNTCLVCERSLLDSFTFCSLGCKIAGTSKNFK-----KRKMCKEMDQGSDAEISSNN 285

Query: 179 KVIKKKIRSFSPSTPPQTCAS---GKRRKGIPHRAPTGG 72
              K + +SFSPSTPP T  S    KRRKG+PHRAP GG
Sbjct: 286 GSSKSRTQSFSPSTPPPTAVSFRTAKRRKGVPHRAPMGG 324


>ref|XP_007029676.1| PLATZ transcription factor family protein isoform 1 [Theobroma
           cacao] gi|508718281|gb|EOY10178.1| PLATZ transcription
           factor family protein isoform 1 [Theobroma cacao]
          Length = 238

 Score =  322 bits (826), Expect = 1e-85
 Identities = 162/220 (73%), Positives = 179/220 (81%), Gaps = 11/220 (5%)
 Frame = -3

Query: 698 QKIMG----EDEDDYS--KWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALC 537
           ++IMG     D+DD    KWP WLRPLLQTSFF QCKLHA AHKSECNMYCLDCMNGALC
Sbjct: 18  RRIMGGGGPNDDDDKEDVKWPPWLRPLLQTSFFVQCKLHADAHKSECNMYCLDCMNGALC 77

Query: 536 SLCLNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPG 357
           SLCL +HKEHRAIQIRRSSYHDVIRVSEIQK+LDITG+QTYIINSA+IVFLNERPQPRPG
Sbjct: 78  SLCLAYHKEHRAIQIRRSSYHDVIRVSEIQKFLDITGIQTYIINSARIVFLNERPQPRPG 137

Query: 356 KGVTNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSE--ESIN 183
           KGVTNTC+VCERSLLDSF+FCSLGCKIVGTS NF      K+ +C E  GSD+E    +N
Sbjct: 138 KGVTNTCQVCERSLLDSFSFCSLGCKIVGTSKNF----IRKKKMCKETDGSDAESLSGVN 193

Query: 182 GKVIKKKIRSFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
               K KI+SF+PSTPP T     + KRRKG+PHRAP GG
Sbjct: 194 SGSRKSKIQSFTPSTPPPTAVNYRTAKRRKGVPHRAPMGG 233


>ref|XP_011084632.1| PREDICTED: uncharacterized protein LOC105166840 [Sesamum indicum]
          Length = 222

 Score =  322 bits (825), Expect = 2e-85
 Identities = 159/216 (73%), Positives = 180/216 (83%), Gaps = 11/216 (5%)
 Frame = -3

Query: 686 GEDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKEH 507
           GE++D  SKWPAW+ PLL+TSFFGQCK+HA+AH+SECNM+CLDC+NG LCS+CLN HK+H
Sbjct: 7   GEEDDGSSKWPAWVGPLLRTSFFGQCKVHAAAHRSECNMFCLDCVNGPLCSVCLNLHKDH 66

Query: 506 RAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRVC 327
           R IQIRRSSYHDVIRV EIQKYLDITGVQTYIINSAKIVFLN RPQPRP KGVTNTCRVC
Sbjct: 67  RPIQIRRSSYHDVIRVCEIQKYLDITGVQTYIINSAKIVFLNHRPQPRPAKGVTNTCRVC 126

Query: 326 ERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEE--SING-----KVIK 168
           +R LLDSF FCSLGCKIVGTS NF+     K+ I  E GGSDSEE  S+NG     +VI+
Sbjct: 127 DRGLLDSFTFCSLGCKIVGTSKNFQ-----KKKIGKEDGGSDSEESMSMNGSTKHKQVIR 181

Query: 167 KKIRSFSPSTPPQTCA----SGKRRKGIPHRAPTGG 72
            KI+SF+PSTPP+T      S KRRKG+PHRAPTGG
Sbjct: 182 NKIQSFTPSTPPRTATLNYRSAKRRKGVPHRAPTGG 217


>ref|XP_002271206.1| PREDICTED: uncharacterized protein LOC100261275 [Vitis vinifera]
          Length = 231

 Score =  321 bits (823), Expect = 3e-85
 Identities = 161/213 (75%), Positives = 177/213 (83%), Gaps = 9/213 (4%)
 Frame = -3

Query: 686 GEDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKEH 507
           GE+E + SKWP WLRPLLQTSFF QCKLHA +H+SECNMYCLDCMNGALCSLCLNFHK+H
Sbjct: 19  GEEEVE-SKWPPWLRPLLQTSFFVQCKLHADSHRSECNMYCLDCMNGALCSLCLNFHKDH 77

Query: 506 RAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRVC 327
           RAIQIRRSSYHDVIRVSEIQK+LDITGVQTYIINSA+IVFLNERPQPRPGKGVTNTC+VC
Sbjct: 78  RAIQIRRSSYHDVIRVSEIQKFLDITGVQTYIINSARIVFLNERPQPRPGKGVTNTCQVC 137

Query: 326 ERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSE-----ESINGKVIKKK 162
           ERSLLDSF FCSLGCKIVGTS +F      K+ +C+E  GSDSE      S +G   K K
Sbjct: 138 ERSLLDSFTFCSLGCKIVGTSKSFR-----KKKLCMETEGSDSESLNGASSGSGSSSKSK 192

Query: 161 IRSFSPSTPPQTCA----SGKRRKGIPHRAPTG 75
           I SF+PSTPP T A    + KRRKG+PHRAP G
Sbjct: 193 IPSFTPSTPPPTAAATYRTAKRRKGVPHRAPLG 225


>gb|KHG03012.1| hypothetical protein F383_25875 [Gossypium arboreum]
          Length = 238

 Score =  320 bits (820), Expect = 6e-85
 Identities = 161/221 (72%), Positives = 178/221 (80%), Gaps = 12/221 (5%)
 Frame = -3

Query: 698 QKIMGEDEDDYS------KWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALC 537
           ++IMG    DY       KWP WLRPLLQTSFF QCKLHA AHKSECNMYCLDCMNGALC
Sbjct: 18  RRIMGGGGPDYDDDKEEVKWPPWLRPLLQTSFFVQCKLHADAHKSECNMYCLDCMNGALC 77

Query: 536 SLCLNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPG 357
           SLCL +HK+HRAIQIRRSSYHDVIRVSEIQK++DITG+QTYIINSA+IVFLNERPQPRPG
Sbjct: 78  SLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKFVDITGIQTYIINSARIVFLNERPQPRPG 137

Query: 356 KGVTNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESING- 180
           KGVTNTC+VCERSLLDSFNFCSLGCKIVGTS NF      K+ +C E  GSD+ ES+NG 
Sbjct: 138 KGVTNTCQVCERSLLDSFNFCSLGCKIVGTSKNF----IRKKKMCKETDGSDA-ESLNGV 192

Query: 179 --KVIKKKIRSFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
                K K++SF PSTPP T     + KRRKG+PHRAP GG
Sbjct: 193 SNGSTKSKVQSFRPSTPPPTAVNYRTAKRRKGVPHRAPMGG 233


>ref|XP_012070242.1| PREDICTED: uncharacterized protein LOC105632469 [Jatropha curcas]
           gi|643732443|gb|KDP39539.1| hypothetical protein
           JCGZ_02559 [Jatropha curcas]
          Length = 245

 Score =  319 bits (817), Expect = 1e-84
 Identities = 163/228 (71%), Positives = 182/228 (79%), Gaps = 19/228 (8%)
 Frame = -3

Query: 698 QKIMG----EDEDDYS---------KWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLD 558
           ++IMG    +D+DD S         KWP WLRPLLQTSFF  CKLH  AHKSECNMYCLD
Sbjct: 18  RRIMGGGGPDDDDDNSNGGRDIEDIKWPPWLRPLLQTSFFVHCKLHIDAHKSECNMYCLD 77

Query: 557 CMNGALCSLCLNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNE 378
           CMNGALCSLCL++HK+HRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSA+IVFLNE
Sbjct: 78  CMNGALCSLCLSYHKDHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSARIVFLNE 137

Query: 377 RPQPRPGKGVTNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDS 198
           RPQPRPGKGVTNTC+VCERSLLDSF+FCSLGCKIVGTS NF      K+ +  E+ GSD+
Sbjct: 138 RPQPRPGKGVTNTCQVCERSLLDSFSFCSLGCKIVGTSKNFR-----KKKMRKEIEGSDT 192

Query: 197 EESIN---GKVIKKKIRSFSPSTPPQTC---ASGKRRKGIPHRAPTGG 72
           EES+N   G   K K +SFSPSTPP T     + KRRKG+PHR+P GG
Sbjct: 193 EESMNSNRGNGNKSKTQSFSPSTPPPTSLNYRTAKRRKGVPHRSPMGG 240


>ref|XP_012470056.1| PREDICTED: uncharacterized protein LOC105787970 isoform X1
           [Gossypium raimondii] gi|763753862|gb|KJB21250.1|
           hypothetical protein B456_003G056500 [Gossypium
           raimondii]
          Length = 238

 Score =  318 bits (815), Expect = 2e-84
 Identities = 160/221 (72%), Positives = 178/221 (80%), Gaps = 12/221 (5%)
 Frame = -3

Query: 698 QKIMGEDEDDYS------KWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALC 537
           ++IMG    DY       KWP WLRPLLQTSFF QCKLHA AHKSECNMYCLDCMNGALC
Sbjct: 18  RRIMGGGGPDYDDDKEEVKWPPWLRPLLQTSFFVQCKLHADAHKSECNMYCLDCMNGALC 77

Query: 536 SLCLNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPG 357
           SLCL +HK+HRAIQIRRSSYHDVIRVSEIQK++DITG+QTYIINSA+IVFLNERPQPRPG
Sbjct: 78  SLCLAYHKDHRAIQIRRSSYHDVIRVSEIQKFVDITGIQTYIINSARIVFLNERPQPRPG 137

Query: 356 KGVTNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESING- 180
           KGVTNTC+VCERSLLDSF+FCSLGCKIVGTS NF      K+ +C E  GSD+ ES+NG 
Sbjct: 138 KGVTNTCQVCERSLLDSFSFCSLGCKIVGTSKNF----IRKKKMCKETDGSDA-ESLNGV 192

Query: 179 --KVIKKKIRSFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
                K K++SF PSTPP T     + KRRKG+PHRAP GG
Sbjct: 193 SNGSTKSKVQSFRPSTPPPTAVNYRTAKRRKGVPHRAPMGG 233


>ref|XP_003517556.1| PREDICTED: uncharacterized protein LOC100796834 [Glycine max]
           gi|734406903|gb|KHN34132.1| hypothetical protein
           glysoja_038621 [Glycine soja]
          Length = 234

 Score =  313 bits (801), Expect = 9e-83
 Identities = 156/212 (73%), Positives = 175/212 (82%), Gaps = 6/212 (2%)
 Frame = -3

Query: 689 MGEDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKE 510
           +G++E++ +KWP WL PLLQTSFF QCK+HA +HKSECNMYCLDCMNGALCS CL  H+E
Sbjct: 21  VGKNEEE-NKWPPWLGPLLQTSFFVQCKVHADSHKSECNMYCLDCMNGALCSTCLASHRE 79

Query: 509 HRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRV 330
           HRAIQIRRSSYHDVIRVSEIQK+LDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTC+V
Sbjct: 80  HRAIQIRRSSYHDVIRVSEIQKFLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCQV 139

Query: 329 CERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESING---KVIKKKI 159
           CERSLLDSF+FCSLGCKIVGTS  F      K+ +  E  GSD EESING   +  + KI
Sbjct: 140 CERSLLDSFSFCSLGCKIVGTSKKFR-----KKKMLAETDGSDGEESINGISNESGRNKI 194

Query: 158 RSFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
            SF+PSTPP T     + KRRKG+PHRAP GG
Sbjct: 195 HSFTPSTPPPTVVNYRTAKRRKGVPHRAPMGG 226


>ref|XP_011014381.1| PREDICTED: uncharacterized protein LOC105118189 [Populus
           euphratica]
          Length = 240

 Score =  311 bits (798), Expect = 2e-82
 Identities = 160/233 (68%), Positives = 182/233 (78%), Gaps = 14/233 (6%)
 Frame = -3

Query: 728 ISPNIEGHQFQKIMG----EDEDDYS---KWPAWLRPLLQTSFFGQCKLHASAHKSECNM 570
           I PNI     ++IMG    +D DD+    KWP WL PLL+TSFF QCKLHA AHKSECNM
Sbjct: 13  IKPNI-----RRIMGGGGPDDVDDHKEDIKWPPWLHPLLETSFFVQCKLHADAHKSECNM 67

Query: 569 YCLDCMNGALCSLCLNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIV 390
           YCLDCMNGALCS+CL+ H +HRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSA+IV
Sbjct: 68  YCLDCMNGALCSVCLSLHSDHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSARIV 127

Query: 389 FLNERPQPRPGKGVTNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVG 210
           FLNERPQPRPGKGVTNTC VCERSLLDSF+FCSLGCKIVGTS NF      K+    E+ 
Sbjct: 128 FLNERPQPRPGKGVTNTCHVCERSLLDSFSFCSLGCKIVGTSKNFR-----KKKRYKEMD 182

Query: 209 GSDSEESING---KVIKKKIRSFSPSTPPQTC----ASGKRRKGIPHRAPTGG 72
           GSD+EES+ G      + K++SF+PSTPP +      + KRRKG+PHR+P GG
Sbjct: 183 GSDTEESMKGIGNGGARSKVQSFTPSTPPPSAMNNYRTAKRRKGVPHRSPMGG 235


>ref|XP_006590432.1| PREDICTED: uncharacterized protein LOC100792668 isoform X2 [Glycine
           max]
          Length = 233

 Score =  311 bits (798), Expect = 2e-82
 Identities = 156/218 (71%), Positives = 175/218 (80%), Gaps = 12/218 (5%)
 Frame = -3

Query: 689 MGEDEDDY------SKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLC 528
           + ED+DD       +KWP WLRPLLQTSFF QCK+HA +HKSECNMYCLDCMNGALCS C
Sbjct: 15  LNEDDDDIGKIEEENKWPPWLRPLLQTSFFVQCKVHADSHKSECNMYCLDCMNGALCSAC 74

Query: 527 LNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGV 348
           L  H+EHRAIQIRRSSYHDVIRVSEIQK+LDI GVQTYIINSAKIVFLNERPQPRPGKGV
Sbjct: 75  LASHREHRAIQIRRSSYHDVIRVSEIQKFLDIAGVQTYIINSAKIVFLNERPQPRPGKGV 134

Query: 347 TNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESING---K 177
           TNTC+VCER+LLDSF+FCSLGCKIVGTS  F      K+ +  E  GS+ EESING   +
Sbjct: 135 TNTCQVCERNLLDSFSFCSLGCKIVGTSKKFR-----KKKMLAETEGSNGEESINGISNE 189

Query: 176 VIKKKIRSFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
             + KI+SF+PSTPP T     + KRRKG+PHRAP GG
Sbjct: 190 SGRNKIQSFTPSTPPPTVVNYRTAKRRKGVPHRAPMGG 227


>ref|XP_003537704.1| PREDICTED: uncharacterized protein LOC100792668 isoform X1 [Glycine
           max] gi|734409047|gb|KHN35154.1| hypothetical protein
           glysoja_004920 [Glycine soja]
          Length = 231

 Score =  311 bits (798), Expect = 2e-82
 Identities = 156/218 (71%), Positives = 175/218 (80%), Gaps = 12/218 (5%)
 Frame = -3

Query: 689 MGEDEDDY------SKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLC 528
           + ED+DD       +KWP WLRPLLQTSFF QCK+HA +HKSECNMYCLDCMNGALCS C
Sbjct: 13  LNEDDDDIGKIEEENKWPPWLRPLLQTSFFVQCKVHADSHKSECNMYCLDCMNGALCSAC 72

Query: 527 LNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGV 348
           L  H+EHRAIQIRRSSYHDVIRVSEIQK+LDI GVQTYIINSAKIVFLNERPQPRPGKGV
Sbjct: 73  LASHREHRAIQIRRSSYHDVIRVSEIQKFLDIAGVQTYIINSAKIVFLNERPQPRPGKGV 132

Query: 347 TNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESING---K 177
           TNTC+VCER+LLDSF+FCSLGCKIVGTS  F      K+ +  E  GS+ EESING   +
Sbjct: 133 TNTCQVCERNLLDSFSFCSLGCKIVGTSKKFR-----KKKMLAETEGSNGEESINGISNE 187

Query: 176 VIKKKIRSFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
             + KI+SF+PSTPP T     + KRRKG+PHRAP GG
Sbjct: 188 SGRNKIQSFTPSTPPPTVVNYRTAKRRKGVPHRAPMGG 225


>ref|XP_007157082.1| hypothetical protein PHAVU_002G041400g [Phaseolus vulgaris]
           gi|561030497|gb|ESW29076.1| hypothetical protein
           PHAVU_002G041400g [Phaseolus vulgaris]
          Length = 232

 Score =  311 bits (796), Expect = 4e-82
 Identities = 153/212 (72%), Positives = 175/212 (82%), Gaps = 6/212 (2%)
 Frame = -3

Query: 689 MGEDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKE 510
           +G+DE++  KWP WLRPLLQTSFF QCK+H+ +HKSECNM+CLDC+NGALCS CL  H+E
Sbjct: 21  IGDDEEE--KWPPWLRPLLQTSFFVQCKVHSDSHKSECNMFCLDCVNGALCSACLASHRE 78

Query: 509 HRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRV 330
           HRAIQIRRSSYHDVIRVSEIQK++DI GVQTYIINSAKIVFLNERPQPRPGKGVTNTC+V
Sbjct: 79  HRAIQIRRSSYHDVIRVSEIQKFVDIAGVQTYIINSAKIVFLNERPQPRPGKGVTNTCQV 138

Query: 329 CERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESING---KVIKKKI 159
           CERSLLDSF+FCSLGCKIVGTS  F      K+ +  E  GSD EESING   +  + KI
Sbjct: 139 CERSLLDSFSFCSLGCKIVGTSKKFR-----KKKLFAETDGSDCEESINGISNESARNKI 193

Query: 158 RSFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
           +SF+PSTPP T     S KRRKG+PHR+P GG
Sbjct: 194 QSFTPSTPPPTVVNYRSAKRRKGVPHRSPMGG 225


>ref|XP_003540084.1| PREDICTED: uncharacterized protein LOC100810757 [Glycine max]
          Length = 223

 Score =  311 bits (796), Expect = 4e-82
 Identities = 152/210 (72%), Positives = 172/210 (81%), Gaps = 7/210 (3%)
 Frame = -3

Query: 680 DEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKEHRA 501
           ++++ +KWP+WL+PLL+T FF QCK+HA +HKSECNMYCLDC+NGALCS CL  HKEHR 
Sbjct: 13  EKEEENKWPSWLQPLLKTRFFVQCKVHADSHKSECNMYCLDCVNGALCSACLASHKEHRI 72

Query: 500 IQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRVCER 321
           IQIRRSSYHDVIRVSEIQK+LDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTC+VCER
Sbjct: 73  IQIRRSSYHDVIRVSEIQKFLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCQVCER 132

Query: 320 SLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESINGKV----IKKKIRS 153
           SLLDSFNFCSLGCKIVGTS  F      K+ +  E  GSD EES+NG +     + KI S
Sbjct: 133 SLLDSFNFCSLGCKIVGTSKKFR-----KKKMLSEADGSDGEESVNGIISNASARNKIHS 187

Query: 152 FSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
           F+PSTPP T     + KRRKGIPHRAP GG
Sbjct: 188 FTPSTPPPTVVNYRTAKRRKGIPHRAPMGG 217


>ref|XP_003537821.1| PREDICTED: uncharacterized protein LOC100810888 [Glycine max]
           gi|734418945|gb|KHN39862.1| hypothetical protein
           glysoja_020602 [Glycine soja]
          Length = 223

 Score =  311 bits (796), Expect = 4e-82
 Identities = 152/211 (72%), Positives = 171/211 (81%), Gaps = 7/211 (3%)
 Frame = -3

Query: 683 EDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKEHR 504
           E+E++ +KWP+WL+PLL+T FF QCK+HA +HKSECNMYCLDC+NGALCS CL+ HKEHR
Sbjct: 15  EEEEENNKWPSWLQPLLKTRFFVQCKVHADSHKSECNMYCLDCVNGALCSACLSSHKEHR 74

Query: 503 AIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRVCE 324
            IQIRRSSYHDVIRVSEIQK+LDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTC+VCE
Sbjct: 75  IIQIRRSSYHDVIRVSEIQKFLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCQVCE 134

Query: 323 RSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESING----KVIKKKIR 156
           RSLLDSFNFCSLGCKIVGTS  F             +G +D EES+NG       +KKI 
Sbjct: 135 RSLLDSFNFCSLGCKIVGTSKKFRNKKM--------LGEADGEESVNGIRNNASARKKIH 186

Query: 155 SFSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
           SF+PSTPP T     + KRRKGIPHRAP GG
Sbjct: 187 SFTPSTPPPTVVNYRTAKRRKGIPHRAPMGG 217


>gb|KHN46470.1| hypothetical protein glysoja_035146 [Glycine soja]
          Length = 223

 Score =  310 bits (795), Expect = 5e-82
 Identities = 152/210 (72%), Positives = 172/210 (81%), Gaps = 7/210 (3%)
 Frame = -3

Query: 680 DEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKEHRA 501
           ++++ +KWP+WL+PLL+T FF QCK+HA +HKSECNMYCLDC+NGALCS CL  HKEHR 
Sbjct: 13  EKEEENKWPSWLQPLLKTRFFVQCKVHADSHKSECNMYCLDCVNGALCSACLASHKEHRI 72

Query: 500 IQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRVCER 321
           IQIRRSSYHDVIRVSEIQK+LDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTC+VCER
Sbjct: 73  IQIRRSSYHDVIRVSEIQKFLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCQVCER 132

Query: 320 SLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESINGKV----IKKKIRS 153
           SLLDSFNFCSLGCKIVGTS  F      K+ +  E  GSD EES+NG +     + KI S
Sbjct: 133 SLLDSFNFCSLGCKIVGTSKKFR-----KKKMLGEADGSDGEESVNGIISNASARNKIHS 187

Query: 152 FSPSTPPQTCA---SGKRRKGIPHRAPTGG 72
           F+PSTPP T     + KRRKGIPHRAP GG
Sbjct: 188 FTPSTPPPTVVNYRTAKRRKGIPHRAPMGG 217


>ref|XP_002325391.1| hypothetical protein POPTR_0019s07790g [Populus trichocarpa]
           gi|222862266|gb|EEE99772.1| hypothetical protein
           POPTR_0019s07790g [Populus trichocarpa]
          Length = 241

 Score =  310 bits (794), Expect = 6e-82
 Identities = 159/234 (67%), Positives = 182/234 (77%), Gaps = 15/234 (6%)
 Frame = -3

Query: 728 ISPNIEGHQFQKIMG-----EDEDDYS---KWPAWLRPLLQTSFFGQCKLHASAHKSECN 573
           I PNI     ++IMG     +D DD+    KWP WL PLL+TSFF QCKLHA AHKSECN
Sbjct: 13  IKPNI-----RRIMGGGGPDDDVDDHKEDIKWPPWLHPLLETSFFVQCKLHADAHKSECN 67

Query: 572 MYCLDCMNGALCSLCLNFHKEHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKI 393
           MYCLDCMNGALCS+CL+ H +HRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSA+I
Sbjct: 68  MYCLDCMNGALCSVCLSLHSDHRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSARI 127

Query: 392 VFLNERPQPRPGKGVTNTCRVCERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEV 213
           VFLNERPQPRPGKGVTNTC VCERSLLDSF+FCSLGCKIVGTS NF      K+    E+
Sbjct: 128 VFLNERPQPRPGKGVTNTCHVCERSLLDSFSFCSLGCKIVGTSKNFR-----KKKRYKEM 182

Query: 212 GGSDSEESING---KVIKKKIRSFSPSTPPQTC----ASGKRRKGIPHRAPTGG 72
            GSD++ES+ G      + K++SF+PSTPP +      + KRRKG+PHR+P GG
Sbjct: 183 DGSDTDESMKGIGNGGARSKVQSFTPSTPPPSAMNNYRTAKRRKGVPHRSPMGG 236


>ref|XP_010088252.1| hypothetical protein L484_003212 [Morus notabilis]
           gi|587842416|gb|EXB32999.1| hypothetical protein
           L484_003212 [Morus notabilis]
          Length = 213

 Score =  309 bits (791), Expect = 1e-81
 Identities = 149/209 (71%), Positives = 169/209 (80%), Gaps = 3/209 (1%)
 Frame = -3

Query: 689 MGEDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKE 510
           MG +E++  +WP WL+PLL+ SFF QCKLHA +HKSECNMYCLDCMNGALCSLCLNFHK+
Sbjct: 1   MGPEEEENHRWPPWLKPLLKESFFVQCKLHADSHKSECNMYCLDCMNGALCSLCLNFHKD 60

Query: 509 HRAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRV 330
           HRAIQIRRSSYHDVIRVSEIQK LDI+GVQTYIINSA++VFLNERPQPRPGKGVTNTC V
Sbjct: 61  HRAIQIRRSSYHDVIRVSEIQKVLDISGVQTYIINSARVVFLNERPQPRPGKGVTNTCEV 120

Query: 329 CERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESINGKVIKKKIRSF 150
           CERSLLDSF FCSLGCKIVGTS NF+        +  +   S S  S +G+    K++SF
Sbjct: 121 CERSLLDSFRFCSLGCKIVGTSKNFQKKRKQMAVMASDSEDSYSSSSSHGR-SSNKLQSF 179

Query: 149 SPSTPPQTCA---SGKRRKGIPHRAPTGG 72
           +PSTPP T     + KRRKGIPHRAP GG
Sbjct: 180 TPSTPPPTSVNYRTAKRRKGIPHRAPMGG 208


>ref|XP_010240867.1| PREDICTED: uncharacterized protein LOC104585621 [Nelumbo nucifera]
          Length = 213

 Score =  308 bits (790), Expect = 2e-81
 Identities = 151/208 (72%), Positives = 167/208 (80%), Gaps = 3/208 (1%)
 Frame = -3

Query: 686 GEDEDDYSKWPAWLRPLLQTSFFGQCKLHASAHKSECNMYCLDCMNGALCSLCLNFHKEH 507
           G  +DD ++WP WLRPLL+T FF QCK HA +HKSECNMYCLDCMNGALCSLCL +HK+H
Sbjct: 4   GGPDDDDNRWPPWLRPLLRTPFFVQCKFHADSHKSECNMYCLDCMNGALCSLCLAYHKDH 63

Query: 506 RAIQIRRSSYHDVIRVSEIQKYLDITGVQTYIINSAKIVFLNERPQPRPGKGVTNTCRVC 327
           RAIQIRRSSYHDVIRVSEIQK LDI+GVQTYIINSA++VFLNERPQPRPGKGVTNTC VC
Sbjct: 64  RAIQIRRSSYHDVIRVSEIQKVLDISGVQTYIINSARVVFLNERPQPRPGKGVTNTCEVC 123

Query: 326 ERSLLDSFNFCSLGCKIVGTSNNFEXXXXXKRNICVEVGGSDSEESINGKVIKKKIRSFS 147
           ERSLLDSF FCSLGCKIVGTS NF+     KR+        DS  S N    KKK++SF+
Sbjct: 124 ERSLLDSFRFCSLGCKIVGTSKNFQ---KRKRSSATASDSEDSYSSSNHDHEKKKVQSFT 180

Query: 146 PSTPPQTCA---SGKRRKGIPHRAPTGG 72
           PSTPP T     + KRRKGIPHRAP GG
Sbjct: 181 PSTPPPTLVNYRTAKRRKGIPHRAPLGG 208


Top