BLASTX nr result

ID: Akebia22_contig00018221 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00018221
         (1471 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007033958.1| Uncharacterized protein isoform 5, partial [...    88   1e-14
ref|XP_007033956.1| Uncharacterized protein isoform 3 [Theobroma...    88   1e-14
ref|XP_007033954.1| Uncharacterized protein isoform 1 [Theobroma...    88   1e-14
ref|XP_007131948.1| hypothetical protein PHAVU_011G0542000g [Pha...    82   6e-13
ref|XP_007131947.1| hypothetical protein PHAVU_011G0542000g [Pha...    82   6e-13
ref|XP_006595065.1| PREDICTED: uncharacterized protein LOC102661...    80   3e-12
ref|XP_006595064.1| PREDICTED: uncharacterized protein LOC102661...    80   3e-12
ref|XP_006590935.1| PREDICTED: uncharacterized protein LOC102668...    79   5e-12
ref|XP_002264971.2| PREDICTED: uncharacterized protein LOC100261...    79   5e-12
emb|CBI36663.3| unnamed protein product [Vitis vinifera]               79   5e-12
emb|CAN70075.1| hypothetical protein VITISV_038385 [Vitis vinifera]    79   5e-12
ref|XP_006478697.1| PREDICTED: uncharacterized protein LOC102618...    78   1e-11
ref|XP_006443033.1| hypothetical protein CICLE_v10019428mg [Citr...    78   1e-11
ref|XP_002533446.1| hypothetical protein RCOM_0656430 [Ricinus c...    77   3e-11
ref|XP_006372874.1| hypothetical protein POPTR_0017s05870g [Popu...    74   2e-10
ref|XP_006372873.1| hypothetical protein POPTR_0017s05870g [Popu...    74   2e-10
ref|XP_006372872.1| hypothetical protein POPTR_0017s05870g [Popu...    74   2e-10
ref|XP_006592148.1| PREDICTED: uncharacterized protein LOC100779...    72   5e-10
gb|EXB68728.1| hypothetical protein L484_024748 [Morus notabilis]      72   6e-10
ref|XP_002309806.1| hypothetical protein POPTR_0007s01970g [Popu...    70   3e-09

>ref|XP_007033958.1| Uncharacterized protein isoform 5, partial [Theobroma cacao]
            gi|508712987|gb|EOY04884.1| Uncharacterized protein
            isoform 5, partial [Theobroma cacao]
          Length = 475

 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 98/370 (26%), Positives = 156/370 (42%), Gaps = 55/370 (14%)
 Frame = +2

Query: 140  TGHSQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGN-IPRIPCQNINSG- 313
            T  SQF SQYDD+G K      +N       ++N  +  R+ +  N +P+I  ++ N+G 
Sbjct: 101  TPQSQFVSQYDDNGFK------NNEFPTICFNRNCLSLMRDGRMNNMLPQIEPRDSNAGA 154

Query: 314  ------FVSTNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFH 475
                  F S+    + V   +F+F+V+SEEGI+L+VDL+S+PS+W+  +K++V I     
Sbjct: 155  CSNEIAFPSSIKTPTTVFPATFQFHVSSEEGINLYVDLNSNPSEWVEKMKSEVSICQNMS 214

Query: 476  DSKSGARPFDLKGLGDSDGQMKSSLLGNTGFG-FQTGGEPVSIKSSLGSIVGE------- 631
              KS     +L   G+S  QMKSS   N   G  + G E   +  SL  I+ E       
Sbjct: 215  HGKSRTFHRELGRFGESSKQMKSSFQLNVDAGKIKDGHEHTGLSPSL--IIKENNQLQLD 272

Query: 632  -------LLGPSAVIPTGTQIKLSGHVE-------------ENVQMVSS------LCETN 733
                    LG + + P+G  + +S H+E                Q++S       L   +
Sbjct: 273  HPDGDDGSLGSTVMTPSGRAVDVSEHLEGDQGLTLIKAHPDSQDQIISGGAKDGCLITPD 332

Query: 734  SNLQSNGSSPRHGEAVTLG---LEFPNTCQINPSRISSVSHERGLPLPCETQEAIAY--- 895
            SN+ S+         + +    L    T Q N    + +     L   C           
Sbjct: 333  SNINSHREKLASDAVLNISDSPLNLLTTEQQNSKLENKICENSSLQNGCNLVSPSGIIPR 392

Query: 896  ---SGSMEMQLSGVVSHCEDTSNFLCLNG---GKSDLMHDLQAE-GGESEKCEFDQNTGM 1054
                GS+++ +   V H  D  +    NG   G  +L H++ AE GG +   E D  T  
Sbjct: 393  CLADGSLQIPMPQDVVHHNDALHSPSENGEFVGMVNLEHNIYAEQGGLAGSTELDPKTYR 452

Query: 1055 DPQSALPEER 1084
            +    L EE+
Sbjct: 453  NRLPTLVEEQ 462


>ref|XP_007033956.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|590655332|ref|XP_007033957.1| Uncharacterized protein
            isoform 3 [Theobroma cacao] gi|508712985|gb|EOY04882.1|
            Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508712986|gb|EOY04883.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 503

 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 98/370 (26%), Positives = 156/370 (42%), Gaps = 55/370 (14%)
 Frame = +2

Query: 140  TGHSQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGN-IPRIPCQNINSG- 313
            T  SQF SQYDD+G K      +N       ++N  +  R+ +  N +P+I  ++ N+G 
Sbjct: 138  TPQSQFVSQYDDNGFK------NNEFPTICFNRNCLSLMRDGRMNNMLPQIEPRDSNAGA 191

Query: 314  ------FVSTNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFH 475
                  F S+    + V   +F+F+V+SEEGI+L+VDL+S+PS+W+  +K++V I     
Sbjct: 192  CSNEIAFPSSIKTPTTVFPATFQFHVSSEEGINLYVDLNSNPSEWVEKMKSEVSICQNMS 251

Query: 476  DSKSGARPFDLKGLGDSDGQMKSSLLGNTGFG-FQTGGEPVSIKSSLGSIVGE------- 631
              KS     +L   G+S  QMKSS   N   G  + G E   +  SL  I+ E       
Sbjct: 252  HGKSRTFHRELGRFGESSKQMKSSFQLNVDAGKIKDGHEHTGLSPSL--IIKENNQLQLD 309

Query: 632  -------LLGPSAVIPTGTQIKLSGHVE-------------ENVQMVSS------LCETN 733
                    LG + + P+G  + +S H+E                Q++S       L   +
Sbjct: 310  HPDGDDGSLGSTVMTPSGRAVDVSEHLEGDQGLTLIKAHPDSQDQIISGGAKDGCLITPD 369

Query: 734  SNLQSNGSSPRHGEAVTLG---LEFPNTCQINPSRISSVSHERGLPLPCETQEAIAY--- 895
            SN+ S+         + +    L    T Q N    + +     L   C           
Sbjct: 370  SNINSHREKLASDAVLNISDSPLNLLTTEQQNSKLENKICENSSLQNGCNLVSPSGIIPR 429

Query: 896  ---SGSMEMQLSGVVSHCEDTSNFLCLNG---GKSDLMHDLQAE-GGESEKCEFDQNTGM 1054
                GS+++ +   V H  D  +    NG   G  +L H++ AE GG +   E D  T  
Sbjct: 430  CLADGSLQIPMPQDVVHHNDALHSPSENGEFVGMVNLEHNIYAEQGGLAGSTELDPKTYR 489

Query: 1055 DPQSALPEER 1084
            +    L EE+
Sbjct: 490  NRLPTLVEEQ 499


>ref|XP_007033954.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590655326|ref|XP_007033955.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508712983|gb|EOY04880.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508712984|gb|EOY04881.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 577

 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 98/370 (26%), Positives = 156/370 (42%), Gaps = 55/370 (14%)
 Frame = +2

Query: 140  TGHSQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGN-IPRIPCQNINSG- 313
            T  SQF SQYDD+G K      +N       ++N  +  R+ +  N +P+I  ++ N+G 
Sbjct: 138  TPQSQFVSQYDDNGFK------NNEFPTICFNRNCLSLMRDGRMNNMLPQIEPRDSNAGA 191

Query: 314  ------FVSTNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFH 475
                  F S+    + V   +F+F+V+SEEGI+L+VDL+S+PS+W+  +K++V I     
Sbjct: 192  CSNEIAFPSSIKTPTTVFPATFQFHVSSEEGINLYVDLNSNPSEWVEKMKSEVSICQNMS 251

Query: 476  DSKSGARPFDLKGLGDSDGQMKSSLLGNTGFG-FQTGGEPVSIKSSLGSIVGE------- 631
              KS     +L   G+S  QMKSS   N   G  + G E   +  SL  I+ E       
Sbjct: 252  HGKSRTFHRELGRFGESSKQMKSSFQLNVDAGKIKDGHEHTGLSPSL--IIKENNQLQLD 309

Query: 632  -------LLGPSAVIPTGTQIKLSGHVE-------------ENVQMVSS------LCETN 733
                    LG + + P+G  + +S H+E                Q++S       L   +
Sbjct: 310  HPDGDDGSLGSTVMTPSGRAVDVSEHLEGDQGLTLIKAHPDSQDQIISGGAKDGCLITPD 369

Query: 734  SNLQSNGSSPRHGEAVTLG---LEFPNTCQINPSRISSVSHERGLPLPCETQEAIAY--- 895
            SN+ S+         + +    L    T Q N    + +     L   C           
Sbjct: 370  SNINSHREKLASDAVLNISDSPLNLLTTEQQNSKLENKICENSSLQNGCNLVSPSGIIPR 429

Query: 896  ---SGSMEMQLSGVVSHCEDTSNFLCLNG---GKSDLMHDLQAE-GGESEKCEFDQNTGM 1054
                GS+++ +   V H  D  +    NG   G  +L H++ AE GG +   E D  T  
Sbjct: 430  CLADGSLQIPMPQDVVHHNDALHSPSENGEFVGMVNLEHNIYAEQGGLAGSTELDPKTYR 489

Query: 1055 DPQSALPEER 1084
            +    L EE+
Sbjct: 490  NRLPTLVEEQ 499


>ref|XP_007131948.1| hypothetical protein PHAVU_011G0542000g [Phaseolus vulgaris]
           gi|593155462|ref|XP_007131949.1| hypothetical protein
           PHAVU_011G0542000g [Phaseolus vulgaris]
           gi|561004948|gb|ESW03942.1| hypothetical protein
           PHAVU_011G0542000g [Phaseolus vulgaris]
           gi|561004949|gb|ESW03943.1| hypothetical protein
           PHAVU_011G0542000g [Phaseolus vulgaris]
          Length = 430

 Score = 82.0 bits (201), Expect = 6e-13
 Identities = 40/91 (43%), Positives = 57/91 (62%)
 Frame = +2

Query: 299 NINSGFVSTNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHD 478
           ++   F S+    +E P +SFEFYV S+ G+ L VDL+ SP+DWI   +N+VCIS   H+
Sbjct: 45  SVQKDFASSVKAATEAPPSSFEFYVWSDVGVSLHVDLNLSPTDWINRFRNEVCISENIHE 104

Query: 479 SKSGARPFDLKGLGDSDGQMKSSLLGNTGFG 571
           +KSG+   DL  L ++  Q KSS L +T  G
Sbjct: 105 NKSGSLWQDLSDLAENSAQGKSSFLWSTNSG 135


>ref|XP_007131947.1| hypothetical protein PHAVU_011G0542000g [Phaseolus vulgaris]
           gi|561004947|gb|ESW03941.1| hypothetical protein
           PHAVU_011G0542000g [Phaseolus vulgaris]
          Length = 443

 Score = 82.0 bits (201), Expect = 6e-13
 Identities = 40/91 (43%), Positives = 57/91 (62%)
 Frame = +2

Query: 299 NINSGFVSTNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHD 478
           ++   F S+    +E P +SFEFYV S+ G+ L VDL+ SP+DWI   +N+VCIS   H+
Sbjct: 58  SVQKDFASSVKAATEAPPSSFEFYVWSDVGVSLHVDLNLSPTDWINRFRNEVCISENIHE 117

Query: 479 SKSGARPFDLKGLGDSDGQMKSSLLGNTGFG 571
           +KSG+   DL  L ++  Q KSS L +T  G
Sbjct: 118 NKSGSLWQDLSDLAENSAQGKSSFLWSTNSG 148


>ref|XP_006595065.1| PREDICTED: uncharacterized protein LOC102661248 isoform X2 [Glycine
           max]
          Length = 373

 Score = 79.7 bits (195), Expect = 3e-12
 Identities = 67/200 (33%), Positives = 94/200 (47%), Gaps = 20/200 (10%)
 Frame = +2

Query: 179 GLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFVS-----TNGI-CS 340
           G   KRK P     VRNSD +  N TR+    N  +  C  + +G  S     T+ +  +
Sbjct: 40  GASKKRKLP-----VRNSDVHLRN-TRSKTVKNFHQNNCGAVETGVSSEEKDFTSSLKAT 93

Query: 341 EVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARPF-DLKGL 517
             P   FEFYV S+EGI+LF+DL+SSPSDW    +N+VC+S +    K     + DL  L
Sbjct: 94  NQPPRCFEFYVRSDEGINLFIDLNSSPSDWTNRYRNEVCVSEKVCRKKEFRSLWQDLSSL 153

Query: 518 GDSDGQMKSSLLGNTGFG-FQTGGEPVSIKSSLGSIVGELLG---------PS---AVIP 658
           G S  Q KSS + NT  G F           SL  +  ++ G         PS   ++ P
Sbjct: 154 GGSSTQGKSSFIWNTNSGHFDDCNGQTKYAPSLKLVKEDVTGLDQQNIGCCPSIYDSLTP 213

Query: 659 TGTQIKLSGHVEENVQMVSS 718
               + +  +V EN   VS+
Sbjct: 214 CAMTVNVEKNVNENQSTVST 233


>ref|XP_006595064.1| PREDICTED: uncharacterized protein LOC102661248 isoform X1 [Glycine
           max]
          Length = 382

 Score = 79.7 bits (195), Expect = 3e-12
 Identities = 67/200 (33%), Positives = 94/200 (47%), Gaps = 20/200 (10%)
 Frame = +2

Query: 179 GLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFVS-----TNGI-CS 340
           G   KRK P     VRNSD +  N TR+    N  +  C  + +G  S     T+ +  +
Sbjct: 40  GASKKRKLP-----VRNSDVHLRN-TRSKTVKNFHQNNCGAVETGVSSEEKDFTSSLKAT 93

Query: 341 EVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARPF-DLKGL 517
             P   FEFYV S+EGI+LF+DL+SSPSDW    +N+VC+S +    K     + DL  L
Sbjct: 94  NQPPRCFEFYVRSDEGINLFIDLNSSPSDWTNRYRNEVCVSEKVCRKKEFRSLWQDLSSL 153

Query: 518 GDSDGQMKSSLLGNTGFG-FQTGGEPVSIKSSLGSIVGELLG---------PS---AVIP 658
           G S  Q KSS + NT  G F           SL  +  ++ G         PS   ++ P
Sbjct: 154 GGSSTQGKSSFIWNTNSGHFDDCNGQTKYAPSLKLVKEDVTGLDQQNIGCCPSIYDSLTP 213

Query: 659 TGTQIKLSGHVEENVQMVSS 718
               + +  +V EN   VS+
Sbjct: 214 CAMTVNVEKNVNENQSTVST 233


>ref|XP_006590935.1| PREDICTED: uncharacterized protein LOC102668780 isoform X1 [Glycine
           max] gi|571488438|ref|XP_006590936.1| PREDICTED:
           uncharacterized protein LOC102668780 isoform X2 [Glycine
           max]
          Length = 418

 Score = 79.0 bits (193), Expect = 5e-12
 Identities = 59/178 (33%), Positives = 91/178 (51%), Gaps = 14/178 (7%)
 Frame = +2

Query: 260 NSKQGNIPRIP-CQNINSGFVSTNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIR 436
           N  Q N+  +    ++   F S+    +EVP + F+FYV S+ GI+L VDL+ S SDWI 
Sbjct: 8   NCHQNNVRAVAGAPSVEKDFASSIKAATEVPPSYFQFYVWSDVGINLHVDLNLSSSDWIN 67

Query: 437 SLKNKVCISSEFHDSKSGARPFDLKGLGDSDGQMKSSLLGNTGFG--FQTGGE-----PV 595
             +N+VCIS   H +KS +   DL GLG++  Q KSS L +T  G     GG+      +
Sbjct: 68  RFRNEVCISENMHRNKSRSLWQDLSGLGENYTQGKSSFLLSTNSGQIEDHGGQARSSSSL 127

Query: 596 SIKSSLGSIVGEL------LGPSAVIPTGTQIKLSGHVEENVQMVSSLCETNSNLQSN 751
            +K    + +G+       L   ++ P    I++   ++EN   VS+  E N N+  N
Sbjct: 128 KLKKDGATELGQQNKDDIPLICDSLTPCSMTIEVKDDLQENHSTVSA--ELNVNVVDN 183


>ref|XP_002264971.2| PREDICTED: uncharacterized protein LOC100261223 [Vitis vinifera]
          Length = 616

 Score = 79.0 bits (193), Expect = 5e-12
 Identities = 99/345 (28%), Positives = 139/345 (40%), Gaps = 53/345 (15%)
 Frame = +2

Query: 140  TGHSQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFV 319
            T HSQ  +  D+SG   K+  P        S KN  +  R    G+ P I  ++IN G  
Sbjct: 149  TSHSQIVTLRDESGFNSKKDSPK-----MRSGKNCFDHAREGGAGDFPPIQHRDINIGAS 203

Query: 320  S---TNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKV------------ 454
            S    +   ++ PS+ FE++V S+EGI+L+VDL+S  S W   LKN+V            
Sbjct: 204  SGESASASSTKAPSSLFEYHVRSDEGINLYVDLNSGSSYWTNRLKNEVYVCRHESNQRFQ 263

Query: 455  ---------CISSEFHDSKSGARPFDLKGLGDSDGQMK-----SSLLGNTGFGFQTGG-E 589
                      ++S  HD K+ +    L G G +DG ++     SS L   G    T G +
Sbjct: 264  GIHQDLGQRLVASARHD-KNLSLWNKLSGCGANDGHVETGSLPSSNLRENGLMEDTPGVD 322

Query: 590  PVSIKS---SLGSIVGELLGPSAVIPTGTQ-IKLSGHVEENVQ--MVS---SLCETNSNL 742
              S +S      SI  E LG     P   Q I LS     +VQ  M+S   +  E     
Sbjct: 323  DGSFRSGEVQACSIAVETLGS----PEEDQAILLSSRPSSDVQNHMISGTKTCSEDGETT 378

Query: 743  QSNGSSPRHGEAVTLGLEFPNTCQINPSRISSVSHERGLPLP--CETQ---------EAI 889
              N S     +  +      N+    P   ++  H+        CE              
Sbjct: 379  TLNSSVCSFSKVKSTSNSVANSTSDGPKSFNAGEHQNSKLCTEICENSTLQNTSNIVNPS 438

Query: 890  AYSGSMEMQLSGVVSHCEDTSNFLCLNGGKSDL---MHDLQAEGG 1015
              SGS+EM+LS  V+HC   S   C NGG   L   MH  + E G
Sbjct: 439  VASGSVEMRLSEDVNHCTSASFSPCGNGGVLHLVNPMHKAETEHG 483


>emb|CBI36663.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score = 79.0 bits (193), Expect = 5e-12
 Identities = 99/345 (28%), Positives = 139/345 (40%), Gaps = 53/345 (15%)
 Frame = +2

Query: 140  TGHSQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFV 319
            T HSQ  +  D+SG   K+  P        S KN  +  R    G+ P I  ++IN G  
Sbjct: 149  TSHSQIVTLRDESGFNSKKDSPK-----MRSGKNCFDHAREGGAGDFPPIQHRDINIGAS 203

Query: 320  S---TNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKV------------ 454
            S    +   ++ PS+ FE++V S+EGI+L+VDL+S  S W   LKN+V            
Sbjct: 204  SGESASASSTKAPSSLFEYHVRSDEGINLYVDLNSGSSYWTNRLKNEVYVCRHESNQRFQ 263

Query: 455  ---------CISSEFHDSKSGARPFDLKGLGDSDGQMK-----SSLLGNTGFGFQTGG-E 589
                      ++S  HD K+ +    L G G +DG ++     SS L   G    T G +
Sbjct: 264  GIHQDLGQRLVASARHD-KNLSLWNKLSGCGANDGHVETGSLPSSNLRENGLMEDTPGVD 322

Query: 590  PVSIKS---SLGSIVGELLGPSAVIPTGTQ-IKLSGHVEENVQ--MVS---SLCETNSNL 742
              S +S      SI  E LG     P   Q I LS     +VQ  M+S   +  E     
Sbjct: 323  DGSFRSGEVQACSIAVETLGS----PEEDQAILLSSRPSSDVQNHMISGTKTCSEDGETT 378

Query: 743  QSNGSSPRHGEAVTLGLEFPNTCQINPSRISSVSHERGLPLP--CETQ---------EAI 889
              N S     +  +      N+    P   ++  H+        CE              
Sbjct: 379  TLNSSVCSFSKVKSTSNSVANSTSDGPKSFNAGEHQNSKLCTEICENSTLQNTSNIVNPS 438

Query: 890  AYSGSMEMQLSGVVSHCEDTSNFLCLNGGKSDL---MHDLQAEGG 1015
              SGS+EM+LS  V+HC   S   C NGG   L   MH  + E G
Sbjct: 439  VASGSVEMRLSEDVNHCTSASFSPCGNGGVLHLVNPMHKAETEHG 483


>emb|CAN70075.1| hypothetical protein VITISV_038385 [Vitis vinifera]
          Length = 531

 Score = 79.0 bits (193), Expect = 5e-12
 Identities = 99/345 (28%), Positives = 139/345 (40%), Gaps = 53/345 (15%)
 Frame = +2

Query: 140  TGHSQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFV 319
            T HSQ  +  D+SG   K+  P        S KN  +  R    G+ P I  ++IN G  
Sbjct: 150  TSHSQIVTLRDESGFNSKKDSPK-----MRSGKNCFDHAREGGAGDFPPIQHRDINIGAS 204

Query: 320  S---TNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKV------------ 454
            S    +   ++ PS+ FE++V S+EGI+L+VDL+S  S W   LKN+V            
Sbjct: 205  SGESASASSTKAPSSLFEYHVRSDEGINLYVDLNSGSSYWTNRLKNEVYVCRHESNQRFQ 264

Query: 455  ---------CISSEFHDSKSGARPFDLKGLGDSDGQMK-----SSLLGNTGFGFQTGG-E 589
                      ++S  HD K+ +    L G G +DG ++     SS L   G    T G +
Sbjct: 265  GIHQDLGQRLVASARHD-KNLSLWNKLSGCGANDGHVETGSLPSSNLRENGLMEDTPGVD 323

Query: 590  PVSIKS---SLGSIVGELLGPSAVIPTGTQ-IKLSGHVEENVQ--MVS---SLCETNSNL 742
              S +S      SI  E LG     P   Q I LS     +VQ  M+S   +  E     
Sbjct: 324  DGSFRSGEVQACSIAVETLGS----PEEDQAILLSSRPSSDVQNHMISGTKTCSEDGETT 379

Query: 743  QSNGSSPRHGEAVTLGLEFPNTCQINPSRISSVSHERGLPLP--CETQ---------EAI 889
              N S     +  +      N+    P   ++  H+        CE              
Sbjct: 380  TLNSSVCSFSKVKSTSNSVANSTSDGPKSFNAGEHQNSKLCTEICENSTLQNTSNXVNPS 439

Query: 890  AYSGSMEMQLSGVVSHCEDTSNFLCLNGGKSDL---MHDLQAEGG 1015
              SGS+EM+LS  V+HC   S   C NGG   L   MH  + E G
Sbjct: 440  VASGSVEMRLSEDVNHCTSASFSPCGNGGVLHLVNPMHKAETEHG 484


>ref|XP_006478697.1| PREDICTED: uncharacterized protein LOC102618334 isoform X1 [Citrus
           sinensis] gi|568849950|ref|XP_006478698.1| PREDICTED:
           uncharacterized protein LOC102618334 isoform X2 [Citrus
           sinensis]
          Length = 599

 Score = 77.8 bits (190), Expect = 1e-11
 Identities = 58/148 (39%), Positives = 78/148 (52%), Gaps = 14/148 (9%)
 Frame = +2

Query: 338 SEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARPFDLKGL 517
           + + S+S EFYV SEEGI L VDLSS+PSDWI  LKN+V I      +K+ +   +L  L
Sbjct: 182 NSLSSSSLEFYVRSEEGIKLCVDLSSNPSDWINKLKNEVNICENTSHNKAPSFHQELGRL 241

Query: 518 GDSDGQMKSSLLGNTGFGFQTGGEPVSIKSSLGSIVGE--------------LLGPSAVI 655
           G+S+ Q KSS L N        G  V  +SS   +  E               L   A+ 
Sbjct: 242 GESNNQNKSSFLRNVDARQSKDGN-VQSESSPSILTKENKDVVLNHPEGGDGSLTSIAIK 300

Query: 656 PTGTQIKLSGHVEENVQMVSSLCETNSN 739
           P+G  + LS HV+E+  +VSS  E NS+
Sbjct: 301 PSGLAVVLSEHVQEDQGVVSS--EPNSD 326


>ref|XP_006443033.1| hypothetical protein CICLE_v10019428mg [Citrus clementina]
           gi|557545295|gb|ESR56273.1| hypothetical protein
           CICLE_v10019428mg [Citrus clementina]
          Length = 587

 Score = 77.8 bits (190), Expect = 1e-11
 Identities = 58/148 (39%), Positives = 78/148 (52%), Gaps = 14/148 (9%)
 Frame = +2

Query: 338 SEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARPFDLKGL 517
           + + S+S EFYV SEEGI L VDLSS+PSDWI  LKN+V I      +K+ +   +L  L
Sbjct: 217 NSLSSSSLEFYVRSEEGIKLCVDLSSNPSDWINKLKNEVNICENTSHNKAPSFHQELGRL 276

Query: 518 GDSDGQMKSSLLGNTGFGFQTGGEPVSIKSSLGSIVGE--------------LLGPSAVI 655
           G+S+ Q KSS L N        G  V  +SS   +  E               L   A+ 
Sbjct: 277 GESNNQNKSSFLRNVDARQSKDGN-VQSESSPSILTKENKDVVLNHPEGGDGSLTSIAIK 335

Query: 656 PTGTQIKLSGHVEENVQMVSSLCETNSN 739
           P+G  + LS HV+E+  +VSS  E NS+
Sbjct: 336 PSGLAVVLSEHVQEDQGVVSS--EPNSD 361


>ref|XP_002533446.1| hypothetical protein RCOM_0656430 [Ricinus communis]
            gi|223526708|gb|EEF28942.1| hypothetical protein
            RCOM_0656430 [Ricinus communis]
          Length = 404

 Score = 76.6 bits (187), Expect = 3e-11
 Identities = 88/335 (26%), Positives = 135/335 (40%), Gaps = 80/335 (23%)
 Frame = +2

Query: 260  NSKQGNIPRIPCQNINSGFVSTNGICSEVPS-----TSFEFYVNSEEGIDLFVDLSSSPS 424
            N +  + P+  C+NIN G      I S + +      SFEFYVNSEEGI L VDL+SSPS
Sbjct: 37   NGRVEDTPQCCCRNINIGVCPKENISSAIRTYTKVPASFEFYVNSEEGIKLCVDLNSSPS 96

Query: 425  DWIRSLKNKVCISSEFHDSKSGARPFDLKGLGDSDGQMKSSLL----------------- 553
            DWI+   N++ + +   ++KS +   +L  + +S+ QM+SS+                  
Sbjct: 97   DWIKKYNNQISLCNNVGNAKSQSLHQELGRIEESNKQMRSSITASVDPGQINDDHIQAEL 156

Query: 554  --------GNTGFGFQTGGEPVSIKS--SLGSIV-----GELLGPSAVIPTGTQIKLSGH 688
                     N G     GG   S+ S   L S+V     G +     +I       +   
Sbjct: 157  SPSFNLEKNNIGIDLPNGGNKSSVPSPARLCSVVHAEGSGCIEEDEGLIAPKPSSGMQKQ 216

Query: 689  VEENVQMVSSLCET-NSNLQ-----SNGSSPRHGEAVTLGLEFPNT------CQINPSRI 832
            +  N +  S +  T +S+LQ     +  S  ++G + T+  +  +T      C    S I
Sbjct: 217  IISNTESCSKIGSTASSDLQKQIHFNTDSCTKNGSSATIDSDVMDTPTEKTACNFVVSSI 276

Query: 833  SSVS------------HE------------RGLPLPCETQEAIAYSGSMEMQLSGVVSHC 940
            S  S            H+              L   C        S S EMQLS   ++C
Sbjct: 277  SDGSVNLNAIERQNSKHDDEVCKNSKRQNCSNLENNCVMLPGCIASCSAEMQLSEAGNYC 336

Query: 941  EDTS-------NFLCLNGGKSDLMHDLQAEGGESE 1024
            +DTS        FL L+  K+++  +  A    SE
Sbjct: 337  KDTSCSPNKNGEFLDLDDSKNNIGTEQAALATSSE 371


>ref|XP_006372874.1| hypothetical protein POPTR_0017s05870g [Populus trichocarpa]
           gi|550319522|gb|ERP50671.1| hypothetical protein
           POPTR_0017s05870g [Populus trichocarpa]
          Length = 574

 Score = 73.9 bits (180), Expect = 2e-10
 Identities = 70/217 (32%), Positives = 107/217 (49%), Gaps = 13/217 (5%)
 Frame = +2

Query: 149 SQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFVSTN 328
           SQF SQY  S +   + + S    V +S      P  +    N      +N      +T 
Sbjct: 139 SQFFSQYAGSHVNHNKPQLSLGGRVEDS------PPFHGTDVNTIASSEENAQPSMKTT- 191

Query: 329 GICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARPFDL 508
              + VP+ SFEF+V+SEEGI L VDL+SSPS+WI+  KN+V +     ++KS +   +L
Sbjct: 192 ---ANVPA-SFEFHVSSEEGIKLCVDLNSSPSEWIKKYKNQVSLCDNVVNTKSRSLYQEL 247

Query: 509 KGLGDSDGQMKSSLLGNTGFG------FQTGGEPVS-----IKSSLGSIVG--ELLGPSA 649
             +G+S+ +MKSS+L N           QT   P S     I  S G  VG    L  S 
Sbjct: 248 GCIGESNKKMKSSVLQNMDSDQIRDDFVQTDPSPSSVAGKNINVSNGHPVGGNNSLISSP 307

Query: 650 VIPTGTQIKLSGHVEENVQMVSSLCETNSNLQSNGSS 760
           +IP G  + ++  +E +  + S+  E +S+ Q+  +S
Sbjct: 308 IIPCGVVVDVTQSLEADPGLASA--EPSSDGQNQKTS 342


>ref|XP_006372873.1| hypothetical protein POPTR_0017s05870g [Populus trichocarpa]
           gi|550319521|gb|ERP50670.1| hypothetical protein
           POPTR_0017s05870g [Populus trichocarpa]
          Length = 564

 Score = 73.9 bits (180), Expect = 2e-10
 Identities = 70/217 (32%), Positives = 107/217 (49%), Gaps = 13/217 (5%)
 Frame = +2

Query: 149 SQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFVSTN 328
           SQF SQY  S +   + + S    V +S      P  +    N      +N      +T 
Sbjct: 139 SQFFSQYAGSHVNHNKPQLSLGGRVEDS------PPFHGTDVNTIASSEENAQPSMKTT- 191

Query: 329 GICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARPFDL 508
              + VP+ SFEF+V+SEEGI L VDL+SSPS+WI+  KN+V +     ++KS +   +L
Sbjct: 192 ---ANVPA-SFEFHVSSEEGIKLCVDLNSSPSEWIKKYKNQVSLCDNVVNTKSRSLYQEL 247

Query: 509 KGLGDSDGQMKSSLLGNTGFG------FQTGGEPVS-----IKSSLGSIVG--ELLGPSA 649
             +G+S+ +MKSS+L N           QT   P S     I  S G  VG    L  S 
Sbjct: 248 GCIGESNKKMKSSVLQNMDSDQIRDDFVQTDPSPSSVAGKNINVSNGHPVGGNNSLISSP 307

Query: 650 VIPTGTQIKLSGHVEENVQMVSSLCETNSNLQSNGSS 760
           +IP G  + ++  +E +  + S+  E +S+ Q+  +S
Sbjct: 308 IIPCGVVVDVTQSLEADPGLASA--EPSSDGQNQKTS 342


>ref|XP_006372872.1| hypothetical protein POPTR_0017s05870g [Populus trichocarpa]
           gi|550319520|gb|ERP50669.1| hypothetical protein
           POPTR_0017s05870g [Populus trichocarpa]
          Length = 424

 Score = 73.9 bits (180), Expect = 2e-10
 Identities = 70/217 (32%), Positives = 107/217 (49%), Gaps = 13/217 (5%)
 Frame = +2

Query: 149 SQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFVSTN 328
           SQF SQY  S +   + + S    V +S      P  +    N      +N      +T 
Sbjct: 66  SQFFSQYAGSHVNHNKPQLSLGGRVEDS------PPFHGTDVNTIASSEENAQPSMKTT- 118

Query: 329 GICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARPFDL 508
              + VP+ SFEF+V+SEEGI L VDL+SSPS+WI+  KN+V +     ++KS +   +L
Sbjct: 119 ---ANVPA-SFEFHVSSEEGIKLCVDLNSSPSEWIKKYKNQVSLCDNVVNTKSRSLYQEL 174

Query: 509 KGLGDSDGQMKSSLLGNTGFG------FQTGGEPVS-----IKSSLGSIVG--ELLGPSA 649
             +G+S+ +MKSS+L N           QT   P S     I  S G  VG    L  S 
Sbjct: 175 GCIGESNKKMKSSVLQNMDSDQIRDDFVQTDPSPSSVAGKNINVSNGHPVGGNNSLISSP 234

Query: 650 VIPTGTQIKLSGHVEENVQMVSSLCETNSNLQSNGSS 760
           +IP G  + ++  +E +  + S+  E +S+ Q+  +S
Sbjct: 235 IIPCGVVVDVTQSLEADPGLASA--EPSSDGQNQKTS 269


>ref|XP_006592148.1| PREDICTED: uncharacterized protein LOC100779750 isoform X2 [Glycine
           max] gi|571492164|ref|XP_003541167.2| PREDICTED:
           uncharacterized protein LOC100779750 isoform X1 [Glycine
           max]
          Length = 417

 Score = 72.4 bits (176), Expect = 5e-10
 Identities = 68/234 (29%), Positives = 106/234 (45%), Gaps = 18/234 (7%)
 Frame = +2

Query: 299 NINSGFVSTNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHD 478
           ++     S+    +EVP +SF+FYV S+ GI+L VDL+ S SDWI   +N+VCIS   H 
Sbjct: 23  SVEKDLASSVKAATEVPPSSFQFYVWSDVGINLHVDLNLSSSDWINRFRNEVCISENMHR 82

Query: 479 SKSGARPFDLKGLGDSDGQMKSSLLGNTGFGFQTGGEPVSIKSSLGSIV----------- 625
           +KS +   DL  LG++  Q KSS L +     Q        +SS  S +           
Sbjct: 83  NKSRSLWQDLSSLGENYMQGKSSFLWSKN-SCQIEDHDGQARSSSSSKLTKDGATESGQQ 141

Query: 626 ---GELLGPSAVIPTGTQIKLSGHVEENVQMVSSLCETNSNLQSNGSSPRHGEAVTLGLE 796
              G  L   +  P    I++  ++ EN   VS+  E N N+  N       E  T+  E
Sbjct: 142 NKDGIPLRCDSFTPCNMTIEVKDNILENHSTVSA--ELNVNVVDNLMQ----EQSTVSAE 195

Query: 797 FPNTCQINPSRISSVSHERGLPLPCETQEAIAYS----GSMEMQLSGVVSHCED 946
              +C I+ S+    S    +P      +++  S    G++E+Q S   + C +
Sbjct: 196 V--SCAIDTSKKIIDSDATNMPFIKSLCDSVVNSLSDPGTLELQNSKPDNECSE 247


>gb|EXB68728.1| hypothetical protein L484_024748 [Morus notabilis]
          Length = 545

 Score = 72.0 bits (175), Expect = 6e-10
 Identities = 52/156 (33%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
 Frame = +2

Query: 155 FCSQYDDSGLKFKRKEPSNSNE----VRNSDKNFPNPTRNSKQGNIPRIPCQNINSGFVS 322
           F S+  ++G   ++ +  +S E    + N D     P  N +  +  + P +N  +  + 
Sbjct: 59  FHSKCYENGANRRKAQRKSSGEKFLSLLNDDLVESMPPTNCRGVDDNKCPAENTFASSIE 118

Query: 323 TNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARPF 502
           T+   S+V S+ FEFYV SEEGIDL+VDL+SSPS+W +  KN+V       ++KS +   
Sbjct: 119 TS---SKVGSSPFEFYVWSEEGIDLYVDLNSSPSEWTQKFKNEVHKFENVQNNKSRSLHE 175

Query: 503 DLKGLGDSDGQMKSSLLGNTGFGFQTGGEPVSIKSS 610
           DL  L + D +M+SS         +   EPV  +SS
Sbjct: 176 DLGYLKEGDKEMRSSFWNI--HAREIRDEPVDTRSS 209


>ref|XP_002309806.1| hypothetical protein POPTR_0007s01970g [Populus trichocarpa]
           gi|222852709|gb|EEE90256.1| hypothetical protein
           POPTR_0007s01970g [Populus trichocarpa]
          Length = 587

 Score = 69.7 bits (169), Expect = 3e-09
 Identities = 68/208 (32%), Positives = 105/208 (50%), Gaps = 18/208 (8%)
 Frame = +2

Query: 149 SQFCSQYDDSGLKFKRKEPSNSNEVRNSDKNFP---NPTRNSKQGNIPRIPCQNINSGFV 319
           SQF SQY  S + FK K  S    V +S +      N    SK+  +P I         +
Sbjct: 139 SQFFSQYAGSHVNFK-KPLSLGGRVEDSPQFHGRDINTVACSKEIGLPSI---------I 188

Query: 320 STNGICSEVPSTSFEFYVNSEEGIDLFVDLSSSPSDWIRSLKNKVCISSEFHDSKSGARP 499
           +T    + VP+ SFEF+V+SEEGI L VDL+SSP +WI+  KN+V +     ++KS +  
Sbjct: 189 TT----ANVPA-SFEFHVSSEEGIKLCVDLNSSPLEWIKKYKNQVSLCDNVVNTKSRSLY 243

Query: 500 FDLKGLGDSDGQMKSSLLGNTGFGFQTGGEPVSIKSSLGSIVGE---------------L 634
            +L  +G+S+ ++KSS+L N   G +   + V  + S  S VGE                
Sbjct: 244 EELGCIGESNKKLKSSVLQNIDSG-KNRDDSVQAEPSPSS-VGEKNSHVRNGHPDGGDNS 301

Query: 635 LGPSAVIPTGTQIKLSGHVEENVQMVSS 718
           L  S VIP    + +S +++E+  + S+
Sbjct: 302 LISSPVIPCSVAVDVSLYLKEDPGLASA 329


Top