BLASTX nr result

ID: Atropa21_contig00005416 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00005416
         (1623 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006347514.1| PREDICTED: uncharacterized protein LOC102588...   384   e-104
ref|XP_004235021.1| PREDICTED: uncharacterized protein LOC101249...   369   3e-99
gb|EOY23701.1| Uncharacterized protein isoform 1 [Theobroma cacao]    162   4e-37
gb|EOY23702.1| Uncharacterized protein isoform 2 [Theobroma cacao]    161   9e-37
ref|XP_006493204.1| PREDICTED: uncharacterized protein LOC102614...   153   2e-34
ref|XP_006374085.1| hypothetical protein POPTR_0015s00740g [Popu...   152   3e-34
ref|XP_006441238.1| hypothetical protein CICLE_v10020653mg [Citr...   151   7e-34
ref|XP_002321364.2| hypothetical protein POPTR_0015s00740g [Popu...   150   1e-33
ref|XP_003634173.1| PREDICTED: uncharacterized protein LOC100853...   149   3e-33
ref|XP_002318455.2| hypothetical protein POPTR_0012s02820g [Popu...   141   9e-31
ref|XP_006376666.1| hypothetical protein POPTR_0012s02820g [Popu...   139   3e-30
ref|XP_004307917.1| PREDICTED: uncharacterized protein LOC101313...   136   3e-29
gb|EMJ22370.1| hypothetical protein PRUPE_ppa021823mg [Prunus pe...   135   7e-29
ref|XP_002524424.1| conserved hypothetical protein [Ricinus comm...   131   7e-28
gb|EXB78390.1| hypothetical protein L484_003252 [Morus notabilis]     125   4e-26
ref|XP_002515870.1| conserved hypothetical protein [Ricinus comm...   125   4e-26
emb|CAN80175.1| hypothetical protein VITISV_018394 [Vitis vinifera]   113   5e-24
ref|XP_006394704.1| hypothetical protein EUTSA_v10005511mg [Eutr...   115   7e-23
ref|XP_006581687.1| PREDICTED: uncharacterized protein LOC100776...   114   9e-23
ref|XP_003527999.1| PREDICTED: uncharacterized protein LOC100776...   114   9e-23

>ref|XP_006347514.1| PREDICTED: uncharacterized protein LOC102588139 [Solanum tuberosum]
          Length = 348

 Score =  384 bits (986), Expect = e-104
 Identities = 225/353 (63%), Positives = 242/353 (68%), Gaps = 38/353 (10%)
 Frame = +3

Query: 159  MLCSISTQKS-GSNWLDRLHSSKGFSFADNSN-----------GS----PNTEXXXXXXX 290
            MLCSISTQKS GSNWLDRL SSKGFSFADN N           GS    P+TE       
Sbjct: 1    MLCSISTQKSAGSNWLDRLRSSKGFSFADNRNLEQFITHQTPNGSDSLPPSTETEIRDSN 60

Query: 291  XXXXXTGSESSCDPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGEST 470
                  GSESS DPI PV EPVLH DQ P AP NS DN+ELC+VVTNVLSELFCMG EST
Sbjct: 61   NNI---GSESSSDPIRPVNEPVLHRDQAPAAPHNSGDNEELCSVVTNVLSELFCMG-EST 116

Query: 471  KFPKFNVKRGSRKQTNPRFCASSKINN-----------ADEQSDKCRVDIKDSQVKLLEQ 617
             FPKF+VKRGSRKQTNPRFCASS+IN+             E  DKCRV+IKDSQVKLLEQ
Sbjct: 117  SFPKFSVKRGSRKQTNPRFCASSEINSDAVVEGGQRKEETESLDKCRVEIKDSQVKLLEQ 176

Query: 618  SHNLNLAEEDEEKSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMN 797
             HNLNLAEE E+KS+ANLMG+SRTEV VIDTSCAPWKFEKLLFRKKNVWKVRDK+SKT+N
Sbjct: 177  GHNLNLAEE-EDKSNANLMGFSRTEVMVIDTSCAPWKFEKLLFRKKNVWKVRDKKSKTLN 235

Query: 798  LG-KKRKTDMTNEDVGGEKKQKVISRHNGYL---------VNEKLQLNDKLEETCKRT-X 944
             G KKRK D+T+ED  GEKKQK IS H+GY          V+EKLQL+DK E TCKRT  
Sbjct: 236  WGKKKRKADVTSEDARGEKKQKFISGHDGYAAKGRECKSSVSEKLQLDDKSEGTCKRTSD 295

Query: 945  XXXXXXXXXXXXXXXXXXXXXXXXXXXXPTSKKNGTGIAKNFLKPSHRQYQAQ 1103
                                        PTSKKNGTG AKN LKPSHRQ QAQ
Sbjct: 296  SVGQASKKKQGSLKLKKSSPSVVLIKSIPTSKKNGTGFAKNSLKPSHRQCQAQ 348


>ref|XP_004235021.1| PREDICTED: uncharacterized protein LOC101249438 [Solanum
            lycopersicum]
          Length = 345

 Score =  369 bits (946), Expect = 3e-99
 Identities = 215/347 (61%), Positives = 234/347 (67%), Gaps = 37/347 (10%)
 Frame = +3

Query: 159  MLCSISTQKS-GSNWLDRLHSSKGFSFADNSN-----------GS---PNTEXXXXXXXX 293
            MLCSISTQKS GSNWLDRL SSKGFSFADN N           GS   P++         
Sbjct: 1    MLCSISTQKSAGSNWLDRLRSSKGFSFADNRNLEQFLTHQTPNGSDSLPSSTETEIRDSN 60

Query: 294  XXXXTGSESSCDPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTK 473
                TGSESS DPI PV E VL  DQ P A  NS DN+ELC+VVTNVLS+LFCMG EST 
Sbjct: 61   NKDNTGSESSSDPIRPVNESVLPRDQAPAASHNSGDNEELCSVVTNVLSDLFCMG-ESTS 119

Query: 474  FPKFNVKRGSRKQTNPRFCASSKINN-----------ADEQSDKCRVDIKDSQVKLLEQS 620
            FPK +VKRGSRKQTNPRFCASS+IN              E  DKCRV+IKDSQVKLLE+ 
Sbjct: 120  FPKLSVKRGSRKQTNPRFCASSEINGDAVVEGGQRKEETESLDKCRVEIKDSQVKLLEEG 179

Query: 621  HNLNLAEEDEEKSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNL 800
            HNLNLAEE E+KS+ANLMG+SRTEV VIDTSCAPWKFEKLLFRKKNVWKVRDK+SKT+NL
Sbjct: 180  HNLNLAEE-EDKSNANLMGFSRTEVMVIDTSCAPWKFEKLLFRKKNVWKVRDKKSKTLNL 238

Query: 801  G-KKRKTDMTNEDVGGEKKQKVISRHNGYL---------VNEKLQLNDKLEETCKRT-XX 947
            G KKRK D+T+ED  GEKK+K IS HNGY          V+EKLQL+DKLE TCKRT   
Sbjct: 239  GKKKRKVDVTSEDARGEKKRKFISGHNGYAEKGRECKSSVSEKLQLDDKLEGTCKRTSDS 298

Query: 948  XXXXXXXXXXXXXXXXXXXXXXXXXXXPTSKKNGTGIAKNFLKPSHR 1088
                                       PTSKKNG G AKN LKPSHR
Sbjct: 299  FGQASKKKQRYLKLKKASSSVVLIKSIPTSKKNGVGFAKNSLKPSHR 345


>gb|EOY23701.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 353

 Score =  162 bits (410), Expect = 4e-37
 Identities = 120/356 (33%), Positives = 174/356 (48%), Gaps = 46/356 (12%)
 Frame = +3

Query: 159  MLCSISTQKSGSNWLDRLHSSKGFSFADNSN-----GSPNTEXXXXXXXXXXXXTGSESS 323
            MLCSIST KSGSNWLDRL SSKGF   DN +      +PN              + SES+
Sbjct: 1    MLCSISTGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNPNPSDSPITDASNSPNSNSEST 60

Query: 324  CDPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTKFPKFNVKRGS 503
                  +      P +  V       +KE   +++NVLSELF MG ++ +  +F+ K+ S
Sbjct: 61   HSNDKELQNRKAPPPE--VVSSEPAGDKEWFGIMSNVLSELFNMGDQA-QTSRFSRKKTS 117

Query: 504  RKQTNPRFCA--SSKINNADEQ---SDKCRVDI------------KDSQVKLLEQSHNLN 632
            RKQTNP+ C   +S +N ++EQ   SD  R D             ++++ +  E+  + N
Sbjct: 118  RKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPASTTSLNSKEEAKREWKEEGDDYN 177

Query: 633  LAEEDEE----KSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNL 800
            + EE++E    K    L+GYSR+EVTVIDTSC  WK +KL+FR+KN+WKV+DK+ K+  +
Sbjct: 178  VEEEEQEEENGKGERELLGYSRSEVTVIDTSCEVWKVDKLIFRRKNIWKVKDKKGKSRIV 237

Query: 801  GKKRK-------TDMTNEDVGG--EKKQKVIS-----------RHNGYLVNEKLQLNDKL 920
            G+K++           +++ GG   KK+K+ S           + +G   N      +K 
Sbjct: 238  GRKKRKAPPPPPPPSYDDNNGGVWNKKRKISSSELRSLKDTSGKESGSPTNHNAP-GEKG 296

Query: 921  EETCKRTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPTSKKNGTGIAKNFLKPSHR 1088
            E  C  T                             PT KKNG  +AKN LK + R
Sbjct: 297  ELVCNETPDDLTQVLRKRLPRKSGKGSTSVILIKSIPTGKKNGAKLAKNRLKDTQR 352


>gb|EOY23702.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 355

 Score =  161 bits (407), Expect = 9e-37
 Identities = 120/357 (33%), Positives = 174/357 (48%), Gaps = 47/357 (13%)
 Frame = +3

Query: 159  MLCSISTQKSGSNWLDRLHSSKGFSFADNSN-----GSPNTEXXXXXXXXXXXXTGSESS 323
            MLCSIST KSGSNWLDRL SSKGF   DN +      +PN              + SES+
Sbjct: 1    MLCSISTGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNPNPSDSPITDASNSPNSNSEST 60

Query: 324  CDPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTKFPKFNVKRGS 503
                  +      P +  V       +KE   +++NVLSELF MG ++ +  +F+ K+ S
Sbjct: 61   HSNDKELQNRKAPPPE--VVSSEPAGDKEWFGIMSNVLSELFNMGDQA-QTSRFSRKKTS 117

Query: 504  RKQTNPRFCA--SSKINNADEQ---SDKCRVDI------------KDSQVKLLEQSHNLN 632
            RKQTNP+ C   +S +N ++EQ   SD  R D             ++++ +  E+  + N
Sbjct: 118  RKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPASTTSLNSKEEAKREWKEEGDDYN 177

Query: 633  LAEEDEE----KSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNL 800
            + EE++E    K    L+GYSR+EVTVIDTSC  WK +KL+FR+KN+WKV+DK+ K+  +
Sbjct: 178  VEEEEQEEENGKGERELLGYSRSEVTVIDTSCEVWKVDKLIFRRKNIWKVKDKKGKSRIV 237

Query: 801  GKKRK-------TDMTNEDVGG--EKKQKVIS-----------RHNGYLVNEKLQL-NDK 917
            G+K++           +++ GG   KK+K+ S           + +G   N       +K
Sbjct: 238  GRKKRKAPPPPPPPSYDDNNGGVWNKKRKISSSELRSLKDTSGKESGSPTNHGQNAPGEK 297

Query: 918  LEETCKRTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPTSKKNGTGIAKNFLKPSHR 1088
             E  C  T                             PT KKNG  +AKN LK + R
Sbjct: 298  GELVCNETPDDLTQVLRKRLPRKSGKGSTSVILIKSIPTGKKNGAKLAKNRLKDTQR 354


>ref|XP_006493204.1| PREDICTED: uncharacterized protein LOC102614232 [Citrus sinensis]
          Length = 376

 Score =  153 bits (386), Expect = 2e-34
 Identities = 99/255 (38%), Positives = 144/255 (56%), Gaps = 13/255 (5%)
 Frame = +3

Query: 153 STMLCSISTQKSGSNWLDRLHSSKGFSFADNSNGSPNTEXXXXXXXXXXXXTGSESSCDP 332
           S M+CS+ST KS SNWLDRL S+KGF   D+       E            + S  S   
Sbjct: 29  SAMICSMSTGKSCSNWLDRLRSNKGFPVGDDLELDHFLENKDSNLKSK---SNSSESTQN 85

Query: 333 IGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMG-GESTKFPKFNVKRGSRK 509
               TE +   ++      N  D  E   ++ NVLS+LF MG     +  KF+ K+ SRK
Sbjct: 86  RKAATEEICGENE------NGDDKGEWFGIMNNVLSDLFIMGESNDDQSCKFSRKKISRK 139

Query: 510 QTNPRFCASSKINNADEQSDK----CRVDIKDSQV--KLLEQ---SHNLNLAEEDEEKSH 662
           QTNP+FC  S++ +++ + ++    C    +++Q+  KL E+     N+N A E E+   
Sbjct: 140 QTNPKFCLVSRMTSSNVEEEQSCGGCERKDENAQIENKLKEEVDGEENVNNAVEMEDGER 199

Query: 663 ANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLG---KKRKTDMTNE 833
             L+GYSR EVTVIDTSC  WKFEKL++RK+NVWKVR+K+ K+  +G   KKRK +  + 
Sbjct: 200 DELLGYSRNEVTVIDTSCTEWKFEKLVYRKRNVWKVREKKGKSRMIGLGRKKRKANGADA 259

Query: 834 DVGGEKKQKVISRHN 878
           +V  +KK K+ S+ +
Sbjct: 260 NVDTKKKFKLNSQED 274


>ref|XP_006374085.1| hypothetical protein POPTR_0015s00740g [Populus trichocarpa]
           gi|550321689|gb|ERP51882.1| hypothetical protein
           POPTR_0015s00740g [Populus trichocarpa]
          Length = 383

 Score =  152 bits (385), Expect = 3e-34
 Identities = 112/292 (38%), Positives = 149/292 (51%), Gaps = 36/292 (12%)
 Frame = +3

Query: 159 MLCSISTQKSGSNWLDRLHSSKGFSFAD-------NSNGSPNTEXXXXXXXXXXXXTGSE 317
           MLCS+ T KSGSNWLDRL S+KGFS  D       N + SP T+            T SE
Sbjct: 44  MLCSVKTSKSGSNWLDRLWSNKGFSNNDDDDPSVPNPSSSPITDASNSVINSNSESTHSE 103

Query: 318 SSCDPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGG-----ESTKFPK 482
           S  + +   T   +          +S DNK+L  ++ NVLS+LF MGG     E +    
Sbjct: 104 SDQNKVTTTTTREI----------SSSDNKDLFFLMNNVLSDLFNMGGCSDPIEGSSRHS 153

Query: 483 FNVKRGSRKQTNPRFCASSKINNADEQSDKCRVD----IKDSQVKLLEQSHNLNLAEED- 647
              +R  RKQT P+FC  S  N++++  D  R D    +    +   + S+N++   +D 
Sbjct: 154 RKKERIPRKQTKPKFCFVSGNNSSNDSLDCVRKDENVLVATGSLNSDKNSNNVDCGVDDD 213

Query: 648 ----------EEKSHA-------NLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRD 776
                     EEK  A        L GYSR+EVTVIDTSC  WKF+KL+FRKKNVWKVRD
Sbjct: 214 DEEEEEEDVEEEKGKAFGVSGDKELKGYSRSEVTVIDTSCLVWKFDKLVFRKKNVWKVRD 273

Query: 777 KRSKTMNLG-KKRKT-DMTNEDVGGEKKQKVISRHNGYLVNEKLQLNDKLEE 926
           K+ K+   G KKRK  D+ + +  G KK+  +S      V      NDK E+
Sbjct: 274 KKGKSWVSGSKKRKVIDLESANGNGAKKKAKVSNLE---VGSSKDANDKPED 322


>ref|XP_006441238.1| hypothetical protein CICLE_v10020653mg [Citrus clementina]
           gi|557543500|gb|ESR54478.1| hypothetical protein
           CICLE_v10020653mg [Citrus clementina]
          Length = 374

 Score =  151 bits (382), Expect = 7e-34
 Identities = 98/255 (38%), Positives = 143/255 (56%), Gaps = 13/255 (5%)
 Frame = +3

Query: 153 STMLCSISTQKSGSNWLDRLHSSKGFSFADNSNGSPNTEXXXXXXXXXXXXTGSESSCDP 332
           S M+CS+ST KS SNWLDRL S+KGF   D+       E            + S  S   
Sbjct: 29  SAMICSMSTGKSCSNWLDRLRSNKGFPVGDDLELDHFLE---NKDSNLKPKSNSSESTQN 85

Query: 333 IGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMG-GESTKFPKFNVKRGSRK 509
               TE +   +      +N  D  E   ++ NVLS+LF MG     +  KF+ K+ SRK
Sbjct: 86  RKVATEEICGEN------ENGDDKGEWFGIMNNVLSDLFIMGESNDDQSCKFSRKKISRK 139

Query: 510 QTNPRFCASSKINNADEQSDK----CRVDIKDSQV--KLLEQ---SHNLNLAEEDEEKSH 662
           QTNP+FC  S++ +++ + ++    C    +++Q+  KL E+     N+N   E E+   
Sbjct: 140 QTNPKFCLVSRMTSSNVEEEQSCGGCERKDENAQIENKLKEEVDGEENVNNVVEMEDGER 199

Query: 663 ANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLG---KKRKTDMTNE 833
             L+GYSR EVTVIDTSC  WKFEKL++RK+NVWKVR+K+ K+  +G   KKRK +  + 
Sbjct: 200 EELLGYSRNEVTVIDTSCTEWKFEKLVYRKRNVWKVREKKGKSRMIGLGRKKRKANGADA 259

Query: 834 DVGGEKKQKVISRHN 878
           +V  +KK K+ S+ +
Sbjct: 260 NVDTKKKFKLNSQED 274


>ref|XP_002321364.2| hypothetical protein POPTR_0015s00740g [Populus trichocarpa]
           gi|550321690|gb|EEF05491.2| hypothetical protein
           POPTR_0015s00740g [Populus trichocarpa]
          Length = 385

 Score =  150 bits (380), Expect = 1e-33
 Identities = 107/273 (39%), Positives = 143/273 (52%), Gaps = 36/273 (13%)
 Frame = +3

Query: 159 MLCSISTQKSGSNWLDRLHSSKGFSFAD-------NSNGSPNTEXXXXXXXXXXXXTGSE 317
           MLCS+ T KSGSNWLDRL S+KGFS  D       N + SP T+            T SE
Sbjct: 44  MLCSVKTSKSGSNWLDRLWSNKGFSNNDDDDPSVPNPSSSPITDASNSVINSNSESTHSE 103

Query: 318 SSCDPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGG-----ESTKFPK 482
           S  + +   T   +          +S DNK+L  ++ NVLS+LF MGG     E +    
Sbjct: 104 SDQNKVTTTTTREI----------SSSDNKDLFFLMNNVLSDLFNMGGCSDPIEGSSRHS 153

Query: 483 FNVKRGSRKQTNPRFCASSKINNADEQSDKCRVD----IKDSQVKLLEQSHNLNLAEED- 647
              +R  RKQT P+FC  S  N++++  D  R D    +    +   + S+N++   +D 
Sbjct: 154 RKKERIPRKQTKPKFCFVSGNNSSNDSLDCVRKDENVLVATGSLNSDKNSNNVDCGVDDD 213

Query: 648 ----------EEKSHA-------NLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRD 776
                     EEK  A        L GYSR+EVTVIDTSC  WKF+KL+FRKKNVWKVRD
Sbjct: 214 DEEEEEEDVEEEKGKAFGVSGDKELKGYSRSEVTVIDTSCLVWKFDKLVFRKKNVWKVRD 273

Query: 777 KRSKTMNLG-KKRKT-DMTNEDVGGEKKQKVIS 869
           K+ K+   G KKRK  D+ + +  G KK+  +S
Sbjct: 274 KKGKSWVSGSKKRKVIDLESANGNGAKKKAKVS 306


>ref|XP_003634173.1| PREDICTED: uncharacterized protein LOC100853133 [Vitis vinifera]
          Length = 985

 Score =  149 bits (377), Expect = 3e-33
 Identities = 95/239 (39%), Positives = 132/239 (55%), Gaps = 17/239 (7%)
 Frame = +3

Query: 198 WLDRLHSSKGFSFADN-------SNGSPNTEXXXXXXXXXXXXTGSESSCDPIGPVTEPV 356
           WLDRL S+KGF   ++       ++  PN                S+S+C    PV +  
Sbjct: 166 WLDRLRSAKGFPTGNDDDLEHFLTHRDPNLSNSPITKPSDPKSI-SDSTCSDEKPVQDRS 224

Query: 357 LHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTKFPKFNVKRGSRKQTNPRFCAS 536
             P+            KE   +++NVL+ELF MG +S + PK + K+ SRKQTNP+ C  
Sbjct: 225 QPPET---------GEKEWFGIMSNVLAELFNMG-DSNQIPKLSGKKSSRKQTNPKICLL 274

Query: 537 SKINNADE-------QSDKCRVDIKDS--QVKLLEQSHNLNLAEEDEEKSHANLMGYSRT 689
           S +   DE         D    ++KDS  +VK + Q   ++  + +EEK + +L  YSR+
Sbjct: 275 SSVRQEDEVPATAPSSGDNSLTEMKDSNGEVKTVNQG-KVDCLDAEEEKCNQDLSAYSRS 333

Query: 690 EVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLG-KKRKTDMTNEDVGGEKKQKV 863
           EVTVIDTSCA WKFEKLLFRKKNVWKVRDK+ K+ ++G KKRK    +E +   KK K+
Sbjct: 334 EVTVIDTSCAVWKFEKLLFRKKNVWKVRDKKGKSRSIGRKKRKASECDEQLEARKKMKL 392


>ref|XP_002318455.2| hypothetical protein POPTR_0012s02820g [Populus trichocarpa]
            gi|550326249|gb|EEE96675.2| hypothetical protein
            POPTR_0012s02820g [Populus trichocarpa]
          Length = 355

 Score =  141 bits (355), Expect = 9e-31
 Identities = 115/359 (32%), Positives = 166/359 (46%), Gaps = 48/359 (13%)
 Frame = +3

Query: 159  MLCSISTQKSGSNWLDRLHSSKGFSFADNSN-------GSPNTEXXXXXXXXXXXXTGSE 317
            MLCS+ T KS SNWLDRL S++GF+  +++N        SP T             T S+
Sbjct: 1    MLCSVQTSKSSSNWLDRLWSNRGFNNNNDNNPSVPNPSSSPTTNASNSVINSNSESTHSD 60

Query: 318  SSCDPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGG------ESTKFP 479
            S    +   T      + +      S DNK+L  ++ NVLS+LF MGG      ES++  
Sbjct: 61   SDQIKVTATTATATTREIS------SSDNKDLFFIMNNVLSDLFNMGGVSDPVEESSRLS 114

Query: 480  KFNVKRGSRKQTNPRFCASSKINNADEQSDKCRVDIK-----------------DSQVKL 608
            +   ++  RKQT P+FC  S  N+ ++  D  R D                   D  V +
Sbjct: 115  R-KKEKVPRKQTKPKFCFISGNNSGNDSLDCVRKDRNVLAATGSLNSDKNSNNVDCGVVV 173

Query: 609  LEQSHNLNLAEEDEEKSHA-------NLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWK 767
             +   +    EED E+           L GYSR+EVTVIDTSC  WKF+KL+FRKKNVWK
Sbjct: 174  DDDDDDEEDVEEDVEEEKGFGVGGDKELKGYSRSEVTVIDTSCQVWKFDKLVFRKKNVWK 233

Query: 768  VRDKRSKTMNLGKKRK--TDMTNEDVGGEKKQKVISR---HNGYLVNE-KLQLNDKLEET 929
            VRDK+ K+   G K++   D+ + +  G KK+  +S     +   VN+ + Q +++ EE 
Sbjct: 234  VRDKKGKSWVFGSKKRKGNDLESANGNGAKKKAKVSNLEVGSSKDVNDVQKQEDERREEE 293

Query: 930  CKR-----TXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPTSKKNGTGIAKNFLKPSHRQ 1091
             K+     +                             PTS K+G  I KN LK + R+
Sbjct: 294  HKQMPEDLSQVPKKRFHFSRSPEKSIKSGSSVILIKTIPTSNKSGKNITKNRLKDNQRK 352


>ref|XP_006376666.1| hypothetical protein POPTR_0012s02820g [Populus trichocarpa]
           gi|550326248|gb|ERP54463.1| hypothetical protein
           POPTR_0012s02820g [Populus trichocarpa]
          Length = 310

 Score =  139 bits (350), Expect = 3e-30
 Identities = 98/276 (35%), Positives = 138/276 (50%), Gaps = 39/276 (14%)
 Frame = +3

Query: 159 MLCSISTQKSGSNWLDRLHSSKGFSFADNSN-------GSPNTEXXXXXXXXXXXXTGSE 317
           MLCS+ T KS SNWLDRL S++GF+  +++N        SP T             T S+
Sbjct: 1   MLCSVQTSKSSSNWLDRLWSNRGFNNNNDNNPSVPNPSSSPTTNASNSVINSNSESTHSD 60

Query: 318 SSCDPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGG------ESTKFP 479
           S    +   T      + +      S DNK+L  ++ NVLS+LF MGG      ES++  
Sbjct: 61  SDQIKVTATTATATTREIS------SSDNKDLFFIMNNVLSDLFNMGGVSDPVEESSRLS 114

Query: 480 KFNVKRGSRKQTNPRFCASSKINNADEQSDKCRVDIK-----------------DSQVKL 608
           +   ++  RKQT P+FC  S  N+ ++  D  R D                   D  V +
Sbjct: 115 R-KKEKVPRKQTKPKFCFISGNNSGNDSLDCVRKDRNVLAATGSLNSDKNSNNVDCGVVV 173

Query: 609 LEQSHNLNLAEEDEEKSHA-------NLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWK 767
            +   +    EED E+           L GYSR+EVTVIDTSC  WKF+KL+FRKKNVWK
Sbjct: 174 DDDDDDEEDVEEDVEEEKGFGVGGDKELKGYSRSEVTVIDTSCQVWKFDKLVFRKKNVWK 233

Query: 768 VRDKRSKTMNLGKKRK--TDMTNEDVGGEKKQKVIS 869
           VRDK+ K+   G K++   D+ + +  G KK+  +S
Sbjct: 234 VRDKKGKSWVFGSKKRKGNDLESANGNGAKKKAKVS 269


>ref|XP_004307917.1| PREDICTED: uncharacterized protein LOC101313650 [Fragaria vesca
           subsp. vesca]
          Length = 323

 Score =  136 bits (342), Expect = 3e-29
 Identities = 89/268 (33%), Positives = 140/268 (52%), Gaps = 9/268 (3%)
 Frame = +3

Query: 159 MLCSISTQKSGSNWLDRLHSSKGFSFADNSNGSPNTEXXXXXXXXXXXXTGSESSCDPIG 338
           MLCS+   KSG NWLDRL S+KGF   DN +                  T S  S +P  
Sbjct: 1   MLCSVRATKSGPNWLDRLRSNKGFPACDNLD---------LDHFLKHNPTSSSESPNPNA 51

Query: 339 PVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTKFPKFNVKRGSRKQTN 518
             T  V +  ++     +++  + L  +++  +SELF + G S +  + + K+  RKQT+
Sbjct: 52  DSTPLVSNRPESSGPTRDAKKGEALLGLMSTAISELFFIDG-SEESSRLSGKKVPRKQTH 110

Query: 519 PRFCASSKINNADEQSDKCRVDIKDSQ-VKLLEQSHNLNLAEEDEEKSHANLMGYSRTEV 695
           PR C +SK+ ++    +    D+ D + V  L   + + L    EE+    L GYS++EV
Sbjct: 111 PRLCVTSKLKSSGSIGN----DVNDLRTVPSLNSKNEVEL----EERGERELKGYSKSEV 162

Query: 696 TVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLGKKRKTDMTNEDVGGE--------K 851
           TVIDTSC  WK EKL+FR+K+VWKVR+K+SK  + G+ ++  ++ ++ G +        K
Sbjct: 163 TVIDTSCEVWKTEKLVFRRKSVWKVREKKSKVRSFGRNKRKVVSGDEEGDDGIEEKRKKK 222

Query: 852 KQKVISRHNGYLVNEKLQLNDKLEETCK 935
           K+  +S     L   +   ND  +E CK
Sbjct: 223 KEAEVSDQCISLNPIENSRNDARKEVCK 250


>gb|EMJ22370.1| hypothetical protein PRUPE_ppa021823mg [Prunus persica]
          Length = 723

 Score =  135 bits (339), Expect = 7e-29
 Identities = 100/263 (38%), Positives = 131/263 (49%), Gaps = 33/263 (12%)
 Frame = +3

Query: 159 MLCSISTQKSGSNWLDRLHSSKGFSFADNSNG----SPNTEXXXXXXXXXXXXTGSESSC 326
           MLCS+   KSGSNWLDRL S+KG    DN +     S NT                 SS 
Sbjct: 1   MLCSVPASKSGSNWLDRLRSNKGLPTGDNLDLDHFLSRNTNSSSEVPTPNV-----SSST 55

Query: 327 DPIGPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTKFPKFNVKRGSR 506
           +   P ++ V++   T   P+     +    +V NVLSELF MGG   +  K   K+  R
Sbjct: 56  ESTRPGSDRVVNQSTTS-CPNRDNQGEAFIGLVNNVLSELFFMGGSDER-SKLLGKKIRR 113

Query: 507 KQTNPRFCASSKIN--------NADEQ--SDKCRVD--------IKDSQVKLL------- 611
           KQ NPR C +S  N        NA E+  SD  R D          DSQ   L       
Sbjct: 114 KQANPRVCVTSTANYDSNAATANATEEKSSDWGRNDEHVLDKAACLDSQNGSLMKNKDLG 173

Query: 612 ----EQSHNLNLAEEDEEKSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDK 779
               E+   +   EE+E++    L GYS +EVTVIDTSC  WK EK++FR+KNVWKVR+K
Sbjct: 174 NVGGEEGEEVEEEEEEEKEELRELKGYSISEVTVIDTSCGVWKTEKVVFRRKNVWKVREK 233

Query: 780 RSKTMNLGKKRKTDMTNEDVGGE 848
           ++K    G +RK  + +E+VG E
Sbjct: 234 KAKVRKFG-RRKRKVVDEEVGVE 255


>ref|XP_002524424.1| conserved hypothetical protein [Ricinus communis]
           gi|223536308|gb|EEF37959.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 272

 Score =  131 bits (330), Expect = 7e-28
 Identities = 98/259 (37%), Positives = 133/259 (51%), Gaps = 22/259 (8%)
 Frame = +3

Query: 159 MLCSIST-QKSGSNWLDRLHSSKGFSFADN---SNGSPNTEXXXXXXXXXXXXTGSESSC 326
           MLCS+S   KSGSNWLDRL S+KGF   +N    N   N+               SES+ 
Sbjct: 1   MLCSVSAGTKSGSNWLDRLRSTKGFPATENLDLDNFLSNSSLLNPSI--------SESTL 52

Query: 327 DPIGPVTEPVLHPDQTPVAPDNSRDN--KELCNVVTNVLSELFCMGGESTKFPKFNVKRG 500
                VT      DQT   PD S +N  KE   +VTNVL +LF MG    K  + +  + 
Sbjct: 53  SHNKRVTS-----DQTQF-PDTSSENGEKEWFGLVTNVLCDLFNMGDSQDKNSRLSGTKS 106

Query: 501 SRKQTNPRFCASSKINN---------ADEQSDKCRVDIKDSQVKLLEQSHNLNLAEEDEE 653
           SRKQTNP+F     +           A  +SD    ++            + N+ EE E+
Sbjct: 107 SRKQTNPKFFDIESVRKEECVQVATPASFRSDN-NSNVVGMNADCFSNDDDNNVDEEKEK 165

Query: 654 -KSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLGKKRK----- 815
             S   L GYS++EVTVIDTS   WKF+KL+FR+KN+WKVRDK+ K+ +   K++     
Sbjct: 166 CSSDKELKGYSKSEVTVIDTSFEMWKFDKLVFRRKNIWKVRDKKGKSWSFSSKKRKGNQL 225

Query: 816 -TDMTNEDVGGEKKQKVIS 869
            + + N +VG +KK K+ S
Sbjct: 226 ESAIGNGNVGCKKKAKMSS 244


>gb|EXB78390.1| hypothetical protein L484_003252 [Morus notabilis]
          Length = 353

 Score =  125 bits (315), Expect = 4e-26
 Identities = 104/296 (35%), Positives = 138/296 (46%), Gaps = 35/296 (11%)
 Frame = +3

Query: 159 MLCSISTQKS--GSNWLDRLHSSKGFSFADNSNGSPNTEXXXXXXXXXXXXTGSESS-CD 329
           MLCS+   KS  GSNWL R+ S KGF   D+ +                  + SES+  D
Sbjct: 1   MLCSVPAGKSAGGSNWLSRIRSIKGFPAGDDDD-------LGHFITQNLNSSASESTRLD 53

Query: 330 PIGPVTEPVLHPDQTPVAPDNSRDN--KELCNVVTNVLSELFCMGGEST-KFPKFNVKRG 500
           P     + +  P+ +P AP   R     E    +  VLSELF MGG       + + KR 
Sbjct: 54  P-----QRIAVPN-SPEAPGRIRGRVEPEWVGAMDTVLSELFFMGGAGEISSSRHSGKRI 107

Query: 501 SRKQTNPRFCASSKINNADEQSD-------------KCRVDIKDSQVKLLEQSHNLNLAE 641
            RKQTNP+ CA+S  NN +  ++             K   D       L   S N +  E
Sbjct: 108 PRKQTNPKICAASASNNNNNNNNSGNSNSSGVVEQKKKGSDFAPKTASLSSDSGNNSTRE 167

Query: 642 -----------EDEEKSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSK 788
                      +DE++    L GYSR+EVTVIDTSC  WK EKL+FR+K+VW+VR+K+ K
Sbjct: 168 GHGNVDVDFDVDDEDEDEKELKGYSRSEVTVIDTSCGSWKSEKLVFRRKSVWRVREKKGK 227

Query: 789 TMNLG-KKRKTDMTNEDV----GGEKKQKVISRHNGYLVNEKLQLNDKLEETCKRT 941
             N G KKRK  + +  V      +  Q +I   +    N K   ND  EE CK T
Sbjct: 228 LRNFGRKKRKLAIDDHHVMSLASSDHHQSLIMPSSDEGQNLK---NDSREEKCKGT 280


>ref|XP_002515870.1| conserved hypothetical protein [Ricinus communis]
           gi|223545025|gb|EEF46539.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 268

 Score =  125 bits (315), Expect = 4e-26
 Identities = 90/254 (35%), Positives = 128/254 (50%), Gaps = 27/254 (10%)
 Frame = +3

Query: 183 KSGSNWLDRLHSSKGFSFADNSNGSPNTEXXXXXXXXXXXXTGSESSCDPIGPVTEPVLH 362
           KSGSNWLDRL S+KGF   +N +                    SES+      VT     
Sbjct: 10  KSGSNWLDRLRSTKGFPATENLD--------LDNFLSDPSLPNSESTQSLNRRVTS---- 57

Query: 363 PDQTPVAPDNSRDN--KELCNVVTNVLSELFCMGGESTKFPKFNVKRGSRKQTNPRFCAS 536
            DQT + PD  R+N  +E   VVTNVL +LF MG    K  + + K+ SRKQTNP+F  +
Sbjct: 58  -DQTEI-PDTLRENGEREWFGVVTNVLCDLFNMGDSQDKNSRISGKKSSRKQTNPKFFDA 115

Query: 537 SKIN-------------NADEQSD------KCRVDIKDSQVKLLEQSHNLNLAEEDEEKS 659
             +              ++D  S+       C VD  D     L++       ++++  S
Sbjct: 116 DSVRKEEYVQAATTASFHSDNNSNVVGMNADCFVDDDDEYNGKLDE-------KKEKSSS 168

Query: 660 HANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLGKKRK------TD 821
              L GYS++EVTVIDTS   WKF+KL+FR+K++WKVRDK+ K+ N   K++      + 
Sbjct: 169 DKELKGYSKSEVTVIDTSFEVWKFDKLVFRRKSIWKVRDKKGKSWNFASKKRKGNHLESA 228

Query: 822 MTNEDVGGEKKQKV 863
             N +V  +KK K+
Sbjct: 229 TNNGNVSSKKKAKM 242


>emb|CAN80175.1| hypothetical protein VITISV_018394 [Vitis vinifera]
          Length = 420

 Score =  113 bits (283), Expect(2) = 5e-24
 Identities = 80/235 (34%), Positives = 118/235 (50%), Gaps = 28/235 (11%)
 Frame = +3

Query: 321 SCDPIGPVTEPVLHPDQTPVAPDNSR----------DNKELCNVVTNVLSELFCMGGEST 470
           +C  +     P+ +P   P+AP  SR            KE   +++NVL+ELF MG +S 
Sbjct: 142 TCPILQSPNPPIPNPYPIPLAPMKSRFKIGASRRKTGEKEWFGIMSNVLAELFNMG-DSN 200

Query: 471 KFPKFNVKRGSRKQTNPRFCASSKINNADE-------QSDKCRVDIKDS--QVKLLEQSH 623
           + PK + K+ SRKQTNP+ C  S +   DE         D    ++KDS  +VK + Q  
Sbjct: 201 QIPKLSGKKSSRKQTNPKICLLSSVRQEDEVPATAPSSGDNSLTEMKDSNGEVKTVNQG- 259

Query: 624 NLNLAEEDEEKSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLG 803
            ++  + +EEK + +L  YSR+             FEKLLFRKKNVWKVRDK+ K+ ++G
Sbjct: 260 KVDCLDAEEEKCNQDLSAYSRS-------------FEKLLFRKKNVWKVRDKKGKSRSIG 306

Query: 804 -KKRKTDMTNEDVGGEKKQKVI--------SRHNGYLVNEKLQLNDKLEETCKRT 941
            KKRK    +E +   KK K+            +    NE+   ++  +E CK T
Sbjct: 307 RKKRKASECDEQLEARKKMKLSVESFKERNEEESAMPSNEEQNPHNAKKEECKET 361



 Score = 26.2 bits (56), Expect(2) = 5e-24
 Identities = 17/40 (42%), Positives = 20/40 (50%), Gaps = 1/40 (2%)
 Frame = +1

Query: 97  SNGVSVNPRPISYYN-QLIYPQCSVPFLPRNPVQIGSTGS 213
           +NG+   PR     + Q    QCSV   P NPV  GST S
Sbjct: 79  ANGLRGRPRVSDLEDEQQQSEQCSVRSPPENPVPSGSTAS 118


>ref|XP_006394704.1| hypothetical protein EUTSA_v10005511mg [Eutrema salsugineum]
           gi|557091343|gb|ESQ31990.1| hypothetical protein
           EUTSA_v10005511mg [Eutrema salsugineum]
          Length = 332

 Score =  115 bits (287), Expect = 7e-23
 Identities = 85/250 (34%), Positives = 129/250 (51%), Gaps = 20/250 (8%)
 Frame = +3

Query: 171 ISTQKSGSNWLDRLHSSKGFSFADN--SNGSPNTEXXXXXXXXXXXXTGSESSCDPIGPV 344
           I  +   S WLDRL  S+G S  D+  ++G+P +             TG  +S  P    
Sbjct: 7   IDDKPVASTWLDRLRLSRGLSTTDDDDASGNPLSLDDFLRRNYHNEITGDPASDSP---P 63

Query: 345 TEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTKFPKFNVKRGSRKQTNPR 524
           + P+L   + P  P +    +E   V+++VLSELF  GG S++      K+  RKQ+NPR
Sbjct: 64  SAPILSALELPEIPLDPNPGEEWYGVMSDVLSELFNFGG-SSRSSTIPGKKLPRKQSNPR 122

Query: 525 FCA---------------SSKINNADEQSDKCRVDI-KDSQVKLLEQSHNLNLAE--EDE 650
            C+               S+ +  A E +   R    K    +  E+  ++  A+  E+E
Sbjct: 123 HCSVETLADVPLLNQKRDSNCLPGAREFATSSRSSYNKKPAPEKRERRRSVAEADGVEEE 182

Query: 651 EKSHANLMGYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLGKKRKTDMTN 830
           E+   +L+G+SR+EVTVIDTS   WK EKL+FR++NVWKVRDKR K+  +  K+KT    
Sbjct: 183 ERGEKDLVGFSRSEVTVIDTSFKIWKSEKLVFRRRNVWKVRDKRGKSRVVSSKKKTMKKL 242

Query: 831 EDVGGEKKQK 860
           +    +KK+K
Sbjct: 243 KKKKKKKKRK 252


>ref|XP_006581687.1| PREDICTED: uncharacterized protein LOC100776590 isoform X2 [Glycine
           max]
          Length = 233

 Score =  114 bits (286), Expect = 9e-23
 Identities = 76/226 (33%), Positives = 109/226 (48%), Gaps = 8/226 (3%)
 Frame = +3

Query: 159 MLCSISTQKSGSNWLDRLHSSKGFSFADNSNGSPNTEXXXXXXXXXXXXTGSESSCDPI- 335
           MLCS  T KSG NWLDRL S+KG                          TG E   D   
Sbjct: 1   MLCSPQTGKSGLNWLDRLRSNKGIP------------------------TGDEPDLDSFL 36

Query: 336 --GPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTKFPKFNVKRGSRK 509
              P   P   P+  P+ P +   ++ +   ++ +L+ELFCMG   +K      K+  RK
Sbjct: 37  LSAPPQSPQARPNDPPLNPPSVARDEPM--PMSTILAELFCMGATLSK----TNKKCPRK 90

Query: 510 QTNPRFCASSKINNADEQSDKCRVDIK----DSQVKLLEQSHNLNLAEEDEEKSHAN-LM 674
           QTNP+   +S        S K    +      S   L+ +  +   A+ DE++   N L 
Sbjct: 91  QTNPKIFLASSAAATTTTSSKSSAPVPAPAAPSSDALVPEVEDEPAADRDEDEEEGNELK 150

Query: 675 GYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLGKKR 812
           G++++EVTVIDTSC  WK +K +FRK NVWKVR+++ K   L K++
Sbjct: 151 GFTKSEVTVIDTSCPGWKVDKFVFRKNNVWKVRERKPKNRFLAKRK 196


>ref|XP_003527999.1| PREDICTED: uncharacterized protein LOC100776590 isoform X1 [Glycine
           max]
          Length = 238

 Score =  114 bits (286), Expect = 9e-23
 Identities = 76/226 (33%), Positives = 109/226 (48%), Gaps = 8/226 (3%)
 Frame = +3

Query: 159 MLCSISTQKSGSNWLDRLHSSKGFSFADNSNGSPNTEXXXXXXXXXXXXTGSESSCDPI- 335
           MLCS  T KSG NWLDRL S+KG                          TG E   D   
Sbjct: 1   MLCSPQTGKSGLNWLDRLRSNKGIP------------------------TGDEPDLDSFL 36

Query: 336 --GPVTEPVLHPDQTPVAPDNSRDNKELCNVVTNVLSELFCMGGESTKFPKFNVKRGSRK 509
              P   P   P+  P+ P +   ++ +   ++ +L+ELFCMG   +K      K+  RK
Sbjct: 37  LSAPPQSPQARPNDPPLNPPSVARDEPM--PMSTILAELFCMGATLSK----TNKKCPRK 90

Query: 510 QTNPRFCASSKINNADEQSDKCRVDIK----DSQVKLLEQSHNLNLAEEDEEKSHAN-LM 674
           QTNP+   +S        S K    +      S   L+ +  +   A+ DE++   N L 
Sbjct: 91  QTNPKIFLASSAAATTTTSSKSSAPVPAPAAPSSDALVPEVEDEPAADRDEDEEEGNELK 150

Query: 675 GYSRTEVTVIDTSCAPWKFEKLLFRKKNVWKVRDKRSKTMNLGKKR 812
           G++++EVTVIDTSC  WK +K +FRK NVWKVR+++ K   L K++
Sbjct: 151 GFTKSEVTVIDTSCPGWKVDKFVFRKNNVWKVRERKPKNRFLAKRK 196


Top