BLASTX nr result

ID: Catharanthus22_contig00018865 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00018865
         (1105 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006347514.1| PREDICTED: uncharacterized protein LOC102588...   165   3e-38
ref|XP_004235021.1| PREDICTED: uncharacterized protein LOC101249...   159   2e-36
gb|EOY23702.1| Uncharacterized protein isoform 2 [Theobroma cacao]    149   1e-33
gb|EOY23701.1| Uncharacterized protein isoform 1 [Theobroma cacao]    149   3e-33
ref|XP_003634173.1| PREDICTED: uncharacterized protein LOC100853...   147   1e-32
ref|XP_002524424.1| conserved hypothetical protein [Ricinus comm...   144   5e-32
ref|XP_002515870.1| conserved hypothetical protein [Ricinus comm...   137   8e-30
ref|XP_006441238.1| hypothetical protein CICLE_v10020653mg [Citr...   135   2e-29
ref|XP_006493204.1| PREDICTED: uncharacterized protein LOC102614...   134   5e-29
gb|EMJ22370.1| hypothetical protein PRUPE_ppa021823mg [Prunus pe...   133   1e-28
ref|XP_004307917.1| PREDICTED: uncharacterized protein LOC101313...   115   4e-23
emb|CAN80175.1| hypothetical protein VITISV_018394 [Vitis vinifera]    94   6e-22
gb|EXB78390.1| hypothetical protein L484_003252 [Morus notabilis]     109   2e-21
ref|XP_002874202.1| hypothetical protein ARALYDRAFT_326742 [Arab...   103   2e-19
ref|XP_006286735.1| hypothetical protein CARUB_v10002983mg [Caps...   102   2e-19
ref|NP_197838.2| uncharacterized protein [Arabidopsis thaliana] ...   102   3e-19
dbj|BAB11202.1| unnamed protein product [Arabidopsis thaliana]        101   5e-19
ref|XP_006394704.1| hypothetical protein EUTSA_v10005511mg [Eutr...   100   1e-18
ref|XP_006374085.1| hypothetical protein POPTR_0015s00740g [Popu...   100   1e-18
ref|XP_002321364.2| hypothetical protein POPTR_0015s00740g [Popu...   100   1e-18

>ref|XP_006347514.1| PREDICTED: uncharacterized protein LOC102588139 [Solanum tuberosum]
          Length = 348

 Score =  165 bits (418), Expect = 3e-38
 Identities = 123/325 (37%), Positives = 152/325 (46%), Gaps = 22/325 (6%)
 Frame = +1

Query: 187  MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNN----------QNAXXXXXXX 336
            MLCSIS    + S WLDRLRSSKGF    + +LEQF+++                     
Sbjct: 1    MLCSISTQKSAGSNWLDRLRSSKGFSFADNRNLEQFITHQTPNGSDSLPPSTETEIRDSN 60

Query: 337  XXXXXXXXXXXXXXXNDPAVHDNQTFEAQNPDGDT-GFFSVVSNVLAELFVMGSSNGLPK 513
                           N+P +H +Q   A +  GD     SVV+NVL+ELF MG S   PK
Sbjct: 61   NNIGSESSSDPIRPVNEPVLHRDQAPAAPHNSGDNEELCSVVTNVLSELFCMGESTSFPK 120

Query: 514  VRGKKSSRKQPNPKFCASLDCPESNTNLENNSTRTESAARKNESASPMSDDNSGVGMAKE 693
               K+ SRKQ NP+FCAS +         N+    E   RK E+ S    D   V +   
Sbjct: 121  FSVKRGSRKQTNPRFCASSEI--------NSDAVVEGGQRKEETESL---DKCRVEIK-- 167

Query: 694  CSCDKQLKLVEY-VNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNVW 870
               D Q+KL+E   N    EE++K   N  GFSRTEV VID+S A WKFEK+LFRKKNVW
Sbjct: 168  ---DSQVKLLEQGHNLNLAEEEDKSNANLMGFSRTEVMVIDTSCAPWKFEKLLFRKKNVW 224

Query: 871  KVRDXXXXXXXXXXXXRKASASFDDQISDGTEEKKQKFLHGRCSLSKKG--CGGAPHEDY 1044
            KVRD            RKA  + +    D   EKKQKF+ G    + KG  C  +  E  
Sbjct: 225  KVRDKKSKTLNWGKKKRKADVTSE----DARGEKKQKFISGHDGYAAKGRECKSSVSEKL 280

Query: 1045 HQPGK--------SDMYSNLSKKNQ 1095
                K        SD     SKK Q
Sbjct: 281  QLDDKSEGTCKRTSDSVGQASKKKQ 305


>ref|XP_004235021.1| PREDICTED: uncharacterized protein LOC101249438 [Solanum
            lycopersicum]
          Length = 345

 Score =  159 bits (401), Expect = 2e-36
 Identities = 120/327 (36%), Positives = 152/327 (46%), Gaps = 24/327 (7%)
 Frame = +1

Query: 187  MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFL------------SNNQNAXXXXX 330
            MLCSIS    + S WLDRLRSSKGF    + +LEQFL            S+ +       
Sbjct: 1    MLCSISTQKSAGSNWLDRLRSSKGFSFADNRNLEQFLTHQTPNGSDSLPSSTETEIRDSN 60

Query: 331  XXXXXXXXXXXXXXXXXNDPAVHDNQTFEAQNPDGDTG-FFSVVSNVLAELFVMGSSNGL 507
                             N+  +  +Q   A +  GD     SVV+NVL++LF MG S   
Sbjct: 61   NKDNTGSESSSDPIRPVNESVLPRDQAPAASHNSGDNEELCSVVTNVLSDLFCMGESTSF 120

Query: 508  PKVRGKKSSRKQPNPKFCASLDCPESNTNLENNSTRTESAARKNESASPMSDDNSGVGMA 687
            PK+  K+ SRKQ NP+FCAS +         N     E   RK E+ S    D   V + 
Sbjct: 121  PKLSVKRGSRKQTNPRFCASSEI--------NGDAVVEGGQRKEETESL---DKCRVEIK 169

Query: 688  KECSCDKQLKLVEYV-NCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKN 864
                 D Q+KL+E   N    EE++K   N  GFSRTEV VID+S A WKFEK+LFRKKN
Sbjct: 170  -----DSQVKLLEEGHNLNLAEEEDKSNANLMGFSRTEVMVIDTSCAPWKFEKLLFRKKN 224

Query: 865  VWKVRDXXXXXXXXXXXXRKASASFDDQISDGTEEKKQKFLHGRCSLSKKG--CGGAPHE 1038
            VWKVRD            RK   + +    D   EKK+KF+ G    ++KG  C  +  E
Sbjct: 225  VWKVRDKKSKTLNLGKKKRKVDVTSE----DARGEKKRKFISGHNGYAEKGRECKSSVSE 280

Query: 1039 DYHQPGK--------SDMYSNLSKKNQ 1095
                  K        SD +   SKK Q
Sbjct: 281  KLQLDDKLEGTCKRTSDSFGQASKKKQ 307


>gb|EOY23702.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 355

 Score =  149 bits (377), Expect = 1e-33
 Identities = 112/316 (35%), Positives = 147/316 (46%), Gaps = 25/316 (7%)
 Frame = +1

Query: 187  MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXX 366
            MLCSIS TGKS S WLDRLRSSKGFP+G +LDL+ FL+N   +                 
Sbjct: 1    MLCSIS-TGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNPNPSDSPITDASNSPNS---- 55

Query: 367  XXXXXNDPAVHDNQTFEAQN------------PDGDTGFFSVVSNVLAELFVMGSSNGLP 510
                 N  + H N   E QN            P GD  +F ++SNVL+ELF MG      
Sbjct: 56   -----NSESTHSNDK-ELQNRKAPPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTS 109

Query: 511  KVRGKKSSRKQPNPKFCASLDCPESNTNLENNSTRTESAARKNE----SASPMSDDNSGV 678
            +   KK+SRKQ NPK C       SN N       +  + RK+E    S + ++      
Sbjct: 110  RFSRKKTSRKQTNPKICI---IKTSNVNTSEEQKSSSDSVRKDENIPASTTSLNSKEEAK 166

Query: 679  GMAKECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRK 858
               KE   D  ++  E      EEE  KG     G+SR+EVTVID+S   WK +K++FR+
Sbjct: 167  REWKEEGDDYNVEEEE-----QEEENGKGERELLGYSRSEVTVIDTSCEVWKVDKLIFRR 221

Query: 859  KNVWKVRDXXXXXXXXXXXXRKA-----SASFDDQISDGTEEKKQKF----LHGRCSLSK 1011
            KN+WKV+D            RKA       S+DD  + G   KK+K     L      S 
Sbjct: 222  KNIWKVKDKKGKSRIVGRKKRKAPPPPPPPSYDDN-NGGVWNKKRKISSSELRSLKDTSG 280

Query: 1012 KGCGGAPHEDYHQPGK 1059
            K  G   +   + PG+
Sbjct: 281  KESGSPTNHGQNAPGE 296


>gb|EOY23701.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 353

 Score =  149 bits (375), Expect = 3e-33
 Identities = 112/315 (35%), Positives = 149/315 (47%), Gaps = 24/315 (7%)
 Frame = +1

Query: 187  MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXX 366
            MLCSIS TGKS S WLDRLRSSKGFP+G +LDL+ FL+N   +                 
Sbjct: 1    MLCSIS-TGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNPNPSDSPITDASNSPNS---- 55

Query: 367  XXXXXNDPAVHDNQTFEAQN------------PDGDTGFFSVVSNVLAELFVMGSSNGLP 510
                 N  + H N   E QN            P GD  +F ++SNVL+ELF MG      
Sbjct: 56   -----NSESTHSNDK-ELQNRKAPPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTS 109

Query: 511  KVRGKKSSRKQPNPKFCASLDCPESNTNLENNSTRTESAARKNE----SASPMSDDNSGV 678
            +   KK+SRKQ NPK C       SN N       +  + RK+E    S + ++      
Sbjct: 110  RFSRKKTSRKQTNPKICI---IKTSNVNTSEEQKSSSDSVRKDENIPASTTSLNSKEEAK 166

Query: 679  GMAKECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRK 858
               KE   D  ++  E      EEE  KG     G+SR+EVTVID+S   WK +K++FR+
Sbjct: 167  REWKEEGDDYNVEEEE-----QEEENGKGERELLGYSRSEVTVIDTSCEVWKVDKLIFRR 221

Query: 859  KNVWKVRDXXXXXXXXXXXXRKA-----SASFDDQISDGTEEKKQKFLHGRCSLSKKGCG 1023
            KN+WKV+D            RKA       S+DD  + G   KK+K         K   G
Sbjct: 222  KNIWKVKDKKGKSRIVGRKKRKAPPPPPPPSYDDN-NGGVWNKKRKISSSELRSLKDTSG 280

Query: 1024 ---GAPHEDYHQPGK 1059
               G+P  +++ PG+
Sbjct: 281  KESGSP-TNHNAPGE 294


>ref|XP_003634173.1| PREDICTED: uncharacterized protein LOC100853133 [Vitis vinifera]
          Length = 985

 Score =  147 bits (370), Expect = 1e-32
 Identities = 100/242 (41%), Positives = 127/242 (52%)
 Frame = +1

Query: 226 KWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXXXXXXXNDPAVHDN 405
           +WLDRLRS+KGFP+G D DLE FL++                            P    +
Sbjct: 165 EWLDRLRSAKGFPTGNDDDLEHFLTHRDPNLSNSPITKPSDPKSISDSTCSDEKPVQDRS 224

Query: 406 QTFEAQNPDGDTGFFSVVSNVLAELFVMGSSNGLPKVRGKKSSRKQPNPKFCASLDCPES 585
           Q  E     G+  +F ++SNVLAELF MG SN +PK+ GKKSSRKQ NPK C        
Sbjct: 225 QPPET----GEKEWFGIMSNVLAELFNMGDSNQIPKLSGKKSSRKQTNPKICLL------ 274

Query: 586 NTNLENNSTRTESAARKNESASPMSDDNSGVGMAKECSCDKQLKLVEYVNCGDEEEKEKG 765
                 +S R E       + +P S DNS   M K+ + + +      V+C D EE EK 
Sbjct: 275 ------SSVRQEDEV---PATAPSSGDNSLTEM-KDSNGEVKTVNQGKVDCLDAEE-EKC 323

Query: 766 YMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNVWKVRDXXXXXXXXXXXXRKASASFDD 945
             + S +SR+EVTVID+S A WKFEK+LFRKKNVWKVRD            RKAS   D+
Sbjct: 324 NQDLSAYSRSEVTVIDTSCAVWKFEKLLFRKKNVWKVRDKKGKSRSIGRKKRKAS-ECDE 382

Query: 946 QI 951
           Q+
Sbjct: 383 QL 384


>ref|XP_002524424.1| conserved hypothetical protein [Ricinus communis]
           gi|223536308|gb|EEF37959.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 272

 Score =  144 bits (364), Expect = 5e-32
 Identities = 98/268 (36%), Positives = 134/268 (50%), Gaps = 3/268 (1%)
 Frame = +1

Query: 187 MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXX 366
           MLCS+SA  KS S WLDRLRS+KGFP+  +LDL+ FLSN+                    
Sbjct: 1   MLCSVSAGTKSGSNWLDRLRSTKGFPATENLDLDNFLSNSS-----------LLNPSISE 49

Query: 367 XXXXXNDPAVHDNQTF-EAQNPDGDTGFFSVVSNVLAELFVMGSSNGL-PKVRGKKSSRK 540
                N     D   F +  + +G+  +F +V+NVL +LF MG S     ++ G KSSRK
Sbjct: 50  STLSHNKRVTSDQTQFPDTSSENGEKEWFGLVTNVLCDLFNMGDSQDKNSRLSGTKSSRK 109

Query: 541 QPNPKFCASLDCPESNTNLENNSTRTESAARKNESASPMSDDNSGV-GMAKECSCDKQLK 717
           Q NPKF             +  S R E   +    AS  SD+NS V GM  +C  +    
Sbjct: 110 QTNPKF------------FDIESVRKEECVQVATPASFRSDNNSNVVGMNADCFSNDDDN 157

Query: 718 LVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNVWKVRDXXXXX 897
                N  +E+EK        G+S++EVTVID+SF  WKF+K++FR+KN+WKVRD     
Sbjct: 158 -----NVDEEKEKCSSDKELKGYSKSEVTVIDTSFEMWKFDKLVFRRKNIWKVRDKKGKS 212

Query: 898 XXXXXXXRKASASFDDQISDGTEEKKQK 981
                  RK +   +  I +G    K+K
Sbjct: 213 WSFSSKKRKGN-QLESAIGNGNVGCKKK 239


>ref|XP_002515870.1| conserved hypothetical protein [Ricinus communis]
            gi|223545025|gb|EEF46539.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 268

 Score =  137 bits (345), Expect = 8e-30
 Identities = 103/297 (34%), Positives = 147/297 (49%), Gaps = 20/297 (6%)
 Frame = +1

Query: 196  SISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXXXXX 375
            S+ A  KS S WLDRLRS+KGFP+  +LDL+ FLS                         
Sbjct: 4    SVFAGNKSGSNWLDRLRSTKGFPATENLDLDNFLS------------------------- 38

Query: 376  XXNDPAVHDNQTFEAQN----------PD-----GDTGFFSVVSNVLAELFVMGSSNGL- 507
               DP++ ++++ ++ N          PD     G+  +F VV+NVL +LF MG S    
Sbjct: 39   ---DPSLPNSESTQSLNRRVTSDQTEIPDTLRENGEREWFGVVTNVLCDLFNMGDSQDKN 95

Query: 508  PKVRGKKSSRKQPNPKFCASLDCPESNTNLENNSTRTESAARKNESASPMSDDNSGV-GM 684
             ++ GKKSSRKQ NPKF             + +S R E   +   +AS  SD+NS V GM
Sbjct: 96   SRISGKKSSRKQTNPKF------------FDADSVRKEEYVQAATTASFHSDNNSNVVGM 143

Query: 685  AKECSCDKQLKLVEYVNCGDEE-EKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKK 861
              +C  D      EY    DE+ EK        G+S++EVTVID+SF  WKF+K++FR+K
Sbjct: 144  NADCFVDDD---DEYNGKLDEKKEKSSSDKELKGYSKSEVTVIDTSFEVWKFDKLVFRRK 200

Query: 862  NVWKVRDXXXXXXXXXXXXRKASASFDDQISDG--TEEKKQKFLHGRCSLSKKGCGG 1026
            ++WKVRD            RK +   +   ++G  + +KK K      + SK+  GG
Sbjct: 201  SIWKVRDKKGKSWNFASKKRKGN-HLESATNNGNVSSKKKAKMSDSEFASSKESNGG 256


>ref|XP_006441238.1| hypothetical protein CICLE_v10020653mg [Citrus clementina]
           gi|557543500|gb|ESR54478.1| hypothetical protein
           CICLE_v10020653mg [Citrus clementina]
          Length = 374

 Score =  135 bits (341), Expect = 2e-29
 Identities = 99/288 (34%), Positives = 145/288 (50%), Gaps = 7/288 (2%)
 Frame = +1

Query: 142 KNSIEFQ---IYPRHFTT--MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNN 306
           K++++F    I  ++++T  M+CS+S TGKS S WLDRLRS+KGFP G DL+L+ FL N 
Sbjct: 11  KSTVQFPEQTILGKYWSTSAMICSMS-TGKSCSNWLDRLRSNKGFPVGDDLELDHFLENK 69

Query: 307 QNAXXXXXXXXXXXXXXXXXXXXXXNDPAVHDNQTFEAQNPDGDTGFFSVVSNVLAELFV 486
            +                       N     +    E +N D    +F +++NVL++LF+
Sbjct: 70  DS----------NLKPKSNSSESTQNRKVATEEICGENENGDDKGEWFGIMNNVLSDLFI 119

Query: 487 MGSSNGLP--KVRGKKSSRKQPNPKFCASLDCPESNTNLENNSTRTESAARKNESASPMS 660
           MG SN     K   KK SRKQ NPKFC       SN   E +    E   RK+E+A   +
Sbjct: 120 MGESNDDQSCKFSRKKISRKQTNPKFCLVSRMTSSNVEEEQSCGGCE---RKDENAQIEN 176

Query: 661 DDNSGVGMAKECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFE 840
                  + +E   ++ +  V  +  G+ EE         G+SR EVTVID+S   WKFE
Sbjct: 177 K------LKEEVDGEENVNNVVEMEDGEREE-------LLGYSRNEVTVIDTSCTEWKFE 223

Query: 841 KMLFRKKNVWKVRDXXXXXXXXXXXXRKASASFDDQISDGTEEKKQKF 984
           K+++RK+NVWKVR+            +K  A+     +D   + K+KF
Sbjct: 224 KLVYRKRNVWKVREKKGKSRMIGLGRKKRKANG----ADANVDTKKKF 267


>ref|XP_006493204.1| PREDICTED: uncharacterized protein LOC102614232 [Citrus sinensis]
          Length = 376

 Score =  134 bits (338), Expect = 5e-29
 Identities = 98/288 (34%), Positives = 145/288 (50%), Gaps = 7/288 (2%)
 Frame = +1

Query: 142 KNSIEFQ---IYPRHFTT--MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNN 306
           K++++F    I  ++++T  M+CS+S TGKS S WLDRLRS+KGFP G DL+L+ FL N 
Sbjct: 11  KSTVQFPEQTILGKYWSTSAMICSMS-TGKSCSNWLDRLRSNKGFPVGDDLELDHFLENK 69

Query: 307 QNAXXXXXXXXXXXXXXXXXXXXXXNDPAVHDNQTFEAQNPDGDTGFFSVVSNVLAELFV 486
            +                       N  A  +    E +N D    +F +++NVL++LF+
Sbjct: 70  DS----------NLKSKSNSSESTQNRKAATEEICGENENGDDKGEWFGIMNNVLSDLFI 119

Query: 487 MGSSNGLP--KVRGKKSSRKQPNPKFCASLDCPESNTNLENNSTRTESAARKNESASPMS 660
           MG SN     K   KK SRKQ NPKFC       SN   E +    E   RK+E+A   +
Sbjct: 120 MGESNDDQSCKFSRKKISRKQTNPKFCLVSRMTSSNVEEEQSCGGCE---RKDENAQIEN 176

Query: 661 DDNSGVGMAKECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFE 840
                  + +E   ++ +     +  G+ +E         G+SR EVTVID+S   WKFE
Sbjct: 177 K------LKEEVDGEENVNNAVEMEDGERDE-------LLGYSRNEVTVIDTSCTEWKFE 223

Query: 841 KMLFRKKNVWKVRDXXXXXXXXXXXXRKASASFDDQISDGTEEKKQKF 984
           K+++RK+NVWKVR+            +K  A+     +D   + K+KF
Sbjct: 224 KLVYRKRNVWKVREKKGKSRMIGLGRKKRKANG----ADANVDTKKKF 267


>gb|EMJ22370.1| hypothetical protein PRUPE_ppa021823mg [Prunus persica]
          Length = 723

 Score =  133 bits (335), Expect = 1e-28
 Identities = 96/282 (34%), Positives = 141/282 (50%), Gaps = 17/282 (6%)
 Frame = +1

Query: 187 MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXX 366
           MLCS+ A+ KS S WLDRLRS+KG P+G +LDL+ FLS N N+                 
Sbjct: 1   MLCSVPAS-KSGSNWLDRLRSNKGLPTGDNLDLDHFLSRNTNSSSEVPTPNVSSSTESTR 59

Query: 367 XXXXXNDPAVHDNQTFEAQNPDGDTGFFSVVSNVLAELFVMGSSNGLPKVRGKKSSRKQP 546
                +D  V+ + T      +    F  +V+NVL+ELF MG S+   K+ GKK  RKQ 
Sbjct: 60  PG---SDRVVNQSTTSCPNRDNQGEAFIGLVNNVLSELFFMGGSDERSKLLGKKIRRKQA 116

Query: 547 NPKFCASLDCPESNTNLENNSTRTESAARKNESASPMSDDNSGVGMAKECSCDKQ----L 714
           NP+ C +     S  N ++N+  T +A  +  S    +D++    + K    D Q    +
Sbjct: 117 NPRVCVT-----STANYDSNAA-TANATEEKSSDWGRNDEHV---LDKAACLDSQNGSLM 167

Query: 715 KLVEYVNCG---------DEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNV 867
           K  +  N G         +EEE+++      G+S +EVTVID+S   WK EK++FR+KNV
Sbjct: 168 KNKDLGNVGGEEGEEVEEEEEEEKEELRELKGYSISEVTVIDTSCGVWKTEKVVFRRKNV 227

Query: 868 WKVRDXXXXXXXXXXXXRKASASFDDQI----SDGTEEKKQK 981
           WKVR+            RK     D+++     D  ++KK K
Sbjct: 228 WKVREKKAKVRKFGRRKRKV---VDEEVGVEGGDDIDKKKAK 266


>ref|XP_004307917.1| PREDICTED: uncharacterized protein LOC101313650 [Fragaria vesca
           subsp. vesca]
          Length = 323

 Score =  115 bits (287), Expect = 4e-23
 Identities = 84/268 (31%), Positives = 124/268 (46%), Gaps = 3/268 (1%)
 Frame = +1

Query: 187 MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXX 366
           MLCS+ AT KS   WLDRLRS+KGFP+  +LDL+ FL +N  +                 
Sbjct: 1   MLCSVRAT-KSGPNWLDRLRSNKGFPACDNLDLDHFLKHNPTS------------SSESP 47

Query: 367 XXXXXNDPAVHDNQTFEAQNPDGDTG--FFSVVSNVLAELFVMGSSNGLPKVRGKKSSRK 540
                + P V +         D   G     ++S  ++ELF +  S    ++ GKK  RK
Sbjct: 48  NPNADSTPLVSNRPESSGPTRDAKKGEALLGLMSTAISELFFIDGSEESSRLSGKKVPRK 107

Query: 541 QPNPKFCASLDCPESNTNLENNSTRTESAARKNESASPMSDDNSGVGMAKECSCDKQLKL 720
           Q +P+ C +                    ++   S S  +D N              L+ 
Sbjct: 108 QTHPRLCVT--------------------SKLKSSGSIGNDVN-------------DLRT 134

Query: 721 VEYVNCGDEEE-KEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNVWKVRDXXXXX 897
           V  +N  +E E +E+G     G+S++EVTVID+S   WK EK++FR+K+VWKVR+     
Sbjct: 135 VPSLNSKNEVELEERGERELKGYSKSEVTVIDTSCEVWKTEKLVFRRKSVWKVREKKSKV 194

Query: 898 XXXXXXXRKASASFDDQISDGTEEKKQK 981
                  RK   S D++  DG EEK++K
Sbjct: 195 RSFGRNKRKV-VSGDEEGDDGIEEKRKK 221


>emb|CAN80175.1| hypothetical protein VITISV_018394 [Vitis vinifera]
          Length = 420

 Score = 93.6 bits (231), Expect(2) = 6e-22
 Identities = 81/226 (35%), Positives = 107/226 (47%), Gaps = 2/226 (0%)
 Frame = +1

Query: 433  GDTGFFSVVSNVLAELFVMGSSNGLPKVRGKKSSRKQPNPKFCASLDCPESNTNLENNST 612
            G+  +F ++SNVLAELF MG SN +PK+ GKKSSRKQ NPK C              +S 
Sbjct: 178  GEKEWFGIMSNVLAELFNMGDSNQIPKLSGKKSSRKQTNPKICLL------------SSV 225

Query: 613  RTESAARKNESASPMSDDNSGVGMAKECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSR 792
            R E       + +P S DNS   M K+ + + +      V+C D EE EK   + S +SR
Sbjct: 226  RQEDEV---PATAPSSGDNSLTEM-KDSNGEVKTVNQGKVDCLDAEE-EKCNQDLSAYSR 280

Query: 793  TEVTVIDSSFATWKFEKMLFRKKNVWKVRDXXXXXXXXXXXXRKASASFDDQISDGTEEK 972
            +             FEK+LFRKKNVWKVRD            RKAS   D+Q+      K
Sbjct: 281  S-------------FEKLLFRKKNVWKVRDKKGKSRSIGRKKRKAS-ECDEQLE---ARK 323

Query: 973  KQKFLHGRCSLSKKGCGGAPHEDYHQPGKSDMYSNLSKKNQ--ETS 1104
            K K       LS +       E+   P   +   + +KK +  ETS
Sbjct: 324  KMK-------LSVESFKERNEEESAMPSNEEQNPHNAKKEECKETS 362



 Score = 38.5 bits (88), Expect(2) = 6e-22
 Identities = 34/90 (37%), Positives = 43/90 (47%), Gaps = 7/90 (7%)
 Frame = +2

Query: 179 SLQCSVQSLPPVNPARNGLTGFVHRKVSHPALTSISSNFLAT-------IRTPDLPIPTK 337
           S QCSV+S PP NP  +G T     KV  PA T ISS    T       +++P+ PIP  
Sbjct: 98  SEQCSVRS-PPENPVPSGSTASGRPKVFRPATTMISSTSSPTETLTCPILQSPNPPIP-- 154

Query: 338 SKTAQHQLPNRSAPQMTRQFMTIKHSKRKT 427
                +  P   AP  +R    I  S+RKT
Sbjct: 155 -----NPYPIPLAPMKSR--FKIGASRRKT 177


>gb|EXB78390.1| hypothetical protein L484_003252 [Morus notabilis]
          Length = 353

 Score =  109 bits (272), Expect = 2e-21
 Identities = 84/253 (33%), Positives = 122/253 (48%), Gaps = 21/253 (8%)
 Frame = +1

Query: 187 MLCSISATGKSS--SKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXX 360
           MLCS+ A GKS+  S WL R+RS KGFP+G D DL  F++ N N+               
Sbjct: 1   MLCSVPA-GKSAGGSNWLSRIRSIKGFPAGDDDDLGHFITQNLNSSA------------- 46

Query: 361 XXXXXXXNDPAVHDNQTFEAQNPDGDTG---------FFSVVSNVLAELFVMGSSNGLPK 513
                  ++    D Q     N     G         +   +  VL+ELF MG +  +  
Sbjct: 47  -------SESTRLDPQRIAVPNSPEAPGRIRGRVEPEWVGAMDTVLSELFFMGGAGEISS 99

Query: 514 VR--GKKSSRKQPNPKFCASLDCPESNTNLENNSTRTESAA-----RKNESASPMS---D 663
            R  GK+  RKQ NPK CA+     +N N  NNS  + S+      +K    +P +    
Sbjct: 100 SRHSGKRIPRKQTNPKICAA--SASNNNNNNNNSGNSNSSGVVEQKKKGSDFAPKTASLS 157

Query: 664 DNSGVGMAKECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEK 843
            +SG    +E   +  +      +  DE+E EK      G+SR+EVTVID+S  +WK EK
Sbjct: 158 SDSGNNSTREGHGNVDVDF----DVDDEDEDEK---ELKGYSRSEVTVIDTSCGSWKSEK 210

Query: 844 MLFRKKNVWKVRD 882
           ++FR+K+VW+VR+
Sbjct: 211 LVFRRKSVWRVRE 223


>ref|XP_002874202.1| hypothetical protein ARALYDRAFT_326742 [Arabidopsis lyrata subsp.
           lyrata] gi|297320039|gb|EFH50461.1| hypothetical protein
           ARALYDRAFT_326742 [Arabidopsis lyrata subsp. lyrata]
          Length = 305

 Score =  103 bits (256), Expect = 2e-19
 Identities = 82/244 (33%), Positives = 118/244 (48%), Gaps = 12/244 (4%)
 Frame = +1

Query: 187 MLCSISATGKSSSKWLDRLRSSKGFP-------SGADLDLEQFLSNNQNAXXXXXXXXXX 345
           ML SI      +S WL+RLR ++G         SG  L L+ FL  N +           
Sbjct: 1   MLSSIIDDKPVASTWLNRLRLNRGLSTTEDDDASGNPLTLDDFLRRNHHTEITATSSASD 60

Query: 346 XXXXXXXXXXXXNDPAVHDNQTFEAQNPDGDTG-FFSVVSNVLAELFVMGSSNGLPKVRG 522
                       + P   D +  E+ + +   G ++ V+S+VL+ELF  G S+    + G
Sbjct: 61  SPP---------SAPVPSDPELAESPSEEPVPGEWYGVMSDVLSELFNFGGSSKSSTIPG 111

Query: 523 KKS-SRKQPNPKFCASLDCPESNTNLEN---NSTRTESAARKNESASPMSDDNSGVGMAK 690
           KK   RKQ NP+ C SLD P     L N   N      + R+  ++S  S  N      +
Sbjct: 112 KKKLPRKQSNPRHC-SLDTPNDVVPLVNQKSNDANCVPSVREFATSSSRSSYNKKTPAPE 170

Query: 691 ECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNVW 870
                + +   E V    +EE+EKG  +  GFSR+EVTVID+SF  WK EK++FR++NVW
Sbjct: 171 IRGRRRSVAEDEDV----DEEEEKGEKDLVGFSRSEVTVIDTSFKIWKSEKLVFRRRNVW 226

Query: 871 KVRD 882
           KVR+
Sbjct: 227 KVRE 230


>ref|XP_006286735.1| hypothetical protein CARUB_v10002983mg [Capsella rubella]
            gi|482555441|gb|EOA19633.1| hypothetical protein
            CARUB_v10002983mg [Capsella rubella]
          Length = 339

 Score =  102 bits (255), Expect = 2e-19
 Identities = 98/323 (30%), Positives = 141/323 (43%), Gaps = 22/323 (6%)
 Frame = +1

Query: 187  MLCSISATGKSSSKWLDRLRSSKGFP-------SGADLDLEQFLSNNQNAXXXXXXXXXX 345
            ML SI       S WL+RLR ++G         SG  L L+ FL  N +           
Sbjct: 1    MLSSIIDDKPVGSSWLNRLRLNRGLTTTEYDDASGNPLTLDDFLRRNHHTEITGDSASDS 60

Query: 346  XXXXXXXXXXXXNDPAVHDNQTFEAQNPDGDTGFFSVVSNVLAELFVM---GSSNGLPKV 516
                        +DP + ++   E  NP     ++ V+S+VL+ELF     GS++    +
Sbjct: 61   PPSAPIP-----SDPELAESP-LEEPNPGE---WYGVMSDVLSELFNFDGGGSASKSSTI 111

Query: 517  RGKKS-SRKQPNPKFCASLDCPESNTNLENNSTRTES---AARKNESASPMSDDNSGVGM 684
             GKK   RKQ NP+ C SL+ P+    L N      +   + R+  ++S  S  N     
Sbjct: 112  PGKKKLPRKQSNPRHC-SLETPQDVAPLVNTKISDANCVPSVREFATSSSRSSYNKKPP- 169

Query: 685  AKECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKN 864
            A E    ++  + E    G +EE+EKG  +  GFSR+EVTVID+SF  WK EK++FR++N
Sbjct: 170  APEIRERRRSVVAEEGEEGVDEEEEKGEKDLVGFSRSEVTVIDTSFKVWKSEKLVFRRRN 229

Query: 865  VWKVRD--------XXXXXXXXXXXXRKASASFDDQISDGTEEKKQKFLHGRCSLSKKGC 1020
            VWKVRD                    +K     DD   DG   KK K +    S+     
Sbjct: 230  VWKVRDKKGKSKIVSKTKKMMMKKKMKKKRKCDDDDDGDGEIAKKSKKMKSSISVPDNVS 289

Query: 1021 GGAPHEDYHQPGKSDMYSNLSKK 1089
                 E   +P  S++   L  K
Sbjct: 290  INYVEEINDEPESSNVSRRLPSK 312


>ref|NP_197838.2| uncharacterized protein [Arabidopsis thaliana]
            gi|28973694|gb|AAO64164.1| unknown protein [Arabidopsis
            thaliana] gi|29824259|gb|AAP04090.1| unknown protein
            [Arabidopsis thaliana] gi|110736861|dbj|BAF00388.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332005934|gb|AED93317.1| uncharacterized protein
            AT5G24500 [Arabidopsis thaliana]
          Length = 334

 Score =  102 bits (254), Expect = 3e-19
 Identities = 101/327 (30%), Positives = 149/327 (45%), Gaps = 21/327 (6%)
 Frame = +1

Query: 187  MLCSISATGKSSSKWLDRLRSSKGFP------SGADLDLEQFLSNNQNAXXXXXXXXXXX 348
            ML SI     +SS WL+RLR ++G        SG  L L+ FL  N +            
Sbjct: 1    MLSSIIDDKPASSTWLNRLRLNRGLTTDDDDASGNPLTLDDFLRRNHHTEIAATSSASDS 60

Query: 349  XXXXXXXXXXXNDPAVHDNQTFEAQNPDGDTG-FFSVVSNVLAELFVMGSSNGLPKVRGK 525
                       + P   D +  E+ + +   G ++ V+S+VL ELF    S+    + GK
Sbjct: 61   PP---------SAPIPSDPELAESPSEEPVPGEWYGVMSDVLFELFNFSGSSKSSTIPGK 111

Query: 526  KS-SRKQPNPKFCASLDCPESNT----NLENNSTRTESAARKNESASPMSDDNSGVGMAK 690
            K   RKQ NP+ C SL+ PE       N +++      + R+  ++S  S  N     A 
Sbjct: 112  KKLPRKQSNPRHC-SLETPEDVVVPLVNQKSDDANCLPSVREFATSSSRSSYNKKPP-AP 169

Query: 691  ECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNVW 870
            E   +++  +VE    G +EE+EKG  +  GFSR+EVTVID+SF  WK EK++FR++NVW
Sbjct: 170  EIR-ERRRSVVE--GDGVDEEEEKGEKDLVGFSRSEVTVIDTSFKIWKSEKLVFRRRNVW 226

Query: 871  KVRD------XXXXXXXXXXXXRKASASFDDQISD--GTEEKKQKFLHGRCSLSKKGCGG 1026
            KVR+                  +K     DD   D  G   KK K +    S+S      
Sbjct: 227  KVREKKGKSRVVSKLKKLMKKKKKKKRKCDDVDDDDGGIARKKSKKMKISTSVSDNNPRY 286

Query: 1027 APHEDYHQPGKSDMYSN-LSKKNQETS 1104
               E + +P  S++    LSK  +E S
Sbjct: 287  NVEEIHDEPESSNVSRRLLSKPRKEGS 313


>dbj|BAB11202.1| unnamed protein product [Arabidopsis thaliana]
          Length = 306

 Score =  101 bits (252), Expect = 5e-19
 Identities = 83/244 (34%), Positives = 123/244 (50%), Gaps = 12/244 (4%)
 Frame = +1

Query: 187 MLCSISATGKSSSKWLDRLRSSKGFP------SGADLDLEQFLSNNQNAXXXXXXXXXXX 348
           ML SI     +SS WL+RLR ++G        SG  L L+ FL  N +            
Sbjct: 1   MLSSIIDDKPASSTWLNRLRLNRGLTTDDDDASGNPLTLDDFLRRNHHTEIAATSSASDS 60

Query: 349 XXXXXXXXXXXNDPAVHDNQTFEAQNPDGDTG-FFSVVSNVLAELFVMGSSNGLPKVRGK 525
                      + P   D +  E+ + +   G ++ V+S+VL ELF    S+    + GK
Sbjct: 61  PP---------SAPIPSDPELAESPSEEPVPGEWYGVMSDVLFELFNFSGSSKSSTIPGK 111

Query: 526 KS-SRKQPNPKFCASLDCPESNT----NLENNSTRTESAARKNESASPMSDDNSGVGMAK 690
           K   RKQ NP+ C SL+ PE       N +++      + R+  ++S  S  N     A 
Sbjct: 112 KKLPRKQSNPRHC-SLETPEDVVVPLVNQKSDDANCLPSVREFATSSSRSSYNKKPP-AP 169

Query: 691 ECSCDKQLKLVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNVW 870
           E   +++  +VE    G +EE+EKG  +  GFSR+EVTVID+SF  WK EK++FR++NVW
Sbjct: 170 EIR-ERRRSVVE--GDGVDEEEEKGEKDLVGFSRSEVTVIDTSFKIWKSEKLVFRRRNVW 226

Query: 871 KVRD 882
           KVR+
Sbjct: 227 KVRE 230


>ref|XP_006394704.1| hypothetical protein EUTSA_v10005511mg [Eutrema salsugineum]
           gi|557091343|gb|ESQ31990.1| hypothetical protein
           EUTSA_v10005511mg [Eutrema salsugineum]
          Length = 332

 Score =  100 bits (249), Expect = 1e-18
 Identities = 76/235 (32%), Positives = 113/235 (48%), Gaps = 14/235 (5%)
 Frame = +1

Query: 220 SSKWLDRLRSSKGFP-------SGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXXXXXX 378
           +S WLDRLR S+G         SG  L L+ FL  N +                      
Sbjct: 13  ASTWLDRLRLSRGLSTTDDDDASGNPLSLDDFLRRNYH-------------NEITGDPAS 59

Query: 379 XNDPAVHDNQTFEAQ----NPDGDTGFFSVVSNVLAELFVMGSSNGLPKVRGKKSSRKQP 546
            + P+       E      +P+    ++ V+S+VL+ELF  G S+    + GKK  RKQ 
Sbjct: 60  DSPPSAPILSALELPEIPLDPNPGEEWYGVMSDVLSELFNFGGSSRSSTIPGKKLPRKQS 119

Query: 547 NPKFCAS---LDCPESNTNLENNSTRTESAARKNESASPMSDDNSGVGMAKECSCDKQLK 717
           NP+ C+     D P  N   ++N       AR+  ++S  S +       K+ + +K+ +
Sbjct: 120 NPRHCSVETLADVPLLNQKRDSNCL---PGAREFATSSRSSYN-------KKPAPEKRER 169

Query: 718 LVEYVNCGDEEEKEKGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKNVWKVRD 882
                     EE+E+G  +  GFSR+EVTVID+SF  WK EK++FR++NVWKVRD
Sbjct: 170 RRSVAEADGVEEEERGEKDLVGFSRSEVTVIDTSFKIWKSEKLVFRRRNVWKVRD 224


>ref|XP_006374085.1| hypothetical protein POPTR_0015s00740g [Populus trichocarpa]
            gi|550321689|gb|ERP51882.1| hypothetical protein
            POPTR_0015s00740g [Populus trichocarpa]
          Length = 383

 Score =  100 bits (249), Expect = 1e-18
 Identities = 94/299 (31%), Positives = 130/299 (43%), Gaps = 14/299 (4%)
 Frame = +1

Query: 187  MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXX 366
            MLCS+  T KS S WLDRL S+KGF +  D D       N ++                 
Sbjct: 44   MLCSVK-TSKSGSNWLDRLWSNKGFSNNDDDDPSV---PNPSSSPITDASNSVINSNSES 99

Query: 367  XXXXXNDPAVHDNQTFEAQNPDGDTGFFSVVSNVLAELFVMGSSN----GLPKVRGKKSS 534
                 +   V    T E  + D    FF +++NVL++LF MG  +    G  +   KK  
Sbjct: 100  THSESDQNKVTTTTTREISSSDNKDLFF-LMNNVLSDLFNMGGCSDPIEGSSRHSRKKER 158

Query: 535  --RKQPNPKFCASLDCPESNTNLENNSTRTESAARKNESA-----SPMSDDNSGVGMAKE 693
              RKQ  PKFC       SN +L+          RK+E+      S  SD NS      +
Sbjct: 159  IPRKQTKPKFCFVSGNNSSNDSLD--------CVRKDENVLVATGSLNSDKNSN---NVD 207

Query: 694  CSCDKQLKLVEYVNCGDEEEKE---KGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKN 864
            C  D   +  E  +  +E+ K     G     G+SR+EVTVID+S   WKF+K++FRKKN
Sbjct: 208  CGVDDDDEEEEEEDVEEEKGKAFGVSGDKELKGYSRSEVTVIDTSCLVWKFDKLVFRKKN 267

Query: 865  VWKVRDXXXXXXXXXXXXRKASASFDDQISDGTEEKKQKFLHGRCSLSKKGCGGAPHED 1041
            VWKVRD            RK     D + ++G   KK+  +      S K     P ++
Sbjct: 268  VWKVRDKKGKSWVSGSKKRKV---IDLESANGNGAKKKAKVSNLEVGSSKDANDKPEDE 323


>ref|XP_002321364.2| hypothetical protein POPTR_0015s00740g [Populus trichocarpa]
           gi|550321690|gb|EEF05491.2| hypothetical protein
           POPTR_0015s00740g [Populus trichocarpa]
          Length = 385

 Score =  100 bits (248), Expect = 1e-18
 Identities = 91/278 (32%), Positives = 124/278 (44%), Gaps = 14/278 (5%)
 Frame = +1

Query: 187 MLCSISATGKSSSKWLDRLRSSKGFPSGADLDLEQFLSNNQNAXXXXXXXXXXXXXXXXX 366
           MLCS+  T KS S WLDRL S+KGF +  D D       N ++                 
Sbjct: 44  MLCSVK-TSKSGSNWLDRLWSNKGFSNNDDDDPSV---PNPSSSPITDASNSVINSNSES 99

Query: 367 XXXXXNDPAVHDNQTFEAQNPDGDTGFFSVVSNVLAELFVMGSSN----GLPKVRGKKSS 534
                +   V    T E  + D    FF +++NVL++LF MG  +    G  +   KK  
Sbjct: 100 THSESDQNKVTTTTTREISSSDNKDLFF-LMNNVLSDLFNMGGCSDPIEGSSRHSRKKER 158

Query: 535 --RKQPNPKFCASLDCPESNTNLENNSTRTESAARKNESA-----SPMSDDNSGVGMAKE 693
             RKQ  PKFC       SN +L+          RK+E+      S  SD NS      +
Sbjct: 159 IPRKQTKPKFCFVSGNNSSNDSLD--------CVRKDENVLVATGSLNSDKNSN---NVD 207

Query: 694 CSCDKQLKLVEYVNCGDEEEKE---KGYMNFSGFSRTEVTVIDSSFATWKFEKMLFRKKN 864
           C  D   +  E  +  +E+ K     G     G+SR+EVTVID+S   WKF+K++FRKKN
Sbjct: 208 CGVDDDDEEEEEEDVEEEKGKAFGVSGDKELKGYSRSEVTVIDTSCLVWKFDKLVFRKKN 267

Query: 865 VWKVRDXXXXXXXXXXXXRKASASFDDQISDGTEEKKQ 978
           VWKVRD            RK     D + ++G   KK+
Sbjct: 268 VWKVRDKKGKSWVSGSKKRKV---IDLESANGNGAKKK 302


Top