BLASTX nr result

ID: Catharanthus23_contig00018938 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00018938
         (1387 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263...   110   1e-21
ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604...   107   9e-21
ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222...   102   5e-19
ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812...   100   2e-18
gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis]      99   4e-18
ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618...    99   5e-18
ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago ...    97   1e-17
gb|ESW27685.1| hypothetical protein PHAVU_003G223000g, partial [...    97   2e-17
ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citr...    96   3e-17
ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293...    96   5e-17
ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492...    94   1e-16
gb|EOY24267.1| Uncharacterized protein isoform 4 [Theobroma cacao]     94   2e-16
gb|EOY24264.1| Uncharacterized protein isoform 1 [Theobroma caca...    94   2e-16
gb|EMJ10735.1| hypothetical protein PRUPE_ppa010718mg [Prunus pe...    93   2e-16
ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferas...    93   3e-16
ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853...    91   1e-15
ref|XP_002531462.1| conserved hypothetical protein [Ricinus comm...    84   1e-13
gb|ABD65177.1| hypothetical protein 40.t00065 [Brassica oleracea]      82   5e-13
ref|XP_006440253.1| hypothetical protein CICLE_v10022000mg [Citr...    75   9e-11
ref|XP_006285401.1| hypothetical protein CARUB_v10006806mg, part...    74   2e-10

>ref|XP_004245789.1| PREDICTED: uncharacterized protein LOC101263341 isoform 1 [Solanum
            lycopersicum] gi|460400536|ref|XP_004245790.1| PREDICTED:
            uncharacterized protein LOC101263341 isoform 2 [Solanum
            lycopersicum]
          Length = 219

 Score =  110 bits (276), Expect = 1e-21
 Identities = 85/272 (31%), Positives = 113/272 (41%), Gaps = 4/272 (1%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M TAPVKSQPLH F LP L+W              RFRRR SPP                
Sbjct: 1    MATAPVKSQPLHYFSLPQLKWGNKSNTNANH----RFRRRDSPP---------------S 41

Query: 390  XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 569
                 T  A+ D  +D     P+           P   ++   + +   K          
Sbjct: 42   NGDNPTQTADVDGGSDSEKVQPRS-----EAEADPNGVSSLQGREEHEEKVKEEEEEEVG 96

Query: 570  XXXXXXKPWNLRPRRFVTLPAASFKKGEKM----SDEIVLYQRNDNSSSAGGCGPSSKPI 737
                  K WNLRPRR VT    +  K  +M    S+ +   QR  +++   G G   K  
Sbjct: 97   CEEGEVKLWNLRPRRGVTKVETTSLKNVEMRVESSNHMQRSQRLKDNADGNGVGSGKK-- 154

Query: 738  RFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXX 917
                            G ++           LWISLSREEIEED+YS+TGS         
Sbjct: 155  ----------------GKKK-----------LWISLSREEIEEDVYSMTGSRPARRPKKR 187

Query: 918  XXTVQKQMDTVFPGLFLVGMNVDSYRVHESLR 1013
              T+QKQ+D VFPGL+LVG+  DS+RV+++ +
Sbjct: 188  SKTIQKQLDNVFPGLYLVGVTADSFRVNDTTK 219


>ref|XP_006359254.1| PREDICTED: uncharacterized protein LOC102604791 [Solanum tuberosum]
          Length = 220

 Score =  107 bits (268), Expect = 9e-21
 Identities = 87/272 (31%), Positives = 111/272 (40%), Gaps = 4/272 (1%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M  APVKSQPLH F LP L+W              RFRRR SPP                
Sbjct: 1    MAAAPVKSQPLHYFSLPQLKWGNKSHTNANH----RFRRRDSPP---------------- 40

Query: 390  XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 569
                      +D D    S   Q    +   P    S        KE  +          
Sbjct: 41   -SNGDNPPQTADVDGGSDSEKVQPRSEAEADPNGVSSLQGEDEHEKEVKEEEEEEEVGCE 99

Query: 570  XXXXXXKPWNLRPRRFVT-LPAASFKKGE---KMSDEIVLYQRNDNSSSAGGCGPSSKPI 737
                  K WNLRPRR VT +  AS K  E   + S+ +   QR  +++   G G   K  
Sbjct: 100  EGEV--KLWNLRPRRGVTKVETASLKNVEMRVESSNHMQRSQRLKDNADGNGVGSGKK-- 155

Query: 738  RFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXX 917
                            G ++           LWISLSREEIEED+YS+TGS         
Sbjct: 156  ----------------GKKK-----------LWISLSREEIEEDVYSMTGSRPARRPKKR 188

Query: 918  XXTVQKQMDTVFPGLFLVGMNVDSYRVHESLR 1013
              T+QKQ+D VFPGL+LVG+  DS+RV+++ +
Sbjct: 189  SKTIQKQLDNVFPGLYLVGLTADSFRVNDTTK 220


>ref|XP_004152391.1| PREDICTED: uncharacterized protein LOC101222282 [Cucumis sativus]
            gi|449488652|ref|XP_004158130.1| PREDICTED:
            uncharacterized LOC101222282 [Cucumis sativus]
          Length = 246

 Score =  102 bits (253), Expect = 5e-19
 Identities = 85/277 (30%), Positives = 112/277 (40%), Gaps = 11/277 (3%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M T PVKSQPLHNF LP L+W              R RR                     
Sbjct: 1    MATGPVKSQPLHNFALPFLKWGGKNQTNSNH----RIRRA----------------IGGG 40

Query: 390  XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK-----------EAS 536
               ++ AV  S+P+++  S  PQ+ +   RT R   +F+ CS   K           E  
Sbjct: 41   GGDSSPAVDHSEPESEADSK-PQL-RVGSRTVRNRLAFSPCSLGDKFAKHSEGEVGDEVV 98

Query: 537  KXXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGC 716
            K                KPWNLRPR+  +L      K      E+     +   +S  G 
Sbjct: 99   KEQKREGEEVEGEEIVQKPWNLRPRKGTSLRGYGDLKNGGDLQEMDGAVSSAAGASQQGE 158

Query: 717  GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXX 896
             P  K +R +             G               WI+LSR+EIEEDI+ +TGS  
Sbjct: 159  NPQPKSLRLR-------------GFTESHRIEKKDKRKFWIALSRDEIEEDIFIMTGSRP 205

Query: 897  XXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
                      VQKQ+DTVFPGL+LVG+  DSYR+ +S
Sbjct: 206  SRRPKKRPKNVQKQLDTVFPGLWLVGVTADSYRLADS 242


>ref|XP_003549926.1| PREDICTED: uncharacterized protein LOC100812835 isoform X1 [Glycine
            max] gi|571536516|ref|XP_006600845.1| PREDICTED:
            uncharacterized protein LOC100812835 isoform X2 [Glycine
            max]
          Length = 237

 Score =  100 bits (248), Expect = 2e-18
 Identities = 85/270 (31%), Positives = 106/270 (39%), Gaps = 7/270 (2%)
 Frame = +3

Query: 219  APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXX--RFRRRGSPPVLHYLVNNXXXXXXXXX 392
            APVKSQPLHNF LP L+W                RFRR    P  H              
Sbjct: 8    APVKSQPLHNFALPFLKWGASGKNNTTTTAAHHHRFRR----PSDH-------------- 49

Query: 393  XRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXXX 572
                     S+PD+ D  + P   +   RT R  FS      +                 
Sbjct: 50   --------ASEPDSSDPDSRPH--RLGSRTARNRFSLPL---KPPPPPPPQLHEAEHDDA 96

Query: 573  XXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCG-----PSSKPI 737
                 KPWNLRPR+   LP A+ + G   S        N      GG G     P+ K +
Sbjct: 97   DDAVQKPWNLRPRKPALLPKAALEIGTGPSRNHHHATNNGEFHDGGGGGGDNNNPAPKSL 156

Query: 738  RFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXX 917
            R +             G               WI+LSREEIEEDI+ +TGS         
Sbjct: 157  RLR-------------GFSDTPCSVKKEKRKFWIALSREEIEEDIFVMTGSRPARRPRKR 203

Query: 918  XXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
               VQKQMD+VFPGL+LVG+  D+YRV ++
Sbjct: 204  PKNVQKQMDSVFPGLWLVGITADAYRVADT 233


>gb|EXC02099.1| hypothetical protein L484_024064 [Morus notabilis]
          Length = 268

 Score = 99.0 bits (245), Expect = 4e-18
 Identities = 85/279 (30%), Positives = 117/279 (41%), Gaps = 16/279 (5%)
 Frame = +3

Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
           M TAPVKS PLHNF LP L+W              R     S PV  +            
Sbjct: 1   MATAPVKS-PLHNFPLPFLKWGGGKNHASGSHRCRRTISADSSPVADHC----------- 48

Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFS--FAACS--SQRKEASKXXXXXX 557
                   AE + +    +   +  +   RT R  F+  FA+CS  S++KE+ +      
Sbjct: 49  ------DAAEQERNESSEAEPNRFHRVGSRTVRNRFAAPFASCSLVSEKKESDEVAAGEG 102

Query: 558 XXXXXXXXXX----------KPWNLRPRRFVTLPAAS--FKKGEKMSDEIVLYQRNDNSS 701
                               KPWNLRPR+ +   AA+   K GE    E          +
Sbjct: 103 KEGDDREVEAAAGEEEMMVQKPWNLRPRKALFSKAATNGAKSGELPEQE----------N 152

Query: 702 SAGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSL 881
           +  G G  S+ +  + P  + + G   L   +Q           WI+LSREEIEEDI+ +
Sbjct: 153 AVAGGGHQSENLNQQPPKSMRLRG---LSESQQSSEKEKRK--FWIALSREEIEEDIFVM 207

Query: 882 TGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRV 998
           TGS            VQKQ+D VFPGL+LVG+  D+YR+
Sbjct: 208 TGSRPARRPRKRPKNVQKQLDAVFPGLWLVGITADAYRI 246


>ref|XP_006477140.1| PREDICTED: uncharacterized protein LOC102618144 isoform X1 [Citrus
            sinensis]
          Length = 216

 Score = 98.6 bits (244), Expect = 5e-18
 Identities = 87/274 (31%), Positives = 109/274 (39%), Gaps = 8/274 (2%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M TAP+KSQPLHNF L  L+W                R R  PP                
Sbjct: 1    MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39

Query: 390  XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPR--------KPFSFAACSSQRKEASKXX 545
                     E D  +D T +   V   S R  R        KP   A   SQR+ A    
Sbjct: 40   ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPCSTSKPHQDAGDRSQRQTADTEE 90

Query: 546  XXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPS 725
                          +PWNLRPR          K  E + D  V   R DN+++     P 
Sbjct: 91   EEEDEVG-------RPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PK 131

Query: 726  SKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXX 905
            S  +R  V    E  G      E+            W++LSREEIEEDI+ +TGS     
Sbjct: 132  STRLREMV----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARR 178

Query: 906  XXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
                   VQKQ+D VFPGL+LVG+ VD+YRV ++
Sbjct: 179  PRKRPKNVQKQLDNVFPGLWLVGLTVDAYRVSDA 212


>ref|XP_003608674.1| hypothetical protein MTR_4g100570 [Medicago truncatula]
            gi|355509729|gb|AES90871.1| hypothetical protein
            MTR_4g100570 [Medicago truncatula]
          Length = 243

 Score = 97.4 bits (241), Expect = 1e-17
 Identities = 88/289 (30%), Positives = 117/289 (40%), Gaps = 23/289 (7%)
 Frame = +3

Query: 210  MGTAP--VKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXX 383
            M T P  VKSQPLHNF LP L+W              R RR    P  H           
Sbjct: 1    MATTPASVKSQPLHNFSLPFLKWGGTGKNNTNATNHHRSRR----PPDH----------- 45

Query: 384  XXXXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK------------ 527
                        S+PD++  S   ++     RT R  F FA+ SSQR+            
Sbjct: 46   -----------ASEPDSEPDSRPHRL---GSRTARNRFGFASSSSQRQAPPTPSSNNETD 91

Query: 528  -----EASKXXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRND 692
                                     KPWNLRPR+ + +P   F+ G   S       RN+
Sbjct: 92   DNAGDRKRDAEDDAEAGGGAEEIVQKPWNLRPRKPM-IPRGGFEIGAGGS-------RNN 143

Query: 693  NSSS----AGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEI 860
            N         G  P+ K +R +  A        N G +++           WI+LS++EI
Sbjct: 144  NGGELQEGVNGENPAPKSLRLRGFADT------NCGEKKEKRK-------FWIALSKDEI 190

Query: 861  EEDIYSLTGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
            EEDI+ +TGS            VQKQMD VFPGL+LVG+  D+YRV ++
Sbjct: 191  EEDIFVMTGSRPNRRPRKRAKNVQKQMDNVFPGLWLVGITADAYRVADT 239


>gb|ESW27685.1| hypothetical protein PHAVU_003G223000g, partial [Phaseolus vulgaris]
            gi|561029046|gb|ESW27686.1| hypothetical protein
            PHAVU_003G223000g, partial [Phaseolus vulgaris]
          Length = 306

 Score = 97.1 bits (240), Expect = 2e-17
 Identities = 89/294 (30%), Positives = 119/294 (40%), Gaps = 26/294 (8%)
 Frame = +3

Query: 204  FLMGTAP----VKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXX 371
            F M TAP    VKSQPLHNF LP L+W              R RR  S    H       
Sbjct: 55   FSMATAPAQPPVKSQPLHNFALPFLKWGASGKNHTNAAHHHRCRRPSSLSSDH------- 107

Query: 372  XXXXXXXXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQR--------- 524
                            S+PD+D  S   +V     RT R  F+   CS +          
Sbjct: 108  ---------------ASEPDSDPDSRPHRV---GSRTTRNRFALPTCSLKPLPPPPEPPQ 149

Query: 525  ----KEASKXXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRND 692
                 + +                 KPWNLRPR+   LP ++ + G   S       RN 
Sbjct: 150  PPSCNDETDDEAAKRDIEDAEEAVQKPWNLRPRK-PALPKSALEIGTGPS-------RNH 201

Query: 693  NSSSAG---------GCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISL 845
             ++  G         G  P+ K +R +  A  +        +E++           WI+L
Sbjct: 202  ANNGVGEFHDGVSHHGENPAPKSLRLRGFADTQC-------AEKKEKRK------FWIAL 248

Query: 846  SREEIEEDIYSLTGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
            SREEIEEDI+ +TGS            VQKQMD+VFPGL+LVG+  D+YRV ++
Sbjct: 249  SREEIEEDIFVMTGSRPARRPRKRPKNVQKQMDSVFPGLWLVGITADAYRVPDT 302


>ref|XP_006440252.1| hypothetical protein CICLE_v10022000mg [Citrus clementina]
            gi|557542514|gb|ESR53492.1| hypothetical protein
            CICLE_v10022000mg [Citrus clementina]
          Length = 216

 Score = 96.3 bits (238), Expect = 3e-17
 Identities = 81/267 (30%), Positives = 105/267 (39%), Gaps = 1/267 (0%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M TAP+KSQPLHNF L  L+W                R R  PP                
Sbjct: 1    MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39

Query: 390  XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 569
                     E D  +D T +   V   S R  R  F  +    Q+    +          
Sbjct: 40   ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTADTEE 90

Query: 570  XXXXXX-KPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKPIRFK 746
                   +PWNLRPR          K  E + D  V   R DN+++     P S  +R  
Sbjct: 91   EEEDEVGRPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PKSTRLREM 138

Query: 747  VPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXXXXT 926
            V    E  G      E+            W++LSREEIEEDI+ +TGS            
Sbjct: 139  V----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARRPRKRPKN 185

Query: 927  VQKQMDTVFPGLFLVGMNVDSYRVHES 1007
            VQKQ+D VFPGL+LVG+  D+YRV ++
Sbjct: 186  VQKQLDNVFPGLWLVGLTADAYRVSDA 212


>ref|XP_004299406.1| PREDICTED: uncharacterized protein LOC101293977 [Fragaria vesca
            subsp. vesca]
          Length = 239

 Score = 95.5 bits (236), Expect = 5e-17
 Identities = 86/282 (30%), Positives = 118/282 (41%), Gaps = 16/282 (5%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M TAPVK  PLHNF L  L+W              R+RR    PV               
Sbjct: 1    MATAPVKP-PLHNFPLSFLKWGSKNHTNTNH----RYRR----PV--------------- 36

Query: 390  XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSS---QRKEAS-------- 536
               +A     +D D +D+ + PQ  +   RT R  FS A+CS    QR E +        
Sbjct: 37   ---SAEPEPSADDDRNDSESPPQHHRVGSRTARHRFSLASCSEKLPQRNEKASEESDDDV 93

Query: 537  ----KXXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSS 704
                K                KPWNLRPRR     A +   GE       +++      S
Sbjct: 94   DDDAKAAAVAAVAAAEEAEVQKPWNLRPRRAPVTKANNNTGGE-------VHEAEGTKQS 146

Query: 705  AGGCGPSSKPIRFK-VPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSL 881
                 P+ K +R + + A  E   GP++  +++           WI+LS++EIEEDI+ +
Sbjct: 147  EQ---PAPKSMRLRGLAAAAE---GPSMEKKKEKRK-------FWIALSKDEIEEDIFIM 193

Query: 882  TGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
            TGS            VQKQ+D  FPGL+LVG   D+YR  +S
Sbjct: 194  TGSRPARRPKKRPKNVQKQLDNCFPGLWLVGFTADAYRGSDS 235


>ref|XP_004508940.1| PREDICTED: uncharacterized protein LOC101492028 [Cicer arietinum]
          Length = 242

 Score = 94.0 bits (232), Expect = 1e-16
 Identities = 82/282 (29%), Positives = 112/282 (39%), Gaps = 19/282 (6%)
 Frame = +3

Query: 219  APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXXXXR 398
            APVKSQPLHNF LP L+W              R RR    P  H                
Sbjct: 6    APVKSQPLHNFSLPFLKWGGTGKNHTNSNNHQRSRR----PPDH---------------- 45

Query: 399  AATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRK------------EASKX 542
                 A  +PD++  S   ++     RT R  F   + SS  +            +A   
Sbjct: 46   -----ASPEPDSEPDSRPHRL---GSRTARNRFGLPSSSSSHRHATVSSNHETDDDAGDR 97

Query: 543  XXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSA----- 707
                           KPWNLRPR+ + +P  +F+ G   S       RN+++        
Sbjct: 98   KREGEDEAGAEEIVQKPWNLRPRKPM-IPRGAFEIGAGGS-------RNNHNGGELVEAV 149

Query: 708  --GGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSL 881
               G  P+ K +R +             G               WI+LS+EEIEEDI+ +
Sbjct: 150  NNNGDNPTPKSLRLR-------------GFADTSCTEKKEKRKFWIALSKEEIEEDIFVM 196

Query: 882  TGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
            TGS            VQKQMD+VFPGL+LVG+  D+YRV ++
Sbjct: 197  TGSRPNRRPRKRPKNVQKQMDSVFPGLWLVGITADAYRVADT 238


>gb|EOY24267.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 227

 Score = 93.6 bits (231), Expect = 2e-16
 Identities = 84/277 (30%), Positives = 110/277 (39%), Gaps = 11/277 (3%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M TAPVKSQPLHNF+ P L+W                  R SP                 
Sbjct: 1    MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSADH---RRSP----------------- 40

Query: 390  XXRAATAVAESDPDNDDT------SNAPQVVKTSIRTPRKPF--SFAACSSQRKEAS--K 539
                     ESD D+D        S + ++ + S   P KP   S      Q++E    K
Sbjct: 41   ---------ESDSDHDRLRPTRVGSRSTRIQRLSFLPPPKPIKQSHGEDEEQQQEEQPLK 91

Query: 540  XXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKG-EKMSDEIVLYQRNDNSSSAGGC 716
                            +PWNLRPR+ V    A      EK+S+                 
Sbjct: 92   PHKNEAEEEEEEETVQRPWNLRPRKVVVETTAVVTTAMEKVSET---------------A 136

Query: 717  GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXX 896
             P S  +R     G+  NGG     E++           WI+LSREEIEEDI+ +TGS  
Sbjct: 137  APKSMRLR-----GLAENGGIVEKKEKRK---------FWIALSREEIEEDIFVMTGSRP 182

Query: 897  XXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
                      +QKQ+D VFPGL+LVG   D+YRV ++
Sbjct: 183  ARRPKKRPKNIQKQLDAVFPGLWLVGTTADAYRVADA 219


>gb|EOY24264.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508777009|gb|EOY24265.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508777010|gb|EOY24266.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508777012|gb|EOY24268.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 223

 Score = 93.6 bits (231), Expect = 2e-16
 Identities = 84/277 (30%), Positives = 110/277 (39%), Gaps = 11/277 (3%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M TAPVKSQPLHNF+ P L+W                  R SP                 
Sbjct: 1    MATAPVKSQPLHNFNFPFLKWGTHGGGGSSTSSADH---RRSP----------------- 40

Query: 390  XXRAATAVAESDPDNDDT------SNAPQVVKTSIRTPRKPF--SFAACSSQRKEAS--K 539
                     ESD D+D        S + ++ + S   P KP   S      Q++E    K
Sbjct: 41   ---------ESDSDHDRLRPTRVGSRSTRIQRLSFLPPPKPIKQSHGEDEEQQQEEQPLK 91

Query: 540  XXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKG-EKMSDEIVLYQRNDNSSSAGGC 716
                            +PWNLRPR+ V    A      EK+S+                 
Sbjct: 92   PHKNEAEEEEEEETVQRPWNLRPRKVVVETTAVVTTAMEKVSET---------------A 136

Query: 717  GPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXX 896
             P S  +R     G+  NGG     E++           WI+LSREEIEEDI+ +TGS  
Sbjct: 137  APKSMRLR-----GLAENGGIVEKKEKRK---------FWIALSREEIEEDIFVMTGSRP 182

Query: 897  XXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
                      +QKQ+D VFPGL+LVG   D+YRV ++
Sbjct: 183  ARRPKKRPKNIQKQLDAVFPGLWLVGTTADAYRVADA 219


>gb|EMJ10735.1| hypothetical protein PRUPE_ppa010718mg [Prunus persica]
          Length = 238

 Score = 93.2 bits (230), Expect = 2e-16
 Identities = 83/271 (30%), Positives = 105/271 (38%), Gaps = 5/271 (1%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M TAPVK  PLHNF L  L+W                             NN        
Sbjct: 1    MATAPVKP-PLHNFPLAFLKWGAK--------------------------NNSTTNNNHR 33

Query: 390  XXRAATAVAESDPDNDDTSNAPQVVKT-SIRTPRKPFSFAACSS---QRKEASKXXXXXX 557
              R  +A   S+PD++         +  S R  R  +S   C+    +R E  +      
Sbjct: 34   YRRPVSAEPASEPDSESERTHYNNSRVGSSRASRHRYSLIPCAGDKRRRSEERESDQEEG 93

Query: 558  XXXXXXXXXXKPWNLRPRRFVTLPAA-SFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKP 734
                      KPWNLRPRR    PA  SF KG    +   L   N N S      P S  
Sbjct: 94   EEADKAEVVHKPWNLRPRR---APATTSFSKGGANGEPHELESPNPNQSELQQ--PKSMR 148

Query: 735  IRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXX 914
            +R     G  V    N                 WI+LS+EEIEEDI+ +TGS        
Sbjct: 149  LRGLAAEGQNVEKKEN--------------RKFWIALSKEEIEEDIFVMTGSRPARRPKK 194

Query: 915  XXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
                VQKQ+D  FPGL+LVG+  D+Y+V +S
Sbjct: 195  RPKNVQKQLDITFPGLWLVGVTADAYKVADS 225


>ref|XP_003525577.1| PREDICTED: histone-lysine N-methyltransferase 2E-like [Glycine max]
          Length = 241

 Score = 92.8 bits (229), Expect = 3e-16
 Identities = 82/269 (30%), Positives = 108/269 (40%), Gaps = 6/269 (2%)
 Frame = +3

Query: 219  APVKSQPLHNFDLPHLRWXXXXXXXXXXXXXX-RFRRRGSPPVLHYLVNNXXXXXXXXXX 395
            APVKSQPLHNF LP L+W               RFRR    P  H               
Sbjct: 14   APVKSQPLHNFALPFLKWGASGKNNTTNAAHHHRFRR----PSDH--------------- 54

Query: 396  RAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXXXX 575
                    S+PD+ D  + P   +   RT R  FS       +                 
Sbjct: 55   -------ASEPDSSDPDSRPH--RLGSRTARNRFSLPL----KPPPPPPPPQPPHDDDAD 101

Query: 576  XXXXKPWNLRPRRFVTLP---AASFKKGEKMSDEIVLYQRNDNSS--SAGGCGPSSKPIR 740
                KPW LRPR+   LP   A     G   +     +   +N      G   P+ K +R
Sbjct: 102  DSVQKPWKLRPRKPALLPNKTALEIGTGPSRNHHHHHHHATNNGEFLDGGDNNPAPKSLR 161

Query: 741  FKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXXX 920
             +  +  +        SE++           WI+LSREEIEEDI+ +TGS          
Sbjct: 162  LRGFSDTQC-------SEKKEKRK------FWIALSREEIEEDIFVMTGSRPARRPRKRP 208

Query: 921  XTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
              VQKQMD+VFPGL+LVG+  D+YRV ++
Sbjct: 209  KNVQKQMDSVFPGLWLVGITADAYRVADT 237


>ref|XP_003635203.1| PREDICTED: uncharacterized protein LOC100853295 [Vitis vinifera]
            gi|296085701|emb|CBI29500.3| unnamed protein product
            [Vitis vinifera]
          Length = 240

 Score = 90.5 bits (223), Expect = 1e-15
 Identities = 82/276 (29%), Positives = 112/276 (40%), Gaps = 10/276 (3%)
 Frame = +3

Query: 210  MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
            M TAPVKSQPLHNF L  L+W                 R   P                 
Sbjct: 1    MATAPVKSQPLHNFPLSFLKWGKNQMNNHRCRKPVDALRESPPD---------------- 44

Query: 390  XXRAATAVAESDPDND-----DTSNAPQVVKTSIRTPRKPFSFAACS----SQRKEAS-K 539
                     ES+PD+D     ++ +  + +    RT R   + A+ S    +Q+ +A  +
Sbjct: 45   -----GRKNESEPDSDGGSKNESDSENRKLPLGSRTARSRHAVASPSPVEKAQKNQALVE 99

Query: 540  XXXXXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCG 719
                            KPWNLRPR+ V+          K   EI +  +N     A    
Sbjct: 100  REGGEVDEGEGEESVQKPWNLRPRKAVS----------KSPIEIGVAPKNGELQEAVPGV 149

Query: 720  PSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXX 899
            P S+      P  + + G     S  +           WISLSREEIEEDI+ +TGS   
Sbjct: 150  PHSE----NQPKSLRLRGFAESHSSEKKEKRK-----FWISLSREEIEEDIFVMTGSKPA 200

Query: 900  XXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
                     VQKQ+D VFPGL+LVG+  DSYR+ ++
Sbjct: 201  RRPKKRAKNVQKQLDNVFPGLWLVGVTPDSYRLPDA 236


>ref|XP_002531462.1| conserved hypothetical protein [Ricinus communis]
           gi|223528916|gb|EEF30912.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 265

 Score = 84.0 bits (206), Expect = 1e-13
 Identities = 87/290 (30%), Positives = 117/290 (40%), Gaps = 27/290 (9%)
 Frame = +3

Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
           M TAPVK Q LHNF +  L+W                    S        NN        
Sbjct: 1   MATAPVKPQQLHNFPIS-LKWGQTTTTTTISANHQHHHHNRSSSS-----NNQ------- 47

Query: 390 XXRAATAV----AESDPD-NDDTSNAPQVVKTSIRTPRKPFSFAACSS----------QR 524
             R AT V     ESDPD +  T   P+V   S R  R  +SFA+CS+          Q+
Sbjct: 48  --RLATPVHESETESDPDQSQSTIRHPRVGSRSARVHR--YSFASCSTLLPKAKTEIPQK 103

Query: 525 KEASKXXXXXXXXXXXXXXXX------------KPWNLRPRRFVTLPAASFKKGEKMSDE 668
            EA++                            +PW LRPR+ + L  +S +    + +E
Sbjct: 104 PEATEKPQQKNLAVLENNNKNEAEEIEEEDSSSRPWKLRPRKGI-LTGSSKETATLLGNE 162

Query: 669 IVLYQRNDNSSSAGGCGPSSKPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLS 848
               QR+  +       P S  +R  V +    + G  +G               W++LS
Sbjct: 163 ----QRDSTT-------PKSMRLRGLVDS---TSSGLGVGLGNGVSLEKKEKRKFWVALS 208

Query: 849 REEIEEDIYSLTGSXXXXXXXXXXXTVQKQMDTVFPGLFLVGMNVDSYRV 998
           REEIEED++ LTGS            VQK +D+VFPGL+LVG   DSYRV
Sbjct: 209 REEIEEDVFVLTGSRPARRPKKRPKNVQKILDSVFPGLWLVGTTADSYRV 258


>gb|ABD65177.1| hypothetical protein 40.t00065 [Brassica oleracea]
          Length = 237

 Score = 82.0 bits (201), Expect = 5e-13
 Identities = 63/213 (29%), Positives = 91/213 (42%), Gaps = 12/213 (5%)
 Frame = +3

Query: 405  TAVAESDPDNDDTSNAPQVVKTSI-----RTPRKPFSFAACSSQR-------KEASKXXX 548
            +AV + DP +D +   P V   ++     R PR  FS  A SS+R        E +    
Sbjct: 26   SAVTDVDPKSDPSPETPPVSNRTVASRSSRQPRLSFSSLAPSSERDHQKKVKSEENPPRR 85

Query: 549  XXXXXXXXXXXXXKPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSS 728
                         + WNLRPR+      AS  K +K    +   + N          P+ 
Sbjct: 86   EEVPVSAEEDEEKRKWNLRPRKACGGGGASEAKNQKPVAAVAEAKSNRQRGI-----PAE 140

Query: 729  KPIRFKVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXX 908
             P       G+   GG    +E            LW++LSR+EIEED++S++G+      
Sbjct: 141  SP-------GLGGGGGVEAKNENHR---------LWVALSRDEIEEDVFSMSGNRPSRRP 184

Query: 909  XXXXXTVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
                 T+QK +D +FPGL LVGMN D +RV  S
Sbjct: 185  RKRTKTLQKHLDVIFPGLCLVGMNADCFRVSTS 217


>ref|XP_006440253.1| hypothetical protein CICLE_v10022000mg [Citrus clementina]
           gi|557542515|gb|ESR53493.1| hypothetical protein
           CICLE_v10022000mg [Citrus clementina]
          Length = 236

 Score = 74.7 bits (182), Expect = 9e-11
 Identities = 74/258 (28%), Positives = 94/258 (36%), Gaps = 4/258 (1%)
 Frame = +3

Query: 210 MGTAPVKSQPLHNFDLPHLRWXXXXXXXXXXXXXXRFRRRGSPPVLHYLVNNXXXXXXXX 389
           M TAP+KSQPLHNF L  L+W                R R  PP                
Sbjct: 1   MTTAPMKSQPLHNFSLSFLKWGTHHPNPNHN------RTRTPPPT--------------- 39

Query: 390 XXRAATAVAESDPDNDDTSNAPQVVKTSIRTPRKPFSFAACSSQRKEASKXXXXXXXXXX 569
                    E D  +D T +   V   S R  R  F  +    Q+    +          
Sbjct: 40  ---------EPDTTDDSTRHHRVVGSRSSRAQRLSFPSSTSKPQQDAVERPQRQTADTEE 90

Query: 570 XXXXXX-KPWNLRPRRFVTLPAASFKKGEKMSDEIVLYQRNDNSSSAGGCGPSSKPIRFK 746
                  +PWNLRPR          K  E + D  V   R DN+++     P S  +R  
Sbjct: 91  EEEDEVGRPWNLRPR----------KVQETLVDVAVFQNRGDNNANTKA--PKSTRLREM 138

Query: 747 VPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXXXXT 926
           V    E  G      E+            W++LSREEIEEDI+ +TGS            
Sbjct: 139 V----ESRGSNGDKKEKNK---------FWVTLSREEIEEDIFIMTGSRPARRPRKRPKN 185

Query: 927 VQKQMDTVF---PGLFLV 971
           VQKQ+D  +   PG FLV
Sbjct: 186 VQKQLDVRYFCSPGFFLV 203


>ref|XP_006285401.1| hypothetical protein CARUB_v10006806mg, partial [Capsella rubella]
            gi|482554106|gb|EOA18299.1| hypothetical protein
            CARUB_v10006806mg, partial [Capsella rubella]
          Length = 256

 Score = 73.6 bits (179), Expect = 2e-10
 Identities = 50/148 (33%), Positives = 67/148 (45%), Gaps = 8/148 (5%)
 Frame = +3

Query: 588  KPWNLRPRRFVTLPAASFKKGEKMSDEIVLY--------QRNDNSSSAGGCGPSSKPIRF 743
            + WNLRPR+         KKG  +                   N  S GG  P S   R 
Sbjct: 117  RTWNLRPRKAY---GGGLKKGNGVFTAEACVGVGGGGGASEVKNQKSGGGMEPKSNRQR- 172

Query: 744  KVPAGVEVNGGPNLGSERQXXXXXXXXXXLWISLSREEIEEDIYSLTGSXXXXXXXXXXX 923
             +PA     GG  + +E            LW++LSR+EIEED++S+ GS           
Sbjct: 173  GIPAESPGLGGGEVANENHR---------LWVALSRDEIEEDLFSMCGSRPSRRPRKRTK 223

Query: 924  TVQKQMDTVFPGLFLVGMNVDSYRVHES 1007
            T+QK +D +FPGL LVGMN D ++V  S
Sbjct: 224  TLQKYLDVIFPGLCLVGMNADCFKVSNS 251


Top