BLASTX nr result

ID: Catharanthus22_contig00031847 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00031847
         (1373 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006352068.1| PREDICTED: intermediate filament protein ifa...    97   2e-17
ref|XP_004250760.1| PREDICTED: uncharacterized protein LOC101255...    92   5e-16
ref|XP_002319864.2| hypothetical protein POPTR_0013s09040g [Popu...    84   2e-13
ref|XP_002521745.1| hypothetical protein RCOM_1329260 [Ricinus c...    82   4e-13
ref|XP_004290260.1| PREDICTED: uncharacterized protein LOC101310...    81   1e-12
gb|EEE68465.1| hypothetical protein OsJ_26861 [Oryza sativa Japo...    78   8e-12
gb|EEC83360.1| hypothetical protein OsI_28766 [Oryza sativa Indi...    78   8e-12
ref|XP_004513401.1| PREDICTED: myosin-10-like [Cicer arietinum]        76   4e-11
ref|XP_006441781.1| hypothetical protein CICLE_v10023084mg [Citr...    75   5e-11
ref|XP_006837914.1| hypothetical protein AMTR_s03150p00001340 [A...    75   6e-11
ref|XP_002265655.2| PREDICTED: uncharacterized protein LOC100248...    75   8e-11
ref|XP_006478342.1| PREDICTED: filamin A-interacting protein 1-l...    74   1e-10
gb|ESW05862.1| hypothetical protein PHAVU_011G215700g [Phaseolus...    73   3e-10
ref|XP_006592981.1| PREDICTED: flagellar attachment zone protein...    71   1e-09
gb|EXC32476.1| hypothetical protein L484_012643 [Morus notabilis]      70   2e-09
ref|XP_006415080.1| hypothetical protein EUTSA_v10008295mg [Eutr...    67   1e-08
gb|AAG51223.1|AC051630_20 hypothetical protein; 76532-78443 [Ara...    67   2e-08
ref|XP_003573710.1| PREDICTED: uncharacterized protein LOC100834...    67   2e-08
gb|EOY16977.1| Uncharacterized protein TCM_036062 [Theobroma cacao]    64   2e-07

>ref|XP_006352068.1| PREDICTED: intermediate filament protein ifa-3-like isoform X1
            [Solanum tuberosum]
          Length = 273

 Score = 97.1 bits (240), Expect = 2e-17
 Identities = 63/161 (39%), Positives = 88/161 (54%), Gaps = 14/161 (8%)
 Frame = -1

Query: 1364 YYSKGVGEITRQLNEQKEWIIADKQYTWLQN----------LAGCLPVSQGN*MKSLRMK 1215
            YY+K V E+T QL EQ+ WI   KQ  W+ +            G +  +Q N  + L+M 
Sbjct: 112  YYAKTVEEVTAQLGEQQGWIKDCKQNLWVGDNGQVMDKVGEKTGEIEENQDNLAEILKMN 171

Query: 1214 LDTSTTESRPY--FKEYHVEAVIGAREGVYLLI--FSSPVPELKEMDSKMLEEKLQALFS 1047
            L  + T+       K   V      R  + L+    +    +L +MDSK L+E+ QAL S
Sbjct: 172  LADAKTKLNQMSELKSKLVTENSQVRRSIELVKSKMNDFKAQLGDMDSKSLQEEYQALLS 231

Query: 1046 DKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGLC 924
            DKA E EYL SLQLQI K+  ISH++KCSCG E+K+++ LC
Sbjct: 232  DKAGEAEYLHSLQLQIAKLMIISHSIKCSCGNEFKIDMDLC 272


>ref|XP_004250760.1| PREDICTED: uncharacterized protein LOC101255855 [Solanum
            lycopersicum]
          Length = 268

 Score = 92.0 bits (227), Expect = 5e-16
 Identities = 60/159 (37%), Positives = 83/159 (52%), Gaps = 12/159 (7%)
 Frame = -1

Query: 1364 YYSKGVGEITRQLNEQKEWIIADKQYTWL----------QNLAGCLPVSQGN*MKSLRMK 1215
            YY+K V E+T QL EQ+ WI   KQ  W+              G +  +Q   ++ L  K
Sbjct: 112  YYAKTVEEVTAQLGEQQGWIKDCKQNLWVGDNGQVMDKVSEKTGEIEENQDKLVEILNAK 171

Query: 1214 LDTSTTESRPYFKEYHVEAVIGAREGVYLLIFSSP--VPELKEMDSKMLEEKLQALFSDK 1041
               +        K   V      R  + L+   +     +L +MDSK L+E+ QAL SDK
Sbjct: 172  TKLNQMSE---LKSKLVTENSQVRRSIELVKSKTNDFKAQLGDMDSKSLQEEYQALLSDK 228

Query: 1040 A*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGLC 924
            A E EYL SLQLQI K+  ISH++KCSCG E+K+++ LC
Sbjct: 229  AGEAEYLHSLQLQIAKLMIISHSIKCSCGNEFKIDMNLC 267


>ref|XP_002319864.2| hypothetical protein POPTR_0013s09040g [Populus trichocarpa]
            gi|550325325|gb|EEE95787.2| hypothetical protein
            POPTR_0013s09040g [Populus trichocarpa]
          Length = 258

 Score = 83.6 bits (205), Expect = 2e-13
 Identities = 57/159 (35%), Positives = 87/159 (54%), Gaps = 10/159 (6%)
 Frame = -1

Query: 1367 AYYSKGVGEITRQLNEQKEWIIADKQYTWL----------QNLAGCLPVSQGN*MKSLRM 1218
            AYYSK   ++  +L +Q++W+   +    +          +NL   L  ++   ++  +M
Sbjct: 111  AYYSKVADDMNSKLQQQQDWVHTHRISGEMGEHGSGNDAEKNLIAKLGSAKSKLVEIAQM 170

Query: 1217 KLDTSTTESRPYFKEYHVEAVIGAREGVYLLIFSSPVPELKEMDSKMLEEKLQALFSDKA 1038
            K    T  ++   K+   +    A++      F +   E  EMD K LEE+ +AL SD+A
Sbjct: 171  KSKLVTENNK--MKQSIEQLKCSAKD------FKT---EFLEMDIKTLEEEYKALLSDRA 219

Query: 1037 *EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGLCA 921
             EIEYLQSLQ QI ++K+ISH VKC+CG EYKV + LCA
Sbjct: 220  GEIEYLQSLQKQIKQLKDISHMVKCACGVEYKVAMELCA 258


>ref|XP_002521745.1| hypothetical protein RCOM_1329260 [Ricinus communis]
            gi|223538958|gb|EEF40555.1| hypothetical protein
            RCOM_1329260 [Ricinus communis]
          Length = 293

 Score = 82.4 bits (202), Expect = 4e-13
 Identities = 41/62 (66%), Positives = 48/62 (77%)
 Frame = -1

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927
            PEL  MD+  LEE+ +AL SDKA E EYLQSLQ QI K+K ISH +KC+CG EYKVE+ L
Sbjct: 232  PELLAMDTTTLEEEYKALLSDKAGEFEYLQSLQDQIDKLKGISHMIKCACGMEYKVEMDL 291

Query: 926  CA 921
            CA
Sbjct: 292  CA 293


>ref|XP_004290260.1| PREDICTED: uncharacterized protein LOC101310565 [Fragaria vesca
            subsp. vesca]
          Length = 285

 Score = 80.9 bits (198), Expect = 1e-12
 Identities = 67/181 (37%), Positives = 87/181 (48%), Gaps = 34/181 (18%)
 Frame = -1

Query: 1364 YYSKGVGEITRQLNEQKEWIIA-----------------DKQYTWLQNLAG-----CLPV 1251
            YY K   +I  +L +QK+WII                  D+Q    Q  A      C+  
Sbjct: 112  YYLKVSEDIAAKLQQQKDWIICHQTTTELGEPGMVKDTIDEQRVATQGKASIGDHLCI-T 170

Query: 1250 SQGN*MK--------SLRMKLDTSTTESRPYFKE-YHVEAVI---GAREGVYLLIFSSPV 1107
            +QGN  +        S ++KLD          KE Y ++  I     RE       S+  
Sbjct: 171  NQGNDARKNLMAMVDSAKVKLDEILQMKSELVKENYKMKQAIDQVNCRE-------SNFK 223

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927
            PEL+ +D K LE++  AL SD A E EYL+SLQ QI K+K ISH +KCSCG EYKVEL  
Sbjct: 224  PELRALDIKTLEDEYNALVSDNAGEAEYLKSLQDQIEKLKGISHVLKCSCGVEYKVELDS 283

Query: 926  C 924
            C
Sbjct: 284  C 284


>gb|EEE68465.1| hypothetical protein OsJ_26861 [Oryza sativa Japonica Group]
          Length = 290

 Score = 78.2 bits (191), Expect = 8e-12
 Identities = 62/175 (35%), Positives = 90/175 (51%), Gaps = 28/175 (16%)
 Frame = -1

Query: 1370 KAYYSKGVGEITRQLNEQKEWIIADK-----------QYTWLQNLAG--------CLPVS 1248
            + +Y+K +  +T +L EQ+EW+ A K           +    QNL G        C  + 
Sbjct: 110  RLFYTKTIESLTVKLQEQQEWLGAFKLKVITIEPSVEESQSKQNLQGQSHGILNSCGSLD 169

Query: 1247 QGN*MKS----LRMKLDTSTTESRPYFKEYHVEAVIGAREGVYLL-----IFSSPVPELK 1095
            +GN + S    LR++L+ ST       KE     +    E   ++       S  +  L+
Sbjct: 170  KGNDIGSKQGELRIQLE-STKHKIDEIKEKQSALLTEISESKQVIEQEKNAISGFLAPLQ 228

Query: 1094 EMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELG 930
            +MD K LEE+ +AL +DKA EIEY QSL+ +I +MK +S  VKC CG EYKVELG
Sbjct: 229  QMDMKSLEEEHKALQADKAGEIEYFQSLEERINEMKGVSDAVKCRCGLEYKVELG 283


>gb|EEC83360.1| hypothetical protein OsI_28766 [Oryza sativa Indica Group]
          Length = 290

 Score = 78.2 bits (191), Expect = 8e-12
 Identities = 62/175 (35%), Positives = 90/175 (51%), Gaps = 28/175 (16%)
 Frame = -1

Query: 1370 KAYYSKGVGEITRQLNEQKEWIIADK-----------QYTWLQNLAG--------CLPVS 1248
            + +Y+K +  +T +L EQ+EW+ A K           +    QNL G        C  + 
Sbjct: 110  RLFYTKTIESLTVKLQEQQEWLGAFKLKVITIEPSVEESQSKQNLQGQSHGILNSCGSLD 169

Query: 1247 QGN*MKS----LRMKLDTSTTESRPYFKEYHVEAVIGAREGVYLL-----IFSSPVPELK 1095
            +GN + S    LR++L+ ST       KE     +    E   ++       S  +  L+
Sbjct: 170  KGNDIGSKQGELRIQLE-STKHKIDEIKEKQSALLTEISESKQVIEQEKNAISGFLAPLQ 228

Query: 1094 EMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELG 930
            +MD K LEE+ +AL +DKA EIEY QSL+ +I +MK +S  VKC CG EYKVELG
Sbjct: 229  QMDMKSLEEEHKALQADKAGEIEYFQSLEERINEMKGVSDAVKCRCGLEYKVELG 283


>ref|XP_004513401.1| PREDICTED: myosin-10-like [Cicer arietinum]
          Length = 283

 Score = 75.9 bits (185), Expect = 4e-11
 Identities = 36/58 (62%), Positives = 43/58 (74%)
 Frame = -1

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933
            PELK  D   LEE+  AL SDKA E EYLQS++ Q+ K+KEI H +KC+CGEEY VEL
Sbjct: 223  PELKAADISALEEEYNALLSDKAGETEYLQSIEKQVEKLKEICHVIKCACGEEYTVEL 280


>ref|XP_006441781.1| hypothetical protein CICLE_v10023084mg [Citrus clementina]
            gi|557544043|gb|ESR55021.1| hypothetical protein
            CICLE_v10023084mg [Citrus clementina]
          Length = 89

 Score = 75.5 bits (184), Expect = 5e-11
 Identities = 39/69 (56%), Positives = 49/69 (71%)
 Frame = -1

Query: 1127 LIFSSPVPELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEE 948
            L+  +  PEL EMD K LEE+   L SD A E EYLQSLQ QI K++ ISH +KC+CG+E
Sbjct: 21   LVSDNNKPELMEMDIKTLEEEHGTLLSDIAGEAEYLQSLQHQIEKLEGISHVIKCACGQE 80

Query: 947  YKVELGLCA 921
            YKV++ L A
Sbjct: 81   YKVKVSLSA 89


>ref|XP_006837914.1| hypothetical protein AMTR_s03150p00001340 [Amborella trichopoda]
            gi|548840297|gb|ERN00483.1| hypothetical protein
            AMTR_s03150p00001340 [Amborella trichopoda]
          Length = 193

 Score = 75.1 bits (183), Expect = 6e-11
 Identities = 38/58 (65%), Positives = 46/58 (79%)
 Frame = -1

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933
            PELK MD   LEE+ +AL SDKA EI YLQSLQ +I ++K IS +VKC+CGEEYKVE+
Sbjct: 133  PELKAMDISALEEEHKALISDKAGEISYLQSLQERIEQLKGISQSVKCACGEEYKVEI 190


>ref|XP_002265655.2| PREDICTED: uncharacterized protein LOC100248648 [Vitis vinifera]
          Length = 334

 Score = 74.7 bits (182), Expect = 8e-11
 Identities = 37/60 (61%), Positives = 44/60 (73%)
 Frame = -1

Query: 1103 ELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGLC 924
            EL+ MD K +EE+  AL SDKA E EYL SLQ QI K+K +SH +KC+CG EYKV L LC
Sbjct: 274  ELRAMDMKNMEEEYNALLSDKAGEAEYLHSLQGQIEKLKGLSHKIKCACGTEYKVGLELC 333


>ref|XP_006478342.1| PREDICTED: filamin A-interacting protein 1-like [Citrus sinensis]
          Length = 283

 Score = 73.9 bits (180), Expect = 1e-10
 Identities = 39/62 (62%), Positives = 45/62 (72%)
 Frame = -1

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927
            PEL EMD K LEE+   L SD A E EYLQSLQ QI K++ ISH +KC CG+EYKVE+ L
Sbjct: 222  PELMEMDIKTLEEEHGTLQSDIAGEAEYLQSLQHQIEKLEGISHVIKCVCGQEYKVEVSL 281

Query: 926  CA 921
             A
Sbjct: 282  SA 283


>gb|ESW05862.1| hypothetical protein PHAVU_011G215700g [Phaseolus vulgaris]
          Length = 284

 Score = 72.8 bits (177), Expect = 3e-10
 Identities = 58/180 (32%), Positives = 84/180 (46%), Gaps = 34/180 (18%)
 Frame = -1

Query: 1370 KAYYSKGVGEITRQLNEQKEWIIADKQY-TWLQN-------LAGCLPVSQG--------- 1242
            +AYYSK   E+  +L +Q+EW+ + ++  + LQ        +AG +  ++G         
Sbjct: 110  RAYYSKVAEEMNAKLQKQQEWVSSTRKIRSELQKHDLVTGKVAGQISKAEGETGAICNLV 169

Query: 1241 -------------N*MKSLRMKLDTSTTESRPYFKEYHVEAV----IGAREGVYLLIFSS 1113
                         N + S +  LD   T       E     +    +  RE  +      
Sbjct: 170  MDNLGSVARNNLINELDSAKATLDEILTLKAKVLTENSKIKLAIEEVKCRENEFK----- 224

Query: 1112 PVPELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933
              PELK  D   LEE+ +AL SDK  E EYLQSL+ Q+ ++KEI H VKC+CGEEY V L
Sbjct: 225  --PELKAADLTALEEEYKALLSDKDGETEYLQSLEKQVERLKEIRHVVKCACGEEYTVAL 282


>ref|XP_006592981.1| PREDICTED: flagellar attachment zone protein 1-like [Glycine max]
          Length = 283

 Score = 70.9 bits (172), Expect = 1e-09
 Identities = 34/60 (56%), Positives = 43/60 (71%)
 Frame = -1

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927
            PELK  D   LEE+  AL SDKA E EYLQSL+ Q+ K+++I H VKC+CGEEY V + +
Sbjct: 224  PELKAADITALEEECTALISDKAGEAEYLQSLEKQVEKLEQIRHVVKCACGEEYTVAVNM 283


>gb|EXC32476.1| hypothetical protein L484_012643 [Morus notabilis]
          Length = 281

 Score = 70.1 bits (170), Expect = 2e-09
 Identities = 35/62 (56%), Positives = 43/62 (69%)
 Frame = -1

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927
            P L+ +D K LE++   L SDKA   EYLQSLQ Q+  +K ISH VKC+CGEEY+V   L
Sbjct: 220  PLLRAVDLKTLEKEYNTLLSDKAGVTEYLQSLQAQVDILKGISHVVKCACGEEYRVGTDL 279

Query: 926  CA 921
            CA
Sbjct: 280  CA 281


>ref|XP_006415080.1| hypothetical protein EUTSA_v10008295mg [Eutrema salsugineum]
            gi|557092851|gb|ESQ33433.1| hypothetical protein
            EUTSA_v10008295mg [Eutrema salsugineum]
          Length = 305

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 34/58 (58%), Positives = 41/58 (70%)
 Frame = -1

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933
            PEL  +D K+LEE+  AL SD++ E EYLQSLQ Q  K+K IS+  KC CGEEY V L
Sbjct: 247  PELMSVDIKVLEEEYTALLSDESGEAEYLQSLQSQAEKLKGISYIAKCGCGEEYSVGL 304


>gb|AAG51223.1|AC051630_20 hypothetical protein; 76532-78443 [Arabidopsis thaliana]
          Length = 254

 Score = 67.0 bits (162), Expect = 2e-08
 Identities = 55/149 (36%), Positives = 74/149 (49%), Gaps = 3/149 (2%)
 Frame = -1

Query: 1370 KAYYSKGVGEITRQLNEQKEWIIADKQYTWLQNLAGCLPVSQGN*MK---SLRMKLDTST 1200
            ++ Y K   E   +L EQK W I+       Q   G    ++ N M+   S R KLD + 
Sbjct: 110  RSNYLKTAEEARTKLEEQKGWFISHMSNETGQQ--GHKKETRNNLMELSDSARAKLDQAK 167

Query: 1199 TESRPYFKEYHVEAVIGAREGVYLLIFSSPVPELKEMDSKMLEEKLQALFSDKA*EIEYL 1020
                   +E     +  + E V   I +   PEL  +D K+LEE+  AL SD++ E EYL
Sbjct: 168  LMRSNLLQEN--SKIKLSIENVKHKI-NEFKPELMSVDIKILEEEYTALLSDESGEAEYL 224

Query: 1019 QSLQLQIIKMKEISHTVKCSCGEEYKVEL 933
             SLQ Q  K+K IS+  KC CGEEY V L
Sbjct: 225  SSLQSQAEKLKGISYIAKCGCGEEYSVGL 253


>ref|XP_003573710.1| PREDICTED: uncharacterized protein LOC100834418 [Brachypodium
            distachyon]
          Length = 290

 Score = 67.0 bits (162), Expect = 2e-08
 Identities = 49/175 (28%), Positives = 88/175 (50%), Gaps = 28/175 (16%)
 Frame = -1

Query: 1373 SKAYYSKGVGEITRQLNEQKEWIIADKQYTW---------------------LQNLAGCL 1257
            ++ +YSK    +T +L E++EW+ + K+                        + N  GC+
Sbjct: 109  NRLFYSKTTEVLTSKLRERQEWLDSFKKKMVAIPLVGVSESIQNCVEGKRCEMLNSEGCI 168

Query: 1256 P--VSQGN*MKSLRMKLDTSTTESRPYFKEYHVEAVIGAREGVYLL-----IFSSPVPEL 1098
                  G+    LR++L+++  ++    K    + ++   +   +L     I +S    L
Sbjct: 169  DKETDMGSKQGELRIQLESAQLKTED-IKAKRSQILLEISKSKQILEQEKNIIASFPAAL 227

Query: 1097 KEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933
            +EM+ K LEE+ +AL  DKA E+E+ Q+L+ +  +MK +S  +KC+CG EYKVEL
Sbjct: 228  QEMNMKSLEEEYKALQGDKAGEVEFFQTLEERTNEMKGVSDPIKCNCGLEYKVEL 282


>gb|EOY16977.1| Uncharacterized protein TCM_036062 [Theobroma cacao]
          Length = 283

 Score = 63.5 bits (153), Expect = 2e-07
 Identities = 31/58 (53%), Positives = 42/58 (72%)
 Frame = -1

Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933
            PE+ EM +  LEE+ + L S+K  E EYL SLQ Q+ +MK IS+ +KC+CGEEY V+L
Sbjct: 224  PEVLEMSTDALEEEYKVLLSEKDGETEYLCSLQNQVERMKGISNVIKCACGEEYTVKL 281


Top