BLASTX nr result
ID: Catharanthus22_contig00031847
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00031847 (1373 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006352068.1| PREDICTED: intermediate filament protein ifa... 97 2e-17 ref|XP_004250760.1| PREDICTED: uncharacterized protein LOC101255... 92 5e-16 ref|XP_002319864.2| hypothetical protein POPTR_0013s09040g [Popu... 84 2e-13 ref|XP_002521745.1| hypothetical protein RCOM_1329260 [Ricinus c... 82 4e-13 ref|XP_004290260.1| PREDICTED: uncharacterized protein LOC101310... 81 1e-12 gb|EEE68465.1| hypothetical protein OsJ_26861 [Oryza sativa Japo... 78 8e-12 gb|EEC83360.1| hypothetical protein OsI_28766 [Oryza sativa Indi... 78 8e-12 ref|XP_004513401.1| PREDICTED: myosin-10-like [Cicer arietinum] 76 4e-11 ref|XP_006441781.1| hypothetical protein CICLE_v10023084mg [Citr... 75 5e-11 ref|XP_006837914.1| hypothetical protein AMTR_s03150p00001340 [A... 75 6e-11 ref|XP_002265655.2| PREDICTED: uncharacterized protein LOC100248... 75 8e-11 ref|XP_006478342.1| PREDICTED: filamin A-interacting protein 1-l... 74 1e-10 gb|ESW05862.1| hypothetical protein PHAVU_011G215700g [Phaseolus... 73 3e-10 ref|XP_006592981.1| PREDICTED: flagellar attachment zone protein... 71 1e-09 gb|EXC32476.1| hypothetical protein L484_012643 [Morus notabilis] 70 2e-09 ref|XP_006415080.1| hypothetical protein EUTSA_v10008295mg [Eutr... 67 1e-08 gb|AAG51223.1|AC051630_20 hypothetical protein; 76532-78443 [Ara... 67 2e-08 ref|XP_003573710.1| PREDICTED: uncharacterized protein LOC100834... 67 2e-08 gb|EOY16977.1| Uncharacterized protein TCM_036062 [Theobroma cacao] 64 2e-07 >ref|XP_006352068.1| PREDICTED: intermediate filament protein ifa-3-like isoform X1 [Solanum tuberosum] Length = 273 Score = 97.1 bits (240), Expect = 2e-17 Identities = 63/161 (39%), Positives = 88/161 (54%), Gaps = 14/161 (8%) Frame = -1 Query: 1364 YYSKGVGEITRQLNEQKEWIIADKQYTWLQN----------LAGCLPVSQGN*MKSLRMK 1215 YY+K V E+T QL EQ+ WI KQ W+ + G + +Q N + L+M Sbjct: 112 YYAKTVEEVTAQLGEQQGWIKDCKQNLWVGDNGQVMDKVGEKTGEIEENQDNLAEILKMN 171 Query: 1214 LDTSTTESRPY--FKEYHVEAVIGAREGVYLLI--FSSPVPELKEMDSKMLEEKLQALFS 1047 L + T+ K V R + L+ + +L +MDSK L+E+ QAL S Sbjct: 172 LADAKTKLNQMSELKSKLVTENSQVRRSIELVKSKMNDFKAQLGDMDSKSLQEEYQALLS 231 Query: 1046 DKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGLC 924 DKA E EYL SLQLQI K+ ISH++KCSCG E+K+++ LC Sbjct: 232 DKAGEAEYLHSLQLQIAKLMIISHSIKCSCGNEFKIDMDLC 272 >ref|XP_004250760.1| PREDICTED: uncharacterized protein LOC101255855 [Solanum lycopersicum] Length = 268 Score = 92.0 bits (227), Expect = 5e-16 Identities = 60/159 (37%), Positives = 83/159 (52%), Gaps = 12/159 (7%) Frame = -1 Query: 1364 YYSKGVGEITRQLNEQKEWIIADKQYTWL----------QNLAGCLPVSQGN*MKSLRMK 1215 YY+K V E+T QL EQ+ WI KQ W+ G + +Q ++ L K Sbjct: 112 YYAKTVEEVTAQLGEQQGWIKDCKQNLWVGDNGQVMDKVSEKTGEIEENQDKLVEILNAK 171 Query: 1214 LDTSTTESRPYFKEYHVEAVIGAREGVYLLIFSSP--VPELKEMDSKMLEEKLQALFSDK 1041 + K V R + L+ + +L +MDSK L+E+ QAL SDK Sbjct: 172 TKLNQMSE---LKSKLVTENSQVRRSIELVKSKTNDFKAQLGDMDSKSLQEEYQALLSDK 228 Query: 1040 A*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGLC 924 A E EYL SLQLQI K+ ISH++KCSCG E+K+++ LC Sbjct: 229 AGEAEYLHSLQLQIAKLMIISHSIKCSCGNEFKIDMNLC 267 >ref|XP_002319864.2| hypothetical protein POPTR_0013s09040g [Populus trichocarpa] gi|550325325|gb|EEE95787.2| hypothetical protein POPTR_0013s09040g [Populus trichocarpa] Length = 258 Score = 83.6 bits (205), Expect = 2e-13 Identities = 57/159 (35%), Positives = 87/159 (54%), Gaps = 10/159 (6%) Frame = -1 Query: 1367 AYYSKGVGEITRQLNEQKEWIIADKQYTWL----------QNLAGCLPVSQGN*MKSLRM 1218 AYYSK ++ +L +Q++W+ + + +NL L ++ ++ +M Sbjct: 111 AYYSKVADDMNSKLQQQQDWVHTHRISGEMGEHGSGNDAEKNLIAKLGSAKSKLVEIAQM 170 Query: 1217 KLDTSTTESRPYFKEYHVEAVIGAREGVYLLIFSSPVPELKEMDSKMLEEKLQALFSDKA 1038 K T ++ K+ + A++ F + E EMD K LEE+ +AL SD+A Sbjct: 171 KSKLVTENNK--MKQSIEQLKCSAKD------FKT---EFLEMDIKTLEEEYKALLSDRA 219 Query: 1037 *EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGLCA 921 EIEYLQSLQ QI ++K+ISH VKC+CG EYKV + LCA Sbjct: 220 GEIEYLQSLQKQIKQLKDISHMVKCACGVEYKVAMELCA 258 >ref|XP_002521745.1| hypothetical protein RCOM_1329260 [Ricinus communis] gi|223538958|gb|EEF40555.1| hypothetical protein RCOM_1329260 [Ricinus communis] Length = 293 Score = 82.4 bits (202), Expect = 4e-13 Identities = 41/62 (66%), Positives = 48/62 (77%) Frame = -1 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927 PEL MD+ LEE+ +AL SDKA E EYLQSLQ QI K+K ISH +KC+CG EYKVE+ L Sbjct: 232 PELLAMDTTTLEEEYKALLSDKAGEFEYLQSLQDQIDKLKGISHMIKCACGMEYKVEMDL 291 Query: 926 CA 921 CA Sbjct: 292 CA 293 >ref|XP_004290260.1| PREDICTED: uncharacterized protein LOC101310565 [Fragaria vesca subsp. vesca] Length = 285 Score = 80.9 bits (198), Expect = 1e-12 Identities = 67/181 (37%), Positives = 87/181 (48%), Gaps = 34/181 (18%) Frame = -1 Query: 1364 YYSKGVGEITRQLNEQKEWIIA-----------------DKQYTWLQNLAG-----CLPV 1251 YY K +I +L +QK+WII D+Q Q A C+ Sbjct: 112 YYLKVSEDIAAKLQQQKDWIICHQTTTELGEPGMVKDTIDEQRVATQGKASIGDHLCI-T 170 Query: 1250 SQGN*MK--------SLRMKLDTSTTESRPYFKE-YHVEAVI---GAREGVYLLIFSSPV 1107 +QGN + S ++KLD KE Y ++ I RE S+ Sbjct: 171 NQGNDARKNLMAMVDSAKVKLDEILQMKSELVKENYKMKQAIDQVNCRE-------SNFK 223 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927 PEL+ +D K LE++ AL SD A E EYL+SLQ QI K+K ISH +KCSCG EYKVEL Sbjct: 224 PELRALDIKTLEDEYNALVSDNAGEAEYLKSLQDQIEKLKGISHVLKCSCGVEYKVELDS 283 Query: 926 C 924 C Sbjct: 284 C 284 >gb|EEE68465.1| hypothetical protein OsJ_26861 [Oryza sativa Japonica Group] Length = 290 Score = 78.2 bits (191), Expect = 8e-12 Identities = 62/175 (35%), Positives = 90/175 (51%), Gaps = 28/175 (16%) Frame = -1 Query: 1370 KAYYSKGVGEITRQLNEQKEWIIADK-----------QYTWLQNLAG--------CLPVS 1248 + +Y+K + +T +L EQ+EW+ A K + QNL G C + Sbjct: 110 RLFYTKTIESLTVKLQEQQEWLGAFKLKVITIEPSVEESQSKQNLQGQSHGILNSCGSLD 169 Query: 1247 QGN*MKS----LRMKLDTSTTESRPYFKEYHVEAVIGAREGVYLL-----IFSSPVPELK 1095 +GN + S LR++L+ ST KE + E ++ S + L+ Sbjct: 170 KGNDIGSKQGELRIQLE-STKHKIDEIKEKQSALLTEISESKQVIEQEKNAISGFLAPLQ 228 Query: 1094 EMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELG 930 +MD K LEE+ +AL +DKA EIEY QSL+ +I +MK +S VKC CG EYKVELG Sbjct: 229 QMDMKSLEEEHKALQADKAGEIEYFQSLEERINEMKGVSDAVKCRCGLEYKVELG 283 >gb|EEC83360.1| hypothetical protein OsI_28766 [Oryza sativa Indica Group] Length = 290 Score = 78.2 bits (191), Expect = 8e-12 Identities = 62/175 (35%), Positives = 90/175 (51%), Gaps = 28/175 (16%) Frame = -1 Query: 1370 KAYYSKGVGEITRQLNEQKEWIIADK-----------QYTWLQNLAG--------CLPVS 1248 + +Y+K + +T +L EQ+EW+ A K + QNL G C + Sbjct: 110 RLFYTKTIESLTVKLQEQQEWLGAFKLKVITIEPSVEESQSKQNLQGQSHGILNSCGSLD 169 Query: 1247 QGN*MKS----LRMKLDTSTTESRPYFKEYHVEAVIGAREGVYLL-----IFSSPVPELK 1095 +GN + S LR++L+ ST KE + E ++ S + L+ Sbjct: 170 KGNDIGSKQGELRIQLE-STKHKIDEIKEKQSALLTEISESKQVIEQEKNAISGFLAPLQ 228 Query: 1094 EMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELG 930 +MD K LEE+ +AL +DKA EIEY QSL+ +I +MK +S VKC CG EYKVELG Sbjct: 229 QMDMKSLEEEHKALQADKAGEIEYFQSLEERINEMKGVSDAVKCRCGLEYKVELG 283 >ref|XP_004513401.1| PREDICTED: myosin-10-like [Cicer arietinum] Length = 283 Score = 75.9 bits (185), Expect = 4e-11 Identities = 36/58 (62%), Positives = 43/58 (74%) Frame = -1 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933 PELK D LEE+ AL SDKA E EYLQS++ Q+ K+KEI H +KC+CGEEY VEL Sbjct: 223 PELKAADISALEEEYNALLSDKAGETEYLQSIEKQVEKLKEICHVIKCACGEEYTVEL 280 >ref|XP_006441781.1| hypothetical protein CICLE_v10023084mg [Citrus clementina] gi|557544043|gb|ESR55021.1| hypothetical protein CICLE_v10023084mg [Citrus clementina] Length = 89 Score = 75.5 bits (184), Expect = 5e-11 Identities = 39/69 (56%), Positives = 49/69 (71%) Frame = -1 Query: 1127 LIFSSPVPELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEE 948 L+ + PEL EMD K LEE+ L SD A E EYLQSLQ QI K++ ISH +KC+CG+E Sbjct: 21 LVSDNNKPELMEMDIKTLEEEHGTLLSDIAGEAEYLQSLQHQIEKLEGISHVIKCACGQE 80 Query: 947 YKVELGLCA 921 YKV++ L A Sbjct: 81 YKVKVSLSA 89 >ref|XP_006837914.1| hypothetical protein AMTR_s03150p00001340 [Amborella trichopoda] gi|548840297|gb|ERN00483.1| hypothetical protein AMTR_s03150p00001340 [Amborella trichopoda] Length = 193 Score = 75.1 bits (183), Expect = 6e-11 Identities = 38/58 (65%), Positives = 46/58 (79%) Frame = -1 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933 PELK MD LEE+ +AL SDKA EI YLQSLQ +I ++K IS +VKC+CGEEYKVE+ Sbjct: 133 PELKAMDISALEEEHKALISDKAGEISYLQSLQERIEQLKGISQSVKCACGEEYKVEI 190 >ref|XP_002265655.2| PREDICTED: uncharacterized protein LOC100248648 [Vitis vinifera] Length = 334 Score = 74.7 bits (182), Expect = 8e-11 Identities = 37/60 (61%), Positives = 44/60 (73%) Frame = -1 Query: 1103 ELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGLC 924 EL+ MD K +EE+ AL SDKA E EYL SLQ QI K+K +SH +KC+CG EYKV L LC Sbjct: 274 ELRAMDMKNMEEEYNALLSDKAGEAEYLHSLQGQIEKLKGLSHKIKCACGTEYKVGLELC 333 >ref|XP_006478342.1| PREDICTED: filamin A-interacting protein 1-like [Citrus sinensis] Length = 283 Score = 73.9 bits (180), Expect = 1e-10 Identities = 39/62 (62%), Positives = 45/62 (72%) Frame = -1 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927 PEL EMD K LEE+ L SD A E EYLQSLQ QI K++ ISH +KC CG+EYKVE+ L Sbjct: 222 PELMEMDIKTLEEEHGTLQSDIAGEAEYLQSLQHQIEKLEGISHVIKCVCGQEYKVEVSL 281 Query: 926 CA 921 A Sbjct: 282 SA 283 >gb|ESW05862.1| hypothetical protein PHAVU_011G215700g [Phaseolus vulgaris] Length = 284 Score = 72.8 bits (177), Expect = 3e-10 Identities = 58/180 (32%), Positives = 84/180 (46%), Gaps = 34/180 (18%) Frame = -1 Query: 1370 KAYYSKGVGEITRQLNEQKEWIIADKQY-TWLQN-------LAGCLPVSQG--------- 1242 +AYYSK E+ +L +Q+EW+ + ++ + LQ +AG + ++G Sbjct: 110 RAYYSKVAEEMNAKLQKQQEWVSSTRKIRSELQKHDLVTGKVAGQISKAEGETGAICNLV 169 Query: 1241 -------------N*MKSLRMKLDTSTTESRPYFKEYHVEAV----IGAREGVYLLIFSS 1113 N + S + LD T E + + RE + Sbjct: 170 MDNLGSVARNNLINELDSAKATLDEILTLKAKVLTENSKIKLAIEEVKCRENEFK----- 224 Query: 1112 PVPELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933 PELK D LEE+ +AL SDK E EYLQSL+ Q+ ++KEI H VKC+CGEEY V L Sbjct: 225 --PELKAADLTALEEEYKALLSDKDGETEYLQSLEKQVERLKEIRHVVKCACGEEYTVAL 282 >ref|XP_006592981.1| PREDICTED: flagellar attachment zone protein 1-like [Glycine max] Length = 283 Score = 70.9 bits (172), Expect = 1e-09 Identities = 34/60 (56%), Positives = 43/60 (71%) Frame = -1 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927 PELK D LEE+ AL SDKA E EYLQSL+ Q+ K+++I H VKC+CGEEY V + + Sbjct: 224 PELKAADITALEEECTALISDKAGEAEYLQSLEKQVEKLEQIRHVVKCACGEEYTVAVNM 283 >gb|EXC32476.1| hypothetical protein L484_012643 [Morus notabilis] Length = 281 Score = 70.1 bits (170), Expect = 2e-09 Identities = 35/62 (56%), Positives = 43/62 (69%) Frame = -1 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVELGL 927 P L+ +D K LE++ L SDKA EYLQSLQ Q+ +K ISH VKC+CGEEY+V L Sbjct: 220 PLLRAVDLKTLEKEYNTLLSDKAGVTEYLQSLQAQVDILKGISHVVKCACGEEYRVGTDL 279 Query: 926 CA 921 CA Sbjct: 280 CA 281 >ref|XP_006415080.1| hypothetical protein EUTSA_v10008295mg [Eutrema salsugineum] gi|557092851|gb|ESQ33433.1| hypothetical protein EUTSA_v10008295mg [Eutrema salsugineum] Length = 305 Score = 67.4 bits (163), Expect = 1e-08 Identities = 34/58 (58%), Positives = 41/58 (70%) Frame = -1 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933 PEL +D K+LEE+ AL SD++ E EYLQSLQ Q K+K IS+ KC CGEEY V L Sbjct: 247 PELMSVDIKVLEEEYTALLSDESGEAEYLQSLQSQAEKLKGISYIAKCGCGEEYSVGL 304 >gb|AAG51223.1|AC051630_20 hypothetical protein; 76532-78443 [Arabidopsis thaliana] Length = 254 Score = 67.0 bits (162), Expect = 2e-08 Identities = 55/149 (36%), Positives = 74/149 (49%), Gaps = 3/149 (2%) Frame = -1 Query: 1370 KAYYSKGVGEITRQLNEQKEWIIADKQYTWLQNLAGCLPVSQGN*MK---SLRMKLDTST 1200 ++ Y K E +L EQK W I+ Q G ++ N M+ S R KLD + Sbjct: 110 RSNYLKTAEEARTKLEEQKGWFISHMSNETGQQ--GHKKETRNNLMELSDSARAKLDQAK 167 Query: 1199 TESRPYFKEYHVEAVIGAREGVYLLIFSSPVPELKEMDSKMLEEKLQALFSDKA*EIEYL 1020 +E + + E V I + PEL +D K+LEE+ AL SD++ E EYL Sbjct: 168 LMRSNLLQEN--SKIKLSIENVKHKI-NEFKPELMSVDIKILEEEYTALLSDESGEAEYL 224 Query: 1019 QSLQLQIIKMKEISHTVKCSCGEEYKVEL 933 SLQ Q K+K IS+ KC CGEEY V L Sbjct: 225 SSLQSQAEKLKGISYIAKCGCGEEYSVGL 253 >ref|XP_003573710.1| PREDICTED: uncharacterized protein LOC100834418 [Brachypodium distachyon] Length = 290 Score = 67.0 bits (162), Expect = 2e-08 Identities = 49/175 (28%), Positives = 88/175 (50%), Gaps = 28/175 (16%) Frame = -1 Query: 1373 SKAYYSKGVGEITRQLNEQKEWIIADKQYTW---------------------LQNLAGCL 1257 ++ +YSK +T +L E++EW+ + K+ + N GC+ Sbjct: 109 NRLFYSKTTEVLTSKLRERQEWLDSFKKKMVAIPLVGVSESIQNCVEGKRCEMLNSEGCI 168 Query: 1256 P--VSQGN*MKSLRMKLDTSTTESRPYFKEYHVEAVIGAREGVYLL-----IFSSPVPEL 1098 G+ LR++L+++ ++ K + ++ + +L I +S L Sbjct: 169 DKETDMGSKQGELRIQLESAQLKTED-IKAKRSQILLEISKSKQILEQEKNIIASFPAAL 227 Query: 1097 KEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933 +EM+ K LEE+ +AL DKA E+E+ Q+L+ + +MK +S +KC+CG EYKVEL Sbjct: 228 QEMNMKSLEEEYKALQGDKAGEVEFFQTLEERTNEMKGVSDPIKCNCGLEYKVEL 282 >gb|EOY16977.1| Uncharacterized protein TCM_036062 [Theobroma cacao] Length = 283 Score = 63.5 bits (153), Expect = 2e-07 Identities = 31/58 (53%), Positives = 42/58 (72%) Frame = -1 Query: 1106 PELKEMDSKMLEEKLQALFSDKA*EIEYLQSLQLQIIKMKEISHTVKCSCGEEYKVEL 933 PE+ EM + LEE+ + L S+K E EYL SLQ Q+ +MK IS+ +KC+CGEEY V+L Sbjct: 224 PEVLEMSTDALEEEYKVLLSEKDGETEYLCSLQNQVERMKGISNVIKCACGEEYTVKL 281