BLASTX nr result
ID: Cephaelis21_contig00002787
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00002787 (2444 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI29694.3| unnamed protein product [Vitis vinifera] 528 e-147 ref|XP_002328787.1| predicted protein [Populus trichocarpa] gi|2... 488 e-135 ref|XP_002526435.1| conserved hypothetical protein [Ricinus comm... 480 e-133 ref|XP_003548348.1| PREDICTED: uncharacterized protein LOC100799... 411 e-112 ref|XP_003540141.1| PREDICTED: uncharacterized protein LOC100798... 410 e-112 >emb|CBI29694.3| unnamed protein product [Vitis vinifera] Length = 519 Score = 528 bits (1361), Expect = e-147 Identities = 299/543 (55%), Positives = 360/543 (66%), Gaps = 11/543 (2%) Frame = -3 Query: 2217 MKGPLDRTKIVLRHLPPSISQSSLMEQVDSRFSGRYNWFSFFPGKTSQNRQSYSRAYIDF 2038 MKGPLDRTK+V+RHLPP+IS+++ +EQ+D+ F GRY F PGK SQ RQSYSRAY+DF Sbjct: 1 MKGPLDRTKVVVRHLPPTISEAAFLEQIDTVFKGRYTLVKFRPGKNSQKRQSYSRAYLDF 60 Query: 2037 KRPDDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPKQWSKKDGREGTILKDPEYVE 1858 KRP+DVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQR+PK W KKDGREGTI KDPEY+E Sbjct: 61 KRPEDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRIPKHWPKKDGREGTIFKDPEYME 120 Query: 1857 FLEFLSKPVENLPSAEIQLERKEAERAGTMKDAPIVTPLMDYVRQKRAAKSGTRRSVSNG 1678 F+E L+KPVENLPSAEIQLER+EAERAG +KD PIVTPLMD+VRQKRAAK +RRS+SNG Sbjct: 121 FVELLAKPVENLPSAEIQLERREAERAGAVKDTPIVTPLMDFVRQKRAAKGVSRRSLSNG 180 Query: 1677 KSTXXXXXXXXXXXXXXXXXXXXXXXXXSTTMYVLRXXXXXXXXXXXSAYVLVPKREDQL 1498 K + STTMYVLR S ++LVPKR+DQL Sbjct: 181 KLSRRASGSSSGNPSLGSSKRGSEKRRLSTTMYVLRDTAKSTSAKDKSTFILVPKRDDQL 240 Query: 1497 LSDKSVALPATPGSDALEEESG-----DPXXXXXXXXXXXXXXIPHVSGGSLMQPSTNAP 1333 LSDKSV L A G++ALEEESG D I H L+Q + +P Sbjct: 241 LSDKSVNLAAGGGAEALEEESGVSGAVDAGKKKVLLLKGKEREISH----HLLQQNVTSP 296 Query: 1332 IKN-SPSSTMKQNQRRETGGKFIRSILL-KEKPQNHSSVVRPEQQSQTSSLEKDKRPPRP 1159 +KN ++ KQNQRRE G+ IRSILL K+ Q+ SS+ + EQQSQ S+LEK+KRPPRP Sbjct: 297 VKNILGANAPKQNQRREGSGRIIRSILLNKDARQSQSSMFQTEQQSQASNLEKEKRPPRP 356 Query: 1158 PSMHTLQKDASVVSEDKVPG-DFHGVHNEKQERRSRNKDRPDRVVWTPLRRSDG-XXXXX 985 P + K+ + +DKV G D H +EKQ++R+RNKDRPDR VWTPLRRSDG Sbjct: 357 PHIQLASKETNGAQDDKVVGNDVHSFVSEKQDKRTRNKDRPDRGVWTPLRRSDGSHASDE 416 Query: 984 XXXXXXXXXXXXXXXEGSQLEMKNDMPSARGGESKHGGSARGSYSSVDNGSYKHGGRRGS 805 EGS EM++DM +AR GE K GS RG +S++DNGS+KH GRRG Sbjct: 417 SLSSSASQPTSSDFPEGSHGEMRSDMSNARSGEVKALGSGRGGHSALDNGSHKHSGRRGP 476 Query: 804 AH-MKDSDDSSFL-EGKPLRRGGSSGFGHYEXXXXXXXXXXXXXXXXXXXXXKQVWVQKS 631 H +KD+D SS + EGK +RG + G+G +E KQVWVQKS Sbjct: 477 THSVKDADGSSIVSEGKHSKRGSAPGYGSHE---------------------KQVWVQKS 515 Query: 630 SSG 622 SSG Sbjct: 516 SSG 518 >ref|XP_002328787.1| predicted protein [Populus trichocarpa] gi|222839085|gb|EEE77436.1| predicted protein [Populus trichocarpa] Length = 527 Score = 488 bits (1256), Expect = e-135 Identities = 285/538 (52%), Positives = 339/538 (63%), Gaps = 10/538 (1%) Frame = -3 Query: 2211 GPLDRTKIVLRHLPPSISQSSLMEQVDSRFSGRYNWFSFFPGKTSQNRQSYSRAYIDFKR 2032 G D+TK+V+RHLPP ISQ +EQ+D FSGRYNW S+ PG SQ QSYSRAYIDFKR Sbjct: 5 GQSDKTKVVVRHLPPGISQPMFVEQIDVAFSGRYNWLSYRPGNNSQKHQSYSRAYIDFKR 64 Query: 2031 PDDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPKQWSKKDGREGTILKDPEYVEFL 1852 P+DVI+FAEFF+GH+FVNEKGTQFK IVEY+PSQRVPKQWSKKDGREGTI KDPEY+EFL Sbjct: 65 PEDVIDFAEFFNGHIFVNEKGTQFKAIVEYSPSQRVPKQWSKKDGREGTISKDPEYLEFL 124 Query: 1851 EFLSKPVENLPSAEIQLERKEAERAGTMKDAPIVTPLMDYVRQKRAAKSGTRRSVSNGKS 1672 E ++KPVENLPSAEIQLER+EAERAG KDAPIVTPLMD+VRQKR AK+G RR +SNGK Sbjct: 125 ELIAKPVENLPSAEIQLERREAERAGAAKDAPIVTPLMDFVRQKRVAKNGPRRILSNGKL 184 Query: 1671 TXXXXXXXXXXXXXXXXXXXXXXXXXSTTMYVLRXXXXXXXXXXXSAYVLVPKREDQLLS 1492 + STTMYVLR S YV VPKR+DQ LS Sbjct: 185 S--RRAGGSGSPSSSSLKRGSEKKRISTTMYVLRDTAKSTSGKDKSTYVHVPKRDDQQLS 242 Query: 1491 DKSVALPATPGSDALEEES-----GDPXXXXXXXXXXXXXXIPHVSGGSLMQPSTNAPIK 1327 + +V L + G+ LE+ES D I V+G Q S ++ + Sbjct: 243 N-AVTLGSGSGTAVLEDESVVSGITDSGKKKILLLKGKEKEISLVTGTMSQQQSISSSDR 301 Query: 1326 NSPSSTMKQNQRRETGGKFIRSILL-KEKPQNHSSVVRPEQQSQTSSLEKDKRPPRPPSM 1150 N SST ++QRRET G+ IRSILL K+ SS V E Q QTS+LEK+KRPPRPP Sbjct: 302 NIISSTALKSQRRETSGRMIRSILLNKDSRHIRSSGVHSEPQMQTSNLEKEKRPPRPPHA 361 Query: 1149 HTLQKDASVVSEDKVPG-DFHGVHNEKQERRSRNKDRPDRVVWTPLRRSDGXXXXXXXXX 973 KDA+ +DKV G D HG NEKQE+R+RNKDRPDR VWTPLRRSDG Sbjct: 362 QLGLKDANGTPDDKVVGNDLHGFPNEKQEKRTRNKDRPDRGVWTPLRRSDGSYASDESLL 421 Query: 972 XXXXXXXXXXXEGSQ---LEMKNDMPSARGGESKHGGSARGSYSSVDNGSYKHGGRRGSA 802 + SQ ++K D + R GE K GS RG++SS+DNGS+KH GRRG + Sbjct: 422 SSASQSTQSVFDSSQGNHGDVKVDSLNLRSGEVKVLGSGRGNHSSLDNGSHKHFGRRGPS 481 Query: 801 HMKDSDDSSFLEGKPLRRGGSSGFGHYEXXXXXXXXXXXXXXXXXXXXXKQVWVQKSS 628 H+ D S +E K +RGGSSG+G +E KQVWVQKS+ Sbjct: 482 HIVRDADGSTVEAKTPKRGGSSGYGSHE--------------VCSLDSQKQVWVQKST 525 >ref|XP_002526435.1| conserved hypothetical protein [Ricinus communis] gi|223534215|gb|EEF35930.1| conserved hypothetical protein [Ricinus communis] Length = 472 Score = 480 bits (1235), Expect = e-133 Identities = 280/538 (52%), Positives = 338/538 (62%), Gaps = 7/538 (1%) Frame = -3 Query: 2211 GPLDRTKIVLRHLPPSISQSSLMEQVDSRFSGRYNWFSFFPGKTSQNRQSYSRAYIDFKR 2032 G ++TK+V+RHLPP+ISQ S +EQ+D FSGRYNW SF PGK+SQ QSYSRAYIDFKR Sbjct: 4 GQAEKTKVVVRHLPPTISQGSFLEQIDVVFSGRYNWVSFRPGKSSQKHQSYSRAYIDFKR 63 Query: 2031 PDDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPKQWSKKDGREGTILKDPEYVEFL 1852 P+DVIEFAEFF+GH+FVNEKGTQF+ IVEYAPSQ VPKQWSKKDGREGTI+KDP Y+EFL Sbjct: 64 PEDVIEFAEFFNGHLFVNEKGTQFRAIVEYAPSQHVPKQWSKKDGREGTIVKDPAYLEFL 123 Query: 1851 EFLSKPVENLPSAEIQLERKEAERAGT-MKDAPIVTPLMDYVRQKRAAKSGTRRSVSNGK 1675 E +SKP ENLPSAEIQLER+EAERA + KDAPIVTPLMD+VRQKRAAK+G+R Sbjct: 124 ELISKPAENLPSAEIQLERREAERAASAAKDAPIVTPLMDFVRQKRAAKTGSR------- 176 Query: 1674 STXXXXXXXXXXXXXXXXXXXXXXXXXSTTMYVLRXXXXXXXXXXXSAYVLVPKREDQLL 1495 YVLR S Y+LVPKR+DQ Sbjct: 177 -------------------------------YVLRDSAKSTSGKDKSTYLLVPKRDDQQF 205 Query: 1494 SDKSVALPATPGSDALEEESGDPXXXXXXXXXXXXXXIPHVSGGSLMQPSTNAPIKNSPS 1315 SDKS + G++ LE+ES I +SGG Q + + KN S Sbjct: 206 SDKSTPFASASGTEVLEDES---------ELYHLCLLIVQLSGGMSKQNAASFD-KNVTS 255 Query: 1314 STMKQNQRRETGGKFIRSILL-KEKPQNHSSVVRPEQQSQTSSLEKDKRPPRPPSMHTLQ 1138 S +KQ+QRRE+ G+ IRSILL K+ QN SS + EQQ Q+S+LEK+KR PRP + + Sbjct: 256 SAIKQSQRRESSGRIIRSILLNKDSRQNQSSGFQSEQQIQSSNLEKEKRLPRPAHVQLVL 315 Query: 1137 KDASVVSEDKVPG-DFHGVHNEKQERRSRNKDRPDRVVWTPLRRSDGXXXXXXXXXXXXX 961 KD + S+DK G D HG EKQE+R+RNKDRPDRVVWTPLRRSDG Sbjct: 316 KDVNGSSDDKFVGNDLHGFSGEKQEKRTRNKDRPDRVVWTPLRRSDGSYASDESLSSSAS 375 Query: 960 XXXXXXXEGSQ---LEMKNDMPSARGGESKHGGSARGSYSSVDNGSYKHGGRRGSAHMKD 790 + SQ ++K D ++R G+ K GS R S+SS+DNGS+KH GRRG +H Sbjct: 376 QSTHTGQDSSQGNLGDIKVDSSNSRSGDVKTLGSGRSSHSSLDNGSHKHFGRRGPSHTVR 435 Query: 789 SDDSSFLEGKPLRR-GGSSGFGHYEXXXXXXXXXXXXXXXXXXXXXKQVWVQKSSSGS 619 D S LEGKP +R GG+SG+G +E KQVWVQKSSSGS Sbjct: 436 DADGSSLEGKPSKRGGGASGYGSHE---------------------KQVWVQKSSSGS 472 >ref|XP_003548348.1| PREDICTED: uncharacterized protein LOC100799816 [Glycine max] Length = 508 Score = 411 bits (1056), Expect = e-112 Identities = 261/543 (48%), Positives = 326/543 (60%), Gaps = 10/543 (1%) Frame = -3 Query: 2217 MKGPLDRTKIVLRHLPPSISQSSLMEQVDSRFSGRYNWFSFFPGKTSQNRQSYSRAYIDF 2038 MKG LDRTK+VLRHLPPSIS+++L+ Q+D+ F+GRYNW SF PGK SQ SYSRAYIDF Sbjct: 1 MKGALDRTKVVLRHLPPSISEAALLAQIDAAFAGRYNWLSFRPGKISQKHISYSRAYIDF 60 Query: 2037 KRPDDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPKQWSKKDGREGTILKDPEYVE 1858 KRP+DVI FAEFF+GHVFVNEKG+QFK IVEYAPSQRVP+QWSKKDGR+GTI KD EY+E Sbjct: 61 KRPEDVILFAEFFNGHVFVNEKGSQFKVIVEYAPSQRVPRQWSKKDGRDGTIYKDSEYLE 120 Query: 1857 FLEFLSKPVENLPSAEIQLERKEAERAGTMKDAPIVTPLMDYVRQKRAAKSGTRRSVSNG 1678 FLE L+KPVENLPSAEIQLE++EAER+ D PI+TPLMD+VRQKRAAK G RR +SNG Sbjct: 121 FLELLAKPVENLPSAEIQLEKREAERS----DIPIITPLMDFVRQKRAAK-GPRRLLSNG 175 Query: 1677 KSTXXXXXXXXXXXXXXXXXXXXXXXXXSTTMYVLRXXXXXXXXXXXSAYVLVPKREDQL 1498 K + S TMYV R S LVPK+ DQ Sbjct: 176 KVSQRAGTSSNGSPSSVTSRRGSGKKRVSATMYVARDPGKNSTIKDKS--TLVPKQGDQH 233 Query: 1497 LSDKSVALPATPGSDALEEE--SGDPXXXXXXXXXXXXXXIPHVSGGSLMQPSTNAPIKN 1324 LSDK+ + ++ + L+E SG+ ++ L S + + + Sbjct: 234 LSDKASNMASSDANLTLDENGVSGNHDAGKKKVLLLKGKEREIITVSDLDSMSQHHNVTS 293 Query: 1323 SP-----SSTMKQNQRRETGGKFIRSIL-LKEKPQNHSSVVRPEQQSQTSSLEKDKRPPR 1162 S S+ +KQ+QR E G+ IRSIL KE Q+ S EQQ QTS+LEK+K+PPR Sbjct: 294 SAKMIVGSTVLKQSQRHEGSGRIIRSILSKKELRQSQYSRALSEQQIQTSNLEKEKQPPR 353 Query: 1161 PPSMHTLQKDASVVSEDKVPGDFHGVHNEKQERRSRNKDRPDRVVWTPLRRSDGXXXXXX 982 P + + K ++ E+K+ V +E+QER R+KDRPDR VWT RS+G Sbjct: 354 PLHVQLILKGSNGTPENKIGVHDSHVSSERQERHVRHKDRPDRGVWT--SRSNG----AD 407 Query: 981 XXXXXXXXXXXXXXEGSQLEMKNDMPSARGGESKHGGSARGSYSSVDNGSYKHGGRRGSA 802 EGS ++K+D P+AR GE K GS R S+SS +NG KH GRRG + Sbjct: 408 DSFSSSASSQVDPLEGSHADLKHDTPNARSGEVKSLGSVRTSHSS-ENGFNKHFGRRGPS 466 Query: 801 H-MKDSDDSSF-LEGKPLRRGGSSGFGHYEXXXXXXXXXXXXXXXXXXXXXKQVWVQKSS 628 H +KD D S EGK RR +S +G E KQVWVQK+S Sbjct: 467 HGVKDVDGYSVSSEGKHPRRSSTSAYGSNE---------------------KQVWVQKAS 505 Query: 627 SGS 619 SG+ Sbjct: 506 SGT 508 >ref|XP_003540141.1| PREDICTED: uncharacterized protein LOC100798866 [Glycine max] Length = 510 Score = 410 bits (1054), Expect = e-112 Identities = 257/543 (47%), Positives = 327/543 (60%), Gaps = 10/543 (1%) Frame = -3 Query: 2217 MKGPLDRTKIVLRHLPPSISQSSLMEQVDSRFSGRYNWFSFFPGKTSQNRQSYSRAYIDF 2038 MKG LDRTK+VLRHLPPSIS+++L+ Q+D+ F+GRYNW SF PGK SQ S+SRAYIDF Sbjct: 1 MKGALDRTKVVLRHLPPSISEAALLSQIDAAFAGRYNWLSFRPGKISQKHMSFSRAYIDF 60 Query: 2037 KRPDDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPKQWSKKDGREGTILKDPEYVE 1858 KRP+DVI FAEFF+GHVFVN KG+QFK IVEYAPSQRVP+QWSKKD R+GTI KD EY+E Sbjct: 61 KRPEDVILFAEFFNGHVFVNVKGSQFKVIVEYAPSQRVPRQWSKKDLRDGTIYKDSEYLE 120 Query: 1857 FLEFLSKPVENLPSAEIQLERKEAERAGTMKDAPIVTPLMDYVRQKRAAKSGTRRSVSNG 1678 FLE L+KPVENLPSAEIQLE++EAER+ D PI+TPLMD+VRQKRAAK G RR +SNG Sbjct: 121 FLELLAKPVENLPSAEIQLEKREAERS----DIPIITPLMDFVRQKRAAK-GPRRPLSNG 175 Query: 1677 KSTXXXXXXXXXXXXXXXXXXXXXXXXXSTTMYVLRXXXXXXXXXXXSAYVLVPKREDQL 1498 K + S TMYV R S+Y LVPK++DQ Sbjct: 176 KVSRRAGTSSNGGPSSATSRRGSGKKRVSATMYVARDPGKSSTIKDKSSYTLVPKQDDQH 235 Query: 1497 LSDKSVALPATPGSDALEEE--SGDPXXXXXXXXXXXXXXIPHVSGGSLMQPSTNAPIKN 1324 L +K+ + ++ G+ L+E SG+ ++ L S + + + Sbjct: 236 LPNKASNMASSDGNQTLDENGVSGNHDAGKKKVLLLKGKEREIITVSDLDSMSQHHNVTS 295 Query: 1323 SP-----SSTMKQNQRRETGGKFIRSIL-LKEKPQNHSSVVRPEQQSQTSSLEKDKRPPR 1162 S S+ +KQ+QR E G+ IRSIL KE Q+ SS EQ+ TS+LEK+K+PPR Sbjct: 296 SAKTVVGSTVLKQSQRHEGSGRIIRSILSKKELHQSQSSRALSEQKILTSNLEKEKQPPR 355 Query: 1161 PPSMHTLQKDASVVSEDKVPGDFHGVHNEKQERRSRNKDRPDRVVWTPLRRSDGXXXXXX 982 P + + K ++ E+K+ V +E+QER R+KDRPDR VWT R +G Sbjct: 356 PLHVQLILKGSNGTPENKIGVHDSHVSSERQERHVRHKDRPDRGVWT--SRFNG----AD 409 Query: 981 XXXXXXXXXXXXXXEGSQLEMKNDMPSARGGESKHGGSARGSYSSVDNGSYKHGGRRGSA 802 EGSQ ++K+DMP+AR E K GS R S+SS +NG KH GRRG + Sbjct: 410 VSFSSPASSQVDPLEGSQADLKHDMPNARSVEVKSFGSVRTSHSS-ENGFNKHFGRRGPS 468 Query: 801 H-MKDSDDSSF-LEGKPLRRGGSSGFGHYEXXXXXXXXXXXXXXXXXXXXXKQVWVQKSS 628 + +KD D S EGK RR +S +G E KQVWVQK+S Sbjct: 469 YGVKDVDGYSVSSEGKHPRRSSTSAYGSNE---------------------KQVWVQKAS 507 Query: 627 SGS 619 SGS Sbjct: 508 SGS 510