BLASTX nr result
ID: Cephaelis21_contig00037651
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00037651 (2189 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI15020.3| unnamed protein product [Vitis vinifera] 152 4e-34 ref|XP_002511393.1| hypothetical protein RCOM_1510520 [Ricinus c... 129 3e-27 ref|XP_004152555.1| PREDICTED: uncharacterized protein LOC101223... 90 2e-15 ref|XP_003544001.1| PREDICTED: uncharacterized protein LOC100820... 72 6e-10 ref|XP_002318649.1| predicted protein [Populus trichocarpa] gi|2... 67 3e-08 >emb|CBI15020.3| unnamed protein product [Vitis vinifera] Length = 1185 Score = 152 bits (384), Expect = 4e-34 Identities = 193/725 (26%), Positives = 266/725 (36%), Gaps = 161/725 (22%) Frame = -2 Query: 2113 CTDVKNSRKLKSSTTEAKVELLIPPKPIENDFL-IGENDWKVDGKSGKGFESRERRRSKY 1937 C + ++ KL SS+ EA + P +EN + + D KS KG ESRER++SKY Sbjct: 473 CHEAEDVGKLASSSEEAGLSR--SPTTMENKASNVRDGDSGTGTKSEKGVESRERKKSKY 530 Query: 1936 LSFPFIGPKKG-----LENASTSGENVPGQGHGRVDVNSKSSQ---------CNGXXXXX 1799 LS P+I G LE++ T VP V +N S Q C+G Sbjct: 531 LSPPYINLNWGRKGPVLEDSETEDPKVPKVSCAGVGMNEASEQLGAPPPIVKCSG-KAQK 589 Query: 1798 XXXXXXSCCFDMFGKGKDIRSSSAEILTELHLAALDCLHTKGWKHSGSVGSFLYGFR--- 1628 + G I +SSA +L+EL AALDCL+ K+ S+ F + FR Sbjct: 590 KRSRKSVSEGNTSGDVDSINASSAVMLSELRFAALDCLYPSERKNFVSIERFFHRFRCSM 649 Query: 1627 -----RFAFINSEIAGE--------------------------------------HIGRP 1577 + + I+GE H+ Sbjct: 650 YSEASQCKMYENNISGEKEALAAEPSSLEKGPLEIKLPIKPEPKKRKKKEKVTLKHLAEL 709 Query: 1576 KEGTTMSSNYGLEES-----------NGLDAVRHTL---SCGMSKKRQKSKEDKTSAGT- 1442 G +S ++ S G++ H + SC SK + K K GT Sbjct: 710 TAGIPDASGNHVKSSLLGKDSAGDELRGVNGHSHNMQEMSCQSSKGKPGRKMMKKKEGTN 769 Query: 1441 -----------VFSCVVENATNGSEVTKCQEAGSDTPKNEKVPRKRKKAGVNLGMPEKKP 1295 + V T+ S + E P + P KRKK G K Sbjct: 770 SKRSKTKPTPGLLDVNVGIVTSSSLINDSGEVKPLAPNGKPEPNKRKKEGATSERLHMKF 829 Query: 1294 APGLPDLNGNIPASSDNQVTESTAFCHVLEFGTESSHTQTAGLLDPGRNNIKFVQLLKDV 1115 G+PDLN N P S + E + LD NN K +KD+ Sbjct: 830 TAGIPDLNRNSPVPSPS---------------VEDLQVMSTVALDVNGNNAKPSPSMKDL 874 Query: 1114 QSMGPNFLHSMSQQSQHNYMMGETPFTLECKERQSEANLEFKERQLEANGNNTLSGSLMK 935 MG LHS+ + N G + + +N EF E NGN +++ Sbjct: 875 PGMG---LHSLGVIPELNGREG---------KEGASSNGEFTVSLPEVNGNIAKFSLMVE 922 Query: 934 NMQSISF---STKCEPKKSKRKVK-MSNPTDTQVASSIPDLNGNVVDCVSSGNKTPDITS 767 + Q S K +P+K KRK K M + A+SIPDLNGN + S+ +I Sbjct: 923 DSQVTSLLAPGGKLKPRKRKRKEKAMMECPEINCAASIPDLNGNSAEPSSTEKHLLEINC 982 Query: 766 ASSKGKVPKKRSRSK---STAIVS---DMNANHNKANNTVRVSALAST------------ 641 SSK K +K+ R K IV D+N N+NK NT T Sbjct: 983 LSSKVKPERKKRRRKGEVGNKIVGGMLDINMNYNKVANTAEALGTTLTLTFAQGSPMPSK 1042 Query: 640 -RLVQPLLPMDGLKT------------------------------------PAMPPLPGS 572 LV+ LK PA+P S Sbjct: 1043 EALVEAFFKFGPLKESETEVLKDSPGAQVVFIRYSDAREAFQSLEKCSPFGPALPAALAS 1102 Query: 571 TPQN---------------GDPPDLLFMKRNLEMMTSTLEKAGSSLSPEMRTKLESEIKG 437 P G+ L F+++NLEMMTS LEK+G +LSPEMR KLE EIKG Sbjct: 1103 QPVESLKTPARSSGSKPPIGEARPLFFIRQNLEMMTSMLEKSGDNLSPEMRAKLEGEIKG 1162 Query: 436 FMKKI 422 +KK+ Sbjct: 1163 LLKKV 1167 >ref|XP_002511393.1| hypothetical protein RCOM_1510520 [Ricinus communis] gi|223550508|gb|EEF51995.1| hypothetical protein RCOM_1510520 [Ricinus communis] Length = 1097 Score = 129 bits (324), Expect = 3e-27 Identities = 159/634 (25%), Positives = 262/634 (41%), Gaps = 69/634 (10%) Frame = -2 Query: 2119 NYCTDVKNSRKLKSSTTEAKVE---LLIPPKPIENDFLIGENDWKVDGKSGKGFESRERR 1949 N D+ ++ + + A+VE + +P P + + I + ++ KG + RER+ Sbjct: 439 NVLNDLASNSRKRKRKKYAEVEGYDVSLPDSPPQVEASIFGSATMIE----KGSDLRERK 494 Query: 1948 RSKYLSFPFIGPK-KGL---------ENASTSGENVPGQGHGRVDVNSKSSQCNGXXXXX 1799 +SKYLS+P++ + KGL + S E+ H + +S S +G Sbjct: 495 KSKYLSYPYVNLEHKGLPSEIEDPKSQKVSQGAEHEKAVSHQFIGSHSVSKS-SGKRFQK 553 Query: 1798 XXXXXXSCCFDMFGKGKDIRSSSAEILTELHLAALDCLHTKGWKHSGSVGSFLYGFRRFA 1619 D I +S A++L+EL L A+DCL++ K+ + F FR A Sbjct: 554 KWFRKFIHNNDASNNPDLINASVADLLSELCLTAMDCLYSNESKNFDLIEWFFARFRISA 613 Query: 1618 FINSEIAGEHIGRPKEGTTMSSNYGLEESNGLDAVRHTLSCGMSKKRQKSKEDKTSAGTV 1439 F + I H + SSN L+ + L+ + L +K QK K++ SA T Sbjct: 614 FHDESIYEMHC----KNMIGSSNEALQGKDTLEPTQTLLDVKAEQKMQKKKKNGNSAPTK 669 Query: 1438 FSCV-------VENATNGSEVTKCQEAGSDTPKNEKVPRKRKKAGVNLGMPEKKPAPGLP 1280 + + A +G+ V + G TP P+K+KK + GLP Sbjct: 670 IKSLRGLSDVNINIAADGTLVKDFCDMGPPTPNGRPGPKKKKKK-------QGTSPAGLP 722 Query: 1279 DLNGNIPASSDNQVTESTAFCHVLEFGTESSHTQTAGLLDPGRNNIKFVQLLKDVQSMGP 1100 DLN + A+S V + HV + + AG + ++ + LL D+Q GP Sbjct: 723 DLNSS-GATSSLLVESFESVSHVEH--EPNQREKKAGSENVNLSDAEPGSLLLDLQVTGP 779 Query: 1099 NFLHSMSQQSQHNYMMGETPFT--------LECKERQSEANLEF--------KERQLEAN 968 ++++ ++ P + L KE S + L ++R+ ++ Sbjct: 780 FSVNTIPKEIMGEGSAPSIPTSDGNCAIPGLLAKEPPSISPLSAEGLPEPKKRKRKDKST 839 Query: 967 GNNT----LSGSLMKNMQSISFSTKCEPKKSKRKVKMSNPTDTQVASSIPDLN--GNVVD 806 T + L + S K E K++++K + A +PD+N N++D Sbjct: 840 AEQTTVAAIEAGLEGTLAESSMLVKPEKKRARKKEVKPRRPRRKSAVRLPDININYNIMD 899 Query: 805 CVSSGNKTPDITSASSKGKVPKKRSRSKSTAIVSDMNANH---NKANNTVRVSALAST-- 641 G T I + + +P K + + + K +NT +V L ST Sbjct: 900 TNGEGLGTALILTFAQGVSLPSKEVLVATFCRFGPLKESEIHLMKDSNTAQVVFLKSTDA 959 Query: 640 ----------------------RLVQPLLPMDGLKTPAMPPLPGSTPQNGDPPDLLFMKR 527 L+ +G PAM GS P + P + F+++ Sbjct: 960 AEAARSLENCSPFGATLVNYRLHLLSAAGSKEGTTAPAMSY--GSMPSPAEAPPIDFIRQ 1017 Query: 526 NLEMMTSTLEKAGSSLSPEMRTKLESEIKGFMKK 425 NLEMMTS LEKAG +LSPEMR KLE+EIKG +KK Sbjct: 1018 NLEMMTSMLEKAGDNLSPEMRAKLETEIKGLLKK 1051 >ref|XP_004152555.1| PREDICTED: uncharacterized protein LOC101223078 [Cucumis sativus] Length = 723 Score = 90.1 bits (222), Expect = 2e-15 Identities = 132/528 (25%), Positives = 218/528 (41%), Gaps = 81/528 (15%) Frame = -2 Query: 1762 FGKGKDIRSSS-AEILTELHLAALDCLHTKGWKHSGSVGSFLYGFRRFAFINSEIAGEH- 1589 F +D+ S S AE L+ELH A+DCL+ + G+V F FR F+ +++ + Sbjct: 202 FVDNQDLMSGSPAEFLSELHFTAVDCLYPNVNNNFGTVAQFFSIFRILMFLGEKVSEDKQ 261 Query: 1588 ---------IGRPKEGTTMSSNYGLEESNG--------LDAVRHTLSCGMSKKRQ----- 1475 G K SS +EE L G ++K+ Sbjct: 262 QQQPSSAAKSGIRKRKGQSSSIKKMEEMKSKPVSGDVDLTGNAEISPAGDAQKKTPSTSK 321 Query: 1474 -KSKEDKTSAG-------TVFSCVVENATNGSEVTK-CQEAGSDTPKNEKVPRKRKKAGV 1322 KSK+DK S G + S V ++ S + K EAG +P RKR+ GV Sbjct: 322 VKSKKDKESLGRLKTKSLSALSDVNITLSSCSLLAKDSPEAGPLSPNGLPKRRKRRNNGV 381 Query: 1321 NLGMPEKKPAPGLPDLNGNIPASSDNQVTESTAFCHVL-EFGTESSHTQTAGLL-DPGRN 1148 + P+ KP +PDLNG+ A + V + A HV + E + G+ + + Sbjct: 382 H---PQSKPTTEIPDLNGS-GAVAGLLVEDQQAVSHVAAQLKREPKRRRKRGVSKENSKA 437 Query: 1147 NIKFVQL-LKDVQSMG-PNFLHSMSQQSQHNYMMGETPFTLECKERQSEANLEFKERQLE 974 + +F+ + + D G PN QS ++ +G+ K+R+ + + Sbjct: 438 STEFINVNVNDSNKPGAPN-------QSVNDQTIGQDQSKSGGKKRKRKEKPPLADPDAV 490 Query: 973 ---ANGNNTLSGSLMKNMQSISFSTKCEPKKSKRK-----VKMSNPTDTQ------VASS 836 +NG T + + + + + +PK+ +R+ + NP+D++ V + Sbjct: 491 LSYSNGVGTDTSQGKDSQLTNNLPPQPKPKRRRRRKGQASLNHPNPSDSRSYIYNRVETD 550 Query: 835 IPDLNGNVVDCVSSGNKTPD----ITSASSKGKVPKKRSRSKSTAI-------------V 707 L ++ SS P IT+ S G + + + K + + V Sbjct: 551 GEGLGSLLLLTFSSEAPLPPREQVITTFSQFGSLKESEIQLKDSTVEIVFLRSADAMEAV 610 Query: 706 SDMNAN-------------HNKANNTVRVSALASTRLVQPLLPMDGLKTPAMPPLPGSTP 566 + N H A S A T L P +G P+ G+ Sbjct: 611 RSLKKNNIFGPTLLKYQLYHLSAPPKTSDSDRACTALAYPA--SEGTLNPSKSAESGN-- 666 Query: 565 QNGDPPDLLFMKRNLEMMTSTLEKAGSSLSPEMRTKLESEIKGFMKKI 422 Q GD P + F+++NL+MMTS LEK+G +LSP+MR KLE +I+G +KK+ Sbjct: 667 QAGDAPPIEFIRKNLQMMTSMLEKSGDNLSPDMRAKLECDIEGLLKKV 714 >ref|XP_003544001.1| PREDICTED: uncharacterized protein LOC100820046 [Glycine max] Length = 935 Score = 72.0 bits (175), Expect = 6e-10 Identities = 38/73 (52%), Positives = 48/73 (65%), Gaps = 3/73 (4%) Frame = -2 Query: 634 VQPLLPMDGLKTPAMPPLP--GSTPQNGD-PPDLLFMKRNLEMMTSTLEKAGSSLSPEMR 464 V P P + P + P P GS G+ PP L F+K+NL+MMTSTLE +GSSLSP MR Sbjct: 682 VMPTQPTGSMAVPGVTPTPPTGSMAMPGETPPSLQFIKQNLQMMTSTLENSGSSLSPRMR 741 Query: 463 TKLESEIKGFMKK 425 KL+SEIK ++K Sbjct: 742 AKLDSEIKNLLRK 754 >ref|XP_002318649.1| predicted protein [Populus trichocarpa] gi|222859322|gb|EEE96869.1| predicted protein [Populus trichocarpa] Length = 171 Score = 66.6 bits (161), Expect = 3e-08 Identities = 31/52 (59%), Positives = 41/52 (78%) Frame = -2 Query: 577 GSTPQNGDPPDLLFMKRNLEMMTSTLEKAGSSLSPEMRTKLESEIKGFMKKI 422 GS P+ + P + F+++NLEMMTS LEK+G +LSPEMR KLE EIKG +KK+ Sbjct: 112 GSMPKLAEAPPIDFIRQNLEMMTSMLEKSGDNLSPEMRAKLEIEIKGLLKKV 163