BLASTX nr result
ID: Cephaelis21_contig00003689
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00003689 (2229 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002327534.1| predicted protein [Populus trichocarpa] gi|2... 311 6e-82 ref|XP_002279868.1| PREDICTED: uncharacterized protein LOC100263... 303 1e-79 ref|NP_001118622.1| RNA recognition motif-containing protein [Ar... 289 2e-75 ref|XP_002884983.1| hypothetical protein ARALYDRAFT_478769 [Arab... 286 1e-74 ref|XP_004141677.1| PREDICTED: U2 small nuclear ribonucleoprotei... 280 1e-72 >ref|XP_002327534.1| predicted protein [Populus trichocarpa] gi|222836088|gb|EEE74509.1| predicted protein [Populus trichocarpa] Length = 281 Score = 311 bits (796), Expect = 6e-82 Identities = 162/250 (64%), Positives = 183/250 (73%), Gaps = 3/250 (1%) Frame = +3 Query: 1488 HPPPYDPYYLAQPIHENYIYSTIPASVGTSDRNGGVNTLFVSGLPDDVKAREIHNLFRRR 1667 H PYDPYY P A+ + N G+NTLFVSGLPDDVKAREIHN+FRRR Sbjct: 5 HHQPYDPYYQLPPA----------AAAPGGEWNSGINTLFVSGLPDDVKAREIHNIFRRR 54 Query: 1668 PGFESCQLKYTGRGNQVVAFATFVNHQSAVAALHSLNGVKFDPQTGSNLHIELARSNSRR 1847 PGF+SCQLKYTGRGNQVVAFATF NHQSA+AALHSLNGVKFDPQ+GS LHIELARSNSRR Sbjct: 55 PGFDSCQLKYTGRGNQVVAFATFFNHQSAIAALHSLNGVKFDPQSGSTLHIELARSNSRR 114 Query: 1848 KNKPGSGPYVVIDNRTKSSTDAHETSSDD--GEXXXXXXXXXXXXGEKDDSVVEKSEATA 2021 K KPGSG YVVID RTK +DAHETSSDD + + DS KSEA + Sbjct: 115 KRKPGSGAYVVIDKRTKKPSDAHETSSDDVESDPEEDPEMNNVDTAYQGDSENAKSEAAS 174 Query: 2022 DAENSVAPESEQTEKVVDGS-QACSTLFIANLGPLCTEDELKQALSQCPGFNSLKLRARG 2198 D +N+ +E E+ +G + CSTLFIANLGP CTEDELKQ LSQ PGF+ LK+RA+G Sbjct: 175 DPDNAAVAVNEIGERTAEGGVRPCSTLFIANLGPNCTEDELKQVLSQYPGFHVLKIRAKG 234 Query: 2199 GMPVAFADFQ 2228 GMPVAFADF+ Sbjct: 235 GMPVAFADFE 244 >ref|XP_002279868.1| PREDICTED: uncharacterized protein LOC100263499 [Vitis vinifera] gi|297743102|emb|CBI35969.3| unnamed protein product [Vitis vinifera] Length = 272 Score = 303 bits (776), Expect = 1e-79 Identities = 165/247 (66%), Positives = 181/247 (73%), Gaps = 4/247 (1%) Frame = +3 Query: 1500 YDPYYLAQPIHENYIYSTIPASVGTSDRNGGVNTLFVSGLPDDVKAREIHNLFRRRPGFE 1679 YDPYY H DR+G +NTLFVSGLPDDVK REIHNLFRRRPGF+ Sbjct: 6 YDPYY-----HH-----------AQGDRSG-INTLFVSGLPDDVKPREIHNLFRRRPGFD 48 Query: 1680 SCQLKYTGRGNQVVAFATFVNHQSAVAALHSLNGVKFDPQTGSNLHIELARSNSRRKNKP 1859 SCQLKYTGRGNQVVAFATF NHQ+AVAALH+LNGVKFDPQTGS LHIELARSNSRRK P Sbjct: 49 SCQLKYTGRGNQVVAFATFFNHQTAVAALHALNGVKFDPQTGSILHIELARSNSRRKRVP 108 Query: 1860 GSGPYVVIDNRTKSSTDAHETSSDDG--EXXXXXXXXXXXXGEKDDSVVEKS-EATADAE 2030 GSG YVVID R+K+ST+AHETSSDDG E G DD V KS E D + Sbjct: 109 GSGAYVVIDKRSKTSTNAHETSSDDGDSESDEPAKTSNPDSGNNDDLVTAKSGEMAVDPD 168 Query: 2031 NSVAPESEQTEKVVD-GSQACSTLFIANLGPLCTEDELKQALSQCPGFNSLKLRARGGMP 2207 +++ +EQ EK D G CSTLFIANLGP CTEDELKQ LSQ PGFN LK+RA+GGMP Sbjct: 169 STLTAVNEQPEKTTDAGLPPCSTLFIANLGPTCTEDELKQVLSQYPGFNVLKMRAKGGMP 228 Query: 2208 VAFADFQ 2228 VAFADF+ Sbjct: 229 VAFADFE 235 >ref|NP_001118622.1| RNA recognition motif-containing protein [Arabidopsis thaliana] gi|26449895|dbj|BAC42069.1| unknown protein [Arabidopsis thaliana] gi|332641880|gb|AEE75401.1| RNA recognition motif-containing protein [Arabidopsis thaliana] Length = 287 Score = 289 bits (740), Expect = 2e-75 Identities = 149/253 (58%), Positives = 185/253 (73%), Gaps = 4/253 (1%) Frame = +3 Query: 1482 MAHPPPYDPYYLAQ-PIHENYIYSTIPASVGTSDRNGGVNTLFVSGLPDDVKAREIHNLF 1658 MA+ PYDP+Y+ Q H +++ +P +D G +NTLFVSGLP+DVKAREIHNLF Sbjct: 1 MAYHQPYDPFYVYQLHSHPHHLPPQLPL---LADEPGAINTLFVSGLPNDVKAREIHNLF 57 Query: 1659 RRRPGFESCQLKYTGRGNQVVAFATFVNHQSAVAALHSLNGVKFDPQTGSNLHIELARSN 1838 RRR GFESCQLKYTGRG+QVVAFATF +H+ A+AA++ LNGVKFDPQTGSNLHIELARSN Sbjct: 58 RRRHGFESCQLKYTGRGDQVVAFATFTSHRFALAAMNELNGVKFDPQTGSNLHIELARSN 117 Query: 1839 SRRKNKPGSGPYVVIDNRTKSSTDAHETSSDDGEXXXXXXXXXXXXGEKDDSVVEKSEAT 2018 SRRK +PGSGPYVVIDNR K + + + SD+G+ ++ KSEA Sbjct: 118 SRRKERPGSGPYVVIDNRNKEISKSQDDQSDEGDSDPDEVQEPGNSDSPKENDTTKSEAD 177 Query: 2019 ADAENSVAPESEQTEKVVD---GSQACSTLFIANLGPLCTEDELKQALSQCPGFNSLKLR 2189 ++ ++ + EK + G++ACSTLFIANLGP CTEDELKQ LS+ PGF+ LK+R Sbjct: 178 SEPDSKAPSANGHLEKASEGGSGARACSTLFIANLGPNCTEDELKQLLSRYPGFHILKIR 237 Query: 2190 ARGGMPVAFADFQ 2228 ARGGMPVAFADF+ Sbjct: 238 ARGGMPVAFADFE 250 >ref|XP_002884983.1| hypothetical protein ARALYDRAFT_478769 [Arabidopsis lyrata subsp. lyrata] gi|297330823|gb|EFH61242.1| hypothetical protein ARALYDRAFT_478769 [Arabidopsis lyrata subsp. lyrata] Length = 287 Score = 286 bits (733), Expect = 1e-74 Identities = 148/253 (58%), Positives = 184/253 (72%), Gaps = 4/253 (1%) Frame = +3 Query: 1482 MAHPPPYDPYYLAQP-IHENYIYSTIPASVGTSDRNGGVNTLFVSGLPDDVKAREIHNLF 1658 MA+ PYDP+Y+ QP H +++ +P +D G +NTLFVSGLP+DVKAREIHNLF Sbjct: 1 MAYHQPYDPFYVYQPHYHPHHLPPQLPQF---ADEPGAINTLFVSGLPNDVKAREIHNLF 57 Query: 1659 RRRPGFESCQLKYTGRGNQVVAFATFVNHQSAVAALHSLNGVKFDPQTGSNLHIELARSN 1838 RRR GFESCQLKYTGRG+QVVAFATF +H+ A+AA++ LNGVKFDPQTGS LHIELARSN Sbjct: 58 RRRYGFESCQLKYTGRGDQVVAFATFTSHRFAMAAMNELNGVKFDPQTGSTLHIELARSN 117 Query: 1839 SRRKNKPGSGPYVVIDNRTKSSTDAHETSSDDGEXXXXXXXXXXXXGEKDDSVVEKSEAT 2018 SRRK +PGSGPYVVIDNR K + + + SD+G+ ++ KSEA Sbjct: 118 SRRKERPGSGPYVVIDNRNKELSKSQDDQSDEGDSDPDEVQEPRNSESPKENDNAKSEAD 177 Query: 2019 ADAENSVAPESEQTEKVVD---GSQACSTLFIANLGPLCTEDELKQALSQCPGFNSLKLR 2189 ++ ++ + EK + G++ACSTLFIANLGP CTEDEL+Q LS+ GFN LK+R Sbjct: 178 SEPDSKAPSANGHLEKAYEGGSGARACSTLFIANLGPNCTEDELRQLLSRYSGFNILKIR 237 Query: 2190 ARGGMPVAFADFQ 2228 ARGGMPVAFADF+ Sbjct: 238 ARGGMPVAFADFE 250 >ref|XP_004141677.1| PREDICTED: U2 small nuclear ribonucleoprotein B''-like [Cucumis sativus] Length = 262 Score = 280 bits (716), Expect = 1e-72 Identities = 153/249 (61%), Positives = 173/249 (69%) Frame = +3 Query: 1482 MAHPPPYDPYYLAQPIHENYIYSTIPASVGTSDRNGGVNTLFVSGLPDDVKAREIHNLFR 1661 MAH P YDPYYL Y S+R+ +NTLF+SGLPDDVKAREIHNLFR Sbjct: 1 MAHLP-YDPYYLYNQPDPTY-----------SERSN-INTLFISGLPDDVKAREIHNLFR 47 Query: 1662 RRPGFESCQLKYTGRGNQVVAFATFVNHQSAVAALHSLNGVKFDPQTGSNLHIELARSNS 1841 RRPGF+SCQLKYTGRGNQVVAFATF NHQSAV ALH+LNGVKFDPQ+GS LHIELARSNS Sbjct: 48 RRPGFDSCQLKYTGRGNQVVAFATFYNHQSAVTALHALNGVKFDPQSGSVLHIELARSNS 107 Query: 1842 RRKNKPGSGPYVVIDNRTKSSTDAHETSSDDGEXXXXXXXXXXXXGEKDDSVVEKSEATA 2021 RRK+KPG G YVVID R K+ ++ ETSSDDG E + +EA Sbjct: 108 RRKHKPGGGAYVVIDKRKKTDANSQETSSDDG---------GSEPDEPSKKAQQSNEAVV 158 Query: 2022 DAENSVAPESEQTEKVVDGSQACSTLFIANLGPLCTEDELKQALSQCPGFNSLKLRARGG 2201 N+++ E EK G CSTLFIANLGP C EDELK+ L + PGFN LKLRA+ G Sbjct: 159 TPANAISAPYEHHEKNDGG--PCSTLFIANLGPNCNEDELKEVLCKYPGFNVLKLRAKSG 216 Query: 2202 MPVAFADFQ 2228 MPVAFADF+ Sbjct: 217 MPVAFADFE 225