BLASTX nr result
ID: Stemona21_contig00028653
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Stemona21_contig00028653 (462 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002329042.1| predicted protein [Populus trichocarpa] 74 2e-11 gb|EOX92171.1| Gag-pol polyprotein-like protein [Theobroma cacao] 65 7e-09 ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, part... 65 1e-08 gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao] 64 2e-08 gb|EMJ11865.1| hypothetical protein PRUPE_ppa022462mg [Prunus pe... 64 3e-08 gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] 63 4e-08 ref|XP_006410106.1| hypothetical protein EUTSA_v10017580mg, part... 61 1e-07 ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutr... 59 9e-07 gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao] 57 2e-06 ref|XP_006293129.1| hypothetical protein CARUB_v10019435mg [Caps... 57 2e-06 gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao] 55 1e-05 gb|AAM15062.1| putative retroelement integrase [Arabidopsis thal... 55 1e-05 >ref|XP_002329042.1| predicted protein [Populus trichocarpa] Length = 442 Score = 73.9 bits (180), Expect = 2e-11 Identities = 44/134 (32%), Positives = 65/134 (48%) Frame = +3 Query: 60 TRRSLAPATTTATTCSFDAKTTKATPSGGTNLRCFSCGEIGHTRNECKKATSRSGKQLLI 239 TR ++ P+ T T T TP+ T RCF+CGE+GH ECKK R + Sbjct: 295 TRTAIPPSPITKPTMP----THVTTPN--TGFRCFNCGELGHRFAECKKGQRRGLFSDVE 348 Query: 240 EXXXXXXXXXXXXXXXXXXCLEDESDDRIFGDQEEALVAKKTLLAPRNEDEDWRRTNIFH 419 E + ++R+ GD L+ +++ LAP ++DW RTN+F Sbjct: 349 EINREQEGDVEAEPVY-------DEEERLEGDAGPMLMIRRSCLAPHVVEDDWLRTNVFQ 401 Query: 420 TSCTIHGKVCHIIV 461 ++CTI GK+C IV Sbjct: 402 STCTISGKICRFIV 415 >gb|EOX92171.1| Gag-pol polyprotein-like protein [Theobroma cacao] Length = 399 Score = 65.5 bits (158), Expect = 7e-09 Identities = 37/129 (28%), Positives = 57/129 (44%), Gaps = 1/129 (0%) Frame = +3 Query: 78 PATTTATTCSFDAKTTKATPSGGTNLRCFSCGEIGHTRNECKKATSRSGKQLLIEXXXXX 257 P ++ T S + K T T + N +CF C GH ++C + +I Sbjct: 247 PKVNSSKTASSNDKKTTFTRASNVNKKCFKCQGFGHIASDCSN-------RRIISLVEEE 299 Query: 258 XXXXXXXXXXXXXCLEDESDDRIFGDQEEALVAKKTL-LAPRNEDEDWRRTNIFHTSCTI 434 +DE + + D EAL+ ++ L A +DE W R NIF+T CT Sbjct: 300 DYANWEKLKPVYDEYDDEEIEEVSADHGEALIVRRNLNTAMMTKDESWFRHNIFYTRCTS 359 Query: 435 HGKVCHIIV 461 GKVC++I+ Sbjct: 360 QGKVCNVII 368 >ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema salsugineum] gi|557103259|gb|ESQ43622.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema salsugineum] Length = 367 Score = 64.7 bits (156), Expect = 1e-08 Identities = 43/136 (31%), Positives = 58/136 (42%), Gaps = 7/136 (5%) Frame = +3 Query: 75 APATTTATTCSFDAKTTKATPSGGTNLRCFSCGEIGHTRNECKKATSRS--GKQLLIEXX 248 A + T + + T S LRCF+CGE GH + C K T R G + + Sbjct: 114 ATTSNTLPVANSGTEPTLRRSSQPNALRCFACGEPGHLQTACPKQTRRGLFGDETKWDKD 173 Query: 249 XXXXXXXXXXXXXXXXCLEDESDDRI-----FGDQEEALVAKKTLLAPRNEDEDWRRTNI 413 EDE D + GD +L+ + LAP +E W RTNI Sbjct: 174 DAADDN------------EDEFDSEVPEDHHHGDTSPSLMLRHVCLAPVVLEEPWLRTNI 221 Query: 414 FHTSCTIHGKVCHIIV 461 F ++CTI GKVC +V Sbjct: 222 FQSTCTIKGKVCRFVV 237 >gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao] Length = 505 Score = 63.9 bits (154), Expect = 2e-08 Identities = 37/129 (28%), Positives = 56/129 (43%), Gaps = 1/129 (0%) Frame = +3 Query: 78 PATTTATTCSFDAKTTKATPSGGTNLRCFSCGEIGHTRNECKKATSRSGKQLLIEXXXXX 257 P ++ T S + K T T + N +CF C GH +C + +I Sbjct: 122 PKVNSSKTASSNDKETTFTRASNVNKKCFKCQGFGHIAFDCPN-------RRIISLVEEE 174 Query: 258 XXXXXXXXXXXXXCLEDESDDRIFGDQEEALVAKKTL-LAPRNEDEDWRRTNIFHTSCTI 434 +DE + + D EAL+ ++ L A +DE W R NIF+T CT Sbjct: 175 DYANWEKLEPVYDEYDDEEIEEVSADHGEALIVRRNLNTAMMTKDESWLRHNIFYTRCTS 234 Query: 435 HGKVCHIIV 461 GKVC++I+ Sbjct: 235 QGKVCNVII 243 >gb|EMJ11865.1| hypothetical protein PRUPE_ppa022462mg [Prunus persica] Length = 606 Score = 63.5 bits (153), Expect = 3e-08 Identities = 38/109 (34%), Positives = 51/109 (46%), Gaps = 4/109 (3%) Frame = +3 Query: 147 TNLRCFSCGEIGHTRNECKKATSRSGKQLLIEXXXXXXXXXXXXXXXXXXCLE--DESDD 320 T RCF CGE GH ECKK + R GK L IE E D ++ Sbjct: 234 TAFRCFKCGETGHCMAECKK-SDRVGKGLFIEHDENQLQEYHDFEHGPVYDNEPNDVVEE 292 Query: 321 RIFGDQEEALVAKKTLLAPRNED--EDWRRTNIFHTSCTIHGKVCHIIV 461 + D L+ +KT PR + + W R N+F + CTI GKVC +++ Sbjct: 293 YMTEDDGPLLMVRKTCFTPRETEGSDGWLRNNVFQSICTIGGKVCKLVI 341 >gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] Length = 794 Score = 63.2 bits (152), Expect = 4e-08 Identities = 38/131 (29%), Positives = 61/131 (46%), Gaps = 3/131 (2%) Frame = +3 Query: 78 PATTTATTCSFDAKTTKATPSGGTNLRCFSCGEIGHTRNEC--KKATSRSGKQLLIEXXX 251 P ++ + + K T +T + N +CF C GH ++C ++ S ++++ E Sbjct: 271 PPPKVNSSKTINHKETTSTRAPNVNKKCFKCQGFGHIASDCPNRRIISLIEEEVMEEPSL 330 Query: 252 XXXXXXXXXXXXXXXCLEDESDDRIFGDQEEALVAKKTL-LAPRNEDEDWRRTNIFHTSC 428 +E + + D EALV ++ L A EDE W R NIFHT C Sbjct: 331 EEVDDELEI-------FNNEEIEEVSADHGEALVVRRNLNTAMLTEDESWLRHNIFHTRC 383 Query: 429 TIHGKVCHIIV 461 T GKVC++I+ Sbjct: 384 TSQGKVCNVII 394 >ref|XP_006410106.1| hypothetical protein EUTSA_v10017580mg, partial [Eutrema salsugineum] gi|557111275|gb|ESQ51559.1| hypothetical protein EUTSA_v10017580mg, partial [Eutrema salsugineum] Length = 282 Score = 61.2 bits (147), Expect = 1e-07 Identities = 31/101 (30%), Positives = 52/101 (51%) Frame = +3 Query: 159 CFSCGEIGHTRNECKKATSRSGKQLLIEXXXXXXXXXXXXXXXXXXCLEDESDDRIFGDQ 338 CF+CGE GH ++ C K +++ + E+++R+ GD Sbjct: 9 CFNCGETGHRQSACPKRVLFGDDEIVFDPEVEDQQ-------------HTETEERVTGDA 55 Query: 339 EEALVAKKTLLAPRNEDEDWRRTNIFHTSCTIHGKVCHIIV 461 + LV +++ L P+ E E W ++NIF ++ TI GKVC +IV Sbjct: 56 DNLLVTRRSFLTPQIE-ESWLQSNIFRSTYTIRGKVCRLIV 95 >ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum] gi|557089351|gb|ESQ30059.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum] Length = 382 Score = 58.5 bits (140), Expect = 9e-07 Identities = 36/110 (32%), Positives = 52/110 (47%), Gaps = 7/110 (6%) Frame = +3 Query: 153 LRCFSCGEIGHTRNECKKATSRSGKQLLIEXXXXXXXXXXXXXXXXXXCLEDESDDRIF- 329 L+C+SCGE GH + C R LL+E DE D I+ Sbjct: 134 LKCYSCGEPGHRQTACPNQQRRG---LLLEDTEGVYNSA------------DEEDTGIYE 178 Query: 330 -----GDQEE-ALVAKKTLLAPRNEDEDWRRTNIFHTSCTIHGKVCHIIV 461 GD L+ ++ LAP +E W RTNIF ++CTI GK+C++++ Sbjct: 179 ETLTSGDSNAPVLMLRRICLAPVGYEEPWLRTNIFRSTCTIKGKLCNLVI 228 >gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao] Length = 625 Score = 57.4 bits (137), Expect = 2e-06 Identities = 34/129 (26%), Positives = 54/129 (41%), Gaps = 1/129 (0%) Frame = +3 Query: 78 PATTTATTCSFDAKTTKATPSGGTNLRCFSCGEIGHTRNECKKATSRSGKQLLIEXXXXX 257 P ++ T S + K T T + N +CF C GH ++C + +I Sbjct: 242 PKVNSSKTASSNDKETTFTRASNVNKKCFKCQRFGHIASDCPS-------RRIISLVEEE 294 Query: 258 XXXXXXXXXXXXXCLEDESDDRIFGDQEEALVAKKTL-LAPRNEDEDWRRTNIFHTSCTI 434 +DE + + D EA + ++ L A +DE R NIF+T CT Sbjct: 295 DYVNWEKLEPVYDEYDDEEIEEVSADHGEAFIVRRNLNTALMTKDESCLRHNIFYTRCTS 354 Query: 435 HGKVCHIIV 461 G VC++I+ Sbjct: 355 QGNVCNVII 363 >ref|XP_006293129.1| hypothetical protein CARUB_v10019435mg [Capsella rubella] gi|482561836|gb|EOA26027.1| hypothetical protein CARUB_v10019435mg [Capsella rubella] Length = 595 Score = 57.4 bits (137), Expect = 2e-06 Identities = 35/106 (33%), Positives = 49/106 (46%), Gaps = 3/106 (2%) Frame = +3 Query: 153 LRCFSCGEIGHTRNECKKATSRSGKQLLIEXXXXXXXXXXXXXXXXXXCLEDESDDRIFG 332 LRCFSCGE GH + C T R LL + ++ D I G Sbjct: 458 LRCFSCGENGHRQTACPNQTRRG---LLAQETEFTDEPRFDEYLSDSN--QEHDTDCIGG 512 Query: 333 DQ---EEALVAKKTLLAPRNEDEDWRRTNIFHTSCTIHGKVCHIIV 461 D + LV ++ L PR+ E W RT++F + TI GK+C +I+ Sbjct: 513 DTGHGSQILVLRRNCLLPRSTKESWLRTSLFRSISTIKGKICKLII 558 >gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao] Length = 1306 Score = 55.1 bits (131), Expect = 1e-05 Identities = 44/154 (28%), Positives = 66/154 (42%), Gaps = 3/154 (1%) Frame = +3 Query: 9 RPDLGTRFVRASEVRDGTRRSLAPATTTATTCSFDAKTTKATPSGGTN--LRCFSCGEIG 182 +P GT + SE R G T+ A T T GG+N +RCF+CGE G Sbjct: 210 KPLYGTHWQNNSEARRGY--------PTSQQNYQGAATINKTNRGGSNSHIRCFTCGENG 261 Query: 183 HTRNECKKATSRSGKQLLIEXXXXXXXXXXXXXXXXXXCLEDESDDRIFGDQEEALVAKK 362 HT S +G Q + E+ + ++ Q E+LV ++ Sbjct: 262 HT--------SFAGPQRRVNLAELREELEPVYDEY-----EEIEEIDVYPAQGESLVVRR 308 Query: 363 TLLAPRNED-EDWRRTNIFHTSCTIHGKVCHIIV 461 + NE+ EDW+R +IF T GKVC +++ Sbjct: 309 VMTTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVI 342 >gb|AAM15062.1| putative retroelement integrase [Arabidopsis thaliana] Length = 1215 Score = 55.1 bits (131), Expect = 1e-05 Identities = 33/104 (31%), Positives = 47/104 (45%) Frame = +3 Query: 150 NLRCFSCGEIGHTRNECKKATSRSGKQLLIEXXXXXXXXXXXXXXXXXXCLEDESDDRIF 329 ++RC+ C GH NEC K+++I E+E Sbjct: 156 DVRCYKCQGKGHYANECPN------KRVMILLDNGEIEPEEEIPDSPSSLKENEE----L 205 Query: 330 GDQEEALVAKKTLLAPRNEDEDWRRTNIFHTSCTIHGKVCHIIV 461 Q E LVA++TL DE +R N+FHT C +HGKVC +I+ Sbjct: 206 PAQGELLVARRTLSVQTKTDEQEQRKNLFHTRCHVHGKVCSLII 249