BLASTX nr result
ID: Stemona21_contig00031981
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Stemona21_contig00031981 (580 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004305134.1| PREDICTED: uncharacterized protein LOC101315... 71 2e-10 ref|XP_006289152.1| hypothetical protein CARUB_v10002590mg, part... 70 3e-10 ref|XP_002865522.1| hypothetical protein ARALYDRAFT_917523 [Arab... 62 1e-07 gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 61 2e-07 ref|XP_006491472.1| PREDICTED: uncharacterized protein LOC102626... 59 1e-06 ref|XP_006483194.1| PREDICTED: putative ribonuclease H protein A... 59 1e-06 ref|NP_194638.1| ribonuclease H-like protein [Arabidopsis thalia... 59 1e-06 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 58 2e-06 ref|XP_004308214.1| PREDICTED: putative ribonuclease H protein A... 57 3e-06 gb|ABK28206.1| unknown [Arabidopsis thaliana] 57 3e-06 ref|NP_180979.1| polynucleotidyl transferase, ribonuclease H-lik... 57 3e-06 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 57 4e-06 ref|XP_004308308.1| PREDICTED: uncharacterized protein LOC101295... 57 4e-06 ref|XP_004292002.1| PREDICTED: uncharacterized protein LOC101312... 55 1e-05 emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulga... 55 1e-05 >ref|XP_004305134.1| PREDICTED: uncharacterized protein LOC101315365 [Fragaria vesca subsp. vesca] Length = 272 Score = 71.2 bits (173), Expect = 2e-10 Identities = 53/189 (28%), Positives = 89/189 (47%), Gaps = 10/189 (5%) Frame = +1 Query: 28 KSVTKAFRASVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFM-CTNHLYSVT 204 K++ + F V A++ W +WK RN+++++ S I ++ A EF+ C NH Sbjct: 5 KTIKEEFLTKV-AFLLWEIWKARNAFVYEAATIDPSLIARKANLAAAEFITCKNHKVISN 63 Query: 205 SVEDGELAARWGLSVHRKQAKKTIQTA---------WRPSSRCGLVLNTDAAFHHSTSQA 357 S+ L +R S H + +I + W P +N DAA+ S+ Sbjct: 64 SMPVSHLHSRQSQSNHPNHGRTSISLSPTVSNVTEIWSPPPPGHFKINCDAAWIESSKLT 123 Query: 358 GLGAILRDCNGVPLFLAMGQGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLA 537 G+ A+ RDC+G A + +VLEAE A + + VA+ R P + +++DS L Sbjct: 124 GISALARDCHGTLFDGATLLCRAGSVLEAEAAAAVVAIEVAK-RLPPSPIILESDSKALV 182 Query: 538 SFINSNCEP 564 IN++ P Sbjct: 183 DQINNSKFP 191 >ref|XP_006289152.1| hypothetical protein CARUB_v10002590mg, partial [Capsella rubella] gi|482557858|gb|EOA22050.1| hypothetical protein CARUB_v10002590mg, partial [Capsella rubella] Length = 254 Score = 70.5 bits (171), Expect = 3e-10 Identities = 43/167 (25%), Positives = 86/167 (51%), Gaps = 1/167 (0%) Frame = +1 Query: 58 VIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDGELAARW 237 V+ ++ W LWK+RN ++F+G + +++ A+E+ N +++D + R Sbjct: 30 VVPWLLWRLWKSRNEFIFKGKDFDAQDTVRKALEDAEEW---NQRKGQDALKDRQPTLR- 85 Query: 238 GLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVPLFL-AMG 414 + + WRP RC + NTD A++ +Q+G+G +LR+C+G L++ A Sbjct: 86 ----------ASNEEKWRPPPRCWVKCNTDVAWNGEDTQSGMGWVLRNCSGEVLWMGAQA 135 Query: 415 QGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINSN 555 + R+ LE E L +++ + R + ++DS VL + +N++ Sbjct: 136 LRRTRSALEVE-LEAIRWAVTSMLRLDFQKIIFESDSLVLVNLLNND 181 >ref|XP_002865522.1| hypothetical protein ARALYDRAFT_917523 [Arabidopsis lyrata subsp. lyrata] gi|297311357|gb|EFH41781.1| hypothetical protein ARALYDRAFT_917523 [Arabidopsis lyrata subsp. lyrata] Length = 352 Score = 62.0 bits (149), Expect = 1e-07 Identities = 49/185 (26%), Positives = 83/185 (44%), Gaps = 2/185 (1%) Frame = +1 Query: 1 QILLDRNEKKSVTKAFRASVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFM- 177 + LLD ++ K + K R ++ W +WK RN +F + I Q++ KE++ Sbjct: 108 RFLLDLHDNKHIDKVCRYLPF-WLLWRIWKTRNDLIFNHKVTKGEDIVGQALIDTKEWLD 166 Query: 178 CTNHLYSVTSVEDGELAARWGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQA 357 C + + DG+L + + + W R + N DA+ + + Sbjct: 167 CQDRTHGPQ--HDGKLQG----------VRSSRSSKWCKPERGYVKCNFDASHYEGNQSS 214 Query: 358 GLGAILRDCNGVPLFLAMGQGKVRNVL-EAECLALLKFLTVARQRYPGMHLTVQTDSAVL 534 GLG I+RD NG L MG+ + R + EAEC AL+ + A H+ + D+A + Sbjct: 215 GLGRIIRDSNGTCLDCGMGKFQGRQTIEEAECSALI-WAIQASWALGYRHVEFEGDNANI 273 Query: 535 ASFIN 549 + IN Sbjct: 274 VNLIN 278 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 60.8 bits (146), Expect = 2e-07 Identities = 44/177 (24%), Positives = 76/177 (42%) Frame = +1 Query: 49 RASVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDGELA 228 R V ++ W LW RN + + + W+ +++ ++ L D ++A Sbjct: 668 RTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIA 727 Query: 229 ARWGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVPLFLA 408 WG+ + + +W + LN D + HS + AG G ILRD GV +F Sbjct: 728 QEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAG-GGILRDHAGVMVFGF 786 Query: 409 MGQGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINSNCEPPWYIQ 579 ++N L+AE LAL + L + R Y L ++ D+ + + N P I+ Sbjct: 787 SENLGIQNSLQAELLALYRGLILCRD-YNIRRLWIEMDAISVIRLLQGNHRGPHAIR 842 >ref|XP_006491472.1| PREDICTED: uncharacterized protein LOC102626455 [Citrus sinensis] Length = 1452 Score = 58.5 bits (140), Expect = 1e-06 Identities = 48/178 (26%), Positives = 79/178 (44%), Gaps = 1/178 (0%) Frame = +1 Query: 22 EKKSVTKAFRASVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSV 201 E S + A ++ CW +W RN ++F+G S + ++ + K + Sbjct: 1221 EMWSRSSTAEAELMIVYCWVIWSARNKFIFEGKKSDSRFLAAKADSVLKAY--------- 1271 Query: 202 TSVEDGELAARWGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRD 381 + ++ G +VH + + Q W+P S+ L LN DAA + GLGAI+RD Sbjct: 1272 ------QRVSKPG-NVHGAKDRGIDQQKWKPPSQNVLKLNVDAAVSTKDQKVGLGAIVRD 1324 Query: 382 CNGVPLFLAMGQGKVR-NVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINS 552 G L + + Q + R V AE A+ L VA Q L V++D + +N+ Sbjct: 1325 AEGKILAVGIKQAQFRERVSLAEAEAIHWGLQVANQ-ISSSSLIVESDCKEVVELLNN 1381 >ref|XP_006483194.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Citrus sinensis] Length = 765 Score = 58.5 bits (140), Expect = 1e-06 Identities = 48/178 (26%), Positives = 79/178 (44%), Gaps = 1/178 (0%) Frame = +1 Query: 22 EKKSVTKAFRASVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSV 201 E S + A ++ CW +W RN ++F+G S + ++ + K + Sbjct: 534 EMWSRSSTAEAELMIVYCWVIWSARNKFIFEGKKSDSRFLAAKADSVLKAY--------- 584 Query: 202 TSVEDGELAARWGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRD 381 + ++ G +VH + + Q W+P S+ L LN DAA + GLGAI+RD Sbjct: 585 ------QRVSKPG-NVHGAKDRGIDQQKWKPPSQNVLKLNVDAAVSTKXQKVGLGAIVRD 637 Query: 382 CNGVPLFLAMGQGKVR-NVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINS 552 G L + + Q + R V AE A+ L VA Q L V++D + +N+ Sbjct: 638 AEGKILAVGIKQAQFRERVSLAEAEAIHWGLQVANQ-ISSSSLIVESDCKEVVELLNN 694 >ref|NP_194638.1| ribonuclease H-like protein [Arabidopsis thaliana] gi|4972055|emb|CAB43923.1| putative protein [Arabidopsis thaliana] gi|7269807|emb|CAB79667.1| putative protein [Arabidopsis thaliana] gi|67633766|gb|AAY78807.1| putative reverse transcriptase/RNA-dependent DNA polymerase [Arabidopsis thaliana] gi|332660185|gb|AEE85585.1| ribonuclease H-like protein [Arabidopsis thaliana] Length = 575 Score = 58.5 bits (140), Expect = 1e-06 Identities = 40/169 (23%), Positives = 82/169 (48%), Gaps = 1/169 (0%) Frame = +1 Query: 52 ASVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDGELAA 231 + ++ ++ W LWKNRN +F+G ++ + +R A++ +E+ + Sbjct: 357 SQLVPWLLWRLWKNRNELVFRGREFNAQEV----LRRAED-----------DLEEWRIRT 401 Query: 232 RWGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNG-VPLFLA 408 + Q ++ WRP + NTDA ++ + G+G +LR+ G V A Sbjct: 402 EAESCGTKPQVNRSSCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGA 461 Query: 409 MGQGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINSN 555 K+++VLEAE L +++ ++ R+ ++ ++DS VL +N++ Sbjct: 462 RALPKLKSVLEAE-LEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNND 509 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 57.8 bits (138), Expect = 2e-06 Identities = 45/177 (25%), Positives = 74/177 (41%) Frame = +1 Query: 49 RASVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDGELA 228 R V + W LW RN + + I W+ +++ ++ L D ++A Sbjct: 2007 RTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIA 2066 Query: 229 ARWGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVPLFLA 408 WG++ + W S LN D + S + AG G +LRD GV +F Sbjct: 2067 QEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMVFGF 2125 Query: 409 MGQGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINSNCEPPWYIQ 579 ++N L+AE LAL + L + R Y L ++ D+A + + N P I+ Sbjct: 2126 SENLGIQNSLQAELLALYRGLILCRD-YNIRRLWIEMDAASVIRLLQGNQRGPHAIR 2181 >ref|XP_004308214.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 409 Score = 57.4 bits (137), Expect = 3e-06 Identities = 32/116 (27%), Positives = 63/116 (54%) Frame = +1 Query: 232 RWGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVPLFLAM 411 ++GL H QA + + W P + +NTD A+ +T ++G G I RD +G L Sbjct: 227 KFGLLCHPCQALRITKVNWHPPLFGWIKVNTDGAWQKTTGKSGYGGIFRDFHGSFLGAFA 286 Query: 412 GQGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINSNCEPPWYIQ 579 ++ N ++AE +A+++ + +A R H+ ++ DSA++ +F++ PW ++ Sbjct: 287 SNLEIPNSVDAEVMAVIQAIELAWVR-DWKHILLEVDSAIVLNFLHDPHLVPWRLR 341 >gb|ABK28206.1| unknown [Arabidopsis thaliana] Length = 293 Score = 57.0 bits (136), Expect = 3e-06 Identities = 38/168 (22%), Positives = 78/168 (46%), Gaps = 1/168 (0%) Frame = +1 Query: 55 SVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDGELAAR 234 +++ ++ W LWK+RN +F+G + + +++ +E+ L S Sbjct: 75 NLVPWLLWRLWKSRNELMFKGKEYDAPEVLRRAMEDFEEWSTRRELEGKAS--------- 125 Query: 235 WGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVPLFL-AM 411 Q ++ + W+ + NTDA + + G+G ILR+ +G L++ A Sbjct: 126 ------GPQVERNLSVQWKAPPYQWVKCNTDATWQLENPRCGIGWILRNESGGVLWMGAR 179 Query: 412 GQGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINSN 555 + +NVLEAE L L++ + R+ + ++D+ L + +NS+ Sbjct: 180 ALPRTKNVLEAE-LEALRWAVLTMSRFNYKRIIFESDAQALVNLLNSD 226 >ref|NP_180979.1| polynucleotidyl transferase, ribonuclease H-like superfamily protein [Arabidopsis thaliana] gi|3337363|gb|AAC27408.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] gi|91805481|gb|ABE65469.1| hypothetical protein At2g34320 [Arabidopsis thaliana] gi|330253864|gb|AEC08958.1| polynucleotidyl transferase, ribonuclease H-like superfamily protein [Arabidopsis thaliana] Length = 292 Score = 57.0 bits (136), Expect = 3e-06 Identities = 38/168 (22%), Positives = 78/168 (46%), Gaps = 1/168 (0%) Frame = +1 Query: 55 SVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDGELAAR 234 +++ ++ W LWK+RN +F+G + + +++ +E+ L S Sbjct: 75 NLVPWLLWRLWKSRNELMFKGKEYDAPEVLRRAMEDFEEWSTRRELEGKAS--------- 125 Query: 235 WGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVPLFL-AM 411 Q ++ + W+ + NTDA + + G+G ILR+ +G L++ A Sbjct: 126 ------GPQVERNLSVQWKAPPYQWVKCNTDATWQLENPRCGIGWILRNESGGVLWMGAR 179 Query: 412 GQGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINSN 555 + +NVLEAE L L++ + R+ + ++D+ L + +NS+ Sbjct: 180 ALPRTKNVLEAE-LEALRWAVLTMSRFNYKRIIFESDAQALVNLLNSD 226 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 56.6 bits (135), Expect = 4e-06 Identities = 37/142 (26%), Positives = 63/142 (44%) Frame = +1 Query: 67 YVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDGELAARWGLS 246 ++CW LW RN ++ + ++ I W+ +++ ++ + L D ++AA W + Sbjct: 1979 FICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYN 2038 Query: 247 VHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVPLFLAMGQGKV 426 K WR S LN D + H A G +LRD G +F Sbjct: 2039 FQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRHG-QHAASGGVLRDHTGKLIFGFSENIGT 2097 Query: 427 RNVLEAECLALLKFLTVARQRY 492 N L+AE ALL+ L + ++R+ Sbjct: 2098 CNSLQAELRALLRGLLLCKERH 2119 >ref|XP_004308308.1| PREDICTED: uncharacterized protein LOC101295087 [Fragaria vesca subsp. vesca] Length = 227 Score = 56.6 bits (135), Expect = 4e-06 Identities = 48/181 (26%), Positives = 85/181 (46%), Gaps = 5/181 (2%) Frame = +1 Query: 52 ASVIAYVCWSLWKNRNSYLFQG---NL-KSSSTIFWQSIRMAK-EFMCTNHLYSVTSVED 216 A V+ C +WK RN F G N+ K+ + IF Q + K C N+ S+ Sbjct: 34 AFVVTLYC--IWKFRNQARFDGVQPNVSKACNLIFGQVVASNKISSSCMNNGVFYLSI-- 89 Query: 217 GELAARWGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVP 396 + G+ +A ++ W P + +NTD A+ S+ QAG G I RD G Sbjct: 90 ---LKKVGVPCKPSKAPCIVEVNWHPPLFGWVKVNTDGAWRSSSGQAGYGGIFRDFRGGV 146 Query: 397 LFLAMGQGKVRNVLEAECLALLKFLTVARQRYPGMHLTVQTDSAVLASFINSNCEPPWYI 576 L + + + + AE +A++K + ++ R H+ ++ DS+++ +F+ S PW + Sbjct: 147 LGVFCSNFNMASSVAAEVMAVIKAIELSWVR-DWKHVWLEVDSSLVITFLRSPHLVPWKL 205 Query: 577 Q 579 + Sbjct: 206 R 206 >ref|XP_004292002.1| PREDICTED: uncharacterized protein LOC101312896 [Fragaria vesca subsp. vesca] Length = 2277 Score = 55.5 bits (132), Expect = 1e-05 Identities = 43/169 (25%), Positives = 77/169 (45%), Gaps = 3/169 (1%) Frame = +1 Query: 55 SVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDGELAAR 234 + + + WS+W+N+N +++ N K ++ + ++R +EF Sbjct: 1209 ATLMMIIWSVWRNQNDHVWNNNAKQATEVVPLTLRWWEEFK------------------- 1249 Query: 235 WGLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGAILRDCNGVPLFLAM- 411 V R+ + + W S + LN DAA+++++ A LG + RD G L + M Sbjct: 1250 -SAFVKRRGRNVSPRDRWITPSSGYIKLNVDAAYNNASCMASLGGVFRDHTGSCLGVFMQ 1308 Query: 412 GQGKVRNVLEAECLALLKFLTVA--RQRYPGMHLTVQTDSAVLASFINS 552 G + +E +ALL + VA YP L+++TD VL S + S Sbjct: 1309 GLFSAHSAHHSELMALLVGVQVAIHHNLYP---LSIETDCLVLVSALTS 1354 >emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1383 Score = 55.5 bits (132), Expect = 1e-05 Identities = 46/189 (24%), Positives = 75/189 (39%), Gaps = 17/189 (8%) Frame = +1 Query: 55 SVIAYVCWSLWKNRNSYLFQGNLKSSSTIFWQSIRMAKEFMCTNHLYSVTSVEDG----- 219 ++ + WSLWK RNS +F + S I ++ + T + V + +DG Sbjct: 1153 AIFFIIIWSLWKERNSRIFNNSNSSLEEI--------QDLILTRLCWWVKAWDDGFPFAC 1204 Query: 220 ------ELAARW----GLSVHRKQAKKTIQTAWRPSSRCGLVLNTDAAFHHSTSQAGLGA 369 +W G + ++ AW P L N DA+F A +G Sbjct: 1205 SEVIRNPACLKWTQSKGCNFGTIGPTNLLKAAWSPPPSNHLQWNVDASFKPGLEHAAVGG 1264 Query: 370 ILRDCNGVPLFLAMGQGKVRNVLEAECLALLKFL--TVARQRYPGMHLTVQTDSAVLASF 543 +LRD NG + L + AE A+ + L +++ R HL + +DSA + Sbjct: 1265 VLRDENGCFVCLFSSPIPRLEINSAEIYAIFRALKISLSSDRIKAQHLIIVSDSANAVRW 1324 Query: 544 INSNCEPPW 570 N + PW Sbjct: 1325 CNQDEGGPW 1333