BLASTX nr result
ID: Rheum21_contig00018016
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00018016 (1731 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like ... 98 1e-17 gb|AAD22368.1| putative non-LTR retroelement reverse transcripta... 94 1e-16 gb|AAC63844.1| putative non-LTR retroelement reverse transcripta... 88 1e-14 gb|EOY02864.1| Ribonuclease H protein [Theobroma cacao] 79 6e-12 gb|ABE80156.1| Ribonuclease H [Medicago truncatula] 78 1e-11 gb|EOY24207.1| Non-LTR retroelement reverse transcriptase [Theob... 76 5e-11 gb|EOX98014.1| Ribonuclease H protein [Theobroma cacao] 72 1e-09 ref|XP_003548282.1| PREDICTED: TMV resistance protein N [Glycine... 67 2e-08 gb|EOY24339.1| RNA-directed DNA polymerase (Reverse transcriptas... 67 3e-08 gb|EOY25852.1| Non-LTR retroelement reverse transcriptase [Theob... 66 4e-08 gb|EOY25817.1| Non-LTR retroelement reverse transcriptase [Theob... 66 4e-08 gb|EOY19161.1| Polynucleotidyl transferase, putative [Theobroma ... 65 7e-08 ref|XP_006468253.1| PREDICTED: uncharacterized protein LOC102614... 59 7e-06 >dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 676 Score = 97.8 bits (242), Expect = 1e-17 Identities = 73/294 (24%), Positives = 124/294 (42%), Gaps = 25/294 (8%) Frame = +3 Query: 99 RTKTKSKHSPACPRC-GLDETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDN 275 R + S CP C G E+ IH +RDC + W + FF +++ W N Sbjct: 374 RVRRHMADSDVCPLCKGASESLIHVLRDCPAMMGIWMRVVPVMEQRRFFETSLLEWMYGN 433 Query: 276 LKRKN-----GWSTLFGNMIWHSWKNRNRLVFENETEPADRL------ICKIKMAALNFV 422 LK ++ W TLF +W WK R VF ++ DR+ + +++ A L Sbjct: 434 LKERSDSERRSWPTLFALTVWWGWKWRCGYVFGEDSRCRDRVKFLKSAVAEVEAAHL-AA 492 Query: 423 AEDYKRKTKLQGILAYQ------------XXXXXXXXXXXXGWVVRSTSGECILLYSSFF 566 D + ++ ++A++ G V+R G ++ ++ Sbjct: 493 NGDAREDVLVERMIAWRKPAEGWVTMNTDGASHGNPGQATAGGVIRDEHGSWLVGFALNI 552 Query: 567 GICXXXXXXXXXXXXXXXXXMNRCFPKVLVEMDSLEVVQLLTQDMPSTHSDFYLIKICRE 746 G+C R + +V +E+DS VV L + +H +L+++C Sbjct: 553 GVCSAPLAELWGVYYGLVVAWERGWRRVRLEVDSALVVGFLQSGIGDSHPLAFLVRLCHG 612 Query: 747 LVCNPGWDVQFTHVPRSHNNVADYLANFSFSHQLGMVIHDH-PPHVMQLLEADV 905 + + W V+ THV R N +AD LAN++F+ G ++ D P HV +L DV Sbjct: 613 FI-SKDWIVRITHVYREANRLADGLANYAFTLPFGFLLLDSCPEHVSSILLEDV 665 >gb|AAD22368.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 321 Score = 94.4 bits (233), Expect = 1e-16 Identities = 71/278 (25%), Positives = 123/278 (44%), Gaps = 19/278 (6%) Frame = +3 Query: 132 CPRC-GLDETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDNLKRKNGWSTLF 308 C C G +ET +H +RDC + W + R FF +++ W NL+ + W T+F Sbjct: 35 CQICQGGEETILHVLRDCPAMAGIWSRLVPRDQIRQFFTASLLEWIYKNLRERGSWPTVF 94 Query: 309 GNMIWHSWKNRNRLVFENETEPADRL-----ICKIKMAALNFVAEDYKRKTKLQGILAY- 470 +W WK R +F + DR+ + + A FV + R ++++ ++++ Sbjct: 95 VMAVWWGWKWRCGNIFGGNGKCRDRVKFIKDLAEEVAIANAFVKGNEVRVSRVERLVSWV 154 Query: 471 -----------QXXXXXXXXXXXXGWVVRSTSGECILLYSSFFGICXXXXXXXXXXXXXX 617 G V+R +G I ++ G+C Sbjct: 155 SPEDGWVKLNTDGASRGNPGFATAGGVLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGL 214 Query: 618 XXXMNRCFPKVLVEMDSLEVVQLLTQDMPSTHSDFYLIKICRELVCNPGWDVQFTHVPRS 797 R +V +E+DS VV LT + +H +L+++C + + + GW V+ +HV R Sbjct: 215 FIAWGRGARRVELEVDSKMVVGFLTTGIADSHPLSFLLRLCYDFL-SKGWIVRISHVYRE 273 Query: 798 HNNVADYLANFSFSHQLGM-VIHDHPPHVMQLLEADVA 908 N +AD LAN++FS LG+ ++ P V +L DVA Sbjct: 274 ANRLADGLANYAFSLSLGLHLLESRPDVVSSILLDDVA 311 >gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1231 Score = 87.8 bits (216), Expect = 1e-14 Identities = 73/295 (24%), Positives = 127/295 (43%), Gaps = 26/295 (8%) Frame = +3 Query: 99 RTKTKSKHSPACPRC-GLDETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDN 275 R + + C C G +ET +H +RDC ++ W+ + H FF+ ++ W N Sbjct: 933 RVRRHLSENAICSVCNGAEETILHVLRDCPAMEPIWRRLLPLRRHHEFFSQSLLEWLFTN 992 Query: 276 LKRKNG-WSTLFGNMIWHSWKNRNRLVFENETEPADRLICKIKMAALNFVAEDYKR---- 440 + G W TLFG IW +WK R VF +R IC+ ++ + +AE+ +R Sbjct: 993 MDPVKGIWPTLFGMGIWWAWKWRCCDVF------GERKICRDRLKFIKDMAEEVRRVHVG 1046 Query: 441 -------KTKLQGILAYQ------------XXXXXXXXXXXXGWVVRSTSGECILLYSSF 563 +++ ++ +Q G +R+ GE + ++ Sbjct: 1047 AVGNRPNGVRVERMIRWQVPSDGWVKITTDGASRGNHGLAAAGGAIRNGQGEWLGGFALN 1106 Query: 564 FGICXXXXXXXXXXXXXXXXXMNRCFPKVLVEMDSLEVVQLLTQDMPSTHSDFYLIKICR 743 G C ++ F +V +++D VV L+ + + H +L+++C+ Sbjct: 1107 IGSCAAPLAELWGAYYGLLIAWDKGFRRVELDLDCKLVVGFLSTGVSNAHPLSFLVRLCQ 1166 Query: 744 ELVCNPGWDVQFTHVPRSHNNVADYLANFSFSHQLGMVIHDH-PPHVMQLLEADV 905 W V+ +HV R N +AD LAN++F+ LG+ D P V LL ADV Sbjct: 1167 GFFTR-DWLVRVSHVYREANRLADGLANYAFTLPLGLHCFDACPEGVRLLLLADV 1220 >gb|EOY02864.1| Ribonuclease H protein [Theobroma cacao] Length = 660 Score = 79.0 bits (193), Expect = 6e-12 Identities = 67/275 (24%), Positives = 113/275 (41%), Gaps = 27/275 (9%) Frame = +3 Query: 132 CPRCGL-DETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDNLKRK-----NG 293 CP+C + DET H +RDC + W I + FF + W NL+++ N Sbjct: 366 CPQCRMEDETVTHVLRDCMVATSLWVKIIPQHEQNDFFTFPLREWLVSNLQKQQLILGNP 425 Query: 294 WSTLFGNMIWHSWKNRNRLVFENETEPADRLICKIKMAALNFVAED-------YKRKTKL 452 WS +FG W WK RN +VF P + I IK A +A +R+ K Sbjct: 426 WSVVFGLACWCLWKWRNGVVFYAAFNPTRKRISMIKSMATATIATSADFDGVQVERRKKE 485 Query: 453 QGILAYQXXXXXXXXXXXX------------GWVVRSTSGECILLYSSFFGICXXXXXXX 596 + ++ ++ G V+R+ G+ + + G C Sbjct: 486 EVLIGWRTPQVGWVCLNTDEAYKRSIEEASTGGVIRNAEGDWQAEFLAKLGKCSAYRAEL 545 Query: 597 XXXXXXXXXXMNRCFPKVLVEMDSLEVVQLLTQD--MPSTHSDFYLIKICRELVCNPGWD 770 + F KV V++D+ VV ++ + +P ++D LI+ ++ V W+ Sbjct: 546 WGVLHGLRLAWDSGFKKVQVQVDNKMVVPAVSTNKLIPGANTD--LIRAIKD-VLQKEWE 602 Query: 771 VQFTHVPRSHNNVADYLANFSFSHQLGMVIHDHPP 875 V F H N V DYLA+++F + ++ + P Sbjct: 603 VSFMHTYCEGNMVTDYLASYAFVLEKSYIVLEQAP 637 >gb|ABE80156.1| Ribonuclease H [Medicago truncatula] Length = 438 Score = 78.2 bits (191), Expect = 1e-11 Identities = 71/280 (25%), Positives = 114/280 (40%), Gaps = 20/280 (7%) Frame = +3 Query: 123 SPACPRCGL-DETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDNLKRKNGWS 299 SP C RCG DET +H +RDC F ++ WQ IG + F A++ W + + + Sbjct: 158 SPTCARCGEEDETFLHCVRDCHFSRSIWQKIGFTG-NDFFTATSAHDWFK--IGMSSSLP 214 Query: 300 TLFGNMIWHSWKNRNRLVFENETEPADRLICKIKMAAL----NFVAED----------YK 437 +F +W +W++RN + NET RL I AA F +E+ + Sbjct: 215 DIFFGGLWWAWRHRNLMCLNNETMSLFRLCNNIVSAATYIKSAFDSEENVNHSDRFVKWN 274 Query: 438 RKTKLQGILAYQXXXXXXXXXXXXGWVVRSTSGECILLYSSFFGICXXXXXXXXXXXXXX 617 + IL G ++R+++G L S F G Sbjct: 275 NRNHHDHILNVDGSCLGTPSRTGYGGILRNSAG---LFISGFSGFIPNSTDILQAELTAI 331 Query: 618 XXXMNRCFPK----VLVEMDSLEVVQLLTQDMPSTHSDFYLIKICRELVCNPGWDVQFTH 785 ++ V+ DSL V L+ D P H+ LI+ ++L+ ++ H Sbjct: 332 HQSLHMVIDSNMNDVMCYSDSLLAVNLIMNDTPRYHTYAVLIQNIKDLLSVR--NITLHH 389 Query: 786 VPRSHNNVADYLANFSFSHQLGMVIHDHPP-HVMQLLEAD 902 R N AD+ A + + +V+H PP ++ LL AD Sbjct: 390 TLREGNQCADFFAKLGANSDVHLVVHQSPPADLLPLLRAD 429 >gb|EOY24207.1| Non-LTR retroelement reverse transcriptase [Theobroma cacao] Length = 391 Score = 75.9 bits (185), Expect = 5e-11 Identities = 61/269 (22%), Positives = 104/269 (38%), Gaps = 27/269 (10%) Frame = +3 Query: 111 KSKHSPACPRC-GLDETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDNLKRK 287 K +CP G ETC+H +RDC + W++I ++ FF + + W NL K Sbjct: 121 KMSSDASCPHYYGAKETCLHVLRDCPASKTLWRNILPQSGINQFFQTPLIDWLSSNLNLK 180 Query: 288 N------GWSTLFGNMIWHSWKNRNRLVFENETEPADRLICKIKMAALNFVAEDYKRKTK 449 N W+ +FG W++WK RN +FE + + IK A+N + + Sbjct: 181 NLYVFDVPWNIVFGIACWYTWKWRNLFIFEGRELSVEGRLSIIKSMAVN-SHNTWSTPSI 239 Query: 450 LQGILAYQXXXXXXXXXXXXGW--------------------VVRSTSGECILLYSSFFG 569 + G + +Q W V+R G I+ Y+ Sbjct: 240 ISGGMRHQEEILVGWSPPPKDWIAVNSDGVFKSAARTAAAGGVLRDAHGTWIVGYACKLE 299 Query: 570 ICXXXXXXXXXXXXXXXXXMNRCFPKVLVEMDSLEVVQLLTQDMPSTHSDFYLIKICREL 749 R F KV ++ D+ +VQ ++ S+ LI+ + + Sbjct: 300 TSSGLRVELWGFYKGLQLAWERGFRKVKLQSDNKAMVQAISFSSVHPCSNLDLIRAIKGM 359 Query: 750 VCNPGWDVQFTHVPRSHNNVADYLANFSF 836 + W+V +H+ R N AD+++N F Sbjct: 360 L-GRHWEVNISHIYREANTTADFMSNLGF 387 >gb|EOX98014.1| Ribonuclease H protein [Theobroma cacao] Length = 528 Score = 71.6 bits (174), Expect = 1e-09 Identities = 66/270 (24%), Positives = 105/270 (38%), Gaps = 22/270 (8%) Frame = +3 Query: 132 CPRCGL-DETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDNLKRKNGWSTLF 308 CP+C + DET H +RDC + W+ L N WS +F Sbjct: 150 CPQCRMEDETVTHVLRDCMVATSLWKQ---------------------QLILGNPWSIVF 188 Query: 309 GNMIWHSWKNRNRLVFENETEPADRLICKIKMAALNFVAED-------YKRKTKLQGILA 467 W+ WK RN +VF+ P + I IK A +A +R+ K + ++ Sbjct: 189 RLACWYLWKWRNGVVFDVAFNPTRKRISMIKSMATATIAPSADFDGVQVERRKKEEVLIE 248 Query: 468 YQXXXXXXXXXXXXG------------WVVRSTSGECILLYSSFFGICXXXXXXXXXXXX 611 ++ G V R+ G+ + + G C Sbjct: 249 WRAPQVGWVCLNTDGAYKRSIEEASAGGVKRNAEGDWQAGFVAKLGKCSAYRAELWGILH 308 Query: 612 XXXXXMNRCFPKVLVEMDSLEVVQLLTQD--MPSTHSDFYLIKICRELVCNPGWDVQFTH 785 + F KV V++D+ VVQ ++ D +P ++D LI + V W+V F H Sbjct: 309 GLRLAWDSGFKKVQVQVDNKMVVQAISTDKLIPGANTD--LISAIKN-VLQKEWEVSFMH 365 Query: 786 VPRSHNNVADYLANFSFSHQLGMVIHDHPP 875 R N VADYLA+++F + V+ + P Sbjct: 366 TYREGNMVADYLASYAFVLEESYVVLEQAP 395 >ref|XP_003548282.1| PREDICTED: TMV resistance protein N [Glycine max] Length = 1420 Score = 67.0 bits (162), Expect = 2e-08 Identities = 34/91 (37%), Positives = 48/91 (52%), Gaps = 4/91 (4%) Frame = +3 Query: 96 LRTKTKSKHSPACPRCG-LDETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQED 272 LR + +CPRC L+ETC+HA+RDC + W+ + + L P FF ++ W E Sbjct: 1101 LRRYRHTSMDSSCPRCPELEETCLHALRDCPKVAAFWRSVLPKKLAPKFFNGDVAVWLET 1160 Query: 273 NLKRKNG---WSTLFGNMIWHSWKNRNRLVF 356 NL W T FG + W++RN LVF Sbjct: 1161 NLSFSEAAFFWPTFFGIAVELLWESRNDLVF 1191 >gb|EOY24339.1| RNA-directed DNA polymerase (Reverse transcriptase), Polynucleotidyl transferase, Ribonuclease H fold-like protein [Theobroma cacao] Length = 616 Score = 66.6 bits (161), Expect = 3e-08 Identities = 46/147 (31%), Positives = 63/147 (42%), Gaps = 9/147 (6%) Frame = +3 Query: 123 SPACPRCGL-DETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDNLKRKN--- 290 S C C + DE+ +H +RDC + W +G R + FF + W NLK N Sbjct: 378 SATCALCSVSDESVLHLLRDCPHSKEVWLKLGSRMGYGNFFDLLLSDWLLTNLKNYNVCV 437 Query: 291 ---GWSTLFGNMIWHSWKNRNRLVFENETEPADRLICKIK--MAALNFVAEDYKRKTKLQ 455 W LFG W+ WK RN VFE + P DR + IK +AA + ++L Sbjct: 438 DGIPWVILFGFTCWYIWKWRNVKVFEGKLIPMDRKLSMIKGLVAASYHAVQIPCTHSRLN 497 Query: 456 GILAYQXXXXXXXXXXXXGWVVRSTSG 536 G Y+ GWV +T G Sbjct: 498 G---YKREMLVGWQNPPQGWVAVNTDG 521 >gb|EOY25852.1| Non-LTR retroelement reverse transcriptase [Theobroma cacao] Length = 1011 Score = 66.2 bits (160), Expect = 4e-08 Identities = 33/113 (29%), Positives = 55/113 (48%), Gaps = 7/113 (6%) Frame = +3 Query: 99 RTKTKSKHSPACPRC-GLDETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDN 275 R + K +CP C G++ETC+H +RDC ++ W+ I ++ FF + W N Sbjct: 743 RVRRKMSSDASCPHCYGVEETCLHVLRDCPALETLWRRILPQSGINQFFQIPLIDWLSSN 802 Query: 276 LKRKN------GWSTLFGNMIWHSWKNRNRLVFENETEPADRLICKIKMAALN 416 L KN W+ + G W++WK RN +FE + + I+ A++ Sbjct: 803 LNLKNLYVFDVPWNIVLGITCWYTWKWRNLFIFEGRELSVEGRLSIIRSVAVD 855 >gb|EOY25817.1| Non-LTR retroelement reverse transcriptase [Theobroma cacao] Length = 874 Score = 66.2 bits (160), Expect = 4e-08 Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 7/94 (7%) Frame = +3 Query: 99 RTKTKSKHSPACPRC-GLDETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDN 275 R + K +CP C G++ETC+H +RDC + W+ I + FF + W N Sbjct: 615 RVRRKMSSDASCPHCYGVEETCLHVLRDCLASETLWRRILPESGINQFFQIPLIDWLSSN 674 Query: 276 LKRKN------GWSTLFGNMIWHSWKNRNRLVFE 359 L KN W+ +FG W++WK N +FE Sbjct: 675 LNLKNLYVFDVPWNIVFGTTCWYTWKRSNLFIFE 708 >gb|EOY19161.1| Polynucleotidyl transferase, putative [Theobroma cacao] Length = 419 Score = 65.5 bits (158), Expect = 7e-08 Identities = 59/263 (22%), Positives = 102/263 (38%), Gaps = 21/263 (7%) Frame = +3 Query: 153 ETCIHAIRDCAFIQNTWQHIGGRALHPLFFASNMGTWQEDNLKRKN-----GWSTLFGNM 317 ET +HA+RDC + W + FF+ + W NL K+ W+ +F + Sbjct: 141 ETLVHALRDCGKSKLLWLQLRPNIHSSDFFSEELKPWVLKNLACKDPVEGIPWAIIFIHA 200 Query: 318 IWHSWKNRNRLVFENE----TEPADRLICKIKMAALNFVAEDYKRKTKLQGILAYQ---- 473 IW W RN +F+ ++ K K A E+++ K ++ ++A++ Sbjct: 201 IWLLWFWRNMNLFDKSFIWPANATKQVWTKAKEAWDTLGKENHRLKQEV--LIAWEKPKN 258 Query: 474 --------XXXXXXXXXXXXGWVVRSTSGECILLYSSFFGICXXXXXXXXXXXXXXXXXM 629 G V+R G I + GI Sbjct: 259 GYVKLNVDGSAKGQPGLAASGGVIRDEYGNWIAGFCQKIGITFSLTAEPWGIYQGLTLCW 318 Query: 630 NRCFPKVLVEMDSLEVVQLLTQDMPSTHSDFYLIKICRELVCNPGWDVQFTHVPRSHNNV 809 NR K VE+DS+ +Q + + L++ +EL+ WDV +HV R + Sbjct: 319 NRGLRKFCVEIDSMLALQKIYSQSSMLDPNAQLLRRIKELL-QQSWDVTISHVHREADQC 377 Query: 810 ADYLANFSFSHQLGMVIHDHPPH 878 D++ + +LG+ I ++PPH Sbjct: 378 TDWMTTHIENLKLGLHIFEYPPH 400 >ref|XP_006468253.1| PREDICTED: uncharacterized protein LOC102614777 [Citrus sinensis] Length = 320 Score = 58.9 bits (141), Expect = 7e-06 Identities = 38/133 (28%), Positives = 66/133 (49%), Gaps = 1/133 (0%) Frame = +3 Query: 510 GWVVRSTSGECILLYSSFFGICXXXXXXXXXXXXXXXXXMNRCFPKVLVEMDSLEVVQLL 689 G ++R G + YS+ G+C F +V VE+DS+ V++L+ Sbjct: 178 GGLIRDFRGVWQVGYSANLGVCSVTSTELWGLFHGLSIAWQYGFRRVYVEVDSMCVMRLI 237 Query: 690 TQDMPSTHSDFYLIKICRELVCNPGWDVQFTHVPRSHNNVADYLANFSFSHQLGM-VIHD 866 + P + F LI+ + L+ W + H+ R N VAD+LA+++FS LG+ Sbjct: 238 SNPNPPINEHFTLIQEIQALL-RRDWLTKVEHIYREANEVADFLASYNFSFSLGLHCFQF 296 Query: 867 HPPHVMQLLEADV 905 +PP+++ +L DV Sbjct: 297 NPPNMLSILTNDV 309