BLASTX nr result
ID: Rehmannia22_contig00005714
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00005714 (900 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 211 4e-52 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 200 5e-49 ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665... 198 3e-48 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 183 7e-44 gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 178 3e-42 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 168 3e-39 ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663... 163 7e-38 ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660... 163 9e-38 ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A... 162 2e-37 ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661... 155 2e-35 ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A... 148 3e-33 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 143 8e-32 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 142 2e-31 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 132 2e-28 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 131 3e-28 ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein A... 129 1e-27 emb|CAB45965.1| putative reverse transcriptase [Arabidopsis thal... 127 6e-27 ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232... 126 1e-26 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 125 3e-26 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 122 2e-25 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 211 bits (536), Expect = 4e-52 Identities = 106/273 (38%), Positives = 158/273 (57%), Gaps = 12/273 (4%) Frame = -3 Query: 865 SYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLWGKNTSP 686 S ++W+ SLSYAG++ LI++V+QG+ FW+ IFPLP +V+D I CR FLWGK Sbjct: 106 SISSRWSRKSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGG 165 Query: 685 -----IKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIHSFYLKNQ 521 + WS+VC P EGGLGL ++ WN ALL+ ILW++HSK DSLWVR +H +Y K Sbjct: 166 KIKPLVAWSEVCTPKKEGGLGLFNLKDWNIALLSCILWDLHSKKDSLWVRLVHHYYFKGG 225 Query: 520 SIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSSKMYDLFREA 341 ++W + SDS + IR+ I++K + ++A L+ + ++ + KMYD R Sbjct: 226 NVWDFISSSSDSVFI----HIRDIIISKEENIEVAKLMLNSWGCNEQTLAGKMYDYIRGT 281 Query: 340 GPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDVDPTCKLCGNYLENASHLF 161 P W S +W IP K SF +WLA +RL ++ +L+ C LC N E+ +HLF Sbjct: 282 RPVVHWSSIIWNPVIPSKMSFILWLATKNRLLALDRAAFLNKGFLCPLCTNEAESHAHLF 341 Query: 160 FDCIVTRLLWDRVKKW-------LKFSHSMSTI 83 F C + +W ++ W + HS+S + Sbjct: 342 FSCRTSLRVWAHIRDWIPLKRQSISLQHSISAL 374 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 200 bits (509), Expect = 5e-49 Identities = 108/288 (37%), Positives = 161/288 (55%), Gaps = 7/288 (2%) Frame = -3 Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716 HY L D+I I W+A LSYAGR+ LI+SV+ FW+Q PLP VI RIN +CR Sbjct: 598 HYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPKFVIMRINAICR 657 Query: 715 VFLWGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551 FLW N+ SPI W KVC P GGL + ++ WNK + K+LWN+ +K+D+LW++ Sbjct: 658 SFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKLLWNVCNKSDNLWIK 717 Query: 550 WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCS 371 W+H++Y++ QSIW+ KKS S ++ + +R +L ++ S + + Sbjct: 718 WLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLR-PLLLQYQSRMQDVFKM----------- 765 Query: 370 SKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNL-KY-LDVDPTCKL 197 K+Y E K W + + N P+ FC+W A + RLA+ + L K+ L+VD C Sbjct: 766 KKIYLALFEESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDRLIKFGLNVDANCAF 825 Query: 196 CGNYLENASHLFFDCIVTRLLWDRVKKWLKFSHSMSTIAGALKWTKKE 53 C + +E+ HLFF CI + +W V WL+ H ST + L W ++ Sbjct: 826 CSS-MESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTWSEELNWITRK 872 >ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max] Length = 506 Score = 198 bits (503), Expect = 3e-48 Identities = 102/284 (35%), Positives = 156/284 (54%), Gaps = 7/284 (2%) Frame = -3 Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716 HY+PL D+I I WTA LSYAGRL L+ SV+ + +WL FP P +V+ +I +CR Sbjct: 156 HYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNYWLNCFPFPKSVLQKIEAICR 215 Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551 +FLW G SP+ W ++C P GGL + D+ WNKA L K+LWN+ SK DSLWV+ Sbjct: 216 IFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNKANLMKLLWNLSSKEDSLWVK 275 Query: 550 WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCS 371 WI ++Y+K + + K +DS ++K I R ++ I N+ + Sbjct: 276 WIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDL--------EKIDNMEELMIRGSINM 327 Query: 370 SKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNL-KY-LDVDPTCKL 197 K+Y ++ G + W + ++ N P+ +F +WLA + RL+T + L KY + D +C Sbjct: 328 GKLYRKLQDCGQRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYGMIDDKSCCF 387 Query: 196 CGNYLENASHLFFDCIVTRLLWDRVKKWLKFSHSMSTIAGALKW 65 C E+ +HLFF C ++ +W V +W++ H S L W Sbjct: 388 CSEE-ESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWPNELHW 430 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 183 bits (465), Expect = 7e-44 Identities = 99/266 (37%), Positives = 147/266 (55%), Gaps = 7/266 (2%) Frame = -3 Query: 886 PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707 PL D+I + W A+ LSYAGRL L+K++L ++ +W QIFPLP +I + CR FL Sbjct: 776 PLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFL 835 Query: 706 WGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542 W +P+ W + P GGL + ++ WNKA + K+LW I K D LWVRW++ Sbjct: 836 WTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVN 895 Query: 541 SFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSSKM 362 ++Y+K Q+I + S +L++I + R E+L + G + A+SN F+ K Sbjct: 896 AYYIKRQNIENVTVSSNTSWILRKIFESR-ELLTRTGGWE-AVSNHMNFS------IKKT 947 Query: 361 YDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLK--YLDVDPTCKLCGN 188 Y L +E W + N PK F +WLA +RLAT + DV P CK+CGN Sbjct: 948 YKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKMCGN 1007 Query: 187 YLENASHLFFDCIVTRLLWDRVKKWL 110 +E HLFF+CI ++ +W +V +L Sbjct: 1008 EIETIQHLFFNCIYSKEIWGKVLLYL 1033 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 178 bits (451), Expect = 3e-42 Identities = 103/287 (35%), Positives = 144/287 (50%), Gaps = 10/287 (3%) Frame = -3 Query: 892 YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713 Y+PL + I I WT LSYAGRL LI SVL + FWL F LP I I+++C Sbjct: 313 YSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSICNFWLAAFRLPRECIREIDKICSA 372 Query: 712 FLW-GKNTSPIK----WSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRW 548 FLW G + +P K W VC P EGGLGLR + N+ K++W I S +SLWVRW Sbjct: 373 FLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEMNEVSCLKLIWRIVSHTNSLWVRW 432 Query: 547 IHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSS 368 I + LK+ + W+ + +L R +E + KF + D Sbjct: 433 IEQYLLKHDTFWSVQTTTNMDSVLWR--GRNDEYMPKFSTRD------------------ 472 Query: 367 KMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLD--VDPTCKLC 194 ++ R WH +W PK+SFC WLA +RL+T + + + + PTC LC Sbjct: 473 -TWNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLC 531 Query: 193 GNYLENASHLFFDCIVTRLLWDRVKKWL---KFSHSMSTIAGALKWT 62 N +E +HLFF C T +W+ + K + KFS + STI ++ T Sbjct: 532 NNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWSTILTSVSTT 578 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 168 bits (425), Expect = 3e-39 Identities = 91/271 (33%), Positives = 138/271 (50%), Gaps = 7/271 (2%) Frame = -3 Query: 886 PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707 PL + I + W A LSYAGRL LIKS+L ++ +W IFPL VI + ++CR FL Sbjct: 773 PLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFL 832 Query: 706 WGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542 W T +P+ W+ + P GG + ++ WN+A + K+LW I K D LWVRWIH Sbjct: 833 WTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIH 892 Query: 541 SFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSSKM 362 S+Y+K Q I T + + +L++I R+ L+ G D + K K Sbjct: 893 SYYIKRQDILTVNISNQTTWILRKIVKARDH-LSNIGDWD------EICIGDK-FSMKKA 944 Query: 361 YDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDV--DPTCKLCGN 188 Y E G + W + N+ PK F +W+ ++RL T++ + V D +LC N Sbjct: 945 YKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRN 1004 Query: 187 YLENASHLFFDCIVTRLLWDRVKKWLKFSHS 95 E HLFF C + +W ++ ++F +S Sbjct: 1005 DGETIQHLFFSCSYSAGVWSKICYIMRFPNS 1035 >ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max] Length = 514 Score = 163 bits (413), Expect = 7e-38 Identities = 86/267 (32%), Positives = 139/267 (52%), Gaps = 11/267 (4%) Frame = -3 Query: 898 NHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLC 719 +HY PL ++I I W++ LS AGR+ L++S++ + +W+ +FP+P VI +I+ +C Sbjct: 258 HHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPKKVIQKIDSIC 317 Query: 718 RVFLWG-----KNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWV 554 R F+W K S + W +VC P GGL L ++ WN + K LWNI SK D+LWV Sbjct: 318 RSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNICSKEDNLWV 377 Query: 553 RWIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSL----FANS 386 +WIH+++LK ++ + K + + +LK + R + ++NL L Sbjct: 378 KWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQ-----------VNNLQLVWIEMLRK 426 Query: 385 KGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDV--D 212 + ++Y E K W + N P+ + +WLA +RLAT LK +++ Sbjct: 427 RKFSMKQVYMELVEDHNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNMIQC 486 Query: 211 PTCKLCGNYLENASHLFFDCIVTRLLW 131 C LC E+ HL F C VT+ +W Sbjct: 487 SLCSLCKEQDEDLDHLMFSCRVTKAIW 513 >ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max] Length = 303 Score = 163 bits (412), Expect = 9e-38 Identities = 83/188 (44%), Positives = 114/188 (60%), Gaps = 5/188 (2%) Frame = -3 Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716 HYAPL +I I W+ SLSYAG+L LI++V+QG+ FW+ IFPLP +V+DRIN CR Sbjct: 108 HYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWIGIFPLPQSVLDRINASCR 167 Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551 FLW GK + WS VC P EGGLGL ++ WN ALL+ ILW+ H K DSLWV Sbjct: 168 NFLWGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWNLALLSCILWDFHCKKDSLWV- 226 Query: 550 WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCS 371 H +Y + +W ++ S S L+K+I IR+ I++K S + A + + + L Sbjct: 227 --HHYYFRRSDVWNYNTSSSYSVLIKKIIQIRDFIISKELSTEEAKKRIQSWRTNGQLLV 284 Query: 370 SKMYDLFR 347 K+Y+ R Sbjct: 285 GKVYEYIR 292 >ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 316 Score = 162 bits (409), Expect = 2e-37 Identities = 77/158 (48%), Positives = 102/158 (64%), Gaps = 5/158 (3%) Frame = -3 Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716 HYAPL +I I W SLSY G+L LIK+V+QG+ FW++IFPLP +V+DRIN C Sbjct: 141 HYAPLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSVLDRINASCC 200 Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551 FLW GKN + W VC P EGGLGL ++ WN ALL+ ILW+ H K DSL VR Sbjct: 201 NFLWSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLALLSHILWDFHCKKDSLRVR 260 Query: 550 WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAK 437 W+H +Y + W ++ S+S L+K+I IR+ I++K Sbjct: 261 WVHHYYFRRSDEWNYNISSSNSVLIKKIIQIRDFIISK 298 >ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max] Length = 947 Score = 155 bits (392), Expect = 2e-35 Identities = 86/273 (31%), Positives = 132/273 (48%), Gaps = 7/273 (2%) Frame = -3 Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716 +Y PL D+I + I WT+ L+ GR+ ++ + + FW+Q P+P +VI +I+ +CR Sbjct: 598 YYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIPMSVIKKIDSMCR 657 Query: 715 VFLWGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551 F+W ++T SPI W+ VC P +GGL + ++ WN + LWN+ K D+LWV+ Sbjct: 658 SFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLWNLCKKVDNLWVK 717 Query: 550 WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCS 371 WIH+ Y+KN S+ + S +LK + R I D + NS+ Sbjct: 718 WIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWDELL-------NSERFKM 770 Query: 370 SKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDV--DPTCKL 197 K YD EA + W + KN P+ WLA + RL T + L + D L Sbjct: 771 KKAYDKMMEA-DRVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRLVRFGMITDKIWSL 829 Query: 196 CGNYLENASHLFFDCIVTRLLWDRVKKWLKFSH 98 C E +H+ F C V +W V + H Sbjct: 830 CKEVEETQNHILFSCKVATDIWSNVLNRIGIDH 862 >ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 239 Score = 148 bits (373), Expect = 3e-33 Identities = 68/132 (51%), Positives = 88/132 (66%), Gaps = 5/132 (3%) Frame = -3 Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716 HYAPL +I I W+ SLSYAG+L LI++V+QG+ FW++IFPL +V+DRIN C Sbjct: 108 HYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWMKIFPLSQSVLDRINASCC 167 Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551 FLW GKN S I WS VC P EGGLGL ++ WN LL++ILW+ H K D LWVR Sbjct: 168 NFLWGKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWNLTLLSRILWDFHCKKDFLWVR 227 Query: 550 WIHSFYLKNQSI 515 W+H +Y + + Sbjct: 228 WVHHYYFRASDV 239 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 143 bits (361), Expect = 8e-32 Identities = 99/344 (28%), Positives = 140/344 (40%), Gaps = 83/344 (24%) Frame = -3 Query: 892 YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713 Y PL ++I + I WT LS+AGRL LIKSVL + FWL +F LP + I ++ Sbjct: 933 YLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSA 992 Query: 712 FLWGK---NTSPIK--WSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRW 548 FLW NT K WS+VC +EGGLGL+ + N+ L K++W I S DSLWV+W Sbjct: 993 FLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKW 1052 Query: 547 IHSFYLKNQSI-----------WTWDP--KKSDSCLLKRICDIRNEILAKF--------- 434 ++ ++ ++ W W K+ D L ++R+ F Sbjct: 1053 VNKHLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPLG 1112 Query: 433 ---------GSPDLAISNLSLFAN------------------------------------ 389 G+ DL I N + A Sbjct: 1113 RLHQHMGSRGTIDLGIPNNATVAEVMNTHRRKRHRADFLNQIKSQIELARQDRSTDGDRS 1172 Query: 388 ---------SKGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATIN 236 SSK + R + W+ VW + PKYSF WLAF++RL T + Sbjct: 1173 LWKQKEDTFKSSFSSSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSD 1232 Query: 235 NLKYLDVDP--TCKLCGNYLENASHLFFDCIVTRLLWDRVKKWL 110 + + C CG LE HLFF C + +W + K L Sbjct: 1233 KICKWNSGARYDCVFCGEELETRDHLFFSCPYSSHVWFSLTKGL 1276 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 142 bits (358), Expect = 2e-31 Identities = 93/275 (33%), Positives = 129/275 (46%), Gaps = 16/275 (5%) Frame = -3 Query: 886 PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707 PL ++I S I+ W LSYAGRL L+ SV+ + FW+ F LP I I ++ FL Sbjct: 1043 PLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFL 1102 Query: 706 W-GKNTSP----IKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542 W G + +P + W VC P EGGLGLR + NK K++W + S SLWV WI Sbjct: 1103 WSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWI- 1161 Query: 541 SFYLKNQSIWTWDPKKSDSCLLKRICDIRNEI---LAKFGSPDLAISNLSLFANSKG--- 380 +N I T S DI N+I L K + S G Sbjct: 1162 ----QNNLIRTVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSLCRSIGGQF 1217 Query: 379 ---LCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLD--V 215 S +++ RE G WH A+W + PK++F WLA +DRL T + + + + Sbjct: 1218 KAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGI 1277 Query: 214 DPTCKLCGNYLENASHLFFDCIVTRLLWDRVKKWL 110 C LC E+ HLFF C + +WDR+ + L Sbjct: 1278 SSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRL 1312 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 132 bits (332), Expect = 2e-28 Identities = 60/139 (43%), Positives = 89/139 (64%), Gaps = 5/139 (3%) Frame = -3 Query: 886 PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707 PL +++ + IN WTA LSYAGR L+K+VL GV+ W Q+F +P+ +I I LCR +L Sbjct: 581 PLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYL 640 Query: 706 WG-----KNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542 W + I W KVC P EGGLGL ++ WN++ + K+ W++ +K D LW++WIH Sbjct: 641 WSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWDLANKEDKLWIKWIH 700 Query: 541 SFYLKNQSIWTWDPKKSDS 485 ++Y+K Q W KKS++ Sbjct: 701 AYYIKGQREW----KKSNT 715 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 131 bits (330), Expect = 3e-28 Identities = 85/265 (32%), Positives = 123/265 (46%), Gaps = 9/265 (3%) Frame = -3 Query: 889 APLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVF 710 +PL DRI + I W LS+AGRL LI+SVL ++ +W LP V+ I + R F Sbjct: 609 SPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCF 668 Query: 709 LWGKNTS-----PIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWI 545 LW N S + WS++CLP EGGLG++D+H WNKAL+ +WN+ S + + W W+ Sbjct: 669 LWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWV 728 Query: 544 HSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFAN--SKGLCS 371 + LK S W S +++ IR E+ F F N G + Sbjct: 729 KVYLLKGNSFWNAPLPSICSWNWRKLLKIR-ELCCSF------------FVNIIGDGRAT 775 Query: 370 SKMYDLFREAGPKTF-WHSAVWKNFIPPKYSFCVWLAFNDRLATINNLK-YLDVDPTCKL 197 S +D + GP T W S + K + F + N L+ + P +L Sbjct: 776 SLWFDNWHPLGPLTLRWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFIVPWYRL 835 Query: 196 CGNYLENASHLFFDCIVTRLLWDRV 122 E +HLFFDC + +W V Sbjct: 836 VWFVAETHNHLFFDCAYSFGIWTHV 860 >ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 192 Score = 129 bits (325), Expect = 1e-27 Identities = 62/118 (52%), Positives = 79/118 (66%), Gaps = 5/118 (4%) Frame = -3 Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716 HYA L +I I W+ SLSYAG+L LI++V+QG+ FW++IF LP +V+D IN CR Sbjct: 65 HYALLLSKITGLIQGWSKKSLSYAGKLELIRAVIQGIVNFWMEIFSLPQSVMDWINASCR 124 Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLW 557 FLW GKN + WS VC P EGGLGL ++ WN ALL++ILW+ H K DSLW Sbjct: 125 NFLWGKADIGKNKPLVAWSVVCSPKKEGGLGLLNLKDWNLALLSRILWDFHCKKDSLW 182 >emb|CAB45965.1| putative reverse transcriptase [Arabidopsis thaliana] gi|7267919|emb|CAB78261.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 662 Score = 127 bits (319), Expect = 6e-27 Identities = 68/190 (35%), Positives = 97/190 (51%), Gaps = 5/190 (2%) Frame = -3 Query: 892 YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713 Y+PL ++I I WTA LSYAGRL L+ SVL + FWL F LP + I++LC Sbjct: 304 YSPLLEQIKRRIGTWTARFLSYAGRLNLVSSVLWSICNFWLSAFRLPRECVREIDKLCSA 363 Query: 712 FLWG-----KNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRW 548 FLW N + I W VC P EGGLGL+ + N K++W I S+ DSLWV+W Sbjct: 364 FLWSGPELSTNKAKIAWETVCRPKREGGLGLQSIKEANDVCCLKLIWRIVSQGDSLWVQW 423 Query: 547 IHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSS 368 I ++ LK + W++ S + K++ R+ A F D+ + F Sbjct: 424 IRTYLLKRNTFWSFRSASQGSWMWKKLLKYRDTAKA-FSKVDIRNGETASFWYDDWSSKG 482 Query: 367 KMYDLFREAG 338 ++ D+ E G Sbjct: 483 RLIDVLGERG 492 >ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232446, partial [Cucumis sativus] Length = 382 Score = 126 bits (317), Expect = 1e-26 Identities = 58/132 (43%), Positives = 84/132 (63%), Gaps = 5/132 (3%) Frame = -3 Query: 886 PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707 PL RI S I W+A LS+AGRL L++SVL+ ++ +W +F LP V ++++ R +L Sbjct: 55 PLIQRITSRIRSWSARVLSFAGRLQLVRSVLRSLQVYWASVFMLPMKVHRDVDKILRSYL 114 Query: 706 W-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542 W G+ + + W +VCLPFDEGGL +RD SWN A KILW + K+ SLWV W+ Sbjct: 115 WRGKEEGRGGAKVAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVE 174 Query: 541 SFYLKNQSIWTW 506 ++ LK +S+ W Sbjct: 175 AYILKGRSMLGW 186 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 125 bits (313), Expect = 3e-26 Identities = 87/343 (25%), Positives = 137/343 (39%), Gaps = 84/343 (24%) Frame = -3 Query: 892 YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713 Y+PL D+I I WT+ LS+AGRL LI SVL + FW+ F LP I+ INR+ Sbjct: 507 YSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNFWMNAFRLPRECINEINRISSA 566 Query: 712 FLW-GKNTSP----IKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHS--------- 575 LW G +P + W ++C P EGGLGL+ + NK K++W + S Sbjct: 567 LLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKW 626 Query: 574 ------KADSLWV--------RWIHSFYLK-----------------NQSIW-------- 512 K +S W WI LK N S W Sbjct: 627 TRMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKG 686 Query: 511 ------------------------TWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNL 404 W ++ ++ + + +L K+ ++ + + Sbjct: 687 PLINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVEILNEFEEILLQKYQHRNIELEDA 746 Query: 403 SLFANSKGLCSSKM-----YDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATI 239 L+ + + ++ ++ R + + WH VW PK+SFC WLA +RL+T Sbjct: 747 ILWRGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTG 806 Query: 238 NNLKYLD--VDPTCKLCGNYLENASHLFFDCIVTRLLWDRVKK 116 + + + TC C + +E HLFF C + +W + K Sbjct: 807 DRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAK 849 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 122 bits (306), Expect = 2e-25 Identities = 61/151 (40%), Positives = 80/151 (52%), Gaps = 5/151 (3%) Frame = -3 Query: 892 YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713 Y PL ++I + W LS+AGR+ LI SV+ G FW+ F LP I RI LC Sbjct: 779 YEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSR 838 Query: 712 FLWGKNTSPIK-----WSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRW 548 FLW N K W+ +CLP EGGLGLR + WNK L +++W + DSLW W Sbjct: 839 FLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADW 898 Query: 547 IHSFYLKNQSIWTWDPKKSDSCLLKRICDIR 455 H +L S W + +SDS KR+ +R Sbjct: 899 QHLHHLSRGSFWAVEGGQSDSWTWKRLLSLR 929