BLASTX nr result
ID: Rehmannia25_contig00011741
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00011741 (1065 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665... 212 2e-52 ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 211 4e-52 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 204 4e-50 emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga... 197 8e-48 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 185 3e-44 gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 184 4e-44 ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661... 174 4e-41 ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668... 166 2e-38 ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660... 160 6e-37 ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A... 160 8e-37 ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663... 160 1e-36 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 157 5e-36 ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A... 141 5e-31 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 137 9e-30 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 134 5e-29 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 134 6e-29 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 130 7e-28 gb|AAD15471.1| putative non-LTR retroelement reverse transcripta... 126 1e-26 gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] 125 2e-26 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 125 2e-26 >ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max] Length = 506 Score = 212 bits (540), Expect = 2e-52 Identities = 114/333 (34%), Positives = 173/333 (51%), Gaps = 13/333 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 + ++KL+ +HY+PL D+I I WTA LSYAGRL L+ SV+ + +WL FP PKSV Sbjct: 147 VTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNYWLNCFPFPKSV 206 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 +++I +CR FLW ++ P+AW +C P GGL I D+ WNKA L K+LW Sbjct: 207 LQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNKANLMKLLWNLS 266 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP 527 ++LWV+W+ +Y++ + + DS ++K I R++L ID + Sbjct: 267 SKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDL--------EKIDNMEE 318 Query: 528 LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNL---S 698 L+ +N K+Y ++ G R W N ++ P+ +F WLAC RLST D L Sbjct: 319 LMIRGSINMGKLYRKLQDCGQRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYG 378 Query: 699 YIDTDPMCKLCKNELESAPHLFFMCTVTNSLWCRIKNWLRITRSMSTLASAIKWIK---- 866 ID D C C E ES HLFF+C + +W + W++I S + + W+ Sbjct: 379 MID-DKSCCFCSEE-ESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWPNELHWLTHHTK 436 Query: 867 -KDRVEPILKKARSIALCSFVFHVWKARNAVIF 962 K +LK +A+ ++ +W RN IF Sbjct: 437 GKGTRAAVLK----MAIAETIYEIWNIRNNKIF 465 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 211 bits (537), Expect = 4e-52 Identities = 111/277 (40%), Positives = 158/277 (57%), Gaps = 9/277 (3%) Frame = +3 Query: 72 KWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLWNK----KRHP 239 +W+ SLSYAG++ LI++V+QGI FW+ IFPLP+SV+ I CR FLW K K P Sbjct: 110 RWSRKSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGGKIKP 169 Query: 240 -IAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFYLRNQSIWT 416 +AW +VC P +EGGLG+ ++ WN ALLS ILW H ++LWVR VH +Y + ++W Sbjct: 170 LVAWSEVCTPKKEGGLGLFNLKDWNIALLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWD 229 Query: 417 WNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLN----SSKMYDIFRNE 584 + DS + IRD +I K + I+ ++N G N + KMYD R Sbjct: 230 FISSSSDSVFIH----IRDIIISK----EENIEVAKLMLNSWGCNEQTLAGKMYDYIRGT 281 Query: 585 GPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSYIDTDPMCKLCKNELESAPHLF 764 P W + IW IP K SF WLA K+RL LD ++++ +C LC NE ES HLF Sbjct: 282 RPVVHWSSIIWNPVIPSKMSFILWLATKNRLLALDRAAFLNKGFLCPLCTNEAESHAHLF 341 Query: 765 FMCTVTNSLWCRIKNWLRITRSMSTLASAIKWIKKDR 875 F C + +W I++W+ + R +L +I + + R Sbjct: 342 FSCRTSLRVWAHIRDWIPLKRQSISLQHSISALIRRR 378 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 204 bits (520), Expect = 4e-50 Identities = 112/330 (33%), Positives = 166/330 (50%), Gaps = 8/330 (2%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L+++KLN HY L D+I I W+A LSYAGR+ LI+SV+ FW+Q PLPK V Sbjct: 589 LSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPKFV 648 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 I RI +CR+FLW ++ PIAW VC P GGL I ++ WNK + K+LW Sbjct: 649 IMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKLLWNVC 708 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP 527 ++ LW++W+H +Y+R QSIW+ +K S ++ + +R L+Q Q Sbjct: 709 NKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLRPLLLQYQSRMQDV------ 762 Query: 528 LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSY-- 701 K+Y E + W + P+ FC W AC RL++ D L Sbjct: 763 ------FKMKKIYLALFEESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDRLIKFG 816 Query: 702 IDTDPMCKLCKNELESAPHLFFMCTVTNSLWCRIKNWLRITRSMSTLASAIKWI-KKDRV 878 ++ D C C + +ES HLFF C ++W + NWL+I ST + + WI +K + Sbjct: 817 LNVDANCAFC-SSMESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTWSEELNWITRKCKG 875 Query: 879 EPILKKARSIALCSFVFHVWKARNAVIFDG 968 + A ++H+W RN +F G Sbjct: 876 KGWRAMLLKCAFTETIYHIWAYRNHRVFGG 905 >emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1114 Score = 197 bits (500), Expect = 8e-48 Identities = 116/360 (32%), Positives = 179/360 (49%), Gaps = 12/360 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 LA++KLN PL D+I W A+ LSYAGRL L+K++L ++ +W QIFPLPK + Sbjct: 764 LASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKL 823 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 IK + CR FLW + P+AW + P GGL + ++ WNKA + K+LW Sbjct: 824 IKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAIT 883 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP 527 + LWVRWV+ +Y++ Q+I + S +L++I + R EL+ + G ++ Sbjct: 884 FKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESR-ELLTRTGGWEA------- 935 Query: 528 LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFI-----PPKFSFCTWLACKDRLSTLDN 692 + N + K Y + + + + N +WK+ I PK F WLA +RL+T + Sbjct: 936 VSNHMNFSIKKTYKLLQED-----YENVVWKRLICNNKATPKSQFILWLAMLNRLATAER 990 Query: 693 LS--YIDTDPMCKLCKNELESAPHLFFMCTVTNSLWCRIKNWLRITRSMSTLASAIKWIK 866 +S D P+CK+C NE+E+ HLFF C + +W ++ +L + A IK Sbjct: 991 VSRWNRDVSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQADAQAKKELAIK 1050 Query: 867 KDRVEPILKKARSIALCSFVFHVWKARNAVIFDGTPFTEEAVFHKIQKHVYKALYFRFPV 1046 K R K + V+ +W RNA +F G Q K++ FR V Sbjct: 1051 KARSTKDRNKLYVMMFTESVYAIWLLRNAKVFRGIEIN--------QNQAVKSIIFRIAV 1102 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 185 bits (469), Expect = 3e-44 Identities = 111/347 (31%), Positives = 164/347 (47%), Gaps = 13/347 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L ++KL PL + I W A LSYAGRL LIKS+L ++ +W IFPL K V Sbjct: 761 LTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKV 820 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 I+ + K+CR FLW K+ P+AW + P GG + ++ WN+A + K+LW Sbjct: 821 IQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIE 880 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP 527 + LWVRW+H +Y++ Q I T N + +L++I RD L S I Sbjct: 881 FKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRKIVKARDHL--------SNIGDWDE 932 Query: 528 LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSY-- 701 + + + K Y G R W I + PK F W+ +RL T+D +S Sbjct: 933 ICIGDKFSMKKAYKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWG 992 Query: 702 IDTDPMCKLCKNELESAPHLFFMCTVTNSLWCRIKNWLRITRS----MSTLASAIKWIKK 869 + D +LC+N+ E+ HLFF C+ + +W +I +R S ++S +K Sbjct: 993 VQCDLNYRLCRNDGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVCGQARK 1052 Query: 870 DRVEPILKKARSIAL--CSFVFHVWKARNAVIFDGTPFTEEAVFHKI 1004 KK + I + FV+ +WK RN F G E V KI Sbjct: 1053 -------KKGKLIVMLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 184 bits (468), Expect = 4e-44 Identities = 109/326 (33%), Positives = 163/326 (50%), Gaps = 10/326 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L ++L Y+PL + I I WT LSYAGRL LI SVL I FWL F LP+ Sbjct: 303 LVTKRLTATDYSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSICNFWLAAFRLPREC 362 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 I+ I K+C AFLW N ++ + W DVC P +EGGLG+R + N+ K++W+ Sbjct: 363 IREIDKICSAFLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEMNEVSCLKLIWRIV 422 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP 527 T +LWVRW+ + L++ + W+ + ++L R DE + KF ++ + Sbjct: 423 SHTNSLWVRWIEQYLLKHDTFWSVQTTTNMDSVLWR--GRNDEYMPKFSTRDT------- 473 Query: 528 LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSYID 707 ++ RN WH IW PKFSFC WLA ++RLST D + + Sbjct: 474 ------------WNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWN 521 Query: 708 --TDPMCKLCKNELESAPHLFFMCTVTNSLWCRIKNWL---RITRSMSTLASAIKWIKKD 872 P C LC N +E+ HLFF C T +W + + + + + ST+ +++ ++ Sbjct: 522 RRLSPTCVLCNNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWSTILTSVSTTWRN 581 Query: 873 RVEPILKKARSIALCSFVFHVWKARN 950 R E L AR I + + +W RN Sbjct: 582 RTESFL--ARYIFQAT-IHTIWHERN 604 >ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max] Length = 947 Score = 174 bits (442), Expect = 4e-41 Identities = 103/336 (30%), Positives = 159/336 (47%), Gaps = 16/336 (4%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L ++KLN +Y PL D+I I WT+ L+ GR+ ++ + I FW+Q P+P SV Sbjct: 589 LTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIPMSV 648 Query: 183 IKRIYKLCRAFLWNK-----KRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 IK+I +CR+F+W++ ++ PIAW+ VC P +GGL I ++ WN + LW Sbjct: 649 IKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLWNLC 708 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP 527 + + LWV+W+H Y++N S+ + S +LK + SQ+ I L P Sbjct: 709 KKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVL-----------SQREYIHTLQP 757 Query: 528 ----LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNL 695 L+N K YD E R W + K P+ TWLAC RL T D L Sbjct: 758 VWDELLNSERFKMKKAYDKMM-EADRVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRL 816 Query: 696 SYID--TDPMCKLCKNELESAPHLFFMCTVTNSLWCRIKNWLRITRSMSTLASAIKWI-- 863 TD + LCK E+ H+ F C V +W + N + I + W+ Sbjct: 817 VRFGMITDKIWSLCKEVEETQNHILFSCKVATDIWSNVLNRIGIDHVPQEWPLELDWLLN 876 Query: 864 ---KKDRVEPILKKARSIALCSFVFHVWKARNAVIF 962 +K +LK +++ ++ +W RN+ IF Sbjct: 877 LTNRKGWRAYLLK----LSVTETIYGIWINRNSKIF 908 >ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max] Length = 477 Score = 166 bits (419), Expect = 2e-38 Identities = 100/335 (29%), Positives = 157/335 (46%), Gaps = 1/335 (0%) Frame = +3 Query: 54 IAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLWNKKR 233 I + I W++ +LSYAG++ LI++V+QGI FW IFPLP+ V+ RI R FLW K Sbjct: 182 ITSLIQGWSSKTLSYAGKVELIRAVIQGIANFWTDIFPLPQFVLDRINVSYRNFLWGKAE 241 Query: 234 HPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFYLRNQSIW 413 VH Y + ++W Sbjct: 242 ------------------------------------------------VHHNYFKGGNVW 253 Query: 414 TWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYDIFRNEGPR 593 + DS L+K+I IRD + K + ++A L ++ L + K YD R P Sbjct: 254 DFISSASDSVLIKKIIHIRDIITIKEDNVEAAKQTLNSWNSNEQLLAGKAYDYIRGVKPA 313 Query: 594 HFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSYIDTDPMCKLCKNELESAPHLFFMC 773 W++ +W IP K SF WLA K+ L TLD ++++ +C LC+ + +S HLFF C Sbjct: 314 VNWNSVVWNPAIPSKMSFILWLATKNHLLTLDRAAFLNKGLLCPLCRTKAKSHAHLFFSC 373 Query: 774 TVTNSLWCRIKNWLRITRSMSTLASAI-KWIKKDRVEPILKKARSIALCSFVFHVWKARN 950 ++ +W I++W+ + R +L I I K R +AL V+ W +RN Sbjct: 374 RISLQVWANIRDWIPLHRQTISLQCTINSRICGRATSGTWGKFRCLALAIAVYCTWISRN 433 Query: 951 AVIFDGTPFTEEAVFHKIQKHVYKALYFRFPVDLV 1055 ++F+ +PF+ + +KI+ VYK R P+ L+ Sbjct: 434 LLLFENSPFSVINIINKIKFLVYKHSRVRVPIVLL 468 >ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max] Length = 303 Score = 160 bits (406), Expect = 6e-37 Identities = 86/197 (43%), Positives = 115/197 (58%), Gaps = 5/197 (2%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L + +LN HYAPL +I I W+ SLSYAG+L LI++V+QGI FW+ IFPLP+SV Sbjct: 99 LLSSRLNVCHYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWIGIFPLPQSV 158 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 + RI CR FLW KK+ +AW VC P EGGLG+ ++ WN ALLS ILW FH Sbjct: 159 LDRINASCRNFLWGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWNLALLSCILWDFH 218 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP 527 ++L WVH +Y R +W +N S L+K+I IRD +I K S + A + Sbjct: 219 CKKDSL---WVHHYYFRRSDVWNYNTSSSYSVLIKKIIQIRDFIISKELSTEEAKKRIQS 275 Query: 528 LVNDNGLNSSKMYDIFR 578 + L K+Y+ R Sbjct: 276 WRTNGQLLVGKVYEYIR 292 >ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 316 Score = 160 bits (405), Expect = 8e-37 Identities = 82/167 (49%), Positives = 106/167 (63%), Gaps = 5/167 (2%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L + +LN HYAPL +I I W SLSY G+L LIK+V+QGI FW++IFPLP+SV Sbjct: 132 LLSSRLNVCHYAPLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSV 191 Query: 183 IKRIYKLCRAFLWNK----KRHP-IAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 + RI C FLW+K K P +AW VC P +EGGLG+ ++ WN ALLS ILW FH Sbjct: 192 LDRINASCCNFLWSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLALLSHILWDFH 251 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQK 488 ++L VRWVH +Y R W +N +S L+K+I IRD +I K Sbjct: 252 CKKDSLRVRWVHHYYFRRSDEWNYNISSSNSVLIKKIIQIRDFIISK 298 >ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max] Length = 514 Score = 160 bits (404), Expect = 1e-36 Identities = 79/272 (29%), Positives = 143/272 (52%), Gaps = 8/272 (2%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L+ +KLN HY PL ++I I W++ LS AGR+ L++S++ I +W+ +FP+PK V Sbjct: 250 LSCKKLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPKKV 309 Query: 183 IKRIYKLCRAFLWN-----KKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 I++I +CR+F+W+ K++ +AW VC P GGL + ++ WN + K LW Sbjct: 310 IQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNIC 369 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP 527 + LWV+W+H ++L+ ++ + + + + +LK + R ++ + Sbjct: 370 SKEDNLWVKWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQV-------NNLQLVWIE 422 Query: 528 LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLST---LDNLS 698 ++ + ++Y + + W + P+ + WLAC++RL+T L N++ Sbjct: 423 MLRKRKFSMKQVYMELVEDHNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMN 482 Query: 699 YIDTDPMCKLCKNELESAPHLFFMCTVTNSLW 794 I +C LCK + E HL F C VT ++W Sbjct: 483 MIQCS-LCSLCKEQDEDLDHLMFSCRVTKAIW 513 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 157 bits (398), Expect = 5e-36 Identities = 99/357 (27%), Positives = 161/357 (45%), Gaps = 15/357 (4%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L +++ PL ++I + I W LSYAGRL L+ SV+ + FW+ F LP++ Sbjct: 1031 LMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRAC 1090 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 I+ I ++ AFLW N + +AWHDVC P EGGLG+R + NK K++W+ Sbjct: 1091 IREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLV 1150 Query: 348 QGTETLWVRWVHGFYLRN--QSIWTWNPRKDDSTLLKRICDIRDELIQKFGSQQSAIDAL 521 +LWV W+ +R +++ + R +L DI +EL +K + + Sbjct: 1151 SAKHSLWVNWIQNNLIRTVAEALSSHRRRSHRDDILN---DIEEEL-EKLLCRGICTEQD 1206 Query: 522 TPLVNDNG------LNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLST 683 L G S +++ R +G WH AIW PKF+F +WLA DRL+T Sbjct: 1207 RSLCRSIGGQFKAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTT 1266 Query: 684 LDNLSYID--TDPMCKLCKNELESAPHLFFMCTVTNSLWCRIKNWLRITRSMSTLASAIK 857 D ++ + +C LC ES HLFF C ++ +W R+ L + R + + + Sbjct: 1267 GDKMASWNRGISSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLL 1326 Query: 858 WIKKDRVEPILKKARSIALCSFVFHVWKARNAVIFDGTPFTEEAVFHKIQKHVYKAL 1028 + + + + +W+ RN P + + I + L Sbjct: 1327 LLSGQDFSGTKRFLLRYVFQATIHTLWRERNKRRHGDLPIPSDHIIKFIDRQTRNRL 1383 >ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 239 Score = 141 bits (355), Expect = 5e-31 Identities = 69/141 (48%), Positives = 90/141 (63%), Gaps = 5/141 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L + +LN HYAPL +I I W+ SLSYAG+L LI++V+QGI FW++IFPL +SV Sbjct: 99 LLSSRLNVCHYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWMKIFPLSQSV 158 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 + RI C FLW K + IAW VC P +EGGLG+ ++ WN LLS+ILW FH Sbjct: 159 LDRINASCCNFLWGKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWNLTLLSRILWDFH 218 Query: 348 QGTETLWVRWVHGFYLRNQSI 410 + LWVRWVH +Y R + Sbjct: 219 CKKDFLWVRWVHHYYFRASDV 239 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 137 bits (344), Expect = 9e-30 Identities = 96/341 (28%), Positives = 143/341 (41%), Gaps = 85/341 (24%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L ++L+ V YAPL ++I I W++ LS+AGR LI S++ FWL F LP++ Sbjct: 329 LVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNFWLSAFQLPRAC 388 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 I+ I KLC +FLW N K+ I+W+ VC P EGGLG+R + N K++W+ Sbjct: 389 IQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLRSLKEANDVCCLKLVWRII 448 Query: 348 QGTETLWVRWVHGFYLRNQSIW---------TWNPRK----------------------- 431 ++LWV+WV L+ + W +W +K Sbjct: 449 SHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYRGVAKRFCKAEVGNGESTS 508 Query: 432 ---DDSTLLKRICDI-----------------------------RDELI----------- 482 DD +LL R+ D+ R E++ Sbjct: 509 FWFDDWSLLGRLIDVAGIRGTIDMGISRTMSVADAWTSRRRRHHRQEILNTIEEVLSTQH 568 Query: 483 QKFGSQQSAIDALTPLVND---NGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCT 653 QK QQ L ND + ++ ++ R WH +W PK+SFC Sbjct: 569 QKRTQQQQQGRVLWKGKNDIYKDKFSTKNTWNYLRTTSNEVAWHKGVWFPHATPKYSFCL 628 Query: 654 WLACKDRLSTLDNLSYIDTDPM--CKLCKNELESAPHLFFM 770 WLA DRL+T + + C C+ +E+ HLFFM Sbjct: 629 WLAAHDRLATGARMIKWNRGETGDCTFCRQGIETRDHLFFM 669 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 134 bits (338), Expect = 5e-29 Identities = 57/142 (40%), Positives = 92/142 (64%), Gaps = 5/142 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L+++KLN + + PL +++ A I+ WTA LSYAGR L+K+VL G++ W Q+F +P + Sbjct: 569 LSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPAKI 628 Query: 183 IKRIYKLCRAFLWN-----KKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 IK I LCR++LW+ K+ IAW VC P EGGLG+ ++ WN++ ++K+ W Sbjct: 629 IKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWDLA 688 Query: 348 QGTETLWVRWVHGFYLRNQSIW 413 + LW++W+H +Y++ Q W Sbjct: 689 NKEDKLWIKWIHAYYIKGQREW 710 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 134 bits (337), Expect = 6e-29 Identities = 109/404 (26%), Positives = 163/404 (40%), Gaps = 87/404 (21%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L +++ Y+PL + + I WTA SLSYAGRL L+ SV+ I FW+ + LP Sbjct: 1073 LLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGC 1132 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKI----- 332 I+ I KLC AFLW N K+ IAW +C P +EGGLGI+ + NK K+ Sbjct: 1133 IREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLL 1192 Query: 333 ----------LWKFHQGTETLW----------------------VRWVHGFYLRNQSI-- 410 +W F T W + +H +RN S Sbjct: 1193 STQPSLWVTWIWTFIIRKGTFWSANERSSLGSWMWKKLLKYRELAKSMHKVEVRNGSSTS 1252 Query: 411 -----WTWNPRKDDSTLLKRICDIRDEL---------------------------IQKFG 494 W+ R D T +R+ D+ L IQ+ Sbjct: 1253 FWYDHWSHLGRLLDITGTRRVIDLGIPLETNLETVLRTHQHRQHRAAIYNRINAEIQRLQ 1312 Query: 495 SQQSA----IDALTPLVNDNGLN--SSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTW 656 Q+ I L ND + ++ R P+ W+ +W + PK+SF W Sbjct: 1313 QQEREAGPDISLWRSLKNDFNKRFITKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLW 1372 Query: 657 LACKDRLSTLDNLSYIDTDPM--CKLCKNELESAPHLFFMCTVTNSLWCRIKNWLRIT-- 824 L ++RLST D + ++ + C LC N E+ HLFF C T+ +W + L T Sbjct: 1373 LTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDHLFFSCQYTSYVWEALTQRLLSTNY 1432 Query: 825 -RSMSTLASAIKWIKKDRVEPILKKARSIALCSFVFHVWKARNA 953 R + L + + R L + + ++H+W+ RNA Sbjct: 1433 SRDWNRLFTLLCTSNLPRDHLFLFR---YVFQASIYHIWRERNA 1473 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 130 bits (328), Expect = 7e-28 Identities = 102/370 (27%), Positives = 154/370 (41%), Gaps = 32/370 (8%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L KL +PL DRI I W LS+AGRL LI+SVL I+ +W LPK V Sbjct: 598 LITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKV 657 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 +K I K R FLW + +AW ++CLP EGGLGI+D++ WNKAL+ +W Sbjct: 658 LKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLV 717 Query: 348 QGTETLWVRWVHGFYLRNQSIW--------TWNPRKDDSTLLK---RICDIRDELIQKFG 494 + W WV + L+ S W +WN RK LLK C +I Sbjct: 718 SSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRK----LLKIRELCCSFFVNIIGDGR 773 Query: 495 SQQSAIDALTPL-----------VNDNGLNSSKMYDIFRNEGPRHFWH-NAIWKQFIPPK 638 + D PL + ++GL+ S M P F+ ++ W P + Sbjct: 774 ATSLWFDNWHPLGPLTLRWSSNIIGESGLSKSAMLT------PNGFYSTSSAWNTLRPSR 827 Query: 639 FSFCTWLACKDRLSTLDNLSYIDTDPMCKLCKNELESAPHLFFMCTVTNSLWCRIKNWLR 818 F P +L E+ HLFF C + +W + + Sbjct: 828 FIV----------------------PWYRLVWFVAETHNHLFFDCAYSFGIWTHVLSKCD 865 Query: 819 ITRSMSTLASAIKWI----KKDRVEPILKKARSIALCSFVFHVWKARNAVIFDGTPFTEE 986 +++ + + I W+ K + + ++ K +AL + V+ +W+ RN F Sbjct: 866 VSKPLLPWSDFIFWVATNWKGNSLPVVILK---LALQAVVYAIWRERNNRRFRNESLPPA 922 Query: 987 AVFHKIQKHV 1016 VF I + + Sbjct: 923 VVFKGIVESI 932 >gb|AAD15471.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1277 Score = 126 bits (317), Expect = 1e-26 Identities = 64/163 (39%), Positives = 91/163 (55%), Gaps = 6/163 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L +++ Y+PL D++ + I WTA SLSYAGRL LI SV+ + FW+ + LP Sbjct: 918 LLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAYRLPAGC 977 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 IK I KLC AFLW N K+ I W +C +EGGLGI+ + NK K++W+ Sbjct: 978 IKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIKSLLEANKVSCLKLIWRLV 1037 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKD-DSTLLKRICDIRD 473 +LWV WV + +R S W+ N R S + K++ + RD Sbjct: 1038 SRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMWKKLLNYRD 1080 >gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] Length = 740 Score = 125 bits (315), Expect = 2e-26 Identities = 64/163 (39%), Positives = 90/163 (55%), Gaps = 6/163 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L +++ Y+PL D++ + I WTA SLSYAGRL LI SV+ + FW+ + LP Sbjct: 349 LLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAYRLPAGC 408 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 IK I KLC AFLW N K+ I W +C +EGGLGI+ + NK K++W+ Sbjct: 409 IKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIKSLLEANKVSCLKLIWRLV 468 Query: 348 QGTETLWVRWVHGFYLRNQSIWTWNPRKD-DSTLLKRICDIRD 473 +LWV WV + +R S W+ N R S + K++ RD Sbjct: 469 SRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMWKKLLKYRD 511 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 125 bits (315), Expect = 2e-26 Identities = 62/143 (43%), Positives = 85/143 (59%), Gaps = 5/143 (3%) Frame = +3 Query: 3 LAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSV 182 L +K+ Y PL ++I I KWTA LS+AGRL LI SV+ + FW+ F LP + Sbjct: 32 LLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPSAC 91 Query: 183 IKRIYKLCRAFLW-----NKKRHPIAWHDVCLPIEEGGLGIRDVYAWNKALLSKILWKFH 347 IK I +C +FLW N K+ +AW DVC P +EGGLGIR + NK L K++W+ Sbjct: 92 IKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANKVSLLKLIWRML 151 Query: 348 QGTETLWVRWVHGFYLRNQSIWT 416 T +LWV+W+ + LR S W+ Sbjct: 152 SST-SLWVQWLRLYLLRKGSFWS 173