BLASTX nr result
ID: Rehmannia25_contig00001415
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00001415 (1010 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] 148 3e-33 emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana] ... 134 4e-29 emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis] 134 6e-29 emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] 134 7e-29 gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidop... 133 1e-28 emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] 130 1e-27 gb|AAK29467.1| polyprotein-like [Solanum chilense] 129 2e-27 emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera] 129 2e-27 sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly... 127 5e-27 gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [... 126 1e-26 gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 126 1e-26 gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi... 125 2e-26 gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao] 125 3e-26 gb|AAP53866.2| retrotransposon protein, putative, Ty1-copia subc... 123 1e-25 emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] 120 6e-25 ref|XP_003625577.1| Ubiquitin-protein ligase [Medicago truncatul... 120 8e-25 gb|ABA98861.1| retrotransposon protein, putative, Ty1-copia subc... 119 2e-24 ref|XP_003613324.1| Cc-nbs-lrr resistance protein [Medicago trun... 119 2e-24 emb|CAN74741.1| hypothetical protein VITISV_025583 [Vitis vinifera] 117 7e-24 gb|ABD96963.1| hypothetical protein [Cleome spinosa] 117 9e-24 >emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] Length = 894 Score = 148 bits (374), Expect = 3e-33 Identities = 100/324 (30%), Positives = 171/324 (52%), Gaps = 4/324 (1%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTAEK--KVEIDEYAY 221 M +++E F GK DF +W+ KM+ +L+QQ + A+ G T ++ K+E+ E A+ Sbjct: 1 MGTVKFDVEKFTGKNDFGLWRLKMRALLVQQGLQDALLGEKNLPXTMQEKHKIELLEKAH 60 Query: 222 SSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDE 401 +IIL+L D+ LR+V K SA L KLE LY SL + + I+E Sbjct: 61 GAIILSLGDTXLREVAKAKSAAKLLLKLESLYMTKSLANRLHKXIKLYTFKMTPSMSIEE 120 Query: 402 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLK 581 +LD F K+I D+K ++ I+LL ++ SY+++K AI YGRD +T D V + L Sbjct: 121 HLDHFNKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMYGRDILTFDEVQSILH 180 Query: 582 SKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXX 761 ++EL + + ++ GE +++RG+S+ R ++K +N ++++ Sbjct: 181 ARELHKQ--EESKEELGEGLNIRGKSKKR--EKKKGNNSKSRS------------KSKTK 224 Query: 762 XXXCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIFMVSDLCEMHAVNSVKS 941 C+ C + GH+ ++CP+ + N K + N G+ M+ D + V +V Sbjct: 225 KFKCFICHKEGHFKKDCPDMRQNTXKK---------TMNEGDATMILDGYDNAGVLNV-- 273 Query: 942 NSVIDN--EWLIDSACTFHMSPFK 1007 + +D+ EW++DS C+FHM P K Sbjct: 274 -AEVDSGKEWILDSGCSFHMCPIK 296 >emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana] gi|7267743|emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] Length = 1230 Score = 134 bits (338), Expect = 4e-29 Identities = 94/341 (27%), Positives = 165/341 (48%), Gaps = 24/341 (7%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMK------GILIQQKVYKAVS--------GVYGEKDT 185 M+ +E F+G D+++W++K+ G+ + + ++VS G EK Sbjct: 1 MSSARVEMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGD 60 Query: 186 AEKKVEID-EYAYSSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXX 362 E +E + A S+I+L++SD VLRK K +A ++ E L++LY +LP+ Sbjct: 61 KEALMEEKRQKARSTIVLSVSDQVLRKSKKEKTAPSMLEALDKLYMSKALPNRIYLKQKL 120 Query: 363 XXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYG- 539 N ++ N+D F +LI D++ T D+ I+LL ++P+ + +K +KYG Sbjct: 121 YSYKMQENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLKDTLKYGS 180 Query: 540 -RDTITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFH 716 R T+++D VV + SKEL+L NK + E ++V+ + + R ++QK ++ ++ Sbjct: 181 GRTTLSVDEVVAAIYSKELELGSNKKSIRGQAEGLYVKDKPETRGMSEQKEKGNKGRS-- 238 Query: 717 XXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFKNDQAN-------MASSSE 875 C+ CGE GH+ CPNK QN DQA+ + Sbjct: 239 ---------RSRSKGWKGCWICGEEGHFKTSCPNKGKQQNKGKDQASGSKGEAATIKGNT 289 Query: 876 NVGEIFMVSDLCEMHAVNSVKSNSVIDNEWLIDSACTFHMS 998 + G + VS+ VN + NEW++D+ C +HM+ Sbjct: 290 SEGSGYYVSEALHSTDVN-------LGNEWVMDTGCNYHMT 323 >emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis] Length = 1334 Score = 134 bits (337), Expect = 6e-29 Identities = 92/325 (28%), Positives = 158/325 (48%), Gaps = 5/325 (1%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTA----EKKVEIDEY 215 M++P + +E F DFS+W+ KMK +L+ Q + A+ E T +K+ +I Sbjct: 1 MSLPRHEIEKFTIGGDFSLWKLKMKALLVHQGLESALDEEDLEASTGSGIDDKRRQIQNR 60 Query: 216 AYSSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEI 395 A+S++IL+L DS+LR++ + +A +W K+E L + SL I Sbjct: 61 AHSTLILSLGDSILREISEEKTALGIWNKVETLCMKKSLAHRLFLKKRLYTFSMREGVTI 120 Query: 396 DENLDVFTKLIQDIK-LTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVN 572 +++D F K+I D++ + K D+ LL+++P+SY + YGR T+TL+ V Sbjct: 121 QDHIDTFNKIILDLEGVENVKICDEDKAFFLLSSLPKSYEGFVDTMLYGRTTLTLEDVKA 180 Query: 573 GLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXX 752 L SKE ++ N ++GE + R +K + +N+N Sbjct: 181 SLSSKE--IQKNCELETSNGEGLMAR---------TEKKKDQKNKNQGKGHGKNQETADK 229 Query: 753 XXXXXXCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIFMVSDLCEMHAVNS 932 C+ C + GHYIR+C KK ++ + + A +S++ + + +DL Sbjct: 230 KKKKRKCFYCRKEGHYIRDCFEKKKKES-QEKSGDAAVASDDGSDGYQSADLL------- 281 Query: 933 VKSNSVIDNEWLIDSACTFHMSPFK 1007 V SNS +W+IDS C+FH+ P K Sbjct: 282 VASNSNTKGQWVIDSGCSFHLCPEK 306 >emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] Length = 1208 Score = 134 bits (336), Expect = 7e-29 Identities = 82/263 (31%), Positives = 142/263 (53%), Gaps = 2/263 (0%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTAEKK--VEIDEYAY 221 M +++E F GK DF + + KM+ +L+QQ + A+ G T ++K +E+ E A+ Sbjct: 1 MGTAKFDVEKFTGKNDFGLXRLKMRALLVQQGLQDALLGEKNLPSTMQEKQKIELLEKAH 60 Query: 222 SSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDE 401 S+IIL+L D+VLR+ K SA +W KLE LY SL + I+ Sbjct: 61 SAIILSLGDTVLREXAKAKSAAEVWLKLESLYMTKSLANRLHKKIKLYTFKMTPGMSIEX 120 Query: 402 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLK 581 +LD F K+I D++ D+ I+LL ++ SY+++K AI YGRD++T D V + L Sbjct: 121 HLDHFNKIILDLENIDITISDEDKAILLLTSLDASYTNMKDAIMYGRDSLTFDEVQSILH 180 Query: 582 SKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXX 761 ++E L+ + ++ SGE +++RGRS+ R ++K N ++++ Sbjct: 181 ARE--LQKQEESKEESGEGLNIRGRSEKR---EKKGKNSKSRS------------KSKTK 223 Query: 762 XXXCYKCGEVGHYIRECPNKKGN 830 C+ C + GH+ ++CP+++ N Sbjct: 224 KFKCFICHKEGHFKKDCPDRRQN 246 >gb|AAC62132.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana] Length = 1137 Score = 133 bits (334), Expect = 1e-28 Identities = 90/288 (31%), Positives = 141/288 (48%), Gaps = 10/288 (3%) Frame = +3 Query: 174 EKDTAEKKVEI---------DEYAYSSIILNLSDSVLRKVGKLASAKALWEKLEELYTET 326 E D A+KK I DE A I +N+ D VLR + +A W L++LY Sbjct: 21 ESDPAKKKQRIEEEKARIDQDEKAMDMIFINVGDKVLRNIENSKTAAEAWATLDKLYLVK 80 Query: 327 SLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPES 506 SLP+ +K ++EN+D F K+I D+ + D+ I++L+A+P+S Sbjct: 81 SLPNRVYLQLKVYNYRMQDSKTLEENVDEFQKMISDLNNLQIQVPDEVQAILILSALPDS 140 Query: 507 YSDVKSAIKYGRDTITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQK 686 Y +K +KYGR+ I LD V++ KSKEL+L+ + GG + GE ++VRG+SQ R + K Sbjct: 141 YDMLKETLKYGREGIKLDDVISAAKSKELELRDSSGGSRPVGEGLYVRGKSQARGSDGPK 200 Query: 687 SDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFKNDQANMAS 866 S + C+ CG+ GH+ R+C + + ++AN A Sbjct: 201 STEGKK---------------------VCWICGKEGHFKRQC-----YKWLEKNKANGAG 234 Query: 867 SSENVGEIFMVSDLCEMHAVNSVKSNSVID-NEWLIDSACTFHMSPFK 1007 + V + DL + A S D EW++D+ C+FHM+P K Sbjct: 235 ETALVKD--DAQDLVGLVASEVNMSEGKDDQEEWIMDTGCSFHMTPRK 280 >emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] Length = 939 Score = 130 bits (326), Expect = 1e-27 Identities = 77/203 (37%), Positives = 119/203 (58%), Gaps = 5/203 (2%) Frame = +3 Query: 69 LEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKD-----TAEKKVEIDEYAYSSII 233 +E F GK DF++W+ +MK IL QQ V A+ E + TA++K +IDE AY II Sbjct: 8 IERFIGKNDFNVWRMRMKAILFQQGVKDALKD---ESELPVTMTAKEKSDIDEKAYHLII 64 Query: 234 LNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDV 413 L L D LR+ + +AK +W KLE+LY + SL + ++ I +NLD Sbjct: 65 LALGDKALREFSEETTAKGVWNKLEQLYMQNSLSNRLYLKERLYGFKMQEDRSIADNLDD 124 Query: 414 FTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLKSKEL 593 F K++ ++ G K D+ +++L ++P YS+ K +KYGR T+TL+ V + L+SKEL Sbjct: 125 FAKIVLEMSNIGIKVDDEDKAVLVLKSLPGLYSNFKETMKYGRKTLTLEEVQSALRSKEL 184 Query: 594 DLKVNKGGRQNSGEVMHVRGRSQ 662 +LK KG ++GE + +RGR + Sbjct: 185 ELK-KKG---SNGEGLSIRGRKK 203 >gb|AAK29467.1| polyprotein-like [Solanum chilense] Length = 1328 Score = 129 bits (324), Expect = 2e-27 Identities = 97/331 (29%), Positives = 156/331 (47%), Gaps = 10/331 (3%) Frame = +3 Query: 48 MAVPSYNLEPFNG-KTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTA--EKKVEIDEYA 218 M+ Y + FNG K FS+WQ++MK +LIQQ ++KA+ G + ++ E E+DE A Sbjct: 1 MSGVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKA 60 Query: 219 YSSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEID 398 S+I L+L+D V+ + SA +W KLE LY +L + D Sbjct: 61 ASAIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFL 120 Query: 399 ENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGL 578 +L+V LI + G K ++ IVLLN++P SY + + I +G+D+I L V + L Sbjct: 121 SHLNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLKDVTSAL 180 Query: 579 KSKELDLKVNKGGRQNSGEVM--HVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXX 752 E K+ K +N G+V RGRS R + N+ Sbjct: 181 LLNE---KMRK-KPENHGQVFITESRGRSYQR----------SSSNYGRSGARGKSKVRS 226 Query: 753 XXXXXXCYKCGEVGHYIRECPNKK----GNQNFKNDQANMASSSENVGEIFMVSDLCE-M 917 CY C + GH+ R+CPN K + KND A N + ++++ E M Sbjct: 227 KSKARNCYNCDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECM 286 Query: 918 HAVNSVKSNSVIDNEWLIDSACTFHMSPFKN 1010 H + ++EW++D+A ++H +P ++ Sbjct: 287 HLAGT-------ESEWVVDTAASYHATPVRD 310 >emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera] Length = 950 Score = 129 bits (323), Expect = 2e-27 Identities = 91/322 (28%), Positives = 160/322 (49%), Gaps = 5/322 (1%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSG--VYGEKDTAEKKVEIDEYAY 221 M+ + +E FNG DF++W+ KMK +L+QQK +A+ G TA +K E+ A+ Sbjct: 1 MSSQKFEVEKFNGSNDFTLWKLKMKALLVQQKCAQAIEGEETLPVGLTAVEKEEVVSRAH 60 Query: 222 SSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDE 401 S+I+L+L+D VLR+V +A LW K E Y + SL + ++ + Sbjct: 61 SAILLSLADEVLREVADETTAVGLWRKFESKYQKKSLTNRLYQKRQLHTLKMSEGMQVRD 120 Query: 402 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLK 581 +L+ F ++I D+ G K ++ ++LL ++P SY + + YGR++I+ + V + L+ Sbjct: 121 HLNNFNRIILDLNGVGVKVEEEDQAMILLCSLPSSYENFVDTMMYGRBSISXNDVKDALQ 180 Query: 582 SKELDLKVNKGGRQNSGE--VMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXX 755 SKEL K+ G + S E + RGRS R G + +++ Sbjct: 181 SKELQ-KLVSGSEEGSVETGLTVSRGRSMERNGGGRSKSXSKSK---------------- 223 Query: 756 XXXXXCYKCGEVGHYIRECPNK-KGNQNFKNDQANMASSSENVGEIFMVSDLCEMHAVNS 932 C+ E GH+ + CP + KG N A + + ++ E SD E V + Sbjct: 224 -AAMRCFHXKEKGHFRKNCPQRQKGIGXGSNGNAQVVVAQKD-SEKQDSSDEGEGGDVLT 281 Query: 933 VKSNSVIDNEWLIDSACTFHMS 998 V ++S ++ W++D+ ++HM+ Sbjct: 282 VSTSSSAES-WILDTGASYHMA 302 >sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease gi|20045|emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] Length = 1328 Score = 127 bits (320), Expect = 5e-27 Identities = 96/329 (29%), Positives = 157/329 (47%), Gaps = 8/329 (2%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDT--AEKKVEIDEYAY 221 M+ Y + FNG FS WQ++M+ +LIQQ ++K + + DT AE ++DE A Sbjct: 1 MSGVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAA 60 Query: 222 SSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDE 401 S+I L+LSD V+ + +A+ +W +LE LY +L + Sbjct: 61 SAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLS 120 Query: 402 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLK 581 +L+VF LI + G K ++ I+LLN++P SY ++ + I +G+ TI L V + L Sbjct: 121 HLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 180 Query: 582 SKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXX 761 E K+ K +N G+ + GR + + Q+S N N+ Sbjct: 181 LNE---KMRK-KPENQGQALITEGRGR----SYQRSSN----NYGRSGARGKSKNRSKSR 228 Query: 762 XXXCYKCGEVGHYIRECPN-KKG---NQNFKNDQ--ANMASSSENVGEIFMVSDLCEMHA 923 CY C + GH+ R+CPN +KG KND A M +++NV + C MH Sbjct: 229 VRNCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEEC-MHL 287 Query: 924 VNSVKSNSVIDNEWLIDSACTFHMSPFKN 1010 S ++EW++D+A + H +P ++ Sbjct: 288 -------SGPESEWVVDTAASHHATPVRD 309 >gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [Theobroma cacao] Length = 1029 Score = 126 bits (317), Expect = 1e-26 Identities = 98/332 (29%), Positives = 153/332 (46%), Gaps = 25/332 (7%) Frame = +3 Query: 42 LKMAVPS--YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVY--------GEKDTAE 191 L MA S Y +E FNG+ DFS+W+ KM+ +L+QQ + KA+ G GEKD Sbjct: 121 LAMATSSTKYEIEKFNGRNDFSLWRVKMRALLVQQGLLKALKGKEHLPSNLSDGEKDDLM 180 Query: 192 KKVEIDEYAYSSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXX 371 KK A+S I+L LSD VLR+V SA A+W KLE +Y SL + Sbjct: 181 KK------AHSVILLALSDEVLREVTDEESAAAVWFKLESIYMTKSLTNRLYMKQRLYTL 234 Query: 372 XXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTI 551 ++ ++D F ++I D+K K D+ ++LL +P SY + + YGRDT+ Sbjct: 235 KMSEGTSVNTHIDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTMLYGRDTL 294 Query: 552 TLDTVVNGLKSKELDLKVNKGGRQNSGEVMHV-RGRSQHRFGNQQKSDNHQNQNFHXXXX 728 T + V L KEL KV +N E + V RGR + + G +K + Sbjct: 295 TFEDVRASLNFKELKKKVGGIRNENQAEGLVVNRGRGKEK-GLDRKGKSRAK-------- 345 Query: 729 XXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFKNDQ--ANMASSS----ENVGEI 890 C+ CG+ GH+ ++C K ++ F + AN+ E + Sbjct: 346 -----------GKTCWNCGQKGHFRQDCTKFKDDEKFNKSENTANVVGDDFDTFEETDNV 394 Query: 891 FMVSDL--------CEMHAVNSVKSNSVIDNE 962 +++ E++A+ +V+ +S I E Sbjct: 395 LAITNYQEVGKQVELEINALVTVRDDSEIQKE 426 >gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1356 Score = 126 bits (317), Expect = 1e-26 Identities = 85/339 (25%), Positives = 161/339 (47%), Gaps = 22/339 (6%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTAEKKVEIDEY---- 215 M+ +E F+G+ D+++W++K+ L + + + + T EKK +DE Sbjct: 1 MSTARIEVEKFDGRGDYTMWKEKL---LAHMDILGLNTALKESESTGEKKSVLDESDEDY 57 Query: 216 ----------------AYSSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXX 347 A S+I+L+++D VLRK+ K ++A A+ L++LY +LP+ Sbjct: 58 EEKLEKFEALEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIY 117 Query: 348 XXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSA 527 N ++ N+D F ++I D++ D+ I+LL A+P+++ +K Sbjct: 118 PKQKLYSFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDT 177 Query: 528 IKY--GRDTITLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQ 701 +KY G+ +TLD V + SKEL+L K + E ++V+ +++++ +QK Sbjct: 178 LKYSSGKSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVKDKNENKGKGEQKGKGKG 237 Query: 702 NQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENV 881 + C+ CGE GH+ CPN+ Q FK Q SS Sbjct: 238 KKG-------------KSKKKPGCWTCGEEGHFRSSCPNQNKPQ-FKQSQVVKGESSGGK 283 Query: 882 GEIFMVSDLCEMHAVNSVKSNSVIDNEWLIDSACTFHMS 998 G + + A++S + + +++EW++D+ C++HM+ Sbjct: 284 GNLAEAAGYYVSEALSSTEVH--LEDEWILDTGCSYHMT 320 >gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 838 Score = 125 bits (315), Expect = 2e-26 Identities = 90/332 (27%), Positives = 163/332 (49%), Gaps = 21/332 (6%) Frame = +3 Query: 69 LEPFNGKTDFSIWQQKMK------GILIQQKVYKAVSGVYGEKDT------AEKKV--EI 206 +E +G+ D+ +W++K+ G+L + +A+ +T E KV E Sbjct: 8 VEKLDGEGDYVLWKEKLLAHIELLGLLEGLEEDEAIEEEESTAETDSLLTKTEDKVLKEK 67 Query: 207 DEYAYSSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLN 386 A S++IL+L + VLRKV K +A + L++L+ SLP+ + Sbjct: 68 RGKARSTVILSLGNHVLRKVIKEKTAAGMIRVLDKLFMAKSLPNRIYLKQRLYGYKMSDS 127 Query: 387 KEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTV 566 I+EN++ F KLI D++ D+ IVLL ++P+ + +K +KYG+ T+ LD + Sbjct: 128 MTIEENVNDFFKLISDLENVKVSVPDEDQAIVLLMSLPKQFDQLKDTLKYGKTTLALDEI 187 Query: 567 VNGLKSKELDLKVNKGGRQNSGEVMHV--RGRSQHRFGNQQKSDNHQNQNFHXXXXXXXX 740 ++SK L+L + +NS + + V RGRS+ R + S+ +++Q+ Sbjct: 188 TGAIRSKVLELGASGKMLKNSSDALFVQDRGRSEKR---DKSSERNKSQS---------- 234 Query: 741 XXXXXXXXXXCYKCGEVGHYIREC-----PNKKGNQNFKNDQANMASSSENVGEIFMVSD 905 C+ CG+ GH+ ++C NKKGN + K + +N+ + + + + + Sbjct: 235 -RSKSREKKVCWVCGKEGHFKKQCYVWKEKNKKGNNSEKGESSNVIGQAADAAALAVREE 293 Query: 906 LCEMHAVNSVKSNSVIDNEWLIDSACTFHMSP 1001 S N +DNEW++D+ C+FHM+P Sbjct: 294 --------SNADNQEVDNEWIMDTGCSFHMTP 317 >gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao] Length = 277 Score = 125 bits (314), Expect = 3e-26 Identities = 82/266 (30%), Positives = 130/266 (48%), Gaps = 3/266 (1%) Frame = +3 Query: 63 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGV--YGEKDTAEKKVEIDEYAYSSIIL 236 Y +E FNG+ DFS+W+ KM +L+QQ + KA+ G + +K ++ E A+S+I+L Sbjct: 8 YEIEKFNGRNDFSLWRVKMCALLVQQGLLKALKGKEHLPSNLSDSEKDDLMEKAHSAILL 67 Query: 237 NLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVF 416 LSD VLR+V SA A+W KLE +Y SL + ++ ++D F Sbjct: 68 TLSDEVLREVTDEESAAAMWFKLESIYITKSLTNRLYMKQRLYTLKMSEGTSVNTHIDEF 127 Query: 417 TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLKSKELD 596 ++I D+K K D+ ++LL +P SY + + YGRDT+T + V L SKEL Sbjct: 128 NRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTMLYGRDTLTFEDVRAYLNSKELK 187 Query: 597 LKVNKGGRQNSGEVMHV-RGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXC 773 KV +N E + V RGR + + G +K + C Sbjct: 188 KKVGGIRNENQAEGLVVNRGRGKEK-GLDKKGKSRAK-------------------GKTC 227 Query: 774 YKCGEVGHYIRECPNKKGNQNFKNDQ 851 + CG+ GH+ ++C K ++ F + Sbjct: 228 WNCGQKGHFRQDCTKFKDDEKFNKSE 253 >gb|AAP53866.2| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 415 Score = 123 bits (309), Expect = 1e-25 Identities = 85/318 (26%), Positives = 146/318 (45%), Gaps = 2/318 (0%) Frame = +3 Query: 51 AVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTAEKKVEIDEYAYSSI 230 AV + + F+G +F +WQ ++K +L QQ + KA+ EK A+K E+ A ++I Sbjct: 5 AVSKFEVVKFDGTGNFVLWQMRLKDLLAQQGISKALQETMPEKIDADKWNEMKAQAAATI 64 Query: 231 ILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLD 410 L+LSDSV+ +V S K +W+KL L+ SL S ++ +++D Sbjct: 65 RLSLSDSVMYQVMDEKSPKEIWDKLASLHMSKSLTSKLYLKQQLYGLQVQEESDLRKHVD 124 Query: 411 VFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLKSKE 590 VF +L+ D+ K D+ I+LL ++P SY V + + +G+DT+ + +++ L +++ Sbjct: 125 VFNQLVVDLSKLDVKLDDEDKAIILLCSLPLSYEHVVTTLTHGKDTVKTEEIISSLLARD 184 Query: 591 L-DLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXXXX 767 L K N+ + + G+ + V+ + H G + + Sbjct: 185 LRRSKKNEATKASQGKSLLVKDKHDHEAGVSKSKEK----------------------GA 222 Query: 768 XCYKCGEVGHYIRECP-NKKGNQNFKNDQANMASSSENVGEIFMVSDLCEMHAVNSVKSN 944 CYKC E GH R CP KK + A S EI VSD Sbjct: 223 RCYKCHEFGHIRRNCPLLKKRKGGIASLAARGDDSDSGSHEILTVSD------------- 269 Query: 945 SVIDNEWLIDSACTFHMS 998 + W++DSA ++H++ Sbjct: 270 EMSGEAWMLDSASSYHVT 287 >emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] Length = 1363 Score = 120 bits (302), Expect = 6e-25 Identities = 84/334 (25%), Positives = 161/334 (48%), Gaps = 24/334 (7%) Frame = +3 Query: 69 LEPFNGKTDFSIWQQKM---------KGILIQQK--VYKAVSGVYGEKDTAEKKVEIDEY 215 +E F+G+ D+++W++K+ +L + + + K ++D E++ +++ + Sbjct: 8 VEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEEREKMEAF 67 Query: 216 ------AYSSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXX 377 A S+I+L++SD VLRK+ K SA A+ E L+ LY +LP+ Sbjct: 68 EEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKLYSFKM 127 Query: 378 DLNKEIDENLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKY--GRDTI 551 N I+ N+D F ++ D++ D+ I+LL ++P+ + +K +KY G+ + Sbjct: 128 SENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSSGKTVL 187 Query: 552 TLDTVVNGLKSKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXX 731 +LD V + S+EL+ K + E ++V+ ++++R ++QK ++ Sbjct: 188 SLDEVAAAIYSRELEFGSVKKSIKGQAEGLYVKDKAENRGRSEQKDKGKGKRS------- 240 Query: 732 XXXXXXXXXXXXXCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIFMVS--- 902 C+ CGE GH CPNK Q FKN +N SS G + S Sbjct: 241 ------KSKSKRGCWICGEDGHLKSTCPNKNKPQ-FKNQGSNKGESSGGKGNLVEGSVNF 293 Query: 903 -DLCEMHAVNSVKSNSV-IDNEWLIDSACTFHMS 998 + M ++ S + +++EW++D+ C +HM+ Sbjct: 294 VESAGMFVSEALSSTDIHLEDEWIMDTGCIYHMT 327 >ref|XP_003625577.1| Ubiquitin-protein ligase [Medicago truncatula] gi|355500592|gb|AES81795.1| Ubiquitin-protein ligase [Medicago truncatula] Length = 1405 Score = 120 bits (301), Expect = 8e-25 Identities = 84/260 (32%), Positives = 130/260 (50%), Gaps = 4/260 (1%) Frame = +3 Query: 63 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTAEK--KVEIDEYAYSSIIL 236 +++E F G DF +W+ KM+ +LIQQK KA+ G T + K E+ + A S+++L Sbjct: 5 WDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSQAEKTEMVDKARSAVVL 64 Query: 237 NLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVF 416 L D VLR+V K A+A ++W KLE LY SL +K I E L F Sbjct: 65 CLGDKVLREVAKEATAASIWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEF 124 Query: 417 TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRD-TITLDTVVNGLKSKEL 593 K++ D++ + D+ I+LL A+P+S+ K + YG++ T+TL+ V L++KE Sbjct: 125 NKILDDLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKE- 183 Query: 594 DLKVNKGGRQNSGEVMHV-RGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXXXXX 770 L +K GE + V RG R GN++KS N Sbjct: 184 -LTKSKDLTHEHGEGLSVTRGNGGGR-GNRRKSGNKSR--------------------FE 221 Query: 771 CYKCGEVGHYIRECPNKKGN 830 C+ C ++GH+ ++CP GN Sbjct: 222 CFNCHKMGHFKKDCPEINGN 241 >gb|ABA98861.1| retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 425 Score = 119 bits (297), Expect = 2e-24 Identities = 86/318 (27%), Positives = 145/318 (45%), Gaps = 1/318 (0%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTAEKKVEIDEYAYSS 227 M V + + F+G +F +WQ ++K +L QQ + KA+ EK A K E+ A ++ Sbjct: 4 MVVSKFEVVKFDGTGNFILWQMRLKDLLAQQGISKALEETMPEKMDAGKWEEMKAQAAAT 63 Query: 228 IILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDENL 407 I L+LSDSV+ V + K +W+KL LY SL S ++ +++ Sbjct: 64 IRLSLSDSVMYPVMDEKTPKEIWDKLASLYMSKSLTSKLYLKQQLYGLQMQEESDLRKHV 123 Query: 408 DVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLKSK 587 DVF +L+ D+ K D+ I+LL ++P SY V + + +G+DT+ + ++ L ++ Sbjct: 124 DVFNQLVVDLSKLDVKLDDEDMAIILLCSLPPSYEHVVTTLMHGKDTVKTEEKISSLLAR 183 Query: 588 EL-DLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXXX 764 +L N+ + E + V+ + H G + D Sbjct: 184 DLRRSNKNEAMEASQAESLLVKAKHDHEAGVSKSKDK----------------------G 221 Query: 765 XXCYKCGEVGHYIRECPNKKGNQNFKNDQANMASSSENVGEIFMVSDLCEMHAVNSVKSN 944 CYKC E GH R CP K K A++A+ ++ SD H +V SN Sbjct: 222 ARCYKCHEFGHIRRNCPLLKKR---KGGIASLAARGDD-------SD-SSSHETLTV-SN 269 Query: 945 SVIDNEWLIDSACTFHMS 998 W++DSA ++H++ Sbjct: 270 EKSGEAWMLDSASSYHVT 287 >ref|XP_003613324.1| Cc-nbs-lrr resistance protein [Medicago truncatula] gi|355514659|gb|AES96282.1| Cc-nbs-lrr resistance protein [Medicago truncatula] Length = 1104 Score = 119 bits (297), Expect = 2e-24 Identities = 81/259 (31%), Positives = 127/259 (49%), Gaps = 3/259 (1%) Frame = +3 Query: 63 YNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTAEK--KVEIDEYAYSSIIL 236 +++E F G DF +W+ KM+ +LIQQK KA+ G T + K E+ + A S+++L Sbjct: 5 WDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSQAEKTEMVDKARSAVVL 64 Query: 237 NLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVF 416 L D VLR+V K A+A ++W KLE LY SL +K I E L F Sbjct: 65 CLGDKVLREVAKEATAASIWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEF 124 Query: 417 TKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRD-TITLDTVVNGLKSKEL 593 K++ D++ + D+ I+LL A+P+S+ K + YG++ T+TL+ V L++KEL Sbjct: 125 NKILDDLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKEL 184 Query: 594 DLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXC 773 + G + RG R GN++KS N C Sbjct: 185 TKSKDLTHEYGDG-LSVTRGNGGGR-GNRRKSGNKSR--------------------FEC 222 Query: 774 YKCGEVGHYIRECPNKKGN 830 + C ++GH+ ++CP GN Sbjct: 223 FNCHKMGHFKKDCPEINGN 241 >emb|CAN74741.1| hypothetical protein VITISV_025583 [Vitis vinifera] Length = 253 Score = 117 bits (293), Expect = 7e-24 Identities = 77/264 (29%), Positives = 133/264 (50%), Gaps = 2/264 (0%) Frame = +3 Query: 48 MAVPSYNLEPFNGKTDFSIWQQKMKGILIQQKVYKAVSGVYGEKDTAEKK--VEIDEYAY 221 M +++E F K DF +W+ KM+ L+QQ + A+ T ++K +E+ E A+ Sbjct: 1 MGTAKFDVEEFTSKNDFRLWRLKMRAFLVQQGLQDALLREKNLLSTMQEKHKIELLEKAH 60 Query: 222 SSIILNLSDSVLRKVGKLASAKALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDE 401 S+I+L+L D++LR+V K SA LW KLE LY SL + I+E Sbjct: 61 SAIVLSLGDTLLREVAKAKSAAELWLKLESLYMTKSLGNRLHKKIKLYTFKMIPGMSIEE 120 Query: 402 NLDVFTKLIQDIKLTGDKNIDDYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLK 581 +LD F K+I D++ +D+ I+L ++ SY+++K AI Y RD +T D V + L Sbjct: 121 HLDHFNKIILDLENIDIVVLDEDKAIMLPTSLDASYTNMKEAIMYERDNMTFDEVQSILH 180 Query: 582 SKELDLKVNKGGRQNSGEVMHVRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXX 761 ++E L+ + ++ SGE +++RGR + R +K N+ N Sbjct: 181 ARE--LQKQEESKEKSGEGLNIRGRYEKR----EKKGNNSNSR-----------SKFQDQ 223 Query: 762 XXXCYKCGEVGHYIRECPNKKGNQ 833 + + H+ ++CP+++ Q Sbjct: 224 EVQVFYLSQGRHFKKDCPDRRQTQ 247 >gb|ABD96963.1| hypothetical protein [Cleome spinosa] Length = 408 Score = 117 bits (292), Expect = 9e-24 Identities = 86/325 (26%), Positives = 143/325 (44%), Gaps = 41/325 (12%) Frame = +3 Query: 48 MAVPSY---NLEPFNGKTDFSIWQQKMK-------------------------------- 122 M+ P Y ++E F+GK DFS+W++KM Sbjct: 1 MSAPLYRGLDVEKFDGKGDFSLWKEKMSISLEILGLGDTLEDDPSTWGVADEGSPQTPAR 60 Query: 123 -----GILIQQKVYKAVSGVYGEKDTAEK-KVEIDEYAYSSIILNLSDSVLRKVGKLASA 284 G L G G A K + E A + I+L L+D VLRKV +A Sbjct: 61 ETEDSGGLTSLSTGSLGPGAKGSGTPALKERQERSRRARNLIVLALADQVLRKVISERTA 120 Query: 285 KALWEKLEELYTETSLPSXXXXXXXXXXXXXDLNKEIDENLDVFTKLIQDIKLTGDKNID 464 +W KLE L+ E SLP+ D ++ I+ENLD+F KL+ D+ K + Sbjct: 121 FGIWRKLERLHIEQSLPNRMYLMQRVSGFRMDSSRTIEENLDIFQKLLSDLHSLNVKVEE 180 Query: 465 DYTPIVLLNAIPESYSDVKSAIKYGRDTITLDTVVNGLKSKELDLKVNKGGRQNSGEVMH 644 +Y + LLN++P +Y ++ +KY R TI+++ V + KEL+L + +GE + Sbjct: 181 EYQAVYLLNSLPPAYEQLREVLKYSRATISVEEVKAAARMKELELLAQGTLTRGTGEGLV 240 Query: 645 VRGRSQHRFGNQQKSDNHQNQNFHXXXXXXXXXXXXXXXXXXCYKCGEVGHYIRECPNKK 824 V+G+ + G ++K+ + C+ CG+ GHY +EC +++ Sbjct: 241 VKGKPEKSGGGKKKAKDQ----------------------VECWYCGKKGHYKKECRSRR 278 Query: 825 GNQNFKNDQANMASSSENVGEIFMV 899 + + + +AS E E+ +V Sbjct: 279 AKEETEG-KGVVASVQEYDSEVLLV 302