BLASTX nr result
ID: Catharanthus22_contig00000133
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00000133 (1362 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] 177 7e-42 emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] 163 1e-37 gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [... 155 5e-35 gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao] 154 6e-35 emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] 152 2e-34 emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera] 152 4e-34 emb|CAN74741.1| hypothetical protein VITISV_025583 [Vitis vinifera] 150 9e-34 gb|EOY34202.1| Uncharacterized protein TCM_041944 [Theobroma cacao] 144 9e-32 gb|AAK29467.1| polyprotein-like [Solanum chilense] 141 6e-31 ref|XP_003625577.1| Ubiquitin-protein ligase [Medicago truncatul... 140 2e-30 sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly... 139 3e-30 ref|XP_003613324.1| Cc-nbs-lrr resistance protein [Medicago trun... 137 8e-30 gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi... 137 1e-29 gb|ABO36622.1| copia LTR rider [Solanum lycopersicum] gi|1337118... 136 2e-29 gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 135 4e-29 emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis] 135 5e-29 gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi... 135 5e-29 dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 134 9e-29 emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] 134 1e-28 ref|XP_004165166.1| PREDICTED: uncharacterized protein LOC101225... 130 1e-27 >emb|CAN80304.1| hypothetical protein VITISV_017821 [Vitis vinifera] Length = 939 Score = 177 bits (450), Expect = 7e-42 Identities = 89/198 (44%), Positives = 135/198 (68%) Frame = -2 Query: 671 KTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAHSCLT 492 K+E ++F GK+DF +WR +MKA+L Q A+ + P + + S+ A+ + Sbjct: 5 KSEIERFIGKNDFNVWRMRMKAILFQQGVKDALKDESELPVTMTAKEKSDIDEKAYHLII 64 Query: 491 LHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSENIDD 312 L L D LRE E+ A +W KLE+LY+ SLSN++YLKE+L+GFKM +S+++N+DD Sbjct: 65 LALGDKALREFSEETTAKGVWNKLEQLYMQNSLSNRLYLKERLYGFKMQEDRSIADNLDD 124 Query: 311 LNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNALKSKDL 132 KIVLE+ N+G K+ DE+ AV++L SLP +++ K +KYGR LTL +V++AL+SK+L Sbjct: 125 FAKIVLEMSNIGIKVDDEDKAVLVLKSLPGLYSNFKETMKYGRKTLTLEEVQSALRSKEL 184 Query: 131 DLRKENKSNGENLYVRGR 78 +L+K+ SNGE L +RGR Sbjct: 185 ELKKKG-SNGEGLSIRGR 201 >emb|CAN61630.1| hypothetical protein VITISV_003191 [Vitis vinifera] Length = 1208 Score = 163 bits (413), Expect = 1e-37 Identities = 86/208 (41%), Positives = 134/208 (64%), Gaps = 1/208 (0%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAH 504 M K + +KF GK+DF + R KM+A+LVQ A+ P + + + E L AH Sbjct: 1 MGTAKFDVEKFTGKNDFGLXRLKMRALLVQQGLQDALLGEKNLPSTMQEKQKIELLEKAH 60 Query: 503 SCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSE 324 S + L L D VLRE + +A E+W KLE LY+ KSL+N+++ K +L+ FKM+ S+ Sbjct: 61 SAIILSLGDTVLREXAKAKSAAEVWLKLESLYMTKSLANRLHKKIKLYTFKMTPGMSIEX 120 Query: 323 NIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNALK 144 ++D NKI+L+L+N+ ISDE+ A++LL SL S+ ++K AI YGRD LT +V++ L Sbjct: 121 HLDHFNKIILDLENIDITISDEDKAILLLTSLDASYTNMKDAIMYGRDSLTFDEVQSILH 180 Query: 143 SKDLDLRKENK-SNGENLYVRGRVDRRE 63 +++L ++E+K +GE L +RGR ++RE Sbjct: 181 ARELQKQEESKEESGEGLNIRGRSEKRE 208 >gb|EOY22705.1| Transducin/WD40 repeat-like superfamily protein [Theobroma cacao] Length = 1029 Score = 155 bits (391), Expect = 5e-35 Identities = 77/201 (38%), Positives = 129/201 (64%), Gaps = 5/201 (2%) Frame = -2 Query: 689 LTMSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGD 510 + S K E +KF+G++DF +WR KM+A+LVQ A+ + P + + + Sbjct: 123 MATSSTKYEIEKFNGRNDFSLWRVKMRALLVQQGLLKALKGKEHLPSNLSDGEKDDLMKK 182 Query: 509 AHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSL 330 AHS + L L+D VLRE+ ++++A +W KLE +Y+ KSL+N++Y+K++L+ KMS S+ Sbjct: 183 AHSVILLALSDEVLREVTDEESAAAVWFKLESIYMTKSLTNRLYMKQRLYTLKMSEGTSV 242 Query: 329 SENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNA 150 + +ID+ N+++L+L+N+ KI DE++A+ILL LP S+ + + YGRD LT DV+ + Sbjct: 243 NTHIDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTMLYGRDTLTFEDVRAS 302 Query: 149 LKSKDL-----DLRKENKSNG 102 L K+L +R EN++ G Sbjct: 303 LNFKELKKKVGGIRNENQAEG 323 >gb|EOY09126.1| Uncharacterized protein TCM_024518 [Theobroma cacao] Length = 277 Score = 154 bits (390), Expect = 6e-35 Identities = 78/201 (38%), Positives = 128/201 (63%), Gaps = 5/201 (2%) Frame = -2 Query: 689 LTMSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGD 510 + S K E +KF+G++DF +WR KM A+LVQ A+ + P + + + Sbjct: 1 MATSSTKYEIEKFNGRNDFSLWRVKMCALLVQQGLLKALKGKEHLPSNLSDSEKDDLMEK 60 Query: 509 AHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSL 330 AHS + L L+D VLRE+ ++++A +W KLE +Y+ KSL+N++Y+K++L+ KMS S+ Sbjct: 61 AHSAILLTLSDEVLREVTDEESAAAMWFKLESIYITKSLTNRLYMKQRLYTLKMSEGTSV 120 Query: 329 SENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNA 150 + +ID+ N+++L+L+N+ KI DE++A+ILL LP S+ + + YGRD LT DV+ Sbjct: 121 NTHIDEFNRVILDLKNIDVKIEDEDLALILLCYLPPSYENFVDTMLYGRDTLTFEDVRAY 180 Query: 149 LKSKDL-----DLRKENKSNG 102 L SK+L +R EN++ G Sbjct: 181 LNSKELKKKVGGIRNENQAEG 201 >emb|CAN84102.1| hypothetical protein VITISV_041248 [Vitis vinifera] Length = 894 Score = 152 bits (385), Expect = 2e-34 Identities = 81/208 (38%), Positives = 132/208 (63%), Gaps = 1/208 (0%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAH 504 M K + +KF GK+DF +WR KM+A+LVQ A+ P + + + E L AH Sbjct: 1 MGTVKFDVEKFTGKNDFGLWRLKMRALLVQQGLQDALLGEKNLPXTMQEKHKIELLEKAH 60 Query: 503 SCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSE 324 + L L D LRE+ + +A ++ KLE LY+ KSL+N+++ +L+ FKM+ S S+ E Sbjct: 61 GAIILSLGDTXLREVAKAKSAAKLLLKLESLYMTKSLANRLHKXIKLYTFKMTPSMSIEE 120 Query: 323 NIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNALK 144 ++D NKI+L+L+N+ +S+E+ A++LL SL S+ ++K AI YGRD LT +V++ L Sbjct: 121 HLDHFNKIILDLKNIDIAVSNEDKAILLLTSLDASYTNMKEAIMYGRDILTFDEVQSILH 180 Query: 143 SKDLDLRKENKSN-GENLYVRGRVDRRE 63 +++L ++E+K GE L +RG+ +RE Sbjct: 181 ARELHKQEESKEELGEGLNIRGKSKKRE 208 >emb|CAN72567.1| hypothetical protein VITISV_044177 [Vitis vinifera] Length = 950 Score = 152 bits (383), Expect = 4e-34 Identities = 77/184 (41%), Positives = 115/184 (62%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAH 504 MS QK E +KF+G +DF +W+ KMKA+LVQ K A AI + P E + AH Sbjct: 1 MSSQKFEVEKFNGSNDFTLWKLKMKALLVQQKCAQAIEGEETLPVGLTAVEKEEVVSRAH 60 Query: 503 SCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSE 324 S + L L D VLRE+ ++ A +W K E Y KSL+N++Y K QL KMS + + Sbjct: 61 SAILLSLADEVLREVADETTAVGLWRKFESKYQKKSLTNRLYQKRQLHTLKMSEGMQVRD 120 Query: 323 NIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNALK 144 ++++ N+I+L+L VG K+ +E+ A+ILL SLP S+ + + YGR+ ++ DVK+AL+ Sbjct: 121 HLNNFNRIILDLNGVGVKVEEEDQAMILLCSLPSSYENFVDTMMYGRBSISXNDVKDALQ 180 Query: 143 SKDL 132 SK+L Sbjct: 181 SKEL 184 >emb|CAN74741.1| hypothetical protein VITISV_025583 [Vitis vinifera] Length = 253 Score = 150 bits (380), Expect = 9e-34 Identities = 79/208 (37%), Positives = 130/208 (62%), Gaps = 1/208 (0%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAH 504 M K + ++F K+DF +WR KM+A LVQ A+ + + + E L AH Sbjct: 1 MGTAKFDVEEFTSKNDFRLWRLKMRAFLVQQGLQDALLREKNLLSTMQEKHKIELLEKAH 60 Query: 503 SCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSE 324 S + L L D +LRE+ + +A E+W KLE LY+ KSL N+++ K +L+ FKM S+ E Sbjct: 61 SAIVLSLGDTLLREVAKAKSAAELWLKLESLYMTKSLGNRLHKKIKLYTFKMIPGMSIEE 120 Query: 323 NIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNALK 144 ++D NKI+L+L+N+ + DE+ A++L SL S+ ++K AI Y RD++T +V++ L Sbjct: 121 HLDHFNKIILDLENIDIVVLDEDKAIMLPTSLDASYTNMKEAIMYERDNMTFDEVQSILH 180 Query: 143 SKDLDLRKENK-SNGENLYVRGRVDRRE 63 +++L ++E+K +GE L +RGR ++RE Sbjct: 181 ARELQKQEESKEKSGEGLNIRGRYEKRE 208 >gb|EOY34202.1| Uncharacterized protein TCM_041944 [Theobroma cacao] Length = 698 Score = 144 bits (363), Expect = 9e-32 Identities = 70/186 (37%), Positives = 118/186 (63%) Frame = -2 Query: 689 LTMSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGD 510 + S K E +KF+G++DF +W KM A+LVQ A+ + + + Sbjct: 1 MVTSSTKYEIEKFNGRNDFSLWCVKMCALLVQQGLLKALKEKEHLLSNLSNGEKDNLMEK 60 Query: 509 AHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSL 330 AHS + L L+D V+RE+ ++++A +W KL+ +Y+ KSL N++Y+K++L+ KMS S+ Sbjct: 61 AHSAILLALSDEVIREVTDEESAIAVWLKLKSIYMTKSLMNRLYIKQRLYTLKMSEGTSV 120 Query: 329 SENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNA 150 + +ID+ N+++L+L+N+ KI DE++A+ILL SLP S+ + + YGRD T DV+ + Sbjct: 121 NTHIDEFNRVILDLKNIDVKIEDEDLALILLCSLPPSYENFMDTMLYGRDTFTFEDVRAS 180 Query: 149 LKSKDL 132 L SK+L Sbjct: 181 LNSKEL 186 >gb|AAK29467.1| polyprotein-like [Solanum chilense] Length = 1328 Score = 141 bits (356), Expect = 6e-31 Identities = 79/203 (38%), Positives = 125/203 (61%), Gaps = 1/203 (0%) Frame = -2 Query: 683 MSIQKTETDKFDG-KSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDA 507 MS K E KF+G K F MW+R+MK +L+Q A+ K PES K E E A Sbjct: 1 MSGVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKA 60 Query: 506 HSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLS 327 S + LHLTD+V+ I ++++A IWTKLE LY+ K+L+NK+YLK+QL+ M + Sbjct: 61 ASAIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFL 120 Query: 326 ENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNAL 147 +++ LN ++ +L N+G KI +E+ ++LLNSLP S++ + + I +G+D + L DV +AL Sbjct: 121 SHLNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLKDVTSAL 180 Query: 146 KSKDLDLRKENKSNGENLYVRGR 78 + +RK+ +++G+ R Sbjct: 181 LLNE-KMRKKPENHGQVFITESR 202 >ref|XP_003625577.1| Ubiquitin-protein ligase [Medicago truncatula] gi|355500592|gb|AES81795.1| Ubiquitin-protein ligase [Medicago truncatula] Length = 1405 Score = 140 bits (352), Expect = 2e-30 Identities = 79/210 (37%), Positives = 129/210 (61%), Gaps = 7/210 (3%) Frame = -2 Query: 671 KTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAHSCLT 492 K + +KF G +DF +W+ KM+AVL+Q K A+ P + +E + A S + Sbjct: 4 KWDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSQAEKTEMVDKARSAVV 63 Query: 491 LHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSENIDD 312 L L D VLRE+ ++ A IW KLE LY+ KSL+++ +LK+QL+ F+M SK++ E + + Sbjct: 64 LCLGDKVLREVAKEATAASIWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTE 123 Query: 311 LNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRD-DLTLADVKNALKSKD 135 NKI+ +L+N+ ++ DE+ A++LL +LP SF K + YG++ +TL +V+ AL++K+ Sbjct: 124 FNKILDDLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKE 183 Query: 134 LDLRKE-NKSNGENLYVR-----GRVDRRE 63 L K+ +GE L V GR +RR+ Sbjct: 184 LTKSKDLTHEHGEGLSVTRGNGGGRGNRRK 213 >sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease gi|20045|emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] Length = 1328 Score = 139 bits (350), Expect = 3e-30 Identities = 73/202 (36%), Positives = 125/202 (61%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAH 504 MS K E KF+G + F W+R+M+ +L+Q + K P++ K E ++ A Sbjct: 1 MSGVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAA 60 Query: 503 SCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSE 324 S + LHL+D+V+ I ++D A IWT+LE LY+ K+L+NK+YLK+QL+ MS + Sbjct: 61 SAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLS 120 Query: 323 NIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADVKNALK 144 +++ N ++ +L N+G KI +E+ A++LLNSLP S++++ + I +G+ + L DV +AL Sbjct: 121 HLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 180 Query: 143 SKDLDLRKENKSNGENLYVRGR 78 + +RK+ ++ G+ L GR Sbjct: 181 LNE-KMRKKPENQGQALITEGR 201 >ref|XP_003613324.1| Cc-nbs-lrr resistance protein [Medicago truncatula] gi|355514659|gb|AES96282.1| Cc-nbs-lrr resistance protein [Medicago truncatula] Length = 1104 Score = 137 bits (346), Expect = 8e-30 Identities = 78/210 (37%), Positives = 128/210 (60%), Gaps = 7/210 (3%) Frame = -2 Query: 671 KTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAHSCLT 492 K + +KF G +DF +W+ KM+AVL+Q K A+ P + +E + A S + Sbjct: 4 KWDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSQAEKTEMVDKARSAVV 63 Query: 491 LHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSENIDD 312 L L D VLRE+ ++ A IW KLE LY+ KSL+++ +LK+QL+ F+M SK++ E + + Sbjct: 64 LCLGDKVLREVAKEATAASIWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTE 123 Query: 311 LNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRD-DLTLADVKNALKSKD 135 NKI+ +L+N+ ++ DE+ A++LL +LP SF K + YG++ +TL +V+ AL++K+ Sbjct: 124 FNKILDDLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKE 183 Query: 134 LDLRKE-NKSNGENLYVR-----GRVDRRE 63 L K+ G+ L V GR +RR+ Sbjct: 184 LTKSKDLTHEYGDGLSVTRGNGGGRGNRRK 213 >gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1333 Score = 137 bits (344), Expect = 1e-29 Identities = 79/226 (34%), Positives = 130/226 (57%), Gaps = 20/226 (8%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVL---------------VQNKTAPAICSPDKYPE 549 MS + E +KFDG+ D+ MW+ K+ A L V+ + ++ E Sbjct: 1 MSAARIEVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEE 60 Query: 548 SWKGEVLSEKLGDAHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKE 369 + E+L EK A S + L +TD VLR+I ++ +A + L+KLY+ K+L N+IY K+ Sbjct: 61 VLRRELLEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQ 120 Query: 368 QLFGFKMSGSKSLSENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKY 189 +L+ FKMS + S+ NID+ +I+ +L+N +SDE+ A++LL SLP F+ ++ +KY Sbjct: 121 KLYSFKMSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKY 180 Query: 188 --GRDDLTLADVKNALKSKDLDL---RKENKSNGENLYVRGRVDRR 66 GR L+L +V A+ SK+L+L +K K E L+V+ + + R Sbjct: 181 GLGRVTLSLDEVVAAIYSKELELGSNKKSIKGQAEGLFVKEKTETR 226 >gb|ABO36622.1| copia LTR rider [Solanum lycopersicum] gi|133711819|gb|ABO36636.1| copia LTR rider [Solanum lycopersicum] Length = 1307 Score = 136 bits (342), Expect = 2e-29 Identities = 77/201 (38%), Positives = 125/201 (62%), Gaps = 7/201 (3%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVLVQ-------NKTAPAICSPDKYPESWKGEVLS 525 MS + DKF G++ F +W+ KM+A+L Q +K A+ +P+ +L Sbjct: 1 MSALNVKIDKFTGRNSFSLWQIKMRALLKQQGFWAPLSKDKNAVVTPEM-------AILE 53 Query: 524 EKLGDAHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMS 345 EK AHS + L L D+V+ E+ +++ A +W KLE LY+ KSL+NK+ LK++LFG +M+ Sbjct: 54 EK---AHSTIMLCLADDVITEVSDEETAAGLWLKLESLYMTKSLTNKLLLKQRLFGLRMA 110 Query: 344 GSKSLSENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLA 165 L E+++ LN ++LEL+N+ KI DE+ A+ILL SLP SF + + G+D ++L Sbjct: 111 EGTQLREHLEQLNTLLLELRNIDVKIEDEDAALILLVSLPMSFENFVQSFIVGKDTVSLE 170 Query: 164 DVKNALKSKDLDLRKENKSNG 102 +V++AL S++L +K+NG Sbjct: 171 EVRSALHSREL----RHKANG 187 >gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1356 Score = 135 bits (340), Expect = 4e-29 Identities = 82/226 (36%), Positives = 130/226 (57%), Gaps = 20/226 (8%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVL--------------VQNKTAPAICSPDKYPES 546 MS + E +KFDG+ D+ MW+ K+ A + K + S + Y E Sbjct: 1 MSTARIEVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEK 60 Query: 545 W-KGEVLSEKLGDAHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKE 369 K E L EK A S + L +TD VLR+I ++ A + L+KLY+ K+L N+IY K+ Sbjct: 61 LEKFEALEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQ 120 Query: 368 QLFGFKMSGSKSLSENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKY 189 +L+ FKMS + S+ NID+ +I+ +L+N+ ISDE+ A++LL +LP +F+ +K +KY Sbjct: 121 KLYSFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKY 180 Query: 188 --GRDDLTLADVKNALKSKDLDL---RKENKSNGENLYVRGRVDRR 66 G+ LTL +V A+ SK+L+L +K K E LYV+ + + + Sbjct: 181 SSGKSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVKDKNENK 226 >emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis] Length = 1334 Score = 135 bits (339), Expect = 5e-29 Identities = 83/206 (40%), Positives = 120/206 (58%), Gaps = 6/206 (2%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGD-- 510 MS+ + E +KF DF +W+ KMKA+LV A+ D E+ G + +K Sbjct: 1 MSLPRHEIEKFTIGGDFSLWKLKMKALLVHQGLESALDEEDL--EASTGSGIDDKRRQIQ 58 Query: 509 --AHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSK 336 AHS L L L D++LREI E+ A IW K+E L + KSL+++++LK++L+ F M Sbjct: 59 NRAHSTLILSLGDSILREISEEKTALGIWNKVETLCMKKSLAHRLFLKKRLYTFSMREGV 118 Query: 335 SLSENIDDLNKIVLELQNV-GEKISDENVAVILLNSLPDSFNDVKSAIKYGRDDLTLADV 159 ++ ++ID NKI+L+L+ V KI DE+ A LL+SLP S+ + YGR LTL DV Sbjct: 119 TIQDHIDTFNKIILDLEGVENVKICDEDKAFFLLSSLPKSYEGFVDTMLYGRTTLTLEDV 178 Query: 158 KNALKSKDLDLRKE-NKSNGENLYVR 84 K +L SK++ E SNGE L R Sbjct: 179 KASLSSKEIQKNCELETSNGEGLMAR 204 >gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 838 Score = 135 bits (339), Expect = 5e-29 Identities = 77/226 (34%), Positives = 136/226 (60%), Gaps = 19/226 (8%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVL--------------VQNKTAPAICSPDKYPES 546 M+ +E +K DG+ D+V+W+ K+ A + ++ + + A D Sbjct: 1 MTSGHSEVEKLDGEGDYVLWKEKLLAHIELLGLLEGLEEDEAIEEEESTA--ETDSLLTK 58 Query: 545 WKGEVLSEKLGDAHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQ 366 + +VL EK G A S + L L ++VLR++ ++ A + L+KL++ KSL N+IYLK++ Sbjct: 59 TEDKVLKEKRGKARSTVILSLGNHVLRKVIKEKTAAGMIRVLDKLFMAKSLPNRIYLKQR 118 Query: 365 LFGFKMSGSKSLSENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKYG 186 L+G+KMS S ++ EN++D K++ +L+NV + DE+ A++LL SLP F+ +K +KYG Sbjct: 119 LYGYKMSDSMTIEENVNDFFKLISDLENVKVSVPDEDQAIVLLMSLPKQFDQLKDTLKYG 178 Query: 185 RDDLTLADVKNALKSKDLDL---RKENKSNGENLYV--RGRVDRRE 63 + L L ++ A++SK L+L K K++ + L+V RGR ++R+ Sbjct: 179 KTTLALDEITGAIRSKVLELGASGKMLKNSSDALFVQDRGRSEKRD 224 >dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana] Length = 1342 Score = 134 bits (337), Expect = 9e-29 Identities = 78/231 (33%), Positives = 137/231 (59%), Gaps = 25/231 (10%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKA----------------VLVQNKTAPAICSPDKYP 552 MS + E +KFDG D+++W+ K+ A +V++ T ++ P Sbjct: 1 MSSGRAEVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDP 60 Query: 551 ES----WKGEVLSEKLGDAHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNK 384 E+ + ++L EK G A S + L L +NVLR++ ++ A + L++L++ KSL N+ Sbjct: 61 ETATSKLEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNR 120 Query: 383 IYLKEQLFGFKMSGSKSLSENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVK 204 IYLK++L+G+KMS + ++ EN++D K++ +L+NV + DE+ A++LL SLP F+ +K Sbjct: 121 IYLKQRLYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLK 180 Query: 203 SAIKYGRDDLTLADVKNALKSKDLDL---RKENKSNGENLYV--RGRVDRR 66 +KY + L L ++ +A++SK L+L K K+N + L+V RGR + R Sbjct: 181 ETLKYCKTTLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETR 231 >emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] Length = 1363 Score = 134 bits (336), Expect = 1e-28 Identities = 80/226 (35%), Positives = 131/226 (57%), Gaps = 20/226 (8%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKM---------KAVLVQNKTAPAI------CSPDKYPE 549 MS + E +KFDG+ D+ MW+ K+ AVL +++T D+ E Sbjct: 1 MSGARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEE 60 Query: 548 SWKGEVLSEKLGDAHSCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKE 369 K E EK A S + L ++D VLR+I ++ +A + L++LY+ K+L N+IYLK+ Sbjct: 61 REKMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQ 120 Query: 368 QLFGFKMSGSKSLSENIDDLNKIVLELQNVGEKISDENVAVILLNSLPDSFNDVKSAIKY 189 +L+ FKMS + S+ NID+ IV +L+N+ +SDE+ A++LL SLP F+ +K +KY Sbjct: 121 KLYSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKY 180 Query: 188 --GRDDLTLADVKNALKSKDLD---LRKENKSNGENLYVRGRVDRR 66 G+ L+L +V A+ S++L+ ++K K E LYV+ + + R Sbjct: 181 SSGKTVLSLDEVAAAIYSRELEFGSVKKSIKGQAEGLYVKDKAENR 226 >ref|XP_004165166.1| PREDICTED: uncharacterized protein LOC101225243 [Cucumis sativus] Length = 158 Score = 130 bits (328), Expect = 1e-27 Identities = 64/153 (41%), Positives = 98/153 (64%) Frame = -2 Query: 683 MSIQKTETDKFDGKSDFVMWRRKMKAVLVQNKTAPAICSPDKYPESWKGEVLSEKLGDAH 504 M+I + E KFDGK DF +W+ K+KA+L Q K A+ P + P E A+ Sbjct: 1 MAIARVEIKKFDGKGDFALWKAKIKALLGQQKAHKALLDPLELPTILTATQKEEIKLIAY 60 Query: 503 SCLTLHLTDNVLREIDEKDNAFEIWTKLEKLYLGKSLSNKIYLKEQLFGFKMSGSKSLSE 324 L L+++DN++R++ E++ A ++W KLE LY K L NKI L+E++F +KM SK+L+E Sbjct: 61 GTLILNISDNIIRQVLEEETAHKVWKKLESLYATKDLPNKICLREKIFTYKMDSSKTLTE 120 Query: 323 NIDDLNKIVLELQNVGEKISDENVAVILLNSLP 225 N+D+ KIV +++ +K+ DEN A +LLN LP Sbjct: 121 NLDEFKKIVSNFKSLEDKLDDENEAFVLLNFLP 153