BLASTX nr result
ID: Catharanthus22_contig00039421
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00039421 (940 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661... 135 2e-29 emb|CAN65229.1| hypothetical protein VITISV_011708 [Vitis vinifera] 123 9e-26 emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera] 107 5e-21 emb|CAN83378.1| hypothetical protein VITISV_011333 [Vitis vinifera] 99 3e-18 gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 98 5e-18 emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera] 96 2e-17 gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha... 95 3e-17 dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 94 6e-17 ref|XP_006299524.1| hypothetical protein CARUB_v10015696mg, part... 94 7e-17 dbj|BAB08885.1| retroelement pol polyprotein-like [Arabidopsis t... 93 2e-16 dbj|BAB10837.1| retroelement pol polyprotein-like [Arabidopsis t... 93 2e-16 gb|AAT71979.1| At5g39185 [Arabidopsis thaliana] 93 2e-16 gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi... 93 2e-16 ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624... 92 4e-16 ref|XP_006419099.1| hypothetical protein EUTSA_v10003107mg [Eutr... 92 4e-16 gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab... 91 5e-16 dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t... 91 5e-16 gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidop... 90 1e-15 ref|XP_006392205.1| hypothetical protein EUTSA_v10023972mg, part... 86 3e-14 ref|XP_006397294.1| hypothetical protein EUTSA_v10029485mg [Eutr... 84 8e-14 >ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661920 [Glycine max] Length = 516 Score = 135 bits (341), Expect = 2e-29 Identities = 79/206 (38%), Positives = 112/206 (54%), Gaps = 18/206 (8%) Frame = -1 Query: 565 NDD*VNEYLQKDA--YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEG 392 N D +L+K Y L SSD+P IIT VQL+ +NYDEW +A+ FV+G Sbjct: 16 NKDESGSHLKKQISPYDLYSSDNPGNIITQVQLKGENYDEWARAVRGSLRARRKFRFVDG 75 Query: 391 QIPKPESGTTEEEDWWTINTMDTEHY*AQSENYYVLHRTMR*TLV-----------*KSI 245 I KP+ E +DWWT+N+M S + + +R T+ K Sbjct: 76 SIKKPDDAAPEIDDWWTVNSM------IVSWIFNTIEPKLRSTITYRENAQELWDDIKQR 129 Query: 244 FWLETAPRKHELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCT-----CGMIN 80 F + PR +LK NCK++G S+ YF +LKK+WD L+++ Q+P CT CG+ Sbjct: 130 FSISNGPRIQQLKSELANCKQNGDSIVTYFGRLKKLWDELNDFDQIPMCTCNGCKCGISA 189 Query: 79 ETIKQREEDKVHQFLIGLDDTVYGTV 2 K+REE+K+HQFL+GLDDT + TV Sbjct: 190 ALNKKREEEKLHQFLMGLDDTQFRTV 215 >emb|CAN65229.1| hypothetical protein VITISV_011708 [Vitis vinifera] Length = 1149 Score = 123 bits (309), Expect = 9e-26 Identities = 71/180 (39%), Positives = 96/180 (53%), Gaps = 5/180 (2%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 YSL S+D+ IIT VQLR +NYDEW +AM F +G I +P E E+W Sbjct: 15 YSLNSNDNSGNIITQVQLRGENYDEWARAMWTALRAKKKYGFXDGXIKQPVENAQEIENW 74 Query: 346 WTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTSV 167 WTIN+M+ F + PR +L++ NCK++G + Sbjct: 75 WTINSMER--------------------------FSIGNGPRVQQLRLDLANCKQNGQVI 108 Query: 166 SAYFAKLKKIWDGLSNYQQLPNCTC--GMINETI---KQREEDKVHQFLIGLDDTVYGTV 2 Y+ KLK IWD L+NY ++P C C N TI K+REE++VHQFL+GLD+ YGTV Sbjct: 109 VTYYGKLKMIWDELNNYDKMPVCNCVGCKCNLTIVLEKKREEERVHQFLMGLDEEGYGTV 168 >emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera] Length = 1157 Score = 107 bits (268), Expect = 5e-21 Identities = 66/180 (36%), Positives = 93/180 (51%), Gaps = 5/180 (2%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y+L S+D+P IIT VQL+ FV+G I +P++ + E EDW Sbjct: 5 YALTSNDNPGNIITQVQLKA-------------LRAKKKYGFVDGSIKQPDNDSPELEDW 51 Query: 346 WTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTSV 167 WTIN+M + L + K F + PR +LK VNCK+ G + Sbjct: 52 WTINSMLVS---------WELWEEI------KQQFSIGNGPRVQQLKSYLVNCKQEGQGI 96 Query: 166 SAYFAKLKKIWDGLSNYQQLPNCT-----CGMINETIKQREEDKVHQFLIGLDDTVYGTV 2 Y+ KLK +WD L+NY +P CT C + + K+REE++VHQFL+GLD+ YGTV Sbjct: 97 IVYYGKLKSLWDELNNYDSIPVCTCTRCKCKITTQLEKKREEERVHQFLMGLDEDGYGTV 156 >emb|CAN83378.1| hypothetical protein VITISV_011333 [Vitis vinifera] Length = 758 Score = 98.6 bits (244), Expect = 3e-18 Identities = 57/178 (32%), Positives = 89/178 (50%), Gaps = 5/178 (2%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y+L S+++P IIT VQL+ DNYDEW +A+ FV+G I + ++ +++ EDW Sbjct: 5 YALTSNNNPANIITQVQLKCDNYDEWARAVHTILLAEKIYGFVDGSIKQLDNDSSKLEDW 64 Query: 346 WTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTSV 167 WT+N+M + WL L+ K G + Sbjct: 65 WTVNSM--------------------------LVSWLFNTIEPI-LRSTISYMKNEGQGI 97 Query: 166 SAYFAKLKKIWDGLSNYQQLPNCT-----CGMINETIKQREEDKVHQFLIGLDDTVYG 8 Y+ +L+ +WD L+NY +P CT C + + K+ EE++VHQFL+GLD+ YG Sbjct: 98 VVYYGRLESLWDKLNNYDSIPVCTCTGCKCNITTQLEKKGEEERVHQFLMGLDEDGYG 155 >gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1501 Score = 97.8 bits (242), Expect = 5e-18 Identities = 55/194 (28%), Positives = 97/194 (50%), Gaps = 16/194 (8%) Frame = -1 Query: 541 LQKDAYSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTT 362 L Y+LASSD+P +I+ V+L DNY++W M F+ G IP+P Sbjct: 27 LMVSPYTLASSDNPGAVISSVELNGDNYNQWATEMLNALQAKRKTGFINGTIPRPPPNDP 86 Query: 361 EEEDWWTINTMDTEHY*AQSE-----------NYYVLHRTMR*TLV*KSIFWLETAPRKH 215 E+W +N+M E + ++L + + K F + R H Sbjct: 87 NYENWTAVNSMIVGWIRTSIEPKVKATVTFISDAHLLWKDL------KQRFSVGNKVRIH 140 Query: 214 ELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMI-----NETIKQREEDK 50 +++ +C++ G +V Y+ +L +W+ + Y+ + CTCG+ +E K+REE+K Sbjct: 141 QIRAQLSSCRQDGQAVIEYYGRLSNLWEEYNIYKPVTVCTCGLCRCGATSEPTKEREEEK 200 Query: 49 VHQFLIGLDDTVYG 8 +HQF++GLD++ +G Sbjct: 201 IHQFVLGLDESRFG 214 >emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera] Length = 1316 Score = 96.3 bits (238), Expect = 2e-17 Identities = 41/84 (48%), Positives = 59/84 (70%) Frame = -1 Query: 253 KSIFWLETAPRKHELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINET 74 K + + APR H+L+ VN K+ G +V+AY+AK+K +WD L+ Y ++P CTCG Sbjct: 6 KERYAVGNAPRVHQLRSEIVNLKQEGMTVAAYYAKIKGMWDELNQYIEIPECTCGAAQAI 65 Query: 73 IKQREEDKVHQFLIGLDDTVYGTV 2 +K RE++K HQFL+GLDDT +GTV Sbjct: 66 VKSREDEKAHQFLMGLDDTTFGTV 89 >gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana] Length = 1468 Score = 95.1 bits (235), Expect = 3e-17 Identities = 55/185 (29%), Positives = 93/185 (50%), Gaps = 10/185 (5%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y L ++D+ +I+ L+ +NY+EW F++G IP+P G+ + EDW Sbjct: 21 YDLTAADNSGAVISHPILKTNNYEEWACGFKTALRSRKKFGFLDGTIPQPLDGSPDLEDW 80 Query: 346 WTINTMDTEHY*AQSENYY---VLHRTMR*TL--V*KSIFWLETAPRKHELKMARVNCKE 182 TIN + ++ + HR + L + F + P+ ++K CK+ Sbjct: 81 LTINALLVSWMKMTIDSELLTNISHRDVARDLWEQIRKRFSVSNGPKNQKMKADLATCKQ 140 Query: 181 SGTSVSAYFAKLKKIWDGLSNYQQLPNCTCG-----MINETIKQREEDKVHQFLIGLDDT 17 G +V Y+ KL KIWD +++Y+ L C CG + + K RE+D VHQ+L GL++T Sbjct: 141 EGMTVEGYYGKLNKIWDNINSYRPLRICKCGRCICNLGTDQEKYREDDMVHQYLYGLNET 200 Query: 16 VYGTV 2 + T+ Sbjct: 201 KFHTI 205 >dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1491 Score = 94.4 bits (233), Expect = 6e-17 Identities = 57/183 (31%), Positives = 88/183 (48%), Gaps = 10/183 (5%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y+LASSD+P +I+ V L DNY+EW M F+ G I KP + E+W Sbjct: 27 YTLASSDNPGAMISSVMLTGDNYNEWSTEMLNALQAKRKTGFINGSISKPPLDNPDYENW 86 Query: 346 WTINTMDTEHY*AQSE-----NYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKE 182 +N+M A E + + K F + R H++K C++ Sbjct: 87 QAVNSMIVGWIRASIEPKVKSTVTFISDAHQLWSELKQRFSVGNKVRVHQIKAQLAACRQ 146 Query: 181 SGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMIN-----ETIKQREEDKVHQFLIGLDDT 17 G V Y+ +L K+W+ Y+ + C CG+ E K+REE+K+HQF++GLDD+ Sbjct: 147 DGQPVIDYYGRLCKLWEEFQIYKPITVCKCGLCTCGATLEPSKEREEEKIHQFVLGLDDS 206 Query: 16 VYG 8 +G Sbjct: 207 RFG 209 >ref|XP_006299524.1| hypothetical protein CARUB_v10015696mg, partial [Capsella rubella] gi|482568233|gb|EOA32422.1| hypothetical protein CARUB_v10015696mg, partial [Capsella rubella] Length = 322 Score = 94.0 bits (232), Expect = 7e-17 Identities = 56/179 (31%), Positives = 88/179 (49%), Gaps = 5/179 (2%) Frame = -1 Query: 529 AYSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEED 350 AY LA++++P II V DN+DEW + + FV+G + +P E ED Sbjct: 18 AYQLAANENPGAIIAHVHFNGDNFDEWAQTVRTALRVKKKFGFVDGSVTEPNKEEAEYED 77 Query: 349 WWTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTS 170 W + +M + +E + + K F PR E+K C++ Sbjct: 78 WVSAKSMTLGNKEDPAELWKEI----------KDRFCEGNGPRIQEIKAELALCRQGYMR 127 Query: 169 VSAYFAKLKKIWDGLSNYQQLPNCTCG----MINETI-KQREEDKVHQFLIGLDDTVYG 8 V Y+ KL+ +W+ LSNY+ C CG IN + K++EED++H FL+GLD+ V+G Sbjct: 128 VIDYYGKLQVLWEDLSNYETPVVCNCGGCTCEINAKLEKKKEEDRIHHFLLGLDEAVFG 186 >dbj|BAB08885.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 370 Score = 92.8 bits (229), Expect = 2e-16 Identities = 55/180 (30%), Positives = 93/180 (51%), Gaps = 12/180 (6%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y+L++SD+P +IT V L DNY+EW + M F++G I KP S + + E+W Sbjct: 30 YTLSNSDNPGTLITSVVLNGDNYNEWSEEMLNALQAKRKTGFIDGTIQKPASDSPDFENW 89 Query: 346 WTINTMDTEHY*AQSE-----------NYYVLHRTMR*TLV*KSIFWLETAPRKHELKMA 200 T+N+M E + ++L +R F + R H++K Sbjct: 90 KTVNSMIVGWIRVSIEPKVKSTVTFISDAHLLWDELR------QRFSVTNNVRVHQIKAQ 143 Query: 199 RVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCG-MINETIKQREEDKVHQFLIGLD 23 +C++ G +V Y+ +L +WD L NYQ C G ++ +K+R+++K+HQF++GLD Sbjct: 144 LASCRQEGQTVIDYYGRLCNLWDELKNYQASAVCPHGSVLTAIVKERDDEKLHQFVLGLD 203 >dbj|BAB10837.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1462 Score = 92.8 bits (229), Expect = 2e-16 Identities = 57/181 (31%), Positives = 88/181 (48%), Gaps = 6/181 (3%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y L S D+P +I+ LR NYDEW + F +G IP+P+ + +DW Sbjct: 24 YDLTSGDNPGTLISKPLLRGPNYDEWATNLRLALKARKKFGFADGTIPQPDETNPDFDDW 83 Query: 346 WTINTMD------TEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCK 185 N + T H + ++ T + K F ++ R LK C+ Sbjct: 84 IANNALVVSWMKLTIHESLATSMSHLDDSHDMWTHIQKR-FGVKNGQRIQRLKTELATCR 142 Query: 184 ESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQREEDKVHQFLIGLDDTVYGT 5 + GT + Y+ KL ++W L++YQQ + E K+REEDK+HQFL+GLD+++YG Sbjct: 143 QKGTPIETYYGKLSQLWRSLADYQQAKT-----MEEVRKEREEDKLHQFLMGLDESMYGA 197 Query: 4 V 2 V Sbjct: 198 V 198 >gb|AAT71979.1| At5g39185 [Arabidopsis thaliana] Length = 348 Score = 92.8 bits (229), Expect = 2e-16 Identities = 57/181 (31%), Positives = 88/181 (48%), Gaps = 6/181 (3%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y L S D+P +I+ LR NYDEW + F +G IP+P+ + +DW Sbjct: 24 YDLTSGDNPGTLISKPLLRGPNYDEWATNLRLALKARKKFGFADGTIPQPDETNPDFDDW 83 Query: 346 WTINTMD------TEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCK 185 N + T H + ++ T + K F ++ R LK C+ Sbjct: 84 IANNALVVSWMKLTIHESLATSMSHLDDSHDMWTHIQKR-FGVKNGQRIQRLKTELATCR 142 Query: 184 ESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQREEDKVHQFLIGLDDTVYGT 5 + GT + Y+ KL ++W L++YQQ + E K+REEDK+HQFL+GLD+++YG Sbjct: 143 QKGTPIETYYGKLSQLWRSLADYQQAKT-----MEEVRKEREEDKLHQFLMGLDESMYGA 197 Query: 4 V 2 V Sbjct: 198 V 198 >gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1413 Score = 92.8 bits (229), Expect = 2e-16 Identities = 58/186 (31%), Positives = 87/186 (46%), Gaps = 13/186 (6%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y+LASSD+P +I+ V L DNY+EW M F+ G I KP + E+W Sbjct: 27 YTLASSDNPGAMISSVMLTGDNYNEWSTKMLNALQAKRKTGFINGSISKPPLDNPDYENW 86 Query: 346 WTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPR--------KHELKMARVN 191 +N+M A E T + W E R H++K Sbjct: 87 QAVNSMIVGWIRASIEPKVKSTVTF---ICDAHQLWSELKQRFSVGNKVHVHQIKTQLAA 143 Query: 190 CKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMIN-----ETIKQREEDKVHQFLIGL 26 C++ G V Y+ +L K+W+ Y+ + C CG+ E K+REE+K+HQF++GL Sbjct: 144 CRQDGQPVIDYYGRLCKLWEEFQIYKPITVCKCGLCTCGATLEPSKEREEEKIHQFVLGL 203 Query: 25 DDTVYG 8 DD+ +G Sbjct: 204 DDSRFG 209 >ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624694 isoform X1 [Citrus sinensis] gi|568852764|ref|XP_006480041.1| PREDICTED: uncharacterized protein LOC102624694 isoform X2 [Citrus sinensis] gi|568852766|ref|XP_006480042.1| PREDICTED: uncharacterized protein LOC102624694 isoform X3 [Citrus sinensis] Length = 320 Score = 91.7 bits (226), Expect = 4e-16 Identities = 49/136 (36%), Positives = 76/136 (55%), Gaps = 16/136 (11%) Frame = -1 Query: 361 EEEDWWTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSI-----------FWLETAPRKH 215 E +DWWT+N+M S + T+R T+ + F + PR H Sbjct: 9 ELDDWWTVNSMIV------SWILNTIEPTLRSTITHMEVAKKLWDDIKERFSVGNGPRVH 62 Query: 214 ELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCG-----MINETIKQREEDK 50 +LK CK+ G ++ +Y+ KLK IW+ L+NY+Q P C+CG + + K+ EE++ Sbjct: 63 QLKSELAECKQRGMTILSYYGKLKLIWEELANYEQYPICSCGGCTCELEAKLNKKCEEER 122 Query: 49 VHQFLIGLDDTVYGTV 2 +HQFL+GLDDT+YG+V Sbjct: 123 LHQFLMGLDDTIYGSV 138 >ref|XP_006419099.1| hypothetical protein EUTSA_v10003107mg [Eutrema salsugineum] gi|557097027|gb|ESQ37535.1| hypothetical protein EUTSA_v10003107mg [Eutrema salsugineum] Length = 189 Score = 91.7 bits (226), Expect = 4e-16 Identities = 57/174 (32%), Positives = 90/174 (51%), Gaps = 1/174 (0%) Frame = -1 Query: 520 LASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDWWT 341 L SD P +IT VQL+ +NY++W K + F++G + KP + E E W Sbjct: 3 LHPSDRPGDLITTVQLKGENYEDWAKHVRNALRTKRKLGFIDGTLMKPTTAK-ELEQWEV 61 Query: 340 INTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTSVSA 161 +N+++ +SE K F PR EL+ NC+++G SV Sbjct: 62 VNSIEGAM--GRSEL--------------KLTFSAGNVPRISELRADIANCRQNGDSVMV 105 Query: 160 YFAKLKKIWDGLSNYQQLPNCTCGMINETIKQ-REEDKVHQFLIGLDDTVYGTV 2 YF KLKK+WD L+ Y+ + C+CG + +++ +EE++ + FL GLD +GTV Sbjct: 106 YFGKLKKMWDELAIYKPIRTCSCGELKAQLEEDQEEERTNTFLTGLDAERFGTV 159 >gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana] Length = 1486 Score = 91.3 bits (225), Expect = 5e-16 Identities = 56/188 (29%), Positives = 85/188 (45%), Gaps = 13/188 (6%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347 Y L S D+P +I+ LR NYDEW + F +G IP+P + EDW Sbjct: 25 YDLTSGDNPGTLISKPLLRGPNYDEWATNLRLALKARKKFGFADGSIPQPVETDPDFEDW 84 Query: 346 WTIN-------------TMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELK 206 N T+ T + H R F ++ R LK Sbjct: 85 TANNALVVSWMKLTIDETVSTSMSHLDDSHELWTHIQKR--------FGVKNGQRVQRLK 136 Query: 205 MARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQREEDKVHQFLIGL 26 C++ G ++ Y+ +L ++W L++YQQ +++ K+REEDK+HQFL+GL Sbjct: 137 TELATCRQKGVAIETYYGRLSQLWRSLADYQQAKT-----MDDVRKEREEDKLHQFLMGL 191 Query: 25 DDTVYGTV 2 D++VYG V Sbjct: 192 DESVYGAV 199 >dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1098 Score = 91.3 bits (225), Expect = 5e-16 Identities = 61/182 (33%), Positives = 87/182 (47%), Gaps = 13/182 (7%) Frame = -1 Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEE--E 353 Y + +SD+P +I+ V L+EDNY EW + + F++G IPKP TTE Sbjct: 14 YGITASDNPGALISSVILKEDNYSEWAEELMNSLQAKQKLGFLDGTIPKP---TTEPALS 70 Query: 352 DWWTINTMDTEHY*AQSENYYVLHRTMR*TLV*-----------KSIFWLETAPRKHELK 206 W N+M + T+R T+ K F RK LK Sbjct: 71 SWKAANSMIIGWIRTS------IDPTIRSTVAFVSDAKDLWDSLKQRFSNGNGVRKQLLK 124 Query: 205 MARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQREEDKVHQFLIGL 26 + CK+ G SV Y+ +L K+W+ L NY+ CTC + K+RE+DKVHQFL+ L Sbjct: 125 DEILACKQDGQSVLVYYGRLTKLWEELQNYKTSRTCTCEAAPDIAKEREDDKVHQFLLNL 184 Query: 25 DD 20 D+ Sbjct: 185 DE 186 >gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidopsis thaliana] Length = 1152 Score = 89.7 bits (221), Expect = 1e-15 Identities = 57/210 (27%), Positives = 97/210 (46%), Gaps = 16/210 (7%) Frame = -1 Query: 586 VHTSTSDNDD*VNEYLQKDAYSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXX 407 V ST+ + + Y L SD PH ++T + L +NY+ W K Sbjct: 5 VDGSTATTASSEKDAISASPYYLHPSDHPHHVLTPMLLNGENYERWAKLTRNNLQAKQKL 64 Query: 406 XFVEGQIPKPESGTTEEEDWWTINTM-----------DTEHY*AQSENYYVLHRTMR*TL 260 F++G + KP S + + W N+M + + +N V+ ++R Sbjct: 65 GFIDGTLTKPSSDSPDYPRWLQTNSMLVGWLYASLDPQVQKSISVVDNARVMWESLR--- 121 Query: 259 V*KSIFWLETAPRKHELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMIN 80 + + + A R H+LK V C++ G + + YF KLK +WD L +Y+ L C C + Sbjct: 122 ---TRYSVGNASRVHQLKYDIVACRQDGQTAANYFGKLKVMWDDLDDYEPLLTCCCNRPS 178 Query: 79 ET-----IKQREEDKVHQFLIGLDDTVYGT 5 T ++R+ +++HQFL+GLD +GT Sbjct: 179 CTHRVRQSQRRDHERIHQFLMGLDAAKFGT 208 >ref|XP_006392205.1| hypothetical protein EUTSA_v10023972mg, partial [Eutrema salsugineum] gi|557088711|gb|ESQ29491.1| hypothetical protein EUTSA_v10023972mg, partial [Eutrema salsugineum] Length = 198 Score = 85.5 bits (210), Expect = 3e-14 Identities = 52/191 (27%), Positives = 92/191 (48%), Gaps = 16/191 (8%) Frame = -1 Query: 547 EYLQKDAYSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESG 368 E + Y L+SSD PH ++T + L DNY+ W K F++G + KP + Sbjct: 12 ETISSSPYYLSSSDHPHHVLTPMLLNGDNYEMWAKLARNNLVAKHKLGFIDGSLSKPSAE 71 Query: 367 TTEEEDWWTINTM-----------DTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPR 221 + + + W N+M + + +N L + K+ + + A R Sbjct: 72 SNDYQRWIQTNSMLVGWLYASLDPKVQKVISFVDNAKALWDNL------KTRYSIGNASR 125 Query: 220 KHELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNC-----TCGMINETIKQREE 56 H++K A + C + G V+ YF KLK +WD L +++ L +C TC + +++R+ Sbjct: 126 VHQIKAAILACMQDGQEVADYFGKLKVMWDDLDDFEPLIDCCCSNATCPQRVKQVQRRDL 185 Query: 55 DKVHQFLIGLD 23 +++HQFL+ LD Sbjct: 186 ERIHQFLMRLD 196 >ref|XP_006397294.1| hypothetical protein EUTSA_v10029485mg [Eutrema salsugineum] gi|557098311|gb|ESQ38747.1| hypothetical protein EUTSA_v10029485mg [Eutrema salsugineum] Length = 196 Score = 84.0 bits (206), Expect = 8e-14 Identities = 57/178 (32%), Positives = 91/178 (51%), Gaps = 8/178 (4%) Frame = -1 Query: 511 SDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDWWTINT 332 SD P +IT +QLR +NY++W K + F+EG +PKP + E E W +N+ Sbjct: 6 SDRPGDLITTMQLRGENYEDWAKHVRNALRTKRKLGFIEGTLPKP-TAPKELEQWEVVNS 64 Query: 331 MDTEHY*AQSENYYVLHRTMR*TLV*KSI-------FWLETAPRKHELKMARVNCKESGT 173 M E+ L T+ K + F + P+ EL+ NC+++G Sbjct: 65 MLVAWIMNTIESN--LKTTISMVDEAKELWDDLKLQFLVGNGPQISELRADIANCRQNGD 122 Query: 172 SVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQ-REEDKVHQFLIGLDDTVYGTV 2 S+ YF KL K+WD L+ Y+ + C+CG + +++ EE++ + FL GLD +GTV Sbjct: 123 SIMVYFEKL-KMWDELAVYKPIRTCSCGELRAQLEEDLEEERTNTFLTGLDAERFGTV 179