BLASTX nr result

ID: Catharanthus22_contig00039421 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00039421
         (940 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661...   135   2e-29
emb|CAN65229.1| hypothetical protein VITISV_011708 [Vitis vinifera]   123   9e-26
emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera]   107   5e-21
emb|CAN83378.1| hypothetical protein VITISV_011333 [Vitis vinifera]    99   3e-18
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi...    98   5e-18
emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera]    96   2e-17
gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha...    95   3e-17
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t...    94   6e-17
ref|XP_006299524.1| hypothetical protein CARUB_v10015696mg, part...    94   7e-17
dbj|BAB08885.1| retroelement pol polyprotein-like [Arabidopsis t...    93   2e-16
dbj|BAB10837.1| retroelement pol polyprotein-like [Arabidopsis t...    93   2e-16
gb|AAT71979.1| At5g39185 [Arabidopsis thaliana]                        93   2e-16
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi...    93   2e-16
ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624...    92   4e-16
ref|XP_006419099.1| hypothetical protein EUTSA_v10003107mg [Eutr...    92   4e-16
gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab...    91   5e-16
dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t...    91   5e-16
gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidop...    90   1e-15
ref|XP_006392205.1| hypothetical protein EUTSA_v10023972mg, part...    86   3e-14
ref|XP_006397294.1| hypothetical protein EUTSA_v10029485mg [Eutr...    84   8e-14

>ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661920 [Glycine max]
          Length = 516

 Score =  135 bits (341), Expect = 2e-29
 Identities = 79/206 (38%), Positives = 112/206 (54%), Gaps = 18/206 (8%)
 Frame = -1

Query: 565 NDD*VNEYLQKDA--YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEG 392
           N D    +L+K    Y L SSD+P  IIT VQL+ +NYDEW +A+           FV+G
Sbjct: 16  NKDESGSHLKKQISPYDLYSSDNPGNIITQVQLKGENYDEWARAVRGSLRARRKFRFVDG 75

Query: 391 QIPKPESGTTEEEDWWTINTMDTEHY*AQSENYYVLHRTMR*TLV-----------*KSI 245
            I KP+    E +DWWT+N+M        S  +  +   +R T+             K  
Sbjct: 76  SIKKPDDAAPEIDDWWTVNSM------IVSWIFNTIEPKLRSTITYRENAQELWDDIKQR 129

Query: 244 FWLETAPRKHELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCT-----CGMIN 80
           F +   PR  +LK    NCK++G S+  YF +LKK+WD L+++ Q+P CT     CG+  
Sbjct: 130 FSISNGPRIQQLKSELANCKQNGDSIVTYFGRLKKLWDELNDFDQIPMCTCNGCKCGISA 189

Query: 79  ETIKQREEDKVHQFLIGLDDTVYGTV 2
              K+REE+K+HQFL+GLDDT + TV
Sbjct: 190 ALNKKREEEKLHQFLMGLDDTQFRTV 215


>emb|CAN65229.1| hypothetical protein VITISV_011708 [Vitis vinifera]
          Length = 1149

 Score =  123 bits (309), Expect = 9e-26
 Identities = 71/180 (39%), Positives = 96/180 (53%), Gaps = 5/180 (2%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           YSL S+D+   IIT VQLR +NYDEW +AM           F +G I +P     E E+W
Sbjct: 15  YSLNSNDNSGNIITQVQLRGENYDEWARAMWTALRAKKKYGFXDGXIKQPVENAQEIENW 74

Query: 346 WTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTSV 167
           WTIN+M+                           F +   PR  +L++   NCK++G  +
Sbjct: 75  WTINSMER--------------------------FSIGNGPRVQQLRLDLANCKQNGQVI 108

Query: 166 SAYFAKLKKIWDGLSNYQQLPNCTC--GMINETI---KQREEDKVHQFLIGLDDTVYGTV 2
             Y+ KLK IWD L+NY ++P C C     N TI   K+REE++VHQFL+GLD+  YGTV
Sbjct: 109 VTYYGKLKMIWDELNNYDKMPVCNCVGCKCNLTIVLEKKREEERVHQFLMGLDEEGYGTV 168


>emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera]
          Length = 1157

 Score =  107 bits (268), Expect = 5e-21
 Identities = 66/180 (36%), Positives = 93/180 (51%), Gaps = 5/180 (2%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y+L S+D+P  IIT VQL+                      FV+G I +P++ + E EDW
Sbjct: 5   YALTSNDNPGNIITQVQLKA-------------LRAKKKYGFVDGSIKQPDNDSPELEDW 51

Query: 346 WTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTSV 167
           WTIN+M            + L   +      K  F +   PR  +LK   VNCK+ G  +
Sbjct: 52  WTINSMLVS---------WELWEEI------KQQFSIGNGPRVQQLKSYLVNCKQEGQGI 96

Query: 166 SAYFAKLKKIWDGLSNYQQLPNCT-----CGMINETIKQREEDKVHQFLIGLDDTVYGTV 2
             Y+ KLK +WD L+NY  +P CT     C +  +  K+REE++VHQFL+GLD+  YGTV
Sbjct: 97  IVYYGKLKSLWDELNNYDSIPVCTCTRCKCKITTQLEKKREEERVHQFLMGLDEDGYGTV 156


>emb|CAN83378.1| hypothetical protein VITISV_011333 [Vitis vinifera]
          Length = 758

 Score = 98.6 bits (244), Expect = 3e-18
 Identities = 57/178 (32%), Positives = 89/178 (50%), Gaps = 5/178 (2%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y+L S+++P  IIT VQL+ DNYDEW +A+           FV+G I + ++ +++ EDW
Sbjct: 5   YALTSNNNPANIITQVQLKCDNYDEWARAVHTILLAEKIYGFVDGSIKQLDNDSSKLEDW 64

Query: 346 WTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTSV 167
           WT+N+M                           + WL        L+      K  G  +
Sbjct: 65  WTVNSM--------------------------LVSWLFNTIEPI-LRSTISYMKNEGQGI 97

Query: 166 SAYFAKLKKIWDGLSNYQQLPNCT-----CGMINETIKQREEDKVHQFLIGLDDTVYG 8
             Y+ +L+ +WD L+NY  +P CT     C +  +  K+ EE++VHQFL+GLD+  YG
Sbjct: 98  VVYYGRLESLWDKLNNYDSIPVCTCTGCKCNITTQLEKKGEEERVHQFLMGLDEDGYG 155


>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1501

 Score = 97.8 bits (242), Expect = 5e-18
 Identities = 55/194 (28%), Positives = 97/194 (50%), Gaps = 16/194 (8%)
 Frame = -1

Query: 541 LQKDAYSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTT 362
           L    Y+LASSD+P  +I+ V+L  DNY++W   M           F+ G IP+P     
Sbjct: 27  LMVSPYTLASSDNPGAVISSVELNGDNYNQWATEMLNALQAKRKTGFINGTIPRPPPNDP 86

Query: 361 EEEDWWTINTMDTEHY*AQSE-----------NYYVLHRTMR*TLV*KSIFWLETAPRKH 215
             E+W  +N+M         E           + ++L + +      K  F +    R H
Sbjct: 87  NYENWTAVNSMIVGWIRTSIEPKVKATVTFISDAHLLWKDL------KQRFSVGNKVRIH 140

Query: 214 ELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMI-----NETIKQREEDK 50
           +++    +C++ G +V  Y+ +L  +W+  + Y+ +  CTCG+      +E  K+REE+K
Sbjct: 141 QIRAQLSSCRQDGQAVIEYYGRLSNLWEEYNIYKPVTVCTCGLCRCGATSEPTKEREEEK 200

Query: 49  VHQFLIGLDDTVYG 8
           +HQF++GLD++ +G
Sbjct: 201 IHQFVLGLDESRFG 214


>emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera]
          Length = 1316

 Score = 96.3 bits (238), Expect = 2e-17
 Identities = 41/84 (48%), Positives = 59/84 (70%)
 Frame = -1

Query: 253 KSIFWLETAPRKHELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINET 74
           K  + +  APR H+L+   VN K+ G +V+AY+AK+K +WD L+ Y ++P CTCG     
Sbjct: 6   KERYAVGNAPRVHQLRSEIVNLKQEGMTVAAYYAKIKGMWDELNQYIEIPECTCGAAQAI 65

Query: 73  IKQREEDKVHQFLIGLDDTVYGTV 2
           +K RE++K HQFL+GLDDT +GTV
Sbjct: 66  VKSREDEKAHQFLMGLDDTTFGTV 89


>gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana]
          Length = 1468

 Score = 95.1 bits (235), Expect = 3e-17
 Identities = 55/185 (29%), Positives = 93/185 (50%), Gaps = 10/185 (5%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y L ++D+   +I+   L+ +NY+EW               F++G IP+P  G+ + EDW
Sbjct: 21  YDLTAADNSGAVISHPILKTNNYEEWACGFKTALRSRKKFGFLDGTIPQPLDGSPDLEDW 80

Query: 346 WTINTMDTEHY*AQSENYY---VLHRTMR*TL--V*KSIFWLETAPRKHELKMARVNCKE 182
            TIN +         ++     + HR +   L    +  F +   P+  ++K     CK+
Sbjct: 81  LTINALLVSWMKMTIDSELLTNISHRDVARDLWEQIRKRFSVSNGPKNQKMKADLATCKQ 140

Query: 181 SGTSVSAYFAKLKKIWDGLSNYQQLPNCTCG-----MINETIKQREEDKVHQFLIGLDDT 17
            G +V  Y+ KL KIWD +++Y+ L  C CG     +  +  K RE+D VHQ+L GL++T
Sbjct: 141 EGMTVEGYYGKLNKIWDNINSYRPLRICKCGRCICNLGTDQEKYREDDMVHQYLYGLNET 200

Query: 16  VYGTV 2
            + T+
Sbjct: 201 KFHTI 205


>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score = 94.4 bits (233), Expect = 6e-17
 Identities = 57/183 (31%), Positives = 88/183 (48%), Gaps = 10/183 (5%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y+LASSD+P  +I+ V L  DNY+EW   M           F+ G I KP     + E+W
Sbjct: 27  YTLASSDNPGAMISSVMLTGDNYNEWSTEMLNALQAKRKTGFINGSISKPPLDNPDYENW 86

Query: 346 WTINTMDTEHY*AQSE-----NYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKE 182
             +N+M      A  E         +    +     K  F +    R H++K     C++
Sbjct: 87  QAVNSMIVGWIRASIEPKVKSTVTFISDAHQLWSELKQRFSVGNKVRVHQIKAQLAACRQ 146

Query: 181 SGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMIN-----ETIKQREEDKVHQFLIGLDDT 17
            G  V  Y+ +L K+W+    Y+ +  C CG+       E  K+REE+K+HQF++GLDD+
Sbjct: 147 DGQPVIDYYGRLCKLWEEFQIYKPITVCKCGLCTCGATLEPSKEREEEKIHQFVLGLDDS 206

Query: 16  VYG 8
            +G
Sbjct: 207 RFG 209


>ref|XP_006299524.1| hypothetical protein CARUB_v10015696mg, partial [Capsella rubella]
           gi|482568233|gb|EOA32422.1| hypothetical protein
           CARUB_v10015696mg, partial [Capsella rubella]
          Length = 322

 Score = 94.0 bits (232), Expect = 7e-17
 Identities = 56/179 (31%), Positives = 88/179 (49%), Gaps = 5/179 (2%)
 Frame = -1

Query: 529 AYSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEED 350
           AY LA++++P  II  V    DN+DEW + +           FV+G + +P     E ED
Sbjct: 18  AYQLAANENPGAIIAHVHFNGDNFDEWAQTVRTALRVKKKFGFVDGSVTEPNKEEAEYED 77

Query: 349 WWTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTS 170
           W +  +M   +    +E +  +          K  F     PR  E+K     C++    
Sbjct: 78  WVSAKSMTLGNKEDPAELWKEI----------KDRFCEGNGPRIQEIKAELALCRQGYMR 127

Query: 169 VSAYFAKLKKIWDGLSNYQQLPNCTCG----MINETI-KQREEDKVHQFLIGLDDTVYG 8
           V  Y+ KL+ +W+ LSNY+    C CG     IN  + K++EED++H FL+GLD+ V+G
Sbjct: 128 VIDYYGKLQVLWEDLSNYETPVVCNCGGCTCEINAKLEKKKEEDRIHHFLLGLDEAVFG 186


>dbj|BAB08885.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 370

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 55/180 (30%), Positives = 93/180 (51%), Gaps = 12/180 (6%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y+L++SD+P  +IT V L  DNY+EW + M           F++G I KP S + + E+W
Sbjct: 30  YTLSNSDNPGTLITSVVLNGDNYNEWSEEMLNALQAKRKTGFIDGTIQKPASDSPDFENW 89

Query: 346 WTINTMDTEHY*AQSE-----------NYYVLHRTMR*TLV*KSIFWLETAPRKHELKMA 200
            T+N+M         E           + ++L   +R        F +    R H++K  
Sbjct: 90  KTVNSMIVGWIRVSIEPKVKSTVTFISDAHLLWDELR------QRFSVTNNVRVHQIKAQ 143

Query: 199 RVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCG-MINETIKQREEDKVHQFLIGLD 23
             +C++ G +V  Y+ +L  +WD L NYQ    C  G ++   +K+R+++K+HQF++GLD
Sbjct: 144 LASCRQEGQTVIDYYGRLCNLWDELKNYQASAVCPHGSVLTAIVKERDDEKLHQFVLGLD 203


>dbj|BAB10837.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1462

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 57/181 (31%), Positives = 88/181 (48%), Gaps = 6/181 (3%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y L S D+P  +I+   LR  NYDEW   +           F +G IP+P+    + +DW
Sbjct: 24  YDLTSGDNPGTLISKPLLRGPNYDEWATNLRLALKARKKFGFADGTIPQPDETNPDFDDW 83

Query: 346 WTINTMD------TEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCK 185
              N +       T H    +   ++       T + K  F ++   R   LK     C+
Sbjct: 84  IANNALVVSWMKLTIHESLATSMSHLDDSHDMWTHIQKR-FGVKNGQRIQRLKTELATCR 142

Query: 184 ESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQREEDKVHQFLIGLDDTVYGT 5
           + GT +  Y+ KL ++W  L++YQQ        + E  K+REEDK+HQFL+GLD+++YG 
Sbjct: 143 QKGTPIETYYGKLSQLWRSLADYQQAKT-----MEEVRKEREEDKLHQFLMGLDESMYGA 197

Query: 4   V 2
           V
Sbjct: 198 V 198


>gb|AAT71979.1| At5g39185 [Arabidopsis thaliana]
          Length = 348

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 57/181 (31%), Positives = 88/181 (48%), Gaps = 6/181 (3%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y L S D+P  +I+   LR  NYDEW   +           F +G IP+P+    + +DW
Sbjct: 24  YDLTSGDNPGTLISKPLLRGPNYDEWATNLRLALKARKKFGFADGTIPQPDETNPDFDDW 83

Query: 346 WTINTMD------TEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCK 185
              N +       T H    +   ++       T + K  F ++   R   LK     C+
Sbjct: 84  IANNALVVSWMKLTIHESLATSMSHLDDSHDMWTHIQKR-FGVKNGQRIQRLKTELATCR 142

Query: 184 ESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQREEDKVHQFLIGLDDTVYGT 5
           + GT +  Y+ KL ++W  L++YQQ        + E  K+REEDK+HQFL+GLD+++YG 
Sbjct: 143 QKGTPIETYYGKLSQLWRSLADYQQAKT-----MEEVRKEREEDKLHQFLMGLDESMYGA 197

Query: 4   V 2
           V
Sbjct: 198 V 198


>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 58/186 (31%), Positives = 87/186 (46%), Gaps = 13/186 (6%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y+LASSD+P  +I+ V L  DNY+EW   M           F+ G I KP     + E+W
Sbjct: 27  YTLASSDNPGAMISSVMLTGDNYNEWSTKMLNALQAKRKTGFINGSISKPPLDNPDYENW 86

Query: 346 WTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPR--------KHELKMARVN 191
             +N+M      A  E       T    +      W E   R         H++K     
Sbjct: 87  QAVNSMIVGWIRASIEPKVKSTVTF---ICDAHQLWSELKQRFSVGNKVHVHQIKTQLAA 143

Query: 190 CKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMIN-----ETIKQREEDKVHQFLIGL 26
           C++ G  V  Y+ +L K+W+    Y+ +  C CG+       E  K+REE+K+HQF++GL
Sbjct: 144 CRQDGQPVIDYYGRLCKLWEEFQIYKPITVCKCGLCTCGATLEPSKEREEEKIHQFVLGL 203

Query: 25  DDTVYG 8
           DD+ +G
Sbjct: 204 DDSRFG 209


>ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624694 isoform X1 [Citrus
           sinensis] gi|568852764|ref|XP_006480041.1| PREDICTED:
           uncharacterized protein LOC102624694 isoform X2 [Citrus
           sinensis] gi|568852766|ref|XP_006480042.1| PREDICTED:
           uncharacterized protein LOC102624694 isoform X3 [Citrus
           sinensis]
          Length = 320

 Score = 91.7 bits (226), Expect = 4e-16
 Identities = 49/136 (36%), Positives = 76/136 (55%), Gaps = 16/136 (11%)
 Frame = -1

Query: 361 EEEDWWTINTMDTEHY*AQSENYYVLHRTMR*TLV*KSI-----------FWLETAPRKH 215
           E +DWWT+N+M        S     +  T+R T+    +           F +   PR H
Sbjct: 9   ELDDWWTVNSMIV------SWILNTIEPTLRSTITHMEVAKKLWDDIKERFSVGNGPRVH 62

Query: 214 ELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCG-----MINETIKQREEDK 50
           +LK     CK+ G ++ +Y+ KLK IW+ L+NY+Q P C+CG     +  +  K+ EE++
Sbjct: 63  QLKSELAECKQRGMTILSYYGKLKLIWEELANYEQYPICSCGGCTCELEAKLNKKCEEER 122

Query: 49  VHQFLIGLDDTVYGTV 2
           +HQFL+GLDDT+YG+V
Sbjct: 123 LHQFLMGLDDTIYGSV 138


>ref|XP_006419099.1| hypothetical protein EUTSA_v10003107mg [Eutrema salsugineum]
           gi|557097027|gb|ESQ37535.1| hypothetical protein
           EUTSA_v10003107mg [Eutrema salsugineum]
          Length = 189

 Score = 91.7 bits (226), Expect = 4e-16
 Identities = 57/174 (32%), Positives = 90/174 (51%), Gaps = 1/174 (0%)
 Frame = -1

Query: 520 LASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDWWT 341
           L  SD P  +IT VQL+ +NY++W K +           F++G + KP +   E E W  
Sbjct: 3   LHPSDRPGDLITTVQLKGENYEDWAKHVRNALRTKRKLGFIDGTLMKPTTAK-ELEQWEV 61

Query: 340 INTMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELKMARVNCKESGTSVSA 161
           +N+++      +SE               K  F     PR  EL+    NC+++G SV  
Sbjct: 62  VNSIEGAM--GRSEL--------------KLTFSAGNVPRISELRADIANCRQNGDSVMV 105

Query: 160 YFAKLKKIWDGLSNYQQLPNCTCGMINETIKQ-REEDKVHQFLIGLDDTVYGTV 2
           YF KLKK+WD L+ Y+ +  C+CG +   +++ +EE++ + FL GLD   +GTV
Sbjct: 106 YFGKLKKMWDELAIYKPIRTCSCGELKAQLEEDQEEERTNTFLTGLDAERFGTV 159


>gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score = 91.3 bits (225), Expect = 5e-16
 Identities = 56/188 (29%), Positives = 85/188 (45%), Gaps = 13/188 (6%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDW 347
           Y L S D+P  +I+   LR  NYDEW   +           F +G IP+P     + EDW
Sbjct: 25  YDLTSGDNPGTLISKPLLRGPNYDEWATNLRLALKARKKFGFADGSIPQPVETDPDFEDW 84

Query: 346 WTIN-------------TMDTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPRKHELK 206
              N             T+ T        +    H   R        F ++   R   LK
Sbjct: 85  TANNALVVSWMKLTIDETVSTSMSHLDDSHELWTHIQKR--------FGVKNGQRVQRLK 136

Query: 205 MARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQREEDKVHQFLIGL 26
                C++ G ++  Y+ +L ++W  L++YQQ        +++  K+REEDK+HQFL+GL
Sbjct: 137 TELATCRQKGVAIETYYGRLSQLWRSLADYQQAKT-----MDDVRKEREEDKLHQFLMGL 191

Query: 25  DDTVYGTV 2
           D++VYG V
Sbjct: 192 DESVYGAV 199


>dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1098

 Score = 91.3 bits (225), Expect = 5e-16
 Identities = 61/182 (33%), Positives = 87/182 (47%), Gaps = 13/182 (7%)
 Frame = -1

Query: 526 YSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEE--E 353
           Y + +SD+P  +I+ V L+EDNY EW + +           F++G IPKP   TTE    
Sbjct: 14  YGITASDNPGALISSVILKEDNYSEWAEELMNSLQAKQKLGFLDGTIPKP---TTEPALS 70

Query: 352 DWWTINTMDTEHY*AQSENYYVLHRTMR*TLV*-----------KSIFWLETAPRKHELK 206
            W   N+M              +  T+R T+             K  F      RK  LK
Sbjct: 71  SWKAANSMIIGWIRTS------IDPTIRSTVAFVSDAKDLWDSLKQRFSNGNGVRKQLLK 124

Query: 205 MARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQREEDKVHQFLIGL 26
              + CK+ G SV  Y+ +L K+W+ L NY+    CTC    +  K+RE+DKVHQFL+ L
Sbjct: 125 DEILACKQDGQSVLVYYGRLTKLWEELQNYKTSRTCTCEAAPDIAKEREDDKVHQFLLNL 184

Query: 25  DD 20
           D+
Sbjct: 185 DE 186


>gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidopsis thaliana]
          Length = 1152

 Score = 89.7 bits (221), Expect = 1e-15
 Identities = 57/210 (27%), Positives = 97/210 (46%), Gaps = 16/210 (7%)
 Frame = -1

Query: 586 VHTSTSDNDD*VNEYLQKDAYSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXX 407
           V  ST+       + +    Y L  SD PH ++T + L  +NY+ W K            
Sbjct: 5   VDGSTATTASSEKDAISASPYYLHPSDHPHHVLTPMLLNGENYERWAKLTRNNLQAKQKL 64

Query: 406 XFVEGQIPKPESGTTEEEDWWTINTM-----------DTEHY*AQSENYYVLHRTMR*TL 260
            F++G + KP S + +   W   N+M             +   +  +N  V+  ++R   
Sbjct: 65  GFIDGTLTKPSSDSPDYPRWLQTNSMLVGWLYASLDPQVQKSISVVDNARVMWESLR--- 121

Query: 259 V*KSIFWLETAPRKHELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNCTCGMIN 80
              + + +  A R H+LK   V C++ G + + YF KLK +WD L +Y+ L  C C   +
Sbjct: 122 ---TRYSVGNASRVHQLKYDIVACRQDGQTAANYFGKLKVMWDDLDDYEPLLTCCCNRPS 178

Query: 79  ET-----IKQREEDKVHQFLIGLDDTVYGT 5
            T      ++R+ +++HQFL+GLD   +GT
Sbjct: 179 CTHRVRQSQRRDHERIHQFLMGLDAAKFGT 208


>ref|XP_006392205.1| hypothetical protein EUTSA_v10023972mg, partial [Eutrema
           salsugineum] gi|557088711|gb|ESQ29491.1| hypothetical
           protein EUTSA_v10023972mg, partial [Eutrema salsugineum]
          Length = 198

 Score = 85.5 bits (210), Expect = 3e-14
 Identities = 52/191 (27%), Positives = 92/191 (48%), Gaps = 16/191 (8%)
 Frame = -1

Query: 547 EYLQKDAYSLASSDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESG 368
           E +    Y L+SSD PH ++T + L  DNY+ W K             F++G + KP + 
Sbjct: 12  ETISSSPYYLSSSDHPHHVLTPMLLNGDNYEMWAKLARNNLVAKHKLGFIDGSLSKPSAE 71

Query: 367 TTEEEDWWTINTM-----------DTEHY*AQSENYYVLHRTMR*TLV*KSIFWLETAPR 221
           + + + W   N+M             +   +  +N   L   +      K+ + +  A R
Sbjct: 72  SNDYQRWIQTNSMLVGWLYASLDPKVQKVISFVDNAKALWDNL------KTRYSIGNASR 125

Query: 220 KHELKMARVNCKESGTSVSAYFAKLKKIWDGLSNYQQLPNC-----TCGMINETIKQREE 56
            H++K A + C + G  V+ YF KLK +WD L +++ L +C     TC    + +++R+ 
Sbjct: 126 VHQIKAAILACMQDGQEVADYFGKLKVMWDDLDDFEPLIDCCCSNATCPQRVKQVQRRDL 185

Query: 55  DKVHQFLIGLD 23
           +++HQFL+ LD
Sbjct: 186 ERIHQFLMRLD 196


>ref|XP_006397294.1| hypothetical protein EUTSA_v10029485mg [Eutrema salsugineum]
           gi|557098311|gb|ESQ38747.1| hypothetical protein
           EUTSA_v10029485mg [Eutrema salsugineum]
          Length = 196

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 57/178 (32%), Positives = 91/178 (51%), Gaps = 8/178 (4%)
 Frame = -1

Query: 511 SDSPHLIIT*VQLREDNYDEWVKAMXXXXXXXXXXXFVEGQIPKPESGTTEEEDWWTINT 332
           SD P  +IT +QLR +NY++W K +           F+EG +PKP +   E E W  +N+
Sbjct: 6   SDRPGDLITTMQLRGENYEDWAKHVRNALRTKRKLGFIEGTLPKP-TAPKELEQWEVVNS 64

Query: 331 MDTEHY*AQSENYYVLHRTMR*TLV*KSI-------FWLETAPRKHELKMARVNCKESGT 173
           M         E+   L  T+      K +       F +   P+  EL+    NC+++G 
Sbjct: 65  MLVAWIMNTIESN--LKTTISMVDEAKELWDDLKLQFLVGNGPQISELRADIANCRQNGD 122

Query: 172 SVSAYFAKLKKIWDGLSNYQQLPNCTCGMINETIKQ-REEDKVHQFLIGLDDTVYGTV 2
           S+  YF KL K+WD L+ Y+ +  C+CG +   +++  EE++ + FL GLD   +GTV
Sbjct: 123 SIMVYFEKL-KMWDELAVYKPIRTCSCGELRAQLEEDLEEERTNTFLTGLDAERFGTV 179


Top