BLASTX nr result

ID: Catharanthus22_contig00006775 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00006775
         (966 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624...   137   6e-30
ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661...   124   5e-26
emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera]   109   1e-21
emb|CAN65229.1| hypothetical protein VITISV_011708 [Vitis vinifera]   104   6e-20
dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t...   103   8e-20
dbj|BAB08885.1| retroelement pol polyprotein-like [Arabidopsis t...   102   3e-19
emb|CAN76645.1| hypothetical protein VITISV_004685 [Vitis vinifera]   101   5e-19
gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha...   100   1e-18
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t...    99   2e-18
emb|CAN74847.1| hypothetical protein VITISV_028741 [Vitis vinifera]    99   3e-18
emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera]    98   5e-18
emb|CAN80919.1| hypothetical protein VITISV_002640 [Vitis vinifera]    97   7e-18
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi...    96   2e-17
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi...    96   2e-17
ref|XP_006299226.1| hypothetical protein CARUB_v10015375mg, part...    89   3e-15
gb|AAT71979.1| At5g39185 [Arabidopsis thaliana]                        88   4e-15
gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab...    86   2e-14
ref|XP_006418743.1| hypothetical protein EUTSA_v10002805mg, part...    78   6e-12
gb|AAD15368.1| putative retroelement pol polyprotein [Arabidopsi...    77   1e-11
emb|CAN64225.1| hypothetical protein VITISV_016222 [Vitis vinifera]    76   2e-11

>ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624694 isoform X1 [Citrus
           sinensis] gi|568852764|ref|XP_006480041.1| PREDICTED:
           uncharacterized protein LOC102624694 isoform X2 [Citrus
           sinensis] gi|568852766|ref|XP_006480042.1| PREDICTED:
           uncharacterized protein LOC102624694 isoform X3 [Citrus
           sinensis]
          Length = 320

 Score =  137 bits (345), Expect = 6e-30
 Identities = 79/215 (36%), Positives = 119/215 (55%), Gaps = 9/215 (4%)
 Frame = -2

Query: 965 IES*ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQVNTT-- 792
           I S ILNTIE +L+S IT  ++ K  WD + + F   NGPR ++LKS +  CKQ   T  
Sbjct: 20  IVSWILNTIEPTLRSTITHMEVAKKLWDDIKERFSVGNGPRVHQLKSELAECKQRGMTIL 79

Query: 791 TYFAKLNKLWDELANYQRLSICDCAKPVLEL----TKQREKENLLQFCSDRITSSLVPCD 624
           +Y+ KL  +W+ELANY++  IC C     EL     K+ E+E L QF      +      
Sbjct: 80  SYYGKLKLIWEELANYEQYPICSCGGCTCELEAKLNKKCEEERLHQFLMGLDDTIYGSVR 139

Query: 623 QIFFGEDALPNMNQAYSKIIQQEQVKNMXXXXXXXXXXAFIFQPLLFG--GGFESEKKTN 450
                 D LP +N+AYS ++Q+E+V+ +              +P+ F   GG + + +  
Sbjct: 140 SNILSTDPLPPLNRAYSLVVQEERVQTITRGKEGRG------EPVAFAVQGGVKGQIEIR 193

Query: 449 -KSGLICTVCNKMGHEARSCFQVVGFPDWWLEKTR 348
            KS +IC  C K GH+A SCFQ++G+P+WW +++R
Sbjct: 194 EKSSVICKHCRKTGHDADSCFQLIGYPEWWGDRSR 228


>ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661920 [Glycine max]
          Length = 516

 Score =  124 bits (311), Expect = 5e-26
 Identities = 79/217 (36%), Positives = 109/217 (50%), Gaps = 11/217 (5%)
 Frame = -2

Query: 965 IES*ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQ--VNTT 792
           I S I NTIE  L+S IT  +  +  WD + Q F   NGPR  +LKS +  CKQ   +  
Sbjct: 97  IVSWIFNTIEPKLRSTITYRENAQELWDDIKQRFSISNGPRIQQLKSELANCKQNGDSIV 156

Query: 791 TYFAKLNKLWDELANYQRLSICDC----AKPVLELTKQREKENLLQFCSDRITSSLVPCD 624
           TYF +L KLWDEL ++ ++ +C C          L K+RE+E L QF      +      
Sbjct: 157 TYFGRLKKLWDELNDFDQIPMCTCNGCKCGISAALNKKREEEKLHQFLMGLDDTQFRTVR 216

Query: 623 QIFFGEDALPNMNQAYSKIIQQEQVKNMXXXXXXXXXXAFIFQPLLF----GGGFESEKK 456
                 D LPN+N+AY  ++Q+E+V  M               P+ F    G     EKK
Sbjct: 217 SNVLSLDPLPNLNRAYQMVVQEERVGVMTRGKEERG------DPIAFAVKSGRTSSWEKK 270

Query: 455 TNK-SGLICTVCNKMGHEARSCFQVVGFPDWWLEKTR 348
            N  S   C+ C + GH+  SCFQ+VG+PDWW ++ R
Sbjct: 271 PNTGSEKPCSHCKRDGHDIDSCFQLVGYPDWWGDRPR 307


>emb|CAN82073.1| hypothetical protein VITISV_036538 [Vitis vinifera]
          Length = 1157

 Score =  109 bits (273), Expect = 1e-21
 Identities = 61/189 (32%), Positives = 90/189 (47%), Gaps = 9/189 (4%)
 Frame = -2

Query: 887 WDKLLQHFQTENGPRYYELKSAVMGCKQVNT--TTYFAKLNKLWDELANYQRLSICDCA- 717
           W+++ Q F   NGPR  +LKS ++ CKQ       Y+ KL  LWDEL NY  + +C C  
Sbjct: 64  WEEIKQQFSIGNGPRVQQLKSYLVNCKQEGQGIIVYYGKLKSLWDELNNYDSIPVCTCTR 123

Query: 716 ---KPVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDALPNMNQAYSKIIQQEQVK 546
              K   +L K+RE+E + QF                   + L N+N+ Y+ I+QQE+V+
Sbjct: 124 CKCKITTQLEKKREEERVHQFLMGLDEDGYGTVRSNILSIEPLSNLNRVYAMIVQQERVR 183

Query: 545 NMXXXXXXXXXXAFIFQPLLFG---GGFESEKKTNKSGLICTVCNKMGHEARSCFQVVGF 375
            M               P+ F    GG           +IC+ C + GHE  SCFQ + +
Sbjct: 184 TMTRTKEERG------SPMSFAVQVGGRNPGGDGKDKTVICSNCKRKGHEVDSCFQRIAY 237

Query: 374 PDWWLEKTR 348
           P+WW ++ R
Sbjct: 238 PEWWGDRPR 246


>emb|CAN65229.1| hypothetical protein VITISV_011708 [Vitis vinifera]
          Length = 1149

 Score =  104 bits (259), Expect = 6e-20
 Identities = 53/182 (29%), Positives = 88/182 (48%), Gaps = 6/182 (3%)
 Frame = -2

Query: 875 LQHFQTENGPRYYELKSAVMGCKQVNTT--TYFAKLNKLWDELANYQRLSICDCAKPVLE 702
           ++ F   NGPR  +L+  +  CKQ      TY+ KL  +WDEL NY ++ +C+C      
Sbjct: 80  MERFSIGNGPRVQQLRLDLANCKQNGQVIVTYYGKLKMIWDELNNYDKMPVCNCVGCKCN 139

Query: 701 LT----KQREKENLLQFCSDRITSSLVPCDQIFFGEDALPNMNQAYSKIIQQEQVKNMXX 534
           LT    K+RE+E + QF                   + LPN+N+ Y+ ++QQE+++ M  
Sbjct: 140 LTIVLEKKREEERVHQFLMGLDEEGYGTVSSNILSTEPLPNLNRVYAMVVQQERMRTMTR 199

Query: 533 XXXXXXXXAFIFQPLLFGGGFESEKKTNKSGLICTVCNKMGHEARSCFQVVGFPDWWLEK 354
                         +   GG  S  +     ++CT C + GH+  +CFQ++G+ +WW  +
Sbjct: 200 TKEERGNLMSFAMKV---GGQNSRGEXKDRNVVCTNCKREGHDVDTCFQLIGYLEWWGNR 256

Query: 353 TR 348
            R
Sbjct: 257 XR 258


>dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1098

 Score =  103 bits (258), Expect = 8e-20
 Identities = 64/209 (30%), Positives = 99/209 (47%), Gaps = 9/209 (4%)
 Frame = -2

Query: 953 ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQ--VNTTTYFA 780
           I  +I+ +++S +      K  WD L Q F   NG R   LK  ++ CKQ   +   Y+ 
Sbjct: 83  IRTSIDPTIRSTVAFVSDAKDLWDSLKQRFSNGNGVRKQLLKDEILACKQDGQSVLVYYG 142

Query: 779 KLNKLWDELANYQRLSICDCAKPVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDA 600
           +L KLW+EL NY+    C C +   ++ K+RE + + QF  + +     P       +D 
Sbjct: 143 RLTKLWEELQNYKTSRTCTC-EAAPDIAKEREDDKVHQFLLN-LDERFRPIRSTITVQDP 200

Query: 599 LPNMNQAYSKIIQQEQVKNMXXXXXXXXXXAFIFQ-------PLLFGGGFESEKKTNKSG 441
           LP +NQ YS++I +EQ  N           A  F        P        + +  ++S 
Sbjct: 201 LPALNQVYSRVIHEEQNLNASRIKDDIKTEAVGFTVQATPLPPTPQVAAVSAPRFRDRSS 260

Query: 440 LICTVCNKMGHEARSCFQVVGFPDWWLEK 354
           L CT  ++ GH+   CF V G+PDWWLE+
Sbjct: 261 LTCTHYHRQGHDITECFLVHGYPDWWLEQ 289


>dbj|BAB08885.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 370

 Score =  102 bits (253), Expect = 3e-19
 Identities = 57/206 (27%), Positives = 99/206 (48%), Gaps = 9/206 (4%)
 Frame = -2

Query: 944 TIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQVNTTT--YFAKLN 771
           +IE  ++S +T      + WD+L Q F   N  R +++K+ +  C+Q   T   Y+ +L 
Sbjct: 103 SIEPKVKSTVTFISDAHLLWDELRQRFSVTNNVRVHQIKAQLASCRQEGQTVIDYYGRLC 162

Query: 770 KLWDELANYQRLSICDCAKPVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDALPN 591
            LWDEL NYQ  ++C     +  + K+R+ E L QF     ++            D LP+
Sbjct: 163 NLWDELKNYQASAVCPHGSVLTAIVKERDDEKLHQFVLGLDSARFSGLCTNLINMDPLPS 222

Query: 590 MNQAYSKIIQQEQ-VKNMXXXXXXXXXXAFIFQPLLFGGGFESEKKTNKSGLI------C 432
           +  AYS++I++EQ +              F+ +           + + +S ++      C
Sbjct: 223 LGVAYSQVIREEQRIHASRTQEQRQEVVGFVARHEQSSAMSSPAQSSIESSIVKSRPVLC 282

Query: 431 TVCNKMGHEARSCFQVVGFPDWWLEK 354
           + C + GHE + C+ +VGFPDWW E+
Sbjct: 283 SHCGRTGHEKKDCWSIVGFPDWWTER 308


>emb|CAN76645.1| hypothetical protein VITISV_004685 [Vitis vinifera]
          Length = 1196

 Score =  101 bits (251), Expect = 5e-19
 Identities = 67/237 (28%), Positives = 114/237 (48%), Gaps = 30/237 (12%)
 Frame = -2

Query: 953 ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQ---VNTTTYF 783
           I NTI+  ++S ++K +  K  W+ L Q +   NGPR  +LK+++  C+Q   ++ TTY+
Sbjct: 6   ITNTIDPEVKSTLSKFRDAKRLWEHLKQRYAMVNGPRIQQLKTSIAKCEQSKSMSVTTYY 65

Query: 782 AKLNKLWDELANYQRLSICDCAKPVLELT---KQREKENLLQFCSDRITSSLVPCDQIFF 612
            KLN LW+EL  ++ L  C C       +    +RE+  L  F     T           
Sbjct: 66  GKLNVLWEELFKHEPLISCTCCSSCTAASLHQARREQGKLHDFLMGLNTDLYAQLXTNIL 125

Query: 611 GEDALPNMNQAYSKIIQQEQVK--NMXXXXXXXXXXAFIFQPLLFGGGFESE-------K 459
            +D LP++++AY  +IQ E+V+               F  +  +  G  ++E       K
Sbjct: 126 SQDPLPSLDRAYQLVIQDERVRLAKAVTEDKPAEVLGFAVRTGVGXGRGKTERLVCXHXK 185

Query: 458 KTNK------SGLICTVCNKMGHEARSCFQVVGFPDWWLE---------KTRQQSNR 333
           KT        S + C  C+K GH+  +C+++VG+P+ WL+         ++RQQ+ R
Sbjct: 186 KTGHETSTCWSXVACPHCHKHGHDKNNCYEIVGYPEGWLDQNKADGGAGRSRQQAGR 242


>gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana]
          Length = 1468

 Score =  100 bits (248), Expect = 1e-18
 Identities = 61/215 (28%), Positives = 103/215 (47%), Gaps = 12/215 (5%)
 Frame = -2

Query: 944 TIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQVNTTT--YFAKLN 771
           TI+  L + I+   + +  W+++ + F   NGP+  ++K+ +  CKQ   T   Y+ KLN
Sbjct: 94  TIDSELLTNISHRDVARDLWEQIRKRFSVSNGPKNQKMKADLATCKQEGMTVEGYYGKLN 153

Query: 770 KLWDELANYQRLSICDCAKPVLEL----TKQREKENLLQFCSDRITSSLVPCDQIFFGED 603
           K+WD + +Y+ L IC C + +  L     K RE + + Q+      +             
Sbjct: 154 KIWDNINSYRPLRICKCGRCICNLGTDQEKYREDDMVHQYLYGLNETKFHTIRSSLTSRV 213

Query: 602 ALPNMNQAYSKIIQQEQ-VKNMXXXXXXXXXXAFIFQ-----PLLFGGGFESEKKTNKSG 441
            LP + + Y+ + Q+E  V N           AF  Q      ++      SEK  NK  
Sbjct: 214 PLPGLEEVYNIVRQEEDMVNNRSSNEERTDVTAFAVQMRPRSEVISEKFANSEKLQNKK- 272

Query: 440 LICTVCNKMGHEARSCFQVVGFPDWWLEKTRQQSN 336
            +CT CN+ GH   +CF ++G+P+WW ++ R +SN
Sbjct: 273 -LCTHCNRGGHSPENCFVLIGYPEWWGDRPRGKSN 306


>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score = 99.4 bits (246), Expect = 2e-18
 Identities = 56/208 (26%), Positives = 101/208 (48%), Gaps = 10/208 (4%)
 Frame = -2

Query: 944 TIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQVNTTT--YFAKLN 771
           +IE  ++S +T        W +L Q F   N  R +++K+ +  C+Q       Y+ +L 
Sbjct: 100 SIEPKVKSTVTFISDAHQLWSELKQRFSVGNKVRVHQIKAQLAACRQDGQPVIDYYGRLC 159

Query: 770 KLWDELANYQRLSICDCAK----PVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGED 603
           KLW+E   Y+ +++C C        LE +K+RE+E + QF      S            D
Sbjct: 160 KLWEEFQIYKPITVCKCGLCTCGATLEPSKEREEEKIHQFVLGLDDSRFGGLSATLIAMD 219

Query: 602 ALPNMNQAYSKIIQQEQ-VKNMXXXXXXXXXXAFIFQPLLFGGGFESEKKTNKS---GLI 435
             P++ + YS+++++EQ + ++           F+ +         ++    KS    ++
Sbjct: 220 PFPSLGEIYSRVVREEQRLASVQIREQQQSAIGFLTRQSEVTADGRTDSSIIKSRDRSVL 279

Query: 434 CTVCNKMGHEARSCFQVVGFPDWWLEKT 351
           C+ C + GHE + C+Q+VGFPDWW E+T
Sbjct: 280 CSHCGRSGHEKKDCWQIVGFPDWWTERT 307


>emb|CAN74847.1| hypothetical protein VITISV_028741 [Vitis vinifera]
          Length = 1262

 Score = 98.6 bits (244), Expect = 3e-18
 Identities = 66/237 (27%), Positives = 109/237 (45%), Gaps = 30/237 (12%)
 Frame = -2

Query: 953 ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQ---VNTTTYF 783
           I NTI+  ++S ++K +  K  W+ L Q +   NGPR  +LK+++  C+Q   ++ TTY+
Sbjct: 90  ITNTIDPEVKSTLSKFRDAKRLWEHLKQRYAMVNGPRIQQLKTSIAKCEQPKSMSVTTYY 149

Query: 782 AKLNKLWDELANYQRLSICDCAKPVLELT---KQREKENLLQFCSDRITSSLVPCDQIFF 612
            KLN LW+EL   + L  C C       +    +RE+  L  F     T           
Sbjct: 150 GKLNVLWEELFKNEPLISCTCCSSCTAASLHQARREQGKLHDFLMGLNTDLYAQLRTNIL 209

Query: 611 GEDALPNMNQAYSKIIQQEQVKNMXXXXXXXXXXAFIFQPLLFGGGFESE---------K 459
            +D LP++++AY  +IQ E+V+               F      G    +         K
Sbjct: 210 SQDPLPSLDRAYQLVIQDERVRLAKAVTEDKPAEVLGFXVRTGAGRGRGKTERPVCSHXK 269

Query: 458 KTNK------SGLICTVCNKMGHEARSCFQVVGFPDWWLE---------KTRQQSNR 333
           KT        S + C  C+K GH+  +C+++VG+P+ WL+         ++RQQ+ R
Sbjct: 270 KTGHETSTCWSXVACPHCHKHGHDKNNCYEIVGYPEGWLDQNKADGGAGRSRQQAGR 326


>emb|CAN67762.1| hypothetical protein VITISV_040650 [Vitis vinifera]
          Length = 1316

 Score = 97.8 bits (242), Expect = 5e-18
 Identities = 58/183 (31%), Positives = 85/183 (46%), Gaps = 2/183 (1%)
 Frame = -2

Query: 887 WDKLLQHFQTENGPRYYELKSAVMGCKQVNTTT--YFAKLNKLWDELANYQRLSICDCAK 714
           W+ L + +   N PR ++L+S ++  KQ   T   Y+AK+  +WDEL  Y  +  C C  
Sbjct: 2   WEDLKERYAVGNAPRVHQLRSEIVNLKQEGMTVAAYYAKIKGMWDELNQYIEIPECTCGA 61

Query: 713 PVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDALPNMNQAYSKIIQQEQVKNMXX 534
               + K RE E   QF      ++           D LP + + Y+ + Q+E+ ++M  
Sbjct: 62  -AQAIVKSREDEKAHQFLMGLDDTTFGTVRSSILALDPLPTLGKIYAMVTQEERHRSMAR 120

Query: 533 XXXXXXXXAFIFQPLLFGGGFESEKKTNKSGLICTVCNKMGHEARSCFQVVGFPDWWLEK 354
                    F       GG      +TNKSG  CT C K GH+   CFQ+ G+PDWW   
Sbjct: 121 GADRAEITVFAAXTEKPGG------QTNKSGS-CTHCGKTGHDVADCFQLKGYPDWW--P 171

Query: 353 TRQ 345
           TRQ
Sbjct: 172 TRQ 174


>emb|CAN80919.1| hypothetical protein VITISV_002640 [Vitis vinifera]
          Length = 1450

 Score = 97.4 bits (241), Expect = 7e-18
 Identities = 65/237 (27%), Positives = 110/237 (46%), Gaps = 30/237 (12%)
 Frame = -2

Query: 953 ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQ---VNTTTYF 783
           I NTI+  ++S ++K +  K  W+ L Q +   NGPR  +LK+++  C+Q   ++ TTY+
Sbjct: 91  ITNTIDPEVKSTLSKFRDAKRLWEHLKQRYAMVNGPRIQQLKTSIAKCEQSKSMSVTTYY 150

Query: 782 AKLNKLWDELANYQRLSICDCAKPVLELT---KQREKENLLQFCSDRITSSLVPCDQIFF 612
            KLN LW+EL  ++ L  C C       +    +RE+  L  F     T           
Sbjct: 151 GKLNVLWEELFKHEPLISCTCCSSCTAASLHQARREQGKLHDFLMGLNTDLYAQLRTNIL 210

Query: 611 GEDALPNMNQAYSKIIQQEQVKNMXXXXXXXXXXAFIFQPLLFGGGFESE---------K 459
            +D LP++++AY  +IQ ++V+               F      G    +         K
Sbjct: 211 SQDPLPSLDRAYQLVIQDKRVRLAKAVTEDKPAEVLGFAVRTGAGRGRGKTERPVCSHCK 270

Query: 458 KTNK------SGLICTVCNKMGHEARSCFQVVGFPDWWLE---------KTRQQSNR 333
           KT        S + C  C+K GH+  +C+++VG+P+ WL+         ++RQQ+ R
Sbjct: 271 KTGHETSTCWSLVACPHCHKHGHDKNNCYEIVGYPEGWLDQNKADGGAGRSRQQAGR 327


>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score = 96.3 bits (238), Expect = 2e-17
 Identities = 51/189 (26%), Positives = 92/189 (48%), Gaps = 10/189 (5%)
 Frame = -2

Query: 887 WDKLLQHFQTENGPRYYELKSAVMGCKQVNTTT--YFAKLNKLWDELANYQRLSICDCAK 714
           W +L Q F   N    +++K+ +  C+Q       Y+ +L KLW+E   Y+ +++C C  
Sbjct: 119 WSELKQRFSVGNKVHVHQIKTQLAACRQDGQPVIDYYGRLCKLWEEFQIYKPITVCKCGL 178

Query: 713 ----PVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDALPNMNQAYSKIIQQEQ-V 549
                 LE +K+RE+E + QF      S            D  P++ + YS+++++EQ +
Sbjct: 179 CTCGATLEPSKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFPSLGEIYSRVVREEQRL 238

Query: 548 KNMXXXXXXXXXXAFIFQPLLFGGGFESEKKTNKS---GLICTVCNKMGHEARSCFQVVG 378
            ++           F+ +         ++    KS    ++C+ C + GHE + C+Q+VG
Sbjct: 239 ASVQIREQQQSAIGFLTRQSEVTADGRTDSSIIKSRDRSVLCSHCGRSGHEKKDCWQIVG 298

Query: 377 FPDWWLEKT 351
           FPDWW E+T
Sbjct: 299 FPDWWTERT 307


>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1496

 Score = 95.9 bits (237), Expect = 2e-17
 Identities = 61/193 (31%), Positives = 91/193 (47%), Gaps = 10/193 (5%)
 Frame = -2

Query: 887 WDKLLQHFQTENGPRYYELKSAVMGCKQVN--TTTYFAKLNKLWDELANYQRLSICDCAK 714
           W+ L + F   NG R   LK  +  C Q       Y+ +L KLW+EL NY+    C C +
Sbjct: 113 WENLRRRFSVGNGVRKTLLKDEIAACTQDGQPVLAYYGRLIKLWEELQNYKSGRECKC-E 171

Query: 713 PVLELTKQREKENLLQFC---SDRITSSLVPCDQIFFGEDALPNMNQAYSKIIQQEQVKN 543
              ++ K+RE + + +F      R +S       I    + LP++ Q YS+++++EQ  N
Sbjct: 172 AASDIEKEREDDRVHKFLLGLDSRFSSIRSSITDI----EPLPDLYQVYSRVVREEQNLN 227

Query: 542 MXXXXXXXXXXAFIFQPLLFGGGFESEKKT-----NKSGLICTVCNKMGHEARSCFQVVG 378
                      A          GF  +  T     +KS L CT CN+ GHE   CF V G
Sbjct: 228 ASRTKDVVKTEAI---------GFSVQSSTTPRFRDKSTLFCTHCNRKGHEVTQCFLVHG 278

Query: 377 FPDWWLEKTRQQS 339
           +PDWWLE+  Q++
Sbjct: 279 YPDWWLEQNPQEN 291


>ref|XP_006299226.1| hypothetical protein CARUB_v10015375mg, partial [Capsella rubella]
           gi|482567935|gb|EOA32124.1| hypothetical protein
           CARUB_v10015375mg, partial [Capsella rubella]
          Length = 361

 Score = 88.6 bits (218), Expect = 3e-15
 Identities = 62/205 (30%), Positives = 95/205 (46%), Gaps = 2/205 (0%)
 Frame = -2

Query: 953 ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQ--VNTTTYFA 780
           ILNT++  ++  +   +     W ++   F   NGPR  E+K+ +M C Q  +    YF 
Sbjct: 84  ILNTVDPKVRRTLAIKEDPMELWKEIKDCFSEGNGPRIQEIKAELMLCCQGTMAVIEYFG 143

Query: 779 KLNKLWDELANYQRLSICDCAKPVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDA 600
           KL  LW+ + N +    C C      L  + EK+       DRI   L+  D   +G   
Sbjct: 144 KLQVLWENMTNNETPLTCTCDGCSCNLKVKLEKKRE----DDRIHHFLLGLDVTIYG--G 197

Query: 599 LPNMNQAYSKIIQQEQVKNMXXXXXXXXXXAFIFQPLLFGGGFESEKKTNKSGLICTVCN 420
           L      YSK+   E+V N+             F  L       S   TNKS L+C+ C 
Sbjct: 198 LRTTIIVYSKVKLVERV-NIVMRGREQQASQVAFLALRSD---VSVGNTNKSKLVCSSCT 253

Query: 419 KMGHEARSCFQVVGFPDWWLEKTRQ 345
           + GH A +CFQV+G+P+WW +++R+
Sbjct: 254 RTGHTAETCFQVIGYPEWWGDRSRR 278


>gb|AAT71979.1| At5g39185 [Arabidopsis thaliana]
          Length = 348

 Score = 88.2 bits (217), Expect = 4e-15
 Identities = 54/181 (29%), Positives = 87/181 (48%), Gaps = 3/181 (1%)
 Frame = -2

Query: 887 WDKLLQHFQTENGPRYYELKSAVMGCKQVNTT--TYFAKLNKLWDELANYQRLSICDCAK 714
           W  + + F  +NG R   LK+ +  C+Q  T   TY+ KL++LW  LA+YQ+      AK
Sbjct: 116 WTHIQKRFGVKNGQRIQRLKTELATCRQKGTPIETYYGKLSQLWRSLADYQQ------AK 169

Query: 713 PVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDALPNMNQAYSKIIQQEQVKNMXX 534
            + E+ K+RE++ L QF      S              LP++ +AY+ + Q E+ K++  
Sbjct: 170 TMEEVRKEREEDKLHQFLMGLDESMYGAVKSALLSRVPLPSLEEAYNTLTQDEESKSLSR 229

Query: 533 XXXXXXXXA-FIFQPLLFGGGFESEKKTNKSGLICTVCNKMGHEARSCFQVVGFPDWWLE 357
                     F  Q         S  K   S ++C+ C ++GH A +CF++VG+P W   
Sbjct: 230 LHDERNDGVSFAVQTT---PRTRSLTKNKDSAIVCSHCGRLGHLAENCFKLVGYPPWLKR 286

Query: 356 K 354
           K
Sbjct: 287 K 287


>gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score = 85.9 bits (211), Expect = 2e-14
 Identities = 50/182 (27%), Positives = 87/182 (47%), Gaps = 2/182 (1%)
 Frame = -2

Query: 887 WDKLLQHFQTENGPRYYELKSAVMGCKQ--VNTTTYFAKLNKLWDELANYQRLSICDCAK 714
           W  + + F  +NG R   LK+ +  C+Q  V   TY+ +L++LW  LA+YQ+      AK
Sbjct: 117 WTHIQKRFGVKNGQRVQRLKTELATCRQKGVAIETYYGRLSQLWRSLADYQQ------AK 170

Query: 713 PVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDALPNMNQAYSKIIQQEQVKNMXX 534
            + ++ K+RE++ L QF      S              LP++ +AY+ + Q E+ K++  
Sbjct: 171 TMDDVRKEREEDKLHQFLMGLDESVYGAVKSALLSRVPLPSLEEAYNALTQDEESKSLSR 230

Query: 533 XXXXXXXXAFIFQPLLFGGGFESEKKTNKSGLICTVCNKMGHEARSCFQVVGFPDWWLEK 354
                         + F     S  + +    +C+ C ++GH A  CF+++G+P W  EK
Sbjct: 231 LHNERVDG------VSFAVQTTSRPRDSSENRVCSNCGRVGHLAEQCFKLIGYPPWLEEK 284

Query: 353 TR 348
            R
Sbjct: 285 LR 286


>ref|XP_006418743.1| hypothetical protein EUTSA_v10002805mg, partial [Eutrema
           salsugineum] gi|557096671|gb|ESQ37179.1| hypothetical
           protein EUTSA_v10002805mg, partial [Eutrema salsugineum]
          Length = 253

 Score = 77.8 bits (190), Expect = 6e-12
 Identities = 55/210 (26%), Positives = 89/210 (42%), Gaps = 10/210 (4%)
 Frame = -2

Query: 953 ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQVNTTT--YFA 780
           I +++E  L+  I+     K  W  L + F   N  R  +L + +  CKQ   T   +F 
Sbjct: 42  IYSSVEPKLRPSISLVDSAKAMWASLQRRFSVNNDTRVLQLLADINNCKQDGDTVEIFFG 101

Query: 779 KLNKLWDELANYQRLSICDCAKPVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDA 600
           +L  +WD+LA+  +   C C           EK    QF      S           + +
Sbjct: 102 RLKVMWDDLADLDKGFTCCCGT---------EKILFHQFLMGFDNSRFGTTHSNILSQQS 152

Query: 599 LPNMNQAYSKIIQQEQVKNMXXXXXXXXXXAFIF-----QPLLFGGGFESEK---KTNKS 444
             N++  YS+I+Q+E+  N+            +      QPL        +    K ++ 
Sbjct: 153 EINLDMVYSQIVQEERYLNVMRGAEERIPVMGLSATTQPQPLQHSAPKTEQAAAAKFSRP 212

Query: 443 GLICTVCNKMGHEARSCFQVVGFPDWWLEK 354
             +CT C K GHEA SCF ++GFP+W+ +K
Sbjct: 213 TTMCTHCGKTGHEATSCFYLIGFPEWYNDK 242


>gb|AAD15368.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|17065314|gb|AAL32811.1| putative retroelement pol
           polyprotein [Arabidopsis thaliana]
           gi|21387147|gb|AAM47977.1| putative retroelement pol
           polyprotein [Arabidopsis thaliana]
          Length = 411

 Score = 77.0 bits (188), Expect = 1e-11
 Identities = 62/223 (27%), Positives = 97/223 (43%), Gaps = 14/223 (6%)
 Frame = -2

Query: 965 IES*ILNTIELSLQS*ITKGKIVKVTWDKLLQHFQTENGPRYYELKSAVMGCKQ--VNTT 792
           ++S ILN +   +   I   +     W  L   F+  N PR Y+L+ AVM  KQ  +N +
Sbjct: 133 VKSWILNVVNKEIYDSILYYEDAVEMWTDLFTRFRVNNLPRKYQLEQAVMTLKQGSLNLS 192

Query: 791 TYFAKLNKLWDELANYQRLSI--CDCAKPVLELTKQREKENLLQFCSDRITSSLVPCDQI 618
           TYF K   LW++L N +  S+  CDC + V EL +  E   ++QF             QI
Sbjct: 193 TYFTKKKTLWEQLLNTKTRSVKKCDCDQ-VKELLEDAETSRVIQFLMGLNDDFNTIMSQI 251

Query: 617 FFGEDALPNMNQAYSKIIQQEQVKNMXXXXXXXXXXAFIFQ---------PLLFG-GGFE 468
                  P +N+ Y+ ++ Q++ + +             FQ         P+L   G F+
Sbjct: 252 -LNMKPRPGLNEIYN-MLDQDESQRLVGHASKPTPSPAAFQTQGLLTEQNPILMAQGNFK 309

Query: 467 SEKKTNKSGLICTVCNKMGHEARSCFQVVGFPDWWLEKTRQQS 339
             K        CT CN++GH    C++V G+P       +Q S
Sbjct: 310 KPK--------CTHCNRIGHTVDKCYKVHGYPPGHPRANQQSS 344


>emb|CAN64225.1| hypothetical protein VITISV_016222 [Vitis vinifera]
          Length = 987

 Score = 75.9 bits (185), Expect = 2e-11
 Identities = 41/176 (23%), Positives = 82/176 (46%), Gaps = 4/176 (2%)
 Frame = -2

Query: 887 WDKLLQHFQTENGPRYYELKSAVMGCKQ--VNTTTYFAKLNKLWDELANYQRLSICDCAK 714
           W+ L + F   +GPR +E+K  ++   Q      TY+ +   LWDEL  ++ + +C+C  
Sbjct: 116 WNDLYERFHQGSGPRIFEIKQKILAHTQGLAYVNTYYTRQKSLWDELREFKAIPVCNCGG 175

Query: 713 PVLELTKQREKENLLQFCSDRITSSLVPCDQIFFGEDALPNMNQAYSKIIQQEQVKNMXX 534
             + +  Q ++E+++QF    +  S  P           P +N+ +S ++Q+E+ +++  
Sbjct: 176 MRVYMEDQ-QRESVMQFLLG-LNESFAPIRAQILLMKPTPPLNKVFSLVVQEERQRSLTI 233

Query: 533 XXXXXXXXAFI--FQPLLFGGGFESEKKTNKSGLICTVCNKMGHEARSCFQVVGFP 372
                        FQ         +  ++ K   +CT CN +GH    C+++ G+P
Sbjct: 234 SNSPAFTAPVSSRFQAASRASSPTNSSRSRKDRPLCTHCNILGHTVDRCYKIHGYP 289


Top