BLASTX nr result

ID: Ephedra25_contig00023129 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00023129
         (1138 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245...    99   4e-18
ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620...    95   6e-17
ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620...    93   2e-16
gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theob...    93   2e-16
gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao]     93   2e-16
gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao]     93   2e-16
gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theob...    93   2e-16
gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma caca...    93   2e-16
gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao]     93   2e-16
ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm...    85   5e-14
ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ...    81   7e-13
ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcript...    80   2e-12
dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]         79   3e-12
ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcr...    79   3e-12
ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211...    78   8e-12
ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab...    77   1e-11
ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part...    77   1e-11
ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps...    76   2e-11
gb|EXB89377.1| hypothetical protein L484_017343 [Morus notabilis]      75   6e-11
ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu...    72   3e-10

>ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera]
           gi|298205214|emb|CBI17273.3| unnamed protein product
           [Vitis vinifera]
          Length = 425

 Score = 98.6 bits (244), Expect = 4e-18
 Identities = 89/318 (27%), Positives = 140/318 (44%)
 Frame = -1

Query: 958 SKLDLGNRLHTKLVKLQDDLHGLEEEATSVSNFDTFNDSHLLRLQETLADMPLEYFKFSG 779
           S++   NR+HT    + D          S S F  F+     R+ + L+       ++S 
Sbjct: 17  SRMSELNRIHTNYSHISDS-----NPLDSRSLFQEFSHHLQSRVNQILS-------QYSD 64

Query: 778 SDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIE 599
            +    DD D  +  +K EL +VE E AK  NEIE L     +D+N L  D+E+L   ++
Sbjct: 65  VESLEADDLDAYLGHLKKELNLVESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVD 124

Query: 598 FWQNDDNVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELED 419
           F      V S   K A+   L   ++   S E + D  +        G+N  FE L+L  
Sbjct: 125 F------VASQGLKRAEAGAL---VDYSSSVEDQLDSRTAH------GDN-NFEILDLNY 168

Query: 418 QLIRKRQKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSI 239
           Q  + +     L+ LD   KR +A+  IE++L+ +K I    NCI+L+L TFIP   G +
Sbjct: 169 QTQKNKITLKSLQDLDYTFKRFEAIEKIEDALTGLKVIDFEGNCIRLSLSTFIPNLEGLL 228

Query: 238 CQFEINNIGESFMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXIHMSKYSSDASDKNIS 59
           C+ +I  + E   + H L ++V   S    + E+          I  +K S         
Sbjct: 229 CEEKIEAVNEPSELNHELLIEVMDQSMELKNVEIFPNDVYLGEIIDAAKSSRKLFSHMSI 288

Query: 58  LNVNGQLNSLVREIQYRI 5
           L     L   VR++Q +I
Sbjct: 289 LETRSSLEWFVRKVQDKI 306


>ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus
           sinensis]
          Length = 447

 Score = 94.7 bits (234), Expect = 6e-17
 Identities = 88/334 (26%), Positives = 157/334 (47%), Gaps = 6/334 (1%)
 Frame = -1

Query: 988 VEEMAHSNAGSKLDLGNRLHTKLVKLQDDLH--GLEEEATSVSNFDTFNDSHLLRLQETL 815
           VE  A  ++ S LDL + L +++ +L + +H  G+E+E  +VS+     DS  L L+E  
Sbjct: 11  VEATATPSSSSPLDL-HSLRSEVKELME-IHRSGIEDEPNTVSS-----DSENL-LKEYA 62

Query: 814 ADMPLEYFKFSGSDDNVN----DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKD 647
            D   +  +      +V+    +D D  ++ +K EL  VE E +K  NEIE L     +D
Sbjct: 63  HDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVED 122

Query: 646 TNNLTGDIELLTTYIEFWQNDDNVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEIC 467
           ++ L  D+E L   I+   ++ +  +  ++ A +     E +   +      D+ +    
Sbjct: 123 SDRLESDLEELNCAIDLIVSEGSQNAKEDRQA-VCPARGEDQVCPTHTEDQSDLIKIH-- 179

Query: 466 LLSGENFMFEALELEDQLIRKRQKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENC 287
               E+  FE LELE Q+ + +     L+ LD + KR DA+  IE+SL+ +K I     C
Sbjct: 180 ----EDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKC 235

Query: 286 IKLTLKTFIPAKGGSICQFEINNIGESFMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXX 107
            +L+++T+IP    S  Q +I ++ E   V H L ++V   +    + E+          
Sbjct: 236 FRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDL 295

Query: 106 IHMSKYSSDASDKNISLNVNGQLNSLVREIQYRI 5
           +  +K    +  +  SL  +  L   +R +Q RI
Sbjct: 296 VDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRI 329


>ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus
           sinensis]
          Length = 444

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 88/334 (26%), Positives = 154/334 (46%), Gaps = 6/334 (1%)
 Frame = -1

Query: 988 VEEMAHSNAGSKLDLGNRLHTKLVKLQDDLH--GLEEEATSVSNFDTFNDSHLLRLQETL 815
           VE  A  ++ S LDL + L +++ +L + +H  G+E+E  +VS+     DS  L L+E  
Sbjct: 11  VEATATPSSSSPLDL-HSLRSEVKELME-IHRSGIEDEPNTVSS-----DSENL-LKEYA 62

Query: 814 ADMPLEYFKFSGSDDNVN----DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKD 647
            D   +  +      +V+    +D D  ++ +K EL  VE E +K  NEIE L     +D
Sbjct: 63  HDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVED 122

Query: 646 TNNLTGDIELLTTYIEFWQNDDNVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEIC 467
           ++ L  D+E L   I+   +++      E    +     E +   +      D+ +    
Sbjct: 123 SDRLESDLEELNCAIDLIVSEN----AKEDRQAVCPARGEDQVCPTHTEDQSDLIKIH-- 176

Query: 466 LLSGENFMFEALELEDQLIRKRQKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENC 287
               E+  FE LELE Q+ + +     L+ LD + KR DA+  IE+SL+ +K I     C
Sbjct: 177 ----EDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKC 232

Query: 286 IKLTLKTFIPAKGGSICQFEINNIGESFMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXX 107
            +L+++T+IP    S  Q +I ++ E   V H L ++V   +    + E+          
Sbjct: 233 FRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDL 292

Query: 106 IHMSKYSSDASDKNISLNVNGQLNSLVREIQYRI 5
           +  +K    +  +  SL  +  L   +R +Q RI
Sbjct: 293 VDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRI 326


>gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theobroma cacao]
          Length = 343

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 65/208 (31%), Positives = 102/208 (49%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  +  +K EL  VE E AK  NEIE+L+    +++N L G++E L   +     D 
Sbjct: 51  EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYAL-----DS 105

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
               G E   +   L++ M  +      +           S E   FE +ELE Q+ +  
Sbjct: 106 IASQGMEGVEEDPCLDSSMNDEDQSNLMH-----------SNEEQKFEIMELESQIEKNN 154

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L+ LD M KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   G +CQ  I 
Sbjct: 155 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 214

Query: 220 NIGESFMVEHVLKVKVEKDSTTFLDAEL 137
           +I E   + H L V++   +    + E+
Sbjct: 215 DISEPSEMNHELLVEIVDGTMEIKNVEM 242


>gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao]
          Length = 432

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 65/208 (31%), Positives = 102/208 (49%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  +  +K EL  VE E AK  NEIE+L+    +++N L G++E L   +     D 
Sbjct: 77  EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYAL-----DS 131

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
               G E   +   L++ M  +      +           S E   FE +ELE Q+ +  
Sbjct: 132 IASQGMEGVEEDPCLDSSMNDEDQSNLMH-----------SNEEQKFEIMELESQIEKNN 180

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L+ LD M KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   G +CQ  I 
Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 220 NIGESFMVEHVLKVKVEKDSTTFLDAEL 137
           +I E   + H L V++   +    + E+
Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEM 268


>gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 392

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 65/208 (31%), Positives = 102/208 (49%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  +  +K EL  VE E AK  NEIE+L+    +++N L G++E L   +     D 
Sbjct: 77  EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYAL-----DS 131

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
               G E   +   L++ M  +      +           S E   FE +ELE Q+ +  
Sbjct: 132 IASQGMEGVEEDPCLDSSMNDEDQSNLMH-----------SNEEQKFEIMELESQIEKNN 180

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L+ LD M KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   G +CQ  I 
Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 220 NIGESFMVEHVLKVKVEKDSTTFLDAEL 137
           +I E   + H L V++   +    + E+
Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEM 268


>gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 372

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 65/208 (31%), Positives = 102/208 (49%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  +  +K EL  VE E AK  NEIE+L+    +++N L G++E L   +     D 
Sbjct: 19  EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYAL-----DS 73

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
               G E   +   L++ M  +      +           S E   FE +ELE Q+ +  
Sbjct: 74  IASQGMEGVEEDPCLDSSMNDEDQSNLMH-----------SNEEQKFEIMELESQIEKNN 122

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L+ LD M KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   G +CQ  I 
Sbjct: 123 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 182

Query: 220 NIGESFMVEHVLKVKVEKDSTTFLDAEL 137
           +I E   + H L V++   +    + E+
Sbjct: 183 DISEPSEMNHELLVEIVDGTMEIKNVEM 210


>gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508713298|gb|EOY05195.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 369

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 65/208 (31%), Positives = 102/208 (49%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  +  +K EL  VE E AK  NEIE+L+    +++N L G++E L   +     D 
Sbjct: 77  EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYAL-----DS 131

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
               G E   +   L++ M  +      +           S E   FE +ELE Q+ +  
Sbjct: 132 IASQGMEGVEEDPCLDSSMNDEDQSNLMH-----------SNEEQKFEIMELESQIEKNN 180

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L+ LD M KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   G +CQ  I 
Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 220 NIGESFMVEHVLKVKVEKDSTTFLDAEL 137
           +I E   + H L V++   +    + E+
Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEM 268


>gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 430

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 65/208 (31%), Positives = 102/208 (49%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  +  +K EL  VE E AK  NEIE+L+    +++N L G++E L   +     D 
Sbjct: 77  EDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYAL-----DS 131

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
               G E   +   L++ M  +      +           S E   FE +ELE Q+ +  
Sbjct: 132 IASQGMEGVEEDPCLDSSMNDEDQSNLMH-----------SNEEQKFEIMELESQIEKNN 180

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L+ LD M KR+D L  IE++L+ +K IG   NCI+L+L+T+IP   G +CQ  I 
Sbjct: 181 IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 220 NIGESFMVEHVLKVKVEKDSTTFLDAEL 137
           +I E   + H L V++   +    + E+
Sbjct: 241 DISEPSEMNHELLVEIVDGTMEIKNVEM 268


>ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis]
           gi|223542639|gb|EEF44176.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 415

 Score = 85.1 bits (209), Expect = 5e-14
 Identities = 59/208 (28%), Positives = 98/208 (47%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  ++ +K EL+    E AK   EIE L     +D   L  DIE+L   ++F  +  
Sbjct: 66  EDLDAFVEHLKEELSTTMSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISS-- 123

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
                       K++E E E    E+    D  R         ++ FE  +L+DQ+ + +
Sbjct: 124 ------------KDVEKEKEVACREDLYSTDAHR---------DYEFEISKLDDQIAKSK 162

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L+  D + KRVDA+  IEE+LS +K I    +CI+L+L+T++P     +CQ +  
Sbjct: 163 MILKSLQDFDSVFKRVDAVEQIEEALSGLKVIEFDGSCIRLSLRTYLPKLDDVMCQHKTE 222

Query: 220 NIGESFMVEHVLKVKVEKDSTTFLDAEL 137
           +  E   V H L ++V   +    + E+
Sbjct: 223 DTAEPSEVNHELLIEVVSGTMELKNVEI 250


>ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana]
           gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis
           thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein
           [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1|
           putative HAPp48,5 protein [Arabidopsis thaliana]
           gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein
           [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1|
           uncharacterized protein AT3G23910 [Arabidopsis thaliana]
          Length = 421

 Score = 81.3 bits (199), Expect = 7e-13
 Identities = 61/202 (30%), Positives = 100/202 (49%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  ++ ++NEL  VE E AK   EIE L+ + A+D++ L  D+E L   ++   + D
Sbjct: 73  EDSDAYLEYLRNELQSVEAESAKVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQD 132

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
                         +E   E Q S  S        E+C +  ++  F+  ELE+Q+  KR
Sbjct: 133 --------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKR 170

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L  LD + KR DA   +E++L+ +K +    N I+L L+T+I    G + Q + +
Sbjct: 171 MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFD 230

Query: 220 NIGESFMVEHVLKVKVEKDSTT 155
           +I E   + H L + + KD TT
Sbjct: 231 HITEPSELIHELLIYL-KDKTT 251


>ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcriptase)-related protein
           [Arabidopsis thaliana] gi|332643359|gb|AEE76880.1|
           RNA-directed DNA polymerase (reverse
           transcriptase)-related protein [Arabidopsis thaliana]
          Length = 746

 Score = 80.1 bits (196), Expect = 2e-12
 Identities = 63/208 (30%), Positives = 102/208 (49%)
 Frame = -1

Query: 778 SDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIE 599
           SD N+ D +   ++ ++NEL  VE E AK   EIE L+ + A D++ L  D+E L   ++
Sbjct: 395 SDGNLTDAY---LEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLD 451

Query: 598 FWQNDDNVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELED 419
              + D              +E   E Q S  S        E+C +  ++  F+  ELE+
Sbjct: 452 SMSSQD--------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELEN 489

Query: 418 QLIRKRQKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSI 239
           Q+  KR     L  LD + KR DA   +E++L+ +K +    N I+L L+T+I    G +
Sbjct: 490 QMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFL 549

Query: 238 CQFEINNIGESFMVEHVLKVKVEKDSTT 155
            Q + ++I E   + H L + + KD TT
Sbjct: 550 GQHKFDHITEPSELIHELLIYL-KDKTT 576


>dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana]
          Length = 421

 Score = 79.3 bits (194), Expect = 3e-12
 Identities = 61/202 (30%), Positives = 98/202 (48%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           D  D  ++ ++NEL  VE E AK   EIE L+ + A D++ L  D+E L   ++   + D
Sbjct: 73  DQTDAYLEYLRNELQSVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQD 132

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
                         +E   E Q S  S        E+C +  ++  F+  ELE+Q+  KR
Sbjct: 133 --------------VEKSKENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKR 170

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L  LD + KR DA   +E++L+ +K +    N I+L L+T+I    G + Q + +
Sbjct: 171 MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFD 230

Query: 220 NIGESFMVEHVLKVKVEKDSTT 155
           +I E   + H L + + KD TT
Sbjct: 231 HITEPSELIHELLIYL-KDKTT 251


>ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein
           [Arabidopsis thaliana] gi|332643360|gb|AEE76881.1|
           RNA-directed DNA polymerase (reverse
           transcriptase)-related protein [Arabidopsis thaliana]
          Length = 428

 Score = 79.0 bits (193), Expect = 3e-12
 Identities = 68/243 (27%), Positives = 113/243 (46%)
 Frame = -1

Query: 883 EATSVSNFDTFNDSHLLRLQETLADMPLEYFKFSGSDDNVNDDFDMEIQSVKNELAIVER 704
           E   V +F    +  +  + E   D+ L     +  D N+ D +   ++ ++NEL  VE 
Sbjct: 42  ETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDHTLVDGNLTDAY---LEYLRNELQSVEA 98

Query: 703 EMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDDNVGSGTEKNAQIKELENEM 524
           E AK   EIE L+ + A D++ L  D+E L   ++   + D              +E   
Sbjct: 99  ESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQD--------------VEKSK 144

Query: 523 EQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKRQKYGELRMLDDMSKRVDAL 344
           E Q S  S        E+C +  ++  F+  ELE+Q+  KR     L  LD + KR DA 
Sbjct: 145 ENQPSSSS-------MEVCEVIDDD-KFKMFELENQMEEKRMILKSLEDLDSLRKRFDAA 196

Query: 343 SMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEINNIGESFMVEHVLKVKVEKD 164
             +E++L+ +K +    N I+L L+T+I    G + Q + ++I E   + H L + + KD
Sbjct: 197 EQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYL-KD 255

Query: 163 STT 155
            TT
Sbjct: 256 KTT 258


>ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211137 [Cucumis sativus]
           gi|449527675|ref|XP_004170835.1| PREDICTED:
           uncharacterized protein LOC101229419 [Cucumis sativus]
          Length = 414

 Score = 77.8 bits (190), Expect = 8e-12
 Identities = 80/317 (25%), Positives = 135/317 (42%), Gaps = 1/317 (0%)
 Frame = -1

Query: 952 LDLGNRLHTKLVKLQDDLHGLEEEATSVSNFDTFNDSHLLRLQETLADMPLEYFKFSGSD 773
           LDL   + ++L +LQ  L   EE  T     +       L L+  +  +  EY   S  D
Sbjct: 16  LDL-QAVRSELEELQRSLEENEESTTDSLGSEKLLRECALHLESRIQQVLSEY---SNVD 71

Query: 772 DNVN-DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEF 596
             +  DD D  ++ +K EL  VE E +K  NEIE L     +D+N L  D+E+L   ++ 
Sbjct: 72  SFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEVLKRTNIEDSNKLKMDLEVLKLSLDR 131

Query: 595 WQNDDNVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQ 416
           + + D            +E          E+     ++R        E   FE LELE Q
Sbjct: 132 FPSQDP-----------EEATFNCSSMNGEDPMNVIVNR--------ECNAFEVLELESQ 172

Query: 415 LIRKRQKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSIC 236
           + + ++    L+ +D++ K +D +  +E ++  +K I + +N I+L+L T IP       
Sbjct: 173 IEKNKKILKSLQEVDEIFKSLDVIEQVEGTIGGMKVIDVADNSIRLSLHTHIPNVEDFST 232

Query: 235 QFEINNIGESFMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXIHMSKYSSDASDKNISL 56
              +  + E   ++H L ++V   +    +AE+          I+ SK  S         
Sbjct: 233 LQRLEGLIEKSELDHELIIEVLDGTMELKNAEIFPADVHLHDIINASKSIS--------- 283

Query: 55  NVNGQLNSLVREIQYRI 5
             N  L   VR++Q RI
Sbjct: 284 --NSSLEWFVRKVQDRI 298


>ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp.
           lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein
           ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score = 77.4 bits (189), Expect = 1e-11
 Identities = 59/202 (29%), Positives = 98/202 (48%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  ++ ++ EL  VE E AK   EIE L+ + A+D++ L  D+E L   ++   + D
Sbjct: 72  EDSDAYLEYLRKELQSVEAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSSQD 131

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
                         +E   E Q S  S        E+C ++ ++  F+  ELE+Q+  KR
Sbjct: 132 --------------VEKSKENQPSSSS-------MEVCEVNDDD-KFKMFELENQMEEKR 169

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L  LD + KR DA   +E++L+ +K +    N I+L L+T+IP     + Q +  
Sbjct: 170 SILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFE 229

Query: 220 NIGESFMVEHVLKVKVEKDSTT 155
           +  E   + H L + + KD TT
Sbjct: 230 HTTEPSELIHELLIYL-KDKTT 250


>ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema
           salsugineum] gi|557096755|gb|ESQ37263.1| hypothetical
           protein EUTSA_v10002763mg, partial [Eutrema salsugineum]
          Length = 355

 Score = 77.0 bits (188), Expect = 1e-11
 Identities = 62/249 (24%), Positives = 117/249 (46%)
 Frame = -1

Query: 751 DMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDDNVG 572
           D  ++ ++ EL  VE E AK   EIE L+++ A+D++ L  D+E L   ++F  + +   
Sbjct: 5   DAYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQE--- 61

Query: 571 SGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKRQKY 392
              +K+ +     + ME  + + S + D++         ++  F+  ELE+Q+  KR+  
Sbjct: 62  --VQKSKENPPSTSSME--RCDASTWIDVN---------DDEKFKMFELENQIEEKRRIL 108

Query: 391 GELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEINNIG 212
             L  LD + KR DA   +E++L+ +K +    N I+L L+T+IP   G + Q ++ +  
Sbjct: 109 KSLENLDSVCKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHKLLHNT 168

Query: 211 ESFMVEHVLKVKVEKDSTTFLDAELSXXXXXXXXXIHMSKYSSDASDKNISLNVNGQLNS 32
           E   + H L + ++  +T     E+             +         +  L+    L  
Sbjct: 169 EPSELIHELLIDLKDKTTEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTRSSLQW 228

Query: 31  LVREIQYRI 5
           LV ++Q RI
Sbjct: 229 LVAKVQERI 237


>ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella]
           gi|482566470|gb|EOA30659.1| hypothetical protein
           CARUB_v10013795mg [Capsella rubella]
          Length = 420

 Score = 76.3 bits (186), Expect = 2e-11
 Identities = 59/202 (29%), Positives = 98/202 (48%)
 Frame = -1

Query: 760 DDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIEFWQNDD 581
           +D D  ++ ++ EL  VE E AK   EIE L+ + A+D++ L  D+E L   ++   + D
Sbjct: 72  EDSDAYLEYLRKELHSVEAESAKVSEEIERLSRSHAEDSSRLERDLEGLLLSLDSMSSQD 131

Query: 580 NVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELEDQLIRKR 401
                                 KS+ES     S  E+C ++ ++  F+  ELE+Q+  KR
Sbjct: 132 --------------------VNKSKESP-PSCSSMEVCEVNDDD-KFKMFELENQMEEKR 169

Query: 400 QKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSICQFEIN 221
                L  LD + KR DA   +E++L+ +K +    N I+L L+T+IP   G   Q +  
Sbjct: 170 MILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPELDGLPAQHKFE 229

Query: 220 NIGESFMVEHVLKVKVEKDSTT 155
           +  +   + H L + + KD TT
Sbjct: 230 HTTKPSELIHELLIYL-KDKTT 250


>gb|EXB89377.1| hypothetical protein L484_017343 [Morus notabilis]
          Length = 550

 Score = 74.7 bits (182), Expect = 6e-11
 Identities = 66/262 (25%), Positives = 126/262 (48%), Gaps = 2/262 (0%)
 Frame = -1

Query: 952 LDLGNRLHTKLVKLQDDLHGLEEEATSV--SNFDTFNDSHLLRLQETLADMPLEYFKFSG 779
           LDL + + ++  +L++ L  LE+  + +  S+ +       L+ Q  + ++  E+   S 
Sbjct: 150 LDL-DTIRSRAKELEEMLSSLEDNDSELFHSDLEKLVKDCALKFQSRMEEIGSEWSDVSF 208

Query: 778 SDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDIELLTTYIE 599
            +D    DFD  ++ +  EL +VE E ++   EIE L    A+D+N L  ++E L + ++
Sbjct: 209 LEDK---DFDACLEHLGEELNLVEAENSRMSEEIEILTRTYAEDSNQLEIELEGLKSAMD 265

Query: 598 FWQNDDNVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFMFEALELED 419
                D       +NA++   +   +  ++ E K              ++ +   LELE+
Sbjct: 266 LTALQDL------ENAKLGACD---DYPRNTEDK--------------QHLVLHLLELEN 302

Query: 418 QLIRKRQKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTFIPAKGGSI 239
           ++ +K      L  LD + K  DA+  IE+ L+ +K I L ENCI+ +L+T+IP     +
Sbjct: 303 EIKKKNIILKSLEDLDGICKWFDAIEQIEDILTSVKVIALEENCIRFSLQTYIPNLESIL 362

Query: 238 CQFEINNIGESFMVEHVLKVKV 173
            Q  I  +   F V+  L +++
Sbjct: 363 SQQTIEAVNVPFEVKLELLIEL 384


>ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa]
           gi|222847415|gb|EEE84962.1| hypothetical protein
           POPTR_0001s32530g [Populus trichocarpa]
          Length = 429

 Score = 72.4 bits (176), Expect = 3e-10
 Identities = 63/282 (22%), Positives = 129/282 (45%)
 Frame = -1

Query: 982 EMAHSNAGSKLDLGNRLHTKLVKLQDDLHGLEEEATSVSNFDTFNDSHLLRLQETLADMP 803
           E++ S     L+L N + +++ +L++       ++ S  N    ++      Q+ ++ + 
Sbjct: 2   EISPSTTQESLNL-NTIRSRINELEEIYRDCNADSFSEINSSDSDELMKDSAQQLVSKVS 60

Query: 802 LEYFKFSGSDDNVNDDFDMEIQSVKNELAIVEREMAKSVNEIENLAAAIAKDTNNLTGDI 623
               ++S       +D D  +  +K EL   E E AK  NEIE L     +D++ L  D+
Sbjct: 61  QTVTEYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDL 120

Query: 622 ELLTTYIEFWQNDDNVGSGTEKNAQIKELENEMEQQKSEESKYDDISRFEICLLSGENFM 443
           E +   ++           ++++ + ++ + +ME   S E++ + I+       + E   
Sbjct: 121 EWMKCSLDL--------ISSQRDREKEKGDEQMEHFSSGENQSNLIN-------TNEENK 165

Query: 442 FEALELEDQLIRKRQKYGELRMLDDMSKRVDALSMIEESLSDIKEIGLLENCIKLTLKTF 263
           FE L+L++Q+    +    ++ LD + K  DA+  IE+ LS +K I     CI+L+L+T+
Sbjct: 166 FEILKLDNQIEESTRILKSMQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTY 225

Query: 262 IPAKGGSICQFEINNIGESFMVEHVLKVKVEKDSTTFLDAEL 137
           IP +     Q +I      + + H   ++V   S      E+
Sbjct: 226 IPKQDVLFLQ-KIEETNVPYEINHEFLIEVTNGSMEIKKVEM 266


Top