BLASTX nr result

ID: Catharanthus23_contig00021278 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00021278
         (1008 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264...   255   2e-65
ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247...   236   8e-60
ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247...   236   8e-60
ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589...   231   3e-58
ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251...   227   5e-57
gb|EOY26199.1| HAT transposon superfamily protein, putative [The...   225   2e-56
ref|XP_004246933.1| PREDICTED: uncharacterized protein LOC101250...   221   5e-55
ref|XP_004246932.1| PREDICTED: uncharacterized protein LOC101250...   215   2e-53
ref|XP_002310902.1| predicted protein [Populus trichocarpa]           213   1e-52
ref|XP_002530377.1| protein dimerization, putative [Ricinus comm...   211   5e-52
ref|XP_006345717.1| PREDICTED: uncharacterized protein LOC102580...   201   3e-49
ref|XP_006299218.1| hypothetical protein CARUB_v10015366mg [Caps...   162   1e-37
ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626...   155   2e-35
ref|XP_002311919.1| predicted protein [Populus trichocarpa]           153   9e-35
ref|XP_002512206.1| DNA binding protein, putative [Ricinus commu...   152   2e-34
ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thal...   152   2e-34
gb|AAM98154.1| putative protein [Arabidopsis thaliana]                152   2e-34
ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu...   151   4e-34
ref|NP_187908.1| hAT transposon superfamily protein [Arabidopsis...   150   8e-34
ref|XP_002323178.1| predicted protein [Populus trichocarpa]           150   8e-34

>ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264734 [Vitis vinifera]
          Length = 714

 Score =  255 bits (651), Expect = 2e-65
 Identities = 118/218 (54%), Positives = 161/218 (73%), Gaps = 2/218 (0%)
 Frame = -2

Query: 650 KSNEGLSPREFHKCIGRFLYETGTEFEAIQSDSFQLML--GLKCGQTAYSIPSYDDLRGW 477
           K  E  S R+  KCIGRF YE GT+  A  S SFQ M+   L CGQ  Y +PS  +L+GW
Sbjct: 150 KEREDSSSRQAKKCIGRFFYELGTDLSAATSPSFQRMITAALGCGQIGYKLPSCQELKGW 209

Query: 476 ILRDLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIA 297
           IL++ +KEMQ YV ++++SW +TGCS+LLDGW D KG+NL+NVL  CPKGT+YIRS DI+
Sbjct: 210 ILKEEVKEMQQYVKDVRNSWANTGCSILLDGWMDEKGRNLINVLADCPKGTIYIRSCDIS 269

Query: 296 NFHPDSDTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYC 117
            F  D D ++ F E+++ +VGV+NV+QI+++S S  M A G++LM+K++T FWTV+  YC
Sbjct: 270 AFIADVDALQFFIEQIIEEVGVENVVQIITYSISDCMAAAGQRLMEKFRTVFWTVSASYC 329

Query: 116 IELVLEKLGMSGFIREIFKKAKTITRFVHGHASVLRLL 3
           IEL+LEK+GM   IR I  KAK IT+F+H HA+VL+L+
Sbjct: 330 IELMLEKIGMMDPIRGILDKAKAITKFIHSHATVLKLM 367


>ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247551 isoform 2 [Solanum
           lycopersicum]
          Length = 682

 Score =  236 bits (603), Expect = 8e-60
 Identities = 124/255 (48%), Positives = 172/255 (67%), Gaps = 15/255 (5%)
 Frame = -2

Query: 722 YPVLPSRRN-CSPDRNTTKSHEMNLKSNEGL------------SPREFHKCIGRFLYETG 582
           +P LP +RN C  D    K+ E   K + G+            S +E  K IGRF YE G
Sbjct: 83  HPSLPLKRNWCPRDGEPNKTSESVNKKHNGVNSNVAGTSVVDSSSQEISKSIGRFFYEAG 142

Query: 581 TEFEAIQSDSFQLMLG--LKCGQTAYSIPSYDDLRGWILRDLLKEMQLYVNNIKSSWEST 408
            +F+AI+  SFQ ML   L  G+T    PS  +L+GWIL+D +KEMQ YV  I+ SW ST
Sbjct: 143 IDFDAIRLPSFQRMLKATLSPGKTI-KFPSCQELKGWILQDAVKEMQQYVTEIRKSWAST 201

Query: 407 GCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVFFEEVLADVGVK 228
           GCS+LLDGW DSKG+NL+N+LV CP+GT+Y+RS+DI++F+ + D + VFFEEVL +VGV+
Sbjct: 202 GCSILLDGWIDSKGRNLINILVYCPRGTIYLRSSDISSFNGNVDAMLVFFEEVLEEVGVE 261

Query: 227 NVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSGFIREIFKKAKT 48
            V+QI+ +S S  M   GK+LM+K KT FWTV+  +C+EL+L+K      I+E  +KAKT
Sbjct: 262 TVVQIVGYSTSACMMEAGKRLMEKCKTVFWTVDVSHCMELMLQKFTKMNPIQEALEKAKT 321

Query: 47  ITRFVHGHASVLRLL 3
           +T+F++ HA+ L+LL
Sbjct: 322 LTQFIYNHATALKLL 336


>ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247551 isoform 1 [Solanum
           lycopersicum]
          Length = 692

 Score =  236 bits (603), Expect = 8e-60
 Identities = 124/255 (48%), Positives = 172/255 (67%), Gaps = 15/255 (5%)
 Frame = -2

Query: 722 YPVLPSRRN-CSPDRNTTKSHEMNLKSNEGL------------SPREFHKCIGRFLYETG 582
           +P LP +RN C  D    K+ E   K + G+            S +E  K IGRF YE G
Sbjct: 93  HPSLPLKRNWCPRDGEPNKTSESVNKKHNGVNSNVAGTSVVDSSSQEISKSIGRFFYEAG 152

Query: 581 TEFEAIQSDSFQLMLG--LKCGQTAYSIPSYDDLRGWILRDLLKEMQLYVNNIKSSWEST 408
            +F+AI+  SFQ ML   L  G+T    PS  +L+GWIL+D +KEMQ YV  I+ SW ST
Sbjct: 153 IDFDAIRLPSFQRMLKATLSPGKTI-KFPSCQELKGWILQDAVKEMQQYVTEIRKSWAST 211

Query: 407 GCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVFFEEVLADVGVK 228
           GCS+LLDGW DSKG+NL+N+LV CP+GT+Y+RS+DI++F+ + D + VFFEEVL +VGV+
Sbjct: 212 GCSILLDGWIDSKGRNLINILVYCPRGTIYLRSSDISSFNGNVDAMLVFFEEVLEEVGVE 271

Query: 227 NVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSGFIREIFKKAKT 48
            V+QI+ +S S  M   GK+LM+K KT FWTV+  +C+EL+L+K      I+E  +KAKT
Sbjct: 272 TVVQIVGYSTSACMMEAGKRLMEKCKTVFWTVDVSHCMELMLQKFTKMNPIQEALEKAKT 331

Query: 47  ITRFVHGHASVLRLL 3
           +T+F++ HA+ L+LL
Sbjct: 332 LTQFIYNHATALKLL 346


>ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589543 isoform X1 [Solanum
           tuberosum] gi|565402986|ref|XP_006366949.1| PREDICTED:
           uncharacterized protein LOC102589543 isoform X2 [Solanum
           tuberosum] gi|565402988|ref|XP_006366950.1| PREDICTED:
           uncharacterized protein LOC102589543 isoform X3 [Solanum
           tuberosum]
          Length = 686

 Score =  231 bits (589), Expect = 3e-58
 Identities = 121/255 (47%), Positives = 172/255 (67%), Gaps = 15/255 (5%)
 Frame = -2

Query: 722 YPVLPSRRN-CSPDRNTTKSHEMNLKSNEGL------------SPREFHKCIGRFLYETG 582
           +P LP +RN C  D    K+ E   K + G+            S +E  K IGRF YE G
Sbjct: 83  HPNLPLKRNWCPRDGEPNKTSESVNKKHNGVNSKVAGTSVVDSSSQEISKSIGRFFYEAG 142

Query: 581 TEFEAIQSDSFQLMLG--LKCGQTAYSIPSYDDLRGWILRDLLKEMQLYVNNIKSSWEST 408
            + +AI+  SFQ M+   L  G+T    PS  +LRGWIL+D +KEMQ YV  I++SW ST
Sbjct: 143 IDLDAIRLPSFQRMVKATLSPGKTV-KFPSCQELRGWILQDAVKEMQQYVMEIRNSWAST 201

Query: 407 GCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVFFEEVLADVGVK 228
           GCS+LLDGW DS G+NL+N+LV CP+GT+Y+RS+DI++F+ + D + +FFEEVL +VGV+
Sbjct: 202 GCSILLDGWIDSNGRNLINILVYCPRGTIYLRSSDISSFNGNVDAMLLFFEEVLEEVGVE 261

Query: 227 NVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSGFIREIFKKAKT 48
            V+QI+++S S  M  VGK+LM+K KT FWTV+  +C+EL+L+       I+E  +KAKT
Sbjct: 262 TVVQIVAYSTSACMMEVGKKLMEKCKTVFWTVDASHCMELMLQNFTKIDPIQEALEKAKT 321

Query: 47  ITRFVHGHASVLRLL 3
           +T+F++ HA+ L+LL
Sbjct: 322 LTQFIYSHATALKLL 336


>ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251332 [Vitis vinifera]
          Length = 709

 Score =  227 bits (579), Expect = 5e-57
 Identities = 110/218 (50%), Positives = 151/218 (69%), Gaps = 2/218 (0%)
 Frame = -2

Query: 650 KSNEGLSPREFHKCIGRFLYETGTEFEAIQSDSFQLMLG--LKCGQTAYSIPSYDDLRGW 477
           K  E +   +  KCIGRFLYE GT+F A    S + M+     C Q  Y  PS+ +L+G 
Sbjct: 148 KEGEDIPVSQAKKCIGRFLYEMGTDFSAATPTSLRRMINGIHSCHQVEYEFPSHQELKGC 207

Query: 476 ILRDLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIA 297
           IL+D +KEM  +V+ I+ +W +TGCS+++DGW+D KG+NL+N LV CP G + +R  DI+
Sbjct: 208 ILQDEVKEMLHHVHGIRDTWATTGCSIVVDGWKDEKGRNLMNFLVDCPWGPICLRLCDIS 267

Query: 296 NFHPDSDTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYC 117
               D  ++ + FE+V+A+VGV+NV+QI+SHSAS  M AVG  LMDKY T FWTV+  +C
Sbjct: 268 TLSDDVHSLVLLFEQVIAEVGVENVVQIVSHSASECMAAVGNTLMDKYPTLFWTVSASHC 327

Query: 116 IELVLEKLGMSGFIREIFKKAKTITRFVHGHASVLRLL 3
           IE++LEK+GM G  REI  KAKTITRF++ HA VL L+
Sbjct: 328 IEMMLEKIGMMGTTREILDKAKTITRFIYCHAMVLNLM 365


>gb|EOY26199.1| HAT transposon superfamily protein, putative [Theobroma cacao]
          Length = 709

 Score =  225 bits (574), Expect = 2e-56
 Identities = 115/241 (47%), Positives = 159/241 (65%), Gaps = 3/241 (1%)
 Frame = -2

Query: 716 VLPSRRNCSPDRNT-TKSHEMNLKSNEGLSPREFHKCIGRFLYETGTEFEAIQSDSFQLM 540
           +LPS R  S    T     E + K N+        +CIGRF YETG +   + S SFQ M
Sbjct: 134 ILPSARIVSQSAVTGDPEEEPSCKQNK--------RCIGRFFYETGIDLTLVNSPSFQRM 185

Query: 539 LG-LKC-GQTAYSIPSYDDLRGWILRDLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKG 366
           +    C GQT Y IPS  +L+GWIL+D +KEMQ YV  I+ SW S+GCS+LLDGW D KG
Sbjct: 186 INDTHCPGQTNYKIPSCQELKGWILKDEVKEMQEYVEKIRQSWASSGCSILLDGWIDEKG 245

Query: 365 QNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVFFEEVLADVGVKNVIQILSHSASPTM 186
           +NLV+ +V CP+G +Y+ S+D++    D D +++ F+ V+ DVGV+NV+QI++ S    +
Sbjct: 246 RNLVSFIVDCPQGPIYLHSSDVSASVDDVDALQLLFDRVIDDVGVENVVQIIAFSTEGWV 305

Query: 185 EAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSGFIREIFKKAKTITRFVHGHASVLRL 6
            AVGKQ M + KT FWTVN  +CIEL+L+K+ M G IR   + A+TI++F+HGH +VL L
Sbjct: 306 GAVGKQFMGRSKTVFWTVNASHCIELMLDKIAMMGEIRGTLENARTISKFIHGHLTVLNL 365

Query: 5   L 3
           L
Sbjct: 366 L 366


>ref|XP_004246933.1| PREDICTED: uncharacterized protein LOC101250835 [Solanum
           lycopersicum]
          Length = 640

 Score =  221 bits (562), Expect = 5e-55
 Identities = 114/248 (45%), Positives = 165/248 (66%), Gaps = 14/248 (5%)
 Frame = -2

Query: 704 RRNCSPDRNTTKSHEM-NLKSNE-----------GLSPREFHKCIGRFLYETGTEFEAIQ 561
           R  C  D + T+S E  N K N              S +E  K IGRF YE G +F+AI+
Sbjct: 20  RNLCPRDGDVTQSSESANKKHNRTNSKVAGTCVVDSSSQEISKSIGRFFYEAGIDFDAIR 79

Query: 560 SDSFQLML--GLKCGQTAYSIPSYDDLRGWILRDLLKEMQLYVNNIKSSWESTGCSMLLD 387
           S SFQ M+   L  GQT    PS  +L+GWIL+D +KEMQ YV  I+ SW STGCS+LLD
Sbjct: 80  SPSFQRMVIATLSLGQTI-KFPSCQELKGWILQDAVKEMQQYVTEIRDSWTSTGCSILLD 138

Query: 386 GWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVFFEEVLADVGVKNVIQILS 207
           GW D   +NL+N+LV CP+GT+Y+RS+DI++F+ +   + +F EE+L +VGV+ V+QI++
Sbjct: 139 GWIDLNNRNLINILVYCPRGTIYLRSSDISSFNGNVGAMLLFLEEILEEVGVETVVQIVT 198

Query: 206 HSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSGFIREIFKKAKTITRFVHG 27
           +S +  M   GK+LM+K++T FW V+  +C+EL+L+K      I E+ +KAKT+T+F++ 
Sbjct: 199 YSTAACMMEAGKKLMEKHRTVFWAVDAYHCMELMLQKFTKIDPIHEVMEKAKTLTQFIYS 258

Query: 26  HASVLRLL 3
           HA+VL+LL
Sbjct: 259 HATVLKLL 266


>ref|XP_004246932.1| PREDICTED: uncharacterized protein LOC101250543 [Solanum
           lycopersicum]
          Length = 618

 Score =  215 bits (547), Expect = 2e-53
 Identities = 103/212 (48%), Positives = 151/212 (71%), Gaps = 2/212 (0%)
 Frame = -2

Query: 632 SPREFHKCIGRFLYETGTEFEAIQSDSFQLMLG--LKCGQTAYSIPSYDDLRGWILRDLL 459
           S +E  K IGRF YE+G +F+AI+  SFQ+M    L  GQT    PS  DL+GWIL+D +
Sbjct: 76  SSQEISKSIGRFFYESGLDFDAIRLPSFQMMFKATLSPGQTV-KFPSCQDLKGWILQDAV 134

Query: 458 KEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDS 279
            EMQLYV  I+SSW  TGCS+LLDGW DS G+NL+N+LV CP+GT+Y+RS+DI +F+ + 
Sbjct: 135 HEMQLYVTEIRSSWPRTGCSILLDGWIDSNGRNLINILVYCPRGTIYLRSSDITSFYENP 194

Query: 278 DTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLE 99
           D + VF EE+L +VGV+NV+QI++HS S  M A G++LMD  KT F++++   C+ L+L+
Sbjct: 195 DAMLVFLEEILEEVGVENVVQIIAHSTSHWMIAAGEKLMDSCKTVFFSIDASRCMGLMLQ 254

Query: 98  KLGMSGFIREIFKKAKTITRFVHGHASVLRLL 3
            +    +I +  +KAK + +F++ H + ++LL
Sbjct: 255 NVTQIDWIGQALQKAKMLIQFIYSHTTTMKLL 286


>ref|XP_002310902.1| predicted protein [Populus trichocarpa]
          Length = 705

 Score =  213 bits (541), Expect = 1e-52
 Identities = 112/278 (40%), Positives = 157/278 (56%), Gaps = 39/278 (14%)
 Frame = -2

Query: 719 PVLPSRRNCSPDRNTTKSHE-------------------------------------MNL 651
           P LP +R CSPD N  K  +                                     M+ 
Sbjct: 86  PDLPWKRYCSPDLNAAKRKKRDANQTTGCGSGMHAEMHSVVEDDMTEHVSVNNRRRAMSS 145

Query: 650 KSNEGLSPREFHKCIGRFLYETGTEFEAIQSDSFQLMLG--LKCGQTAYSIPSYDDLRGW 477
              E +  R+  +CIGRF YETG +F A    SFQ M+   L  G + Y +PS  DL+GW
Sbjct: 146 GPKENVMSRQAQRCIGRFFYETGFDFSASTLPSFQRMINATLDDGHSEYKVPSLQDLKGW 205

Query: 476 ILRDLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIA 297
           IL D ++E++ YVN I  SW STGCS+LLDGW D KG+NLV+ +V CP G  Y+RSAD++
Sbjct: 206 ILHDEVEEIKTYVNEISHSWASTGCSVLLDGWVDEKGRNLVSFVVECPGGPTYLRSADVS 265

Query: 296 NFHPDSDTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYC 117
               D + +++  E V+ +VG+ NV+QI++ S    + AVG+Q M +Y   FW V+  +C
Sbjct: 266 AIIDDVNALQLLLEGVIEEVGIDNVVQIVAFSTVGWVGAVGEQFMQRYWCVFWCVSASHC 325

Query: 116 IELVLEKLGMSGFIREIFKKAKTITRFVHGHASVLRLL 3
           IEL+LEK+G    IR   +KAK IT+F++GH  VL+L+
Sbjct: 326 IELMLEKIGAMDSIRRTLEKAKIITKFIYGHKKVLKLM 363


>ref|XP_002530377.1| protein dimerization, putative [Ricinus communis]
           gi|223530094|gb|EEF32010.1| protein dimerization,
           putative [Ricinus communis]
          Length = 698

 Score =  211 bits (536), Expect = 5e-52
 Identities = 100/228 (43%), Positives = 151/228 (66%)
 Frame = -2

Query: 686 DRNTTKSHEMNLKSNEGLSPREFHKCIGRFLYETGTEFEAIQSDSFQLMLGLKCGQTAYS 507
           +R       +N ++ E  S R+  +CIGRF YETG +F    S SF+ ML    G     
Sbjct: 136 NRRVDPEFAINGEAKEDASSRQAKRCIGRFFYETGIDFSNANSPSFKRMLNTTLGDGQVK 195

Query: 506 IPSYDDLRGWILRDLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKG 327
           IP+  + +GWIL D LKE Q YV  I++SW STGCS+LLDGW + KGQNLV+ +V  P+G
Sbjct: 196 IPTIHEFKGWILWDELKETQEYVKKIRNSWASTGCSLLLDGWMNEKGQNLVSFVVEGPEG 255

Query: 326 TVYIRSADIANFHPDSDTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKT 147
            +Y+RSA++++   D D +++  + V+ +VGV NV+QI++ S +  M  +GKQ MD+ +T
Sbjct: 256 LIYLRSANVSDIINDLDALQLLLDRVMEEVGVDNVVQIIACSTTGWMGTIGKQFMDRRRT 315

Query: 146 FFWTVNGPYCIELVLEKLGMSGFIREIFKKAKTITRFVHGHASVLRLL 3
            FW+V+  +CI+L+LEK+G    I+ I +KAK IT+F++G+  VL+L+
Sbjct: 316 VFWSVSASHCIKLMLEKIGAMDCIKWIIEKAKIITKFIYGNGEVLKLM 363


>ref|XP_006345717.1| PREDICTED: uncharacterized protein LOC102580052 [Solanum tuberosum]
          Length = 586

 Score =  201 bits (512), Expect = 3e-49
 Identities = 95/194 (48%), Positives = 143/194 (73%), Gaps = 2/194 (1%)
 Frame = -2

Query: 578 EFEAIQSDSFQLMLG--LKCGQTAYSIPSYDDLRGWILRDLLKEMQLYVNNIKSSWESTG 405
           +F+AI+S SF+ M+   L  GQT    PS  +L GWIL D ++EMQ YV  I+ SW STG
Sbjct: 78  DFDAIRSPSFRRMVKATLSPGQTI-KFPSCQELNGWILEDAVQEMQQYVTEIRKSWASTG 136

Query: 404 CSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVFFEEVLADVGVKN 225
           CS+LLDGW D   +NL+N+LV CP+GT+Y+RS+DI++F  + D + +F EE+L +VGV+N
Sbjct: 137 CSILLDGWIDLNNRNLINILVYCPRGTIYLRSSDISSFSRNFDAMLLFLEEILEEVGVEN 196

Query: 224 VIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSGFIREIFKKAKTI 45
           V+QI++++ S  M   GK+LMDK KT FW+++  YC+EL+L+++   G+I+E  +KAK +
Sbjct: 197 VVQIVAYTTSDWMMEAGKKLMDKCKTVFWSIDASYCMELMLQEVTKIGWIKEALEKAKML 256

Query: 44  TRFVHGHASVLRLL 3
            +F++ HA+VL+LL
Sbjct: 257 VQFIYSHATVLKLL 270


>ref|XP_006299218.1| hypothetical protein CARUB_v10015366mg [Capsella rubella]
           gi|482567927|gb|EOA32116.1| hypothetical protein
           CARUB_v10015366mg [Capsella rubella]
          Length = 596

 Score =  162 bits (411), Expect = 1e-37
 Identities = 82/208 (39%), Positives = 125/208 (60%), Gaps = 1/208 (0%)
 Frame = -2

Query: 626 REFHKCIGRFLYETGTEFEAIQSDSFQ-LMLGLKCGQTAYSIPSYDDLRGWILRDLLKEM 450
           ++  +C+ RFLYE G +F A+ S SFQ LM  +  G+ A  IP   DL GW+L++ LKE+
Sbjct: 109 KQSQRCLARFLYEHGVDFSALDSTSFQELMTTVTGGKLALKIPDSRDLNGWMLQEALKEV 168

Query: 449 QLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTV 270
           Q  V  IK SWE TGCS+LLD W D KG++LV+ +  CP G VY++S+D++    D   +
Sbjct: 169 QDRVKEIKDSWEITGCSILLDAWIDQKGRDLVSFVADCPAGAVYLKSSDVSGIKTDVTAL 228

Query: 269 EVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLG 90
           +     ++ + GV NVIQI++ S S  +  +GK L       FW+V+  +CIEL+L ++G
Sbjct: 229 KSLVNGIVEEAGVHNVIQIVACSTSGWVGELGKLLAGHDMKVFWSVSISHCIELMLVEIG 288

Query: 89  MSGFIREIFKKAKTITRFVHGHASVLRL 6
                 +I  K   I   +H + S+L++
Sbjct: 289 KMHSFGDILNKVNIIQESIHNNPSLLKI 316


>ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis]
          Length = 745

 Score =  155 bits (392), Expect = 2e-35
 Identities = 78/234 (33%), Positives = 138/234 (58%), Gaps = 1/234 (0%)
 Frame = -2

Query: 701 RNCSPDRNTTKSHEMNLKSNEGLSPREFHKCIGRFLYETGTEFEAIQSDSFQLML-GLKC 525
           +N S +  T      +L +  G +P      +GRFLY+ G   +A+ S+ FQ M+  +  
Sbjct: 155 KNSSVNAYTGAMISASLDATRGNNP--IFMAVGRFLYDIGAPLDAVNSEYFQPMVDAIAS 212

Query: 524 GQTAYSIPSYDDLRGWILRDLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVL 345
           G    ++PSY D+RGWIL++ ++E++  V+   ++W  TGCS+L+D W    G+ L+  L
Sbjct: 213 GGPEAAMPSYHDIRGWILKNSVEEVKNDVDRYTTTWGKTGCSILVDQWNTEAGRTLLCFL 272

Query: 344 VTCPKGTVYIRSADIANFHPDSDTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQL 165
             CP+GTV+++S D +     SD +    ++V+ +VGV++V+Q+++ S+     A G++L
Sbjct: 273 AYCPEGTVFLKSVDASGIMNSSDALYELLKQVVEEVGVRHVLQVIT-SSEEQFIAAGRRL 331

Query: 164 MDKYKTFFWTVNGPYCIELVLEKLGMSGFIREIFKKAKTITRFVHGHASVLRLL 3
            D + T +WT     C++L+LE      +I  I ++A+ +TRFV+ H+ VL +L
Sbjct: 332 TDTFPTLYWTPCAARCLDLILEDFAKLEWINAIIEQARAVTRFVYNHSVVLNML 385


>ref|XP_002311919.1| predicted protein [Populus trichocarpa]
          Length = 617

 Score =  153 bits (387), Expect = 9e-35
 Identities = 78/219 (35%), Positives = 127/219 (57%), Gaps = 1/219 (0%)
 Frame = -2

Query: 656 NLKSNEGLSPREFHKCIGRFLYETGTEFEAIQSDSFQLML-GLKCGQTAYSIPSYDDLRG 480
           NL S E +   +    + RF+YE G    A  S +FQ M   +      Y +PSY+ LRG
Sbjct: 40  NLASQESIDQADI--AVARFMYEAGVPLSAANSCTFQQMADSIAAVGPGYKMPSYNALRG 97

Query: 479 WILRDLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADI 300
            +L   +++   Y   ++ SWE TGCS+L+D W D   + ++N  V CPKGT++++S D 
Sbjct: 98  RLLNKSVQDAGEYCTELRKSWEVTGCSVLVDRWMDRINRTVINFFVYCPKGTMFLKSVDA 157

Query: 299 ANFHPDSDTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPY 120
            +    +  +   F+ V+ +VG K ++  ++   SP+ +A GK L DKYKTFF +  G  
Sbjct: 158 TDITKSAAGLYNLFDSVVQEVGPKIIVNFVT-DTSPSYKAAGKLLADKYKTFFCSTCGVQ 216

Query: 119 CIELVLEKLGMSGFIREIFKKAKTITRFVHGHASVLRLL 3
           CI+L+LE++     ++E+ +KAK +TRF++ +A VL L+
Sbjct: 217 CIDLMLEEISKKDEVKEVLEKAKRVTRFIYNNARVLNLM 255


>ref|XP_002512206.1| DNA binding protein, putative [Ricinus communis]
           gi|223548750|gb|EEF50240.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 739

 Score =  152 bits (385), Expect = 2e-34
 Identities = 75/203 (36%), Positives = 121/203 (59%), Gaps = 1/203 (0%)
 Frame = -2

Query: 608 IGRFLYETGTEFEAIQSDSFQLML-GLKCGQTAYSIPSYDDLRGWILRDLLKEMQLYVNN 432
           + RF YE G  F A  S  FQ M   +      Y +PSY  LRG +L   +++ + Y + 
Sbjct: 172 VARFFYEAGIPFTAANSYFFQQMADNIIAAGPGYKMPSYTSLRGKLLNRCIQDAEEYCSE 231

Query: 431 IKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVFFEE 252
           ++ SWE TGC++L+D W   + + ++N  V CPKGT+++RS D +      + +   F+ 
Sbjct: 232 LRKSWEVTGCTVLVDRWMHGRDRTVINFFVYCPKGTMFLRSVDASGITKSVEALLNLFDS 291

Query: 251 VLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSGFIR 72
           V+  VG+KN++  ++ S  PT +  GK L +KYKTFF +  G  CI L+LE++G S  I+
Sbjct: 292 VVQQVGLKNIVNFVTDSV-PTYKNAGKLLAEKYKTFFCSTCGAECINLMLEEIGESDGIK 350

Query: 71  EIFKKAKTITRFVHGHASVLRLL 3
           E+  KAK +T+F++ ++ VL L+
Sbjct: 351 EVLAKAKRLTQFIYNNSWVLNLM 373


>ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thaliana]
           gi|240255844|ref|NP_193238.5| hAT transposon superfamily
           [Arabidopsis thaliana] gi|332658140|gb|AEE83540.1| hAT
           transposon superfamily [Arabidopsis thaliana]
           gi|332658141|gb|AEE83541.1| hAT transposon superfamily
           [Arabidopsis thaliana]
          Length = 768

 Score =  152 bits (384), Expect = 2e-34
 Identities = 75/206 (36%), Positives = 127/206 (61%), Gaps = 1/206 (0%)
 Frame = -2

Query: 617 HKCIGRFLYETGTEFEAIQSDSFQLML-GLKCGQTAYSIPSYDDLRGWILRDLLKEMQLY 441
           H  IGRFL+  G +F+A+ S +FQ M+  +  G    S P++DDLRGWIL++ ++EM   
Sbjct: 195 HMAIGRFLFGIGADFDAVNSVNFQPMIDAIASGGFGVSAPTHDDLRGWILKNCVEEMAKE 254

Query: 440 VNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVF 261
           ++  K+ W+ TGCS+L++     KG  ++N LV CP+  V+++S D +     +D +   
Sbjct: 255 IDECKAMWKRTGCSILVEELNSDKGFKVLNFLVYCPEKVVFLKSVDASEVLSSADKLFEL 314

Query: 260 FEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSG 81
             E++ +VG  NV+Q+++      ++A GK+LM  Y + +W     +CI+ +LE+ G  G
Sbjct: 315 LSELVEEVGSTNVVQVITKCDDYYVDA-GKRLMLVYPSLYWVPCAAHCIDQMLEEFGKLG 373

Query: 80  FIREIFKKAKTITRFVHGHASVLRLL 3
           +I E  ++A+ ITRFV+ H+ VL L+
Sbjct: 374 WISETIEQAQAITRFVYNHSGVLNLM 399


>gb|AAM98154.1| putative protein [Arabidopsis thaliana]
          Length = 768

 Score =  152 bits (384), Expect = 2e-34
 Identities = 75/206 (36%), Positives = 127/206 (61%), Gaps = 1/206 (0%)
 Frame = -2

Query: 617 HKCIGRFLYETGTEFEAIQSDSFQLML-GLKCGQTAYSIPSYDDLRGWILRDLLKEMQLY 441
           H  IGRFL+  G +F+A+ S +FQ M+  +  G    S P++DDLRGWIL++ ++EM   
Sbjct: 195 HMAIGRFLFGIGADFDAVNSVNFQPMIDAIASGGFGVSAPTHDDLRGWILKNCVEEMAKE 254

Query: 440 VNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVF 261
           ++  K+ W+ TGCS+L++     KG  ++N LV CP+  V+++S D +     +D +   
Sbjct: 255 IDECKAMWKRTGCSILVEELNSDKGFKVLNFLVYCPEKVVFLKSVDASEVLSSADKLFEL 314

Query: 260 FEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSG 81
             E++ +VG  NV+Q+++      ++A GK+LM  Y + +W     +CI+ +LE+ G  G
Sbjct: 315 LSELVEEVGSTNVVQVITKCDDYYVDA-GKRLMLVYPSLYWVPCAAHCIDQMLEEFGKLG 373

Query: 80  FIREIFKKAKTITRFVHGHASVLRLL 3
           +I E  ++A+ ITRFV+ H+ VL L+
Sbjct: 374 WISETIEQAQAITRFVYNHSGVLNLM 399


>ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis]
           gi|223539752|gb|EEF41333.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 854

 Score =  151 bits (381), Expect = 4e-34
 Identities = 70/206 (33%), Positives = 128/206 (62%), Gaps = 1/206 (0%)
 Frame = -2

Query: 617 HKCIGRFLYETGTEFEAIQSDSFQLMLG-LKCGQTAYSIPSYDDLRGWILRDLLKEMQLY 441
           H  +GRFLY+ G  F+A+ S  F+ ++  L  G +    PS  DLRGWIL+ L++E++  
Sbjct: 292 HTTVGRFLYDIGANFDALDSIYFRSLIDMLSSGASGAVAPSNHDLRGWILKKLVEEIKND 351

Query: 440 VNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFHPDSDTVEVF 261
           ++  +++W  TGCS+L++ W    G  L+N LV C +GTV+++S + ++     D + V 
Sbjct: 352 IDQSRTTWARTGCSVLVEEWNSESGITLLNFLVNCSQGTVFLKSVEASHIIYSPDGLYVL 411

Query: 260 FEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIELVLEKLGMSG 81
            ++V+ +VG  NV+Q+++ + +      GK+LM+ + + FW     +C++L+LE      
Sbjct: 412 LKQVVEEVGASNVLQVIT-NGNEHYTVAGKRLMEAFPSLFWAPCAVHCLDLILEDFAKLE 470

Query: 80  FIREIFKKAKTITRFVHGHASVLRLL 3
           +I  + ++AK++TRFV+ H++VL L+
Sbjct: 471 WIDAVIEQAKSVTRFVYNHSAVLNLM 496


>ref|NP_187908.1| hAT transposon superfamily protein [Arabidopsis thaliana]
           gi|15795134|dbj|BAB02512.1| transposase-like protein
           [Arabidopsis thaliana] gi|332641756|gb|AEE75277.1| hAT
           transposon superfamily protein [Arabidopsis thaliana]
          Length = 605

 Score =  150 bits (379), Expect = 8e-34
 Identities = 81/214 (37%), Positives = 122/214 (57%), Gaps = 1/214 (0%)
 Frame = -2

Query: 644 NEGLSPREFHKCIGRFLYETGTEFEAIQSDSF-QLMLGLKCGQTAYSIPSYDDLRGWILR 468
           N+ L   +  KCIGRF YE   +  A+ S  F ++M+ L  GQ    IP   DL G +L+
Sbjct: 116 NQDLLSSKAQKCIGRFFYEHCVDLSAVDSPCFKEMMMALGVGQ---KIPDSHDLNGRLLQ 172

Query: 467 DLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVYIRSADIANFH 288
           + +KE+Q YV NIK SW+ TGCS+LLD W D KG +LV+ +  CP G VY++S D++   
Sbjct: 173 EAMKEVQDYVKNIKDSWKITGCSILLDAWIDPKGHDLVSFVADCPAGPVYLKSIDVSVVK 232

Query: 287 PDSDTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFWTVNGPYCIEL 108
            D   +      ++ +VGV NV QI++ S S  +  +GK      +  FW+V+  +C EL
Sbjct: 233 NDVTALLSLVNGLVEEVGVHNVTQIIACSTSGWVGELGKLFSGHDREVFWSVSLSHCFEL 292

Query: 107 VLEKLGMSGFIREIFKKAKTITRFVHGHASVLRL 6
           +L K+G      +I  K  TI  F++ + S L++
Sbjct: 293 MLVKIGKMRSFGDILDKVNTIWEFINNNPSALKI 326


>ref|XP_002323178.1| predicted protein [Populus trichocarpa]
          Length = 530

 Score =  150 bits (379), Expect = 8e-34
 Identities = 74/225 (32%), Positives = 135/225 (60%), Gaps = 3/225 (1%)
 Frame = -2

Query: 668 SHEMNLKSNEGLSPRE--FHKCIGRFLYETGTEFEAIQSDSFQLML-GLKCGQTAYSIPS 498
           +H+ +     GL   +   H  +GRFLY+ G   +A+ S  FQ ++  +  GQ+  + PS
Sbjct: 151 AHDADALMGLGLEKADNAIHVTMGRFLYDIGASLDALDSSFFQPLIDAVFSGQSGIAAPS 210

Query: 497 YDDLRGWILRDLLKEMQLYVNNIKSSWESTGCSMLLDGWEDSKGQNLVNVLVTCPKGTVY 318
           + D RG IL+ L++E++  +   K+ W  TGCS+L++ W+   G  L+N LV C KGTV+
Sbjct: 211 HQDFRGRILKSLVEEVKSDIEQHKTRWAKTGCSLLVEEWDSGSGLTLLNFLVYCSKGTVF 270

Query: 317 IRSADIANFHPDSDTVEVFFEEVLADVGVKNVIQILSHSASPTMEAVGKQLMDKYKTFFW 138
           ++S D +N    +D +    ++++ +VG  NV+Q++++     + A GK++MD + + +W
Sbjct: 271 LKSVDASNLIYSTDGLYELLKQMVEEVGAGNVLQVITNGEEHYVTA-GKKIMDTFPSLYW 329

Query: 137 TVNGPYCIELVLEKLGMSGFIREIFKKAKTITRFVHGHASVLRLL 3
                 CI+ +LE LG   +I  + ++AK++TRFV+ +++VL L+
Sbjct: 330 APCAARCIDQILEDLGKLEWINAVLEQAKSVTRFVYNNSAVLNLM 374


Top