BLASTX nr result

ID: Cephaelis21_contig00003185 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00003185
         (1149 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003540767.1| PREDICTED: uncharacterized protein LOC100797...   197   4e-48
ref|XP_002866395.1| hypothetical protein ARALYDRAFT_496228 [Arab...   192   1e-46
ref|NP_200888.2| heavy metal transport/detoxification domain-con...   187   6e-45
ref|NP_001190582.1| heavy metal transport/detoxification domain-...   185   2e-44
ref|XP_002276537.1| PREDICTED: uncharacterized protein LOC100261...   172   1e-40

>ref|XP_003540767.1| PREDICTED: uncharacterized protein LOC100797817 [Glycine max]
          Length = 639

 Score =  197 bits (501), Expect = 4e-48
 Identities = 117/281 (41%), Positives = 150/281 (53%), Gaps = 11/281 (3%)
 Frame = -2

Query: 821  EGCAGKVVKFVRAFQGVEAVKVELSSGKITANGKVDPVTLREKLEEKTHKKVELLSPVPK 642
            +GCA K++K +RAFQGVE VK E  +GK+T  GKVDP  +R+ L EK  KKVEL+SP PK
Sbjct: 373  DGCASKIIKHLRAFQGVETVKAESDAGKVTVTGKVDPTKVRDNLAEKIRKKVELVSPQPK 432

Query: 641  KDKENVKEXXXXXXXXXXXXXXXXXXXXXXXXXDERTPKEPPVTMAVLKVPLHCEGCIQK 462
            K+KEN K+                          ++T  +  VT AVLKV LHC+GC+ +
Sbjct: 433  KEKENEKDPKPNNKSENKTQD-------------KKTKDKEVVTTAVLKVALHCQGCLDR 479

Query: 461  IHRIVSKTKGFEEIKIDKQKDLVTVKGAIDMKGLAENLKKHLKRNVEIVPPKR------- 303
            I + V KTKG +E+ IDK+K++VTVKG +D+K LAENL + LKR VE+VPP++       
Sbjct: 480  IGKTVLKTKGVQEMAIDKEKEMVTVKGTMDVKALAENLMEKLKRKVEVVPPQKDKEGDNK 539

Query: 302  --XXXXXXXXXXXXXXXXXXXXXXXGDGVEKMEGNKMXXXXXXXXXXGYGLPPVAXXXXX 129
                                      DG+EK+E N+M          GYG P        
Sbjct: 540  EGGGGEKGSGKKKNKGGGGDKNENIEDGIEKIEHNRMEYLAPPAFGFGYG-PYGGYGHGH 598

Query: 128  XXXXXXXXXXYHVDPYQFHYHAH--APQMFSDENPNACIVM 12
                        V P Q H+H H  APQMFSDENPNAC VM
Sbjct: 599  GHGNIGGYSCVPVYPEQMHFHLHAPAPQMFSDENPNACSVM 639


>ref|XP_002866395.1| hypothetical protein ARALYDRAFT_496228 [Arabidopsis lyrata subsp.
           lyrata] gi|297312230|gb|EFH42654.1| hypothetical protein
           ARALYDRAFT_496228 [Arabidopsis lyrata subsp. lyrata]
          Length = 284

 Score =  192 bits (489), Expect = 1e-46
 Identities = 115/271 (42%), Positives = 145/271 (53%), Gaps = 1/271 (0%)
 Frame = -2

Query: 821 EGCAGKVVKFVRAFQGVEAVKVELSSGKITANGKVDPVTLREKLEEKTHKKVELLSPVPK 642
           EGCA ++VK VR+FQGVE VK E ++GK+T  G +DPV LREKLEEKT KKV+L+SP PK
Sbjct: 37  EGCASRIVKCVRSFQGVETVKSESATGKLTVTGALDPVKLREKLEEKTKKKVDLVSPQPK 96

Query: 641 KDKENVKEXXXXXXXXXXXXXXXXXXXXXXXXXDERTPKEPPVTMAVLKVPLHCEGCIQK 462
           K+KE                              E+ PKE PVT AVLK+  HC+GCI K
Sbjct: 97  KEKEKENNKDKNKNDEDKKKSEEKKKPDNN----EKKPKETPVTTAVLKLNFHCQGCIGK 152

Query: 461 IHRIVSKTKGFEEIKIDKQKDLVTVKGAIDMKGLAENLKKHLKRNVEIVPPKRXXXXXXX 282
           I + ++KTKG + + +DK+K+LVTVKG +D+K L E+L + LKR VEIVPPK+       
Sbjct: 153 IQKTITKTKGVDGLTMDKEKNLVTVKGTMDVKKLVESLSEKLKRQVEIVPPKKEKENGNE 212

Query: 281 XXXXXXXXXXXXXXXXGDGVE-KMEGNKMXXXXXXXXXXGYGLPPVAXXXXXXXXXXXXX 105
                             G +   EG  M          GYG  P               
Sbjct: 213 TGEKKKGGGGDGGGKEKSGNKGGGEGVNMMEYMAAQPAYGYGYYPGG------------- 259

Query: 104 XXYHVDPYQFHYHAHAPQMFSDENPNACIVM 12
                 PY +   AHAPQ+FSDENPNAC+VM
Sbjct: 260 ------PYGYPIQAHAPQIFSDENPNACVVM 284



 Score = 57.8 bits (138), Expect = 5e-06
 Identities = 28/73 (38%), Positives = 42/73 (57%)
 Frame = -2

Query: 524 EPPVTMAVLKVPLHCEGCIQKIHRIVSKTKGFEEIKIDKQKDLVTVKGAIDMKGLAENLK 345
           E P    VLKV +HCEGC  +I + V   +G E +K +     +TV GA+D   L E L+
Sbjct: 22  ETPSITVVLKVDMHCEGCASRIVKCVRSFQGVETVKSESATGKLTVTGALDPVKLREKLE 81

Query: 344 KHLKRNVEIVPPK 306
           +  K+ V++V P+
Sbjct: 82  EKTKKKVDLVSPQ 94


>ref|NP_200888.2| heavy metal transport/detoxification domain-containing protein
           [Arabidopsis thaliana] gi|10176908|dbj|BAB10101.1|
           unnamed protein product [Arabidopsis thaliana]
           gi|28416657|gb|AAO42859.1| At5g60800 [Arabidopsis
           thaliana] gi|110735953|dbj|BAE99951.1| hypothetical
           protein [Arabidopsis thaliana]
           gi|332009995|gb|AED97378.1| heavy metal
           transport/detoxification domain-containing protein
           [Arabidopsis thaliana]
          Length = 283

 Score =  187 bits (474), Expect = 6e-45
 Identities = 117/276 (42%), Positives = 146/276 (52%), Gaps = 6/276 (2%)
 Frame = -2

Query: 821 EGCAGKVVKFVRAFQGVEAVKVELSSGKITANGKVDPVTLREKLEEKTHKKVELLSPVPK 642
           EGCA ++VK VR+FQGVE VK E ++GK+T  G +DPV LREKLEEKT KKV+L+SP PK
Sbjct: 37  EGCASRIVKCVRSFQGVETVKSESATGKLTVTGALDPVKLREKLEEKTKKKVDLVSPQPK 96

Query: 641 KDKENVKEXXXXXXXXXXXXXXXXXXXXXXXXXDERTPKEPPVTMAVLKVPLHCEGCIQK 462
           K+KE   +                          ++ PKE PVT AVLK+  HC+GCI K
Sbjct: 97  KEKEKENKNKNDEDKKKSEEKKKPDNN-------DKKPKETPVTTAVLKLNFHCQGCIGK 149

Query: 461 IHRIVSKTKGFEEIKIDKQKDLVTVKGAIDMKGLAENLKKHLKRNVEIVPPKRXXXXXXX 282
           I + V+KTKG   + +DK+K+L+TVKG +D+K L E L + LKR VEIVPPK+       
Sbjct: 150 IQKTVTKTKGVNGLTMDKEKNLLTVKGTMDVKKLVEILSEKLKRAVEIVPPKK---EKDK 206

Query: 281 XXXXXXXXXXXXXXXXGDGVEKM------EGNKMXXXXXXXXXXGYGLPPVAXXXXXXXX 120
                           G G EK       EG  M          GYG  P          
Sbjct: 207 ENGNENGEKKKGGGGDGGGKEKTGNKGGGEGVNMMEYMAAQPAYGYGYYPGG-------- 258

Query: 119 XXXXXXXYHVDPYQFHYHAHAPQMFSDENPNACIVM 12
                      PY +   AHAPQ+FSDENPNAC+VM
Sbjct: 259 -----------PYGYPIQAHAPQIFSDENPNACVVM 283



 Score = 57.8 bits (138), Expect = 5e-06
 Identities = 28/73 (38%), Positives = 42/73 (57%)
 Frame = -2

Query: 524 EPPVTMAVLKVPLHCEGCIQKIHRIVSKTKGFEEIKIDKQKDLVTVKGAIDMKGLAENLK 345
           E P    VLKV +HCEGC  +I + V   +G E +K +     +TV GA+D   L E L+
Sbjct: 22  ETPSITVVLKVDMHCEGCASRIVKCVRSFQGVETVKSESATGKLTVTGALDPVKLREKLE 81

Query: 344 KHLKRNVEIVPPK 306
           +  K+ V++V P+
Sbjct: 82  EKTKKKVDLVSPQ 94


>ref|NP_001190582.1| heavy metal transport/detoxification domain-containing protein
           [Arabidopsis thaliana] gi|332009996|gb|AED97379.1| heavy
           metal transport/detoxification domain-containing protein
           [Arabidopsis thaliana]
          Length = 302

 Score =  185 bits (470), Expect = 2e-44
 Identities = 116/276 (42%), Positives = 146/276 (52%), Gaps = 6/276 (2%)
 Frame = -2

Query: 821 EGCAGKVVKFVRAFQGVEAVKVELSSGKITANGKVDPVTLREKLEEKTHKKVELLSPVPK 642
           EGCA ++VK VR+FQGVE VK E ++GK+T  G +DPV LREKLEEKT KKV+L+SP PK
Sbjct: 37  EGCASRIVKCVRSFQGVETVKSESATGKLTVTGALDPVKLREKLEEKTKKKVDLVSPQPK 96

Query: 641 KDKENVKEXXXXXXXXXXXXXXXXXXXXXXXXXDERTPKEPPVTMAVLKVPLHCEGCIQK 462
           K+KE   +                          ++ PKE PVT AVLK+  HC+GCI K
Sbjct: 97  KEKEKENKNKNDEDKKKSEEKKKPDNN-------DKKPKETPVTTAVLKLNFHCQGCIGK 149

Query: 461 IHRIVSKTKGFEEIKIDKQKDLVTVKGAIDMKGLAENLKKHLKRNVEIVPPKRXXXXXXX 282
           I + V+KTKG   + +DK+K+L+TVKG +D+K L E L + LKR VEIVPPK+       
Sbjct: 150 IQKTVTKTKGVNGLTMDKEKNLLTVKGTMDVKKLVEILSEKLKRAVEIVPPKK---EKDK 206

Query: 281 XXXXXXXXXXXXXXXXGDGVEKM------EGNKMXXXXXXXXXXGYGLPPVAXXXXXXXX 120
                           G G EK       EG  M          GYG  P          
Sbjct: 207 ENGNENGEKKKGGGGDGGGKEKTGNKGGGEGVNMMEYMAAQPAYGYGYYPGG-------- 258

Query: 119 XXXXXXXYHVDPYQFHYHAHAPQMFSDENPNACIVM 12
                      PY +   AHAPQ+FSDENPNAC+V+
Sbjct: 259 -----------PYGYPIQAHAPQIFSDENPNACVVI 283



 Score = 57.8 bits (138), Expect = 5e-06
 Identities = 28/73 (38%), Positives = 42/73 (57%)
 Frame = -2

Query: 524 EPPVTMAVLKVPLHCEGCIQKIHRIVSKTKGFEEIKIDKQKDLVTVKGAIDMKGLAENLK 345
           E P    VLKV +HCEGC  +I + V   +G E +K +     +TV GA+D   L E L+
Sbjct: 22  ETPSITVVLKVDMHCEGCASRIVKCVRSFQGVETVKSESATGKLTVTGALDPVKLREKLE 81

Query: 344 KHLKRNVEIVPPK 306
           +  K+ V++V P+
Sbjct: 82  EKTKKKVDLVSPQ 94


>ref|XP_002276537.1| PREDICTED: uncharacterized protein LOC100261829 [Vitis vinifera]
           gi|297734927|emb|CBI17161.3| unnamed protein product
           [Vitis vinifera]
          Length = 313

 Score =  172 bits (436), Expect = 1e-40
 Identities = 92/173 (53%), Positives = 109/173 (63%)
 Frame = -2

Query: 821 EGCAGKVVKFVRAFQGVEAVKVELSSGKITANGKVDPVTLREKLEEKTHKKVELLSPVPK 642
           EGC  KVVK+++   GV   K +  + K+T  GKVDP  LREKLE+KT KKVELLSP PK
Sbjct: 41  EGCGSKVVKYLKGLDGVANAKADSDTNKVTVIGKVDPSMLREKLEQKTKKKVELLSPAPK 100

Query: 641 KDKENVKEXXXXXXXXXXXXXXXXXXXXXXXXXDERTPKEPPVTMAVLKVPLHCEGCIQK 462
           KDK+N                            +++ PKEPPVT AVLK+ LHC GCI K
Sbjct: 101 KDKKN----------DDGGGGDKKAEKKPEKKAEDKKPKEPPVTTAVLKIDLHCAGCIDK 150

Query: 461 IHRIVSKTKGFEEIKIDKQKDLVTVKGAIDMKGLAENLKKHLKRNVEIVPPKR 303
           I R VSKTKG E   IDKQK+LVTV G +D+K L E+LK  LKR VEIVPPK+
Sbjct: 151 IQRTVSKTKGVESKSIDKQKNLVTVTGTMDVKALVESLKDRLKRPVEIVPPKK 203