BLASTX nr result

ID: Atropa21_contig00034777 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00034777
         (865 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006358739.1| PREDICTED: kinesin-like protein cut7-like [S...   103   6e-20
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...    84   9e-14
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...    80   1e-12
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...    70   8e-10
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...    69   2e-09
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]              69   2e-09
ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...    67   6e-09
ref|NP_175039.1| RNA-directed DNA polymerase (reverse transcript...    67   8e-09
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...    66   2e-08
dbj|BAB01431.1| non-LTR retroelement reverse transcriptase-like ...    62   2e-07
ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcript...    62   2e-07
gb|ABK28243.1| unknown [Arabidopsis thaliana]                          62   2e-07
dbj|BAE98403.1| putative non-LTR reverse transcriptase [Arabidop...    62   2e-07
gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thali...    62   2e-07
gb|AAC78274.1| putative reverse transcriptase [Arabidopsis thali...    62   2e-07
gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]               62   3e-07
ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arab...    61   6e-07
ref|XP_003566528.1| PREDICTED: uncharacterized protein LOC100844...    60   8e-07
ref|XP_002448781.1| hypothetical protein SORBIDRAFT_06g033056 [S...    60   8e-07
dbj|BAD66732.1| orf147a [Beta vulgaris subsp. vulgaris] gi|54606...    60   8e-07

>ref|XP_006358739.1| PREDICTED: kinesin-like protein cut7-like [Solanum tuberosum]
          Length = 904

 Score =  103 bits (258), Expect = 6e-20
 Identities = 44/78 (56%), Positives = 54/78 (69%)
 Frame = +2

Query: 20  EATVTKYVWNIEEKTDNLWVRWVNHFY*KDQDWWQYSPSIDCSWY*RKICIIKERYKQGY 199
           EA V KYVWNI +K DNLWVRWV+H Y K  DWWQY+   D  WY +KIC I++++  GY
Sbjct: 26  EAAVEKYVWNIAQKADNLWVRWVDHVYIKGTDWWQYAIPHDSCWYWKKICRIRDKFAPGY 85

Query: 200 VGDGWMKPNGKYTIHSGC 253
             +G +KP GKY I S C
Sbjct: 86  NQNGSLKPEGKYMIASSC 103


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score = 83.6 bits (205), Expect = 9e-14
 Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 4/159 (2%)
 Frame = +2

Query: 2   DCII*NEATVTKYVWNIEEKTDNLWVRWVNHFY*KDQDWWQYSPSIDCSWY*RKICIIKE 181
           D  I N+A + K +WN+  K D+LWV+W+  +Y K  +          SW  +   I+K+
Sbjct: 249 DIDIWNKANLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIMK--AILKQ 306

Query: 182 RYKQGYVGDGWMKPNGKYTIHSGCLWRK----GEMKVWNASRWVWSVMNVPKYCFISWLA 349
           R     + D   +   + +I+ G L+RK    G+ K W     ++     P+  FI WLA
Sbjct: 307 REDLEKI-DNMEELMIRGSINMGKLYRKLQDCGQRKEW--KNLLYGNTARPRANFILWLA 363

Query: 350 ANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
            + RL TK RL K G++D+  C  C + EES  HLFF C
Sbjct: 364 CHGRLSTKDRLCKYGMIDDKSCCFCSE-EESMNHLFFVC 401


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score = 80.1 bits (196), Expect = 1e-12
 Identities = 54/157 (34%), Positives = 79/157 (50%), Gaps = 4/157 (2%)
 Frame = +2

Query: 8    II*NEATVTKYVWNIEEKTDNLWVRWVNHFY*KDQDWWQYSPSIDCSWY*RKICIIKERY 187
            ++ N+A + K +W I  K D LWVRWVN +Y K Q+    + S + SW  RKI   +E  
Sbjct: 868  VLWNKAAILKLLWAITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESRELL 927

Query: 188  KQGYVGDGW--MKPNGKYTIHS--GCLWRKGEMKVWNASRWVWSVMNVPKYCFISWLAAN 355
             +     GW  +  +  ++I      L    E  VW   R + +    PK  FI WLA  
Sbjct: 928  TR---TGGWEAVSNHMNFSIKKTYKLLQEDYENVVW--KRLICNNKATPKSQFILWLAML 982

Query: 356  NRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
            NRL T +R+ +     + LC +CG++ E+  HLFF C
Sbjct: 983  NRLATAERVSRWNRDVSPLCKMCGNEIETIQHLFFNC 1019


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score = 70.5 bits (171), Expect = 8e-10
 Identities = 52/166 (31%), Positives = 78/166 (46%), Gaps = 7/166 (4%)
 Frame = +2

Query: 17   NEATVTKYVWNIEEKTDNLWVRWVNHFY*KDQDWWQYSPSIDCSWY*RKICIIKERYKQG 196
            N A + K +W IE K D LWVRW++ +Y K QD    + S   +W  RK  I+K R    
Sbjct: 868  NRAAMLKLLWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRK--IVKARDHLS 925

Query: 197  YVGDGW--MKPNGKYTIHSGC--LWRKGEMKVWNASRWVWSVMNVPKYCFISWLAANNRL 364
             +GD W  +    K+++      +   GE   W   R + +    PK  FI W+  + RL
Sbjct: 926  NIGD-WDEICIGDKFSMKKAYKKISENGERVRWR--RLICNNYATPKSKFILWMMLHERL 982

Query: 365  LTKQRLQKIGLVDNDLCALCGDKEESRAHLFFECPKGVFI---LCY 493
             T  R+ + G+  +    LC +  E+  HLFF C     +   +CY
Sbjct: 983  PTVDRISRWGVQCDLNYRLCRNDGETIQHLFFSCSYSAGVWSKICY 1028


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 44/156 (28%), Positives = 77/156 (49%), Gaps = 6/156 (3%)
 Frame = +2

Query: 17   NEATVTKYVWNIEEKTDNLWVRWVNHFY*KDQDWWQYSPSIDCSWY*RKICIIKERYKQG 196
            N  TV   +WN+ +K DNLWV+W++  Y K+        + + SW  + +   +E     
Sbjct: 696  NHITVLNCLWNLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQRE----- 750

Query: 197  YVGDGWMKPNGKYTIHSGCL-WRKGEMKVWNASRWVWSVMNV-----PKYCFISWLAANN 358
            Y+    ++P     ++S     +K   K+  A R  WS +       P+    +WLA + 
Sbjct: 751  YIHT--LQPVWDELLNSERFKMKKAYDKMMEADRVHWSGLMRKNCARPRAIHTTWLACHG 808

Query: 359  RLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
            RL TK RL + G++ + + +LC + EE++ H+ F C
Sbjct: 809  RLGTKDRLVRFGMITDKIWSLCKEVEETQNHILFSC 844


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 46/168 (27%), Positives = 66/168 (39%), Gaps = 3/168 (1%)
 Frame = +2

Query: 17  NEATVTKYVWNIEEKTDNLWVRWVNHFY*KDQDWWQYSPSIDCS---WY*RKICIIKERY 187
           NE +  K +W I   T++LWVRW+  +  K   +W    + +     W  R      E  
Sbjct: 410 NEVSCLKLIWRIVSHTNSLWVRWIEQYLLKHDTFWSVQTTTNMDSVLWRGRN----DEYM 465

Query: 188 KQGYVGDGWMKPNGKYTIHSGCLWRKGEMKVWNASRWVWSVMNVPKYCFISWLAANNRLL 367
            +    D W +     T      W  G          +W     PK+ F +WLA  NRL 
Sbjct: 466 PKFSTRDTWNQTRNTST---PVTWHMG----------IWFAHATPKFSFCAWLAVQNRLS 512

Query: 368 TKQRLQKIGLVDNDLCALCGDKEESRAHLFFECPKGVFILCYNGCILE 511
           T  ++ +     +  C LC +  E+R HLFF C       CY   I E
Sbjct: 513 TGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSC-------CYTAEIWE 553


>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score = 67.4 bits (163), Expect = 6e-09
 Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 4/154 (2%)
 Frame = +2

Query: 17  NEATVTKYVWNIEEKTDNLWVRWVNHFY*KDQDWWQY--SPSIDCSWY*RKICIIKERYK 190
           N A ++  +W++  K D+LWVR V+H+Y K  + W +  S S     + R I I KE   
Sbjct: 194 NIALLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWDFISSSSDSVFIHIRDIIISKEENI 253

Query: 191 Q--GYVGDGWMKPNGKYTIHSGCLWRKGEMKVWNASRWVWSVMNVPKYCFISWLAANNRL 364
           +    + + W   N +        + +G   V + S  +W+ +   K  FI WLA  NRL
Sbjct: 254 EVAKLMLNSW-GCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATKNRL 312

Query: 365 LTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
           L   R   +      LC LC ++ ES AHLFF C
Sbjct: 313 LALDRAAFLN--KGFLCPLCTNEAESHAHLFFSC 344


>ref|NP_175039.1| RNA-directed DNA polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
           gi|332193871|gb|AEE31992.1| RNA-directed DNA polymerase
           (reverse transcriptase)-related family protein
           [Arabidopsis thaliana]
          Length = 320

 Score = 67.0 bits (162), Expect = 8e-09
 Identities = 31/57 (54%), Positives = 38/57 (66%)
 Frame = +2

Query: 299 VWSVMNVPKYCFISWLAANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFECP 469
           VW   +VPK+ FI W+ A NRL T+ RL+  GL    +C LC   +ESRAHLFFECP
Sbjct: 155 VWFKNHVPKHAFICWVVAWNRLHTRDRLRSWGLSIPAVCLLCNSHDESRAHLFFECP 211


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 43/162 (26%), Positives = 77/162 (47%), Gaps = 12/162 (7%)
 Frame = +2

Query: 17  NEATVTKYVWNIEEKTDNLWVRWVNHFY*KDQDWWQYSPSIDCSWY*RKICIIKERYKQG 196
           N   + K +WNI  K DNLWV+W++ ++ K  +    +   + +W  + +  +K+R +  
Sbjct: 357 NVTAMLKCLWNICSKEDNLWVKWIHAYFLKGDNVMSATIKSNSTWILKSV--MKQRPQVN 414

Query: 197 YVGDGW--MKPNGKYTI----------HSGCLWRKGEMKVWNASRWVWSVMNVPKYCFIS 340
            +   W  M    K+++          H+   W +  +  +N +R        P+     
Sbjct: 415 NLQLVWIEMLRKRKFSMKQVYMELVEDHNKIDWFR--LLRYNRAR--------PRANVTL 464

Query: 341 WLAANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
           WLA  NRL TK RL+ + ++   LC+LC +++E   HL F C
Sbjct: 465 WLACQNRLATKTRLKNMNMIQCSLCSLCKEQDEDLDHLMFSC 506


>dbj|BAB01431.1| non-LTR retroelement reverse transcriptase-like protein
           [Arabidopsis thaliana]
          Length = 637

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 31/78 (39%), Positives = 40/78 (51%)
 Frame = +2

Query: 236 TIHSGCLWRKGEMKVWNASRWVWSVMNVPKYCFISWLAANNRLLTKQRLQKIGLVDNDLC 415
           TI + C W +G          VW   + PKY F++WLA +NRL T  RL K        C
Sbjct: 272 TISNECEWYRG----------VWFPSSTPKYSFVTWLAFHNRLATGDRLYKWNSEARATC 321

Query: 416 ALCGDKEESRAHLFFECP 469
             C ++ E+R HLFF CP
Sbjct: 322 VFCDEELETRDHLFFSCP 339


>ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
           gi|5732057|gb|AAD48956.1|AF149414_5 contains similarity
           to a family of Arabidopsis thaliana predicted proteins,
           which have similarity to reverse transcriptases; see
           T14P8.10 (GB:AF069298) [Arabidopsis thaliana]
           gi|7267223|emb|CAB80830.1| AT4g04650 [Arabidopsis
           thaliana] gi|332657009|gb|AEE82409.1| RNA-directed DNA
           polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
          Length = 332

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 30/56 (53%), Positives = 36/56 (64%)
 Frame = +2

Query: 299 VWSVMNVPKYCFISWLAANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
           VW   +VPK+ FI W+ A NRL T+ RLQ  GL     C LC   ++SRAHLFFEC
Sbjct: 133 VWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFFEC 188


>gb|ABK28243.1| unknown [Arabidopsis thaliana]
          Length = 297

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 30/56 (53%), Positives = 36/56 (64%)
 Frame = +2

Query: 299 VWSVMNVPKYCFISWLAANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
           VW   +VPK+ FI W+ A NRL T+ RLQ  GL     C LC   ++SRAHLFFEC
Sbjct: 133 VWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFFEC 188


>dbj|BAE98403.1| putative non-LTR reverse transcriptase [Arabidopsis thaliana]
          Length = 278

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 31/78 (39%), Positives = 40/78 (51%)
 Frame = +2

Query: 236 TIHSGCLWRKGEMKVWNASRWVWSVMNVPKYCFISWLAANNRLLTKQRLQKIGLVDNDLC 415
           TI + C W +G          VW   + PKY F++WLA +NRL T  RL K        C
Sbjct: 104 TISNECEWYRG----------VWFPSSTPKYSFVTWLAFHNRLATGDRLYKWNSEARATC 153

Query: 416 ALCGDKEESRAHLFFECP 469
             C ++ E+R HLFF CP
Sbjct: 154 VFCDEELETRDHLFFSCP 171


>gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thaliana]
          Length = 296

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 30/56 (53%), Positives = 36/56 (64%)
 Frame = +2

Query: 299 VWSVMNVPKYCFISWLAANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
           VW   +VPK+ FI W+ A NRL T+ RLQ  GL     C LC   ++SRAHLFFEC
Sbjct: 133 VWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFFEC 188


>gb|AAC78274.1| putative reverse transcriptase [Arabidopsis thaliana]
          Length = 543

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 30/59 (50%), Positives = 37/59 (62%)
 Frame = +2

Query: 299 VWSVMNVPKYCFISWLAANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFECPKG 475
           VW    VPK+ FISW+ A NRL T+ RL+  GL+    C LC   +E+R HLFF C KG
Sbjct: 458 VWFTNQVPKHAFISWVTAWNRLHTRDRLRSWGLIVPAECVLCNLVDETRDHLFFACFKG 516


>gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]
          Length = 1161

 Score = 62.0 bits (149), Expect = 3e-07
 Identities = 28/56 (50%), Positives = 35/56 (62%)
 Frame = +2

Query: 299  VWSVMNVPKYCFISWLAANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFEC 466
            VW   +VPK  FI W+ A+NRL T+ RL++ G      C LC D +ESR HLFF C
Sbjct: 990  VWFKDHVPKQAFICWVVAHNRLHTRDRLRRWGFSIPPTCVLCNDLDESREHLFFRC 1045


>ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arabidopsis lyrata subsp.
           lyrata] gi|297311371|gb|EFH41795.1| hypothetical protein
           ARALYDRAFT_917542 [Arabidopsis lyrata subsp. lyrata]
          Length = 227

 Score = 60.8 bits (146), Expect = 6e-07
 Identities = 37/95 (38%), Positives = 51/95 (53%), Gaps = 11/95 (11%)
 Frame = +2

Query: 218 KPNGKYTI----------HSGCLWRKGEMKV-WNASRWVWSVMNVPKYCFISWLAANNRL 364
           KP  +YT+          H+  L RK + KV W+ S  VW    VP+Y FI WLA  ++L
Sbjct: 49  KPLWRYTLDDYDSSFTSRHTWNLLRKAKHKVLWHNS--VWFPQRVPRYSFIVWLAVKDQL 106

Query: 365 LTKQRLQKIGLVDNDLCALCGDKEESRAHLFFECP 469
            T  R++  G+     C  C +++ESR HLFF CP
Sbjct: 107 STGTRMRAWGV--EQPCVFCRERDESRDHLFFACP 139


>ref|XP_003566528.1| PREDICTED: uncharacterized protein LOC100844973 [Brachypodium
           distachyon]
          Length = 448

 Score = 60.5 bits (145), Expect = 8e-07
 Identities = 33/84 (39%), Positives = 41/84 (48%)
 Frame = +2

Query: 215 MKPNGKYTIHSGCLWRKGEMKVWNASRWVWSVMNVPKYCFISWLAANNRLLTKQRLQKIG 394
           + PNG Y+  S    +       N  R +WS   +PK  F +WLA  NRL T  RL   G
Sbjct: 263 LTPNGIYSTSSAYKVQFFGATACNFHRLIWSPWALPKCRFFAWLAIQNRLWTNDRLAARG 322

Query: 395 LVDNDLCALCGDKEESRAHLFFEC 466
                LC LCG   E+ AHL F+C
Sbjct: 323 WPHQPLCKLCGVCPETAAHLLFDC 346


>ref|XP_002448781.1| hypothetical protein SORBIDRAFT_06g033056 [Sorghum bicolor]
           gi|241939964|gb|EES13109.1| hypothetical protein
           SORBIDRAFT_06g033056 [Sorghum bicolor]
          Length = 206

 Score = 60.5 bits (145), Expect = 8e-07
 Identities = 33/85 (38%), Positives = 42/85 (49%)
 Frame = +2

Query: 215 MKPNGKYTIHSGCLWRKGEMKVWNASRWVWSVMNVPKYCFISWLAANNRLLTKQRLQKIG 394
           ++ +G+YT  S    +     V N    +W V   PK  +  WL   NRL T  RLQ  G
Sbjct: 21  LESSGEYTAKSAYAAQFAGNIVSNHPALIWRVWATPKCKYFIWLLIQNRLWTAARLQLRG 80

Query: 395 LVDNDLCALCGDKEESRAHLFFECP 469
             +N  CALC    E+  HLFFECP
Sbjct: 81  WTNNYFCALCERNLETAHHLFFECP 105


>dbj|BAD66732.1| orf147a [Beta vulgaris subsp. vulgaris] gi|54606753|dbj|BAD66776.1|
           orf147a [Beta vulgaris subsp. vulgaris]
          Length = 147

 Score = 60.5 bits (145), Expect = 8e-07
 Identities = 35/84 (41%), Positives = 44/84 (52%), Gaps = 4/84 (4%)
 Frame = +2

Query: 320 PKYCFISWLAANNRLLTKQRLQKIGLVDNDLCALCGDKEESRAHLFFECPKGVFI----L 487
           PK  FI+WL   +RL T  RLQK G+V + LC LCG+ +E+R HLFF C     I    L
Sbjct: 12  PKCTFITWLTILDRLATCDRLQKFGIVCDQLCVLCGNVDETRDHLFFVCEFSYEIWSSLL 71

Query: 488 CYNGCILEFRIQSCRAYGGDWSKV 559
           C+ G          +   GDW  V
Sbjct: 72  CWLG---------IQRTAGDWQGV 86


Top