BLASTX nr result

ID: Angelica23_contig00032417 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00032417
         (1048 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635424.1| PREDICTED: LOW QUALITY PROTEIN: putative rib...   110   7e-30
ref|XP_002518871.1| conserved hypothetical protein [Ricinus comm...   103   7e-20
ref|XP_002522452.1| nucleic acid binding protein, putative [Rici...    86   2e-18
ref|XP_002525961.1| hypothetical protein RCOM_0596960 [Ricinus c...    98   3e-18
gb|AAG13524.1|AC068924_29 putative non-LTR retroelement reverse ...    77   6e-18

>ref|XP_003635424.1| PREDICTED: LOW QUALITY PROTEIN: putative ribonuclease H protein
            At1g65750-like [Vitis vinifera]
          Length = 820

 Score =  110 bits (274), Expect(2) = 7e-30
 Identities = 72/235 (30%), Positives = 105/235 (44%), Gaps = 11/235 (4%)
 Frame = +1

Query: 121  APPKAKDTIRRAGTHCLPTKMNL*GKRVPINSVYPLCNIYNETTSHCLVSCEFSWNCWVV 300
            APPK  +   RA  +CLPT+  L  + V      P+C    ETT H LV C  + + W  
Sbjct: 516  APPKILNFAWRAARNCLPTRFALTIRHVDTPMCCPICRSELETTLHALVECVAARDVWDE 575

Query: 301  SGLNVPGRESISFYQWMGEVLEQGDAETTAKVVMICWSIWKARNDIVWNQR-WRSVDEVV 477
            SGL +      SF  W+  +    D    AK + +CW +W  RND+VWN R W S   V 
Sbjct: 576  SGLAMLQGNFGSFVDWLATMFAYCDFVVFAKYLAVCWGLWWRRNDVVWNGRIWHSQQVVN 635

Query: 478  AFAMLSLNQYVAAQNMG---SIPSLSPLLEGDGAERWARPVANTIKVNVDASIFEKEKGY 648
                +  + + A + +    ++PS S         +W +P    IK+NVD ++F  +   
Sbjct: 636  GCFTMLESWFHANETLATAVTVPSYS--------SKWQKPDYGWIKINVDGAVFPDKGAI 687

Query: 649  GYAFXXXXXXXXXXXXSARFLSGVVS-------PSLAEAIGIKEALSWVKEP*RS 792
            G  F              RF+ G          P + EA+G++E LSW+ E  RS
Sbjct: 688  GAVF---------RDHQGRFMGGFAKPFPHQTLPKVVEALGVREVLSWIHERSRS 733



 Score = 47.8 bits (112), Expect(2) = 7e-30
 Identities = 23/69 (33%), Positives = 41/69 (59%)
 Frame = +3

Query: 783 LKEHQGRRVEIESDSLVSIQAIRSSNSMYSGFGLVIQECRCLVESLSNIALNFVKRSANR 962
           + E    R+ +E+D L  +QAI+  +   + FG +I +C  +++ L ++ + + +RSAN 
Sbjct: 727 IHERSRSRIVVETDCLRVVQAIQHKSCPNTSFGFIIVDCLDVLQHLVDVQVVYARRSANS 786

Query: 963 AAHVLARHA 989
           AAH LA  A
Sbjct: 787 AAHCLANGA 795


>ref|XP_002518871.1| conserved hypothetical protein [Ricinus communis]
            gi|223541858|gb|EEF43404.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 668

 Score =  103 bits (257), Expect = 7e-20
 Identities = 74/277 (26%), Positives = 121/277 (43%), Gaps = 19/277 (6%)
 Frame = +1

Query: 7    IILGIQINQSNLIDSWYW-----------SAYRWLQS------TSSENQGSELSKAPPKA 135
            +I  I +   N+ DSWYW           + YR LQ       TS  N+  +L K P K 
Sbjct: 382  LIESIPLTNRNVEDSWYWLFDDKGNYLVKNCYRLLQGDLGRPETSFWNKFWKL-KIPTKV 440

Query: 136  KDTIRRAGTHCLPTKMNL*GKRVPINSVYPLCNIYNETTSHCLVSCEFSWNCWVVSGLNV 315
            ++ + +  ++C+ T MNL  + V ++ +   CN   ET  H L  C  +  CW + G   
Sbjct: 441  RNLMWKICSNCIRTAMNLRMRYVDLDPLCKWCNREPETLFHVLFGCSMARECWDLLGFRF 500

Query: 316  PGRESISFYQWMGEVLEQGDAETTAKVVMICWSIWKARNDIVWNQRWRSVDEVVAFAMLS 495
                      W+         +T AK+ ++C S+W  RN  VWN++  +   V++ A   
Sbjct: 501  SFPTESPICSWIQNFFIDNHEDTCAKMALVCGSLWNQRNHWVWNKQSNTTYGVISPANQL 560

Query: 496  LNQYVAAQNMGSIPSLSPLL--EGDGAERWARPVANTIKVNVDASIFEKEKGYGYAFXXX 669
            L Q   AQ+      ++P L    D   +W +P    +K+N D ++F K+   G      
Sbjct: 561  LEQRTRAQSADE--EIAPPLGHNTDVTRKWTKPELGWLKINTDTALFLKQGRIGLGCVVR 618

Query: 670  XXXXXXXXXSARFLSGVVSPSLAEAIGIKEALSWVKE 780
                      +    G  +   AEA+ +KEALS++K+
Sbjct: 619  DSNGRMIMARSERRVGRFTAREAEALSMKEALSYMKD 655


>ref|XP_002522452.1| nucleic acid binding protein, putative [Ricinus communis]
           gi|223538337|gb|EEF39944.1| nucleic acid binding
           protein, putative [Ricinus communis]
          Length = 483

 Score = 85.5 bits (210), Expect(2) = 2e-18
 Identities = 63/215 (29%), Positives = 92/215 (42%)
 Frame = +1

Query: 130 KAKDTIRRAGTHCLPTKMNL*GKRVPINSVYPLCNIYNETTSHCLVSCEFSWNCWVVSGL 309
           K K  + RA T+ LP + NL  ++V  +S  P C    ET  H LV C+ +   W    L
Sbjct: 195 KCKVFMWRALTNRLPVRTNLVMRKVTEDSSCPCCVSQPETIMHILVLCDVTTQSWKYVNL 254

Query: 310 NVPGRESISFYQWMGEVLEQGDAETTAKVVMICWSIWKARNDIVWNQRWRSVDEVVAFAM 489
                +  +  +    V E  D    A  V++ WS+W   ND+VWN +  S   V + A 
Sbjct: 255 YQFLSQVSNLLEGARSVFEHFDDSIVASFVVLWWSLWTNMNDVVWNGKKLSWRAVASRAS 314

Query: 490 LSLNQYVAAQNMGSIPSLSPLLEGDGAERWARPVANTIKVNVDASIFEKEKGYGYAFXXX 669
             L Q+  A+ +     L     G G   W +P     K+NVDAS   +    G +F   
Sbjct: 315 SFLFQWGKARKLTDYHQLGRHFAG-GDCLWQKPATGKYKLNVDASSSAERGKSGASFVLR 373

Query: 670 XXXXXXXXXSARFLSGVVSPSLAEAIGIKEALSWV 774
                           + +P +AEA  +KEALSW+
Sbjct: 374 DNAGIWITGVLIIRPYIANPDVAEAWALKEALSWI 408



 Score = 33.9 bits (76), Expect(2) = 2e-18
 Identities = 21/65 (32%), Positives = 37/65 (56%)
 Frame = +3

Query: 807  VEIESDSLVSIQAIRSSNSMYSGFGLVIQECRCLVESLSNIALNFVKRSANRAAHVLARH 986
            V+IE+D L +I+ +       S    ++++C+ L+  L+   L FV  SAN  AH++++ 
Sbjct: 416  VQIETDCLRNIELLEEELHPNSYLLCLLKDCQDLLRVLNRCNLVFVYGSANTVAHMISK- 474

Query: 987  ARCLA 1001
            A C A
Sbjct: 475  ATCSA 479


>ref|XP_002525961.1| hypothetical protein RCOM_0596960 [Ricinus communis]
           gi|223534693|gb|EEF36385.1| hypothetical protein
           RCOM_0596960 [Ricinus communis]
          Length = 270

 Score = 98.2 bits (243), Expect = 3e-18
 Identities = 55/195 (28%), Positives = 95/195 (48%), Gaps = 2/195 (1%)
 Frame = +1

Query: 199 RVPINSVYPLCNIYNETTSHCLVSCEFSWNCWVVSGLNVPGRESISFYQWMGEVLEQGDA 378
           R+   ++ PLC+   E+  H L+ C F    WV+S L   G  + S   W+ E+  + D 
Sbjct: 76  RIDNQNLCPLCSNPGESIYHALIDCSFVKEVWVISVLVSRGNRNGSICDWLQEIFTKFDK 135

Query: 379 ETTAKVVMICWSIWKARNDIVWNQRWRSVDEVVAFAMLSLNQYVAAQNMGSIPSLSPLLE 558
            T   +  I W++W  RN++VWN + +   ++V  A+  L +++ AQ        +  + 
Sbjct: 136 STWHCIAAIVWNLWIHRNEVVWNSKRKQPRQIVDGAVTYLQRWLIAQQTNPPSPDNNEVF 195

Query: 559 GDGAERWARPVANTIKVNVDASIFEKEK--GYGYAFXXXXXXXXXXXXSARFLSGVVSPS 732
                +W +P + ++K NVDAS F ++   G GY              ++ F+   ++  
Sbjct: 196 NHNLAKWKKPKSGSLKCNVDASTFNQQGMIGAGYVLRNTEGALLGARITS-FVQSDLNLK 254

Query: 733 LAEAIGIKEALSWVK 777
           LAEA+  +EALSW K
Sbjct: 255 LAEALSFREALSWTK 269


>gb|AAG13524.1|AC068924_29 putative non-LTR retroelement reverse transcriptase [Oryza sativa
            Japonica Group]
          Length = 1382

 Score = 77.4 bits (189), Expect(2) = 6e-18
 Identities = 62/227 (27%), Positives = 93/227 (40%), Gaps = 7/227 (3%)
 Frame = +1

Query: 121  APPKAKDTIRRAGTHCLPTKMNL*GKRVPINSVYPLCNIYNETTSHCLVSCEFSWNCWV- 297
            AP K K T+ RA   CL T   L  + +P       CN  ++T  H  + C F+   W  
Sbjct: 1071 APGKMKITLWRAAHECLATGFQLRRRHIPSTDGCVFCN-RDDTVEHVFLFCPFAAQIWEE 1129

Query: 298  VSGLNVP--GRESIS-FYQWMGEVLEQGDAETTAKVVMICWSIWKARNDIVWNQRWRSVD 468
            + G      GR   S   QW+ + L++G +     + +  W IW+ARN+   N       
Sbjct: 1130 IKGKCAVKLGRNGFSTMRQWIFDFLKRGSSHANTLLAVTFWHIWEARNN-TKNNNGTVHP 1188

Query: 469  EVVAFAMLSLNQYVAAQNMGSIPSLSPLLEGDGAE---RWARPVANTIKVNVDASIFEKE 639
            + V   +LS    +   N  ++        G   +   RW  P A+   +N DA+IF   
Sbjct: 1189 QRVVIKILSYVDMILKHNTKTVDG----QRGGNTQAIPRWQPPPASVWMINSDAAIFSSS 1244

Query: 640  KGYGYAFXXXXXXXXXXXXSARFLSGVVSPSLAEAIGIKEALSWVKE 780
            +  G                +  +S VV P LAEA+ I+ AL   KE
Sbjct: 1245 RTMGVGALIRDNTGKCLVACSEMISDVVLPELAEALAIRRALGLAKE 1291



 Score = 40.4 bits (93), Expect(2) = 6e-18
 Identities = 25/71 (35%), Positives = 37/71 (52%)
 Frame = +3

Query: 777  GALKEHQGRRVEIESDSLVSIQAIRSSNSMYSGFGLVIQECRCLVESLSNIALNFVKRSA 956
            G  KE     + + SD L  I+ I++S    SG G VI++ + L  +    +   V R +
Sbjct: 1287 GLAKEEGLEHIVMASDCLTVIRRIQTSGRDRSGVGCVIEDIKKLASTFVLCSFMHVNRLS 1346

Query: 957  NRAAHVLARHA 989
            N AAH LAR+A
Sbjct: 1347 NLAAHSLARNA 1357


Top