BLASTX nr result

ID: Angelica23_contig00004242 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00004242
         (1586 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN68960.1| hypothetical protein VITISV_019275 [Vitis vinifera]   226   2e-56
ref|XP_002324096.1| predicted protein [Populus trichocarpa] gi|2...   193   9e-47
ref|XP_002526084.1| hypothetical protein RCOM_0524510 [Ricinus c...   191   5e-46
ref|XP_003530425.1| PREDICTED: uncharacterized protein LOC100787...   177   7e-42
ref|XP_003551752.1| PREDICTED: uncharacterized protein LOC100798...   172   3e-40

>emb|CAN68960.1| hypothetical protein VITISV_019275 [Vitis vinifera]
          Length = 477

 Score =  226 bits (575), Expect = 2e-56
 Identities = 164/415 (39%), Positives = 225/415 (54%), Gaps = 20/415 (4%)
 Frame = -2

Query: 1348 QSHISQQPHHSYYPQSP-LLQNPKPHFQSLSRQDHFQENNEINHTVISSLLRRISALESS 1172
            +SHIS   H++Y+PQ P  LQ               Q+  +  HT++SSLLRRI ALESS
Sbjct: 69   KSHIS---HNNYHPQRPSFLQE--------------QQQQQQTHTLLSSLLRRIDALESS 111

Query: 1171 VR--CTSSFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALNTLKLNHSNNTRS 998
            +    T S++LRDAAARTIQTHFRAFLV RSRTL  LK+LA+IKSA N+L+ + S  T  
Sbjct: 112  LLHFSTPSYSLRDAAARTIQTHFRAFLVRRSRTLAHLKELALIKSAFNSLRSSLSQQTHF 171

Query: 997  DXXXXXXXXXXXXXXLEFIQGSDPMIRDAKRSINKELIRFMDYLDELSVKRHQLSTRVVK 818
            D              L+ IQ SD MIRD KRS+ +EL+RF+D++D +S +RHQL  + ++
Sbjct: 172  DFEALSHKAMDLLLKLDSIQDSDSMIRDGKRSVTRELVRFLDFIDGVSARRHQLRNKPIR 231

Query: 817  NLRIGVSGTKSR----GFGSDHRGSGSETRGPREDGERELLEKLRKRVEKMEGYSR---- 662
            N+R   + +KSR      GS+ R SG++        +REL+EKLR RVEK+ G+SR    
Sbjct: 232  NMRTAQNASKSRVLTGRVGSNCRDSGAD--------QRELMEKLRDRVEKIRGFSRVLEK 283

Query: 661  ----ASXXXXXXXXXXXXDSRLYVNGKPVVERVRSGVLMKRQAGAPSKAKKSVRFAENGN 494
                              +  + +  K  V + R+G+L+KR    P + KKSV FAEN N
Sbjct: 284  GEEDVELEGFQHLSDDEENPSISIIEKGRVSKARNGILVKRHELEP-RVKKSVSFAENXN 342

Query: 493  VYMVYKGSNGPVSFVECHSNDGSDSV-----DAEEMGRXXXXXXXXXXXXXXXXXXXXXX 329
            +  V   ++ PVS  +  S   ++ V     + E+MG                       
Sbjct: 343  LSRVLGSTHEPVSGEDGPSMGQTELVENLSGEVEDMG-GLSRETEDDDGGDIESGGSSQT 401

Query: 328  XXDGRDPRRNVVAESDNDSSGHEPIENEDGTFVFSAPLPVKMEPRADLMNKKSLK 164
              D R+P RN+  E   D    E  + +D +FVFSAPLP+KME RADLM +K  K
Sbjct: 402  SDDERNPIRNLGTE---DGHEVEHYQYQDXSFVFSAPLPLKMESRADLMKRKGGK 453


>ref|XP_002324096.1| predicted protein [Populus trichocarpa] gi|222867098|gb|EEF04229.1|
            predicted protein [Populus trichocarpa]
          Length = 423

 Score =  193 bits (491), Expect = 9e-47
 Identities = 127/326 (38%), Positives = 185/326 (56%), Gaps = 19/326 (5%)
 Frame = -2

Query: 1333 QQPHHSYYPQSPLLQNPKPHFQSLSRQD--HFQENNEIN--HTVISSLLRRISALESSVR 1166
            Q  HH  +  SP L  P  H ++  ++   HFQ+ ++    H V++SLL+RI+ LESS+ 
Sbjct: 66   QLQHHQSHIFSPCLDRPNSHTKNHHQKPKLHFQQEDDPQQTHFVLASLLQRINTLESSLH 125

Query: 1165 -----CTS-----SFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALNTLKLNH 1016
                 CT+     S +LRD AAR IQTHFRAFLVHRSRTLRQLK+LA IKS+ N+LK + 
Sbjct: 126  QFSASCTNNHNYPSHSLRDTAARVIQTHFRAFLVHRSRTLRQLKELAFIKSSFNSLKSSI 185

Query: 1015 SNNTRSDXXXXXXXXXXXXXXLEFIQGSDPMIRDAKRSINKELIRFMDYLDELSVKRHQL 836
            S  +  D              ++ IQG D MIRD KRS+ ++L+RF++++D  ++KRH+L
Sbjct: 186  STESHFDFKVASHKAMGLLLKIDSIQGGDTMIRDGKRSVTRDLVRFLEFVDGFAIKRHEL 245

Query: 835  STRVVKNLRIGVSGTKSRGFGSDHRGSGSETRGPREDGERELLEKLRKRVEKMEGYSRAS 656
            S +  +N+R   +  K+R   + +   G        + +RE+++KLRKRVEK+ G+SRA 
Sbjct: 246  SYKSARNVRALGNTNKARALNAKNGYGGCR---DLTESQREIVDKLRKRVEKISGFSRAC 302

Query: 655  XXXXXXXXXXXXDSRL-----YVNGKPVVERVRSGVLMKRQAGAPSKAKKSVRFAENGNV 491
                           +      +N K  V+  R GV +K++ G P + KK+V FAENGN 
Sbjct: 303  ENDQEDVELEGFQQFVDDGDGELNRKVSVDGKR-GVSLKKRVGHP-RVKKTVSFAENGNS 360

Query: 490  YMVYKGSNGPVSFVECHSNDGSDSVD 413
            Y V   ++  V   +   NDGSD  D
Sbjct: 361  YRVISDTDESVLNGDDSFNDGSDYSD 386


>ref|XP_002526084.1| hypothetical protein RCOM_0524510 [Ricinus communis]
            gi|223534581|gb|EEF36278.1| hypothetical protein
            RCOM_0524510 [Ricinus communis]
          Length = 465

 Score =  191 bits (485), Expect = 5e-46
 Identities = 146/412 (35%), Positives = 210/412 (50%), Gaps = 19/412 (4%)
 Frame = -2

Query: 1336 SQQPH-HSYYPQSPLLQNPKPHFQSLSRQDHFQENNEINHTVISSLLRRISALESSV--- 1169
            + +PH H  + Q    Q  K +FQ LS +    ++ + N  ++SSLL+RI  LESS+   
Sbjct: 77   NNKPHLHKNHNQRLHFQPQKFNFQHLSEE----QDQDTNSFILSSLLQRIKILESSLQQF 132

Query: 1168 -----RCTSSFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALNTLKLNHSNNT 1004
                 RC  S++LR+ AAR IQTHFRAFLV RSRTL QL+DLA IKS+ N +K +  NNT
Sbjct: 133  SVSNRRCHHSYSLRETAARVIQTHFRAFLVRRSRTLSQLQDLASIKSSFNAMKSSVLNNT 192

Query: 1003 RSDXXXXXXXXXXXXXXLEFIQGSDPMIRDAKRSINKELIRFMDYLDELSVKRHQLST-R 827
                             ++ IQG DP+IRD K+SI+++++RF+D++D L  K    S  +
Sbjct: 193  HLSHAVVSHRAMGLLLKIDSIQGGDPIIRDGKKSISRDIVRFLDFIDGLPGKGQGSSLYK 252

Query: 826  VVKNLRIGVSGTKSRGFGSDHRGSGSETRGPREDGERELLEKLRKRVEKMEGYSRASXXX 647
             VKN+R     +KSR   S+  G    +R      ++E++E L +RVEK+ G+SR     
Sbjct: 253  PVKNVRFIRKMSKSRASNSNV-GYEDLSRN-----QKEIVENLSERVEKIRGFSRVYEND 306

Query: 646  XXXXXXXXXDSRLYVNGKPV---VERVRSGVLMKRQAGAPSKAKKSVRFAENGNVYMVYK 476
                        +  +       V ++R+G+L+K     P + KKSV F E+GNVY ++ 
Sbjct: 307  EEDVELEGFQELIDDDEDEENIKVSKIRNGILVKSNGSKP-RVKKSVSFDEDGNVYRIFS 365

Query: 475  GSNGPVSFVECHSNDGSDSVDAEEMGRXXXXXXXXXXXXXXXXXXXXXXXXDGRDPRRNV 296
             ++  V   +    DGSDS D                                 D  RN 
Sbjct: 366  DTHESVLNGDGSFTDGSDSSD----------DHGETLQKDEVVQEENPASSQSSDAERNH 415

Query: 295  VAESDNDSSGH-----EPIENEDGTFVFSAPLPVKMEPRADLMNK-KSLKII 158
            V    N  SG         +++DG  VFSAP+PVKME RADLM K K++KI+
Sbjct: 416  VR---NSRSGEYYEISRYCQDQDGNLVFSAPMPVKMESRADLMKKRKAVKIV 464


>ref|XP_003530425.1| PREDICTED: uncharacterized protein LOC100787996 [Glycine max]
          Length = 452

 Score =  177 bits (449), Expect = 7e-42
 Identities = 142/425 (33%), Positives = 205/425 (48%), Gaps = 27/425 (6%)
 Frame = -2

Query: 1351 CQSHISQQPHHSYYPQSPLLQNPKPHFQSLS--RQDHFQENNEIN------------HTV 1214
            C +    Q HH     + LL  P+PH   +    Q H+   N +N            H+ 
Sbjct: 45   CCTPSPPQHHHLLQVIASLLSQPQPHPIPIPIPSQQHYTHQN-LNLPTQNHHPQRQPHST 103

Query: 1213 ISSLLRRISALESSVRCTSSFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALN 1034
            +SSLL RI  LESS+   +  +LR AAAR IQTHFR+ L  RSRTL QLK LA IKS  N
Sbjct: 104  MSSLLHRIETLESSLNHYTHHSLRHAAARLIQTHFRSLLARRSRTLSQLKHLASIKSTFN 163

Query: 1033 TLKLNHSNNTRSDXXXXXXXXXXXXXXLEFIQGSDPMIRDAKRSINKELIRFMDYLDELS 854
             LK + S++T  D              L+ IQG DPMI D KRSI+++L++F+D ++E++
Sbjct: 164  ALKSSFSSHTHVDFAAISLKAMNLLLELDSIQGCDPMIVDGKRSISRDLVQFLDSVEEVA 223

Query: 853  VKRHQLSTRVVKNLRIGVSGTKSRGFGSDHRGSGSETRGPREDGERELLEKLRKRVEKME 674
            +K+H L  +  K +R G  G K +     HR S        +D  R+LL+ LR RVEK+ 
Sbjct: 224  LKKHVLYVKAAKPVRSG--GKKVQ----KHRNSD-------DDERRKLLQNLRGRVEKLS 270

Query: 673  GYSRAS-----XXXXXXXXXXXXDSRLYVNGKPVVERVRSGVLMKRQAGAPSKAKKSVRF 509
               + S                  + + + G+  V + ++GV + RQ GA    KKSVRF
Sbjct: 271  KLCKVSANDEEDSESEEGIHDNGVTNVLIGGRNEVPQNKNGVFLPRQ-GAQPGVKKSVRF 329

Query: 508  AENGNVYMVYKGSNGPVSFVECHSNDG--------SDSVDAEEMGRXXXXXXXXXXXXXX 353
            A+N N+  VY G +   S   C S+D         S +V+ + +G               
Sbjct: 330  AKNRNICEVYSG-DVACSDGSCSSSDELGDVLENVSGAVEDDSVGSSQGAEDDEEVLVVE 388

Query: 352  XXXXXXXXXXDGRDPRRNVVAESDNDSSGHEPIENEDGTFVFSAPLPVKMEPRADLMNKK 173
                        R+ RR +V  +D  +   E ++      +FSAPLP+KME R+   N K
Sbjct: 389  SGGSPRSSDDGERNTRRVLV--NDGRNVVKEQLQAHREKLLFSAPLPLKMENRSGSRNSK 446

Query: 172  SLKII 158
             +KI+
Sbjct: 447  GVKIL 451


>ref|XP_003551752.1| PREDICTED: uncharacterized protein LOC100798286 [Glycine max]
          Length = 452

 Score =  172 bits (435), Expect = 3e-40
 Identities = 139/422 (32%), Positives = 195/422 (46%), Gaps = 31/422 (7%)
 Frame = -2

Query: 1330 QPHHSYYPQSPLLQNPKPHFQSLSRQDHFQ------------ENNEINH-------TVIS 1208
            Q HH     S LL  P+PH   +  Q H+             +N+ + H       + IS
Sbjct: 48   QHHHLLQAISSLLSQPQPHLIPIPSQQHYTKSYTHQNLKLPTQNHHLQHPHQHQTHSTIS 107

Query: 1207 SLLRRISALESSVRCTSSFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALNTL 1028
            SLL RI +LESS    +  +LR AAAR IQTHFR+FL  RSRTL QLK LA IKS  N L
Sbjct: 108  SLLDRIESLESSFNHYTHHSLRHAAARVIQTHFRSFLARRSRTLAQLKHLASIKSTFNAL 167

Query: 1027 KLNHSNNTRSDXXXXXXXXXXXXXXLEFIQGSDPMIRDAKRSINKELIRFMDYLDELSVK 848
            K + SN+T  D              L+ IQG DPMI D KRSI+++L++F+D ++E+++K
Sbjct: 168  KSSFSNHTHVDFAAISLKAMNLLLELDSIQGCDPMIVDGKRSISRDLVQFLDSIEEVALK 227

Query: 847  RHQLSTRVVKNLRIGVSGTKSRGFGSDHRGSGSETRGPREDGERELLEKLRKRVEKMEGY 668
            +H L  +  K +       K+R    D R              R+LL+ LR RVEK+   
Sbjct: 228  KHVLHVKAGKTVGSVKKVQKNRNSDDDER--------------RKLLQNLRCRVEKLSRL 273

Query: 667  SRAS-----XXXXXXXXXXXXDSRLYVNGKPVVERVRSGVLMKRQAGAPSKAKKSVRFAE 503
             + S                  + + + G+  V   ++GV + R+ G P   KKSVRFAE
Sbjct: 274  CKVSANDEEDSESGESIHDDGVTNVLIGGRNEVSPNKNGVCLLRR-GEPG-VKKSVRFAE 331

Query: 502  NGNVYMVYKG----SNGPVSFVECH---SNDGSDSVDAEEMGRXXXXXXXXXXXXXXXXX 344
            N N+  VY G    S+G  S  +     S + S +V+   +                   
Sbjct: 332  NRNICEVYSGDVACSDGSCSSSDEQGEVSENVSGAVEDNGVDSSQGAEDHEEVLVFDSGG 391

Query: 343  XXXXXXXDGRDPRRNVVAESDNDSSGHEPIENEDGTFVFSAPLPVKMEPRADLMNKKSLK 164
                     R+ RR  V    N     E ++      +FSAPLP+KME R+   N K +K
Sbjct: 392  LPHSSDDGERNTRRLFVKVGRNVVK--EQLQAHQEKLLFSAPLPLKMENRSGSKNSKGVK 449

Query: 163  II 158
            I+
Sbjct: 450  IL 451


Top