BLASTX nr result

ID: Angelica22_contig00014313 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00014313
         (1691 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN68960.1| hypothetical protein VITISV_019275 [Vitis vinifera]   228   5e-57
ref|XP_002324096.1| predicted protein [Populus trichocarpa] gi|2...   193   1e-46
ref|XP_002526084.1| hypothetical protein RCOM_0524510 [Ricinus c...   193   1e-46
ref|XP_003530425.1| PREDICTED: uncharacterized protein LOC100787...   179   2e-42
ref|XP_003551752.1| PREDICTED: uncharacterized protein LOC100798...   174   8e-41

>emb|CAN68960.1| hypothetical protein VITISV_019275 [Vitis vinifera]
          Length = 477

 Score =  228 bits (580), Expect = 5e-57
 Identities = 163/415 (39%), Positives = 222/415 (53%), Gaps = 20/415 (4%)
 Frame = +1

Query: 286  QSHISQQPHHSYYPQSP-LLQNPKPHFQSLSRQDHFQENNEINHTVISSLLRRISALESS 462
            +SHIS   H++Y+PQ P  LQ               Q+  +  HT++SSLLRRI ALESS
Sbjct: 69   KSHIS---HNNYHPQRPSFLQE--------------QQQQQQTHTLLSSLLRRIDALESS 111

Query: 463  VR--CTSSFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALNTLKLNHSNNTRS 636
            +    T S++LRDAAARTIQTHFRAFLV RSRTL  LK+LA+IKSA N+L+ + S  T  
Sbjct: 112  LLHFSTPSYSLRDAAARTIQTHFRAFLVRRSRTLAHLKELALIKSAFNSLRSSLSQQTHF 171

Query: 637  DXXXXXXXXXXXXXXXEFIQGSDPMIRDAKRSINKELIRFMDYLDELSVKRHQLSTRVVK 816
            D               + IQ SD MIRD KRS+ +EL+RF+D++D +S +RHQL  + ++
Sbjct: 172  DFEALSHKAMDLLLKLDSIQDSDSMIRDGKRSVTRELVRFLDFIDGVSARRHQLRNKPIR 231

Query: 817  NLRIGVSGTKSR----GFGSDHRGSGSETRGPREDGERELLEKLRKRVEKMEGYSR---- 972
            N+R   + +KSR      GS+ R SG++        +REL+EKLR RVEK+ G+SR    
Sbjct: 232  NMRTAQNASKSRVLTGRVGSNCRDSGAD--------QRELMEKLRDRVEKIRGFSRVLEK 283

Query: 973  ----ASXXXXXXXXXXXXXSRLYVNGKPVVERVRNGVLMKRQAGAPSKAKKSVRFAENGN 1140
                                 + +  K  V + RNG+L+KR    P + KKSV FAEN N
Sbjct: 284  GEEDVELEGFQHLSDDEENPSISIIEKGRVSKARNGILVKRHELEP-RVKKSVSFAENXN 342

Query: 1141 VYMVYKGSNGPVSFVECHSNDGSDSV-----DAEEMGRXXXXXXXXXXXXXXXXXXXXXX 1305
            +  V   ++ PVS  +  S   ++ V     + E+MG                       
Sbjct: 343  LSRVLGSTHEPVSGEDGPSMGQTELVENLSGEVEDMG-GLSRETEDDDGGDIESGGSSQT 401

Query: 1306 XXXGRDPRRNVVAESDNDSSGHEPIENEDGTFVFSAPLPVKMEPRADLMNKKSLK 1470
                R+P RN+  E   D    E  + +D +FVFSAPLP+KME RADLM +K  K
Sbjct: 402  SDDERNPIRNLGTE---DGHEVEHYQYQDXSFVFSAPLPLKMESRADLMKRKGGK 453


>ref|XP_002324096.1| predicted protein [Populus trichocarpa] gi|222867098|gb|EEF04229.1|
            predicted protein [Populus trichocarpa]
          Length = 423

 Score =  193 bits (491), Expect = 1e-46
 Identities = 127/326 (38%), Positives = 184/326 (56%), Gaps = 19/326 (5%)
 Frame = +1

Query: 301  QQPHHSYYPQSPLLQNPKPHFQSLSRQD--HFQENNEIN--HTVISSLLRRISALESSVR 468
            Q  HH  +  SP L  P  H ++  ++   HFQ+ ++    H V++SLL+RI+ LESS+ 
Sbjct: 66   QLQHHQSHIFSPCLDRPNSHTKNHHQKPKLHFQQEDDPQQTHFVLASLLQRINTLESSLH 125

Query: 469  -----CTS-----SFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALNTLKLNH 618
                 CT+     S +LRD AAR IQTHFRAFLVHRSRTLRQLK+LA IKS+ N+LK + 
Sbjct: 126  QFSASCTNNHNYPSHSLRDTAARVIQTHFRAFLVHRSRTLRQLKELAFIKSSFNSLKSSI 185

Query: 619  SNNTRSDXXXXXXXXXXXXXXXEFIQGSDPMIRDAKRSINKELIRFMDYLDELSVKRHQL 798
            S  +  D               + IQG D MIRD KRS+ ++L+RF++++D  ++KRH+L
Sbjct: 186  STESHFDFKVASHKAMGLLLKIDSIQGGDTMIRDGKRSVTRDLVRFLEFVDGFAIKRHEL 245

Query: 799  STRVVKNLRIGVSGTKSRGFGSDHRGSGSETRGPREDGERELLEKLRKRVEKMEGYSRAS 978
            S +  +N+R   +  K+R   + +   G        + +RE+++KLRKRVEK+ G+SRA 
Sbjct: 246  SYKSARNVRALGNTNKARALNAKNGYGGCR---DLTESQREIVDKLRKRVEKISGFSRAC 302

Query: 979  XXXXXXXXXXXXXSRL-----YVNGKPVVERVRNGVLMKRQAGAPSKAKKSVRFAENGNV 1143
                           +      +N K  V+  R GV +K++ G P + KK+V FAENGN 
Sbjct: 303  ENDQEDVELEGFQQFVDDGDGELNRKVSVDGKR-GVSLKKRVGHP-RVKKTVSFAENGNS 360

Query: 1144 YMVYKGSNGPVSFVECHSNDGSDSVD 1221
            Y V   ++  V   +   NDGSD  D
Sbjct: 361  YRVISDTDESVLNGDDSFNDGSDYSD 386


>ref|XP_002526084.1| hypothetical protein RCOM_0524510 [Ricinus communis]
            gi|223534581|gb|EEF36278.1| hypothetical protein
            RCOM_0524510 [Ricinus communis]
          Length = 465

 Score =  193 bits (490), Expect = 1e-46
 Identities = 147/412 (35%), Positives = 209/412 (50%), Gaps = 19/412 (4%)
 Frame = +1

Query: 298  SQQPH-HSYYPQSPLLQNPKPHFQSLSRQDHFQENNEINHTVISSLLRRISALESSV--- 465
            + +PH H  + Q    Q  K +FQ LS +    ++ + N  ++SSLL+RI  LESS+   
Sbjct: 77   NNKPHLHKNHNQRLHFQPQKFNFQHLSEE----QDQDTNSFILSSLLQRIKILESSLQQF 132

Query: 466  -----RCTSSFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALNTLKLNHSNNT 630
                 RC  S++LR+ AAR IQTHFRAFLV RSRTL QL+DLA IKS+ N +K +  NNT
Sbjct: 133  SVSNRRCHHSYSLRETAARVIQTHFRAFLVRRSRTLSQLQDLASIKSSFNAMKSSVLNNT 192

Query: 631  RSDXXXXXXXXXXXXXXXEFIQGSDPMIRDAKRSINKELIRFMDYLDELSVKRHQLST-R 807
                              + IQG DP+IRD K+SI+++++RF+D++D L  K    S  +
Sbjct: 193  HLSHAVVSHRAMGLLLKIDSIQGGDPIIRDGKKSISRDIVRFLDFIDGLPGKGQGSSLYK 252

Query: 808  VVKNLRIGVSGTKSRGFGSDHRGSGSETRGPREDGERELLEKLRKRVEKMEGYSRASXXX 987
             VKN+R     +KSR   S+  G    +R      ++E++E L +RVEK+ G+SR     
Sbjct: 253  PVKNVRFIRKMSKSRASNSNV-GYEDLSRN-----QKEIVENLSERVEKIRGFSRVYEND 306

Query: 988  XXXXXXXXXXSRLYVNGKPV---VERVRNGVLMKRQAGAPSKAKKSVRFAENGNVYMVYK 1158
                        +  +       V ++RNG+L+K     P + KKSV F E+GNVY ++ 
Sbjct: 307  EEDVELEGFQELIDDDEDEENIKVSKIRNGILVKSNGSKP-RVKKSVSFDEDGNVYRIFS 365

Query: 1159 GSNGPVSFVECHSNDGSDSVDAEEMGRXXXXXXXXXXXXXXXXXXXXXXXXXGRDPRRNV 1338
             ++  V   +    DGSDS D                                 D  RN 
Sbjct: 366  DTHESVLNGDGSFTDGSDSSD----------DHGETLQKDEVVQEENPASSQSSDAERNH 415

Query: 1339 VAESDNDSSGH-----EPIENEDGTFVFSAPLPVKMEPRADLMNK-KSLKII 1476
            V    N  SG         +++DG  VFSAP+PVKME RADLM K K++KI+
Sbjct: 416  VR---NSRSGEYYEISRYCQDQDGNLVFSAPMPVKMESRADLMKKRKAVKIV 464


>ref|XP_003530425.1| PREDICTED: uncharacterized protein LOC100787996 [Glycine max]
          Length = 452

 Score =  179 bits (454), Expect = 2e-42
 Identities = 142/425 (33%), Positives = 204/425 (48%), Gaps = 27/425 (6%)
 Frame = +1

Query: 283  CQSHISQQPHHSYYPQSPLLQNPKPHFQSLS--RQDHFQENNEIN------------HTV 420
            C +    Q HH     + LL  P+PH   +    Q H+   N +N            H+ 
Sbjct: 45   CCTPSPPQHHHLLQVIASLLSQPQPHPIPIPIPSQQHYTHQN-LNLPTQNHHPQRQPHST 103

Query: 421  ISSLLRRISALESSVRCTSSFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALN 600
            +SSLL RI  LESS+   +  +LR AAAR IQTHFR+ L  RSRTL QLK LA IKS  N
Sbjct: 104  MSSLLHRIETLESSLNHYTHHSLRHAAARLIQTHFRSLLARRSRTLSQLKHLASIKSTFN 163

Query: 601  TLKLNHSNNTRSDXXXXXXXXXXXXXXXEFIQGSDPMIRDAKRSINKELIRFMDYLDELS 780
             LK + S++T  D               + IQG DPMI D KRSI+++L++F+D ++E++
Sbjct: 164  ALKSSFSSHTHVDFAAISLKAMNLLLELDSIQGCDPMIVDGKRSISRDLVQFLDSVEEVA 223

Query: 781  VKRHQLSTRVVKNLRIGVSGTKSRGFGSDHRGSGSETRGPREDGERELLEKLRKRVEKME 960
            +K+H L  +  K +R G  G K +     HR S        +D  R+LL+ LR RVEK+ 
Sbjct: 224  LKKHVLYVKAAKPVRSG--GKKVQ----KHRNSD-------DDERRKLLQNLRGRVEKLS 270

Query: 961  GYSRAS-----XXXXXXXXXXXXXSRLYVNGKPVVERVRNGVLMKRQAGAPSKAKKSVRF 1125
               + S                  + + + G+  V + +NGV + RQ GA    KKSVRF
Sbjct: 271  KLCKVSANDEEDSESEEGIHDNGVTNVLIGGRNEVPQNKNGVFLPRQ-GAQPGVKKSVRF 329

Query: 1126 AENGNVYMVYKGSNGPVSFVECHSNDG--------SDSVDAEEMGRXXXXXXXXXXXXXX 1281
            A+N N+  VY G +   S   C S+D         S +V+ + +G               
Sbjct: 330  AKNRNICEVYSG-DVACSDGSCSSSDELGDVLENVSGAVEDDSVGSSQGAEDDEEVLVVE 388

Query: 1282 XXXXXXXXXXXGRDPRRNVVAESDNDSSGHEPIENEDGTFVFSAPLPVKMEPRADLMNKK 1461
                        R+ RR +V  +D  +   E ++      +FSAPLP+KME R+   N K
Sbjct: 389  SGGSPRSSDDGERNTRRVLV--NDGRNVVKEQLQAHREKLLFSAPLPLKMENRSGSRNSK 446

Query: 1462 SLKII 1476
             +KI+
Sbjct: 447  GVKIL 451


>ref|XP_003551752.1| PREDICTED: uncharacterized protein LOC100798286 [Glycine max]
          Length = 452

 Score =  174 bits (440), Expect = 8e-41
 Identities = 139/422 (32%), Positives = 194/422 (45%), Gaps = 31/422 (7%)
 Frame = +1

Query: 304  QPHHSYYPQSPLLQNPKPHFQSLSRQDHFQ------------ENNEINH-------TVIS 426
            Q HH     S LL  P+PH   +  Q H+             +N+ + H       + IS
Sbjct: 48   QHHHLLQAISSLLSQPQPHLIPIPSQQHYTKSYTHQNLKLPTQNHHLQHPHQHQTHSTIS 107

Query: 427  SLLRRISALESSVRCTSSFTLRDAAARTIQTHFRAFLVHRSRTLRQLKDLAVIKSALNTL 606
            SLL RI +LESS    +  +LR AAAR IQTHFR+FL  RSRTL QLK LA IKS  N L
Sbjct: 108  SLLDRIESLESSFNHYTHHSLRHAAARVIQTHFRSFLARRSRTLAQLKHLASIKSTFNAL 167

Query: 607  KLNHSNNTRSDXXXXXXXXXXXXXXXEFIQGSDPMIRDAKRSINKELIRFMDYLDELSVK 786
            K + SN+T  D               + IQG DPMI D KRSI+++L++F+D ++E+++K
Sbjct: 168  KSSFSNHTHVDFAAISLKAMNLLLELDSIQGCDPMIVDGKRSISRDLVQFLDSIEEVALK 227

Query: 787  RHQLSTRVVKNLRIGVSGTKSRGFGSDHRGSGSETRGPREDGERELLEKLRKRVEKMEGY 966
            +H L  +  K +       K+R    D R              R+LL+ LR RVEK+   
Sbjct: 228  KHVLHVKAGKTVGSVKKVQKNRNSDDDER--------------RKLLQNLRCRVEKLSRL 273

Query: 967  SRAS-----XXXXXXXXXXXXXSRLYVNGKPVVERVRNGVLMKRQAGAPSKAKKSVRFAE 1131
             + S                  + + + G+  V   +NGV + R+ G P   KKSVRFAE
Sbjct: 274  CKVSANDEEDSESGESIHDDGVTNVLIGGRNEVSPNKNGVCLLRR-GEPG-VKKSVRFAE 331

Query: 1132 NGNVYMVYKG----SNGPVSFVECH---SNDGSDSVDAEEMGRXXXXXXXXXXXXXXXXX 1290
            N N+  VY G    S+G  S  +     S + S +V+   +                   
Sbjct: 332  NRNICEVYSGDVACSDGSCSSSDEQGEVSENVSGAVEDNGVDSSQGAEDHEEVLVFDSGG 391

Query: 1291 XXXXXXXXGRDPRRNVVAESDNDSSGHEPIENEDGTFVFSAPLPVKMEPRADLMNKKSLK 1470
                     R+ RR  V    N     E ++      +FSAPLP+KME R+   N K +K
Sbjct: 392  LPHSSDDGERNTRRLFVKVGRNVVK--EQLQAHQEKLLFSAPLPLKMENRSGSKNSKGVK 449

Query: 1471 II 1476
            I+
Sbjct: 450  IL 451


Top