BLASTX nr result

ID: Angelica23_contig00016020 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00016020
         (1696 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003556463.1| PREDICTED: trihelix transcription factor GTL...   404   e-110
ref|XP_003591003.1| Trihelix transcription factor [Medicago trun...   389   e-105
ref|NP_001236643.1| trihelix transcription factor [Glycine max] ...   385   e-104
ref|XP_003554701.1| PREDICTED: uncharacterized protein LOC100801...   373   e-101
ref|XP_002328348.1| predicted protein [Populus trichocarpa] gi|2...   368   2e-99

>ref|XP_003556463.1| PREDICTED: trihelix transcription factor GTL2-like [Glycine max]
          Length = 590

 Score =  404 bits (1038), Expect = e-110
 Identities = 243/548 (44%), Positives = 314/548 (57%), Gaps = 52/548 (9%)
 Frame = -3

Query: 1649 GVPQEQFHQLIASSRTLSASLPINPLPFSTSSPH--PFPLNFDAYXXXXXXXXXXXXXXX 1476
            GVP +QFHQ I    +L   LP  PL  S  +P+   FP NFD Y               
Sbjct: 4    GVP-DQFHQFITPRTSLPLHLPF-PLHTSGGTPNTTTFPSNFDPYNHPHQLPLQPNNLLH 61

Query: 1475 XXXXPTKEDAHNKVEETSIGVE--------HIMDPWSNDEVLSLLRIRSSMDNWFPDFTW 1320
                  ++   N     ++ ++         ++DPW+NDEVL+LLRIRSSM++WFP+ TW
Sbjct: 62   PLHHKDEDKEENTTVPMNLEIQRDQRQQLPELIDPWNNDEVLALLRIRSSMESWFPELTW 121

Query: 1319 EHVSRKMAEFGFKRSADKCKQKFEAETRDFNRLCFNKP----------RFDSELEELYH- 1173
            EHVSRK+AE G+KRSA+KCK+KFE E+R FN + + K           RF SELE+LYH 
Sbjct: 122  EHVSRKLAELGYKRSAEKCKEKFEEESRYFNNINYGKNNNNNNNSSNYRFLSELEQLYHQ 181

Query: 1172 GXXXXXXXXXXXXXXXXXXXLAGNVAINNLPEQPTHEELKRSTAWKQNHQI--------- 1020
            G                     G+ A+  L  +     +  +   KQN Q          
Sbjct: 182  GGSGDHHLENTTQPPLQKQDKMGHHAL-ELEVEGDSRNVVDALVTKQNEQSDEALAVEKI 240

Query: 1019 ---KKRKREQKFENFKEFCVDIVNKMMLQQEEMHNKLLQDFLSRDDENIAREEAWKKQEM 849
               +KRKR  +FE FK FC  IV+K+M QQEEMHNKLL+D + RDDE   REEAWKKQE+
Sbjct: 241  TKDRKRKRPDRFEMFKCFCESIVHKIMAQQEEMHNKLLEDMMKRDDEKFTREEAWKKQEI 300

Query: 848  ENFIKEAEVRTREQAIAGDRQATITEIMNKFTSQEAFDKIAVSYEELLKVSNTLSSLTSS 669
            E   KE E+  REQAIAGDRQA I +I+NKF++       + +   L KV+N  +  T  
Sbjct: 301  EKMNKELEMMAREQAIAGDRQANIIQILNKFSATS-----SPASHTLKKVNNDSNINTHI 355

Query: 668  YQN-------------------ILPQNNSSSTETLSHSNPSNSTSNLPQNHDYKTPSLSA 546
             QN                   ++P  +S+ST  L   NPS  + N+  N++    ++  
Sbjct: 356  TQNPNPSQTENPTLSVAQDTLQVIPSTSSTSTPALP-QNPSTYSLNIQNNNN----NIPV 410

Query: 545  LSFPLVKPSNERDDVGKRWPKDEVLALINMRCKLYANSSNKNEEAGAASGSKGSLWERIS 366
             +  ++   NE+DDVG+RWPKDEVLALIN+RC    N++N+ +E      +K  LWERIS
Sbjct: 411  ETNSVLNKGNEKDDVGRRWPKDEVLALINLRCTSVNNNNNEEKE----GNNKVPLWERIS 466

Query: 365  QGMMELGYKRSAKRCKEKWENINKYFRKTKDVNKKRSVDSRTCPYFQQLSSLYDQGTLVA 186
            QGM+ELGYKRSAKRCKEKWENINKYFRKTKDVNKKRS+DSRTCPYF QLSSLY+QG  V 
Sbjct: 467  QGMLELGYKRSAKRCKEKWENINKYFRKTKDVNKKRSLDSRTCPYFHQLSSLYNQGKPVL 526

Query: 185  PDTDGITS 162
                 + S
Sbjct: 527  QSESHLNS 534


>ref|XP_003591003.1| Trihelix transcription factor [Medicago truncatula]
            gi|355480051|gb|AES61254.1| Trihelix transcription factor
            [Medicago truncatula]
          Length = 557

 Score =  389 bits (1000), Expect = e-105
 Identities = 237/547 (43%), Positives = 312/547 (57%), Gaps = 49/547 (8%)
 Frame = -3

Query: 1649 GVPQEQFHQLIASSRTLSASLPINPLPFSTSSPH----PF-PLNFDAYXXXXXXXXXXXX 1485
            GVP +QFHQ I + RT S+SLP++ LPF  S+P+    PF P N   +            
Sbjct: 4    GVP-DQFHQFI-TPRTSSSSLPLH-LPFPLSTPNNTFPPFDPYNQQNHPSQHHQLPLQVQ 60

Query: 1484 XXXXXXXPTKEDAHNKVEETSIGVEH-------------IMDPWSNDEVLSLLRIRSSMD 1344
                      +D  +K + ++  + +             ++DPW+NDEVL+LL+IRSSM+
Sbjct: 61   PNLLHPLHPHKDDEDKEQNSTPSMNNFQIDRDQRQILPQLIDPWTNDEVLALLKIRSSME 120

Query: 1343 NWFPDFTWEHVSRKMAEFGFKRSADKCKQKFEAETRDFNRLCFNKP------RFDSELEE 1182
            +WFPDFTWEHVSRK+AE G+KRSA+KCK+KFE E+R FN +  N+       RF +ELEE
Sbjct: 121  SWFPDFTWEHVSRKLAEVGYKRSAEKCKEKFEEESRFFNNINHNQNSFGKNFRFVTELEE 180

Query: 1181 LYHGXXXXXXXXXXXXXXXXXXXLA----------GNVAINNLPEQPTHEELKRSTAWKQ 1032
            +Y G                                +V ++   E+   E +++ T    
Sbjct: 181  VYQGGGGENNKNLVEAEKQNEVQDKMDPHEEDSRMDDVLVSKKSEE---EVVEKGTT--N 235

Query: 1031 NHQIKKRKREQKFENFKEFCVDIVNKMMLQQEEMHNKLLQDFLSRDDENIAREEAWKKQE 852
            + + +KR  + +FE FK FC  +V KMM QQEEMHNKL++D + RD+E  +REEAWKKQE
Sbjct: 236  DEKKRKRSGDDRFEVFKGFCESVVKKMMDQQEEMHNKLIEDMVKRDEEKFSREEAWKKQE 295

Query: 851  MENFIKEAEVRTREQAIAGDRQATITEIMNKFTSQEAFDKIAVSYEELLKVSNTLSSLTS 672
            ME   KE E+   EQAIAGDRQA I + +NKF++      +            ++S+   
Sbjct: 296  MEKMNKELELMAHEQAIAGDRQAHIIQFLNKFSTSANSSSLT-----------SMSTQLQ 344

Query: 671  SYQNILPQNNSSSTETLSHSNPSNSTSNL---PQNHDYKTPSLSAL------------SF 537
            +Y   L  N+SSST    + NP      L   P+N     PS S              S+
Sbjct: 345  AYLATLTSNSSSSTLHSQNPNPETLKKTLQPIPENPSSTLPSSSTTLVAQPRNNNPISSY 404

Query: 536  PLVKPSNERDDVGKRWPKDEVLALINMRCKLYANSSNKNEEAGAASGSKGSLWERISQGM 357
             L+  S ERDD+G+RWPKDEVLALIN+RC      +N NEE    S +K  LWERISQGM
Sbjct: 405  SLIS-SGERDDIGRRWPKDEVLALINLRC------NNNNEEKEGNSNNKAPLWERISQGM 457

Query: 356  MELGYKRSAKRCKEKWENINKYFRKTKDVNKKRSVDSRTCPYFQQLSSLYDQGTLVAPDT 177
            +ELGYKRSAKRCKEKWENINKYFRKTKD N+KRS+DSRTCPYF  L++LY+QG LV    
Sbjct: 458  LELGYKRSAKRCKEKWENINKYFRKTKDANRKRSLDSRTCPYFHLLTNLYNQGKLVLQSD 517

Query: 176  DGITSEN 156
                S N
Sbjct: 518  QKQESNN 524


>ref|NP_001236643.1| trihelix transcription factor [Glycine max]
            gi|146674837|gb|ABQ42350.1| trihelix transcription factor
            [Glycine max]
          Length = 581

 Score =  385 bits (990), Expect = e-104
 Identities = 233/555 (41%), Positives = 306/555 (55%), Gaps = 45/555 (8%)
 Frame = -3

Query: 1649 GVPQEQFHQLIASSRTLSASLPINPLPFSTSSPHPFPLNFDAYXXXXXXXXXXXXXXXXX 1470
            GVP +QFHQ I    +    LP  PL  S +    FP NFD Y                 
Sbjct: 4    GVP-DQFHQFITPRTSQPLHLPF-PLHASGTPNTTFPSNFDPYNNPSHQLPLQPNNLLHP 61

Query: 1469 XXPTKEDAHNKVEETSIGVE----------HIMDPWSNDEVLSLLRIRSSMDNWFPDFTW 1320
                 E+         +  E           ++DPW+ DEVL+LLRIRSSM++WFP+ TW
Sbjct: 62   LHHKDEEKEENTTTVPMNFEIQRDQRQQLPELIDPWTTDEVLTLLRIRSSMESWFPELTW 121

Query: 1319 EHVSRKMAEFGFKRSADKCKQKFEAETRDFNR---------LCFNKPRFDSELEELYHGX 1167
            EHVSR++AE G+KRSA+KCK+KFE E+R FN             +  RF SELE+LYH  
Sbjct: 122  EHVSRRLAELGYKRSAEKCKEKFEEESRYFNNDINYAKNNNNSTSNYRFLSELEQLYHQQ 181

Query: 1166 XXXXXXXXXXXXXXXXXXLAGNVAINNLPEQP------------THEELKRSTAWKQNHQ 1023
                                 +     L E+             T  +   + A ++  +
Sbjct: 182  GSSGDHLEKMTQPPLQKQGRMDHHALELEEEEGDSRNVIVDASVTKIQSDEALAVEKITK 241

Query: 1022 IKKRKREQKFENFKEFCVDIVNKMMLQQEEMHNKLLQDFLSRDDENIAREEAWKKQEMEN 843
             +KRKR  +FE FK FC  IV+KMM QQEEMHNKLL+D + RD+E   REEAWKKQEME 
Sbjct: 242  DRKRKRSDRFEMFKGFCESIVHKMMTQQEEMHNKLLEDMMKRDEEKFTREEAWKKQEMEK 301

Query: 842  FIKEAEVRTREQAIAGDRQATITEIMNKFTSQEAFDKIAVSYEELLKVSNTLS------- 684
              KE E+  REQA+AGDRQA I +I+NKF++  +    + +   L KV+  +S       
Sbjct: 302  MNKELEMMAREQAVAGDRQAKIIQILNKFSATTS----SPASHTLKKVNTHISQNPNPSQ 357

Query: 683  ----SLTSSYQNILPQNNSSSTETLSHSNPSNSTSNLPQNHDYKTPSLSALSFPLVKP-- 522
                +L+ +   ++P  +S+ST   +     +S S   QN+++   ++      ++    
Sbjct: 358  TENPTLSVAQDTLIPSTSSTSTPAPAPPQNPSSCSLNSQNNNHINNNIPVEKNSILNKGS 417

Query: 521  -SNERDDVGKRWPKDEVLALINMRCKLYANSSNKNEEAGAASGSKGSLWERISQGMMELG 345
             SNE+DDVG+RWPKDEVLALIN+RC    N++N  E+ G    +K  LWERISQGM EL 
Sbjct: 418  SSNEKDDVGRRWPKDEVLALINLRCTSVNNNNNNEEKEG---NNKVPLWERISQGMSELR 474

Query: 344  YKRSAKRCKEKWENINKYFRKTKDVNKKRSVDSRTCPYFQQLSSLYDQGTLVAPDTDGIT 165
            YKRSAKRCKEKWENINKYFRKTKD+ KKRS+DSRTCPYF QLSSLY+QG LV      + 
Sbjct: 475  YKRSAKRCKEKWENINKYFRKTKDITKKRSLDSRTCPYFHQLSSLYNQGKLVLQSESHLN 534

Query: 164  SENRSQTSS*LAPRQ 120
            +    Q    + P Q
Sbjct: 535  NTPPDQNPEQVKPDQ 549


>ref|XP_003554701.1| PREDICTED: uncharacterized protein LOC100801868 [Glycine max]
          Length = 564

 Score =  373 bits (957), Expect = e-101
 Identities = 233/528 (44%), Positives = 294/528 (55%), Gaps = 39/528 (7%)
 Frame = -3

Query: 1655 MFGVPQEQFHQLIASSRTLSA--SLPINPLPFSTSSPHPFPLNFDAYXXXXXXXXXXXXX 1482
            MF    +QFHQ IA   TL    S P++    ST  P+ F L FD Y             
Sbjct: 1    MFDGAPDQFHQFIAPRTTLPLHLSFPLHHHASSTPPPNTF-LPFDPYNPSSHHLLPSLQT 59

Query: 1481 XXXXXXPTKEDAHNKVEETSIGV----------------EHIMDPWSNDEVLSLLRIRSS 1350
                  PT    H   ++  I                  + + D W+NDEVL+L RIRSS
Sbjct: 60   NHLLHPPTTSPTHKHEQDKVIAPIVNNNEEIQRDQRQLPDQLTDSWTNDEVLALFRIRSS 119

Query: 1349 MDNWFPDFTWEHVSRKMAEFGFKRSADKCKQKFEAETRDFNRLC-FNKPRFD---SELEE 1182
            M+NW P+ TW+HVSR++AE GFK+SA+KCK+KFE E+R F+ +  + K  F    SELEE
Sbjct: 120  MENWLPELTWDHVSRRLAEVGFKKSAEKCKEKFEDESRYFDNINNYGKNNFRFLISELEE 179

Query: 1181 LYHGXXXXXXXXXXXXXXXXXXXLAGNVAIN-NLPEQPTHEELKR------STAWKQNHQ 1023
            L                        G  A+  N  +  T    KR      +   K N +
Sbjct: 180  LCQNSDPGAHDHNGVVVRSEKTHHLGGHALEENSRDIETTTATKRCDIGSDTVVEKSNSK 239

Query: 1022 IKKRKREQKFENFKEFCVDIVNKMMLQQEEMHNKLLQDFLSRDDENIAREEAWKKQEMEN 843
            ++KRKR  +FE FK FC  +VNKMM QQEE HNKLL+D + RD E  AREEAWKKQE++ 
Sbjct: 240  VRKRKRRDRFEMFKGFCESVVNKMMAQQEETHNKLLEDMVKRDQEKFAREEAWKKQELDR 299

Query: 842  FIKEAEVRTREQAIAGDRQATITEIMNK-----FTSQEAFDKIAVSYEELLKVSNTLSSL 678
              KE E+  +EQAIAGDRQATI E + K      TS     + A  Y  +   SN  +S 
Sbjct: 300  MKKELEIMAQEQAIAGDRQATIIEFLKKCATTTITSLSPPSQNAKYY--ITNDSNLPNSE 357

Query: 677  TSSYQNILPQNNSSSTETLSHSNPSNSTSN----LPQNHDYKTPSLSALSFPLVKPSNER 510
              S  + L Q  SSS  + +  NPS+S ++    +P   +  +      + P+    N +
Sbjct: 358  NPSTSDTLLQVPSSSNSSPTTHNPSSSLNSHNNIIPLESNSVSTYKPTSTTPMASSENSK 417

Query: 509  DDVGKRWPKDEVLALINMRCKLYANSSNKNEEAGAASGSKGSLWERISQGMMELGYKRSA 330
            DD+G+RWP+DEVLALIN+RC     S + NEE     G+KG LWERISQGM  LGYKRSA
Sbjct: 418  DDIGRRWPRDEVLALINLRC----TSLSSNEE---KEGNKGPLWERISQGMSALGYKRSA 470

Query: 329  KRCKEKWENINKYFRKTKD-VNKKRSVDSRTCPYFQQLSSLYDQGTLV 189
            KRCKEKWENINKYFRKTKD VNKKRS++SRTCPYF QLS LY QG +V
Sbjct: 471  KRCKEKWENINKYFRKTKDNVNKKRSLNSRTCPYFHQLSCLYGQGKIV 518


>ref|XP_002328348.1| predicted protein [Populus trichocarpa] gi|222838063|gb|EEE76428.1|
            predicted protein [Populus trichocarpa]
          Length = 475

 Score =  368 bits (945), Expect = 2e-99
 Identities = 218/459 (47%), Positives = 273/459 (59%), Gaps = 23/459 (5%)
 Frame = -3

Query: 1448 AHNKVEETSIGVEHIMDPWSNDEVLSLLRIRSSMDNWFPDFTWEHVS-RKMAEFGFKRSA 1272
            A N   E    +  +++PWSNDEVL LLRIRSSMDNWFP+FTWEH S R +AE GFKRS 
Sbjct: 2    AMNLKFERERSIPELVNPWSNDEVLPLLRIRSSMDNWFPEFTWEHASSRNLAEVGFKRST 61

Query: 1271 DKCKQKFEAETRDFNR----LCFNKPRFDSELEELYHGXXXXXXXXXXXXXXXXXXXLAG 1104
            +K K+KFE E+  FN        N     SE EE+YHG                      
Sbjct: 62   EKWKEKFEEESGYFNSNIDIYSKNYRASFSEFEEIYHGDQNPDQQEATAGEKKIRKPSED 121

Query: 1103 NVAIN---NLPEQPTHEELKRSTAWKQNH---------QIKKRKREQKFENFKEFCVDIV 960
                    NL E+   ++   + + + N          + KKRKRE+KFE FK  C DIV
Sbjct: 122  EQQDKMGQNLEEETRIDQTVGNQSVEDNDGKLEQFEKSKRKKRKREKKFEMFKGICEDIV 181

Query: 959  NKMMLQQEEMHNKLLQDFLSRDDENIAREEAWKKQEMENFIKEAEVRTREQAIAGDRQAT 780
            NKMM QQEE HNKLL+D + RD+E  AREEAWKK EM+   KE E+R  EQA+AGDR  T
Sbjct: 182  NKMMAQQEEKHNKLLEDIVKRDEEKFAREEAWKKLEMDRINKELELRAHEQALAGDRLDT 241

Query: 779  ITEIMNKFTSQEAFDKIAVSYEELLKVSNTLSS------LTSSYQNILPQNNSSSTETLS 618
            + + + K TS +  +    S  +    ++TL+        TSS   + PQN +S     S
Sbjct: 242  LIKFLKKITSAQ--NPNPASQTKPQNPNSTLAPNIPQAPTTSSTLALAPQNPNSLN---S 296

Query: 617  HSNPSNSTSNLPQNHDYKTPSLSALSFPLVKPSNERDDVGKRWPKDEVLALINMRCKLYA 438
            H++PS  +S LP    YK  + S         SN+ DD+GKRWP+DEVLALIN+RC LY 
Sbjct: 297  HNSPSGPSSILPM---YKVQAKST--------SNDEDDIGKRWPRDEVLALINLRCSLYN 345

Query: 437  NSSNKNEEAGAASGSKGSLWERISQGMMELGYKRSAKRCKEKWENINKYFRKTKDVNKKR 258
            N+ +K   A      K  +WERISQGM+ELGYKRSAKRCK+KWENINKYFRKTKD +KKR
Sbjct: 346  NNEDKEGSA------KAPVWERISQGMLELGYKRSAKRCKQKWENINKYFRKTKDASKKR 399

Query: 257  SVDSRTCPYFQQLSSLYDQGTLVAPDTDGITSENRSQTS 141
             ++SRT PYF QLS+LY+ GTLVAP     + EN+S  S
Sbjct: 400  YINSRTSPYFHQLSTLYNHGTLVAPKNRSASPENQSNLS 438


Top