BLASTX nr result

ID: Mentha29_contig00015940 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00015940
         (1417 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU31017.1| hypothetical protein MIMGU_mgv1a026083mg [Mimulus...    92   6e-16
emb|CAN62228.1| hypothetical protein VITISV_008028 [Vitis vinifera]    90   2e-15
gb|EYU24001.1| hypothetical protein MIMGU_mgv1a021604mg [Mimulus...    89   5e-15
ref|XP_007047984.1| VQ motif-containing protein [Theobroma cacao...    87   2e-14
ref|XP_007159640.1| hypothetical protein PHAVU_002G254700g [Phas...    70   2e-09
ref|XP_004502434.1| PREDICTED: homeobox protein 2-like [Cicer ar...    64   2e-07
ref|XP_002310570.2| VQ motif-containing family protein [Populus ...    61   1e-06
ref|XP_007018802.1| VQ motif-containing protein [Theobroma cacao...    60   2e-06

>gb|EYU31017.1| hypothetical protein MIMGU_mgv1a026083mg [Mimulus guttatus]
          Length = 419

 Score = 92.0 bits (227), Expect = 6e-16
 Identities = 132/442 (29%), Positives = 153/442 (34%), Gaps = 68/442 (15%)
 Frame = -2

Query: 1341 DEEYDSRAA--DSVSAFMSXXXXXXXXXXXXGSISHAPPPFFDPLSGY--XXXXXXXXXX 1174
            DEEYDSRAA  DSVSAFMS                   PPFFDP+S Y            
Sbjct: 16   DEEYDSRAAAADSVSAFMSGGGGGGGAVNL--------PPFFDPVSNYLQLHQNPNFSFL 67

Query: 1173 XXXXSWPR--TAAPPLRSDPNP---------IGHDGV-----HSVFPASNSYMPCFQPGA 1042
                +WPR  TAA P RSDPNP           HD       + + P S  +MP F+P  
Sbjct: 68   NPSLAWPRNSTAAAP-RSDPNPNPNPNPNPITHHDATTTTNNNPMLPNSPPFMPSFRPPP 126

Query: 1041 A--DGGAS---------QLXXXXXXXXXXXXXXXXXXXXXXPTTVLTTDTTNFRAMVQEF 895
            A   GGA          Q                       PTTVLTTDTTNFRAMV EF
Sbjct: 127  ARVAGGADLALPPPQQRQNQNQNQPAAARNPKKRSRASRRAPTTVLTTDTTNFRAMVHEF 186

Query: 894  TGIXXXXXXXXXXXXXXXXXXXXXXXXAVDAAQPPPYLRRP--------------FAQKV 757
            TGI                               PPYLRRP               A   
Sbjct: 187  TGIPAPPFNNNNNSSFPRSRFDLFGTTTTH-LDAPPYLRRPPKSPLTSAAAAAVAAAAAA 245

Query: 756  QPPHPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPSMNYQMQMTQNPNLF-XXXXXX 580
              P                                 + + NYQ     +PNLF       
Sbjct: 246  SSPTNNNNNTPHNIITSVNSPTAINNEASSSSASNTNNNNNYQ----HSPNLFNNIIPTP 301

Query: 579  XXXXXXXXXXNPKFPFSTASIAAAKPQNSYEIPHNENHFTHQPSQTLAALIPSD--HNGG 406
                      NPKFPFS                   NH  H  +  +  L+     H GG
Sbjct: 302  LLTSFLQSNPNPKFPFS-------------------NHH-HLINTKMGGLLDDQFGHGGG 341

Query: 405  G------------DV-----TNAALRSVNGGGYDFSAMDGKMSFSASSTSGFQGKSPENN 277
            G            DV      NAA+   N    D    D +++ + + +  F    P+NN
Sbjct: 342  GGHGGDHVIISATDVINSNNNNAAIWHSNNPDAD---KDDRINLNGAGSYNFPA-GPDNN 397

Query: 276  AAAAV---RGEGMVESWICSSE 220
            AAAA    RGEGMVESWICSS+
Sbjct: 398  AAAAAAASRGEGMVESWICSSD 419


>emb|CAN62228.1| hypothetical protein VITISV_008028 [Vitis vinifera]
          Length = 422

 Score = 90.1 bits (222), Expect = 2e-15
 Identities = 93/290 (32%), Positives = 117/290 (40%), Gaps = 47/290 (16%)
 Frame = -2

Query: 948 TTVLTTDTTNFRAMVQEFTGI---XXXXXXXXXXXXXXXXXXXXXXXXAVDAAQPPPYLR 778
           TTVLTTDTTNFRAMVQEFTGI                            +D A PP YL 
Sbjct: 139 TTVLTTDTTNFRAMVQEFTGIPAQPFTSSPFPRSRLDLFGTASTMRSGHLDHA-PPSYLL 197

Query: 777 RPFAQKVQPPHPF-------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPSMNYQMQ- 622
           RPFAQK+QPP PF                                     S S+NYQ+  
Sbjct: 198 RPFAQKLQPP-PFASPPPSSSSSFSSSSMVDAIASTTNITSGSASNTSSNSTSINYQLPS 256

Query: 621 ----MTQNPNLFXXXXXXXXXXXXXXXXNP-KFPFSTASIAAAKPQNSYEIPHNENH--- 466
               + Q  NL                  P K+P   ++I  +KPQ S EIP  ++H   
Sbjct: 257 DLGLVKQPQNLLNMNVQNPILSIQSFLQTPLKYPHPNSAIMGSKPQGSLEIPSTDSHIKM 316

Query: 465 -------FTHQPSQTLAALIP-----------SDHN--------GGGDVTNAALRSVNGG 364
                   +H    T  + +P           SD+N        G     +  L  +N G
Sbjct: 317 GGLEDFGLSHGHVNTHLSGLPNLVSSDRTASRSDNNPPSWNDGLGSSGGNHGQLGPLN-G 375

Query: 363 GYDFS--AMDGKMSFSASSTSGFQGKSPENNAAAAVRGEGMVESWICSSE 220
            Y+ S    +GKM++SASS+     K PEN    + R EGMVESWICSS+
Sbjct: 376 NYNNSQRVTNGKMNYSASSSDFHGDKVPEN---VSTRSEGMVESWICSSD 422


>gb|EYU24001.1| hypothetical protein MIMGU_mgv1a021604mg [Mimulus guttatus]
          Length = 304

 Score = 89.0 bits (219), Expect = 5e-15
 Identities = 74/207 (35%), Positives = 86/207 (41%), Gaps = 6/207 (2%)
 Frame = -2

Query: 1341 DEEYDSRAADSVSAFMSXXXXXXXXXXXXGSISHAP-PPFFDPLSGYXXXXXXXXXXXXX 1165
            DEEY+SRAA S+    S            G IS+ P PPFFDP S Y             
Sbjct: 17   DEEYESRAAGSIFMNSSTTHHPAPPTHVIGPISNPPQPPFFDPFSNYMQLLHSNMT---- 72

Query: 1164 XSWPRTAAPPLRSDPN---PIGHDGVHSVFPASNSYMPCFQPGAADGGASQ--LXXXXXX 1000
              WPR  A   RSDPN   P+     H++ P      P    G+ + G S   +      
Sbjct: 73   --WPRPPATSFRSDPNTINPMLSASSHALRPP-----PAGGEGSNNSGVSAPTVTNTNQA 125

Query: 999  XXXXXXXXXXXXXXXXPTTVLTTDTTNFRAMVQEFTGIXXXXXXXXXXXXXXXXXXXXXX 820
                            PTTVLTTDTTNFRAMVQEFTG+                      
Sbjct: 126  AAARNPRKRSRASRRAPTTVLTTDTTNFRAMVQEFTGV-----PAPPFIPRGGLDLFGPR 180

Query: 819  XXAVDAAQPPPYLRRPFAQKVQPPHPF 739
              A +   PPPYLRRPF+QK  PP  F
Sbjct: 181  STAFETPPPPPYLRRPFSQKDNPPPQF 207


>ref|XP_007047984.1| VQ motif-containing protein [Theobroma cacao]
            gi|508700245|gb|EOX92141.1| VQ motif-containing protein
            [Theobroma cacao]
          Length = 472

 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 90/294 (30%), Positives = 112/294 (38%), Gaps = 51/294 (17%)
 Frame = -2

Query: 948  TTVLTTDTTNFRAMVQEFTGIXXXXXXXXXXXXXXXXXXXXXXXXAVDAAQPPP--YLRR 775
            TTVLTTDTTNFRAMVQEFTGI                              P P  YL R
Sbjct: 182  TTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFGTPSTMRSTPLDPSPPHYLLR 241

Query: 774  PFAQKVQPP---------HPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPSMNYQMQ 622
            PFAQK+ PP           F                              S S+NYQ+ 
Sbjct: 242  PFAQKIHPPPFVSSSTASSSFPSSSMVDAIASTPSTNITSASASNNNTTSSSTSINYQLS 301

Query: 621  -----MTQNPNLFXXXXXXXXXXXXXXXXNP-KFPFSTASIAAAKPQNSYEIPHNENHF- 463
                 + Q  NL                  P K+P   ++I   K Q S +IP N++   
Sbjct: 302  SELGLLKQPQNLLNINMQNPILNFQSLLQAPPKYPLPNSTILGTKLQGSLDIPSNDSSLK 361

Query: 462  ---------THQPSQT----LAALIPSDH-----------------NGGGDVTNAALRSV 373
                     +H    T    L  ++ SD                   G  +   + LRS+
Sbjct: 362  MGVLEEFGLSHGHVNTNLSGLQNMVSSDGALPRNDSSTNPPSWGEGTGSQEHDQSLLRSI 421

Query: 372  NGGGYDFS--AMDGKMSFSASSTSGFQG-KSPENNAAAAVRGEGMVESWICSSE 220
            NGG    S    +GK+S  ++S+S F G K PEN AA   R EGMVESWICSS+
Sbjct: 422  NGGYNSNSQRVSNGKVSNFSASSSDFHGDKGPENVAA---RSEGMVESWICSSD 472


>ref|XP_007159640.1| hypothetical protein PHAVU_002G254700g [Phaseolus vulgaris]
            gi|561033055|gb|ESW31634.1| hypothetical protein
            PHAVU_002G254700g [Phaseolus vulgaris]
          Length = 493

 Score = 70.1 bits (170), Expect = 2e-09
 Identities = 79/310 (25%), Positives = 110/310 (35%), Gaps = 67/310 (21%)
 Frame = -2

Query: 948  TTVLTTDTTNFRAMVQEFTGI-----------XXXXXXXXXXXXXXXXXXXXXXXXAVDA 802
            TTVLTTDTTNFRAMVQEFTGI                                    +D 
Sbjct: 192  TTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRTRLDLFASAATPTLRSNLNVNVNPLDP 251

Query: 801  AQPPPYLRRPFAQKVQ--PPHPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPSMNYQ 628
              PPPYL RPFAQK+Q    HPF                                +++  
Sbjct: 252  PTPPPYLLRPFAQKLQFRSLHPFPPSLSNTLSPSTNSTTNSTSINYHQQQQQQQQNLSEH 311

Query: 627  MQMTQNPNLFXXXXXXXXXXXXXXXXNPKFPFSTASIAAAKP--QNSYEIPHN------- 475
              + + P+ F                +PK+P   +S+  ++P  Q+S++IP +       
Sbjct: 312  FGLMKQPHNF------NNTPSLEAYHHPKYPLGNSSVLVSRPQQQSSFDIPPSLKMGVFE 365

Query: 474  -----------------ENHFTHQPSQTLAALIPSDHNGG----------------GDVT 394
                               +     S  + AL   ++N                  G +T
Sbjct: 366  ELGLRPDGHVNTDLRCLHQNMVSSTSVGVGALSSGNNNNNNNLSNANPSTEWVQRTGTIT 425

Query: 393  NAALRSVNGGGYDFS-----------AMDGKMSFSASSTSGFQGKSPENNAAAAVRGEGM 247
            N       GGG   S             +GK+ +SASS+     K P+ +  A  R +GM
Sbjct: 426  NDDCDHGGGGGGGLSGTVSYSDIAERVSNGKVHYSASSSDFHGEKVPDFSVTA--RSQGM 483

Query: 246  VESWI-CSSE 220
            VESWI CSS+
Sbjct: 484  VESWINCSSD 493


>ref|XP_004502434.1| PREDICTED: homeobox protein 2-like [Cicer arietinum]
          Length = 426

 Score = 63.9 bits (154), Expect = 2e-07
 Identities = 74/263 (28%), Positives = 98/263 (37%), Gaps = 21/263 (7%)
 Frame = -2

Query: 948 TTVLTTDTTNFRAMVQEFTGIXXXXXXXXXXXXXXXXXXXXXXXXAVD-AAQPPPYLRRP 772
           TTVLTTDTTNFRAMVQEFTGI                              QPPPYL RP
Sbjct: 185 TTVLTTDTTNFRAMVQEFTGIPAPPFSSPFPRTRLDLFGSSRSVSMESHQQQPPPYLLRP 244

Query: 771 FAQKVQPPHPFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPSMNYQ-MQMTQNPNLFX 595
           FAQK+QP H                                SPS+NY  +Q  QNP    
Sbjct: 245 FAQKIQPLH------------HSFSSFQPSSSMVENSTCTNSPSINYHLLQQQQNPLNMH 292

Query: 594 XXXXXXXXXXXXXXXNPKFPFSTASIAAAKPQNSYEI-PHNENHFTHQPSQTLA-----A 433
                          +PK+   + S      + + EI P+ ++H      + L      A
Sbjct: 293 NQILGFQSNNINSQTHPKYQLGSLS-----NKTTLEITPNVDSHMKMSVFEELGLSHTHA 347

Query: 432 LIPSDHNGG------------GDVTNAALRSVNGGGYDFSAMDGKMSFSASSTSGFQGKS 289
            + +++N G             D  N  +R++     D   ++ +       ++G     
Sbjct: 348 HVSNNNNIGVVHQQNMIPASSSDGVNNNMRNI-PNSEDRGGINYRNDIEERESNG----K 402

Query: 288 PENNAAAAVRGEGMVESWI-CSS 223
                  A RGEG VESWI CSS
Sbjct: 403 GSECVGVATRGEGTVESWINCSS 425


>ref|XP_002310570.2| VQ motif-containing family protein [Populus trichocarpa]
           gi|550334197|gb|EEE91020.2| VQ motif-containing family
           protein [Populus trichocarpa]
          Length = 527

 Score = 60.8 bits (146), Expect = 1e-06
 Identities = 36/75 (48%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
 Frame = -2

Query: 948 TTVLTTDTTNFRAMVQEFTGIXXXXXXXXXXXXXXXXXXXXXXXXAVDAA------QPPP 787
           TTVLTTDTTNFRAMVQEFTGI                           A        PPP
Sbjct: 186 TTVLTTDTTNFRAMVQEFTGIPAPPFTSSPFPRSRLDLFGTAASTLRSAVSHHLDPSPPP 245

Query: 786 YLRRPFAQKVQPPHP 742
           YL RPFAQ+ QPP P
Sbjct: 246 YLLRPFAQRFQPPPP 260


>ref|XP_007018802.1| VQ motif-containing protein [Theobroma cacao]
            gi|508724130|gb|EOY16027.1| VQ motif-containing protein
            [Theobroma cacao]
          Length = 551

 Score = 60.5 bits (145), Expect = 2e-06
 Identities = 61/174 (35%), Positives = 74/174 (42%), Gaps = 22/174 (12%)
 Frame = -2

Query: 1341 DEEYDSRAADSVSAFMSXXXXXXXXXXXXGS-ISHA---PPPFFDPLSGYXXXXXXXXXX 1174
            DEEYDSR  +S+ AF++             S +SH    PP FFDP S Y          
Sbjct: 77   DEEYDSRP-ESLPAFLNASGHFSPLSNPHPSLVSHHQDHPPTFFDPSSNYLNPFSQSQPN 135

Query: 1173 XXXXSWPRTAAPP-LRSDPN----------------PIGHDGVHS-VFPASNSYMPCFQP 1048
                +      P  LRS+PN                 +G  G++   FP+S+S     +P
Sbjct: 136  NSLLNLDGGVRPRGLRSEPNCTDLGNLPGSSSSSQSMLGAQGLNQGSFPSSSSMQS--RP 193

Query: 1047 GAADGGASQLXXXXXXXXXXXXXXXXXXXXXXPTTVLTTDTTNFRAMVQEFTGI 886
             A D GA  L                      PTTVLTTDTTNFRAMVQEFTGI
Sbjct: 194  -AHDNGARSLAQSDQTSVVKNPKKRTRASRRAPTTVLTTDTTNFRAMVQEFTGI 246


Top