BLASTX nr result

ID: Mentha22_contig00050909 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00050909
         (374 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AGV40509.1| hypothetical protein [Phaseolus vulgaris]               74   2e-11
ref|XP_007018320.1| Uncharacterized protein TCM_034564 [Theobrom...    74   3e-11
gb|ABN08132.1| Putative non-LTR retroelement reverse transcripta...    73   5e-11
ref|XP_006386067.1| hypothetical protein POPTR_0003s21595g [Popu...    72   6e-11
gb|AER13156.1| putative retrotransposon [Phaseolus vulgaris]           72   6e-11
ref|XP_007029778.1| Uncharacterized protein TCM_025650 [Theobrom...    72   8e-11
ref|XP_007032608.1| Uncharacterized protein TCM_018647 [Theobrom...    72   8e-11
ref|XP_007029782.1| Uncharacterized protein TCM_025654 [Theobrom...    71   2e-10
ref|XP_007014905.1| Uncharacterized protein TCM_040499 [Theobrom...    69   9e-10
ref|XP_007021221.1| Uncharacterized protein TCM_031281 [Theobrom...    69   9e-10
ref|XP_007041344.1| Uncharacterized protein TCM_006266 [Theobrom...    68   1e-09
ref|XP_007019766.1| Uncharacterized protein TCM_036081 [Theobrom...    68   1e-09
emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulga...    67   2e-09
ref|XP_007014193.1| Uncharacterized protein TCM_038999 [Theobrom...    66   4e-09
ref|XP_002272748.2| PREDICTED: uncharacterized protein LOC100256...    66   4e-09
emb|CAN76026.1| hypothetical protein VITISV_027817 [Vitis vinifera]    66   4e-09
ref|XP_006589908.1| PREDICTED: putative ribonuclease H protein A...    65   1e-08
gb|AER13161.1| putative non-LTR retroelement [Phaseolus vulgaris]      65   1e-08
emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulga...    65   1e-08
emb|CAN82456.1| hypothetical protein VITISV_010028 [Vitis vinifera]    65   1e-08

>gb|AGV40509.1| hypothetical protein [Phaseolus vulgaris]
          Length = 1366

 Score = 74.3 bits (181), Expect = 2e-11
 Identities = 37/95 (38%), Positives = 55/95 (57%)
 Frame = +3

Query: 42   RSGWWKRVVDGVEGTEGQ*FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVN 221
            +S WW+ +++     EG  ++   + V+ +     FWE  W  E  L+D++PRLF LSVN
Sbjct: 1031 QSWWWRDLINLCGEGEGVGWFQ--QAVVWN---VRFWEDSWIVENKLKDIYPRLFSLSVN 1085

Query: 222  KGGKVGEMGAWEEGTWRWKVNWRRELREREKVWEE 326
            +G  VGE G W++  W W +NWRR   + E V EE
Sbjct: 1086 QGMTVGESGFWDDYGWHWNLNWRRRRFQWESVMEE 1120


>ref|XP_007018320.1| Uncharacterized protein TCM_034564 [Theobroma cacao]
           gi|508723648|gb|EOY15545.1| Uncharacterized protein
           TCM_034564 [Theobroma cacao]
          Length = 175

 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 32/70 (45%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
 Frame = +3

Query: 120 VLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWKVNWRREL 299
           ++G+G   NFW+  W     L D FPR+F L+V K GKV E G WE+G W W V +RR+L
Sbjct: 69  IVGNGENLNFWQDEWIEGVVLADAFPRMFALAVKKSGKVTEFGIWEDGRWAWNVQFRRQL 128

Query: 300 RERE-KVWEE 326
            + E + WE+
Sbjct: 129 FDWEVEQWEQ 138


>gb|ABN08132.1| Putative non-LTR retroelement reverse transcriptase, related
           [Medicago truncatula]
          Length = 532

 Score = 72.8 bits (177), Expect = 5e-11
 Identities = 50/120 (41%), Positives = 59/120 (49%), Gaps = 8/120 (6%)
 Frame = +3

Query: 3   SRNGEIKNR---DSRRRSGWWK---RVVDGVEGTEGQ*FWDGMEEVLGDGSKANFWEGFW 164
           +R GEI  R     R+ S WWK    + DGV    G  F + +  V+GDG+   FW   W
Sbjct: 196 ARYGEIGGRLQEGGRQGSSWWKMLCHIRDGVGEGVGNWFDENIRRVVGDGNNTFFWYDSW 255

Query: 165 *GEKTLRDLFPRLFQLSVNKGGKVGEMG--AWEEGTWRWKVNWRRELREREKVWEEGTWR 338
            GE  L   FPRLF L+VNK   VGEM    W EG   W   WRR L      WEE + R
Sbjct: 256 VGEMPLCTKFPRLFDLAVNKECSVGEMVTLGWAEGGRAWV--WRRGL----LAWEEDSVR 309


>ref|XP_006386067.1| hypothetical protein POPTR_0003s21595g [Populus trichocarpa]
           gi|550343715|gb|ERP63864.1| hypothetical protein
           POPTR_0003s21595g [Populus trichocarpa]
          Length = 139

 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 34/92 (36%), Positives = 50/92 (54%), Gaps = 2/92 (2%)
 Frame = +3

Query: 45  SGWWKRVVDGVEGTE--GQ*FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSV 218
           S +WK V+  ++     G    + ++  +GDGS   FW   W G   LR +FP+L+Q+S 
Sbjct: 41  SPFWKDVLSILDPNSVIGLVLRENIKFKIGDGSSILFWSDVWIGSNALRSIFPKLYQIST 100

Query: 219 NKGGKVGEMGAWEEGTWRWKVNWRRELREREK 314
            + G V EMG W    WRW + WRR+L   E+
Sbjct: 101 FRNGLVNEMGQWVNDQWRWNLVWRRKLLSYEE 132


>gb|AER13156.1| putative retrotransposon [Phaseolus vulgaris]
          Length = 1759

 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 36/97 (37%), Positives = 52/97 (53%), Gaps = 2/97 (2%)
 Frame = +3

Query: 39   RRSGWWKRVVDGV--EGTEGQ*FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQL 212
            +   WW R +  +  EG     F + +   +G G K  F E  W G   L+ LFPRL+ L
Sbjct: 1367 KSQSWWWRDLSKMCKEGGGKGWFQEELGWEIGYGDKVKFSEEVWVGSVDLKSLFPRLYSL 1426

Query: 213  SVNKGGKVGEMGAWEEGTWRWKVNWRRELREREKVWE 323
            S+N+G  VGE+G W +  WRW++ W+R+  E E   E
Sbjct: 1427 SLNQGQTVGELGEWIDSEWRWRMRWKRDRFEWESSLE 1463


>ref|XP_007029778.1| Uncharacterized protein TCM_025650 [Theobroma cacao]
           gi|508718383|gb|EOY10280.1| Uncharacterized protein
           TCM_025650 [Theobroma cacao]
          Length = 455

 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 34/76 (44%), Positives = 42/76 (55%)
 Frame = +3

Query: 108 GMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWKVNW 287
           GM  V+G+G    FW+  W     L+D FPR+F L+ NK G V E GAW  G WRWK+N 
Sbjct: 29  GMCMVVGNGHNVLFWQDEWIEGVILKDKFPRMFALASNKTGCVNEFGAWVNGDWRWKINL 88

Query: 288 RRELREREKVWEEGTW 335
           RR +      WE   W
Sbjct: 89  RRSI----FYWERAQW 100


>ref|XP_007032608.1| Uncharacterized protein TCM_018647 [Theobroma cacao]
           gi|508711637|gb|EOY03534.1| Uncharacterized protein
           TCM_018647 [Theobroma cacao]
          Length = 814

 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 32/80 (40%), Positives = 50/80 (62%)
 Frame = +3

Query: 99  FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWK 278
           F++ ME V+G+G+  +FW+  W    TLR  F   F L++NK GKV E+G+W +  W+W+
Sbjct: 561 FFNRMEYVVGNGANISFWDDEWIEGITLRIAFLWFFALAINKSGKVCELGSWVKSVWQWE 620

Query: 279 VNWRRELREREKVWEEGTWR 338
           VN RR + +    WE  +W+
Sbjct: 621 VNLRRRIFD----WETNSWK 636


>ref|XP_007029782.1| Uncharacterized protein TCM_025654 [Theobroma cacao]
           gi|508718387|gb|EOY10284.1| Uncharacterized protein
           TCM_025654 [Theobroma cacao]
          Length = 129

 Score = 70.9 bits (172), Expect = 2e-10
 Identities = 31/72 (43%), Positives = 41/72 (56%)
 Frame = +3

Query: 120 VLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWKVNWRREL 299
           V+G+G    FW+  W     L+D FPR+F L+ NK G V E GAW  G WRWK+N R+ +
Sbjct: 4   VMGNGHNVLFWQDEWIEGVILKDKFPRMFALASNKTGSVNEFGAWINGDWRWKINLRQSI 63

Query: 300 REREKVWEEGTW 335
            +    WE   W
Sbjct: 64  FD----WESAQW 71


>ref|XP_007014905.1| Uncharacterized protein TCM_040499 [Theobroma cacao]
           gi|508785268|gb|EOY32524.1| Uncharacterized protein
           TCM_040499 [Theobroma cacao]
          Length = 837

 Score = 68.6 bits (166), Expect = 9e-10
 Identities = 34/87 (39%), Positives = 48/87 (55%)
 Frame = +3

Query: 111 MEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWKVNWR 290
           M+ V+GDGS+  FW   W     L+DL+PR+F L+ NK G + E G WEE  W WKV  R
Sbjct: 620 MQLVVGDGSRILFWADRWTDGGILKDLYPRIFALARNKDGYIQEFGRWEEEVWVWKVQLR 679

Query: 291 RELREREKVWEEGTWRWKVNWRRELRE 371
           R            T+ W+ + + +L+E
Sbjct: 680 RP-----------TFGWEEDQQNQLKE 695


>ref|XP_007021221.1| Uncharacterized protein TCM_031281 [Theobroma cacao]
            gi|508720849|gb|EOY12746.1| Uncharacterized protein
            TCM_031281 [Theobroma cacao]
          Length = 1408

 Score = 68.6 bits (166), Expect = 9e-10
 Identities = 29/77 (37%), Positives = 42/77 (54%)
 Frame = +3

Query: 108  GMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWKVNW 287
            G    L  G+   FWE FW  +  L   FPR++ L+++K   + E+G+W EG WRW V  
Sbjct: 987  GFAHSLDKGTTIRFWEDFWVDDSILATKFPRIYVLAISKKATIAELGSWVEGNWRWDV-- 1044

Query: 288  RRELREREKVWEEGTWR 338
              +LR +   WE+  WR
Sbjct: 1045 --KLRRQPFSWEQNQWR 1059


>ref|XP_007041344.1| Uncharacterized protein TCM_006266 [Theobroma cacao]
            gi|508705279|gb|EOX97175.1| Uncharacterized protein
            TCM_006266 [Theobroma cacao]
          Length = 1129

 Score = 68.2 bits (165), Expect = 1e-09
 Identities = 37/119 (31%), Positives = 60/119 (50%), Gaps = 10/119 (8%)
 Frame = +3

Query: 27   RDSRRRSGWWKRVVDGVEGTEG----Q*FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLF 194
            R    +   WKRV+   EG +     +   +G+  +LG G    FW   W   + +++ F
Sbjct: 799  RSGNEKDNLWKRVLVEKEGDDHDEQHRTLREGIGFILGKGKNVRFWTEEWIKRRIVKEDF 858

Query: 195  PRLFQLSVNKGGKVGEMGAWEEGTWRWKVNWRREL----REREKVWEEGTWR--WKVNW 353
             R+F ++ NK G+V E G W +G+W+W++  RR L     E+  V++E T    WK  W
Sbjct: 859  SRIFAVATNKEGRVKEFGVWVDGSWQWRIELRRMLFGWENEQTMVYKERTPDDIWKKVW 917


>ref|XP_007019766.1| Uncharacterized protein TCM_036081 [Theobroma cacao]
           gi|508725094|gb|EOY16991.1| Uncharacterized protein
           TCM_036081 [Theobroma cacao]
          Length = 348

 Score = 67.8 bits (164), Expect = 1e-09
 Identities = 29/81 (35%), Positives = 47/81 (58%)
 Frame = +3

Query: 99  FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWK 278
           ++  ++ V+ DGS+  FWE  W   + L+  FPR++ L++NK G + + G W+E  W W+
Sbjct: 235 YYSNVQLVVKDGSRILFWEDNWMERQPLKVRFPRIYALAINKEGYIQDYGKWDEELWVWE 294

Query: 279 VNWRRELREREKVWEEGTWRW 341
           V    +LR +   WEE  W W
Sbjct: 295 V----QLRRQPFGWEEEQWSW 311


>emb|CCA66180.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1383

 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 2/92 (2%)
 Frame = +3

Query: 48   GWWKRVVDGVEGTEGQ*FW--DGMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVN 221
            G WK +   V G EG      +GM + +G+G  + FW   W  E+ L+ + PRLF +++N
Sbjct: 904  GPWKSICAAVLGHEGARLIAVNGMRKNVGNGISSLFWHDTWLCEQPLKRIAPRLFSIAIN 963

Query: 222  KGGKVGEMGAWEEGTWRWKVNWRRELREREKV 317
            K   +   G WE   W W  +W+R LR ++ V
Sbjct: 964  KNSSIASYGVWEGFNWVWVFSWKRVLRPQDLV 995


>ref|XP_007014193.1| Uncharacterized protein TCM_038999 [Theobroma cacao]
           gi|508784556|gb|EOY31812.1| Uncharacterized protein
           TCM_038999 [Theobroma cacao]
          Length = 243

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 30/70 (42%), Positives = 38/70 (54%)
 Frame = +3

Query: 126 GDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWKVNWRRELRE 305
           G+G    FW   W  +  L+D FPRLF L+VNK GK+ E G W E  W+W++  RR L  
Sbjct: 70  GNGCLIYFWTKPWLNDMILKDEFPRLFALAVNKNGKLNEFGVWTEVVWQWRIELRRNLFG 129

Query: 306 REKVWEEGTW 335
               WE   W
Sbjct: 130 ----WEPNQW 135


>ref|XP_002272748.2| PREDICTED: uncharacterized protein LOC100256388 [Vitis vinifera]
          Length = 2667

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 38/104 (36%), Positives = 54/104 (51%), Gaps = 2/104 (1%)
 Frame = +3

Query: 24   NRDSRRRSGW--WKRVVDGVEGTEGQ*FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLFP 197
            ++D+R R G   WK +  G E      F      ++GDG+K  FW+  W G ++L++ FP
Sbjct: 906  SKDARNRYGVGVWKAIRKGWEN-----FRSHSRFIIGDGTKVKFWKDLWCGNQSLKETFP 960

Query: 198  RLFQLSVNKGGKVGEMGAWEEGTWRWKVNWRRELREREKVWEEG 329
             LF LSVNK G V E    +EG   W + + R L +    WE G
Sbjct: 961  ILFNLSVNKEGWVAEAWEEDEGGGSWGLRFNRHLND----WEVG 1000


>emb|CAN76026.1| hypothetical protein VITISV_027817 [Vitis vinifera]
          Length = 1728

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 38/104 (36%), Positives = 54/104 (51%), Gaps = 2/104 (1%)
 Frame = +3

Query: 24   NRDSRRRSGW--WKRVVDGVEGTEGQ*FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLFP 197
            ++D+R R G   WK +  G E      F      ++GDG+K  FW+  W G ++L++ FP
Sbjct: 987  SKDARNRYGVGVWKAIRKGWEN-----FRSHSRFIIGDGTKVKFWKDLWCGNQSLKETFP 1041

Query: 198  RLFQLSVNKGGKVGEMGAWEEGTWRWKVNWRRELREREKVWEEG 329
             LF LSVNK G V E    +EG   W + + R L +    WE G
Sbjct: 1042 ILFNLSVNKEGWVAEAWEEDEGGGSWGLRFNRHLND----WEVG 1081


>ref|XP_006589908.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 433

 Score = 65.1 bits (157), Expect = 1e-08
 Identities = 34/100 (34%), Positives = 50/100 (50%), Gaps = 1/100 (1%)
 Frame = +3

Query: 18  IKNRDSRRRSGWWKRVVDGVEGTEGQ*FWDGMEEVLGDGSKANFWEGFW*G-EKTLRDLF 194
           I  RD    S WWK +   V   +       ME  +GDG+  NFW+  W G +  L   F
Sbjct: 111 INGRDRPWHSQWWKDLRKLVNQPDFSSIIQQMEWKVGDGTLINFWKDKWIGTDSNLEQQF 170

Query: 195 PRLFQLSVNKGGKVGEMGAWEEGTWRWKVNWRRELREREK 314
            +LF +S  +   +  MG++  G+W W +NWRR L + E+
Sbjct: 171 NQLFLISRQQNCNIRSMGSFSHGSWCWDLNWRRNLFDHEQ 210


>gb|AER13161.1| putative non-LTR retroelement [Phaseolus vulgaris]
          Length = 685

 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 28/67 (41%), Positives = 37/67 (55%)
 Frame = +3

Query: 123 LGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKGGKVGEMGAWEEGTWRWKVNWRRELR 302
           LG   KA F E  W G   L  +FPR++ LS+N+G  +G++G W    WRW + WRR   
Sbjct: 517 LGCEDKAKFGEEVWVGSNDLESMFPRMYSLSLNQGQTMGKVGTWSNSEWRWTMRWRRAKF 576

Query: 303 EREKVWE 323
           E E   E
Sbjct: 577 EWESPME 583


>emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1380

 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 7/109 (6%)
 Frame = +3

Query: 18   IKNRDSRRRSGWWKRVVDGV--EGTEGQ*FWDGMEEVLGDGSKANFWEGFW*GEKTLRDL 191
            I++ D  R+ G W+++V  +    T      +G+  ++GDG+   FW   W G K L+  
Sbjct: 893  IRDLDPPRQGGPWQKIVSAIIKSPTAKAIAINGVRSLVGDGALTLFWHDQWLGPKPLKAQ 952

Query: 192  FPRLFQLSVNKGGKVGEMGAWEEGTWRWKVNW-----RRELREREKVWE 323
            FPRL+ L+ NK   V     W+   W W  +W      R+L E+EK+ E
Sbjct: 953  FPRLYLLATNKMAPVASHCFWDGLAWAWSFSWARHHRARDLDEKEKLLE 1001


>emb|CAN82456.1| hypothetical protein VITISV_010028 [Vitis vinifera]
          Length = 4128

 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 35/94 (37%), Positives = 48/94 (51%)
 Frame = +3

Query: 48   GWWKRVVDGVEGTEGQ*FWDGMEEVLGDGSKANFWEGFW*GEKTLRDLFPRLFQLSVNKG 227
            G WK +  G E      F      ++GDG+K  FW+  W G ++L++ FP LF LSVNK 
Sbjct: 3317 GVWKAIRKGWEN-----FRSHSRFIIGDGTKVKFWKDLWCGNQSLKETFPILFNLSVNKE 3371

Query: 228  GKVGEMGAWEEGTWRWKVNWRRELREREKVWEEG 329
            G V E    +EG   W + + R L +    WE G
Sbjct: 3372 GWVAEAWEEDEGGXSWGLRFNRHLND----WEVG 3401


Top