BLASTX nr result

ID: Rheum21_contig00018108 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00018108
         (804 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004303312.1| PREDICTED: uncharacterized protein LOC101299...    76   2e-11
gb|EOY31080.1| Uncharacterized protein isoform 2 [Theobroma cacao]     75   4e-11
gb|EOY31079.1| Uncharacterized protein isoform 1 [Theobroma cacao]     74   5e-11
gb|EOY25799.1| Uncharacterized protein TCM_027159 [Theobroma cacao]    70   9e-10
ref|XP_002268235.1| PREDICTED: uncharacterized protein LOC100261...    68   4e-09
gb|EXB39152.1| hypothetical protein L484_003791 [Morus notabilis]      66   2e-08
ref|XP_002325224.2| hypothetical protein POPTR_0018s13100g [Popu...    64   6e-08
ref|XP_002308986.2| hypothetical protein POPTR_0006s06840g [Popu...    62   2e-07
gb|EMJ16348.1| hypothetical protein PRUPE_ppa013340mg [Prunus pe...    59   3e-06
ref|XP_003543678.1| PREDICTED: uncharacterized protein LOC100784...    57   1e-05

>ref|XP_004303312.1| PREDICTED: uncharacterized protein LOC101299686 [Fragaria vesca
           subsp. vesca]
          Length = 118

 Score = 75.9 bits (185), Expect = 2e-11
 Identities = 49/120 (40%), Positives = 68/120 (56%), Gaps = 11/120 (9%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQRIFPATQIPTPQNQFLRSCLSTGG-XXXXX 372
           MSSVL SQ ++L  AMA+S  ++ L+  R++  P  Q  T Q+  LRSCLS+G       
Sbjct: 1   MSSVLSSQGLVLATAMAISSTLVFLAFSRKQTLPEPQNTTKQSPTLRSCLSSGDKKRDRK 60

Query: 371 XXXXRFADSV----------KPAERRIPRPVDRRTPDNRMALYRGILRDRVHVQRMASSY 222
               RFA++V          + + +R+ R    + P+N+ ALY GILRDR  VQRMA SY
Sbjct: 61  KKKVRFAENVEEEKMVERVTESSSKRVERSCRNQIPENQAALYSGILRDR--VQRMACSY 118


>gb|EOY31080.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 142

 Score = 74.7 bits (182), Expect = 4e-11
 Identities = 56/143 (39%), Positives = 69/143 (48%), Gaps = 34/143 (23%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQRIFPATQIPTPQNQFLRSCLSTG-GXXXXX 372
           M+S+L SQ V+L  AMAVSG VILL+   Q+  P  QIP P  Q LRSC+S+G       
Sbjct: 1   MASILSSQGVVLATAMAVSGTVILLAFRLQKSLPLDQIPQPSQQVLRSCISSGKKREKKK 60

Query: 371 XXXXRFADSV--------------------------KPAE-------RRIPRPVDRRTPD 291
                FA+ V                           PA        ++I    DR  P 
Sbjct: 61  KKKVHFAEDVMDPRGDGEEFRRQLMQNPVRIGSNNNSPAALNSSTKFKKIGGGKDRGMPA 120

Query: 290 NRMALYRGILRDRVHVQRMASSY 222
           NR+ALY GIL+DRV VQR+A SY
Sbjct: 121 NRVALYNGILKDRV-VQRLAYSY 142


>gb|EOY31079.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 143

 Score = 74.3 bits (181), Expect = 5e-11
 Identities = 56/144 (38%), Positives = 69/144 (47%), Gaps = 35/144 (24%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQRIFPATQIPTPQNQFLRSCLSTGG--XXXX 375
           M+S+L SQ V+L  AMAVSG VILL+   Q+  P  QIP P  Q LRSC+S+ G      
Sbjct: 1   MASILSSQGVVLATAMAVSGTVILLAFRLQKSLPLDQIPQPSQQVLRSCISSEGKKREKK 60

Query: 374 XXXXXRFADSV--------------------------KPAE-------RRIPRPVDRRTP 294
                 FA+ V                           PA        ++I    DR  P
Sbjct: 61  KKKKVHFAEDVMDPRGDGEEFRRQLMQNPVRIGSNNNSPAALNSSTKFKKIGGGKDRGMP 120

Query: 293 DNRMALYRGILRDRVHVQRMASSY 222
            NR+ALY GIL+DRV VQR+A SY
Sbjct: 121 ANRVALYNGILKDRV-VQRLAYSY 143


>gb|EOY25799.1| Uncharacterized protein TCM_027159 [Theobroma cacao]
          Length = 124

 Score = 70.1 bits (170), Expect = 9e-10
 Identities = 50/132 (37%), Positives = 63/132 (47%), Gaps = 23/132 (17%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQRIFPATQIPTPQNQFLRSCLST-GGXXXXX 372
           MSS +CSQ ++L  AM VS  VI L+  RQ+  P      P  Q LRSCLS+ G      
Sbjct: 1   MSSGICSQGLVLATAMVVSSTVIFLTFSRQKTLP------PSKQTLRSCLSSEGKRRGRK 54

Query: 371 XXXXRFADSVKPA----------------------ERRIPRPVDRRTPDNRMALYRGILR 258
               +FA++VK                         R++ R      P+NR+ALY GILR
Sbjct: 55  KKKVQFAENVKDTSGNGEEYRKEQNKKLIAATAGRSRKVDRFCRNEMPENRIALYNGILR 114

Query: 257 DRVHVQRMASSY 222
           DRVH  RM  SY
Sbjct: 115 DRVH--RMECSY 124


>ref|XP_002268235.1| PREDICTED: uncharacterized protein LOC100261053 [Vitis vinifera]
           gi|297743449|emb|CBI36316.3| unnamed protein product
           [Vitis vinifera]
          Length = 132

 Score = 67.8 bits (164), Expect = 4e-09
 Identities = 51/137 (37%), Positives = 67/137 (48%), Gaps = 28/137 (20%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQRIFPATQI---PTPQ--NQFLRSCL----- 399
           MSS+LCSQ V+L  AMAVSG ++ L+  RQ+  P+ +    P  Q   Q LRSCL     
Sbjct: 1   MSSILCSQGVVLATAMAVSG-ILFLAFSRQKSLPSPEFSGNPNSQAPKQILRSCLTNDEK 59

Query: 398 ------------------STGGXXXXXXXXXRFADSVKPAERRIPRPVDRRTPDNRMALY 273
                             S  G         +F+   +P    IP+   ++ P+N+ ALY
Sbjct: 60  KRERKKKRVQFAENVKEPSGNGKEFRQEQRKKFSRVQRPCRNEIPQ--IQKVPENQAALY 117

Query: 272 RGILRDRVHVQRMASSY 222
            GILRDRVH  RMA SY
Sbjct: 118 NGILRDRVH--RMACSY 132


>gb|EXB39152.1| hypothetical protein L484_003791 [Morus notabilis]
          Length = 146

 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 52/146 (35%), Positives = 68/146 (46%), Gaps = 38/146 (26%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQRIFPATQIPT---PQNQFLRSCLSTGG--- 387
           MSS+L SQ V+L  AMAVSG VILL+   Q+  P TQ P    P+ Q LRSC+S+ G   
Sbjct: 1   MSSILSSQGVVLATAMAVSGTVILLAFRLQKSSPNTQFPVTRIPRTQILRSCISSEGSKK 60

Query: 386 -XXXXXXXXXRFAD-------------------------------SVKPAERRIPRPVDR 303
                      FA+                               SV  +  ++ +  ++
Sbjct: 61  DKKKNKQKRVHFAEDAVDPSGDGKEFRRQRSEISNSLNSKSSSSSSVSDSSLKLKKMSNK 120

Query: 302 RTPDNRMALYRGILRDRVHVQRMASS 225
             P NR+ALY GILRDRV V R+A S
Sbjct: 121 GMPANRVALYNGILRDRV-VHRLAYS 145


>ref|XP_002325224.2| hypothetical protein POPTR_0018s13100g [Populus trichocarpa]
           gi|550318640|gb|EEF03789.2| hypothetical protein
           POPTR_0018s13100g [Populus trichocarpa]
          Length = 143

 Score = 63.9 bits (154), Expect = 6e-08
 Identities = 56/143 (39%), Positives = 63/143 (44%), Gaps = 35/143 (24%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQRI------FPAT--QIPTPQNQFLRSCLST 393
           MSS LCSQ V+L  AMAVSG VI+L+   Q+       FP    QIP    Q LRSC+S 
Sbjct: 1   MSSSLCSQGVVLATAMAVSGTVIVLAFRLQKSHLPSGQFPGDHHQIPQSSQQALRSCISP 60

Query: 392 GGXXXXXXXXXRFADSVKPAE------RRIPRPV---------------------DRRTP 294
            G          FA+ V          RR    V                      RR P
Sbjct: 61  EGKKKGKKKRVHFAEDVVDPRGDGEEFRRQHEAVFLSQNSCSSSSTSTEFKKNGQQRRMP 120

Query: 293 DNRMALYRGILRDRVHVQRMASS 225
            NR ALY GILRDR  VQR+A S
Sbjct: 121 ANRAALYNGILRDR-GVQRLAYS 142


>ref|XP_002308986.2| hypothetical protein POPTR_0006s06840g [Populus trichocarpa]
           gi|550335661|gb|EEE92509.2| hypothetical protein
           POPTR_0006s06840g [Populus trichocarpa]
          Length = 145

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 53/143 (37%), Positives = 65/143 (45%), Gaps = 37/143 (25%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQR-IFPATQIPTPQN--------QFLRSCLS 396
           MSS+LCSQ V+L  AMAVSG VILL+   Q+ + P+ Q P   +        Q LRSC+S
Sbjct: 1   MSSMLCSQGVVLATAMAVSGTVILLAFRLQKSLLPSGQFPIDLHHRIAQSSPQALRSCIS 60

Query: 395 TGGXXXXXXXXXRFADSVKPAE------RRIPRPV----------------------DRR 300
           + G          FA+ V          RR    +                       RR
Sbjct: 61  SEGKKKGKKKRVHFAEDVVDPRGDGQEFRRQHEAIFLSQNSCSSSSSTSTEFKKNGQQRR 120

Query: 299 TPDNRMALYRGILRDRVHVQRMA 231
            P NR ALY GILRDR  VQR+A
Sbjct: 121 MPANRAALYNGILRDR-GVQRLA 142


>gb|EMJ16348.1| hypothetical protein PRUPE_ppa013340mg [Prunus persica]
          Length = 128

 Score = 58.5 bits (140), Expect = 3e-06
 Identities = 46/130 (35%), Positives = 66/130 (50%), Gaps = 21/130 (16%)
 Frame = -2

Query: 548 MSSVLCSQSVILGAAMAVSGAVILLSLGRQRIFPATQI-------PTPQNQFLRSCLSTG 390
           MSS+L SQ ++L  AMAVS  ++ L+  RQ+ F  TQ+         P+   LRSCL +G
Sbjct: 1   MSSMLSSQGLVLATAMAVSSTLVFLAFSRQKTFLPTQLSDSYNSQQNPKKTALRSCLCSG 60

Query: 389 G-XXXXXXXXXRFADSV--------KPAERRIPRPVDRRT-----PDNRMALYRGILRDR 252
                       FA +V        +    R    V+RR+     P+NR+ALY GIL++R
Sbjct: 61  DKKRERKKKKVHFAKNVVKEPTGGGEEMVMRKQSKVERRSCRNEIPENRIALYNGILKNR 120

Query: 251 VHVQRMASSY 222
             V+RM  S+
Sbjct: 121 --VERMQCSH 128


>ref|XP_003543678.1| PREDICTED: uncharacterized protein LOC100784964 [Glycine max]
          Length = 112

 Score = 56.6 bits (135), Expect = 1e-05
 Identities = 41/102 (40%), Positives = 55/102 (53%), Gaps = 7/102 (6%)
 Frame = -2

Query: 530 SQSVILGAAMAVSGAVILLSLGRQRIFPATQIPTPQNQFLRSCL-STGGXXXXXXXXXRF 354
           SQ ++L +AM +S  ++ ++  RQ+  P+ QI       LRSCL S            +F
Sbjct: 5   SQGLVLTSAMLLSTTLLYVAFSRQKTTPSFQIHHSNKPTLRSCLYSEEKKRERKKKKVKF 64

Query: 353 ADSVKPA-ERRIPRPVDRRT-----PDNRMALYRGILRDRVH 246
           ADSVK   ER   R   R++     P NRMALY GILR+RVH
Sbjct: 65  ADSVKEGRERNEQRSNSRQSGGMPMPANRMALYYGILRNRVH 106


Top