BLASTX nr result

ID: Catharanthus22_contig00001012 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00001012
         (3550 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN77900.1| hypothetical protein VITISV_037350 [Vitis vinifera]   114   3e-22
emb|CAN68499.1| hypothetical protein VITISV_041099 [Vitis vinifera]   112   2e-21
ref|XP_006848921.1| hypothetical protein AMTR_s02100p00009220, p...   111   3e-21
emb|CAN74973.1| hypothetical protein VITISV_001042 [Vitis vinifera]   110   3e-21
ref|XP_006490837.1| PREDICTED: uncharacterized protein LOC102624...   110   6e-21
emb|CAN77247.1| hypothetical protein VITISV_021658 [Vitis vinifera]   109   8e-21
emb|CAN76474.1| hypothetical protein VITISV_016008 [Vitis vinifera]   108   2e-20
ref|XP_006606762.1| PREDICTED: uncharacterized protein LOC102662...   106   8e-20
ref|XP_006380063.1| hypothetical protein POPTR_0008s20820g [Popu...   106   8e-20
gb|AAW28576.2| Gag-pol polyprotein, putative [Solanum demissum]       105   2e-19
gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ...   105   2e-19
gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ...   105   2e-19
emb|CAN67751.1| hypothetical protein VITISV_030106 [Vitis vinifera]   103   5e-19
ref|XP_004515184.1| PREDICTED: uncharacterized protein LOC101510...   102   9e-19
ref|XP_002302802.2| hypothetical protein POPTR_0002s18940g [Popu...    96   1e-16
ref|XP_006402401.1| hypothetical protein EUTSA_v10006305mg [Eutr...    95   2e-16
ref|XP_006574103.1| PREDICTED: uncharacterized protein LOC102664...    95   3e-16
gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea bat...    94   4e-16
ref|XP_006574160.1| PREDICTED: uncharacterized protein LOC102669...    92   1e-15
ref|XP_002876666.1| DNAJ heat shock N-terminal domain-containing...    92   2e-15

>emb|CAN77900.1| hypothetical protein VITISV_037350 [Vitis vinifera]
          Length = 1173

 Score =  114 bits (285), Expect = 3e-22
 Identities = 61/106 (57%), Positives = 74/106 (69%)
 Frame = +2

Query: 1745 DEANDSDLVEEFSKIRQRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQV 1924
            DE  +   V + +K R++      + FESGDWVWV +RKERF   R++KL  RGDGPFQV
Sbjct: 1030 DEKKNEQYVTKANKGRRQ------VLFESGDWVWVHMRKERFPTRRQSKLHPRGDGPFQV 1083

Query: 1925 LKRINDNAYKIDLHGEQNDSATFNVSDLSLFDMDVDSRKNPNRIRG 2062
            L+RINDNAYK+DL GE N SATF VSDLS F++  DSR NP   RG
Sbjct: 1084 LERINDNAYKLDLLGEYNISATFKVSDLSPFNVGDDSRTNPFEERG 1129


>emb|CAN68499.1| hypothetical protein VITISV_041099 [Vitis vinifera]
          Length = 1115

 Score =  112 bits (279), Expect = 2e-21
 Identities = 56/80 (70%), Positives = 62/80 (77%)
 Frame = +2

Query: 1823 FESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGEQNDSATFNVS 2002
            FE GDWV V +RKERF   R++KL  RGDGPFQVL+RINDNAYK+DL GE N SATFNVS
Sbjct: 1002 FEPGDWVXVHMRKERFPTCRQSKLHPRGDGPFQVLERINDNAYKLDLPGEYNISATFNVS 1061

Query: 2003 DLSLFDMDVDSRKNPNRIRG 2062
            DLS FD+  DSR NP   RG
Sbjct: 1062 DLSPFDVSDDSRTNPFEKRG 1081


>ref|XP_006848921.1| hypothetical protein AMTR_s02100p00009220, partial [Amborella
            trichopoda] gi|548852379|gb|ERN10502.1| hypothetical
            protein AMTR_s02100p00009220, partial [Amborella
            trichopoda]
          Length = 183

 Score =  111 bits (277), Expect = 3e-21
 Identities = 61/112 (54%), Positives = 74/112 (66%), Gaps = 3/112 (2%)
 Frame = +2

Query: 1736 KRFDEA---NDSDLVEEFSKIRQRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARG 1906
            K+F E    N     E++ K   +G  +    FE GDWVW+ +RKERF   R++KL  RG
Sbjct: 14   KKFHERARLNIERRTEQYLKQANKGRHKQV--FEPGDWVWLHMRKERFPTQRRSKLLPRG 71

Query: 1907 DGPFQVLKRINDNAYKIDLHGEQNDSATFNVSDLSLFDMDVDSRKNPNRIRG 2062
            DGPFQVL+RINDNAYK+DL GE N SATFNVSDLS FD   D R NP++  G
Sbjct: 72   DGPFQVLERINDNAYKLDLPGEYNVSATFNVSDLSPFDTGEDLRTNPSKEGG 123


>emb|CAN74973.1| hypothetical protein VITISV_001042 [Vitis vinifera]
          Length = 1281

 Score =  110 bits (276), Expect = 3e-21
 Identities = 54/80 (67%), Positives = 61/80 (76%)
 Frame = +2

Query: 1823 FESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGEQNDSATFNVS 2002
            FE GDWVWV +RKERF   R +KL  RGDGPFQVL+RINDNAYK+D+ GE N SATFNVS
Sbjct: 1157 FEPGDWVWVHMRKERFPTRRXSKLHPRGDGPFQVLERINDNAYKLDIPGEYNISATFNVS 1216

Query: 2003 DLSLFDMDVDSRKNPNRIRG 2062
            DLS FD+  DS  NP   +G
Sbjct: 1217 DLSPFDVGDDSXTNPFEEKG 1236


>ref|XP_006490837.1| PREDICTED: uncharacterized protein LOC102624555 [Citrus sinensis]
          Length = 821

 Score =  110 bits (274), Expect = 6e-21
 Identities = 54/80 (67%), Positives = 61/80 (76%)
 Frame = +2

Query: 1823 FESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGEQNDSATFNVS 2002
            F+ GDWVWV +RKERF   R++KL  RGDGPFQV+ RINDNAYK+DL GE N  ATFNVS
Sbjct: 403  FQPGDWVWVHMRKERFPAQRRSKLLPRGDGPFQVVARINDNAYKLDLPGEYNVIATFNVS 462

Query: 2003 DLSLFDMDVDSRKNPNRIRG 2062
            DLS FD+  DSR NP   RG
Sbjct: 463  DLSPFDVGEDSRTNPFEERG 482


>emb|CAN77247.1| hypothetical protein VITISV_021658 [Vitis vinifera]
          Length = 1323

 Score =  109 bits (273), Expect = 8e-21
 Identities = 53/75 (70%), Positives = 59/75 (78%)
 Frame = +2

Query: 1823 FESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGEQNDSATFNVS 2002
            FE GDWVWV +RKERF   R++KL  RGDGPFQVL+RINDNAYK+DL  E N SATFNV 
Sbjct: 1178 FEPGDWVWVHMRKERFPTRRRSKLHPRGDGPFQVLERINDNAYKLDLPXEYNISATFNVX 1237

Query: 2003 DLSLFDMDVDSRKNP 2047
            DLS FD+  DSR NP
Sbjct: 1238 DLSPFDVGDDSRTNP 1252


>emb|CAN76474.1| hypothetical protein VITISV_016008 [Vitis vinifera]
          Length = 519

 Score =  108 bits (270), Expect = 2e-20
 Identities = 54/80 (67%), Positives = 60/80 (75%)
 Frame = +2

Query: 1823 FESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGEQNDSATFNVS 2002
            FE GDWVWV +RK RF    ++KL  RGDGPFQVL+RINDNAYK+DL GE N SAT NVS
Sbjct: 374  FEPGDWVWVHIRKGRFPTCXQSKLHPRGDGPFQVLERINDNAYKLDLPGEYNISATVNVS 433

Query: 2003 DLSLFDMDVDSRKNPNRIRG 2062
            DLS FD+  DSR NP   RG
Sbjct: 434  DLSPFDVGDDSRTNPFEERG 453


>ref|XP_006606762.1| PREDICTED: uncharacterized protein LOC102662828 [Glycine max]
          Length = 612

 Score =  106 bits (264), Expect = 8e-20
 Identities = 54/82 (65%), Positives = 59/82 (71%)
 Frame = +2

Query: 1781 SKIRQRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKID 1960
            S  RQ       +  E GDWVWV L+KERF   RK+KL  RGDGPFQVL++INDNAYKID
Sbjct: 520  SYARQANKSRKKVVLEPGDWVWVHLKKERFPEHRKSKLQPRGDGPFQVLEKINDNAYKID 579

Query: 1961 LHGEQNDSATFNVSDLSLFDMD 2026
            L  E N SATFNVSDLSLFD D
Sbjct: 580  LPNEYNVSATFNVSDLSLFDAD 601


>ref|XP_006380063.1| hypothetical protein POPTR_0008s20820g [Populus trichocarpa]
            gi|550333562|gb|ERP57860.1| hypothetical protein
            POPTR_0008s20820g [Populus trichocarpa]
          Length = 826

 Score =  106 bits (264), Expect = 8e-20
 Identities = 50/73 (68%), Positives = 59/73 (80%)
 Frame = +2

Query: 1823 FESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGEQNDSATFNVS 2002
            F+ GDWVWV +RKERF   RK+KL   GDGPFQVL+RINDNAYKID+ GE   SA FNV+
Sbjct: 33   FQPGDWVWVYMRKERFSNQRKSKLQPCGDGPFQVLERINDNAYKIDIPGEYGVSAIFNVA 92

Query: 2003 DLSLFDMDVDSRK 2041
            DL+LFD+D DSR+
Sbjct: 93   DLTLFDIDFDSRR 105


>gb|AAW28576.2| Gag-pol polyprotein, putative [Solanum demissum]
          Length = 1096

 Score =  105 bits (261), Expect = 2e-19
 Identities = 52/84 (61%), Positives = 62/84 (73%)
 Frame = +2

Query: 1793 QRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGE 1972
            +R     Y+ F+ GD VWV +RKERF   RKTKLD RG GP++VL+RI DNAYK+DL GE
Sbjct: 928  RRNKGRKYVIFKPGDLVWVHMRKERFPSKRKTKLDPRGSGPYKVLERIGDNAYKLDLPGE 987

Query: 1973 QNDSATFNVSDLSLFDMDVDSRKN 2044
               SATFNVSDLS +D D+DSR N
Sbjct: 988  FQVSATFNVSDLSHYDADLDSRTN 1011


>gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score =  105 bits (261), Expect = 2e-19
 Identities = 52/84 (61%), Positives = 62/84 (73%)
 Frame = +2

Query: 1793 QRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGE 1972
            +R     Y+ F+ GD VWV +RKERF   RKTKLD RG GP++VL+RI DNAYK+DL GE
Sbjct: 1420 RRNKGRKYVIFKPGDLVWVHMRKERFPSKRKTKLDPRGSGPYKVLERIGDNAYKLDLPGE 1479

Query: 1973 QNDSATFNVSDLSLFDMDVDSRKN 2044
               SATFNVSDLS +D D+DSR N
Sbjct: 1480 FQVSATFNVSDLSHYDADLDSRTN 1503


>gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score =  105 bits (261), Expect = 2e-19
 Identities = 52/84 (61%), Positives = 62/84 (73%)
 Frame = +2

Query: 1793 QRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGE 1972
            +R     Y+ F+ GD VWV +RKERF   RKTKLD RG GP++VL+RI DNAYK+DL GE
Sbjct: 1420 RRNKGRKYVIFKPGDLVWVHMRKERFPSKRKTKLDPRGSGPYKVLERIGDNAYKLDLPGE 1479

Query: 1973 QNDSATFNVSDLSLFDMDVDSRKN 2044
               SATFNVSDLS +D D+DSR N
Sbjct: 1480 FQVSATFNVSDLSHYDADLDSRTN 1503


>emb|CAN67751.1| hypothetical protein VITISV_030106 [Vitis vinifera]
          Length = 376

 Score =  103 bits (257), Expect = 5e-19
 Identities = 55/105 (52%), Positives = 69/105 (65%)
 Frame = +2

Query: 1787 IRQRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLH 1966
            I+Q       + FE  DWVW+ +RKERF   R++KL  RG+ PFQVL+RINDNAYK+DL 
Sbjct: 261  IKQANKGHQQLIFEPEDWVWLHMRKERFPAQRQSKLLLRGNEPFQVLERINDNAYKLDLL 320

Query: 1967 GEQNDSATFNVSDLSLFDMDVDSRKNPNRIRGR*YDPGMLKKQNS 2101
            GE N  ATFNV+DLS FD+  D R+NP +  G   D G   K N+
Sbjct: 321  GEYNVKATFNVTDLSPFDVGDDLRRNPFQDEGN--DEGTTNKWNT 363


>ref|XP_004515184.1| PREDICTED: uncharacterized protein LOC101510243 [Cicer arietinum]
          Length = 593

 Score =  102 bits (255), Expect = 9e-19
 Identities = 59/110 (53%), Positives = 67/110 (60%), Gaps = 3/110 (2%)
 Frame = +2

Query: 1781 SKIRQRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKID 1960
            S  RQ       I FE GDWVWV LRKERF    K+KL  RGDGPFQVL +INDNAYKID
Sbjct: 428  SYARQANKRRKKIVFEPGDWVWVHLRKERFPSQMKSKLQPRGDGPFQVLSKINDNAYKID 487

Query: 1961 LHGEQNDSATFNVSDLSLFDM---DVDSRKNPNRIRGR*YDPGMLKKQNS 2101
            L GE   S TFNV+DLSL+D    D + R N  +  G   D   + K+ S
Sbjct: 488  LPGEYGVSLTFNVADLSLYDAAEEDCNLRANSVQEGGNDEDIECITKEES 537


>ref|XP_002302802.2| hypothetical protein POPTR_0002s18940g [Populus trichocarpa]
            gi|550345342|gb|EEE82075.2| hypothetical protein
            POPTR_0002s18940g [Populus trichocarpa]
          Length = 126

 Score = 95.5 bits (236), Expect = 1e-16
 Identities = 49/94 (52%), Positives = 68/94 (72%), Gaps = 2/94 (2%)
 Frame = -3

Query: 3512 IAGRREDRLSATPPGYVHIIQ--MHIEEAKSLLGFPPDSHPSVSQVKAAYKTKVWDTHPD 3339
            +AG+   + +   P    ++Q  M  +EAK LLGFPP+S P++SQVKAAY+ KVW++HPD
Sbjct: 1    MAGKEAAKTNNNNPSKRRLLQKEMQGDEAKVLLGFPPNSRPTLSQVKAAYRKKVWESHPD 60

Query: 3338 RFPPHQRSGAEHQFKLISEAYSCLRYGNIFSLMF 3237
             FP H++ GAE +FKLISEAY+ L+ GN  SL+F
Sbjct: 61   LFPLHEKPGAESKFKLISEAYTYLQTGN-SSLLF 93


>ref|XP_006402401.1| hypothetical protein EUTSA_v10006305mg [Eutrema salsugineum]
            gi|557103500|gb|ESQ43854.1| hypothetical protein
            EUTSA_v10006305mg [Eutrema salsugineum]
          Length = 138

 Score = 95.1 bits (235), Expect = 2e-16
 Identities = 44/66 (66%), Positives = 52/66 (78%)
 Frame = -3

Query: 3449 MHIEEAKSLLGFPPDSHPSVSQVKAAYKTKVWDTHPDRFPPHQRSGAEHQFKLISEAYSC 3270
            M +EEAK LLGFPP+S P  SQVKAAY+ KVW++HPD FP  Q+ GAE +FK ISEAYSC
Sbjct: 1    MQVEEAKILLGFPPNSRPDPSQVKAAYRKKVWESHPDLFPDDQKLGAESKFKSISEAYSC 60

Query: 3269 LRYGNI 3252
            L  G+I
Sbjct: 61   LESGDI 66


>ref|XP_006574103.1| PREDICTED: uncharacterized protein LOC102664315 [Glycine max]
          Length = 198

 Score = 94.7 bits (234), Expect = 3e-16
 Identities = 55/102 (53%), Positives = 67/102 (65%), Gaps = 3/102 (2%)
 Frame = +2

Query: 1751 ANDSDLVEEFSKIRQRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLK 1930
            AN     E++ K   +G V   + FE GDWVWV +RKERF    K+KL  RG+ PFQV++
Sbjct: 41   ANIEKKNEQYEKQANKGHVR--VIFEPGDWVWVHMRKERFPAQIKSKLQPRGNRPFQVVE 98

Query: 1931 RINDNAYKIDLHGEQ-NDSATFNVSDLSLFDMD--VDSRKNP 2047
             IN NAYK+DL  E  N SATFNV+DLSLFD+    DSR NP
Sbjct: 99   NINGNAYKLDLPREYGNISATFNVADLSLFDVGNRSDSRMNP 140


>gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea batatas]
          Length = 1358

 Score = 94.0 bits (232), Expect = 4e-16
 Identities = 47/76 (61%), Positives = 56/76 (73%)
 Frame = +2

Query: 1817 IDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAYKIDLHGEQNDSATFN 1996
            I F+ GDWVW+   K RF   RK+KL  RGDGPFQVL+RINDNAYK+DL GE + S+TFN
Sbjct: 1198 IIFKPGDWVWIHYSKNRFPNQRKSKLMPRGDGPFQVLERINDNAYKLDLRGEHSVSSTFN 1257

Query: 1997 VSDLSLFDMDVDSRKN 2044
            V+DL+ FD   DS  N
Sbjct: 1258 VADLAPFDFS-DSGTN 1272


>ref|XP_006574160.1| PREDICTED: uncharacterized protein LOC102669103 [Glycine max]
          Length = 220

 Score = 92.4 bits (228), Expect = 1e-15
 Identities = 51/95 (53%), Positives = 64/95 (67%), Gaps = 3/95 (3%)
 Frame = +2

Query: 1772 EEFSKIRQRGTVEDYIDFESGDWVWVQLRKERFYILRKTKLDARGDGPFQVLKRINDNAY 1951
            E+++K   +  V+  + FE  DWVWV  RKERF   RK KL  RGDGPFQVL++INDNA 
Sbjct: 55   EQYAKQANKDRVK--VIFEPRDWVWVHTRKERFPTQRKPKLQPRGDGPFQVLEKINDNAN 112

Query: 1952 KIDLHGEQND-SATFNVSDLSLFDM--DVDSRKNP 2047
            K+DL GE    S TFNV+DLS F +  + +SR NP
Sbjct: 113  KLDLPGEYGSISITFNVADLSFFYVGNEANSRSNP 147


>ref|XP_002876666.1| DNAJ heat shock N-terminal domain-containing protein [Arabidopsis
            lyrata subsp. lyrata] gi|297322504|gb|EFH52925.1| DNAJ
            heat shock N-terminal domain-containing protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 138

 Score = 92.0 bits (227), Expect = 2e-15
 Identities = 42/66 (63%), Positives = 51/66 (77%)
 Frame = -3

Query: 3449 MHIEEAKSLLGFPPDSHPSVSQVKAAYKTKVWDTHPDRFPPHQRSGAEHQFKLISEAYSC 3270
            M +EEAK LLGFPP+S P  SQVKAAY+ KVW++HPD FP  Q+  AE +FK ISEAYSC
Sbjct: 1    MEVEEAKILLGFPPNSRPDPSQVKAAYRKKVWESHPDLFPDDQKQVAESKFKSISEAYSC 60

Query: 3269 LRYGNI 3252
            L  G++
Sbjct: 61   LESGDV 66


Top