BLASTX nr result

ID: Catharanthus22_contig00039859 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00039859
         (351 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668...    95   1e-17
ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...    92   9e-17
ref|XP_006378155.1| hypothetical protein POPTR_0010s04250g, part...    79   8e-13
ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcript...    68   1e-09
ref|XP_006605006.1| PREDICTED: uncharacterized protein LOC102669...    67   3e-09
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...    67   3e-09
ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A...    65   9e-09
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...    64   2e-08
ref|XP_006588078.1| PREDICTED: uncharacterized protein LOC102665...    64   2e-08
ref|XP_004252416.1| PREDICTED: uncharacterized protein LOC101244...    64   2e-08
ref|XP_006595463.1| PREDICTED: uncharacterized protein LOC102660...    64   2e-08
ref|XP_006588848.1| PREDICTED: uncharacterized protein LOC102662...    64   2e-08
gb|EMS58832.1| Alpha-galactosidase [Triticum urartu]                   64   2e-08
ref|XP_004253503.1| PREDICTED: uncharacterized protein LOC101243...    64   2e-08
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]            63   3e-08
gb|AAC67331.1| putative non-LTR retroelement reverse transcripta...    63   5e-08
ref|XP_006584325.1| PREDICTED: uncharacterized protein LOC100811...    62   6e-08
ref|XP_004229147.1| PREDICTED: uncharacterized protein LOC101247...    62   8e-08
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...    62   8e-08
gb|ABD96948.1| hypothetical protein [Cleome spinosa]                   62   8e-08

>ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max]
          Length = 477

 Score = 94.7 bits (234), Expect = 1e-17
 Identities = 44/112 (39%), Positives = 58/112 (51%)
 Frame = +3

Query: 9   LESWRKNQGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV 188
           L SW  N+ +   KAY+Y+    P   W  +VWNP IP K S  +W      L T DR  
Sbjct: 289 LNSWNSNEQLLAGKAYDYIRGVKPAVNWNSVVWNPAIPSKMSFILWLATKNHLLTLDRAA 348

Query: 189 FLNIEGQCKLCKGPEESLAHLFFQCNFTRGIWESIREWAGLRRAMTTIQSCI 344
           FLN    C LC+   +S AHLFF C  +  +W +IR+W  L R   ++Q  I
Sbjct: 349 FLNKGLLCPLCRTKAKSHAHLFFSCRISLQVWANIRDWIPLHRQTISLQCTI 400


>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score = 91.7 bits (226), Expect = 9e-17
 Identities = 44/112 (39%), Positives = 57/112 (50%)
 Frame = +3

Query: 9   LESWRKNQGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV 188
           L SW  N+     K Y+Y+    P   W+ I+WNP+IP K S  +W     RL   DR  
Sbjct: 260 LNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATKNRLLALDRAA 319

Query: 189 FLNIEGQCKLCKGPEESLAHLFFQCNFTRGIWESIREWAGLRRAMTTIQSCI 344
           FLN    C LC    ES AHLFF C  +  +W  IR+W  L+R   ++Q  I
Sbjct: 320 FLNKGFLCPLCTNEAESHAHLFFSCRTSLRVWAHIRDWIPLKRQSISLQHSI 371


>ref|XP_006378155.1| hypothetical protein POPTR_0010s04250g, partial [Populus
           trichocarpa] gi|550329025|gb|ERP55952.1| hypothetical
           protein POPTR_0010s04250g, partial [Populus trichocarpa]
          Length = 112

 Score = 78.6 bits (192), Expect = 8e-13
 Identities = 38/113 (33%), Positives = 55/113 (48%)
 Frame = +3

Query: 9   LESWRKNQGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV 188
           L SW    G  TA AY +    V +  W  +VW     P+ +  +W  +LG+L T DRL 
Sbjct: 2   LSSWHSRPGSFTANAYHFFTYKVDHVQWASVVWEQWFLPRHNFSLW--LLGKLRTRDRLQ 59

Query: 189 FLNIEGQCKLCKGPEESLAHLFFQCNFTRGIWESIREWAGLRRAMTTIQSCIK 347
           F++ +    LC    ES AHLFF C ++  +W   R W     +M T+   I+
Sbjct: 60  FISTDPLYPLCHNSSESHAHLFFSCAWSSSLWGKARYWLEFHSSMPTLNRVIR 112


>ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
           gi|332005241|gb|AED92624.1| RNA-directed DNA polymerase
           (reverse transcriptase)-related family protein
           [Arabidopsis thaliana]
          Length = 295

 Score = 67.8 bits (164), Expect = 1e-09
 Identities = 36/85 (42%), Positives = 48/85 (56%), Gaps = 2/85 (2%)
 Frame = +3

Query: 39  DTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRL--VFLNIEGQC 212
           DT +     +P VP   W K+VW     P+FS+  W + L RLPT DRL    +NI    
Sbjct: 113 DTWEQIRVHSPTVP---WAKVVWFKEYIPRFSLITWMSFLERLPTRDRLRGWGMNIPSSW 169

Query: 213 KLCKGPEESLAHLFFQCNFTRGIWE 287
            LC   +E+ AHLFF+C+F+  IWE
Sbjct: 170 VLCSNGDETHAHLFFECSFSLAIWE 194


>ref|XP_006605006.1| PREDICTED: uncharacterized protein LOC102669369 [Glycine max]
          Length = 1096

 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 28/73 (38%), Positives = 42/73 (57%), Gaps = 3/73 (4%)
 Frame = +3

Query: 99   IVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI---EGQCKLCKGPEESLAHLFFQCNF 269
            IVW   +P K ++  W  +L RLPT D L+  N+     +C LC   +E++ HLFF C+F
Sbjct: 927  IVWKVPVPSKVALFCWRLLLDRLPTKDNLIRRNVVINNSRCSLCDSCDENVVHLFFHCDF 986

Query: 270  TRGIWESIREWAG 308
            +  IW+ +  W G
Sbjct: 987  SNCIWKEVLSWIG 999


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
 Frame = +3

Query: 78   PNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNIEGQ---CKLCKGPEESLAH 248
            P + W K VW P   PK+S  +W T+  RL T DR+   N  GQ   C LC   EE+  H
Sbjct: 1350 PQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWN-SGQLVTCTLCNNAEETRDH 1408

Query: 249  LFFQCNFTRGIWESIRE 299
            LFF C +T  +WE++ +
Sbjct: 1409 LFFSCQYTSYVWEALTQ 1425


>ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 364

 Score = 65.1 bits (157), Expect = 9e-09
 Identities = 27/87 (31%), Positives = 49/87 (56%), Gaps = 2/87 (2%)
 Frame = +3

Query: 36  IDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV--FLNIEGQ 209
           +   +A+++L P +P+  W K++W+  I P+ S+H W  + GR+ + D L    + +  +
Sbjct: 15  LSAKEAFQFLRPRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRVLSEDLLQRRGIALASR 74

Query: 210 CKLCKGPEESLAHLFFQCNFTRGIWES 290
           C LC    ESL H+F  C+F   +W +
Sbjct: 75  CVLCGRDGESLPHIFLTCSFAASLWNN 101


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 33/90 (36%), Positives = 43/90 (47%), Gaps = 2/90 (2%)
 Frame = +3

Query: 30   QGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--E 203
            QG   AK +E L P  P K W K VW     PK + + W   L RLPT  RLV   +   
Sbjct: 890  QGFSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSS 949

Query: 204  GQCKLCKGPEESLAHLFFQCNFTRGIWESI 293
             +C LC    E+  HL   C+F+  +W  +
Sbjct: 950  AECCLCSFDTETRDHLLLLCDFSSQVWRMV 979


>ref|XP_006588078.1| PREDICTED: uncharacterized protein LOC102665107 [Glycine max]
          Length = 189

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 35/96 (36%), Positives = 48/96 (50%), Gaps = 7/96 (7%)
 Frame = +3

Query: 18  WRKNQG--IDTAKAYEYLAPAVP--NKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRL 185
           W+ N+     T  AY+ L    P       KIVWN  +PP+ ++  W  IL RLPT   L
Sbjct: 94  WKANRSGIYSTKSAYKLLKTTTPIMEANILKIVWNLNVPPRAAIFSWRLILDRLPTRGNL 153

Query: 186 VFLNIEGQ---CKLCKGPEESLAHLFFQCNFTRGIW 284
           +  N++ Q   C LC   +E + HL F C  T G+W
Sbjct: 154 LRRNVQMQDTSCPLCGNAQEEVDHLVFNCEMTLGLW 189


>ref|XP_004252416.1| PREDICTED: uncharacterized protein LOC101244351 [Solanum
           lycopersicum]
          Length = 169

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
 Frame = +3

Query: 87  VWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVF--LNIEGQCKLCKGPEESLAHLFFQ 260
           +WT +++N    PK    MW  +  RL T DRL    + +E  C LC+  EE+  H+F Q
Sbjct: 8   IWTCLMFNNAARPKAYFTMWIMMNQRLVTVDRLAKWGVEVEKTCVLCENEEETAEHVFIQ 67

Query: 261 CNFTRGIWESIREWAG 308
           C+F RG+W  +  W G
Sbjct: 68  CSFARGLWGRLLNWTG 83


>ref|XP_006595463.1| PREDICTED: uncharacterized protein LOC102660851 [Glycine max]
          Length = 199

 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 31/89 (34%), Positives = 43/89 (48%), Gaps = 2/89 (2%)
 Frame = +3

Query: 90  WTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--EGQCKLCKGPEESLAHLFFQC 263
           W  ++ N    P   + +W    GRLPT DRL    +  E  CKLCK   ESL HLFF+C
Sbjct: 41  WRFLMHNNHARPLAKLTLWMVCHGRLPTMDRLHRFGMIQETICKLCKEKNESLTHLFFEC 100

Query: 264 NFTRGIWESIREWAGLRRAMTTIQSCIKW 350
             T+ +W+ +  W  L   +      + W
Sbjct: 101 GMTKTVWDQVLHWLNLNHRIKGWNEELDW 129


>ref|XP_006588848.1| PREDICTED: uncharacterized protein LOC102662740 [Glycine max]
          Length = 292

 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 35/93 (37%), Positives = 47/93 (50%), Gaps = 6/93 (6%)
 Frame = +3

Query: 42  TAKAYEYLAPAV---PNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNIEGQ- 209
           T  AY  L P++   P++   +I+W+  IPP+ +V  W   L RLPT   L   +I  Q 
Sbjct: 104 TKSAYRLLMPSISPAPSRRNFQILWHLKIPPRAAVFSWRLFLDRLPTRGNLSRRSIPIQD 163

Query: 210 --CKLCKGPEESLAHLFFQCNFTRGIWESIREW 302
             C LC    E   HLFF CN T+G+W     W
Sbjct: 164 IMCPLCGCQHEEAGHLFFHCNMTKGLWWESMRW 196


>gb|EMS58832.1| Alpha-galactosidase [Triticum urartu]
          Length = 561

 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 38/103 (36%), Positives = 48/103 (46%), Gaps = 4/103 (3%)
 Frame = +3

Query: 15  SWR--KNQGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV 188
           SW+  K+     A AY      + +    +IVW    PPK     W  I  R+ TADRL 
Sbjct: 435 SWKFSKDGQYSAAMAYSAQFLGLMDTDMNQIVWKNWAPPKCKFFTWLVINNRIWTADRLQ 494

Query: 189 FLNIEG--QCKLCKGPEESLAHLFFQCNFTRGIWESIREWAGL 311
                    C LCK  +ES AHL FQC FT  +W  ++ W GL
Sbjct: 495 RRGWPNCHLCPLCKQVQESAAHLLFQCRFTVRVWGMLKSWLGL 537


>ref|XP_004253503.1| PREDICTED: uncharacterized protein LOC101243694 [Solanum
           lycopersicum]
          Length = 177

 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 34/98 (34%), Positives = 48/98 (48%), Gaps = 4/98 (4%)
 Frame = +3

Query: 54  YEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--EGQCKLCKG 227
           Y+YL        W  +++     PK    +W  +  +L T DRL    +  +  C LCKG
Sbjct: 6   YDYLRGDQAKPEWKGLMFKNAARPKAIFTLWILLNRKLATIDRLAKWGVVHDPTCVLCKG 65

Query: 228 PEESLAHLFFQCNFTRGIWESIREWAGL--RRAMTTIQ 335
            +ESL HLF QC++   +WE +  WAG    R  T IQ
Sbjct: 66  ADESLDHLFLQCHYAEEVWERVLTWAGFYNNRPRTWIQ 103


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score = 63.2 bits (152), Expect = 3e-08
 Identities = 32/90 (35%), Positives = 43/90 (47%), Gaps = 2/90 (2%)
 Frame = +3

Query: 30   QGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--E 203
            QG   AK +E L P  P K W + VW     PK + + W   L RLPT  RLV   +   
Sbjct: 890  QGFSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSS 949

Query: 204  GQCKLCKGPEESLAHLFFQCNFTRGIWESI 293
             +C LC    E+  HL   C+F+  +W  +
Sbjct: 950  AECCLCSFDTETRDHLLLLCDFSSQVWRMV 979


>gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1449

 Score = 62.8 bits (151), Expect = 5e-08
 Identities = 28/68 (41%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
 Frame = +3

Query: 90   WTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLN--IEGQCKLCKGPEESLAHLFFQC 263
            W K VW     PKFS  +W  +  RL T D+++  N  ++G C LC+   ES  HLFF C
Sbjct: 1282 WHKGVWFTHSTPKFSFCVWLAVYDRLSTGDKMLLWNRGLQGTCLLCRNATESRDHLFFSC 1341

Query: 264  NFTRGIWE 287
            +F+  +WE
Sbjct: 1342 SFSSEVWE 1349


>ref|XP_006584325.1| PREDICTED: uncharacterized protein LOC100811880 [Glycine max]
          Length = 621

 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 35/93 (37%), Positives = 46/93 (49%), Gaps = 6/93 (6%)
 Frame = +3

Query: 42  TAKAYEYLAPA---VPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNIEGQ- 209
           T  AY  L P    +P++   +I+W+  IPP+ +V  W     RLPT   L   NI  Q 
Sbjct: 483 TKSAYSLLMPPSNPLPSRRNFQILWHLKIPPRAAVFSWRLFWDRLPTRGNLSRRNIPIQD 542

Query: 210 --CKLCKGPEESLAHLFFQCNFTRGIWESIREW 302
             C LC   +E   HLFF C+ TRG+W     W
Sbjct: 543 TMCPLCGSQQEEAGHLFFHCSMTRGLWWESMVW 575


>ref|XP_004229147.1| PREDICTED: uncharacterized protein LOC101247059 [Solanum
           lycopersicum]
          Length = 133

 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 34/98 (34%), Positives = 48/98 (48%), Gaps = 4/98 (4%)
 Frame = +3

Query: 54  YEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVF--LNIEGQCKLCKG 227
           Y+YL    P  VW  +++     PK    +W  +  +L T DRL    L  +  C LC  
Sbjct: 6   YDYLRGEKPKPVWKCLMFKNTERPKAIFTLWILMYRKLATVDRLAKWGLTHDTACVLCTN 65

Query: 228 PEESLAHLFFQCNFTRGIWESIREWAGL--RRAMTTIQ 335
            +ESL H+F QC++   +WE +  W GL   RA T  Q
Sbjct: 66  MDESLDHMFLQCHYVGEVWERVLAWDGLHNNRAKTWTQ 103


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 33/89 (37%), Positives = 43/89 (48%), Gaps = 2/89 (2%)
 Frame = +3

Query: 33   GIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--EG 206
            G   A+ +E + P  P K WTK VW     PK + +MW + L RLPT  RL    +    
Sbjct: 903  GFSAARTWEAMRPKKPVKDWTKSVWFKGSVPKHAFNMWVSHLNRLPTRQRLAAWGVTTTT 962

Query: 207  QCKLCKGPEESLAHLFFQCNFTRGIWESI 293
             C LC    ES  HL   C F+  IW+ +
Sbjct: 963  DCCLCSSRPESRDHLLLYCVFSAVIWKLV 991


>gb|ABD96948.1| hypothetical protein [Cleome spinosa]
          Length = 539

 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 31/76 (40%), Positives = 41/76 (53%), Gaps = 4/76 (5%)
 Frame = +3

Query: 90  WTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--EGQCKLCKGPEESLAHLFFQC 263
           W+ IVW P+  P+ +   W  +L RLPT DRL    I  +  C+LC G +ES  HLFF C
Sbjct: 428 WSSIVWFPLAIPRHAFLHWQVMLFRLPTKDRLQQWGITSDATCRLCDGEDESHQHLFFGC 487

Query: 264 NFTRGIWESIRE--WA 305
            +   +W    E  WA
Sbjct: 488 TYASHLWRHFGEVCWA 503


Top