BLASTX nr result

ID: Catharanthus22_contig00017083 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00017083
         (684 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga...    94   3e-17
dbj|BAE79384.1| unnamed protein product [Ipomoea batatas]              93   7e-17
dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]              93   7e-17
dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]              93   7e-17
ref|XP_004289445.1| PREDICTED: putative ribonuclease H protein A...    85   2e-14
emb|CAB10337.1| reverse transcriptase like protein [Arabidopsis ...    85   2e-14
ref|XP_004306074.1| PREDICTED: putative ribonuclease H protein A...    80   6e-13
gb|AAC63844.1| putative non-LTR retroelement reverse transcripta...    79   1e-12
sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H pr...    78   3e-12
gb|EMT28796.1| Ferredoxin-dependent glutamate synthase, chloropl...    77   5e-12
gb|AAD37021.1| putative non-LTR retrolelement reverse transcript...    77   6e-12
gb|EOY24339.1| RNA-directed DNA polymerase (Reverse transcriptas...    75   2e-11
emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|73210...    75   2e-11
ref|XP_004233672.1| PREDICTED: uncharacterized protein LOC101266...    74   4e-11
emb|CAN78577.1| hypothetical protein VITISV_020585 [Vitis vinifera]    74   5e-11
emb|CAN70399.1| hypothetical protein VITISV_023214 [Vitis vinifera]    74   5e-11
gb|EMS67416.1| Protein argonaute 1D [Triticum urartu]                  73   9e-11
emb|CAN77369.1| hypothetical protein VITISV_033118 [Vitis vinifera]    73   9e-11
gb|EMT20940.1| ABC transporter C family member 10 [Aegilops taus...    72   1e-10
emb|CAN69899.1| hypothetical protein VITISV_029782 [Vitis vinifera]    72   1e-10

>emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1378

 Score = 94.4 bits (233), Expect = 3e-17
 Identities = 42/100 (42%), Positives = 66/100 (66%)
 Frame = -2

Query: 302  RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
            R+ LIQ+  S++P+Y+ Q  KLP ST  +++  +R+FLWG+++ KR +H + W  I K  
Sbjct: 795  RATLIQSAFSSIPYYTMQSTKLPRSTCDDIDRKSRSFLWGEQEGKRRVHLVAWENISKSK 854

Query: 122  AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
             +GGLGI  +++ N +FL+KL  R L E  +LW  +LR K
Sbjct: 855  KEGGLGIRSMRQANSAFLVKLGWRLLAEPSSLWSRILRAK 894


>dbj|BAE79384.1| unnamed protein product [Ipomoea batatas]
          Length = 1898

 Score = 93.2 bits (230), Expect = 7e-17
 Identities = 41/100 (41%), Positives = 68/100 (68%)
 Frame = -2

Query: 302  RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
            R +L+Q + +TVP Y+ Q++ LP ST  E++   RNFLWG + + R+LH+++W +I KP 
Sbjct: 1325 RRVLVQASLATVPTYTMQVMALPVSTCNEIDKTCRNFLWGHDTNTRKLHSVNWAEICKPR 1384

Query: 122  AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
             +GGLG+ + ++ N++FL K+A +       LWV +LR+K
Sbjct: 1385 NEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLWVKVLREK 1424


>dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score = 93.2 bits (230), Expect = 7e-17
 Identities = 41/100 (41%), Positives = 68/100 (68%)
 Frame = -2

Query: 302  RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
            R +L+Q + +TVP Y+ Q++ LP ST  E++   RNFLWG + + R+LH+++W +I KP 
Sbjct: 793  RRVLVQASLATVPTYTMQVMALPVSTCNEIDKTCRNFLWGHDTNTRKLHSVNWAEICKPR 852

Query: 122  AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
             +GGLG+ + ++ N++FL K+A +       LWV +LR+K
Sbjct: 853  NEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLWVKVLREK 892


>dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score = 93.2 bits (230), Expect = 7e-17
 Identities = 41/100 (41%), Positives = 68/100 (68%)
 Frame = -2

Query: 302  RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
            R +L+Q + +TVP Y+ Q++ LP ST  E++   RNFLWG + + R+LH+++W +I KP 
Sbjct: 793  RRVLVQASLATVPTYTMQVMALPVSTCNEIDKTCRNFLWGHDTNTRKLHSVNWAEICKPR 852

Query: 122  AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
             +GGLG+ + ++ N++FL K+A +       LWV +LR+K
Sbjct: 853  NEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLWVKVLREK 892


>ref|XP_004289445.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 719

 Score = 85.1 bits (209), Expect = 2e-14
 Identities = 35/100 (35%), Positives = 59/100 (59%)
 Frame = -2

Query: 302 RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
           R  L+Q+  +++P Y+ Q  KLP S  + ++  N NFLWGD  +K+++H ++W  + KP 
Sbjct: 340 RLTLVQSVTASIPIYAMQTAKLPLSLCESIDKANINFLWGDSNEKKKIHLVNWESVCKPK 399

Query: 122 AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
            +GGLG+    + NQ+ L K + R  +  + LW  + R K
Sbjct: 400 HRGGLGLKKTADMNQAMLAKASWRIFQNDHGLWADIYRKK 439


>emb|CAB10337.1| reverse transcriptase like protein [Arabidopsis thaliana]
           gi|7268307|emb|CAB78601.1| reverse transcriptase like
           protein [Arabidopsis thaliana]
          Length = 929

 Score = 84.7 bits (208), Expect = 2e-14
 Identities = 38/100 (38%), Positives = 62/100 (62%)
 Frame = -2

Query: 302 RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
           R  L +    ++P ++   I LPAS +++L+ ++RNFLWG   +KR+ H + W K+ +P 
Sbjct: 587 RITLTKAVLMSIPIHTMSSILLPASLLEQLDKVSRNFLWGSTVEKRKQHLLSWKKVCRPK 646

Query: 122 AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
           A GGLG+   K+ N++ L K+  R L +K +LW  +LR K
Sbjct: 647 AAGGLGLRASKDMNRALLAKVGWRLLNDKVSLWARVLRRK 686


>ref|XP_004306074.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 407

 Score = 80.1 bits (196), Expect = 6e-13
 Identities = 36/100 (36%), Positives = 59/100 (59%)
 Frame = -2

Query: 302 RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
           R  LIQ+  S +P Y+ Q  K P S  + L+ +NRNFLWGD + K+++H ++W  + +P 
Sbjct: 52  RLTLIQSVSSAIPNYAMQTAKFPVSLCENLDKLNRNFLWGDTEIKKKVHLVNWDVVCQPK 111

Query: 122 AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
             G +GI   ++ NQ+ L K++ R  +    LW S+  +K
Sbjct: 112 QLGDIGIKKTEDMNQAMLAKISWRMFQCDKGLWASMFAEK 151


>gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 1231

 Score = 79.3 bits (194), Expect = 1e-12
 Identities = 35/100 (35%), Positives = 60/100 (60%)
 Frame = -2

Query: 302 RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
           R  L +   S++P +    I LP ST+  L+  +R FLWG   +K++ H + W KI KP 
Sbjct: 652 RITLTKAVLSSIPVHVMSAILLPVSTLDTLDRYSRTFLWGSTMEKKKQHLLSWRKICKPK 711

Query: 122 AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
           A+GG+G+   ++ N++ + K+  R L++K +LW  ++R K
Sbjct: 712 AEGGIGLRSARDMNKALVAKVGWRLLQDKESLWARVVRKK 751


>sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H protein At1g65750
          Length = 620

 Score = 77.8 bits (190), Expect = 3e-12
 Identities = 35/100 (35%), Positives = 58/100 (58%)
 Frame = -2

Query: 302 RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
           R  L +   S++P +S   I LP S +  L+ ++R FLWG   +K++ H + W K+  P 
Sbjct: 37  RLTLTKAVLSSMPVHSMSTILLPQSILNRLDQLSRTFLWGSTAEKKKQHLVKWSKVCSPK 96

Query: 122 AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
            +GGLG+   K  N++ + K+  R L+EK +LW  +L+ K
Sbjct: 97  KEGGLGVRAAKSMNRALISKVGWRLLQEKNSLWTLVLQKK 136


>gb|EMT28796.1| Ferredoxin-dependent glutamate synthase, chloroplastic [Aegilops
           tauschii]
          Length = 1896

 Score = 77.0 bits (188), Expect = 5e-12
 Identities = 40/110 (36%), Positives = 62/110 (56%)
 Frame = -2

Query: 332 KNYCKSGKPARSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHT 153
           +N+   G   +  LI+T    +P Y+  + K P S  +EL  + RNF WGDE+D+R+ H 
Sbjct: 153 ENFASRG--VKEDLIKTVIEALPVYAMGIFKFPVSLCEELSQIIRNFWWGDEEDRRKTHW 210

Query: 152 IDWHKIRKPNAKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
           + W+K+ +P  KGG+G   L+  NQ+ L K A R L    +L   L++ K
Sbjct: 211 LAWNKLTRPKGKGGMGFQDLRLFNQALLAKHAWRLLVYPDSLCAKLMKAK 260


>gb|AAD37021.1| putative non-LTR retrolelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 732

 Score = 76.6 bits (187), Expect = 6e-12
 Identities = 33/98 (33%), Positives = 61/98 (62%)
 Frame = -2

Query: 302 RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
           R  L +   S++P ++   I LP ST+  L+ ++R+FLWG    +R+ H I W ++ KP 
Sbjct: 314 RVTLTKAVLSSIPVHTMSTIALPKSTLDGLDKVSRSFLWGSSVTQRKQHLISWKRVCKPR 373

Query: 122 AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLR 9
           ++GGLGI   ++ N++ L K+  R +++ ++LW  ++R
Sbjct: 374 SEGGLGIRKAQDMNKALLSKVGWRLIQDYHSLWARIMR 411


>gb|EOY24339.1| RNA-directed DNA polymerase (Reverse transcriptase),
           Polynucleotidyl transferase, Ribonuclease H fold-like
           protein [Theobroma cacao]
          Length = 616

 Score = 74.7 bits (182), Expect = 2e-11
 Identities = 33/97 (34%), Positives = 59/97 (60%)
 Frame = -2

Query: 293 LIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPNAKG 114
           L+++  ST+P+Y  Q++ +P  + K +E   +NFLWG + D + +H I  ++I +P  + 
Sbjct: 93  LVKSVLSTIPYYVMQIVSIPLDSCKRMERYCQNFLWGGDADHKRIHLIRCNQICRPKEER 152

Query: 113 GLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
            LG+  L   N +FL+KL  + +    +LWVS++R K
Sbjct: 153 SLGVKRLHVMNNAFLMKLLWQLVTRPKSLWVSIIRGK 189


>emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|7321072|emb|CAB82119.1|
           putative protein [Arabidopsis thaliana]
          Length = 947

 Score = 74.7 bits (182), Expect = 2e-11
 Identities = 34/100 (34%), Positives = 60/100 (60%)
 Frame = -2

Query: 302 RSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPN 123
           R  L ++  S +P ++   I LP ST++ L+ + R FL G   +K++LH + W ++  P 
Sbjct: 466 RLTLTKSVLSLIPIHTMSTISLPQSTLEGLDKLARVFLLGSSAEKKKLHLVAWDRVCLPK 525

Query: 122 AKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
           ++GGLGI   K  N++ + K+  R + ++Y+LW  +LR K
Sbjct: 526 SEGGLGIRTSKCMNKALVSKVGWRLINDRYSLWARILRSK 565


>ref|XP_004233672.1| PREDICTED: uncharacterized protein LOC101266093 [Solanum
           lycopersicum]
          Length = 486

 Score = 73.9 bits (180), Expect = 4e-11
 Identities = 35/96 (36%), Positives = 56/96 (58%)
 Frame = -2

Query: 290 IQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKPNAKGG 111
           I++T +++P +  Q++KLP+S ++ +E   RNFLWG    K+ LH + W  +  P  +GG
Sbjct: 299 IRSTLNSLPNHIMQVVKLPSSVVQAMECYERNFLWGTTPIKKRLHLVCWSTVTNPKDQGG 358

Query: 110 LGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
           LGI  L+  N + L   A R    +  LW  +LR+K
Sbjct: 359 LGIQDLRTKNNALLASTAWRLHNSQKKLWAMILRNK 394


>emb|CAN78577.1| hypothetical protein VITISV_020585 [Vitis vinifera]
          Length = 1848

 Score = 73.6 bits (179), Expect = 5e-11
 Identities = 38/103 (36%), Positives = 57/103 (55%)
 Frame = -2

Query: 311  KPARSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIR 132
            K  R+ LI++T S +P Y   +++LP+S    LE + R+FLWG    +R+ H + W  + 
Sbjct: 1424 KGGRATLIRSTLSNLPIYYMSVLRLPSSVRSRLEQIQRDFLWGGGSLERKPHLVRWKVVC 1483

Query: 131  KPNAKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
                KGGLGI  L   N++ L K   R   E+ ALW  ++R K
Sbjct: 1484 LSKKKGGLGIKCLSNLNKALLSKWNWRYANEREALWNQVIRGK 1526


>emb|CAN70399.1| hypothetical protein VITISV_023214 [Vitis vinifera]
          Length = 844

 Score = 73.6 bits (179), Expect = 5e-11
 Identities = 38/103 (36%), Positives = 58/103 (56%)
 Frame = -2

Query: 311 KPARSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIR 132
           K  R+ LI++T S +P Y   L+ LP+S  + LE + R+FLWG    +R+ H + W  + 
Sbjct: 662 KGGRATLIRSTLSNLPIYLMSLLCLPSSVRRRLEKIQRDFLWGGGNLERKPHLVRWELVC 721

Query: 131 KPNAKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
              +KGGLG+  L   N++ L K   R   E+ ALW  ++R K
Sbjct: 722 LSKSKGGLGVKCLSLLNKALLAKWNWRFANERKALWNQVIRGK 764


>gb|EMS67416.1| Protein argonaute 1D [Triticum urartu]
          Length = 1573

 Score = 72.8 bits (177), Expect = 9e-11
 Identities = 36/101 (35%), Positives = 59/101 (58%)
 Frame = -2

Query: 305  ARSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKP 126
            A+  LI++    +P Y+  + K PAS  +EL  + RNF WGDE+++R+ H + W K+ KP
Sbjct: 1026 AKETLIKSVLQALPVYAMGIFKFPASLCEELAQIIRNFWWGDEENRRKTHWLAWDKMTKP 1085

Query: 125  NAKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
             A+ G+G   L+  NQ+ L K A R +    +L   +++ K
Sbjct: 1086 KAEDGMGFRDLRLFNQALLAKQAWRLVVNPESLCARVIKAK 1126


>emb|CAN77369.1| hypothetical protein VITISV_033118 [Vitis vinifera]
          Length = 861

 Score = 72.8 bits (177), Expect = 9e-11
 Identities = 38/103 (36%), Positives = 56/103 (54%)
 Frame = -2

Query: 311 KPARSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIR 132
           K  R+ LI++T S +P Y   L++LP+S  + LE + R+FLWG    +R+ H + W  + 
Sbjct: 605 KGGRATLIRSTLSNLPIYFMSLLRLPSSVRRRLEQIQRDFLWGGGNLERKPHLVRWEVVC 664

Query: 131 KPNAKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
               KGGL +  L   N + L K   R   EK ALW  ++R K
Sbjct: 665 LSKKKGGLXVKCLSILNXALLFKWNWRYANEKEALWNQVIRGK 707


>gb|EMT20940.1| ABC transporter C family member 10 [Aegilops tauschii]
          Length = 2212

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 37/101 (36%), Positives = 57/101 (56%)
 Frame = -2

Query: 305 ARSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELHTIDWHKIRKP 126
           A+  LI++    +  Y+  + K  AS  +EL  + RNF WGDE D+R++H + W K+  P
Sbjct: 59  AKESLIKSVLQALSTYAMSVFKFSASPCEELSQITRNFWWGDENDRRKIHWLAWDKLLMP 118

Query: 125 NAKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
             KGG+G   L+  NQ+ L K A R ++   +L   LL+ K
Sbjct: 119 KEKGGMGFRDLRLFNQALLAKQAWRLIQFPDSLCARLLKAK 159


>emb|CAN69899.1| hypothetical protein VITISV_029782 [Vitis vinifera]
          Length = 760

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 39/111 (35%), Positives = 61/111 (54%)
 Frame = -2

Query: 335 RKNYCKSGKPARSILIQTTKSTVPFYSRQLIKLPASTIKELENMNRNFLWGDEKDKRELH 156
           ++ Y   G+  R+ LI +T S +P Y   L+ LP+S  + LE + R+FLWG    +R+ H
Sbjct: 330 KRQYLSKGR--RATLICSTLSNLPIYLMSLLCLPSSVRRRLEKIQRDFLWGGGNLERKPH 387

Query: 155 TIDWHKIRKPNAKGGLGIHLLKETNQSFLIKLA*R*LKEKYALWVSLLRDK 3
            + W  +    +KGGLG+  L   N++ L K   R   E+ ALW  ++R K
Sbjct: 388 LVRWELVCLSKSKGGLGVKSLSLLNKTLLAKWNWRFANEREALWNQVIRGK 438


Top