BLASTX nr result
ID: Catharanthus22_contig00039859
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00039859 (351 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668... 95 1e-17 ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 92 9e-17 ref|XP_006378155.1| hypothetical protein POPTR_0010s04250g, part... 79 8e-13 ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcript... 68 1e-09 ref|XP_006605006.1| PREDICTED: uncharacterized protein LOC102669... 67 3e-09 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 67 3e-09 ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A... 65 9e-09 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 64 2e-08 ref|XP_006588078.1| PREDICTED: uncharacterized protein LOC102665... 64 2e-08 ref|XP_004252416.1| PREDICTED: uncharacterized protein LOC101244... 64 2e-08 ref|XP_006595463.1| PREDICTED: uncharacterized protein LOC102660... 64 2e-08 ref|XP_006588848.1| PREDICTED: uncharacterized protein LOC102662... 64 2e-08 gb|EMS58832.1| Alpha-galactosidase [Triticum urartu] 64 2e-08 ref|XP_004253503.1| PREDICTED: uncharacterized protein LOC101243... 64 2e-08 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 63 3e-08 gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 63 5e-08 ref|XP_006584325.1| PREDICTED: uncharacterized protein LOC100811... 62 6e-08 ref|XP_004229147.1| PREDICTED: uncharacterized protein LOC101247... 62 8e-08 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 62 8e-08 gb|ABD96948.1| hypothetical protein [Cleome spinosa] 62 8e-08 >ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max] Length = 477 Score = 94.7 bits (234), Expect = 1e-17 Identities = 44/112 (39%), Positives = 58/112 (51%) Frame = +3 Query: 9 LESWRKNQGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV 188 L SW N+ + KAY+Y+ P W +VWNP IP K S +W L T DR Sbjct: 289 LNSWNSNEQLLAGKAYDYIRGVKPAVNWNSVVWNPAIPSKMSFILWLATKNHLLTLDRAA 348 Query: 189 FLNIEGQCKLCKGPEESLAHLFFQCNFTRGIWESIREWAGLRRAMTTIQSCI 344 FLN C LC+ +S AHLFF C + +W +IR+W L R ++Q I Sbjct: 349 FLNKGLLCPLCRTKAKSHAHLFFSCRISLQVWANIRDWIPLHRQTISLQCTI 400 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 91.7 bits (226), Expect = 9e-17 Identities = 44/112 (39%), Positives = 57/112 (50%) Frame = +3 Query: 9 LESWRKNQGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV 188 L SW N+ K Y+Y+ P W+ I+WNP+IP K S +W RL DR Sbjct: 260 LNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATKNRLLALDRAA 319 Query: 189 FLNIEGQCKLCKGPEESLAHLFFQCNFTRGIWESIREWAGLRRAMTTIQSCI 344 FLN C LC ES AHLFF C + +W IR+W L+R ++Q I Sbjct: 320 FLNKGFLCPLCTNEAESHAHLFFSCRTSLRVWAHIRDWIPLKRQSISLQHSI 371 >ref|XP_006378155.1| hypothetical protein POPTR_0010s04250g, partial [Populus trichocarpa] gi|550329025|gb|ERP55952.1| hypothetical protein POPTR_0010s04250g, partial [Populus trichocarpa] Length = 112 Score = 78.6 bits (192), Expect = 8e-13 Identities = 38/113 (33%), Positives = 55/113 (48%) Frame = +3 Query: 9 LESWRKNQGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV 188 L SW G TA AY + V + W +VW P+ + +W +LG+L T DRL Sbjct: 2 LSSWHSRPGSFTANAYHFFTYKVDHVQWASVVWEQWFLPRHNFSLW--LLGKLRTRDRLQ 59 Query: 189 FLNIEGQCKLCKGPEESLAHLFFQCNFTRGIWESIREWAGLRRAMTTIQSCIK 347 F++ + LC ES AHLFF C ++ +W R W +M T+ I+ Sbjct: 60 FISTDPLYPLCHNSSESHAHLFFSCAWSSSLWGKARYWLEFHSSMPTLNRVIR 112 >ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] gi|332005241|gb|AED92624.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] Length = 295 Score = 67.8 bits (164), Expect = 1e-09 Identities = 36/85 (42%), Positives = 48/85 (56%), Gaps = 2/85 (2%) Frame = +3 Query: 39 DTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRL--VFLNIEGQC 212 DT + +P VP W K+VW P+FS+ W + L RLPT DRL +NI Sbjct: 113 DTWEQIRVHSPTVP---WAKVVWFKEYIPRFSLITWMSFLERLPTRDRLRGWGMNIPSSW 169 Query: 213 KLCKGPEESLAHLFFQCNFTRGIWE 287 LC +E+ AHLFF+C+F+ IWE Sbjct: 170 VLCSNGDETHAHLFFECSFSLAIWE 194 >ref|XP_006605006.1| PREDICTED: uncharacterized protein LOC102669369 [Glycine max] Length = 1096 Score = 66.6 bits (161), Expect = 3e-09 Identities = 28/73 (38%), Positives = 42/73 (57%), Gaps = 3/73 (4%) Frame = +3 Query: 99 IVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI---EGQCKLCKGPEESLAHLFFQCNF 269 IVW +P K ++ W +L RLPT D L+ N+ +C LC +E++ HLFF C+F Sbjct: 927 IVWKVPVPSKVALFCWRLLLDRLPTKDNLIRRNVVINNSRCSLCDSCDENVVHLFFHCDF 986 Query: 270 TRGIWESIREWAG 308 + IW+ + W G Sbjct: 987 SNCIWKEVLSWIG 999 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 66.6 bits (161), Expect = 3e-09 Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 3/77 (3%) Frame = +3 Query: 78 PNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNIEGQ---CKLCKGPEESLAH 248 P + W K VW P PK+S +W T+ RL T DR+ N GQ C LC EE+ H Sbjct: 1350 PQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWN-SGQLVTCTLCNNAEETRDH 1408 Query: 249 LFFQCNFTRGIWESIRE 299 LFF C +T +WE++ + Sbjct: 1409 LFFSCQYTSYVWEALTQ 1425 >ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 364 Score = 65.1 bits (157), Expect = 9e-09 Identities = 27/87 (31%), Positives = 49/87 (56%), Gaps = 2/87 (2%) Frame = +3 Query: 36 IDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV--FLNIEGQ 209 + +A+++L P +P+ W K++W+ I P+ S+H W + GR+ + D L + + + Sbjct: 15 LSAKEAFQFLRPRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRVLSEDLLQRRGIALASR 74 Query: 210 CKLCKGPEESLAHLFFQCNFTRGIWES 290 C LC ESL H+F C+F +W + Sbjct: 75 CVLCGRDGESLPHIFLTCSFAASLWNN 101 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 64.3 bits (155), Expect = 2e-08 Identities = 33/90 (36%), Positives = 43/90 (47%), Gaps = 2/90 (2%) Frame = +3 Query: 30 QGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--E 203 QG AK +E L P P K W K VW PK + + W L RLPT RLV + Sbjct: 890 QGFSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSS 949 Query: 204 GQCKLCKGPEESLAHLFFQCNFTRGIWESI 293 +C LC E+ HL C+F+ +W + Sbjct: 950 AECCLCSFDTETRDHLLLLCDFSSQVWRMV 979 >ref|XP_006588078.1| PREDICTED: uncharacterized protein LOC102665107 [Glycine max] Length = 189 Score = 64.3 bits (155), Expect = 2e-08 Identities = 35/96 (36%), Positives = 48/96 (50%), Gaps = 7/96 (7%) Frame = +3 Query: 18 WRKNQG--IDTAKAYEYLAPAVP--NKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRL 185 W+ N+ T AY+ L P KIVWN +PP+ ++ W IL RLPT L Sbjct: 94 WKANRSGIYSTKSAYKLLKTTTPIMEANILKIVWNLNVPPRAAIFSWRLILDRLPTRGNL 153 Query: 186 VFLNIEGQ---CKLCKGPEESLAHLFFQCNFTRGIW 284 + N++ Q C LC +E + HL F C T G+W Sbjct: 154 LRRNVQMQDTSCPLCGNAQEEVDHLVFNCEMTLGLW 189 >ref|XP_004252416.1| PREDICTED: uncharacterized protein LOC101244351 [Solanum lycopersicum] Length = 169 Score = 64.3 bits (155), Expect = 2e-08 Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 2/76 (2%) Frame = +3 Query: 87 VWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVF--LNIEGQCKLCKGPEESLAHLFFQ 260 +WT +++N PK MW + RL T DRL + +E C LC+ EE+ H+F Q Sbjct: 8 IWTCLMFNNAARPKAYFTMWIMMNQRLVTVDRLAKWGVEVEKTCVLCENEEETAEHVFIQ 67 Query: 261 CNFTRGIWESIREWAG 308 C+F RG+W + W G Sbjct: 68 CSFARGLWGRLLNWTG 83 >ref|XP_006595463.1| PREDICTED: uncharacterized protein LOC102660851 [Glycine max] Length = 199 Score = 63.9 bits (154), Expect = 2e-08 Identities = 31/89 (34%), Positives = 43/89 (48%), Gaps = 2/89 (2%) Frame = +3 Query: 90 WTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--EGQCKLCKGPEESLAHLFFQC 263 W ++ N P + +W GRLPT DRL + E CKLCK ESL HLFF+C Sbjct: 41 WRFLMHNNHARPLAKLTLWMVCHGRLPTMDRLHRFGMIQETICKLCKEKNESLTHLFFEC 100 Query: 264 NFTRGIWESIREWAGLRRAMTTIQSCIKW 350 T+ +W+ + W L + + W Sbjct: 101 GMTKTVWDQVLHWLNLNHRIKGWNEELDW 129 >ref|XP_006588848.1| PREDICTED: uncharacterized protein LOC102662740 [Glycine max] Length = 292 Score = 63.9 bits (154), Expect = 2e-08 Identities = 35/93 (37%), Positives = 47/93 (50%), Gaps = 6/93 (6%) Frame = +3 Query: 42 TAKAYEYLAPAV---PNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNIEGQ- 209 T AY L P++ P++ +I+W+ IPP+ +V W L RLPT L +I Q Sbjct: 104 TKSAYRLLMPSISPAPSRRNFQILWHLKIPPRAAVFSWRLFLDRLPTRGNLSRRSIPIQD 163 Query: 210 --CKLCKGPEESLAHLFFQCNFTRGIWESIREW 302 C LC E HLFF CN T+G+W W Sbjct: 164 IMCPLCGCQHEEAGHLFFHCNMTKGLWWESMRW 196 >gb|EMS58832.1| Alpha-galactosidase [Triticum urartu] Length = 561 Score = 63.9 bits (154), Expect = 2e-08 Identities = 38/103 (36%), Positives = 48/103 (46%), Gaps = 4/103 (3%) Frame = +3 Query: 15 SWR--KNQGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLV 188 SW+ K+ A AY + + +IVW PPK W I R+ TADRL Sbjct: 435 SWKFSKDGQYSAAMAYSAQFLGLMDTDMNQIVWKNWAPPKCKFFTWLVINNRIWTADRLQ 494 Query: 189 FLNIEG--QCKLCKGPEESLAHLFFQCNFTRGIWESIREWAGL 311 C LCK +ES AHL FQC FT +W ++ W GL Sbjct: 495 RRGWPNCHLCPLCKQVQESAAHLLFQCRFTVRVWGMLKSWLGL 537 >ref|XP_004253503.1| PREDICTED: uncharacterized protein LOC101243694 [Solanum lycopersicum] Length = 177 Score = 63.9 bits (154), Expect = 2e-08 Identities = 34/98 (34%), Positives = 48/98 (48%), Gaps = 4/98 (4%) Frame = +3 Query: 54 YEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--EGQCKLCKG 227 Y+YL W +++ PK +W + +L T DRL + + C LCKG Sbjct: 6 YDYLRGDQAKPEWKGLMFKNAARPKAIFTLWILLNRKLATIDRLAKWGVVHDPTCVLCKG 65 Query: 228 PEESLAHLFFQCNFTRGIWESIREWAGL--RRAMTTIQ 335 +ESL HLF QC++ +WE + WAG R T IQ Sbjct: 66 ADESLDHLFLQCHYAEEVWERVLTWAGFYNNRPRTWIQ 103 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 63.2 bits (152), Expect = 3e-08 Identities = 32/90 (35%), Positives = 43/90 (47%), Gaps = 2/90 (2%) Frame = +3 Query: 30 QGIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--E 203 QG AK +E L P P K W + VW PK + + W L RLPT RLV + Sbjct: 890 QGFSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSS 949 Query: 204 GQCKLCKGPEESLAHLFFQCNFTRGIWESI 293 +C LC E+ HL C+F+ +W + Sbjct: 950 AECCLCSFDTETRDHLLLLCDFSSQVWRMV 979 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 62.8 bits (151), Expect = 5e-08 Identities = 28/68 (41%), Positives = 39/68 (57%), Gaps = 2/68 (2%) Frame = +3 Query: 90 WTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLN--IEGQCKLCKGPEESLAHLFFQC 263 W K VW PKFS +W + RL T D+++ N ++G C LC+ ES HLFF C Sbjct: 1282 WHKGVWFTHSTPKFSFCVWLAVYDRLSTGDKMLLWNRGLQGTCLLCRNATESRDHLFFSC 1341 Query: 264 NFTRGIWE 287 +F+ +WE Sbjct: 1342 SFSSEVWE 1349 >ref|XP_006584325.1| PREDICTED: uncharacterized protein LOC100811880 [Glycine max] Length = 621 Score = 62.4 bits (150), Expect = 6e-08 Identities = 35/93 (37%), Positives = 46/93 (49%), Gaps = 6/93 (6%) Frame = +3 Query: 42 TAKAYEYLAPA---VPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNIEGQ- 209 T AY L P +P++ +I+W+ IPP+ +V W RLPT L NI Q Sbjct: 483 TKSAYSLLMPPSNPLPSRRNFQILWHLKIPPRAAVFSWRLFWDRLPTRGNLSRRNIPIQD 542 Query: 210 --CKLCKGPEESLAHLFFQCNFTRGIWESIREW 302 C LC +E HLFF C+ TRG+W W Sbjct: 543 TMCPLCGSQQEEAGHLFFHCSMTRGLWWESMVW 575 >ref|XP_004229147.1| PREDICTED: uncharacterized protein LOC101247059 [Solanum lycopersicum] Length = 133 Score = 62.0 bits (149), Expect = 8e-08 Identities = 34/98 (34%), Positives = 48/98 (48%), Gaps = 4/98 (4%) Frame = +3 Query: 54 YEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVF--LNIEGQCKLCKG 227 Y+YL P VW +++ PK +W + +L T DRL L + C LC Sbjct: 6 YDYLRGEKPKPVWKCLMFKNTERPKAIFTLWILMYRKLATVDRLAKWGLTHDTACVLCTN 65 Query: 228 PEESLAHLFFQCNFTRGIWESIREWAGL--RRAMTTIQ 335 +ESL H+F QC++ +WE + W GL RA T Q Sbjct: 66 MDESLDHMFLQCHYVGEVWERVLAWDGLHNNRAKTWTQ 103 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 62.0 bits (149), Expect = 8e-08 Identities = 33/89 (37%), Positives = 43/89 (48%), Gaps = 2/89 (2%) Frame = +3 Query: 33 GIDTAKAYEYLAPAVPNKVWTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--EG 206 G A+ +E + P P K WTK VW PK + +MW + L RLPT RL + Sbjct: 903 GFSAARTWEAMRPKKPVKDWTKSVWFKGSVPKHAFNMWVSHLNRLPTRQRLAAWGVTTTT 962 Query: 207 QCKLCKGPEESLAHLFFQCNFTRGIWESI 293 C LC ES HL C F+ IW+ + Sbjct: 963 DCCLCSSRPESRDHLLLYCVFSAVIWKLV 991 >gb|ABD96948.1| hypothetical protein [Cleome spinosa] Length = 539 Score = 62.0 bits (149), Expect = 8e-08 Identities = 31/76 (40%), Positives = 41/76 (53%), Gaps = 4/76 (5%) Frame = +3 Query: 90 WTKIVWNPIIPPKFSVHMWCTILGRLPTADRLVFLNI--EGQCKLCKGPEESLAHLFFQC 263 W+ IVW P+ P+ + W +L RLPT DRL I + C+LC G +ES HLFF C Sbjct: 428 WSSIVWFPLAIPRHAFLHWQVMLFRLPTKDRLQQWGITSDATCRLCDGEDESHQHLFFGC 487 Query: 264 NFTRGIWESIRE--WA 305 + +W E WA Sbjct: 488 TYASHLWRHFGEVCWA 503