BLASTX nr result
ID: Cephaelis21_contig00020932
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00020932 (1046 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAB82639.1| putative non-LTR retroelement reverse transcripta... 162 1e-37 gb|AAD20714.1| putative non-LTR retroelement reverse transcripta... 154 5e-35 emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|72677... 153 8e-35 gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam... 152 1e-34 pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi... 152 2e-34 >gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1374 Score = 162 bits (411), Expect = 1e-37 Identities = 109/351 (31%), Positives = 179/351 (50%), Gaps = 3/351 (0%) Frame = +1 Query: 1 CRGLGGPSTISQLREELRFHLPNIVFLCETKKK-SFVHSVCTKLKMLSMWRVVEPRGLSG 177 C+G+G T+ LRE + P ++FLCETKK+ +++ +V L + VEP G SG Sbjct: 8 CQGVGNTPTVRHLREIRGLYFPEVIFLCETKKRRNYLENVVGHLGFFDL-HTVEPIGKSG 66 Query: 178 GLMIGWSDRIVVKQVVLNEFCIQ--IEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQ 351 GL + W D + +K + ++ I + ++D E C +Y +A R W L + Sbjct: 67 GLALMWKDSVQIKVLQSDKRLIDALLIWQDKEFYLTC----IYGEPVQAERGELWERLTR 122 Query: 352 QRHFWGKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWA 531 W L GD N++ D SEK GG AR E S FR + + E+ +G ++W Sbjct: 123 LGLSRSGPWMLTGDFNELVDPSEKIGGPARKESSCLEFRQMLNSCGLWEVNHSGYQFSWY 182 Query: 532 NNRDGEGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFL 711 NR+ E V+ RLDR A+ W+ P+A ++ K SDH+ LI + + + F Sbjct: 183 GNRNDE-LVQCRLDRTVANQAWMELFPQAKATYLQKICSDHSPLINNLVGDNWRKWAGFK 241 Query: 712 FDKRLLNLPQCEETVAVAWNKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQAI 891 +DKR + ++ + W++ T ++ +I +CR + K K ++ +S IQ + Sbjct: 242 YDKRWVQREGFKDLLCNFWSQQSTKTNAL-MMEKIASCRREISKWKRVSKPSSAVRIQEL 300 Query: 892 KSKMMAMQEKGGQRDWQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044 + K+ A K D +E A LK +L +EY EE +W +KSR+ W++ GD+ Sbjct: 301 QFKLDA-ATKQIPFDRRELARLKKELSQEYNNEEQFWQEKSRIMWMRNGDR 350 >gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1750 Score = 154 bits (388), Expect = 5e-35 Identities = 106/352 (30%), Positives = 167/352 (47%), Gaps = 6/352 (1%) Frame = +1 Query: 7 GLGGPSTISQLREELRFHLPNIVFLCETKKKSFVHSVCTKLKMLSMWRVVE--PRGLSGG 180 G+G P T S+L R + +I+FL ET + VC L V+ P G SGG Sbjct: 391 GIGMPLTQSRLFRLFRMYNYDILFLVETLNQC--DKVCKLAYDLGFPNVITQPPNGRSGG 448 Query: 181 LMIGWSDRIVVKQVVLNEFCIQ--IEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQQ 354 L + W + + + + +E I + F + C VY ++ R W L+ Sbjct: 449 LALMWKNNVSLSLISQDERLIDSHVTFNNKSFYLSC----VYGHPTQSERHQLWQTLEHI 504 Query: 355 RHFWGKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWAN 534 W L GD N+I +EK GG R E +FR FR+ + ++ +++ G ++W Sbjct: 505 SDNRNAEWLLVGDFNEILSNAEKIGGPMREEWTFRNFRNMVSHCDIEDMRSKGDRFSWVG 564 Query: 535 NRDGEGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFLF 714 R V+ LDR F + W P A + + SDH +++ R++ F F Sbjct: 565 ERHTHT-VKCCLDRVFINSAWTATFPYAEIEFLDFTGSDHKPVLVHFNESFPRRSKLFRF 623 Query: 715 DKRLLNLPQCEETVAVAW--NKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQA 888 D RL+++P + V +W N+ TP + RI +CR A+ +LK + LNS + I+ Sbjct: 624 DNRLIDIPTFKRIVQTSWRTNRNSRSTP---ITERISSCRQAMARLKHASNLNSEQRIKK 680 Query: 889 IKSKMMAMQEKGGQRDWQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044 ++S + E + D Q L+ L + + EE+YW QKSR QW+KEGDQ Sbjct: 681 LQSSLNRAMESTRRVDRQLIPQLQESLAKAFSDEEIYWKQKSRNQWMKEGDQ 732 >emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|7267781|emb|CAB81184.1| putative protein [Arabidopsis thaliana] Length = 1294 Score = 153 bits (386), Expect = 8e-35 Identities = 105/347 (30%), Positives = 164/347 (47%) Frame = +1 Query: 4 RGLGGPSTISQLREELRFHLPNIVFLCETKKKSFVHSVCTKLKMLSMWRVVEPRGLSGGL 183 +G+G P T SQL + +++FL ET K V S + P+G SGGL Sbjct: 370 KGIGVPLTQSQLSNLCKVFKFDVLFLIETLNKCEVISNLASVLGFPNVITQPPQGHSGGL 429 Query: 184 MIGWSDRIVVKQVVLNEFCIQIEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQQRHF 363 + W D + + + ++ I + + I + VY ++ R S W + Sbjct: 430 ALLWKDSVRLSNLYQDDRHIDVHISINNIN--FYLSRVYGHPCQSERHSLWTHFENLSKT 487 Query: 364 WGKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWANNRD 543 W L GD N+I +EK GG R E +FRGFR+ + ++ +I+ G ++W R Sbjct: 488 RNDPWILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDRFSWVGERH 547 Query: 544 GEGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFLFDKR 723 V+ LDR F + E P A + + SDH L L + + R F FDKR Sbjct: 548 SHT-VKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMRPFRFDKR 606 Query: 724 LLNLPQCEETVAVAWNKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQAIKSKM 903 LL +P + V WNKA I + +++ CR A+ KLK + LNS I +++ + Sbjct: 607 LLEVPHFKTYVKAGWNKA-INGQRKHLPDQVRTCRQAMAKLKHKSNLNSRIRINQLQAAL 665 Query: 904 MAMQEKGGQRDWQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044 + + + ++++ +L YR EE YW QKSR QW+KEGD+ Sbjct: 666 DKAMSSVNRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDR 712 >gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score: 42.57) [Arabidopsis thaliana] Length = 1662 Score = 152 bits (384), Expect = 1e-34 Identities = 105/346 (30%), Positives = 163/346 (47%) Frame = +1 Query: 7 GLGGPSTISQLREELRFHLPNIVFLCETKKKSFVHSVCTKLKMLSMWRVVEPRGLSGGLM 186 G+G P T SQL + +++FL ET K V S + P+G SGGL Sbjct: 391 GIGVPLTQSQLSNLCKVFKFDVLFLIETLNKCEVISNLASVLGFPNVITQPPQGHSGGLA 450 Query: 187 IGWSDRIVVKQVVLNEFCIQIEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQQRHFW 366 + W D + + + ++ I + + I + VY ++ R S W + Sbjct: 451 LLWKDSVRLSNLYQDDRHIDVHISINNIN--FYLSRVYGHPCQSERHSLWTHFENLSKTR 508 Query: 367 GKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWANNRDG 546 W L GD N+I +EK GG R E +FRGFR+ + ++ +I+ G ++W R Sbjct: 509 NDPWILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDRFSWVGERHS 568 Query: 547 EGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFLFDKRL 726 V+ LDR F + E P A + + SDH L L + + R F FDKRL Sbjct: 569 HT-VKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMRPFRFDKRL 627 Query: 727 LNLPQCEETVAVAWNKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQAIKSKMM 906 L +P + V WNKA I + +++ CR A+ KLK + LNS I +++ + Sbjct: 628 LEVPHFKTYVKAGWNKA-INGQRKHLPDQVRTCRQAMAKLKHKSNLNSRIRINQLQAALD 686 Query: 907 AMQEKGGQRDWQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044 + + + ++++ +L YR EE YW QKSR QW+KEGD+ Sbjct: 687 KAMSSVNRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDR 732 >pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana (fragment) Length = 1365 Score = 152 bits (383), Expect = 2e-34 Identities = 113/350 (32%), Positives = 169/350 (48%), Gaps = 2/350 (0%) Frame = +1 Query: 1 CRGLGGPSTISQLREELRFHLPNIVFLCETKK-KSFVHSVCTKLKMLSMWRVVEPRGLSG 177 C+GL P TI L+E + H P+I+FL ETK + FV+ V L VEP G SG Sbjct: 7 CQGLRNPWTIRYLKEMKKDHFPDILFLMETKNSQDFVYKVFCWLGY-DFIHTVEPEGRSG 65 Query: 178 GLMIGWSDRIVVKQVVLNEFCIQIEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQQR 357 GL I W + ++ + ++ + ++ S +V + VY +R W L Sbjct: 66 GLAIFWKSHLEIEFLYADKNLMDLQV--SSRNKVWFISCVYGLPVTHMRPKLWEHLNSIG 123 Query: 358 HFWGKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWANN 537 + W L GD NDIR EK GG RS SF+ F + + M E+ G +TW N Sbjct: 124 LKRAEAWCLIGDFNDIRSNDEKLGGPRRSPSSFQCFEHMLLNCSMHELGSTGNSFTWGGN 183 Query: 538 RDGEGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFLFD 717 R+ + +V+ +LDR F +P W P A + K SDH +++ ++ +F +D Sbjct: 184 RNDQ-WVQCKLDRCFGNPAWFSIFPNAHQWFLEKFGSDHRPVLVKFTNDNELFRGQFRYD 242 Query: 718 KRLLNLPQCEETVAVAWNKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQAIKS 897 KRL + P C E + +WN A S I+ CR A+ K ++ N+ I+ ++ Sbjct: 243 KRLDDDPYCIEVIHRSWNSAMSQGTHSSFFSLIE-CRRAISVWKHSSDTNAQSRIKRLRK 301 Query: 898 KMMAMQEKGGQRD-WQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044 + A EK Q W +K QL Y EE++W QKSR +WL GD+ Sbjct: 302 DLDA--EKSIQIPCWPRIEYIKDQLSLAYGDEELFWRQKSRQKWLAGGDK 349