BLASTX nr result
ID: Cocculus22_contig00013024
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00013024 (1772 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcript... 72 2e-18 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 70 3e-18 ref|XP_004293076.1| PREDICTED: putative ribonuclease H protein A... 58 3e-16 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 65 1e-14 gb|AAD26953.1| putative non-LTR retrolelement reverse transcript... 67 1e-14 ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A... 74 1e-13 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 60 3e-13 ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcript... 56 3e-13 gb|ABK28243.1| unknown [Arabidopsis thaliana] 56 3e-13 gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thali... 56 3e-13 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 63 5e-13 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 63 4e-12 dbj|BAA77394.1| SAE1-S9-protein [Brassica rapa] 69 5e-12 gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] 70 3e-11 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 75 7e-11 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 58 1e-10 ref|XP_004305958.1| PREDICTED: uncharacterized protein LOC101308... 74 2e-10 gb|ABA99600.2| retrotransposon protein, putative, unclassified [... 48 2e-10 gb|EEE53448.1| hypothetical protein OsJ_36550 [Oryza sativa Japo... 48 2e-10 gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub... 61 3e-10 >ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] gi|332005241|gb|AED92624.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] Length = 295 Score = 72.4 bits (176), Expect(2) = 2e-18 Identities = 45/157 (28%), Positives = 67/157 (42%), Gaps = 12/157 (7%) Frame = +1 Query: 418 IGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAGYWGNPPSSVSN-- 591 +G+G+ FW D W LL +GA R ++ V + +R G W P + N Sbjct: 16 MGNGESASFWYDAWTDFGQLLTFLGAAGPRQLRIRQDARVVEASRNGDWFLPAARSDNSQ 75 Query: 592 -------VRAVWHQFHGLARLGNDDQDDIIWTTTSSG---AFTLNSAWELIRAKESNFRW 741 + V H+ G QD +W + +F+ WE IR W Sbjct: 76 LFLAALTMAPVPHESRG--------QDSFLWRNAAGSYLPSFSSRDTWEQIRVHSPTVPW 127 Query: 742 FDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 852 +VWF + +P+FSL TW +LPT+ LR G+N Sbjct: 128 AKVVWFKEYIPRFSLITWMSFLERLPTRDRLRGWGMN 164 Score = 48.9 bits (115), Expect(2) = 2e-18 Identities = 34/107 (31%), Positives = 50/107 (46%), Gaps = 8/107 (7%) Frame = +3 Query: 867 LCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFV--------PAALPRLREADWLLNACR 1022 LC +G E + L+FECSFS A+ WEF P LP + W+L Sbjct: 171 LCSNGDETHAHLFFECSFSLAI--------WEFFASKFRPSPPFGLPAA--SSWILQLPL 220 Query: 1023 GKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 H +++ K + V+H+W ERN+RIF + S L I+ + Sbjct: 221 RSHSTTIL-KLLLQSAVYHVWKERNARIFTSISSSASSLRLAIDRTM 266 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 69.7 bits (169), Expect(2) = 3e-18 Identities = 44/170 (25%), Positives = 70/170 (41%), Gaps = 6/170 (3%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 534 +W+ R + K+R + + +G G FW D W LL++ G + + P+NS Sbjct: 852 NWIWRKLCKLRPFARPFIICEVGSGVTASFWHDNWTDHGPLLHLTGPAGPLLAGLPLNSV 911 Query: 535 VQQVTRAGYWGNPPSSVSN--VRAVWHQFHGLARLGNDDQDD-IIWTT---TSSGAFTLN 696 V+ R W S N + + A L + DD +W S F+ Sbjct: 912 VRDALRDDTWRISSSRSRNPVITLLQRVLPSAASLIDCPHDDTYLWKIGHHAPSNRFSTA 971 Query: 697 SAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG 846 W ++ ++ W VWF D VPK + W + +L T+ LRR G Sbjct: 972 DTWSYLQPSSTSVLWHKAVWFKDHVPKQAFICWVVAHNRLHTRDRLRRWG 1021 Score = 50.8 bits (120), Expect(2) = 3e-18 Identities = 36/103 (34%), Positives = 47/103 (45%), Gaps = 6/103 (5%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAAL-----PRLREAD-WLLNACR 1022 C LC E L+F C FS + W F AL P+ W L A R Sbjct: 1028 CVLCNDLDESREHLFFRCQFSSEI--------WSFFMRALNLNPPPQFMHCLLWTLTASR 1079 Query: 1023 GKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTI 1151 +++ +LI K F +V+ IW ERN RI N VR H ++K I Sbjct: 1080 DRNI-TLITKLLFHASVYFIWRERNLRIHSNSVRPAHLIIKEI 1121 >ref|XP_004293076.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 487 Score = 58.2 bits (139), Expect(3) = 3e-16 Identities = 46/165 (27%), Positives = 74/165 (44%), Gaps = 5/165 (3%) Frame = +1 Query: 367 RGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPV--NSSVQ 540 +GIL R+L+ + I+G+G++ KFW W + LLN++ +I RN + N +V Sbjct: 36 KGILDARNLILKGMRWIVGNGENIKFWTFNWAYEFPLLNLI----QINDRNAIDLNETVA 91 Query: 541 QVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWELIRA 720 G W + Q G+ L ++ D+ IW + G F++ SA L Sbjct: 92 DYIFNGCWNIQKLLQVLDQETVKQITGIPILVSNQCDECIWAPPTDGRFSVKSATWLQYQ 151 Query: 721 KESNFRWFDL---VWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG 846 + DL VW D K L W +L+ +L T+ L + G Sbjct: 152 NLEKHQQSDLINKVWKLDVPLKVKLFGWLLLRGRLKTRDRLSKFG 196 Score = 52.0 bits (123), Expect(3) = 3e-16 Identities = 34/133 (25%), Positives = 67/133 (50%), Gaps = 3/133 (2%) Frame = +2 Query: 1235 GCLNQRRSICVNGVLPLRVFT-LHSGGSMSARVCGYGGVIRDEMGEIIIGFSGSVAQGSV 1411 G ++Q S + + P F ++ GS+ R G V R+ G +I+ + + ++ Sbjct: 309 GGISQTTSSTIRWLPPHNNFIKINFDGSVQGRSAAGGFVFRNSDGNVILAAAKGLGSTTI 368 Query: 1412 -LLEAIGLCNGIKLAKEQNFTKLKAVSDSKILIMIVNKQCPSPWYIKHIVHDIW*LCSNM 1588 EA L + + A+++ + ++ DSK++I +N + PW ++ IV DI + ++ Sbjct: 369 PTAEATALRDSLVKARDRGYMNVQVEGDSKLVIDAINGKLSPPWRLQKIVQDIRTIATSF 428 Query: 1589 -EVSFSHCWREVN 1624 V F+H +RE N Sbjct: 429 SSVCFNHVYREAN 441 Score = 23.5 bits (49), Expect(3) = 3e-16 Identities = 26/102 (25%), Positives = 35/102 (34%), Gaps = 6/102 (5%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREADW---LLNACRGKH 1031 C LC S E L+ C F+ V F A + L DW L R Sbjct: 203 CPLCDSDNETADHLFGHCDFTTEV----------FRLAGISALM--DWHEGYLKVLREMF 250 Query: 1032 MESLIRKSCFTTTV---HHIWLERNSRIFRNQVRSTHQLVKT 1148 + K F + IW RN IFR+ + + + T Sbjct: 251 INQPYDKFLFAKVLIIYWQIWKARNDTIFRDVITTATNVAAT 292 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 65.5 bits (158), Expect(2) = 1e-14 Identities = 42/165 (25%), Positives = 69/165 (41%), Gaps = 9/165 (5%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 534 SW+ R +LK R++ K C + +G +T FW D W + L+N+ GA R ++ Sbjct: 648 SWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKGPLINLTGA------RGAIDMG 701 Query: 535 V-QQVTRAGYWGNPPSSVSNVRAVWHQFHGL-----ARLGNDDQDDIIW---TTTSSGAF 687 + + +T A W V + ++F + + +D I+W F Sbjct: 702 ISRHMTLAEAWSRRRRKRHRVE-ILNEFEEILLQKYQHRNIELEDAILWRGKEDVFKARF 760 Query: 688 TLNSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT 822 + W IR + W VWF PKFS W ++ +L T Sbjct: 761 STKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLST 805 Score = 43.1 bits (100), Expect(2) = 1e-14 Identities = 24/101 (23%), Positives = 48/101 (47%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 1040 C C S +E L+F+C +S + SI + ++ + ++ + ++S Sbjct: 820 CVFCSSPMETRDHLFFQCCYSSEIWTSIAKNVYK--DRFSTKWSAVVNYISDSQPDRIQS 877 Query: 1041 LIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 + + F ++H IW ERNSR + RS L++ I++ + Sbjct: 878 FLSRYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQIDKTI 918 >gb|AAD26953.1| putative non-LTR retrolelement reverse transcriptase [Arabidopsis thaliana] Length = 323 Score = 67.0 bits (162), Expect(2) = 1e-14 Identities = 44/174 (25%), Positives = 73/174 (41%), Gaps = 8/174 (4%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 534 SW+ R + K+R L + + IG G+ FW D W Q L+++ G + P+N+ Sbjct: 20 SWIWRKLCKLRPLARPFLVCEIGSGETASFWQDNWTGQGPLIDLTGTNGPRSVGMPLNAV 79 Query: 535 VQQVTRAGYWGNPPS-----SVSNVRAVWHQFHGLARLGNDDQDDIIWTT---TSSGAFT 690 V+ R W S S++ +++V + +DD W S F+ Sbjct: 80 VRDALRGDNWWLSSSRSRNPSIALLKSVLPSSESMIECQHDDV--YKWKPDHHAPSNIFS 137 Query: 691 LNSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 852 + W + W VWF DR+PK + W +L T+ L + G+N Sbjct: 138 ASKTWTALNPDGVLVPWQKSVWFKDRIPKHAFICWVAAWKRLHTRDRLTQWGLN 191 Score = 41.2 bits (95), Expect(2) = 1e-14 Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 6/108 (5%) Frame = +3 Query: 858 LCSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEF--VPAAL--PRLREAD--WLLNAC 1019 +C LC E L+F+C FS + W F + A + P L WL +A Sbjct: 195 VCVLCNVVDETHDHLFFQCQFSNEI--------WSFFMIRAGMTPPHLFGPILLWLKSAS 246 Query: 1020 RGKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 K++ SLI K F +V+ IW ERN RI R+ ++K + + + Sbjct: 247 SSKNL-SLIIKLLFQASVYLIWRERNCRIHTTHSRTPPTIIKEVQQLI 293 >ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 751 Score = 73.9 bits (180), Expect(2) = 1e-13 Identities = 46/167 (27%), Positives = 75/167 (44%), Gaps = 1/167 (0%) Frame = +1 Query: 352 SSWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNI-VGADRRIFSRNPVN 528 +S V G+ ++ L+ E IIGDG FW DKWL ++ + +G+ + +N Sbjct: 345 TSSVWHGLKRVLPLLFEHSRWIIGDGNSILFWSDKWLHSSIIQQLNMGSLSHL-----LN 399 Query: 529 SSVQQVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWE 708 S V W P + Q + + D +IW +SSG F+ + +E Sbjct: 400 SRVADFIWDQQWALPSHFSNLFPDCAKQILEIPLPNTPESDILIWEHSSSGIFSFSDGYE 459 Query: 709 LIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGV 849 L+R W VW P++S+ W++ +KLPT L+R G+ Sbjct: 460 LVRPYFEKLDWASSVWHSFIPPRYSVLAWRIFHLKLPTDDQLQRRGI 506 Score = 31.2 bits (69), Expect(2) = 1e-13 Identities = 25/108 (23%), Positives = 48/108 (44%), Gaps = 8/108 (7%) Frame = +3 Query: 852 LKLCSLC-WSGLEDISDLYFECSFSQAV*ASIKRLCWEFVP----AALPRLREADWLLNA 1016 + +C LC +S EDI L+ CSF+Q + W+++ +LP + L ++ Sbjct: 509 VSVCQLCSFSHTEDIPHLFVNCSFAQHI--------WQWLAYYFGTSLPSSGSLNDLWSS 560 Query: 1017 CRGKHMESLIRKSCFTT---TVHHIWLERNSRIFRNQVRSTHQLVKTI 1151 GK ++ F + + IW N F N+ S ++ +++ Sbjct: 561 VTGKAFSPQLKNIWFASCLFALMAIWKSHNKLRFDNKQPSLMRVFRSV 608 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 60.1 bits (144), Expect(2) = 3e-13 Identities = 42/168 (25%), Positives = 68/168 (40%), Gaps = 12/168 (7%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRI---FSRNPV 525 SW+ + +LK R+ K + +G T FW D W L+++ G +I SRN Sbjct: 332 SWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNK- 390 Query: 526 NSSVQQVTRAGYWGNP------PSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSS--- 678 T A W N ++++ A +Q + L +D +W Sbjct: 391 -------TVAEAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLLREDAT--LWRGKGDVFK 441 Query: 679 GAFTLNSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT 822 +F+ W +R K + W+ VWF PK+ TW L+ +L T Sbjct: 442 TSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLST 489 Score = 43.5 bits (101), Expect(2) = 3e-13 Identities = 24/106 (22%), Positives = 48/106 (45%), Gaps = 5/106 (4%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREADW-----LLNACRG 1025 C+ C + +E L+F CS++ A+ +I + L DW ++ + Sbjct: 504 CTFCSTSIETRDHLFFSCSYASAIWTAIAK-------NVLQHRFSTDWQTIVNYISETQT 556 Query: 1026 KHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 + S + + F TVH +W ERN R + R++ L+ +++ + Sbjct: 557 DRIRSFLSRYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQI 602 >ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] gi|5732057|gb|AAD48956.1|AF149414_5 contains similarity to a family of Arabidopsis thaliana predicted proteins, which have similarity to reverse transcriptases; see T14P8.10 (GB:AF069298) [Arabidopsis thaliana] gi|7267223|emb|CAB80830.1| AT4g04650 [Arabidopsis thaliana] gi|332657009|gb|AEE82409.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] Length = 332 Score = 56.2 bits (134), Expect(2) = 3e-13 Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 8/166 (4%) Frame = +1 Query: 379 KIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAG 558 K+R + + + +G G KFW D W+ L+ ++G P+++ V+ R Sbjct: 3 KLRVVARPFIVCEVGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLPIDAVVRDALRGT 62 Query: 559 YWGNPPSSVSN-----VRAVWHQFHGLARLGNDDQDDIIWTT---TSSGAFTLNSAWELI 714 W S N ++ + + GL +DD +W T S F+ W + Sbjct: 63 SWWIASSRSRNPIIVQLKNLLPEAQGLLDCQHDDS--FLWKTDLHAPSNRFSAPRTWSAL 120 Query: 715 RAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 852 + W VWF + VPK + W + +L T+ L+ G++ Sbjct: 121 HPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLS 166 Score = 47.4 bits (111), Expect(2) = 3e-13 Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 6/103 (5%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAAL---PRLREAD---WLLNACR 1022 C LC + + + L+FEC FS V W F A+ P + D WLL+ R Sbjct: 171 CLLCNAHDDSRAHLFFECQFSGVV--------WRFFTASTNLNPPAQLMDCLNWLLSPSR 222 Query: 1023 GKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTI 1151 K++ +IR + F + V+ IW ERN R+ RST ++K I Sbjct: 223 EKNICLIIRLA-FHSCVYAIWRERNQRLHSGVSRSTESILKDI 264 >gb|ABK28243.1| unknown [Arabidopsis thaliana] Length = 297 Score = 56.2 bits (134), Expect(2) = 3e-13 Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 8/166 (4%) Frame = +1 Query: 379 KIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAG 558 K+R + + + +G G KFW D W+ L+ ++G P+++ V+ R Sbjct: 3 KLRVVARPFIVCEVGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLPIDAVVRDALRGT 62 Query: 559 YWGNPPSSVSN-----VRAVWHQFHGLARLGNDDQDDIIWTT---TSSGAFTLNSAWELI 714 W S N ++ + + GL +DD +W T S F+ W + Sbjct: 63 SWWIASSRSRNPIIVQLKNLLPEAQGLLDCQHDDS--FLWKTDLHAPSNRFSAPRTWSAL 120 Query: 715 RAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 852 + W VWF + VPK + W + +L T+ L+ G++ Sbjct: 121 HPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLS 166 Score = 47.4 bits (111), Expect(2) = 3e-13 Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 6/103 (5%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAAL---PRLREAD---WLLNACR 1022 C LC + + + L+FEC FS V W F A+ P + D WLL+ R Sbjct: 171 CLLCNAHDDSRAHLFFECQFSGVV--------WRFFTASTNLNPPAQLMDCLNWLLSPSR 222 Query: 1023 GKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTI 1151 K++ +IR + F + V+ IW ERN R+ RST ++K I Sbjct: 223 EKNICLIIRLA-FHSCVYAIWRERNQRLHSGVSRSTESILKDI 264 >gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thaliana] Length = 296 Score = 56.2 bits (134), Expect(2) = 3e-13 Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 8/166 (4%) Frame = +1 Query: 379 KIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAG 558 K+R + + + +G G KFW D W+ L+ ++G P+++ V+ R Sbjct: 3 KLRVVARPFIVCEVGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLPIDAVVRDALRGT 62 Query: 559 YWGNPPSSVSN-----VRAVWHQFHGLARLGNDDQDDIIWTT---TSSGAFTLNSAWELI 714 W S N ++ + + GL +DD +W T S F+ W + Sbjct: 63 SWWIASSRSRNPIIVQLKNLLPEAQGLLDCQHDDS--FLWKTDLHAPSNRFSAPRTWSAL 120 Query: 715 RAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 852 + W VWF + VPK + W + +L T+ L+ G++ Sbjct: 121 HPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLS 166 Score = 47.4 bits (111), Expect(2) = 3e-13 Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 6/103 (5%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAAL---PRLREAD---WLLNACR 1022 C LC + + + L+FEC FS V W F A+ P + D WLL+ R Sbjct: 171 CLLCNAHDDSRAHLFFECQFSGVV--------WRFFTASTNLNPPAQLMDCLNWLLSPSR 222 Query: 1023 GKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTI 1151 K++ +IR + F + V+ IW ERN R+ RST ++K I Sbjct: 223 EKNICLIIRLA-FHSCVYAIWRERNQRLHSGVSRSTESILKDI 264 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 62.8 bits (151), Expect(2) = 5e-13 Identities = 41/160 (25%), Positives = 70/160 (43%), Gaps = 4/160 (2%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 534 SW+ + +LK R+L K + +G T FW D W LL+I G R I P+ ++ Sbjct: 1224 SWMWKKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETN 1283 Query: 535 VQQVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDDQDDI-IWTTTSSG---AFTLNSA 702 ++ V R +++ N + + L + + DI +W + + F Sbjct: 1284 LETVLRTHQHRQHRAAIYN--RINAEIQRLQQQEREAGPDISLWRSLKNDFNKRFITKVT 1341 Query: 703 WELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT 822 W +R + W+ VWFP PK+S W ++ +L T Sbjct: 1342 WNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLST 1381 Score = 40.0 bits (92), Expect(2) = 5e-13 Identities = 30/111 (27%), Positives = 48/111 (43%), Gaps = 6/111 (5%) Frame = +3 Query: 849 QLKLCSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREA-DW-----LL 1010 QL C+LC + E L+F C ++ V WE + L + DW LL Sbjct: 1392 QLVTCTLCNNAEETRDHLFFSCQYTSYV--------WEALTQRLLSTNYSRDWNRLFTLL 1443 Query: 1011 NACRGKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 + + F +++HIW ERN+R T++L+K I++ V Sbjct: 1444 CTSNLPRDHLFLFRYVFQASIYHIWRERNARRHGEISSPTNRLIKLIDKTV 1494 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 63.2 bits (152), Expect(2) = 4e-12 Identities = 40/173 (23%), Positives = 76/173 (43%), Gaps = 7/173 (4%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 534 SW+ + IL +R L K +G+G+ +W D W L+ +GA + ++ Sbjct: 918 SWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAV 977 Query: 535 VQQVTRAGYWGNPPS-----SVSNVRAVWHQFHGLARLGNDDQDDIIWTT--TSSGAFTL 693 V + + + W P + S++N+R+ + A G+ +D W +SS +F+ Sbjct: 978 VTEASSSTGWILPSARTRNASLANLRSTL--LNSPAPSGDRGEDTYTWYIEGSSSTSFSS 1035 Query: 694 NSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 852 WE +R +++ W VW+ +PK++ W +LP + N Sbjct: 1036 KLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTN 1088 Score = 36.6 bits (83), Expect(2) = 4e-12 Identities = 31/122 (25%), Positives = 48/122 (39%), Gaps = 7/122 (5%) Frame = +3 Query: 840 SWSQLKLCSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPR---LRE----A 998 S ++ LC +C E L+ C+ L W+ V A R RE Sbjct: 1086 STNRPSLCCVCQRETETRDHLFIHCTLGS--------LIWQQVLARFGRSQMFREWKDII 1137 Query: 999 DWLLNACRGKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAVVSCI* 1178 +W+L+ ++K T + HIW ERNSR+ S + K I+ ++ I Sbjct: 1138 EWMLS--NQGSFSGTLKKLAVQTAIFHIWKERNSRLHSAMSASHTAIFKQIDRSIRDSIL 1195 Query: 1179 VR 1184 R Sbjct: 1196 AR 1197 >dbj|BAA77394.1| SAE1-S9-protein [Brassica rapa] Length = 255 Score = 68.9 bits (167), Expect(2) = 5e-12 Identities = 43/164 (26%), Positives = 70/164 (42%), Gaps = 5/164 (3%) Frame = +1 Query: 373 ILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTR 552 +L+++D + + IGDG+ FW D W L ++ G+ R P+N+SV + Sbjct: 1 MLQLKDSLSDYLRCGIGDGRTASFWFDYWTELGPLFSLFGSTGPRQLRIPLNASVADAVQ 60 Query: 553 AGYWGNPPSSVSNVRAVWHQFHGLA-RLGNDDQDDIIWTTTSSGAFT----LNSAWELIR 717 G+W PP+ + + N+D D W +G FT WEL+ Sbjct: 61 NGHWYLPPARSEFAETLQIILSTITPPTDNNDIDIFYWRGGPTGGFTNKFSSKVTWELLC 120 Query: 718 AKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGV 849 W V F + VP+ + TW + +LPT+ L G+ Sbjct: 121 VPSPEVTWHSTVGFKEEVPRCTFITWLAMLERLPTRDRLISWGL 164 Score = 30.8 bits (68), Expect(2) = 5e-12 Identities = 13/24 (54%), Positives = 16/24 (66%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV 932 C LC G+E S L+FECSF+ V Sbjct: 170 CVLCNGGVESHSHLFFECSFAVGV 193 >gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] Length = 438 Score = 70.1 bits (170), Expect(2) = 3e-11 Identities = 52/177 (29%), Positives = 73/177 (41%), Gaps = 6/177 (3%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 534 SW R +L++R L + IG+G FW D W P LL +G+D R P+ S Sbjct: 192 SWNWRCLLRLRPLASKFLFCSIGNGLTASFWADSWTPFGPLLTFIGSDGPRNQRIPLCSK 251 Query: 535 VQQVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDD--QDDIIWTTTSSGAFTLNSA-- 702 V V W P SN + H F + +D +W + +SA Sbjct: 252 VADVVNGNRWLLPSPRSSNALNL-HAFLTTLSIPLQPLVEDSYLWKVENCSDIGFSSAHT 310 Query: 703 WELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG--VN*NCAL 867 W +R KE W VWF PK + W + +L TK+ + G V+ CAL Sbjct: 311 WNALRHKEVEKPWVSSVWFKGVTPKNAFNMWITHQDRLRTKLRMIAWGFLVSPVCAL 367 Score = 26.9 bits (58), Expect(2) = 3e-11 Identities = 20/76 (26%), Positives = 31/76 (40%) Frame = +3 Query: 858 LCSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREADWLLNACRGKHME 1037 +C+LC G E L C FS +V A +++ P + + L R K Sbjct: 364 VCALCQVGFETRDHLMLSCDFSVSVWALVRQRIG--TPLTIFQNWSELILWTQNRSKAAP 421 Query: 1038 SLIRKSCFTTTVHHIW 1085 S +RK V+ +W Sbjct: 422 STLRKLVAQAVVYALW 437 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 75.5 bits (184), Expect = 7e-11 Identities = 55/190 (28%), Positives = 85/190 (44%), Gaps = 10/190 (5%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 534 SW+ R ILKIRD+ K +G+G+ FW D W L++ VG I P +S Sbjct: 574 SWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREAS 633 Query: 535 VQQV-TRAGYWGNPPSSVSNVRAV--WHQFHGLARLGNDDQDDIIWTTTSS---GAFTLN 696 V TR + S ++ + + + + H +D +D ++W + F+ Sbjct: 634 VADAWTRRSRRRHRTSLLNEIEEMMAYQRIHH-----SDAEDTVLWRGKNDVFKPHFSTR 688 Query: 697 SAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRL----GVN*NCA 864 W LI+A S W VWF PK++L TW + +LPT + + V+ NC Sbjct: 689 DTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCV 748 Query: 865 LFVGQAWKTL 894 L + KTL Sbjct: 749 LCTNNS-KTL 757 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 58.2 bits (139), Expect(2) = 1e-10 Identities = 38/164 (23%), Positives = 66/164 (40%), Gaps = 8/164 (4%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVG----ADRRIFSRNP 522 SW+ + +LK R++ K +G+GK T FW D W LL G D I R Sbjct: 927 SWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMT 986 Query: 523 VNSSVQQVTRAGYWGNPPSSVSN-VRAVWHQFHGLARLGNDDQDDIIWTTTSS---GAFT 690 V + + + + + + + ++ W + +D ++W S F+ Sbjct: 987 VEEAWTNRRQRRHRNDVYNVIEDALKKSWD-------TRTETEDKVLWRGKSDVFRTTFS 1039 Query: 691 LNSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT 822 W R+ + W ++WF PK+S +W +LPT Sbjct: 1040 TRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPT 1083 Score = 36.6 bits (83), Expect(2) = 1e-10 Identities = 24/101 (23%), Positives = 41/101 (40%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 1040 C C LE L+F CSF+ + + R F + + + +E Sbjct: 1098 CIFCQGTLETRDHLFFTCSFTSVIWVDLARGI--FKTQYTSHWQSIIEAITNSQHHRVEW 1155 Query: 1041 LIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 +R+ F T++ +W ERN R + QLV I++ + Sbjct: 1156 FLRRYVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQI 1196 >ref|XP_004305958.1| PREDICTED: uncharacterized protein LOC101308407 [Fragaria vesca subsp. vesca] Length = 177 Score = 74.3 bits (181), Expect = 2e-10 Identities = 47/137 (34%), Positives = 64/137 (46%) Frame = +1 Query: 343 PRDSSWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNP 522 P+ SW R +LKIRD ++ HIIGDGK T FW D W P LL +G I S P Sbjct: 36 PQICSWNWRKLLKIRDFIRPSIKHIIGDGKSTYFWHDYWHPFGPLLPRLGPGAMINSGIP 95 Query: 523 VNSSVQQVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSA 702 N+ V + + W P S+ S + V GL + +D IW ++SG F+ S Sbjct: 96 SNALVSSIVKGESWCWPLSTNSAILRVASNVEGLIP-NSSCKDSCIWLPSTSGIFSTAST 154 Query: 703 WELIRAKESNFRWFDLV 753 + I W +V Sbjct: 155 MDQIWIHHPVVDWAKIV 171 >gb|ABA99600.2| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 1432 Score = 47.8 bits (112), Expect(2) = 2e-10 Identities = 36/114 (31%), Positives = 50/114 (43%), Gaps = 7/114 (6%) Frame = +3 Query: 843 WSQLKLCSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREAD----WLL 1010 W C LC ED L+ C FS+ V ++ W V LP W++ Sbjct: 1312 WPHEDHCVLCQREQEDCLHLFITCDFSRRVWQLLR--AWVNVDFPLPGQAGEGLIGWWMV 1369 Query: 1011 NACRGKHMESLIRK---SCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 C H S R S F T IW ERN+RIF+++ ++ QL++ I E V Sbjct: 1370 ARC---HFRSSYRSNFDSVFALTCWLIWKERNARIFQHKQQTVEQLLEDIKEEV 1420 Score = 46.2 bits (108), Expect(2) = 2e-10 Identities = 38/156 (24%), Positives = 63/156 (40%), Gaps = 13/156 (8%) Frame = +1 Query: 418 IGDGKDTKFWLDKWL-------PQDVLLNIVGADRRIFSRNPVNSSVQQVTRAGYW-GNP 573 +G+G+DTKFW D WL L+ +G N +V+Q W + Sbjct: 1167 VGNGRDTKFWSDNWLGGGSIAWRWPTLVTFIGR---------TNLTVEQGLLGHRWVRDL 1217 Query: 574 PSSVSNVR-----AVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWELIRAKESNFR 738 S+S++ +W + + + ++D I W T G+F+++SA++L Sbjct: 1218 QGSLSDIAMMQYFQLWDEIQQINL--SQEEDTICWKLTPDGSFSVSSAYDLFYMAREISP 1275 Query: 739 WFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG 846 L+W K W K K T NL + G Sbjct: 1276 CGQLIWQTKAPSKVRFFLWLATKGKCLTADNLAKRG 1311 >gb|EEE53448.1| hypothetical protein OsJ_36550 [Oryza sativa Japonica Group] Length = 394 Score = 47.8 bits (112), Expect(2) = 2e-10 Identities = 36/114 (31%), Positives = 50/114 (43%), Gaps = 7/114 (6%) Frame = +3 Query: 843 WSQLKLCSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREAD----WLL 1010 W C LC ED L+ C FS+ V ++ W V LP W++ Sbjct: 274 WPHEDHCVLCQREQEDCLHLFITCDFSRRVWQLLR--AWVNVDFPLPGQAGEGLIGWWMV 331 Query: 1011 NACRGKHMESLIRK---SCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 C H S R S F T IW ERN+RIF+++ ++ QL++ I E V Sbjct: 332 ARC---HFRSSYRSNFDSVFALTCWLIWKERNARIFQHKQQTVEQLLEDIKEEV 382 Score = 46.2 bits (108), Expect(2) = 2e-10 Identities = 38/156 (24%), Positives = 63/156 (40%), Gaps = 13/156 (8%) Frame = +1 Query: 418 IGDGKDTKFWLDKWL-------PQDVLLNIVGADRRIFSRNPVNSSVQQVTRAGYW-GNP 573 +G+G+DTKFW D WL L+ +G N +V+Q W + Sbjct: 129 VGNGRDTKFWSDNWLGGGSIAWRWPTLVTFIGR---------TNLTVEQGLLGHRWVRDL 179 Query: 574 PSSVSNVR-----AVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWELIRAKESNFR 738 S+S++ +W + + + ++D I W T G+F+++SA++L Sbjct: 180 QGSLSDIAMMQYFQLWDEIQQINL--SQEEDTICWKLTPDGSFSVSSAYDLFYMAREISP 237 Query: 739 WFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG 846 L+W K W K K T NL + G Sbjct: 238 CGQLIWQTKAPSKVRFFLWLATKGKCLTADNLAKRG 273 >gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata] Length = 441 Score = 60.8 bits (146), Expect(2) = 3e-10 Identities = 44/175 (25%), Positives = 73/175 (41%), Gaps = 6/175 (3%) Frame = +1 Query: 355 SWVLRGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 534 SW+ R +LK R L + I +GK FW D W P L+ I G I ++++ Sbjct: 146 SWMWRKLLKYRHLASGFTQYEIRNGKGVSFWHDNWSPLGPLIAISGTRGCIDMGIDIHAT 205 Query: 535 VQQVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSSG----AFTLNSA 702 V + + ++ + A Q L G + +D++ G +F+ Sbjct: 206 VAEALTHRRRRHRADHLNQMEA---QLEELRTKGLVETEDVVLWKGKGGRFKPSFSTKET 262 Query: 703 WELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT--KVNLRRLGVN*NC 861 W R ++ W+ +WF PK+S TW K +L T ++ GVN +C Sbjct: 263 WADTREQKPRNEWYQGIWFSHATPKYSFITWLATKNRLSTGDRMMSWNAGVNLSC 317 Score = 32.7 bits (73), Expect(2) = 3e-10 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 6/107 (5%) Frame = +3 Query: 861 CSLCWSGLEDISDLYFECSFSQAV*ASIKRLCWEFVPAALPRLREADW------LLNACR 1022 C C E + L+F C +S+ V + + L R DW L + Sbjct: 317 CVFCQEQTETRNHLFFTCRYSREVWSGL-------TSKLLTRHYSTDWTTILKLLTDKTL 369 Query: 1023 GKHMESLIRKSCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1163 G + L+R + F V+ IW ERNSR + + L+K +++ V Sbjct: 370 GNNRLFLLRYA-FQILVYSIWKERNSRRHGEEPLPSALLLKRLDKEV 415