BLASTX nr result
ID: Cocculus23_contig00002058
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00002058 (1659 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcript... 72 1e-20 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 67 2e-19 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 63 3e-16 gb|AAD26953.1| putative non-LTR retrolelement reverse transcript... 65 3e-16 ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A... 74 7e-16 ref|XP_004293076.1| PREDICTED: putative ribonuclease H protein A... 57 1e-15 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 59 3e-15 ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcript... 56 3e-15 gb|ABK28243.1| unknown [Arabidopsis thaliana] 56 4e-15 gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thali... 56 4e-15 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 62 3e-14 dbj|BAA77394.1| SAE1-S9-protein [Brassica rapa] 69 1e-13 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 62 2e-13 gb|EPS72636.1| hypothetical protein M569_02121, partial [Genlise... 71 3e-13 gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] 68 7e-13 ref|XP_004148188.1| PREDICTED: uncharacterized protein LOC101204... 54 9e-13 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 57 4e-12 gb|EAZ38029.1| hypothetical protein OsJ_22373 [Oryza sativa Japo... 52 7e-12 gb|ABA99600.2| retrotransposon protein, putative, unclassified [... 52 1e-11 gb|EEE53448.1| hypothetical protein OsJ_36550 [Oryza sativa Japo... 52 2e-11 >ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] gi|332005241|gb|AED92624.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] Length = 295 Score = 72.4 bits (176), Expect(2) = 1e-20 Identities = 45/157 (28%), Positives = 67/157 (42%), Gaps = 12/157 (7%) Frame = +3 Query: 258 IGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAGYWGNPPSSVSN-- 431 +G+G+ FW D W LL +GA R ++ V + +R G W P + N Sbjct: 16 MGNGESASFWYDAWTDFGQLLTFLGAAGPRQLRIRQDARVVEASRNGDWFLPAARSDNSQ 75 Query: 432 -------VRAVWHQFHGLARLGNDDQDDIIWTTTSSG---AFTLNSAWELIRAKESNFRW 581 + V H+ G QD +W + +F+ WE IR W Sbjct: 76 LFLAALTMAPVPHESRG--------QDSFLWRNAAGSYLPSFSSRDTWEQIRVHSPTVPW 127 Query: 582 FDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 692 +VWF + +P+FSL TW +LPT+ LR G+N Sbjct: 128 AKVVWFKEYIPRFSLITWMSFLERLPTRDRLRGWGMN 164 Score = 56.2 bits (134), Expect(2) = 1e-20 Identities = 33/99 (33%), Positives = 50/99 (50%) Frame = +2 Query: 707 LCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHMESLI 886 LC +G E ++L+FECSFS A+W P LP + W+L H +++ Sbjct: 171 LCSNGDETHAHLFFECSFSLAIWEFFASKFRPSPPFGLP--AASSWILQLPLRSHSTTIL 228 Query: 887 KKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1003 KL + V+H+W ERN+RIF + S L I+ + Sbjct: 229 -KLLLQSAVYHVWKERNARIFTSISSSASSLRLAIDRTM 266 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 67.4 bits (163), Expect(2) = 2e-19 Identities = 43/170 (25%), Positives = 69/170 (40%), Gaps = 6/170 (3%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 374 +W+ + K+R + + +G G FW D W LL++ G + + P+NS Sbjct: 852 NWIWRKLCKLRPFARPFIICEVGSGVTASFWHDNWTDHGPLLHLTGPAGPLLAGLPLNSV 911 Query: 375 VQQVTRAGYWGNPPSSVSN--VRAVWHQFHGLARLGNDDQDDI-IWTT---TSSGAFTLN 536 V+ R W S N + + A L + DD +W S F+ Sbjct: 912 VRDALRDDTWRISSSRSRNPVITLLQRVLPSAASLIDCPHDDTYLWKIGHHAPSNRFSTA 971 Query: 537 SAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG 686 W ++ ++ W VWF D VPK + W + +L T+ LRR G Sbjct: 972 DTWSYLQPSSTSVLWHKAVWFKDHVPKQAFICWVVAHNRLHTRDRLRRWG 1021 Score = 57.0 bits (136), Expect(2) = 2e-19 Identities = 35/97 (36%), Positives = 48/97 (49%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 880 C LC E +L+F C FS +W+ R P + W L A R +++ + Sbjct: 1028 CVLCNDLDESREHLFFRCQFSSEIWSFFMRALNLNPPPQF--MHCLLWTLTASRDRNI-T 1084 Query: 881 LIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTI 991 LI KL F +V+ IW ERN RI N VR H ++K I Sbjct: 1085 LITKLLFHASVYFIWRERNLRIHSNSVRPAHLIIKEI 1121 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 63.2 bits (152), Expect(2) = 3e-16 Identities = 41/165 (24%), Positives = 68/165 (41%), Gaps = 9/165 (5%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 374 SW+ +LK R++ K C + +G +T FW D W + L+N+ GA R ++ Sbjct: 648 SWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKGPLINLTGA------RGAIDMG 701 Query: 375 V-QQVTRAGYWGNPPSSVSNVRAVWHQFHGL-----ARLGNDDQDDIIW---TTTSSGAF 527 + + +T A W V + ++F + + +D I+W F Sbjct: 702 ISRHMTLAEAWSRRRRKRHRVE-ILNEFEEILLQKYQHRNIELEDAILWRGKEDVFKARF 760 Query: 528 TLNSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT 662 + W IR + W VWF PKFS W ++ +L T Sbjct: 761 STKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLST 805 Score = 50.4 bits (119), Expect(2) = 3e-16 Identities = 25/101 (24%), Positives = 50/101 (49%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 880 C C S +E +L+F+C +S +W SI + ++ + ++ + ++S Sbjct: 820 CVFCSSPMETRDHLFFQCCYSSEIWTSIAKNVYK--DRFSTKWSAVVNYISDSQPDRIQS 877 Query: 881 LIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1003 + + F ++H IW ERNSR + RS L++ I++ + Sbjct: 878 FLSRYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQIDKTI 918 >gb|AAD26953.1| putative non-LTR retrolelement reverse transcriptase [Arabidopsis thaliana] Length = 323 Score = 64.7 bits (156), Expect(2) = 3e-16 Identities = 43/174 (24%), Positives = 72/174 (41%), Gaps = 8/174 (4%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 374 SW+ + K+R L + + IG G+ FW D W Q L+++ G + P+N+ Sbjct: 20 SWIWRKLCKLRPLARPFLVCEIGSGETASFWQDNWTGQGPLIDLTGTNGPRSVGMPLNAV 79 Query: 375 VQQVTRAGYWGNPPS-----SVSNVRAVWHQFHGLARLGNDDQDDIIWTT---TSSGAFT 530 V+ R W S S++ +++V + +DD W S F+ Sbjct: 80 VRDALRGDNWWLSSSRSRNPSIALLKSVLPSSESMIECQHDDV--YKWKPDHHAPSNIFS 137 Query: 531 LNSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 692 + W + W VWF DR+PK + W +L T+ L + G+N Sbjct: 138 ASKTWTALNPDGVLVPWQKSVWFKDRIPKHAFICWVAAWKRLHTRDRLTQWGLN 191 Score = 48.9 bits (115), Expect(2) = 3e-16 Identities = 31/102 (30%), Positives = 49/102 (48%) Frame = +2 Query: 698 LCSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHME 877 +C LC E +L+F+C FS +W+ P + WL +A K++ Sbjct: 195 VCVLCNVVDETHDHLFFQCQFSNEIWSFFMIRAGMTPPHLFGPILL--WLKSASSSKNL- 251 Query: 878 SLIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1003 SLI KL F +V+ IW ERN RI R+ ++K + + + Sbjct: 252 SLIIKLLFQASVYLIWRERNCRIHTTHSRTPPTIIKEVQQLI 293 >ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 751 Score = 73.6 bits (179), Expect(2) = 7e-16 Identities = 46/167 (27%), Positives = 75/167 (44%), Gaps = 1/167 (0%) Frame = +3 Query: 192 SSWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNI-VGADRRIFSRNPVN 368 +S V G+ ++ L+ E IIGDG FW DKWL ++ + +G+ + +N Sbjct: 345 TSSVWHGLKRVLPLLFEHSRWIIGDGNSILFWSDKWLHSSIIQQLNMGSLSHL-----LN 399 Query: 369 SSVQQVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWE 548 S V W P + Q + + D +IW +SSG F+ + +E Sbjct: 400 SRVADFIWDQQWALPSHFSNLFPDCAKQILEIPLPNTPESDILIWEHSSSGIFSFSDGYE 459 Query: 549 LIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGV 689 L+R W VW P++S+ W++ +KLPT L+R G+ Sbjct: 460 LVRPYFEKLDWASSVWHSFIPPRYSVLAWRIFHLKLPTDDQLQRRGI 506 Score = 38.9 bits (89), Expect(2) = 7e-16 Identities = 26/104 (25%), Positives = 49/104 (47%), Gaps = 4/104 (3%) Frame = +2 Query: 692 LKLCSLC-WSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGK 868 + +C LC +S EDI +L+ CSF+Q +W + + +LP + L ++ GK Sbjct: 509 VSVCQLCSFSHTEDIPHLFVNCSFAQHIWQWLA----YYFGTSLPSSGSLNDLWSSVTGK 564 Query: 869 HMESLIKKLCFTT---TVHHIWLERNSRIFRNQVRSTHQLVKTI 991 +K + F + + IW N F N+ S ++ +++ Sbjct: 565 AFSPQLKNIWFASCLFALMAIWKSHNKLRFDNKQPSLMRVFRSV 608 >ref|XP_004293076.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 487 Score = 57.4 bits (137), Expect(3) = 1e-15 Identities = 46/164 (28%), Positives = 73/164 (44%), Gaps = 5/164 (3%) Frame = +3 Query: 210 GILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPV--NSSVQQ 383 GIL R+L+ + I+G+G++ KFW W + LLN++ +I RN + N +V Sbjct: 37 GILDARNLILKGMRWIVGNGENIKFWTFNWAYEFPLLNLI----QINDRNAIDLNETVAD 92 Query: 384 VTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWELIRAK 563 G W + Q G+ L ++ D+ IW + G F++ SA L Sbjct: 93 YIFNGCWNIQKLLQVLDQETVKQITGIPILVSNQCDECIWAPPTDGRFSVKSATWLQYQN 152 Query: 564 ESNFRWFDL---VWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG 686 + DL VW D K L W +L+ +L T+ L + G Sbjct: 153 LEKHQQSDLINKVWKLDVPLKVKLFGWLLLRGRLKTRDRLSKFG 196 Score = 48.9 bits (115), Expect(3) = 1e-15 Identities = 31/127 (24%), Positives = 63/127 (49%), Gaps = 2/127 (1%) Frame = +1 Query: 1075 GCLNQRRSICVNGVLPLRVFT-LHSGGSMSARVCGYGGVIRDEMGEIIIGFSGSVAQGSV 1251 G ++Q S + + P F ++ GS+ R G V R+ G +I+ + + ++ Sbjct: 309 GGISQTTSSTIRWLPPHNNFIKINFDGSVQGRSAAGGFVFRNSDGNVILAAAKGLGSTTI 368 Query: 1252 LLLEAIGLCNGIKLAKEQNFTKLKAVSDSKILIMIVNKQCPSPWYIKHIVHDIW*LCSNM 1431 EA L + + A+++ + ++ DSK++I +N + PW ++ IV DI + ++ Sbjct: 369 PTAEATALRDSLVKARDRGYMNVQVEGDSKLVIDAINGKLSPPWRLQKIVQDIRTIATSF 428 Query: 1432 -EVSFSH 1449 V F+H Sbjct: 429 SSVCFNH 435 Score = 25.4 bits (54), Expect(3) = 1e-15 Identities = 26/104 (25%), Positives = 36/104 (34%), Gaps = 8/104 (7%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVW--ASIKRLCWEFVPAALPRLREADW---LLNACRG 865 C LC S E +L+ C F+ V+ A I L DW L R Sbjct: 203 CPLCDSDNETADHLFGHCDFTTEVFRLAGISAL--------------MDWHEGYLKVLRE 248 Query: 866 KHMESLIKKLCFTTTV---HHIWLERNSRIFRNQVRSTHQLVKT 988 + K F + IW RN IFR+ + + + T Sbjct: 249 MFINQPYDKFLFAKVLIIYWQIWKARNDTIFRDVITTATNVAAT 292 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 59.3 bits (142), Expect(2) = 3e-15 Identities = 42/168 (25%), Positives = 67/168 (39%), Gaps = 12/168 (7%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRI---FSRNPV 365 SW+ +LK R+ K + +G T FW D W L+++ G +I SRN Sbjct: 332 SWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNK- 390 Query: 366 NSSVQQVTRAGYWGNP------PSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSS--- 518 T A W N ++++ A +Q + L +D +W Sbjct: 391 -------TVAEAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLLREDAT--LWRGKGDVFK 441 Query: 519 GAFTLNSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT 662 +F+ W +R K + W+ VWF PK+ TW L+ +L T Sbjct: 442 TSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLST 489 Score = 50.8 bits (120), Expect(2) = 3e-15 Identities = 25/106 (23%), Positives = 50/106 (47%), Gaps = 5/106 (4%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADW-----LLNACRG 865 C+ C + +E +L+F CS++ A+W +I + L DW ++ + Sbjct: 504 CTFCSTSIETRDHLFFSCSYASAIWTAIAK-------NVLQHRFSTDWQTIVNYISETQT 556 Query: 866 KHMESLIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1003 + S + + F TVH +W ERN R + R++ L+ +++ + Sbjct: 557 DRIRSFLSRYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQI 602 >ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] gi|5732057|gb|AAD48956.1|AF149414_5 contains similarity to a family of Arabidopsis thaliana predicted proteins, which have similarity to reverse transcriptases; see T14P8.10 (GB:AF069298) [Arabidopsis thaliana] gi|7267223|emb|CAB80830.1| AT4g04650 [Arabidopsis thaliana] gi|332657009|gb|AEE82409.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] Length = 332 Score = 56.2 bits (134), Expect(2) = 3e-15 Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 8/166 (4%) Frame = +3 Query: 219 KIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAG 398 K+R + + + +G G KFW D W+ L+ ++G P+++ V+ R Sbjct: 3 KLRVVARPFIVCEVGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLPIDAVVRDALRGT 62 Query: 399 YWGNPPSSVSN-----VRAVWHQFHGLARLGNDDQDDIIWTT---TSSGAFTLNSAWELI 554 W S N ++ + + GL +DD +W T S F+ W + Sbjct: 63 SWWIASSRSRNPIIVQLKNLLPEAQGLLDCQHDDS--FLWKTDLHAPSNRFSAPRTWSAL 120 Query: 555 RAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 692 + W VWF + VPK + W + +L T+ L+ G++ Sbjct: 121 HPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLS 166 Score = 53.9 bits (128), Expect(2) = 3e-15 Identities = 35/97 (36%), Positives = 51/97 (52%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 880 C LC + + ++L+FEC FS VW PA L + +WLL+ R K++ Sbjct: 171 CLLCNAHDDSRAHLFFECQFSGVVWRFFTASTNLNPPAQL--MDCLNWLLSPSREKNI-C 227 Query: 881 LIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTI 991 LI +L F + V+ IW ERN R+ RST ++K I Sbjct: 228 LIIRLAFHSCVYAIWRERNQRLHSGVSRSTESILKDI 264 >gb|ABK28243.1| unknown [Arabidopsis thaliana] Length = 297 Score = 56.2 bits (134), Expect(2) = 4e-15 Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 8/166 (4%) Frame = +3 Query: 219 KIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAG 398 K+R + + + +G G KFW D W+ L+ ++G P+++ V+ R Sbjct: 3 KLRVVARPFIVCEVGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLPIDAVVRDALRGT 62 Query: 399 YWGNPPSSVSN-----VRAVWHQFHGLARLGNDDQDDIIWTT---TSSGAFTLNSAWELI 554 W S N ++ + + GL +DD +W T S F+ W + Sbjct: 63 SWWIASSRSRNPIIVQLKNLLPEAQGLLDCQHDDS--FLWKTDLHAPSNRFSAPRTWSAL 120 Query: 555 RAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 692 + W VWF + VPK + W + +L T+ L+ G++ Sbjct: 121 HPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLS 166 Score = 53.9 bits (128), Expect(2) = 4e-15 Identities = 35/97 (36%), Positives = 51/97 (52%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 880 C LC + + ++L+FEC FS VW PA L + +WLL+ R K++ Sbjct: 171 CLLCNAHDDSRAHLFFECQFSGVVWRFFTASTNLNPPAQL--MDCLNWLLSPSREKNI-C 227 Query: 881 LIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTI 991 LI +L F + V+ IW ERN R+ RST ++K I Sbjct: 228 LIIRLAFHSCVYAIWRERNQRLHSGVSRSTESILKDI 264 >gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thaliana] Length = 296 Score = 56.2 bits (134), Expect(2) = 4e-15 Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 8/166 (4%) Frame = +3 Query: 219 KIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAG 398 K+R + + + +G G KFW D W+ L+ ++G P+++ V+ R Sbjct: 3 KLRVVARPFIVCEVGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLPIDAVVRDALRGT 62 Query: 399 YWGNPPSSVSN-----VRAVWHQFHGLARLGNDDQDDIIWTT---TSSGAFTLNSAWELI 554 W S N ++ + + GL +DD +W T S F+ W + Sbjct: 63 SWWIASSRSRNPIIVQLKNLLPEAQGLLDCQHDDS--FLWKTDLHAPSNRFSAPRTWSAL 120 Query: 555 RAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 692 + W VWF + VPK + W + +L T+ L+ G++ Sbjct: 121 HPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLS 166 Score = 53.9 bits (128), Expect(2) = 4e-15 Identities = 35/97 (36%), Positives = 51/97 (52%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 880 C LC + + ++L+FEC FS VW PA L + +WLL+ R K++ Sbjct: 171 CLLCNAHDDSRAHLFFECQFSGVVWRFFTASTNLNPPAQL--MDCLNWLLSPSREKNI-C 227 Query: 881 LIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTI 991 LI +L F + V+ IW ERN R+ RST ++K I Sbjct: 228 LIIRLAFHSCVYAIWRERNQRLHSGVSRSTESILKDI 264 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 62.0 bits (149), Expect(2) = 3e-14 Identities = 41/160 (25%), Positives = 69/160 (43%), Gaps = 4/160 (2%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 374 SW+ +LK R+L K + +G T FW D W LL+I G R I P+ ++ Sbjct: 1224 SWMWKKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETN 1283 Query: 375 VQQVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDDQDDI-IWTTTSSG---AFTLNSA 542 ++ V R +++ N + + L + + DI +W + + F Sbjct: 1284 LETVLRTHQHRQHRAAIYN--RINAEIQRLQQQEREAGPDISLWRSLKNDFNKRFITKVT 1341 Query: 543 WELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT 662 W +R + W+ VWFP PK+S W ++ +L T Sbjct: 1342 WNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLST 1381 Score = 45.1 bits (105), Expect(2) = 3e-14 Identities = 29/110 (26%), Positives = 49/110 (44%), Gaps = 5/110 (4%) Frame = +2 Query: 689 QLKLCSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADW-----LLN 853 QL C+LC + E +L+F C ++ VW ++ + L DW LL Sbjct: 1392 QLVTCTLCNNAEETRDHLFFSCQYTSYVWEALTQ-------RLLSTNYSRDWNRLFTLLC 1444 Query: 854 ACRGKHMESLIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1003 + + F +++HIW ERN+R T++L+K I++ V Sbjct: 1445 TSNLPRDHLFLFRYVFQASIYHIWRERNARRHGEISSPTNRLIKLIDKTV 1494 >dbj|BAA77394.1| SAE1-S9-protein [Brassica rapa] Length = 255 Score = 68.9 bits (167), Expect(2) = 1e-13 Identities = 43/164 (26%), Positives = 70/164 (42%), Gaps = 5/164 (3%) Frame = +3 Query: 213 ILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTR 392 +L+++D + + IGDG+ FW D W L ++ G+ R P+N+SV + Sbjct: 1 MLQLKDSLSDYLRCGIGDGRTASFWFDYWTELGPLFSLFGSTGPRQLRIPLNASVADAVQ 60 Query: 393 AGYWGNPPSSVSNVRAVWHQFHGLA-RLGNDDQDDIIWTTTSSGAFT----LNSAWELIR 557 G+W PP+ + + N+D D W +G FT WEL+ Sbjct: 61 NGHWYLPPARSEFAETLQIILSTITPPTDNNDIDIFYWRGGPTGGFTNKFSSKVTWELLC 120 Query: 558 AKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGV 689 W V F + VP+ + TW + +LPT+ L G+ Sbjct: 121 VPSPEVTWHSTVGFKEEVPRCTFITWLAMLERLPTRDRLISWGL 164 Score = 35.8 bits (81), Expect(2) = 1e-13 Identities = 14/25 (56%), Positives = 18/25 (72%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVW 775 C LC G+E S+L+FECSF+ VW Sbjct: 170 CVLCNGGVESHSHLFFECSFAVGVW 194 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 62.4 bits (150), Expect(2) = 2e-13 Identities = 40/173 (23%), Positives = 75/173 (43%), Gaps = 7/173 (4%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 374 SW+ IL +R L K +G+G+ +W D W L+ +GA + ++ Sbjct: 918 SWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAV 977 Query: 375 VQQVTRAGYWGNPPS-----SVSNVRAVWHQFHGLARLGNDDQDDIIWTT--TSSGAFTL 533 V + + + W P + S++N+R+ + A G+ +D W +SS +F+ Sbjct: 978 VTEASSSTGWILPSARTRNASLANLRSTL--LNSPAPSGDRGEDTYTWYIEGSSSTSFSS 1035 Query: 534 NSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLGVN 692 WE +R +++ W VW+ +PK++ W +LP + N Sbjct: 1036 KLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTN 1088 Score = 41.6 bits (96), Expect(2) = 2e-13 Identities = 29/117 (24%), Positives = 50/117 (42%), Gaps = 2/117 (1%) Frame = +2 Query: 680 SWSQLKLCSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLRE--ADWLLN 853 S ++ LC +C E +L+ C+ +W + F + + R + +W+L+ Sbjct: 1086 STNRPSLCCVCQRETETRDHLFIHCTLGSLIWQQVLA---RFGRSQMFREWKDIIEWMLS 1142 Query: 854 ACRGKHMESLIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAVVSCI*VR 1024 +KKL T + HIW ERNSR+ S + K I+ ++ I R Sbjct: 1143 --NQGSFSGTLKKLAVQTAIFHIWKERNSRLHSAMSASHTAIFKQIDRSIRDSILAR 1197 >gb|EPS72636.1| hypothetical protein M569_02121, partial [Genlisea aurea] Length = 1503 Score = 71.2 bits (173), Expect(2) = 3e-13 Identities = 52/189 (27%), Positives = 85/189 (44%), Gaps = 18/189 (9%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLL---NIVGADRRIFSRNPV 365 S+V +GI+K RDLV + H+IGDG W D W+P+ N++G RR + Sbjct: 1243 SYVWNGIMKSRDLVSKGIRHLIGDGSSVDIWHDPWIPKPPTFKPTNLLGERRRASVATLI 1302 Query: 366 NSSVQQVTRAGYW--GNPPSSVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNS 539 +S R +W G V A + + + +D I+W + SG +T+ S Sbjct: 1303 DS------RTKWWDVGRIREKFDPVDA--NHIISIPLSESPSEDKILWHYSKSGTYTVRS 1354 Query: 540 AWELIRAKESNF-----------RWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNL--RR 680 A+ L+R+ + +DL+W PK L W++ LPT L RR Sbjct: 1355 AYHLVRSLRVEVSSSSSDSRVTPKVWDLIWKHACCPKIGLFMWRLAHGCLPTNETLWRRR 1414 Query: 681 LGVN*NCAL 707 + ++ C++ Sbjct: 1415 IPIDKECSI 1423 Score = 32.3 bits (72), Expect(2) = 3e-13 Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 3/87 (3%) Frame = +2 Query: 695 KLCSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHM 874 K CS+C + E ++ EC + VWA + L W + DW+ + Sbjct: 1419 KECSICLNRTESDRHILLECPPAIQVWA-LSDLPWGAINTWRDGASAIDWI------SSV 1471 Query: 875 ESLIKKLCFT---TTVHHIWLERNSRI 946 + +K F+ T +W +RNSRI Sbjct: 1472 SATLKPAAFSRLMTIAWFLWWKRNSRI 1498 >gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana] Length = 438 Score = 67.8 bits (164), Expect(2) = 7e-13 Identities = 51/177 (28%), Positives = 72/177 (40%), Gaps = 6/177 (3%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSS 374 SW +L++R L + IG+G FW D W P LL +G+D R P+ S Sbjct: 192 SWNWRCLLRLRPLASKFLFCSIGNGLTASFWADSWTPFGPLLTFIGSDGPRNQRIPLCSK 251 Query: 375 VQQVTRAGYWGNPPSSVSNVRAVWHQFHGLARLGNDD--QDDIIWTTTSSGAFTLNSA-- 542 V V W P SN + H F + +D +W + +SA Sbjct: 252 VADVVNGNRWLLPSPRSSNALNL-HAFLTTLSIPLQPLVEDSYLWKVENCSDIGFSSAHT 310 Query: 543 WELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG--VN*NCAL 707 W +R KE W VWF PK + W + +L TK+ + G V+ CAL Sbjct: 311 WNALRHKEVEKPWVSSVWFKGVTPKNAFNMWITHQDRLRTKLRMIAWGFLVSPVCAL 367 Score = 34.7 bits (78), Expect(2) = 7e-13 Identities = 21/76 (27%), Positives = 34/76 (44%) Frame = +2 Query: 698 LCSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHME 877 +C+LC G E +L C FS +VWA +++ P + + L R K Sbjct: 364 VCALCQVGFETRDHLMLSCDFSVSVWALVRQRIG--TPLTIFQNWSELILWTQNRSKAAP 421 Query: 878 SLIKKLCFTTTVHHIW 925 S ++KL V+ +W Sbjct: 422 STLRKLVAQAVVYALW 437 >ref|XP_004148188.1| PREDICTED: uncharacterized protein LOC101204314 [Cucumis sativus] Length = 282 Score = 54.3 bits (129), Expect(2) = 9e-13 Identities = 26/80 (32%), Positives = 40/80 (50%) Frame = +3 Query: 441 VWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWELIRAKESNFRWFDLVWFPDRVPKF 620 +W + G+ RL +D +W S +F++ SAWE IR S W L+W +PK Sbjct: 81 IWDRVQGV-RLSPSVEDRWVWVPGSHDSFSITSAWETIRPHSSRVGWSGLLWGGGNIPKH 139 Query: 621 SLTTWKMLKVKLPTKVNLRR 680 S W ++ +L T+ L R Sbjct: 140 SFCAWLAIRDRLGTRGRLSR 159 Score = 47.8 bits (112), Expect(2) = 9e-13 Identities = 30/108 (27%), Positives = 47/108 (43%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 880 C LC E +L+F C F +W+ I L + E W+ N GK + Sbjct: 168 CLLCGGNYESRDHLFFSCHFGWEIWSRILLLKSSSHRTGYWGV-ELSWIYNQGIGKSVRR 226 Query: 881 LIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAVVSCI*VR 1024 + +L + T++ IW ERN R+ +R + + + SCI VR Sbjct: 227 KLWRLLWCATIYFIWQERNHRLHGGSIREP----LVVFQLIRSCIKVR 270 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 57.4 bits (137), Expect(2) = 4e-12 Identities = 38/164 (23%), Positives = 65/164 (39%), Gaps = 8/164 (4%) Frame = +3 Query: 195 SWVLSGILKIRDLVKEKCLHIIGDGKDTKFWLDKWLPQDVLLNIVG----ADRRIFSRNP 362 SW+ +LK R++ K +G+GK T FW D W LL G D I R Sbjct: 927 SWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMT 986 Query: 363 VNSSVQQVTRAGYWGNPPSSVSN-VRAVWHQFHGLARLGNDDQDDIIWTTTSS---GAFT 530 V + + + + + + + ++ W + +D ++W S F+ Sbjct: 987 VEEAWTNRRQRRHRNDVYNVIEDALKKSWD-------TRTETEDKVLWRGKSDVFRTTFS 1039 Query: 531 LNSAWELIRAKESNFRWFDLVWFPDRVPKFSLTTWKMLKVKLPT 662 W R+ + W ++WF PK+S +W +LPT Sbjct: 1040 TRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPT 1083 Score = 42.4 bits (98), Expect(2) = 4e-12 Identities = 24/101 (23%), Positives = 43/101 (42%) Frame = +2 Query: 701 CSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREADWLLNACRGKHMES 880 C C LE +L+F CSF+ +W + R F + + + +E Sbjct: 1098 CIFCQGTLETRDHLFFTCSFTSVIWVDLAR--GIFKTQYTSHWQSIIEAITNSQHHRVEW 1155 Query: 881 LIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1003 +++ F T++ +W ERN R + QLV I++ + Sbjct: 1156 FLRRYVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQI 1196 >gb|EAZ38029.1| hypothetical protein OsJ_22373 [Oryza sativa Japonica Group] Length = 273 Score = 52.4 bits (124), Expect(2) = 7e-12 Identities = 37/149 (24%), Positives = 63/149 (42%), Gaps = 6/149 (4%) Frame = +3 Query: 258 IGDGKDTKFWLDKWLPQDVLLNIVGADRRIFSRNPVNSSVQQVTRAGYWGNP------PS 419 +GDGKDT FW + WLP+ + V + +V + R W + Sbjct: 6 VGDGKDTDFWRENWLPRGCIS--VSSPTLFTYLGRSKLTVVEALRQHRWVRDIRGSLSAA 63 Query: 420 SVSNVRAVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWELIRAKESNFRWFDLVWF 599 ++S +W + + +L +D D I W TS+G F S +EL A + + +L+W Sbjct: 64 ALSEYLNLWDEIQEV-QLQDDVDDSIRWRLTSNGTFCTASVYELFFAAQVKSQSAELIWQ 122 Query: 600 PDRVPKFSLTTWKMLKVKLPTKVNLRRLG 686 + W + K + NL++ G Sbjct: 123 TRGPSRIKFFMWLITKGRCLIADNLQKRG 151 Score = 46.6 bits (109), Expect(2) = 7e-12 Identities = 33/113 (29%), Positives = 44/113 (38%), Gaps = 5/113 (4%) Frame = +2 Query: 683 WSQLKLCSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALP---RLREADWLLN 853 W C LC E +L +C FS VW ++ W LP ADW + Sbjct: 152 WLHEDGCVLCNGDQESCDHLLLQCPFSNRVWGLVRT--WIGTSFPLPGEDNWEFADWWMK 209 Query: 854 A--CRGKHMESLIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAVV 1006 A C LC +W ERN RIF + R+ +L + I + VV Sbjct: 210 ARSCFQTRSRGAFDSLCLLIC-WFVWRERNFRIFEQKTRTALELFRDIKDEVV 261 >gb|ABA99600.2| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 1432 Score = 51.6 bits (122), Expect(2) = 1e-11 Identities = 33/111 (29%), Positives = 49/111 (44%), Gaps = 4/111 (3%) Frame = +2 Query: 683 WSQLKLCSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREAD----WLL 850 W C LC ED +L+ C FS+ VW ++ W V LP W++ Sbjct: 1312 WPHEDHCVLCQREQEDCLHLFITCDFSRRVWQLLR--AWVNVDFPLPGQAGEGLIGWWMV 1369 Query: 851 NACRGKHMESLIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1003 C + F T IW ERN+RIF+++ ++ QL++ I E V Sbjct: 1370 ARCHFRSSYRSNFDSVFALTCWLIWKERNARIFQHKQQTVEQLLEDIKEEV 1420 Score = 46.2 bits (108), Expect(2) = 1e-11 Identities = 38/156 (24%), Positives = 63/156 (40%), Gaps = 13/156 (8%) Frame = +3 Query: 258 IGDGKDTKFWLDKWL-------PQDVLLNIVGADRRIFSRNPVNSSVQQVTRAGYW-GNP 413 +G+G+DTKFW D WL L+ +G N +V+Q W + Sbjct: 1167 VGNGRDTKFWSDNWLGGGSIAWRWPTLVTFIGR---------TNLTVEQGLLGHRWVRDL 1217 Query: 414 PSSVSNVR-----AVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWELIRAKESNFR 578 S+S++ +W + + + ++D I W T G+F+++SA++L Sbjct: 1218 QGSLSDIAMMQYFQLWDEIQQINL--SQEEDTICWKLTPDGSFSVSSAYDLFYMAREISP 1275 Query: 579 WFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG 686 L+W K W K K T NL + G Sbjct: 1276 CGQLIWQTKAPSKVRFFLWLATKGKCLTADNLAKRG 1311 >gb|EEE53448.1| hypothetical protein OsJ_36550 [Oryza sativa Japonica Group] Length = 394 Score = 51.6 bits (122), Expect(2) = 2e-11 Identities = 33/111 (29%), Positives = 49/111 (44%), Gaps = 4/111 (3%) Frame = +2 Query: 683 WSQLKLCSLCWSGLEDISNLYFECSFSQAVWASIKRLCWEFVPAALPRLREAD----WLL 850 W C LC ED +L+ C FS+ VW ++ W V LP W++ Sbjct: 274 WPHEDHCVLCQREQEDCLHLFITCDFSRRVWQLLR--AWVNVDFPLPGQAGEGLIGWWMV 331 Query: 851 NACRGKHMESLIKKLCFTTTVHHIWLERNSRIFRNQVRSTHQLVKTINEAV 1003 C + F T IW ERN+RIF+++ ++ QL++ I E V Sbjct: 332 ARCHFRSSYRSNFDSVFALTCWLIWKERNARIFQHKQQTVEQLLEDIKEEV 382 Score = 46.2 bits (108), Expect(2) = 2e-11 Identities = 38/156 (24%), Positives = 63/156 (40%), Gaps = 13/156 (8%) Frame = +3 Query: 258 IGDGKDTKFWLDKWL-------PQDVLLNIVGADRRIFSRNPVNSSVQQVTRAGYW-GNP 413 +G+G+DTKFW D WL L+ +G N +V+Q W + Sbjct: 129 VGNGRDTKFWSDNWLGGGSIAWRWPTLVTFIGR---------TNLTVEQGLLGHRWVRDL 179 Query: 414 PSSVSNVR-----AVWHQFHGLARLGNDDQDDIIWTTTSSGAFTLNSAWELIRAKESNFR 578 S+S++ +W + + + ++D I W T G+F+++SA++L Sbjct: 180 QGSLSDIAMMQYFQLWDEIQQINL--SQEEDTICWKLTPDGSFSVSSAYDLFYMAREISP 237 Query: 579 WFDLVWFPDRVPKFSLTTWKMLKVKLPTKVNLRRLG 686 L+W K W K K T NL + G Sbjct: 238 CGQLIWQTKAPSKVRFFLWLATKGKCLTADNLAKRG 273