BLASTX nr result
ID: Catharanthus23_contig00013643
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00013643 (914 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD25830.1| putative retroelement pol polyprotein [Arabidopsi... 78 8e-20 gb|EOY00074.1| Cysteine-rich RLK (RECEPTOR-like protein kinase) ... 74 2e-17 gb|AAG10817.1|AC011808_5 Putative retroelement polyprotein [Arab... 63 6e-15 ref|XP_006367551.1| PREDICTED: uncharacterized protein LOC102604... 69 2e-14 emb|CAN73437.1| hypothetical protein VITISV_031733 [Vitis vinifera] 85 4e-14 ref|NP_194047.2| cysteine-rich receptor-like protein kinase 8 [A... 58 7e-14 emb|CAA18463.1| putative protein [Arabidopsis thaliana] gi|72691... 58 7e-14 ref|XP_004234454.1| PREDICTED: uncharacterized protein LOC101244... 82 2e-13 emb|CAN60829.1| hypothetical protein VITISV_012059 [Vitis vinifera] 79 2e-12 ref|XP_004149623.1| PREDICTED: uncharacterized protein LOC101211... 62 6e-12 emb|CAN65820.1| hypothetical protein VITISV_042324 [Vitis vinifera] 77 1e-11 gb|AAC33963.1| contains similarity to reverse transcriptases (Pf... 75 4e-11 emb|CAB40067.1| putative retrotransposon polyprotein [Arabidopsi... 75 4e-11 gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab... 73 2e-10 gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi... 73 2e-10 gb|ABD32333.1| polyprotein-like, putative [Medicago truncatula] 72 2e-10 ref|XP_004509218.1| PREDICTED: uncharacterized protein LOC101501... 72 3e-10 gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsi... 72 3e-10 gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidop... 70 8e-10 emb|CAN83990.1| hypothetical protein VITISV_018454 [Vitis vinifera] 69 2e-09 >gb|AAD25830.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1015 Score = 78.2 bits (191), Expect(2) = 8e-20 Identities = 51/166 (30%), Positives = 78/166 (46%), Gaps = 10/166 (6%) Frame = +3 Query: 222 PQLYASVLVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQ 401 P+ + RS+R+ P KDY CN+ S G+ YPL Y++YD S Sbjct: 567 PKSVPTTSTSRSKRESKQPAHLKDYFCNL---------SRKGVQYPLSDYMSYDQLSTPY 617 Query: 402 KAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS--- 563 +AY+ ++ + +F S W +A+ EL AL+ TW+ L +AI Sbjct: 618 RAYICSVTKFSEPSSFFQAKKSDDWIKAMNAELQALEGTATWEICSLPSNKKAIGCKWVY 677 Query: 564 ----NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689 N +G + + YKA LVA+G +Q EG DF +TF+P+ Sbjct: 678 KVKLNVDGTL---------ERYKARLVAKGYTQQEGVDFEDTFSPV 714 Score = 46.2 bits (108), Expect(2) = 8e-20 Identities = 20/39 (51%), Positives = 27/39 (69%) Frame = +1 Query: 691 ISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGYTP 807 ++ A K+ LH +D++N FL+ DL EEIY L PGYTP Sbjct: 724 LAVAAAKKWSLHQLDISNAFLNRDLYEEIYMNLAPGYTP 762 >gb|EOY00074.1| Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao] Length = 1494 Score = 73.6 bits (179), Expect(2) = 2e-17 Identities = 51/168 (30%), Positives = 78/168 (46%), Gaps = 16/168 (9%) Frame = +3 Query: 234 ASVLVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPL------GMTYPL*SYLAYDNFSD 395 A V+ +R + +P + DY + P S+ S+ YPL +++Y FS Sbjct: 812 ADTSVMTGKRARQIPRKLADYDFVLPPSLTSSSSTHTPTPKANSTVYPLSQFISYSRFSR 871 Query: 396 SQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS- 563 A+L + S + F HW A+ KE++AL+ N TW KL +AI+ Sbjct: 872 DHNAFLAAILSTDEPTNFHQAIKYAHWQDAMAKEISALEENKTWVLSKLPPGKRAIDSKW 931 Query: 564 ------NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689 N +G + + YKA LVA+G +Q+EG DF ETF P+ Sbjct: 932 VYKIKYNLDGSV---------ERYKARLVAKGYTQIEGVDFHETFAPV 970 Score = 42.7 bits (99), Expect(2) = 2e-17 Identities = 17/28 (60%), Positives = 22/28 (78%) Frame = +1 Query: 721 LH*IDVNNVFLDGDLREEIYKELPPGYT 804 LH +DVNN FL GDL EE+Y ++P G+T Sbjct: 990 LHQLDVNNAFLHGDLNEEVYMKIPQGFT 1017 >gb|AAG10817.1|AC011808_5 Putative retroelement polyprotein [Arabidopsis thaliana] Length = 1413 Score = 62.8 bits (151), Expect(2) = 6e-15 Identities = 45/156 (28%), Positives = 72/156 (46%), Gaps = 10/156 (6%) Frame = +3 Query: 252 RSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVLSSQ 431 ++ R P KDY CN S SS +P+ L+Y + SD ++ ++ Sbjct: 862 QNSRVSRPPAYLKDYHCN------SVTSST---DHPISEVLSYSSLSDPYMIFINAVNKI 912 Query: 432 KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------NANGFI 581 + T+ WC A+ E+TAL++N TW L +A+ NA+G + Sbjct: 913 PEPHTYAQARQIKEWCDAMGMEITALEDNGTWVVCSLPVGKKAVGCKWVYKIKLNADGSL 972 Query: 582 A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689 + YKA LVA+G +Q EG D+ +TF+P+ Sbjct: 973 ---------ERYKARLVAKGYTQTEGLDYVDTFSPV 999 Score = 45.1 bits (105), Expect(2) = 6e-15 Identities = 21/41 (51%), Positives = 27/41 (65%) Frame = +1 Query: 685 LYISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGYTP 807 L I+ A K L +D++N FL+G L EEIY LPPGY+P Sbjct: 1007 LLIAVAAAKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSP 1047 >ref|XP_006367551.1| PREDICTED: uncharacterized protein LOC102604059 [Solanum tuberosum] Length = 1014 Score = 68.9 bits (167), Expect(2) = 2e-14 Identities = 50/156 (32%), Positives = 74/156 (47%), Gaps = 10/156 (6%) Frame = +3 Query: 252 RSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVLSSQ 431 RS R P+ KDYV +V + A + PL Y + YL YDN S + +AY+ + Sbjct: 655 RSLRTSSTPLWMKDYVASVQGIP--AHAKPL---YSIDQYLGYDNLSANYQAYMSSFGTD 709 Query: 432 KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAINL----------SNANGFI 581 + +F W A+Q E++AL+ N TW+ L + ANG I Sbjct: 710 IEPSSFEEACKDPRWVDAMQAEISALECNNTWQVVPLPCGKTVIGCKWIFKIKYKANGQI 769 Query: 582 A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689 + +KA LVA+G +Q EG D+ ETF+P+ Sbjct: 770 ---------ERFKARLVAKGYNQREGLDYHETFSPV 796 Score = 37.0 bits (84), Expect(2) = 2e-14 Identities = 16/37 (43%), Positives = 22/37 (59%) Frame = +1 Query: 691 ISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGY 801 ++ A +H +DV N FL GDL EE+Y LP G+ Sbjct: 806 LALAAAGNWHVHQMDVYNAFLQGDLYEEVYMTLPQGF 842 >emb|CAN73437.1| hypothetical protein VITISV_031733 [Vitis vinifera] Length = 1322 Score = 84.7 bits (208), Expect = 4e-14 Identities = 65/231 (28%), Positives = 108/231 (46%), Gaps = 21/231 (9%) Frame = +3 Query: 243 LVLRSQRKQHVPVRFKDYVCNV----NPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAY 410 ++ RSQR H P+ +DYVCN N L + S G YPL ++++Y +S +++ Sbjct: 766 ILRRSQRPHHPPMALRDYVCNQVTFPNHLPPLSSSPQKGTRYPLCNFVSYHRYSPQHRSF 825 Query: 411 LVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAIN----------L 560 +S + ++ A+ +HW A+Q EL AL+ N+TW L Sbjct: 826 TAAVSQDIEPTSYAEAASHSHWQEAMQSELAALEANHTWSLTSLPLGKKPIGCRWVYKIK 885 Query: 561 SNANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK 740 +++G I + +KA LVA+G +Q+EG D+ +TF+P + ++ Sbjct: 886 RHSDGTI---------ERFKARLVAKGYTQLEGIDYHDTFSPTAKMITVR---------- 926 Query: 741 *CVL------RW*LEGRNIQGTSTWLH-SLKARILLSSLYGLRQAGRNFFS 872 C+L W L ++ + +LH L I +S GLR+ G N FS Sbjct: 927 -CLLALAAAQNWSLHQLDV--NNAFLHGDLHEEIYMSPPPGLRRQGENLFS 974 >ref|NP_194047.2| cysteine-rich receptor-like protein kinase 8 [Arabidopsis thaliana] gi|332659317|gb|AEE84717.1| cysteine-rich receptor-like protein kinase 8 [Arabidopsis thaliana] Length = 1262 Score = 58.2 bits (139), Expect(2) = 7e-14 Identities = 43/156 (27%), Positives = 72/156 (46%), Gaps = 11/156 (7%) Frame = +3 Query: 255 SQRKQHVPVRFKDYVCNVNPLELSARSSPLGMT-YPL*SYLAYDNFSDSQKAYLVVLSSQ 431 S R+ P +DY C+ S +T + + +L+Y+ S ++LV ++ Sbjct: 34 SHRRTRKPAYLQDYYCH----------SVASLTIHDISQFLSYEKVSPLYHSFLVCIAKA 83 Query: 432 KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------NANGFI 581 K+ T+ WC A+ E+ A++ +TW+ L + I N++G I Sbjct: 84 KEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTI 143 Query: 582 A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689 + YKA LVA+G +Q EG DF ETF+P+ Sbjct: 144 ---------ERYKARLVAKGYTQQEGIDFIETFSPV 170 Score = 46.2 bits (108), Expect(2) = 7e-14 Identities = 20/39 (51%), Positives = 27/39 (69%) Frame = +1 Query: 685 LYISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGY 801 L ++ A LH +D++N FL+GDL EEIY +LPPGY Sbjct: 178 LILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGY 216 >emb|CAA18463.1| putative protein [Arabidopsis thaliana] gi|7269163|emb|CAB79271.1| putative protein [Arabidopsis thaliana] Length = 1240 Score = 58.2 bits (139), Expect(2) = 7e-14 Identities = 43/156 (27%), Positives = 72/156 (46%), Gaps = 11/156 (7%) Frame = +3 Query: 255 SQRKQHVPVRFKDYVCNVNPLELSARSSPLGMT-YPL*SYLAYDNFSDSQKAYLVVLSSQ 431 S R+ P +DY C+ S +T + + +L+Y+ S ++LV ++ Sbjct: 34 SHRRTRKPAYLQDYYCH----------SVASLTIHDISQFLSYEKVSPLYHSFLVCIAKA 83 Query: 432 KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------NANGFI 581 K+ T+ WC A+ E+ A++ +TW+ L + I N++G I Sbjct: 84 KEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTI 143 Query: 582 A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689 + YKA LVA+G +Q EG DF ETF+P+ Sbjct: 144 ---------ERYKARLVAKGYTQQEGIDFIETFSPV 170 Score = 46.2 bits (108), Expect(2) = 7e-14 Identities = 20/39 (51%), Positives = 27/39 (69%) Frame = +1 Query: 685 LYISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGY 801 L ++ A LH +D++N FL+GDL EEIY +LPPGY Sbjct: 178 LILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGY 216 >ref|XP_004234454.1| PREDICTED: uncharacterized protein LOC101244259 [Solanum lycopersicum] Length = 1812 Score = 82.4 bits (202), Expect = 2e-13 Identities = 76/252 (30%), Positives = 110/252 (43%), Gaps = 29/252 (11%) Frame = +3 Query: 222 PQLYASVLVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQ 401 P + S + RS R H P+ DYV R +P YPL +Y++Y N S S Sbjct: 845 PVSHTSAPIRRSNRHSHPPLWLADYV---------TRPAPTSTLYPLSNYVSYTNLSSSH 895 Query: 402 KAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQA---------- 551 + YL V S+ + T+ + W A+Q E+ AL +N+TW+ L Sbjct: 896 QHYLGVFSAIIEPSTYQEAIKDSRWIDAMQSEIQALHDNHTWELVPLPPGKVPIGCRWVY 955 Query: 552 -INLSNANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*L---------- 698 + L +NG I + +K LVA+G +Q EG DF ETF+P+ + Sbjct: 956 KVKL-KSNGDI---------ERFKTRLVAKGYTQKEGLDFHETFSPVVKMTTVRTVLSLA 1005 Query: 699 ----WRLQK----DVFTLDRCK*CVLRW*LEGRNIQGTSTWLHSLKARILLSSLYGLRQA 854 W + + +VF V EG + QG S + L R L+ SLYGL+QA Sbjct: 1006 AQFNWHIHQLDVYNVFLHSDLHDEVYMQLPEGFSSQGES---NGLVCR-LVKSLYGLKQA 1061 Query: 855 GRNFFSKLSSIL 890 R + KL L Sbjct: 1062 SRQWNLKLCEAL 1073 >emb|CAN60829.1| hypothetical protein VITISV_012059 [Vitis vinifera] Length = 1128 Score = 79.0 bits (193), Expect = 2e-12 Identities = 73/253 (28%), Positives = 120/253 (47%), Gaps = 35/253 (13%) Frame = +3 Query: 252 RSQRKQHVPVRFKDYVC-NVNPLELSARS----SPLGMTYPL*SYLAYDNFSDSQKAYLV 416 RS+R +H+P ++Y C N+ ++ + ++ S G Y + S+L+ S KA++ Sbjct: 549 RSERTKHLPKYLQNYYCGNMTKIDSATQAPSSCSSSGKPYCIFSFLSDSRLSSKHKAFIY 608 Query: 417 VLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------N 566 V+SS + +T+ + HW A+ E+ ALK+N TW L AI Sbjct: 609 VISSTFEPKTYKQXVSIPHWQTAMTDEIKALKHNKTWDLAILPPNKTAIGCKWVYRVKFK 668 Query: 567 ANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK*C 746 A+G + + YKA LVA+G +Q EG DF +T++P+ + + + + + K Sbjct: 669 ADGSV---------ERYKARLVAKGYTQQEGLDFFDTYSPVAKMTTV-RVLLAIAAAK-- 716 Query: 747 VLRW*LEGRNIQGTSTWLH-SLKARI------------------LLSSLYGLRQAGRNFF 869 +W L ++ + +LH L + L SLYGLRQA R ++ Sbjct: 717 --QWYLHQLDV--NNAFLHGDLNEEVYMQLPLGFSTPNDPRVCKLKKSLYGLRQASRQWY 772 Query: 870 SKL-SSILSFDLS 905 SKL SS+L F S Sbjct: 773 SKLSSSLLKFGFS 785 >ref|XP_004149623.1| PREDICTED: uncharacterized protein LOC101211618 [Cucumis sativus] Length = 2085 Score = 62.4 bits (150), Expect(2) = 6e-12 Identities = 44/152 (28%), Positives = 75/152 (49%), Gaps = 2/152 (1%) Frame = +3 Query: 240 VLVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVV 419 ++ +S R H P KD+ CN+ S S+P +PL YL+Y+ +S K Y+ Sbjct: 727 IMTRKSSRPHHPPSYLKDFYCNLT----SQNSTP----FPLNQYLSYNAYSQHHKNYMFN 778 Query: 420 LSSQKKLQTFIVVATSTH-WCRAVQKELTALKNNYTWKYQKLSQAINLSNANGFIA*NTR 596 ++S + T+ A H W +A+ +E+ A++ TW + + + + + Sbjct: 779 VTSIYE-PTYYHQAVKHHTWRKAMAEEIEAMERTNTWTIVSIPKDHHTVGSKWVYKVKCK 837 Query: 597 L-MV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689 YKA LVA+G +Q EG DF +TF+P+ Sbjct: 838 PDGTIDRYKARLVAKGYNQQEGIDFLDTFSPV 869 Score = 35.4 bits (80), Expect(2) = 6e-12 Identities = 14/24 (58%), Positives = 19/24 (79%) Frame = +1 Query: 730 IDVNNVFLDGDLREEIYKELPPGY 801 +D+NN FL+GDL EE++ LP GY Sbjct: 892 MDINNAFLNGDLFEEVHMTLPLGY 915 >emb|CAN65820.1| hypothetical protein VITISV_042324 [Vitis vinifera] Length = 1262 Score = 76.6 bits (187), Expect = 1e-11 Identities = 70/253 (27%), Positives = 119/253 (47%), Gaps = 35/253 (13%) Frame = +3 Query: 252 RSQRKQHVPVRFKDYVC-NVNPLELSARS----SPLGMTYPL*SYLAYDNFSDSQKAYLV 416 RS+R +H+P ++Y C N+ ++L+ ++ S G Y + S+L+ S KA++ Sbjct: 855 RSERTKHLPKYLQNYYCGNMTKIDLATQAPSSCSSSGKPYYIFSFLSDSKLSSKHKAFIS 914 Query: 417 VLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------N 566 ++SS + +T+ + HW A+ E+ AL++N TW L I Sbjct: 915 IISSTFEPKTYKQAVSIPHWKTAMTDEIKALEHNKTWDLAILPPNKTTIGCKWVYQVKFK 974 Query: 567 ANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK*C 746 A+G + + YKA LVA+G +Q EG DF +T++P+ + + + + + K Sbjct: 975 ADGSV---------ERYKARLVAKGYTQQEGLDFFDTYSPVAKMTTV-RVLLAIAATK-- 1022 Query: 747 VLRW*LEGRNIQGTSTWLH-------------------SLKARILLSSLYGLRQAGRNFF 869 +W L ++ + +LH + L SLYGLRQA R ++ Sbjct: 1023 --QWYLHQLDV--NNAFLHEDLNEDVYMQLPPGFSTPNDPRVCKLKKSLYGLRQASRQWY 1078 Query: 870 SKL-SSILSFDLS 905 SKL SS+L F S Sbjct: 1079 SKLSSSLLKFGFS 1091 >gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score: 11.19) [Arabidopsis thaliana] Length = 1633 Score = 74.7 bits (182), Expect = 4e-11 Identities = 73/263 (27%), Positives = 109/263 (41%), Gaps = 46/263 (17%) Frame = +3 Query: 237 SVLVLRSQRKQHVPVRFKDYVCNVNP------------LELSARSSP---LGMTYPL*SY 371 SV + R +R P +Y CN P +E + S P + YP+ + Sbjct: 857 SVPIARPKRNAKAPAYLSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTA 916 Query: 372 LAYDNFSDSQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQA 551 ++YD + +Y+ + + + + F S W RA +EL AL+ N TW + L++ Sbjct: 917 ISYDKLTPLFHSYICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEG 976 Query: 552 INL----------SNANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*L- 698 N+ N +G I + YKA LVA+G +Q EG D+ ETF+P+ Sbjct: 977 KNVVGCKWVFTIKYNPDGSI---------ERYKARLVAQGFTQQEGIDYMETFSPVAKFG 1027 Query: 699 -------------WRL-QKDVFT------LDRCK*CVLRW*LEGRNIQGTSTWLHSLKAR 818 W L Q DV LD + L T L S Sbjct: 1028 SVKLLLGLAAATGWSLTQMDVSNAFLHGELDE----EIYMSLPQGYTPPTGISLPSKPVC 1083 Query: 819 ILLSSLYGLRQAGRNFFSKLSSI 887 LL SLYGL+QA R ++ +LSS+ Sbjct: 1084 RLLKSLYGLKQASRQWYKRLSSV 1106 >emb|CAB40067.1| putative retrotransposon polyprotein [Arabidopsis thaliana] gi|7267797|emb|CAB81200.1| putative retrotransposon polyprotein [Arabidopsis thaliana] Length = 1203 Score = 74.7 bits (182), Expect = 4e-11 Identities = 73/263 (27%), Positives = 109/263 (41%), Gaps = 46/263 (17%) Frame = +3 Query: 237 SVLVLRSQRKQHVPVRFKDYVCNVNP------------LELSARSSP---LGMTYPL*SY 371 SV + R +R P +Y CN P +E + S P + YP+ + Sbjct: 443 SVPIARPKRNAKAPAYLSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTA 502 Query: 372 LAYDNFSDSQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQA 551 ++YD + +Y+ + + + + F S W RA +EL AL+ N TW + L++ Sbjct: 503 ISYDKLTPLFHSYICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEG 562 Query: 552 INL----------SNANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*L- 698 N+ N +G I + YKA LVA+G +Q EG D+ ETF+P+ Sbjct: 563 KNVVGCKWVFTIKYNPDGSI---------ERYKARLVAQGFTQQEGIDYMETFSPVAKFG 613 Query: 699 -------------WRL-QKDVFT------LDRCK*CVLRW*LEGRNIQGTSTWLHSLKAR 818 W L Q DV LD + L T L S Sbjct: 614 SVKLLLGLAAATGWSLTQMDVSNAFLHGELDE----EIYMSLPQGYTPPTGISLPSKPVC 669 Query: 819 ILLSSLYGLRQAGRNFFSKLSSI 887 LL SLYGL+QA R ++ +LSS+ Sbjct: 670 RLLKSLYGLKQASRQWYKRLSSV 692 >gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana] Length = 1486 Score = 72.8 bits (177), Expect = 2e-10 Identities = 65/239 (27%), Positives = 104/239 (43%), Gaps = 23/239 (9%) Frame = +3 Query: 243 LVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVL 422 L+ + R + PV+ DYV L + P YPL +Y++ FSD+ +AY++ + Sbjct: 917 LLGKGHRPKRPPVKLADYVTT-----LLHQPFPSATPYPLDNYISSSRFSDNYQAYILAI 971 Query: 423 SSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAINLSNANGFIA*NTRL- 599 +S + + + HW AV E+ +L+N TW + L + Sbjct: 972 TSGNEPRNYNEAMLDDHWKGAVSHEIGSLENLGTWTVEDLPPGKKALGCKWVFRLKYKSD 1031 Query: 600 MV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQ---KDVFTLDRCK*CVLRW*LEG 770 + +KA LV G +Q EG D+TETF P+ + ++ + V +LD W E Sbjct: 1032 GTLERHKARLVVLGNNQTEGLDYTETFAPVAKMVTVRAFLQQVVSLD--------W--EV 1081 Query: 771 RNIQGTSTWLH-------------------SLKARILLSSLYGLRQAGRNFFSKLSSIL 890 + + +LH K L SLYGL+QA R +F+KL+S L Sbjct: 1082 HQMDVHNAFLHGDLDEEVYMQFPPGFRTGDKTKVCRLRKSLYGLKQAPRCWFAKLTSAL 1140 >gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1413 Score = 72.8 bits (177), Expect = 2e-10 Identities = 74/248 (29%), Positives = 110/248 (44%), Gaps = 35/248 (14%) Frame = +3 Query: 252 RSQRKQHVPVRFKDYV-----CNVN------PLELSARSSPLG-MTYPL*SYLAYDNFSD 395 + +R+ P R KDY+ C N P + SS G + YPL Y++ + FS Sbjct: 908 QGKRQVQQPARLKDYILYNASCTPNTPHVLSPSTSQSSSSIQGNLQYPLTDYISDECFSA 967 Query: 396 SQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL--------SQA 551 K +L +++ + + F W A+ KE+ AL+ N TW L SQ Sbjct: 968 GHKVFLAAITANDEPKHFKEDVKVKVWNDAMYKEVDALEVNKTWDIVDLPTGKVAIGSQW 1027 Query: 552 INLS--NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRL------ 707 + + NA+G + + YKA LV +G +Q+EGED+TETF P+ + + Sbjct: 1028 VYKTKFNADGTV---------ERYKARLVVQGNNQIEGEDYTETFAPVVKMTTVRTLLRL 1078 Query: 708 ----QKDVFTLDRCK*CVLRW*LEGR---NIQGTSTWLHSLKARILLSSLYGLRQAGRNF 866 Q +V+ +D L LE + H K L SLYGL+QA R + Sbjct: 1079 VAANQWEVYQMD-VHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCW 1137 Query: 867 FSKLSSIL 890 F KLS L Sbjct: 1138 FKKLSDAL 1145 >gb|ABD32333.1| polyprotein-like, putative [Medicago truncatula] Length = 635 Score = 72.4 bits (176), Expect = 2e-10 Identities = 61/240 (25%), Positives = 109/240 (45%), Gaps = 28/240 (11%) Frame = +3 Query: 255 SQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVLSSQK 434 S R + P +DY+CN + +S+ + + YPL +++++ + S+SQ + + L S Sbjct: 40 SSRTKKSPSYLQDYICNPSTNSVSSANKSC-ILYPLSNFISHKHLSNSQHTFALSLVSHI 98 Query: 435 KLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAIN----------LSNANGFIA 584 + +++ S W +A+Q EL AL TW + + N +G I Sbjct: 99 EPKSYAEAIKSDCWKQAMQLELNALDQTGTWTVVDIPSQVKPIGCKWVCRIKYNDDGSI- 157 Query: 585 *NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK*CVLRW*L 764 + YKA LVA+G +Q+EG D+ +TF+P+ + + L + W L Sbjct: 158 --------ERYKARLVAKGYNQIEGLDYFDTFSPV-----AKITIVRLVIALASINHWFL 204 Query: 765 EGRNIQ------------------GTSTWLHSLKARILLSSLYGLRQAGRNFFSKLSSIL 890 ++ G ST+ + + L SLYGL+QA R ++ KL+++L Sbjct: 205 HQLDVNNAFLHGDLQENVYKKIPPGLSTFKPNQVCK-LSKSLYGLKQASRKWYEKLTTLL 263 >ref|XP_004509218.1| PREDICTED: uncharacterized protein LOC101501009 [Cicer arietinum] Length = 751 Score = 72.0 bits (175), Expect = 3e-10 Identities = 68/248 (27%), Positives = 111/248 (44%), Gaps = 33/248 (13%) Frame = +3 Query: 252 RSQRKQHVPVRFKDYVCNVNPLELSARSSPL------GMTYPL*SYLAYDNFSDSQKAYL 413 RS R P D+ C++ L+ S ++ L G YPL ++++YDN S + K + Sbjct: 301 RSGRTIKPPSYLTDFHCSL--LQGSINNNILVPNQFKGTPYPLSTFISYDNLSSAHKFFT 358 Query: 414 VVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAINL----------S 563 + +S+ K+ ++ ++W A+ EL +L NN TW+ L + Sbjct: 359 INVSTLKEPSSYSEAIKDSNWRLAIDSELRSLLNNNTWELTTLPSDKKVIGCKWVFKLKF 418 Query: 564 NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQK---------- 713 +ANG I + YKA LVA+G +Q EG D+ +TF+P+ + ++ Sbjct: 419 HANGTI---------ERYKARLVAKGFNQTEGLDYLDTFSPVVKMTTIRLLLSIAAIKNW 469 Query: 714 DVFTLDRCK*CVLRW*LEGRNIQGTSTWL-------HSLKARILLSSLYGLRQAGRNFFS 872 +F LD + L G I+ + H + L SLYGL+QA R + Sbjct: 470 FLFQLD-----INTAFLHGDLIEDVYMKIPPGLHVQHKSQVCKLKRSLYGLKQASRQWNM 524 Query: 873 KLSSILSF 896 KL S+ F Sbjct: 525 KLCSLKQF 532 >gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1156 Score = 72.0 bits (175), Expect = 3e-10 Identities = 77/249 (30%), Positives = 113/249 (45%), Gaps = 38/249 (15%) Frame = +3 Query: 258 QRKQHV--PVRFKDYV---CNVNPL------ELSARSSPL----GMTYPL*SYLAYDNFS 392 QRK+ + VR +DYV V+P+ + S++SS + YPL Y++ D FS Sbjct: 564 QRKRQIRQSVRLQDYVLYNATVSPINPHALPDSSSQSSSMVQGTSSLYPLSDYVSDDCFS 623 Query: 393 DSQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL--------SQ 548 KA+L +++ + + F W A+ KE+ AL+ N TW L SQ Sbjct: 624 AGHKAFLAAITANDEPKHFKEAVRIKVWNDAMFKEVDALEINKTWDIVDLPPGKVAIGSQ 683 Query: 549 AINLS--NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRL----- 707 + + NA+G I + YKA LV +G QVEGED+ ETF P+ + + Sbjct: 684 WVYKTKYNADGSI---------ERYKARLVVQGNKQVEGEDYNETFAPVVKMTTVRTLLR 734 Query: 708 -----QKDVFTLDRCK*CVLRW*LEGR---NIQGTSTWLHSLKARILLSSLYGLRQAGRN 863 Q +V+ +D L L+ + H K L SLYGL+QA R Sbjct: 735 LVAANQWEVYQMD-VNNAFLHGDLDEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRC 793 Query: 864 FFSKLSSIL 890 +F KLS L Sbjct: 794 WFKKLSDAL 802 >gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidopsis thaliana] Length = 1152 Score = 70.5 bits (171), Expect = 8e-10 Identities = 75/247 (30%), Positives = 107/247 (43%), Gaps = 32/247 (12%) Frame = +3 Query: 252 RSQRKQHVPVRFKDYVCNVNPLELSAR-SSPLGMT-YPL*SYLAYDNFSDSQKAYLVVLS 425 R R++ VR KDY E + S +G YP+ +Y++ + FS S + +L +S Sbjct: 871 RGLRQRQENVRLKDYQTYSAQCESTQTLSDNIGTCIYPMANYVSGEIFSPSNQHFLAAIS 930 Query: 426 SQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAINL----------SNANG 575 QT+ W AV E+ AL++ TW KL Q + N+NG Sbjct: 931 MVDPPQTYNQAIREKEWRNAVFFEVDALEDQGTWDITKLPQGVKAIGSKWVFRIKYNSNG 990 Query: 576 FIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK*CVLR 755 + + YKA LVA G Q EG DFT+TF P+ ++Q LD Sbjct: 991 TV---------ERYKARLVALGNHQKEGIDFTKTFAPVV---KMQTVRLLLDVA--AAKD 1036 Query: 756 W*LEGRNIQGTSTWLH-SLKARI------------------LLSSLYGLRQAGRNFFSKL 878 W L ++ + +LH LK I L S+YGL+QA R +F KL Sbjct: 1037 WELHQMDVH--NAFLHGDLKEDIYMKPPPGFKTTDPSLVCKLKKSIYGLKQAPRCWFEKL 1094 Query: 879 S-SILSF 896 S S+L F Sbjct: 1095 STSLLKF 1101 >emb|CAN83990.1| hypothetical protein VITISV_018454 [Vitis vinifera] Length = 1243 Score = 69.3 bits (168), Expect = 2e-09 Identities = 69/241 (28%), Positives = 101/241 (41%), Gaps = 28/241 (11%) Frame = +3 Query: 252 RSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVLSSQ 431 R R P KDY C++ + A ++P+ +L+YD S S K + + +S Sbjct: 690 RXTRVSKQPSYLKDYHCSL--INSVAHVETHSTSHPIQHFLSYDKLSPSYKLFSLSVSII 747 Query: 432 KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAIN----------LSNANGFI 581 + +F A W A+ EL AL+ N TW L + A+G I Sbjct: 748 SEPSSFAKAAEIPEWRAAMDCELEALEENKTWSIVSLXVGKHPVGCKWVYKIKHKADGTI 807 Query: 582 A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*L--------------WRLQK-- 713 + YKA LVA+G +Q EG D+ +TF+P+ L W L + Sbjct: 808 ---------ERYKARLVAKGYTQREGIDYVDTFSPVAKLVTVKLLLAIAAVKGWHLSQLD 858 Query: 714 --DVFTLDRCK*CVLRW*LEGRNIQGTSTWLHSLKARILLSSLYGLRQAGRNFFSKLSSI 887 + F V G N +G S L S +L SLYGL+QA R +FSK S+ Sbjct: 859 VNNAFLHGDLNEEVYMKLPPGYNRKGES--LPSNAVCLLHKSLYGLKQASRQWFSKFSTA 916 Query: 888 L 890 + Sbjct: 917 I 917