BLASTX nr result

ID: Catharanthus23_contig00013643 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00013643
         (914 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD25830.1| putative retroelement pol polyprotein [Arabidopsi...    78   8e-20
gb|EOY00074.1| Cysteine-rich RLK (RECEPTOR-like protein kinase) ...    74   2e-17
gb|AAG10817.1|AC011808_5 Putative retroelement polyprotein [Arab...    63   6e-15
ref|XP_006367551.1| PREDICTED: uncharacterized protein LOC102604...    69   2e-14
emb|CAN73437.1| hypothetical protein VITISV_031733 [Vitis vinifera]    85   4e-14
ref|NP_194047.2| cysteine-rich receptor-like protein kinase 8 [A...    58   7e-14
emb|CAA18463.1| putative protein [Arabidopsis thaliana] gi|72691...    58   7e-14
ref|XP_004234454.1| PREDICTED: uncharacterized protein LOC101244...    82   2e-13
emb|CAN60829.1| hypothetical protein VITISV_012059 [Vitis vinifera]    79   2e-12
ref|XP_004149623.1| PREDICTED: uncharacterized protein LOC101211...    62   6e-12
emb|CAN65820.1| hypothetical protein VITISV_042324 [Vitis vinifera]    77   1e-11
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf...    75   4e-11
emb|CAB40067.1| putative retrotransposon polyprotein [Arabidopsi...    75   4e-11
gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab...    73   2e-10
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi...    73   2e-10
gb|ABD32333.1| polyprotein-like, putative [Medicago truncatula]        72   2e-10
ref|XP_004509218.1| PREDICTED: uncharacterized protein LOC101501...    72   3e-10
gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsi...    72   3e-10
gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidop...    70   8e-10
emb|CAN83990.1| hypothetical protein VITISV_018454 [Vitis vinifera]    69   2e-09

>gb|AAD25830.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1015

 Score = 78.2 bits (191), Expect(2) = 8e-20
 Identities = 51/166 (30%), Positives = 78/166 (46%), Gaps = 10/166 (6%)
 Frame = +3

Query: 222  PQLYASVLVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQ 401
            P+   +    RS+R+   P   KDY CN+         S  G+ YPL  Y++YD  S   
Sbjct: 567  PKSVPTTSTSRSKRESKQPAHLKDYFCNL---------SRKGVQYPLSDYMSYDQLSTPY 617

Query: 402  KAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS--- 563
            +AY+  ++   +  +F     S  W +A+  EL AL+   TW+   L    +AI      
Sbjct: 618  RAYICSVTKFSEPSSFFQAKKSDDWIKAMNAELQALEGTATWEICSLPSNKKAIGCKWVY 677

Query: 564  ----NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689
                N +G +         + YKA LVA+G +Q EG DF +TF+P+
Sbjct: 678  KVKLNVDGTL---------ERYKARLVAKGYTQQEGVDFEDTFSPV 714



 Score = 46.2 bits (108), Expect(2) = 8e-20
 Identities = 20/39 (51%), Positives = 27/39 (69%)
 Frame = +1

Query: 691 ISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGYTP 807
           ++  A K+  LH +D++N FL+ DL EEIY  L PGYTP
Sbjct: 724 LAVAAAKKWSLHQLDISNAFLNRDLYEEIYMNLAPGYTP 762


>gb|EOY00074.1| Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao]
          Length = 1494

 Score = 73.6 bits (179), Expect(2) = 2e-17
 Identities = 51/168 (30%), Positives = 78/168 (46%), Gaps = 16/168 (9%)
 Frame = +3

Query: 234  ASVLVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPL------GMTYPL*SYLAYDNFSD 395
            A   V+  +R + +P +  DY   + P   S+ S+           YPL  +++Y  FS 
Sbjct: 812  ADTSVMTGKRARQIPRKLADYDFVLPPSLTSSSSTHTPTPKANSTVYPLSQFISYSRFSR 871

Query: 396  SQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS- 563
               A+L  + S  +   F       HW  A+ KE++AL+ N TW   KL    +AI+   
Sbjct: 872  DHNAFLAAILSTDEPTNFHQAIKYAHWQDAMAKEISALEENKTWVLSKLPPGKRAIDSKW 931

Query: 564  ------NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689
                  N +G +         + YKA LVA+G +Q+EG DF ETF P+
Sbjct: 932  VYKIKYNLDGSV---------ERYKARLVAKGYTQIEGVDFHETFAPV 970



 Score = 42.7 bits (99), Expect(2) = 2e-17
 Identities = 17/28 (60%), Positives = 22/28 (78%)
 Frame = +1

Query: 721  LH*IDVNNVFLDGDLREEIYKELPPGYT 804
            LH +DVNN FL GDL EE+Y ++P G+T
Sbjct: 990  LHQLDVNNAFLHGDLNEEVYMKIPQGFT 1017


>gb|AAG10817.1|AC011808_5 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score = 62.8 bits (151), Expect(2) = 6e-15
 Identities = 45/156 (28%), Positives = 72/156 (46%), Gaps = 10/156 (6%)
 Frame = +3

Query: 252  RSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVLSSQ 431
            ++ R    P   KDY CN      S  SS     +P+   L+Y + SD    ++  ++  
Sbjct: 862  QNSRVSRPPAYLKDYHCN------SVTSST---DHPISEVLSYSSLSDPYMIFINAVNKI 912

Query: 432  KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------NANGFI 581
             +  T+        WC A+  E+TAL++N TW    L    +A+          NA+G +
Sbjct: 913  PEPHTYAQARQIKEWCDAMGMEITALEDNGTWVVCSLPVGKKAVGCKWVYKIKLNADGSL 972

Query: 582  A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689
                     + YKA LVA+G +Q EG D+ +TF+P+
Sbjct: 973  ---------ERYKARLVAKGYTQTEGLDYVDTFSPV 999



 Score = 45.1 bits (105), Expect(2) = 6e-15
 Identities = 21/41 (51%), Positives = 27/41 (65%)
 Frame = +1

Query: 685  LYISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGYTP 807
            L I+  A K   L  +D++N FL+G L EEIY  LPPGY+P
Sbjct: 1007 LLIAVAAAKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSP 1047


>ref|XP_006367551.1| PREDICTED: uncharacterized protein LOC102604059 [Solanum tuberosum]
          Length = 1014

 Score = 68.9 bits (167), Expect(2) = 2e-14
 Identities = 50/156 (32%), Positives = 74/156 (47%), Gaps = 10/156 (6%)
 Frame = +3

Query: 252  RSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVLSSQ 431
            RS R    P+  KDYV +V  +   A + PL   Y +  YL YDN S + +AY+    + 
Sbjct: 655  RSLRTSSTPLWMKDYVASVQGIP--AHAKPL---YSIDQYLGYDNLSANYQAYMSSFGTD 709

Query: 432  KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAINL----------SNANGFI 581
             +  +F        W  A+Q E++AL+ N TW+   L     +            ANG I
Sbjct: 710  IEPSSFEEACKDPRWVDAMQAEISALECNNTWQVVPLPCGKTVIGCKWIFKIKYKANGQI 769

Query: 582  A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689
                     + +KA LVA+G +Q EG D+ ETF+P+
Sbjct: 770  ---------ERFKARLVAKGYNQREGLDYHETFSPV 796



 Score = 37.0 bits (84), Expect(2) = 2e-14
 Identities = 16/37 (43%), Positives = 22/37 (59%)
 Frame = +1

Query: 691 ISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGY 801
           ++  A     +H +DV N FL GDL EE+Y  LP G+
Sbjct: 806 LALAAAGNWHVHQMDVYNAFLQGDLYEEVYMTLPQGF 842


>emb|CAN73437.1| hypothetical protein VITISV_031733 [Vitis vinifera]
          Length = 1322

 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 65/231 (28%), Positives = 108/231 (46%), Gaps = 21/231 (9%)
 Frame = +3

Query: 243  LVLRSQRKQHVPVRFKDYVCNV----NPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAY 410
            ++ RSQR  H P+  +DYVCN     N L   + S   G  YPL ++++Y  +S   +++
Sbjct: 766  ILRRSQRPHHPPMALRDYVCNQVTFPNHLPPLSSSPQKGTRYPLCNFVSYHRYSPQHRSF 825

Query: 411  LVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAIN----------L 560
               +S   +  ++   A+ +HW  A+Q EL AL+ N+TW    L                
Sbjct: 826  TAAVSQDIEPTSYAEAASHSHWQEAMQSELAALEANHTWSLTSLPLGKKPIGCRWVYKIK 885

Query: 561  SNANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK 740
             +++G I         + +KA LVA+G +Q+EG D+ +TF+P   +  ++          
Sbjct: 886  RHSDGTI---------ERFKARLVAKGYTQLEGIDYHDTFSPTAKMITVR---------- 926

Query: 741  *CVL------RW*LEGRNIQGTSTWLH-SLKARILLSSLYGLRQAGRNFFS 872
             C+L       W L   ++   + +LH  L   I +S   GLR+ G N FS
Sbjct: 927  -CLLALAAAQNWSLHQLDV--NNAFLHGDLHEEIYMSPPPGLRRQGENLFS 974


>ref|NP_194047.2| cysteine-rich receptor-like protein kinase 8 [Arabidopsis thaliana]
           gi|332659317|gb|AEE84717.1| cysteine-rich receptor-like
           protein kinase 8 [Arabidopsis thaliana]
          Length = 1262

 Score = 58.2 bits (139), Expect(2) = 7e-14
 Identities = 43/156 (27%), Positives = 72/156 (46%), Gaps = 11/156 (7%)
 Frame = +3

Query: 255 SQRKQHVPVRFKDYVCNVNPLELSARSSPLGMT-YPL*SYLAYDNFSDSQKAYLVVLSSQ 431
           S R+   P   +DY C+          S   +T + +  +L+Y+  S    ++LV ++  
Sbjct: 34  SHRRTRKPAYLQDYYCH----------SVASLTIHDISQFLSYEKVSPLYHSFLVCIAKA 83

Query: 432 KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------NANGFI 581
           K+  T+        WC A+  E+ A++  +TW+   L    + I          N++G I
Sbjct: 84  KEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTI 143

Query: 582 A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689
                    + YKA LVA+G +Q EG DF ETF+P+
Sbjct: 144 ---------ERYKARLVAKGYTQQEGIDFIETFSPV 170



 Score = 46.2 bits (108), Expect(2) = 7e-14
 Identities = 20/39 (51%), Positives = 27/39 (69%)
 Frame = +1

Query: 685 LYISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGY 801
           L ++  A     LH +D++N FL+GDL EEIY +LPPGY
Sbjct: 178 LILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGY 216


>emb|CAA18463.1| putative protein [Arabidopsis thaliana] gi|7269163|emb|CAB79271.1|
           putative protein [Arabidopsis thaliana]
          Length = 1240

 Score = 58.2 bits (139), Expect(2) = 7e-14
 Identities = 43/156 (27%), Positives = 72/156 (46%), Gaps = 11/156 (7%)
 Frame = +3

Query: 255 SQRKQHVPVRFKDYVCNVNPLELSARSSPLGMT-YPL*SYLAYDNFSDSQKAYLVVLSSQ 431
           S R+   P   +DY C+          S   +T + +  +L+Y+  S    ++LV ++  
Sbjct: 34  SHRRTRKPAYLQDYYCH----------SVASLTIHDISQFLSYEKVSPLYHSFLVCIAKA 83

Query: 432 KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------NANGFI 581
           K+  T+        WC A+  E+ A++  +TW+   L    + I          N++G I
Sbjct: 84  KEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTI 143

Query: 582 A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689
                    + YKA LVA+G +Q EG DF ETF+P+
Sbjct: 144 ---------ERYKARLVAKGYTQQEGIDFIETFSPV 170



 Score = 46.2 bits (108), Expect(2) = 7e-14
 Identities = 20/39 (51%), Positives = 27/39 (69%)
 Frame = +1

Query: 685 LYISFGACKRMFLH*IDVNNVFLDGDLREEIYKELPPGY 801
           L ++  A     LH +D++N FL+GDL EEIY +LPPGY
Sbjct: 178 LILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGY 216


>ref|XP_004234454.1| PREDICTED: uncharacterized protein LOC101244259 [Solanum
            lycopersicum]
          Length = 1812

 Score = 82.4 bits (202), Expect = 2e-13
 Identities = 76/252 (30%), Positives = 110/252 (43%), Gaps = 29/252 (11%)
 Frame = +3

Query: 222  PQLYASVLVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQ 401
            P  + S  + RS R  H P+   DYV          R +P    YPL +Y++Y N S S 
Sbjct: 845  PVSHTSAPIRRSNRHSHPPLWLADYV---------TRPAPTSTLYPLSNYVSYTNLSSSH 895

Query: 402  KAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQA---------- 551
            + YL V S+  +  T+      + W  A+Q E+ AL +N+TW+   L             
Sbjct: 896  QHYLGVFSAIIEPSTYQEAIKDSRWIDAMQSEIQALHDNHTWELVPLPPGKVPIGCRWVY 955

Query: 552  -INLSNANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*L---------- 698
             + L  +NG I         + +K  LVA+G +Q EG DF ETF+P+  +          
Sbjct: 956  KVKL-KSNGDI---------ERFKTRLVAKGYTQKEGLDFHETFSPVVKMTTVRTVLSLA 1005

Query: 699  ----WRLQK----DVFTLDRCK*CVLRW*LEGRNIQGTSTWLHSLKARILLSSLYGLRQA 854
                W + +    +VF        V     EG + QG S   + L  R L+ SLYGL+QA
Sbjct: 1006 AQFNWHIHQLDVYNVFLHSDLHDEVYMQLPEGFSSQGES---NGLVCR-LVKSLYGLKQA 1061

Query: 855  GRNFFSKLSSIL 890
             R +  KL   L
Sbjct: 1062 SRQWNLKLCEAL 1073


>emb|CAN60829.1| hypothetical protein VITISV_012059 [Vitis vinifera]
          Length = 1128

 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 73/253 (28%), Positives = 120/253 (47%), Gaps = 35/253 (13%)
 Frame = +3

Query: 252  RSQRKQHVPVRFKDYVC-NVNPLELSARS----SPLGMTYPL*SYLAYDNFSDSQKAYLV 416
            RS+R +H+P   ++Y C N+  ++ + ++    S  G  Y + S+L+    S   KA++ 
Sbjct: 549  RSERTKHLPKYLQNYYCGNMTKIDSATQAPSSCSSSGKPYCIFSFLSDSRLSSKHKAFIY 608

Query: 417  VLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------N 566
            V+SS  + +T+    +  HW  A+  E+ ALK+N TW    L     AI           
Sbjct: 609  VISSTFEPKTYKQXVSIPHWQTAMTDEIKALKHNKTWDLAILPPNKTAIGCKWVYRVKFK 668

Query: 567  ANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK*C 746
            A+G +         + YKA LVA+G +Q EG DF +T++P+  +  + + +  +   K  
Sbjct: 669  ADGSV---------ERYKARLVAKGYTQQEGLDFFDTYSPVAKMTTV-RVLLAIAAAK-- 716

Query: 747  VLRW*LEGRNIQGTSTWLH-SLKARI------------------LLSSLYGLRQAGRNFF 869
              +W L   ++   + +LH  L   +                  L  SLYGLRQA R ++
Sbjct: 717  --QWYLHQLDV--NNAFLHGDLNEEVYMQLPLGFSTPNDPRVCKLKKSLYGLRQASRQWY 772

Query: 870  SKL-SSILSFDLS 905
            SKL SS+L F  S
Sbjct: 773  SKLSSSLLKFGFS 785


>ref|XP_004149623.1| PREDICTED: uncharacterized protein LOC101211618 [Cucumis sativus]
          Length = 2085

 Score = 62.4 bits (150), Expect(2) = 6e-12
 Identities = 44/152 (28%), Positives = 75/152 (49%), Gaps = 2/152 (1%)
 Frame = +3

Query: 240  VLVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVV 419
            ++  +S R  H P   KD+ CN+     S  S+P    +PL  YL+Y+ +S   K Y+  
Sbjct: 727  IMTRKSSRPHHPPSYLKDFYCNLT----SQNSTP----FPLNQYLSYNAYSQHHKNYMFN 778

Query: 420  LSSQKKLQTFIVVATSTH-WCRAVQKELTALKNNYTWKYQKLSQAINLSNANGFIA*NTR 596
            ++S  +  T+   A   H W +A+ +E+ A++   TW    + +  +   +        +
Sbjct: 779  VTSIYE-PTYYHQAVKHHTWRKAMAEEIEAMERTNTWTIVSIPKDHHTVGSKWVYKVKCK 837

Query: 597  L-MV*KHYKAHLVAEGCSQVEGEDFTETFTPI 689
                   YKA LVA+G +Q EG DF +TF+P+
Sbjct: 838  PDGTIDRYKARLVAKGYNQQEGIDFLDTFSPV 869



 Score = 35.4 bits (80), Expect(2) = 6e-12
 Identities = 14/24 (58%), Positives = 19/24 (79%)
 Frame = +1

Query: 730 IDVNNVFLDGDLREEIYKELPPGY 801
           +D+NN FL+GDL EE++  LP GY
Sbjct: 892 MDINNAFLNGDLFEEVHMTLPLGY 915


>emb|CAN65820.1| hypothetical protein VITISV_042324 [Vitis vinifera]
          Length = 1262

 Score = 76.6 bits (187), Expect = 1e-11
 Identities = 70/253 (27%), Positives = 119/253 (47%), Gaps = 35/253 (13%)
 Frame = +3

Query: 252  RSQRKQHVPVRFKDYVC-NVNPLELSARS----SPLGMTYPL*SYLAYDNFSDSQKAYLV 416
            RS+R +H+P   ++Y C N+  ++L+ ++    S  G  Y + S+L+    S   KA++ 
Sbjct: 855  RSERTKHLPKYLQNYYCGNMTKIDLATQAPSSCSSSGKPYYIFSFLSDSKLSSKHKAFIS 914

Query: 417  VLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL---SQAINLS-------N 566
            ++SS  + +T+    +  HW  A+  E+ AL++N TW    L      I           
Sbjct: 915  IISSTFEPKTYKQAVSIPHWKTAMTDEIKALEHNKTWDLAILPPNKTTIGCKWVYQVKFK 974

Query: 567  ANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK*C 746
            A+G +         + YKA LVA+G +Q EG DF +T++P+  +  + + +  +   K  
Sbjct: 975  ADGSV---------ERYKARLVAKGYTQQEGLDFFDTYSPVAKMTTV-RVLLAIAATK-- 1022

Query: 747  VLRW*LEGRNIQGTSTWLH-------------------SLKARILLSSLYGLRQAGRNFF 869
              +W L   ++   + +LH                     +   L  SLYGLRQA R ++
Sbjct: 1023 --QWYLHQLDV--NNAFLHEDLNEDVYMQLPPGFSTPNDPRVCKLKKSLYGLRQASRQWY 1078

Query: 870  SKL-SSILSFDLS 905
            SKL SS+L F  S
Sbjct: 1079 SKLSSSLLKFGFS 1091


>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score:
            11.19) [Arabidopsis thaliana]
          Length = 1633

 Score = 74.7 bits (182), Expect = 4e-11
 Identities = 73/263 (27%), Positives = 109/263 (41%), Gaps = 46/263 (17%)
 Frame = +3

Query: 237  SVLVLRSQRKQHVPVRFKDYVCNVNP------------LELSARSSP---LGMTYPL*SY 371
            SV + R +R    P    +Y CN  P            +E  + S P   +   YP+ + 
Sbjct: 857  SVPIARPKRNAKAPAYLSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTA 916

Query: 372  LAYDNFSDSQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQA 551
            ++YD  +    +Y+   + + + + F     S  W RA  +EL AL+ N TW  + L++ 
Sbjct: 917  ISYDKLTPLFHSYICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEG 976

Query: 552  INL----------SNANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*L- 698
             N+           N +G I         + YKA LVA+G +Q EG D+ ETF+P+    
Sbjct: 977  KNVVGCKWVFTIKYNPDGSI---------ERYKARLVAQGFTQQEGIDYMETFSPVAKFG 1027

Query: 699  -------------WRL-QKDVFT------LDRCK*CVLRW*LEGRNIQGTSTWLHSLKAR 818
                         W L Q DV        LD      +   L       T   L S    
Sbjct: 1028 SVKLLLGLAAATGWSLTQMDVSNAFLHGELDE----EIYMSLPQGYTPPTGISLPSKPVC 1083

Query: 819  ILLSSLYGLRQAGRNFFSKLSSI 887
             LL SLYGL+QA R ++ +LSS+
Sbjct: 1084 RLLKSLYGLKQASRQWYKRLSSV 1106


>emb|CAB40067.1| putative retrotransposon polyprotein [Arabidopsis thaliana]
            gi|7267797|emb|CAB81200.1| putative retrotransposon
            polyprotein [Arabidopsis thaliana]
          Length = 1203

 Score = 74.7 bits (182), Expect = 4e-11
 Identities = 73/263 (27%), Positives = 109/263 (41%), Gaps = 46/263 (17%)
 Frame = +3

Query: 237  SVLVLRSQRKQHVPVRFKDYVCNVNP------------LELSARSSP---LGMTYPL*SY 371
            SV + R +R    P    +Y CN  P            +E  + S P   +   YP+ + 
Sbjct: 443  SVPIARPKRNAKAPAYLSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTA 502

Query: 372  LAYDNFSDSQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQA 551
            ++YD  +    +Y+   + + + + F     S  W RA  +EL AL+ N TW  + L++ 
Sbjct: 503  ISYDKLTPLFHSYICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEG 562

Query: 552  INL----------SNANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*L- 698
             N+           N +G I         + YKA LVA+G +Q EG D+ ETF+P+    
Sbjct: 563  KNVVGCKWVFTIKYNPDGSI---------ERYKARLVAQGFTQQEGIDYMETFSPVAKFG 613

Query: 699  -------------WRL-QKDVFT------LDRCK*CVLRW*LEGRNIQGTSTWLHSLKAR 818
                         W L Q DV        LD      +   L       T   L S    
Sbjct: 614  SVKLLLGLAAATGWSLTQMDVSNAFLHGELDE----EIYMSLPQGYTPPTGISLPSKPVC 669

Query: 819  ILLSSLYGLRQAGRNFFSKLSSI 887
             LL SLYGL+QA R ++ +LSS+
Sbjct: 670  RLLKSLYGLKQASRQWYKRLSSV 692


>gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score = 72.8 bits (177), Expect = 2e-10
 Identities = 65/239 (27%), Positives = 104/239 (43%), Gaps = 23/239 (9%)
 Frame = +3

Query: 243  LVLRSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVL 422
            L+ +  R +  PV+  DYV       L  +  P    YPL +Y++   FSD+ +AY++ +
Sbjct: 917  LLGKGHRPKRPPVKLADYVTT-----LLHQPFPSATPYPLDNYISSSRFSDNYQAYILAI 971

Query: 423  SSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAINLSNANGFIA*NTRL- 599
            +S  + + +       HW  AV  E+ +L+N  TW  + L                 +  
Sbjct: 972  TSGNEPRNYNEAMLDDHWKGAVSHEIGSLENLGTWTVEDLPPGKKALGCKWVFRLKYKSD 1031

Query: 600  MV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQ---KDVFTLDRCK*CVLRW*LEG 770
               + +KA LV  G +Q EG D+TETF P+  +  ++   + V +LD        W  E 
Sbjct: 1032 GTLERHKARLVVLGNNQTEGLDYTETFAPVAKMVTVRAFLQQVVSLD--------W--EV 1081

Query: 771  RNIQGTSTWLH-------------------SLKARILLSSLYGLRQAGRNFFSKLSSIL 890
              +   + +LH                     K   L  SLYGL+QA R +F+KL+S L
Sbjct: 1082 HQMDVHNAFLHGDLDEEVYMQFPPGFRTGDKTKVCRLRKSLYGLKQAPRCWFAKLTSAL 1140


>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score = 72.8 bits (177), Expect = 2e-10
 Identities = 74/248 (29%), Positives = 110/248 (44%), Gaps = 35/248 (14%)
 Frame = +3

Query: 252  RSQRKQHVPVRFKDYV-----CNVN------PLELSARSSPLG-MTYPL*SYLAYDNFSD 395
            + +R+   P R KDY+     C  N      P    + SS  G + YPL  Y++ + FS 
Sbjct: 908  QGKRQVQQPARLKDYILYNASCTPNTPHVLSPSTSQSSSSIQGNLQYPLTDYISDECFSA 967

Query: 396  SQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL--------SQA 551
              K +L  +++  + + F        W  A+ KE+ AL+ N TW    L        SQ 
Sbjct: 968  GHKVFLAAITANDEPKHFKEDVKVKVWNDAMYKEVDALEVNKTWDIVDLPTGKVAIGSQW 1027

Query: 552  INLS--NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRL------ 707
            +  +  NA+G +         + YKA LV +G +Q+EGED+TETF P+  +  +      
Sbjct: 1028 VYKTKFNADGTV---------ERYKARLVVQGNNQIEGEDYTETFAPVVKMTTVRTLLRL 1078

Query: 708  ----QKDVFTLDRCK*CVLRW*LEGR---NIQGTSTWLHSLKARILLSSLYGLRQAGRNF 866
                Q +V+ +D      L   LE      +       H  K   L  SLYGL+QA R +
Sbjct: 1079 VAANQWEVYQMD-VHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCW 1137

Query: 867  FSKLSSIL 890
            F KLS  L
Sbjct: 1138 FKKLSDAL 1145


>gb|ABD32333.1| polyprotein-like, putative [Medicago truncatula]
          Length = 635

 Score = 72.4 bits (176), Expect = 2e-10
 Identities = 61/240 (25%), Positives = 109/240 (45%), Gaps = 28/240 (11%)
 Frame = +3

Query: 255 SQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVLSSQK 434
           S R +  P   +DY+CN +   +S+ +    + YPL +++++ + S+SQ  + + L S  
Sbjct: 40  SSRTKKSPSYLQDYICNPSTNSVSSANKSC-ILYPLSNFISHKHLSNSQHTFALSLVSHI 98

Query: 435 KLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAIN----------LSNANGFIA 584
           + +++     S  W +A+Q EL AL    TW    +   +             N +G I 
Sbjct: 99  EPKSYAEAIKSDCWKQAMQLELNALDQTGTWTVVDIPSQVKPIGCKWVCRIKYNDDGSI- 157

Query: 585 *NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK*CVLRW*L 764
                   + YKA LVA+G +Q+EG D+ +TF+P+      +  +  L      +  W L
Sbjct: 158 --------ERYKARLVAKGYNQIEGLDYFDTFSPV-----AKITIVRLVIALASINHWFL 204

Query: 765 EGRNIQ------------------GTSTWLHSLKARILLSSLYGLRQAGRNFFSKLSSIL 890
              ++                   G ST+  +   + L  SLYGL+QA R ++ KL+++L
Sbjct: 205 HQLDVNNAFLHGDLQENVYKKIPPGLSTFKPNQVCK-LSKSLYGLKQASRKWYEKLTTLL 263


>ref|XP_004509218.1| PREDICTED: uncharacterized protein LOC101501009 [Cicer arietinum]
          Length = 751

 Score = 72.0 bits (175), Expect = 3e-10
 Identities = 68/248 (27%), Positives = 111/248 (44%), Gaps = 33/248 (13%)
 Frame = +3

Query: 252 RSQRKQHVPVRFKDYVCNVNPLELSARSSPL------GMTYPL*SYLAYDNFSDSQKAYL 413
           RS R    P    D+ C++  L+ S  ++ L      G  YPL ++++YDN S + K + 
Sbjct: 301 RSGRTIKPPSYLTDFHCSL--LQGSINNNILVPNQFKGTPYPLSTFISYDNLSSAHKFFT 358

Query: 414 VVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAINL----------S 563
           + +S+ K+  ++      ++W  A+  EL +L NN TW+   L     +           
Sbjct: 359 INVSTLKEPSSYSEAIKDSNWRLAIDSELRSLLNNNTWELTTLPSDKKVIGCKWVFKLKF 418

Query: 564 NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQK---------- 713
           +ANG I         + YKA LVA+G +Q EG D+ +TF+P+  +  ++           
Sbjct: 419 HANGTI---------ERYKARLVAKGFNQTEGLDYLDTFSPVVKMTTIRLLLSIAAIKNW 469

Query: 714 DVFTLDRCK*CVLRW*LEGRNIQGTSTWL-------HSLKARILLSSLYGLRQAGRNFFS 872
            +F LD     +    L G  I+     +       H  +   L  SLYGL+QA R +  
Sbjct: 470 FLFQLD-----INTAFLHGDLIEDVYMKIPPGLHVQHKSQVCKLKRSLYGLKQASRQWNM 524

Query: 873 KLSSILSF 896
           KL S+  F
Sbjct: 525 KLCSLKQF 532


>gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1156

 Score = 72.0 bits (175), Expect = 3e-10
 Identities = 77/249 (30%), Positives = 113/249 (45%), Gaps = 38/249 (15%)
 Frame = +3

Query: 258  QRKQHV--PVRFKDYV---CNVNPL------ELSARSSPL----GMTYPL*SYLAYDNFS 392
            QRK+ +   VR +DYV     V+P+      + S++SS +       YPL  Y++ D FS
Sbjct: 564  QRKRQIRQSVRLQDYVLYNATVSPINPHALPDSSSQSSSMVQGTSSLYPLSDYVSDDCFS 623

Query: 393  DSQKAYLVVLSSQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKL--------SQ 548
               KA+L  +++  + + F        W  A+ KE+ AL+ N TW    L        SQ
Sbjct: 624  AGHKAFLAAITANDEPKHFKEAVRIKVWNDAMFKEVDALEINKTWDIVDLPPGKVAIGSQ 683

Query: 549  AINLS--NANGFIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRL----- 707
             +  +  NA+G I         + YKA LV +G  QVEGED+ ETF P+  +  +     
Sbjct: 684  WVYKTKYNADGSI---------ERYKARLVVQGNKQVEGEDYNETFAPVVKMTTVRTLLR 734

Query: 708  -----QKDVFTLDRCK*CVLRW*LEGR---NIQGTSTWLHSLKARILLSSLYGLRQAGRN 863
                 Q +V+ +D      L   L+      +       H  K   L  SLYGL+QA R 
Sbjct: 735  LVAANQWEVYQMD-VNNAFLHGDLDEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRC 793

Query: 864  FFSKLSSIL 890
            +F KLS  L
Sbjct: 794  WFKKLSDAL 802


>gb|AAG51258.1|AC025782_3 Ty1/copia-element polyprotein [Arabidopsis thaliana]
          Length = 1152

 Score = 70.5 bits (171), Expect = 8e-10
 Identities = 75/247 (30%), Positives = 107/247 (43%), Gaps = 32/247 (12%)
 Frame = +3

Query: 252  RSQRKQHVPVRFKDYVCNVNPLELSAR-SSPLGMT-YPL*SYLAYDNFSDSQKAYLVVLS 425
            R  R++   VR KDY       E +   S  +G   YP+ +Y++ + FS S + +L  +S
Sbjct: 871  RGLRQRQENVRLKDYQTYSAQCESTQTLSDNIGTCIYPMANYVSGEIFSPSNQHFLAAIS 930

Query: 426  SQKKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAINL----------SNANG 575
                 QT+        W  AV  E+ AL++  TW   KL Q +             N+NG
Sbjct: 931  MVDPPQTYNQAIREKEWRNAVFFEVDALEDQGTWDITKLPQGVKAIGSKWVFRIKYNSNG 990

Query: 576  FIA*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*LWRLQKDVFTLDRCK*CVLR 755
             +         + YKA LVA G  Q EG DFT+TF P+    ++Q     LD        
Sbjct: 991  TV---------ERYKARLVALGNHQKEGIDFTKTFAPVV---KMQTVRLLLDVA--AAKD 1036

Query: 756  W*LEGRNIQGTSTWLH-SLKARI------------------LLSSLYGLRQAGRNFFSKL 878
            W L   ++   + +LH  LK  I                  L  S+YGL+QA R +F KL
Sbjct: 1037 WELHQMDVH--NAFLHGDLKEDIYMKPPPGFKTTDPSLVCKLKKSIYGLKQAPRCWFEKL 1094

Query: 879  S-SILSF 896
            S S+L F
Sbjct: 1095 STSLLKF 1101


>emb|CAN83990.1| hypothetical protein VITISV_018454 [Vitis vinifera]
          Length = 1243

 Score = 69.3 bits (168), Expect = 2e-09
 Identities = 69/241 (28%), Positives = 101/241 (41%), Gaps = 28/241 (11%)
 Frame = +3

Query: 252  RSQRKQHVPVRFKDYVCNVNPLELSARSSPLGMTYPL*SYLAYDNFSDSQKAYLVVLSSQ 431
            R  R    P   KDY C++  +   A       ++P+  +L+YD  S S K + + +S  
Sbjct: 690  RXTRVSKQPSYLKDYHCSL--INSVAHVETHSTSHPIQHFLSYDKLSPSYKLFSLSVSII 747

Query: 432  KKLQTFIVVATSTHWCRAVQKELTALKNNYTWKYQKLSQAIN----------LSNANGFI 581
             +  +F   A    W  A+  EL AL+ N TW    L    +             A+G I
Sbjct: 748  SEPSSFAKAAEIPEWRAAMDCELEALEENKTWSIVSLXVGKHPVGCKWVYKIKHKADGTI 807

Query: 582  A*NTRLMV*KHYKAHLVAEGCSQVEGEDFTETFTPIY*L--------------WRLQK-- 713
                     + YKA LVA+G +Q EG D+ +TF+P+  L              W L +  
Sbjct: 808  ---------ERYKARLVAKGYTQREGIDYVDTFSPVAKLVTVKLLLAIAAVKGWHLSQLD 858

Query: 714  --DVFTLDRCK*CVLRW*LEGRNIQGTSTWLHSLKARILLSSLYGLRQAGRNFFSKLSSI 887
              + F        V      G N +G S  L S    +L  SLYGL+QA R +FSK S+ 
Sbjct: 859  VNNAFLHGDLNEEVYMKLPPGYNRKGES--LPSNAVCLLHKSLYGLKQASRQWFSKFSTA 916

Query: 888  L 890
            +
Sbjct: 917  I 917


Top