BLASTX nr result

ID: Catharanthus23_contig00034228 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00034228
         (771 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004233645.1| PREDICTED: uncharacterized protein LOC101257...   320   4e-85
ref|XP_006338249.1| PREDICTED: uncharacterized protein LOC102601...   317   3e-84
ref|XP_006338248.1| PREDICTED: uncharacterized protein LOC102601...   317   3e-84
emb|CBI40980.3| unnamed protein product [Vitis vinifera]              290   3e-76
ref|XP_006466676.1| PREDICTED: uncharacterized protein LOC102617...   283   3e-74
ref|XP_006425795.1| hypothetical protein CICLE_v10024678mg [Citr...   283   3e-74
gb|EPS71292.1| hypothetical protein M569_03462, partial [Genlise...   270   5e-70
gb|EOX91261.1| Vacuolar protein sorting-associated protein 13C, ...   266   6e-69
ref|XP_004301869.1| PREDICTED: uncharacterized protein LOC101304...   266   6e-69
ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arab...   249   9e-64
ref|XP_006405272.1| hypothetical protein EUTSA_v10027614mg [Eutr...   248   1e-63
ref|XP_006293179.1| hypothetical protein CARUB_v10019496mg [Caps...   245   1e-62
ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] ...   242   1e-61
emb|CAB62317.1| putative protein [Arabidopsis thaliana]               242   1e-61
gb|ESW27979.1| hypothetical protein PHAVU_003G249100g [Phaseolus...   240   3e-61
ref|XP_006598717.1| PREDICTED: uncharacterized protein LOC100527...   237   3e-60
gb|EXB37240.1| hypothetical protein L484_020299 [Morus notabilis]     226   9e-57
gb|EEE53527.1| hypothetical protein OsJ_36721 [Oryza sativa Japo...   214   3e-53
gb|EEC69601.1| hypothetical protein OsI_38957 [Oryza sativa Indi...   214   3e-53
gb|EMS50104.1| Retrovirus-related Pol polyprotein LINE-1 [Tritic...   201   3e-49

>ref|XP_004233645.1| PREDICTED: uncharacterized protein LOC101257436 [Solanum
            lycopersicum]
          Length = 3178

 Score =  320 bits (819), Expect = 4e-85
 Identities = 162/265 (61%), Positives = 202/265 (76%), Gaps = 12/265 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            +L+HPL+I +FYRY FL Q  EN+   V GH +ARIKEL +++TELSLDI+LF+IGKL L
Sbjct: 1731 DLMHPLEIDVFYRYTFLNQGPENSILWVPGHFYARIKELSMTITELSLDIILFIIGKLNL 1790

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGP+AV++S ILANCCK+EN+ GLTL CQF+DNQ  SVAG+Q+ TIFLRH+ALAN+PPEA
Sbjct: 1791 AGPYAVKDSTILANCCKVENQSGLTLVCQFYDNQDVSVAGRQATTIFLRHMALANRPPEA 1850

Query: 363  SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542
            S  SI+L ++G L TS LH  LLE ++FAWR R+VS QESKT PGPF+V EV+  T++ L
Sbjct: 1851 SFFSIQLIERGLLSTSLLHLSLLETQSFAWRPRIVSLQESKTYPGPFLVAEVSPGTEDYL 1910

Query: 543  SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710
            SI VSPLLRIHN T F +ELRFQRPQ +E D+AS+ +E G+ +DDS+ AF A NLS    
Sbjct: 1911 SIGVSPLLRIHNNTKFPMELRFQRPQHKEIDYASVRLEAGDTIDDSMTAFSAINLSGGRK 1970

Query: 711  --------GNFLFSFRPRITDEQLN 761
                    GNFL SFRP +TD   N
Sbjct: 1971 KTLNSLSVGNFLLSFRPEVTDVLTN 1995


>ref|XP_006338249.1| PREDICTED: uncharacterized protein LOC102601421 isoform X2 [Solanum
            tuberosum]
          Length = 2549

 Score =  317 bits (812), Expect = 3e-84
 Identities = 160/265 (60%), Positives = 200/265 (75%), Gaps = 12/265 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            +L+HPL+I +FYRY FL Q  EN    V GH +ARIKEL +++TELSLDI+LF+IGKL  
Sbjct: 1737 DLMHPLEIDVFYRYTFLNQGPENIILWVPGHFYARIKELSMTITELSLDIILFIIGKLNF 1796

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGP+AV++S ILANCCK+EN+ GLTL CQF+DNQ  SVAG+ + TIFLRH+ALAN+PPEA
Sbjct: 1797 AGPYAVKDSTILANCCKVENQSGLTLVCQFYDNQDVSVAGRHATTIFLRHMALANRPPEA 1856

Query: 363  SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542
            S  SI+L ++G L TS LH  LLE ++FAWR R+VS QESKT PGPF+V EV+  T++ L
Sbjct: 1857 SFFSIQLIERGLLSTSLLHLSLLETQSFAWRPRIVSLQESKTYPGPFLVAEVSPGTEDYL 1916

Query: 543  SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710
            SI VSPLLRIHN+T F +ELRFQRPQ +E D+AS+ +E G+ +DDS+ AF A NLS    
Sbjct: 1917 SIVVSPLLRIHNDTKFPMELRFQRPQHKEIDYASVRLEAGDTIDDSMTAFSAINLSGGRK 1976

Query: 711  --------GNFLFSFRPRITDEQLN 761
                    GNFL SFRP +TD   N
Sbjct: 1977 KTLNSLSVGNFLLSFRPEVTDVLTN 2001


>ref|XP_006338248.1| PREDICTED: uncharacterized protein LOC102601421 isoform X1 [Solanum
            tuberosum]
          Length = 3185

 Score =  317 bits (812), Expect = 3e-84
 Identities = 160/265 (60%), Positives = 200/265 (75%), Gaps = 12/265 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            +L+HPL+I +FYRY FL Q  EN    V GH +ARIKEL +++TELSLDI+LF+IGKL  
Sbjct: 1737 DLMHPLEIDVFYRYTFLNQGPENIILWVPGHFYARIKELSMTITELSLDIILFIIGKLNF 1796

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGP+AV++S ILANCCK+EN+ GLTL CQF+DNQ  SVAG+ + TIFLRH+ALAN+PPEA
Sbjct: 1797 AGPYAVKDSTILANCCKVENQSGLTLVCQFYDNQDVSVAGRHATTIFLRHMALANRPPEA 1856

Query: 363  SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542
            S  SI+L ++G L TS LH  LLE ++FAWR R+VS QESKT PGPF+V EV+  T++ L
Sbjct: 1857 SFFSIQLIERGLLSTSLLHLSLLETQSFAWRPRIVSLQESKTYPGPFLVAEVSPGTEDYL 1916

Query: 543  SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710
            SI VSPLLRIHN+T F +ELRFQRPQ +E D+AS+ +E G+ +DDS+ AF A NLS    
Sbjct: 1917 SIVVSPLLRIHNDTKFPMELRFQRPQHKEIDYASVRLEAGDTIDDSMTAFSAINLSGGRK 1976

Query: 711  --------GNFLFSFRPRITDEQLN 761
                    GNFL SFRP +TD   N
Sbjct: 1977 KTLNSLSVGNFLLSFRPEVTDVLTN 2001


>emb|CBI40980.3| unnamed protein product [Vitis vinifera]
          Length = 2083

 Score =  290 bits (743), Expect = 3e-76
 Identities = 152/268 (56%), Positives = 201/268 (75%), Gaps = 13/268 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            ELVHP++ICIFYR  F ++  E     V  H + R KE+ ISLTE+SLDILLFVIGKL L
Sbjct: 619  ELVHPVEICIFYRSSFQIEGSEIVSQSVPMHFYFRCKEVEISLTEVSLDILLFVIGKLNL 678

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPE- 359
            AGPF+V+ S ILA+CCK+EN+ GL L  ++ D+Q  S+A +QSA+IFLRHLA A+Q PE 
Sbjct: 679  AGPFSVKTSMILAHCCKVENQSGLNLLFRYQDDQGLSIARKQSASIFLRHLASADQSPEN 738

Query: 360  ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539
            AS  SI+L+  G+  TSP+H  L + +  AWRTR+VS Q+SKT PGPFIVV+++R++++G
Sbjct: 739  ASFASIQLSWFGSFSTSPIHLSLSKTQVLAWRTRIVSLQDSKTYPGPFIVVDISRKSEDG 798

Query: 540  LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710
            LS+ VSPL+RIHNET FS+ LRFQRPQQ E++FAS+L++ G+ +DDS+AAF + N+S   
Sbjct: 799  LSVVVSPLIRIHNETTFSMALRFQRPQQVETEFASVLLKTGDTIDDSMAAFDSINVSGGL 858

Query: 711  ---------GNFLFSFRPRITDEQLNSK 767
                     GNFLFSFRP ITD+  +SK
Sbjct: 859  KKALLSLSVGNFLFSFRPEITDDLGSSK 886


>ref|XP_006466676.1| PREDICTED: uncharacterized protein LOC102617616 [Citrus sinensis]
          Length = 3197

 Score =  283 bits (725), Expect = 3e-74
 Identities = 146/268 (54%), Positives = 199/268 (74%), Gaps = 13/268 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            ELV P++ICI+YR  F +Q  E  +  V   ++ RIKE  I LTELSLDILLFV+GKL L
Sbjct: 1755 ELVQPVEICIYYRSSFQIQGSEALWHRVPLRIYCRIKEFQIFLTELSLDILLFVVGKLDL 1814

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPE- 359
            AGP+ +R+S ILANCCK+EN+ GL LHC F + Q  +V  +QSA+IFLR+  L NQ P+ 
Sbjct: 1815 AGPYLIRSSRILANCCKVENQSGLNLHCHFDEQQSVTVGRKQSASIFLRNSTLVNQAPDS 1874

Query: 360  ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539
            +SV+SI+L+  G+  TSP++  LLE+R+  WRTR+VSAQ+S+T PGPFIVV+++R +++G
Sbjct: 1875 SSVVSIQLS-LGSFTTSPIYLSLLESRSLTWRTRIVSAQDSRTFPGPFIVVDISRTSEDG 1933

Query: 540  LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710
            LSI VSPL+R+HNET+FS+ELRF+R Q++E DFAS+L++PG  +DDS+A F A + S   
Sbjct: 1934 LSIVVSPLIRVHNETEFSMELRFRRVQEQEDDFASILLKPGHTIDDSMAMFDAVSFSGGL 1993

Query: 711  ---------GNFLFSFRPRITDEQLNSK 767
                     GNFLFSFRP  +D  ++SK
Sbjct: 1994 KKALMSLSVGNFLFSFRPGSSDGLISSK 2021


>ref|XP_006425795.1| hypothetical protein CICLE_v10024678mg [Citrus clementina]
            gi|557527785|gb|ESR39035.1| hypothetical protein
            CICLE_v10024678mg [Citrus clementina]
          Length = 3169

 Score =  283 bits (725), Expect = 3e-74
 Identities = 146/268 (54%), Positives = 199/268 (74%), Gaps = 13/268 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            ELV P++ICI+YR  F +Q  E  +  V   ++ RIKE  I LTELSLDILLFV+GKL L
Sbjct: 1755 ELVQPVEICIYYRSSFQIQGSEALWHRVPLRIYCRIKEFQIFLTELSLDILLFVVGKLDL 1814

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPE- 359
            AGP+ +R+S ILANCCK+EN+ GL LHC F + Q  +V  +QSA+IFLR+  L NQ P+ 
Sbjct: 1815 AGPYLIRSSRILANCCKVENQSGLNLHCHFDEQQSVTVGRKQSASIFLRNSTLVNQAPDS 1874

Query: 360  ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539
            +SV+SI+L+  G+  TSP++  LLE+R+  WRTR+VSAQ+S+T PGPFIVV+++R +++G
Sbjct: 1875 SSVVSIQLS-LGSFTTSPIYLSLLESRSLTWRTRIVSAQDSRTFPGPFIVVDISRTSEDG 1933

Query: 540  LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710
            LSI VSPL+R+HNET+FS+ELRF+R Q++E DFAS+L++PG  +DDS+A F A + S   
Sbjct: 1934 LSIVVSPLIRVHNETEFSMELRFRRVQEQEDDFASILLKPGHTIDDSMAMFDAVSFSGGL 1993

Query: 711  ---------GNFLFSFRPRITDEQLNSK 767
                     GNFLFSFRP  +D  ++SK
Sbjct: 1994 KKALMSLSVGNFLFSFRPGSSDGLISSK 2021


>gb|EPS71292.1| hypothetical protein M569_03462, partial [Genlisea aurea]
          Length = 730

 Score =  270 bits (689), Expect = 5e-70
 Identities = 132/264 (50%), Positives = 183/264 (69%), Gaps = 12/264 (4%)
 Frame = +3

Query: 9    VHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKLAG 188
            + PL++CIFY     +   E++  G   H + R  E+ + LT+LSLDILLFVIGKL LAG
Sbjct: 347  IQPLELCIFYSQNIFIHGAESSSHGFSKHFYIRTGEVSVFLTQLSLDILLFVIGKLDLAG 406

Query: 189  PFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEASV 368
            P+AV++SA+L NCCK+EN+ GLTL CQF+ NQ  ++   QS TIFLRH+ALANQP E S 
Sbjct: 407  PYAVKSSAVLGNCCKVENKSGLTLVCQFYGNQEVAIHASQSNTIFLRHMALANQPLEGSF 466

Query: 369  LSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGLSI 548
            +S +L  +G   +SP+   LLEAR  +WR+RVVS Q+SK+ PGPFI++E+++  ++GLSI
Sbjct: 467  ISAQLVKEGFFSSSPIRLSLLEARKISWRSRVVSLQDSKSFPGPFIIIEISKGVEDGLSI 526

Query: 549  TVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS------ 710
            +VSPLL+IHN+TDF +ELRFQRP   E + AS ++  G+ +DD+V AF   ++       
Sbjct: 527  SVSPLLKIHNDTDFQMELRFQRPSGEEPESASFMLGAGDVIDDAVMAFTGIDIPGNLRKV 586

Query: 711  ------GNFLFSFRPRITDEQLNS 764
                  GN+LFSFRP I +   +S
Sbjct: 587  LASLSVGNYLFSFRPVIAEGTTDS 610


>gb|EOX91261.1| Vacuolar protein sorting-associated protein 13C, putative [Theobroma
            cacao]
          Length = 3155

 Score =  266 bits (680), Expect = 6e-69
 Identities = 144/267 (53%), Positives = 192/267 (71%), Gaps = 13/267 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            + + P++ICIFYR  F     +N   GV  H++ R KEL ISLTELSLDILLFVIGKL L
Sbjct: 1726 DFLRPVEICIFYRSCF-----QNPH-GVPVHVYCRTKELEISLTELSLDILLFVIGKLNL 1779

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGPF+VR+S ILANC K+EN+ GL L C F+  Q  +V  +QSA+  LR  A  NQPPEA
Sbjct: 1780 AGPFSVRSSMILANCGKVENQTGLNLLCHFYGKQSVTVGRKQSASFSLRVSAFENQPPEA 1839

Query: 363  SV-LSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539
            +  LSI+L+  G+  TSP+H  LL A+  AWRTR+VS ++SK+ PGPF+VV+V+R++++G
Sbjct: 1840 AAALSIQLSLPGSFTTSPIHLSLLGAQTLAWRTRLVSLKDSKSYPGPFVVVDVSRKSEDG 1899

Query: 540  LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710
            LSI+VSPL+RIHNET FSVEL+  RP+  E +FAS+L++ G+  DDS+A+F A N S   
Sbjct: 1900 LSISVSPLIRIHNETKFSVELQISRPEPMEDEFASVLLKAGDTFDDSMASFDAINFSGGF 1959

Query: 711  ---------GNFLFSFRPRITDEQLNS 764
                     GNFLFSFRP I+++ ++S
Sbjct: 1960 RKAVMSLNVGNFLFSFRPEISNDLMHS 1986


>ref|XP_004301869.1| PREDICTED: uncharacterized protein LOC101304881 [Fragaria vesca
            subsp. vesca]
          Length = 3178

 Score =  266 bits (680), Expect = 6e-69
 Identities = 143/272 (52%), Positives = 192/272 (70%), Gaps = 17/272 (6%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            EL+HP++ C FYR        E    GV  H+H R KEL ISL+ELSLDILLF +GKL L
Sbjct: 1788 ELIHPVETCFFYRS---THSSEGVSHGVPVHIHCRTKELNISLSELSLDILLFTVGKLNL 1844

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPE- 359
            AGPF+VR++ I ANCCK+EN+ GL L CQ+ D +   V+ +QS +I LR   L NQPPE 
Sbjct: 1845 AGPFSVRSTKIWANCCKVENQSGLNLLCQY-DEESVKVSRRQSTSIILRCSDLENQPPEI 1903

Query: 360  ASVLSIRLADQ-GALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQE 536
            ASV+S++L+    +L TSP+H   LEA+AFAWRT+++S Q+S+T PGPF++V+V+R++++
Sbjct: 1904 ASVVSVQLSGPISSLTTSPIHISRLEAQAFAWRTQIMSLQDSQTYPGPFVIVDVSRKSED 1963

Query: 537  GLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS-- 710
            GLSI +SPL+RIHNET  S++LRF+RPQQ+E  FAS+++  G+  DDS+A F A NL+  
Sbjct: 1964 GLSIRISPLIRIHNETGLSIKLRFRRPQQKEDVFASVVLNAGDTYDDSMAMFDAINLAGE 2023

Query: 711  ----------GNFLFSFR---PRITDEQLNSK 767
                      GNFLFSFR   P I D  +NSK
Sbjct: 2024 EKKALRSLSLGNFLFSFRPEIPEIPDGLMNSK 2055


>ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arabidopsis lyrata subsp.
            lyrata] gi|297323582|gb|EFH54003.1| hypothetical protein
            ARALYDRAFT_485391 [Arabidopsis lyrata subsp. lyrata]
          Length = 3074

 Score =  249 bits (635), Expect = 9e-64
 Identities = 127/257 (49%), Positives = 176/257 (68%), Gaps = 12/257 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            E +HP+++  FYR  F   DL N    V  H++ RI +L + LTELS+D+LLFV+GKL+ 
Sbjct: 1685 EFIHPVEVSAFYRSTFQTPDLNNTMQKVPTHIYCRIGKLDVFLTELSMDMLLFVLGKLEF 1744

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGPF+V+ SAIL+NCCKI+N  GL L C+F + Q A+V  +Q+A+IFLRH    N  PEA
Sbjct: 1745 AGPFSVKTSAILSNCCKIKNLSGLDLICRFNEKQTATVGRKQTASIFLRHSM--NHQPEA 1802

Query: 363  SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542
            S ++      G  +TS ++  LLEAR  AWRTR++S Q++++ PGPF+VV++ +  ++GL
Sbjct: 1803 SPVAAVQLSSGKFITSSINVSLLEARTLAWRTRIISLQDARSHPGPFVVVDIKKGLEDGL 1862

Query: 543  SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710
            SI+VSPL RIHNET   +E+RFQR +Q+  DFAS+ ++PG  +DDSVAAF A +LS    
Sbjct: 1863 SISVSPLTRIHNETSLPMEIRFQRSKQKRDDFASVPLKPGGSIDDSVAAFNAISLSGDMK 1922

Query: 711  --------GNFLFSFRP 737
                    GNF  SFRP
Sbjct: 1923 KALTSLAVGNFSLSFRP 1939


>ref|XP_006405272.1| hypothetical protein EUTSA_v10027614mg [Eutrema salsugineum]
            gi|557106410|gb|ESQ46725.1| hypothetical protein
            EUTSA_v10027614mg [Eutrema salsugineum]
          Length = 3132

 Score =  248 bits (634), Expect = 1e-63
 Identities = 128/267 (47%), Positives = 184/267 (68%), Gaps = 12/267 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            E +HP+++  FYR  F  QDL+N    V  H++ RI +L + LTELSLD+LLFV+ +L+ 
Sbjct: 1688 EFIHPVEVSAFYRSTFQTQDLKNTMHKVPSHIYCRIGKLEVYLTELSLDMLLFVLEELEF 1747

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGPF+V+ S IL NCCKIEN  GL L C+F + Q  +V+ +Q+A+IFLRH ++ +QP   
Sbjct: 1748 AGPFSVKTSVILPNCCKIENLSGLDLTCRFNEKQTTTVSRKQTASIFLRH-SMNHQPEAF 1806

Query: 363  SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542
             V++++L+  G  +TS L+  LLEAR  AWRTR+VS Q+S++ PGPF+VV++ + +++GL
Sbjct: 1807 PVVAVQLSS-GNFITSSLNVSLLEARTLAWRTRIVSLQDSRSHPGPFVVVDIKKGSEDGL 1865

Query: 543  SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710
            SI+VSPL RIHNET F +E+RFQR +Q+  DFAS+ ++PG  +DDSV AF A +LS    
Sbjct: 1866 SISVSPLTRIHNETSFPMEIRFQRSKQKRDDFASVPLKPGASIDDSVGAFNAISLSGDQK 1925

Query: 711  --------GNFLFSFRPRITDEQLNSK 767
                    GN+  SFRP   +    S+
Sbjct: 1926 KALTSLAVGNYSLSFRPESLETLFESE 1952


>ref|XP_006293179.1| hypothetical protein CARUB_v10019496mg [Capsella rubella]
            gi|482561886|gb|EOA26077.1| hypothetical protein
            CARUB_v10019496mg [Capsella rubella]
          Length = 3074

 Score =  245 bits (626), Expect = 1e-62
 Identities = 125/257 (48%), Positives = 177/257 (68%), Gaps = 12/257 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            E +HP+++  FYR  F  Q+L+N    V  H++ R+ +L + +TELSLD+LLFV+GKL+ 
Sbjct: 1685 EFIHPVEVSAFYRSTFQTQELQNTMHKVPTHIYCRVGKLEVFVTELSLDMLLFVLGKLEF 1744

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGPF+V+ S+IL+NCCK+EN  GL L C F + Q +++  +Q+A+IFLRH    N  PEA
Sbjct: 1745 AGPFSVKTSSILSNCCKVENLSGLDLICCFNEKQTSTIGRKQTASIFLRHSM--NHQPEA 1802

Query: 363  SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542
            S ++      G  +TS +   LLEAR  AWRTR+VS  +S++ PGPF+VV++ +  ++GL
Sbjct: 1803 SPVAAVQLSSGKFVTSSISVSLLEARTLAWRTRIVSLLDSRSHPGPFVVVDIKKGFEDGL 1862

Query: 543  SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710
            SI+VSPL+RIHNET   +E+RFQR +Q++ DFAS+ ++PG  VDDSVAAF A +LS    
Sbjct: 1863 SISVSPLIRIHNETSLPMEIRFQRSKQKKDDFASVPLKPGGSVDDSVAAFNAISLSGDLK 1922

Query: 711  --------GNFLFSFRP 737
                    GNF  SFRP
Sbjct: 1923 KALTSLAVGNFSLSFRP 1939


>ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332645140|gb|AEE78661.1| uncharacterized protein
            AT3G50380 [Arabidopsis thaliana]
          Length = 3072

 Score =  242 bits (617), Expect = 1e-61
 Identities = 125/257 (48%), Positives = 173/257 (67%), Gaps = 12/257 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            E +HP+++  FYR  F  +DL N    V  H++ RI +L + LTELSLD+LLF++GKL+ 
Sbjct: 1683 EFIHPVEVSAFYRSTFQTRDLNNTMHKVPTHIYCRIGKLEVFLTELSLDMLLFLLGKLEF 1742

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGPF+V+ SAIL+NCCKIEN  GL L C+F + Q A+V  +Q+A IFLRH    N   EA
Sbjct: 1743 AGPFSVKTSAILSNCCKIENLSGLDLICRFNEKQTATVGRKQTAAIFLRHSM--NHQQEA 1800

Query: 363  SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542
            S ++      G  +TS ++  LLEAR  AWRTR++S  +S++ PGPF+VV++ +  ++GL
Sbjct: 1801 SPVAAVQLSSGKFITSSINVSLLEARTLAWRTRIISLLDSRSHPGPFVVVDIKKGLEDGL 1860

Query: 543  SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710
            SI+VSPL RIHNET   +E+RFQR +Q+  +FAS+ ++PG  +DDSVAAF A + S    
Sbjct: 1861 SISVSPLTRIHNETSLPIEIRFQRSKQKRDEFASVPLKPGGSIDDSVAAFNAISSSGDMK 1920

Query: 711  --------GNFLFSFRP 737
                    GNF  SFRP
Sbjct: 1921 KALTSLAVGNFSLSFRP 1937


>emb|CAB62317.1| putative protein [Arabidopsis thaliana]
          Length = 3071

 Score =  242 bits (617), Expect = 1e-61
 Identities = 125/257 (48%), Positives = 173/257 (67%), Gaps = 12/257 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            E +HP+++  FYR  F  +DL N    V  H++ RI +L + LTELSLD+LLF++GKL+ 
Sbjct: 1682 EFIHPVEVSAFYRSTFQTRDLNNTMHKVPTHIYCRIGKLEVFLTELSLDMLLFLLGKLEF 1741

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGPF+V+ SAIL+NCCKIEN  GL L C+F + Q A+V  +Q+A IFLRH    N   EA
Sbjct: 1742 AGPFSVKTSAILSNCCKIENLSGLDLICRFNEKQTATVGRKQTAAIFLRHSM--NHQQEA 1799

Query: 363  SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542
            S ++      G  +TS ++  LLEAR  AWRTR++S  +S++ PGPF+VV++ +  ++GL
Sbjct: 1800 SPVAAVQLSSGKFITSSINVSLLEARTLAWRTRIISLLDSRSHPGPFVVVDIKKGLEDGL 1859

Query: 543  SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710
            SI+VSPL RIHNET   +E+RFQR +Q+  +FAS+ ++PG  +DDSVAAF A + S    
Sbjct: 1860 SISVSPLTRIHNETSLPIEIRFQRSKQKRDEFASVPLKPGGSIDDSVAAFNAISSSGDMK 1919

Query: 711  --------GNFLFSFRP 737
                    GNF  SFRP
Sbjct: 1920 KALTSLAVGNFSLSFRP 1936


>gb|ESW27979.1| hypothetical protein PHAVU_003G249100g [Phaseolus vulgaris]
          Length = 3168

 Score =  240 bits (613), Expect = 3e-61
 Identities = 130/268 (48%), Positives = 176/268 (65%), Gaps = 13/268 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            EL+HP++IC+FYR     Q  E     V  +   R+KEL + L E SLD+LLFVIGKL L
Sbjct: 1730 ELLHPVEICLFYRSNIEAQLSEYRSDAVPVNYFCRMKELDVFLNENSLDMLLFVIGKLNL 1789

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLA-LANQPPE 359
            +GP+++RNS I ANCCK+EN+ GL LH  F D Q   +  +QSA+I LR ++   NQ  E
Sbjct: 1790 SGPYSMRNSIIEANCCKVENQSGLNLHVHF-DQQSIIIPRKQSASILLRGISDFKNQDSE 1848

Query: 360  ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539
            A+ +SI+L D G+  TS     L   +  +WRTR++SA+ S T PGP  VV + R ++ G
Sbjct: 1849 ATSISIQLTDLGSFATSSNKVSLSRTQTLSWRTRIMSAEGSTTFPGPIFVVNITRNSEVG 1908

Query: 540  LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710
            LS+ VSPL+RIHN T FS+EL+FQR + +E +FASLL+ PG+ +DDS+A F A N S   
Sbjct: 1909 LSVVVSPLIRIHNGTGFSMELQFQRLEPKEDEFASLLLRPGDSIDDSMAMFDAINFSGGV 1968

Query: 711  ---------GNFLFSFRPRITDEQLNSK 767
                     GNFLFSFRP+I +E +NS+
Sbjct: 1969 KRALISLSVGNFLFSFRPKIAEELVNSE 1996


>ref|XP_006598717.1| PREDICTED: uncharacterized protein LOC100527166 isoform X1 [Glycine
            max]
          Length = 3165

 Score =  237 bits (605), Expect = 3e-60
 Identities = 128/268 (47%), Positives = 177/268 (66%), Gaps = 13/268 (4%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            EL+HP++ICIFYR     Q  E     V  +   R+KE+ + L E SLD+LLFVIG L L
Sbjct: 1727 ELLHPVEICIFYRSNIQAQLSEYRSHAVPVNFFCRMKEMDVYLNENSLDVLLFVIGILNL 1786

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLA-LANQPPE 359
            +GP+++R+S I ANCCK+EN+ GL L    FD Q  ++  +QSA+I LR ++   +Q  E
Sbjct: 1787 SGPYSLRSSIIQANCCKVENQSGLNL-VVHFDQQSITIPRKQSASILLRRISDFKHQASE 1845

Query: 360  ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539
            A+ +SI+L D G+  TS  H  L   +  AWRTR++S + S T PGP  VV ++R ++ G
Sbjct: 1846 ATSISIQLTDFGSFATSSNHLLLSRTQTLAWRTRIMSTEGSTTFPGPMFVVNISRNSEVG 1905

Query: 540  LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710
            LS+ VSPL+RIHN T FS+EL+FQR + +E +FASLL+ PG+ +DDS+A F A N S   
Sbjct: 1906 LSVEVSPLIRIHNGTGFSMELQFQRLEPKEDEFASLLLRPGDSIDDSMAMFDAINFSGGV 1965

Query: 711  ---------GNFLFSFRPRITDEQLNSK 767
                     GNFLFSFRP+IT+E +NS+
Sbjct: 1966 KRALISLSVGNFLFSFRPKITEELINSE 1993


>gb|EXB37240.1| hypothetical protein L484_020299 [Morus notabilis]
          Length = 1451

 Score =  226 bits (575), Expect = 9e-57
 Identities = 124/271 (45%), Positives = 174/271 (64%), Gaps = 16/271 (5%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            EL+HP++I +FYR  F +Q  E  F GV  H+H R KEL +SL+ELSLDILLFVIGKL L
Sbjct: 655  ELLHPVEIFLFYRSNFHIQGSEANFHGVPVHIHCRTKELNMSLSELSLDILLFVIGKLNL 714

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362
            AGP+++++S IL N CK+EN+ G+ L C FF+ Q   +A +QS +I  R         + 
Sbjct: 715  AGPYSLKSSRILVNRCKVENQTGVNLLCHFFNKQSMKIARKQSTSIVFRIFY------DF 768

Query: 363  SVLSIRLADQGALMTSPLHF---PLLEARAFA-WRTRVVSAQESKTSPGPFIVVEVARRT 530
               S+ LA     + S  H       + R    W +      +S+T PGPF+VV+++R +
Sbjct: 769  PNFSLSLASSKTCLESTYHVISSIQSDGRILGIWMSIFHPYTDSRTYPGPFVVVDISRES 828

Query: 531  QEGLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS 710
            ++GLS+ VSPL+RIHNET FS+EL+F+RP Q+E +FASL+++PG+ +DDS+A FGA +LS
Sbjct: 829  EDGLSVIVSPLIRIHNETKFSMELQFRRPHQKEDEFASLVLKPGDTIDDSMAMFGALHLS 888

Query: 711  ------------GNFLFSFRPRITDEQLNSK 767
                        GNFL SFRP  T+  +NSK
Sbjct: 889  GGMKKALTSLSLGNFLLSFRPDTTEGLMNSK 919


>gb|EEE53527.1| hypothetical protein OsJ_36721 [Oryza sativa Japonica Group]
          Length = 4290

 Score =  214 bits (545), Expect = 3e-53
 Identities = 112/243 (46%), Positives = 164/243 (67%), Gaps = 6/243 (2%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGH-----LHARIKELGISLTELSLDILLFVI 167
            ELV P+   +F+RYRF      N  P  R           +K++ I + ELS+DILL+V 
Sbjct: 1766 ELVSPITAYMFFRYRFF-----NLVPVTRCRRMPLRFFVHLKQVDIFVNELSIDILLYVA 1820

Query: 168  GKLKLAGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALA- 344
            GKL + GP+AV++SA+  NCCKIEN   LTL C F +N+ A V+GQQSA++FLRHL    
Sbjct: 1821 GKLNVMGPYAVKSSAVFPNCCKIENNSRLTLVCHFQNNEDAIVSGQQSASVFLRHLTFED 1880

Query: 345  NQPPEASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVAR 524
            N PP+ S++SI L  +G   T+P++  L ++  FA RTRV+S ++S++  GPF+VV+V++
Sbjct: 1881 NHPPDQSIVSISLFKEGLFSTAPINVSLQDSGVFASRTRVLSLKDSRSFSGPFVVVKVSQ 1940

Query: 525  RTQEGLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANN 704
             ++EGLS++V PLLRI+N++DF +ELRFQRPQ+   + A + V  G+ VD+S   F + +
Sbjct: 1941 NSEEGLSLSVQPLLRIYNKSDFPLELRFQRPQKSSEEAAFVTVRSGDMVDESTGVFDSMD 2000

Query: 705  LSG 713
            LSG
Sbjct: 2001 LSG 2003


>gb|EEC69601.1| hypothetical protein OsI_38957 [Oryza sativa Indica Group]
          Length = 4261

 Score =  214 bits (545), Expect = 3e-53
 Identities = 112/243 (46%), Positives = 164/243 (67%), Gaps = 6/243 (2%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGH-----LHARIKELGISLTELSLDILLFVI 167
            ELV P+   +F+RYRF      N  P  R           +K++ I + ELS+DILL+V 
Sbjct: 1737 ELVSPITAYMFFRYRFF-----NLVPVTRCRRMPLRFFVHLKQVDIFVNELSIDILLYVA 1791

Query: 168  GKLKLAGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALA- 344
            GKL + GP+AV++SA+  NCCKIEN   LTL C F +N+ A V+GQQSA++FLRHL    
Sbjct: 1792 GKLNVMGPYAVKSSAVFPNCCKIENNSRLTLVCHFQNNEDAIVSGQQSASVFLRHLTFED 1851

Query: 345  NQPPEASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVAR 524
            N PP+ S++SI L  +G   T+P++  L ++  FA RTRV+S ++S++  GPF+VV+V++
Sbjct: 1852 NHPPDQSIVSISLFKEGLFSTAPINVSLQDSGVFASRTRVLSLKDSRSFSGPFVVVKVSQ 1911

Query: 525  RTQEGLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANN 704
             ++EGLS++V PLLRI+N++DF +ELRFQRPQ+   + A + V  G+ VD+S   F + +
Sbjct: 1912 NSEEGLSLSVQPLLRIYNKSDFPLELRFQRPQKSSEEAAFVTVRSGDMVDESTGVFDSMD 1971

Query: 705  LSG 713
            LSG
Sbjct: 1972 LSG 1974


>gb|EMS50104.1| Retrovirus-related Pol polyprotein LINE-1 [Triticum urartu]
          Length = 3154

 Score =  201 bits (510), Expect = 3e-49
 Identities = 109/244 (44%), Positives = 155/244 (63%), Gaps = 7/244 (2%)
 Frame = +3

Query: 3    ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182
            +L+ P+   +F R+RF  QD               +K++ I + ELS+D+LL+++GKL L
Sbjct: 1596 DLISPITSYVFLRFRFFNQDSVTRRSRTPLRFFFHLKQVDIFINELSVDMLLYLVGKLGL 1655

Query: 183  AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLAL-----AN 347
             GP+AVRNSAI  NCCKIEN   L L C F +N  A V GQQS ++FL  LA       N
Sbjct: 1656 MGPYAVRNSAIFPNCCKIENNSRLALVCHFQNNGDAIVPGQQSTSVFLSILARNFVFDDN 1715

Query: 348  QPPEASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARR 527
            +P + S++SI L  +GA  T+P++ PL E+  +AWRT   S ++S+   GPF+VV+V++ 
Sbjct: 1716 RPHDQSLVSISLFKEGAFSTAPINIPLHESGIYAWRTLASSLKDSRRFSGPFVVVKVSQN 1775

Query: 528  T--QEGLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGAN 701
            +  QEGLS++V PLLRI+N++DF +ELRFQRPQ    + A + V  G+ VD+S     A 
Sbjct: 1776 SVLQEGLSLSVQPLLRIYNKSDFPLELRFQRPQNENEEAALVTVRSGDMVDESTGVLDAM 1835

Query: 702  NLSG 713
            NLSG
Sbjct: 1836 NLSG 1839


Top