BLASTX nr result
ID: Catharanthus23_contig00034228
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00034228 (771 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004233645.1| PREDICTED: uncharacterized protein LOC101257... 320 4e-85 ref|XP_006338249.1| PREDICTED: uncharacterized protein LOC102601... 317 3e-84 ref|XP_006338248.1| PREDICTED: uncharacterized protein LOC102601... 317 3e-84 emb|CBI40980.3| unnamed protein product [Vitis vinifera] 290 3e-76 ref|XP_006466676.1| PREDICTED: uncharacterized protein LOC102617... 283 3e-74 ref|XP_006425795.1| hypothetical protein CICLE_v10024678mg [Citr... 283 3e-74 gb|EPS71292.1| hypothetical protein M569_03462, partial [Genlise... 270 5e-70 gb|EOX91261.1| Vacuolar protein sorting-associated protein 13C, ... 266 6e-69 ref|XP_004301869.1| PREDICTED: uncharacterized protein LOC101304... 266 6e-69 ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arab... 249 9e-64 ref|XP_006405272.1| hypothetical protein EUTSA_v10027614mg [Eutr... 248 1e-63 ref|XP_006293179.1| hypothetical protein CARUB_v10019496mg [Caps... 245 1e-62 ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] ... 242 1e-61 emb|CAB62317.1| putative protein [Arabidopsis thaliana] 242 1e-61 gb|ESW27979.1| hypothetical protein PHAVU_003G249100g [Phaseolus... 240 3e-61 ref|XP_006598717.1| PREDICTED: uncharacterized protein LOC100527... 237 3e-60 gb|EXB37240.1| hypothetical protein L484_020299 [Morus notabilis] 226 9e-57 gb|EEE53527.1| hypothetical protein OsJ_36721 [Oryza sativa Japo... 214 3e-53 gb|EEC69601.1| hypothetical protein OsI_38957 [Oryza sativa Indi... 214 3e-53 gb|EMS50104.1| Retrovirus-related Pol polyprotein LINE-1 [Tritic... 201 3e-49 >ref|XP_004233645.1| PREDICTED: uncharacterized protein LOC101257436 [Solanum lycopersicum] Length = 3178 Score = 320 bits (819), Expect = 4e-85 Identities = 162/265 (61%), Positives = 202/265 (76%), Gaps = 12/265 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 +L+HPL+I +FYRY FL Q EN+ V GH +ARIKEL +++TELSLDI+LF+IGKL L Sbjct: 1731 DLMHPLEIDVFYRYTFLNQGPENSILWVPGHFYARIKELSMTITELSLDIILFIIGKLNL 1790 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGP+AV++S ILANCCK+EN+ GLTL CQF+DNQ SVAG+Q+ TIFLRH+ALAN+PPEA Sbjct: 1791 AGPYAVKDSTILANCCKVENQSGLTLVCQFYDNQDVSVAGRQATTIFLRHMALANRPPEA 1850 Query: 363 SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542 S SI+L ++G L TS LH LLE ++FAWR R+VS QESKT PGPF+V EV+ T++ L Sbjct: 1851 SFFSIQLIERGLLSTSLLHLSLLETQSFAWRPRIVSLQESKTYPGPFLVAEVSPGTEDYL 1910 Query: 543 SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710 SI VSPLLRIHN T F +ELRFQRPQ +E D+AS+ +E G+ +DDS+ AF A NLS Sbjct: 1911 SIGVSPLLRIHNNTKFPMELRFQRPQHKEIDYASVRLEAGDTIDDSMTAFSAINLSGGRK 1970 Query: 711 --------GNFLFSFRPRITDEQLN 761 GNFL SFRP +TD N Sbjct: 1971 KTLNSLSVGNFLLSFRPEVTDVLTN 1995 >ref|XP_006338249.1| PREDICTED: uncharacterized protein LOC102601421 isoform X2 [Solanum tuberosum] Length = 2549 Score = 317 bits (812), Expect = 3e-84 Identities = 160/265 (60%), Positives = 200/265 (75%), Gaps = 12/265 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 +L+HPL+I +FYRY FL Q EN V GH +ARIKEL +++TELSLDI+LF+IGKL Sbjct: 1737 DLMHPLEIDVFYRYTFLNQGPENIILWVPGHFYARIKELSMTITELSLDIILFIIGKLNF 1796 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGP+AV++S ILANCCK+EN+ GLTL CQF+DNQ SVAG+ + TIFLRH+ALAN+PPEA Sbjct: 1797 AGPYAVKDSTILANCCKVENQSGLTLVCQFYDNQDVSVAGRHATTIFLRHMALANRPPEA 1856 Query: 363 SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542 S SI+L ++G L TS LH LLE ++FAWR R+VS QESKT PGPF+V EV+ T++ L Sbjct: 1857 SFFSIQLIERGLLSTSLLHLSLLETQSFAWRPRIVSLQESKTYPGPFLVAEVSPGTEDYL 1916 Query: 543 SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710 SI VSPLLRIHN+T F +ELRFQRPQ +E D+AS+ +E G+ +DDS+ AF A NLS Sbjct: 1917 SIVVSPLLRIHNDTKFPMELRFQRPQHKEIDYASVRLEAGDTIDDSMTAFSAINLSGGRK 1976 Query: 711 --------GNFLFSFRPRITDEQLN 761 GNFL SFRP +TD N Sbjct: 1977 KTLNSLSVGNFLLSFRPEVTDVLTN 2001 >ref|XP_006338248.1| PREDICTED: uncharacterized protein LOC102601421 isoform X1 [Solanum tuberosum] Length = 3185 Score = 317 bits (812), Expect = 3e-84 Identities = 160/265 (60%), Positives = 200/265 (75%), Gaps = 12/265 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 +L+HPL+I +FYRY FL Q EN V GH +ARIKEL +++TELSLDI+LF+IGKL Sbjct: 1737 DLMHPLEIDVFYRYTFLNQGPENIILWVPGHFYARIKELSMTITELSLDIILFIIGKLNF 1796 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGP+AV++S ILANCCK+EN+ GLTL CQF+DNQ SVAG+ + TIFLRH+ALAN+PPEA Sbjct: 1797 AGPYAVKDSTILANCCKVENQSGLTLVCQFYDNQDVSVAGRHATTIFLRHMALANRPPEA 1856 Query: 363 SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542 S SI+L ++G L TS LH LLE ++FAWR R+VS QESKT PGPF+V EV+ T++ L Sbjct: 1857 SFFSIQLIERGLLSTSLLHLSLLETQSFAWRPRIVSLQESKTYPGPFLVAEVSPGTEDYL 1916 Query: 543 SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710 SI VSPLLRIHN+T F +ELRFQRPQ +E D+AS+ +E G+ +DDS+ AF A NLS Sbjct: 1917 SIVVSPLLRIHNDTKFPMELRFQRPQHKEIDYASVRLEAGDTIDDSMTAFSAINLSGGRK 1976 Query: 711 --------GNFLFSFRPRITDEQLN 761 GNFL SFRP +TD N Sbjct: 1977 KTLNSLSVGNFLLSFRPEVTDVLTN 2001 >emb|CBI40980.3| unnamed protein product [Vitis vinifera] Length = 2083 Score = 290 bits (743), Expect = 3e-76 Identities = 152/268 (56%), Positives = 201/268 (75%), Gaps = 13/268 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 ELVHP++ICIFYR F ++ E V H + R KE+ ISLTE+SLDILLFVIGKL L Sbjct: 619 ELVHPVEICIFYRSSFQIEGSEIVSQSVPMHFYFRCKEVEISLTEVSLDILLFVIGKLNL 678 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPE- 359 AGPF+V+ S ILA+CCK+EN+ GL L ++ D+Q S+A +QSA+IFLRHLA A+Q PE Sbjct: 679 AGPFSVKTSMILAHCCKVENQSGLNLLFRYQDDQGLSIARKQSASIFLRHLASADQSPEN 738 Query: 360 ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539 AS SI+L+ G+ TSP+H L + + AWRTR+VS Q+SKT PGPFIVV+++R++++G Sbjct: 739 ASFASIQLSWFGSFSTSPIHLSLSKTQVLAWRTRIVSLQDSKTYPGPFIVVDISRKSEDG 798 Query: 540 LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710 LS+ VSPL+RIHNET FS+ LRFQRPQQ E++FAS+L++ G+ +DDS+AAF + N+S Sbjct: 799 LSVVVSPLIRIHNETTFSMALRFQRPQQVETEFASVLLKTGDTIDDSMAAFDSINVSGGL 858 Query: 711 ---------GNFLFSFRPRITDEQLNSK 767 GNFLFSFRP ITD+ +SK Sbjct: 859 KKALLSLSVGNFLFSFRPEITDDLGSSK 886 >ref|XP_006466676.1| PREDICTED: uncharacterized protein LOC102617616 [Citrus sinensis] Length = 3197 Score = 283 bits (725), Expect = 3e-74 Identities = 146/268 (54%), Positives = 199/268 (74%), Gaps = 13/268 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 ELV P++ICI+YR F +Q E + V ++ RIKE I LTELSLDILLFV+GKL L Sbjct: 1755 ELVQPVEICIYYRSSFQIQGSEALWHRVPLRIYCRIKEFQIFLTELSLDILLFVVGKLDL 1814 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPE- 359 AGP+ +R+S ILANCCK+EN+ GL LHC F + Q +V +QSA+IFLR+ L NQ P+ Sbjct: 1815 AGPYLIRSSRILANCCKVENQSGLNLHCHFDEQQSVTVGRKQSASIFLRNSTLVNQAPDS 1874 Query: 360 ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539 +SV+SI+L+ G+ TSP++ LLE+R+ WRTR+VSAQ+S+T PGPFIVV+++R +++G Sbjct: 1875 SSVVSIQLS-LGSFTTSPIYLSLLESRSLTWRTRIVSAQDSRTFPGPFIVVDISRTSEDG 1933 Query: 540 LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710 LSI VSPL+R+HNET+FS+ELRF+R Q++E DFAS+L++PG +DDS+A F A + S Sbjct: 1934 LSIVVSPLIRVHNETEFSMELRFRRVQEQEDDFASILLKPGHTIDDSMAMFDAVSFSGGL 1993 Query: 711 ---------GNFLFSFRPRITDEQLNSK 767 GNFLFSFRP +D ++SK Sbjct: 1994 KKALMSLSVGNFLFSFRPGSSDGLISSK 2021 >ref|XP_006425795.1| hypothetical protein CICLE_v10024678mg [Citrus clementina] gi|557527785|gb|ESR39035.1| hypothetical protein CICLE_v10024678mg [Citrus clementina] Length = 3169 Score = 283 bits (725), Expect = 3e-74 Identities = 146/268 (54%), Positives = 199/268 (74%), Gaps = 13/268 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 ELV P++ICI+YR F +Q E + V ++ RIKE I LTELSLDILLFV+GKL L Sbjct: 1755 ELVQPVEICIYYRSSFQIQGSEALWHRVPLRIYCRIKEFQIFLTELSLDILLFVVGKLDL 1814 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPE- 359 AGP+ +R+S ILANCCK+EN+ GL LHC F + Q +V +QSA+IFLR+ L NQ P+ Sbjct: 1815 AGPYLIRSSRILANCCKVENQSGLNLHCHFDEQQSVTVGRKQSASIFLRNSTLVNQAPDS 1874 Query: 360 ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539 +SV+SI+L+ G+ TSP++ LLE+R+ WRTR+VSAQ+S+T PGPFIVV+++R +++G Sbjct: 1875 SSVVSIQLS-LGSFTTSPIYLSLLESRSLTWRTRIVSAQDSRTFPGPFIVVDISRTSEDG 1933 Query: 540 LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710 LSI VSPL+R+HNET+FS+ELRF+R Q++E DFAS+L++PG +DDS+A F A + S Sbjct: 1934 LSIVVSPLIRVHNETEFSMELRFRRVQEQEDDFASILLKPGHTIDDSMAMFDAVSFSGGL 1993 Query: 711 ---------GNFLFSFRPRITDEQLNSK 767 GNFLFSFRP +D ++SK Sbjct: 1994 KKALMSLSVGNFLFSFRPGSSDGLISSK 2021 >gb|EPS71292.1| hypothetical protein M569_03462, partial [Genlisea aurea] Length = 730 Score = 270 bits (689), Expect = 5e-70 Identities = 132/264 (50%), Positives = 183/264 (69%), Gaps = 12/264 (4%) Frame = +3 Query: 9 VHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKLAG 188 + PL++CIFY + E++ G H + R E+ + LT+LSLDILLFVIGKL LAG Sbjct: 347 IQPLELCIFYSQNIFIHGAESSSHGFSKHFYIRTGEVSVFLTQLSLDILLFVIGKLDLAG 406 Query: 189 PFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEASV 368 P+AV++SA+L NCCK+EN+ GLTL CQF+ NQ ++ QS TIFLRH+ALANQP E S Sbjct: 407 PYAVKSSAVLGNCCKVENKSGLTLVCQFYGNQEVAIHASQSNTIFLRHMALANQPLEGSF 466 Query: 369 LSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGLSI 548 +S +L +G +SP+ LLEAR +WR+RVVS Q+SK+ PGPFI++E+++ ++GLSI Sbjct: 467 ISAQLVKEGFFSSSPIRLSLLEARKISWRSRVVSLQDSKSFPGPFIIIEISKGVEDGLSI 526 Query: 549 TVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS------ 710 +VSPLL+IHN+TDF +ELRFQRP E + AS ++ G+ +DD+V AF ++ Sbjct: 527 SVSPLLKIHNDTDFQMELRFQRPSGEEPESASFMLGAGDVIDDAVMAFTGIDIPGNLRKV 586 Query: 711 ------GNFLFSFRPRITDEQLNS 764 GN+LFSFRP I + +S Sbjct: 587 LASLSVGNYLFSFRPVIAEGTTDS 610 >gb|EOX91261.1| Vacuolar protein sorting-associated protein 13C, putative [Theobroma cacao] Length = 3155 Score = 266 bits (680), Expect = 6e-69 Identities = 144/267 (53%), Positives = 192/267 (71%), Gaps = 13/267 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 + + P++ICIFYR F +N GV H++ R KEL ISLTELSLDILLFVIGKL L Sbjct: 1726 DFLRPVEICIFYRSCF-----QNPH-GVPVHVYCRTKELEISLTELSLDILLFVIGKLNL 1779 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGPF+VR+S ILANC K+EN+ GL L C F+ Q +V +QSA+ LR A NQPPEA Sbjct: 1780 AGPFSVRSSMILANCGKVENQTGLNLLCHFYGKQSVTVGRKQSASFSLRVSAFENQPPEA 1839 Query: 363 SV-LSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539 + LSI+L+ G+ TSP+H LL A+ AWRTR+VS ++SK+ PGPF+VV+V+R++++G Sbjct: 1840 AAALSIQLSLPGSFTTSPIHLSLLGAQTLAWRTRLVSLKDSKSYPGPFVVVDVSRKSEDG 1899 Query: 540 LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710 LSI+VSPL+RIHNET FSVEL+ RP+ E +FAS+L++ G+ DDS+A+F A N S Sbjct: 1900 LSISVSPLIRIHNETKFSVELQISRPEPMEDEFASVLLKAGDTFDDSMASFDAINFSGGF 1959 Query: 711 ---------GNFLFSFRPRITDEQLNS 764 GNFLFSFRP I+++ ++S Sbjct: 1960 RKAVMSLNVGNFLFSFRPEISNDLMHS 1986 >ref|XP_004301869.1| PREDICTED: uncharacterized protein LOC101304881 [Fragaria vesca subsp. vesca] Length = 3178 Score = 266 bits (680), Expect = 6e-69 Identities = 143/272 (52%), Positives = 192/272 (70%), Gaps = 17/272 (6%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 EL+HP++ C FYR E GV H+H R KEL ISL+ELSLDILLF +GKL L Sbjct: 1788 ELIHPVETCFFYRS---THSSEGVSHGVPVHIHCRTKELNISLSELSLDILLFTVGKLNL 1844 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPE- 359 AGPF+VR++ I ANCCK+EN+ GL L CQ+ D + V+ +QS +I LR L NQPPE Sbjct: 1845 AGPFSVRSTKIWANCCKVENQSGLNLLCQY-DEESVKVSRRQSTSIILRCSDLENQPPEI 1903 Query: 360 ASVLSIRLADQ-GALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQE 536 ASV+S++L+ +L TSP+H LEA+AFAWRT+++S Q+S+T PGPF++V+V+R++++ Sbjct: 1904 ASVVSVQLSGPISSLTTSPIHISRLEAQAFAWRTQIMSLQDSQTYPGPFVIVDVSRKSED 1963 Query: 537 GLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS-- 710 GLSI +SPL+RIHNET S++LRF+RPQQ+E FAS+++ G+ DDS+A F A NL+ Sbjct: 1964 GLSIRISPLIRIHNETGLSIKLRFRRPQQKEDVFASVVLNAGDTYDDSMAMFDAINLAGE 2023 Query: 711 ----------GNFLFSFR---PRITDEQLNSK 767 GNFLFSFR P I D +NSK Sbjct: 2024 EKKALRSLSLGNFLFSFRPEIPEIPDGLMNSK 2055 >ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arabidopsis lyrata subsp. lyrata] gi|297323582|gb|EFH54003.1| hypothetical protein ARALYDRAFT_485391 [Arabidopsis lyrata subsp. lyrata] Length = 3074 Score = 249 bits (635), Expect = 9e-64 Identities = 127/257 (49%), Positives = 176/257 (68%), Gaps = 12/257 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 E +HP+++ FYR F DL N V H++ RI +L + LTELS+D+LLFV+GKL+ Sbjct: 1685 EFIHPVEVSAFYRSTFQTPDLNNTMQKVPTHIYCRIGKLDVFLTELSMDMLLFVLGKLEF 1744 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGPF+V+ SAIL+NCCKI+N GL L C+F + Q A+V +Q+A+IFLRH N PEA Sbjct: 1745 AGPFSVKTSAILSNCCKIKNLSGLDLICRFNEKQTATVGRKQTASIFLRHSM--NHQPEA 1802 Query: 363 SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542 S ++ G +TS ++ LLEAR AWRTR++S Q++++ PGPF+VV++ + ++GL Sbjct: 1803 SPVAAVQLSSGKFITSSINVSLLEARTLAWRTRIISLQDARSHPGPFVVVDIKKGLEDGL 1862 Query: 543 SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710 SI+VSPL RIHNET +E+RFQR +Q+ DFAS+ ++PG +DDSVAAF A +LS Sbjct: 1863 SISVSPLTRIHNETSLPMEIRFQRSKQKRDDFASVPLKPGGSIDDSVAAFNAISLSGDMK 1922 Query: 711 --------GNFLFSFRP 737 GNF SFRP Sbjct: 1923 KALTSLAVGNFSLSFRP 1939 >ref|XP_006405272.1| hypothetical protein EUTSA_v10027614mg [Eutrema salsugineum] gi|557106410|gb|ESQ46725.1| hypothetical protein EUTSA_v10027614mg [Eutrema salsugineum] Length = 3132 Score = 248 bits (634), Expect = 1e-63 Identities = 128/267 (47%), Positives = 184/267 (68%), Gaps = 12/267 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 E +HP+++ FYR F QDL+N V H++ RI +L + LTELSLD+LLFV+ +L+ Sbjct: 1688 EFIHPVEVSAFYRSTFQTQDLKNTMHKVPSHIYCRIGKLEVYLTELSLDMLLFVLEELEF 1747 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGPF+V+ S IL NCCKIEN GL L C+F + Q +V+ +Q+A+IFLRH ++ +QP Sbjct: 1748 AGPFSVKTSVILPNCCKIENLSGLDLTCRFNEKQTTTVSRKQTASIFLRH-SMNHQPEAF 1806 Query: 363 SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542 V++++L+ G +TS L+ LLEAR AWRTR+VS Q+S++ PGPF+VV++ + +++GL Sbjct: 1807 PVVAVQLSS-GNFITSSLNVSLLEARTLAWRTRIVSLQDSRSHPGPFVVVDIKKGSEDGL 1865 Query: 543 SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710 SI+VSPL RIHNET F +E+RFQR +Q+ DFAS+ ++PG +DDSV AF A +LS Sbjct: 1866 SISVSPLTRIHNETSFPMEIRFQRSKQKRDDFASVPLKPGASIDDSVGAFNAISLSGDQK 1925 Query: 711 --------GNFLFSFRPRITDEQLNSK 767 GN+ SFRP + S+ Sbjct: 1926 KALTSLAVGNYSLSFRPESLETLFESE 1952 >ref|XP_006293179.1| hypothetical protein CARUB_v10019496mg [Capsella rubella] gi|482561886|gb|EOA26077.1| hypothetical protein CARUB_v10019496mg [Capsella rubella] Length = 3074 Score = 245 bits (626), Expect = 1e-62 Identities = 125/257 (48%), Positives = 177/257 (68%), Gaps = 12/257 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 E +HP+++ FYR F Q+L+N V H++ R+ +L + +TELSLD+LLFV+GKL+ Sbjct: 1685 EFIHPVEVSAFYRSTFQTQELQNTMHKVPTHIYCRVGKLEVFVTELSLDMLLFVLGKLEF 1744 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGPF+V+ S+IL+NCCK+EN GL L C F + Q +++ +Q+A+IFLRH N PEA Sbjct: 1745 AGPFSVKTSSILSNCCKVENLSGLDLICCFNEKQTSTIGRKQTASIFLRHSM--NHQPEA 1802 Query: 363 SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542 S ++ G +TS + LLEAR AWRTR+VS +S++ PGPF+VV++ + ++GL Sbjct: 1803 SPVAAVQLSSGKFVTSSISVSLLEARTLAWRTRIVSLLDSRSHPGPFVVVDIKKGFEDGL 1862 Query: 543 SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710 SI+VSPL+RIHNET +E+RFQR +Q++ DFAS+ ++PG VDDSVAAF A +LS Sbjct: 1863 SISVSPLIRIHNETSLPMEIRFQRSKQKKDDFASVPLKPGGSVDDSVAAFNAISLSGDLK 1922 Query: 711 --------GNFLFSFRP 737 GNF SFRP Sbjct: 1923 KALTSLAVGNFSLSFRP 1939 >ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] gi|332645140|gb|AEE78661.1| uncharacterized protein AT3G50380 [Arabidopsis thaliana] Length = 3072 Score = 242 bits (617), Expect = 1e-61 Identities = 125/257 (48%), Positives = 173/257 (67%), Gaps = 12/257 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 E +HP+++ FYR F +DL N V H++ RI +L + LTELSLD+LLF++GKL+ Sbjct: 1683 EFIHPVEVSAFYRSTFQTRDLNNTMHKVPTHIYCRIGKLEVFLTELSLDMLLFLLGKLEF 1742 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGPF+V+ SAIL+NCCKIEN GL L C+F + Q A+V +Q+A IFLRH N EA Sbjct: 1743 AGPFSVKTSAILSNCCKIENLSGLDLICRFNEKQTATVGRKQTAAIFLRHSM--NHQQEA 1800 Query: 363 SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542 S ++ G +TS ++ LLEAR AWRTR++S +S++ PGPF+VV++ + ++GL Sbjct: 1801 SPVAAVQLSSGKFITSSINVSLLEARTLAWRTRIISLLDSRSHPGPFVVVDIKKGLEDGL 1860 Query: 543 SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710 SI+VSPL RIHNET +E+RFQR +Q+ +FAS+ ++PG +DDSVAAF A + S Sbjct: 1861 SISVSPLTRIHNETSLPIEIRFQRSKQKRDEFASVPLKPGGSIDDSVAAFNAISSSGDMK 1920 Query: 711 --------GNFLFSFRP 737 GNF SFRP Sbjct: 1921 KALTSLAVGNFSLSFRP 1937 >emb|CAB62317.1| putative protein [Arabidopsis thaliana] Length = 3071 Score = 242 bits (617), Expect = 1e-61 Identities = 125/257 (48%), Positives = 173/257 (67%), Gaps = 12/257 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 E +HP+++ FYR F +DL N V H++ RI +L + LTELSLD+LLF++GKL+ Sbjct: 1682 EFIHPVEVSAFYRSTFQTRDLNNTMHKVPTHIYCRIGKLEVFLTELSLDMLLFLLGKLEF 1741 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGPF+V+ SAIL+NCCKIEN GL L C+F + Q A+V +Q+A IFLRH N EA Sbjct: 1742 AGPFSVKTSAILSNCCKIENLSGLDLICRFNEKQTATVGRKQTAAIFLRHSM--NHQQEA 1799 Query: 363 SVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEGL 542 S ++ G +TS ++ LLEAR AWRTR++S +S++ PGPF+VV++ + ++GL Sbjct: 1800 SPVAAVQLSSGKFITSSINVSLLEARTLAWRTRIISLLDSRSHPGPFVVVDIKKGLEDGL 1859 Query: 543 SITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS---- 710 SI+VSPL RIHNET +E+RFQR +Q+ +FAS+ ++PG +DDSVAAF A + S Sbjct: 1860 SISVSPLTRIHNETSLPIEIRFQRSKQKRDEFASVPLKPGGSIDDSVAAFNAISSSGDMK 1919 Query: 711 --------GNFLFSFRP 737 GNF SFRP Sbjct: 1920 KALTSLAVGNFSLSFRP 1936 >gb|ESW27979.1| hypothetical protein PHAVU_003G249100g [Phaseolus vulgaris] Length = 3168 Score = 240 bits (613), Expect = 3e-61 Identities = 130/268 (48%), Positives = 176/268 (65%), Gaps = 13/268 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 EL+HP++IC+FYR Q E V + R+KEL + L E SLD+LLFVIGKL L Sbjct: 1730 ELLHPVEICLFYRSNIEAQLSEYRSDAVPVNYFCRMKELDVFLNENSLDMLLFVIGKLNL 1789 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLA-LANQPPE 359 +GP+++RNS I ANCCK+EN+ GL LH F D Q + +QSA+I LR ++ NQ E Sbjct: 1790 SGPYSMRNSIIEANCCKVENQSGLNLHVHF-DQQSIIIPRKQSASILLRGISDFKNQDSE 1848 Query: 360 ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539 A+ +SI+L D G+ TS L + +WRTR++SA+ S T PGP VV + R ++ G Sbjct: 1849 ATSISIQLTDLGSFATSSNKVSLSRTQTLSWRTRIMSAEGSTTFPGPIFVVNITRNSEVG 1908 Query: 540 LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710 LS+ VSPL+RIHN T FS+EL+FQR + +E +FASLL+ PG+ +DDS+A F A N S Sbjct: 1909 LSVVVSPLIRIHNGTGFSMELQFQRLEPKEDEFASLLLRPGDSIDDSMAMFDAINFSGGV 1968 Query: 711 ---------GNFLFSFRPRITDEQLNSK 767 GNFLFSFRP+I +E +NS+ Sbjct: 1969 KRALISLSVGNFLFSFRPKIAEELVNSE 1996 >ref|XP_006598717.1| PREDICTED: uncharacterized protein LOC100527166 isoform X1 [Glycine max] Length = 3165 Score = 237 bits (605), Expect = 3e-60 Identities = 128/268 (47%), Positives = 177/268 (66%), Gaps = 13/268 (4%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 EL+HP++ICIFYR Q E V + R+KE+ + L E SLD+LLFVIG L L Sbjct: 1727 ELLHPVEICIFYRSNIQAQLSEYRSHAVPVNFFCRMKEMDVYLNENSLDVLLFVIGILNL 1786 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLA-LANQPPE 359 +GP+++R+S I ANCCK+EN+ GL L FD Q ++ +QSA+I LR ++ +Q E Sbjct: 1787 SGPYSLRSSIIQANCCKVENQSGLNL-VVHFDQQSITIPRKQSASILLRRISDFKHQASE 1845 Query: 360 ASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARRTQEG 539 A+ +SI+L D G+ TS H L + AWRTR++S + S T PGP VV ++R ++ G Sbjct: 1846 ATSISIQLTDFGSFATSSNHLLLSRTQTLAWRTRIMSTEGSTTFPGPMFVVNISRNSEVG 1905 Query: 540 LSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS--- 710 LS+ VSPL+RIHN T FS+EL+FQR + +E +FASLL+ PG+ +DDS+A F A N S Sbjct: 1906 LSVEVSPLIRIHNGTGFSMELQFQRLEPKEDEFASLLLRPGDSIDDSMAMFDAINFSGGV 1965 Query: 711 ---------GNFLFSFRPRITDEQLNSK 767 GNFLFSFRP+IT+E +NS+ Sbjct: 1966 KRALISLSVGNFLFSFRPKITEELINSE 1993 >gb|EXB37240.1| hypothetical protein L484_020299 [Morus notabilis] Length = 1451 Score = 226 bits (575), Expect = 9e-57 Identities = 124/271 (45%), Positives = 174/271 (64%), Gaps = 16/271 (5%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 EL+HP++I +FYR F +Q E F GV H+H R KEL +SL+ELSLDILLFVIGKL L Sbjct: 655 ELLHPVEIFLFYRSNFHIQGSEANFHGVPVHIHCRTKELNMSLSELSLDILLFVIGKLNL 714 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALANQPPEA 362 AGP+++++S IL N CK+EN+ G+ L C FF+ Q +A +QS +I R + Sbjct: 715 AGPYSLKSSRILVNRCKVENQTGVNLLCHFFNKQSMKIARKQSTSIVFRIFY------DF 768 Query: 363 SVLSIRLADQGALMTSPLHF---PLLEARAFA-WRTRVVSAQESKTSPGPFIVVEVARRT 530 S+ LA + S H + R W + +S+T PGPF+VV+++R + Sbjct: 769 PNFSLSLASSKTCLESTYHVISSIQSDGRILGIWMSIFHPYTDSRTYPGPFVVVDISRES 828 Query: 531 QEGLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANNLS 710 ++GLS+ VSPL+RIHNET FS+EL+F+RP Q+E +FASL+++PG+ +DDS+A FGA +LS Sbjct: 829 EDGLSVIVSPLIRIHNETKFSMELQFRRPHQKEDEFASLVLKPGDTIDDSMAMFGALHLS 888 Query: 711 ------------GNFLFSFRPRITDEQLNSK 767 GNFL SFRP T+ +NSK Sbjct: 889 GGMKKALTSLSLGNFLLSFRPDTTEGLMNSK 919 >gb|EEE53527.1| hypothetical protein OsJ_36721 [Oryza sativa Japonica Group] Length = 4290 Score = 214 bits (545), Expect = 3e-53 Identities = 112/243 (46%), Positives = 164/243 (67%), Gaps = 6/243 (2%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGH-----LHARIKELGISLTELSLDILLFVI 167 ELV P+ +F+RYRF N P R +K++ I + ELS+DILL+V Sbjct: 1766 ELVSPITAYMFFRYRFF-----NLVPVTRCRRMPLRFFVHLKQVDIFVNELSIDILLYVA 1820 Query: 168 GKLKLAGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALA- 344 GKL + GP+AV++SA+ NCCKIEN LTL C F +N+ A V+GQQSA++FLRHL Sbjct: 1821 GKLNVMGPYAVKSSAVFPNCCKIENNSRLTLVCHFQNNEDAIVSGQQSASVFLRHLTFED 1880 Query: 345 NQPPEASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVAR 524 N PP+ S++SI L +G T+P++ L ++ FA RTRV+S ++S++ GPF+VV+V++ Sbjct: 1881 NHPPDQSIVSISLFKEGLFSTAPINVSLQDSGVFASRTRVLSLKDSRSFSGPFVVVKVSQ 1940 Query: 525 RTQEGLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANN 704 ++EGLS++V PLLRI+N++DF +ELRFQRPQ+ + A + V G+ VD+S F + + Sbjct: 1941 NSEEGLSLSVQPLLRIYNKSDFPLELRFQRPQKSSEEAAFVTVRSGDMVDESTGVFDSMD 2000 Query: 705 LSG 713 LSG Sbjct: 2001 LSG 2003 >gb|EEC69601.1| hypothetical protein OsI_38957 [Oryza sativa Indica Group] Length = 4261 Score = 214 bits (545), Expect = 3e-53 Identities = 112/243 (46%), Positives = 164/243 (67%), Gaps = 6/243 (2%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGH-----LHARIKELGISLTELSLDILLFVI 167 ELV P+ +F+RYRF N P R +K++ I + ELS+DILL+V Sbjct: 1737 ELVSPITAYMFFRYRFF-----NLVPVTRCRRMPLRFFVHLKQVDIFVNELSIDILLYVA 1791 Query: 168 GKLKLAGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLALA- 344 GKL + GP+AV++SA+ NCCKIEN LTL C F +N+ A V+GQQSA++FLRHL Sbjct: 1792 GKLNVMGPYAVKSSAVFPNCCKIENNSRLTLVCHFQNNEDAIVSGQQSASVFLRHLTFED 1851 Query: 345 NQPPEASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVAR 524 N PP+ S++SI L +G T+P++ L ++ FA RTRV+S ++S++ GPF+VV+V++ Sbjct: 1852 NHPPDQSIVSISLFKEGLFSTAPINVSLQDSGVFASRTRVLSLKDSRSFSGPFVVVKVSQ 1911 Query: 525 RTQEGLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGANN 704 ++EGLS++V PLLRI+N++DF +ELRFQRPQ+ + A + V G+ VD+S F + + Sbjct: 1912 NSEEGLSLSVQPLLRIYNKSDFPLELRFQRPQKSSEEAAFVTVRSGDMVDESTGVFDSMD 1971 Query: 705 LSG 713 LSG Sbjct: 1972 LSG 1974 >gb|EMS50104.1| Retrovirus-related Pol polyprotein LINE-1 [Triticum urartu] Length = 3154 Score = 201 bits (510), Expect = 3e-49 Identities = 109/244 (44%), Positives = 155/244 (63%), Gaps = 7/244 (2%) Frame = +3 Query: 3 ELVHPLDICIFYRYRFLVQDLENAFPGVRGHLHARIKELGISLTELSLDILLFVIGKLKL 182 +L+ P+ +F R+RF QD +K++ I + ELS+D+LL+++GKL L Sbjct: 1596 DLISPITSYVFLRFRFFNQDSVTRRSRTPLRFFFHLKQVDIFINELSVDMLLYLVGKLGL 1655 Query: 183 AGPFAVRNSAILANCCKIENRLGLTLHCQFFDNQYASVAGQQSATIFLRHLAL-----AN 347 GP+AVRNSAI NCCKIEN L L C F +N A V GQQS ++FL LA N Sbjct: 1656 MGPYAVRNSAIFPNCCKIENNSRLALVCHFQNNGDAIVPGQQSTSVFLSILARNFVFDDN 1715 Query: 348 QPPEASVLSIRLADQGALMTSPLHFPLLEARAFAWRTRVVSAQESKTSPGPFIVVEVARR 527 +P + S++SI L +GA T+P++ PL E+ +AWRT S ++S+ GPF+VV+V++ Sbjct: 1716 RPHDQSLVSISLFKEGAFSTAPINIPLHESGIYAWRTLASSLKDSRRFSGPFVVVKVSQN 1775 Query: 528 T--QEGLSITVSPLLRIHNETDFSVELRFQRPQQRESDFASLLVEPGECVDDSVAAFGAN 701 + QEGLS++V PLLRI+N++DF +ELRFQRPQ + A + V G+ VD+S A Sbjct: 1776 SVLQEGLSLSVQPLLRIYNKSDFPLELRFQRPQNENEEAALVTVRSGDMVDESTGVLDAM 1835 Query: 702 NLSG 713 NLSG Sbjct: 1836 NLSG 1839