BLASTX nr result
ID: Catharanthus22_contig00012391
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00012391 (1773 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006359164.1| PREDICTED: mucin-2-like isoform X2 [Solanum ... 245 5e-62 ref|XP_004229349.1| PREDICTED: uncharacterized protein LOC101251... 239 3e-60 ref|XP_006359163.1| PREDICTED: mucin-2-like isoform X1 [Solanum ... 236 2e-59 emb|CBI32170.3| unnamed protein product [Vitis vinifera] 190 1e-45 gb|EPS68426.1| hypothetical protein M569_06343, partial [Genlise... 164 1e-37 ref|XP_002330893.1| predicted protein [Populus trichocarpa] gi|5... 162 3e-37 ref|XP_006439026.1| hypothetical protein CICLE_v10032226mg [Citr... 150 2e-33 gb|EOX96773.1| Hydroxyproline-rich glycoprotein family protein, ... 149 3e-33 gb|EOX96772.1| Hydroxyproline-rich glycoprotein family protein, ... 145 5e-32 ref|XP_006439027.1| hypothetical protein CICLE_v10032226mg [Citr... 144 9e-32 ref|XP_004149622.1| PREDICTED: uncharacterized protein LOC101211... 144 9e-32 gb|EXC42165.1| hypothetical protein L484_002415 [Morus notabilis] 133 3e-28 ref|XP_003546862.1| PREDICTED: uncharacterized protein LOC100779... 129 4e-27 ref|XP_004306495.1| PREDICTED: uncharacterized protein LOC101308... 128 9e-27 ref|NP_180843.2| proline-rich uncharacterized protein [Arabidops... 125 4e-26 ref|XP_006294535.1| hypothetical protein CARUB_v10023571mg [Caps... 125 6e-26 ref|XP_002881247.1| hypothetical protein ARALYDRAFT_482225 [Arab... 125 8e-26 ref|XP_006598246.1| PREDICTED: uncharacterized protein LOC100779... 122 5e-25 gb|EOX96774.1| Hydroxyproline-rich glycoprotein family protein, ... 122 6e-25 ref|XP_006585335.1| PREDICTED: uncharacterized protein LOC100808... 121 8e-25 >ref|XP_006359164.1| PREDICTED: mucin-2-like isoform X2 [Solanum tuberosum] Length = 344 Score = 245 bits (625), Expect = 5e-62 Identities = 156/318 (49%), Positives = 188/318 (59%), Gaps = 11/318 (3%) Frame = -1 Query: 1497 PFYSNHHPLPVNSRLPTNPNPNYAHLVPSPHPHDVGXXXXXXXXXAILYPVASSGRGFLT 1318 PF P RLP + NP+Y+ LV P D +ILYPVASSGRGFL+ Sbjct: 53 PFSLQSSHFPSTQRLPPSSNPSYSQLVLKPPNPD-----SQPHLHSILYPVASSGRGFLS 107 Query: 1317 KQTTPVTLANPGPNSPLFQSRPGMSY--PRPPFGYSHSDPSLVQGMGYVTGRPTHVHHGA 1144 K + + +RP +S+ RP FG + DP L Q G RP+H+ H A Sbjct: 108 KPSN-------------YPNRPVVSHLGSRPTFGLNQMDPGLGQSTGV---RPSHLQH-A 150 Query: 1143 TMGNS--------AGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRT 988 +G+S A +AGV+PG +KG PV V+SSH HKI + S SDCNG ++ RDR+ Sbjct: 151 LLGSSPTVNSAGPAASAGVLPGAVKGFPV-VSSSH--HKIASTQPSLSDCNGFREKRDRS 207 Query: 987 KDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQDI 808 KD++ FA IRDRKVRISDN SLY LCRSWLRNGLP+++Q QY D VRSLPRP L+ QD Sbjct: 208 KDDT-FAIIRDRKVRISDNASLYTLCRSWLRNGLPDDTQSQYMDGVRSLPRPLALAPQDA 266 Query: 807 TSPV-XXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLAL 631 SPV E+VEHL+PKELLQRHVK AK RYKTRLAL Sbjct: 267 ESPVKKEGDKEEEEEAGESVEHLSPKELLQRHVKRAKRIRSRLREERLRRIARYKTRLAL 326 Query: 630 LLPPMVDPQLRNDLAPGN 577 LLPPMV+ Q RND A GN Sbjct: 327 LLPPMVEQQFRNDPASGN 344 >ref|XP_004229349.1| PREDICTED: uncharacterized protein LOC101251026 [Solanum lycopersicum] Length = 342 Score = 239 bits (610), Expect = 3e-60 Identities = 157/318 (49%), Positives = 185/318 (58%), Gaps = 11/318 (3%) Frame = -1 Query: 1497 PFYSNHHPLPVNSRLPTNPNPNYAHLVPSPHPHDVGXXXXXXXXXAILYPVASSGRGFLT 1318 PF P RLP + NP Y+ LV P D +ILYPVASSGRGFL+ Sbjct: 51 PFSLQSSHFPSTQRLPPSSNPGYSQLVLKPPNPD-----SQPHLHSILYPVASSGRGFLS 105 Query: 1317 KQTTPVTLANPGPNSPLFQSRPGMSY--PRPPFGYSHSDPSLVQGMGYVTGRPTHVHHGA 1144 K + + +RP +S+ RP FG + DP Q G RP+H+ H A Sbjct: 106 KPSN-------------YPNRPVVSHLGSRPVFGVNQMDPGSGQSAGV---RPSHLQH-A 148 Query: 1143 TMG-----NSAGAA---GVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRT 988 +G NSAG A GV+PG +KG PV V+SSH +KI + S SDCNG +D RDR+ Sbjct: 149 LLGSSPTVNSAGPAASSGVLPGAVKGFPV-VSSSH--NKIASTQPSLSDCNGFRDKRDRS 205 Query: 987 KDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQDI 808 KDE+ FA IRDRKVRI DN SLY LCRSWLRNGLP+++Q QY D VRSLPRP L+ QD Sbjct: 206 KDET-FAIIRDRKVRICDNASLYTLCRSWLRNGLPDDTQSQYMDGVRSLPRPLALAPQDA 264 Query: 807 TSPV-XXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLAL 631 SPV E+VEHL+PKELLQRHVK AK RYKTRLAL Sbjct: 265 ESPVKKEGDKEEEEEAGESVEHLSPKELLQRHVKRAKRIRSRLREERLRRIARYKTRLAL 324 Query: 630 LLPPMVDPQLRNDLAPGN 577 LLPPMV+ Q RND A GN Sbjct: 325 LLPPMVEQQFRNDPASGN 342 >ref|XP_006359163.1| PREDICTED: mucin-2-like isoform X1 [Solanum tuberosum] Length = 366 Score = 236 bits (603), Expect = 2e-59 Identities = 156/340 (45%), Positives = 188/340 (55%), Gaps = 33/340 (9%) Frame = -1 Query: 1497 PFYSNHHPLPVNSRLPTNPNPNYAHLVPSPHPHDVGXXXXXXXXXAILYPVASSGRGFLT 1318 PF P RLP + NP+Y+ LV P D +ILYPVASSGRGFL+ Sbjct: 53 PFSLQSSHFPSTQRLPPSSNPSYSQLVLKPPNPD-----SQPHLHSILYPVASSGRGFLS 107 Query: 1317 KQTTPVTLANPGPNSPLFQSRPGMSY--PRPPFGYSHSDPSLVQGMGYVTGRPTHVHHGA 1144 K + + +RP +S+ RP FG + DP L Q G RP+H+ H A Sbjct: 108 KPSN-------------YPNRPVVSHLGSRPTFGLNQMDPGLGQSTGV---RPSHLQH-A 150 Query: 1143 TMGNS--------AGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRT 988 +G+S A +AGV+PG +KG PV V+SSH HKI + S SDCNG ++ RDR+ Sbjct: 151 LLGSSPTVNSAGPAASAGVLPGAVKGFPV-VSSSH--HKIASTQPSLSDCNGFREKRDRS 207 Query: 987 KDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQDI 808 KD++ FA IRDRKVRISDN SLY LCRSWLRNGLP+++Q QY D VRSLPRP L+ QD Sbjct: 208 KDDT-FAIIRDRKVRISDNASLYTLCRSWLRNGLPDDTQSQYMDGVRSLPRPLALAPQDA 266 Query: 807 TSPV-----------------------XXXXXXXXXXXXETVEHLTPKELLQRHVKHAKX 697 SPV E+VEHL+PKELLQRHVK AK Sbjct: 267 ESPVKKEGDKEEEEEDCSFSSMLILKRVNFPIPINFKAGESVEHLSPKELLQRHVKRAKR 326 Query: 696 XXXXXXXXXXXXXXRYKTRLALLLPPMVDPQLRNDLAPGN 577 RYKTRLALLLPPMV+ Q RND A GN Sbjct: 327 IRSRLREERLRRIARYKTRLALLLPPMVEQQFRNDPASGN 366 >emb|CBI32170.3| unnamed protein product [Vitis vinifera] Length = 342 Score = 190 bits (483), Expect = 1e-45 Identities = 130/306 (42%), Positives = 162/306 (52%), Gaps = 14/306 (4%) Frame = -1 Query: 1452 PTNPNPNYAHLVPSPH---PHDVGXXXXXXXXXAILYPVASSGRGFLTKQTTP------- 1303 P +P P VP+P PHD ILYPVASSGRGF+ K P Sbjct: 62 PPHPLPYSTIRVPNPQLAKPHD--------PPQGILYPVASSGRGFIPKPLRPQSSDHNT 113 Query: 1302 VTLANPG---PNSPLFQSRPGMSYPRPPFGYSHSDPSL-VQGMGYVTGRPTHVHHGATMG 1135 VT+ANPG P + S+ PFG+ SD + V M P+HV A G Sbjct: 114 VTVANPGAAFPPRSAATAAAAFSHQARPFGFPQSDLNYPVHSMRMPHLLPSHVGVTAVPG 173 Query: 1134 NSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRTKDESAFATIRD 955 ++ IKG+PVS K+ SP S SDCNG+KD RDR +D++ F T+RD Sbjct: 174 SAP---------IKGIPVSA-----HPKVAPSPPSVSDCNGYKDSRDRNRDDT-FVTVRD 218 Query: 954 RKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQDITSPVXXXXXXX 775 RKVRISD S+YALCRSWLRNG EE+Q Q+ D+++SLPRP P+ D P Sbjct: 219 RKVRISDGASIYALCRSWLRNGFSEETQPQHYDSMKSLPRPLPIPVTDPNLP-KKKEDDE 277 Query: 774 XXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALLLPPMVDPQLRN 595 +VE+L P++LLQRH+K AK RYKTRLALLLPP V+ + RN Sbjct: 278 EEEDEGSVENLLPQDLLQRHIKRAKKVRARLREQRLKRIARYKTRLALLLPPPVE-RFRN 336 Query: 594 DLAPGN 577 D GN Sbjct: 337 DTGAGN 342 >gb|EPS68426.1| hypothetical protein M569_06343, partial [Genlisea aurea] Length = 302 Score = 164 bits (415), Expect = 1e-37 Identities = 123/301 (40%), Positives = 152/301 (50%), Gaps = 8/301 (2%) Frame = -1 Query: 1503 PAPFYSNHHPLPVNSRLPTNPNPNYAHLVP-SPHPHDVGXXXXXXXXXAILYPVASSGRG 1327 P PFYS SRLP+NPNPNY L P +PH D + SSG G Sbjct: 45 PLPFYSQSP-----SRLPSNPNPNYPQLAPRTPHSQDPSQ-------------IGSSGGG 86 Query: 1326 FLTKQTTPVTLANPGPNSPLFQSRPGMSYPRPPFGYSHSDPSLVQGMGYVTGRPTHVHH- 1150 +++ PL RP RPP+G S L QG+ RP +++H Sbjct: 87 IVSR--------------PLSAGRPTQ---RPPYG---SPCLLDQGLA----RPNNLNHV 122 Query: 1149 --GATMGNSA--GAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHK-DLRDRTK 985 G G+SA AG MPGV +G+P +S H +I D NGH DLR R + Sbjct: 123 ILGPMRGSSADTSGAGAMPGVAQGIPFPTSSHSKVHPHSILVG---DSNGHTTDLRGRHR 179 Query: 984 DESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQDIT 805 D+ A IRDRKVR+S+N SLYALCRSWLRNG+P + Q QY D V+SLPRPS +S Q Sbjct: 180 DD-VVALIRDRKVRLSENASLYALCRSWLRNGVPADMQPQYVDVVKSLPRPSHVSGQTAD 238 Query: 804 SP-VXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALL 628 SP ++V L+ KELLQRH+K AK RYK+RLALL Sbjct: 239 SPEKNEASSEVETEDEDSVGQLSEKELLQRHIKRAKKIRSKLNERRFKRIDRYKSRLALL 298 Query: 627 L 625 L Sbjct: 299 L 299 >ref|XP_002330893.1| predicted protein [Populus trichocarpa] gi|566150610|ref|XP_006369465.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550348014|gb|ERP66034.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 340 Score = 162 bits (411), Expect = 3e-37 Identities = 120/337 (35%), Positives = 164/337 (48%), Gaps = 21/337 (6%) Frame = -1 Query: 1524 TAHPYTCPAPFYSNH-HPLPVNSRLPTNPNPNYAHLVPSPHPHDVGXXXXXXXXXAILYP 1348 T P +PF +H H PV PNP ++P H +LYP Sbjct: 38 TTTPPRPQSPFQIHHQHIYPVIRPQTQTPNP----IIPPSHQ-------------GVLYP 80 Query: 1347 VASSGRGFLTKQTTPVTLANPGPNSPLFQSRPGMSYPRP---------PFGYSHSDPSLV 1195 VASSGRGF+ + P P G++Y RP P SH +P + Sbjct: 81 VASSGRGFIPRPVRPHQDQTPANQGAYHPRGAGVAY-RPHTPTTVVGSPSSRSHPNPQQL 139 Query: 1194 QGMGYVTG-----------RPTHVHHGATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKI 1048 + ++ PTH+ H +G G G + IKG+PV+ ++ Sbjct: 140 GDLHHLHNVQQQHLMMSRQHPTHLQHHNYVGFGLGV-GSVAAPIKGIPVT-------GQL 191 Query: 1047 TISPNSNSDCNGHKDLRDRTKDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQL 868 ++P+ SD NG+K+LRDR++D++ +RDRKVRISD LYALCRSWLRNG PEES++ Sbjct: 192 KVAPSPVSDSNGYKNLRDRSRDDNLMV-VRDRKVRISDGAPLYALCRSWLRNGFPEESEV 250 Query: 867 QYTDAVRSLPRPSPLSAQDITSPVXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXX 688 Y D+V+ LPRP L ++ V E V++L+ ELL+RH+KHAK Sbjct: 251 HYGDSVKPLPRPL-LPKEESEEEV-----EKEKKDEEPVDNLSAAELLKRHIKHAKKVRA 304 Query: 687 XXXXXXXXXXXRYKTRLALLLPPMVDPQLRNDLAPGN 577 RYK+RLALLLPP V+ Q RND N Sbjct: 305 RLREERLKRIARYKSRLALLLPPQVE-QFRNDTPAEN 340 >ref|XP_006439026.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] gi|557541222|gb|ESR52266.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] Length = 303 Score = 150 bits (378), Expect = 2e-33 Identities = 112/309 (36%), Positives = 152/309 (49%), Gaps = 11/309 (3%) Frame = -1 Query: 1470 PVNSRLPTNPNPNYAHLVPSPHPHDVGXXXXXXXXXAILYPVASSGRGFLTKQTTP---- 1303 P + + P H+ + P + ++YPVASSGRGF+ K P Sbjct: 6 PFTTATTPSQAPPQQHMYQNQRPANPSHSQGQAQGQGVVYPVASSGRGFIPKPMRPSDQT 65 Query: 1302 VTLANPGPNSPLFQSRPGMSYPRPPFGYSHSDPSLVQGMGYVTGRPTHVHHGATMGNSAG 1123 VT+AN G P P YPRP +H P L + RP +++ Sbjct: 66 VTVANHGGYPPRPNQLP--PYPRPHLD-NHHHPVLHHHQHHHMIRPPPLNNQQHQHPQIS 122 Query: 1122 AAGVMPGVIKGVPVSVASSHTQ----HKITISPNSNSDCNGH-KDLRDRTKDESAFATIR 958 + P I+GVPVS S H + ++SP D NG+ K LRD + + F +R Sbjct: 123 SN---PSPIRGVPVS--SGHLKVAPSSSASLSPVIPPDSNGYNKHLRDNS--DETFTIVR 175 Query: 957 DRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQD--ITSPVXXXX 784 DRKVRI++ SLYALCRSWLRNG PEE+Q Q+ D V+SLPRP P+ D I Sbjct: 176 DRKVRITEGASLYALCRSWLRNGSPEETQPQHADGVKSLPRPLPMPRADANIAKEKESEE 235 Query: 783 XXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALLLPPMVDPQ 604 E V+ L+ ++LL+RHV+ AK RYKTRL+LLLPP+V+ Q Sbjct: 236 DEDETDEDENVDRLSEEDLLRRHVQRAKQIRARLSNERAKRIERYKTRLSLLLPPLVE-Q 294 Query: 603 LRNDLAPGN 577 +ND G+ Sbjct: 295 SQNDAHAGS 303 >gb|EOX96773.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 276 Score = 149 bits (377), Expect = 3e-33 Identities = 113/316 (35%), Positives = 149/316 (47%), Gaps = 12/316 (3%) Frame = -1 Query: 1488 SNHHPLPVNSRLPTNPNPNYAHLVP----SPHPHDVGXXXXXXXXXAILYPVASSGRGFL 1321 SN P + P++ + N A V P P ++YPVASSGRGFL Sbjct: 2 SNTSSTPTTTIRPSSSSTNAAAAVTMSMRGPCPTTSYQEQQCPTTAGVMYPVASSGRGFL 61 Query: 1320 TKQTTPVTLA----NPGPNSPLFQSRPGMSYPRPPFGYSHSDPSLVQGMGYVTGRPTHVH 1153 L +P P+ F + PRPP PSL PTH H Sbjct: 62 PTNHPCRPLLPYHHHPHPHPHHFAN------PRPP------SPSLS------LPHPTHFH 103 Query: 1152 HGATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRTKDESA 973 P+ S K+ SP+S S+ NG+K++RDRTKD+S Sbjct: 104 P---------------------PLKALSLSLHPKVAPSPSSLSETNGYKNVRDRTKDDS- 141 Query: 972 FATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQD----IT 805 +RDRKVRI+D S+YALCRSWLRNG P+E+Q QY D +SLP+P P+ D T Sbjct: 142 LVNVRDRKVRITDGASVYALCRSWLRNGFPDETQPQYGDVSKSLPQPLPIPVTDNLLKDT 201 Query: 804 SPVXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALLL 625 ++VE+L+ ++LL+RH+ AK RYKTRLALLL Sbjct: 202 EDEEEQEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLALLL 261 Query: 624 PPMVDPQLRNDLAPGN 577 PP+V+ Q R+D A GN Sbjct: 262 PPLVE-QFRSDAAAGN 276 >gb|EOX96772.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 277 Score = 145 bits (366), Expect = 5e-32 Identities = 113/317 (35%), Positives = 149/317 (47%), Gaps = 13/317 (4%) Frame = -1 Query: 1488 SNHHPLPVNSRLPTNPNPNYAHLVP----SPHPHDVGXXXXXXXXXAILYPVASSGRGFL 1321 SN P + P++ + N A V P P ++YPVASSGRGFL Sbjct: 2 SNTSSTPTTTIRPSSSSTNAAAAVTMSMRGPCPTTSYQEQQCPTTAGVMYPVASSGRGFL 61 Query: 1320 TKQTTPVTLA----NPGPNSPLFQSRPGMSYPRPPFGYSHSDPSLVQGMGYVTGRPTHVH 1153 L +P P+ F + PRPP PSL PTH H Sbjct: 62 PTNHPCRPLLPYHHHPHPHPHHFAN------PRPP------SPSLS------LPHPTHFH 103 Query: 1152 HGATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRTKDESA 973 P+ S K+ SP+S S+ NG+K++RDRTKD+S Sbjct: 104 P---------------------PLKALSLSLHPKVAPSPSSLSETNGYKNVRDRTKDDS- 141 Query: 972 FATIRDRKVRISDNVSLYALCRSWLRNGLPEESQL-QYTDAVRSLPRPSPLSAQD----I 808 +RDRKVRI+D S+YALCRSWLRNG P+E+Q QY D +SLP+P P+ D Sbjct: 142 LVNVRDRKVRITDGASVYALCRSWLRNGFPDETQQPQYGDVSKSLPQPLPIPVTDNLLKD 201 Query: 807 TSPVXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALL 628 T ++VE+L+ ++LL+RH+ AK RYKTRLALL Sbjct: 202 TEDEEEQEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLALL 261 Query: 627 LPPMVDPQLRNDLAPGN 577 LPP+V+ Q R+D A GN Sbjct: 262 LPPLVE-QFRSDAAAGN 277 >ref|XP_006439027.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] gi|568858552|ref|XP_006482814.1| PREDICTED: RNA-binding protein 33-like [Citrus sinensis] gi|557541223|gb|ESR52267.1| hypothetical protein CICLE_v10032226mg [Citrus clementina] Length = 297 Score = 144 bits (364), Expect = 9e-32 Identities = 108/310 (34%), Positives = 144/310 (46%), Gaps = 12/310 (3%) Frame = -1 Query: 1470 PVNSRLPTNPNPNYAHLVPSPHPHDVGXXXXXXXXXAILYPVASSGRGFLTKQTTP---- 1303 P + + P H+ + P + ++YPVASSGRGF+ K P Sbjct: 6 PFTTATTPSQAPPQQHMYQNQRPANPSHSQGQAQGQGVVYPVASSGRGFIPKPMRPSDQT 65 Query: 1302 VTLANPGPNSPLFQSRPGMSYPRPPFGYSHSDPSLVQGMGYVTGRPTHVHHGATMGNSAG 1123 VT+AN G P P YPRP +H P L + RP +++ Sbjct: 66 VTVANHGGYPPRPNQLP--PYPRPHLD-NHHHPVLHHHQHHHMIRPPPLNNQQHQHPQIS 122 Query: 1122 AAGVMPGVIKGVPVS------VASSHTQHKITISPNSNSDCNGHKDLRDRTKDESAFATI 961 + P I+GVPVS SS I P+SN D + F + Sbjct: 123 SN---PSPIRGVPVSSGHLKVAPSSSASLSPVIPPDSNGD-----------NSDETFTIV 168 Query: 960 RDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQD--ITSPVXXX 787 RDRKVRI++ SLYALCRSWLRNG PEE+Q Q+ D V+SLPRP P+ D I Sbjct: 169 RDRKVRITEGASLYALCRSWLRNGSPEETQPQHADGVKSLPRPLPMPRADANIAKEKESE 228 Query: 786 XXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALLLPPMVDP 607 E V+ L+ ++LL+RHV+ AK RYKTRL+LLLPP+V+ Sbjct: 229 EDEDETDEDENVDRLSEEDLLRRHVQRAKQIRARLSNERAKRIERYKTRLSLLLPPLVE- 287 Query: 606 QLRNDLAPGN 577 Q +ND G+ Sbjct: 288 QSQNDAHAGS 297 >ref|XP_004149622.1| PREDICTED: uncharacterized protein LOC101211370 [Cucumis sativus] Length = 376 Score = 144 bits (364), Expect = 9e-32 Identities = 105/271 (38%), Positives = 136/271 (50%), Gaps = 10/271 (3%) Frame = -1 Query: 1359 ILYPVASSGRGFLTKQTTP------VTLANPGPNSPLFQSRPGMSYPRPPFGYSHSDPSL 1198 ILYPVASSGRGF+ + P VTLANPG + RP +++P P G H D S+ Sbjct: 127 ILYPVASSGRGFVPRTIRPLPADQAVTLANPGG----YPHRPVVTFPHRPIGSPHLD-SM 181 Query: 1197 VQGMGYVTGRPTHVHHGATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDC 1018 M ++T P S+ + G IK P ++ P + + Sbjct: 182 SHPM-HMTRPPNLQQQLIPFSGSS-----ISGSIKCAP------NSSDPKAFPPQTICES 229 Query: 1017 NGHKDLRDRTKDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLP 838 NG K++R R + +RDRKVRI+D SLYALCRSWLRNG EESQ QY RSLP Sbjct: 230 NGCKEMRVR---DDTLCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGSFFRSLP 286 Query: 837 RPSPLSAQDIT----SPVXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXX 670 RP P++ V ++EHL+ +ELL+RHV+ AK Sbjct: 287 RPLPIAVAGAAPLQKKEVVKEEVDEKDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREER 346 Query: 669 XXXXXRYKTRLALLLPPMVDPQLRNDLAPGN 577 RYKTRLALLLPP ++ QLR D G+ Sbjct: 347 LQRIERYKTRLALLLPPPIE-QLRTDNVTGS 376 >gb|EXC42165.1| hypothetical protein L484_002415 [Morus notabilis] Length = 454 Score = 133 bits (334), Expect = 3e-28 Identities = 101/286 (35%), Positives = 141/286 (49%), Gaps = 14/286 (4%) Frame = -1 Query: 1515 PYTCPAPFYSNHHPLPVNSRLPTNPNPNYAHLVPSPHPHDVGXXXXXXXXXAILYPVASS 1336 P P F +N PLP P+ NY S G I YPV SS Sbjct: 46 PRLPPELFAANIRPLP--------PHRNYIPASASVSAPPQG----------IPYPVVSS 87 Query: 1335 GRGFLT----KQTTP-------VTLANPGPNSPLFQSRPGMSYPRPPFGYSHSDPSLVQG 1189 GRGF++ ++P VT+A+P P+ ++ RP +Y P + H Sbjct: 88 GRGFISLPKSSSSSPAAGADQTVTVASPNPSG--YRPRPAANYVVRPIQHIH-------- 137 Query: 1188 MGYVTGRPTHVHHGATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGH 1009 H HH + ++ G +KGVPVS+ Q K+ SP S DCNG+ Sbjct: 138 ---------HYHHHQQQPH------LVAGPVKGVPVSI---QLQPKVPPSP-SVPDCNGY 178 Query: 1008 KDLRDRTKDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPS 829 KD+RD+ +D+S +RDRKVRI+++ SLYALC+SWLRNG EESQ QY DAV SLPRP Sbjct: 179 KDMRDKVRDDS-LTIVRDRKVRITEDASLYALCQSWLRNGFSEESQKQYGDAVMSLPRPL 237 Query: 828 PL---SAQDITSPVXXXXXXXXXXXXETVEHLTPKELLQRHVKHAK 700 P+ + + E+V++L+ ++L +RH+K AK Sbjct: 238 PIPMATNNEQKKEGEEDDNDGDEEDEESVKNLSAEDLFKRHLKRAK 283 >ref|XP_003546862.1| PREDICTED: uncharacterized protein LOC100779268 isoform X1 [Glycine max] Length = 274 Score = 129 bits (324), Expect = 4e-27 Identities = 92/264 (34%), Positives = 127/264 (48%), Gaps = 36/264 (13%) Frame = -1 Query: 1260 SRPGMSYPRPPFGYSHSDPSLVQGMGYVTGRPTHVHHGATMG----NSAGAAGVMP---- 1105 SRP P+P + H PS Q + + P V+ A G + AG + P Sbjct: 13 SRPISPLPQPQQQHHHHYPSQQQTLPILAPNPHFVYPFAPKGVRAADHAGVSAAFPPPSM 72 Query: 1104 ---GVIKGVPVSVASSHTQH---------------------KITISPNSNSDCNGHKDLR 997 G ++GVP+ S H H K + ++ +D NG KD Sbjct: 73 MYSGGVRGVPLDYFS-HALHVGRPPTHVPFPHAAPAASPPVKKAAARSAVADVNGGKDTN 131 Query: 996 DRTKD-ESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLS 820 R K E F +RDRKVR++D+ SLYALCRSWLRNG+ EESQ Q D +++LP+P P S Sbjct: 132 TREKSSEDTFIVVRDRKVRVTDDASLYALCRSWLRNGINEESQPQQKDVIKALPKPLPAS 191 Query: 819 ---AQDITSPVXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRY 649 + ++VEHL+P++LL+RH+K AK RY Sbjct: 192 MVASYLSNKKEDEKDEDEKEENEQSVEHLSPQDLLKRHIKRAKNVRARLREERLQRITRY 251 Query: 648 KTRLALLLPPMVDPQLRNDLAPGN 577 ++RL LLLPP ++ Q RND A GN Sbjct: 252 RSRLRLLLPPAIE-QCRNDTAAGN 274 >ref|XP_004306495.1| PREDICTED: uncharacterized protein LOC101308794 [Fragaria vesca subsp. vesca] Length = 254 Score = 128 bits (321), Expect = 9e-27 Identities = 92/235 (39%), Positives = 123/235 (52%), Gaps = 6/235 (2%) Frame = -1 Query: 1263 QSRPG-MSYP--RPPFGYSHSDPSLVQGMGYVTGRPTHVHHGATMGNSAGAAGVMPGVIK 1093 QS PG + YP RPP+ P + + P H+H ++P IK Sbjct: 42 QSSPGALVYPSARPPY------PPPLNFHPHPHPYPPHLHPSPP---PPAYQSLLPPPIK 92 Query: 1092 GVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRTKDESAFATIRDRKVRISDNVSLYAL 913 + S + P+S D NG +RD+ +D++ F I+DRKVRI+D SLY L Sbjct: 93 DLRFS--------GLVAPPSSVPDSNG---IRDKGRDDTQFL-IQDRKVRITDGASLYVL 140 Query: 912 CRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQDITSP---VXXXXXXXXXXXXETVEHL 742 CRSWLRNG EESQ +Y DA RSLP+PSP+ P E+VEH+ Sbjct: 141 CRSWLRNGTSEESQPRYGDATRSLPKPSPIPMASAIPPNKDEGDKKEDNEDKVEESVEHV 200 Query: 741 TPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALLLPPMVDPQLRNDLAPGN 577 +P++LL+RH+K A+ RYK+RLALLLPP+V+ Q RNDLA GN Sbjct: 201 SPEDLLKRHIKRARKVRARLREERLRRIARYKSRLALLLPPLVE-QFRNDLAAGN 254 >ref|NP_180843.2| proline-rich uncharacterized protein [Arabidopsis thaliana] gi|26450185|dbj|BAC42211.1| unknown protein [Arabidopsis thaliana] gi|28827576|gb|AAO50632.1| unknown protein [Arabidopsis thaliana] gi|330253655|gb|AEC08749.1| proline-rich uncharacterized protein [Arabidopsis thaliana] Length = 337 Score = 125 bits (315), Expect = 4e-26 Identities = 107/322 (33%), Positives = 141/322 (43%), Gaps = 12/322 (3%) Frame = -1 Query: 1521 AHPYTCPAPFYSNHHPLPVNSRLPTNPNP--------NYAHLVPSPHPHDVGXXXXXXXX 1366 A Y AP + HHP + + TNP P N H P P P Sbjct: 51 ASSYRAIAPLH-RHHP---HQNIYTNPLPIRRSNSVTNSPHQPPHPDPSS---------- 96 Query: 1365 XAILYPVASSGRGFLTKQTTPVTLANPGPNSPLFQSRPGMSYPRPP-FGYSHSDPSLVQG 1189 ++YP SSGRGF T+ PV + P+ PG PR P +GY H V Sbjct: 97 --LIYPFGSSGRGFPTR---PVRQNSNSVADPVGSPSPGGYTPRGPVYGYHHGQ--FVSN 149 Query: 1188 MGYVTGRPTHVHHGATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGH 1009 + + H G+ G +KGVP Q + T SP S D +GH Sbjct: 150 LDPMNQFMRAAHPQNQQSPQLGS-----GHMKGVP-----HFLQPRATPSPTSILDNSGH 199 Query: 1008 KDLRDRTKDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPS 829 K R R + A +R RKVRI++ SLY+LCRSWLRNG E + Q D + LP+P Sbjct: 200 KKARSR---DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIKPQRIDMMTCLPKPL 256 Query: 828 PLSAQDITSP---VXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXX 658 P+ + + P V E+V+HL+ +LL+RH+ AK Sbjct: 257 PVDKTETSLPKDLVEEAICEEDKEDEESVKHLSESDLLKRHIDRAKKVRARLREERLKRI 316 Query: 657 XRYKTRLALLLPPMVDPQLRND 592 RYK RLALLLPP + Q RN+ Sbjct: 317 ARYKARLALLLPPFGE-QCRNE 337 >ref|XP_006294535.1| hypothetical protein CARUB_v10023571mg [Capsella rubella] gi|482563243|gb|EOA27433.1| hypothetical protein CARUB_v10023571mg [Capsella rubella] Length = 339 Score = 125 bits (314), Expect = 6e-26 Identities = 107/316 (33%), Positives = 143/316 (45%), Gaps = 6/316 (1%) Frame = -1 Query: 1521 AHPYTCPAPFYSNHHPLPVNSRLPTNPNP--NYAHLVPSPH-PHDVGXXXXXXXXXAILY 1351 A Y APF+ + HP P + T+P+P + SPH PH ++Y Sbjct: 51 APSYRAIAPFHRHPHPHPHQNHY-THPSPIRRSNSVAGSPHQPHP-----PQPDPSTLIY 104 Query: 1350 PVASSGRGFLTKQTTPVTLANPGPNSPLFQSRPGMSYPRPPFGYSHSDPSLVQGMGYVTG 1171 P SSGRGF T+ P + P+ + PG PRP + Y H G Sbjct: 105 PFGSSGRGFPTR---PARQNSNSVADPV--ASPGGHPPRPVYAYHH-------GQFGSNL 152 Query: 1170 RPTHVHHGATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDR 991 P A + + + PG +KGVP Q + T SP S D GHK R R Sbjct: 153 DPMFQFMRAAHPQNQQSPQLGPGHMKGVP-----HFLQPRATPSPTSILDNVGHKKARSR 207 Query: 990 TKDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQD 811 + A +R RKVRI++ SLY+LCRSWLRNG E Q Q +D + LP+P P+ + Sbjct: 208 ---DDALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIQPQRSDTLTCLPKPLPVDMTE 264 Query: 810 ITSP---VXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTR 640 + P V E+V+ L+ +LL+RHV AK RYK R Sbjct: 265 TSLPKDSVEEPNPEEDKEDEESVKELSTSDLLKRHVDRAKKVRSRLREDRLKRIARYKAR 324 Query: 639 LALLLPPMVDPQLRND 592 LALLLPP + Q RN+ Sbjct: 325 LALLLPPFGE-QCRNE 339 >ref|XP_002881247.1| hypothetical protein ARALYDRAFT_482225 [Arabidopsis lyrata subsp. lyrata] gi|297327086|gb|EFH57506.1| hypothetical protein ARALYDRAFT_482225 [Arabidopsis lyrata subsp. lyrata] Length = 334 Score = 125 bits (313), Expect = 8e-26 Identities = 102/308 (33%), Positives = 142/308 (46%), Gaps = 12/308 (3%) Frame = -1 Query: 1479 HPLPVNSRLPTNPNPNYAHLVPSPHPHDVGXXXXXXXXXAILYPVASSGRGFLTK--QTT 1306 HPLP+ +P+ H PS ++YP SSGRGF T+ + Sbjct: 71 HPLPIRRSNSVTNSPHQPHPDPSS----------------LIYPFGSSGRGFPTRPGRQN 114 Query: 1305 PVTLANPGPNSPLFQSRPGMSYPRPPFGY-------SHSDPSLVQGMGYVTGRPTHVHHG 1147 ++A+P PG PRP +GY S+ DP L Q M R H+ + Sbjct: 115 SNSVADP-------VGSPGGYPPRPVYGYHQHGQFGSNLDPVLQQLM-----RAAHLQNQ 162 Query: 1146 ATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRTKDESAFA 967 + +G +KGVP Q ++T SP S D +GHK R R + A Sbjct: 163 QSPQLGSGH-------MKGVP-----HFLQPRVTPSPTSILDNSGHKKARSR---DDALV 207 Query: 966 TIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLSAQDITSP---V 796 +R RKVRI++ SLY+LCRSWLRNG E + Q +D + LP+P P+ + + P V Sbjct: 208 LVRKRKVRITEGASLYSLCRSWLRNGAHEGIKPQRSDTMTCLPKPLPVDMTETSLPKEVV 267 Query: 795 XXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALLLPPM 616 E+V+HL+ +LL+RH+ AK RYK RLALLLPP Sbjct: 268 EEPNREEDKEDEESVKHLSESDLLKRHIDRAKKVRSRLREERLKRIARYKARLALLLPPF 327 Query: 615 VDPQLRND 592 + Q RN+ Sbjct: 328 GE-QCRNE 334 >ref|XP_006598246.1| PREDICTED: uncharacterized protein LOC100779268 isoform X2 [Glycine max] Length = 288 Score = 122 bits (306), Expect = 5e-25 Identities = 93/278 (33%), Positives = 128/278 (46%), Gaps = 50/278 (17%) Frame = -1 Query: 1260 SRPGMSYPRPPFGYSHSDPSLVQGMGYVTGRPTHVHHGATMG----NSAGAAGVMP---- 1105 SRP P+P + H PS Q + + P V+ A G + AG + P Sbjct: 13 SRPISPLPQPQQQHHHHYPSQQQTLPILAPNPHFVYPFAPKGVRAADHAGVSAAFPPPSM 72 Query: 1104 ---GVIKGVPVSVASSHTQH---------------------KITISPNSNSDCNGHKDLR 997 G ++GVP+ S H H K + ++ +D NG KD Sbjct: 73 MYSGGVRGVPLDYFS-HALHVGRPPTHVPFPHAAPAASPPVKKAAARSAVADVNGGKDTN 131 Query: 996 DRTKD-ESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQL--------------QY 862 R K E F +RDRKVR++D+ SLYALCRSWLRNG+ EESQL Q Sbjct: 132 TREKSSEDTFIVVRDRKVRVTDDASLYALCRSWLRNGINEESQLLSFAFYSLAALTEPQQ 191 Query: 861 TDAVRSLPRPSPLS---AQDITSPVXXXXXXXXXXXXETVEHLTPKELLQRHVKHAKXXX 691 D +++LP+P P S + ++VEHL+P++LL+RH+K AK Sbjct: 192 KDVIKALPKPLPASMVASYLSNKKEDEKDEDEKEENEQSVEHLSPQDLLKRHIKRAKNVR 251 Query: 690 XXXXXXXXXXXXRYKTRLALLLPPMVDPQLRNDLAPGN 577 RY++RL LLLPP ++ Q RND A GN Sbjct: 252 ARLREERLQRITRYRSRLRLLLPPAIE-QCRNDTAAGN 288 >gb|EOX96774.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 3 [Theobroma cacao] Length = 202 Score = 122 bits (305), Expect = 6e-25 Identities = 85/227 (37%), Positives = 113/227 (49%), Gaps = 8/227 (3%) Frame = -1 Query: 1356 LYPVASSGRGFLTKQTTPVTLA----NPGPNSPLFQSRPGMSYPRPPFGYSHSDPSLVQG 1189 +YPVASSGRGFL L +P P+ F + PRPP PSL Sbjct: 1 MYPVASSGRGFLPTNHPCRPLLPYHHHPHPHPHHFAN------PRPP------SPSLS-- 46 Query: 1188 MGYVTGRPTHVHHGATMGNSAGAAGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGH 1009 PTH H P+ S K+ SP+S S+ NG+ Sbjct: 47 ----LPHPTHFHP---------------------PLKALSLSLHPKVAPSPSSLSETNGY 81 Query: 1008 KDLRDRTKDESAFATIRDRKVRISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPS 829 K++RDRTKD+S +RDRKVRI+D S+YALCRSWLRNG P+E+Q QY D +SLP+P Sbjct: 82 KNVRDRTKDDS-LVNVRDRKVRITDGASVYALCRSWLRNGFPDETQPQYGDVSKSLPQPL 140 Query: 828 PLSAQD----ITSPVXXXXXXXXXXXXETVEHLTPKELLQRHVKHAK 700 P+ D T ++VE+L+ ++LL+RH+ AK Sbjct: 141 PIPVTDNLLKDTEDEEEQEQEDKKEDEQSVENLSAQDLLKRHIDRAK 187 >ref|XP_006585335.1| PREDICTED: uncharacterized protein LOC100808873 isoform X1 [Glycine max] Length = 271 Score = 121 bits (304), Expect = 8e-25 Identities = 88/245 (35%), Positives = 117/245 (47%), Gaps = 13/245 (5%) Frame = -1 Query: 1272 PLFQSRPGMSYPRPPFGYSHSD------PSLVQG---MGYVTGRPTHVHHGATMGNSAGA 1120 P+ P YP P G +D PS++ G + Y + HV T + A Sbjct: 40 PIRAPNPHFVYPFAPKGVRAADQGPFPPPSMMHGGVPLDYFS-HALHVARPPTHVPFSHA 98 Query: 1119 AGVMPGVIKGVPVSVASSHTQHKITISPNSNSDCNGHKDLRDRTKD-ESAFATIRDRKVR 943 A P V S A S H NG KD R K E + +RDRKVR Sbjct: 99 AAAAPAASPPVKKSAARSAVAH-----------VNGGKDTNTREKSREDTYIVVRDRKVR 147 Query: 942 ISDNVSLYALCRSWLRNGLPEESQLQYTDAVRSLPRPSPLS---AQDITSPVXXXXXXXX 772 I+++ SLYALCRSWLRNG+ EESQ Q D +++LP+P P S + Sbjct: 148 ITEDASLYALCRSWLRNGINEESQSQQKDVMKALPKPLPASMVASYLSNKKEDEKDEDEK 207 Query: 771 XXXXETVEHLTPKELLQRHVKHAKXXXXXXXXXXXXXXXRYKTRLALLLPPMVDPQLRND 592 ++VEHL+P++LL+RH+K AK RY++RL LLLPP ++ Q RND Sbjct: 208 EENEQSVEHLSPQDLLKRHIKRAKKVRACLREERLQRITRYRSRLRLLLPPAIE-QCRND 266 Query: 591 LAPGN 577 A GN Sbjct: 267 TAAGN 271