BLASTX nr result
ID: Catharanthus22_contig00012529
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00012529 (2171 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe... 506 e-140 ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626... 504 e-140 ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr... 501 e-139 ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241... 493 e-136 ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660... 486 e-134 ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254... 485 e-134 ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu... 479 e-132 ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm... 476 e-131 ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu... 473 e-130 gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] 470 e-129 gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [... 470 e-129 ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309... 447 e-123 emb|CBI34651.3| unnamed protein product [Vitis vinifera] 433 e-118 ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309... 430 e-117 ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 424 e-116 gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i... 408 e-111 ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu... 403 e-109 gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i... 403 e-109 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 394 e-107 ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791... 389 e-105 >gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica] Length = 455 Score = 506 bits (1304), Expect = e-140 Identities = 270/463 (58%), Positives = 315/463 (68%) Frame = -2 Query: 1828 RKVREEMRGGGGVNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHR 1649 R+V E R G NN L+ AE+R P A VQK+RW S WS+YWCFG +H+ Sbjct: 2 RRVNGESRTG---NNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHK 58 Query: 1648 KRIGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGG 1469 KRIG AVLVPE T DAP AENP Q S+VLPF+ SFLQSEPPSA QSPAG Sbjct: 59 KRIGHAVLVPETTDRGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGF 118 Query: 1468 LSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITT 1289 SLT ASMYSP GP S+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TT Sbjct: 119 FSLT---ASMYSPSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 175 Query: 1288 PSSPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXX 1109 PSSPEVPFA+LLDP +NGE G RFP S YEFQSYQL PGSPV L Sbjct: 176 PSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSS 235 Query: 1108 PFPDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLL 929 PFPD EF HFL+FR+GDPP LLNL+ +++ WGS GSG++TPD S + FLL Sbjct: 236 PFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDGFLL 295 Query: 928 DHQHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAE 749 Q ++ S N RN++ ++HRVSFE++ EEV+RCVEKKPVAL + V +SLE+ E Sbjct: 296 KPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLEDTE 355 Query: 748 SVVKKEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSV 569 KED P + + VGETSN +E+A DGEE Q H KQRS TLGS KEFNFD+ Sbjct: 356 KAQSKED-PSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRSITLGSVKEFNFDNP 414 Query: 568 DGVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 DG +S +IG+ WWANEKV KE T KNW+FFP+MQPGVS Sbjct: 415 DGGDSGN-SIGSDWWANEKVDAKENGPT-KNWSFFPMMQPGVS 455 >ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis] Length = 460 Score = 504 bits (1298), Expect = e-140 Identities = 268/464 (57%), Positives = 317/464 (68%), Gaps = 7/464 (1%) Frame = -2 Query: 1810 MRGGGG-----VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRK 1646 MRG G +NN L+ AE+R A QK+RW CWS+ WCFG KHRK Sbjct: 1 MRGVNGGDSRALNNSLETINAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRK 60 Query: 1645 RIGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGL 1466 RIG AVLVPEPT+SR +A A N QAA++ LPF+ SFLQSEPPSA QSPAG + Sbjct: 61 RIGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGLV 120 Query: 1465 SLTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTP 1286 SL SIS +MYSPGGP+S+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TTP Sbjct: 121 SLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 180 Query: 1285 SSPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXP 1106 SSPEVPFA+LLDP + GE G +FPFS YEFQSY L PGSPV +L P Sbjct: 181 SSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSP 240 Query: 1105 FPDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLD 926 FPDGEF P F F GDPP LLNL+K++ +WGS QGSGTLTPDA +N F + Sbjct: 241 FPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVGSTPRNGFFQN 300 Query: 925 HQHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAES 746 Q S+++ +S N R D+ +VDHRVSFE+T E+VVRCVEKKP L + V SL+N + Sbjct: 301 RQISEVALRPHSENGLRKDQ-IVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNG-T 358 Query: 745 VVKKEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVD 566 V+KE++ E N SC GE +N + +D EE RHQKQ+S TLGSTKEFNFDS D Sbjct: 359 TVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSAD 418 Query: 565 GVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQ--PGVS 440 G +S +P I + WWANEKVVGK+ KNW FFPV+Q PGVS Sbjct: 419 G-DSHEPTIASDWWANEKVVGKDSGAI-KNWAFFPVIQPAPGVS 460 >ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] gi|557541785|gb|ESR52763.1| hypothetical protein CICLE_v10020073mg [Citrus clementina] Length = 460 Score = 501 bits (1291), Expect = e-139 Identities = 266/464 (57%), Positives = 316/464 (68%), Gaps = 7/464 (1%) Frame = -2 Query: 1810 MRGGGG-----VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRK 1646 MRG G +NN L+ AE+R A QK+RW CW++ WCFG KHRK Sbjct: 1 MRGVNGGDSRALNNSLETISAAATAIASAENRVHQATSQKRRWGGCWNISWCFGFQKHRK 60 Query: 1645 RIGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGL 1466 RIG AVLVPEPT+SR +A A N QA ++ LPF+ SFLQSEPPSA QSPAG + Sbjct: 61 RIGHAVLVPEPTASRSNASEAVNSTQATAISLPFVAPPSSPASFLQSEPPSATQSPAGLV 120 Query: 1465 SLTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTP 1286 SL SIS +MYSPGGP+S+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TTP Sbjct: 121 SLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTP 180 Query: 1285 SSPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXP 1106 SSPEVPFA+LLDP + GE G +FPFS YEFQSY L PGSPV +L P Sbjct: 181 SSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSP 240 Query: 1105 FPDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLD 926 FPDGEF P F F GDPP LLNL+K++ +WGS QGSGTLTPDA +N F + Sbjct: 241 FPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNGFFQN 300 Query: 925 HQHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAES 746 Q S+++ +S N R D+ +VDHRVSFE+T E+VVRCVEKKP L + V SL+N + Sbjct: 301 RQISEVALRPHSENGLRKDQ-IVDHRVSFELTTEDVVRCVEKKPTTLAEAVSESLQNG-T 358 Query: 745 VVKKEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVD 566 V+KE++ E N SC GE +N + +D EE RHQKQ+S TLGSTKEFNFDS D Sbjct: 359 TVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFDSAD 418 Query: 565 GVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQ--PGVS 440 G +S +P I + WWANEKVVGK+ KNW FFPV+Q PGVS Sbjct: 419 G-DSHEPTIASDWWANEKVVGKDSGAI-KNWAFFPVIQPAPGVS 460 >ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera] Length = 479 Score = 493 bits (1268), Expect = e-136 Identities = 268/472 (56%), Positives = 323/472 (68%), Gaps = 21/472 (4%) Frame = -2 Query: 1792 VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEP 1613 +N+ L+ AE+R P VQK+RW SCW YWCF SPK KRIG AVL PE Sbjct: 11 MNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKD-KRIGHAVLAPES 69 Query: 1612 TSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYS 1433 + PAAEN QA ++VLPF+ SFLQSEPPSA QSP+G LSLTSI+A++YS Sbjct: 70 RAPGSGVPAAENLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYS 129 Query: 1432 PGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARLL 1253 PGGPAS+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TTPSSPEVPFA+L Sbjct: 130 PGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLF 189 Query: 1252 DPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFV-PGR 1076 DP ++NGE G RF SQYEFQSYQL PGSPV HL PFPD +FV G Sbjct: 190 DPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGS 249 Query: 1075 PHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDHQHSD-ISPP 899 FL+FR+G PP LL L+K+++H+WGS GSG++TPDA P S++ +LD Q SD I PP Sbjct: 250 SQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPP 309 Query: 898 SN-----------------SYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVL 770 S S + N+E +VDHRVSFE+T E+VVRCVEK AL K V Sbjct: 310 SGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVS 369 Query: 769 SSLENAESVVKKEDNPKETANEPASCVGETSNSTSERASID--GEEGQRHQKQRSTTLGS 596 +SL+N + V+ ++N +E + VGET+N+ E+A D GEEGQ H KQRS TLGS Sbjct: 370 ASLQN-PATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGS 428 Query: 595 TKEFNFDSVDGVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 KEFNFD+ DG +SDKPNI + WWANEKVVGKE+ + KNW+ F +MQP VS Sbjct: 429 AKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGAS-KNWSIFHMMQPSVS 479 >ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum] Length = 443 Score = 486 bits (1252), Expect = e-134 Identities = 255/452 (56%), Positives = 304/452 (67%) Frame = -2 Query: 1795 GVNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPE 1616 GV++ L+ E+R P A++QK+RW CWS+YWCFGS K KRIG AV +PE Sbjct: 10 GVDSTLETISAAATAIASVENRVPQASIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPE 69 Query: 1615 PTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMY 1436 T+S D P++ QA S+VLPFI SFL SEPPSA SP G L S S Y Sbjct: 70 TTASGADRPSSNTSSQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSKCL---SMSTY 126 Query: 1435 SPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARL 1256 SP GPAS+FAIGPYAHETQLV+PP FS FTTEPSTAPFTPPPESV +TTPSSPEVPFA+L Sbjct: 127 SPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 186 Query: 1255 LDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFVPGR 1076 LDP +QN G R+PF+QYEFQSYQLQPGSPVS+L PF D E+ PGR Sbjct: 187 LDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDREYTPGR 246 Query: 1075 PHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDHQHSDISPPS 896 P FL NLEK+A H+WGS QGSGTLTP+A P+ +NFLL++Q+S + Sbjct: 247 PQFL-----------NLEKIAPHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLP 295 Query: 895 NSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAESVVKKEDNPKE 716 +N W+ND TVVDHRVSFEIT E+VVRCVEKKP + +T SL++ E K+++N E Sbjct: 296 KPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAE 355 Query: 715 TANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVDGVNSDKPNIG 536 +N E S E +S DGE+GQR QK RS TLGS+KEFNFD+VDG DK IG Sbjct: 356 MSNGHDHGGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIG 415 Query: 535 TAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 + WWANEKV+GKE NW FP+MQPGVS Sbjct: 416 SDWWANEKVLGKE---PCNNW-IFPMMQPGVS 443 >ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum lycopersicum] Length = 443 Score = 485 bits (1248), Expect = e-134 Identities = 253/452 (55%), Positives = 305/452 (67%) Frame = -2 Query: 1795 GVNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPE 1616 GV++ L+ E+R P A++QK+RW SCWS+YWCFGS K KRIG AV +PE Sbjct: 10 GVDSTLETINAAATAIASVENRVPQASIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPE 69 Query: 1615 PTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMY 1436 T+S D P++ QA S+VLPFI SFL SEPPSA SP G L S S Y Sbjct: 70 TTASAADRPSSNTSSQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSKCL---SMSTY 126 Query: 1435 SPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARL 1256 SP GPAS+FAIGPYAHETQLV+PP FS FTTEPSTAPFTPPPESV +TTPSSPEVPFA+L Sbjct: 127 SPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKL 186 Query: 1255 LDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFVPGR 1076 LDP +QN G R+PF+QYEFQSYQLQPGSPVS+L PF + E+ PGR Sbjct: 187 LDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFLEREYTPGR 246 Query: 1075 PHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDHQHSDISPPS 896 P FL NLEK+A H+WGS QGSGTLTP+A P+ ++FLL++Q++ + Sbjct: 247 PQFL-----------NLEKIAPHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLP 295 Query: 895 NSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAESVVKKEDNPKE 716 +N W+ND TVVDHRVSFEIT E+VVRCVEKKP + +T SL++ E K+++N E Sbjct: 296 KPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAE 355 Query: 715 TANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVDGVNSDKPNIG 536 +N E S E +S DGE+GQR QK RS TLGS+KEFNFD+VDG DK IG Sbjct: 356 MSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIG 415 Query: 535 TAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 + WWANEKV+GKE NW FP+MQPGVS Sbjct: 416 SDWWANEKVLGKE---PCNNW-IFPMMQPGVS 443 >ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346901|gb|EEE82832.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 453 Score = 479 bits (1234), Expect = e-132 Identities = 261/461 (56%), Positives = 312/461 (67%), Gaps = 4/461 (0%) Frame = -2 Query: 1810 MRGGGG----VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKR 1643 MRG G NN L+ AE+R P A VQK+RW SCWS+Y CFG KH+K+ Sbjct: 1 MRGFNGESRAANNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQ 60 Query: 1642 IGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLS 1463 IG AVL PEP++ APA+ENP QA +V LPF SF QSEPPS QSPAG +S Sbjct: 61 IGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVS 120 Query: 1462 LTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPS 1283 LTSISASMYSP GPAS+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TTPS Sbjct: 121 LTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 180 Query: 1282 SPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPF 1103 SPEVPFA+ LDP +NG+ G RFPF +FQSYQ PGSPV L PF Sbjct: 181 SPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPF 237 Query: 1102 PDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDH 923 PDGEF G HF +FR G+PP LLNL+K+++ +WGS+QGSG LTP++ V R NFLL Sbjct: 238 PDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPES-VRRGSPNFLLHR 296 Query: 922 QHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAESV 743 Q SD+ S N +N + VV+HRVSFE+T E+ RCVE+KP KTV +EN + Sbjct: 297 QFSDVPSRPRSGNGHKNGQ-VVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENG-TQ 354 Query: 742 VKKEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVDG 563 K+E N E+ VG TSN + E AS DGE +H+KQ+S TLGS KEFNFD+ D Sbjct: 355 AKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADE 414 Query: 562 VNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 +S KP+ + WWAN V+GKE ETT KNW+FFP++Q GVS Sbjct: 415 GDSRKPS-SSNWWANGSVIGKEGETT-KNWSFFPMVQSGVS 453 >ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis] gi|223549721|gb|EEF51209.1| conserved hypothetical protein [Ricinus communis] Length = 459 Score = 476 bits (1226), Expect = e-131 Identities = 257/450 (57%), Positives = 307/450 (68%), Gaps = 1/450 (0%) Frame = -2 Query: 1789 NNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEPT 1610 NN LD AE+R P A +QK+RW SCWS+YWCFG +HRKRIG AVLVPE + Sbjct: 15 NNALDTINAAASVIASAENRVPQATIQKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENS 74 Query: 1609 SSRVDAPAAENPP-QAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYS 1433 + D+ AAENP QA ++ LPF+ SFLQSEPPSA QSPAG LSLTS+SASMYS Sbjct: 75 APGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSPAGILSLTSVSASMYS 134 Query: 1432 PGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARLL 1253 P GPAS+FAIGPYAHETQLV+PPAFSTFTTEPSTAPFTPPPESVQ+TTPSSPEVPFA+LL Sbjct: 135 PSGPASIFAIGPYAHETQLVSPPAFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 194 Query: 1252 DPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFVPGRP 1073 +P ++NGE G RFPFS YEFQSYQ PGSPV L PFPDGEF P Sbjct: 195 EPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGP 254 Query: 1072 HFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDHQHSDISPPSN 893 FL+F+ PP LLNL+K++ H+ GS QGSGTLTPDA V + +F LD Q SDI+ + Sbjct: 255 RFLEFQMAVPPKLLNLDKLSVHECGSRQGSGTLTPDA-VRATSCSFPLDRQCSDIASNRH 313 Query: 892 SYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAESVVKKEDNPKET 713 S N D+ V D RVSF+++ E+ +R E KP + K + S++N E +K E Sbjct: 314 SDN-ENKDDQVADLRVSFDLSAEDALRYAEPKPASPVKIMPESMKN-EIAAEKVQKSSEI 371 Query: 712 ANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVDGVNSDKPNIGT 533 + VGETSN E+AS GE+ RHQK R+ TLG+ KEFNFD+ DGV KP+ G Sbjct: 372 RHNFECRVGETSNGILEQASTGGEKTPRHQKHRTLTLGTFKEFNFDNADGV--PKPSAGP 429 Query: 532 AWWANEKVVGKEIETTGKNWTFFPVMQPGV 443 WW N VGKE + T KNW+FFPVMQP + Sbjct: 430 DWWDNGSDVGKE-DFTAKNWSFFPVMQPSI 458 >ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] gi|550346902|gb|ERP65330.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa] Length = 452 Score = 473 bits (1217), Expect = e-130 Identities = 260/461 (56%), Positives = 311/461 (67%), Gaps = 4/461 (0%) Frame = -2 Query: 1810 MRGGGG----VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKR 1643 MRG G NN L+ AE+R P A VQ+ RW SCWS+Y CFG KH+K+ Sbjct: 1 MRGFNGESRAANNTLETINAAATAIASAENRVPQATVQR-RWGSCWSIYLCFGYQKHKKQ 59 Query: 1642 IGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLS 1463 IG AVL PEP++ APA+ENP QA +V LPF SF QSEPPS QSPAG +S Sbjct: 60 IGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVS 119 Query: 1462 LTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPS 1283 LTSISASMYSP GPAS+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TTPS Sbjct: 120 LTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 179 Query: 1282 SPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPF 1103 SPEVPFA+ LDP +NG+ G RFPF +FQSYQ PGSPV L PF Sbjct: 180 SPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQFHPGSPVGQLISPSSGISGSGTSSPF 236 Query: 1102 PDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDH 923 PDGEF G HF +FR G+PP LLNL+K+++ +WGS+QGSG LTP++ V R NFLL Sbjct: 237 PDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQGSGALTPES-VRRGSPNFLLHR 295 Query: 922 QHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAESV 743 Q SD+ S N +N + VV+HRVSFE+T E+ RCVE+KP KTV +EN + Sbjct: 296 QFSDVPSRPRSGNGHKNGQ-VVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENG-TQ 353 Query: 742 VKKEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVDG 563 K+E N E+ VG TSN + E AS DGE +H+KQ+S TLGS KEFNFD+ D Sbjct: 354 AKEEKNSGESIQSFECRVGVTSNDSPEMASTDGEAAPQHRKQQSITLGSVKEFNFDNADE 413 Query: 562 VNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 +S KP+ + WWAN V+GKE ETT KNW+FFP++Q GVS Sbjct: 414 GDSRKPS-SSNWWANGSVIGKEGETT-KNWSFFPMVQSGVS 452 >gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] Length = 455 Score = 470 bits (1209), Expect = e-129 Identities = 253/466 (54%), Positives = 307/466 (65%), Gaps = 9/466 (1%) Frame = -2 Query: 1810 MRG--GGG----VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHR 1649 MRG GGG +NN L+ AE+R P A V+K+RW C S+YWCFG+PK+R Sbjct: 1 MRGASGGGDSRTMNNALETINAAATAIAMAENRVPQATVRKRRWGGCLSIYWCFGTPKNR 60 Query: 1648 KRIGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGG 1469 RIG VLVPE AP AEN Q +V+LPFI SFLQSEPPSA QSPAG Sbjct: 61 TRIGHGVLVPETAQPGNSAPRAENSTQTHAVILPFIAPPSSPASFLQSEPPSATQSPAGL 120 Query: 1468 LSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITT 1289 LSLTS+SASMYSPGGPAS+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TT Sbjct: 121 LSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 180 Query: 1288 PSSPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXX 1109 PSSPEVPFA+LLDP NGE G RFP EFQSY QPGSP+ L Sbjct: 181 PSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSS 240 Query: 1108 PFPDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLL 929 PFPD EF PHFL+FR+GDPP LLNL+K++ WGS QGSG+LTPD+ P S Sbjct: 241 PFPDPEFAARGPHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLTPDSVKPIST----- 295 Query: 928 DHQHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAE 749 +++P RN E V D RVSF+++ E+V+R VEKK V L + +L+SL++ Sbjct: 296 ----FEVAPHLKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKKTVPLAEAMLTSLKDT- 350 Query: 748 SVVKKEDNPKETANEPASC---VGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNF 578 ++ ++E+N E C VGETSN ++A GEE +HQK RS TLGS+KEFNF Sbjct: 351 TMGQREENSDSNKVEEIGCENRVGETSNEEPDKAPTSGEEVLQHQKHRSITLGSSKEFNF 410 Query: 577 DSVDGVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 D+ D + K + + WWAN+KV GKE +NW+FFP++QPGVS Sbjct: 411 DNADAGDLHKSDSVSDWWANQKVAGKE-GAPSQNWSFFPMIQPGVS 455 >gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 458 Score = 470 bits (1209), Expect = e-129 Identities = 257/461 (55%), Positives = 305/461 (66%), Gaps = 5/461 (1%) Frame = -2 Query: 1810 MRGGGG----VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKR 1643 MRG G +NN L+ AE+R P A VQK+RW CWS+YWCFGS K +KR Sbjct: 1 MRGANGESIAMNNTLETIHAAANAIASAENRVPQATVQKRRWGGCWSIYWCFGSYKQKKR 60 Query: 1642 IGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLS 1463 IG AVL E + S + PAAENP QA ++ LPF+ SFL SEPPSA QSPAG +S Sbjct: 61 IGPAVLTSETSFSGANVPAAENPTQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLVS 120 Query: 1462 LTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPS 1283 LTSISASMYSPG PAS+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TTPS Sbjct: 121 LTSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 179 Query: 1282 SPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPF 1103 SPEVPFA+LL P Q GE RFP S YEFQSYQL PGSPV L PF Sbjct: 180 SPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPF 239 Query: 1102 PDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDH 923 DGEF HF +FR GDPP LLNL+K +S +WGSH GSGTLTPDA +N FLLDH Sbjct: 240 RDGEFAASL-HFPEFRMGDPPKLLNLDKHSSCEWGSHHGSGTLTPDATRSTPRNGFLLDH 298 Query: 922 QHSDI-SPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAES 746 Q S+I S P +ND+ +HRVSFE+T EEVVR +E + A P +S E+ Sbjct: 299 QISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEVVRSLEME-TATPSEAVSGSLQIEA 357 Query: 745 VVKKEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVD 566 + E++ + ++ VGETSN E+A D E +H K +S TLGS KEFNFD+VD Sbjct: 358 TRESEEHDTKVVDDYECRVGETSNERPEKALADREGKPQHHKHQSITLGSAKEFNFDNVD 417 Query: 565 GVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGV 443 G ++ KP + + WWAN+KV GK +NW+FFP+MQPGV Sbjct: 418 GGDAHKPILTSDWWANDKVAGKG-GGVPRNWSFFPMMQPGV 457 >ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria vesca subsp. vesca] Length = 459 Score = 447 bits (1150), Expect = e-123 Identities = 249/463 (53%), Positives = 302/463 (65%), Gaps = 2/463 (0%) Frame = -2 Query: 1822 VREEMRGGGGVNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKR 1643 +R + GG G NN LD AESR P A VQK+RW W +YWCFG +HRKR Sbjct: 3 MRRGVNGGDG-NNALDTINAAASAIAAAESRVPQATVQKRRWAKGWGVYWCFGFQRHRKR 61 Query: 1642 IGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLS 1463 IG AV++PE TS + P AEN QA+S+VLPF SFLQSEPPSAMQSP S Sbjct: 62 IGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFNFS 121 Query: 1462 LTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPS 1283 L SASMYSPG P+S+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPP ESV +T PS Sbjct: 122 L---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPS 177 Query: 1282 SPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPF 1103 SPEVPFA+LLD + GE G R+P S YEFQSYQ PGSPV L PF Sbjct: 178 SPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPF 237 Query: 1102 PDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDH 923 D EF G HFL+FR+G+ P +LNL+ + + WGS SG++TPDA S F L Sbjct: 238 LDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKP 297 Query: 922 QHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAESV 743 + + S + RND + HRVSFE++ EEVVRCVEKKPVAL + V +SL++AE Sbjct: 298 YTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKA 357 Query: 742 VKKEDNPKETANEPASCVGETSNSTSERA-SIDGEE-GQRHQKQRSTTLGSTKEFNFDSV 569 ++E +E ++ V +TSN +SE+A D EE R+QK+RS TLGS KEFNFD+ Sbjct: 358 EREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNA 417 Query: 568 DGVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 DG +S +I T WWANEKVV KE KNW+FFP++QPG+S Sbjct: 418 DGGDSGTSSISTDWWANEKVVLKE-NGESKNWSFFPMIQPGMS 459 >emb|CBI34651.3| unnamed protein product [Vitis vinifera] Length = 412 Score = 433 bits (1113), Expect = e-118 Identities = 242/453 (53%), Positives = 288/453 (63%), Gaps = 2/453 (0%) Frame = -2 Query: 1792 VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEP 1613 +N+ L+ AE+R P VQK+RW SCW YWCF SPK KRIG AVL PE Sbjct: 11 MNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKD-KRIGHAVLAPES 69 Query: 1612 TSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYS 1433 + PAAEN QA ++VLPF+ SFLQSEPPSA QSP+G LSLTSI+A++YS Sbjct: 70 RAPGSGVPAAENLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYS 129 Query: 1432 PGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARLL 1253 PGGPAS+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TTPSSPEVPFA+L Sbjct: 130 PGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLF 189 Query: 1252 DPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFVPGRP 1073 DP ++NGE G RF SQYEFQSYQL PGSPV HL PFPD Sbjct: 190 DPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPD-------- 241 Query: 1072 HFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDHQHSDISPPSN 893 SG++TPDA P S++ +LDH Sbjct: 242 ----------------------------RSGSITPDALGPPSRDGSVLDHSGCP------ 267 Query: 892 SYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAESVVKKEDNPKET 713 N+E +VDHRVSFE+T E+VVRCVEK AL K V +SL+N + V+ ++N +E Sbjct: 268 ------NNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQN-PATVEIDENSREV 320 Query: 712 ANEPASCVGETSNSTSERASID--GEEGQRHQKQRSTTLGSTKEFNFDSVDGVNSDKPNI 539 + VGET+N+ E+A D GEEGQ H KQRS TLGS KEFNFD+ DG +SDKPNI Sbjct: 321 VVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNI 380 Query: 538 GTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 + WWANEKVVGKE+ + KNW+ F +MQP VS Sbjct: 381 SSDWWANEKVVGKEVGAS-KNWSIFHMMQPSVS 412 >ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria vesca subsp. vesca] Length = 422 Score = 430 bits (1105), Expect = e-117 Identities = 234/427 (54%), Positives = 286/427 (66%), Gaps = 2/427 (0%) Frame = -2 Query: 1714 VQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXX 1535 +QK+RW W +YWCFG +HRKRIG AV++PE TS + P AEN QA+S+VLPF Sbjct: 1 MQKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAP 60 Query: 1534 XXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFS 1355 SFLQSEPPSAMQSP SL SASMYSPG P+S+FAIGPYAHETQLV+PP FS Sbjct: 61 PSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFS 116 Query: 1354 TFTTEPSTAPFTPPPESVQITTPSSPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQ 1175 TFTTEPSTAPFTPP ESV +T PSSPEVPFA+LLD + GE G R+P S YEFQSYQ Sbjct: 117 TFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWY 176 Query: 1174 PGSPVSHLXXXXXXXXXXXXXXPFPDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGS 995 PGSPV L PF D EF G HFL+FR+G+ P +LNL+ + + WGS Sbjct: 177 PGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGS 236 Query: 994 HQGSGTLTPDAGVPRSQNNFLLDHQHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVV 815 SG++TPDA S F L + + S + RND + HRVSFE++ EEVV Sbjct: 237 RLCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 296 Query: 814 RCVEKKPVALPKTVLSSLENAESVVKKEDNPKETANEPASCVGETSNSTSERA-SIDGEE 638 RCVEKKPVAL + V +SL++AE ++E +E ++ V +TSN +SE+A D EE Sbjct: 297 RCVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEE 356 Query: 637 -GQRHQKQRSTTLGSTKEFNFDSVDGVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFP 461 R+QK+RS TLGS KEFNFD+ DG +S +I T WWANEKVV KE KNW+FFP Sbjct: 357 LSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKE-NGESKNWSFFP 415 Query: 460 VMQPGVS 440 ++QPG+S Sbjct: 416 MIQPGMS 422 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 424 bits (1091), Expect = e-116 Identities = 240/462 (51%), Positives = 293/462 (63%), Gaps = 11/462 (2%) Frame = -2 Query: 1792 VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEP 1613 VNN ++ AESR VQK+RW SC SLYWCFGS +H KRIG AVLVPEP Sbjct: 4 VNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEP 63 Query: 1612 TSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYS 1433 APA+EN + S+VLPFI SFLQS+PPS+ QSPAG LSLT++S + YS Sbjct: 64 MVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYS 123 Query: 1432 PGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARL- 1256 P GPASMFAIGPYAHETQLV+PP FSTF TEPSTAPFTPPPESVQ+TTPSSPEVPFA+L Sbjct: 124 PSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 1255 ---LDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFV 1085 LD +N + S YEFQ YQL P SPV HL PFPD + Sbjct: 184 TSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHL---ISPISNSGTSSPFPDRRPI 240 Query: 1084 PGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLLDHQHSDIS 905 + P LL E ++ +WGS GSG+LTPD P S+++FLL++Q S+++ Sbjct: 241 V-----------EAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVA 289 Query: 904 PPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSL----ENAESVVK 737 +NS + +N ETV+DHRVSFE+ E+V CVEKKPVA +TV ++L E E + Sbjct: 290 SLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERE 349 Query: 736 KEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVDGVN 557 ++ + T N CVGE + SE+AS +GEE Q H+K GS KEFNFD+ G Sbjct: 350 RDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEV 409 Query: 556 SDKPN-IGTAWWANEKVVGKEIETTG--KNWTFFPVMQPGVS 440 S KPN IG+ WW NEKVVGK TG NWTFFP++QPG+S Sbjct: 410 SAKPNIIGSEWWVNEKVVGK---GTGPQTNWTFFPLLQPGIS 448 >gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 408 bits (1048), Expect = e-111 Identities = 235/490 (47%), Positives = 286/490 (58%), Gaps = 39/490 (7%) Frame = -2 Query: 1792 VNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEP 1613 VN+ ++ A+SR VQKKRW SCW LYWCFGS K+ KRIG AVLVPEP Sbjct: 4 VNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEP 63 Query: 1612 TSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYS 1433 AEN ++LPFI SFLQS+PPSA QSPAG LSLTS+S + YS Sbjct: 64 VVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYS 123 Query: 1432 PGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARLL 1253 P GPAS+FAIGPYAHETQLVTPP FS TTEPSTAPFTPPPESVQ+TTPSSPEVPFA+LL Sbjct: 124 PRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 1252 ----DPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFV 1085 + +N +F S YEFQSYQ+ PGSP +L PFPD Sbjct: 184 TSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPD---- 239 Query: 1084 PGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGS-------------------------- 983 R L+FR G+ P LL E + KWGS GS Sbjct: 240 --RRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGL 297 Query: 982 ------GTLTPDAGVPRSQNNFLLDHQHSDISPPSNSYNFWRNDETVVDHRVSFEITEEE 821 G+LTPD P S++ FL+ Q S+++ +N N +NDET+VDHRVSFE++ E+ Sbjct: 298 GSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGED 357 Query: 820 VVRCVEKKPVALPKTVLSSLEN---AESVVKKEDNPKETANEPASCVGETSNSTSERASI 650 V C+E K + LP +S AE +++ K+ + + ETSN T E+AS Sbjct: 358 VAPCLESKSL-LPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASG 416 Query: 649 DGEEGQRHQKQRSTTLGSTKEFNFDSVDGVNSDKPNIGTAWWANEKVVGKEIETTGKNWT 470 + EE +QK RS TLGS KEFNFD+ G SDKP I + WWANEKV GKE G +WT Sbjct: 417 EAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEAR-PGNSWT 475 Query: 469 FFPVMQPGVS 440 FFP++QP VS Sbjct: 476 FFPMLQPEVS 485 >ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa] gi|222841936|gb|EEE79483.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa] Length = 441 Score = 403 bits (1036), Expect = e-109 Identities = 227/429 (52%), Positives = 273/429 (63%) Frame = -2 Query: 1828 RKVREEMRGGGGVNNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHR 1649 R V E R NN L+ AE+R P A VQK+RWRS WS+YWCFG K + Sbjct: 2 RDVNGESRAA---NNTLETINAAATAIASAENRVPQAMVQKQRWRSHWSIYWCFGYQKSK 58 Query: 1648 KRIGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGG 1469 ++IG AVL PE ++ APAAEN QA V PF+ SF QSEPPS QSPAG Sbjct: 59 RQIGHAVLFPESSAPGSGAPAAENSAQAPEVTFPFVAPPSSPASFFQSEPPSVTQSPAGL 118 Query: 1468 LSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITT 1289 +S TSISASMYSP GPAS+FAIGPYAHETQLV+PP FSTFTTEPSTAPFTPPPESV +TT Sbjct: 119 VSRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTT 178 Query: 1288 PSSPEVPFARLLDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXX 1109 PSSPEVPFA+L+DP +NG G RFPF +FQSYQ PGS V L Sbjct: 179 PSSPEVPFAQLIDPTLRNGVTGLRFPF---DFQSYQFHPGSSVGQLISPSSGISGSGTSS 235 Query: 1108 PFPDGEFVPGRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGTLTPDAGVPRSQNNFLL 929 PFPDGEF G PH +FR G P LLNL+K+++ +WGS+Q SG LTPD+ V NFLL Sbjct: 236 PFPDGEFAVGGPHSPEFRMG--PKLLNLDKLSTREWGSYQDSGALTPDS-VRHGSPNFLL 292 Query: 928 DHQHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLENAE 749 Q SD++ S N +D+ VV+HR SFE++ ++ RCVE+KP KTV +EN Sbjct: 293 HRQFSDVASHPRSEN-GHDDDQVVNHRFSFELSVKDASRCVEEKPACSIKTVPEYVENG- 350 Query: 748 SVVKKEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSV 569 + K+E+N E G+TSN T E S DGE Q H+KQ+ TLGS EFNFD+ Sbjct: 351 TKAKEEENYGELIQSFERRSGDTSNDTPETPSTDGEAPQ-HRKQQPITLGSVNEFNFDNA 409 Query: 568 DGVNSDKPN 542 D +S P+ Sbjct: 410 DEGDSHNPS 418 >gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 403 bits (1035), Expect = e-109 Identities = 229/464 (49%), Positives = 276/464 (59%), Gaps = 39/464 (8%) Frame = -2 Query: 1714 VQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEPTSSRVDAPAAENPPQAASVVLPFIXX 1535 V KKRW SCW LYWCFGS K+ KRIG AVLVPEP AEN ++LPFI Sbjct: 34 VYKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAP 93 Query: 1534 XXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYSPGGPASMFAIGPYAHETQLVTPPAFS 1355 SFLQS+PPSA QSPAG LSLTS+S + YSP GPAS+FAIGPYAHETQLVTPP FS Sbjct: 94 PSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFS 153 Query: 1354 TFTTEPSTAPFTPPPESVQITTPSSPEVPFARLL----DPIHQNGEDGPRFPFSQYEFQS 1187 TTEPSTAPFTPPPESVQ+TTPSSPEVPFA+LL + +N +F S YEFQS Sbjct: 154 ALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQS 213 Query: 1186 YQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFVPGRPHFLQFRSGDPPHLLNLEKMASH 1007 YQ+ PGSP +L PFPD R L+FR G+ P LL E + Sbjct: 214 YQIYPGSPGGNLISPGSAISNSGTSSPFPD------RRPILEFRMGEAPKLLGFENFTTR 267 Query: 1006 KWGSHQGS--------------------------------GTLTPDAGVPRSQNNFLLDH 923 KWGS GS G+LTPD P S++ FL+ Sbjct: 268 KWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGS 327 Query: 922 QHSDISPPSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLEN---A 752 Q S+++ +N N +NDET+VDHRVSFE++ E+V C+E K + LP +S A Sbjct: 328 QISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSL-LPSRAVSEYPKDLVA 386 Query: 751 ESVVKKEDNPKETANEPASCVGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDS 572 E +++ K+ + + ETSN T E+AS + EE +QK RS TLGS KEFNFD+ Sbjct: 387 EGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDN 446 Query: 571 VDGVNSDKPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 G SDKP I + WWANEKV GKE G +WTFFP++QP VS Sbjct: 447 TKGEASDKPTIRSEWWANEKVAGKEAR-PGNSWTFFPMLQPEVS 489 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 394 bits (1012), Expect = e-107 Identities = 235/504 (46%), Positives = 282/504 (55%), Gaps = 54/504 (10%) Frame = -2 Query: 1789 NNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEPT 1610 N+ +D AESR VQK+RW CWSLYWCFGS K KRIG AVL PEP Sbjct: 19 NSSVDTINAAATAIVSAESRVQPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPE 77 Query: 1609 SSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYSP 1430 +AEN Q+ ++ +PFI SFLQS+PPSA QSPAG LSLTS+S + YSP Sbjct: 78 VQGAVVTSAENQSQSTAITVPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSP 137 Query: 1429 GGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARL-- 1256 GGPAS+FAIGPYAHETQLVTPPAFS FTTEPSTAPFTPPPESVQ+TTPSSPEVPFA+L Sbjct: 138 GGPASIFAIGPYAHETQLVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLT 197 Query: 1255 --LDPIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFVP 1082 L+ +N +F S YEFQSY L PGSP L PFPD Sbjct: 198 SSLERARRNSGTNQKFALSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPD----- 252 Query: 1081 GRPHFLQFRSGDPPHLLNLEKMASHKWGSHQGSGT------------------------- 977 R L+FR G+ P LL E + KWGS GSGT Sbjct: 253 -RYPILEFRMGEAPKLLGFEHFTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQG 311 Query: 976 -----------------------LTPDAGVPRSQNNFLLDHQHSDISPPSNSYNFWRNDE 866 LTPDA P S++ F L++Q S+++ +NS N + DE Sbjct: 312 SRLGSGTVTPDGVGLRSMLGSGSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDE 371 Query: 865 TVVDHRVSFEITEEEVVRCVEKKPVALPKTVLSSLEN--AESVVKKEDNPKETANEPASC 692 +VDHRVSFE++ EEV RC+E K +A + + AE +K N P Sbjct: 372 NIVDHRVSFELSGEEVARCLESKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLP--- 428 Query: 691 VGETSNSTSERASIDGEEGQRHQKQRSTTLGSTKEFNFDSVDGVNSDKPNIGTAWWANEK 512 GETS T E+ S + EE ++K RS TLGS KEFNFD+ V DKP+I + WWANE Sbjct: 429 TGETSGETPEKPSGEMEEEHCYRKHRSITLGSIKEFNFDNSKEV-PDKPSINSEWWANET 487 Query: 511 VVGKEIETTGKNWTFFPVMQPGVS 440 + GKE NWTFFP++QP VS Sbjct: 488 IAGKEAR-PANNWTFFPLLQPEVS 510 >ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791666 isoform X1 [Glycine max] Length = 461 Score = 389 bits (1000), Expect = e-105 Identities = 225/457 (49%), Positives = 280/457 (61%), Gaps = 7/457 (1%) Frame = -2 Query: 1789 NNPLDXXXXXXXXXXXAESRPPHAAVQKKRWRSCWSLYWCFGSPKHRKRIGRAVLVPEPT 1610 NN LD A++R + QKKRW S CFG K RKRIG AVLVPEPT Sbjct: 14 NNTLDTINAAAFAIASAQNRVSQPSTQKKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPT 73 Query: 1609 SSRVDAPAAENPPQAASVVLPFIXXXXXXXSFLQSEPPSAMQSPAGGLSLTSISASMYSP 1430 ++ D AA + QA S+ LPF+ SF QSEPPS QSP G +S T +SAS+YSP Sbjct: 74 TNGADPAAAASSIQAPSITLPFVAPPSSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSP 133 Query: 1429 GGPASMFAIGPYAHETQLVTPPAFSTFTTEPSTAPFTPPPESVQITTPSSPEVPFARLLD 1250 GGPAS+FAIGPYAHETQLV+PP FS STAPFTPPPESV +TTPSSPEVPFA+LLD Sbjct: 134 GGPASIFAIGPYAHETQLVSPPVFSA----SSTAPFTPPPESVHMTTPSSPEVPFAQLLD 189 Query: 1249 PIHQNGEDGPRFPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDGEFVPGRPH 1070 P ++N E RF S Y+FQSYQ PGSPV L P PD EF H Sbjct: 190 PNNKNSETFQRFQISHYDFQSYQFHPGSPVGQLISPRSAISVSGTSSPLPDSEFNATFAH 249 Query: 1069 FLQFRSGDPPHLLNLEKMAS--HKWGSHQGSGTLTPDAGVPRSQNNFLLDHQHSDI--SP 902 L F+ DPP LLNL+ S S+ GSG+LTPDA +Q+ FL +H S+I SP Sbjct: 250 ILDFQRADPPKLLNLDNKLSSCENQKSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSP 309 Query: 901 -PSNSYNFWRNDETVVDHRVSFEITEEEVVRCVEKKPVALPKT-VLSSLENAESVVKKED 728 PSN+ R +E ++HRVSFE++ ++V++ +E KP A T VL L+N KE+ Sbjct: 310 HPSNN----RLNEISINHRVSFELSAQKVLKSLENKPAASAWTNVLPKLKNDAPTTDKEE 365 Query: 727 NPKETANEPASCVGETSNSTSERASIDGEEGQR-HQKQRSTTLGSTKEFNFDSVDGVNSD 551 +E+A + V E N ++ G++ H+K +S TL S KEFNFD+ DG +S Sbjct: 366 KSEESALDDKQVVSEAHNDQPLETTLGGDKATTVHEKDQSLTLSSAKEFNFDNADGGDSL 425 Query: 550 KPNIGTAWWANEKVVGKEIETTGKNWTFFPVMQPGVS 440 PNI WWANEKV GKE E + K+W+FFP++QPGVS Sbjct: 426 APNIVADWWANEKVAGKEREAS-KDWSFFPMIQPGVS 461