BLASTX nr result
ID: Catharanthus22_contig00023676
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00023676 (1764 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum] 477 e-132 ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanu... 472 e-130 ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum... 472 e-130 ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264... 429 e-117 gb|EOY14176.1| ARM repeat superfamily protein, putative isoform ... 428 e-117 gb|EOY14175.1| ARM repeat superfamily protein, putative isoform ... 428 e-117 gb|EOY14173.1| ARM repeat superfamily protein, putative isoform ... 428 e-117 gb|EOY14172.1| ARM repeat superfamily protein, putative isoform ... 428 e-117 ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citr... 428 e-117 gb|EMJ20253.1| hypothetical protein PRUPE_ppa004765mg [Prunus pe... 424 e-116 ref|XP_002320751.1| ataxin-related family protein [Populus trich... 424 e-116 ref|XP_002511774.1| conserved hypothetical protein [Ricinus comm... 417 e-113 ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum] 416 e-113 ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max] 414 e-113 gb|ESW20728.1| hypothetical protein PHAVU_005G009900g [Phaseolus... 407 e-111 ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297... 405 e-110 ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828... 392 e-106 ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca su... 384 e-104 ref|XP_002875041.1| hypothetical protein ARALYDRAFT_490543 [Arab... 381 e-103 ref|NP_567156.1| protein MATERNAL EFFECT EMBRYO ARREST 50 [Arabi... 379 e-102 >ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum] Length = 501 Score = 477 bits (1228), Expect = e-132 Identities = 250/498 (50%), Positives = 322/498 (64%), Gaps = 6/498 (1%) Frame = +3 Query: 222 KCEKMDETFPTELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTV 401 K +D+ EL +PE + + I +AK + GR DL+ KN+VT Sbjct: 4 KVVTVDDQIVAELTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTT 63 Query: 402 TLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDL 581 L LC++ K+LRNLCAGE++NQN FL+Q GV IV ++I S+ D Sbjct: 64 VLHLCQSLSSISYRYLLLLSLKVLRNLCAGEIINQNEFLQQRGVEIVVDVIMSVGLTPDP 123 Query: 582 DNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEG 761 D +I GLQLLGN+++GGGE + VW +LFP+K +KIA + EI DPLCMVIYTC +G Sbjct: 124 DCMIIRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDG 183 Query: 762 TDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKLYPVSS 941 TDGL+ L +++G I++E+++T S VG KE W+KLLLSK+C+EGS+ SIF KL+ S Sbjct: 184 TDGLLTDLCSEKGLPILIEILRTASAVGLKEVWLKLLLSKLCIEGSYISSIFFKLHSYPS 243 Query: 942 NANCG------DYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSV 1103 N G D F Q++LLS LSEILNER+E I V F + IL++A+ D Sbjct: 244 VENNGVVTHVVDQFVIEQSYLLSTLSEILNERVEHIVVSHDFARSIFGILKSASGVADFS 303 Query: 1104 PRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXX 1283 R +S LP G A IDVLGYSL ILRDICA DH +E+S D + Sbjct: 304 IRGKSDLPVGSAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNL 363 Query: 1284 XXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKI 1463 PP+TI+K MKQ + ++ S+S + CPY+GFRRDIV ILGNCAYRR+ VQD+I Sbjct: 364 LRDLEPPTTIRKAMKQDQIKE--GTISSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEI 421 Query: 1464 REENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTG 1643 R++NGILLLLQQCV DEDNP+LREWGIW +RNLLEGNAENQ ++DLE+Q TVDVPEL Sbjct: 422 RDKNGILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVR 481 Query: 1644 LGLKVEIDPQTRRAKLVN 1697 LGL+VE+DP TR KLVN Sbjct: 482 LGLRVEVDPVTRHTKLVN 499 >ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanum tuberosum] gi|565401994|ref|XP_006366477.1| PREDICTED: ataxin-10-like isoform X2 [Solanum tuberosum] gi|565401996|ref|XP_006366478.1| PREDICTED: ataxin-10-like isoform X3 [Solanum tuberosum] gi|565401998|ref|XP_006366479.1| PREDICTED: ataxin-10-like isoform X4 [Solanum tuberosum] gi|565402000|ref|XP_006366480.1| PREDICTED: ataxin-10-like isoform X5 [Solanum tuberosum] Length = 504 Score = 472 bits (1214), Expect = e-130 Identities = 248/494 (50%), Positives = 320/494 (64%), Gaps = 6/494 (1%) Frame = +3 Query: 234 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTVTLQL 413 +D+ E+ +PE + + I +AK + GR DL+ KN+VT L L Sbjct: 11 VDDKIVAEVTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHL 70 Query: 414 CRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 593 C++ K+LRNLCAGE+ NQN FL+Q GV IV ++I S+ D D + Sbjct: 71 CQSLSSISYRQLLLSSLKVLRNLCAGEIRNQNEFLQQRGVEIVVDVITSVGLTPDPDCMI 130 Query: 594 IHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEGTDGL 773 I GLQLLGN+++GGGE + VW +LFP+K +KIA + EI DPLCMVIYTC +GTDGL Sbjct: 131 IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRSWEICDPLCMVIYTCCDGTDGL 190 Query: 774 VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKLYPVSSNANC 953 + L ++QG I++E+++T S V KE W+KLLLSK+C+EGS+ SIF KL+ S N Sbjct: 191 LTDLCSEQGLPILIEILRTASAVDRKEVWLKLLLSKLCIEGSYISSIFFKLHSFPSIQNN 250 Query: 954 G------DYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRE 1115 G D F Q +LLSILSEI+N++IE I V F L + IL++A +D R + Sbjct: 251 GVVTHATDQFVIEQPYLLSILSEIVNDQIEHIVVSHDFALSIFGILKSAFVVVDFSIRGK 310 Query: 1116 SGLPTGHATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXXXXXX 1295 S LP G A IDVLGYSL ILRDICA DH +E+S D + Sbjct: 311 SDLPVGFAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 370 Query: 1296 XPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKIREEN 1475 PP+TI+K MKQ + + S+S + CPY+GFRRDIV I+GNCAYRR+ VQD+IR++N Sbjct: 371 EPPTTIRKAMKQDQITE--GIISSSFRCCPYQGFRRDIVSIIGNCAYRRRYVQDEIRDKN 428 Query: 1476 GILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLK 1655 GILLLLQQCV DEDNP+LREWGIW +RNLLEGNAENQ ++DLE+Q TVDVPEL LGL+ Sbjct: 429 GILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLR 488 Query: 1656 VEIDPQTRRAKLVN 1697 VE+DP TRR KLVN Sbjct: 489 VEVDPVTRRTKLVN 502 >ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum lycopersicum] gi|460373805|ref|XP_004232704.1| PREDICTED: ataxin-10-like isoform 2 [Solanum lycopersicum] Length = 501 Score = 472 bits (1214), Expect = e-130 Identities = 251/498 (50%), Positives = 321/498 (64%), Gaps = 6/498 (1%) Frame = +3 Query: 222 KCEKMDETFPTELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTV 401 K MD+ +EL +PE + + I ++K GR DL+ KN+VT Sbjct: 4 KVVTMDDQIVSELTIPENVAKELLLVSNSSSLETALDKLIQLSKEGGGRLDLSSKNVVTT 63 Query: 402 TLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDL 581 L LC++ K+LRNLCAGE+ NQN FL+Q GV IV ++I S+ D Sbjct: 64 VLHLCQSLSSISYRNLLLLSLKVLRNLCAGEIRNQNGFLQQRGVEIVLDVIMSVGLSPDP 123 Query: 582 DNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEG 761 D +I GLQLLGN+++GGGE + VW +LFP+K +KIA + EI DPLCMVIYTC +G Sbjct: 124 DCMIIRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDG 183 Query: 762 TDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKLYPVSS 941 TDGL+ L ++QG I+ E+++T S VG KE W+KLLLSK+C+EGSH SIF KL+ S Sbjct: 184 TDGLLTDLCSEQGLPILFEILRTASAVGLKEVWLKLLLSKLCIEGSHISSIFFKLHSYPS 243 Query: 942 NANCG------DYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSV 1103 + G D F Q +LLSILSEILNER+E I V F + IL++A+ +D Sbjct: 244 VEDNGVVTHVADQFVIEQPYLLSILSEILNERVEHIVVSHDFARSIFGILKSASGVVDFS 303 Query: 1104 PRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXX 1283 R +S LP G A IDVLGYSL ++RDICA DH +E+S D + Sbjct: 304 IRGKSDLPVGSAPIDVLGYSLTLMRDICASDHLSSSKEESSKDVVDVLVSSGLIEFLLNL 363 Query: 1284 XXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKI 1463 PP+TI+ MK + + EG+ S S + CPY+GFRRDIV ILGNCAYRR+ VQD+I Sbjct: 364 LRDLEPPTTIRNAMKPDQIK-EGTIPS-SFRCCPYQGFRRDIVAILGNCAYRRRHVQDEI 421 Query: 1464 REENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTG 1643 R++NGILLLLQQCV DEDNP+LREWGIW +RNLLEGNAENQ ++DLE+Q TVDVPEL Sbjct: 422 RDKNGILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVR 481 Query: 1644 LGLKVEIDPQTRRAKLVN 1697 LGL+VE+DP TRR KLVN Sbjct: 482 LGLRVEVDPVTRRTKLVN 499 >ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264428 [Vitis vinifera] Length = 494 Score = 429 bits (1102), Expect = e-117 Identities = 230/461 (49%), Positives = 297/461 (64%), Gaps = 7/461 (1%) Frame = +3 Query: 339 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFL 518 I +KT GR DL KNI+ V LQL ++ KLLRNLCAGE+ NQN F+ Sbjct: 35 IEASKTPGGRLDLGSKNILPVVLQLSQSLSYPSGHDILLLSLKLLRNLCAGEMTNQNLFI 94 Query: 519 EQGGVGIVSNIIASIKCL-SDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKI 695 EQ GV VS I+ S L SD D +I GLQLLGN +L G H+ AVW FP ++I Sbjct: 95 EQNGVKAVSTILLSFVGLDSDSDYGIIRMGLQLLGNVSLAGERHQRAVWHHFFPAGFLEI 154 Query: 696 AGIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLL 875 A + +E DPLCMVIYTC + + + + DQG I+ E+++T STVGF+E W+KLLL Sbjct: 155 ARVRTLETSDPLCMVIYTCFDQSHEFITEICGDQGLPILAEIVRTASTVGFEEDWLKLLL 214 Query: 876 SKICLEGSHFESIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAV 1037 S+ICLE SHF +FSKL PV ++ N D FA QAFL+ I++EILNE+I + V Sbjct: 215 SRICLEESHFPMLFSKLCPVGTSGNYESIEFKVDVFASEQAFLMDIVAEILNEQINKMTV 274 Query: 1038 DSKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQ 1217 S L VL IL+ +A +DSV +SG G I+VL YSL IL++ICA D + Sbjct: 275 SSDVALCVLGILKKSAGVLDSVSTCKSGFSAGSNAINVLKYSLTILKEICARDAQKSSNE 334 Query: 1218 ENSVDAMXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGF 1397 SVD + PP+ I+K +KQ + +D S S K PY+GF Sbjct: 335 HGSVDVVDLLVSSGLLELLLCLLRDLEPPAIIRKAIKQGENQD--GAASYSPKHYPYRGF 392 Query: 1398 RRDIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNA 1577 RRD+V ++GNCAYRRK VQ++IRE NGILLLLQQCV+DE+N +LREWGIW +RNLLEGN Sbjct: 393 RRDLVAVIGNCAYRRKHVQNEIRERNGILLLLQQCVTDEENQFLREWGIWCVRNLLEGNV 452 Query: 1578 ENQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 1700 ENQR+V++LE+Q +VDVPE+ GLGL+VE+D +T RAKLVN+ Sbjct: 453 ENQRVVAELELQGSVDVPEIAGLGLRVEVDQKTGRAKLVNV 493 >gb|EOY14176.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao] Length = 519 Score = 428 bits (1101), Expect = e-117 Identities = 224/456 (49%), Positives = 303/456 (66%), Gaps = 6/456 (1%) Frame = +3 Query: 339 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFL 518 I +++T RA+LA +NI+ L+L + KLLRNLCAGE+ NQN F Sbjct: 36 IKVSRTAAARAELALRNILPTVLKLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFF 95 Query: 519 EQGGVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 698 EQ GV +V +++ S LS+ D+ VI LQ+L N +L G +H+ A+W + FPN+ +A Sbjct: 96 EQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLA 155 Query: 699 GIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 878 + E DPLCM++YTC + GLVA L D G I++ +I+T+++VGF E W KLLLS Sbjct: 156 RVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLS 215 Query: 879 KICLEGSHFESIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAVD 1040 ++CLE HF +FSK SS+ N G D F QAFLL I+SEILNERIE+I V Sbjct: 216 RLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVS 275 Query: 1041 SKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQE 1220 S+F L VL I + + + +D R S LPTG +IDV+GYSLIILRDICA + G ++ Sbjct: 276 SEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGYSLIILRDICARE-GVGDLKN 334 Query: 1221 NSVDAMXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFR 1400 +S+D + PP+ I+K +K+ D ++SK CPYKGFR Sbjct: 335 DSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEG---DNQGLNLSASKLCPYKGFR 391 Query: 1401 RDIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAE 1580 RD++ ++GNCAYRRK VQD+IR++NGILLLLQQCV+D+DNPYLREWGIWS+RNLLEG+AE Sbjct: 392 RDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAE 451 Query: 1581 NQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAK 1688 NQ+ V+DLE+Q +VD+PEL+ LGL+VE+D +TRRAK Sbjct: 452 NQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487 >gb|EOY14175.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao] Length = 500 Score = 428 bits (1101), Expect = e-117 Identities = 224/456 (49%), Positives = 303/456 (66%), Gaps = 6/456 (1%) Frame = +3 Query: 339 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFL 518 I +++T RA+LA +NI+ L+L + KLLRNLCAGE+ NQN F Sbjct: 48 IKVSRTAAARAELALRNILPTVLKLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFF 107 Query: 519 EQGGVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 698 EQ GV +V +++ S LS+ D+ VI LQ+L N +L G +H+ A+W + FPN+ +A Sbjct: 108 EQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLA 167 Query: 699 GIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 878 + E DPLCM++YTC + GLVA L D G I++ +I+T+++VGF E W KLLLS Sbjct: 168 RVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLS 227 Query: 879 KICLEGSHFESIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAVD 1040 ++CLE HF +FSK SS+ N G D F QAFLL I+SEILNERIE+I V Sbjct: 228 RLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVS 287 Query: 1041 SKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQE 1220 S+F L VL I + + + +D R S LPTG +IDV+GYSLIILRDICA + G ++ Sbjct: 288 SEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGYSLIILRDICARE-GVGDLKN 346 Query: 1221 NSVDAMXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFR 1400 +S+D + PP+ I+K +K+ D ++SK CPYKGFR Sbjct: 347 DSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEG---DNQGLNLSASKLCPYKGFR 403 Query: 1401 RDIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAE 1580 RD++ ++GNCAYRRK VQD+IR++NGILLLLQQCV+D+DNPYLREWGIWS+RNLLEG+AE Sbjct: 404 RDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAE 463 Query: 1581 NQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAK 1688 NQ+ V+DLE+Q +VD+PEL+ LGL+VE+D +TRRAK Sbjct: 464 NQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 499 >gb|EOY14173.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|508722277|gb|EOY14174.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|508722280|gb|EOY14177.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] Length = 488 Score = 428 bits (1101), Expect = e-117 Identities = 224/456 (49%), Positives = 303/456 (66%), Gaps = 6/456 (1%) Frame = +3 Query: 339 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFL 518 I +++T RA+LA +NI+ L+L + KLLRNLCAGE+ NQN F Sbjct: 36 IKVSRTAAARAELALRNILPTVLKLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFF 95 Query: 519 EQGGVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 698 EQ GV +V +++ S LS+ D+ VI LQ+L N +L G +H+ A+W + FPN+ +A Sbjct: 96 EQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLA 155 Query: 699 GIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 878 + E DPLCM++YTC + GLVA L D G I++ +I+T+++VGF E W KLLLS Sbjct: 156 RVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLS 215 Query: 879 KICLEGSHFESIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAVD 1040 ++CLE HF +FSK SS+ N G D F QAFLL I+SEILNERIE+I V Sbjct: 216 RLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVS 275 Query: 1041 SKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQE 1220 S+F L VL I + + + +D R S LPTG +IDV+GYSLIILRDICA + G ++ Sbjct: 276 SEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGYSLIILRDICARE-GVGDLKN 334 Query: 1221 NSVDAMXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFR 1400 +S+D + PP+ I+K +K+ D ++SK CPYKGFR Sbjct: 335 DSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEG---DNQGLNLSASKLCPYKGFR 391 Query: 1401 RDIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAE 1580 RD++ ++GNCAYRRK VQD+IR++NGILLLLQQCV+D+DNPYLREWGIWS+RNLLEG+AE Sbjct: 392 RDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAE 451 Query: 1581 NQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAK 1688 NQ+ V+DLE+Q +VD+PEL+ LGL+VE+D +TRRAK Sbjct: 452 NQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487 >gb|EOY14172.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao] Length = 531 Score = 428 bits (1101), Expect = e-117 Identities = 224/456 (49%), Positives = 303/456 (66%), Gaps = 6/456 (1%) Frame = +3 Query: 339 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFL 518 I +++T RA+LA +NI+ L+L + KLLRNLCAGE+ NQN F Sbjct: 48 IKVSRTAAARAELALRNILPTVLKLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFF 107 Query: 519 EQGGVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 698 EQ GV +V +++ S LS+ D+ VI LQ+L N +L G +H+ A+W + FPN+ +A Sbjct: 108 EQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLA 167 Query: 699 GIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 878 + E DPLCM++YTC + GLVA L D G I++ +I+T+++VGF E W KLLLS Sbjct: 168 RVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLS 227 Query: 879 KICLEGSHFESIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAVD 1040 ++CLE HF +FSK SS+ N G D F QAFLL I+SEILNERIE+I V Sbjct: 228 RLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVS 287 Query: 1041 SKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQE 1220 S+F L VL I + + + +D R S LPTG +IDV+GYSLIILRDICA + G ++ Sbjct: 288 SEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGYSLIILRDICARE-GVGDLKN 346 Query: 1221 NSVDAMXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFR 1400 +S+D + PP+ I+K +K+ D ++SK CPYKGFR Sbjct: 347 DSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEG---DNQGLNLSASKLCPYKGFR 403 Query: 1401 RDIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAE 1580 RD++ ++GNCAYRRK VQD+IR++NGILLLLQQCV+D+DNPYLREWGIWS+RNLLEG+AE Sbjct: 404 RDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAE 463 Query: 1581 NQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAK 1688 NQ+ V+DLE+Q +VD+PEL+ LGL+VE+D +TRRAK Sbjct: 464 NQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 499 >ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858312|ref|XP_006421839.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858314|ref|XP_006421840.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858316|ref|XP_006421841.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|568874427|ref|XP_006490317.1| PREDICTED: ataxin-10-like isoform X1 [Citrus sinensis] gi|568874429|ref|XP_006490318.1| PREDICTED: ataxin-10-like isoform X2 [Citrus sinensis] gi|557523711|gb|ESR35078.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523712|gb|ESR35079.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523713|gb|ESR35080.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523714|gb|ESR35081.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] Length = 497 Score = 428 bits (1100), Expect = e-117 Identities = 226/495 (45%), Positives = 304/495 (61%), Gaps = 6/495 (1%) Frame = +3 Query: 234 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTVTLQL 413 MD+ ++ + E + Q I +KT GR+DLA KNI+ LQL Sbjct: 1 MDDASSLDISLSEDVLQPLLTTSNSSSLKDALEILIESSKTTVGRSDLASKNILPEVLQL 60 Query: 414 CRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 593 ++ KLLRNLCAGE+ NQ +F+EQ GVGIV ++ S D D + Sbjct: 61 TQSIPHSSGCHYLLLSLKLLRNLCAGEITNQKSFIEQTGVGIVLRVLRSPGVNLDKDYGI 120 Query: 594 IHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEGTDGL 773 I LQ+L N +L G H++A+W + FP++ +AG+ C E DPLCMVIYTC +G+ GL Sbjct: 121 IRIALQVLANVSLAGETHQHAIWCQFFPDEFATLAGVRCQETCDPLCMVIYTCCDGSSGL 180 Query: 774 VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKLYPVSSNANC 953 L D+G I+ E++ T ++VGFKE W K L+S+ C+E HF +F KL V ++ NC Sbjct: 181 FKELCGDKGLAIMAEIVCTAASVGFKEDWFKFLVSRTCVEEIHFPQLFFKLSQVGASRNC 240 Query: 954 GD------YFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRE 1115 D F+ QAFLL I+SEI+NERIE+I V + F L VL I + +D R Sbjct: 241 EDSNSREGTFSSEQAFLLEIVSEIVNERIEEIIVPNDFALSVLGIFTKSIGLVDFYARGT 300 Query: 1116 SGLPTGHATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXXXXXX 1295 LPT + I+VLGYSL ILR+ICA + N D + Sbjct: 301 PSLPTSSSAINVLGYSLSILRNICAREDPAGSSSVNRADLVDSLQSHGLIEMFLSLLRDL 360 Query: 1296 XPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKIREEN 1475 PP+ I+K M+Q + ++ S + S+K CPY GFRRD+V ++GNCAYRRK +QD+IRE + Sbjct: 361 EPPAIIRKAMRQGENQEGTS--AKSAKTCPYIGFRRDLVAVIGNCAYRRKHIQDEIRERD 418 Query: 1476 GILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLK 1655 GILLLLQQCV+DEDNP+ REWGIW +RNLLEGNAENQ++V+DLE+Q +++VPELT LGLK Sbjct: 419 GILLLLQQCVTDEDNPFSREWGIWCVRNLLEGNAENQKVVADLELQGSINVPELTDLGLK 478 Query: 1656 VEIDPQTRRAKLVNM 1700 VE+D TRRAKLVN+ Sbjct: 479 VEVDKNTRRAKLVNV 493 >gb|EMJ20253.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica] Length = 492 Score = 424 bits (1091), Expect = e-116 Identities = 225/494 (45%), Positives = 308/494 (62%), Gaps = 5/494 (1%) Frame = +3 Query: 234 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTVTLQL 413 MD+T E VPE + Q I + + DGRADLA K+I+ +QL Sbjct: 1 MDKTALQEFFVPEDVLQILLSASNSSTLIDSLETLIQVCRAADGRADLASKSILPSVVQL 60 Query: 414 CRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 593 ++ KLLRNLCAGE+ NQ +FLEQ GV I+SN++ S + D+ V Sbjct: 61 IQSLPYPSGRHLLTLSLKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNSANISLEPDSGV 120 Query: 594 IHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEGTDGL 773 I GLQ+L N +L G H++ +W++LFP + + +A + E DPLCMVI+ C +G+ L Sbjct: 121 IRMGLQVLANVSLAGERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180 Query: 774 VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKL-YPVSSNAN 950 L D G I+ E+++T + VGF E W+KLLLS+ICLEG +F S+FS L + S N Sbjct: 181 FEKLCGDGGITIMKEIVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFATSENVE 240 Query: 951 ----CGDYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRES 1118 D F+ QAF L I+S+ILNER+ +I V F L V I + + A++ V R +S Sbjct: 241 DTEFREDLFSSDQAFFLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQS 300 Query: 1119 GLPTGHATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXXXXXXX 1298 GLPTG + IDVLGYSL ILRD+CA + QE+ DA+ Sbjct: 301 GLPTGTSMIDVLGYSLTILRDVCA-QKTLRGFQEDLGDAVDVLLSHGLIELILCLLRDLE 359 Query: 1299 PPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKIREENG 1478 PP+ I+K +KQ + +D + S SSK CPYKGFRRDIV ++GNC Y+RK VQD+IR+ +G Sbjct: 360 PPAIIRKAIKQGEGQDGTN--SGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQRDG 417 Query: 1479 ILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLKV 1658 ILLLLQQC DEDNP+L+EWGIW +RNLLEGN +N+R+V++LE+Q +VD PE+ GLG +V Sbjct: 418 ILLLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGFRV 477 Query: 1659 EIDPQTRRAKLVNM 1700 E++P+T R KLVN+ Sbjct: 478 EVNPETGRPKLVNV 491 >ref|XP_002320751.1| ataxin-related family protein [Populus trichocarpa] gi|222861524|gb|EEE99066.1| ataxin-related family protein [Populus trichocarpa] Length = 496 Score = 424 bits (1090), Expect = e-116 Identities = 228/463 (49%), Positives = 303/463 (65%), Gaps = 9/463 (1%) Frame = +3 Query: 339 IGIAKTDDGRADLAGKNIVTVTLQLC-RAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTF 515 I IAKTDDGRADLA KNI+ V LQL +L+RNLCAGE+ NQ +F Sbjct: 37 IAIAKTDDGRADLASKNILPVVLQLITHLLNDPFDHEYLSLSLRLMRNLCAGEVANQKSF 96 Query: 516 LEQGGVGIVSNIIASIKCLS-DLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIK 692 ++ GVGI ++ S K S + D+ +I GLQ+L N +L G EH+ A+W LF ++L Sbjct: 97 IQLNGVGIFLTVLRSKKVASSEPDHGIIRMGLQVLANVSLAGKEHQQAIWGGLFHDELYM 156 Query: 693 IAGIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLL 872 +A + DPLCM+IY C +G+ LV L +QG I++E+I+T S VGF E W+KLL Sbjct: 157 LAKVRSQGTCDPLCMIIYACCDGSPELVLQLCGNQGLPIVVEIIRTASLVGFGEEWLKLL 216 Query: 873 LSKICLEGSHFESIFSKLYPVSSNANCGDY-------FADPQAFLLSILSEILNERIEDI 1031 LS+ICLE +F +FS++Y V S G+ F QA+LL+I+SEILNER+++I Sbjct: 217 LSRICLEDIYFPQLFSRIYSVCSYCENGEEISLSSNPFFTEQAYLLNIVSEILNERLKEI 276 Query: 1032 AVDSKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGFKK 1211 + + F L + I + + +A + R ES LPTG A IDVLGYSL ILRDICA + G K Sbjct: 277 TILNDFALCIFGIFKKSVEAFEFGSRAESRLPTGFAVIDVLGYSLTILRDICANNGGVGK 336 Query: 1212 VQENSVDAMXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYK 1391 E+ VD + PP I+K M Q+ ++ + S K CPYK Sbjct: 337 --EDLVDVVDSLLSSGLLDLLLCLLRDLEPPKIIRKAMNQAGNQE--ATTSYFPKVCPYK 392 Query: 1392 GFRRDIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEG 1571 GFRRD+V ++GNCAYRRK VQD IR++NG+LL+LQQCV+DEDNP+LREWGIWS+RNLLEG Sbjct: 393 GFRRDLVAVIGNCAYRRKHVQDDIRQKNGMLLMLQQCVTDEDNPFLREWGIWSMRNLLEG 452 Query: 1572 NAENQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 1700 N+ENQ+ V++LE+Q +VD+PEL GLGLKVE+D TR AKLVN+ Sbjct: 453 NSENQQAVAELELQGSVDMPELAGLGLKVEVDQNTRSAKLVNI 495 >ref|XP_002511774.1| conserved hypothetical protein [Ricinus communis] gi|223548954|gb|EEF50443.1| conserved hypothetical protein [Ricinus communis] Length = 497 Score = 417 bits (1071), Expect = e-113 Identities = 222/491 (45%), Positives = 302/491 (61%), Gaps = 6/491 (1%) Frame = +3 Query: 255 ELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTVTLQLCRAXXXX 434 EL +PE + Q I ++ DDGRA+LA K+++ + L+L ++ Sbjct: 2 ELFLPEDLLQLLFRASKSYDLKEALEILIETSRIDDGRANLAAKDVLPLVLKLFKSISYP 61 Query: 435 XXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQVIHFGLQL 614 KLLRNLCAGE+ NQN F+ G +VS ++ S + + D +I GLQ+ Sbjct: 62 SGDQFLTLSLKLLRNLCAGEITNQNCFVALNGPEMVSTLLRSAGLVYEPDYGIIRLGLQV 121 Query: 615 LGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTD 794 L N +L G +H+ A+W FP++ + +A DPLCM+IYTC +G G V L D Sbjct: 122 LANVSLAGEKHQQAIWHWFFPDEFVVLAKNRSQSTCDPLCMIIYTCCDGNPGFVLELCGD 181 Query: 795 QGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKLYPVSSNANC------G 956 +G ++ E+++T S VG+ E W KLLLS+ICLE +F +FS Y + N Sbjct: 182 RGLAVVAEIVRTASVVGYGEDWFKLLLSRICLEEEYFYKLFSCFYCAGDSENSEGISSSS 241 Query: 957 DYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRESGLPTGH 1136 D F+ QA+LLS +SEILNER+EDI+V F +V I + + +D V R SGLPTG Sbjct: 242 DLFSTEQAYLLSTVSEILNERLEDISVSIDFAFYVFGIFKRSVGVVDFVSRGNSGLPTGS 301 Query: 1137 ATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXXXXXXXPPSTIK 1316 A +DVLGYSL ILRD CA HG K +SVD + PP IK Sbjct: 302 AAVDVLGYSLTILRDTCAL-HG-KGGLYHSVDVVDTLLSNGLLELLLFVLHDLEPPPMIK 359 Query: 1317 KGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKIREENGILLLLQ 1496 K MKQ++ + S S S K CPYKGFRRDIV ++GNCA++R +VQD+IR+++ I LLLQ Sbjct: 360 KAMKQNENHEPAS--SRSYKPCPYKGFRRDIVAVIGNCAFQRNNVQDEIRQKDMIPLLLQ 417 Query: 1497 QCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLKVEIDPQT 1676 QCV+DEDNP+LREWG+W +RNLLEGN ENQ+ V++LE+Q TV VPEL+GLGL+VE+D T Sbjct: 418 QCVTDEDNPFLREWGLWCVRNLLEGNVENQKAVAELELQGTVQVPELSGLGLRVEVDSNT 477 Query: 1677 RRAKLVNM*SS 1709 RRA+LVN+ S+ Sbjct: 478 RRARLVNVSST 488 >ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum] Length = 468 Score = 416 bits (1070), Expect = e-113 Identities = 216/459 (47%), Positives = 293/459 (63%), Gaps = 5/459 (1%) Frame = +3 Query: 339 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFL 518 I +K+D GR++LA K ++ L + + FKLLRNLCAGE NQN FL Sbjct: 13 IHTSKSDSGRSNLASKRVLPAVLNILNSQTLPLDHNLLSLCFKLLRNLCAGEFENQNLFL 72 Query: 519 EQGGVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 698 E GV +VS+I+ S D+ ++ +GLQ+L N L G +H+ A+W+E+FP + +A Sbjct: 73 EFDGVVVVSSILMSEAGSLRPDHMLVRWGLQVLANVCLAGKQHQKAIWEEIFPLGFVSLA 132 Query: 699 GIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 878 + EI DPLCMVIYTC +G L +D G ++ E++KT S+ F E WIKLLLS Sbjct: 133 RLGTKEICDPLCMVIYTCCDGNHECFGELCSDSGLPVVAEIVKTASSASFGEDWIKLLLS 192 Query: 879 KICLEGSHFESIFSKL----YPVSSNANCGDY-FADPQAFLLSILSEILNERIEDIAVDS 1043 +ICLE S +F KL P + + DY F+ QAFLL ILSEILNER+ D+ V Sbjct: 193 RICLEESQLPMLFPKLRFMDIPEGEDIDSKDYQFSFEQAFLLQILSEILNERLRDVVVSK 252 Query: 1044 KFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQEN 1223 L V + + + ++ R +SGLP+G +D LGYSL ILRDICA D + E+ Sbjct: 253 DVALFVYGVFKKSVGVLEHAVRGKSGLPSGSVAVDALGYSLTILRDICAHD-SVRGNPED 311 Query: 1224 SVDAMXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRR 1403 + D + PP+ I+KG+KQS+ ++ SC SSK CPYKGFRR Sbjct: 312 TNDVVDVLLSQDIIELLLILLGDLEPPAIIRKGIKQSENQEGASC---SSKPCPYKGFRR 368 Query: 1404 DIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAEN 1583 DIV ++GNC YRRK QD+IR NGILLLLQQCV+DEDNP+LREWGIWS+RN+LEGN EN Sbjct: 369 DIVSLIGNCVYRRKHAQDEIRGRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGNEEN 428 Query: 1584 QRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 1700 Q++VS+L++Q + DVP+++ LGL++E+D +TRRAKLVN+ Sbjct: 429 QKVVSELQLQGSADVPQISALGLRIEVDQKTRRAKLVNV 467 >ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max] Length = 498 Score = 414 bits (1064), Expect = e-113 Identities = 218/462 (47%), Positives = 294/462 (63%), Gaps = 11/462 (2%) Frame = +3 Query: 348 AKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXX------FKLLRNLCAGELVNQN 509 AK+D GR +LA K I+ L + + FKLLRNLCAGE NQ+ Sbjct: 40 AKSDSGRLELASKRILPAVLNIVHSLTHASHHHHHQHNHILCLSFKLLRNLCAGEAANQD 99 Query: 510 TFLEQGGVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLI 689 +FLE GV +V +++ S S D+ ++ +GLQ+L N +L G +H+ A+WKEL+ + + Sbjct: 100 SFLELDGVAVVCSVLRSEAACSGPDHGLVRWGLQVLANVSLAGKQHQCAIWKELYLDGFV 159 Query: 690 KIAGIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKL 869 +A +H E DPLCMVIYTC +G L ++ G+ ++ E+++T S+ F E W+KL Sbjct: 160 SLARLHTKETCDPLCMVIYTCCDGNPEWFKRLSSEDGWFVMAEIVRTASSASFGEDWLKL 219 Query: 870 LLSKICLEGSHFESIFSKLY-----PVSSNANCGDYFADPQAFLLSILSEILNERIEDIA 1034 LLS+ICLE S +FSKL V + D+F+ QAFLL ILSEILNER +D+ Sbjct: 220 LLSRICLEESQLPVLFSKLQFADVPKVEVAESKDDHFSFEQAFLLRILSEILNERHKDVT 279 Query: 1035 VDSKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGFKKV 1214 V L V I + + ++ R +SGLP+G +DVLGYSL ILRDICA D G + Sbjct: 280 VSKDVALFVFGIFKNSIGVLEHATRGKSGLPSGFVGVDVLGYSLTILRDICAQD-GVRGN 338 Query: 1215 QENSVDAMXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKG 1394 E+S D + PP+ I+KG+KQ + +D SC S K CPYKG Sbjct: 339 TEDSNDVVDALLSYGLIELLLYLLEALEPPAIIRKGLKQCENQDGASC---SFKPCPYKG 395 Query: 1395 FRRDIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGN 1574 FRRDIV ++GNC YRRK QD+IR NGILLLLQQCV+DEDNP+LREWGIWS+RN+LEGN Sbjct: 396 FRRDIVALIGNCVYRRKHAQDEIRHRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGN 455 Query: 1575 AENQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 1700 ENQ++V++LE+Q + DVPE+T LGL+VE+D +TRRAKLVN+ Sbjct: 456 DENQKVVAELEIQGSADVPEITSLGLRVEVDQRTRRAKLVNI 497 >gb|ESW20728.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris] Length = 498 Score = 407 bits (1046), Expect = e-111 Identities = 224/498 (44%), Positives = 300/498 (60%), Gaps = 9/498 (1%) Frame = +3 Query: 234 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTVTLQ- 410 +D TF E P+ E Q I AK+D GR +LA K I+ L Sbjct: 2 IDTTF-LEHPISEDTLQLLFQASNSSNLEKSLEILIQNAKSDSGRLELASKRILPAVLNI 60 Query: 411 ---LCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDL 581 L +A FKLLRNLCAGE NQ +F+E GV +V +++ S Sbjct: 61 VQSLAQASHHHHHNQTFSLCFKLLRNLCAGEAANQVSFIELNGVAVVWSVLRSEAGSLGP 120 Query: 582 DNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEG 761 D++++ +GLQ+L N +LGG +H+ A+W+EL+P +A + EI DPLCMVIYTC +G Sbjct: 121 DHRLVRWGLQVLANVSLGGKQHQRAIWEELYPIGFASLARVGTKEICDPLCMVIYTCCDG 180 Query: 762 TDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKLYPVSS 941 L +D G+ ++ E+++T S+ F E W+KLLLS+I LE S +FSKL V Sbjct: 181 NPEWFKKLSSDDGWPVVAEIVRTASSASFDEDWLKLLLSRIFLEESQLPVLFSKLQSVDV 240 Query: 942 NA-----NCGDYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVP 1106 + F+ QAFLL ILSEILNER+ D+ V L V I + + ++ Sbjct: 241 PEGEVIESKNGQFSFEQAFLLQILSEILNERLGDVTVSEDVALFVFGIFKKSIGVLEHAM 300 Query: 1107 RRESGLPTGHATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXXX 1286 R +SGLP+G +DVLGYSL ILRDICA D ++ N+ D + Sbjct: 301 RGKSGLPSGFTGVDVLGYSLTILRDICAQDG----MRGNTKDVVDVLLSYGLIEFLLSLL 356 Query: 1287 XXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKIR 1466 PP+ I+KG+KQ + +D SCCS K CPYKGFRRDIV ++GNC YRRK QD+IR Sbjct: 357 GALEPPAIIRKGLKQIENQDNASCCS---KPCPYKGFRRDIVALIGNCVYRRKHAQDEIR 413 Query: 1467 EENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGL 1646 + NGILLLLQQCV+DEDNP+LREWGIWS+RN+LEGN ENQ+LV++LE+Q + DVPE+ L Sbjct: 414 DRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGNDENQKLVAELEIQGSADVPEINAL 473 Query: 1647 GLKVEIDPQTRRAKLVNM 1700 GL+VE+D +TRR KLVN+ Sbjct: 474 GLQVEVDQRTRRPKLVNI 491 >ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297970 [Fragaria vesca subsp. vesca] Length = 492 Score = 405 bits (1042), Expect = e-110 Identities = 224/495 (45%), Positives = 297/495 (60%), Gaps = 6/495 (1%) Frame = +3 Query: 234 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTVTLQL 413 MD T E VPE + Q + + KT DGR DL+ KN++ +QL Sbjct: 1 MDNTTLPECSVPEHVLQALLSVSNSSKLVDSLEDLVQVCKTADGREDLSAKNVLPTVIQL 60 Query: 414 CRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 593 ++ +LLRNLCAGE+ NQN+F+EQ GV I+SNI++S L D + Sbjct: 61 VQSLSYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIISNILSSASSLEP-DFGI 119 Query: 594 IHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEGTDGL 773 I GLQ+L N AL G ++A+W++LF + +A + + PLCM+IY C +GT L Sbjct: 120 ICVGLQVLANVALAGERQQHAIWQQLFLENFVALARVRSQKTCGPLCMIIYACCDGTPEL 179 Query: 774 VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKLYPVSSNANC 953 VA L D G I+ E++KT + GF E W KLLLS+ICLE +F +F L V N N Sbjct: 180 VAQLCGDCGVTIVKEIVKTAAADGFGEDWYKLLLSRICLEEPYFRPLFFSLQHVGGNENG 239 Query: 954 GDY------FADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRE 1115 D F + Q FLL +SEILNER+ +I V F L V I + + K + R Sbjct: 240 DDTEGGQESFLEEQEFLLKNVSEILNERLNEITVPDDFALCVFGIFKNSIKVLSYATRGR 299 Query: 1116 SGLPTGHATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXXXXXX 1295 SGLPTG IDVLGYSL ILRDICA + +++D + Sbjct: 300 SGLPTGSIDIDVLGYSLTILRDICA-QGTLRGCTVDTMDVVDALISYGLIELLLCLLRDL 358 Query: 1296 XPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKIREEN 1475 PP+ IKK + Q+K + EGS S +SK CPYKGFRRDIVG++GNC Y R+ VQD+IR ++ Sbjct: 359 EPPAIIKKSVNQAKDQ-EGSNYS-ASKPCPYKGFRRDIVGVIGNCLYGRQIVQDEIRRKD 416 Query: 1476 GILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLK 1655 G+LLLLQQCV+D+DNPYLREWGIW +RNLLE N ENQ+ V++LE+Q +VDVP+L LGL+ Sbjct: 417 GLLLLLQQCVTDDDNPYLREWGIWCVRNLLERNQENQQAVAELELQGSVDVPDLARLGLR 476 Query: 1656 VEIDPQTRRAKLVNM 1700 VE++P T R KLVN+ Sbjct: 477 VEMNPATGRPKLVNI 491 >ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828|gb|AES80031.1| Ataxin-10 [Medicago truncatula] Length = 491 Score = 392 bits (1006), Expect = e-106 Identities = 209/420 (49%), Positives = 276/420 (65%), Gaps = 7/420 (1%) Frame = +3 Query: 462 FKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGG 641 FKLLRNLCAGE++NQN FLE GV IV + I + + D ++ +GLQ+L N L G Sbjct: 77 FKLLRNLCAGEILNQNMFLENDGVFIVVSSILRSEVVGS-DYMLVRWGLQVLANVCLAGK 135 Query: 642 EHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEV 821 EH+ AVW E+FP + +A I E+ DPLCMVIYTC +G D + + +D G+ +++E+ Sbjct: 136 EHQKAVWDEMFPVGFLSVARIGKKEVNDPLCMVIYTCCDGNDQWFSEVCSDGGWNVLVEI 195 Query: 822 IKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKL----YPVSSNANC-GDYFADPQAFL 986 ++T S+ F E WIKLLLS+ICLE S +FSKL P + D F+ QAFL Sbjct: 196 VRTASSASFGEDWIKLLLSRICLEDSQLRVLFSKLRFMDIPDGEDTKTKDDQFSSEQAFL 255 Query: 987 LSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSL 1166 L I+S+ILNERI D+ + + V I + + ++ R +SGLP+G +DVLGYSL Sbjct: 256 LQIISDILNERIGDVTISLEVASFVYGIFKKSIGVLEHAVRGKSGLPSGITDVDVLGYSL 315 Query: 1167 IILRDICACDHGFKKVQENSVDA--MXXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKT 1340 +LRDICA D V+ NS D + PP+ I+KGMK S+ Sbjct: 316 TMLRDICAHD----SVRGNSEDTEVVDMLLSYGLIELVFILLGDLEPPTIIRKGMKHSEN 371 Query: 1341 RDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKIREENGILLLLQQCVSDEDN 1520 D S +SSK CPYKGFRRDIV ++GNC YRRK VQD+IR NGILLLLQQCV+DEDN Sbjct: 372 PDGAS---SSSKPCPYKGFRRDIVALIGNCVYRRKHVQDEIRSRNGILLLLQQCVTDEDN 428 Query: 1521 PYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 1700 PYLREWGIW +RN+LEGN ENQ+ +S+L++Q + DVPE++ LGL+VE+D +TRRAKLVN+ Sbjct: 429 PYLREWGIWCVRNMLEGNEENQKEISELQLQGSADVPEISALGLRVEVDQKTRRAKLVNV 488 >ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca subsp. vesca] Length = 490 Score = 384 bits (986), Expect = e-104 Identities = 217/493 (44%), Positives = 290/493 (58%), Gaps = 4/493 (0%) Frame = +3 Query: 234 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXXIGIAKTDDGRADLAGKNIVTVTLQL 413 MD T E VPE + Q I + KT DGR DLA KN++ +QL Sbjct: 1 MDNTALPECSVPEDVIQALLSVSNSSNLVESMEDLIQVCKTADGREDLAAKNVLPTVIQL 60 Query: 414 CRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 593 ++ +LLRNLCAGE+ NQN+F+EQ GV IVSNI++S L D + Sbjct: 61 VQSLLYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIVSNILSSAISLEP-DFWI 119 Query: 594 IHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIHCMEIIDPLCMVIYTCTEGTDGL 773 I GLQ+L N AL G ++A+W++LF K + +A + + PLCM+I TC +GT L Sbjct: 120 ICVGLQVLANAALAGERQQHAIWQQLFSEKFVALARVRSKKTCGPLCMIISTCCDGTPEL 179 Query: 774 VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFESIFSKLYPVSSNANC 953 VA L D G I+ E++KT + V F E W KLLLS+ICL +F +F L V NA Sbjct: 180 VAQLCGDCGVTILKEIVKTAAAVDFGEDWYKLLLSRICLVEPYFRPLFFSLEHVGENAED 239 Query: 954 GD----YFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRESG 1121 + F+ Q FLL +SEILNE + +I V + F L V I + + K + R SG Sbjct: 240 TEGGRESFSKEQEFLLKNVSEILNECLSEITVPNDFALCVFGIFKNSIKVLSYATRGRSG 299 Query: 1122 LPTGHATIDVLGYSLIILRDICACDHGFKKVQENSVDAMXXXXXXXXXXXXXXXXXXXXP 1301 LPTG IDVLGYSL ILRD CA + ++++D + P Sbjct: 300 LPTGSIDIDVLGYSLTILRDTCA-QGTLRGSTKDTMDVVDALISYGLIELLLSLLRDLEP 358 Query: 1302 PSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGILGNCAYRRKSVQDKIREENGI 1481 P+ IKK + Q++ ++ S S++ K CPYKGFRRDIV ++GNC Y RK VQD+IR ++G+ Sbjct: 359 PAIIKKSINQAENQEGSS--SSTLKPCPYKGFRRDIVAVIGNCLYGRKIVQDEIRRKDGL 416 Query: 1482 LLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLKVE 1661 LLLLQQCV D+DNPY REWGIW RNLL+ N ENQR V++LE++ +VDVP L LGL+VE Sbjct: 417 LLLLQQCVIDDDNPYSREWGIWCQRNLLDRNQENQRAVAELELKGSVDVPALARLGLRVE 476 Query: 1662 IDPQTRRAKLVNM 1700 ++ T R KLVN+ Sbjct: 477 MNLATGRPKLVNI 489 >ref|XP_002875041.1| hypothetical protein ARALYDRAFT_490543 [Arabidopsis lyrata subsp. lyrata] gi|297320878|gb|EFH51300.1| hypothetical protein ARALYDRAFT_490543 [Arabidopsis lyrata subsp. lyrata] Length = 474 Score = 381 bits (978), Expect = e-103 Identities = 200/453 (44%), Positives = 284/453 (62%), Gaps = 3/453 (0%) Frame = +3 Query: 348 AKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQG 527 +KTD GR+DLA K I+ L+L + K+LRNLCAGE+ NQN+F++ Sbjct: 34 SKTDSGRSDLASKCILPSILRLLQLLPYPSSRHYLNLSLKVLRNLCAGEVSNQNSFVDHD 93 Query: 528 GVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIH 707 G IVS ++ S + D + + FGLQ+L N L G + + VW FP + + IA I Sbjct: 94 GSVIVSELLDS----AIADFETVRFGLQVLANVVLFGEKRQRDVWLRFFPERFLSIAKIR 149 Query: 708 CMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVG-FKEHWIKLLLSKI 884 E DPLCM++YTC +G+ + + L + +G II E ++T S+VG +++W+KLL+S+I Sbjct: 150 RRETCDPLCMILYTCFDGSSEIASELCSSEGLTIIAETLRTSSSVGSVEDYWLKLLVSRI 209 Query: 885 CLEGSHFESIFSKLYPVSSNANCGDYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVL 1064 C+E +F +FSKLY V+ N + F QAFLL I+S+I NERI +A+ +L Sbjct: 210 CVEDDYFPKLFSKLYKVAEN----EKFTSEQAFLLRIVSDIANERIGKVAIPKDTASSIL 265 Query: 1065 EILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACD--HGFKKVQENSVDAM 1238 + + + D V S LPTG +DV+GYSL+I+RD CA K ++S D + Sbjct: 266 GLFKQSVDVFDFVSGERSELPTGSTIVDVMGYSLVIIRDACAGGSLEELNKDNKDSGDTV 325 Query: 1239 XXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGI 1418 PP+TIKK + QS T S+S K CPY+GFRRDIV + Sbjct: 326 ELLLSSGLIELLLDLLRKLDPPTTIKKALNQSPTS------SSSFKPCPYRGFRRDIVSV 379 Query: 1419 LGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVS 1598 +GNCAYRRK VQD+IRE +G++L+LQQCV+D++NP+LREWG+W +RNLLEGN ENQ +V+ Sbjct: 380 IGNCAYRRKEVQDEIRERDGLVLMLQQCVTDDENPFLREWGLWCVRNLLEGNPENQEVVA 439 Query: 1599 DLEMQETVDVPELTGLGLKVEIDPQTRRAKLVN 1697 +LE++ +VDVP+L +GL+VEIDP+T R KLVN Sbjct: 440 ELEIKGSVDVPQLREIGLRVEIDPKTARPKLVN 472 >ref|NP_567156.1| protein MATERNAL EFFECT EMBRYO ARREST 50 [Arabidopsis thaliana] gi|3193319|gb|AAC19301.1| contains similarity to mouse brain protein E46 (GB:X61506) [Arabidopsis thaliana] gi|26451586|dbj|BAC42890.1| unknown protein [Arabidopsis thaliana] gi|28973257|gb|AAO63953.1| unknown protein [Arabidopsis thaliana] gi|332656441|gb|AEE81841.1| maternal effect embryo arrest 50 protein [Arabidopsis thaliana] Length = 475 Score = 379 bits (974), Expect = e-102 Identities = 198/453 (43%), Positives = 286/453 (63%), Gaps = 3/453 (0%) Frame = +3 Query: 348 AKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXXFKLLRNLCAGELVNQNTFLEQG 527 +KTD GR+DLA K+I+ L+L + K+LRNLCAGE+ NQN+F++ Sbjct: 34 SKTDSGRSDLASKSILPSILRLLQLLPYPSSRHYLNLSLKVLRNLCAGEVSNQNSFVDHD 93 Query: 528 GVGIVSNIIASIKCLSDLDNQVIHFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAGIH 707 G IVS+++ S + D + + FGLQ+L N L G + + VW +P + + IA I Sbjct: 94 GSAIVSDLLDS----AIADFETVRFGLQVLANVVLFGEKRQRDVWLRFYPERFLSIAKIR 149 Query: 708 CMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVG-FKEHWIKLLLSKI 884 E DPLCM++YTC +G+ + + L + QG II E ++T S+VG +++W+KLL+S+I Sbjct: 150 KRETFDPLCMILYTCVDGSSEIASELCSCQGLTIIAETLRTSSSVGSVEDYWLKLLVSRI 209 Query: 885 CLEGSHFESIFSKLYPVSSNANCGDYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVL 1064 C+E +F +FSKLY + N + F+ QAFL+ ++S+I NERI +++ +L Sbjct: 210 CVEDGYFLKLFSKLYEDAEN----EIFSSEQAFLVRMVSDIANERIGKVSIPKDTACSIL 265 Query: 1065 EILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDH--GFKKVQENSVDAM 1238 + R + D V S LPTG +DV+GYSL+I+RD CA K+ ++S D + Sbjct: 266 GLFRQSVDVFDFVSGERSELPTGSTIVDVMGYSLVIIRDACAGGRLEELKEDNKDSGDTV 325 Query: 1239 XXXXXXXXXXXXXXXXXXXXPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGI 1418 PP+TIKK + QS S S+S K CPY+GFRRDIV + Sbjct: 326 ELLLSSGLIELLLDLLSKLDPPTTIKKALNQSP-----SSSSSSLKPCPYRGFRRDIVSV 380 Query: 1419 LGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVS 1598 +GNCAYRRK VQD+IRE +G+ L+LQQCV+D++NP+LREWG+W IRNLLEGN ENQ +V+ Sbjct: 381 IGNCAYRRKEVQDEIRERDGLFLMLQQCVTDDENPFLREWGLWCIRNLLEGNPENQEVVA 440 Query: 1599 DLEMQETVDVPELTGLGLKVEIDPQTRRAKLVN 1697 +LE++ +VDVP+L +GL+VEIDP+T R KLVN Sbjct: 441 ELEIKGSVDVPQLREIGLRVEIDPKTARPKLVN 473