BLASTX nr result
ID: Papaver32_contig00022377
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver32_contig00022377 (1969 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_002305483.2 hypothetical protein POPTR_0004s17490g [Populus t... 384 e-113 XP_011037260.1 PREDICTED: uncharacterized protein LOC105134514 i... 383 e-113 XP_011037259.1 PREDICTED: uncharacterized protein LOC105134514 i... 382 e-112 XP_010268229.1 PREDICTED: uncharacterized protein LOC104605241 [... 371 e-109 EEF33131.1 conserved hypothetical protein [Ricinus communis] 368 e-107 GAV60305.1 Urb2 domain-containing protein, partial [Cephalotus f... 359 e-104 XP_016707646.1 PREDICTED: uncharacterized protein LOC107922238 [... 350 e-101 XP_007041935.2 PREDICTED: uncharacterized protein LOC18607616 is... 349 e-101 XP_017971647.1 PREDICTED: uncharacterized protein LOC18607616 is... 348 e-101 KJB15703.1 hypothetical protein B456_002G192700 [Gossypium raimo... 346 e-100 XP_012467517.1 PREDICTED: uncharacterized protein LOC105785869 [... 346 e-100 EOX97769.1 Urb2/Npa2, putative isoform 5 [Theobroma cacao] 342 1e-99 XP_017622784.1 PREDICTED: uncharacterized protein LOC108466921 [... 345 1e-99 EOX97768.1 Urb2/Npa2, putative isoform 4 [Theobroma cacao] 343 1e-99 EOX97766.1 Urb2/Npa2, putative isoform 2 [Theobroma cacao] 343 5e-99 EOX97765.1 Urb2/Npa2, putative isoform 1 [Theobroma cacao] 343 5e-99 EOX97767.1 Urb2/Npa2, putative isoform 3 [Theobroma cacao] 342 9e-99 XP_016701544.1 PREDICTED: uncharacterized protein LOC107916705 [... 338 2e-97 XP_015160859.1 PREDICTED: uncharacterized protein LOC102601821 i... 319 1e-90 XP_006367335.1 PREDICTED: uncharacterized protein LOC102601821 i... 319 1e-90 >XP_002305483.2 hypothetical protein POPTR_0004s17490g [Populus trichocarpa] EEE85994.2 hypothetical protein POPTR_0004s17490g [Populus trichocarpa] Length = 2070 Score = 384 bits (985), Expect = e-113 Identities = 254/715 (35%), Positives = 382/715 (53%), Gaps = 67/715 (9%) Frame = +3 Query: 12 VIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTL 191 V +H T ++H + KIS+H+I+ ELL DS+ YE F+ RHL S F ++L++ ++ +FGD Sbjct: 1078 VERHHTNEAHFLDKISVHQISAELLADSVLYEHKFVRRHLASRFCNLLEKSILPLFGDVK 1137 Query: 192 AGQTDINEWEVILNKLEKTSLALNRRHVADDAVLPMTPDS---------LCPDSPFIKCT 344 + +W+ L+ LE + + L+R+ D + P S + +S +K T Sbjct: 1138 LNMSP--KWKEGLSALENSYVVLSRKSSTCDELTGGKPASHLLSEMAADISRESTAVKFT 1195 Query: 345 KEXXXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELF 524 WMPKGY+NS+SFS T+ LNLER+++ LL+ + YEL Sbjct: 1196 A----CQSLLRLLCWMPKGYINSKSFSLYVTSTLNLERLVIGHLLECGDSFFSHKQYELL 1251 Query: 525 RLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEE 704 RL V+CR+AL+CL+M++ + ++IP+ + ++VLWL +SV+ + L E+ Sbjct: 1252 RLLVACRRALKCLIMAYCEEKVRTTHSALIPVLFEDVHSVLWLSRSVSVVFRLQETLSED 1311 Query: 705 HATEVRHLSLSLMDDTSYIQSTLSKEQFLFSPS-----------STDGFSKNGSSGESNP 851 A EV + SLMD TSY+ TLSK Q + S ++D + S ES P Sbjct: 1312 KACEVADMIFSLMDHTSYVFLTLSKYQCPSAVSIIAEKPYTEQLNSDVTQEQSSVNESLP 1371 Query: 852 SLGTW------------MESLKTHAEVLLLKIADLNIVE------------VSSLVTSIH 959 L T ESLK A+ L++ + D + E +SS+V+ Sbjct: 1372 CLDTSNDVESCKSVILIAESLKEQAQDLIISLKDAHCNEKSSDEIDVDWNKLSSMVSCFS 1431 Query: 960 GLLWGITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHALFVMDYQH 1124 G +WG+ S L +D + SK+ I+A FI H LFV D Sbjct: 1432 GFMWGLASALDHSNATDSDYKAKLLRWKCEVISKISHCINAFADFICFSFHMLFVKDDLQ 1491 Query: 1125 PNY---SKNLVDKD-------------LSIKKKRSSSDNADFTIDVL--IDSYERQXXXX 1250 PN+ + N V D +++ K S S+N +L +DSYE Sbjct: 1492 PNHLSATGNFVKSDDRDSSLVSGDSWKVTVNKHGSQSENVTSIAGILSKLDSYECLPLNK 1551 Query: 1251 XXXXXXXKGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFA 1430 +GD+P+ AV +RQLLI +SAI+ L ++ SS +P ISQ +L + A Sbjct: 1552 EWLQSFLEGDHPKAAVLIRQLLIAASAIVKLNLETKCTPLLSSLVPSFTGISQVLLLKLA 1611 Query: 1431 NMVQEPHTFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGK 1610 + + P FS++ LDG +KYL+ +GS+ + +NV+++L+ +HL A+GKCISL GK Sbjct: 1612 DGTEVPKPFSFVWLDGVLKYLQELGSHFPITNPTSTRNVFSKLLELHLKALGKCISLQGK 1671 Query: 1611 EATLASHDTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTA 1790 EATL SHD E ST TL S GS+ LS + L+EF+ARLR+SFK L+++PS+LH ++A Sbjct: 1672 EATLTSHDKELSTNTLHSHIGSASLSHPY---YLDEFKARLRMSFKSLIRKPSELHLLSA 1728 Query: 1791 IEVVKKALVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL 1955 I+ +++ALVGV C ++YEI TG D GKVS+ VAAG+DC D +E VS R L Sbjct: 1729 IQAIERALVGVYEGCPIIYEITTGNVDGGKVSSTVAAGIDCLDLVLEYVSGRKRL 1783 >XP_011037260.1 PREDICTED: uncharacterized protein LOC105134514 isoform X2 [Populus euphratica] Length = 2047 Score = 383 bits (983), Expect = e-113 Identities = 253/706 (35%), Positives = 375/706 (53%), Gaps = 58/706 (8%) Frame = +3 Query: 12 VIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTL 191 V +H T ++H + KIS+H+I+ ELL DS+ YE F+ RHL S F ++L++ ++ +FGD Sbjct: 1068 VERHHTNEAHFLDKISVHQISAELLADSVLYEHKFVRRHLASRFCNLLEKSILPLFGDVK 1127 Query: 192 AGQTDINEWEVILNKLEKTSLALNRRHVADDAVLPMTPDSLCPDSPFIKCTKEXXXXXXX 371 + +W+ L+ LE + L R+ T D L D + Sbjct: 1128 LNMSP--KWKEGLSALENSYFVLGRKS--------STCDELTADISRESTAVKFAACQSL 1177 Query: 372 XXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELFRLFVSCRKA 551 WMPKGY+NS+SFS AT+ LNLER+++ LL+ + YEL RL V+CR+A Sbjct: 1178 LRLLCWMPKGYINSKSFSLYATSTLNLERLVIGHLLECGDSFFSHKQYELLRLLVACRRA 1237 Query: 552 LRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEEHATEVRHLS 731 L+CLLM++ + ++IP+ + ++VLWL +SV+ + L E+ A EV + Sbjct: 1238 LKCLLMAYCEEKVRTTHSALIPVLFEDVHSVLWLSRSVSVVFRLQETLSEDKACEVADMI 1297 Query: 732 LSLMDDTSYIQSTLSKEQFLFSPS-----------STDGFSKNGSSGESNPSLGTW---- 866 SLMD TSY+ TLSK Q + S ++D + S ES P L T Sbjct: 1298 FSLMDHTSYVFLTLSKYQCPSAVSIIAEKPHTEQLNSDATQEQSSVNESPPCLDTSNDVE 1357 Query: 867 --------MESLKTHAEVLLLKIADLNIVE------------VSSLVTSIHGLLWGITSV 986 ESLK A+ L++ + D + E +SS+V+ G +WG+ S Sbjct: 1358 SCKSILLIAESLKEQAQDLIISLKDAHCNEKSSDEIDVDWNKLSSMVSCFSGFMWGLASA 1417 Query: 987 LRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHALFVMDYQHPNY---SKN 1142 L G D + SK+ I+A FI H LFV D PN+ + N Sbjct: 1418 LDHSNATGGDYKVKLLRWKCEVISKISHCINAFADFICFSFHMLFVKDDLQPNHLSATGN 1477 Query: 1143 LVDKD-------------LSIKKKRSSSDNADFTIDVL--IDSYERQXXXXXXXXXXXKG 1277 V D +++ K S S+N +L +DSYE +G Sbjct: 1478 FVKSDDRDSSLVSGDAWKVTVNKHCSWSENVTSIAGILSKLDSYECLPLNKEWLQSFLEG 1537 Query: 1278 DNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPHTF 1457 D+P+ AV +RQLLI +SAI+ L ++ SS +P ISQ +L + A+ + P F Sbjct: 1538 DHPKAAVLIRQLLIAASAIVKLNLETKCTPLLSSLVPSFTGISQVLLLKLADGTEVPKPF 1597 Query: 1458 SYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASHDT 1637 S++ LDG +KYL+ +GS+ + +NV+++L+ +HL A+GKCISL GKEATL SHD Sbjct: 1598 SFVWLDGVLKYLQELGSHFPITNPTSTRNVFSKLLELHLKALGKCISLQGKEATLTSHDK 1657 Query: 1638 ESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKALV 1817 E ST TL S GS+ LS + L+EF+ARLR+SF+ L+++PS+LH ++AI+ +++ALV Sbjct: 1658 ELSTNTLHSHIGSASLSHPY---YLDEFKARLRMSFRSLIRKPSELHLLSAIQAIERALV 1714 Query: 1818 GVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL 1955 GV C ++YEI TG D KVS+ VAAG+DC D +E VS R L Sbjct: 1715 GVYEGCPIIYEITTGNVDGRKVSSTVAAGIDCLDLVLEYVSGRKRL 1760 >XP_011037259.1 PREDICTED: uncharacterized protein LOC105134514 isoform X1 [Populus euphratica] Length = 2060 Score = 382 bits (980), Expect = e-112 Identities = 253/711 (35%), Positives = 377/711 (53%), Gaps = 63/711 (8%) Frame = +3 Query: 12 VIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTL 191 V +H T ++H + KIS+H+I+ ELL DS+ YE F+ RHL S F ++L++ ++ +FGD Sbjct: 1068 VERHHTNEAHFLDKISVHQISAELLADSVLYEHKFVRRHLASRFCNLLEKSILPLFGDVK 1127 Query: 192 AGQTDINEWEVILNKLEKTSLALNRRHVADDAVLPMTPDS-----LCPDSPFIKCTKEXX 356 + +W+ L+ LE + L R+ D + P S + D + Sbjct: 1128 LNMSP--KWKEGLSALENSYFVLGRKSSTCDELTGDKPASHLLSEMTADISRESTAVKFA 1185 Query: 357 XXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELFRLFV 536 WMPKGY+NS+SFS AT+ LNLER+++ LL+ + YEL RL V Sbjct: 1186 ACQSLLRLLCWMPKGYINSKSFSLYATSTLNLERLVIGHLLECGDSFFSHKQYELLRLLV 1245 Query: 537 SCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEEHATE 716 +CR+AL+CLLM++ + ++IP+ + ++VLWL +SV+ + L E+ A E Sbjct: 1246 ACRRALKCLLMAYCEEKVRTTHSALIPVLFEDVHSVLWLSRSVSVVFRLQETLSEDKACE 1305 Query: 717 VRHLSLSLMDDTSYIQSTLSKEQFLFSPS-----------STDGFSKNGSSGESNPSLGT 863 V + SLMD TSY+ TLSK Q + S ++D + S ES P L T Sbjct: 1306 VADMIFSLMDHTSYVFLTLSKYQCPSAVSIIAEKPHTEQLNSDATQEQSSVNESPPCLDT 1365 Query: 864 W------------MESLKTHAEVLLLKIADLNIVE------------VSSLVTSIHGLLW 971 ESLK A+ L++ + D + E +SS+V+ G +W Sbjct: 1366 SNDVESCKSILLIAESLKEQAQDLIISLKDAHCNEKSSDEIDVDWNKLSSMVSCFSGFMW 1425 Query: 972 GITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHALFVMDYQHPNY- 1133 G+ S L G D + SK+ I+A FI H LFV D PN+ Sbjct: 1426 GLASALDHSNATGGDYKVKLLRWKCEVISKISHCINAFADFICFSFHMLFVKDDLQPNHL 1485 Query: 1134 --SKNLVDKD-------------LSIKKKRSSSDNADFTIDVL--IDSYERQXXXXXXXX 1262 + N V D +++ K S S+N +L +DSYE Sbjct: 1486 SATGNFVKSDDRDSSLVSGDAWKVTVNKHCSWSENVTSIAGILSKLDSYECLPLNKEWLQ 1545 Query: 1263 XXXKGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQ 1442 +GD+P+ AV +RQLLI +SAI+ L ++ SS +P ISQ +L + A+ + Sbjct: 1546 SFLEGDHPKAAVLIRQLLIAASAIVKLNLETKCTPLLSSLVPSFTGISQVLLLKLADGTE 1605 Query: 1443 EPHTFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATL 1622 P FS++ LDG +KYL+ +GS+ + +NV+++L+ +HL A+GKCISL GKEATL Sbjct: 1606 VPKPFSFVWLDGVLKYLQELGSHFPITNPTSTRNVFSKLLELHLKALGKCISLQGKEATL 1665 Query: 1623 ASHDTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVV 1802 SHD E ST TL S GS+ LS + L+EF+ARLR+SF+ L+++PS+LH ++AI+ + Sbjct: 1666 TSHDKELSTNTLHSHIGSASLSHPY---YLDEFKARLRMSFRSLIRKPSELHLLSAIQAI 1722 Query: 1803 KKALVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL 1955 ++ALVGV C ++YEI TG D KVS+ VAAG+DC D +E VS R L Sbjct: 1723 ERALVGVYEGCPIIYEITTGNVDGRKVSSTVAAGIDCLDLVLEYVSGRKRL 1773 >XP_010268229.1 PREDICTED: uncharacterized protein LOC104605241 [Nelumbo nucifera] Length = 2131 Score = 371 bits (953), Expect = e-109 Identities = 268/772 (34%), Positives = 387/772 (50%), Gaps = 119/772 (15%) Frame = +3 Query: 9 DVIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDT 188 D+ K K+ +++ +I+++LL D++ YEQT LCRHL S F L++ + T Sbjct: 1095 DIEKQNMDKTLHPRTVTMQQISLKLLRDNVLYEQTILCRHLTSRFCRTLEKSISPFLICT 1154 Query: 189 LAGQTDIN---EWEVILNKLEKTSLALNRRHVADDAVLPMTPDSLCP---------DSPF 332 D N +W ++ LE LN H D PDS + Sbjct: 1155 SFKSFDFNLPPDWGTDVSMLEN----LNSTHGMHDGSSLSEPDSFQSCLSIEHHNGEKAS 1210 Query: 333 IKCTKEXXXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRY 512 + E WMPK + NS S AT +LNLE+ ++ LL QG+L + Sbjct: 1211 SLTSMELTACQNLLDLLCWMPKCHANSRSLLIYATYILNLEKFVICSLLNVQGKLFLNSC 1270 Query: 513 YELFRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSV 692 YELFRLF+SCR+AL+ L+M + + A+ S++ I +S++V+WL KSV+A+ Sbjct: 1271 YELFRLFLSCRRALKYLVMVSCEETIGAQESSLVSILFDSSFSVIWLLKSVSAIGGFSYS 1330 Query: 693 FPEEHATEVRHLSLSLMDDTSYIQSTLSKEQFLFSPSST------------------DGF 818 E A++++ + SLMD TSY+ TL K Q + S + Sbjct: 1331 LLGEQASQMKDIFFSLMDHTSYVFLTLIKHQSGLAIGSLTYERPQLKLPNFVLLREQNNI 1390 Query: 819 SKNGSSGESNPSLGTWM------ESLKTHAEVLL---------------LKIADLNIVEV 935 + S + + TW ++LK + +L + +ADLN ++ Sbjct: 1391 IEAEPSDDFSKQFDTWKVVILVAKALKEQTKSVLDALKNNSCNTKLEAGVSVADLN--KL 1448 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFKD-----SKLKQFISASERFINSC--- 1091 SS V+ G LWG+ S L S EK S S++ IS E F+N C Sbjct: 1449 SSTVSCFQGFLWGLASSLNSIDEKCCPVKTKSLIQKLGHMSEISLCISVCEDFMNFCLRK 1508 Query: 1092 --------------LHALFVMDY------------------------QHPNY-------- 1133 LH L +D+ Q N+ Sbjct: 1509 LLFENGQQPQGLSDLHNLPKIDHLTGSLIFKESLNISGDEIMNSSGKQEENFPGRMDGSA 1568 Query: 1134 ---------SKNLVDKDLSIKKKRSSSDNADFTIDVL--IDSYERQXXXXXXXXXXXKGD 1280 +KN K S +++ D+A+ + +L +DS+E + KG+ Sbjct: 1569 SETDDDHESTKNSDVKSSSFQEEGLQIDHAECAVSILTAVDSFELEHLKKSLVCGLLKGE 1628 Query: 1281 NPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPHTFS 1460 NPE+A VRQL I SSAIL LK+ I S+ P+ I ISQ++L EFA+MV+ PH+FS Sbjct: 1629 NPEVAFLVRQLFIASSAILGLKLLIDFNPLSSTLTPLFIGISQFVLLEFADMVEVPHSFS 1688 Query: 1461 YICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASHDTE 1640 ++ LDG +KYLEV+G+ S + +NVYA+L+ IHL AIG+CISL GK ATLASHDTE Sbjct: 1689 FVWLDGILKYLEVLGNNFSITNPTSSRNVYAKLVDIHLRAIGRCISLQGKRATLASHDTE 1748 Query: 1641 SSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKALVG 1820 SSTKTL Q S+ HG +L+EF+ARLR+SFKVL+++P +LH ++A++ +++ALVG Sbjct: 1749 SSTKTLQGQMEPLGSSLCHGPYNLDEFKARLRMSFKVLIRKPLELHLLSAMQAIERALVG 1808 Query: 1821 VNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 V C+M+YEI+TG D GKVS VAAGVDC DS +ESVS R L K HI Sbjct: 1809 VQEGCNMIYEIHTGSQDGGKVSPVVAAGVDCLDSILESVSGRKRLSVVKRHI 1860 >EEF33131.1 conserved hypothetical protein [Ricinus communis] Length = 2057 Score = 368 bits (944), Expect = e-107 Identities = 242/692 (34%), Positives = 368/692 (53%), Gaps = 52/692 (7%) Frame = +3 Query: 21 HKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAGQ 200 +KTG++ + KI++H+I+ ELL +S+ YE F+ RHL S F +LK +++IF D Sbjct: 1087 NKTGEAGFLNKITVHQISSELLINSILYEHNFVRRHLASRFCHLLKNSVLAIFNDFSIMD 1146 Query: 201 TDINE---WEVILNKLEKTSLA-LNRRHVADDAVL---PMTPDS--LCPDSPFIKCTKEX 353 DIN W+ +L+ + +A L +HV D + P++P S + D+ + Sbjct: 1147 VDINSFPNWQEVLSTVGSLPMAILESKHVTFDELSEERPISPLSSKIAADNSMESPDMKF 1206 Query: 354 XXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELFRLF 533 W+PKGY+NS SFS T LLNLER ++ + + G + +EL RL Sbjct: 1207 RACQSLLKLLCWLPKGYMNSRSFSIYVTYLLNLERYIISSISECTGAMSSYNLFELLRLL 1266 Query: 534 VSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEEHAT 713 +SCR+AL+ L+M+ + S+ P+ S ++VLWL+KSV + L F ++ + Sbjct: 1267 ISCRRALKYLVMALSEEKTITSHSSVTPVLSEGLFSVLWLFKSVFMVVGLQETFSKDDSD 1326 Query: 714 EVRHLSLSLMDDTSY-------------IQSTLSKEQFLFSPSSTDGFSKNGSSGESNPS 854 E+ + SLMD TSY I+S +SKE ++ + +S ES+ Sbjct: 1327 EIGEMIFSLMDHTSYLFLELSKHSCTCAIRSIISKEPHK-EQTNVRSVQEVSTSNESDSR 1385 Query: 855 LGTW------------MESLKTHAEVLLLKIAD-------------LNIVEVSSLVTSIH 959 + +W ESLK + LL+ + D +N+ +SS+V+ I Sbjct: 1386 VDSWGSDKGWKNILVMAESLKEQTQGLLIYLKDALCNEKLGNGVDLVNLNNLSSMVSWIS 1445 Query: 960 GLLWGITSVLRSGYEKGTDGPE----DSFKDSKLKQFISASERFINSCLHALFVMDYQHP 1127 G LWG++S L + +D E + S++ I+ FI+ LH FV D + Sbjct: 1446 GFLWGVSSALNHTNKIDSDKVEILKLNFEPSSQIGLCINVFTDFISFILHKYFVEDDRQR 1505 Query: 1128 NYSKNLVDKDLSIKKKRSSSDNADFTIDVLIDSYERQXXXXXXXXXXXKGDNPELAVCVR 1307 S ++ SD ++ + L D+Y+ + GD+PE A+ +R Sbjct: 1506 GSS-------FDVQNVEQPSDRSNCVLSQL-DNYKCESLNNYFLQSLLDGDHPEAAILIR 1557 Query: 1308 QLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPHTFSYICLDGAVK 1487 QLLI SSA+L L +Q SS +P IS +L + A++ + P FS I LDG +K Sbjct: 1558 QLLIASSALLKLNLQTNCTTSLSSLVPSFFGISHVLLLKLADVSEVPQPFSLIWLDGVLK 1617 Query: 1488 YLEVVGSYVSPK-DSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASHDTESSTKTLMS 1664 YL+ +GS+ K DS +VY RL+ +HL A+GKCI+L GKEATLASH+ ESS+K L + Sbjct: 1618 YLQELGSHFPSKVDSTSTVSVYTRLVELHLNALGKCITLQGKEATLASHEMESSSKILSN 1677 Query: 1665 QTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKALVGVNNRCSMV 1844 GSSE S H L+EF+ARLR+S KVL+ + +LH AI+ +++ALVGV C+M+ Sbjct: 1678 NKGSSESSFSHTSFFLDEFKARLRMSLKVLISKSIELHMFPAIQAIERALVGVQEGCTMI 1737 Query: 1845 YEIYTGGPDEGKVSAAVAAGVDCFDSCIESVS 1940 YEI TG D GKVS+ VAAG+DC D +E +S Sbjct: 1738 YEIKTGTADGGKVSSTVAAGIDCLDLVLEYIS 1769 >GAV60305.1 Urb2 domain-containing protein, partial [Cephalotus follicularis] Length = 2053 Score = 359 bits (921), Expect = e-104 Identities = 247/727 (33%), Positives = 371/727 (51%), Gaps = 81/727 (11%) Frame = +3 Query: 18 KHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAG 197 +H ++ + K+++H I+ LL DS+ YEQ F+ RH S F +LK+ + +F G Sbjct: 1050 RHHFSEASQLNKVTVHNISWALLSDSILYEQKFVRRHFASRFCHILKKLALPVFSGFSVG 1109 Query: 198 QTDINE---WEVILNKLEKTSLALNR-RHVADDAVLPMTPDSLCPDS------------P 329 D W +L LE +S+ ++ + V + L T S D P Sbjct: 1110 AVDFKSLPNWAEVLRSLEDSSMLVSTGKLVTHNGFLKETLMSCSSDDLLREICWKQQAFP 1169 Query: 330 FIKCTKEXXXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDR 509 F C W+PKGYL S+S S AT +LNLER+LV LL L + Sbjct: 1170 FTACQS-------LLSLLGWIPKGYLKSKSISLYATYILNLERLLVGTLLDCGDVLSSHK 1222 Query: 510 YYELFRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLS 689 +Y+L RLF+SCRK+L+C++ + + EA S+ PI S + LWL+KSV+ + L Sbjct: 1223 HYQLLRLFLSCRKSLKCIIDKSCEETTEACLSSLGPILSEVPFFALWLFKSVSLVVGLRE 1282 Query: 690 VFPEEHATEVRHLSLSLMDDTSYIQSTLSKEQFLFS-------------PSSTD------ 812 EVR + SL+D TSY+ L+K Q ++ SS+D Sbjct: 1283 AMSGYSGHEVRDMIFSLLDLTSYVFLMLTKYQSTYAVLSCMISEKPEKEQSSSDVAYKQK 1342 Query: 813 GFSKNGSSGESNPSLGTWM------ESLKTHAEVLLLKIAD-------------LNIVEV 935 +K+ S +S+ + W +SLK ++L + + D L + ++ Sbjct: 1343 NLNKSDHSADSSKDIEAWKGVLLLADSLKEQTQILFVTLKDAICDEKKGLNINALKLNKL 1402 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFKD------SKLKQFISASERFINSCLH 1097 SS+++ G LWG+ S L + TD SKL+ I FI+S LH Sbjct: 1403 SSIISCFSGFLWGLASSLN--HTDATDSHRAKMLRRKREAVSKLEHCIYLFADFISSFLH 1460 Query: 1098 ALFVMDYQHP-----------------NYSKNLVDKDLSIKKKRSSS--DNADFTIDVL- 1217 L V D + P + ++D + + SS + A VL Sbjct: 1461 MLVVEDDKQPGKLCDAHNSHKLEMKWNSLGSAMIDDNSGNNVRGCSSQLNYASCAASVLS 1520 Query: 1218 -IDSYERQXXXXXXXXXXXKGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVS 1394 ++S+E+ +GD+PE+A +R+L SAILSL +QI + +S +P+ Sbjct: 1521 EVNSHEQNFLNMNILQSLLRGDHPEVAFALRELFFAYSAILSLNLQIGNTSLFSL-VPLF 1579 Query: 1395 IAISQYMLTEFANMVQEPHTFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHL 1574 +ISQ +L+E A + P +++++ LDG +KYLE +GS+ + +N+YA LI +HL Sbjct: 1580 TSISQVLLSELAETAEIPQSYTFVWLDGVLKYLEELGSHFPLTNPTLTRNLYANLIELHL 1639 Query: 1575 PAIGKCISLCGKEATLASHDTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVL 1754 A+GKCISL GK+ATLASH+TESSTK L G SE S+ +G L EF+ RLR+SFKVL Sbjct: 1640 RALGKCISLQGKKATLASHETESSTKMLQGHIGISEASLSNGCYWLEEFKNRLRMSFKVL 1699 Query: 1755 LKEPSKLHRMTAIEVVKKALVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIES 1934 +++PS+LH ++AI +++ALVGV C+ VYEI TG D GKVS+ VAAG+DC D +E Sbjct: 1700 IRKPSELHLLSAIHAIERALVGVQEHCTTVYEIQTGCVDGGKVSSIVAAGIDCLDLVLEY 1759 Query: 1935 VSDRDSL 1955 VS L Sbjct: 1760 VSGHKRL 1766 >XP_016707646.1 PREDICTED: uncharacterized protein LOC107922238 [Gossypium hirsutum] XP_016707647.1 PREDICTED: uncharacterized protein LOC107922238 [Gossypium hirsutum] XP_016707648.1 PREDICTED: uncharacterized protein LOC107922238 [Gossypium hirsutum] Length = 2042 Score = 350 bits (897), Expect = e-101 Identities = 256/707 (36%), Positives = 360/707 (50%), Gaps = 53/707 (7%) Frame = +3 Query: 6 KDVIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGD 185 + V KHK K KISL++I+ LL DS Y+ F+ R+L S F L+ + +FGD Sbjct: 1072 QQVEKHKIEKDGQPKKISLYQISQGLLKDSTLYDHKFVRRNLSSSFCHALENLALLLFGD 1131 Query: 186 TLAGQTDINE---WEVILNKLEKTSLALN-RRHVADDAVLPMTPDSLCPDS---PFIKCT 344 + + N W +L+ L+ + ++ RR+V D+ +S S P Sbjct: 1132 SSVSDRNFNSFPVWSEVLSTLDNSPAVVSGRRYVKHDSPTRSISNSCNEQSSMNPTALPF 1191 Query: 345 KEXXXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELF 524 K WMPKG+L+S+SFS AT +++L++++V LL QG L ELF Sbjct: 1192 KTVKDCKSLLNLLCWMPKGFLSSKSFSKLATCVVHLDQLVVAELLHCQGTLSSYGC-ELF 1250 Query: 525 RLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEE 704 +LFV+CR+ L+ ++M+ + + EA S++ + G SY + WL+KSV+A+TELL E+ Sbjct: 1251 QLFVTCRRTLKNIIMALCEENIEASLSSLLSVAEG-SYFITWLFKSVSAVTELLDTMSED 1309 Query: 705 HATEVRHLSLSLMDDTSYIQSTLSKEQFLF-------SPSSTDGFS-------------- 821 +E + SLMD TSY+ +SK QF S FS Sbjct: 1310 CISEYKIKKFSLMDHTSYVFFAISKYQFSQAVDFIGNSEQPCKHFSGFVSDQNILNEPLL 1369 Query: 822 --KNGSSGESNPSLGTWMESLKTHAEVLLLKIADL-------------NIVEVSSLVTSI 956 N E+ SL ESLK AE LL + + NI ++S LV+ Sbjct: 1370 RFNNLKDSEALKSLSIIAESLKEQAESLLSSLKEALGIAQVGIEKEAENINKMSFLVSCF 1429 Query: 957 HGLLWGITSVLRSGYEKGTDGPEDSFKDSKLKQFISASERFINSCLHAL--FVMDYQH-- 1124 G LWG+ S L EK + +KL ++ S I C + + D H Sbjct: 1430 GGFLWGLASALNQLGEKCGE------LKTKLSRWKSEPLSKIKICTNVFVDLISDVLHMF 1483 Query: 1125 -PNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI--DSYERQXXXXXXXXXXXKGDNPELA 1295 N + D D SSD D+ D L+ D KGD+P+ A Sbjct: 1484 LENGQQRRSDSD------SQSSDKFDYRRDSLVFNDLVVLPCLNKHLLLGLLKGDHPDRA 1537 Query: 1296 VCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPHTFSYICLD 1475 V +RQLLI SAIL L +++ S P+ I +S+++L E AN VQ P F+++ LD Sbjct: 1538 VLLRQLLITYSAILRLNLRVGGPLLSSGMAPLIIDMSKFLLLELANSVQSPPPFTFVWLD 1597 Query: 1476 GAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASHDTESSTKT 1655 GAVKYLE VGS+ + DS +NVY +LI +HL IGKCISL GK ATL SH+ ESS+K Sbjct: 1598 GAVKYLEEVGSHFTFTDSALNENVYGKLIELHLRGIGKCISLQGKRATLESHERESSSKI 1657 Query: 1656 LMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKALVGVNNRC 1835 L TG SE + HG + L+EF+ARLR SF V +K PS+L M+ IE ++KALVGV Sbjct: 1658 LRDDTGLSESFLSHGSHFLDEFKARLRRSFSVFVKNPSELQLMSTIEAIEKALVGVQGAH 1717 Query: 1836 SMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 + +YEI G + G VS+ VA G+DC D +E S R L K HI Sbjct: 1718 ARIYEITAGSANGGMVSSTVAGGIDCLDLLLEHGSGRKCLSVIKRHI 1764 >XP_007041935.2 PREDICTED: uncharacterized protein LOC18607616 isoform X1 [Theobroma cacao] Length = 2065 Score = 349 bits (895), Expect = e-101 Identities = 251/715 (35%), Positives = 359/715 (50%), Gaps = 65/715 (9%) Frame = +3 Query: 18 KHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAG 197 KHK GK + KI+L++I+ LL DS YE F+ R+L S F L+ ++S+F D+ Sbjct: 1087 KHKIGKDGQLKKITLYQISQGLLKDSTLYENKFVRRNLASSFCHALENSVLSLFSDSSVR 1146 Query: 198 QTDINE---WEVILNKLEKTSLAL-NRRHVADDAVLPMTPDSL--CPDSPFIKCTKEXXX 359 + W +L+KL+ +S + +RR V D+ +S P +K Sbjct: 1147 DINFKSLPVWPEVLSKLDNSSTVVCSRRDVKHDSAARSISNSSDRLPSEISMKQKAFPIE 1206 Query: 360 XXXXXXXXS------WMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYEL 521 S WMPKGYLNS+SF +LNLER++V LL QG L + YEL Sbjct: 1207 NVKFKDCQSLLNLLCWMPKGYLNSKSFCQLTAYVLNLERIVVEDLLGCQGALSSNGCYEL 1266 Query: 522 FRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPE 701 F+LFV+CR+ L+ ++M+ + E S++ + G+S+ V+WL+KSV+ + +L E Sbjct: 1267 FQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSF-VIWLFKSVSTVIGVLDTMME 1325 Query: 702 EHATEVRHLSLSLMDDTSYIQSTLSKEQF----------------------------LFS 797 + E LMD TSY+ +SK QF L Sbjct: 1326 DCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSGVVGDESILNQ 1385 Query: 798 PSSTDGFSKNGSSGESNPSLGTWMESLKTHAEVLL----------LKIAD----LNIVEV 935 P S + K+ E+ SL E+LK AE LL K+ D +N ++ Sbjct: 1386 PGSCSNYLKD---SEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNKAVNTNKM 1442 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHA 1100 S ++ G LWG+ S L G EK + + SKL I+ FI+ LH Sbjct: 1443 SFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDFISEVLHM 1502 Query: 1101 LFVMDYQHPNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI---DSYERQXXXXXXXXXXX 1271 D Q +Y SS D++ +L+ D E Sbjct: 1503 FLDNDQQSRSY------------YDAESSQKLDYSRHLLVFETDLVELHYLNKHFLQGLL 1550 Query: 1272 KGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPH 1451 KGD+P+ A+ +R LLI SAI L ++I S +P++I ISQ +L E AN + P Sbjct: 1551 KGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLELANSGEIPP 1610 Query: 1452 TFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASH 1631 F+++ LDGAVKYLE +GS+ D N YA+LI +HL AIGKCISL GK ATL SH Sbjct: 1611 PFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELHLRAIGKCISLQGKRATLESH 1670 Query: 1632 DTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKA 1811 + ESSTK L TG SE + HG + L+EF+ARLR+SFK +K PS+L ++A++ +++A Sbjct: 1671 ERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLLSAMQAIERA 1730 Query: 1812 LVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 LVGV +M+Y+I TG + G VS+ VAAG+DC D +E S R L K HI Sbjct: 1731 LVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILEYGSGRRCLRVVKRHI 1785 >XP_017971647.1 PREDICTED: uncharacterized protein LOC18607616 isoform X2 [Theobroma cacao] Length = 1777 Score = 348 bits (892), Expect = e-101 Identities = 246/701 (35%), Positives = 354/701 (50%), Gaps = 62/701 (8%) Frame = +3 Query: 18 KHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAG 197 KHK GK + KI+L++I+ LL DS YE F+ R+L S F L+ ++S+F D+ Sbjct: 1087 KHKIGKDGQLKKITLYQISQGLLKDSTLYENKFVRRNLASSFCHALENSVLSLFSDSSVR 1146 Query: 198 QTDINE---WEVILNKLEKTSLAL-NRRHVADDAVLPMTPDSL--CPDSPFIKCTKEXXX 359 + W +L+KL+ +S + +RR V D+ +S P +K Sbjct: 1147 DINFKSLPVWPEVLSKLDNSSTVVCSRRDVKHDSAARSISNSSDRLPSEISMKQKAFPIE 1206 Query: 360 XXXXXXXXS------WMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYEL 521 S WMPKGYLNS+SF +LNLER++V LL QG L + YEL Sbjct: 1207 NVKFKDCQSLLNLLCWMPKGYLNSKSFCQLTAYVLNLERIVVEDLLGCQGALSSNGCYEL 1266 Query: 522 FRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPE 701 F+LFV+CR+ L+ ++M+ + E S++ + G+S+ V+WL+KSV+ + +L E Sbjct: 1267 FQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSF-VIWLFKSVSTVIGVLDTMME 1325 Query: 702 EHATEVRHLSLSLMDDTSYIQSTLSKEQF----------------------------LFS 797 + E LMD TSY+ +SK QF L Sbjct: 1326 DCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSGVVGDESILNQ 1385 Query: 798 PSSTDGFSKNGSSGESNPSLGTWMESLKTHAEVLL----------LKIAD----LNIVEV 935 P S + K+ E+ SL E+LK AE LL K+ D +N ++ Sbjct: 1386 PGSCSNYLKD---SEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNKAVNTNKM 1442 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHA 1100 S ++ G LWG+ S L G EK + + SKL I+ FI+ LH Sbjct: 1443 SFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDFISEVLHM 1502 Query: 1101 LFVMDYQHPNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI---DSYERQXXXXXXXXXXX 1271 D Q +Y SS D++ +L+ D E Sbjct: 1503 FLDNDQQSRSY------------YDAESSQKLDYSRHLLVFETDLVELHYLNKHFLQGLL 1550 Query: 1272 KGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPH 1451 KGD+P+ A+ +R LLI SAI L ++I S +P++I ISQ +L E AN + P Sbjct: 1551 KGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLELANSGEIPP 1610 Query: 1452 TFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASH 1631 F+++ LDGAVKYLE +GS+ D N YA+LI +HL AIGKCISL GK ATL SH Sbjct: 1611 PFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELHLRAIGKCISLQGKRATLESH 1670 Query: 1632 DTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKA 1811 + ESSTK L TG SE + HG + L+EF+ARLR+SFK +K PS+L ++A++ +++A Sbjct: 1671 ERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLLSAMQAIERA 1730 Query: 1812 LVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIES 1934 LVGV +M+Y+I TG + G VS+ VAAG+DC D +ES Sbjct: 1731 LVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILES 1771 >KJB15703.1 hypothetical protein B456_002G192700 [Gossypium raimondii] Length = 1934 Score = 346 bits (888), Expect = e-100 Identities = 255/707 (36%), Positives = 361/707 (51%), Gaps = 53/707 (7%) Frame = +3 Query: 6 KDVIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGD 185 + V KHK K KISL++I+ LL DS Y+ F+ R+L S F L+ + +FGD Sbjct: 964 QQVEKHKIEKDGQPKKISLYQISQGLLKDSTLYDHKFVRRNLSSRFCHALENLALLLFGD 1023 Query: 186 TLAGQTDINE---WEVILNKLEKTSLALN-RRHVADDAVLPMTPDSLCPDS---PFIKCT 344 + + N W +L+ L+ + ++ RR+V D+ +S S P Sbjct: 1024 SSVSDRNFNSFPVWSEVLSTLDNSPAVVSGRRYVKHDSPTRSISNSCNEQSSMNPTALPF 1083 Query: 345 KEXXXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELF 524 K WMPKG+L+S+SFS AT +++L++++V LL QG L ELF Sbjct: 1084 KTVKDCKSLLNLLCWMPKGFLSSKSFSKLATCVVHLDQLVVAELLHCQGTLSSYGC-ELF 1142 Query: 525 RLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEE 704 +LFV+CR+ L+ ++M+ + + EA S++ + G SY + WL+KSV+A+TELL E+ Sbjct: 1143 QLFVTCRRTLKNIIMALCEENIEASLSSLLSVAEG-SYFITWLFKSVSAVTELLDTMSED 1201 Query: 705 HATEVRHLSLSLMDDTSYIQSTLSKEQFL----FSPSSTD------GFSKNGS------- 833 +E ++ SLMD TSY+ +SK QF F +S GF + S Sbjct: 1202 CISEYKNKKFSLMDHTSYVFFAISKYQFSQAVDFIGNSEQPCKHFSGFVSDQSILNEPLL 1261 Query: 834 ------SGESNPSLGTWMESLKTHAEVLLLKIADL-------------NIVEVSSLVTSI 956 E+ SL ESLK AE LL + + NI ++S LV+ Sbjct: 1262 RFNNLKDSEALKSLSIIAESLKEQAESLLSSLKEALGIAQVGIEKEAENINKMSFLVSCF 1321 Query: 957 HGLLWGITSVLRSGYEKGTDGPEDSFKDSKLKQFISASERFINSCLHAL--FVMDYQH-- 1124 G LWG+ S L EK + +KL ++ S I C + + D H Sbjct: 1322 GGFLWGLASALNQLGEKCGE------LKTKLSRWKSEPLSKIKLCTNVFVDLISDVLHMF 1375 Query: 1125 -PNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI--DSYERQXXXXXXXXXXXKGDNPELA 1295 N + D D SSD D+ D L+ D KGD+P+ A Sbjct: 1376 LENGQQQRSDSD------SQSSDKFDYRRDSLVFNDLVVLPCLNKHLLLGLLKGDHPDRA 1429 Query: 1296 VCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPHTFSYICLD 1475 V +RQLLI SAIL L +++ S P+ I +SQ++L E N VQ P F+++ LD Sbjct: 1430 VLLRQLLITYSAILRLNLRVGGPLLSSGMAPLIIDMSQFLLLELVNSVQSPPPFTFVWLD 1489 Query: 1476 GAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASHDTESSTKT 1655 GAVKYLE VGS+ DS +NVY +LI +HL IGKCISL GK ATL SH+ ESS+K Sbjct: 1490 GAVKYLEEVGSHFPFTDSALNENVYGKLIELHLRGIGKCISLQGKRATLESHERESSSKI 1549 Query: 1656 LMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKALVGVNNRC 1835 L TG SE + HG + ++EF+ARLR SF V +K S+L M+ IE ++KALVGV Sbjct: 1550 LHDDTGLSESFLSHGSHCVDEFKARLRRSFSVFIKNSSELQLMSTIEAIEKALVGVQGAH 1609 Query: 1836 SMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 + +YEI G + G VS+ VA G+DC D +E S R L K HI Sbjct: 1610 ARIYEITAGSANGGMVSSTVAGGIDCLDLLLEHGSGRKCLSVIKRHI 1656 >XP_012467517.1 PREDICTED: uncharacterized protein LOC105785869 [Gossypium raimondii] XP_012467518.1 PREDICTED: uncharacterized protein LOC105785869 [Gossypium raimondii] KJB15702.1 hypothetical protein B456_002G192700 [Gossypium raimondii] Length = 2042 Score = 346 bits (888), Expect = e-100 Identities = 255/707 (36%), Positives = 361/707 (51%), Gaps = 53/707 (7%) Frame = +3 Query: 6 KDVIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGD 185 + V KHK K KISL++I+ LL DS Y+ F+ R+L S F L+ + +FGD Sbjct: 1072 QQVEKHKIEKDGQPKKISLYQISQGLLKDSTLYDHKFVRRNLSSRFCHALENLALLLFGD 1131 Query: 186 TLAGQTDINE---WEVILNKLEKTSLALN-RRHVADDAVLPMTPDSLCPDS---PFIKCT 344 + + N W +L+ L+ + ++ RR+V D+ +S S P Sbjct: 1132 SSVSDRNFNSFPVWSEVLSTLDNSPAVVSGRRYVKHDSPTRSISNSCNEQSSMNPTALPF 1191 Query: 345 KEXXXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELF 524 K WMPKG+L+S+SFS AT +++L++++V LL QG L ELF Sbjct: 1192 KTVKDCKSLLNLLCWMPKGFLSSKSFSKLATCVVHLDQLVVAELLHCQGTLSSYGC-ELF 1250 Query: 525 RLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEE 704 +LFV+CR+ L+ ++M+ + + EA S++ + G SY + WL+KSV+A+TELL E+ Sbjct: 1251 QLFVTCRRTLKNIIMALCEENIEASLSSLLSVAEG-SYFITWLFKSVSAVTELLDTMSED 1309 Query: 705 HATEVRHLSLSLMDDTSYIQSTLSKEQFL----FSPSSTD------GFSKNGS------- 833 +E ++ SLMD TSY+ +SK QF F +S GF + S Sbjct: 1310 CISEYKNKKFSLMDHTSYVFFAISKYQFSQAVDFIGNSEQPCKHFSGFVSDQSILNEPLL 1369 Query: 834 ------SGESNPSLGTWMESLKTHAEVLLLKIADL-------------NIVEVSSLVTSI 956 E+ SL ESLK AE LL + + NI ++S LV+ Sbjct: 1370 RFNNLKDSEALKSLSIIAESLKEQAESLLSSLKEALGIAQVGIEKEAENINKMSFLVSCF 1429 Query: 957 HGLLWGITSVLRSGYEKGTDGPEDSFKDSKLKQFISASERFINSCLHAL--FVMDYQH-- 1124 G LWG+ S L EK + +KL ++ S I C + + D H Sbjct: 1430 GGFLWGLASALNQLGEKCGE------LKTKLSRWKSEPLSKIKLCTNVFVDLISDVLHMF 1483 Query: 1125 -PNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI--DSYERQXXXXXXXXXXXKGDNPELA 1295 N + D D SSD D+ D L+ D KGD+P+ A Sbjct: 1484 LENGQQQRSDSD------SQSSDKFDYRRDSLVFNDLVVLPCLNKHLLLGLLKGDHPDRA 1537 Query: 1296 VCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPHTFSYICLD 1475 V +RQLLI SAIL L +++ S P+ I +SQ++L E N VQ P F+++ LD Sbjct: 1538 VLLRQLLITYSAILRLNLRVGGPLLSSGMAPLIIDMSQFLLLELVNSVQSPPPFTFVWLD 1597 Query: 1476 GAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASHDTESSTKT 1655 GAVKYLE VGS+ DS +NVY +LI +HL IGKCISL GK ATL SH+ ESS+K Sbjct: 1598 GAVKYLEEVGSHFPFTDSALNENVYGKLIELHLRGIGKCISLQGKRATLESHERESSSKI 1657 Query: 1656 LMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKALVGVNNRC 1835 L TG SE + HG + ++EF+ARLR SF V +K S+L M+ IE ++KALVGV Sbjct: 1658 LHDDTGLSESFLSHGSHCVDEFKARLRRSFSVFIKNSSELQLMSTIEAIEKALVGVQGAH 1717 Query: 1836 SMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 + +YEI G + G VS+ VA G+DC D +E S R L K HI Sbjct: 1718 ARIYEITAGSANGGMVSSTVAGGIDCLDLLLEHGSGRKCLSVIKRHI 1764 >EOX97769.1 Urb2/Npa2, putative isoform 5 [Theobroma cacao] Length = 1387 Score = 342 bits (877), Expect = 1e-99 Identities = 244/701 (34%), Positives = 352/701 (50%), Gaps = 62/701 (8%) Frame = +3 Query: 18 KHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAG 197 KHK GK + KI+L++I+ LL DS YE F+ R+L S F L+ ++S+F D+ Sbjct: 697 KHKIGKDGQLKKITLYQISQGLLKDSTLYENKFVRRNLASSFCHALENSVLSLFSDSSVR 756 Query: 198 QTDINE---WEVILNKLEKTSLAL-NRRHVADDAVLPMTPDSL--CPDSPFIKCTKEXXX 359 + W +L+KL+ +S + +RR V D+ +S P +K Sbjct: 757 DINFKSLPVWPEVLSKLDNSSTVVCSRRDVKHDSAARSISNSSDRLPSEISMKQKAFPIE 816 Query: 360 XXXXXXXXS------WMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYEL 521 S WMPKGYLNS+SF +LNLER++V LL QG L + YEL Sbjct: 817 NVKFKDCQSLLNLLCWMPKGYLNSKSFCQLTAYVLNLERIVVEDLLGCQGALSSNGCYEL 876 Query: 522 FRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPE 701 F+LFV+CR+ L+ ++M+ + E S++ + G+S+ V+WL+KSV+ + +L E Sbjct: 877 FQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSF-VIWLFKSVSTVIGVLDTMME 935 Query: 702 EHATEVRHLSLSLMDDTSYIQSTLSKEQF----------------------------LFS 797 + E LMD TSY+ +SK QF L Sbjct: 936 DCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSGVVGDESILNQ 995 Query: 798 PSSTDGFSKNGSSGESNPSLGTWMESLKTHAEVLL----------LKIAD----LNIVEV 935 P S + K+ E+ SL E+LK AE LL K+ D +N ++ Sbjct: 996 PGSCSNYLKD---SEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNKAVNTNKM 1052 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHA 1100 S ++ G LWG+ S L G EK + + SKL I+ FI+ H Sbjct: 1053 SFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDFISEVFHM 1112 Query: 1101 LFVMDYQHPNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI---DSYERQXXXXXXXXXXX 1271 D Q +Y SS D++ +L+ D E Sbjct: 1113 FLDNDQQSRSY------------YDAESSQKLDYSRHLLVFETDLVELHYLNKHFLQGLL 1160 Query: 1272 KGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPH 1451 KGD+P+ A+ +R LLI SAI L ++I S +P++I ISQ +L E AN + P Sbjct: 1161 KGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLELANSGEIPP 1220 Query: 1452 TFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASH 1631 F+++ LDGAVKYLE +GS+ D N YA+LI + L AIGKCISL GK ATL SH Sbjct: 1221 PFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELLLRAIGKCISLQGKRATLESH 1280 Query: 1632 DTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKA 1811 + ESSTK L TG SE + HG + L+EF+ARLR+SFK +K PS+L ++A++ +++A Sbjct: 1281 ERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLLSAMQAIERA 1340 Query: 1812 LVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIES 1934 LVGV +M+Y+I TG + G VS+ VAAG+DC D +ES Sbjct: 1341 LVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILES 1381 >XP_017622784.1 PREDICTED: uncharacterized protein LOC108466921 [Gossypium arboreum] Length = 2044 Score = 345 bits (884), Expect = 1e-99 Identities = 254/708 (35%), Positives = 363/708 (51%), Gaps = 54/708 (7%) Frame = +3 Query: 6 KDVIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGD 185 + V KHK K + KISL++I+ LL DS Y+ F+ R+L S F L+ + +FG+ Sbjct: 1072 QQVEKHKIEKDGQLKKISLYQISQGLLKDSTLYDHKFVRRNLSSRFCHALENLALLLFGN 1131 Query: 186 TLAGQTDINE---WEVILNKLEKTSLALN-RRHVADDAVLPMTPDSLCPDSPFIKCT--- 344 + + N W +L+ L+ + ++ RR+V D+ +S C + + T Sbjct: 1132 SSVSDRNFNSFPVWSEVLSTLDNSPAVVSGRRYVKHDSATRSISNS-CNEQSSMNPTSLP 1190 Query: 345 -KEXXXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYEL 521 K WMPKG+L+S+SFS AT +L+L++++V LL Q L EL Sbjct: 1191 FKTVKDCKSLLNLLCWMPKGFLSSKSFSKLATCVLHLDQLVVAELLLCQRALSSYGC-EL 1249 Query: 522 FRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPE 701 F+LFV+CR+ L+ ++M+ + + EA S++ + G SY + WL+KSV+A+TELL E Sbjct: 1250 FQLFVTCRRTLKNIIMALCEENIEASLSSLLSVAEG-SYFITWLFKSVSAVTELLDTMSE 1308 Query: 702 EHATEVRHLSLSLMDDTSYIQSTLSKEQFL----FSPSSTD------GFSKNGS------ 833 + ++ + SLMD TSY+ +SK QF F +S GF + S Sbjct: 1309 DCISDYKTKKFSLMDHTSYVFFAISKYQFSQAVDFIGNSEQPCKHFSGFVSDQSILNEPP 1368 Query: 834 -------SGESNPSLGTWMESLKTHAEVLLLKIADL-------------NIVEVSSLVTS 953 E+ SL T ESLK AE L + + NI ++S LV+ Sbjct: 1369 LCFNYLKDSEALKSLSTIAESLKEQAESFLSSLKEALGIAQVGIEEEAENINKMSFLVSC 1428 Query: 954 IHGLLWGITSVLRSGYEKGTDGPEDSFKD-----SKLKQFISASERFINSCLHALFVMDY 1118 G LWG+ S L EK + + SK+K + I++ LH Sbjct: 1429 FGGFLWGLASALNQLGEKCGELKTKLLRWKSEPLSKIKLCTNVFVDLISNVLHMFLEKGQ 1488 Query: 1119 QHPNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI--DSYERQXXXXXXXXXXXKGDNPEL 1292 Q + D D SSD D+ D L+ D KGD+P+ Sbjct: 1489 QRRS------DPD------SQSSDKFDYRRDSLVFNDLVVLPCLNKHLLLGLLKGDHPDR 1536 Query: 1293 AVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPHTFSYICL 1472 AV +RQLLI SAIL L +++ S + I +SQ++L E AN V+ P F+++ L Sbjct: 1537 AVLLRQLLITYSAILRLNLRVGGPLLSSGMASLIIDMSQFLLLELANSVESPPPFTFVWL 1596 Query: 1473 DGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASHDTESSTK 1652 DGAVKYLE VGS+ DS +NVY +LI +HL IGKCISL GK ATL SH+ ESS+K Sbjct: 1597 DGAVKYLEEVGSHFPFTDSALNENVYGKLIELHLRGIGKCISLQGKSATLESHERESSSK 1656 Query: 1653 TLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKALVGVNNR 1832 L TG SE + HG + L+EF+ARLR+SF V +K PS+L M+AIE ++KALVGV Sbjct: 1657 ILHDDTGLSESFLSHGSHCLDEFKARLRMSFSVFIKNPSELQLMSAIEAIEKALVGVQGA 1716 Query: 1833 CSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 +YEI G + G VS+ VA G+DC D +E S R L K HI Sbjct: 1717 HGRIYEITAGSANGGMVSSTVAGGIDCLDLLLEHGSGRKCLSVIKRHI 1764 >EOX97768.1 Urb2/Npa2, putative isoform 4 [Theobroma cacao] Length = 1533 Score = 343 bits (880), Expect = 1e-99 Identities = 249/715 (34%), Positives = 357/715 (49%), Gaps = 65/715 (9%) Frame = +3 Query: 18 KHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAG 197 KHK GK + KI+L++I+ LL DS YE F+ R+L S F L+ ++S+F D+ Sbjct: 697 KHKIGKDGQLKKITLYQISQGLLKDSTLYENKFVRRNLASSFCHALENSVLSLFSDSSVR 756 Query: 198 QTDINE---WEVILNKLEKTSLAL-NRRHVADDAVLPMTPDSL--CPDSPFIKCTKEXXX 359 + W +L+KL+ +S + +RR V D+ +S P +K Sbjct: 757 DINFKSLPVWPEVLSKLDNSSTVVCSRRDVKHDSAARSISNSSDRLPSEISMKQKAFPIE 816 Query: 360 XXXXXXXXS------WMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYEL 521 S WMPKGYLNS+SF +LNLER++V LL QG L + YEL Sbjct: 817 NVKFKDCQSLLNLLCWMPKGYLNSKSFCQLTAYVLNLERIVVEDLLGCQGALSSNGCYEL 876 Query: 522 FRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPE 701 F+LFV+CR+ L+ ++M+ + E S++ + G+S+ V+WL+KSV+ + +L E Sbjct: 877 FQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSF-VIWLFKSVSTVIGVLDTMME 935 Query: 702 EHATEVRHLSLSLMDDTSYIQSTLSKEQF----------------------------LFS 797 + E LMD TSY+ +SK QF L Sbjct: 936 DCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSGVVGDESILNQ 995 Query: 798 PSSTDGFSKNGSSGESNPSLGTWMESLKTHAEVLL----------LKIAD----LNIVEV 935 P S + K+ E+ SL E+LK AE LL K+ D +N ++ Sbjct: 996 PGSCSNYLKD---SEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNKAVNTNKM 1052 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHA 1100 S ++ G LWG+ S L G EK + + SKL I+ FI+ H Sbjct: 1053 SFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDFISEVFHM 1112 Query: 1101 LFVMDYQHPNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI---DSYERQXXXXXXXXXXX 1271 D Q +Y SS D++ +L+ D E Sbjct: 1113 FLDNDQQSRSY------------YDAESSQKLDYSRHLLVFETDLVELHYLNKHFLQGLL 1160 Query: 1272 KGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPH 1451 KGD+P+ A+ +R LLI SAI L ++I S +P++I ISQ +L E AN + P Sbjct: 1161 KGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLELANSGEIPP 1220 Query: 1452 TFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASH 1631 F+++ LDGAVKYLE +GS+ D N YA+LI + L AIGKCISL GK ATL SH Sbjct: 1221 PFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELLLRAIGKCISLQGKRATLESH 1280 Query: 1632 DTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKA 1811 + ESSTK L TG SE + HG + L+EF+ARLR+SFK +K PS+L ++A++ +++A Sbjct: 1281 ERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLLSAMQAIERA 1340 Query: 1812 LVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 LVGV +M+Y+I TG + G VS+ VAAG+DC D +E S R L K HI Sbjct: 1341 LVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILEYGSGRRCLRVVKRHI 1395 >EOX97766.1 Urb2/Npa2, putative isoform 2 [Theobroma cacao] Length = 2065 Score = 343 bits (880), Expect = 5e-99 Identities = 249/715 (34%), Positives = 357/715 (49%), Gaps = 65/715 (9%) Frame = +3 Query: 18 KHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAG 197 KHK GK + KI+L++I+ LL DS YE F+ R+L S F L+ ++S+F D+ Sbjct: 1087 KHKIGKDGQLKKITLYQISQGLLKDSTLYENKFVRRNLASSFCHALENSVLSLFSDSSVR 1146 Query: 198 QTDINE---WEVILNKLEKTSLAL-NRRHVADDAVLPMTPDSL--CPDSPFIKCTKEXXX 359 + W +L+KL+ +S + +RR V D+ +S P +K Sbjct: 1147 DINFKSLPVWPEVLSKLDNSSTVVCSRRDVKHDSAARSISNSSDRLPSEISMKQKAFPIE 1206 Query: 360 XXXXXXXXS------WMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYEL 521 S WMPKGYLNS+SF +LNLER++V LL QG L + YEL Sbjct: 1207 NVKFKDCQSLLNLLCWMPKGYLNSKSFCQLTAYVLNLERIVVEDLLGCQGALSSNGCYEL 1266 Query: 522 FRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPE 701 F+LFV+CR+ L+ ++M+ + E S++ + G+S+ V+WL+KSV+ + +L E Sbjct: 1267 FQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSF-VIWLFKSVSTVIGVLDTMME 1325 Query: 702 EHATEVRHLSLSLMDDTSYIQSTLSKEQF----------------------------LFS 797 + E LMD TSY+ +SK QF L Sbjct: 1326 DCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSGVVGDESILNQ 1385 Query: 798 PSSTDGFSKNGSSGESNPSLGTWMESLKTHAEVLL----------LKIAD----LNIVEV 935 P S + K+ E+ SL E+LK AE LL K+ D +N ++ Sbjct: 1386 PGSCSNYLKD---SEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNKAVNTNKM 1442 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHA 1100 S ++ G LWG+ S L G EK + + SKL I+ FI+ H Sbjct: 1443 SFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDFISEVFHM 1502 Query: 1101 LFVMDYQHPNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI---DSYERQXXXXXXXXXXX 1271 D Q +Y SS D++ +L+ D E Sbjct: 1503 FLDNDQQSRSY------------YDAESSQKLDYSRHLLVFETDLVELHYLNKHFLQGLL 1550 Query: 1272 KGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPH 1451 KGD+P+ A+ +R LLI SAI L ++I S +P++I ISQ +L E AN + P Sbjct: 1551 KGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLELANSGEIPP 1610 Query: 1452 TFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASH 1631 F+++ LDGAVKYLE +GS+ D N YA+LI + L AIGKCISL GK ATL SH Sbjct: 1611 PFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELLLRAIGKCISLQGKRATLESH 1670 Query: 1632 DTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKA 1811 + ESSTK L TG SE + HG + L+EF+ARLR+SFK +K PS+L ++A++ +++A Sbjct: 1671 ERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLLSAMQAIERA 1730 Query: 1812 LVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 LVGV +M+Y+I TG + G VS+ VAAG+DC D +E S R L K HI Sbjct: 1731 LVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILEYGSGRRCLRVVKRHI 1785 >EOX97765.1 Urb2/Npa2, putative isoform 1 [Theobroma cacao] Length = 2090 Score = 343 bits (880), Expect = 5e-99 Identities = 249/715 (34%), Positives = 357/715 (49%), Gaps = 65/715 (9%) Frame = +3 Query: 18 KHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAG 197 KHK GK + KI+L++I+ LL DS YE F+ R+L S F L+ ++S+F D+ Sbjct: 1111 KHKIGKDGQLKKITLYQISQGLLKDSTLYENKFVRRNLASSFCHALENSVLSLFSDSSVR 1170 Query: 198 QTDINE---WEVILNKLEKTSLAL-NRRHVADDAVLPMTPDSL--CPDSPFIKCTKEXXX 359 + W +L+KL+ +S + +RR V D+ +S P +K Sbjct: 1171 DINFKSLPVWPEVLSKLDNSSTVVCSRRDVKHDSAARSISNSSDRLPSEISMKQKAFPIE 1230 Query: 360 XXXXXXXXS------WMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYEL 521 S WMPKGYLNS+SF +LNLER++V LL QG L + YEL Sbjct: 1231 NVKFKDCQSLLNLLCWMPKGYLNSKSFCQLTAYVLNLERIVVEDLLGCQGALSSNGCYEL 1290 Query: 522 FRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPE 701 F+LFV+CR+ L+ ++M+ + E S++ + G+S+ V+WL+KSV+ + +L E Sbjct: 1291 FQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSF-VIWLFKSVSTVIGVLDTMME 1349 Query: 702 EHATEVRHLSLSLMDDTSYIQSTLSKEQF----------------------------LFS 797 + E LMD TSY+ +SK QF L Sbjct: 1350 DCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSGVVGDESILNQ 1409 Query: 798 PSSTDGFSKNGSSGESNPSLGTWMESLKTHAEVLL----------LKIAD----LNIVEV 935 P S + K+ E+ SL E+LK AE LL K+ D +N ++ Sbjct: 1410 PGSCSNYLKD---SEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNKAVNTNKM 1466 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHA 1100 S ++ G LWG+ S L G EK + + SKL I+ FI+ H Sbjct: 1467 SFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDFISEVFHM 1526 Query: 1101 LFVMDYQHPNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI---DSYERQXXXXXXXXXXX 1271 D Q +Y SS D++ +L+ D E Sbjct: 1527 FLDNDQQSRSY------------YDAESSQKLDYSRHLLVFETDLVELHYLNKHFLQGLL 1574 Query: 1272 KGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPH 1451 KGD+P+ A+ +R LLI SAI L ++I S +P++I ISQ +L E AN + P Sbjct: 1575 KGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLELANSGEIPP 1634 Query: 1452 TFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASH 1631 F+++ LDGAVKYLE +GS+ D N YA+LI + L AIGKCISL GK ATL SH Sbjct: 1635 PFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELLLRAIGKCISLQGKRATLESH 1694 Query: 1632 DTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKA 1811 + ESSTK L TG SE + HG + L+EF+ARLR+SFK +K PS+L ++A++ +++A Sbjct: 1695 ERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLLSAMQAIERA 1754 Query: 1812 LVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 LVGV +M+Y+I TG + G VS+ VAAG+DC D +E S R L K HI Sbjct: 1755 LVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILEYGSGRRCLRVVKRHI 1809 >EOX97767.1 Urb2/Npa2, putative isoform 3 [Theobroma cacao] Length = 1777 Score = 342 bits (877), Expect = 9e-99 Identities = 244/701 (34%), Positives = 352/701 (50%), Gaps = 62/701 (8%) Frame = +3 Query: 18 KHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAG 197 KHK GK + KI+L++I+ LL DS YE F+ R+L S F L+ ++S+F D+ Sbjct: 1087 KHKIGKDGQLKKITLYQISQGLLKDSTLYENKFVRRNLASSFCHALENSVLSLFSDSSVR 1146 Query: 198 QTDINE---WEVILNKLEKTSLAL-NRRHVADDAVLPMTPDSL--CPDSPFIKCTKEXXX 359 + W +L+KL+ +S + +RR V D+ +S P +K Sbjct: 1147 DINFKSLPVWPEVLSKLDNSSTVVCSRRDVKHDSAARSISNSSDRLPSEISMKQKAFPIE 1206 Query: 360 XXXXXXXXS------WMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYEL 521 S WMPKGYLNS+SF +LNLER++V LL QG L + YEL Sbjct: 1207 NVKFKDCQSLLNLLCWMPKGYLNSKSFCQLTAYVLNLERIVVEDLLGCQGALSSNGCYEL 1266 Query: 522 FRLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPE 701 F+LFV+CR+ L+ ++M+ + E S++ + G+S+ V+WL+KSV+ + +L E Sbjct: 1267 FQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSF-VIWLFKSVSTVIGVLDTMME 1325 Query: 702 EHATEVRHLSLSLMDDTSYIQSTLSKEQF----------------------------LFS 797 + E LMD TSY+ +SK QF L Sbjct: 1326 DCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSGVVGDESILNQ 1385 Query: 798 PSSTDGFSKNGSSGESNPSLGTWMESLKTHAEVLL----------LKIAD----LNIVEV 935 P S + K+ E+ SL E+LK AE LL K+ D +N ++ Sbjct: 1386 PGSCSNYLKD---SEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNKAVNTNKM 1442 Query: 936 SSLVTSIHGLLWGITSVLRSGYEKGTDGPEDSFK-----DSKLKQFISASERFINSCLHA 1100 S ++ G LWG+ S L G EK + + SKL I+ FI+ H Sbjct: 1443 SFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDFISEVFHM 1502 Query: 1101 LFVMDYQHPNYSKNLVDKDLSIKKKRSSSDNADFTIDVLI---DSYERQXXXXXXXXXXX 1271 D Q +Y SS D++ +L+ D E Sbjct: 1503 FLDNDQQSRSY------------YDAESSQKLDYSRHLLVFETDLVELHYLNKHFLQGLL 1550 Query: 1272 KGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPH 1451 KGD+P+ A+ +R LLI SAI L ++I S +P++I ISQ +L E AN + P Sbjct: 1551 KGDHPDRAILLRHLLITHSAIPRLNLRIDDTSLSSGMVPLNIGISQVLLLELANSGEIPP 1610 Query: 1452 TFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASH 1631 F+++ LDGAVKYLE +GS+ D N YA+LI + L AIGKCISL GK ATL SH Sbjct: 1611 PFTFVWLDGAVKYLEELGSHFPLNDPTLNGNAYAKLIELLLRAIGKCISLQGKRATLESH 1670 Query: 1632 DTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKA 1811 + ESSTK L TG SE + HG + L+EF+ARLR+SFK +K PS+L ++A++ +++A Sbjct: 1671 ERESSTKILHGGTGWSESFLSHGSHCLDEFKARLRMSFKAFIKNPSELQLLSAMQAIERA 1730 Query: 1812 LVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCIES 1934 LVGV +M+Y+I TG + G VS+ VAAG+DC D +ES Sbjct: 1731 LVGVRGGHAMIYDINTGSANGGMVSSTVAAGIDCLDLILES 1771 >XP_016701544.1 PREDICTED: uncharacterized protein LOC107916705 [Gossypium hirsutum] Length = 2044 Score = 338 bits (868), Expect = 2e-97 Identities = 250/704 (35%), Positives = 357/704 (50%), Gaps = 50/704 (7%) Frame = +3 Query: 6 KDVIKHKTGKSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGD 185 + V KHK K KISL++I+ LL DS Y+ F+ R+L S F L+ + +FG+ Sbjct: 1072 QQVEKHKIEKDGQPKKISLYQISQGLLKDSTLYDHKFVRRNLSSRFCHALENLALLLFGN 1131 Query: 186 TLAGQTDINE---WEVILNKLEKTSLALN-RRHVADDAVLPMTPDSLCPDS---PFIKCT 344 + + N W + + L+ + ++ RR+V D+ +S S P Sbjct: 1132 SSVSDRNFNSFPVWSEVFSTLDNSPAVVSGRRYVKHDSATRSISNSCNEQSSMNPTALPF 1191 Query: 345 KEXXXXXXXXXXXSWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELF 524 K WMPKG+L+S+SFS AT +L+L++++V LL Q L ELF Sbjct: 1192 KTVKDCKSLLNLLCWMPKGFLSSKSFSKLATCVLHLDQLVVAELLLCQRALSSYGC-ELF 1250 Query: 525 RLFVSCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEE 704 +LFV+CR+ L+ ++M+ + + EA S++ + G SY + WL+KSV+A+TELL E+ Sbjct: 1251 QLFVTCRRTLKNIIMALCEENIEASLSSLLSVAEG-SYFITWLFKSVSAVTELLDTMSED 1309 Query: 705 HATEVRHLSLSLMDDTSYIQSTLSKEQFL----FSPSSTD------GFSKNGS------- 833 ++ + SLMD TSY+ +SK QF F +S GF + S Sbjct: 1310 CISDYKTKKFSLMDHTSYVFFAISKYQFSQAVDFIGNSEQPCKHFSGFVSDQSILNEPPL 1369 Query: 834 ------SGESNPSLGTWMESLKTHAEVLLLKIADL-------------NIVEVSSLVTSI 956 E+ SL ESL+ AE L + + NI ++S LV+ Sbjct: 1370 CFNYLKDSEALKSLSIIAESLQEQAESFLSSLKEALGIAQVGIEEEAENINKMSFLVSCF 1429 Query: 957 HGLLWGITSVLRSGYEKGTDGPEDSFKDSKLKQFISASERFINSCLHAL--FVMDYQHPN 1130 G LWG+ S L EK + +KL ++ S I C + + D H Sbjct: 1430 GGFLWGLASALNQLGEKCGE------LKTKLLRWKSEPLSKIKLCTNVFVDLISDVLHMF 1483 Query: 1131 YSKNLVDKDLSIKKKRSSSDNADFTIDVLI--DSYERQXXXXXXXXXXXKGDNPELAVCV 1304 K + SSD D+ D L+ D KGD+P+ AV + Sbjct: 1484 LEKGQQRRS---DPDSQSSDKFDYRRDSLVFNDLVVLPCLNKHLLLGLLKGDHPDRAVLL 1540 Query: 1305 RQLLIGSSAILSLKMQIYHRDFYSSSMPVSIAISQYMLTEFANMVQEPHTFSYICLDGAV 1484 RQLLI SAIL L +++ S + I +SQ++L E AN V+ P F+++ LDGAV Sbjct: 1541 RQLLITYSAILRLNLRVGGPLLSSGMASLIIDMSQFLLLELANSVESPPPFTFVWLDGAV 1600 Query: 1485 KYLEVVGSYVSPKDSKKLQNVYARLIGIHLPAIGKCISLCGKEATLASHDTESSTKTLMS 1664 KYLE VGS+ DS +NVY +LI +HL IGKCISL GK ATL SH+ ESS+K L Sbjct: 1601 KYLEEVGSHFQFTDSALNENVYGKLIELHLRGIGKCISLQGKSATLESHERESSSKILHD 1660 Query: 1665 QTGSSELSIGHGWNSLNEFRARLRLSFKVLLKEPSKLHRMTAIEVVKKALVGVNNRCSMV 1844 TG SE + HG + L+EF+ARLR+SF V +K PS+L M+AIE ++KALVGV + Sbjct: 1661 DTGLSESFLSHGSHCLDEFKARLRMSFSVFIKNPSELQLMSAIEAIEKALVGVQGAHGRI 1720 Query: 1845 YEIYTGGPDEGKVSAAVAAGVDCFDSCIESVSDRDSL---KNHI 1967 YEI G + G VS+ VA G+DC D +E S R L K HI Sbjct: 1721 YEITAGSANGGMVSSTVAGGIDCLDLLLEHGSGRKCLSVIKRHI 1764 >XP_015160859.1 PREDICTED: uncharacterized protein LOC102601821 isoform X2 [Solanum tuberosum] Length = 2018 Score = 319 bits (817), Expect = 1e-90 Identities = 238/736 (32%), Positives = 366/736 (49%), Gaps = 91/736 (12%) Frame = +3 Query: 33 KSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAGQTDIN 212 KS V ++ H +++ELL +++ YEQ +CRH+ S F +LK+ + SIF + G+ D+N Sbjct: 1010 KSGYVTGVNRHLVSVELLSNTILYEQKPICRHMASIFCQILKKSVSSIF--SYVGEVDLN 1067 Query: 213 ---EWEVILNKLEKTSLALNR-RHVADDAVLPMTP-DSLCPDSPFIKCTKEXXXXXXXXX 377 +WE ++ LEK+S R H D+ L + P L D P C KE Sbjct: 1068 GTPDWENAIHMLEKSSTTFFRSNHPQDNDSLLIEPIHHLLNDIPAELCEKELSPINAEIT 1127 Query: 378 XX-------SWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELFRLFV 536 SW+PKG+L S+SFS AT++LN++R++V L G + YEL RL V Sbjct: 1128 RCREFLNLLSWIPKGHLRSKSFSRYATSILNIDRLVVGCLFDQHGSVALCSRYELLRLLV 1187 Query: 537 SCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEEHATE 716 +CR+ + LLM+ G+ S++ V WL KS++A+T LSV +E + + Sbjct: 1188 TCRRTFKNLLMA--SCKGKKGHQSLLACLLSERSPVFWLLKSLSAVTGFLSVISQETSPQ 1245 Query: 717 VRHLSLSLMDDTSYIQSTLSKEQF--LFSP----------SSTDGFSK-----NGSSGES 845 ++H+ SLMD TS+I TL K+QF +F+ SS DG + NG + Sbjct: 1246 LKHMIFSLMDHTSFILLTLFKDQFEAIFALTAGKSYGGAISSVDGHKETVLRENGPRSDF 1305 Query: 846 NPSLGTWME------SLKTHAEVLL---------LKIADL----NIVEVSSLVTSIHGLL 968 + + W +L HA+ LL K+ DL + +VS LV+ G L Sbjct: 1306 SDNNNAWRSVSSVAGTLTRHAQELLDSLNLAVVNRKVDDLAGLQEMDKVSPLVSCFQGFL 1365 Query: 969 WGITSVLRS-GYEKGTDGPEDSFKDSKLKQFISASERFINSCLHALFVMDYQHPN----- 1130 G+ S + S ++ + E + + K+K I +NS LH LF+ Q P Sbjct: 1366 CGLVSAMDSLDIKRSSTLIESTSHNLKMKPCIETCADLLNSILHLLFLEGDQCPQGLSST 1425 Query: 1131 ------------------YSKNLVDKDLSIKKKRSSSDNADFT--------------IDV 1214 S++ D+ ++KK+ S +AD I+ Sbjct: 1426 HTAIETECCNELLAAGTYQSRDSADEPNNVKKEEHYSGSADSVQSNDCKNDLQKFGGIES 1485 Query: 1215 LIDS--YERQXXXXXXXXXXXKGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMP 1388 L+ + +E+Q KG+N E A C++ + SSAIL + + +P Sbjct: 1486 LLANVDFEQQYLRKSLLQGLSKGENLEAAFCLKHIFGASSAILKFSLHTKSTSLPKNLLP 1545 Query: 1389 VSIAISQYMLTEFANMVQEPHTFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGI 1568 + I +S +L++FAN FS+I LDG K++ +G + ++++ + I + Sbjct: 1546 ILIRVSHVLLSDFANHSGSLEQFSFIWLDGVAKFIGELGKIFPLLNPLSSRDLFVKQIEL 1605 Query: 1569 HLPAIGKCISLCGKEATLASHDTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFK 1748 HL A+GKCISL GKEA LAS + ESSTK ++S +LS H N L+E ++RLR+SF Sbjct: 1606 HLRAMGKCISLQGKEAALASREIESSTK-MLSGLPEHDLSNSHWLNHLDELKSRLRMSFA 1664 Query: 1749 VLLKEPSKLHRMTAIEVVKKALVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCI 1928 + S+LH ++AI+ +++ALVGV C + YE+ TG KVSA VAAG+DC D + Sbjct: 1665 NFVSRASELHLLSAIQAIERALVGVQEHCIINYEVTTGSSHGAKVSAYVAAGIDCLDVIL 1724 Query: 1929 ESVSDRDSL---KNHI 1967 ESVS R L K HI Sbjct: 1725 ESVSGRKKLAVVKRHI 1740 >XP_006367335.1 PREDICTED: uncharacterized protein LOC102601821 isoform X1 [Solanum tuberosum] Length = 2086 Score = 319 bits (817), Expect = 1e-90 Identities = 238/736 (32%), Positives = 366/736 (49%), Gaps = 91/736 (12%) Frame = +3 Query: 33 KSHDVGKISLHEITIELLHDSLFYEQTFLCRHLPSCFSDVLKQFLVSIFGDTLAGQTDIN 212 KS V ++ H +++ELL +++ YEQ +CRH+ S F +LK+ + SIF + G+ D+N Sbjct: 1078 KSGYVTGVNRHLVSVELLSNTILYEQKPICRHMASIFCQILKKSVSSIF--SYVGEVDLN 1135 Query: 213 ---EWEVILNKLEKTSLALNR-RHVADDAVLPMTP-DSLCPDSPFIKCTKEXXXXXXXXX 377 +WE ++ LEK+S R H D+ L + P L D P C KE Sbjct: 1136 GTPDWENAIHMLEKSSTTFFRSNHPQDNDSLLIEPIHHLLNDIPAELCEKELSPINAEIT 1195 Query: 378 XX-------SWMPKGYLNSESFSDCATNLLNLERVLVVMLLQDQGELKGDRYYELFRLFV 536 SW+PKG+L S+SFS AT++LN++R++V L G + YEL RL V Sbjct: 1196 RCREFLNLLSWIPKGHLRSKSFSRYATSILNIDRLVVGCLFDQHGSVALCSRYELLRLLV 1255 Query: 537 SCRKALRCLLMSFHDASGEAERCSIIPIFSGNSYAVLWLWKSVTALTELLSVFPEEHATE 716 +CR+ + LLM+ G+ S++ V WL KS++A+T LSV +E + + Sbjct: 1256 TCRRTFKNLLMA--SCKGKKGHQSLLACLLSERSPVFWLLKSLSAVTGFLSVISQETSPQ 1313 Query: 717 VRHLSLSLMDDTSYIQSTLSKEQF--LFSP----------SSTDGFSK-----NGSSGES 845 ++H+ SLMD TS+I TL K+QF +F+ SS DG + NG + Sbjct: 1314 LKHMIFSLMDHTSFILLTLFKDQFEAIFALTAGKSYGGAISSVDGHKETVLRENGPRSDF 1373 Query: 846 NPSLGTWME------SLKTHAEVLL---------LKIADL----NIVEVSSLVTSIHGLL 968 + + W +L HA+ LL K+ DL + +VS LV+ G L Sbjct: 1374 SDNNNAWRSVSSVAGTLTRHAQELLDSLNLAVVNRKVDDLAGLQEMDKVSPLVSCFQGFL 1433 Query: 969 WGITSVLRS-GYEKGTDGPEDSFKDSKLKQFISASERFINSCLHALFVMDYQHPN----- 1130 G+ S + S ++ + E + + K+K I +NS LH LF+ Q P Sbjct: 1434 CGLVSAMDSLDIKRSSTLIESTSHNLKMKPCIETCADLLNSILHLLFLEGDQCPQGLSST 1493 Query: 1131 ------------------YSKNLVDKDLSIKKKRSSSDNADFT--------------IDV 1214 S++ D+ ++KK+ S +AD I+ Sbjct: 1494 HTAIETECCNELLAAGTYQSRDSADEPNNVKKEEHYSGSADSVQSNDCKNDLQKFGGIES 1553 Query: 1215 LIDS--YERQXXXXXXXXXXXKGDNPELAVCVRQLLIGSSAILSLKMQIYHRDFYSSSMP 1388 L+ + +E+Q KG+N E A C++ + SSAIL + + +P Sbjct: 1554 LLANVDFEQQYLRKSLLQGLSKGENLEAAFCLKHIFGASSAILKFSLHTKSTSLPKNLLP 1613 Query: 1389 VSIAISQYMLTEFANMVQEPHTFSYICLDGAVKYLEVVGSYVSPKDSKKLQNVYARLIGI 1568 + I +S +L++FAN FS+I LDG K++ +G + ++++ + I + Sbjct: 1614 ILIRVSHVLLSDFANHSGSLEQFSFIWLDGVAKFIGELGKIFPLLNPLSSRDLFVKQIEL 1673 Query: 1569 HLPAIGKCISLCGKEATLASHDTESSTKTLMSQTGSSELSIGHGWNSLNEFRARLRLSFK 1748 HL A+GKCISL GKEA LAS + ESSTK ++S +LS H N L+E ++RLR+SF Sbjct: 1674 HLRAMGKCISLQGKEAALASREIESSTK-MLSGLPEHDLSNSHWLNHLDELKSRLRMSFA 1732 Query: 1749 VLLKEPSKLHRMTAIEVVKKALVGVNNRCSMVYEIYTGGPDEGKVSAAVAAGVDCFDSCI 1928 + S+LH ++AI+ +++ALVGV C + YE+ TG KVSA VAAG+DC D + Sbjct: 1733 NFVSRASELHLLSAIQAIERALVGVQEHCIINYEVTTGSSHGAKVSAYVAAGIDCLDVIL 1792 Query: 1929 ESVSDRDSL---KNHI 1967 ESVS R L K HI Sbjct: 1793 ESVSGRKKLAVVKRHI 1808