BLASTX nr result
ID: Mentha29_contig00000943
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00000943 (2042 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU37499.1| hypothetical protein MIMGU_mgv1a003124mg [Mimulus... 772 0.0 ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ... 705 0.0 ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ... 702 0.0 ref|XP_007040833.1| Uncharacterized protein isoform 1 [Theobroma... 689 0.0 ref|XP_007040836.1| Uncharacterized protein isoform 4 [Theobroma... 659 0.0 ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257... 656 0.0 ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci... 654 0.0 ref|XP_007040837.1| Uncharacterized protein isoform 5 [Theobroma... 652 0.0 emb|CBI21809.3| unnamed protein product [Vitis vinifera] 652 0.0 ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm... 640 0.0 ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510... 637 e-180 ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps... 635 e-179 ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778... 632 e-178 ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia... 632 e-178 gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis] 631 e-178 ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab... 630 e-178 ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago ... 627 e-177 ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutr... 625 e-176 ref|XP_007158055.1| hypothetical protein PHAVU_002G120300g [Phas... 624 e-176 ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [A... 622 e-175 >gb|EYU37499.1| hypothetical protein MIMGU_mgv1a003124mg [Mimulus guttatus] Length = 606 Score = 772 bits (1993), Expect = 0.0 Identities = 392/530 (73%), Positives = 440/530 (83%) Frame = -3 Query: 1962 NDGSNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAASLGCFILSSSPARAKTG 1783 +D SNNFFN RN FL PSH IFS E L I +S P Sbjct: 101 DDWSNNFFNFSRNPFFLFPSHFIFSREENL------------------ISTSLPKH---- 138 Query: 1782 ETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRD 1603 + V+EI+ GKR+ +VPDYSKDEFVVPEK W W + N +S+ + DVW KCRD Sbjct: 139 ---EVVFEIRAGKRVELVPDYSKDEFVVPEKNWSWWLKAAKSNPSSN---LADVWMKCRD 192 Query: 1602 LTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAA 1423 + SL+LPEGFPESVTSDYLEYSLWRGVQG+AAQ+SGVLATQA+LYA+GLGKGAIPTAAA Sbjct: 193 VAMSLMLPEGFPESVTSDYLEYSLWRGVQGIAAQVSGVLATQALLYAVGLGKGAIPTAAA 252 Query: 1422 VNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIX 1243 VNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRL AD LENAAFG+EILTPAFPHLFVPI Sbjct: 253 VNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLCADFLENAAFGLEILTPAFPHLFVPIG 312 Query: 1242 XXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAV 1063 ALIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGI LAN V Sbjct: 313 AVAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGV 372 Query: 1062 QSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVND 883 QSS PLALASF VITW+HMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSGLVPSV+EVND Sbjct: 373 QSSIPLALASFSVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVND 432 Query: 882 EEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFD 703 EEPLFPAFPLLIVK TSEEQ E+LS DAK AA+ IDRRL+LGSKLSDV+K+RE+A+ALFD Sbjct: 433 EEPLFPAFPLLIVKPTSEEQVEVLSPDAKHAASNIDRRLKLGSKLSDVVKSREEAIALFD 492 Query: 702 LYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPG 523 LY+SE YILTE +GRYCV LKESS PQDML+SL+QV YLYWLERNAGIKS++ +DDCRPG Sbjct: 493 LYKSEGYILTEHQGRYCVVLKESSMPQDMLKSLFQVSYLYWLERNAGIKSTTTIDDCRPG 552 Query: 522 GKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSA 373 G+LQIS+EYV+REF H+KNDS+ AGW++DGLIARPLP+RIR+G+++ S A Sbjct: 553 GRLQISMEYVQREFTHIKNDSQFAGWVVDGLIARPLPHRIRIGDETASPA 602 >ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum] Length = 606 Score = 705 bits (1819), Expect = 0.0 Identities = 372/536 (69%), Positives = 430/536 (80%), Gaps = 8/536 (1%) Frame = -3 Query: 1959 DGSNNFFNSDRNYLFLLPSHLIFSSNEE-----LRSVPYAL-LVSVAASLGCFILSSSPA 1798 D NNFFN D+ + LLP IF + L P L LVS ++S+ C +L +S Sbjct: 81 DWWNNFFNFDK--ILLLP---IFRDEDTFIDSVLSCKPLLLFLVSASSSITCCLLLASFV 135 Query: 1797 RAKTGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVW--FWPWSSKDGNLTSSQMTMGD 1624 +AKT + VYEI+GGKR +VPDYSKDEFV+ + +W WP S G+ S+ Sbjct: 136 QAKTNN-GEIVYEIRGGKRFELVPDYSKDEFVLTKTMWSQLWP-DSTSGSFVSN------ 187 Query: 1623 VWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKG 1444 +W +C++LT +L LPEGFPESVTSDYLEY+LWRGVQG+AAQISGVLATQA+LYA+GLGKG Sbjct: 188 LWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQALLYAVGLGKG 247 Query: 1443 AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 1264 AIPTAAA+NWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTPAFP Sbjct: 248 AIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAFP 307 Query: 1263 HLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 1084 HLFVPI +LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK+IGIMLG Sbjct: 308 HLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIMLG 367 Query: 1083 IVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 904 I LAN +SST LALASFGV+TW+HMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP Sbjct: 368 IALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 427 Query: 903 SVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNRE 724 SV+EVNDEEPLFPA +L +K E Q+E+LS AK AAA I RRLQLGSKLSDV ++E Sbjct: 428 SVKEVNDEEPLFPA-AILNLKAAYETQTEVLSVHAKQAAAGIVRRLQLGSKLSDVATSQE 486 Query: 723 DAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSI 544 D +ALF+LY++E YILTE EGR+C+ LKESSSPQDML+SL+ V YLYWLE NAGIKSSS+ Sbjct: 487 DVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGIKSSSV 546 Query: 543 VDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSS 376 +DCRPGG+LQ+SLEYV+REFNHVK D E AGW+ D LIARPLP RIRL + SS Sbjct: 547 ANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRIRLDYAAESS 602 >ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum] Length = 609 Score = 702 bits (1813), Expect = 0.0 Identities = 374/537 (69%), Positives = 430/537 (80%), Gaps = 9/537 (1%) Frame = -3 Query: 1959 DGSNNFFNSD-RNYLFLLPSHLIFSSNEE-----LRSVPYAL-LVSVAASLGCFILSSSP 1801 D +NFFN D R L LLP IF + + L P L LVS ++S+ C +L +S Sbjct: 81 DWWSNFFNFDKRRSLLLLP---IFRNEDTFIDSVLSCKPLLLFLVSASSSITCCLLLASF 137 Query: 1800 ARAKTGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVW--FWPWSSKDGNLTSSQMTMG 1627 +AKT + V+EI+GGKR +VPDYSKDEFV+ + +W P SK G+ S+ Sbjct: 138 VQAKTNN-GEIVHEIRGGKRFELVPDYSKDEFVLTKTMWSRLLP-DSKSGSFVSN----- 190 Query: 1626 DVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGK 1447 +W +C++LT +LLLPEGFP+SVTSDYLEY+LWRGVQGVAAQISGVLATQA+LYA+GLGK Sbjct: 191 -LWMQCKELTTTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGLGK 249 Query: 1446 GAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAF 1267 GAIPTAAAVNWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTPAF Sbjct: 250 GAIPTAAAVNWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAF 309 Query: 1266 PHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIML 1087 PHLFVPI +LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK+IGIML Sbjct: 310 PHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIML 369 Query: 1086 GIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLV 907 GI LAN +SST LALASFGV+TW+HMFCNLKSY SIQLRTLNPYRASLVFSEYLLSGLV Sbjct: 370 GIALANCTRSSTSLALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSGLV 429 Query: 906 PSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNR 727 PSV+EVNDEEPLFPA +L +K E Q E+LS AK AAA I RRLQLGSKLSDV +R Sbjct: 430 PSVKEVNDEEPLFPA-AILNLKAAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDVATSR 488 Query: 726 EDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSS 547 ED +ALF+LY++E YILTE EGR+C+ LKESSSPQDML+SL+ V YLYWLE AGIKSSS Sbjct: 489 EDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGIKSSS 548 Query: 546 IVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSS 376 + +DCRPGG+LQ+SLEYV+REFNHVK D E AGW+ D LIARPLPNRIRL + SS Sbjct: 549 VANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRIRLDYTAVSS 605 >ref|XP_007040833.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590680339|ref|XP_007040835.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508778078|gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508778080|gb|EOY25336.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 591 Score = 689 bits (1779), Expect = 0.0 Identities = 354/509 (69%), Positives = 414/509 (81%), Gaps = 4/509 (0%) Frame = -3 Query: 1887 SNEELRSVPYALLVSVAASLGCFILSS-SPARAKTGET---DDPVYEIKGGKRIAVVPDY 1720 +++ S + L+ +++ + CF S S A A+T E DD V+E+KG K ++PD+ Sbjct: 92 NDDSSSSHSHPFLLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDF 151 Query: 1719 SKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLE 1540 S+D FV + NLT S +++ VW +CRD+ LLLPEGFP+SVTSDYL+ Sbjct: 152 SEDAFVASNGIV---------NLTKS-LSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLD 201 Query: 1539 YSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYG 1360 YSLWRGVQGVA+QISGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYG Sbjct: 202 YSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYG 261 Query: 1359 RHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCF 1180 RHFDVNPKGWRLFADLLENAAFG+E+LTPAFPHLFVPI ALIQAATRSCF Sbjct: 262 RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCF 321 Query: 1179 FAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFC 1000 +AGFAAQRNFAEVIAKGEAQGMVSKSIGI+LGI LAN V SST LALASFGV+TWVHM+C Sbjct: 322 YAGFAAQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYC 381 Query: 999 NLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQS 820 NLKSYQSIQLRTLN YRASLVFSEYLLSG PS++EVNDEEPLFPA P L + + E+S Sbjct: 382 NLKSYQSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERS 441 Query: 819 ELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALK 640 +LSS+AK AAA I+RRLQLGSKLSD++ N+EDA+ALF LY+ E YILTE EG++CV LK Sbjct: 442 VVLSSEAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFCVVLK 501 Query: 639 ESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDS 460 ESS PQDML+SL+QV YLYWLERNAGI++S DCRPGG+LQIS+EYV+REFNHVK DS Sbjct: 502 ESSLPQDMLKSLFQVNYLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDS 561 Query: 459 ESAGWILDGLIARPLPNRIRLGNQSTSSA 373 ES GW+ DGLIARPLPNRIR G++ S+A Sbjct: 562 ESVGWVTDGLIARPLPNRIRPGHRDASTA 590 >ref|XP_007040836.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508778081|gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 577 Score = 659 bits (1701), Expect = 0.0 Identities = 343/509 (67%), Positives = 402/509 (78%), Gaps = 4/509 (0%) Frame = -3 Query: 1887 SNEELRSVPYALLVSVAASLGCFILSS-SPARAKTGET---DDPVYEIKGGKRIAVVPDY 1720 +++ S + L+ +++ + CF S S A A+T E DD V+E+KG K ++PD+ Sbjct: 92 NDDSSSSHSHPFLLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDF 151 Query: 1719 SKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLE 1540 S+D FV + NLT S +++ VW +CRD+ LLLPEGFP+SVTSDYL+ Sbjct: 152 SEDAFVASNGIV---------NLTKS-LSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLD 201 Query: 1539 YSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYG 1360 YSLWRGVQGVA+QISGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYG Sbjct: 202 YSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYG 261 Query: 1359 RHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCF 1180 RHFDVNPKGWRLFADLLENAAFG+E+LTPAFPHLFVPI ALIQAATRSCF Sbjct: 262 RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCF 321 Query: 1179 FAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFC 1000 +AGFAAQRNFAEVIAKGEAQGMVSKSIGI+LGI LAN V SST LALASFGV+TWVHM+C Sbjct: 322 YAGFAAQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYC 381 Query: 999 NLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQS 820 NLKSYQSIQLRTLN YRASLVFSEYLLSG PS++EVNDEEPLFPA P L + + E+S Sbjct: 382 NLKSYQSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERS 441 Query: 819 ELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALK 640 +LSS+AK AAA I+RRLQLGSKLSD++ N+EDA+ALF LY+ E YILTE EG++C Sbjct: 442 VVLSSEAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFC---- 497 Query: 639 ESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDS 460 SL+QV YLYWLERNAGI++S DCRPGG+LQIS+EYV+REFNHVK DS Sbjct: 498 ----------SLFQVNYLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDS 547 Query: 459 ESAGWILDGLIARPLPNRIRLGNQSTSSA 373 ES GW+ DGLIARPLPNRIR G++ S+A Sbjct: 548 ESVGWVTDGLIARPLPNRIRPGHRDASTA 576 >ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera] Length = 713 Score = 656 bits (1693), Expect = 0.0 Identities = 345/527 (65%), Positives = 412/527 (78%), Gaps = 4/527 (0%) Frame = -3 Query: 1956 GSN---NFFNSDRNYLFLL-PSHLIFSSNEELRSVPYALLVSVAASLGCFILSSSPARAK 1789 GSN ++ ++ N LF+ S ++ E + A+L+ V + L F Sbjct: 169 GSNWNWGWWGNEENALFIFFCSRVLHEHGSETAHMLRAVLLFVFSVLYSFFHFQLDTALS 228 Query: 1788 TGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKC 1609 + ++ V+E++GGK ++PD SKDEF+V P G SS T+ ++W +C Sbjct: 229 KEKEEEGVWEVRGGKWHKIIPDSSKDEFLVVT-----PGIGAVGAPKSS--TLPNLWLQC 281 Query: 1608 RDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTA 1429 ++L L+LPEGFP SVTSDYL+Y+LWRGVQGVA+QISGVLATQA+LYA+GLGKGAIPTA Sbjct: 282 KELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTA 341 Query: 1428 AAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVP 1249 AAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+EILTPAFPH F+ Sbjct: 342 AAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLL 401 Query: 1248 IXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLAN 1069 I ALIQA+TRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGI LAN Sbjct: 402 IGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN 461 Query: 1068 AVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREV 889 + SS PL+ ASF V+T VHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG VPS++EV Sbjct: 462 CIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEV 521 Query: 888 NDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVAL 709 N+EEPLFP PLL K T + QS +LS++AKDAAA I+RRLQLGSKLS+V+ ++ED +AL Sbjct: 522 NEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLAL 581 Query: 708 FDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCR 529 FDLY++EAYILTE +GR+ V LKES SPQDML+S++ V YLYWLERNAGI S DDCR Sbjct: 582 FDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGIISMGASDDCR 641 Query: 528 PGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQ 388 PGG+LQISLEYV+REFNH+KNDSE GW DGLIARPLPNRIR G++ Sbjct: 642 PGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHK 688 >ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis] Length = 586 Score = 654 bits (1687), Expect = 0.0 Identities = 342/539 (63%), Positives = 411/539 (76%), Gaps = 9/539 (1%) Frame = -3 Query: 1962 NDGSNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAASLGCFI---LSSSPARA 1792 + G+NN N++ N H +++ Y+LL+ V + L CF ++++ AR Sbjct: 56 SSGNNNNNNNNNNPSGSWWWHGGNGGDDDSSGSFYSLLLFVPSLLYCFCHLQVATAIART 115 Query: 1791 KTGETDD------PVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTM 1630 T DD V+E+KG KR ++PD++KD FVV S+ + +L SS +++ Sbjct: 116 ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVA--------SASNASL-SSLLSV 166 Query: 1629 GDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLG 1450 +W +CR+L +LPEGFP+SVTSDYL YSLWR VQGVA+QISGVLATQA+LYAIGLG Sbjct: 167 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 226 Query: 1449 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1270 KGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+LTPA Sbjct: 227 KGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 286 Query: 1269 FPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1090 FPH FV I ALIQA+TRSCF+AGFAA+RNFAEVIAKGEAQGMVSK+IGIM Sbjct: 287 FPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 346 Query: 1089 LGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGL 910 LGI LAN + SS P ALASF V+TW+HM+CNLKSYQSI+LRTLNPYRASLVFSEYLLSG Sbjct: 347 LGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 406 Query: 909 VPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKN 730 P V+EVNDEEPLFPAF +K ++ Q +LSS+AKDAA I+ RLQLGSKLSDV+ N Sbjct: 407 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN 466 Query: 729 REDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSS 550 +EDA ALF LY+ E YILTE G++CV LKES+ PQDML+SL+Q YLYWLERNAGI ++ Sbjct: 467 KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVAT 526 Query: 549 SIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSA 373 S DC PGG+L+ISL+YV+REFNHVK+DS S GW+ DGLIARPLPNRIR G S A Sbjct: 527 STSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVEPSVA 585 >ref|XP_007040837.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508778082|gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 573 Score = 652 bits (1682), Expect = 0.0 Identities = 340/509 (66%), Positives = 398/509 (78%), Gaps = 4/509 (0%) Frame = -3 Query: 1887 SNEELRSVPYALLVSVAASLGCFILSS-SPARAKTGET---DDPVYEIKGGKRIAVVPDY 1720 +++ S + L+ +++ + CF S S A A+T E DD V+E+KG K ++PD+ Sbjct: 92 NDDSSSSHSHPFLLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDF 151 Query: 1719 SKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLE 1540 S+D FV + NLT S +++ VW +CRD+ LLLPEGFP+SVTSDYL+ Sbjct: 152 SEDAFVASNGIV---------NLTKS-LSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLD 201 Query: 1539 YSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYG 1360 YSLWRGVQGVA+QISGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYG Sbjct: 202 YSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYG 261 Query: 1359 RHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCF 1180 RHFDVNPKGWRLFADLLENAAFG+E+LTPAFPHLFVPI ALIQAATRSCF Sbjct: 262 RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCF 321 Query: 1179 FAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFC 1000 +AGFAAQRNFAEVIAKGEAQGMVSKSIGI+LGI LAN V SST LALASFGV+TWVHM+C Sbjct: 322 YAGFAAQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYC 381 Query: 999 NLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQS 820 NLKSYQSIQLRTLN YRASLVFSEYLLSG PS++EVNDEEPLFPA P L + + E+S Sbjct: 382 NLKSYQSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERS 441 Query: 819 ELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALK 640 +LSS+AK AAA I+RRLQLGSKLSD++ N+EDA+ALF LY+ E YILTE EG++CV Sbjct: 442 VVLSSEAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFCVN-- 499 Query: 639 ESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDS 460 YLYWLERNAGI++S DCRPGG+LQIS+EYV+REFNHVK DS Sbjct: 500 ----------------YLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDS 543 Query: 459 ESAGWILDGLIARPLPNRIRLGNQSTSSA 373 ES GW+ DGLIARPLPNRIR G++ S+A Sbjct: 544 ESVGWVTDGLIARPLPNRIRPGHRDASTA 572 >emb|CBI21809.3| unnamed protein product [Vitis vinifera] Length = 537 Score = 652 bits (1681), Expect = 0.0 Identities = 337/489 (68%), Positives = 395/489 (80%) Frame = -3 Query: 1857 ALLVSVAASLGCFILSSSPARAKTGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFW 1678 A+L+ V + L F + ++ V+E++GGK ++PD SKDEF+V Sbjct: 4 AVLLFVFSVLYSFFHFQLDTALSKEKEEEGVWEVRGGKWHKIIPDSSKDEFLVVT----- 58 Query: 1677 PWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQI 1498 P G SS T+ ++W +C++L L+LPEGFP SVTSDYL+Y+LWRGVQGVA+QI Sbjct: 59 PGIGAVGAPKSS--TLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQI 116 Query: 1497 SGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFA 1318 SGVLATQA+LYA+GLGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFA Sbjct: 117 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFA 176 Query: 1317 DLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVI 1138 DLLENAA+G+EILTPAFPH F+ I ALIQA+TRSCF+AGFAAQRNFAEVI Sbjct: 177 DLLENAAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVI 236 Query: 1137 AKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLN 958 AKGEAQGMVSKSIGIMLGI LAN + SS PL+ ASF V+T VHMFCNLKSYQSIQLRTLN Sbjct: 237 AKGEAQGMVSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLN 296 Query: 957 PYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYI 778 PYRASLVFSEYLLSG VPS++EVN+EEPLFP PLL K T + QS +LS++AKDAAA I Sbjct: 297 PYRASLVFSEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEI 356 Query: 777 DRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQ 598 +RRLQLGSKLS+V+ ++ED +ALFDLY++EAYILTE +GR+ V LKES SPQDML+S++ Sbjct: 357 ERRLQLGSKLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFH 416 Query: 597 VCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARP 418 V YLYWLERNAGI S DDCRPGG+LQISLEYV+REFNH+KNDSE GW DGLIARP Sbjct: 417 VNYLYWLERNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARP 476 Query: 417 LPNRIRLGN 391 LPNRIR G+ Sbjct: 477 LPNRIRPGH 485 >ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis] gi|223541000|gb|EEF42558.1| conserved hypothetical protein [Ricinus communis] Length = 541 Score = 640 bits (1650), Expect = 0.0 Identities = 344/540 (63%), Positives = 408/540 (75%), Gaps = 7/540 (1%) Frame = -3 Query: 1965 KNDGSNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAASLGCFILSSSPARAKT 1786 + GSNN N++ N P ++ N + + L+ +L SS+ AR Sbjct: 9 RGSGSNNNNNNNNNNNPFDPWWW-WNENNKNNCDYFVWLLCCFVALWLQSASSAFARTTL 67 Query: 1785 GE-----TDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMG-D 1624 E +D V+ +KG KRI ++PD+ KDEF+V + SS D ++SS + G Sbjct: 68 KEKEEEGAEDSVWVVKGSKRIRLIPDFIKDEFLVNPSLP----SSYDDIISSSWLHFGRT 123 Query: 1623 VWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKG 1444 +W +CR L L+LPEG+P SVTSDYL+YSLWRGVQGVA+QISGVLATQA+LYAIGLGKG Sbjct: 124 LWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAIGLGKG 183 Query: 1443 AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 1264 AIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAAFG+EILTPAFP Sbjct: 184 AIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLEILTPAFP 243 Query: 1263 HLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 1084 HLFV I ALIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK IGIMLG Sbjct: 244 HLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLG 303 Query: 1083 IVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 904 I LAN + SS PLALASF V+TW+HMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG P Sbjct: 304 IGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAP 363 Query: 903 SVREVNDEEPLFPA-FPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNR 727 +++VNDEEPLFPA FP K + +LS +A+DAA I+RRLQLGSKLSDV+ ++ Sbjct: 364 PIKDVNDEEPLFPAVFPHF--KSADKPSLVVLSLEARDAATEIERRLQLGSKLSDVVNSK 421 Query: 726 EDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSS 547 ED +ALF+LY+ E YILTE +GR+CV LKES S QDML++L+QV YLYWLERNAG+ + Sbjct: 422 EDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDMLKALFQVNYLYWLERNAGLDARG 481 Query: 546 IVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSAES 367 DCR GG+LQ+SLEY++REF+HV+NDS S GW+ DGLIARPLPNRI G+ SS S Sbjct: 482 TSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADGLIARPLPNRIYPGDLVASSIVS 541 >ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum] Length = 590 Score = 637 bits (1642), Expect = e-180 Identities = 325/515 (63%), Positives = 394/515 (76%), Gaps = 14/515 (2%) Frame = -3 Query: 1893 FSSNEELRSVPYALLVSVAAS----------LGCFILSSSPARAKTGETDD----PVYEI 1756 F S++ + Y L +S+ S L F ++ +P+ + ++ P++E+ Sbjct: 79 FDSDDSSSNSRYTLFLSLLCSSVICYFFQLLLAKFAMARTPSSCSSSIENEILKQPIWEV 138 Query: 1755 KGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPE 1576 KGG I + PD+ KD F+ +F SS L SQ+ ++TKC++ T L+LPE Sbjct: 139 KGGNFIKLFPDHLKDIFIASNPTFFSELSS----LNVSQVP-SFLYTKCKEFTVRLMLPE 193 Query: 1575 GFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGI 1396 GFP SVTSDYLEYSLWRGVQGVA Q+SGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGI Sbjct: 194 GFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVLKDGI 253 Query: 1395 GYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXX 1216 GYLSKI+LS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFPHLFVPI Sbjct: 254 GYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAVAGASRSA 313 Query: 1215 XALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALA 1036 +LIQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LGI L N + SSTPL LA Sbjct: 314 ASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIALGIGLGNCIGSSTPLVLA 373 Query: 1035 SFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFP 856 SF V+TWVHM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG P V+EVNDEEPLFPA P Sbjct: 374 SFCVVTWVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPALP 433 Query: 855 LLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYIL 676 +L ++ QS +LSS+AKDAA I+ RLQLGSKLS+++ N+E+ +ALF LY++E YIL Sbjct: 434 ILNACFANKAQSIVLSSEAKDAAVEIESRLQLGSKLSEIIHNKEEVLALFSLYKNEGYIL 493 Query: 675 TELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEY 496 +E G++CV LKE+ S DML++L+QV YLYWLE+NAGI+ + DC+PGG+L+ISLEY Sbjct: 494 SEHTGKFCVVLKENCSQLDMLKALFQVNYLYWLEKNAGIEGRGALYDCKPGGRLRISLEY 553 Query: 495 VKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 391 +REFNH +ND ESAGWI DGLIARPLPNRIR GN Sbjct: 554 AEREFNHARNDGESAGWIADGLIARPLPNRIRPGN 588 >ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella] gi|482559415|gb|EOA23606.1| hypothetical protein CARUB_v10016806mg [Capsella rubella] Length = 657 Score = 635 bits (1639), Expect = e-179 Identities = 326/492 (66%), Positives = 387/492 (78%), Gaps = 10/492 (2%) Frame = -3 Query: 1830 LGCFI-----LSSSPARAKTGETDDP-----VYEIKGGKRIAVVPDYSKDEFVVPEKVWF 1681 L CF +S+ A+A+ ++DD V+E++G KR +VPD+ KDEFV E + Sbjct: 167 LSCFFHFRLSAASAVAKAENSDSDDSTEKETVWEVRGSKRKRLVPDFVKDEFVSEEAAFE 226 Query: 1680 WPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQ 1501 SS +T ++ +CR L LLPEG+P SVTSDYL+YSLWRGVQG+A+Q Sbjct: 227 ----------LSSSLTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDYSLWRGVQGIASQ 276 Query: 1500 ISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLF 1321 ISGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLF Sbjct: 277 ISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLF 336 Query: 1320 ADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEV 1141 ADLLENAAFGME+LTP FP FV I ALIQAATRSCF AGFA+QRNFAEV Sbjct: 337 ADLLENAAFGMEMLTPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFNAGFASQRNFAEV 396 Query: 1140 IAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTL 961 IAKGEAQGMVSKS+GI+LGIV+AN + +ST LALA+FGV+T +HM+ NLKSYQ IQLRTL Sbjct: 397 IAKGEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTAIHMYTNLKSYQCIQLRTL 456 Query: 960 NPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAY 781 NPYRASLVFSEYL+SG P ++EVNDEEPLFPA L +K + Q +LSS+AK AAA Sbjct: 457 NPYRASLVFSEYLISGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDFVLSSEAKSAAAD 516 Query: 780 IDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLY 601 I+ RLQLGSKLSDV+ N+E+A+ALFDLY++E YILTE GR+CV LKESSSPQDMLRSL+ Sbjct: 517 IEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSSPQDMLRSLF 576 Query: 600 QVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIAR 421 QV YLYWLE+NAGI+ +S DC+PGG+L ISL+YV+REF H K DSES GW+ +GLIAR Sbjct: 577 QVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIAR 636 Query: 420 PLPNRIRLGNQS 385 PLP RIRLG S Sbjct: 637 PLPTRIRLGYDS 648 >ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max] Length = 593 Score = 632 bits (1630), Expect = e-178 Identities = 330/539 (61%), Positives = 404/539 (74%), Gaps = 9/539 (1%) Frame = -3 Query: 1953 SNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAASLGCFILSSSPARAKT---- 1786 +NN N++ + P S++ ++ +LL S A C +L + A+AKT Sbjct: 59 NNNNNNNNNGGSWGNPFDSSDSNSNSHHTLFLSLLCSSALCFFCHLLHAKLAKAKTLSPS 118 Query: 1785 --GETD---DPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDV 1621 +T +PVYE+KGGK +VPD + D FV ++ + SS + S T V Sbjct: 119 TTADTSLFSEPVYEVKGGKWTKLVPDLTNDVFVSAQQGFLSELSSL--KVPSQLATF--V 174 Query: 1620 WTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGA 1441 W KC D+ L+LPEGFPESVTSDYLEYSLWR VQGVA Q+SGVLATQ++LYA+GLGKGA Sbjct: 175 WLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKGA 234 Query: 1440 IPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPH 1261 IPTAAA+NWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFADLLENAAFG+E+ TPAFP Sbjct: 235 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAAFGLEMCTPAFPQ 294 Query: 1260 LFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGI 1081 FV I +LIQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LGI Sbjct: 295 FFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLGI 354 Query: 1080 VLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPS 901 L N + SSTPL LASF V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG P Sbjct: 355 GLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPP 414 Query: 900 VREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNRED 721 V+EVNDEEPLFPA P+L ++ QS +LSS+AKDAAA I+ RLQLGSKLS+++ ++ED Sbjct: 415 VKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAKDAAAEIEHRLQLGSKLSEIVNSKED 474 Query: 720 AVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIV 541 +ALF LY++E YIL+E G++CV LKE+ S QDML++L+QV YLYWLE+NAGI + Sbjct: 475 VLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGGRGTL 534 Query: 540 DDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSAEST 364 +D +PGG+L ISL+YV+REFNHVKND E GW+ DGLIARPLPNRIR+G+ S++ S+ Sbjct: 535 NDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRIRIGDTPPSNSVSS 593 >ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana] gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis thaliana] gi|30794095|gb|AAP40490.1| unknown protein [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1| hypothetical protein [Arabidopsis thaliana] gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1 [Arabidopsis thaliana] Length = 608 Score = 632 bits (1630), Expect = e-178 Identities = 329/509 (64%), Positives = 395/509 (77%), Gaps = 10/509 (1%) Frame = -3 Query: 1887 SNEELRSVPYALLVSVAASLGCFI---LSSSPARAKTGETD-------DPVYEIKGGKRI 1738 S+ +LR + + LL L CF LS++ A AK +D + V+E++G KR Sbjct: 103 SSFDLRYLCFLLL-----GLSCFFHFRLSAASAIAKDQNSDSNGDAVKETVWEVRGSKRK 157 Query: 1737 AVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESV 1558 +VPD+ KDEFV E + SS +T ++ +CR+L LLPEGFP SV Sbjct: 158 RLVPDFVKDEFVSEESAFE----------LSSSLTPENLLAQCRNLLTQFLLPEGFPNSV 207 Query: 1557 TSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKI 1378 TSDYL+YSLWRGVQG+A+QISGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI Sbjct: 208 TSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI 267 Query: 1377 MLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQA 1198 MLSKYGRHFDV+PKGWRLFADLLENAAFGME+LTP FP FV I ALIQA Sbjct: 268 MLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQA 327 Query: 1197 ATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVIT 1018 ATRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+GI+LGIV+AN + +ST LALA+FGV+T Sbjct: 328 ATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVT 387 Query: 1017 WVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKR 838 +HM+ NLKSYQ IQLRTLNPYRASLVFSEYL+SG P ++EVNDEEPLFP +K Sbjct: 388 TIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKS 447 Query: 837 TSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGR 658 + Q +LSS+AK AAA I+ RLQLGSKLSDV+ N+E+A+ALFDLY++E YILTE +GR Sbjct: 448 PEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHKGR 507 Query: 657 YCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFN 478 +CV LKESS+PQDMLRSL+QV YLYWLE+NAGI+ +S DC+PGG+L ISL+YV+REF Sbjct: 508 FCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFE 567 Query: 477 HVKNDSESAGWILDGLIARPLPNRIRLGN 391 H K DSES GW+ +GLIARPLP RIRLG+ Sbjct: 568 HAKEDSESVGWVTEGLIARPLPTRIRLGH 596 >gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis] Length = 579 Score = 631 bits (1627), Expect = e-178 Identities = 322/467 (68%), Positives = 375/467 (80%) Frame = -3 Query: 1794 AKTGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWT 1615 A+ V+E+KGGK I +VP+ D FVV +P +S ++ + + Sbjct: 112 ARAQSLSSSVWEVKGGKWILLVPNDLDDTFVVDS---LFPSTSSTRPVSPLNLWL----E 164 Query: 1614 KCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIP 1435 KCR L L+LPEG+PESVTSDYL+YSLWR VQGVA+QIS VLATQ++LYA+GLGKGAIP Sbjct: 165 KCRQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLATQSLLYAVGLGKGAIP 224 Query: 1434 TAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLF 1255 TAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG E+LTPAFPHLF Sbjct: 225 TAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGFEMLTPAFPHLF 284 Query: 1254 VPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVL 1075 VPI LIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGI +GI L Sbjct: 285 VPIGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIAMGIGL 344 Query: 1074 ANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVR 895 AN + +STPLALASF V+T++HM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG P ++ Sbjct: 345 ANCIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIK 404 Query: 894 EVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAV 715 EVNDE+PLFPA P+L VK ++EQ +LS++AK AAA ID RL LGSKLSDV+ N +D + Sbjct: 405 EVNDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLLLGSKLSDVVNNHKDVL 464 Query: 714 ALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDD 535 ALFDLY++E YILTE GR+CV LKE+ SP DML++++ V YLYWLE+NAGI +S D Sbjct: 465 ALFDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLYWLEKNAGIDGASPYLD 524 Query: 534 CRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLG 394 +PGG+LQISL+YV+REFNHVK D ESAGW DGLIARPLPNRIR G Sbjct: 525 SKPGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRIRPG 571 >ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata] Length = 613 Score = 630 bits (1625), Expect = e-178 Identities = 322/490 (65%), Positives = 386/490 (78%), Gaps = 10/490 (2%) Frame = -3 Query: 1830 LGCFI---LSSSPARAKTGETD-------DPVYEIKGGKRIAVVPDYSKDEFVVPEKVWF 1681 L CF LS++ A AK ++D + V+E++G KR +VPD+ KDEFV E + Sbjct: 123 LSCFFHFRLSAASAIAKASDSDSSGDTDKETVWEVRGSKRKRLVPDFVKDEFVSEESAFE 182 Query: 1680 WPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQ 1501 SS +T ++ +CR+L LLPEGFP SVTSDYL+YSLWRGVQG+A+Q Sbjct: 183 ----------LSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQ 232 Query: 1500 ISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLF 1321 +SGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLF Sbjct: 233 VSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLF 292 Query: 1320 ADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEV 1141 ADLLENAAFGME+LTP FP FV I ALIQAATRSCF AGFA+QRNFAEV Sbjct: 293 ADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEV 352 Query: 1140 IAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTL 961 IAKGEAQGMVSKS+GI+LGIV+AN + +ST LALA+FGV+T +HM+ NLKSYQ IQLRTL Sbjct: 353 IAKGEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTL 412 Query: 960 NPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAY 781 NPYRASLVFSEYL+SG P ++EVNDEEPLFP L +K + Q +LSS+AK AA Sbjct: 413 NPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSPEKLQDFVLSSEAKAAAED 472 Query: 780 IDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLY 601 I+ RLQLGSKLSDV+ N+E+A+ALFDLY++E YILTE GR+CV LKESS+PQDMLRSL+ Sbjct: 473 IEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSTPQDMLRSLF 532 Query: 600 QVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIAR 421 QV YLYWLE+NAGI+ +S DC+PGG+L ISL+YV+REF H K DS+S GW+ +GLIAR Sbjct: 533 QVNYLYWLEKNAGIEPASTYTDCKPGGRLHISLDYVRREFEHAKEDSQSVGWVTEGLIAR 592 Query: 420 PLPNRIRLGN 391 PLP RIRLG+ Sbjct: 593 PLPTRIRLGH 602 >ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula] gi|355513788|gb|AES95411.1| hypothetical protein MTR_5g025160 [Medicago truncatula] Length = 630 Score = 627 bits (1618), Expect = e-177 Identities = 325/497 (65%), Positives = 385/497 (77%), Gaps = 7/497 (1%) Frame = -3 Query: 1860 YALLVSVAASLGCFIL---SSSPARAKTGETD---DPVYEIKGGKRIAVVPDYSKDEFVV 1699 Y LL ++ S F L + + R+ + E D P+YE+KGG I + PD KD F+ Sbjct: 87 YTLLFTLLFSSVTFCLCQLAMAKTRSLSSEDDILTQPIYEVKGGNLIKLFPDNLKDIFIA 146 Query: 1698 PEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGV 1519 F SS L SSQ+ ++ KCR+ L+LPEGFP SVTSDYLEYSLWRGV Sbjct: 147 SNPGLFSELSS----LNSSQVPTF-LYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGV 201 Query: 1518 QGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNP 1339 QGVA Q+SGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHFDVNP Sbjct: 202 QGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNP 261 Query: 1338 KGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQ 1159 KGWRLFADLLENAAFG+E+ TPAFPHLFVPI +LIQA+TRSCFFAGFAAQ Sbjct: 262 KGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQ 321 Query: 1158 RNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQS 979 RNFAEVIAKGE QGMVS+ IGI +GI L N + SSTPL LASF V+TWVHM+CNLKSYQS Sbjct: 322 RNFAEVIAKGEVQGMVSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQS 381 Query: 978 IQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEE-QSELLSSD 802 IQLRTLNP+RASLVFSEYLLSG P V+EVN EEPLFPA P+L ++E QS +LSS+ Sbjct: 382 IQLRTLNPHRASLVFSEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIVLSSE 441 Query: 801 AKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALKESSSPQ 622 AKDAA I+ RLQLGSKLS+++ N+E+ +ALF LY++E YIL+E G++CV LKE+ S Sbjct: 442 AKDAAVEIESRLQLGSKLSEIINNKEEVLALFSLYKNEGYILSEHTGKFCVVLKETCSQL 501 Query: 621 DMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWI 442 DML++L+QV YLYWLE+NAGI+ + DC+PGG+LQISLEY +REFNHV+ND ES GWI Sbjct: 502 DMLKALFQVNYLYWLEKNAGIEGRGTLYDCKPGGRLQISLEYAEREFNHVRNDGESVGWI 561 Query: 441 LDGLIARPLPNRIRLGN 391 DGLIARPLPNR R GN Sbjct: 562 TDGLIARPLPNRCRPGN 578 >ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutrema salsugineum] gi|557096914|gb|ESQ37422.1| hypothetical protein EUTSA_v10002446mg [Eutrema salsugineum] Length = 611 Score = 625 bits (1611), Expect = e-176 Identities = 332/519 (63%), Positives = 396/519 (76%), Gaps = 12/519 (2%) Frame = -3 Query: 1884 NEELRSVPYALLVSVAASLGCFI---LSSSPARAKTGETD-------DPVYEIKGGKRIA 1735 N + S P L + CF LS++ A AK E+D + V+E++G KR Sbjct: 104 NSDGSSSPLRFLCFLFLVYSCFFQLRLSAAIAIAKAPESDSNGDTEKETVWEVRGSKRKR 163 Query: 1734 VVPDYSKDEFVV-PEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESV 1558 +VPD+ +DEF V PE+ TSS +T ++ +CR+L LLPEGFP SV Sbjct: 164 LVPDFVRDEFFVSPEET------------TSSPLTPENLLAQCRNLLTQFLLPEGFPNSV 211 Query: 1557 TSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKI 1378 TSDYL+YSLWRGVQG+A+QISGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI Sbjct: 212 TSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI 271 Query: 1377 MLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQA 1198 MLSKYGRHFDV+PKGWRLFADLLEN+AFGME+LTP FP FV I ALIQA Sbjct: 272 MLSKYGRHFDVHPKGWRLFADLLENSAFGMEMLTPLFPQFFVLIGAAAGAGRSAAALIQA 331 Query: 1197 ATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVIT 1018 ATRSCF AGFA+QRNFAEVIAKGEAQGMVSKSIGI+LGIV+AN + +ST LALASFGV+T Sbjct: 332 ATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSIGILLGIVVANCIGTSTSLALASFGVVT 391 Query: 1017 WVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKR 838 +HM+ NLKSYQ IQLRTLNPYRASLVFSEYL+SG P ++EVNDEEPLFP L +K Sbjct: 392 SIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPPIKEVNDEEPLFPTVRSLNIKS 451 Query: 837 TSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGR 658 + Q +LSS+AK AAA I+ RLQLGSKLSDV+ N+E+AVALFDLY+ E YILTE GR Sbjct: 452 AEKRQDFVLSSEAKAAAADIEERLQLGSKLSDVVHNKEEAVALFDLYRDEGYILTEHRGR 511 Query: 657 YCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFN 478 +CV LKESSSPQDMLRSL+QV YLYWLE+NAGI++S+ DC+PGG+L ISL+YV+REF Sbjct: 512 FCVMLKESSSPQDMLRSLFQVNYLYWLEKNAGIEASNTYLDCKPGGRLHISLDYVRREFE 571 Query: 477 HVKNDSESAGWILDGLIARPLPNRIRLG-NQSTSSAEST 364 K DSE GW+ +GLIARPL RIRL ++ SS+ S+ Sbjct: 572 LAKEDSELVGWVTEGLIARPLSTRIRLDYDREPSSSPSS 610 >ref|XP_007158055.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris] gi|561031470|gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris] Length = 592 Score = 624 bits (1610), Expect = e-176 Identities = 325/521 (62%), Positives = 390/521 (74%), Gaps = 12/521 (2%) Frame = -3 Query: 1890 SSNEELRSVPYALLVSVAASLGCFILSSSPARAKTGETD-------DPVYEIKGGKRIAV 1732 S++ R + +LL S A +L A AKT + +PV+E+KGGK + Sbjct: 82 SNSNSHRILFLSLLCSSAVCFFGHLLLVKLANAKTWSSSSDNELLSEPVWEVKGGKWTRL 141 Query: 1731 VPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGD-----VWTKCRDLTASLLLPEGFP 1567 VPD + D FV S+ G L Q VW KCRD+ L+LPEGFP Sbjct: 142 VPDPTNDVFV----------SAHPGLLAELQSLKPSQFATFVWLKCRDIFTRLMLPEGFP 191 Query: 1566 ESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYL 1387 ESVTSDYLEYSLWR VQGVA Q+SGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYL Sbjct: 192 ESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYL 251 Query: 1386 SKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXAL 1207 SKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFP FV I +L Sbjct: 252 SKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPQFFVLIGAVAGASRSAASL 311 Query: 1206 IQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFG 1027 IQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LGI L N + SSTPL LASF Sbjct: 312 IQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLGIGLGNCIGSSTPLVLASFI 371 Query: 1026 VITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLI 847 V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG P V++VNDEEPLFPA P+L Sbjct: 372 VLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKDVNDEEPLFPAVPILN 431 Query: 846 VKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTEL 667 ++ +S LSS+AKDAAA I+RRLQLGSKLS+++ +ED +ALF LY+ E YIL+E Sbjct: 432 ATFANKARSIALSSEAKDAAAEIERRLQLGSKLSEIVNGKEDVLALFRLYKKEGYILSEH 491 Query: 666 EGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKR 487 G++CV LKE+ S QDML++L+QV YLYWLE+NAGI ++D RPGG+L SL+YV+R Sbjct: 492 MGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGGRGTLNDSRPGGRLHTSLDYVER 551 Query: 486 EFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSAEST 364 EFNH+KND ES GW+ DGLIARPLPNRIR+G+ ++S++ S+ Sbjct: 552 EFNHLKNDGESVGWVTDGLIARPLPNRIRIGDTTSSNSVSS 592 >ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [Amborella trichopoda] gi|548831916|gb|ERM94718.1| hypothetical protein AMTR_s00011p00244680 [Amborella trichopoda] Length = 565 Score = 622 bits (1605), Expect = e-175 Identities = 329/523 (62%), Positives = 387/523 (73%), Gaps = 1/523 (0%) Frame = -3 Query: 1962 NDGSNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAA-SLGCFILSSSPARAKT 1786 N+ +NN N++ NY N + + + LL+S + F L+S P Sbjct: 54 NNNNNNGSNNNNNY-----GDSWSDDNNGIPNTSFCLLLSFSLFPNNLFSLASKPGEVVA 108 Query: 1785 GETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCR 1606 +E+KGGK V D SKDE + G L ++ +G W CR Sbjct: 109 -------WEVKGGKWSPVYADSSKDELFADNALRLL----SSGVLDLGKI-LGSSWLWCR 156 Query: 1605 DLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAA 1426 +L L+LPEG+P SV+SDYLEYSLWR VQGVA+QI+GVL TQA+LYA+GLGKGAIPTAA Sbjct: 157 ELAVRLMLPEGYPASVSSDYLEYSLWRAVQGVASQINGVLTTQALLYAVGLGKGAIPTAA 216 Query: 1425 AVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPI 1246 AVNWVLKDG+GYLSKI LSKYGRHFDV+PKGWRLFADLLENAA+G+E+LTPA+P FV I Sbjct: 217 AVNWVLKDGLGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGLELLTPAYPQFFVLI 276 Query: 1245 XXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANA 1066 ALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGI LAN Sbjct: 277 GAAAGAGRSAAALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANH 336 Query: 1065 VQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVN 886 + +S PLA ASFGV+T VHMFCNLKSYQSIQLRTLNPYR SLVFSEYLLSG VP V+EVN Sbjct: 337 IGASGPLAAASFGVVTAVHMFCNLKSYQSIQLRTLNPYRGSLVFSEYLLSGEVPPVKEVN 396 Query: 885 DEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALF 706 DEEPLF L V QS++LS++AK+AAA I+ RLQLG KLSDV+ +ED +ALF Sbjct: 397 DEEPLFSGSSFLKVVPVQHAQSQVLSAEAKEAAAQIESRLQLGCKLSDVVSKKEDVLALF 456 Query: 705 DLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRP 526 DL++ E YILTE +G+YCV LKE SPQDML+SL+QV YLYWLERNAGI S S DC+P Sbjct: 457 DLFEKEGYILTEQKGKYCVVLKEDYSPQDMLKSLFQVSYLYWLERNAGIDSRSASTDCKP 516 Query: 525 GGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRL 397 GGK+Q+S +YV+REFNHVKNDS++AGWI DGLIARPLP R+R+ Sbjct: 517 GGKMQLSYDYVQREFNHVKNDSQAAGWITDGLIARPLPCRVRV 559