BLASTX nr result
ID: Ophiopogon21_contig00006111
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon21_contig00006111 (2689 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008802020.1| PREDICTED: large proline-rich protein BAG6 i... 601 e-168 ref|XP_008802019.1| PREDICTED: large proline-rich protein BAG6 i... 598 e-168 ref|XP_010910594.1| PREDICTED: uncharacterized protein LOC105036... 594 e-166 ref|XP_008802021.1| PREDICTED: uncharacterized protein LOC103715... 567 e-158 ref|XP_009394532.1| PREDICTED: large proline-rich protein BAG6 i... 524 e-145 ref|XP_010244741.1| PREDICTED: large proline-rich protein BAG6-l... 459 e-126 ref|XP_010244742.1| PREDICTED: uncharacterized protein LOC104588... 453 e-124 ref|XP_010913366.1| PREDICTED: large proline-rich protein BAG6, ... 435 e-124 ref|XP_007013655.1| Ubiquitin-like superfamily protein, putative... 452 e-124 ref|XP_007013657.1| Ubiquitin-like superfamily protein, putative... 451 e-123 ref|XP_007013661.1| Ubiquitin-like superfamily protein, putative... 450 e-123 ref|XP_007013658.1| Ubiquitin-like superfamily protein, putative... 449 e-123 ref|XP_010244866.1| PREDICTED: large proline-rich protein BAG6-l... 448 e-122 ref|XP_002283083.2| PREDICTED: large proline-rich protein bag6-A... 447 e-122 ref|XP_007013659.1| Ubiquitin-like superfamily protein, putative... 447 e-122 gb|KJB64610.1| hypothetical protein B456_010G057100 [Gossypium r... 437 e-119 ref|XP_007013662.1| Ubiquitin-like superfamily protein, putative... 434 e-118 ref|XP_007013656.1| Ubiquitin-like superfamily protein, putative... 434 e-118 gb|KJB64613.1| hypothetical protein B456_010G057100 [Gossypium r... 431 e-117 gb|KJB64612.1| hypothetical protein B456_010G057100 [Gossypium r... 431 e-117 >ref|XP_008802020.1| PREDICTED: large proline-rich protein BAG6 isoform X2 [Phoenix dactylifera] Length = 745 Score = 601 bits (1550), Expect = e-168 Identities = 370/774 (47%), Positives = 468/774 (60%), Gaps = 69/774 (8%) Frame = -3 Query: 2528 MASNEAAKVTPIG-NVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVL 2352 M +N A++VT +V EDS TTVEIKIKTLDS TYTLRV+K VPV LKEQIA+VTGV+ Sbjct: 1 METNGASEVTTSHVDVTEDSETTVEIKIKTLDSHTYTLRVNKCVPVLMLKEQIATVTGVV 60 Query: 2351 SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTN-QMESEAASGHPAT 2175 SEQQRLICRG+VLKDD+LLSAYHVEDGHTLHLV R + SE AS +PA Sbjct: 61 SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPPTGHVGSEGASANPAA 120 Query: 2174 SSSGG-AHIR-SPIAHSFVFESVNVDQGDMDTSVIGRIISSMLDSIGATSSGTGNTSDLR 2001 +SS AH R S +A S VFE+VN+DQGD TS + R ISS+L SIG T++ N R Sbjct: 121 NSSSSIAHNRGSHVARSIVFEAVNIDQGDNRTSHLSRFISSILSSIGTTNTAFQNP---R 177 Query: 2000 EIISERLGRAASDGGLSDSTQPHPNPTNQQRE--RRQADVRFSSTVPLGPQNPTVIPDAL 1827 + E GR + D GL D Q +PNP++ + E +Q+ +RF S PLG Q P VIPD+L Sbjct: 178 NDLRETGGRTSGDTGLPDPVQSNPNPSSSRVEVDSQQSPLRFQSVFPLGSQQPIVIPDSL 237 Query: 1826 TTLSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQ---GRSNASNSPLAQVGLPTPAS 1656 TT++QY+G++RD+FRREG +N EQ N++ AA + + + P Q GLP+PAS Sbjct: 238 TTMNQYLGVIRDDFRREGISTNGREQTNDAAAAGMNSIDVQNHDFLSPPPRQGGLPSPAS 297 Query: 1655 LAEILLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMF 1476 LAEI+LS RQLLM+Q CLSQLAG L DH S+TDPLTRM++QSSA+RSG+++RNLGS+ Sbjct: 298 LAEIVLSTRQLLMDQAGGCLSQLAGHLEDHVSVTDPLTRMDLQSSAIRSGVILRNLGSLL 357 Query: 1475 LELGRATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNS 1296 LELGR TMTLRMGQ+P EAVVNAGPA FISA+GPNPLMVQ VPF PGSS G T Sbjct: 358 LELGRTTMTLRMGQTPLEAVVNAGPAVFISASGPNPLMVQPVPFFPGSSFGGTPF----- 412 Query: 1295 GHGLEGEPPEAMFIPRNIEIRIRT-GRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNV 1119 GEPP + FIPRN++IR+RT GRA PV N EQA Q QEQ DP RN + N Sbjct: 413 -----GEPPASAFIPRNVDIRVRTGGRAVPVTNANLGEQAGAQPPQEQTDPTRNPSTANS 467 Query: 1118 VHQTFPG--------GDSGVRVMPLRTVVAXXXXXXXXXXXXXXXXVRLFYPLLARVRQA 963 V+Q F G G+SGVRV+P+RTVVA + YPLLAR++ Sbjct: 468 VNQAFSGISSTTSFAGESGVRVVPIRTVVAVPTGHSPSDSSGSAVG--VIYPLLARIQHV 525 Query: 962 STGTASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPF 783 ++ A++ RGS+AS + NQ P + + L +S +Q ++ PAN+ P Sbjct: 526 NSANANNARGSRASNEPNQSHPNINQPNL-ESAMQNQS---------------PANSAPV 569 Query: 782 VSGSSPPANDSTAYQGVSAVFDSSQQNPLNNDDESRTQANDD------------------ 657 VS +P AN+ +YQG S V +SQQ P + ES TQA+ + Sbjct: 570 VSWINPSANELPSYQGSSPVSIASQQAPPASSSESNTQAHVEPQVGQGPMSQFLNRVDEW 629 Query: 656 ---------------------------------AARGTESHDAARVDSEQGVLFSNVLRH 576 AR TE+ +A+RV ++ GV FS+++R Sbjct: 630 LRTALLSGEQVQGGGTSHQESVTGSAAVQNQTGTARNTETQEASRVGNDDGVFFSSLVRE 689 Query: 575 IMPLISQGREHGVSSTIPNSSTAXXXXXXXXXXXXXQHQRDPPEAPSPKRTRRD 414 +MP +SQ S + STA QH RDPPEAPSPKR RR+ Sbjct: 690 LMPFLSQRTTVPGGSASNDGSTAQTASHHLNDSSSSQHHRDPPEAPSPKRPRRN 743 >ref|XP_008802019.1| PREDICTED: large proline-rich protein BAG6 isoform X1 [Phoenix dactylifera] Length = 746 Score = 598 bits (1543), Expect = e-168 Identities = 371/775 (47%), Positives = 469/775 (60%), Gaps = 70/775 (9%) Frame = -3 Query: 2528 MASNEAAKVTPIG-NVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVL 2352 M +N A++VT +V EDS TTVEIKIKTLDS TYTLRV+K VPV LKEQIA+VTGV+ Sbjct: 1 METNGASEVTTSHVDVTEDSETTVEIKIKTLDSHTYTLRVNKCVPVLMLKEQIATVTGVV 60 Query: 2351 SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTN-QMESEAASGHPAT 2175 SEQQRLICRG+VLKDD+LLSAYHVEDGHTLHLV R + SE AS +PA Sbjct: 61 SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPPTGHVGSEGASANPAA 120 Query: 2174 SSSGG-AHIR-SPIAHSFVFESVNVDQGDMDTSVIGRIISSMLDSIGATSSGTGNTSDLR 2001 +SS AH R S +A S VFE+VN+DQGD TS + R ISS+L SIG T++ N R Sbjct: 121 NSSSSIAHNRGSHVARSIVFEAVNIDQGDNRTSHLSRFISSILSSIGTTNTAFQNP---R 177 Query: 2000 EIISERLGRAASDGGLSDSTQPHPNPTNQQRE--RRQADVRFSSTVPLGPQNPTVIPDAL 1827 + E GR + D GL D Q +PNP++ + E +Q+ +RF S PLG Q P VIPD+L Sbjct: 178 NDLRETGGRTSGDTGLPDPVQSNPNPSSSRVEVDSQQSPLRFQSVFPLGSQQPIVIPDSL 237 Query: 1826 TTLSQYIGLMRDEFRREGFGSNAS-EQINNSDAADVRQ---GRSNASNSPLAQVGLPTPA 1659 TT++QY+G++RD+FRREG +NA EQ N++ AA + + + P Q GLP+PA Sbjct: 238 TTMNQYLGVIRDDFRREGISTNAGREQTNDAAAAGMNSIDVQNHDFLSPPPRQGGLPSPA 297 Query: 1658 SLAEILLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSM 1479 SLAEI+LS RQLLM+Q CLSQLAG L DH S+TDPLTRM++QSSA+RSG+++RNLGS+ Sbjct: 298 SLAEIVLSTRQLLMDQAGGCLSQLAGHLEDHVSVTDPLTRMDLQSSAIRSGVILRNLGSL 357 Query: 1478 FLELGRATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVN 1299 LELGR TMTLRMGQ+P EAVVNAGPA FISA+GPNPLMVQ VPF PGSS G T Sbjct: 358 LLELGRTTMTLRMGQTPLEAVVNAGPAVFISASGPNPLMVQPVPFFPGSSFGGTPF---- 413 Query: 1298 SGHGLEGEPPEAMFIPRNIEIRIRT-GRAAPVAATNASEQAPGQQEQEQMDPARNSTGVN 1122 GEPP + FIPRN++IR+RT GRA PV N EQA Q QEQ DP RN + N Sbjct: 414 ------GEPPASAFIPRNVDIRVRTGGRAVPVTNANLGEQAGAQPPQEQTDPTRNPSTAN 467 Query: 1121 VVHQTFPG--------GDSGVRVMPLRTVVAXXXXXXXXXXXXXXXXVRLFYPLLARVRQ 966 V+Q F G G+SGVRV+P+RTVVA + YPLLAR++ Sbjct: 468 SVNQAFSGISSTTSFAGESGVRVVPIRTVVAVPTGHSPSDSSGSAVG--VIYPLLARIQH 525 Query: 965 ASTGTASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVP 786 ++ A++ RGS+AS + NQ P + + L +S +Q ++ PAN+ P Sbjct: 526 VNSANANNARGSRASNEPNQSHPNINQPNL-ESAMQNQS---------------PANSAP 569 Query: 785 FVSGSSPPANDSTAYQGVSAVFDSSQQNPLNNDDESRTQANDD----------------- 657 VS +P AN+ +YQG S V +SQQ P + ES TQA+ + Sbjct: 570 VVSWINPSANELPSYQGSSPVSIASQQAPPASSSESNTQAHVEPQVGQGPMSQFLNRVDE 629 Query: 656 ----------------------------------AARGTESHDAARVDSEQGVLFSNVLR 579 AR TE+ +A+RV ++ GV FS+++R Sbjct: 630 WLRTALLSGEQVQGGGTSHQESVTGSAAVQNQTGTARNTETQEASRVGNDDGVFFSSLVR 689 Query: 578 HIMPLISQGREHGVSSTIPNSSTAXXXXXXXXXXXXXQHQRDPPEAPSPKRTRRD 414 +MP +SQ S + STA QH RDPPEAPSPKR RR+ Sbjct: 690 ELMPFLSQRTTVPGGSASNDGSTAQTASHHLNDSSSSQHHRDPPEAPSPKRPRRN 744 >ref|XP_010910594.1| PREDICTED: uncharacterized protein LOC105036531 [Elaeis guineensis] Length = 745 Score = 594 bits (1531), Expect = e-166 Identities = 374/774 (48%), Positives = 464/774 (59%), Gaps = 71/774 (9%) Frame = -3 Query: 2528 MASNEAAKVTPI-GNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVL 2352 M +N A +T G+V EDS TTVEIKIKTLDSQTYTLRV+K VP+ LKEQIA+VTGV+ Sbjct: 1 MGTNGARDITTSHGDVTEDSETTVEIKIKTLDSQTYTLRVNKCVPILMLKEQIATVTGVV 60 Query: 2351 SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTN-QMESEAASGHPAT 2175 SEQQRLICRG+VLKDD+LLSAYHVEDGHTLHLV R + + SE AS +PA Sbjct: 61 SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPSTGHVGSEGASANPAA 120 Query: 2174 -SSSGGAHIR-SPIAHSFVFESVNVDQGDMDTSVIGRIISSMLDSIGATSSGTGNT-SDL 2004 SSS AH R S +A S VFE+VN+DQGD TS +GRIISS+L SIG T++ N +DL Sbjct: 121 NSSSSTAHNRGSHVARSIVFEAVNIDQGDNRTSHLGRIISSLLSSIGTTNTAFQNPRNDL 180 Query: 2003 REIISERLGRAASDGGLSDSTQPHPNPTNQQRE--RRQADVRFSSTVPLGPQNPTVIPDA 1830 RE + GR + D GLSD+ Q +PNP + E +Q +RF S PLG Q P VIPD+ Sbjct: 181 RETV----GRTSGDTGLSDAMQSNPNPPTSRVELDSQQGPLRFQSVFPLGSQQPIVIPDS 236 Query: 1829 LTTLSQYIGLMRDEFRREGFGSNASEQINNSDAADVR----QGRSNASNSPLAQVGLPTP 1662 LTT++QY+G++RD+FRREG EQ N++ AA + Q S P Q GLP+P Sbjct: 237 LTTMNQYLGVIRDDFRREGLSIYGREQTNDAAAAGMNGNDVQNHDFLSPLPSRQGGLPSP 296 Query: 1661 ASLAEILLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGS 1482 ASLAEI+LS RQLLM+Q CLSQLA +L DH S+TDPL RM++QSSA+RSG+L+RNLGS Sbjct: 297 ASLAEIVLSTRQLLMDQAGGCLSQLARRLDDHVSVTDPLMRMDLQSSAIRSGVLLRNLGS 356 Query: 1481 MFLELGRATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSV 1302 + LELGR TMTL MGQ+P EAVVNAGPA FISA+GPNP+MVQ VPF PGSS G Sbjct: 357 LLLELGRTTMTLHMGQTPLEAVVNAGPAVFISASGPNPVMVQPVPFFPGSSFGGAQF--- 413 Query: 1301 NSGHGLEGEPPEAMFIPRNIEIRIRT-GRAAPVAATNASEQAPGQQEQEQMDPARNSTGV 1125 GEP + FIPRNI+IR+RT GR PV N EQA QQ EQ DP RN + Sbjct: 414 -------GEPLASAFIPRNIDIRVRTGGRTIPVTNANLGEQAGAQQPLEQTDPTRNPSAA 466 Query: 1124 NVVHQTFPG--------GDSGVRVMPLRTVVAXXXXXXXXXXXXXXXXVRLFYPLLARVR 969 N V+Q F G G+SGVRV+P+RTVVA + YPLLAR++ Sbjct: 467 NSVNQAFSGIPSSTSFAGESGVRVVPVRTVVAVPAGHSPSDSSGSAIG--VIYPLLARIQ 524 Query: 968 QASTGTASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAV 789 ++G A++ RGS+AS + NQ P + NL PN + S PA++ Sbjct: 525 HVNSGNANNARGSRASNESNQSQPNI-------------NL-PNLESAMRNQS--PASSA 568 Query: 788 PFVSGSSPPANDSTAYQGVSAVFDSSQQNPLNNDDESRTQAN-----------------D 660 P VS +P AN+ YQG S V +SQQ P ++ ES TQA+ D Sbjct: 569 PVVSWMNPSANELPGYQGSSLVSITSQQAPPASNSESNTQAHVGQQVGQGSMSQLLSRVD 628 Query: 659 DAAR----------------------------------GTESHDAARVDSEQGVLFSNVL 582 + R TE+H+A+RV S+ GV FS+++ Sbjct: 629 EWIRTALFPGEQVQVGGTGHQESVTGSVAVQNQTGTTGNTETHEASRVGSDDGVFFSSLV 688 Query: 581 RHIMPLISQGREHGVSSTIPNSSTAXXXXXXXXXXXXXQHQRDPPEAPSPKRTR 420 R +MP +S+ S + STA QH RDPPEAPSPKR R Sbjct: 689 RQLMPFLSEHTTVPGGSASNDGSTAQTASNHLNDSSSSQHHRDPPEAPSPKRPR 742 >ref|XP_008802021.1| PREDICTED: uncharacterized protein LOC103715983 isoform X3 [Phoenix dactylifera] Length = 721 Score = 567 bits (1462), Expect = e-158 Identities = 358/774 (46%), Positives = 456/774 (58%), Gaps = 69/774 (8%) Frame = -3 Query: 2528 MASNEAAKVTPIG-NVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVL 2352 M +N A++VT +V EDS TTVEIKIKTLDS TYTLRV+K VPV LKEQIA+VTGV+ Sbjct: 1 METNGASEVTTSHVDVTEDSETTVEIKIKTLDSHTYTLRVNKCVPVLMLKEQIATVTGVV 60 Query: 2351 SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTN-QMESEAASGHPAT 2175 SEQQRLICRG+VLKDD+LLSAYHVEDGHTLHLV R + SE AS +PA Sbjct: 61 SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPPTGHVGSEGASANPAA 120 Query: 2174 SSSGG-AHIR-SPIAHSFVFESVNVDQGDMDTSVIGRIISSMLDSIGATSSGTGNTSDLR 2001 +SS AH R S +A S VFE+VN+DQGD TS + R ISS+L SIG T++ N R Sbjct: 121 NSSSSIAHNRGSHVARSIVFEAVNIDQGDNRTSHLSRFISSILSSIGTTNTAFQNP---R 177 Query: 2000 EIISERLGRAASDGGLSDSTQPHPNPTNQQRE--RRQADVRFSSTVPLGPQNPTVIPDAL 1827 + E GR + D GL D Q +PNP++ + E +Q+ +RF S PLG Q P VIPD+L Sbjct: 178 NDLRETGGRTSGDTGLPDPVQSNPNPSSSRVEVDSQQSPLRFQSVFPLGSQQPIVIPDSL 237 Query: 1826 TTLSQYIGLMRDEFRREGFGSNAS-EQINNSDAADVRQ---GRSNASNSPLAQVGLPTPA 1659 TT++QY+G++RD+FRREG +NA EQ N++ AA + + + P Q GLP+PA Sbjct: 238 TTMNQYLGVIRDDFRREGISTNAGREQTNDAAAAGMNSIDVQNHDFLSPPPRQGGLPSPA 297 Query: 1658 SLAEILLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSM 1479 SLAEI+LS RQLLM+Q CLSQLAG L DH S+TDPLTRM++QSSA+RSG+++RNLGS+ Sbjct: 298 SLAEIVLSTRQLLMDQAGGCLSQLAGHLEDHVSVTDPLTRMDLQSSAIRSGVILRNLGSL 357 Query: 1478 FLELGRATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVN 1299 LELGR TMTLRMGQ+P EAVVNAGPA FISA+GPNPLMVQ VPF PGSS G T Sbjct: 358 LLELGRTTMTLRMGQTPLEAVVNAGPAVFISASGPNPLMVQPVPFFPGSSFGGTPF---- 413 Query: 1298 SGHGLEGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNV 1119 GEPP + FIPRN++IR+RT DP RN + N Sbjct: 414 ------GEPPASAFIPRNVDIRVRT------------------------DPTRNPSTANS 443 Query: 1118 VHQTFPG--------GDSGVRVMPLRTVVAXXXXXXXXXXXXXXXXVRLFYPLLARVRQA 963 V+Q F G G+SGVRV+P+RTVVA + YPLLAR++ Sbjct: 444 VNQAFSGISSTTSFAGESGVRVVPIRTVVAVPTGHSPSDSSGSAVG--VIYPLLARIQHV 501 Query: 962 STGTASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPF 783 ++ A++ RGS+AS + NQ P + + L +S +Q ++ PAN+ P Sbjct: 502 NSANANNARGSRASNEPNQSHPNINQPNL-ESAMQNQS---------------PANSAPV 545 Query: 782 VSGSSPPANDSTAYQGVSAVFDSSQQNPLNNDDESRTQANDD------------------ 657 VS +P AN+ +YQG S V +SQQ P + ES TQA+ + Sbjct: 546 VSWINPSANELPSYQGSSPVSIASQQAPPASSSESNTQAHVEPQVGQGPMSQFLNRVDEW 605 Query: 656 ---------------------------------AARGTESHDAARVDSEQGVLFSNVLRH 576 AR TE+ +A+RV ++ GV FS+++R Sbjct: 606 LRTALLSGEQVQGGGTSHQESVTGSAAVQNQTGTARNTETQEASRVGNDDGVFFSSLVRE 665 Query: 575 IMPLISQGREHGVSSTIPNSSTAXXXXXXXXXXXXXQHQRDPPEAPSPKRTRRD 414 +MP +SQ S + STA QH RDPPEAPSPKR RR+ Sbjct: 666 LMPFLSQRTTVPGGSASNDGSTAQTASHHLNDSSSSQHHRDPPEAPSPKRPRRN 719 >ref|XP_009394532.1| PREDICTED: large proline-rich protein BAG6 isoform X1 [Musa acuminata subsp. malaccensis] Length = 737 Score = 524 bits (1349), Expect = e-145 Identities = 328/761 (43%), Positives = 432/761 (56%), Gaps = 56/761 (7%) Frame = -3 Query: 2528 MASNEAAKVTPIG-NVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVL 2352 M +N+ ++ T +V +DS +TVEIKIKTLDSQTYTLRVDK VP+P LKEQIA+VTGV+ Sbjct: 1 MGTNDPSEATTSCIDVAQDSESTVEIKIKTLDSQTYTLRVDKSVPIPKLKEQIATVTGVI 60 Query: 2351 SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTN-QMESEAASGHPAT 2175 SEQQRLICRG+VLKDD++LSAYHVEDGHTLHLV R + QM E ASG Sbjct: 61 SEQQRLICRGKVLKDDEILSAYHVEDGHTLHLVVRQPHQSTPSPSTGQMGHEGASGQSDA 120 Query: 2174 SSSGGAHIRSPIAHSFVFESVNVDQGDMDTSVIGRIISSMLDSIGATSSGTGNTS-DLRE 1998 S + G S + VFE+VN+ QGD T + +IISS+L+++ ++G+ + +LR Sbjct: 121 SRNHG----SQSTRTLVFETVNIGQGDHRTQ-LSQIISSILNAVATRNTGSQTSGPNLRN 175 Query: 1997 IISERLGRAASDGGLSDSTQPHPNPTNQQRERRQADVRFSSTVPLGPQNPTVIPDALTTL 1818 + + G + G+ + PN +F S VPL + PTVIPD+LTT+ Sbjct: 176 LSA---GASVDYPGIELGSGQVPN-------------QFHSAVPLVSEQPTVIPDSLTTI 219 Query: 1817 SQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSNASNSPLAQVGLPTPASLAEILL 1638 QY+G MRDEF REG +N E N + AA + S + GLP+PASL E+LL Sbjct: 220 HQYLGFMRDEFTREGLSANGGEHRNEASAAYMNNDSLQFHQS-FSPGGLPSPASLVEVLL 278 Query: 1637 SARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELGRA 1458 S RQLLMEQ + +SQ A L D ++TDPL R+ +Q+S RSG+L++NLGS+ LELGR Sbjct: 279 STRQLLMEQADGYISQFARGLEDQVNLTDPLVRLRLQNSVFRSGVLLQNLGSLLLELGRT 338 Query: 1457 TMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGLEG 1278 TMTLR+GQ+P+EAV+NAGPA FISA+GPNP+MVQ VPF+PGSS +G+ +GHG +G Sbjct: 339 TMTLRLGQTPSEAVINAGPAVFISASGPNPVMVQPVPFYPGSSFS-PRVGATYAGHGSQG 397 Query: 1277 EPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTFPG 1098 EP +P NI IR R GR PV+ N +EQ GQQ+QE +P RNS+ N Q F G Sbjct: 398 EPLGPSLVPGNISIRFRAGRPVPVSPHNQTEQG-GQQQQETTNPTRNSSAANAAPQAFSG 456 Query: 1097 --------GDSGVRVMPLRTVVAXXXXXXXXXXXXXXXXV-RLFYPLLARVRQASTGTAS 945 +SGVRV+P+RTVVA L YPLLARV+ +TG+ Sbjct: 457 VSNNASLSEESGVRVLPIRTVVAVPGGVNRSTSDPSGSSAVGLIYPLLARVQHVATGSLD 516 Query: 944 DTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPFVSGSSP 765 D RG+++S ++N S + +NLE G D PANAVP S +P Sbjct: 517 DARGTESSNEINHDGHNAEEQANIGSTMHAQNLESTIGNFINDIDSTPANAVPLFSEFNP 576 Query: 764 PANDSTAYQGVSAVFDSS-QQNPLNNDDESRTQA-------------------------- 666 N+S +YQG F S+ QQ P +++ S T+ Sbjct: 577 SVNESASYQGSLRDFISAGQQGPPSSNSTSNTEELGHISQLASRLDQWLQSIFPGEQVVV 636 Query: 665 ---------------NDDAARGTESHDAARVDSEQGVLFSNVLRHIMPLISQGREHGV-- 537 D R ++ + V ++GV FS ++R++MP ISQ G Sbjct: 637 GSSSHQEMTRSSVTDQTDIGRNSQPEEHTGVGEDEGVFFSRLVRNLMPFISQATSAGQDG 696 Query: 536 SSTIPNSSTAXXXXXXXXXXXXXQHQRDPPEAPSPKRTRRD 414 S T SSTA Q +RDPPEAPS KRTRRD Sbjct: 697 SPTSHGSSTAHVAGENLNDLSNSQSRRDPPEAPSSKRTRRD 737 >ref|XP_010244741.1| PREDICTED: large proline-rich protein BAG6-like isoform X1 [Nelumbo nucifera] Length = 684 Score = 459 bits (1182), Expect = e-126 Identities = 309/674 (45%), Positives = 391/674 (58%), Gaps = 16/674 (2%) Frame = -3 Query: 2528 MASNEAAKVTPIGNV-VEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVL 2352 M +N+A +V G E S TVEIKIKTLDSQTYTLRV+K VPVPALKEQIA+VTGVL Sbjct: 1 MGTNDANEVLISGGAEAECSEATVEIKIKTLDSQTYTLRVNKCVPVPALKEQIATVTGVL 60 Query: 2351 SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXT-NQMESEAASGH--- 2184 SEQQRLICRG+VLKDDQLLSAYHVEDGHTLHLV R T M SE H Sbjct: 61 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPPPSASTMGFMGSEGLHDHSAS 120 Query: 2183 -PATSSSGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS 2010 P T++S G + +AHS V + N+ DQGD I RI+S++L SI T+ Sbjct: 121 EPTTNASHGQG--NQVAHSVVLGTFNIADQGDGTLPDINRIVSAVLGSI-LTNGSNSEGG 177 Query: 2009 DLREIISERLGRAASDGGLSDSTQPHPNP--TNQQRERRQADVRFSSTVPLGPQNPTVIP 1836 + RE SER+ R DS +P P+ Q + RF S V LGP P VIP Sbjct: 178 NRRESGSERIDRTIGASVPHDSMRPQPSQPAAGVQSDPLHGAFRFPSAVSLGPLQPPVIP 237 Query: 1835 DALTTLSQYIGLMRDEFRREGFGSNASEQINNSDAADVR----QGRSNASNSPLAQVGLP 1668 D+L TLSQY+ MR EF N NN+ A V Q +A +S Q GLP Sbjct: 238 DSLATLSQYLTRMRHEFHVIARSYN-----NNTQPAGVTGNEGQEHDSAPHSSAGQAGLP 292 Query: 1667 TPASLAEILLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNL 1488 TPASLAE++ S R +L+EQ +CL QL QL ++TDPL RM +QS+AMRSG++++NL Sbjct: 293 TPASLAEVIHSTRHMLIEQAGECLYQLTRQLEGQANMTDPLMRMTVQSNAMRSGVILQNL 352 Query: 1487 GSMFLELGRATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMG 1308 G++ LELGRATMTLRMG++P+EAVVNAGP+ FIS +GPNP+MVQ +PF PG+S G MG Sbjct: 353 GALLLELGRATMTLRMGRTPSEAVVNAGPSVFISTSGPNPIMVQPLPFQPGTSFGAVPMG 412 Query: 1307 SVNSGHGLEGEPPEAMFIPRNIEIRIR--TGRAAPVAATNASEQAPGQQEQEQMDPARNS 1134 +V+SG L G + FIPRNI+IRIR TG + P A N EQA QQ Q +PAR Sbjct: 413 AVHSGSSLVGSTLASGFIPRNIDIRIRTVTGSSIPTANVNQGEQAGVQQPPGQTNPARPV 472 Query: 1133 TGVNVVHQTFPGGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQAST 957 +F G+SGVRV+P+RTVV A + LFYPLLARV+ ++ Sbjct: 473 LAGAAGAHSFT-GESGVRVVPIRTVVAAVPAPVNRPPSDPSGSSLGLFYPLLARVQHVTS 531 Query: 956 GTASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPFVS 777 G S RGSQ + + PE + +S VQ +N+ N A D + S Sbjct: 532 GHFSSPRGSQVASERPPSVPETEQRPSPESAVQHQNIGLNHDASARDVN----------S 581 Query: 776 GSSPPANDSTAYQGVSAVFDSSQQNPLNNDDESRTQANDDAARGTESHDAARVDSEQGVL 597 ++ P ND QG++ N ++R DAA G V S+ G+ Sbjct: 582 TNTTPQND----QGIT-----------RNAADAR-----DAASG--------VGSDDGIF 613 Query: 596 FSNVLRHIMPLISQ 555 SN+LR ++P+ISQ Sbjct: 614 LSNLLRQVIPVISQ 627 >ref|XP_010244742.1| PREDICTED: uncharacterized protein LOC104588496 isoform X2 [Nelumbo nucifera] Length = 676 Score = 453 bits (1166), Expect = e-124 Identities = 307/672 (45%), Positives = 388/672 (57%), Gaps = 14/672 (2%) Frame = -3 Query: 2528 MASNEAAKVTPIGNV-VEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVL 2352 M +N+A +V G E S TVEIKIKTLDSQTYTLRV+K VPVPALKEQIA+VTGVL Sbjct: 1 MGTNDANEVLISGGAEAECSEATVEIKIKTLDSQTYTLRVNKCVPVPALKEQIATVTGVL 60 Query: 2351 SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXT-NQMESEAASGH--- 2184 SEQQRLICRG+VLKDDQLLSAYHVEDGHTLHLV R T M SE H Sbjct: 61 SEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPPPSASTMGFMGSEGLHDHSAS 120 Query: 2183 -PATSSSGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS 2010 P T++S G + +AHS V + N+ DQGD I RI+S++L SI T+ Sbjct: 121 EPTTNASHGQG--NQVAHSVVLGTFNIADQGDGTLPDINRIVSAVLGSI-LTNGSNSEGG 177 Query: 2009 DLREIISERLGRAASDGGLSDSTQPHPNP--TNQQRERRQADVRFSSTVPLGPQNPTVIP 1836 + RE SER+ R DS +P P+ Q + RF S V LGP P VIP Sbjct: 178 NRRESGSERIDRTIGASVPHDSMRPQPSQPAAGVQSDPLHGAFRFPSAVSLGPLQPPVIP 237 Query: 1835 DALTTLSQYIGLMRDEFRREGFGSNASEQINNSDAADVR----QGRSNASNSPLAQVGLP 1668 D+L TLSQY+ MR EF N NN+ A V Q +A +S Q GLP Sbjct: 238 DSLATLSQYLTRMRHEFHVIARSYN-----NNTQPAGVTGNEGQEHDSAPHSSAGQAGLP 292 Query: 1667 TPASLAEILLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNL 1488 TPASLAE++ S R +L+EQ +CL QL QL ++TDPL RM +QS+AMRSG++++NL Sbjct: 293 TPASLAEVIHSTRHMLIEQAGECLYQLTRQLEGQANMTDPLMRMTVQSNAMRSGVILQNL 352 Query: 1487 GSMFLELGRATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMG 1308 G++ LELGRATMTLRMG++P+EAVVNAGP+ FIS +GPNP+MVQ +PF PG+S G MG Sbjct: 353 GALLLELGRATMTLRMGRTPSEAVVNAGPSVFISTSGPNPIMVQPLPFQPGTSFGAVPMG 412 Query: 1307 SVNSGHGLEGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTG 1128 +V+SG L G + FIPRNI+IRIRT A N EQA QQ Q +PAR Sbjct: 413 AVHSGSSLVGSTLASGFIPRNIDIRIRT------ANVNQGEQAGVQQPPGQTNPARPVLA 466 Query: 1127 VNVVHQTFPGGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGT 951 +F G+SGVRV+P+RTVV A + LFYPLLARV+ ++G Sbjct: 467 GAAGAHSFT-GESGVRVVPIRTVVAAVPAPVNRPPSDPSGSSLGLFYPLLARVQHVTSGH 525 Query: 950 ASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPFVSGS 771 S RGSQ + + PE + +S VQ +N+ N A D + S + Sbjct: 526 FSSPRGSQVASERPPSVPETEQRPSPESAVQHQNIGLNHDASARDVN----------STN 575 Query: 770 SPPANDSTAYQGVSAVFDSSQQNPLNNDDESRTQANDDAARGTESHDAARVDSEQGVLFS 591 + P ND QG++ N ++R DAA G V S+ G+ S Sbjct: 576 TTPQND----QGIT-----------RNAADAR-----DAASG--------VGSDDGIFLS 607 Query: 590 NVLRHIMPLISQ 555 N+LR ++P+ISQ Sbjct: 608 NLLRQVIPVISQ 619 >ref|XP_010913366.1| PREDICTED: large proline-rich protein BAG6, partial [Elaeis guineensis] Length = 465 Score = 435 bits (1118), Expect(2) = e-124 Identities = 250/427 (58%), Positives = 304/427 (71%), Gaps = 11/427 (2%) Frame = -3 Query: 2528 MASNEAAKVTPI-GNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVL 2352 M +N A +T G+V EDS TTVEIKIKTLDSQTYTLRV+K VP+ LKEQIA+VTGV+ Sbjct: 1 MGTNGARDITTSHGDVTEDSETTVEIKIKTLDSQTYTLRVNKCVPILMLKEQIATVTGVV 60 Query: 2351 SEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTN-QMESEAASGHPAT 2175 SEQQRLICRG+VLKDD+LLSAYHVEDGHTLHLV R + + SE AS +PA Sbjct: 61 SEQQRLICRGKVLKDDELLSAYHVEDGHTLHLVVRQPHQSSSSPSTGHVGSEGASANPAA 120 Query: 2174 -SSSGGAHIR-SPIAHSFVFESVNVDQGDMDTSVIGRIISSMLDSIGATSSGTGNT-SDL 2004 SSS AH R S +A S VFE+VN+DQGD TS +GRIISS+L SIG T++ N +DL Sbjct: 121 NSSSSTAHNRGSHVARSIVFEAVNIDQGDNRTSHLGRIISSLLSSIGTTNTAFQNPRNDL 180 Query: 2003 REIISERLGRAASDGGLSDSTQPHPNPTNQQRE--RRQADVRFSSTVPLGPQNPTVIPDA 1830 RE + GR + D GLSD+ Q +PNP + E +Q +RF S PLG Q P VIPD+ Sbjct: 181 RETV----GRTSGDTGLSDAMQSNPNPPTSRVELDSQQGPLRFQSVFPLGSQQPIVIPDS 236 Query: 1829 LTTLSQYIGLMRDEFRREGFGSNASEQINNSDAADVR----QGRSNASNSPLAQVGLPTP 1662 LTT++QY+G++RD+FRREG EQ N++ AA + Q S P Q GLP+P Sbjct: 237 LTTMNQYLGVIRDDFRREGHSIYGREQTNDAAAAGMNGNDVQNHDFLSPLPSRQGGLPSP 296 Query: 1661 ASLAEILLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGS 1482 ASLAEI+LS RQLLM+Q CLSQLA +L DH S+TDPL RM++QSSA+RSG+L+RNLGS Sbjct: 297 ASLAEIVLSTRQLLMDQAGGCLSQLARRLDDHVSVTDPLMRMDLQSSAIRSGVLLRNLGS 356 Query: 1481 MFLELGRATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSV 1302 + LELGR TMTL MGQ+P EAVVNAGPA FISA+GPNP+MVQ VPF ++G Sbjct: 357 LLLELGRTTMTLHMGQTPLEAVVNAGPAVFISASGPNPVMVQPVPF---------SLGQA 407 Query: 1301 NSGHGLE 1281 +GH LE Sbjct: 408 LAGHSLE 414 Score = 40.4 bits (93), Expect(2) = e-124 Identities = 27/51 (52%), Positives = 30/51 (58%), Gaps = 1/51 (1%) Frame = -1 Query: 1276 NHLRLCLFLEILKYESVQAVQLLLLPP-TQANRLLDNKSKSKWIQQGIPLV 1127 NHL L LFL L Y S Q V LLP T NRL+ + KSK IQQ I L+ Sbjct: 415 NHLLLHLFLGTLTYVSAQVVVPYLLPMLTWVNRLVHSNRKSKQIQQEIHLL 465 >ref|XP_007013655.1| Ubiquitin-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|590578981|ref|XP_007013660.1| Ubiquitin-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508784018|gb|EOY31274.1| Ubiquitin-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508784023|gb|EOY31279.1| Ubiquitin-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 724 Score = 452 bits (1162), Expect = e-124 Identities = 305/736 (41%), Positives = 406/736 (55%), Gaps = 31/736 (4%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P + E S TT+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTGADKV-PRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATSS 2169 EQQRLICRG+VLKDDQLLSAYHVEDGHTLH+V R + +++ASG TS Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSASG---TSR 116 Query: 2168 SGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS-DLREI 1995 H+ A S V E+ NV DQGD I RI+S++L S G + G+GN D+RE Sbjct: 117 GHSNHV----APSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREH 172 Query: 1994 ISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALTT 1821 S+RL R + G+ DS+Q + Q +R + + V LGP P VIPD+L T Sbjct: 173 GSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLAT 232 Query: 1820 LSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSN-ASNSPLAQVGLPTPASLAEI 1644 LSQY+ +R EF +G G E + + SN ASNS Q GLPTPASLAE+ Sbjct: 233 LSQYLSHLRREF--DGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQEGLPTPASLAEV 290 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LL+ RQLL+EQ +CL QLA QL D ++TD R++ QS A R+G+L++NLGS+FLELG Sbjct: 291 LLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLELG 350 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMT+R+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G MG+V G GL Sbjct: 351 RTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSGL 410 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTF 1104 + +PR I+I+IR G + N E+ Q+ Q +P+ S N QT Sbjct: 411 VNGLGTGL-LPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRSTQTS 469 Query: 1103 P--------GGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGT 951 G+SGVRV+P+RT+V A V L+YP L R + ++G Sbjct: 470 SRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIASGH 529 Query: 950 ASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLE--------PNGGGVAGDDSHFPAN 795 S RGSQ SG+ + + L+ +S Q+++ E PN + S+ + Sbjct: 530 VSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNTRSV 589 Query: 794 AVPFVSGSSPPANDS------TAYQGVSAVFDSSQQNPLNNDDESRTQANDDAARGTESH 633 ++ ++ N ++ Q + A+F + N + + GT S Sbjct: 590 SINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGTSSG 649 Query: 632 DAARVDS--EQGVLFSNVLRHIMPLISQGREHGVSSTIPNSSTAXXXXXXXXXXXXXQHQ 459 A S +QGV SN+L IMP + Q + ST+P + Sbjct: 650 APAAEPSITDQGVFLSNLLHQIMPYVPQ-QASSQQSTVPTEEANTSTQAEHTSPGSSRRP 708 Query: 458 RD-PPEAPSPKRTRRD 414 D P +P+ KR +R+ Sbjct: 709 SDSEPNSPNSKRQKRE 724 >ref|XP_007013657.1| Ubiquitin-like superfamily protein, putative isoform 3 [Theobroma cacao] gi|508784020|gb|EOY31276.1| Ubiquitin-like superfamily protein, putative isoform 3 [Theobroma cacao] Length = 730 Score = 451 bits (1160), Expect = e-123 Identities = 305/735 (41%), Positives = 405/735 (55%), Gaps = 31/735 (4%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P + E S TT+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTGADKV-PRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATSS 2169 EQQRLICRG+VLKDDQLLSAYHVEDGHTLH+V R + +++ASG TS Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSASG---TSR 116 Query: 2168 SGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS-DLREI 1995 H+ A S V E+ NV DQGD I RI+S++L S G + G+GN D+RE Sbjct: 117 GHSNHV----APSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREH 172 Query: 1994 ISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALTT 1821 S+RL R + G+ DS+Q + Q +R + + V LGP P VIPD+L T Sbjct: 173 GSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLAT 232 Query: 1820 LSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSN-ASNSPLAQVGLPTPASLAEI 1644 LSQY+ +R EF +G G E + + SN ASNS Q GLPTPASLAE+ Sbjct: 233 LSQYLSHLRREF--DGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQEGLPTPASLAEV 290 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LL+ RQLL+EQ +CL QLA QL D ++TD R++ QS A R+G+L++NLGS+FLELG Sbjct: 291 LLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLELG 350 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMT+R+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G MG+V G GL Sbjct: 351 RTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSGL 410 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTF 1104 + +PR I+I+IR G + N E+ Q+ Q +P+ S N QT Sbjct: 411 VNGLGTGL-LPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRSTQTS 469 Query: 1103 P--------GGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGT 951 G+SGVRV+P+RT+V A V L+YP L R + ++G Sbjct: 470 SRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIASGH 529 Query: 950 ASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLE--------PNGGGVAGDDSHFPAN 795 S RGSQ SG+ + + L+ +S Q+++ E PN + S+ + Sbjct: 530 VSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNTRSV 589 Query: 794 AVPFVSGSSPPANDS------TAYQGVSAVFDSSQQNPLNNDDESRTQANDDAARGTESH 633 ++ ++ N ++ Q + A+F + N + + GT S Sbjct: 590 SINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGTSSG 649 Query: 632 DAARVDS--EQGVLFSNVLRHIMPLISQGREHGVSSTIPNSSTAXXXXXXXXXXXXXQHQ 459 A S +QGV SN+L IMP + Q + ST+P + Sbjct: 650 APAAEPSITDQGVFLSNLLHQIMPYVPQ-QASSQQSTVPTEEANTSTQAEHTSPGSSRRP 708 Query: 458 RD-PPEAPSPKRTRR 417 D P +P+ KR +R Sbjct: 709 SDSEPNSPNSKRQKR 723 >ref|XP_007013661.1| Ubiquitin-like superfamily protein, putative isoform 7 [Theobroma cacao] gi|508784024|gb|EOY31280.1| Ubiquitin-like superfamily protein, putative isoform 7 [Theobroma cacao] Length = 725 Score = 450 bits (1157), Expect = e-123 Identities = 304/737 (41%), Positives = 408/737 (55%), Gaps = 32/737 (4%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P + E S TT+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTGADKV-PRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATSS 2169 EQQRLICRG+VLKDDQLLSAYHVEDGHTLH+V R + +++ASG TS Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSASG---TSR 116 Query: 2168 SGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS-DLREI 1995 H+ A S V E+ NV DQGD I RI+S++L S G + G+GN D+RE Sbjct: 117 GHSNHV----APSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREH 172 Query: 1994 ISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALTT 1821 S+RL R + G+ DS+Q + Q +R + + V LGP P VIPD+L T Sbjct: 173 GSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLAT 232 Query: 1820 LSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSN-ASNSPLAQVGLPTPASLAEI 1644 LSQY+ +R EF +G G E + + SN ASNS Q GLPTPASLAE+ Sbjct: 233 LSQYLSHLRREF--DGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQEGLPTPASLAEV 290 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LL+ RQLL+EQ +CL QLA QL D ++TD R++ QS A R+G+L++NLGS+FLELG Sbjct: 291 LLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLELG 350 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMT+R+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G MG+V G GL Sbjct: 351 RTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSGL 410 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTF 1104 + +PR I+I+IR G + N E+ Q+ Q +P+ S N QT Sbjct: 411 VNGLGTGL-LPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRSTQTS 469 Query: 1103 P--------GGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGT 951 G+SGVRV+P+RT+V A V L+YP L R + ++G Sbjct: 470 SRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIASGH 529 Query: 950 ASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLE--------PNGGGVAGDDSHFPAN 795 S RGSQ SG+ + + L+ +S Q+++ E PN + S+ + Sbjct: 530 VSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNTRSV 589 Query: 794 AVPFVSGSSPPANDS------TAYQGVSAVFDSSQQNPLNNDDESRTQANDDAARGTESH 633 ++ ++ N ++ Q + A+F + N + + GT S Sbjct: 590 SINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGTSSG 649 Query: 632 DAARVDS--EQGVLFSNVLRHIMPLISQGREHGVSSTIP--NSSTAXXXXXXXXXXXXXQ 465 A S +QGV SN+L IMP + Q + ST+P ++T+ + Sbjct: 650 APAAEPSITDQGVFLSNLLHQIMPYVPQ-QASSQQSTVPTEEANTSTQQAEHTSPGSSRR 708 Query: 464 HQRDPPEAPSPKRTRRD 414 P +P+ KR + + Sbjct: 709 PSDSEPNSPNSKRQKTE 725 >ref|XP_007013658.1| Ubiquitin-like superfamily protein, putative isoform 4 [Theobroma cacao] gi|508784021|gb|EOY31277.1| Ubiquitin-like superfamily protein, putative isoform 4 [Theobroma cacao] Length = 724 Score = 449 bits (1156), Expect = e-123 Identities = 304/736 (41%), Positives = 405/736 (55%), Gaps = 31/736 (4%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P + E S TT+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTGADKV-PRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATSS 2169 EQQRLICRG+VLKDDQLLSAYHVEDGHTLH+V R + +++ASG TS Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSASG---TSR 116 Query: 2168 SGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS-DLREI 1995 H+ A S V E+ NV DQGD I RI+S++L S G + G+GN D+RE Sbjct: 117 GHSNHV----APSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREH 172 Query: 1994 ISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALTT 1821 S+RL R + G+ DS+Q + Q +R + + V LGP P VIPD+L T Sbjct: 173 GSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLAT 232 Query: 1820 LSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSN-ASNSPLAQVGLPTPASLAEI 1644 LSQY+ +R EF +G G E + + SN ASNS Q GLPTPASLAE+ Sbjct: 233 LSQYLSHLRREF--DGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQEGLPTPASLAEV 290 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LL+ RQLL+EQ +CL QLA QL D ++TD R++ QS A R+G+L++NLGS+FLELG Sbjct: 291 LLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLELG 350 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMT+R+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G MG+V G GL Sbjct: 351 RTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSGL 410 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTF 1104 + +PR I+I+IR G + N E+ Q+ Q +P+ S N QT Sbjct: 411 VNGLGTGL-LPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRSTQTS 469 Query: 1103 P--------GGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGT 951 G+SGVRV+P+RT+V A V L+YP L R + ++G Sbjct: 470 SRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIASGH 529 Query: 950 ASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLE--------PNGGGVAGDDSHFPAN 795 S RGSQ SG+ + + L+ +S Q+++ E PN + S+ + Sbjct: 530 VSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNTRSV 589 Query: 794 AVPFVSGSSPPANDS------TAYQGVSAVFDSSQQNPLNNDDESRTQANDDAARGTESH 633 ++ ++ N ++ Q + A+F + N + + GT S Sbjct: 590 SINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGTSSG 649 Query: 632 DAARVDS--EQGVLFSNVLRHIMPLISQGREHGVSSTIPNSSTAXXXXXXXXXXXXXQHQ 459 A S +QGV SN+L IMP + Q + ST+P + Sbjct: 650 APAAEPSITDQGVFLSNLLHQIMPYVPQ-QASSQQSTVPTEEANTSTQAEHTSPGSSRRP 708 Query: 458 RD-PPEAPSPKRTRRD 414 D P +P+ KR + + Sbjct: 709 SDSEPNSPNSKRQKTE 724 >ref|XP_010244866.1| PREDICTED: large proline-rich protein BAG6-like [Nelumbo nucifera] Length = 794 Score = 448 bits (1152), Expect = e-122 Identities = 309/714 (43%), Positives = 402/714 (56%), Gaps = 55/714 (7%) Frame = -3 Query: 2531 RMASNEAAKVTPIGN-VVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGV 2355 RM SN+A ++ G+ E S TVEIKIKTLDSQTYTLRV+K VPVPALKEQIA++TGV Sbjct: 29 RMGSNDANELMISGSDEAECSEATVEIKIKTLDSQTYTLRVNKCVPVPALKEQIATITGV 88 Query: 2354 LSEQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXT-NQMESEAASGHPA 2178 LSEQQRLICRG+VLKDDQLLSAYHVEDGHTLHLV R T + E + HPA Sbjct: 89 LSEQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPPSSASTMGFIGPEGSPDHPA 148 Query: 2177 TS-SSGGAHIR-SPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTSD 2007 + ++ +H + + +AHS V + N+ DQGD I RI+S++L SI ++ Sbjct: 149 SEPTTNTSHSQGNQVAHSVVLGTFNIADQGDGVLPDINRIVSAVLGSIFTNIGSGSEGAN 208 Query: 2006 LREIISERLGRAASDGGLSDSTQPHPN-PT-NQQRERRQADVRFSSTVPLGPQNPTVIPD 1833 RE SERL R + SDS + P PT Q + R + LGP VIPD Sbjct: 209 HREPASERLERTSGASVPSDSVRSQPGQPTAGVQSDPLHGAFRLPTPASLGPLQAPVIPD 268 Query: 1832 ALTTLSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSNASNSPLAQVGLPTPASL 1653 +L TLSQYI MR EFR +++ Q ++ + R S AS S Q GLPTPASL Sbjct: 269 SLATLSQYINRMRHEFRVIARSHSSNSQPASTPGNEGRDYDS-ASRSNEGQAGLPTPASL 327 Query: 1652 AEILLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFL 1473 AE++LS RQ+L++Q +CL QL QL D +ITDPL RM +QS+AMRSG+ ++NLG++ L Sbjct: 328 AEVILSTRQMLIDQAGECLYQLTRQLEDQGNITDPLMRMTIQSNAMRSGVFLQNLGALLL 387 Query: 1472 ELGRATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSG 1293 ELGRATM LRMG++P+EAVVNAGP+ FIS +GPNP+MVQ +PF PG+S G +G+V+ G Sbjct: 388 ELGRATMMLRMGRTPSEAVVNAGPSVFISNSGPNPIMVQPLPFQPGTSFGAVPIGAVHPG 447 Query: 1292 HGLEGEPPEAMFIPRNIEIRIR--TGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNV 1119 L G + FIPRNI+IRIR TG + P A N EQA +Q + R S + Sbjct: 448 SSLVGGTLGSGFIPRNIDIRIRTATGSSVPTANVNQGEQAGVRQPSGPTNSVRPSGSASG 507 Query: 1118 VHQTFP-GGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGTAS 945 + G+SGVRV+P+RTVV A + LFYPLLARV+ ++G + Sbjct: 508 ASGSPSFAGESGVRVVPIRTVVAAVPAPVNRPASDSSGSSIGLFYPLLARVQHVTSGHFN 567 Query: 944 DTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPFVSGSSP 765 TR SQ SG PE R + +S VQ +N+ + G DS AN V S Sbjct: 568 STRDSQVSGDRPPSVPETVRHPIPESVVQNQNISLHIGTSRDADS---ANVVVQNQQGSL 624 Query: 764 PANDSTAYQGVSAVFDSSQQNPLNNDDESRTQANDDAAR--------------------- 648 P + S Q S +++ N NN R Q ++A Sbjct: 625 PTSSSRQSQ-PSDSNNNNNNNNNNNLSSGRIQNGQESAAHISNRFDQLLRTIFPGEQISV 683 Query: 647 -----------------GTESHDA------ARVDSEQGVLFSNVLRHIMPLISQ 555 GT A +RV+S+ G+ SN+LR +MP+ISQ Sbjct: 684 GEANFPGMSTGSATEHVGTAGSTANAREATSRVESDDGIFLSNLLRQVMPVISQ 737 >ref|XP_002283083.2| PREDICTED: large proline-rich protein bag6-A isoform X1 [Vitis vinifera] Length = 708 Score = 447 bits (1151), Expect = e-122 Identities = 301/718 (41%), Positives = 388/718 (54%), Gaps = 31/718 (4%) Frame = -3 Query: 2474 STTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLSEQQRLICRGRVLKDDQLL 2295 S TVEIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLSEQQRLICRGRVLKDDQLL Sbjct: 20 SEATVEIKIKTLDSQTYTLRVDKCMPVPALKEQIASVTGVLSEQQRLICRGRVLKDDQLL 79 Query: 2294 SAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATSSSGGAHIRSPIAHSFVFES 2115 SAYHVEDGHTLHLV R + ++ A T + G H+ S + Sbjct: 80 SAYHVEDGHTLHLVVRQPFPPSSE--SLPDNSATDPASNTLRNQGFHVGSSVV------- 130 Query: 2114 VNVDQGDMDTSVIGRIISSMLDSIGATSSGTGNT-SDLREIISERLGRAASDGGLSDSTQ 1938 V +QGD + RI+S++L S G + +G+ +D R+ + ER GL DS++ Sbjct: 131 VLSEQGD-GVPDLSRIVSAVLSSFGVNNVRSGSEGADPRDPVPERGSGTPGLSGLRDSSR 189 Query: 1937 PHPNP--TNQQRERRQADVRFSSTVPLGPQNPTVIPDALTTLSQYIGLMRDEFRRE--GF 1770 PN T Q + V L P VIPD+LTTLSQY+ MR EF G Sbjct: 190 QQPNQAATIDQSNPLNGASLLPNDVTLEQLQPPVIPDSLTTLSQYLRNMRHEFGGSVRGH 249 Query: 1769 GSNASEQINNSDAADVRQGRSNASNSPLAQVGLPTPASLAEILLSARQLLMEQIEDCLSQ 1590 G+N++ I+ D Q S + Q GLPTPASLAE++LS RQ+L+EQ + LSQ Sbjct: 250 GNNSAAGIHGCDV----QNSEATLQSDVTQGGLPTPASLAEVILSTRQILIEQAAEDLSQ 305 Query: 1589 LAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELGRATMTLRMGQSPAEAVVN 1410 L QL +H ++TDPL R ++QS+A+R G ++RNLG++ LELGR TMTLRMGQ+P +AVVN Sbjct: 306 LTRQLENHANVTDPLARRSIQSNALRLGAILRNLGALLLELGRTTMTLRMGQTPNDAVVN 365 Query: 1409 AGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGLEGEPPEAMFIPRNIEIRI 1230 AGPA FIS +GPNP+MVQ +PFHPG+S G MG+V G G + F+PRNI+IRI Sbjct: 366 AGPALFISTSGPNPIMVQPLPFHPGTSFGAIPMGTVQPGSGFSSGTLRSGFLPRNIDIRI 425 Query: 1229 RTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTFPGG--------DSGVRVM 1074 RTG P + N E A GQ Q Q PA S GVN +HQ G ++ VRV+ Sbjct: 426 RTGSMMPPSVINQREPAGGQTSQGQTSPAL-SGGVNSIHQPAAGASRSSSSTREAEVRVV 484 Query: 1073 PLRTVVAXXXXXXXXXXXXXXXXVR-LFYPLLARVRQASTGTASDTRGSQAS-------- 921 P+RTVVA LFYP+LARV+ +G + RGSQAS Sbjct: 485 PIRTVVAAIPAAARHSPSDSSRSSMGLFYPVLARVQHVMSGNYNGARGSQASDEHQPRGL 544 Query: 920 GQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPFVSGSSPPANDSTAY 741 G Q PE ++N+E G G++ +F + SG Sbjct: 545 GAQQQSVPE---------SASQQNIESQGRD-GGENPNFQTASTQLRSGLDQLLRTIFPV 594 Query: 740 QGVSAVFDSSQQNPLNNDDESRTQANDDAARGTESHDAARVDSEQGVLFSNVLRHIMPLI 561 + + D + Q + +AA TE + RV ++G FSN+L HIMPLI Sbjct: 595 EQIHVGSDVNFQG--TGTSSTGITGTTEAAANTEEIE-PRV-GDEGTFFSNLLHHIMPLI 650 Query: 560 SQGREHGVS---------STIPNSSTAXXXXXXXXXXXXXQHQRDPPEAPSPKRTRRD 414 S+ G S +I + QRDPP P+ KR +++ Sbjct: 651 SENSAMGSSDAAADRADVGSINGQDSTTHPQENSDVGTSSGRQRDPPSPPNSKRQKKE 708 >ref|XP_007013659.1| Ubiquitin-like superfamily protein, putative isoform 5 [Theobroma cacao] gi|508784022|gb|EOY31278.1| Ubiquitin-like superfamily protein, putative isoform 5 [Theobroma cacao] Length = 729 Score = 447 bits (1150), Expect = e-122 Identities = 296/688 (43%), Positives = 390/688 (56%), Gaps = 30/688 (4%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P + E S TT+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTGADKV-PRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATSS 2169 EQQRLICRG+VLKDDQLLSAYHVEDGHTLH+V R + +++ASG TS Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSASG---TSR 116 Query: 2168 SGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS-DLREI 1995 H+ A S V E+ NV DQGD I RI+S++L S G + G+GN D+RE Sbjct: 117 GHSNHV----APSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREH 172 Query: 1994 ISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALTT 1821 S+RL R + G+ DS+Q + Q +R + + V LGP P VIPD+L T Sbjct: 173 GSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLAT 232 Query: 1820 LSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSN-ASNSPLAQVGLPTPASLAEI 1644 LSQY+ +R EF +G G E + + SN ASNS Q GLPTPASLAE+ Sbjct: 233 LSQYLSHLRREF--DGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQEGLPTPASLAEV 290 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LL+ RQLL+EQ +CL QLA QL D ++TD R++ QS A R+G+L++NLGS+FLELG Sbjct: 291 LLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLELG 350 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMT+R+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G MG+V G GL Sbjct: 351 RTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSGL 410 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTF 1104 + +PR I+I+IR G + N E+ Q+ Q +P+ S N QT Sbjct: 411 VNGLGTGL-LPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRSTQTS 469 Query: 1103 P--------GGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGT 951 G+SGVRV+P+RT+V A V L+YP L R + ++G Sbjct: 470 SRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIASGH 529 Query: 950 ASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLE--------PNGGGVAGDDSHFPAN 795 S RGSQ SG+ + + L+ +S Q+++ E PN + S+ + Sbjct: 530 VSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFEESTRDGSLPNPNSRQQERSNTRSV 589 Query: 794 AVPFVSGSSPPANDS------TAYQGVSAVFDSSQQNPLNNDDESRTQANDDAARGTESH 633 ++ ++ N ++ Q + A+F + N + + GT S Sbjct: 590 SINILAAGRTQNNQDSERQIPSSVQFLRAIFPGGEINVEEASVQGAATGSVQEQAGTSSG 649 Query: 632 DAARVDS--EQGVLFSNVLRHIMPLISQ 555 A S +QGV SN+L IMP + Q Sbjct: 650 APAAEPSITDQGVFLSNLLHQIMPYVPQ 677 >gb|KJB64610.1| hypothetical protein B456_010G057100 [Gossypium raimondii] Length = 720 Score = 437 bits (1123), Expect = e-119 Identities = 306/708 (43%), Positives = 394/708 (55%), Gaps = 35/708 (4%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P +E S T+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTSAHKV-PSDTEMEGSEATIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATS- 2172 EQQRLICRG+VLKDDQLLSAYHVEDGHTLHLV R S+ + H A Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPP--------SSDGSPYHSANDP 111 Query: 2171 SSGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNT-SDLRE 1998 +SG + S A SFV E+ NV DQGD I RI+S++L S G + +GNT SD R+ Sbjct: 112 ASGTSRGHSNHAPSFVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANMASGNTGSDARD 171 Query: 1997 IISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALT 1824 S+R R + G+ DS+Q + Q +R + + V LG P VIPD+L Sbjct: 172 HGSQRQERTSGGSGMPDSSQAQTELASMTSQSDRAHSAFGLPAAVSLGSMQPPVIPDSLA 231 Query: 1823 TLSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSNASNSPLAQVGLPTPASLAEI 1644 TLSQY+ +R+EF G Q R S ASNS GLPTPASLAE+ Sbjct: 232 TLSQYLSHIRNEFDALGRAGGNDSQTAPMSRTGSRDSNS-ASNSGTVHEGLPTPASLAEV 290 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LLS RQ+L+EQ + + QLA QL D ++TDP R+ Q++A+R+G L+ NLGS+ LELG Sbjct: 291 LLSTRQMLIEQAGESVQQLARQLEDQVNVTDPSARLIAQTNALRTGALLHNLGSLLLELG 350 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMTLR+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G +G+V G GL Sbjct: 351 RTTMTLRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAIPVGTVQPGSGL 410 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAP----GQQEQEQMDPARN---STGV 1125 F+PR I+I+IR R + +A N E+ P GQ Q + + N T Sbjct: 411 VNGLGTG-FVPRRIDIQIR--RGSSMATPNTREEHPPNQSGQSNQSMVSDSENRSSQTTS 467 Query: 1124 NVVHQTFPGGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGTA 948 V G+SGVRV+P+RT+V A V ++YPLL R++ + G Sbjct: 468 RVSDTPSFAGESGVRVVPIRTMVAAVPAPLGRLPSESSGNSVGVYYPLLGRLQNIAPGHV 527 Query: 947 SDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPFVSGSS 768 S RG QASG+ + L + +S VQ ++ E + A D S AN+ ++ Sbjct: 528 SGERGPQASGEHPSSGAQPELLRIPESAVQHQSSEES----ARDGSLPNANSRQQERPNT 583 Query: 767 PPANDSTAYQG-VSAVFDSSQQNPLNNDDESRT-------QANDDAARGTESHDAARVDS 612 N S G DS +Q+P N RT Q + +++GT + D+ R + Sbjct: 584 RSVNISILAAGRTENNQDSERQSPSNVLQFLRTIFPGGEIQVEEASSQGT-ARDSVRGQA 642 Query: 611 E--------------QGVLFSNVLRHIMPLISQGREHGVSSTIPNSST 510 E QGV SN+L IMP ISQ G + P +T Sbjct: 643 EASNVAPAAETSITNQGVFLSNLLHQIMPYISQ--HAGSQRSTPEEAT 688 >ref|XP_007013662.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma cacao] gi|590578991|ref|XP_007013663.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma cacao] gi|590578994|ref|XP_007013664.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma cacao] gi|508784025|gb|EOY31281.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma cacao] gi|508784026|gb|EOY31282.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma cacao] gi|508784027|gb|EOY31283.1| Ubiquitin-like superfamily protein, putative isoform 8 [Theobroma cacao] Length = 575 Score = 434 bits (1117), Expect = e-118 Identities = 273/576 (47%), Positives = 350/576 (60%), Gaps = 14/576 (2%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P + E S TT+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTGADKV-PRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATSS 2169 EQQRLICRG+VLKDDQLLSAYHVEDGHTLH+V R + +++ASG TS Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSASG---TSR 116 Query: 2168 SGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS-DLREI 1995 H+ A S V E+ NV DQGD I RI+S++L S G + G+GN D+RE Sbjct: 117 GHSNHV----APSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREH 172 Query: 1994 ISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALTT 1821 S+RL R + G+ DS+Q + Q +R + + V LGP P VIPD+L T Sbjct: 173 GSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLAT 232 Query: 1820 LSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSN-ASNSPLAQVGLPTPASLAEI 1644 LSQY+ +R EF +G G E + + SN ASNS Q GLPTPASLAE+ Sbjct: 233 LSQYLSHLRREF--DGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQEGLPTPASLAEV 290 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LL+ RQLL+EQ +CL QLA QL D ++TD R++ QS A R+G+L++NLGS+FLELG Sbjct: 291 LLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLELG 350 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMT+R+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G MG+V G GL Sbjct: 351 RTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSGL 410 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTF 1104 + +PR I+I+IR G + N E+ Q+ Q +P+ S N QT Sbjct: 411 VNGLGTGL-LPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRSTQTS 469 Query: 1103 P--------GGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGT 951 G+SGVRV+P+RT+V A V L+YP L R + ++G Sbjct: 470 SRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIASGH 529 Query: 950 ASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLE 843 S RGSQ SG+ + + L+ +S Q+++ E Sbjct: 530 VSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFE 565 >ref|XP_007013656.1| Ubiquitin-like superfamily protein, putative isoform 2 [Theobroma cacao] gi|590578998|ref|XP_007013665.1| Ubiquitin-like superfamily protein, putative isoform 2 [Theobroma cacao] gi|508784019|gb|EOY31275.1| Ubiquitin-like superfamily protein, putative isoform 2 [Theobroma cacao] gi|508784028|gb|EOY31284.1| Ubiquitin-like superfamily protein, putative isoform 2 [Theobroma cacao] Length = 579 Score = 434 bits (1117), Expect = e-118 Identities = 273/576 (47%), Positives = 350/576 (60%), Gaps = 14/576 (2%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P + E S TT+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTGADKV-PRDSETEGSETTIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATSS 2169 EQQRLICRG+VLKDDQLLSAYHVEDGHTLH+V R + +++ASG TS Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHMVVRQPVPPSSDGSPHSANDSASG---TSR 116 Query: 2168 SGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNTS-DLREI 1995 H+ A S V E+ NV DQGD I RI+S++L S G + G+GN D+RE Sbjct: 117 GHSNHV----APSVVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANVGSGNIGGDVREH 172 Query: 1994 ISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALTT 1821 S+RL R + G+ DS+Q + Q +R + + V LGP P VIPD+L T Sbjct: 173 GSQRLERTSGASGMPDSSQAQTEQASMRGQSDRVHSAFGLPAAVSLGPLQPPVIPDSLAT 232 Query: 1820 LSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSN-ASNSPLAQVGLPTPASLAEI 1644 LSQY+ +R EF +G G E + + SN ASNS Q GLPTPASLAE+ Sbjct: 233 LSQYLSHLRREF--DGIGRAGGEDPQAASLSRTGDRDSNPASNSGTVQEGLPTPASLAEV 290 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LL+ RQLL+EQ +CL QLA QL D ++TD R++ QS A R+G+L++NLGS+FLELG Sbjct: 291 LLATRQLLIEQAGECLQQLARQLEDQGNVTDSSARLSAQSIAWRTGVLLQNLGSLFLELG 350 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMT+R+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G MG+V G GL Sbjct: 351 RTTMTIRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAFPMGTVQPGSGL 410 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAPGQQEQEQMDPARNSTGVNVVHQTF 1104 + +PR I+I+IR G + N E+ Q+ Q +P+ S N QT Sbjct: 411 VNGLGTGL-LPRRIDIQIRRGSSVATPNVNREERGDTAQQSGQRNPSMGSGSENRSTQTS 469 Query: 1103 P--------GGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGT 951 G+SGVRV+P+RT+V A V L+YP L R + ++G Sbjct: 470 SRVSDTPSFAGESGVRVVPIRTMVAAVPTPFGRLPSDSSGNSVGLYYPFLGRFQHIASGH 529 Query: 950 ASDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLE 843 S RGSQ SG+ + + L+ +S Q+++ E Sbjct: 530 VSGERGSQGSGENLSHGVQSEQHLIPESTAQQQSFE 565 >gb|KJB64613.1| hypothetical protein B456_010G057100 [Gossypium raimondii] Length = 698 Score = 431 bits (1108), Expect = e-117 Identities = 305/708 (43%), Positives = 391/708 (55%), Gaps = 35/708 (4%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P +E S T+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTSAHKV-PSDTEMEGSEATIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATS- 2172 EQQRLICRG+VLKDDQLLSAYHVEDGHTLHLV R S+ + H A Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPP--------SSDGSPYHSANDP 111 Query: 2171 SSGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNT-SDLRE 1998 +SG + S A SFV E+ NV DQGD I RI+S++L S G + +GNT SD RE Sbjct: 112 ASGTSRGHSNHAPSFVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANMASGNTGSDARE 171 Query: 1997 IISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALT 1824 R + G+ DS+Q + Q +R + + V LG P VIPD+L Sbjct: 172 -------RTSGGSGMPDSSQAQTELASMTSQSDRAHSAFGLPAAVSLGSMQPPVIPDSLA 224 Query: 1823 TLSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSNASNSPLAQVGLPTPASLAEI 1644 TLSQY+ +R+EF G Q R S ASNS GLPTPASLAE+ Sbjct: 225 TLSQYLSHIRNEFDALGRAGGNDSQTAPMSRTGSRDSNS-ASNSGTVHEGLPTPASLAEV 283 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LLS RQ+L+EQ + + QLA QL D ++TDP R+ Q++A+R+G L+ NLGS+ LELG Sbjct: 284 LLSTRQMLIEQAGESVQQLARQLEDQVNVTDPSARLIAQTNALRTGALLHNLGSLLLELG 343 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMTLR+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G +G+V G GL Sbjct: 344 RTTMTLRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAIPVGTVQPGSGL 403 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAP----GQQEQEQMDPARN---STGV 1125 F+PR I+I+IR R + +A N E+ P GQ Q + + N T Sbjct: 404 VNGLGTG-FVPRRIDIQIR--RGSSMATPNTREEHPPNQSGQSNQSMVSDSENRSSQTTS 460 Query: 1124 NVVHQTFPGGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGTA 948 V G+SGVRV+P+RT+V A V ++YPLL R++ + G Sbjct: 461 RVSDTPSFAGESGVRVVPIRTMVAAVPAPLGRLPSESSGNSVGVYYPLLGRLQNIAPGHV 520 Query: 947 SDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPFVSGSS 768 S RG QASG+ + L + +S VQ ++ E + A D S AN+ ++ Sbjct: 521 SGERGPQASGEHPSSGAQPELLRIPESAVQHQSSEES----ARDGSLPNANSRQQERPNT 576 Query: 767 PPANDSTAYQG-VSAVFDSSQQNPLNNDDESRT-------QANDDAARGTESHDAARVDS 612 N S G DS +Q+P N RT Q + +++GT + D+ R + Sbjct: 577 RSVNISILAAGRTENNQDSERQSPSNVLQFLRTIFPGGEIQVEEASSQGT-ARDSVRGQA 635 Query: 611 E--------------QGVLFSNVLRHIMPLISQGREHGVSSTIPNSST 510 E QGV SN+L IMP ISQ G + P +T Sbjct: 636 EASNVAPAAETSITNQGVFLSNLLHQIMPYISQ--HAGSQRSTPEEAT 681 >gb|KJB64612.1| hypothetical protein B456_010G057100 [Gossypium raimondii] Length = 714 Score = 431 bits (1108), Expect = e-117 Identities = 305/708 (43%), Positives = 391/708 (55%), Gaps = 35/708 (4%) Frame = -3 Query: 2528 MASNEAAKVTPIGNVVEDSTTTVEIKIKTLDSQTYTLRVDKFVPVPALKEQIASVTGVLS 2349 M S A KV P +E S T+EIKIKTLDSQTYTLRVDK +PVPALKEQIASVTGVLS Sbjct: 1 MGSTSAHKV-PSDTEMEGSEATIEIKIKTLDSQTYTLRVDKQMPVPALKEQIASVTGVLS 59 Query: 2348 EQQRLICRGRVLKDDQLLSAYHVEDGHTLHLVARXXXXXXXXXTNQMESEAASGHPATS- 2172 EQQRLICRG+VLKDDQLLSAYHVEDGHTLHLV R S+ + H A Sbjct: 60 EQQRLICRGKVLKDDQLLSAYHVEDGHTLHLVVRQPVPP--------SSDGSPYHSANDP 111 Query: 2171 SSGGAHIRSPIAHSFVFESVNV-DQGDMDTSVIGRIISSMLDSIGATSSGTGNT-SDLRE 1998 +SG + S A SFV E+ NV DQGD I RI+S++L S G + +GNT SD RE Sbjct: 112 ASGTSRGHSNHAPSFVIETFNVPDQGDGVPPEISRIVSAVLGSFGFANMASGNTGSDARE 171 Query: 1997 IISERLGRAASDGGLSDSTQPHPNPTNQ--QRERRQADVRFSSTVPLGPQNPTVIPDALT 1824 R + G+ DS+Q + Q +R + + V LG P VIPD+L Sbjct: 172 -------RTSGGSGMPDSSQAQTELASMTSQSDRAHSAFGLPAAVSLGSMQPPVIPDSLA 224 Query: 1823 TLSQYIGLMRDEFRREGFGSNASEQINNSDAADVRQGRSNASNSPLAQVGLPTPASLAEI 1644 TLSQY+ +R+EF G Q R S ASNS GLPTPASLAE+ Sbjct: 225 TLSQYLSHIRNEFDALGRAGGNDSQTAPMSRTGSRDSNS-ASNSGTVHEGLPTPASLAEV 283 Query: 1643 LLSARQLLMEQIEDCLSQLAGQLGDHTSITDPLTRMNMQSSAMRSGLLMRNLGSMFLELG 1464 LLS RQ+L+EQ + + QLA QL D ++TDP R+ Q++A+R+G L+ NLGS+ LELG Sbjct: 284 LLSTRQMLIEQAGESVQQLARQLEDQVNVTDPSARLIAQTNALRTGALLHNLGSLLLELG 343 Query: 1463 RATMTLRMGQSPAEAVVNAGPANFISANGPNPLMVQAVPFHPGSSPGVTNMGSVNSGHGL 1284 R TMTLR+GQ+P+EAVVNAGPA FIS +GPNPLMVQA+PF PG+S G +G+V G GL Sbjct: 344 RTTMTLRLGQTPSEAVVNAGPAVFISPSGPNPLMVQALPFQPGTSFGAIPVGTVQPGSGL 403 Query: 1283 EGEPPEAMFIPRNIEIRIRTGRAAPVAATNASEQAP----GQQEQEQMDPARN---STGV 1125 F+PR I+I+IR R + +A N E+ P GQ Q + + N T Sbjct: 404 VNGLGTG-FVPRRIDIQIR--RGSSMATPNTREEHPPNQSGQSNQSMVSDSENRSSQTTS 460 Query: 1124 NVVHQTFPGGDSGVRVMPLRTVV-AXXXXXXXXXXXXXXXXVRLFYPLLARVRQASTGTA 948 V G+SGVRV+P+RT+V A V ++YPLL R++ + G Sbjct: 461 RVSDTPSFAGESGVRVVPIRTMVAAVPAPLGRLPSESSGNSVGVYYPLLGRLQNIAPGHV 520 Query: 947 SDTRGSQASGQLNQGAPEVGRLLLHQSPVQRENLEPNGGGVAGDDSHFPANAVPFVSGSS 768 S RG QASG+ + L + +S VQ ++ E + A D S AN+ ++ Sbjct: 521 SGERGPQASGEHPSSGAQPELLRIPESAVQHQSSEES----ARDGSLPNANSRQQERPNT 576 Query: 767 PPANDSTAYQG-VSAVFDSSQQNPLNNDDESRT-------QANDDAARGTESHDAARVDS 612 N S G DS +Q+P N RT Q + +++GT + D+ R + Sbjct: 577 RSVNISILAAGRTENNQDSERQSPSNVLQFLRTIFPGGEIQVEEASSQGT-ARDSVRGQA 635 Query: 611 E--------------QGVLFSNVLRHIMPLISQGREHGVSSTIPNSST 510 E QGV SN+L IMP ISQ G + P +T Sbjct: 636 EASNVAPAAETSITNQGVFLSNLLHQIMPYISQ--HAGSQRSTPEEAT 681