BLASTX nr result
ID: Rehmannia28_contig00045787
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia28_contig00045787 (771 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699... 264 1e-82 ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697... 265 1e-82 ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobrom... 197 2e-58 ref|XP_007033573.1| Uncharacterized protein TCM_019740 [Theobrom... 187 6e-56 ref|XP_008806397.1| PREDICTED: uncharacterized protein LOC103719... 177 6e-50 ref|XP_007029160.1| Uncharacterized protein isoform 1 [Theobroma... 171 6e-49 ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobrom... 172 9e-49 ref|XP_007029161.1| Uncharacterized protein isoform 2 [Theobroma... 169 1e-48 emb|CAN65820.1| hypothetical protein VITISV_042324 [Vitis vinifera] 176 7e-47 ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955... 169 9e-46 ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954... 171 1e-45 ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobrom... 162 5e-45 ref|XP_007049075.1| Uncharacterized protein TCM_002073 [Theobrom... 168 3e-44 gb|KHN28108.1| hypothetical protein glysoja_039628, partial [Gly... 156 8e-44 gb|KYP49735.1| Retrovirus-related Pol polyprotein from transposo... 162 9e-44 ref|XP_012856897.1| PREDICTED: uncharacterized protein LOC105976... 166 2e-43 ref|XP_013615493.1| PREDICTED: uncharacterized protein LOC106321... 159 2e-43 ref|XP_013674504.1| PREDICTED: uncharacterized protein LOC106379... 156 3e-43 gb|KYP59020.1| hypothetical protein KK1_014445 [Cajanus cajan] 155 5e-43 ref|XP_013691399.1| PREDICTED: uncharacterized protein LOC106395... 157 6e-43 >ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699729, partial [Phoenix dactylifera] Length = 490 Score = 264 bits (675), Expect = 1e-82 Identities = 141/274 (51%), Positives = 187/274 (68%), Gaps = 18/274 (6%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A+E W KLK RF+QPDNVRI+QL+QQLSSI Q + SV+EYFTQLNA+WEELRNYRPLP C Sbjct: 117 AKEVWNKLKSRFAQPDNVRIYQLKQQLSSITQRSLSVSEYFTQLNAIWEELRNYRPLPYC 176 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 SCG C C+AL+ +GE D++F+FLMGLN++++ +RG I+LMS PSLDK FS++LQEE Sbjct: 177 SCGHCICDALKGVGEDLELDHIFQFLMGLNDTYDTVRGQIILMSPLPSLDKTFSLVLQEE 236 Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231 RQR+AR P +SS+ A V + + C HCG +GH ++KC+RLIGFPPNFKF Sbjct: 237 RQRQARAIIFPAPESSALAA--VLNKSKNRAEITCYHCGKSGHTKEKCYRLIGFPPNFKF 294 Query: 230 TKSKPRHFGQK----HSAH-LISSQENQGVSNEKQNEGNVPFTQDQIQKLMALINSDSMQ 66 TK+K K HSA+ +ISS + +G+S + + +Q QIQ+L+AL+NS Q Sbjct: 295 TKTKFPSVNNKSVAPHSANQVISSTQGKGLSAPQ-----LSLSQTQIQQLLALVNSGIPQ 349 Query: 65 LSQN-------------TPQSQSGNIHPHLSNMA 3 +S N TP +++GN SNMA Sbjct: 350 MSLNSASTQQEPILPMVTPTTETGNNSAPSSNMA 383 >ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697258 [Phoenix dactylifera] Length = 514 Score = 265 bits (676), Expect = 1e-82 Identities = 137/254 (53%), Positives = 176/254 (69%), Gaps = 5/254 (1%) Frame = -2 Query: 767 QETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCS 588 +E W KLK RF+QPDNVRI+QL+QQLSSI QGT SV+EYFTQLNA+WEELRNYRPLP CS Sbjct: 118 KEVWNKLKSRFAQPDNVRIYQLKQQLSSITQGTLSVSEYFTQLNAIWEELRNYRPLPYCS 177 Query: 587 CGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEER 408 CG C C+AL+ +GE DY+F+FLM LN +F+ +RG I+LMS PSLDK FS++LQEER Sbjct: 178 CGHCICDALKGVGENLELDYIFQFLMELNNTFDSVRGQIILMSPLPSLDKTFSLVLQEER 237 Query: 407 QREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKFT 228 QR+AR P +SS+ A V + + C HCG GH R+KC+RLIGFPPNFKFT Sbjct: 238 QRQARAIIFPAPESSALAA--VLNKPKNKAKITCYHCGKPGHTREKCYRLIGFPPNFKFT 295 Query: 227 KSKPRHFGQK----HSAH-LISSQENQGVSNEKQNEGNVPFTQDQIQKLMALINSDSMQL 63 K+K K HSA+ +IS + +G++ + + +Q Q+Q+L AL+NS QL Sbjct: 296 KTKSPSVNNKSVASHSANQVISPTQGKGLAAPQ-----LSLSQAQVQQLFALVNSGITQL 350 Query: 62 SQNTPQSQSGNIHP 21 + N+ SQ I P Sbjct: 351 NLNSASSQQEPIPP 364 >ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobroma cacao] gi|508722464|gb|EOY14361.1| Uncharacterized protein TCM_033758 [Theobroma cacao] Length = 328 Score = 197 bits (500), Expect = 2e-58 Identities = 104/232 (44%), Positives = 144/232 (62%), Gaps = 2/232 (0%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A E WE LK RFSQPD+ RI LQ L +I QGT SV+ YFT+LN +WEELRNYRPLP C Sbjct: 83 AYEVWETLKERFSQPDDARICNLQFNLYNISQGTRSVDAYFTELNCIWEELRNYRPLPHC 142 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 SCG+C ++ + D VF+FL GLNESF +R IL+M PSL+K +++++++E Sbjct: 143 SCGICNSACFQTYIDQYQKDSVFRFLNGLNESFSALRSQILMMKPFPSLNKAYNLVIRDE 202 Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231 QR + P ++SS+ A VVC +C GH +DKC+RLIGFPP+FKF Sbjct: 203 SQRNLYLHTMPIIESSAMA-TMTEGKVKSKVDVVCSYCHKKGHTKDKCYRLIGFPPDFKF 261 Query: 230 TKSK-PRHFGQKHSAHLISSQENQGVSNEK-QNEGNVPFTQDQIQKLMALIN 81 K K P G S + + ++ +E ++ ++ ++ QIQKLM+LIN Sbjct: 262 LKGKSPLKKGNVWSINNVGPVTSKEECDESTKSLSSLTLSKHQIQKLMSLIN 313 >ref|XP_007033573.1| Uncharacterized protein TCM_019740 [Theobroma cacao] gi|508712602|gb|EOY04499.1| Uncharacterized protein TCM_019740 [Theobroma cacao] Length = 211 Score = 187 bits (474), Expect = 6e-56 Identities = 91/209 (43%), Positives = 129/209 (61%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A + W+ LK FSQPD+ RI LQ L +I Q T V+ YFT+LN +WEEL+NYRPLP C Sbjct: 4 AADIWQTLKNHFSQPDDTRICNLQYSLCNITQDTRPVDSYFTKLNGIWEELKNYRPLPYC 63 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 CG CT + + E+ D VF+FL GLNESF +R HI+++ PSLD+ ++++L+EE Sbjct: 64 ECGKCTQSCFQKYIELWEKDRVFRFLNGLNESFSALRSHIIMIKPFPSLDEAYNLVLREE 123 Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231 QR + P +D++ AV VVC HC GH ++KC+ +IGFPP+FKF Sbjct: 124 SQRSILMQSQPLLDTTVVAV-VTESKIRVKNEVVCSHCAKNGHVKEKCYCIIGFPPDFKF 182 Query: 230 TKSKPRHFGQKHSAHLISSQENQGVSNEK 144 TK K +F +K + + +S V N++ Sbjct: 183 TKGK-GNFSRKAMSAVANSTNQSQVENQE 210 >ref|XP_008806397.1| PREDICTED: uncharacterized protein LOC103719098 [Phoenix dactylifera] Length = 406 Score = 177 bits (449), Expect = 6e-50 Identities = 104/262 (39%), Positives = 154/262 (58%), Gaps = 19/262 (7%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A+E W L+ RFSQ + RIFQ+Q+ ++S+ Q +SV+ YFT+L +WEEL NYRP P C Sbjct: 24 AREIWNNLQERFSQGNGPRIFQIQKSIASLSQDQSSVSAYFTKLKGLWEELWNYRPNPIC 83 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 S G A++ L E Q+ + +FLMGLN+S+ IRG ILL+ PS +KVFS++LQEE Sbjct: 84 SSG-----AMKQLIEYQNQECTMQFLMGLNDSYSQIRGQILLIDPLPSTNKVFSLVLQEE 138 Query: 410 RQREARVPFSPTMDSSSF--AVNF------------VADXXXXXXSVVCEHCGITGHRRD 273 +QRE P +P M+ ++F N+ A+ VC HCG+TGH ++ Sbjct: 139 KQREITSPVNPNMNIAAFLGRTNYNNAPSILAKYGAGANQFQRRERSVCSHCGVTGHTKE 198 Query: 272 KCFRLIGFPPNFKFTKSKPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLM 93 +C++L G+P +K +K+K G + + + SSQ S + N +PFT +Q Q+L+ Sbjct: 199 RCYKLHGYPRGYK-SKNK----GSQITVNQASSQTG---SKQFTNAPQLPFTVEQCQQLL 250 Query: 92 ALINSDSM-----QLSQNTPQS 42 A+IN S S TPQ+ Sbjct: 251 AMINHSSSSDAGHSNSSTTPQN 272 >ref|XP_007029160.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508717765|gb|EOY09662.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 263 Score = 171 bits (432), Expect = 6e-49 Identities = 94/254 (37%), Positives = 137/254 (53%), Gaps = 13/254 (5%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A E W LK F+QPD+ R+ LQ L ++ QG +V+ YF +L +WEELRNYRPLP C Sbjct: 4 AAEIWNTLKQNFAQPDDTRVCNLQYTLGNVSQGARTVDVYFIELKGIWEELRNYRPLPHC 63 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 CG + + D VF+FL GLN+SF IR ILLM P LDKV+S+IL+EE Sbjct: 64 ECGSYNPGCFKKYTDQFQKDMVFRFLNGLNKSFSAIRSQILLMDPIPGLDKVYSLILREE 123 Query: 410 RQREARVPFSPTMDSSSFAVNFVAD-XXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFK 234 QR V P ++ SFA+ AD ++C HCG GH +DKC+++I F +FK Sbjct: 124 SQRNILVQPQPLLE--SFAMFTAADNKKKARKDIICNHCGKKGHTKDKCYKIISFLDDFK 181 Query: 233 FTKS------KPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFT------QDQIQKLMA 90 FTK K ++ A +S +++ K+ + + F + Q+ KLM Sbjct: 182 FTKGGRSNPRKGKNLVNNVFAVSDASTDSESQVETKEEQASAGFVCQLSMIKQQVNKLMQ 241 Query: 89 LINSDSMQLSQNTP 48 ++ + + ++ P Sbjct: 242 FLSENGISSNEGHP 255 >ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobroma cacao] gi|508779769|gb|EOY27025.1| Uncharacterized protein TCM_028976 [Theobroma cacao] Length = 318 Score = 172 bits (435), Expect = 9e-49 Identities = 84/185 (45%), Positives = 109/185 (58%), Gaps = 2/185 (1%) Frame = -2 Query: 764 ETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCSC 585 E W LK+ ++QPDN + LQ L S+ Q V YF +L +WEELRNYRPLP C C Sbjct: 122 EIWNTLKLNYAQPDNTCVCNLQYTLGSVTQRVKIVYAYFIELKCIWEELRNYRPLPHCEC 181 Query: 584 GLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEERQ 405 G C N + + D VF+FL GLNESF IR I+LM PSLDKV+SM+L+EE Q Sbjct: 182 GKCNANCFKKFSDQYQKDMVFRFLNGLNESFSAIRSQIILMDPIPSLDKVYSMVLREESQ 241 Query: 404 REARVPFSPTMDSSSF--AVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231 + + P ++S + A N + C HCG GH ++KC+R+I FP +FKF Sbjct: 242 KNMFLQSQPFLESLAMLAATNV---KKKPMKDLTCTHCGKKGHVKEKCYRIIRFPEDFKF 298 Query: 230 TKSKP 216 TK KP Sbjct: 299 TKGKP 303 >ref|XP_007029161.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508717766|gb|EOY09663.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 260 Score = 169 bits (429), Expect = 1e-48 Identities = 97/260 (37%), Positives = 139/260 (53%), Gaps = 13/260 (5%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A E W LK F+QPD+ R+ LQ L ++ QG +V+ YF +L +WEELRNYRPLP C Sbjct: 4 AAEIWNTLKQNFAQPDDTRVCNLQYTLGNVSQGARTVDVYFIELKGIWEELRNYRPLPHC 63 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 CG + + D VF+FL GLN+SF IR ILLM P LDKV+S+IL+EE Sbjct: 64 ECGSYNPGCFKKYTDQFQKDMVFRFLNGLNKSFSAIRSQILLMDPIPGLDKVYSLILREE 123 Query: 410 RQREARVPFSPTMDSSSFAVNFVAD-XXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFK 234 QR V P ++ SFA+ AD ++C HCG GH +DKC+++I F +FK Sbjct: 124 SQRNILVQPQPLLE--SFAMFTAADNKKKARKDIICNHCGKKGHTKDKCYKIISFLDDFK 181 Query: 233 FTKS------KPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFT------QDQIQKLMA 90 FTK K ++ A +S +++ K+ + + F + Q+ KLM Sbjct: 182 FTKGGRSNPRKGKNLVNNVFAVSDASTDSESQVETKEEQASAGFVCQLSMIKQQVNKLMQ 241 Query: 89 LINSDSMQLSQNTPQSQSGN 30 ++ + +S N + S N Sbjct: 242 FLSENG--ISSNEGKGISSN 259 >emb|CAN65820.1| hypothetical protein VITISV_042324 [Vitis vinifera] Length = 1262 Score = 176 bits (447), Expect = 7e-47 Identities = 97/260 (37%), Positives = 145/260 (55%), Gaps = 17/260 (6%) Frame = -2 Query: 755 EKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCSCG-- 582 E+LK+R+ + D R+F L++ LSSI Q + S+ EYF++ A+W+E +YRP+P C CG Sbjct: 89 EELKIRYLRSDGPRVFSLEKSLSSISQNSKSITEYFSEFKALWDEYISYRPIPSCRCGNL 148 Query: 581 -LCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEERQ 405 C+CN L+ L + Q SDYV KFL+GL++S+ IR +LL S PS+ +VFS++LQEE Q Sbjct: 149 NRCSCNILKDLTDRQQSDYVMKFLVGLHDSYSAIRSQLLLQSPLPSMSRVFSLLLQEESQ 208 Query: 404 REARVPFSPTMDSSSFAV----------NFVADXXXXXXSVVCEHCGITGHRRDKCFRLI 255 R ++DS + N +C HCG +GH DKCF+LI Sbjct: 209 RSLTNAVGISIDSQAMVAEQSSRTVSTSNTQFTKQKGKSDAICSHCGYSGHLVDKCFQLI 268 Query: 254 GFPPNFKFTKSKPRHFGQKHSA----HLISSQENQGVSNEKQNEGNVPFTQDQIQKLMAL 87 G+PP +K + K F +A + + N V + + N+ F+Q+QIQ L+ L Sbjct: 269 GYPPRWKGPRGK--IFNSTPTAAKNFQRLPTANNTNVLEQNSSNSNMIFSQEQIQNLLTL 326 Query: 86 INSDSMQLSQNTPQSQSGNI 27 NS S + NT + N+ Sbjct: 327 ANSLS---NSNTNFNAXSNV 343 >ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955841, partial [Erythranthe guttata] Length = 514 Score = 169 bits (427), Expect = 9e-46 Identities = 98/270 (36%), Positives = 139/270 (51%), Gaps = 14/270 (5%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 ++E W+ LK RFSQ + RIFQL++ L+++ QG+ SVN YFT++ A+W+EL NYRP CC Sbjct: 105 SKEIWDDLKTRFSQTNGPRIFQLRRDLANLTQGSQSVNVYFTKVKAIWDELVNYRP--CC 162 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 SCG C C L + +YV FLMGLNES RG ILLM P + KVF+ + QEE Sbjct: 163 SCGKCDCGGFEKLQAHYNQEYVMSFLMGLNESLASTRGQILLMDPLPPISKVFAFVSQEE 222 Query: 410 RQREARVPFSPTMDSSSFAV-----------NFVADXXXXXXSVVCEHCGITGHRRDKCF 264 RQR V S F+V F C HC + GH +KC+ Sbjct: 223 RQRSV-VSSHVESSGSVFSVKNEGFKRSINNQFYNTGFKKKERSFCTHCNMQGHTVEKCY 281 Query: 263 RLIGFPPNFKFTKSK---PRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLM 93 +L G+PP++K KS+ P + + L S + GVS++ + T Q Q+ M Sbjct: 282 KLHGYPPSYKPQKSRFSSPANQVSGFDSSLDSHSSDSGVSSQHVDGYLQSMTPSQCQQFM 341 Query: 92 ALINSDSMQLSQNTPQSQSGNIHPHLSNMA 3 ++ +S Q + S H ++ A Sbjct: 342 SMFSSHMAAQQQQSAASAQPQSSAHGADTA 371 >ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954710 [Erythranthe guttata] Length = 659 Score = 171 bits (432), Expect = 1e-45 Identities = 102/266 (38%), Positives = 144/266 (54%), Gaps = 17/266 (6%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 ++E W+ LK RFSQ + RIFQL++ L+++ QG+ SVN YFT++ A+W+EL NYRP CC Sbjct: 105 SKEIWDDLKTRFSQTNGPRIFQLRRDLANLTQGSQSVNVYFTKVKAIWDELANYRP--CC 162 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 SCG C C L + +YV FLMGLN+S RG ILLM P + KVF+ I QEE Sbjct: 163 SCGKCDCGGFEKLQAHYNQEYVMSFLMGLNDSLASTRGQILLMDPLPPISKVFAFISQEE 222 Query: 410 RQREARVPFSPTMDSSS--FAV-----------NFVADXXXXXXSVVCEHCGITGHRRDK 270 RQR S +DSS F+V F C HC + GH +K Sbjct: 223 RQRSV---VSSHVDSSGSVFSVKNEGFKRSINNQFYNPGLKKRERSFCTHCNMQGHTVEK 279 Query: 269 CFRLIGFPPNFKFTKSK-PRHFGQ--KHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQK 99 C++L G+PP++K KS+ H Q + L S + GVS+++ + T Q Q+ Sbjct: 280 CYKLHGYPPSYKPQKSRFSSHVNQVSGFDSSLDSHSSDAGVSSQQVDGYLQSMTPSQCQQ 339 Query: 98 LMALINSD-SMQLSQNTPQSQSGNIH 24 M++ +S + Q Q+T Q + H Sbjct: 340 FMSMFSSHMAAQQQQSTASIQPQSAH 365 >ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobroma cacao] gi|508708772|gb|EOY00669.1| Uncharacterized protein TCM_010591 [Theobroma cacao] Length = 336 Score = 162 bits (411), Expect = 5e-45 Identities = 94/232 (40%), Positives = 139/232 (59%), Gaps = 2/232 (0%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A+E E LK RFSQP I LQ QL +I+QGT SVN YFT+LN+VW+EL+N+RPLP C Sbjct: 108 AKEILETLKNRFSQPYETIICNLQFQLRNILQGTRSVNTYFTELNSVWQELKNFRPLPQC 167 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 N + + Q+ D VF FL GLNESF +R HIL++ S+D+ +S+++++ Sbjct: 168 DYEGRKNNCYKKYADQQNKDAVFCFLNGLNESFSCLRSHILMLKPFLSIDQAYSLVIKKM 227 Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231 QR + + SP +S+ V + + ++VC HCG GH ++K + +IGFP NFKF Sbjct: 228 LQR-SLILQSPVENSTMATV--ITEEKRKNTNLVCSHCGKKGHSKEKYYCIIGFPENFKF 284 Query: 230 TKSK--PRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLMALIN 81 TK K R G ++ + S++++ + + T+ QIQKLM LI+ Sbjct: 285 TKLKRNMRKGGSSVNSAISGSEQDEYDETVTNSISQLSLTKAQIQKLMTLIS 336 >ref|XP_007049075.1| Uncharacterized protein TCM_002073 [Theobroma cacao] gi|508701336|gb|EOX93232.1| Uncharacterized protein TCM_002073 [Theobroma cacao] Length = 817 Score = 168 bits (426), Expect = 3e-44 Identities = 85/226 (37%), Positives = 126/226 (55%) Frame = -2 Query: 758 WEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCSCGL 579 W LK ++QPD+ R+ LQ L +I QGT SV+ YF +L AV EE+R+YRPLP C CG Sbjct: 87 WNTLKQNYAQPDDTRLCNLQYTLGNITQGTRSVDSYFIELKAVREEIRSYRPLPHCECGR 146 Query: 578 CTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEERQRE 399 C N + + D VF+FL GLNESF IR HI+LM P+LD+V++ +L+EE Q+ Sbjct: 147 CNANCFKRYIDQYHKDMVFRFLNGLNESFSAIRSHIILMDPIPTLDRVYNFMLREETQKN 206 Query: 398 ARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKFTKSK 219 ++SS+ + +VC HCG GH ++KC+RLIGFP +FKFT K Sbjct: 207 LLFQSQSVLESSTM-LTTTDSKKKLKKDLVCSHCGKKGHNKEKCYRLIGFPYDFKFTTRK 265 Query: 218 PRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLMALIN 81 K + + +++ + + + + + Q +L+N Sbjct: 266 ANIKKGKTAVNNVTASNEISIDEFQVDSDGKGISSNSQQGKQSLVN 311 >gb|KHN28108.1| hypothetical protein glysoja_039628, partial [Glycine soja] Length = 230 Score = 156 bits (395), Expect = 8e-44 Identities = 80/187 (42%), Positives = 115/187 (61%), Gaps = 10/187 (5%) Frame = -2 Query: 767 QETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCS 588 +E W+ LK F + + RIFQL++QL S+ QGT+ +N Y T+L ++WEEL Y+P C+ Sbjct: 49 KEIWDDLKTWFLRKNGPRIFQLKRQLMSLQQGTDDINTYHTKLKSIWEELTGYKPTFSCT 108 Query: 587 CGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEER 408 CG L+ + +YV FLMGLN+SF IRG ILL + PS+ VFS+ILQEE Sbjct: 109 CG-----GLQQIHTHHEFEYVMSFLMGLNDSFSQIRGQILLSNPLPSIGNVFSLILQEEA 163 Query: 407 QREARVPFSPT--MDSSSFAVNFVA--------DXXXXXXSVVCEHCGITGHRRDKCFRL 258 +RE V SPT +D+ +F+VN+V+ C HC + GH +DKC++L Sbjct: 164 KREIVVTHSPTNSLDNIAFSVNYVSKNQYENTKGKYIKKERPKCAHCDMLGHTKDKCYKL 223 Query: 257 IGFPPNF 237 +G+PPN+ Sbjct: 224 VGYPPNY 230 >gb|KYP49735.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 434 Score = 162 bits (409), Expect = 9e-44 Identities = 96/241 (39%), Positives = 134/241 (55%), Gaps = 10/241 (4%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A+E W+ LK RFS+ + RIF L++QL S+ QG++ V+ Y+T+L ++WEEL Y+P C Sbjct: 60 AKENWDDLKTRFSRKNGPRIFHLKRQLMSLQQGSDDVSTYYTKLKSIWEELAGYKPNFQC 119 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 +CG L SL + S+YV FLMGLN+SF IRG ILL PS+ VFS+ILQEE Sbjct: 120 TCG-----GLESLHKHTQSEYVMSFLMGLNDSFSQIRGQILLSDPLPSIGNVFSLILQEE 174 Query: 410 RQREARVPF--SPTMDSSSFAVNFVA--------DXXXXXXSVVCEHCGITGHRRDKCFR 261 Q+E V S D +FAVN + + C HC + GH +DKC++ Sbjct: 175 TQKEIAVTHATSAHSDDMAFAVNQCSKTNFDNNKGKFVKKDRLKCAHCEMFGHTKDKCYK 234 Query: 260 LIGFPPNFKFTKSKPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLMALIN 81 L+G+PPN+ F +P+ Q +H SN N T Q Q+LM L+N Sbjct: 235 LVGYPPNY-FKNRQPQVVNQVDISH------ESSTSNTALN-----LTPAQCQQLMTLLN 282 Query: 80 S 78 + Sbjct: 283 N 283 >ref|XP_012856897.1| PREDICTED: uncharacterized protein LOC105976150 [Erythranthe guttata] Length = 746 Score = 166 bits (419), Expect = 2e-43 Identities = 96/272 (35%), Positives = 142/272 (52%), Gaps = 25/272 (9%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A E W+ L RFSQ + RIFQL+++LS++ Q T SVN YFT+L A+W+EL N+RP C Sbjct: 103 AHEMWKDLNTRFSQTNGPRIFQLRRELSNLTQDTQSVNVYFTKLKAIWDELSNFRP--SC 160 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 +CG CTC ++ L E + ++V FLMGLNES RG ILLM P ++KVF+++ QEE Sbjct: 161 TCGACTCGGVQKLNEHYNLEHVMAFLMGLNESLTSTRGQILLMDPLPPINKVFALVSQEE 220 Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVV------------CEHCGITGHRRDKC 267 RQR + +S +F++ + V C HC I GH DKC Sbjct: 221 RQRSIHSSHNEVQNSLAFSIRGDQSVQRSVHNQVYTSAPKRKERGFCTHCNIYGHTIDKC 280 Query: 266 FRLIGFPPNFKFTKSKPRHFG---QKHSAHLISSQENQGVSNEKQNEGNVPFTQD----- 111 ++L G+PP + K+KPR+ + S + +++ E+ + PF Sbjct: 281 YKLHGYPPGY---KAKPRYSSLPQSRFSVNQVAAMESPLDYATSGSTSQPPFVSSDPVLA 337 Query: 110 -----QIQKLMALINSDSMQLSQNTPQSQSGN 30 Q Q+LMA ++ Q + Q G+ Sbjct: 338 NMSAAQCQQLMAYFSNQMAAKKQVSTQQSHGD 369 >ref|XP_013615493.1| PREDICTED: uncharacterized protein LOC106321802, partial [Brassica oleracea var. oleracea] Length = 353 Score = 159 bits (402), Expect = 2e-43 Identities = 89/234 (38%), Positives = 125/234 (53%), Gaps = 7/234 (2%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A+ W+ L RF Q D RIF+++Q+LS+I QG+ V+ Y+T+L +WEE +NY LP C Sbjct: 108 AELIWKNLMSRFKQDDAPRIFEIEQKLSNIQQGSLDVSTYYTELVTLWEEFQNYVDLPVC 167 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 +CG C CNA S IQ V KFLMGLNES++ R HIL++ PS+++VF+M+ Q+E Sbjct: 168 TCGKCECNAAASWELIQQRSRVTKFLMGLNESYDATRRHILMLKPIPSIEEVFNMVAQDE 227 Query: 410 RQREAR-------VPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIG 252 RQ+ R V F + S+ A VC HCG+ GH KCF+L G Sbjct: 228 RQKIIRPSLKTDSVVFQTSATESASPHYAAAVAYRPKQRPVCTHCGMAGHIVQKCFKLHG 287 Query: 251 FPPNFKFTKSKPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLMA 90 +PP +F + SSQ+ + Q+ G V + Q Q A Sbjct: 288 YPPGHRFYNTN------------ASSQQRLSAPSNNQSRGPVSQSSHQHQSTTA 329 >ref|XP_013674504.1| PREDICTED: uncharacterized protein LOC106379017 [Brassica napus] gi|923870047|ref|XP_013709354.1| PREDICTED: uncharacterized protein LOC106413059 [Brassica napus] Length = 267 Score = 156 bits (394), Expect = 3e-43 Identities = 86/237 (36%), Positives = 127/237 (53%), Gaps = 26/237 (10%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A+ W+ + RF Q D R+++++QQLSSI QG+ V+ Y+T L +WEE +NY LP C Sbjct: 27 AEAIWKNILSRFKQDDAPRVYEIEQQLSSIQQGSMDVSAYYTALVTLWEEHKNYVELPVC 86 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 SCG C CNA +Q V KFLMGLNES+E R HIL++ P+++ VF+++ Q+E Sbjct: 87 SCGKCECNAAELWERLQQRSRVTKFLMGLNESYESTRRHILMLKPIPTIEDVFNLVTQDE 146 Query: 410 RQREAR-----VPFS----------PTM-----DSSSFAVNFVADXXXXXXSVVCEHCGI 291 RQR + +P + PT+ D S+FA +C +C Sbjct: 147 RQRGIKPSTTSIPVALQASGPTESLPTIDVVAPDHSAFATTHNNSGYRPKQRPLCTYCNQ 206 Query: 290 TGHRRDKCFRLIGFPPNFKFTKSKPRHFG------QKHSAHLISSQENQGVSNEKQN 138 GH DKCFRL G+PP K+ KS + G + + Q +Q ++++QN Sbjct: 207 LGHVVDKCFRLHGYPPGHKYNKSSHPNAGFAPRGQNNYQQRPVQQQNSQYFASQQQN 263 >gb|KYP59020.1| hypothetical protein KK1_014445 [Cajanus cajan] Length = 262 Score = 155 bits (392), Expect = 5e-43 Identities = 80/186 (43%), Positives = 111/186 (59%), Gaps = 8/186 (4%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 AQE W+ LK RFS+ + RIFQL+ QL S+ QG + ++ Y+T+L ++WEEL Y+P C Sbjct: 60 AQEIWDDLKTRFSRKNGPRIFQLRCQLMSLHQGMDDISTYYTKLKSIWEELSGYKPTFQC 119 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 +CG L+ L S+YV FLMGLN+SF IRG ILL PS++ FS++LQ+E Sbjct: 120 TCG-----GLQQLQSFTESEYVMSFLMGLNDSFSQIRGQILLSDPLPSIENFFSLVLQDE 174 Query: 410 RQREARVPFSPTM---DSSSFAVNFVADXXXXXXSVV-----CEHCGITGHRRDKCFRLI 255 QRE V SP + D+ +F VN C HC I GH +D C++L+ Sbjct: 175 AQREIAVTSSPPVANSDNIAFTVNSSQPATSRNRFTKKERPRCAHCDILGHTKDTCYKLV 234 Query: 254 GFPPNF 237 G+PPN+ Sbjct: 235 GYPPNY 240 >ref|XP_013691399.1| PREDICTED: uncharacterized protein LOC106395505 [Brassica napus] Length = 344 Score = 157 bits (398), Expect = 6e-43 Identities = 80/200 (40%), Positives = 110/200 (55%), Gaps = 17/200 (8%) Frame = -2 Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591 A+ W+ + RF Q D R++++ Q+LSSI QG++ V Y+T L +WEE +NY LP C Sbjct: 106 AESIWKNILSRFKQDDAPRVYEIDQKLSSIQQGSDDVTTYYTALVTLWEEHKNYVELPVC 165 Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411 SCG C CNA +Q V KFLMGLNES+E HIL++ P +++VF+++ Q+E Sbjct: 166 SCGKCECNAAELWERLQERSRVTKFLMGLNESYESTHRHILMLKPIPPIEEVFNLVTQDE 225 Query: 410 RQREARVPFSPTM-----------------DSSSFAVNFVADXXXXXXSVVCEHCGITGH 282 RQR + +PT D S+FA +C +CG GH Sbjct: 226 RQRAIKPSSTPTSVVFQASGPDETLLSAPPDHSAFAAAHANSGYRPKQRPLCTYCGQLGH 285 Query: 281 RRDKCFRLIGFPPNFKFTKS 222 DKCFRL G+PP KF KS Sbjct: 286 IVDKCFRLHGYPPGHKFNKS 305