BLASTX nr result
ID: Mentha22_contig00040609
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00040609 (1373 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU21331.1| hypothetical protein MIMGU_mgv1a000085mg [Mimulus... 450 e-124 ref|XP_004246882.1| PREDICTED: uncharacterized protein LOC101258... 340 1e-90 ref|XP_006604941.1| PREDICTED: uncharacterized protein LOC100816... 337 1e-89 ref|XP_006604940.1| PREDICTED: uncharacterized protein LOC100816... 337 1e-89 ref|XP_006604939.1| PREDICTED: uncharacterized protein LOC100816... 337 1e-89 ref|XP_006604938.1| PREDICTED: uncharacterized protein LOC100816... 337 1e-89 ref|XP_006604937.1| PREDICTED: uncharacterized protein LOC100816... 337 1e-89 ref|XP_003553813.1| PREDICTED: uncharacterized protein LOC100816... 337 1e-89 ref|XP_007024605.1| B-block binding subunit of TFIIIC, putative ... 336 2e-89 ref|XP_007024604.1| B-block binding subunit of TFIIIC, putative ... 336 2e-89 ref|XP_006574488.1| PREDICTED: uncharacterized protein LOC100814... 334 5e-89 ref|XP_006574487.1| PREDICTED: uncharacterized protein LOC100814... 334 5e-89 ref|XP_006574486.1| PREDICTED: uncharacterized protein LOC100814... 334 5e-89 ref|XP_003519701.1| PREDICTED: uncharacterized protein LOC100814... 334 5e-89 ref|XP_002264494.2| PREDICTED: uncharacterized protein LOC100267... 334 6e-89 ref|XP_006465928.1| PREDICTED: uncharacterized protein LOC102628... 328 4e-87 ref|XP_006426643.1| hypothetical protein CICLE_v10024687mg [Citr... 326 1e-86 ref|XP_004499551.1| PREDICTED: uncharacterized protein LOC101494... 326 1e-86 ref|XP_004499550.1| PREDICTED: uncharacterized protein LOC101494... 326 1e-86 ref|XP_007217094.1| hypothetical protein PRUPE_ppa000094mg [Prun... 317 8e-84 >gb|EYU21331.1| hypothetical protein MIMGU_mgv1a000085mg [Mimulus guttatus] Length = 1865 Score = 450 bits (1157), Expect = e-124 Identities = 249/452 (55%), Positives = 301/452 (66%), Gaps = 7/452 (1%) Frame = +3 Query: 9 LGVENLSEQDKESYSLIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFR 188 LG + +E KE + HK+ALS L AR KKF W+EEADRQLVIEYAR+RAA GA + Sbjct: 1076 LGYDKGAELLKEDDEVHHKQALSRLKSARQKKFLWTEEADRQLVIEYARHRAALGAKYQG 1135 Query: 189 TDWVTISNLPAPPDACKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQDKMLIHAN 368 DW ++ NLPAP +CKRRM+ L +IPFR+A+MKLC ML+E Y +YLEKFQ K L + Sbjct: 1136 VDWASLQNLPAPLQSCKRRMASLKRYIPFRKALMKLCNMLAERYRQYLEKFQSKTLNPGD 1195 Query: 369 SGQMIRGPASGE----SSAEMLEEWANFDEDSIKVALDDVMRCKRMAKLNAAQETFPGQE 536 +M+R AS + SSA M E WANFD+ IKVALD+V+R K+MAKL+ Q+T E Sbjct: 1196 PRKMVRDTASEKDSFCSSAPMSENWANFDDSVIKVALDNVLRYKKMAKLDTVQDTSSDHE 1255 Query: 537 NSEDDDMEECXXXXXXXXXXXXXXXIRKNLYVFTGARIFKQMHESVAVANAAELFKLIFL 716 + EDD E + + GA + K MHESVA+ANAAELFKLIFL Sbjct: 1256 DIEDDVFEGFDGKVSGQRSSAQHLSRKYMKLLSKGASVGKWMHESVAIANAAELFKLIFL 1315 Query: 717 SKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GNGQFELSQHFLHGITSSA 893 S S APE T LAETLRRYSEHDL AAFN LREKKIM+GG N F LSQ FL I+SS Sbjct: 1316 SNSMAPEVSTFLAETLRRYSEHDLFAAFNYLREKKIMIGGSSNSPFALSQPFLQSISSSK 1375 Query: 894 FPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLCDLLSSGELSITPSLPIE 1073 FP+DTG RAAK ++W VP D+QCGEVF+LC L+ SGE+SIT LP E Sbjct: 1376 FPTDTGERAAKFSSWLHEKQKDLMEEGIDVPLDMQCGEVFTLCTLVYSGEVSITSCLPSE 1435 Query: 1074 GVGEAEDNR-PKRKSENTEPEVG-SSKKFRKTFAGDSEIISRREKGFPGIKLCLHRERIS 1247 GVGEAED R KRK + + + +SKK + F G+ E+I+RREKGFPGI LCLHRE++ Sbjct: 1436 GVGEAEDYRTSKRKWDGSVSDCAENSKKSKTPFTGEGELIARREKGFPGITLCLHREKLP 1495 Query: 1248 RSLAIDSFGKENMHPAPFLGGKDQTNASSGLD 1343 R LAIDSF E+M+ P GG DQ N SGLD Sbjct: 1496 RGLAIDSFKDEDMYTTPPFGGNDQNNTLSGLD 1527 >ref|XP_004246882.1| PREDICTED: uncharacterized protein LOC101258404 [Solanum lycopersicum] Length = 1854 Score = 340 bits (871), Expect = 1e-90 Identities = 197/441 (44%), Positives = 259/441 (58%), Gaps = 27/441 (6%) Frame = +3 Query: 24 LSEQDKESYSLIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVT 203 L E D + + K ALS P R +F+W+++ DRQLVIEYAR+RA+ GA F R DW Sbjct: 1051 LPEDDGVGRAFLDKIALSRAKPTRKGRFWWTDDVDRQLVIEYARHRASLGAKFNRVDWGK 1110 Query: 204 ISNLPAPPDACKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQDKMLIHANSGQMI 383 + NLPAPPDAC+RRM++L FR+++ +LC +LS+ Y YLEK +DK L H Sbjct: 1111 LHNLPAPPDACRRRMALLRTNRQFRKSITRLCNVLSQRYVDYLEKSKDKQLNHEGHQATQ 1170 Query: 384 RGPASGESSAEMLEEWANFDEDSIKVALDDVMRCKRMAKLNAAQETFPGQENSED----- 548 S+ + W NFD+ IK+AL+D +R K+++K ++ P +N+ D Sbjct: 1171 CCCLKNTSNFLAQDPWDNFDDADIKLALEDALRYKKISKSETFKDVHPFFDNNSDVNTDE 1230 Query: 549 -------------------DDMEECXXXXXXXXXXXXXXXIRKNLYVFTGARIFKQMHES 671 D+ E NL + G + K+++ES Sbjct: 1231 KDVSCGPQSVLPVSCGQYVDNFSENTEDSGTPISSNRIAQKYVNLTI-GGIPVSKRLYES 1289 Query: 672 VAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GNGQ 848 AVANAAELFKLIFL SK+P PTLLAETLRRYSEHDL AAFN LREKK+++GG N Sbjct: 1290 AAVANAAELFKLIFLCSSKSPLVPTLLAETLRRYSEHDLFAAFNYLREKKVLIGGHSNCP 1349 Query: 849 FELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLCDL 1028 F LSQ FL+ I S FPSDTG RAAK A+W +P+D+QCG+V+ L L Sbjct: 1350 FVLSQTFLNCIEFSPFPSDTGKRAAKFASWLCEREKELIAEGVDLPTDLQCGDVYHLLAL 1409 Query: 1029 LSSGELSITPSLPIEGVGEAEDNR-PKRKSENTE-PEVGSSKKFRKTFAGDSEIISRREK 1202 LSSGELSI P LP EGVGE ED+R KRK++++E + KK + + A DSE+ SRR K Sbjct: 1410 LSSGELSIAPCLPDEGVGEVEDSRTSKRKNDDSEFSDSDRYKKLKTSMASDSELCSRRAK 1469 Query: 1203 GFPGIKLCLHRERISRSLAID 1265 GFPGI+LCL + R +D Sbjct: 1470 GFPGIRLCLRHATLPRIKIMD 1490 >ref|XP_006604941.1| PREDICTED: uncharacterized protein LOC100816444 isoform X7 [Glycine max] Length = 1491 Score = 337 bits (863), Expect = 1e-89 Identities = 197/457 (43%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 687 LISQRVLTKMKPTRQRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWTSISDLPATPIA 746 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQD-------KMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC MLSE YAK LEK Q K + + S + I Sbjct: 747 CTRRMNLLNSNMRFRKAVNKLCNMLSERYAKQLEKSQHSSLNNDCKQFVRSQSCEGILNN 806 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ALD+++RCK MAKL NA Sbjct: 807 SSPDAEIQITSLNKEAWDDFENKNIKMALDEILRCKMMAKLGASSQKGQLQYDGWSDANA 866 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 867 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 926 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 927 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 986 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 987 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGTNLAEDLQCGDIFHLF 1046 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1047 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1106 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1107 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1143 >ref|XP_006604940.1| PREDICTED: uncharacterized protein LOC100816444 isoform X6 [Glycine max] Length = 1502 Score = 337 bits (863), Expect = 1e-89 Identities = 197/457 (43%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 698 LISQRVLTKMKPTRQRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWTSISDLPATPIA 757 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQD-------KMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC MLSE YAK LEK Q K + + S + I Sbjct: 758 CTRRMNLLNSNMRFRKAVNKLCNMLSERYAKQLEKSQHSSLNNDCKQFVRSQSCEGILNN 817 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ALD+++RCK MAKL NA Sbjct: 818 SSPDAEIQITSLNKEAWDDFENKNIKMALDEILRCKMMAKLGASSQKGQLQYDGWSDANA 877 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 878 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 937 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 938 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 997 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 998 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGTNLAEDLQCGDIFHLF 1057 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1058 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1117 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1118 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1154 >ref|XP_006604939.1| PREDICTED: uncharacterized protein LOC100816444 isoform X5 [Glycine max] Length = 1774 Score = 337 bits (863), Expect = 1e-89 Identities = 197/457 (43%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 1022 LISQRVLTKMKPTRQRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWTSISDLPATPIA 1081 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQD-------KMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC MLSE YAK LEK Q K + + S + I Sbjct: 1082 CTRRMNLLNSNMRFRKAVNKLCNMLSERYAKQLEKSQHSSLNNDCKQFVRSQSCEGILNN 1141 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ALD+++RCK MAKL NA Sbjct: 1142 SSPDAEIQITSLNKEAWDDFENKNIKMALDEILRCKMMAKLGASSQKGQLQYDGWSDANA 1201 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 1202 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 1261 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 1262 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 1321 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 1322 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGTNLAEDLQCGDIFHLF 1381 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1382 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1441 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1442 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1478 >ref|XP_006604938.1| PREDICTED: uncharacterized protein LOC100816444 isoform X4 [Glycine max] Length = 1812 Score = 337 bits (863), Expect = 1e-89 Identities = 197/457 (43%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 1008 LISQRVLTKMKPTRQRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWTSISDLPATPIA 1067 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQD-------KMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC MLSE YAK LEK Q K + + S + I Sbjct: 1068 CTRRMNLLNSNMRFRKAVNKLCNMLSERYAKQLEKSQHSSLNNDCKQFVRSQSCEGILNN 1127 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ALD+++RCK MAKL NA Sbjct: 1128 SSPDAEIQITSLNKEAWDDFENKNIKMALDEILRCKMMAKLGASSQKGQLQYDGWSDANA 1187 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 1188 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 1247 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 1248 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 1307 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 1308 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGTNLAEDLQCGDIFHLF 1367 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1368 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1427 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1428 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1464 >ref|XP_006604937.1| PREDICTED: uncharacterized protein LOC100816444 isoform X3 [Glycine max] Length = 1813 Score = 337 bits (863), Expect = 1e-89 Identities = 197/457 (43%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 1022 LISQRVLTKMKPTRQRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWTSISDLPATPIA 1081 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQD-------KMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC MLSE YAK LEK Q K + + S + I Sbjct: 1082 CTRRMNLLNSNMRFRKAVNKLCNMLSERYAKQLEKSQHSSLNNDCKQFVRSQSCEGILNN 1141 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ALD+++RCK MAKL NA Sbjct: 1142 SSPDAEIQITSLNKEAWDDFENKNIKMALDEILRCKMMAKLGASSQKGQLQYDGWSDANA 1201 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 1202 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 1261 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 1262 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 1321 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 1322 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGTNLAEDLQCGDIFHLF 1381 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1382 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1441 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1442 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1478 >ref|XP_003553813.1| PREDICTED: uncharacterized protein LOC100816444 isoform X1 [Glycine max] gi|571560952|ref|XP_006604936.1| PREDICTED: uncharacterized protein LOC100816444 isoform X2 [Glycine max] Length = 1826 Score = 337 bits (863), Expect = 1e-89 Identities = 197/457 (43%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 1022 LISQRVLTKMKPTRQRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWTSISDLPATPIA 1081 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQD-------KMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC MLSE YAK LEK Q K + + S + I Sbjct: 1082 CTRRMNLLNSNMRFRKAVNKLCNMLSERYAKQLEKSQHSSLNNDCKQFVRSQSCEGILNN 1141 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ALD+++RCK MAKL NA Sbjct: 1142 SSPDAEIQITSLNKEAWDDFENKNIKMALDEILRCKMMAKLGASSQKGQLQYDGWSDANA 1201 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 1202 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 1261 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 1262 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 1321 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 1322 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGTNLAEDLQCGDIFHLF 1381 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1382 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1441 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1442 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1478 >ref|XP_007024605.1| B-block binding subunit of TFIIIC, putative isoform 2 [Theobroma cacao] gi|508779971|gb|EOY27227.1| B-block binding subunit of TFIIIC, putative isoform 2 [Theobroma cacao] Length = 1648 Score = 336 bits (861), Expect = 2e-89 Identities = 207/481 (43%), Positives = 276/481 (57%), Gaps = 34/481 (7%) Frame = +3 Query: 30 EQDKESYSLIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTIS 209 E+D + YSLI + A + P R K+F W++EADR+LV +YARYRAA GA F R DW +I+ Sbjct: 1072 EEDDDCYSLISQYAFPKMKPTRKKRFSWTDEADRELVTQYARYRAALGAKFHRVDWTSIA 1131 Query: 210 NLPAPPDACKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQDK--------MLIHA 365 LPAPP AC RRM+ L I FR+A+MKLC MLSE Y +LEK Q++ L+ + Sbjct: 1132 GLPAPPRACARRMTSLKKSIKFRKALMKLCNMLSERYVIHLEKNQNRAFNNNDCGFLVRS 1191 Query: 366 NSGQMIRGPASGESSAEMLEEWANFDEDSIKVALDDVMRCKRMAKLNAAQ-------ETF 524 +S + G GE + E W +FD+ I+ AL+DV+R K++AKL A++ E Sbjct: 1192 SSVEFSSGIEHGEDAGFEEERWDDFDDRKIRRALEDVLRFKQIAKLEASKRVGSVSAEWS 1251 Query: 525 PGQENSED---------------DDMEECXXXXXXXXXXXXXXXIRKNLYVF--TGARIF 653 NSED +DM + L G + Sbjct: 1252 NMNMNSEDYNLQGPEMVSQTTLGEDMGTGAGQLKSSIQSSRHHRFHQKLVKLWNIGHGVG 1311 Query: 654 KQMHESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVG 833 +Q+HES+AV+NA ELFKL+FLS S A P LLAETLRRYSEHDL AAF+ LR++KIM+G Sbjct: 1312 RQVHESLAVSNAVELFKLVFLSTSTAAPFPNLLAETLRRYSEHDLFAAFSYLRDRKIMIG 1371 Query: 834 GGNGQ-FELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEV 1010 G GQ F LSQ FLH I+ S FP +TG RAA + W + D+QCG++ Sbjct: 1372 GTCGQPFVLSQQFLHSISKSPFPRNTGKRAANFSAWLHQREKDLMQGGINLTEDLQCGDI 1431 Query: 1011 FSLCDLLSSGELSITPSLPIEGVGEAEDNRP-KRKSENTEPEVGSSKKFRKTFAGDSEII 1187 F L L+SSGELS++PSLP EGVGEAED R K ++E++E K K+ A + E + Sbjct: 1432 FHLFSLVSSGELSVSPSLPDEGVGEAEDLRSLKCRAEDSELCDADKAKKLKSIA-EGEFV 1490 Query: 1188 SRREKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLGGKDQTNASSGLDVNSSPLQS 1367 SRREKGFPGI + ++ +S + A++ F E F G D+T + VN S S Sbjct: 1491 SRREKGFPGIMVSVYSSTVSTANALELFNDEETCTLAF--GNDETTSQK---VNISSTNS 1545 Query: 1368 D 1370 D Sbjct: 1546 D 1546 >ref|XP_007024604.1| B-block binding subunit of TFIIIC, putative isoform 1 [Theobroma cacao] gi|508779970|gb|EOY27226.1| B-block binding subunit of TFIIIC, putative isoform 1 [Theobroma cacao] Length = 1845 Score = 336 bits (861), Expect = 2e-89 Identities = 207/481 (43%), Positives = 276/481 (57%), Gaps = 34/481 (7%) Frame = +3 Query: 30 EQDKESYSLIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTIS 209 E+D + YSLI + A + P R K+F W++EADR+LV +YARYRAA GA F R DW +I+ Sbjct: 1072 EEDDDCYSLISQYAFPKMKPTRKKRFSWTDEADRELVTQYARYRAALGAKFHRVDWTSIA 1131 Query: 210 NLPAPPDACKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQDK--------MLIHA 365 LPAPP AC RRM+ L I FR+A+MKLC MLSE Y +LEK Q++ L+ + Sbjct: 1132 GLPAPPRACARRMTSLKKSIKFRKALMKLCNMLSERYVIHLEKNQNRAFNNNDCGFLVRS 1191 Query: 366 NSGQMIRGPASGESSAEMLEEWANFDEDSIKVALDDVMRCKRMAKLNAAQ-------ETF 524 +S + G GE + E W +FD+ I+ AL+DV+R K++AKL A++ E Sbjct: 1192 SSVEFSSGIEHGEDAGFEEERWDDFDDRKIRRALEDVLRFKQIAKLEASKRVGSVSAEWS 1251 Query: 525 PGQENSED---------------DDMEECXXXXXXXXXXXXXXXIRKNLYVF--TGARIF 653 NSED +DM + L G + Sbjct: 1252 NMNMNSEDYNLQGPEMVSQTTLGEDMGTGAGQLKSSIQSSRHHRFHQKLVKLWNIGHGVG 1311 Query: 654 KQMHESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVG 833 +Q+HES+AV+NA ELFKL+FLS S A P LLAETLRRYSEHDL AAF+ LR++KIM+G Sbjct: 1312 RQVHESLAVSNAVELFKLVFLSTSTAAPFPNLLAETLRRYSEHDLFAAFSYLRDRKIMIG 1371 Query: 834 GGNGQ-FELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEV 1010 G GQ F LSQ FLH I+ S FP +TG RAA + W + D+QCG++ Sbjct: 1372 GTCGQPFVLSQQFLHSISKSPFPRNTGKRAANFSAWLHQREKDLMQGGINLTEDLQCGDI 1431 Query: 1011 FSLCDLLSSGELSITPSLPIEGVGEAEDNRP-KRKSENTEPEVGSSKKFRKTFAGDSEII 1187 F L L+SSGELS++PSLP EGVGEAED R K ++E++E K K+ A + E + Sbjct: 1432 FHLFSLVSSGELSVSPSLPDEGVGEAEDLRSLKCRAEDSELCDADKAKKLKSIA-EGEFV 1490 Query: 1188 SRREKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLGGKDQTNASSGLDVNSSPLQS 1367 SRREKGFPGI + ++ +S + A++ F E F G D+T + VN S S Sbjct: 1491 SRREKGFPGIMVSVYSSTVSTANALELFNDEETCTLAF--GNDETTSQK---VNISSTNS 1545 Query: 1368 D 1370 D Sbjct: 1546 D 1546 >ref|XP_006574488.1| PREDICTED: uncharacterized protein LOC100814813 isoform X4 [Glycine max] Length = 1570 Score = 334 bits (857), Expect = 5e-89 Identities = 196/457 (42%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 1022 LISQRVLTKMKPTRLRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWASISDLPASPIA 1081 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQ-------DKMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC+MLSE YAK LEK Q K + + S + I Sbjct: 1082 CMRRMNLLNSNMRFRKAVNKLCSMLSERYAKQLEKSQYSSLNNDRKQFVRSQSCEGILNN 1141 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ LD+++RCK MAKL NA Sbjct: 1142 SSPDAEIQITSLNKEAWDDFENKNIKMVLDEILRCKMMAKLGASSQKGQLQYDGWSDANA 1201 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 1202 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 1261 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 1262 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 1321 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 1322 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGANLAEDLQCGDIFHLF 1381 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1382 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1441 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1442 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1478 >ref|XP_006574487.1| PREDICTED: uncharacterized protein LOC100814813 isoform X3 [Glycine max] Length = 1812 Score = 334 bits (857), Expect = 5e-89 Identities = 196/457 (42%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 1008 LISQRVLTKMKPTRLRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWASISDLPASPIA 1067 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQ-------DKMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC+MLSE YAK LEK Q K + + S + I Sbjct: 1068 CMRRMNLLNSNMRFRKAVNKLCSMLSERYAKQLEKSQYSSLNNDRKQFVRSQSCEGILNN 1127 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ LD+++RCK MAKL NA Sbjct: 1128 SSPDAEIQITSLNKEAWDDFENKNIKMVLDEILRCKMMAKLGASSQKGQLQYDGWSDANA 1187 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 1188 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 1247 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 1248 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 1307 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 1308 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGANLAEDLQCGDIFHLF 1367 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1368 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1427 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1428 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1464 >ref|XP_006574486.1| PREDICTED: uncharacterized protein LOC100814813 isoform X2 [Glycine max] Length = 1813 Score = 334 bits (857), Expect = 5e-89 Identities = 196/457 (42%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 1022 LISQRVLTKMKPTRLRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWASISDLPASPIA 1081 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQ-------DKMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC+MLSE YAK LEK Q K + + S + I Sbjct: 1082 CMRRMNLLNSNMRFRKAVNKLCSMLSERYAKQLEKSQYSSLNNDRKQFVRSQSCEGILNN 1141 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ LD+++RCK MAKL NA Sbjct: 1142 SSPDAEIQITSLNKEAWDDFENKNIKMVLDEILRCKMMAKLGASSQKGQLQYDGWSDANA 1201 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 1202 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 1261 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 1262 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 1321 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 1322 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGANLAEDLQCGDIFHLF 1381 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1382 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1441 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1442 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1478 >ref|XP_003519701.1| PREDICTED: uncharacterized protein LOC100814813 isoform X1 [Glycine max] Length = 1826 Score = 334 bits (857), Expect = 5e-89 Identities = 196/457 (42%), Positives = 269/457 (58%), Gaps = 39/457 (8%) Frame = +3 Query: 54 LIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDA 233 LI +R L+ + P R ++F WS++ DRQLVI+Y ++RA GA + R DW +IS+LPA P A Sbjct: 1022 LISQRVLTKMKPTRLRRFIWSDKTDRQLVIQYVKHRAVLGAKYHRIDWASISDLPASPIA 1081 Query: 234 CKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQ-------DKMLIHANSGQMIRGP 392 C RRM++LN+ + FR+AV KLC+MLSE YAK LEK Q K + + S + I Sbjct: 1082 CMRRMNLLNSNMRFRKAVNKLCSMLSERYAKQLEKSQYSSLNNDRKQFVRSQSCEGILNN 1141 Query: 393 ASGESSAEML----EEWANFDEDSIKVALDDVMRCKRMAKL-----------------NA 509 +S ++ ++ E W +F+ +IK+ LD+++RCK MAKL NA Sbjct: 1142 SSPDAEIQITSLNKEAWDDFENKNIKMVLDEILRCKMMAKLGASSQKGQLQYDGWSDANA 1201 Query: 510 AQETFPGQENSE------DDDMEECXXXXXXXXXXXXXXXIRKNLYVFTG--ARIFKQMH 665 + F QEN E D+++ + KN F ++ Q++ Sbjct: 1202 NADGFESQENEEITSAIPCDNIQSHGKPHTFSAQRSRRRRLDKNFTRFLNNMVNVYGQVN 1261 Query: 666 ESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGG-GN 842 ES+A++N ELFKL+FLS S P+AP LL + LRRYS+HDL AAFN L+EKK+MVGG GN Sbjct: 1262 ESLAISNVVELFKLVFLSTSTDPQAPKLLDDILRRYSQHDLFAAFNYLKEKKVMVGGTGN 1321 Query: 843 GQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLC 1022 +FELSQ FL ++ S FP +TG +A K + W + D+QCG++F L Sbjct: 1322 ERFELSQQFLQSVSKSPFPFNTGKQAVKFSAWLEERGKDLTEVGANLAEDLQCGDIFHLF 1381 Query: 1023 DLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFAG-DSEIISRR 1196 L+SSGELSI+P LP GVGEAED R KRKS+ TE K K+F G + EIISRR Sbjct: 1382 ALVSSGELSISPFLPDNGVGEAEDLRSAKRKSDTTESSYSDKAKKSKSFFGVEGEIISRR 1441 Query: 1197 EKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLG 1307 EKGFPGI + HR ISR+ ++ F + + PF G Sbjct: 1442 EKGFPGIIISAHRTTISRADILNLFKDNDNYGQPFEG 1478 >ref|XP_002264494.2| PREDICTED: uncharacterized protein LOC100267761 [Vitis vinifera] Length = 1884 Score = 334 bits (856), Expect = 6e-89 Identities = 201/492 (40%), Positives = 284/492 (57%), Gaps = 40/492 (8%) Frame = +3 Query: 15 VENLS-EQDKESYSLIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRT 191 VE L E+++E S + + A + + P R ++F W+E+ADRQLV++Y R+RAA GA F R Sbjct: 1063 VEELGPEEEQEDCSSVSQFAFTRMKPTRQRRFLWTEKADRQLVMQYVRHRAALGAKFHRI 1122 Query: 192 DWVTISNLPAPPDACKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQDKMLIHANS 371 DW ++ +LP PP C +RM+ LN I FR+AVM+LC MLS+ YA +LEK +K+L + + Sbjct: 1123 DWSSLPDLPGPPGPCGKRMASLNTNIKFRKAVMRLCNMLSQRYANHLEKTPNKLL-NLDD 1181 Query: 372 GQMIRGPASG------------ESSAEMLEEWANFDEDSIKVALDDVMRCKRMAKLNAAQ 515 + +RG +G E+S E W +F++ +IK+ALD+V++CK M+K+ + + Sbjct: 1182 CRQVRGSLAGLNKNLSVGVEHAEASNSEGERWDDFEDKNIKIALDEVIQCKWMSKVESLK 1241 Query: 516 ETFPGQEN----------------------SEDDDMEECXXXXXXXXXXXXXXXIRKNLY 629 + E ED RK + Sbjct: 1242 QVRTLSEEWSNLNMDAEGNDPHKTKLVSTPGEDVQTHRGRQCGTSGRRSSRRCLPRKFIK 1301 Query: 630 VFTGA-RIFKQMHESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNC 806 + + ++ HES+AV+NA ELFKL+FLS S APE P LLAETLRRYSEHDL +AFN Sbjct: 1302 ILNERISVTRRAHESLAVSNAVELFKLVFLSTSTAPEVPNLLAETLRRYSEHDLISAFNY 1361 Query: 807 LREKKIMVGG-GNGQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXV 983 LREKKIMVGG G+ F LSQ FL ++SS FP+DTG RAAK A+W + Sbjct: 1362 LREKKIMVGGNGSDPFVLSQQFLQSVSSSPFPTDTGRRAAKFASWLHEREKDLTEEGINL 1421 Query: 984 PSDVQCGEVFSLCDLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTE-PEVGSSKKFR 1157 D+QCG++F L L+S GEL ++P LP EGVGEAED+R KRK+++ E V KK + Sbjct: 1422 SQDLQCGDIFHLFALVSLGELCLSPRLPDEGVGEAEDSRTSKRKTDSNESSNVNMIKKLK 1481 Query: 1158 KTFAGDSEIISRREKGFPGIKLCLHRERISRSLAIDSFGKENM-HPAPFLGGKDQTNASS 1334 + + EI+SRREKGFPGI + + R +SR+ +D F + A DQ + +S Sbjct: 1482 TSLVTEGEIVSRREKGFPGIMVSVSRATMSRTNVVDLFKDGKICTGAHDFEENDQWHVTS 1541 Query: 1335 GLDVNSSPLQSD 1370 ++SS SD Sbjct: 1542 DKKIDSSSSHSD 1553 >ref|XP_006465928.1| PREDICTED: uncharacterized protein LOC102628666 isoform X1 [Citrus sinensis] gi|568823033|ref|XP_006465929.1| PREDICTED: uncharacterized protein LOC102628666 isoform X2 [Citrus sinensis] Length = 1499 Score = 328 bits (840), Expect = 4e-87 Identities = 207/487 (42%), Positives = 279/487 (57%), Gaps = 31/487 (6%) Frame = +3 Query: 3 ERLGVENLSEQDKESYSLIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANF 182 E +G LS++D E +SL+ + A S L P+R K+F W++EADRQLVI+Y R+R+A GA F Sbjct: 688 EMVGEPGLSDEDDECHSLLSQLAFSKLRPSRQKRFSWTDEADRQLVIQYVRHRSALGAKF 747 Query: 183 FRTDWVTISNLPAPPDACKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQDKMLIH 362 R DW ++ NLPA P AC RRMS L I FR+AVMKLC MLSE YAK+LEK Q+ + + Sbjct: 748 HRVDWASVPNLPASPGACARRMSSLKRSIQFRKAVMKLCNMLSERYAKHLEKIQNMSMDN 807 Query: 363 ANSGQMIRG------PASGESSAEMLEE-------WANFDEDSIKVALDDVMRCKRMAKL 503 +SG + R + +S E E+ W +FD+ I AL+ V+R K++AKL Sbjct: 808 IDSGVLRRSSFKEGLKLNSSNSVEHTEDAGFGKERWDDFDDKDIGSALEGVLRLKQIAKL 867 Query: 504 NAAQE-----------------TFPGQENSEDDDMEECXXXXXXXXXXXXXXXIRKNLYV 632 A++ P + ++ ME+ I K L Sbjct: 868 GASENVESIYEECSNNLEESGLASPTTFSDQNLGMEQHKDAARRTKYHHRHRKIIKLLNE 927 Query: 633 FTGARIFKQMHESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLR 812 A K++ ES+AV++A ELFK++FLS S PE LLAETLRRYSEHDL AAF+ LR Sbjct: 928 RINAS--KEVFESLAVSSAIELFKIVFLSTSTTPELQNLLAETLRRYSEHDLFAAFSYLR 985 Query: 813 EKKIMVGGGNGQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSD 992 E+K M+GG F LSQ FL ++ S FP +TG RAAK ++W + +D Sbjct: 986 ERKFMIGGNGNPFVLSQLFLQSLSKSPFPMNTGKRAAKFSSWLHEKEKDLKAGGVNLNAD 1045 Query: 993 VQCGEVFSLCDLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFA 1169 +QCG++F L L+SSGEL I+P LP EGVGEAED R KRK+E E V K K+ Sbjct: 1046 LQCGDIFHLLALVSSGELYISPCLPDEGVGEAEDLRCLKRKNEEKELYVTDKGKKLKSLM 1105 Query: 1170 GDSEIISRREKGFPGIKLCLHRERISRSLAIDSFGKENMHPAPFLGGKDQTNASSGLDVN 1349 + E++SRREKGFPGI + + R IS + AI+ F K+ L G + +S + Sbjct: 1106 -EGELVSRREKGFPGIMVSVCRATISVANAIEMF-KDGQSCTGELHGNSEFKTTSEKNGG 1163 Query: 1350 SSPLQSD 1370 SS QSD Sbjct: 1164 SS-CQSD 1169 >ref|XP_006426643.1| hypothetical protein CICLE_v10024687mg [Citrus clementina] gi|557528633|gb|ESR39883.1| hypothetical protein CICLE_v10024687mg [Citrus clementina] Length = 1849 Score = 326 bits (836), Expect = 1e-86 Identities = 198/454 (43%), Positives = 265/454 (58%), Gaps = 31/454 (6%) Frame = +3 Query: 3 ERLGVENLSEQDKESYSLIHKRALSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANF 182 E +G LS++D E +SL+ + A S L P+R K+F W++EADRQLVI+Y R+R+A GA F Sbjct: 1038 EMVGEPGLSDEDDECHSLLSQLAFSKLRPSRQKRFSWTDEADRQLVIQYVRHRSALGAKF 1097 Query: 183 FRTDWVTISNLPAPPDACKRRMSILNNFIPFREAVMKLCTMLSEHYAKYLEKFQDKMLIH 362 R DW ++ NLPA P AC RRMS L I FR+AVMKLC ML E YAK+LEK Q+ + + Sbjct: 1098 HRVDWASVPNLPASPGACARRMSSLKRSIQFRKAVMKLCNMLCERYAKHLEKIQNMSMDN 1157 Query: 363 ANSGQMIRG------PASGESSAEMLEE-------WANFDEDSIKVALDDVMRCKRMAKL 503 +SG + R + +S E E+ W +FD+ I AL+ V+R K+MAKL Sbjct: 1158 IDSGVLRRSSFKEGLKLNSSNSVEHTEDAGFGKERWDDFDDKDIGSALEGVLRLKQMAKL 1217 Query: 504 NAAQE-----------------TFPGQENSEDDDMEECXXXXXXXXXXXXXXXIRKNLYV 632 A++ P + ++ ME+ I K L Sbjct: 1218 GASENVESIYEECSNNLEESGLASPTTFSDQNLGMEQHKDAARRTKYHHRHRKIIKLLNE 1277 Query: 633 FTGARIFKQMHESVAVANAAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLR 812 A K++ ES+AV++A ELFK++FLS S PE LLAETLRRYSEHDL AAF+ LR Sbjct: 1278 RINAS--KEVFESLAVSSAIELFKIVFLSTSTTPELQNLLAETLRRYSEHDLFAAFSYLR 1335 Query: 813 EKKIMVGGGNGQFELSQHFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSD 992 E+K M+GG F LSQ FL ++ S FP +TG RAAK ++W + +D Sbjct: 1336 ERKFMIGGNGNPFVLSQLFLQSLSKSPFPMNTGKRAAKFSSWLHEKEKDLKAGGVNLNAD 1395 Query: 993 VQCGEVFSLCDLLSSGELSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTFA 1169 +QCG++F L L+SSGEL I+P LP EGVGEAED R KRK+E E V K K+ Sbjct: 1396 LQCGDIFHLLALVSSGELYISPCLPDEGVGEAEDLRCLKRKNEEKELYVTDKGKKLKSLM 1455 Query: 1170 GDSEIISRREKGFPGIKLCLHRERISRSLAIDSF 1271 + E++SRREKGFPGI + + R IS + AI+ F Sbjct: 1456 -EGELVSRREKGFPGIMVSVCRATISVANAIEMF 1488 >ref|XP_004499551.1| PREDICTED: uncharacterized protein LOC101494281 isoform X2 [Cicer arietinum] Length = 1794 Score = 326 bits (836), Expect = 1e-86 Identities = 185/447 (41%), Positives = 258/447 (57%), Gaps = 35/447 (7%) Frame = +3 Query: 72 LSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDACKRRMS 251 L+ + P R +F WS++ DRQLVI+Y R+RAA GAN+ R DW ++S+LPAPP C RRM+ Sbjct: 1001 LTGMKPPRQSRFIWSDKTDRQLVIQYVRHRAALGANYHRIDWASLSDLPAPPRVCMRRMN 1060 Query: 252 ILNNFIPFREAVMKLCTMLSEHYAKYLEKFQD--------KMLIHANSGQMIRGPASGES 407 LN + FR+AV +LC MLSE YAK L+K Q+ ++ + + S + + + Sbjct: 1061 FLNGNLRFRKAVNRLCNMLSERYAKQLDKSQNLSSNKDDCRLFVQSQSSKGVHNSFCPDV 1120 Query: 408 SAEML----EEWANFDEDSIKVALDDVMRCKRMAKLNAAQETFPGQENSEDDDMEECXXX 575 +M E W +F+ SIK ALD+++RCK MAKL+A+ + Q + + Sbjct: 1121 DIQMSSLNGEAWDDFENKSIKTALDEILRCKTMAKLDASYQNVQSQNEGWNRYESQEHEK 1180 Query: 576 XXXXXXXXXXXXIRKNLYVFTGAR-------------------IFKQMHESVAVANAAEL 698 + + F+ R I+ Q+H+S+AV+NA EL Sbjct: 1181 TTSAIPSKIFQSHSEKAHTFSSQRSRHCRLDMKFSRFLNNRPSIYGQVHDSLAVSNAVEL 1240 Query: 699 FKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGGGNG--QFELSQHFL 872 FKL+FLS + +P+AP LLA+ LR YSEHDL AAF+ LREKKIMVGG + +FELS FL Sbjct: 1241 FKLVFLSTATSPQAPNLLADILRHYSEHDLFAAFSYLREKKIMVGGSDSDERFELSLQFL 1300 Query: 873 HGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLCDLLSSGELSI 1052 H ++ S FP DTG++A K + W + D+QCG+ F L L+SSGELSI Sbjct: 1301 HSVSKSPFPCDTGNQAVKFSAWLKERDKDLTEMGTDLAEDLQCGDTFHLLALISSGELSI 1360 Query: 1053 TPSLPIEGVGEAEDNR-PKRKSENTEPEVG-SSKKFRKTFAGDSEIISRREKGFPGIKLC 1226 +PSLP GVGEA D R KRKS+ + +KK + G+ EIISRREKGFPGI + Sbjct: 1361 SPSLPDNGVGEAGDLRSAKRKSDASGSSFNEKAKKLKSLSGGEGEIISRREKGFPGINIS 1420 Query: 1227 LHRERISRSLAIDSFGKENMHPAPFLG 1307 +HR +SR+ +D F + + F G Sbjct: 1421 VHRTAVSRADILDLFKDNDNNDQHFEG 1447 >ref|XP_004499550.1| PREDICTED: uncharacterized protein LOC101494281 isoform X1 [Cicer arietinum] Length = 1817 Score = 326 bits (836), Expect = 1e-86 Identities = 185/447 (41%), Positives = 258/447 (57%), Gaps = 35/447 (7%) Frame = +3 Query: 72 LSMLNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDACKRRMS 251 L+ + P R +F WS++ DRQLVI+Y R+RAA GAN+ R DW ++S+LPAPP C RRM+ Sbjct: 1024 LTGMKPPRQSRFIWSDKTDRQLVIQYVRHRAALGANYHRIDWASLSDLPAPPRVCMRRMN 1083 Query: 252 ILNNFIPFREAVMKLCTMLSEHYAKYLEKFQD--------KMLIHANSGQMIRGPASGES 407 LN + FR+AV +LC MLSE YAK L+K Q+ ++ + + S + + + Sbjct: 1084 FLNGNLRFRKAVNRLCNMLSERYAKQLDKSQNLSSNKDDCRLFVQSQSSKGVHNSFCPDV 1143 Query: 408 SAEML----EEWANFDEDSIKVALDDVMRCKRMAKLNAAQETFPGQENSEDDDMEECXXX 575 +M E W +F+ SIK ALD+++RCK MAKL+A+ + Q + + Sbjct: 1144 DIQMSSLNGEAWDDFENKSIKTALDEILRCKTMAKLDASYQNVQSQNEGWNRYESQEHEK 1203 Query: 576 XXXXXXXXXXXXIRKNLYVFTGAR-------------------IFKQMHESVAVANAAEL 698 + + F+ R I+ Q+H+S+AV+NA EL Sbjct: 1204 TTSAIPSKIFQSHSEKAHTFSSQRSRHCRLDMKFSRFLNNRPSIYGQVHDSLAVSNAVEL 1263 Query: 699 FKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGGGNG--QFELSQHFL 872 FKL+FLS + +P+AP LLA+ LR YSEHDL AAF+ LREKKIMVGG + +FELS FL Sbjct: 1264 FKLVFLSTATSPQAPNLLADILRHYSEHDLFAAFSYLREKKIMVGGSDSDERFELSLQFL 1323 Query: 873 HGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLCDLLSSGELSI 1052 H ++ S FP DTG++A K + W + D+QCG+ F L L+SSGELSI Sbjct: 1324 HSVSKSPFPCDTGNQAVKFSAWLKERDKDLTEMGTDLAEDLQCGDTFHLLALISSGELSI 1383 Query: 1053 TPSLPIEGVGEAEDNR-PKRKSENTEPEVG-SSKKFRKTFAGDSEIISRREKGFPGIKLC 1226 +PSLP GVGEA D R KRKS+ + +KK + G+ EIISRREKGFPGI + Sbjct: 1384 SPSLPDNGVGEAGDLRSAKRKSDASGSSFNEKAKKLKSLSGGEGEIISRREKGFPGINIS 1443 Query: 1227 LHRERISRSLAIDSFGKENMHPAPFLG 1307 +HR +SR+ +D F + + F G Sbjct: 1444 VHRTAVSRADILDLFKDNDNNDQHFEG 1470 >ref|XP_007217094.1| hypothetical protein PRUPE_ppa000094mg [Prunus persica] gi|462413244|gb|EMJ18293.1| hypothetical protein PRUPE_ppa000094mg [Prunus persica] Length = 1843 Score = 317 bits (812), Expect = 8e-84 Identities = 192/465 (41%), Positives = 272/465 (58%), Gaps = 41/465 (8%) Frame = +3 Query: 81 LNPARPKKFFWSEEADRQLVIEYARYRAARGANFFRTDWVTISNLPAPPDACKRRMSILN 260 L R ++F W+EEADRQL+I+Y R+RA G + R DW ++ +LPAPP C++RM++L Sbjct: 1105 LQSTRQRRFSWTEEADRQLIIQYVRHRATLGPKYHRIDWTSLPDLPAPPSTCQKRMALLK 1164 Query: 261 NFIPFREAVMKLCTMLSEHYAKYLEKFQDKMLIHANSGQMIRGPASGESSAEML------ 422 + FR AVM+LC ++ E YAK+LEK Q++ L + ++RG ++GE + L Sbjct: 1165 SNKRFRIAVMRLCNVIGERYAKFLEKTQNRSLTKDDCRLLLRG-STGEDNDRNLPNISNH 1223 Query: 423 --------EEWANFDEDSIKVALDDVMRCKRMAKLNAA-------QETFPGQENSEDDDM 557 E W +FD+++IK AL++V+ KRMAKL+A+ Q+ N+E+ D Sbjct: 1224 NQGTGVQEEPWDDFDDNNIKRALEEVLHYKRMAKLDASKRVGSTCQDWSDLNTNAEEYDP 1283 Query: 558 EECXXXXXXXXXXXXXXXIRKNLYV-----------------FTGARIFKQMHESVAVAN 686 +E + L + G + Q+++S+AV+N Sbjct: 1284 QESELIASTTPYEDVQNHSGRGLKISARRSCCQHLNEKFFKLLHGVNVSTQVYKSLAVSN 1343 Query: 687 AAELFKLIFLSKSKAPEAPTLLAETLRRYSEHDLCAAFNCLREKKIMVGGGNGQ-FELSQ 863 A ELFKL+FLS S APE P LLAE LRRYSE DL AAFN LR++KIMVGG + Q F LSQ Sbjct: 1344 AVELFKLVFLSISTAPEVPNLLAEILRRYSECDLFAAFNYLRDRKIMVGGNDSQHFSLSQ 1403 Query: 864 HFLHGITSSAFPSDTGSRAAKLATWXXXXXXXXXXXXXXVPSDVQCGEVFSLCDLLSSGE 1043 FLH I+ S FP+++G RA K A W + +D+QCG++F L L+SSGE Sbjct: 1404 QFLHNISMSPFPTNSGKRATKFAHWLREREKDLMEGGIDLSADLQCGDIFHLFALVSSGE 1463 Query: 1044 LSITPSLPIEGVGEAEDNR-PKRKSENTEPEVGSSKKFRKTF-AGDSEIISRREKGFPGI 1217 LSI+P LP EG+GEAED R KRK ++ E G K K+F A + EIISRREKGFPGI Sbjct: 1464 LSISPCLPDEGMGEAEDLRSSKRKIDSNEFLDGDKTKKLKSFVAAEGEIISRREKGFPGI 1523 Query: 1218 KLCLHRERISRSLAIDSFGKENMHPAPFLGGKDQTNASSGLDVNS 1352 K+ ++R S + A+D F + +GG Q +++ G ++ S Sbjct: 1524 KVSVYRASFSTADAVDLFTNDT-PCVKKIGGSYQLDSTCGQNILS 1567