BLASTX nr result
ID: Cinnamomum23_contig00007613
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum23_contig00007613 (1547 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010241959.1| PREDICTED: uncharacterized protein LOC104586... 509 e-141 ref|XP_010241958.1| PREDICTED: uncharacterized protein LOC104586... 509 e-141 ref|XP_010934861.1| PREDICTED: uncharacterized protein LOC105054... 489 e-135 ref|XP_009408148.1| PREDICTED: uncharacterized protein LOC103990... 485 e-134 ref|XP_008810464.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 473 e-130 ref|XP_003529319.1| PREDICTED: uncharacterized protein LOC100778... 462 e-127 ref|XP_006467813.1| PREDICTED: uncharacterized protein LOC102631... 459 e-126 gb|KJB69499.1| hypothetical protein B456_011G026900 [Gossypium r... 459 e-126 ref|XP_012454722.1| PREDICTED: uncharacterized protein LOC105776... 459 e-126 gb|KDO75937.1| hypothetical protein CISIN_1g003258mg [Citrus sin... 457 e-126 gb|KDO75936.1| hypothetical protein CISIN_1g003258mg [Citrus sin... 457 e-126 gb|KDO75935.1| hypothetical protein CISIN_1g003258mg [Citrus sin... 457 e-126 gb|KDO75934.1| hypothetical protein CISIN_1g003258mg [Citrus sin... 457 e-126 ref|XP_007025648.1| DNA mismatch repair protein MutS isoform 1 [... 457 e-125 ref|XP_007159320.1| hypothetical protein PHAVU_002G228200g [Phas... 456 e-125 ref|XP_002305805.1| DNA mismatch repair MutS family protein [Pop... 456 e-125 gb|KHG26053.1| MutS2 [Gossypium arboreum] 455 e-125 ref|XP_007025649.1| DNA mismatch repair protein MutS, type 2, pu... 455 e-125 ref|XP_004505047.1| PREDICTED: uncharacterized protein LOC101503... 454 e-125 ref|XP_009795021.1| PREDICTED: DNA mismatch repair protein MSH2 ... 452 e-124 >ref|XP_010241959.1| PREDICTED: uncharacterized protein LOC104586426 isoform X2 [Nelumbo nucifera] Length = 908 Score = 509 bits (1312), Expect = e-141 Identities = 279/466 (59%), Positives = 340/466 (72%), Gaps = 3/466 (0%) Frame = -1 Query: 1391 HKSIKLTNVKSKSSLDKIRVXXXXXXXXXXXXEWQSICXXXXXXXXXXXXXXXXXXGNLP 1212 H I+ +++ + K++V EW S+C G LP Sbjct: 22 HGFIRKSSLTNSPGSSKVKVAEDLQKESEEILEWHSVCRQVSAFTSTSMGLSIAREGKLP 81 Query: 1211 FGRDREESQKLLEQTTAAFLLPQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAVARSLQS 1032 FGR +ESQKLL QTTAA LLP+PLDFSGIEDLSEIV S+V G+L T+R+LCAV R+LQS Sbjct: 82 FGRSLQESQKLLNQTTAAMLLPRPLDFSGIEDLSEIVSSSVVGQLRTIRELCAVKRTLQS 141 Query: 1031 ARGVLEQLEKMSSENQGDSQWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVKDQASEK 852 AR + EQLE+ S ++Y PL+EIL+NCNFLTELEQKIGFCIDC+LSVV D+ASE Sbjct: 142 ARELFEQLEEASLNGDSSDRYY-PLIEILQNCNFLTELEQKIGFCIDCNLSVVLDRASED 200 Query: 851 LGFIRSERKRNMESLESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRSLLNGGV 672 L IRSERKRNM++LESLLK V+T+IF+AGGIDSPL+TKRRSRMCVGI+AS +SLL G+ Sbjct: 201 LQIIRSERKRNMDNLESLLKEVATQIFRAGGIDSPLITKRRSRMCVGIKASYKSLLPDGI 260 Query: 671 ILGVSSSGATYFMEPRDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEIRYLMDK 492 +L SSSGATYFMEP+DAVELNNMEVRLSNSEKAEEL IL+LLTSEIAGSE EI YL+++ Sbjct: 261 VLNASSSGATYFMEPKDAVELNNMEVRLSNSEKAEELGILSLLTSEIAGSETEIIYLLER 320 Query: 491 VVELDLASARGSFAHWINGVCPVLGAIHERVELNNTGESLSVDIECIRHP---XXXXXXX 321 ++ELDLA AR ++A +NGVCP+LG + +N E+L VDI+ I+HP Sbjct: 321 ILELDLACARAAYARSLNGVCPILGVEICKGARSNKTENLLVDIKGIQHPVLLESSLGSL 380 Query: 320 XXXXXXXXENSIHFDGSNGMAKSGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTA 141 E+S+ N +S R G+ FPVP+DIK+G TKVVVISGPNTGGKTA Sbjct: 381 HMLSISESESSVQSHRENIKLESDR-STGGSVFPVPIDIKVGHATKVVVISGPNTGGKTA 439 Query: 140 TMKTLGLAALMSKAGLYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 +MKTLGLA+LMSKAG+YLPA+ P+LPWFD VLADIGD+QSLE NL Sbjct: 440 SMKTLGLASLMSKAGMYLPARNCPRLPWFDLVLADIGDNQSLEQNL 485 >ref|XP_010241958.1| PREDICTED: uncharacterized protein LOC104586426 isoform X1 [Nelumbo nucifera] Length = 910 Score = 509 bits (1312), Expect = e-141 Identities = 279/466 (59%), Positives = 340/466 (72%), Gaps = 3/466 (0%) Frame = -1 Query: 1391 HKSIKLTNVKSKSSLDKIRVXXXXXXXXXXXXEWQSICXXXXXXXXXXXXXXXXXXGNLP 1212 H I+ +++ + K++V EW S+C G LP Sbjct: 22 HGFIRKSSLTNSPGSSKVKVAEDLQKESEEILEWHSVCRQVSAFTSTSMGLSIAREGKLP 81 Query: 1211 FGRDREESQKLLEQTTAAFLLPQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAVARSLQS 1032 FGR +ESQKLL QTTAA LLP+PLDFSGIEDLSEIV S+V G+L T+R+LCAV R+LQS Sbjct: 82 FGRSLQESQKLLNQTTAAMLLPRPLDFSGIEDLSEIVSSSVVGQLRTIRELCAVKRTLQS 141 Query: 1031 ARGVLEQLEKMSSENQGDSQWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVKDQASEK 852 AR + EQLE+ S ++Y PL+EIL+NCNFLTELEQKIGFCIDC+LSVV D+ASE Sbjct: 142 ARELFEQLEEASLNGDSSDRYY-PLIEILQNCNFLTELEQKIGFCIDCNLSVVLDRASED 200 Query: 851 LGFIRSERKRNMESLESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRSLLNGGV 672 L IRSERKRNM++LESLLK V+T+IF+AGGIDSPL+TKRRSRMCVGI+AS +SLL G+ Sbjct: 201 LQIIRSERKRNMDNLESLLKEVATQIFRAGGIDSPLITKRRSRMCVGIKASYKSLLPDGI 260 Query: 671 ILGVSSSGATYFMEPRDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEIRYLMDK 492 +L SSSGATYFMEP+DAVELNNMEVRLSNSEKAEEL IL+LLTSEIAGSE EI YL+++ Sbjct: 261 VLNASSSGATYFMEPKDAVELNNMEVRLSNSEKAEELGILSLLTSEIAGSETEIIYLLER 320 Query: 491 VVELDLASARGSFAHWINGVCPVLGAIHERVELNNTGESLSVDIECIRHP---XXXXXXX 321 ++ELDLA AR ++A +NGVCP+LG + +N E+L VDI+ I+HP Sbjct: 321 ILELDLACARAAYARSLNGVCPILGVEICKGARSNKTENLLVDIKGIQHPVLLESSLGSL 380 Query: 320 XXXXXXXXENSIHFDGSNGMAKSGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTA 141 E+S+ N +S R G+ FPVP+DIK+G TKVVVISGPNTGGKTA Sbjct: 381 HMLSISESESSVQSHRENIKLESDR-STGGSVFPVPIDIKVGHATKVVVISGPNTGGKTA 439 Query: 140 TMKTLGLAALMSKAGLYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 +MKTLGLA+LMSKAG+YLPA+ P+LPWFD VLADIGD+QSLE NL Sbjct: 440 SMKTLGLASLMSKAGMYLPARNCPRLPWFDLVLADIGDNQSLEQNL 485 >ref|XP_010934861.1| PREDICTED: uncharacterized protein LOC105054914 [Elaeis guineensis] Length = 1486 Score = 489 bits (1258), Expect = e-135 Identities = 258/433 (59%), Positives = 325/433 (75%), Gaps = 3/433 (0%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAFLLPQPLDFSGIEDL 1113 W +C G+LP GRDREES KLL+QT A LLPQPLDFSGI+D+ Sbjct: 632 WSLVCSQVCAFVSTSAGKALCRSGSLPIGRDREESLKLLDQTAAVVLLPQPLDFSGIDDV 691 Query: 1112 SEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEILKNCN 933 SEIV AV G+LLT+R+LCAV RSL+SAR V EQLE++S+ + + +PLL+IL++C+ Sbjct: 692 SEIVRLAVDGQLLTIRELCAVERSLRSARRVFEQLEQVSAAAESPDR-LAPLLDILQDCD 750 Query: 932 FLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGID 753 FLT++ KIGFCIDC+LSVV D+AS KL +R ERK+NME LESLL+ +S +FQAGGID Sbjct: 751 FLTDIANKIGFCIDCTLSVVLDRASVKLESVRLERKQNMERLESLLREISMNVFQAGGID 810 Query: 752 SPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEK 573 SPL+TKRRSRMC+GI+AS +SLL G++L SSSGATYFMEPRDAVELNNMEVRL N EK Sbjct: 811 SPLITKRRSRMCIGIKASHKSLLPEGIVLSSSSSGATYFMEPRDAVELNNMEVRLLNDEK 870 Query: 572 AEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVLGAIHERVEL 393 EELAIL L+SEIA SE + R LM+K++ELDLASARG++A W+NGV PV H+ ++ Sbjct: 871 DEELAILGFLSSEIACSETKFRLLMEKILELDLASARGAYALWMNGVRPVFSEGHQIIKS 930 Query: 392 NNTGESLSVDIECIRHP---XXXXXXXXXXXXXXXENSIHFDGSNGMAKSGRLPEHGADF 222 + + +SLS+DI+ I+HP +S + +G+ +S LPE A+ Sbjct: 931 SISADSLSIDIQGIQHPLLLQPSLRSLSSISIPEAGSSEMLNRRDGLMESEDLPE--AET 988 Query: 221 PVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKPSPKLPWFDHVL 42 PVP+D+++G TTKV+VISGPNTGGKTATMKTLGLAALMSKAG++LPA+ P+LPWFD +L Sbjct: 989 PVPIDVRIGYTTKVLVISGPNTGGKTATMKTLGLAALMSKAGMFLPARGRPRLPWFDQIL 1048 Query: 41 ADIGDHQSLEHNL 3 ADIGDHQSLEHNL Sbjct: 1049 ADIGDHQSLEHNL 1061 >ref|XP_009408148.1| PREDICTED: uncharacterized protein LOC103990661 [Musa acuminata subsp. malaccensis] Length = 954 Score = 485 bits (1248), Expect = e-134 Identities = 265/451 (58%), Positives = 326/451 (72%), Gaps = 3/451 (0%) Frame = -1 Query: 1346 DKIRVXXXXXXXXXXXXEWQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQT 1167 +++R+ EW S+C GNLP GRDREES+KLL+QT Sbjct: 84 ERVRIREELRRETEETLEWGSVCSQVSAFVSTSVGRALCRSGNLPVGRDREESEKLLDQT 143 Query: 1166 TAAFLLPQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSEN 987 AA LLP+PLDFSGI+D+SEIV +AV+GELL +R+LCA+ RSLQSAR V EQLE++S++ Sbjct: 144 AAAVLLPRPLDFSGIDDVSEIVRAAVAGELLGIRELCAIERSLQSARRVFEQLEQISADE 203 Query: 986 QGDSQWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESL 807 D Y+ LLEIL++C+FL EL +I FCID LS+V DQAS KL IR ER++NME L Sbjct: 204 SSDR--YTSLLEILQDCDFLVELANQIAFCIDGKLSIVLDQASMKLESIRMERRKNMEKL 261 Query: 806 ESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEP 627 ES LK VS ++FQ+GGIDSPLVTKRRSRMCVGI+AS +SLL G++L SSSGATYF+EP Sbjct: 262 ESFLKEVSMKVFQSGGIDSPLVTKRRSRMCVGIKASHKSLLPEGIVLSSSSSGATYFIEP 321 Query: 626 RDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAH 447 RDA+ELNNMEVRL N EKAEELAIL +LTSEIA +E +IRYLM+K++ELDLA ARG++A Sbjct: 322 RDAIELNNMEVRLFNDEKAEELAILGVLTSEIAHAETKIRYLMEKILELDLAVARGAYAL 381 Query: 446 WINGVCPVLGAIHERVELNNTGESLSVDIECIRHP---XXXXXXXXXXXXXXXENSIHFD 276 W GV P L +ER + TG++LSVDIE I+HP +SI FD Sbjct: 382 WNGGVRPYLIQDYERFKSIITGDTLSVDIESIQHPLLLEPSLRHLPSVSEKGGGSSILFD 441 Query: 275 GSNGMAKSGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAG 96 N S E + PVP+D K+ ++TKVVVISGPNTGGKTATMKTLGLA++MSKAG Sbjct: 442 RRNLSIDSEEFLE--VEPPVPVDFKIENSTKVVVISGPNTGGKTATMKTLGLASIMSKAG 499 Query: 95 LYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 ++L A+ PKLPWFD +LADIGDHQSLEHNL Sbjct: 500 MFLSARDQPKLPWFDQILADIGDHQSLEHNL 530 >ref|XP_008810464.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103721871 [Phoenix dactylifera] Length = 1716 Score = 473 bits (1218), Expect = e-130 Identities = 256/437 (58%), Positives = 319/437 (72%), Gaps = 7/437 (1%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAFLLPQPLDFSGIEDL 1113 W IC G+LP GRDREES KLL+QT AA LLPQPLDFSGI+D+ Sbjct: 862 WSLICSQVSAFVCTSAGKALCRSGSLPIGRDREESMKLLDQTAAAVLLPQPLDFSGIDDV 921 Query: 1112 SEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEILKNCN 933 SEIV SAV G+LLT+ +LCAV RSL+SAR V E LE++ + + + +SPLL+IL++C+ Sbjct: 922 SEIVRSAVDGQLLTIGELCAVERSLRSARRVFELLEQIWAAGESPDR-FSPLLDILQDCD 980 Query: 932 FLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGID 753 FLT++ KI FCIDC+LS+V D+AS KL +R ERK+NME LESLL+ +S +FQ GGID Sbjct: 981 FLTDIANKIRFCIDCTLSIVLDRASMKLESLRLERKQNMERLESLLRKISMEVFQVGGID 1040 Query: 752 SPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEK 573 PL+TKRRSRMC+GIRAS +SLL G++L SSSGATYFMEPRDAV LNNMEVRL N EK Sbjct: 1041 RPLITKRRSRMCIGIRASHKSLLPEGIVLSSSSSGATYFMEPRDAVVLNNMEVRLLNDEK 1100 Query: 572 AEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVLGAIHERVEL 393 EELAIL+ L+SEIA SE + R LM+K++ELDLASARG++A W+NGV P+ H+ + Sbjct: 1101 DEELAILSYLSSEIARSETKFRLLMEKILELDLASARGAYALWMNGVHPLFSEGHQIINS 1160 Query: 392 NNTGESLSVDIECIRHPXXXXXXXXXXXXXXXENSIHFDGSNGM-------AKSGRLPEH 234 N + SLS+DI+ I+HP SI GS+ M +S LP+ Sbjct: 1161 NISANSLSIDIQGIQHP----LLLQPSLRSLSSTSIPEAGSSEMLSRRDRAMESEDLPK- 1215 Query: 233 GADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKPSPKLPWF 54 A+ PVP+DI++G TTKV+VISGPNTGGKTATMKT GLAALMSKAG++LPA+ P+LPWF Sbjct: 1216 -AETPVPIDIRIGYTTKVLVISGPNTGGKTATMKTXGLAALMSKAGMFLPARGRPRLPWF 1274 Query: 53 DHVLADIGDHQSLEHNL 3 D +LADIGDHQ+LEHNL Sbjct: 1275 DQILADIGDHQTLEHNL 1291 >ref|XP_003529319.1| PREDICTED: uncharacterized protein LOC100778373 isoformX1 [Glycine max] gi|571467012|ref|XP_006583816.1| PREDICTED: uncharacterized protein LOC100778373 isoform X2 [Glycine max] Length = 914 Score = 462 bits (1190), Expect = e-127 Identities = 245/443 (55%), Positives = 314/443 (70%), Gaps = 13/443 (2%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAFLLPQPLDFSGIEDL 1113 W S+C LP GR R +SQ+LL+QT+AA L+ +PLDFSG+ DL Sbjct: 46 WGSVCKQLSAFTSTSMGSAAALNARLPIGRTRRDSQRLLDQTSAARLVAEPLDFSGVHDL 105 Query: 1112 SEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEILKNCN 933 +EI+G A SG LLT+R+LC V +L +AR + + L++++S + Q Y PLL+IL+NCN Sbjct: 106 TEILGVATSGHLLTIRELCTVRHTLAAARELFDALKRVASASN-HPQRYLPLLDILQNCN 164 Query: 932 FLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGID 753 F LE+KI FCIDC LS++ D+ASE L IRSERKRN+E L+SLLK VS++IFQAGGID Sbjct: 165 FQVGLERKIEFCIDCKLSIILDRASEDLEIIRSERKRNIEILDSLLKEVSSQIFQAGGID 224 Query: 752 SPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEK 573 PL+ KRRSRMCVGIRAS R LL GV+L VSSSGATYFMEP+DA++LNN+EVRLS+SEK Sbjct: 225 RPLIVKRRSRMCVGIRASHRYLLPDGVVLNVSSSGATYFMEPKDAIDLNNLEVRLSSSEK 284 Query: 572 AEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPV--LGAIHERV 399 AEE IL++L SEIA SE +I +L+DK++++DLA AR ++A W+NGVCP+ LG R Sbjct: 285 AEESVILSMLASEIANSESDINHLLDKILKVDLAFARAAYAQWMNGVCPIFSLGNFEGRD 344 Query: 398 ELNNTGES--------LSVDIECIRHP---XXXXXXXXXXXXXXXENSIHFDGSNGMAKS 252 + + ++ L+VDI IRHP N+ F NG S Sbjct: 345 SVEDDDDTLVTQEDDDLTVDIVGIRHPLLLESSLENISDNLTLRSGNAAEFGNGNGTMAS 404 Query: 251 GRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKPS 72 +P+ +DFPVP+D K+G T+VVVISGPNTGGKTA+MKTLGLA+LMSKAG++LPAK + Sbjct: 405 KYMPQGISDFPVPVDFKIGHGTRVVVISGPNTGGKTASMKTLGLASLMSKAGMHLPAKKN 464 Query: 71 PKLPWFDHVLADIGDHQSLEHNL 3 PKLPWFD +LADIGDHQSLE NL Sbjct: 465 PKLPWFDLILADIGDHQSLEQNL 487 >ref|XP_006467813.1| PREDICTED: uncharacterized protein LOC102631102 [Citrus sinensis] Length = 907 Score = 459 bits (1181), Expect = e-126 Identities = 248/434 (57%), Positives = 312/434 (71%), Gaps = 4/434 (0%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAFLL--PQPLDFSGIE 1119 W ++C +PFG+ EESQKLL QT+AA + QPLD S IE Sbjct: 58 WPTLCHQLSSFTQTSMGHAVVQKAQIPFGKSLEESQKLLNQTSAALAMMQSQPLDLSAIE 117 Query: 1118 DLSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDS-QWYSPLLEILK 942 D++ I+ SAVSG+LL+ ++CAV R+L++ V ++L + ++E GDS Q YSPLLE+LK Sbjct: 118 DIAGILNSAVSGQLLSPSEICAVRRTLRAVNNVWKKLTE-AAELDGDSLQRYSPLLELLK 176 Query: 941 NCNFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQAG 762 NCNFLTELE+KIGFCIDC L ++ D+ASE L IR+ERKRNME+L+SLLK V+ +IFQAG Sbjct: 177 NCNFLTELEEKIGFCIDCKLLIILDRASEDLELIRAERKRNMENLDSLLKKVAAQIFQAG 236 Query: 761 GIDSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSN 582 GID PL+TKRRSRMCVGI+AS + LL G+ L VSSSGATYFMEP++AVE NNMEVRLSN Sbjct: 237 GIDKPLITKRRSRMCVGIKASHKYLLPDGIALNVSSSGATYFMEPKEAVEFNNMEVRLSN 296 Query: 581 SEKAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVLGAIHER 402 SE AEE AIL+LLT+EIA SE +I+YLMD+V+E+DLA AR FA W++GVCP+L + Sbjct: 297 SEIAEETAILSLLTAEIAKSERKIKYLMDRVLEIDLAFARAGFAQWMDGVCPILSS---- 352 Query: 401 VELNNTGESLSVDIECIRHP-XXXXXXXXXXXXXXXENSIHFDGSNGMAKSGRLPEHGAD 225 ++ S++IE I+HP N + D N G L + +D Sbjct: 353 --QSHVSFDSSINIEGIKHPLLLGSSLRSLSAASSNSNPLKSDVENSEMTVGSLSKGISD 410 Query: 224 FPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKPSPKLPWFDHV 45 FPVP+DIK+ T+VVVI+GPNTGGKTA+MKTLGLA+LMSKAGLYLPAK P+LPWFD + Sbjct: 411 FPVPIDIKVECETRVVVITGPNTGGKTASMKTLGLASLMSKAGLYLPAKNHPRLPWFDLI 470 Query: 44 LADIGDHQSLEHNL 3 LADIGDHQSLE NL Sbjct: 471 LADIGDHQSLEQNL 484 >gb|KJB69499.1| hypothetical protein B456_011G026900 [Gossypium raimondii] Length = 671 Score = 459 bits (1180), Expect = e-126 Identities = 249/447 (55%), Positives = 320/447 (71%), Gaps = 17/447 (3%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAF-----LLPQPLDFS 1128 W S+C +P G+ RE+SQKLL+QTT+A L +PLD S Sbjct: 65 WPSLCNYLSPFTSTSMAFSLTKAAAIPVGQSREDSQKLLDQTTSALHALEALKSEPLDLS 124 Query: 1127 GIEDLSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEI 948 IED+SEI+ SA SG++LTVR+LC V R L +AR V E+L ++ G + Y+PLLEI Sbjct: 125 VIEDVSEILHSAASGQVLTVRELCRVRRMLGAARAVSEKLAAIAEG--GSLERYTPLLEI 182 Query: 947 LKNCNFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQ 768 L+ CNF ELE+KIGFCIDCSLS + +ASE+L IR ERKRNME+L+SLLK VS IFQ Sbjct: 183 LQGCNFQLELERKIGFCIDCSLSTILGRASEELELIREERKRNMENLDSLLKEVSVSIFQ 242 Query: 767 AGGIDSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRL 588 AGGID PL+TKRRSRMCVG++A+ + LL GGV+L VSSSGATYFMEP++AVELNNMEV+L Sbjct: 243 AGGIDKPLITKRRSRMCVGVKATHKYLLPGGVVLNVSSSGATYFMEPKEAVELNNMEVKL 302 Query: 587 SNSEKAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVLGAIH 408 SNSEKAEE+AIL++LTSEIA SE EI+YL+D+++E+DLA AR ++A W+NGVCP+L + Sbjct: 303 SNSEKAEEMAILSMLTSEIAESEAEIKYLLDRLIEVDLAFARAAYAQWVNGVCPILSSKE 362 Query: 407 ERVELNNTGE-SLSVDIECIRHPXXXXXXXXXXXXXXXENSIHFDGSNGMA------KSG 249 + ++N + +LS+DIE ++HP NS F SN M KSG Sbjct: 363 SEMLISNGADNALSIDIEGMQHP--------LLLGSFLSNSTDFITSNSMGPSVLGNKSG 414 Query: 248 RL-----PEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLP 84 + + ++FP+P+DIK+ T+VV+ISGPNTGGKTA+MKTLGLA++MSKAG+YLP Sbjct: 415 EMTPIKSSKVVSNFPIPIDIKVQCGTRVVIISGPNTGGKTASMKTLGLASIMSKAGMYLP 474 Query: 83 AKPSPKLPWFDHVLADIGDHQSLEHNL 3 AK P+LPWFD VLADIGD QSLE +L Sbjct: 475 AKKQPRLPWFDLVLADIGDSQSLEQSL 501 >ref|XP_012454722.1| PREDICTED: uncharacterized protein LOC105776552 [Gossypium raimondii] gi|763802560|gb|KJB69498.1| hypothetical protein B456_011G026900 [Gossypium raimondii] Length = 927 Score = 459 bits (1180), Expect = e-126 Identities = 249/447 (55%), Positives = 320/447 (71%), Gaps = 17/447 (3%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAF-----LLPQPLDFS 1128 W S+C +P G+ RE+SQKLL+QTT+A L +PLD S Sbjct: 65 WPSLCNYLSPFTSTSMAFSLTKAAAIPVGQSREDSQKLLDQTTSALHALEALKSEPLDLS 124 Query: 1127 GIEDLSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEI 948 IED+SEI+ SA SG++LTVR+LC V R L +AR V E+L ++ G + Y+PLLEI Sbjct: 125 VIEDVSEILHSAASGQVLTVRELCRVRRMLGAARAVSEKLAAIAEG--GSLERYTPLLEI 182 Query: 947 LKNCNFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQ 768 L+ CNF ELE+KIGFCIDCSLS + +ASE+L IR ERKRNME+L+SLLK VS IFQ Sbjct: 183 LQGCNFQLELERKIGFCIDCSLSTILGRASEELELIREERKRNMENLDSLLKEVSVSIFQ 242 Query: 767 AGGIDSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRL 588 AGGID PL+TKRRSRMCVG++A+ + LL GGV+L VSSSGATYFMEP++AVELNNMEV+L Sbjct: 243 AGGIDKPLITKRRSRMCVGVKATHKYLLPGGVVLNVSSSGATYFMEPKEAVELNNMEVKL 302 Query: 587 SNSEKAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVLGAIH 408 SNSEKAEE+AIL++LTSEIA SE EI+YL+D+++E+DLA AR ++A W+NGVCP+L + Sbjct: 303 SNSEKAEEMAILSMLTSEIAESEAEIKYLLDRLIEVDLAFARAAYAQWVNGVCPILSSKE 362 Query: 407 ERVELNNTGE-SLSVDIECIRHPXXXXXXXXXXXXXXXENSIHFDGSNGMA------KSG 249 + ++N + +LS+DIE ++HP NS F SN M KSG Sbjct: 363 SEMLISNGADNALSIDIEGMQHP--------LLLGSFLSNSTDFITSNSMGPSVLGNKSG 414 Query: 248 RL-----PEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLP 84 + + ++FP+P+DIK+ T+VV+ISGPNTGGKTA+MKTLGLA++MSKAG+YLP Sbjct: 415 EMTPIKSSKVVSNFPIPIDIKVQCGTRVVIISGPNTGGKTASMKTLGLASIMSKAGMYLP 474 Query: 83 AKPSPKLPWFDHVLADIGDHQSLEHNL 3 AK P+LPWFD VLADIGD QSLE +L Sbjct: 475 AKKQPRLPWFDLVLADIGDSQSLEQSL 501 >gb|KDO75937.1| hypothetical protein CISIN_1g003258mg [Citrus sinensis] Length = 620 Score = 457 bits (1177), Expect = e-126 Identities = 247/409 (60%), Positives = 307/409 (75%), Gaps = 4/409 (0%) Frame = -1 Query: 1217 LPFGRDREESQKLLEQTTAAFLL--PQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAVAR 1044 +PFG+ EESQKLL QT+AA + QPLD S IED++ I+ SAVSG+LL+ ++CAV R Sbjct: 11 IPFGKSLEESQKLLNQTSAALAMMQSQPLDLSTIEDIAGILNSAVSGQLLSPSEICAVRR 70 Query: 1043 SLQSARGVLEQLEKMSSENQGDS-QWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVKD 867 +L++ V ++L + ++E GDS Q YSPLLE+LKNCNFLTELE+KIGFCIDC L ++ D Sbjct: 71 TLRAVNNVWKKLTE-AAELDGDSLQRYSPLLELLKNCNFLTELEEKIGFCIDCKLLIILD 129 Query: 866 QASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRSL 687 +ASE L IR+ERKRNME+L+SLLK V+ +IFQAGGID PL+TKRRSRMCVGI+AS + L Sbjct: 130 RASEDLELIRAERKRNMENLDSLLKKVAAQIFQAGGIDKPLITKRRSRMCVGIKASHKYL 189 Query: 686 LNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEIR 507 L G+ L VSSSGATYFMEP+ AVE NNMEVRLSNSE AEE AIL+LLT+EIA SE EI+ Sbjct: 190 LPDGIALNVSSSGATYFMEPKGAVEFNNMEVRLSNSEIAEETAILSLLTAEIAKSEREIK 249 Query: 506 YLMDKVVELDLASARGSFAHWINGVCPVLGAIHERVELNNTGESLSVDIECIRHP-XXXX 330 YLMD+V+E+DLA AR FA W++GVCP+L + ++ S++IE I+HP Sbjct: 250 YLMDRVLEIDLAFARAGFAQWMDGVCPILSS------QSHVSFDSSINIEGIKHPLLLGS 303 Query: 329 XXXXXXXXXXXENSIHFDGSNGMAKSGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGG 150 N + D N G L + +DFPVP+DIK+ T+VVVI+GPNTGG Sbjct: 304 SLRSLSAASSNSNPLKSDVENSEMTVGSLSKGISDFPVPIDIKVECETRVVVITGPNTGG 363 Query: 149 KTATMKTLGLAALMSKAGLYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 KTA+MKTLGLA+LMSKAGLYLPAK P+LPWFD +LADIGDHQSLE NL Sbjct: 364 KTASMKTLGLASLMSKAGLYLPAKNHPRLPWFDLILADIGDHQSLEQNL 412 >gb|KDO75936.1| hypothetical protein CISIN_1g003258mg [Citrus sinensis] Length = 623 Score = 457 bits (1177), Expect = e-126 Identities = 247/409 (60%), Positives = 307/409 (75%), Gaps = 4/409 (0%) Frame = -1 Query: 1217 LPFGRDREESQKLLEQTTAAFLL--PQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAVAR 1044 +PFG+ EESQKLL QT+AA + QPLD S IED++ I+ SAVSG+LL+ ++CAV R Sbjct: 11 IPFGKSLEESQKLLNQTSAALAMMQSQPLDLSTIEDIAGILNSAVSGQLLSPSEICAVRR 70 Query: 1043 SLQSARGVLEQLEKMSSENQGDS-QWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVKD 867 +L++ V ++L + ++E GDS Q YSPLLE+LKNCNFLTELE+KIGFCIDC L ++ D Sbjct: 71 TLRAVNNVWKKLTE-AAELDGDSLQRYSPLLELLKNCNFLTELEEKIGFCIDCKLLIILD 129 Query: 866 QASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRSL 687 +ASE L IR+ERKRNME+L+SLLK V+ +IFQAGGID PL+TKRRSRMCVGI+AS + L Sbjct: 130 RASEDLELIRAERKRNMENLDSLLKKVAAQIFQAGGIDKPLITKRRSRMCVGIKASHKYL 189 Query: 686 LNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEIR 507 L G+ L VSSSGATYFMEP+ AVE NNMEVRLSNSE AEE AIL+LLT+EIA SE EI+ Sbjct: 190 LPDGIALNVSSSGATYFMEPKGAVEFNNMEVRLSNSEIAEETAILSLLTAEIAKSEREIK 249 Query: 506 YLMDKVVELDLASARGSFAHWINGVCPVLGAIHERVELNNTGESLSVDIECIRHP-XXXX 330 YLMD+V+E+DLA AR FA W++GVCP+L + ++ S++IE I+HP Sbjct: 250 YLMDRVLEIDLAFARAGFAQWMDGVCPILSS------QSHVSFDSSINIEGIKHPLLLGS 303 Query: 329 XXXXXXXXXXXENSIHFDGSNGMAKSGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGG 150 N + D N G L + +DFPVP+DIK+ T+VVVI+GPNTGG Sbjct: 304 SLRSLSAASSNSNPLKSDVENSEMTVGSLSKGISDFPVPIDIKVECETRVVVITGPNTGG 363 Query: 149 KTATMKTLGLAALMSKAGLYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 KTA+MKTLGLA+LMSKAGLYLPAK P+LPWFD +LADIGDHQSLE NL Sbjct: 364 KTASMKTLGLASLMSKAGLYLPAKNHPRLPWFDLILADIGDHQSLEQNL 412 >gb|KDO75935.1| hypothetical protein CISIN_1g003258mg [Citrus sinensis] Length = 742 Score = 457 bits (1177), Expect = e-126 Identities = 247/409 (60%), Positives = 307/409 (75%), Gaps = 4/409 (0%) Frame = -1 Query: 1217 LPFGRDREESQKLLEQTTAAFLL--PQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAVAR 1044 +PFG+ EESQKLL QT+AA + QPLD S IED++ I+ SAVSG+LL+ ++CAV R Sbjct: 11 IPFGKSLEESQKLLNQTSAALAMMQSQPLDLSTIEDIAGILNSAVSGQLLSPSEICAVRR 70 Query: 1043 SLQSARGVLEQLEKMSSENQGDS-QWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVKD 867 +L++ V ++L + ++E GDS Q YSPLLE+LKNCNFLTELE+KIGFCIDC L ++ D Sbjct: 71 TLRAVNNVWKKLTE-AAELDGDSLQRYSPLLELLKNCNFLTELEEKIGFCIDCKLLIILD 129 Query: 866 QASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRSL 687 +ASE L IR+ERKRNME+L+SLLK V+ +IFQAGGID PL+TKRRSRMCVGI+AS + L Sbjct: 130 RASEDLELIRAERKRNMENLDSLLKKVAAQIFQAGGIDKPLITKRRSRMCVGIKASHKYL 189 Query: 686 LNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEIR 507 L G+ L VSSSGATYFMEP+ AVE NNMEVRLSNSE AEE AIL+LLT+EIA SE EI+ Sbjct: 190 LPDGIALNVSSSGATYFMEPKGAVEFNNMEVRLSNSEIAEETAILSLLTAEIAKSEREIK 249 Query: 506 YLMDKVVELDLASARGSFAHWINGVCPVLGAIHERVELNNTGESLSVDIECIRHP-XXXX 330 YLMD+V+E+DLA AR FA W++GVCP+L + ++ S++IE I+HP Sbjct: 250 YLMDRVLEIDLAFARAGFAQWMDGVCPILSS------QSHVSFDSSINIEGIKHPLLLGS 303 Query: 329 XXXXXXXXXXXENSIHFDGSNGMAKSGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGG 150 N + D N G L + +DFPVP+DIK+ T+VVVI+GPNTGG Sbjct: 304 SLRSLSAASSNSNPLKSDVENSEMTVGSLSKGISDFPVPIDIKVECETRVVVITGPNTGG 363 Query: 149 KTATMKTLGLAALMSKAGLYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 KTA+MKTLGLA+LMSKAGLYLPAK P+LPWFD +LADIGDHQSLE NL Sbjct: 364 KTASMKTLGLASLMSKAGLYLPAKNHPRLPWFDLILADIGDHQSLEQNL 412 >gb|KDO75934.1| hypothetical protein CISIN_1g003258mg [Citrus sinensis] Length = 835 Score = 457 bits (1177), Expect = e-126 Identities = 247/409 (60%), Positives = 307/409 (75%), Gaps = 4/409 (0%) Frame = -1 Query: 1217 LPFGRDREESQKLLEQTTAAFLL--PQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAVAR 1044 +PFG+ EESQKLL QT+AA + QPLD S IED++ I+ SAVSG+LL+ ++CAV R Sbjct: 11 IPFGKSLEESQKLLNQTSAALAMMQSQPLDLSTIEDIAGILNSAVSGQLLSPSEICAVRR 70 Query: 1043 SLQSARGVLEQLEKMSSENQGDS-QWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVKD 867 +L++ V ++L + ++E GDS Q YSPLLE+LKNCNFLTELE+KIGFCIDC L ++ D Sbjct: 71 TLRAVNNVWKKLTE-AAELDGDSLQRYSPLLELLKNCNFLTELEEKIGFCIDCKLLIILD 129 Query: 866 QASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRSL 687 +ASE L IR+ERKRNME+L+SLLK V+ +IFQAGGID PL+TKRRSRMCVGI+AS + L Sbjct: 130 RASEDLELIRAERKRNMENLDSLLKKVAAQIFQAGGIDKPLITKRRSRMCVGIKASHKYL 189 Query: 686 LNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEIR 507 L G+ L VSSSGATYFMEP+ AVE NNMEVRLSNSE AEE AIL+LLT+EIA SE EI+ Sbjct: 190 LPDGIALNVSSSGATYFMEPKGAVEFNNMEVRLSNSEIAEETAILSLLTAEIAKSEREIK 249 Query: 506 YLMDKVVELDLASARGSFAHWINGVCPVLGAIHERVELNNTGESLSVDIECIRHP-XXXX 330 YLMD+V+E+DLA AR FA W++GVCP+L + ++ S++IE I+HP Sbjct: 250 YLMDRVLEIDLAFARAGFAQWMDGVCPILSS------QSHVSFDSSINIEGIKHPLLLGS 303 Query: 329 XXXXXXXXXXXENSIHFDGSNGMAKSGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGG 150 N + D N G L + +DFPVP+DIK+ T+VVVI+GPNTGG Sbjct: 304 SLRSLSAASSNSNPLKSDVENSEMTVGSLSKGISDFPVPIDIKVECETRVVVITGPNTGG 363 Query: 149 KTATMKTLGLAALMSKAGLYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 KTA+MKTLGLA+LMSKAGLYLPAK P+LPWFD +LADIGDHQSLE NL Sbjct: 364 KTASMKTLGLASLMSKAGLYLPAKNHPRLPWFDLILADIGDHQSLEQNL 412 >ref|XP_007025648.1| DNA mismatch repair protein MutS isoform 1 [Theobroma cacao] gi|508781014|gb|EOY28270.1| DNA mismatch repair protein MutS isoform 1 [Theobroma cacao] Length = 921 Score = 457 bits (1175), Expect = e-125 Identities = 253/441 (57%), Positives = 315/441 (71%), Gaps = 11/441 (2%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAF-----LLPQPLDFS 1128 W S+C P G+ +EESQKLL+QTTAA L +PLD S Sbjct: 63 WPSLCNYLSPFTSTSMALSLTKSAAFPIGQSQEESQKLLDQTTAALHAMEALKSEPLDLS 122 Query: 1127 GIEDLSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEI 948 IED+S I+ SA SG+LLTVR+LC V R+L +AR V E+L ++ G + Y+PLLEI Sbjct: 123 AIEDVSGILRSAGSGQLLTVRELCRVRRTLGAARAVSEKLAAVAEG--GSLKRYTPLLEI 180 Query: 947 LKNCNFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQ 768 L+NCNF ELE+KIGFCIDC+LS V D+ASE+L IR+ERKRNM +L+SLLK VS +FQ Sbjct: 181 LQNCNFQKELEKKIGFCIDCNLSTVLDRASEELELIRAERKRNMGNLDSLLKEVSVNVFQ 240 Query: 767 AGGIDSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRL 588 AGGID PL+TKRRSRMCVG+RAS + LL GV+L VSSSGATYFMEP++AVELNNMEV+L Sbjct: 241 AGGIDRPLITKRRSRMCVGVRASHKYLLPDGVVLNVSSSGATYFMEPKEAVELNNMEVKL 300 Query: 587 SNSEKAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVLGAIH 408 SNSEKAEE+AIL+LLTSEIA SE EI+YL+DK++E+DLA A+ ++A W+NGVCP+ + Sbjct: 301 SNSEKAEEMAILSLLTSEIAESEAEIKYLLDKLLEVDLAFAKAAYAQWMNGVCPIFSSTE 360 Query: 407 ERVELNNTGESL-SVDIECIRHPXXXXXXXXXXXXXXXENSIHFDGSNGMAKSGRL---- 243 V ++N ++ SVDIE I+HP +S D S KSG + Sbjct: 361 SEVLISNGADNAWSVDIEGIQHPLLLGSSLRNFTDFIASSS--GDPSITEEKSGAMAAVK 418 Query: 242 -PEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKPSPK 66 + + FPVP+DIK+ T+VVVISGPNTGGKTA+MKTLGLA+LMSKAG+YLPAK P+ Sbjct: 419 SSKGVSSFPVPIDIKVQCGTRVVVISGPNTGGKTASMKTLGLASLMSKAGMYLPAKKQPR 478 Query: 65 LPWFDHVLADIGDHQSLEHNL 3 LPWFD VLADIGD QSLE +L Sbjct: 479 LPWFDLVLADIGDSQSLERSL 499 >ref|XP_007159320.1| hypothetical protein PHAVU_002G228200g [Phaseolus vulgaris] gi|561032735|gb|ESW31314.1| hypothetical protein PHAVU_002G228200g [Phaseolus vulgaris] Length = 908 Score = 456 bits (1173), Expect = e-125 Identities = 245/442 (55%), Positives = 307/442 (69%), Gaps = 12/442 (2%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAFLLPQPLDFSGIEDL 1113 W S+C LP GR SQKLL+QT+AA LL QPLDFS I DL Sbjct: 44 WSSVCKQLSPFTSTSMASAAALNARLPVGRTPAHSQKLLDQTSAARLLAQPLDFSAIHDL 103 Query: 1112 SEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEILKNCN 933 ++I+ A SG+LLT R+LC V R+L +AR + + L++ +S + Q Y PLLEIL+NCN Sbjct: 104 TDILRVATSGQLLTTRELCTVRRTLAAARELFDSLKRFASASN-HPQRYLPLLEILQNCN 162 Query: 932 FLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGID 753 FL LE KI FCIDC+LS++ D+ASE L IRSERKRN E L+S+LK V+++IFQAGGID Sbjct: 163 FLAGLESKIEFCIDCTLSIILDRASEDLEIIRSERKRNTEILDSMLKEVASQIFQAGGID 222 Query: 752 SPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEK 573 PL+TKRRSRMCVGIRAS R LL GGV+L VSSSGATYFMEP+DA++LNN+EVRLS+SEK Sbjct: 223 RPLITKRRSRMCVGIRASHRYLLPGGVVLNVSSSGATYFMEPKDAIDLNNLEVRLSSSEK 282 Query: 572 AEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVL--------- 420 AEE AIL++L SEIA SE +I L+DK++E+DLA AR ++A W+NGVCP+ Sbjct: 283 AEESAILSMLASEIANSESDISNLLDKIMEIDLAFARAAYAQWMNGVCPIFRLDCFEGCD 342 Query: 419 GAIHERVELNNTGESLSVDIECIRHP---XXXXXXXXXXXXXXXENSIHFDGSNGMAKSG 249 + + +SL+V+I I+HP N++ F NG + Sbjct: 343 SNVDSDILDPQEDDSLNVNIVGIQHPLLLESSLEIISDNLALRSGNAVKFGDGNGEMATK 402 Query: 248 RLPEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKPSP 69 +DFPVP+D K+G T+VVVISGPNTGGKTA+MKTLGLA+LMSKAG+YLPAK +P Sbjct: 403 YTSHSISDFPVPVDFKIGRGTRVVVISGPNTGGKTASMKTLGLASLMSKAGMYLPAKNNP 462 Query: 68 KLPWFDHVLADIGDHQSLEHNL 3 KLPWFD +LADIGDHQSLE NL Sbjct: 463 KLPWFDLILADIGDHQSLEQNL 484 >ref|XP_002305805.1| DNA mismatch repair MutS family protein [Populus trichocarpa] gi|222848769|gb|EEE86316.1| DNA mismatch repair MutS family protein [Populus trichocarpa] Length = 908 Score = 456 bits (1172), Expect = e-125 Identities = 247/436 (56%), Positives = 320/436 (73%), Gaps = 6/436 (1%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAFLLPQ--PLDFSGIE 1119 W S+C +P G+ +EESQKLL+QT AA + + PLDFSGIE Sbjct: 57 WSSLCNQLTPFTSTSMGQSITRNAKIPIGKSKEESQKLLDQTAAALAVMESGPLDFSGIE 116 Query: 1118 DLSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGD-SQWYSPLLEILK 942 D++ I+ SAVSG LLTV +LCAV R+L++AR VLE+L+ + GD S+ Y+PLLEIL+ Sbjct: 117 DITRILDSAVSGTLLTVGELCAVRRTLRAARAVLERLK-----DSGDCSERYAPLLEILQ 171 Query: 941 NCNFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQAG 762 NC+F ELE+K+GFCIDC+LS + D+ASE L IRSERKRNME+L+ LLK +S RIFQAG Sbjct: 172 NCSFQIELEKKVGFCIDCNLSKILDRASEDLEIIRSERKRNMENLDRLLKGISARIFQAG 231 Query: 761 GIDSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSN 582 GID PLVTKRRSR+CVG+RAS R L+ GV+L VSSSG TYFMEP +AVELNN+EV LS+ Sbjct: 232 GIDKPLVTKRRSRLCVGVRASHRYLIPDGVVLNVSSSGVTYFMEPGEAVELNNLEVMLSD 291 Query: 581 SEKAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVLGAIHER 402 SEKAEE+AIL+LLTSEIA S +I+Y++D ++E+DL+ AR ++A+W+NGV P+ + Sbjct: 292 SEKAEEIAILSLLTSEIAESARDIKYMLDGIIEVDLSFARAAYAYWMNGVRPIWTSEGCG 351 Query: 401 VELNNTGE-SLSVDIECIRHPXXXXXXXXXXXXXXXENSIHF--DGSNGMAKSGRLPEHG 231 ++ G+ LS+DIE IRHP NS++ + M +G+ ++ Sbjct: 352 GISSSGGDYLLSIDIEGIRHPLLNGTSRKRLSNILGSNSLNSMEVDEDSMLDTGKPSKNV 411 Query: 230 ADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKPSPKLPWFD 51 ++FPVP++IK+ T+VVVISGPNTGGKTA+MKTLG+A+LMSKAGLYLPAK +PKLPWFD Sbjct: 412 SEFPVPINIKVECGTRVVVISGPNTGGKTASMKTLGVASLMSKAGLYLPAKNTPKLPWFD 471 Query: 50 HVLADIGDHQSLEHNL 3 VLADIGDHQSLE NL Sbjct: 472 FVLADIGDHQSLEQNL 487 >gb|KHG26053.1| MutS2 [Gossypium arboreum] Length = 1230 Score = 455 bits (1171), Expect = e-125 Identities = 245/442 (55%), Positives = 317/442 (71%), Gaps = 12/442 (2%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAF-----LLPQPLDFS 1128 W S+C +P G+ REESQKLL+QTT+A L +PLD S Sbjct: 65 WPSLCNYLSPFTSTSMAFSLTKTAAVPVGQSREESQKLLDQTTSALHALEALKSEPLDLS 124 Query: 1127 GIEDLSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEI 948 IED+SEI+ SA SG++LTVR+LC V R L +AR V E+L ++ G + Y+PLLEI Sbjct: 125 VIEDVSEILHSAASGQVLTVRELCRVRRMLGAARAVSEKLAAIAEG--GSLERYTPLLEI 182 Query: 947 LKNCNFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQ 768 L+ CNF ELE+KIGFCIDCSLS + +ASE+L IR ERKRNME+L+ LLK VS IFQ Sbjct: 183 LQGCNFQLELERKIGFCIDCSLSTILGRASEELELIREERKRNMENLDFLLKEVSVSIFQ 242 Query: 767 AGGIDSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRL 588 AGGID PL+TKRRSRMCVG++A+ + LL GGV+L VSSSGATYFMEP++AVELNN+EV+L Sbjct: 243 AGGIDKPLITKRRSRMCVGVKATHKYLLPGGVVLNVSSSGATYFMEPKEAVELNNIEVKL 302 Query: 587 SNSEKAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVLGAIH 408 SNSEKAEE+AIL+LLTSEIA SE EI+YL+D+++E+DLA AR ++A W+NGVCP+L + Sbjct: 303 SNSEKAEEMAILSLLTSEIAESEAEIKYLLDRLIEVDLAFARAAYAQWVNGVCPILSSKE 362 Query: 407 ERVELNNTGE-SLSVDIECIRHPXXXXXXXXXXXXXXXENSI------HFDGSNGMAKSG 249 + ++N + +LS+DIE ++HP NS+ + G KS Sbjct: 363 SEMLISNGADNALSIDIEGMQHPLLLGSFLSNSTDFITSNSMGPSVLGNTSGEMTPIKSS 422 Query: 248 RLPEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKPSP 69 ++ ++FP+P+DIK+ T+VV+ISGPNTGGKTA+MKTLGLA++MSKAG+YLPAK P Sbjct: 423 KVV---SNFPIPIDIKVQCGTRVVIISGPNTGGKTASMKTLGLASIMSKAGMYLPAKKQP 479 Query: 68 KLPWFDHVLADIGDHQSLEHNL 3 +LPWFD VLADIGD QSLE +L Sbjct: 480 RLPWFDLVLADIGDSQSLEQSL 501 >ref|XP_007025649.1| DNA mismatch repair protein MutS, type 2, putative isoform 2 [Theobroma cacao] gi|508781015|gb|EOY28271.1| DNA mismatch repair protein MutS, type 2, putative isoform 2 [Theobroma cacao] Length = 694 Score = 455 bits (1171), Expect = e-125 Identities = 250/415 (60%), Positives = 311/415 (74%), Gaps = 11/415 (2%) Frame = -1 Query: 1214 PFGRDREESQKLLEQTTAAF-----LLPQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAV 1050 P G+ +EESQKLL+QTTAA L +PLD S IED+S I+ SA SG+LLTVR+LC V Sbjct: 12 PIGQSQEESQKLLDQTTAALHAMEALKSEPLDLSAIEDVSGILRSAGSGQLLTVRELCRV 71 Query: 1049 ARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVK 870 R+L +AR V E+L ++ G + Y+PLLEIL+NCNF ELE+KIGFCIDC+LS V Sbjct: 72 RRTLGAARAVSEKLAAVAEG--GSLKRYTPLLEILQNCNFQKELEKKIGFCIDCNLSTVL 129 Query: 869 DQASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRS 690 D+ASE+L IR+ERKRNM +L+SLLK VS +FQAGGID PL+TKRRSRMCVG+RAS + Sbjct: 130 DRASEELELIRAERKRNMGNLDSLLKEVSVNVFQAGGIDRPLITKRRSRMCVGVRASHKY 189 Query: 689 LLNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEI 510 LL GV+L VSSSGATYFMEP++AVELNNMEV+LSNSEKAEE+AIL+LLTSEIA SE EI Sbjct: 190 LLPDGVVLNVSSSGATYFMEPKEAVELNNMEVKLSNSEKAEEMAILSLLTSEIAESEAEI 249 Query: 509 RYLMDKVVELDLASARGSFAHWINGVCPVLGAIHERVELNNTGESL-SVDIECIRHPXXX 333 +YL+DK++E+DLA A+ ++A W+NGVCP+ + V ++N ++ SVDIE I+HP Sbjct: 250 KYLLDKLLEVDLAFAKAAYAQWMNGVCPIFSSTESEVLISNGADNAWSVDIEGIQHPLLL 309 Query: 332 XXXXXXXXXXXXENSIHFDGSNGMAKSGRL-----PEHGADFPVPLDIKLGDTTKVVVIS 168 +S D S KSG + + + FPVP+DIK+ T+VVVIS Sbjct: 310 GSSLRNFTDFIASSS--GDPSITEEKSGAMAAVKSSKGVSSFPVPIDIKVQCGTRVVVIS 367 Query: 167 GPNTGGKTATMKTLGLAALMSKAGLYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 GPNTGGKTA+MKTLGLA+LMSKAG+YLPAK P+LPWFD VLADIGD QSLE +L Sbjct: 368 GPNTGGKTASMKTLGLASLMSKAGMYLPAKKQPRLPWFDLVLADIGDSQSLERSL 422 >ref|XP_004505047.1| PREDICTED: uncharacterized protein LOC101503544 [Cicer arietinum] Length = 944 Score = 454 bits (1168), Expect = e-125 Identities = 248/444 (55%), Positives = 309/444 (69%), Gaps = 14/444 (3%) Frame = -1 Query: 1292 WQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQKLLEQTTAAFLLPQP-LDFSGIED 1116 W SIC L GR +SQKLL+QT+AA L+PQ +DFSGI D Sbjct: 74 WSSICKQLSSFTSTSMGSSAANNARLLIGRTPHQSQKLLDQTSAARLIPQQHIDFSGIHD 133 Query: 1115 LSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEKMSSENQGDSQWYSPLLEILKNC 936 L++I+ AVSG LLT+ +LC V R+L +AR + L+ ++SE SQ YSPLLEIL+NC Sbjct: 134 LTDILSLAVSGHLLTIPELCKVRRTLTAARELFHTLKHVASE-ANHSQRYSPLLEILQNC 192 Query: 935 NFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKRNMESLESLLKVVSTRIFQAGGI 756 NFL LE+KI +C+DC+LS + D+ASE L IRSERKRN+E L+SLLK VS++IF+AGGI Sbjct: 193 NFLVGLERKIEYCVDCNLSTILDRASEDLEIIRSERKRNLEILDSLLKEVSSQIFRAGGI 252 Query: 755 DSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGATYFMEPRDAVELNNMEVRLSNSE 576 D P +TKRRSRMCVGIRASR+ LL G++L VSSSGATYFMEP++A++LNNMEVRLSNSE Sbjct: 253 DRPFITKRRSRMCVGIRASRKYLLPEGIVLNVSSSGATYFMEPKEAIDLNNMEVRLSNSE 312 Query: 575 KAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASARGSFAHWINGVCPVL--GAIHER 402 KAEE AIL++L SEIA SE EI YL+DK++E+DLA AR ++A W+NGVCP+ G + R Sbjct: 313 KAEERAILSMLASEIANSESEINYLLDKILEVDLAFARAAYAQWMNGVCPIFSSGTLEGR 372 Query: 401 VELNNTG--------ESLSVDIECIRHPXXXXXXXXXXXXXXXENS---IHFDGSNGMAK 255 + + L+V+IE IRHP + S + NG Sbjct: 373 DSVGEDNDILVVQEDDDLTVNIEGIRHPLLLEKSLENISDNLTQKSGTAVELGNGNGTMA 432 Query: 254 SGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSKAGLYLPAKP 75 S + DFPVP+D K+ TKVVVISGPNTGGKTA+MKTLGLA+LMSKAG++LPAK Sbjct: 433 SNGTSQGITDFPVPVDFKIRHGTKVVVISGPNTGGKTASMKTLGLASLMSKAGMHLPAKR 492 Query: 74 SPKLPWFDHVLADIGDHQSLEHNL 3 SPKLPWFD +LADIGD QSLE NL Sbjct: 493 SPKLPWFDLILADIGDQQSLEQNL 516 >ref|XP_009795021.1| PREDICTED: DNA mismatch repair protein MSH2 [Nicotiana sylvestris] Length = 908 Score = 452 bits (1163), Expect = e-124 Identities = 244/453 (53%), Positives = 313/453 (69%) Frame = -1 Query: 1361 SKSSLDKIRVXXXXXXXXXXXXEWQSICXXXXXXXXXXXXXXXXXXGNLPFGRDREESQK 1182 S S ++++ EW ++C +P G+ EES K Sbjct: 37 SSESTHRVKLAESLQSETLKLLEWPAVCRQLSAFTSTSMGFAAAQSAVIPVGKTPEESGK 96 Query: 1181 LLEQTTAAFLLPQPLDFSGIEDLSEIVGSAVSGELLTVRQLCAVARSLQSARGVLEQLEK 1002 LL QT+AA +P+PLDFSGIED+S IV ++++G +L++R+LC+V R+L +AR +L+QLE+ Sbjct: 97 LLSQTSAAVAVPRPLDFSGIEDVSPIVNASIAGGVLSIRELCSVKRTLGAARFLLQQLEE 156 Query: 1001 MSSENQGDSQWYSPLLEILKNCNFLTELEQKIGFCIDCSLSVVKDQASEKLGFIRSERKR 822 ++S N + YSPL EIL NC+FL ELEQKI FCIDCS S + D+ASE L IRSERKR Sbjct: 157 IASLNDFSDR-YSPLKEILHNCDFLVELEQKIEFCIDCSFSAILDRASEDLEIIRSERKR 215 Query: 821 NMESLESLLKVVSTRIFQAGGIDSPLVTKRRSRMCVGIRASRRSLLNGGVILGVSSSGAT 642 NME+LESLLK +ST++FQ GG D PLVTKRRSRMCV +RAS RSLL VIL SSSG+T Sbjct: 216 NMENLESLLKQLSTQVFQGGGFDRPLVTKRRSRMCVAVRASHRSLLPNAVILDTSSSGST 275 Query: 641 YFMEPRDAVELNNMEVRLSNSEKAEELAILNLLTSEIAGSEMEIRYLMDKVVELDLASAR 462 YFMEP++AVELNNMEV+LS+SE+ EE IL+LLTSEIA S M+I++L+D+++E+DLA AR Sbjct: 276 YFMEPKEAVELNNMEVKLSSSERIEEQTILSLLTSEIAESNMKIKHLLDRILEIDLAFAR 335 Query: 461 GSFAHWINGVCPVLGAIHERVELNNTGESLSVDIECIRHPXXXXXXXXXXXXXXXENSIH 282 + A WI G CP A+ R N+ E LS+D+E IRHP S Sbjct: 336 AAHAQWIGGACP---ALSSRNCNNSQSELLSIDVEGIRHPLLLESSLRNLSTDVSPRSPD 392 Query: 281 FDGSNGMAKSGRLPEHGADFPVPLDIKLGDTTKVVVISGPNTGGKTATMKTLGLAALMSK 102 D NG+ + A FPVP+DIK+G TKVVVISGPNTGGKTA+MKTLGLA++M K Sbjct: 393 LDQGNGVMNF--KTKSRARFPVPIDIKVGHGTKVVVISGPNTGGKTASMKTLGLASMMLK 450 Query: 101 AGLYLPAKPSPKLPWFDHVLADIGDHQSLEHNL 3 AG+YLPA+ P+LPWFD +LADIGD QSLE +L Sbjct: 451 AGMYLPAQNQPRLPWFDLILADIGDQQSLEQSL 483