BLASTX nr result

ID: Cornus23_contig00031859 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00031859
         (1215 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010649804.1| PREDICTED: uncharacterized protein LOC100266...   481   e-133
ref|XP_007035442.1| BRCT domain-containing DNA repair protein, p...   439   e-120
ref|XP_007035440.1| BRCT domain-containing DNA repair protein, p...   439   e-120
ref|XP_011071499.1| PREDICTED: uncharacterized protein LOC105156...   438   e-120
ref|XP_008223149.1| PREDICTED: uncharacterized protein LOC103322...   432   e-118
ref|XP_009624053.1| PREDICTED: uncharacterized protein LOC104115...   430   e-117
ref|XP_007035445.1| BRCT domain-containing DNA repair protein, p...   423   e-115
ref|XP_010244657.1| PREDICTED: uncharacterized protein LOC104588...   421   e-115
ref|XP_012455779.1| PREDICTED: uncharacterized protein LOC105777...   421   e-115
ref|XP_012455778.1| PREDICTED: uncharacterized protein LOC105777...   421   e-115
ref|XP_012482072.1| PREDICTED: uncharacterized protein LOC105796...   421   e-115
emb|CDO98931.1| unnamed protein product [Coffea canephora]            419   e-114
ref|XP_007035446.1| BRCT domain-containing DNA repair protein, p...   419   e-114
ref|XP_009779332.1| PREDICTED: uncharacterized protein LOC104228...   418   e-114
ref|XP_009369092.1| PREDICTED: uncharacterized protein LOC103958...   417   e-113
ref|XP_009369091.1| PREDICTED: uncharacterized protein LOC103958...   417   e-113
ref|XP_009369089.1| PREDICTED: uncharacterized protein LOC103958...   417   e-113
ref|XP_007227074.1| hypothetical protein PRUPE_ppa000432mg [Prun...   415   e-113
ref|XP_010321010.1| PREDICTED: uncharacterized protein LOC101247...   414   e-113
gb|KHG13951.1| PAX-interacting 1 [Gossypium arboreum]                 412   e-112

>ref|XP_010649804.1| PREDICTED: uncharacterized protein LOC100266667 [Vitis vinifera]
          Length = 1239

 Score =  481 bits (1237), Expect = e-133
 Identities = 259/404 (64%), Positives = 303/404 (75%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            R+HR L G+ N   +  G  K   GQEA      R  RSK+ +  T     +KR+ +SS+
Sbjct: 800  RTHRNLLGRANSITDLDGPPKPFAGQEAIEPFIPRQTRSKSKARGTFSGFDMKRKIQSSS 859

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
             A   LSSLD  SE   L+QS+ + G+GDA LN + V++N   I +D  G +AS+  +  
Sbjct: 860  NASLGLSSLDQNSEGILLKQSLDKPGAGDAMLNRSSVNLNRKKISRDPTGERASKHSEGN 919

Query: 853  CDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQSC 674
             DAD +S AE    N  L    RE CKPSGS C TPVNS TP N ASP+CMG+EY+KQSC
Sbjct: 920  SDADPSSPAEGREGNAGL----REMCKPSGSVCTTPVNSVTPTNAASPVCMGNEYVKQSC 975

Query: 673  RKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKILT 494
            +KNL R  L+KEI++L   G    SA KDSR+RR+++NVR LFS HLDDDIIKQQKKILT
Sbjct: 976  KKNL-RTSLLKEINNLTDTGPGPTSAVKDSRRRREISNVRVLFSQHLDDDIIKQQKKILT 1034

Query: 493  RLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHIL 314
            RLG S+ASSISDATHFITD FVRTRNMLEAIA+GKPVVTHLWLESC QA CFIDEK +IL
Sbjct: 1035 RLGVSVASSISDATHFITDAFVRTRNMLEAIAYGKPVVTHLWLESCVQARCFIDEKGYIL 1094

Query: 313  RDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIG 134
            RDAKKEKE GFS+PVSLARACQ PLLQG+KVLITPNTKPGKE++ASLVKAV G  VERIG
Sbjct: 1095 RDAKKEKELGFSMPVSLARACQHPLLQGRKVLITPNTKPGKEIIASLVKAVDGQPVERIG 1154

Query: 133  RSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RS LKD + PDDLL+LSC+EDYA+C P+LEKGAAVYSSELLLNG
Sbjct: 1155 RSVLKDGKFPDDLLILSCDEDYAVCEPYLEKGAAVYSSELLLNG 1198


>ref|XP_007035442.1| BRCT domain-containing DNA repair protein, putative isoform 3
            [Theobroma cacao] gi|508714471|gb|EOY06368.1| BRCT
            domain-containing DNA repair protein, putative isoform 3
            [Theobroma cacao]
          Length = 1200

 Score =  439 bits (1128), Expect = e-120
 Identities = 240/407 (58%), Positives = 289/407 (71%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS  K+      +DN     K+ V QE   QS A LKRS++++ ST +  S +R TRSS 
Sbjct: 779  RSSWKMCVDVGESDNLKAQSKRSVLQEDKGQSIAVLKRSRSNNRSTHIHSSTRRITRSSV 838

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDG---VQASQQV 863
             +  VL   D   E +   QS  + GS D  +N+N  ++NG  +   + G    ++++  
Sbjct: 839  NSRPVLYFSDQNPEGKLSHQSSDKEGSEDDVINYNSTEMNGRMVSTRITGPEPAKSAKHS 898

Query: 862  DRKCDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
            D   DA  +  AES+ VNV LD SP+E  K  GS+C TPVN  TPIN ASP+CMG+EY K
Sbjct: 899  DGNRDAVSSPIAESVAVNVTLDKSPKEKSKSPGSKCTTPVNCPTPINAASPVCMGEEYYK 958

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
            QSC+KNL+++ L KE+ SL     E IS  KD RKRRD+ANVR LFS+HLD+DIIKQQKK
Sbjct: 959  QSCKKNLSKSSLNKELKSLSPIEPEPISPLKDMRKRRDLANVRVLFSNHLDEDIIKQQKK 1018

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            IL RLG S  SSI DATHFITD FVRTRNMLEAIA GKPVVT+LWLES GQ +  IDE+ 
Sbjct: 1019 ILARLGISEVSSILDATHFITDKFVRTRNMLEAIASGKPVVTYLWLESIGQVNIHIDEEA 1078

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVE 143
            +ILRD +KEKE GF +PVSLARA + PLLQG++V ITPNTKPGKE ++ LV AV G AVE
Sbjct: 1079 YILRDIRKEKELGFCMPVSLARARKRPLLQGRRVFITPNTKPGKETISHLVTAVGGQAVE 1138

Query: 142  RIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RIGRS  KDD++PDDLLVLSCEEDY ICVPFLEKGAAVYSSELLLNG
Sbjct: 1139 RIGRSATKDDKVPDDLLVLSCEEDYVICVPFLEKGAAVYSSELLLNG 1185


>ref|XP_007035440.1| BRCT domain-containing DNA repair protein, putative isoform 1
            [Theobroma cacao] gi|590660596|ref|XP_007035441.1| BRCT
            domain-containing DNA repair protein, putative isoform 1
            [Theobroma cacao] gi|508714469|gb|EOY06366.1| BRCT
            domain-containing DNA repair protein, putative isoform 1
            [Theobroma cacao] gi|508714470|gb|EOY06367.1| BRCT
            domain-containing DNA repair protein, putative isoform 1
            [Theobroma cacao]
          Length = 1225

 Score =  439 bits (1128), Expect = e-120
 Identities = 240/407 (58%), Positives = 289/407 (71%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS  K+      +DN     K+ V QE   QS A LKRS++++ ST +  S +R TRSS 
Sbjct: 779  RSSWKMCVDVGESDNLKAQSKRSVLQEDKGQSIAVLKRSRSNNRSTHIHSSTRRITRSSV 838

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDG---VQASQQV 863
             +  VL   D   E +   QS  + GS D  +N+N  ++NG  +   + G    ++++  
Sbjct: 839  NSRPVLYFSDQNPEGKLSHQSSDKEGSEDDVINYNSTEMNGRMVSTRITGPEPAKSAKHS 898

Query: 862  DRKCDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
            D   DA  +  AES+ VNV LD SP+E  K  GS+C TPVN  TPIN ASP+CMG+EY K
Sbjct: 899  DGNRDAVSSPIAESVAVNVTLDKSPKEKSKSPGSKCTTPVNCPTPINAASPVCMGEEYYK 958

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
            QSC+KNL+++ L KE+ SL     E IS  KD RKRRD+ANVR LFS+HLD+DIIKQQKK
Sbjct: 959  QSCKKNLSKSSLNKELKSLSPIEPEPISPLKDMRKRRDLANVRVLFSNHLDEDIIKQQKK 1018

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            IL RLG S  SSI DATHFITD FVRTRNMLEAIA GKPVVT+LWLES GQ +  IDE+ 
Sbjct: 1019 ILARLGISEVSSILDATHFITDKFVRTRNMLEAIASGKPVVTYLWLESIGQVNIHIDEEA 1078

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVE 143
            +ILRD +KEKE GF +PVSLARA + PLLQG++V ITPNTKPGKE ++ LV AV G AVE
Sbjct: 1079 YILRDIRKEKELGFCMPVSLARARKRPLLQGRRVFITPNTKPGKETISHLVTAVGGQAVE 1138

Query: 142  RIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RIGRS  KDD++PDDLLVLSCEEDY ICVPFLEKGAAVYSSELLLNG
Sbjct: 1139 RIGRSATKDDKVPDDLLVLSCEEDYVICVPFLEKGAAVYSSELLLNG 1185


>ref|XP_011071499.1| PREDICTED: uncharacterized protein LOC105156929 [Sesamum indicum]
          Length = 1158

 Score =  438 bits (1126), Expect = e-120
 Identities = 227/378 (60%), Positives = 277/378 (73%)
 Frame = -3

Query: 1135 EANAQSTARLKRSKTDSISTCMDLSIKRQTRSSTCAGSVLSSLDLPSEQRSLQQSVVRGG 956
            +AN Q+  RLKRS+  + ST    S       +   GS LSSLD  S    L Q++V G 
Sbjct: 746  KANTQNHGRLKRSREVAASTVNPGSSHLNQLHN---GSALSSLDTQSGGMLLHQTIVNGS 802

Query: 955  SGDAALNHNYVDINGNTILKDVDGVQASQQVDRKCDADKTSSAESIGVNVRLDTSPREGC 776
            S + +  H+   ++    L D  G   S+Q D K D +  +SAE    N + + SPRE C
Sbjct: 803  SRNDSAEHDSNCMDAKASLHDAAGTSTSKQHDEKTDDE--TSAEGAETNGKAEASPRERC 860

Query: 775  KPSGSECATPVNSRTPINEASPICMGDEYLKQSCRKNLARACLMKEIDSLIGDGLERISA 596
              S S C TP    TPIN  SPICMGDEY KQSCRK+L+R  L++EI++L+       S 
Sbjct: 861  GISSSACVTPATCTTPINNVSPICMGDEYHKQSCRKSLSRFSLIREINNLVTGSPGPYST 920

Query: 595  FKDSRKRRDMANVRALFSHHLDDDIIKQQKKILTRLGASIASSISDATHFITDNFVRTRN 416
             KDSRKR+D+ N++ LFS HLD D+ K QK+IL RLG ++ASS++DATHF+ D FVRTRN
Sbjct: 921  MKDSRKRKDITNIKVLFSQHLDVDVTKLQKRILARLGGAVASSMADATHFVADEFVRTRN 980

Query: 415  MLEAIAFGKPVVTHLWLESCGQASCFIDEKNHILRDAKKEKEFGFSLPVSLARACQSPLL 236
            MLEAIA+GKPVVTHLWLESCGQASC IDEKN+ILRDA+KE+E+GFSLP SL+RACQ PLL
Sbjct: 981  MLEAIAYGKPVVTHLWLESCGQASCLIDEKNYILRDARKEREYGFSLPGSLSRACQHPLL 1040

Query: 235  QGQKVLITPNTKPGKEVLASLVKAVHGLAVERIGRSTLKDDRIPDDLLVLSCEEDYAICV 56
            QGQKVL+TPNTKPGK++LA+LVKAV GLAVER+GRS LKD+++PDDLL+LSCEEDY ICV
Sbjct: 1041 QGQKVLVTPNTKPGKDILANLVKAVGGLAVERLGRSVLKDEKLPDDLLILSCEEDYDICV 1100

Query: 55   PFLEKGAAVYSSELLLNG 2
            PFLEKG AVYSSELLLNG
Sbjct: 1101 PFLEKGGAVYSSELLLNG 1118


>ref|XP_008223149.1| PREDICTED: uncharacterized protein LOC103322970 [Prunus mume]
          Length = 1266

 Score =  432 bits (1111), Expect = e-118
 Identities = 229/404 (56%), Positives = 281/404 (69%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS R +S Q    +N  G  +  V  +   Q     KR ++   + C D+ + R+ RSST
Sbjct: 837  RSRRNMSIQVYGPNNSDGPSEPSVQADKIGQRVNSHKRLRSGVKNICNDIKLTRRMRSST 896

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
            C    L             Q +++GG G+A L+ N    +G  I + ++G +     DRK
Sbjct: 897  CGEQNLDG--------KFAQEILKGGPGEAPLHCNSSHKDGRMISEIINGKKVVGISDRK 948

Query: 853  CDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQSC 674
             DA+ +S+ +        D  PRE CKPS S C TPVN++ P+N ASP+CMG+EY KQ+C
Sbjct: 949  SDANFSSATKMS------DEFPREKCKPSDSSCTTPVNNKVPVNAASPVCMGNEYFKQTC 1002

Query: 673  RKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKILT 494
            ++ L  + L+KEI  L     E  S   + R+RRDM +VR L+SHHLD+DIIK+QKKIL 
Sbjct: 1003 KRRLLGSSLLKEIRGLSATVCEPTST-PELRRRRDMTDVRVLYSHHLDEDIIKKQKKILA 1061

Query: 493  RLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHIL 314
            RLG S+ASS++DATHFI D FVRTRNMLEAIA GKPVVTHLWLESCGQA CF+DEK+HIL
Sbjct: 1062 RLGVSVASSMTDATHFIADQFVRTRNMLEAIAVGKPVVTHLWLESCGQAGCFVDEKSHIL 1121

Query: 313  RDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIG 134
            RD KKEKEFGFS+P SLARACQ PLLQ +KV ITPNTKPGKE++++LVKAV G AVERIG
Sbjct: 1122 RDNKKEKEFGFSMPASLARACQHPLLQDRKVFITPNTKPGKEIISNLVKAVKGQAVERIG 1181

Query: 133  RSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RSTL  D+IPDDLLVLSCEEDY ICVP LEKGAAVYSSELLLNG
Sbjct: 1182 RSTLNADKIPDDLLVLSCEEDYEICVPLLEKGAAVYSSELLLNG 1225


>ref|XP_009624053.1| PREDICTED: uncharacterized protein LOC104115175 [Nicotiana
            tomentosiformis]
          Length = 1168

 Score =  430 bits (1106), Expect = e-117
 Identities = 229/404 (56%), Positives = 282/404 (69%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RSHRK+                 +GQE   QS  R KR + D  ST +++S K++  SS 
Sbjct: 747  RSHRKMP---------------TMGQETTIQSCRRSKRLRGDQTSTSINVSTKKRKCSSE 791

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
            C    ++S +  S ++ LQ+ + +      + N  + D +  TIL      ++ +  +RK
Sbjct: 792  CTLPDIASSERGSHKKLLQEGIDKRHLDGNSTNDAFADGSAKTILH-----KSIKDSNRK 846

Query: 853  CDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQSC 674
             + + T S +        ++S  E CK S S C TP NS+   N  SPICMGDEY KQSC
Sbjct: 847  TNVEITRSVDEAQGT---ESSTGEQCKASASACTTPTNSKIQKNAVSPICMGDEYHKQSC 903

Query: 673  RKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKILT 494
            RKN++R+ L++EI SL   G +  S  KDSRKRR+M NVR LFS HLD DIIKQQKKIL 
Sbjct: 904  RKNMSRSSLLREITSLHSTGTQIGSTIKDSRKRREMTNVRVLFSQHLDADIIKQQKKILA 963

Query: 493  RLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHIL 314
            RLGAS  S +SDATHF+ D FVRTRN+LEAIA GKPVVTHLWLESCGQASC IDEKN+IL
Sbjct: 964  RLGASSVSCMSDATHFVADEFVRTRNVLEAIAVGKPVVTHLWLESCGQASCLIDEKNYIL 1023

Query: 313  RDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIG 134
            RDA+KEKEFGFS+PVSLARACQ PLLQG +V  TPNTKPGK++LASLVKAVHGLAVER+G
Sbjct: 1024 RDARKEKEFGFSMPVSLARACQHPLLQGYRVFTTPNTKPGKDILASLVKAVHGLAVERLG 1083

Query: 133  RSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RS +K++ +PDDLLVLSCEEDY +C+PFLEKG+ VYSSELLLNG
Sbjct: 1084 RSVMKEEVVPDDLLVLSCEEDYEVCIPFLEKGSTVYSSELLLNG 1127


>ref|XP_007035445.1| BRCT domain-containing DNA repair protein, putative isoform 6
            [Theobroma cacao] gi|508714474|gb|EOY06371.1| BRCT
            domain-containing DNA repair protein, putative isoform 6
            [Theobroma cacao]
          Length = 1254

 Score =  423 bits (1088), Expect = e-115
 Identities = 240/436 (55%), Positives = 289/436 (66%), Gaps = 32/436 (7%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS  K+      +DN     K+ V QE   QS A LKRS++++ ST +  S +R TRSS 
Sbjct: 779  RSSWKMCVDVGESDNLKAQSKRSVLQEDKGQSIAVLKRSRSNNRSTHIHSSTRRITRSSV 838

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDG---VQASQQV 863
             +  VL   D   E +   QS  + GS D  +N+N  ++NG  +   + G    ++++  
Sbjct: 839  NSRPVLYFSDQNPEGKLSHQSSDKEGSEDDVINYNSTEMNGRMVSTRITGPEPAKSAKHS 898

Query: 862  DRKCDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
            D   DA  +  AES+ VNV LD SP+E  K  GS+C TPVN  TPIN ASP+CMG+EY K
Sbjct: 899  DGNRDAVSSPIAESVAVNVTLDKSPKEKSKSPGSKCTTPVNCPTPINAASPVCMGEEYYK 958

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
            QSC+KNL+++ L KE+ SL     E IS  KD RKRRD+ANVR LFS+HLD+DIIKQQKK
Sbjct: 959  QSCKKNLSKSSLNKELKSLSPIEPEPISPLKDMRKRRDLANVRVLFSNHLDEDIIKQQKK 1018

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            IL RLG S  SSI DATHFITD FVRTRNMLEAIA GKPVVT+LWLES GQ +  IDE+ 
Sbjct: 1019 ILARLGISEVSSILDATHFITDKFVRTRNMLEAIASGKPVVTYLWLESIGQVNIHIDEEA 1078

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVE 143
            +ILRD +KEKE GF +PVSLARA + PLLQG++V ITPNTKPGKE ++ LV AV G AVE
Sbjct: 1079 YILRDIRKEKELGFCMPVSLARARKRPLLQGRRVFITPNTKPGKETISHLVTAVGGQAVE 1138

Query: 142  RIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEK-------------------------- 41
            RIGRS  KDD++PDDLLVLSCEEDY ICVPFLEK                          
Sbjct: 1139 RIGRSATKDDKVPDDLLVLSCEEDYVICVPFLEKGYKCFLSYLLACLMKFGLLLESFAAF 1198

Query: 40   ---GAAVYSSELLLNG 2
               GAAVYSSELLLNG
Sbjct: 1199 MLSGAAVYSSELLLNG 1214


>ref|XP_010244657.1| PREDICTED: uncharacterized protein LOC104588427 isoform X1 [Nelumbo
            nucifera]
          Length = 1228

 Score =  421 bits (1082), Expect = e-115
 Identities = 233/407 (57%), Positives = 283/407 (69%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            R+ R +S  FN      G      G+E   QST   KRS +D+     +L ++++TRS  
Sbjct: 797  RTRRSISSHFNEVGLLDGPSIVVKGKEPKEQSTIWRKRSNSDT-GINFNLDMRKRTRSG- 854

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
                V     LP  ++S ++ +      D+A +H+   +N     K + GV     VD K
Sbjct: 855  ----VYPHPFLPFPEKSSKRLMGHKPGSDSAGSHSLDVVNR----KVIPGV-----VDAK 901

Query: 853  CDAD---KTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
               D   K  S E    N +   SP E  K   SEC+TPVN+ TPIN ASP+CMGDEY K
Sbjct: 902  VSPDSGSKNESVEGAKGNAKFVESPNEKAKRPCSECSTPVNATTPINAASPVCMGDEYHK 961

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
            QSC KNL+++ LMKE+  L          +KD R+RRD++++R LFSHHLD+DIIKQQKK
Sbjct: 962  QSC-KNLSKSFLMKELVRLDASEAVPTPVWKDMRRRRDLSSIRVLFSHHLDEDIIKQQKK 1020

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            ILTRLG SIAS  SDATHF+ D FVRTRNMLEAIA GKPVVTHLWLESCGQASCFIDEKN
Sbjct: 1021 ILTRLGISIASCSSDATHFVADKFVRTRNMLEAIALGKPVVTHLWLESCGQASCFIDEKN 1080

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVE 143
            +ILRD+KKEKE GFS+PVSLARACQ P+LQG++V +TPN KP KEV+ASLV+AV G AVE
Sbjct: 1081 YILRDSKKEKEIGFSMPVSLARACQHPILQGKRVFVTPNIKPSKEVVASLVRAVQGQAVE 1140

Query: 142  RIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RIGRS +KDD+IPDDLLVLSCEEDYA+CVP LEKGAA+YSSEL+LNG
Sbjct: 1141 RIGRSVVKDDKIPDDLLVLSCEEDYAVCVPILEKGAAIYSSELVLNG 1187


>ref|XP_012455779.1| PREDICTED: uncharacterized protein LOC105777205 isoform X2 [Gossypium
            raimondii]
          Length = 1113

 Score =  421 bits (1081), Expect = e-115
 Identities = 231/407 (56%), Positives = 285/407 (70%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS RK+S     +DN      + V Q  N  S   +K+S+ ++ STC+  +  R TRSS 
Sbjct: 668  RSSRKMSVHVGESDNLEAPSGKSV-QLDNEPSIPVVKKSRRNNRSTCIRSTTVRITRSSR 726

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
                VL   D  SE +  +QS  + GS D A+N N   +N   I   + G +A++++   
Sbjct: 727  NTCPVLHFPDQNSEGKLSRQSSDKQGSKDNAVNCNSTKMNRRMISTSITGPEAAKEIQHS 786

Query: 853  CD---ADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
                 A  +  AE++ VNV  + SP E  +  GS C TPVN  TPIN ASP+CMG+EY K
Sbjct: 787  GGNHVAVSSPIAENLAVNVATNKSPEEKSRSLGSLCTTPVNCPTPINAASPVCMGEEYFK 846

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
            QSC+KNL+++ L+KE+ SL     E IS  KD RKRRD+A++R LFS+HLD+DIIKQQKK
Sbjct: 847  QSCKKNLSKSLLIKELRSLNPIDPEPISPSKDMRKRRDLADIRVLFSNHLDEDIIKQQKK 906

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            IL RLG S ASSI  ATHF+TD FVRTRNMLEAIA GKPVVTHLWLES GQ +  IDE+ 
Sbjct: 907  ILARLGISEASSILAATHFVTDKFVRTRNMLEAIASGKPVVTHLWLESVGQVNIHIDEEA 966

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVE 143
            +ILRD KKEKEFGF +P SLARAC+ PLLQG++VLITPNTKP KE +  LV  +HG A+E
Sbjct: 967  YILRDIKKEKEFGFCMPASLARACRRPLLQGRRVLITPNTKPNKETIVHLVAVLHGQALE 1026

Query: 142  RIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RIGRS +KDD++ DDLL+LSCEEDYAICVPFLEKGAAVYSSELLLNG
Sbjct: 1027 RIGRSAMKDDKVLDDLLILSCEEDYAICVPFLEKGAAVYSSELLLNG 1073


>ref|XP_012455778.1| PREDICTED: uncharacterized protein LOC105777205 isoform X1 [Gossypium
            raimondii] gi|763805618|gb|KJB72556.1| hypothetical
            protein B456_011G184800 [Gossypium raimondii]
          Length = 1215

 Score =  421 bits (1081), Expect = e-115
 Identities = 231/407 (56%), Positives = 285/407 (70%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS RK+S     +DN      + V Q  N  S   +K+S+ ++ STC+  +  R TRSS 
Sbjct: 770  RSSRKMSVHVGESDNLEAPSGKSV-QLDNEPSIPVVKKSRRNNRSTCIRSTTVRITRSSR 828

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
                VL   D  SE +  +QS  + GS D A+N N   +N   I   + G +A++++   
Sbjct: 829  NTCPVLHFPDQNSEGKLSRQSSDKQGSKDNAVNCNSTKMNRRMISTSITGPEAAKEIQHS 888

Query: 853  CD---ADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
                 A  +  AE++ VNV  + SP E  +  GS C TPVN  TPIN ASP+CMG+EY K
Sbjct: 889  GGNHVAVSSPIAENLAVNVATNKSPEEKSRSLGSLCTTPVNCPTPINAASPVCMGEEYFK 948

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
            QSC+KNL+++ L+KE+ SL     E IS  KD RKRRD+A++R LFS+HLD+DIIKQQKK
Sbjct: 949  QSCKKNLSKSLLIKELRSLNPIDPEPISPSKDMRKRRDLADIRVLFSNHLDEDIIKQQKK 1008

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            IL RLG S ASSI  ATHF+TD FVRTRNMLEAIA GKPVVTHLWLES GQ +  IDE+ 
Sbjct: 1009 ILARLGISEASSILAATHFVTDKFVRTRNMLEAIASGKPVVTHLWLESVGQVNIHIDEEA 1068

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVE 143
            +ILRD KKEKEFGF +P SLARAC+ PLLQG++VLITPNTKP KE +  LV  +HG A+E
Sbjct: 1069 YILRDIKKEKEFGFCMPASLARACRRPLLQGRRVLITPNTKPNKETIVHLVAVLHGQALE 1128

Query: 142  RIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RIGRS +KDD++ DDLL+LSCEEDYAICVPFLEKGAAVYSSELLLNG
Sbjct: 1129 RIGRSAMKDDKVLDDLLILSCEEDYAICVPFLEKGAAVYSSELLLNG 1175


>ref|XP_012482072.1| PREDICTED: uncharacterized protein LOC105796804 isoform X1 [Gossypium
            raimondii] gi|763761329|gb|KJB28583.1| hypothetical
            protein B456_005G056800 [Gossypium raimondii]
          Length = 1136

 Score =  421 bits (1081), Expect = e-115
 Identities = 229/407 (56%), Positives = 284/407 (69%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS RK+      +D      ++    + N + TA  KRS+ ++ STC+  S +R  RSS 
Sbjct: 691  RSSRKMPVGLGESDKMEAQSRKPAQPDDNGKPTAMQKRSRGNNRSTCIPSSTRRTARSSV 750

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDR- 857
                +    D  SE +  +QS+ + GS    LN N+ D NG  I K   G +A++ +   
Sbjct: 751  NTCPLPYFSDQNSEGKLSRQSLDKQGSDADELNCNFSDKNGRMISKRKIGPKAAKAITHA 810

Query: 856  --KCDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
                DA   S+AE++ VNV  D SP+E  +  GS C TP N  TPIN ASP+CMG+EY K
Sbjct: 811  GGNPDAISLSNAENLTVNVDSDKSPKEKSRSPGSLCTTPTNHLTPINAASPVCMGEEYYK 870

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
             SC+KNL +A L+KE+ SL  +  E IS  KD RKRR++A+VR LFS+HLD+DI+KQQKK
Sbjct: 871  MSCKKNLLKASLIKELRSLCPNEAEPISPLKDMRKRRNLADVRVLFSNHLDEDILKQQKK 930

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            IL RLG   AS+I DATHFITD FVRTRNMLEAIA GK VV+HLWLES GQ +  IDE+ 
Sbjct: 931  ILARLGIHEASTILDATHFITDKFVRTRNMLEAIASGKSVVSHLWLESIGQVNIHIDEEA 990

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVE 143
            +ILRD KKEKE GF +PVSLARA + PLLQG++VLITP TKPGKE ++ LV AVHG A+E
Sbjct: 991  YILRDIKKEKELGFCMPVSLARARKRPLLQGRRVLITPKTKPGKETISRLVTAVHGQAIE 1050

Query: 142  RIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            R G+S++KDD+IPDDLLVLSCEEDY ICVPFLEKGAAVYSSELLLNG
Sbjct: 1051 RTGKSSMKDDKIPDDLLVLSCEEDYEICVPFLEKGAAVYSSELLLNG 1097


>emb|CDO98931.1| unnamed protein product [Coffea canephora]
          Length = 1158

 Score =  419 bits (1078), Expect = e-114
 Identities = 228/380 (60%), Positives = 268/380 (70%), Gaps = 1/380 (0%)
 Frame = -3

Query: 1138 QEANAQSTARLKRSKTDSISTCMDLSIKRQTRSSTCAGSVLSSLDLPSEQRSLQQSVVRG 959
            QEA+AQ+  R KRSK D  S+ M+    +  R+S   G ++           L      G
Sbjct: 757  QEASAQNITRFKRSKRDVTSSSMNPVENQDERTSVSGGKII-----------LADRTDAG 805

Query: 958  GSGDAALNHNYVDINGNTILKDVDGVQASQQVDRKCDADKTSSAESIGVNVRLDTSPREG 779
             S    L+ N  +I  N +   +     S     K D D + SAE   +N   D SP++ 
Sbjct: 806  SS----LHGNLSNIQENVVKSII-----SNHSGIKIDMDNSRSAEGEIMNGSEDASPKDR 856

Query: 778  CKPSGSECATPVNSRTPINEASPICMGDEYLKQSCRKNLARACLMKEIDSLIGDGLERIS 599
             KP  S   TPV+  TPI+ ASPICMGDEY KQSCRKNL    LM+E++S         +
Sbjct: 857  RKPEASTSTTPVSFTTPISAASPICMGDEYHKQSCRKNLLGLSLMRELNSRTNTTSPLFT 916

Query: 598  A-FKDSRKRRDMANVRALFSHHLDDDIIKQQKKILTRLGASIASSISDATHFITDNFVRT 422
               KD R+RRDM  VRA+FS HLD D +KQQKKIL R GA IASS+S+ATHFITD FVRT
Sbjct: 917  GGVKDLRRRRDMTTVRAMFSRHLDADTVKQQKKILARFGALIASSMSEATHFITDEFVRT 976

Query: 421  RNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHILRDAKKEKEFGFSLPVSLARACQSP 242
            RNMLEAIAFGKPVVTHLWLESCGQA+CFIDE+N+ILRDA+KEKEFGFS+PVSL+RACQ P
Sbjct: 977  RNMLEAIAFGKPVVTHLWLESCGQANCFIDERNYILRDARKEKEFGFSMPVSLSRACQHP 1036

Query: 241  LLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIGRSTLKDDRIPDDLLVLSCEEDYAI 62
            LLQG +VLITPNTKPGKE+L SLVKAVHGLAVER+GRS  KD+R+PDD+L+LSCEEDY I
Sbjct: 1037 LLQGLRVLITPNTKPGKEILGSLVKAVHGLAVERLGRSAWKDERLPDDILILSCEEDYEI 1096

Query: 61   CVPFLEKGAAVYSSELLLNG 2
            CVPFLEKGAAVYSSELLLNG
Sbjct: 1097 CVPFLEKGAAVYSSELLLNG 1116


>ref|XP_007035446.1| BRCT domain-containing DNA repair protein, putative isoform 7
            [Theobroma cacao] gi|508714475|gb|EOY06372.1| BRCT
            domain-containing DNA repair protein, putative isoform 7
            [Theobroma cacao]
          Length = 1035

 Score =  419 bits (1076), Expect = e-114
 Identities = 240/437 (54%), Positives = 289/437 (66%), Gaps = 33/437 (7%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS  K+      +DN     K+ V QE   QS A LKRS++++ ST +  S +R TRSS 
Sbjct: 559  RSSWKMCVDVGESDNLKAQSKRSVLQEDKGQSIAVLKRSRSNNRSTHIHSSTRRITRSSV 618

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDG---VQASQQV 863
             +  VL   D   E +   QS  + GS D  +N+N  ++NG  +   + G    ++++  
Sbjct: 619  NSRPVLYFSDQNPEGKLSHQSSDKEGSEDDVINYNSTEMNGRMVSTRITGPEPAKSAKHS 678

Query: 862  DRKCDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
            D   DA  +  AES+ VNV LD SP+E  K  GS+C TPVN  TPIN ASP+CMG+EY K
Sbjct: 679  DGNRDAVSSPIAESVAVNVTLDKSPKEKSKSPGSKCTTPVNCPTPINAASPVCMGEEYYK 738

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
            QSC+KNL+++ L KE+ SL     E IS  KD RKRRD+ANVR LFS+HLD+DIIKQQKK
Sbjct: 739  QSCKKNLSKSSLNKELKSLSPIEPEPISPLKDMRKRRDLANVRVLFSNHLDEDIIKQQKK 798

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            IL RLG S  SSI DATHFITD FVRTRNMLEAIA GKPVVT+LWLES GQ +  IDE+ 
Sbjct: 799  ILARLGISEVSSILDATHFITDKFVRTRNMLEAIASGKPVVTYLWLESIGQVNIHIDEEA 858

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLL-QGQKVLITPNTKPGKEVLASLVKAVHGLAV 146
            +ILRD +KEKE GF +PVSLARA + PLL QG++V ITPNTKPGKE ++ LV AV G AV
Sbjct: 859  YILRDIRKEKELGFCMPVSLARARKRPLLQQGRRVFITPNTKPGKETISHLVTAVGGQAV 918

Query: 145  ERIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEK------------------------- 41
            ERIGRS  KDD++PDDLLVLSCEEDY ICVPFLEK                         
Sbjct: 919  ERIGRSATKDDKVPDDLLVLSCEEDYVICVPFLEKGYKCFLSYLLACLMKFGLLLESFAA 978

Query: 40   ----GAAVYSSELLLNG 2
                GAAVYSSELLLNG
Sbjct: 979  FMLSGAAVYSSELLLNG 995


>ref|XP_009779332.1| PREDICTED: uncharacterized protein LOC104228553 [Nicotiana
            sylvestris]
          Length = 1165

 Score =  418 bits (1074), Expect = e-114
 Identities = 226/404 (55%), Positives = 277/404 (68%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RSHRK+                 +GQE   QS  R KR + D  ST +++S KR+  +  
Sbjct: 745  RSHRKMP---------------TMGQETTTQSCRRSKRLRGDQTSTSINVSTKRRKCTPE 789

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
            C    ++S +  S ++ LQ+ + +      + N  + D +  TIL      ++ +  +RK
Sbjct: 790  CTLPNIASSERGSRKKLLQEGIDKRHLDGNSTNDAFADGSAKTILH-----KSIKDSNRK 844

Query: 853  CDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQSC 674
             + + T S +        ++S  E CK S S C TP NS+   N  SPICMGDEY KQSC
Sbjct: 845  TNVEITRSVDEAQGT---ESSTGEQCKASASACTTPTNSKIQKNAVSPICMGDEYHKQSC 901

Query: 673  RKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKILT 494
            RKN++R+ L++EI SL   G +  S  KDSRKRR+M NVR LFS HLD D  KQQKKIL 
Sbjct: 902  RKNMSRSSLLREITSLHSTGTQIGSTLKDSRKRREMTNVRVLFSQHLDADSTKQQKKILA 961

Query: 493  RLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHIL 314
            RLGAS  S +SDATHF+ D FVRTRNMLEAIA GKPVVTHLWLESCGQASC IDEKN+IL
Sbjct: 962  RLGASSVSCMSDATHFVADEFVRTRNMLEAIAVGKPVVTHLWLESCGQASCLIDEKNYIL 1021

Query: 313  RDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIG 134
            RDA+KEKEF FS+PVSLARACQ PLL G +V  TPNTKPGK++LASLVKAVHGLAVER+G
Sbjct: 1022 RDARKEKEFCFSMPVSLARACQHPLLLGYRVFTTPNTKPGKDILASLVKAVHGLAVERLG 1081

Query: 133  RSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RS +K+D +PDDLLVLSCEEDY +C+PFLEKG+ VYSSELLLNG
Sbjct: 1082 RSVMKED-VPDDLLVLSCEEDYEVCIPFLEKGSTVYSSELLLNG 1124


>ref|XP_009369092.1| PREDICTED: uncharacterized protein LOC103958544 isoform X4 [Pyrus x
            bretschneideri]
          Length = 1218

 Score =  417 bits (1071), Expect = e-113
 Identities = 229/404 (56%), Positives = 276/404 (68%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS RKLS Q    DN +      V  +     T R  R +  + S  +D+   R+TRS+T
Sbjct: 789  RSRRKLSDQVYGPDNLNDPPTPSVHPDKVGHIT-RHTRLQGAAQSIFVDVKSTRRTRSAT 847

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
                  +        R L    ++     A L  N    +G  I +   G +A   +DR 
Sbjct: 848  RGDKNCA--------RKLAHQSLKTDPWKAPLRCNSSHKDGIMISEITTGGEAVGILDRM 899

Query: 853  CDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQSC 674
             DA+ +S+ +        D SP   CKP  S CATPVNS+ P+N+ASP+CMG+EY KQSC
Sbjct: 900  SDANPSSATKM------RDESPLGKCKPLDSACATPVNSKVPVNDASPVCMGNEYFKQSC 953

Query: 673  RKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKILT 494
            +K  +R  L+KEI  L  +G    SA KD RKRRDM +VR L+SHHLDD +IK QKKIL 
Sbjct: 954  KKTPSRPSLLKEIRDLSANGHTPTSASKDLRKRRDMTDVRVLYSHHLDDYVIKHQKKILA 1013

Query: 493  RLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHIL 314
            RLG S+ASS++DATHFI D FVRTRNMLEAIA GKPVVTHLWL+SCGQASCFIDEKN++L
Sbjct: 1014 RLGVSVASSMTDATHFIADQFVRTRNMLEAIAAGKPVVTHLWLDSCGQASCFIDEKNYVL 1073

Query: 313  RDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIG 134
            RD KKEKEFGF++P SL RACQ PLL+G+KV ITPNTKPGKE+++SLVKAVHG A+ERIG
Sbjct: 1074 RDTKKEKEFGFNMPTSLVRACQHPLLEGRKVFITPNTKPGKEIISSLVKAVHGQAIERIG 1133

Query: 133  RSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RS L+ D+IPDDLLVLSCEEDY ICVP LEKGA VYSSELLLNG
Sbjct: 1134 RSVLEADKIPDDLLVLSCEEDYEICVPLLEKGAPVYSSELLLNG 1177


>ref|XP_009369091.1| PREDICTED: uncharacterized protein LOC103958544 isoform X3 [Pyrus x
            bretschneideri]
          Length = 1221

 Score =  417 bits (1071), Expect = e-113
 Identities = 229/404 (56%), Positives = 276/404 (68%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS RKLS Q    DN +      V  +     T R  R +  + S  +D+   R+TRS+T
Sbjct: 792  RSRRKLSDQVYGPDNLNDPPTPSVHPDKVGHIT-RHTRLQGAAQSIFVDVKSTRRTRSAT 850

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
                  +        R L    ++     A L  N    +G  I +   G +A   +DR 
Sbjct: 851  RGDKNCA--------RKLAHQSLKTDPWKAPLRCNSSHKDGIMISEITTGGEAVGILDRM 902

Query: 853  CDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQSC 674
             DA+ +S+ +        D SP   CKP  S CATPVNS+ P+N+ASP+CMG+EY KQSC
Sbjct: 903  SDANPSSATKM------RDESPLGKCKPLDSACATPVNSKVPVNDASPVCMGNEYFKQSC 956

Query: 673  RKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKILT 494
            +K  +R  L+KEI  L  +G    SA KD RKRRDM +VR L+SHHLDD +IK QKKIL 
Sbjct: 957  KKTPSRPSLLKEIRDLSANGHTPTSASKDLRKRRDMTDVRVLYSHHLDDYVIKHQKKILA 1016

Query: 493  RLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHIL 314
            RLG S+ASS++DATHFI D FVRTRNMLEAIA GKPVVTHLWL+SCGQASCFIDEKN++L
Sbjct: 1017 RLGVSVASSMTDATHFIADQFVRTRNMLEAIAAGKPVVTHLWLDSCGQASCFIDEKNYVL 1076

Query: 313  RDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIG 134
            RD KKEKEFGF++P SL RACQ PLL+G+KV ITPNTKPGKE+++SLVKAVHG A+ERIG
Sbjct: 1077 RDTKKEKEFGFNMPTSLVRACQHPLLEGRKVFITPNTKPGKEIISSLVKAVHGQAIERIG 1136

Query: 133  RSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RS L+ D+IPDDLLVLSCEEDY ICVP LEKGA VYSSELLLNG
Sbjct: 1137 RSVLEADKIPDDLLVLSCEEDYEICVPLLEKGAPVYSSELLLNG 1180


>ref|XP_009369089.1| PREDICTED: uncharacterized protein LOC103958544 isoform X1 [Pyrus x
            bretschneideri]
          Length = 1247

 Score =  417 bits (1071), Expect = e-113
 Identities = 229/404 (56%), Positives = 276/404 (68%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS RKLS Q    DN +      V  +     T R  R +  + S  +D+   R+TRS+T
Sbjct: 818  RSRRKLSDQVYGPDNLNDPPTPSVHPDKVGHIT-RHTRLQGAAQSIFVDVKSTRRTRSAT 876

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
                  +        R L    ++     A L  N    +G  I +   G +A   +DR 
Sbjct: 877  RGDKNCA--------RKLAHQSLKTDPWKAPLRCNSSHKDGIMISEITTGGEAVGILDRM 928

Query: 853  CDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQSC 674
             DA+ +S+ +        D SP   CKP  S CATPVNS+ P+N+ASP+CMG+EY KQSC
Sbjct: 929  SDANPSSATKM------RDESPLGKCKPLDSACATPVNSKVPVNDASPVCMGNEYFKQSC 982

Query: 673  RKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKILT 494
            +K  +R  L+KEI  L  +G    SA KD RKRRDM +VR L+SHHLDD +IK QKKIL 
Sbjct: 983  KKTPSRPSLLKEIRDLSANGHTPTSASKDLRKRRDMTDVRVLYSHHLDDYVIKHQKKILA 1042

Query: 493  RLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHIL 314
            RLG S+ASS++DATHFI D FVRTRNMLEAIA GKPVVTHLWL+SCGQASCFIDEKN++L
Sbjct: 1043 RLGVSVASSMTDATHFIADQFVRTRNMLEAIAAGKPVVTHLWLDSCGQASCFIDEKNYVL 1102

Query: 313  RDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIG 134
            RD KKEKEFGF++P SL RACQ PLL+G+KV ITPNTKPGKE+++SLVKAVHG A+ERIG
Sbjct: 1103 RDTKKEKEFGFNMPTSLVRACQHPLLEGRKVFITPNTKPGKEIISSLVKAVHGQAIERIG 1162

Query: 133  RSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            RS L+ D+IPDDLLVLSCEEDY ICVP LEKGA VYSSELLLNG
Sbjct: 1163 RSVLEADKIPDDLLVLSCEEDYEICVPLLEKGAPVYSSELLLNG 1206


>ref|XP_007227074.1| hypothetical protein PRUPE_ppa000432mg [Prunus persica]
            gi|462424010|gb|EMJ28273.1| hypothetical protein
            PRUPE_ppa000432mg [Prunus persica]
          Length = 1188

 Score =  415 bits (1067), Expect = e-113
 Identities = 219/400 (54%), Positives = 274/400 (68%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS R +S Q    +N  G  +  V  +   Q     KR ++ + + C D+ + R+TRSST
Sbjct: 796  RSRRNMSIQVYGPNNSDGPSEPSVQADKIGQRVNSHKRLQSGAKNICNDIKLTRRTRSST 855

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
            C    L             + +++GG G+A L+ N    +G  I + + G +     DRK
Sbjct: 856  CGDQNLDG--------KFAREILKGGPGEAPLHCNSSHKDGRMISEIITGKRVVGISDRK 907

Query: 853  CDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQSC 674
             DA+ +S+ +        D  PRE CKPS S C TPVN++ P+N ASP+CMG+EY KQ+C
Sbjct: 908  SDANCSSATKMS------DEFPRENCKPSDSSCTTPVNNKVPVNAASPVCMGNEYFKQTC 961

Query: 673  RKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKILT 494
            ++ L  + L+KEI  L     E  S   + RKRRDM +VR L+SHHLD+DIIK+QKKIL 
Sbjct: 962  KRRLLGSSLLKEIRGLSATVCEPTST-PELRKRRDMTDVRVLYSHHLDEDIIKKQKKILA 1020

Query: 493  RLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHIL 314
            RLG S+A S++DATHFI D FVRTRNMLEAIAFGKPVVTHLWLESCGQA CF+DEK+HIL
Sbjct: 1021 RLGVSVALSMTDATHFIADQFVRTRNMLEAIAFGKPVVTHLWLESCGQAGCFVDEKSHIL 1080

Query: 313  RDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERIG 134
            RD KKEKEFGFS+P SLARACQ PLLQ +KV ITPNTKPGKE++++LVKAV G AVERIG
Sbjct: 1081 RDNKKEKEFGFSMPASLARACQHPLLQDRKVFITPNTKPGKEIISNLVKAVKGQAVERIG 1140

Query: 133  RSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSEL 14
            RSTL  D+IPDDLLVLSCEEDY ICVP LEKG + +  +L
Sbjct: 1141 RSTLNADKIPDDLLVLSCEEDYEICVPLLEKGISSFPIKL 1180


>ref|XP_010321010.1| PREDICTED: uncharacterized protein LOC101247749 isoform X1 [Solanum
            lycopersicum]
          Length = 1169

 Score =  414 bits (1064), Expect = e-113
 Identities = 229/405 (56%), Positives = 276/405 (68%), Gaps = 1/405 (0%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RSHRK+                 +GQEA  Q   R KR   D  ST +D+S K++  S  
Sbjct: 749  RSHRKIPA---------------MGQEATTQPCRRSKRLSGDQTSTSIDVSAKKRKCSPE 793

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDRK 854
                + SS    S ++   + + +G      ++  + D N   +       ++ +  + K
Sbjct: 794  TPSGIASS-GRGSRKKLSNEGINKGHPEGTNISDAFADGNTKALR-----YKSPEDSNMK 847

Query: 853  CD-ADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLKQS 677
             D A K S  E+ GV    ++   + CK   S C TP NS+   +  SPICMGDEY KQS
Sbjct: 848  ADVATKQSVDEAHGV----ESLTGDQCKAPASACTTPTNSKILKSSVSPICMGDEYQKQS 903

Query: 676  CRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKKIL 497
            CRKN +R+ LM+EI SL   G +  S  KDSRKRR+M NVR LFS HLD DIIKQQKKI+
Sbjct: 904  CRKNTSRSSLMREIISLHTTGTQVDSTLKDSRKRREMTNVRILFSQHLDPDIIKQQKKII 963

Query: 496  TRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKNHI 317
             RLGAS ASS+SDATHF+ D FVRTRNMLEAIA GKPVVTHLWLESCGQASC IDEKN+I
Sbjct: 964  ARLGASSASSMSDATHFMADEFVRTRNMLEAIAAGKPVVTHLWLESCGQASCLIDEKNYI 1023

Query: 316  LRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVERI 137
            LRDA+KEKEFGFS+PVSLARACQ P+LQG KV ITPNTKPGKE+LASLVKAVHGLAVER+
Sbjct: 1024 LRDARKEKEFGFSMPVSLARACQHPILQGYKVFITPNTKPGKEILASLVKAVHGLAVERL 1083

Query: 136  GRSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
             RS +K++ IPD+LLVLSCEEDY +C+PFLEKG+ VYSSELLLNG
Sbjct: 1084 CRSAMKEEVIPDNLLVLSCEEDYEVCIPFLEKGSTVYSSELLLNG 1128


>gb|KHG13951.1| PAX-interacting 1 [Gossypium arboreum]
          Length = 1134

 Score =  412 bits (1060), Expect = e-112
 Identities = 227/407 (55%), Positives = 280/407 (68%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1213 RSHRKLSGQFNITDNQHGSFKQCVGQEANAQSTARLKRSKTDSISTCMDLSIKRQTRSST 1034
            RS RK+      +D      ++    + N + TA  K S+ ++ STC+  S +R  RSS 
Sbjct: 689  RSSRKVPVGLGESDKMEAQPRKPAQPDDNGKPTAMQKTSRGNNRSTCIPSSTRRTARSSV 748

Query: 1033 CAGSVLSSLDLPSEQRSLQQSVVRGGSGDAALNHNYVDINGNTILKDVDGVQASQQVDR- 857
                +    D  SE +   QS+ + GS    LN N  D NG  I K   G +A++ +   
Sbjct: 749  NTCPLPYFSDQNSEGKLSHQSLDKQGSDADELNCNLSDKNGRMISKRKIGPKAAKAITHA 808

Query: 856  --KCDADKTSSAESIGVNVRLDTSPREGCKPSGSECATPVNSRTPINEASPICMGDEYLK 683
                 A   S+AE++ VNV  + SP+E  +  GS C TP N  TPIN ASP+CMG+EY K
Sbjct: 809  GGNPGAISLSNAENLTVNVDSEKSPKEKSRSPGSLCTTPTNHLTPINAASPVCMGEEYYK 868

Query: 682  QSCRKNLARACLMKEIDSLIGDGLERISAFKDSRKRRDMANVRALFSHHLDDDIIKQQKK 503
            QSC+KNL +A L+KE+ SL  +  E IS  KD RKRR++A+VR LFS+HLD+DI+KQQKK
Sbjct: 869  QSCKKNLLKASLIKELRSLCPNEAEPISPLKDMRKRRNLADVRVLFSNHLDEDILKQQKK 928

Query: 502  ILTRLGASIASSISDATHFITDNFVRTRNMLEAIAFGKPVVTHLWLESCGQASCFIDEKN 323
            IL RLG   AS+I DATHFITD FVRTRNMLEAIA GK VV+HLWLES GQ +  IDE+ 
Sbjct: 929  ILARLGIHEASTILDATHFITDKFVRTRNMLEAIASGKSVVSHLWLESIGQVNIHIDEEA 988

Query: 322  HILRDAKKEKEFGFSLPVSLARACQSPLLQGQKVLITPNTKPGKEVLASLVKAVHGLAVE 143
            +ILRD KKEKE GF +PVSLARA + PLLQG++VLITP TKPGKE ++ LV AVHG A+E
Sbjct: 989  YILRDIKKEKELGFCMPVSLARARKRPLLQGRRVLITPKTKPGKETISRLVTAVHGQAIE 1048

Query: 142  RIGRSTLKDDRIPDDLLVLSCEEDYAICVPFLEKGAAVYSSELLLNG 2
            R GRS++KDD+IPDDLLVLSCEEDY ICVPFLEKG AVYSSELLLNG
Sbjct: 1049 RTGRSSMKDDKIPDDLLVLSCEEDYEICVPFLEKGTAVYSSELLLNG 1095


Top