BLASTX nr result

ID: Atropa21_contig00035918 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00035918
         (804 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006350160.1| PREDICTED: uncharacterized protein LOC102591...   250   1e-66
ref|XP_004231725.1| PREDICTED: uncharacterized protein LOC101244...   231   1e-60
ref|XP_006481108.1| PREDICTED: uncharacterized protein LOC102628...   106   1e-20
ref|XP_006429488.1| hypothetical protein CICLE_v10013403mg [Citr...   106   1e-20
ref|XP_002320595.2| hypothetical protein POPTR_0014s19050g [Popu...   101   8e-20
gb|AAC19315.1| contains similarity to breast cancer susceptibili...   101   3e-19
ref|XP_002264351.2| PREDICTED: uncharacterized protein LOC100241...   101   3e-19
ref|NP_001154192.1| breast cancer protein 2 like 2A [Arabidopsis...   101   3e-19
emb|CAN83105.1| hypothetical protein VITISV_007645 [Vitis vinifera]   101   3e-19
ref|NP_191913.3| breast cancer protein 2 like 2A [Arabidopsis th...   101   3e-19
gb|EOY07083.1| BREAST CANCER 2 like 2A, putative isoform 1 [Theo...   100   5e-19
gb|EOY07085.1| BREAST CANCER 2 like 2A, putative isoform 3 [Theo...   100   6e-19
gb|EOY07084.1| BRCA2-like B, putative isoform 2 [Theobroma cacao]     100   6e-19
emb|CBI18109.3| unnamed protein product [Vitis vinifera]               99   1e-18
ref|NP_195783.3| protein BRCA2-like B [Arabidopsis thaliana] gi|...    99   2e-18
emb|CAB82279.1| putative protein [Arabidopsis thaliana]                99   2e-18
ref|XP_002870909.1| hypothetical protein ARALYDRAFT_486909 [Arab...    93   1e-16
ref|XP_004156673.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...    91   6e-16
ref|XP_004137896.1| PREDICTED: uncharacterized protein LOC101215...    91   6e-16
gb|EXB46338.1| Breast cancer type 2 susceptibility-like protein ...    88   3e-15

>ref|XP_006350160.1| PREDICTED: uncharacterized protein LOC102591010 [Solanum tuberosum]
          Length = 1126

 Score =  250 bits (638), Expect(2) = 1e-66
 Identities = 134/197 (68%), Positives = 150/197 (76%), Gaps = 10/197 (5%)
 Frame = -2

Query: 563 FQFFSTGSGKPVPLKQSSLSRARSILRDA---VFDTGQ*TGKENGFGFEKAVFQKGSGKT 393
           F  F TGSGKPV +K SS+S A SIL D    + DTG  TG+++   F++AVFQKGSGK 
Sbjct: 65  FPIFRTGSGKPVSVKHSSISTALSILGDEDKPILDTGIGTGRQDVLAFQEAVFQKGSGKA 124

Query: 392 LNAPESFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFPASE 213
           LNAP+SF P+ L K+F+MSNSLFQTGSGK VNISS GLNRAKALLGL+ENGDHETFP S 
Sbjct: 125 LNAPQSFSPSSLNKQFSMSNSLFQTGSGKPVNISSTGLNRAKALLGLDENGDHETFPGSG 184

Query: 212 KKNQNPLPFVAVKGIANTGSTNVSEASFSPFD---NSSVCPAEE----FLNCADKPPPIK 54
           KKN         +GIA+TGSTNVS AS SPFD   NS VCPAEE    FL+CADKPPPIK
Sbjct: 185 KKNTTSDELFGFQGIASTGSTNVSAASLSPFDVKFNSPVCPAEELVADFLHCADKPPPIK 244

Query: 53  FHIAGGRSILVSCEALK 3
           FH AGGRSI VSCEALK
Sbjct: 245 FHTAGGRSITVSCEALK 261



 Score = 30.4 bits (67), Expect(2) = 1e-66
 Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 6/79 (7%)
 Frame = -1

Query: 771 MPTWPLYSVSGNDLVWRECNNERIELC---PSSSNPCHISCGKVPYLPTCL*SYSL*HTS 601
           M TW LYSVS +D  W+  + E +      PS + P      ++  +P  L         
Sbjct: 1   MSTWQLYSVSVSDFRWKVSDGESLTEALEEPSLTLPPQ----QLQSMPDLL-------RQ 49

Query: 600 GSLRLVENSDA---KFPVF 553
           GS RL  N+D+   +FP+F
Sbjct: 50  GSSRLAGNTDSTSTQFPIF 68


>ref|XP_004231725.1| PREDICTED: uncharacterized protein LOC101244820 [Solanum
           lycopersicum]
          Length = 1131

 Score =  231 bits (589), Expect(2) = 1e-60
 Identities = 128/198 (64%), Positives = 147/198 (74%), Gaps = 19/198 (9%)
 Frame = -2

Query: 539 GKPVPLKQSSLSRARSILRDA---VFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPESFC 369
           GKPV +K SS+S A SIL D    + DTG  TG+++   F++AVFQKGSG+ LNAP+SF 
Sbjct: 66  GKPVSVKHSSISTALSILDDEDKPILDTGIGTGRQDVLTFQEAVFQKGSGEPLNAPQSFS 125

Query: 368 PAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFPASEKKN----- 204
           P+ L K+F+MSNSLFQT SGK VNIS  GLN+AKALLGLEENGDHETFP S KKN     
Sbjct: 126 PSSLNKQFSMSNSLFQTASGKPVNISCTGLNKAKALLGLEENGDHETFPGSGKKNTTPDE 185

Query: 203 ----QNPLPFVAVKGIANTGSTNVSEASFSPFD---NSSVCPAEE----FLNCADKPPPI 57
               +N  P V V+GIA+TGSTNVS AS SPFD   NS+VCPAEE    FL+ A KPPPI
Sbjct: 186 LFGFRNSFPIVEVEGIASTGSTNVSAASLSPFDVKFNSTVCPAEELVADFLHSAGKPPPI 245

Query: 56  KFHIAGGRSILVSCEALK 3
           KFH AGGRSI VSCEALK
Sbjct: 246 KFHTAGGRSITVSCEALK 263



 Score = 29.3 bits (64), Expect(2) = 1e-60
 Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 3/76 (3%)
 Frame = -1

Query: 771 MPTWPLYSVSGNDLVWRECNNERIELCPSSSNPCHISCGKVPYLPTCL*SYSL*HTSGSL 592
           M  W LYSVS ND  W+  + E +   PS + P      ++  +P  L         G+ 
Sbjct: 1   MSMWQLYSVSVNDFRWK-VSGESLTEEPSLTLPPQ----QLQSIPDLL-------RQGTS 48

Query: 591 RLVENSDA---KFPVF 553
           RL  N+D+   +FP+F
Sbjct: 49  RLAGNTDSTSTRFPIF 64


>ref|XP_006481108.1| PREDICTED: uncharacterized protein LOC102628548 [Citrus sinensis]
          Length = 1112

 Score =  106 bits (264), Expect = 1e-20
 Identities = 83/202 (41%), Positives = 101/202 (50%), Gaps = 18/202 (8%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRD----AVFDTGQ*TGKENGFGFEKAVFQKGSGKTLN 387
           F TGSGK VPLKQSS+ +A S+L       +   G+   +ENGFGF              
Sbjct: 71  FKTGSGKVVPLKQSSIEKALSVLGTDNDCGISFAGEEHPRENGFGF-------------- 116

Query: 386 APESFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFPASEKK 207
                           SNSLFQTGSGK VNISSAGL RAK+LLGLEE  +  +F   +  
Sbjct: 117 ----------------SNSLFQTGSGKTVNISSAGLVRAKSLLGLEEGRNDWSFEGLQHT 160

Query: 206 NQNPLPFVAVK-GIANT---------GSTNVSEASF--SPFDN--SSVCPAEEFLNCADK 69
                P   VK G+              +++S+A F  S F N  SS     E LN A K
Sbjct: 161 RMTSTPRFEVKEGVKGNVFESDTSVLRPSSISKAGFAESRFKNKISSNMMQTEGLNSAPK 220

Query: 68  PPPIKFHIAGGRSILVSCEALK 3
           PP IKF  AGGRS+ VS +AL+
Sbjct: 221 PPQIKFQTAGGRSLSVSSDALQ 242


>ref|XP_006429488.1| hypothetical protein CICLE_v10013403mg [Citrus clementina]
           gi|557531545|gb|ESR42728.1| hypothetical protein
           CICLE_v10013403mg [Citrus clementina]
          Length = 1112

 Score =  106 bits (264), Expect = 1e-20
 Identities = 83/202 (41%), Positives = 101/202 (50%), Gaps = 18/202 (8%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRD----AVFDTGQ*TGKENGFGFEKAVFQKGSGKTLN 387
           F TGSGK VPLKQSS+ +A S+L       +   G+   +ENGFGF              
Sbjct: 71  FKTGSGKVVPLKQSSIEKALSVLGTDNDCGISFAGEEHPRENGFGF-------------- 116

Query: 386 APESFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFPASEKK 207
                           SNSLFQTGSGK VNISSAGL RAK+LLGLEE  +  +F   +  
Sbjct: 117 ----------------SNSLFQTGSGKTVNISSAGLVRAKSLLGLEEGRNDWSFEGLQHT 160

Query: 206 NQNPLPFVAVK-GIANT---------GSTNVSEASF--SPFDN--SSVCPAEEFLNCADK 69
                P   VK G+              +++S+A F  S F N  SS     E LN A K
Sbjct: 161 RMTSTPRFEVKEGVKGNVFESDTSVLRPSSISKAGFAESRFKNKISSNMMQTEGLNSAPK 220

Query: 68  PPPIKFHIAGGRSILVSCEALK 3
           PP IKF  AGGRS+ VS +AL+
Sbjct: 221 PPQIKFQTAGGRSLSVSTDALQ 242


>ref|XP_002320595.2| hypothetical protein POPTR_0014s19050g [Populus trichocarpa]
           gi|550324536|gb|EEE98910.2| hypothetical protein
           POPTR_0014s19050g [Populus trichocarpa]
          Length = 1186

 Score =  101 bits (251), Expect(2) = 8e-20
 Identities = 87/245 (35%), Positives = 114/245 (46%), Gaps = 57/245 (23%)
 Frame = -2

Query: 566 NFQFFSTGSGKPVPLKQSSLSRARSILRDAVFDTGQ*TGKENGFGFEKA----------- 420
           N   F TGSGK V LKQSS+++A S+LRD   D G+  G EN   F K            
Sbjct: 64  NAPIFRTGSGKSVALKQSSIAKALSVLRDDD-DAGEACGGENELSFSKLRKKGNEDNGNA 122

Query: 419 -VFQKGSGKTLNAPESFC----------------PAGLEKRFN---MSNSLFQTGSGKAV 300
            +F  GSGK++   +S                  P  +  R N    SNSLF TGSGK+V
Sbjct: 123 PIFHTGSGKSVVLKQSSIAKALSVLGDDDGYSGNPGEVHGRNNERCFSNSLFHTGSGKSV 182

Query: 299 NISSAGLNRAKALLGLEE---NGDHETFPASEKKN--------QNPLPFVAVKGIANTG- 156
           +ISSAGL RAK LLG+EE   + + + F    K +        Q+ +       + N G 
Sbjct: 183 DISSAGLVRAKRLLGMEEENYSSNFQGFKCPRKSSTVNEQFGWQDVMHSGTKVSMKNNGV 242

Query: 155 --------------STNVSEASFSPFDNSSVCPAEEFLNCADKPPPIKFHIAGGRSILVS 18
                          T + E+  +   N+++   E       KPPPIKFH AGGRS+ VS
Sbjct: 243 IGDDLPAPRSSLVSKTVILESELTKEVNTNLLEPE-----IQKPPPIKFHTAGGRSLSVS 297

Query: 17  CEALK 3
            EALK
Sbjct: 298 SEALK 302



 Score = 22.7 bits (47), Expect(2) = 8e-20
 Identities = 23/83 (27%), Positives = 35/83 (42%), Gaps = 10/83 (12%)
 Frame = -1

Query: 771 MPTWPLYSVSGNDLVWRECNNERIE----------LCPSSSNPCHISCGKVPYLPTCL*S 622
           M +W ++S SGN+  W E   + I           L P SS+  H+     P +   L  
Sbjct: 1   MSSWKIFSDSGNNFRW-EVTGQIIHTKPEPKQSGALIPPSSSKTHL-----PSMADLL-- 52

Query: 621 YSL*HTSGSLRLVENSDAKFPVF 553
                  G  +L+EN +A  P+F
Sbjct: 53  -----LQGCPKLLENGNA--PIF 68


>gb|AAC19315.1| contains similarity to breast cancer susceptibility (Brca2)
           [Arabidopsis thaliana] gi|7267089|emb|CAB80760.1|
           putative BRCA2 homolog [Arabidopsis thaliana]
          Length = 765

 Score =  101 bits (251), Expect = 3e-19
 Identities = 91/247 (36%), Positives = 124/247 (50%), Gaps = 6/247 (2%)
 Frame = -2

Query: 725 GESVIMKESSFALAAPIHAISLAARYLTYLLVCEVTHFDIHQEV*D*SRIVMPNFQFFST 546
           G+SV++KESS A A      S+ A  +TY  +   T+  I Q     +   +P F+   T
Sbjct: 75  GKSVVLKESSIAKAK-----SILAEKVTYSDLRN-TNCSIPQMRQVDTAETLPMFR---T 125

Query: 545 GSGKPVPLKQSSLSRARSIL-RDAVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPESFC 369
            SGK VPLK+SS+++A SIL  D + D+     +E+GFG                     
Sbjct: 126 ASGKSVPLKESSIAKAMSILGSDKIIDSDNVLPRESGFG--------------------- 164

Query: 368 PAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEE---NGDHETFPASEKKNQN 198
                    +SNSLFQT S K VN+SSAGL RAKALLGLEE   NG +    +S    Q+
Sbjct: 165 ---------VSNSLFQTASNKKVNVSSAGLARAKALLGLEEDDLNGFNHVNQSSSSSQQH 215

Query: 197 PLPFVAVKGIANTGSTNVSEASFSP--FDNSSVCPAEEFLNCADKPPPIKFHIAGGRSIL 24
              +  +K      +T V   S +P  +++       E LN + K PP KF  AGG+S+ 
Sbjct: 216 --GWSGLKTHEEFDATVVKHHSGTPGQYEDYVSGKRSEVLNPSLKVPPTKFQTAGGKSLS 273

Query: 23  VSCEALK 3
           VS EALK
Sbjct: 274 VSAEALK 280


>ref|XP_002264351.2| PREDICTED: uncharacterized protein LOC100241398 [Vitis vinifera]
          Length = 1126

 Score =  101 bits (251), Expect = 3e-19
 Identities = 78/205 (38%), Positives = 94/205 (45%), Gaps = 21/205 (10%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRDAVFDTG-Q*TGKENGFGFEKAVFQKGSGKTLNAPE 378
           F TG GK V +KQSS+++A S+L D  F  G Q   ++NG GF                 
Sbjct: 73  FRTGLGKSVTVKQSSIAKALSVLGDDDFGAGGQDHDRDNGCGF----------------- 115

Query: 377 SFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFPASEKKNQ- 201
                        SNSLFQTGSGK VNISSAGL RAK LLGLEEN +H +      K   
Sbjct: 116 -------------SNSLFQTGSGKMVNISSAGLVRAKTLLGLEENSNHHSCQEHITKQSV 162

Query: 200 -------------------NPLPFVAVKGIANTGSTNVSEASFSPFDNSSVCPAEEFLNC 78
                              N +     K +    ST+ S  + S  +        E  N 
Sbjct: 163 MDGLDGGQNSSCLEMQEDLNSIKSEDAKPVPRPFSTSTSWRTESINEAVPHLKQSEMYNP 222

Query: 77  ADKPPPIKFHIAGGRSILVSCEALK 3
           A  PPPIKFH AGGRSI VS +AL+
Sbjct: 223 APNPPPIKFHTAGGRSISVSSDALQ 247


>ref|NP_001154192.1| breast cancer protein 2 like 2A [Arabidopsis thaliana]
           gi|332656414|gb|AEE81814.1| breast cancer protein 2 like
           2A [Arabidopsis thaliana]
          Length = 1187

 Score =  101 bits (251), Expect = 3e-19
 Identities = 91/247 (36%), Positives = 124/247 (50%), Gaps = 6/247 (2%)
 Frame = -2

Query: 725 GESVIMKESSFALAAPIHAISLAARYLTYLLVCEVTHFDIHQEV*D*SRIVMPNFQFFST 546
           G+SV++KESS A A      S+ A  +TY  +   T+  I Q     +   +P F+   T
Sbjct: 75  GKSVVLKESSIAKAK-----SILAEKVTYSDLRN-TNCSIPQMRQVDTAETLPMFR---T 125

Query: 545 GSGKPVPLKQSSLSRARSIL-RDAVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPESFC 369
            SGK VPLK+SS+++A SIL  D + D+     +E+GFG                     
Sbjct: 126 ASGKSVPLKESSIAKAMSILGSDKIIDSDNVLPRESGFG--------------------- 164

Query: 368 PAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEE---NGDHETFPASEKKNQN 198
                    +SNSLFQT S K VN+SSAGL RAKALLGLEE   NG +    +S    Q+
Sbjct: 165 ---------VSNSLFQTASNKKVNVSSAGLARAKALLGLEEDDLNGFNHVNQSSSSSQQH 215

Query: 197 PLPFVAVKGIANTGSTNVSEASFSP--FDNSSVCPAEEFLNCADKPPPIKFHIAGGRSIL 24
              +  +K      +T V   S +P  +++       E LN + K PP KF  AGG+S+ 
Sbjct: 216 --GWSGLKTHEEFDATVVKHHSGTPGQYEDYVSGKRSEVLNPSLKVPPTKFQTAGGKSLS 273

Query: 23  VSCEALK 3
           VS EALK
Sbjct: 274 VSAEALK 280


>emb|CAN83105.1| hypothetical protein VITISV_007645 [Vitis vinifera]
          Length = 288

 Score =  101 bits (251), Expect = 3e-19
 Identities = 78/205 (38%), Positives = 94/205 (45%), Gaps = 21/205 (10%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRDAVFDTG-Q*TGKENGFGFEKAVFQKGSGKTLNAPE 378
           F TG GK V +KQSS+++A S+L D  F  G Q   ++NG GF                 
Sbjct: 73  FRTGLGKSVTVKQSSIAKALSVLGDDDFGAGGQDHDRDNGCGF----------------- 115

Query: 377 SFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFPASEKKNQ- 201
                        SNSLFQTGSGK VNISSAGL RAK LLGLEEN +H +      K   
Sbjct: 116 -------------SNSLFQTGSGKMVNISSAGLVRAKTLLGLEENSNHHSCQEHITKQSV 162

Query: 200 -------------------NPLPFVAVKGIANTGSTNVSEASFSPFDNSSVCPAEEFLNC 78
                              N +     K +    ST+ S  + S  +        E  N 
Sbjct: 163 MDGLDGGQNSSCLEMQEDLNSIKSEDAKPVPRPFSTSTSWRTESINEAVPHLKQSEMYNP 222

Query: 77  ADKPPPIKFHIAGGRSILVSCEALK 3
           A  PPPIKFH AGGRSI VS +AL+
Sbjct: 223 APNPPPIKFHTAGGRSISVSSDALQ 247


>ref|NP_191913.3| breast cancer protein 2 like 2A [Arabidopsis thaliana]
           gi|31335360|emb|CAD32571.1| breast cancer susceptibility
           protein 2a [Arabidopsis thaliana]
           gi|332656413|gb|AEE81813.1| breast cancer protein 2 like
           2A [Arabidopsis thaliana]
          Length = 1151

 Score =  101 bits (251), Expect = 3e-19
 Identities = 91/247 (36%), Positives = 124/247 (50%), Gaps = 6/247 (2%)
 Frame = -2

Query: 725 GESVIMKESSFALAAPIHAISLAARYLTYLLVCEVTHFDIHQEV*D*SRIVMPNFQFFST 546
           G+SV++KESS A A      S+ A  +TY  +   T+  I Q     +   +P F+   T
Sbjct: 75  GKSVVLKESSIAKAK-----SILAEKVTYSDLRN-TNCSIPQMRQVDTAETLPMFR---T 125

Query: 545 GSGKPVPLKQSSLSRARSIL-RDAVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPESFC 369
            SGK VPLK+SS+++A SIL  D + D+     +E+GFG                     
Sbjct: 126 ASGKSVPLKESSIAKAMSILGSDKIIDSDNVLPRESGFG--------------------- 164

Query: 368 PAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEE---NGDHETFPASEKKNQN 198
                    +SNSLFQT S K VN+SSAGL RAKALLGLEE   NG +    +S    Q+
Sbjct: 165 ---------VSNSLFQTASNKKVNVSSAGLARAKALLGLEEDDLNGFNHVNQSSSSSQQH 215

Query: 197 PLPFVAVKGIANTGSTNVSEASFSP--FDNSSVCPAEEFLNCADKPPPIKFHIAGGRSIL 24
              +  +K      +T V   S +P  +++       E LN + K PP KF  AGG+S+ 
Sbjct: 216 --GWSGLKTHEEFDATVVKHHSGTPGQYEDYVSGKRSEVLNPSLKVPPTKFQTAGGKSLS 273

Query: 23  VSCEALK 3
           VS EALK
Sbjct: 274 VSAEALK 280


>gb|EOY07083.1| BREAST CANCER 2 like 2A, putative isoform 1 [Theobroma cacao]
          Length = 1155

 Score =  100 bits (250), Expect = 5e-19
 Identities = 81/218 (37%), Positives = 103/218 (47%), Gaps = 30/218 (13%)
 Frame = -2

Query: 566 NFQFFSTGSGKPVPLKQSSLSRARSILRDAVFDTGQ*TGKENGFGFEKAVFQKGSGKT-- 393
           N   F TG GK V LK+SS+++A SIL D    T   + K     F  ++F   +     
Sbjct: 63  NCPMFRTGLGKSVALKESSIAKALSILGDDDVGTAVTSSKR----FSLSLFSFNNVHLAF 118

Query: 392 ----LNAPESFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETF 225
               L+      P      F  SNSLFQTGSGK VNISSAGL RAK LLGLE++ +H +F
Sbjct: 119 HILILSFIWEVVPGN--NGFGCSNSLFQTGSGKMVNISSAGLVRAKTLLGLEQDNEHHSF 176

Query: 224 PASEKKNQNPLP-----------FVAVKGIANTG-------------STNVSEASFSPFD 117
              +   + P                 +G+ NTG             S N    S    +
Sbjct: 177 EGFQHPKKLPATNEPCGWQSFSHSEKKEGLRNTGVADFFSESRHLLNSRNGFVGSTVGSE 236

Query: 116 NSSVCPAEEFLNCADKPPPIKFHIAGGRSILVSCEALK 3
           N S     +  + A KPPPIKFH AGGRS+ VS +ALK
Sbjct: 237 NDSTPVHSKEFDSAPKPPPIKFHTAGGRSLSVSSDALK 274


>gb|EOY07085.1| BREAST CANCER 2 like 2A, putative isoform 3 [Theobroma cacao]
          Length = 982

 Score =  100 bits (249), Expect = 6e-19
 Identities = 81/217 (37%), Positives = 99/217 (45%), Gaps = 29/217 (13%)
 Frame = -2

Query: 566 NFQFFSTGSGKPVPLKQSSLSRARSILRDAVFDTGQ*TGKE-----NGFGFEKAVFQKGS 402
           N   F TG GK V LK+SS+++A SIL D    T   T +E     NGFG          
Sbjct: 63  NCPMFRTGLGKSVALKESSIAKALSILGDDDVGTAV-TSREVVPGNNGFG---------- 111

Query: 401 GKTLNAPESFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFP 222
                                SNSLFQTGSGK VNISSAGL RAK LLGLE++ +H +F 
Sbjct: 112 --------------------CSNSLFQTGSGKMVNISSAGLVRAKTLLGLEQDNEHHSFE 151

Query: 221 ASEKKNQNPLP-----------FVAVKGIANTG-------------STNVSEASFSPFDN 114
             +   + P                 +G+ NTG             S N    S    +N
Sbjct: 152 GFQHPKKLPATNEPCGWQSFSHSEKKEGLRNTGVADFFSESRHLLNSRNGFVGSTVGSEN 211

Query: 113 SSVCPAEEFLNCADKPPPIKFHIAGGRSILVSCEALK 3
            S     +  + A KPPPIKFH AGGRS+ VS +ALK
Sbjct: 212 DSTPVHSKEFDSAPKPPPIKFHTAGGRSLSVSSDALK 248


>gb|EOY07084.1| BRCA2-like B, putative isoform 2 [Theobroma cacao]
          Length = 1111

 Score =  100 bits (249), Expect = 6e-19
 Identities = 81/217 (37%), Positives = 99/217 (45%), Gaps = 29/217 (13%)
 Frame = -2

Query: 566 NFQFFSTGSGKPVPLKQSSLSRARSILRDAVFDTGQ*TGKE-----NGFGFEKAVFQKGS 402
           N   F TG GK V LK+SS+++A SIL D    T   T +E     NGFG          
Sbjct: 63  NCPMFRTGLGKSVALKESSIAKALSILGDDDVGTAV-TSREVVPGNNGFG---------- 111

Query: 401 GKTLNAPESFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFP 222
                                SNSLFQTGSGK VNISSAGL RAK LLGLE++ +H +F 
Sbjct: 112 --------------------CSNSLFQTGSGKMVNISSAGLVRAKTLLGLEQDNEHHSFE 151

Query: 221 ASEKKNQNPLP-----------FVAVKGIANTG-------------STNVSEASFSPFDN 114
             +   + P                 +G+ NTG             S N    S    +N
Sbjct: 152 GFQHPKKLPATNEPCGWQSFSHSEKKEGLRNTGVADFFSESRHLLNSRNGFVGSTVGSEN 211

Query: 113 SSVCPAEEFLNCADKPPPIKFHIAGGRSILVSCEALK 3
            S     +  + A KPPPIKFH AGGRS+ VS +ALK
Sbjct: 212 DSTPVHSKEFDSAPKPPPIKFHTAGGRSLSVSSDALK 248


>emb|CBI18109.3| unnamed protein product [Vitis vinifera]
          Length = 1134

 Score = 99.4 bits (246), Expect = 1e-18
 Identities = 81/208 (38%), Positives = 97/208 (46%), Gaps = 24/208 (11%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRDAVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPES 375
           F TG GK V +KQSS+++A S+L D  F  G        +  +   F    G T++  E 
Sbjct: 73  FRTGLGKSVTVKQSSIAKALSVLGDDDFGAGGAQCSLFFYHLDYLSFADAIGSTISFKEH 132

Query: 374 FC---------------PAGLEKRFN---MSNSLFQTGSGKAVNISSAGLNRAKALLGLE 249
                            P     R N    SNSLFQTGSGK VNISSAGL RAK LLGLE
Sbjct: 133 CSGQDQNISQKDLLLPGPDPDHDRDNGCGFSNSLFQTGSGKMVNISSAGLVRAKTLLGLE 192

Query: 248 ENGDHETFPASEKKN------QNPLPFVAVKGIANTGSTNVSEASFSPFDNSSVCPAEEF 87
           EN +H +      K         P PF          ST+ S  + S  +        E 
Sbjct: 193 ENSNHHSCQEHITKQSVMDGLDVPRPF----------STSTSWRTESINEAVPHLKQSEM 242

Query: 86  LNCADKPPPIKFHIAGGRSILVSCEALK 3
            N A  PPPIKFH AGGRSI VS +AL+
Sbjct: 243 YNPAPNPPPIKFHTAGGRSISVSSDALQ 270


>ref|NP_195783.3| protein BRCA2-like B [Arabidopsis thaliana]
           gi|31335362|emb|CAD32572.1| breast cancer susceptibility
           protein 2b [Arabidopsis thaliana]
           gi|332002986|gb|AED90369.1| protein BRCA2-like B
           [Arabidopsis thaliana]
          Length = 1155

 Score = 99.0 bits (245), Expect = 2e-18
 Identities = 90/247 (36%), Positives = 121/247 (48%), Gaps = 6/247 (2%)
 Frame = -2

Query: 725 GESVIMKESSFALAAPIHAISLAARYLTYLLVCEVTHFDIHQEV*D*SRIVMPNFQFFST 546
           G+SV++KESS A A  I A ++A   L      + T+  I Q     +   MP F+   T
Sbjct: 75  GKSVVLKESSIAKAKSILAENVAYSDL------QNTNCSIPQTRQVDTAETMPMFR---T 125

Query: 545 GSGKPVPLKQSSLSRARSIL-RDAVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPESFC 369
             GK VPLK+SS+++  SIL  D + D+     +E+GFG                     
Sbjct: 126 ALGKTVPLKESSIAKPLSILGSDMIIDSDNVLPRESGFG--------------------- 164

Query: 368 PAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEE---NGDHETFPASEKKNQN 198
                    + NSLFQT S K VN+SSAGL RAKALLGLEE   NG +    +S    Q+
Sbjct: 165 ---------VPNSLFQTASNKKVNVSSAGLARAKALLGLEEDDLNGFNHVNQSSSSLQQH 215

Query: 197 PLPFVAVKGIANTGSTNVSEASFSP--FDNSSVCPAEEFLNCADKPPPIKFHIAGGRSIL 24
              +  +K      +T V   S +P  ++N       E LN + K PP KF  AGG+S+ 
Sbjct: 216 --GWSGLKTHEEFDATVVKHHSGTPGQYENYVSGKRSEILNPSLKVPPTKFQTAGGKSLS 273

Query: 23  VSCEALK 3
           VS EALK
Sbjct: 274 VSAEALK 280


>emb|CAB82279.1| putative protein [Arabidopsis thaliana]
          Length = 1136

 Score = 99.0 bits (245), Expect = 2e-18
 Identities = 90/247 (36%), Positives = 121/247 (48%), Gaps = 6/247 (2%)
 Frame = -2

Query: 725 GESVIMKESSFALAAPIHAISLAARYLTYLLVCEVTHFDIHQEV*D*SRIVMPNFQFFST 546
           G+SV++KESS A A  I A ++A   L      + T+  I Q     +   MP F+   T
Sbjct: 75  GKSVVLKESSIAKAKSILAENVAYSDL------QNTNCSIPQTRQVDTAETMPMFR---T 125

Query: 545 GSGKPVPLKQSSLSRARSIL-RDAVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPESFC 369
             GK VPLK+SS+++  SIL  D + D+     +E+GFG                     
Sbjct: 126 ALGKTVPLKESSIAKPLSILGSDMIIDSDNVLPRESGFG--------------------- 164

Query: 368 PAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEE---NGDHETFPASEKKNQN 198
                    + NSLFQT S K VN+SSAGL RAKALLGLEE   NG +    +S    Q+
Sbjct: 165 ---------VPNSLFQTASNKKVNVSSAGLARAKALLGLEEDDLNGFNHVNQSSSSLQQH 215

Query: 197 PLPFVAVKGIANTGSTNVSEASFSP--FDNSSVCPAEEFLNCADKPPPIKFHIAGGRSIL 24
              +  +K      +T V   S +P  ++N       E LN + K PP KF  AGG+S+ 
Sbjct: 216 --GWSGLKTHEEFDATVVKHHSGTPGQYENYVSGKRSEILNPSLKVPPTKFQTAGGKSLS 273

Query: 23  VSCEALK 3
           VS EALK
Sbjct: 274 VSAEALK 280


>ref|XP_002870909.1| hypothetical protein ARALYDRAFT_486909 [Arabidopsis lyrata subsp.
           lyrata] gi|297316746|gb|EFH47168.1| hypothetical protein
           ARALYDRAFT_486909 [Arabidopsis lyrata subsp. lyrata]
          Length = 1151

 Score = 92.8 bits (229), Expect = 1e-16
 Identities = 79/213 (37%), Positives = 105/213 (49%), Gaps = 29/213 (13%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRDA-VFDTGQ*TGKEN------GFGFEKAVFQKGSGK 396
           F TG GK VPLK+SS+++A+S+L D+  F   Q T   N             +F+   GK
Sbjct: 70  FRTGLGKSVPLKESSMAKAKSLLADSGTFLDLQNTNCSNPQMRQVDSAETLPMFRTALGK 129

Query: 395 TLNAPESFCPAGL-----------------EKRFNMSNSLFQTGSGKAVNISSAGLNRAK 267
           ++   ES     L                 E  F + N+LFQT S K VN+SSAGL RAK
Sbjct: 130 SVPLKESSIAKALSILASDKIIDSDYVLPRESGFGVPNTLFQTASNKKVNVSSAGLARAK 189

Query: 266 ALLGLEE---NGDHETFPASEKKNQNPLPFVAVKGIANTGSTNVSEASFSP--FDNSSVC 102
           ALLGLEE   NG +    +S    Q+ L    +K      +T V   S +P  +++    
Sbjct: 190 ALLGLEEDDLNGFNHVNQSSSSLQQHGLS--VLKTHEEFDATVVKHHSGTPGQYEDYVSG 247

Query: 101 PAEEFLNCADKPPPIKFHIAGGRSILVSCEALK 3
              E LN + K PP KF  AGG+S+ VS EALK
Sbjct: 248 KRPEILNPSLKVPPTKFQTAGGKSLSVSAEALK 280


>ref|XP_004156673.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101230245 [Cucumis sativus]
          Length = 1111

 Score = 90.5 bits (223), Expect = 6e-16
 Identities = 76/203 (37%), Positives = 98/203 (48%), Gaps = 19/203 (9%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRDAVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPES 375
           F TG GK V +KQSS+ +A S+L D   D     G+ +  G                   
Sbjct: 73  FRTGLGKSVSVKQSSIDKALSLLSD---DKAPDIGRLHNGG------------------- 110

Query: 374 FCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFPASEKKNQNP 195
                     N SNSLFQTGSGK+VN+SS GL RAK LLGLEE+   +T  + ++  Q  
Sbjct: 111 ----------NFSNSLFQTGSGKSVNVSSEGLLRAKTLLGLEED---DTCSSFQRFGQAI 157

Query: 194 LP------FVAVKGIANTGSTNVSEASFSP------FDNSS----VCPA---EEFLNCAD 72
            P      F+  KG+    + + +  S SP      F  SS      P+    E  N A 
Sbjct: 158 SPYDVKGEFLESKGVCGMENMSGASVSISPLVFNTCFSRSSSENQASPSFRQIELPNKAP 217

Query: 71  KPPPIKFHIAGGRSILVSCEALK 3
           K PPIKFH AGGRS+ VS +AL+
Sbjct: 218 KAPPIKFHTAGGRSLSVSSDALQ 240


>ref|XP_004137896.1| PREDICTED: uncharacterized protein LOC101215906 [Cucumis sativus]
          Length = 1111

 Score = 90.5 bits (223), Expect = 6e-16
 Identities = 76/203 (37%), Positives = 98/203 (48%), Gaps = 19/203 (9%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRDAVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPES 375
           F TG GK V +KQSS+ +A S+L D   D     G+ +  G                   
Sbjct: 73  FRTGLGKSVSVKQSSIDKALSLLSD---DKAPDIGRLHNGG------------------- 110

Query: 374 FCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGDHETFPASEKKNQNP 195
                     N SNSLFQTGSGK+VN+SS GL RAK LLGLEE+   +T  + ++  Q  
Sbjct: 111 ----------NFSNSLFQTGSGKSVNVSSEGLLRAKTLLGLEED---DTCSSFQRFGQAI 157

Query: 194 LP------FVAVKGIANTGSTNVSEASFSP------FDNSS----VCPA---EEFLNCAD 72
            P      F+  KG+    + + +  S SP      F  SS      P+    E  N A 
Sbjct: 158 SPYDVKGEFLESKGVCGMENMSGASVSISPLVFNTCFSRSSSENQASPSFRQIELPNKAP 217

Query: 71  KPPPIKFHIAGGRSILVSCEALK 3
           K PPIKFH AGGRS+ VS +AL+
Sbjct: 218 KAPPIKFHTAGGRSLSVSSDALQ 240


>gb|EXB46338.1| Breast cancer type 2 susceptibility-like protein [Morus notabilis]
          Length = 1155

 Score = 88.2 bits (217), Expect = 3e-15
 Identities = 75/208 (36%), Positives = 97/208 (46%), Gaps = 24/208 (11%)
 Frame = -2

Query: 554 FSTGSGKPVPLKQSSLSRARSILRD-AVFDTGQ*TGKENGFGFEKAVFQKGSGKTLNAPE 378
           F TG G+ VP+KQSS+++A S+L D +V DTGQ   ++N   F                 
Sbjct: 78  FKTGLGRFVPVKQSSITKALSVLGDDSVTDTGQIQARDNVCDFP---------------- 121

Query: 377 SFCPAGLEKRFNMSNSLFQTGSGKAVNISSAGLNRAKALLGLEENGD---HETFPASEKK 207
                         NSLFQTGSGK VNISS GL RAK LLGL E  D    + F  S K 
Sbjct: 122 --------------NSLFQTGSGKKVNISSDGLARAKTLLGLVEESDPCNFQGFRNSRKS 167

Query: 206 NQ--------NPLPFVAVKGIANTGSTNV-----------SEASFSPFDNSSVCPAEEFL 84
           +         N   F   +G+ + G+ +            ++   S F N +  P    +
Sbjct: 168 SNIDSSFGWPNISNFEKGEGVNHFGTVHSASGPRSSPICRTDIGHSRFGNEAKQPTHSRM 227

Query: 83  -NCADKPPPIKFHIAGGRSILVSCEALK 3
            N A  P PIKF  AGGRSI VS +AL+
Sbjct: 228 PNSATTPSPIKFQTAGGRSISVSSDALQ 255


Top