BLASTX nr result

ID: Forsythia23_contig00007664 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00007664
         (823 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011072884.1| PREDICTED: uncharacterized protein LOC105157...   328   3e-87
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   304   5e-80
emb|CDP12040.1| unnamed protein product [Coffea canephora]            303   7e-80
ref|XP_008238921.1| PREDICTED: uncharacterized protein LOC103337...   303   7e-80
ref|XP_012086872.1| PREDICTED: uncharacterized protein LOC105645...   296   1e-77
ref|XP_010089083.1| hypothetical protein L484_024256 [Morus nota...   291   4e-76
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   291   4e-76
gb|KDO76253.1| hypothetical protein CISIN_1g012593mg [Citrus sin...   289   2e-75
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   288   4e-75
ref|XP_012439688.1| PREDICTED: uncharacterized protein LOC105765...   285   2e-74
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   283   9e-74
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   283   1e-73
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   281   5e-73
ref|XP_012439689.1| PREDICTED: uncharacterized protein LOC105765...   280   1e-72
ref|XP_011080935.1| PREDICTED: uncharacterized protein LOC105164...   279   1e-72
gb|KHG14068.1| hypothetical protein F383_19964 [Gossypium arboreum]   278   3e-72
ref|XP_011022665.1| PREDICTED: uncharacterized protein LOC105124...   276   9e-72
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   274   4e-71
ref|XP_012836270.1| PREDICTED: COPII coat assembly protein sec16...   274   6e-71
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   273   1e-70

>ref|XP_011072884.1| PREDICTED: uncharacterized protein LOC105157994 [Sesamum indicum]
          Length = 455

 Score =  328 bits (840), Expect = 3e-87
 Identities = 171/260 (65%), Positives = 187/260 (71%)
 Frame = -3

Query: 785 GVNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRIGHAVLV 606
           GVNG DALETINAAA+ IAS E R  Q S QKRRW  +WSL+WCFGS+K K RIGHAVLV
Sbjct: 5   GVNGADALETINAAATAIASVETRGLQDSVQKRRWGRWWSLYWCFGSNKTK-RIGHAVLV 63

Query: 605 PETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSISAN 426
           PET    A+ PTAE+P+QP SI                    SA QSPTGL+SLTS+SA+
Sbjct: 64  PETTAAGADAPTAEHPAQPPSIVLPFVAPPSSPASFLPSEPPSAAQSPTGLLSLTSVSAS 123

Query: 425 MYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFA 246
           MYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFA
Sbjct: 124 MYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFA 183

Query: 245 RLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFPDLECAP 66
           RLLEPN Q+ E GQR+ +S YEFQSYQLQ                      PFPD E A 
Sbjct: 184 RLLEPNLQNSEAGQRFHISPYEFQSYQLQPGSPVSHLISPSSGISGSGTSSPFPDREFAA 243

Query: 65  GHPLFVEFRTGNHTKFLNLD 6
           GHP F+EFRTGN  K L+LD
Sbjct: 244 GHPFFLEFRTGNPPKLLDLD 263


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
           gi|462404864|gb|EMJ10328.1| hypothetical protein
           PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  304 bits (778), Expect = 5e-80
 Identities = 159/257 (61%), Positives = 180/257 (70%)
 Frame = -3

Query: 776 GTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRIGHAVLVPET 597
           G +ALETINAAAS IA+AENRVPQ + QKRRW S+WS++WCFG  ++K+RIGHAVLVPET
Sbjct: 11  GNNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPET 70

Query: 596 PTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSISANMYS 417
                + P AENP Q  SI                    SATQSP G  SLT   A+MYS
Sbjct: 71  TDRGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFFSLT---ASMYS 127

Query: 416 PGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFARLL 237
           P GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSSPEVPFA+LL
Sbjct: 128 PSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL 187

Query: 236 EPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFPDLECAPGHP 57
           +P+F++GEGGQR+PLS YEFQSYQL                       PFPDLE A    
Sbjct: 188 DPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGH 247

Query: 56  LFVEFRTGNHTKFLNLD 6
            F+EFRTG+  K LNLD
Sbjct: 248 HFLEFRTGDPPKLLNLD 264


>emb|CDP12040.1| unnamed protein product [Coffea canephora]
          Length = 466

 Score =  303 bits (777), Expect = 7e-80
 Identities = 160/260 (61%), Positives = 183/260 (70%)
 Frame = -3

Query: 782 VNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRIGHAVLVP 603
           VN T  LETINAAA+ IA+AENRVPQV  QKRRW S WSL+WCFGS K+ +RIGHAVLVP
Sbjct: 16  VNNT--LETINAAANAIAAAENRVPQVGVQKRRWASCWSLYWCFGSYKHTKRIGHAVLVP 73

Query: 602 ETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSISANM 423
           E    RA+ P  EN +Q  S+                    SATQSP GL+SLTS+SA+M
Sbjct: 74  EPIAPRADPPAVENQTQAASVALPFIAPPSSPASFLQSEPPSATQSPPGLLSLTSMSASM 133

Query: 422 YSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFAR 243
           YSPGGP+S+FAIGPYAHETQLV+PPVFSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFAR
Sbjct: 134 YSPGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAR 193

Query: 242 LLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFPDLECAPG 63
           LL+P  Q+ + GQRYPL QYEFQSYQLQ                      PFPD E   G
Sbjct: 194 LLDPIDQNCQDGQRYPLPQYEFQSYQLQPGSPASHLISPSSGISGSGTSSPFPDGEFVYG 253

Query: 62  HPLFVEFRTGNHTKFLNLDK 3
            P F+EFR+G+  K LNL+K
Sbjct: 254 RPHFLEFRSGDPPKLLNLEK 273


>ref|XP_008238921.1| PREDICTED: uncharacterized protein LOC103337539 [Prunus mume]
          Length = 455

 Score =  303 bits (777), Expect = 7e-80
 Identities = 159/257 (61%), Positives = 179/257 (69%)
 Frame = -3

Query: 776 GTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRIGHAVLVPET 597
           G +ALETINAAAS IA+AENRVPQ + QKRRW S+WS++WCFG  ++K+RIGHAVLVPET
Sbjct: 11  GNNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPET 70

Query: 596 PTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSISANMYS 417
                + P AENP Q  SI                    SATQSP G  SLT   A+MYS
Sbjct: 71  TDRGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFFSLT---ASMYS 127

Query: 416 PGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFARLL 237
           P GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSSPEVPFA+LL
Sbjct: 128 PSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL 187

Query: 236 EPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFPDLECAPGHP 57
           +P F++GEGGQR+PLS YEFQSYQL                       PFPDLE A    
Sbjct: 188 DPQFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGH 247

Query: 56  LFVEFRTGNHTKFLNLD 6
            F+EFRTG+  K LNLD
Sbjct: 248 HFLEFRTGDPPKLLNLD 264


>ref|XP_012086872.1| PREDICTED: uncharacterized protein LOC105645786 [Jatropha curcas]
           gi|643711987|gb|KDP25415.1| hypothetical protein
           JCGZ_20571 [Jatropha curcas]
          Length = 455

 Score =  296 bits (758), Expect = 1e-77
 Identities = 160/267 (59%), Positives = 185/267 (69%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRI 624
           R VNG      +AL+TINAAAS IASAENRVPQ + QKRRW S +S++WCFG +++++RI
Sbjct: 2   RAVNGDSRPSNNALDTINAAASAIASAENRVPQATVQKRRWGSCFSVYWCFGYNRHRKRI 61

Query: 623 GHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           GHAVLVPETP  R +   AEN +Q  +I                    SA+QSPTG++SL
Sbjct: 62  GHAVLVPETPGPRNDSSAAENSTQTPTITLPFVAPPSSPASFLQSEPPSASQSPTGVLSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISANMYSP GPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISANMYSPSGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 181

Query: 263 PEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFP 84
           PEVPFA+LL+P+ ++ E G R+PLS YEFQSYQL                       PFP
Sbjct: 182 PEVPFAQLLDPSIRNVEAGLRFPLSNYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFP 241

Query: 83  DLECAPGHPLFVEFRTGNHTKFLNLDK 3
           D E A G   F+EFR G   K LNLDK
Sbjct: 242 DGEFAAG---FLEFRMGEPPKLLNLDK 265


>ref|XP_010089083.1| hypothetical protein L484_024256 [Morus notabilis]
           gi|587846890|gb|EXB37330.1| hypothetical protein
           L484_024256 [Morus notabilis]
          Length = 455

 Score =  291 bits (744), Expect = 4e-76
 Identities = 154/263 (58%), Positives = 178/263 (67%)
 Frame = -3

Query: 791 SRGVNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRIGHAV 612
           SR +N  +ALETINAAA+ IA AENRVPQ + +KRRW    S++WCFG+ KN+ RIGH V
Sbjct: 10  SRTMN--NALETINAAATAIAMAENRVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGHGV 67

Query: 611 LVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSIS 432
           LVPET       P AEN +Q  ++                    SATQSP GL+SLTS+S
Sbjct: 68  LVPETAQPGNSAPRAENSTQTHAVILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSVS 127

Query: 431 ANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVP 252
           A+MYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSSPEVP
Sbjct: 128 ASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVP 187

Query: 251 FARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFPDLEC 72
           FA+LL+PN  +GE GQR+P+   EFQSY  Q                      PFPD E 
Sbjct: 188 FAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEF 247

Query: 71  APGHPLFVEFRTGNHTKFLNLDK 3
           A   P F+EFRTG+  K LNLDK
Sbjct: 248 AARGPHFLEFRTGDPPKLLNLDK 270


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  291 bits (744), Expect = 4e-76
 Identities = 155/268 (57%), Positives = 177/268 (66%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNGTDA------LETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQR 627
           RGVNG D+      LETINAAA+ IASAENRV Q ++QKRRW   WS+ WCFG  K+++R
Sbjct: 2   RGVNGGDSRALNNSLETINAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRKR 61

Query: 626 IGHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLIS 447
           IGHAVLVPE   +R+    A N +Q  +I                    SATQSP GL+S
Sbjct: 62  IGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGLVS 121

Query: 446 LTSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPS 267
           L SIS NMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPS
Sbjct: 122 LNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 181

Query: 266 SPEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPF 87
           SPEVPFA+LL+P+ + GE GQ++P S YEFQSY L                       PF
Sbjct: 182 SPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPF 241

Query: 86  PDLECAPGHPLFVEFRTGNHTKFLNLDK 3
           PD E A   P F +F  G+  K LNLDK
Sbjct: 242 PDGEFATAGPQFPDFHRGDPPKLLNLDK 269


>gb|KDO76253.1| hypothetical protein CISIN_1g012593mg [Citrus sinensis]
          Length = 460

 Score =  289 bits (739), Expect = 2e-75
 Identities = 154/268 (57%), Positives = 177/268 (66%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNGTDA------LETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQR 627
           RGVNG D+      LETI+AAA+ IASAENRV Q ++QKRRW   WS+ WCFG  K+++R
Sbjct: 2   RGVNGGDSRALNNSLETISAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRKR 61

Query: 626 IGHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLIS 447
           IGHAVLVPE   +R+    A N +Q  +I                    SATQSP GL+S
Sbjct: 62  IGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGLVS 121

Query: 446 LTSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPS 267
           L SIS NMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPS
Sbjct: 122 LNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 181

Query: 266 SPEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPF 87
           SPEVPFA+LL+P+ + GE GQ++P S YEFQSY L                       PF
Sbjct: 182 SPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPF 241

Query: 86  PDLECAPGHPLFVEFRTGNHTKFLNLDK 3
           PD E A   P F +F  G+  K LNLDK
Sbjct: 242 PDGEFATAGPQFPDFHRGDPPKLLNLDK 269


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
           gi|557541785|gb|ESR52763.1| hypothetical protein
           CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  288 bits (736), Expect = 4e-75
 Identities = 153/268 (57%), Positives = 177/268 (66%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNGTDA------LETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQR 627
           RGVNG D+      LETI+AAA+ IASAENRV Q ++QKRRW   W++ WCFG  K+++R
Sbjct: 2   RGVNGGDSRALNNSLETISAAATAIASAENRVHQATSQKRRWGGCWNISWCFGFQKHRKR 61

Query: 626 IGHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLIS 447
           IGHAVLVPE   +R+    A N +Q  +I                    SATQSP GL+S
Sbjct: 62  IGHAVLVPEPTASRSNASEAVNSTQATAISLPFVAPPSSPASFLQSEPPSATQSPAGLVS 121

Query: 446 LTSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPS 267
           L SIS NMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPS
Sbjct: 122 LNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 181

Query: 266 SPEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPF 87
           SPEVPFA+LL+P+ + GE GQ++P S YEFQSY L                       PF
Sbjct: 182 SPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPF 241

Query: 86  PDLECAPGHPLFVEFRTGNHTKFLNLDK 3
           PD E A   P F +F  G+  K LNLDK
Sbjct: 242 PDGEFATAGPQFPDFHRGDPPKLLNLDK 269


>ref|XP_012439688.1| PREDICTED: uncharacterized protein LOC105765237 isoform X3
           [Gossypium raimondii] gi|763785055|gb|KJB52126.1|
           hypothetical protein B456_008G247900 [Gossypium
           raimondii]
          Length = 457

 Score =  285 bits (729), Expect = 2e-74
 Identities = 157/267 (58%), Positives = 180/267 (67%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRI 624
           RG NG      + LETI+AAA+ IASAENRVPQ + QK+RW  +WS +WCFGS K K+RI
Sbjct: 2   RGANGESRAMNNPLETIHAAANAIASAENRVPQSTVQKKRWGGWWSKYWCFGSYKQKKRI 61

Query: 623 GHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           G AV V ET ++RA  P AE P+Q +++                    SATQSP GL+SL
Sbjct: 62  GPAVPVSETSSSRANMPAAEIPTQAVTVTLPFVAPPSSPASFLPSEPPSATQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 180

Query: 263 PEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L PN Q GE GQR+P+ QYEFQSYQL                       PFP
Sbjct: 181 PEVPFAQFLGPNLQYGEAGQRFPIYQYEFQSYQLHPGSPIGQLISPSSGISGSGTSSPFP 240

Query: 83  DLECAPGHPLFVEFRTGNHTKFLNLDK 3
           D + A G   F EFR G+  K LNLDK
Sbjct: 241 DGDFAAG-LRFPEFRMGDPPKLLNLDK 266


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
           gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
           glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  283 bits (724), Expect = 9e-74
 Identities = 160/268 (59%), Positives = 178/268 (66%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRI 624
           RG NG      + LETI+AAA+ IASAENRVPQ + QKRRW   WS++WCFGS K K+RI
Sbjct: 2   RGANGESIAMNNTLETIHAAANAIASAENRVPQATVQKRRWGGCWSIYWCFGSYKQKKRI 61

Query: 623 GHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           G AVL  ET  + A  P AENP+Q  +I                    SATQSP GL+SL
Sbjct: 62  GPAVLTSETSFSGANVPAAENPTQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 180

Query: 263 PEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFP 84
           PEVPFA+LL PN Q GEG QR+P+S YEFQSYQL                       PF 
Sbjct: 181 PEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFR 240

Query: 83  DLECAPG-HPLFVEFRTGNHTKFLNLDK 3
           D E A   H  F EFR G+  K LNLDK
Sbjct: 241 DGEFAASLH--FPEFRMGDPPKLLNLDK 266


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
           gi|223549721|gb|EEF51209.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 459

 Score =  283 bits (723), Expect = 1e-73
 Identities = 153/272 (56%), Positives = 179/272 (65%), Gaps = 8/272 (2%)
 Frame = -3

Query: 794 MSRGVNG-------TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKN 636
           M R VNG        +AL+TINAAASVIASAENRVPQ + QKRRW S WS++WCFG  ++
Sbjct: 1   MMRNVNGGADSRPSNNALDTINAAASVIASAENRVPQATIQKRRWGSCWSVYWCFGYHRH 60

Query: 635 KQRIGHAVLVPETPTTRAEGPTAENPS-QPLSIXXXXXXXXXXXXXXXXXXXXSATQSPT 459
           ++RIGHAVLVPE      +   AENP+ Q  +I                    SA+QSP 
Sbjct: 61  RKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSPA 120

Query: 458 GLISLTSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHL 279
           G++SLTS+SA+MYSP GP+SIFAIGPYAHETQLVSPP FSTFTTEPSTAP+TPPPESV L
Sbjct: 121 GILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTFTTEPSTAPFTPPPESVQL 180

Query: 278 TTPSSPEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXX 99
           TTPSSPEVPFA+LLEP+ ++GE G R+P S YEFQSYQ                      
Sbjct: 181 TTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGT 240

Query: 98  XXPFPDLECAPGHPLFVEFRTGNHTKFLNLDK 3
             PFPD E A   P F+EF+     K LNLDK
Sbjct: 241 SSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDK 272


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  281 bits (718), Expect = 5e-73
 Identities = 156/268 (58%), Positives = 177/268 (66%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRI 624
           R VNG       ALETINAAA+ IASAENRVPQ + QKRRW S W  +WCF S K+K RI
Sbjct: 2   RSVNGDTRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKDK-RI 60

Query: 623 GHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           GHAVL PE+    +  P AEN +Q  +I                    SATQSP+GL+SL
Sbjct: 61  GHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLLSL 120

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSI+AN+YSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 121 TSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 180

Query: 263 PEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFP 84
           PEVPFA+L +PN ++GE G R+ LSQYEFQSYQL                       PFP
Sbjct: 181 PEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFP 240

Query: 83  DLE-CAPGHPLFVEFRTGNHTKFLNLDK 3
           D +    G   F+EFR G   K L LDK
Sbjct: 241 DRDFVCSGSSQFLEFRAGGPPKLLTLDK 268


>ref|XP_012439689.1| PREDICTED: uncharacterized protein LOC105765237 isoform X4
           [Gossypium raimondii] gi|763785056|gb|KJB52127.1|
           hypothetical protein B456_008G247900 [Gossypium
           raimondii]
          Length = 456

 Score =  280 bits (715), Expect = 1e-72
 Identities = 157/267 (58%), Positives = 179/267 (67%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRI 624
           RG NG      + LETI+AAA+ IASAENRVPQ + QKR W  +WS +WCFGS K K+RI
Sbjct: 2   RGANGESRAMNNPLETIHAAANAIASAENRVPQSTVQKR-WGGWWSKYWCFGSYKQKKRI 60

Query: 623 GHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           G AV V ET ++RA  P AE P+Q +++                    SATQSP GL+SL
Sbjct: 61  GPAVPVSETSSSRANMPAAEIPTQAVTVTLPFVAPPSSPASFLPSEPPSATQSPAGLVSL 120

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 121 TSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 179

Query: 263 PEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L PN Q GE GQR+P+ QYEFQSYQL                       PFP
Sbjct: 180 PEVPFAQFLGPNLQYGEAGQRFPIYQYEFQSYQLHPGSPIGQLISPSSGISGSGTSSPFP 239

Query: 83  DLECAPGHPLFVEFRTGNHTKFLNLDK 3
           D + A G   F EFR G+  K LNLDK
Sbjct: 240 DGDFAAG-LRFPEFRMGDPPKLLNLDK 265


>ref|XP_011080935.1| PREDICTED: uncharacterized protein LOC105164075 isoform X1 [Sesamum
           indicum]
          Length = 444

 Score =  279 bits (714), Expect = 1e-72
 Identities = 154/263 (58%), Positives = 178/263 (67%)
 Frame = -3

Query: 791 SRGVNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRIGHAV 612
           S G+NGTD+LETINAAA+ I   E+RV + S QKRRW S WSL+ CFGS K+ +RIG AV
Sbjct: 3   SGGINGTDSLETINAAAAAI---ESRVRRASVQKRRWGSCWSLYRCFGSYKHNKRIGRAV 59

Query: 611 LVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSIS 432
           +VPET  +  + PTAE+P +P  +                    S+ QSPTG++SLTS+S
Sbjct: 60  IVPETSASVMDVPTAEHPPRPPPLELPFVVPPSSPASFLPSDPPSSAQSPTGVLSLTSVS 119

Query: 431 ANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVP 252
           ANMYSPGGP SIFAIGPYAHETQLVSPPVFSTF TEPSTAPYT PPESVHLTTPSSPEVP
Sbjct: 120 ANMYSPGGPPSIFAIGPYAHETQLVSPPVFSTFATEPSTAPYT-PPESVHLTTPSSPEVP 178

Query: 251 FARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFPDLEC 72
           F+RLLEP  Q+GE  QRY  SQYEFQSYQLQ                      P P+LE 
Sbjct: 179 FSRLLEPTLQNGEACQRYGFSQYEFQSYQLQ-----PGSPVSHLISPSSGTSSPLPELEF 233

Query: 71  APGHPLFVEFRTGNHTKFLNLDK 3
           A G P  + F TG+  K L+LDK
Sbjct: 234 ATGIPFLLGFTTGHPPKLLDLDK 256


>gb|KHG14068.1| hypothetical protein F383_19964 [Gossypium arboreum]
          Length = 457

 Score =  278 bits (711), Expect = 3e-72
 Identities = 154/267 (57%), Positives = 179/267 (67%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRI 624
           RG NG      + LETI+AAA+ IASAENRVPQ + QK+RW  +WS +WCF S K K+RI
Sbjct: 2   RGANGESRAMNNPLETIHAAANAIASAENRVPQSTVQKKRWGGWWSKYWCFRSYKQKKRI 61

Query: 623 GHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           G AV V ET ++RA  P AE P+Q +++                    SATQSP GL+SL
Sbjct: 62  GPAVPVSETTSSRANIPAAEIPTQAVTVTLPFVAPPSSPASFLPSEPPSATQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 180

Query: 263 PEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L PN Q GE GQR+P+ QYEFQSYQL                       PFP
Sbjct: 181 PEVPFAQFLGPNLQYGEAGQRFPIYQYEFQSYQLHPGSPIGQLISPSSGISGSGTSSPFP 240

Query: 83  DLECAPGHPLFVEFRTGNHTKFLNLDK 3
           D + A G   F EFR G+  K L+L+K
Sbjct: 241 DGDFASG-LRFPEFRMGDPPKLLSLEK 266


>ref|XP_011022665.1| PREDICTED: uncharacterized protein LOC105124362 isoform X2 [Populus
           euphratica]
          Length = 454

 Score =  276 bits (707), Expect = 9e-72
 Identities = 150/265 (56%), Positives = 171/265 (64%), Gaps = 2/265 (0%)
 Frame = -3

Query: 791 SRGVNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRIGHAV 612
           SR  N T  LETINAAA+ IASAENRVPQ + QKRRWRS WS++WCFG  K+K +IGHAV
Sbjct: 8   SRAANNT--LETINAAATAIASAENRVPQATVQKRRWRSRWSMYWCFGYQKHKSQIGHAV 65

Query: 611 LVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSIS 432
           L PE P   +  P AEN +Q   +                    S TQSP GL+S TSIS
Sbjct: 66  LFPEPPAPGSGAPAAENSAQAPEVTFPFVAPPSSPASFFQSEPPSVTQSPAGLVSRTSIS 125

Query: 431 ANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVP 252
           A+MYSP GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSSPEVP
Sbjct: 126 ASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVP 185

Query: 251 FARLLEP--NFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFPDL 78
           FA+ ++P    ++G  G R+P   ++FQSYQ                        PFPD 
Sbjct: 186 FAQFIDPTATLRNGVTGLRFP---FDFQSYQFHPGSSGGQLISPSSGVSGSGTSSPFPDG 242

Query: 77  ECAPGHPLFVEFRTGNHTKFLNLDK 3
           E A G P F EFR G   K LNLDK
Sbjct: 243 EFAVGGPHFPEFRMGEPPKLLNLDK 267


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
           gi|550346901|gb|EEE82832.2| hypothetical protein
           POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  274 bits (701), Expect = 4e-71
 Identities = 148/267 (55%), Positives = 173/267 (64%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRI 624
           RG NG      + LETINAAA+ IASAENRVPQ + QKRRW S WS++ CFG  K+K++I
Sbjct: 2   RGFNGESRAANNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQI 61

Query: 623 GHAVLVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           GHAVL PE        P +ENP+Q  ++                    S TQSP GL+SL
Sbjct: 62  GHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSP GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 181

Query: 263 PEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L+P+ ++G+ G R+P   ++FQSYQ                        PFP
Sbjct: 182 PEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFP 238

Query: 83  DLECAPGHPLFVEFRTGNHTKFLNLDK 3
           D E A G   F EFR G   K LNLDK
Sbjct: 239 DGEFAVGGAHFPEFRIGEPPKLLNLDK 265


>ref|XP_012836270.1| PREDICTED: COPII coat assembly protein sec16 [Erythranthe guttatus]
           gi|604334238|gb|EYU38335.1| hypothetical protein
           MIMGU_mgv1a007082mg [Erythranthe guttata]
          Length = 420

 Score =  274 bits (700), Expect = 6e-71
 Identities = 160/259 (61%), Positives = 174/259 (67%), Gaps = 4/259 (1%)
 Frame = -3

Query: 794 MSRGVN-GTDALETINAAASVIASAENRVPQVSA-QKRRWRSYWSLFWCFGSDKNKQRIG 621
           M RGVN GTDALETI+AAAS IASAE      S+ QKRRWRS+WSL+WCF  + NK RIG
Sbjct: 1   MRRGVNNGTDALETISAAASAIASAEAHGAHASSLQKRRWRSFWSLYWCFRPNNNK-RIG 59

Query: 620 HAVLVPETPTT-RAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           HAVLV ET ++  A  PTAE P QP SI                    S+TQSPTGL+SL
Sbjct: 60  HAVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAPPSSPASFIPSEPPSSTQSPTGLLSL 119

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE-SVHLTTPS 267
           +S S N+YSP GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE S HLTTPS
Sbjct: 120 SSPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPEFSAHLTTPS 179

Query: 266 SPEVPFARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPF 87
           SPEVPFARLLEPN       QRYPLSQYEFQSYQLQ                      PF
Sbjct: 180 SPEVPFARLLEPN-------QRYPLSQYEFQSYQLQPGSPVSHLISPCSGISGSGASSPF 232

Query: 86  PDLECAPGHPLFVEFRTGN 30
            D + A  HP F+EF  GN
Sbjct: 233 LDRDFAAVHPFFLEFGGGN 251


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
           gi|222841936|gb|EEE79483.1| hypothetical protein
           POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  273 bits (698), Expect = 1e-70
 Identities = 148/263 (56%), Positives = 172/263 (65%)
 Frame = -3

Query: 791 SRGVNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDKNKQRIGHAV 612
           SR  N T  LETINAAA+ IASAENRVPQ   QK+RWRS+WS++WCFG  K+K++IGHAV
Sbjct: 8   SRAANNT--LETINAAATAIASAENRVPQAMVQKQRWRSHWSIYWCFGYQKSKRQIGHAV 65

Query: 611 LVPETPTTRAEGPTAENPSQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSIS 432
           L PE+    +  P AEN +Q   +                    S TQSP GL+S TSIS
Sbjct: 66  LFPESSAPGSGAPAAENSAQAPEVTFPFVAPPSSPASFFQSEPPSVTQSPAGLVSRTSIS 125

Query: 431 ANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVP 252
           A+MYSP GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSSPEVP
Sbjct: 126 ASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVP 185

Query: 251 FARLLEPNFQSGEGGQRYPLSQYEFQSYQLQXXXXXXXXXXXXXXXXXXXXXXPFPDLEC 72
           FA+L++P  ++G  G R+P   ++FQSYQ                        PFPD E 
Sbjct: 186 FAQLIDPTLRNGVTGLRFP---FDFQSYQFHPGSSVGQLISPSSGISGSGTSSPFPDGEF 242

Query: 71  APGHPLFVEFRTGNHTKFLNLDK 3
           A G P   EFR G   K LNLDK
Sbjct: 243 AVGGPHSPEFRMG--PKLLNLDK 263


Top