BLASTX nr result

ID: Forsythia23_contig00007663 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00007663
         (823 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011072884.1| PREDICTED: uncharacterized protein LOC105157...   342   2e-91
emb|CDP12040.1| unnamed protein product [Coffea canephora]            320   9e-85
ref|XP_008238921.1| PREDICTED: uncharacterized protein LOC103337...   316   1e-83
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   316   1e-83
ref|XP_012086872.1| PREDICTED: uncharacterized protein LOC105645...   310   9e-82
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   302   1e-79
gb|KDO76253.1| hypothetical protein CISIN_1g012593mg [Citrus sin...   300   6e-79
ref|XP_011080935.1| PREDICTED: uncharacterized protein LOC105164...   299   1e-78
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   299   1e-78
ref|XP_010089083.1| hypothetical protein L484_024256 [Morus nota...   298   3e-78
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   296   1e-77
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   293   9e-77
ref|XP_012439688.1| PREDICTED: uncharacterized protein LOC105765...   293   1e-76
ref|XP_012836270.1| PREDICTED: COPII coat assembly protein sec16...   291   3e-76
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   291   3e-76
ref|XP_012439689.1| PREDICTED: uncharacterized protein LOC105765...   287   5e-75
gb|KHG14068.1| hypothetical protein F383_19964 [Gossypium arboreum]   286   1e-74
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   285   2e-74
ref|XP_009589061.1| PREDICTED: uncharacterized protein At1g76660...   284   5e-74
ref|XP_011029307.1| PREDICTED: uncharacterized protein LOC105129...   283   7e-74

>ref|XP_011072884.1| PREDICTED: uncharacterized protein LOC105157994 [Sesamum indicum]
          Length = 455

 Score =  342 bits (876), Expect = 2e-91
 Identities = 176/260 (67%), Positives = 192/260 (73%)
 Frame = -3

Query: 785 GVNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRIGHAVLV 606
           GVNG DALETINAAA+ IAS E R  Q S QKRRW  +WSL+WCFGS+  K RIGHAVLV
Sbjct: 5   GVNGADALETINAAATAIASVETRGLQDSVQKRRWGRWWSLYWCFGSNKTK-RIGHAVLV 63

Query: 605 PETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSISAN 426
           PET    AD PTAE+P QP SI                    SA QSPTGL+SLTS+SA+
Sbjct: 64  PETTAAGADAPTAEHPAQPPSIVLPFVAPPSSPASFLPSEPPSAAQSPTGLLSLTSVSAS 123

Query: 425 MYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFA 246
           MYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFA
Sbjct: 124 MYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFA 183

Query: 245 RLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDLECTP 66
           RLLEP+ Q+ E GQR+ +S YEFQSYQLQPGSPVSHL              PFPD E   
Sbjct: 184 RLLEPNLQNSEAGQRFHISPYEFQSYQLQPGSPVSHLISPSSGISGSGTSSPFPDREFAA 243

Query: 65  GHPLFVEFRTGNHTQFLNLD 6
           GHP F+EFRTGN  + L+LD
Sbjct: 244 GHPFFLEFRTGNPPKLLDLD 263


>emb|CDP12040.1| unnamed protein product [Coffea canephora]
          Length = 466

 Score =  320 bits (819), Expect = 9e-85
 Identities = 166/260 (63%), Positives = 188/260 (72%)
 Frame = -3

Query: 782 VNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRIGHAVLVP 603
           VN T  LETINAAA+ IA+AENRVPQV  QKRRW S WSL+WCFGS  + +RIGHAVLVP
Sbjct: 16  VNNT--LETINAAANAIAAAENRVPQVGVQKRRWASCWSLYWCFGSYKHTKRIGHAVLVP 73

Query: 602 ETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSISANM 423
           E    RAD P  EN  Q  S+                    SATQSP GL+SLTS+SA+M
Sbjct: 74  EPIAPRADPPAVENQTQAASVALPFIAPPSSPASFLQSEPPSATQSPPGLLSLTSMSASM 133

Query: 422 YSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFAR 243
           YSPGGP+S+FAIGPYAHETQLV+PPVFSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFAR
Sbjct: 134 YSPGGPASMFAIGPYAHETQLVTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFAR 193

Query: 242 LLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDLECTPG 63
           LL+P  Q+ + GQRYPL QYEFQSYQLQPGSP SHL              PFPD E   G
Sbjct: 194 LLDPIDQNCQDGQRYPLPQYEFQSYQLQPGSPASHLISPSSGISGSGTSSPFPDGEFVYG 253

Query: 62  HPLFVEFRTGNHTQFLNLDK 3
            P F+EFR+G+  + LNL+K
Sbjct: 254 RPHFLEFRSGDPPKLLNLEK 273


>ref|XP_008238921.1| PREDICTED: uncharacterized protein LOC103337539 [Prunus mume]
          Length = 455

 Score =  316 bits (810), Expect = 1e-83
 Identities = 164/257 (63%), Positives = 183/257 (71%)
 Frame = -3

Query: 776 GTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRIGHAVLVPET 597
           G +ALETINAAAS IA+AENRVPQ + QKRRW S+WS++WCFG   +K+RIGHAVLVPET
Sbjct: 11  GNNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPET 70

Query: 596 PTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSISANMYS 417
                D P AENP Q  SI                    SATQSP G  SLT   A+MYS
Sbjct: 71  TDRGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFFSLT---ASMYS 127

Query: 416 PGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFARLL 237
           P GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSSPEVPFA+LL
Sbjct: 128 PSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL 187

Query: 236 EPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDLECTPGHP 57
           +P F++GEGGQR+PLS YEFQSYQL PGSPV  L              PFPDLE      
Sbjct: 188 DPQFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGH 247

Query: 56  LFVEFRTGNHTQFLNLD 6
            F+EFRTG+  + LNLD
Sbjct: 248 HFLEFRTGDPPKLLNLD 264


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
           gi|462404864|gb|EMJ10328.1| hypothetical protein
           PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  316 bits (809), Expect = 1e-83
 Identities = 164/257 (63%), Positives = 183/257 (71%)
 Frame = -3

Query: 776 GTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRIGHAVLVPET 597
           G +ALETINAAAS IA+AENRVPQ + QKRRW S+WS++WCFG   +K+RIGHAVLVPET
Sbjct: 11  GNNALETINAAASAIAAAENRVPQATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPET 70

Query: 596 PTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSISANMYS 417
                D P AENP Q  SI                    SATQSP G  SLT   A+MYS
Sbjct: 71  TDRGGDAPRAENPIQTPSIVLPFVAPPSSPASFLQSEPPSATQSPAGFFSLT---ASMYS 127

Query: 416 PGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFARLL 237
           P GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSSPEVPFA+LL
Sbjct: 128 PSGPTSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLL 187

Query: 236 EPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDLECTPGHP 57
           +P F++GEGGQR+PLS YEFQSYQL PGSPV  L              PFPDLE      
Sbjct: 188 DPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGH 247

Query: 56  LFVEFRTGNHTQFLNLD 6
            F+EFRTG+  + LNLD
Sbjct: 248 HFLEFRTGDPPKLLNLD 264


>ref|XP_012086872.1| PREDICTED: uncharacterized protein LOC105645786 [Jatropha curcas]
           gi|643711987|gb|KDP25415.1| hypothetical protein
           JCGZ_20571 [Jatropha curcas]
          Length = 455

 Score =  310 bits (793), Expect = 9e-82
 Identities = 166/267 (62%), Positives = 188/267 (70%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           R VNG      +AL+TINAAAS IASAENRVPQ + QKRRW S +S++WCFG + +++RI
Sbjct: 2   RAVNGDSRPSNNALDTINAAASAIASAENRVPQATVQKRRWGSCFSVYWCFGYNRHRKRI 61

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           GHAVLVPETP  R D   AEN  Q  +I                    SA+QSPTG++SL
Sbjct: 62  GHAVLVPETPGPRNDSSAAENSTQTPTITLPFVAPPSSPASFLQSEPPSASQSPTGVLSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISANMYSP GPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISANMYSPSGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 181

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+LL+PS ++ E G R+PLS YEFQSYQL PGSPV  L              PFP
Sbjct: 182 PEVPFAQLLDPSIRNVEAGLRFPLSNYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFP 241

Query: 83  DLECTPGHPLFVEFRTGNHTQFLNLDK 3
           D E   G   F+EFR G   + LNLDK
Sbjct: 242 DGEFAAG---FLEFRMGEPPKLLNLDK 265


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  302 bits (774), Expect = 1e-79
 Identities = 159/268 (59%), Positives = 182/268 (67%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNGTDA------LETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQR 627
           RGVNG D+      LETINAAA+ IASAENRV Q ++QKRRW   WS+ WCFG   +++R
Sbjct: 2   RGVNGGDSRALNNSLETINAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRKR 61

Query: 626 IGHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLIS 447
           IGHAVLVPE   +R++   A N  Q  +I                    SATQSP GL+S
Sbjct: 62  IGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGLVS 121

Query: 446 LTSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPS 267
           L SIS NMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPS
Sbjct: 122 LNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 181

Query: 266 SPEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPF 87
           SPEVPFA+LL+PS + GE GQ++P S YEFQSY L PGSPV +L              PF
Sbjct: 182 SPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPF 241

Query: 86  PDLECTPGHPLFVEFRTGNHTQFLNLDK 3
           PD E     P F +F  G+  + LNLDK
Sbjct: 242 PDGEFATAGPQFPDFHRGDPPKLLNLDK 269


>gb|KDO76253.1| hypothetical protein CISIN_1g012593mg [Citrus sinensis]
          Length = 460

 Score =  300 bits (769), Expect = 6e-79
 Identities = 158/268 (58%), Positives = 182/268 (67%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNGTDA------LETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQR 627
           RGVNG D+      LETI+AAA+ IASAENRV Q ++QKRRW   WS+ WCFG   +++R
Sbjct: 2   RGVNGGDSRALNNSLETISAAATAIASAENRVHQATSQKRRWGGCWSISWCFGFQKHRKR 61

Query: 626 IGHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLIS 447
           IGHAVLVPE   +R++   A N  Q  +I                    SATQSP GL+S
Sbjct: 62  IGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVAPPSSPASFLQSEPPSATQSPAGLVS 121

Query: 446 LTSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPS 267
           L SIS NMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPS
Sbjct: 122 LNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 181

Query: 266 SPEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPF 87
           SPEVPFA+LL+PS + GE GQ++P S YEFQSY L PGSPV +L              PF
Sbjct: 182 SPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPF 241

Query: 86  PDLECTPGHPLFVEFRTGNHTQFLNLDK 3
           PD E     P F +F  G+  + LNLDK
Sbjct: 242 PDGEFATAGPQFPDFHRGDPPKLLNLDK 269


>ref|XP_011080935.1| PREDICTED: uncharacterized protein LOC105164075 isoform X1 [Sesamum
           indicum]
          Length = 444

 Score =  299 bits (766), Expect = 1e-78
 Identities = 161/263 (61%), Positives = 186/263 (70%)
 Frame = -3

Query: 791 SRGVNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRIGHAV 612
           S G+NGTD+LETINAAA+ I   E+RV + S QKRRW S WSL+ CFGS  + +RIG AV
Sbjct: 3   SGGINGTDSLETINAAAAAI---ESRVRRASVQKRRWGSCWSLYRCFGSYKHNKRIGRAV 59

Query: 611 LVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSIS 432
           +VPET  +  D PTAE+PP+P  +                    S+ QSPTG++SLTS+S
Sbjct: 60  IVPETSASVMDVPTAEHPPRPPPLELPFVVPPSSPASFLPSDPPSSAQSPTGVLSLTSVS 119

Query: 431 ANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVP 252
           ANMYSPGGP SIFAIGPYAHETQLVSPPVFSTF TEPSTAPYT PPESVHLTTPSSPEVP
Sbjct: 120 ANMYSPGGPPSIFAIGPYAHETQLVSPPVFSTFATEPSTAPYT-PPESVHLTTPSSPEVP 178

Query: 251 FARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDLEC 72
           F+RLLEP+ Q+GE  QRY  SQYEFQSYQLQPGSPVSHL              P P+LE 
Sbjct: 179 FSRLLEPTLQNGEACQRYGFSQYEFQSYQLQPGSPVSHL-----ISPSSGTSSPLPELEF 233

Query: 71  TPGHPLFVEFRTGNHTQFLNLDK 3
             G P  + F TG+  + L+LDK
Sbjct: 234 ATGIPFLLGFTTGHPPKLLDLDK 256


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
           gi|557541785|gb|ESR52763.1| hypothetical protein
           CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  299 bits (766), Expect = 1e-78
 Identities = 157/268 (58%), Positives = 182/268 (67%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNGTDA------LETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQR 627
           RGVNG D+      LETI+AAA+ IASAENRV Q ++QKRRW   W++ WCFG   +++R
Sbjct: 2   RGVNGGDSRALNNSLETISAAATAIASAENRVHQATSQKRRWGGCWNISWCFGFQKHRKR 61

Query: 626 IGHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLIS 447
           IGHAVLVPE   +R++   A N  Q  +I                    SATQSP GL+S
Sbjct: 62  IGHAVLVPEPTASRSNASEAVNSTQATAISLPFVAPPSSPASFLQSEPPSATQSPAGLVS 121

Query: 446 LTSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPS 267
           L SIS NMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPS
Sbjct: 122 LNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPS 181

Query: 266 SPEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPF 87
           SPEVPFA+LL+PS + GE GQ++P S YEFQSY L PGSPV +L              PF
Sbjct: 182 SPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPF 241

Query: 86  PDLECTPGHPLFVEFRTGNHTQFLNLDK 3
           PD E     P F +F  G+  + LNLDK
Sbjct: 242 PDGEFATAGPQFPDFHRGDPPKLLNLDK 269


>ref|XP_010089083.1| hypothetical protein L484_024256 [Morus notabilis]
           gi|587846890|gb|EXB37330.1| hypothetical protein
           L484_024256 [Morus notabilis]
          Length = 455

 Score =  298 bits (763), Expect = 3e-78
 Identities = 155/263 (58%), Positives = 181/263 (68%)
 Frame = -3

Query: 791 SRGVNGTDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRIGHAV 612
           SR +N  +ALETINAAA+ IA AENRVPQ + +KRRW    S++WCFG+  N+ RIGH V
Sbjct: 10  SRTMN--NALETINAAATAIAMAENRVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGHGV 67

Query: 611 LVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISLTSIS 432
           LVPET       P AEN  Q  ++                    SATQSP GL+SLTS+S
Sbjct: 68  LVPETAQPGNSAPRAENSTQTHAVILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSVS 127

Query: 431 ANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVP 252
           A+MYSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSSPEVP
Sbjct: 128 ASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVP 187

Query: 251 FARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDLEC 72
           FA+LL+P+  +GE GQR+P+   EFQSY  QPGSP+  L              PFPD E 
Sbjct: 188 FAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEF 247

Query: 71  TPGHPLFVEFRTGNHTQFLNLDK 3
               P F+EFRTG+  + LNLDK
Sbjct: 248 AARGPHFLEFRTGDPPKLLNLDK 270


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
           gi|223549721|gb|EEF51209.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 459

 Score =  296 bits (758), Expect = 1e-77
 Identities = 159/272 (58%), Positives = 182/272 (66%), Gaps = 8/272 (2%)
 Frame = -3

Query: 794 MSRGVNG-------TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNN 636
           M R VNG        +AL+TINAAASVIASAENRVPQ + QKRRW S WS++WCFG   +
Sbjct: 1   MMRNVNGGADSRPSNNALDTINAAASVIASAENRVPQATIQKRRWGSCWSVYWCFGYHRH 60

Query: 635 KQRIGHAVLVPETPTTRADGPTAENPP-QPLSIXXXXXXXXXXXXXXXXXXXXSATQSPT 459
           ++RIGHAVLVPE      D   AENP  Q  +I                    SA+QSP 
Sbjct: 61  RKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPASFLQSEPPSASQSPA 120

Query: 458 GLISLTSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHL 279
           G++SLTS+SA+MYSP GP+SIFAIGPYAHETQLVSPP FSTFTTEPSTAP+TPPPESV L
Sbjct: 121 GILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTFTTEPSTAPFTPPPESVQL 180

Query: 278 TTPSSPEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXX 99
           TTPSSPEVPFA+LLEPS ++GE G R+P S YEFQSYQ  PGSPV  L            
Sbjct: 181 TTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPVGQLISPSSGISGSGT 240

Query: 98  XXPFPDLECTPGHPLFVEFRTGNHTQFLNLDK 3
             PFPD E     P F+EF+     + LNLDK
Sbjct: 241 SSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDK 272


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  293 bits (750), Expect = 9e-77
 Identities = 160/268 (59%), Positives = 182/268 (67%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           R VNG       ALETINAAA+ IASAENRVPQ + QKRRW S W  +WCF S  +K RI
Sbjct: 2   RSVNGDTRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPKDK-RI 60

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           GHAVL PE+    +  P AEN  Q  +I                    SATQSP+GL+SL
Sbjct: 61  GHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLLSL 120

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSI+AN+YSPGGP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 121 TSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 180

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+L +P+ ++GE G R+ LSQYEFQSYQL PGSPV HL              PFP
Sbjct: 181 PEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFP 240

Query: 83  DLE-CTPGHPLFVEFRTGNHTQFLNLDK 3
           D +    G   F+EFR G   + L LDK
Sbjct: 241 DRDFVCSGSSQFLEFRAGGPPKLLTLDK 268


>ref|XP_012439688.1| PREDICTED: uncharacterized protein LOC105765237 isoform X3
           [Gossypium raimondii] gi|763785055|gb|KJB52126.1|
           hypothetical protein B456_008G247900 [Gossypium
           raimondii]
          Length = 457

 Score =  293 bits (749), Expect = 1e-76
 Identities = 158/267 (59%), Positives = 184/267 (68%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           RG NG      + LETI+AAA+ IASAENRVPQ + QK+RW  +WS +WCFGS   K+RI
Sbjct: 2   RGANGESRAMNNPLETIHAAANAIASAENRVPQSTVQKKRWGGWWSKYWCFGSYKQKKRI 61

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           G AV V ET ++RA+ P AE P Q +++                    SATQSP GL+SL
Sbjct: 62  GPAVPVSETSSSRANMPAAEIPTQAVTVTLPFVAPPSSPASFLPSEPPSATQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 180

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L P+ Q GE GQR+P+ QYEFQSYQL PGSP+  L              PFP
Sbjct: 181 PEVPFAQFLGPNLQYGEAGQRFPIYQYEFQSYQLHPGSPIGQLISPSSGISGSGTSSPFP 240

Query: 83  DLECTPGHPLFVEFRTGNHTQFLNLDK 3
           D +   G   F EFR G+  + LNLDK
Sbjct: 241 DGDFAAG-LRFPEFRMGDPPKLLNLDK 266


>ref|XP_012836270.1| PREDICTED: COPII coat assembly protein sec16 [Erythranthe guttatus]
           gi|604334238|gb|EYU38335.1| hypothetical protein
           MIMGU_mgv1a007082mg [Erythranthe guttata]
          Length = 420

 Score =  291 bits (746), Expect = 3e-76
 Identities = 167/259 (64%), Positives = 182/259 (70%), Gaps = 4/259 (1%)
 Frame = -3

Query: 794 MSRGVN-GTDALETINAAASVIASAENRVPQVSA-QKRRWRSYWSLFWCFGSDNNKQRIG 621
           M RGVN GTDALETI+AAAS IASAE      S+ QKRRWRS+WSL+WCF  +NNK RIG
Sbjct: 1   MRRGVNNGTDALETISAAASAIASAEAHGAHASSLQKRRWRSFWSLYWCFRPNNNK-RIG 59

Query: 620 HAVLVPETPTT-RADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           HAVLV ET ++  A  PTAE P QP SI                    S+TQSPTGL+SL
Sbjct: 60  HAVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAPPSSPASFIPSEPPSSTQSPTGLLSL 119

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE-SVHLTTPS 267
           +S S N+YSP GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE S HLTTPS
Sbjct: 120 SSPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPEFSAHLTTPS 179

Query: 266 SPEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPF 87
           SPEVPFARLLEP+       QRYPLSQYEFQSYQLQPGSPVSHL              PF
Sbjct: 180 SPEVPFARLLEPN-------QRYPLSQYEFQSYQLQPGSPVSHLISPCSGISGSGASSPF 232

Query: 86  PDLECTPGHPLFVEFRTGN 30
            D +    HP F+EF  GN
Sbjct: 233 LDRDFAAVHPFFLEFGGGN 251


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
           gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
           glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  291 bits (745), Expect = 3e-76
 Identities = 162/268 (60%), Positives = 182/268 (67%), Gaps = 6/268 (2%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           RG NG      + LETI+AAA+ IASAENRVPQ + QKRRW   WS++WCFGS   K+RI
Sbjct: 2   RGANGESIAMNNTLETIHAAANAIASAENRVPQATVQKRRWGGCWSIYWCFGSYKQKKRI 61

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           G AVL  ET  + A+ P AENP Q  +I                    SATQSP GL+SL
Sbjct: 62  GPAVLTSETSFSGANVPAAENPTQAPAIALPFVAPPSSPASFLPSEPPSATQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 180

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+LL P+ Q GEG QR+P+S YEFQSYQL PGSPV  L              PF 
Sbjct: 181 PEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFR 240

Query: 83  DLECTPG-HPLFVEFRTGNHTQFLNLDK 3
           D E     H  F EFR G+  + LNLDK
Sbjct: 241 DGEFAASLH--FPEFRMGDPPKLLNLDK 266


>ref|XP_012439689.1| PREDICTED: uncharacterized protein LOC105765237 isoform X4
           [Gossypium raimondii] gi|763785056|gb|KJB52127.1|
           hypothetical protein B456_008G247900 [Gossypium
           raimondii]
          Length = 456

 Score =  287 bits (735), Expect = 5e-75
 Identities = 158/267 (59%), Positives = 183/267 (68%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           RG NG      + LETI+AAA+ IASAENRVPQ + QKR W  +WS +WCFGS   K+RI
Sbjct: 2   RGANGESRAMNNPLETIHAAANAIASAENRVPQSTVQKR-WGGWWSKYWCFGSYKQKKRI 60

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           G AV V ET ++RA+ P AE P Q +++                    SATQSP GL+SL
Sbjct: 61  GPAVPVSETSSSRANMPAAEIPTQAVTVTLPFVAPPSSPASFLPSEPPSATQSPAGLVSL 120

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 121 TSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 179

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L P+ Q GE GQR+P+ QYEFQSYQL PGSP+  L              PFP
Sbjct: 180 PEVPFAQFLGPNLQYGEAGQRFPIYQYEFQSYQLHPGSPIGQLISPSSGISGSGTSSPFP 239

Query: 83  DLECTPGHPLFVEFRTGNHTQFLNLDK 3
           D +   G   F EFR G+  + LNLDK
Sbjct: 240 DGDFAAG-LRFPEFRMGDPPKLLNLDK 265


>gb|KHG14068.1| hypothetical protein F383_19964 [Gossypium arboreum]
          Length = 457

 Score =  286 bits (731), Expect = 1e-74
 Identities = 155/267 (58%), Positives = 183/267 (68%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           RG NG      + LETI+AAA+ IASAENRVPQ + QK+RW  +WS +WCF S   K+RI
Sbjct: 2   RGANGESRAMNNPLETIHAAANAIASAENRVPQSTVQKKRWGGWWSKYWCFRSYKQKKRI 61

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           G AV V ET ++RA+ P AE P Q +++                    SATQSP GL+SL
Sbjct: 62  GPAVPVSETTSSRANIPAAEIPTQAVTVTLPFVAPPSSPASFLPSEPPSATQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPG-PASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 180

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L P+ Q GE GQR+P+ QYEFQSYQL PGSP+  L              PFP
Sbjct: 181 PEVPFAQFLGPNLQYGEAGQRFPIYQYEFQSYQLHPGSPIGQLISPSSGISGSGTSSPFP 240

Query: 83  DLECTPGHPLFVEFRTGNHTQFLNLDK 3
           D +   G   F EFR G+  + L+L+K
Sbjct: 241 DGDFASG-LRFPEFRMGDPPKLLSLEK 266


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
           gi|550346901|gb|EEE82832.2| hypothetical protein
           POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  285 bits (730), Expect = 2e-74
 Identities = 152/267 (56%), Positives = 176/267 (65%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           RG NG      + LETINAAA+ IASAENRVPQ + QKRRW S WS++ CFG   +K++I
Sbjct: 2   RGFNGESRAANNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQI 61

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           GHAVL PE        P +ENP Q  ++                    S TQSP GL+SL
Sbjct: 62  GHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSP GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 181

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L+PS ++G+ G R+P   ++FQSYQ  PGSPV  L              PFP
Sbjct: 182 PEVPFAQFLDPSLRNGDTGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFP 238

Query: 83  DLECTPGHPLFVEFRTGNHTQFLNLDK 3
           D E   G   F EFR G   + LNLDK
Sbjct: 239 DGEFAVGGAHFPEFRIGEPPKLLNLDK 265


>ref|XP_009589061.1| PREDICTED: uncharacterized protein At1g76660 [Nicotiana
           tomentosiformis]
          Length = 446

 Score =  284 bits (726), Expect = 5e-74
 Identities = 155/267 (58%), Positives = 173/267 (64%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNGTD-----ALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           RGVNG        LETINAAA+ IAS ENRVPQ S QKRRW S WS++WCFGS    +RI
Sbjct: 2   RGVNGEQRGVDSTLETINAAATAIASVENRVPQASVQKRRWGSCWSMYWCFGSQKQTKRI 61

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           GHAV VPET    AD P A+N  Q  SI                    SAT SP G   L
Sbjct: 62  GHAVFVPETTAPGADRPAADNSTQAPSIVLPFIAPPSSPASFLPSEPPSATHSPVGSKCL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           +S+S   YSP GP+SIFAIGPYAHE QLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 SSMST--YSPSGPASIFAIGPYAHEPQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 179

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+LL+P+ Q+ + G R+P +QYEFQSYQLQPGSPVS+L              PF 
Sbjct: 180 PEVPFAKLLDPNHQNVDAGNRFPFAQYEFQSYQLQPGSPVSNLISPGSAISVSGTSSPFL 239

Query: 83  DLECTPGHPLFVEFRTGNHTQFLNLDK 3
           D E  PG P           QFLNL+K
Sbjct: 240 DRESNPGRP-----------QFLNLEK 255


>ref|XP_011029307.1| PREDICTED: uncharacterized protein LOC105129075 isoform X1 [Populus
           euphratica]
          Length = 453

 Score =  283 bits (725), Expect = 7e-74
 Identities = 151/267 (56%), Positives = 176/267 (65%), Gaps = 5/267 (1%)
 Frame = -3

Query: 788 RGVNG-----TDALETINAAASVIASAENRVPQVSAQKRRWRSYWSLFWCFGSDNNKQRI 624
           RG NG      + LETINAAA+ IASAENRVPQ + QKRRW S WS++ CFG   +K++I
Sbjct: 2   RGFNGESRAANNTLETINAAATAIASAENRVPQATVQKRRWGSCWSIYLCFGYQKHKKQI 61

Query: 623 GHAVLVPETPTTRADGPTAENPPQPLSIXXXXXXXXXXXXXXXXXXXXSATQSPTGLISL 444
           GHAVL PE        P +ENP Q   +                    S TQSP GL+SL
Sbjct: 62  GHAVLFPEPSAPGNGAPASENPTQAPVVTLPFAAPPSSPASFFQSEPPSVTQSPAGLVSL 121

Query: 443 TSISANMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSS 264
           TSISA+MYSP GP+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPESVHLTTPSS
Sbjct: 122 TSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSS 181

Query: 263 PEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFP 84
           PEVPFA+ L+PS ++G+ G R+P   ++FQSYQ  PGSPV  L              PFP
Sbjct: 182 PEVPFAQFLDPSLRNGDKGLRFP---FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFP 238

Query: 83  DLECTPGHPLFVEFRTGNHTQFLNLDK 3
           D E + G   F EFR G   + L+LDK
Sbjct: 239 DGEFSVGGAHFTEFRMGEPPKLLSLDK 265


Top