BLASTX nr result

ID: Forsythia22_contig00027648 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00027648
         (1326 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011070539.1| PREDICTED: uncharacterized protein LOC105156...   332   5e-88
ref|XP_011070538.1| PREDICTED: uncharacterized protein LOC105156...   330   2e-87
ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun...   308   5e-81
ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600...   308   8e-81
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   308   8e-81
ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261...   304   9e-80
ref|XP_009607025.1| PREDICTED: uncharacterized protein LOC104101...   303   2e-79
ref|XP_009772695.1| PREDICTED: uncharacterized protein LOC104223...   299   3e-78
ref|XP_012846113.1| PREDICTED: uncharacterized protein LOC105966...   298   5e-78
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   296   3e-77
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   291   8e-76
ref|XP_008220943.1| PREDICTED: uncharacterized protein LOC103320...   286   3e-74
ref|XP_011021729.1| PREDICTED: uncharacterized protein LOC105123...   280   2e-72
ref|XP_011021730.1| PREDICTED: uncharacterized protein LOC105123...   279   4e-72
ref|XP_011021728.1| PREDICTED: uncharacterized protein LOC105123...   279   4e-72
gb|KDO60855.1| hypothetical protein CISIN_1g006789mg [Citrus sin...   275   7e-71
ref|XP_012448358.1| PREDICTED: uncharacterized protein LOC105771...   272   5e-70
ref|XP_012448359.1| PREDICTED: uncharacterized protein LOC105771...   272   5e-70
ref|XP_012448356.1| PREDICTED: uncharacterized protein LOC105771...   270   2e-69
ref|XP_012448357.1| PREDICTED: uncharacterized protein LOC105771...   270   2e-69

>ref|XP_011070539.1| PREDICTED: uncharacterized protein LOC105156173 isoform X2 [Sesamum
           indicum]
          Length = 650

 Score =  332 bits (850), Expect = 5e-88
 Identities = 178/336 (52%), Positives = 218/336 (64%), Gaps = 28/336 (8%)
 Frame = -3

Query: 925 MAMQSGNVAAPEKMPGPGEAVAVRGQWFHYQQ-----HQQLDERDGFLMWLRGEFAAANA 761
           MAMQS  V  PEK+P          QW+++Q      H Q+DER+GFLMWLRGEFAAANA
Sbjct: 1   MAMQSAAVVVPEKIPV---------QWYNHQHQQQQPHHQMDEREGFLMWLRGEFAAANA 51

Query: 760 IIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWRRQQ 581
           IID+LCHHLR VG+ GEYDGVIG IQQRRCNWNP+LHMQQYF VAEV+YALQQVGWRRQQ
Sbjct: 52  IIDALCHHLRTVGDPGEYDGVIGSIQQRRCNWNPILHMQQYFSVAEVLYALQQVGWRRQQ 111

Query: 580 RAAGFDGGVKM-GSWKXXXXXXXXXXXXXXGQNSSVEMNTKNL-NVYVKPNADTNE--NA 413
           R   F+G V+M G  K                N   E+N K+L N Y K N + N+  + 
Sbjct: 112 RTVAFEGSVRMGGGGKEFRRGGRGQRGSVEVHNLGGEVNGKDLNNGYAKSNLNVNDKLDG 171

Query: 412 RENNNLDXXXXXXXXXXXXXGSCIVRKE-------------------SRSIQILNEKQNL 290
            E   ++                +V ++                     S   L EK+NL
Sbjct: 172 GEKAKVEEKEEKKELNEKSEADSLVTRQGSTQGAVHHADEVEGSCGVDASASALEEKRNL 231

Query: 289 NTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAAGRRGQLQGQTF 110
           + +PKTFV+ E+ DGK VN+VEG+KLYE+  +D E+SKL+ LVNDLRAAGRRGQLQG +F
Sbjct: 232 DVSPKTFVANEICDGKSVNIVEGMKLYEDQVNDSEISKLIALVNDLRAAGRRGQLQGHSF 291

Query: 109 IVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
           ++SKRP +GHGREMIQLGVPIADAPPEDE+A+G S+
Sbjct: 292 VISKRPMKGHGREMIQLGVPIADAPPEDEAASGASR 327


>ref|XP_011070538.1| PREDICTED: uncharacterized protein LOC105156173 isoform X1 [Sesamum
           indicum]
          Length = 652

 Score =  330 bits (845), Expect = 2e-87
 Identities = 179/338 (52%), Positives = 217/338 (64%), Gaps = 30/338 (8%)
 Frame = -3

Query: 925 MAMQSGNVAAPEKMPGPGEAVAVRGQWFHYQQ-----HQQLDERDGFLMWLRGEFAAANA 761
           MAMQS  V  PEK+P          QW+++Q      H Q+DER+GFLMWLRGEFAAANA
Sbjct: 1   MAMQSAAVVVPEKIPV---------QWYNHQHQQQQPHHQMDEREGFLMWLRGEFAAANA 51

Query: 760 IIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWRRQQ 581
           IID+LCHHLR VG+ GEYDGVIG IQQRRCNWNP+LHMQQYF VAEV+YALQQVGWRRQQ
Sbjct: 52  IIDALCHHLRTVGDPGEYDGVIGSIQQRRCNWNPILHMQQYFSVAEVLYALQQVGWRRQQ 111

Query: 580 RAAGFDGGVKM-GSWKXXXXXXXXXXXXXXGQNSSVEMNTKNL-NVYVKPNADTNE---- 419
           R   F+G V+M G  K                N   E+N K+L N Y K N + N+    
Sbjct: 112 RTVAFEGSVRMGGGGKEFRRGGRGQRGSVEVHNLGGEVNGKDLNNGYAKSNLNVNDKLDG 171

Query: 418 -NARENNNLDXXXXXXXXXXXXXGSCIVRKES------------------RSIQILNEKQ 296
               +    +              S + R+ S                   S   L EK+
Sbjct: 172 GEKAKVEEKEEKKVTELNEKSEADSLVTRQGSTQGAVHHADEVEGSCGVDASASALEEKR 231

Query: 295 NLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAAGRRGQLQGQ 116
           NL+ +PKTFV+ E+ DGK VN+VEG+KLYE+  +D E+SKL+ LVNDLRAAGRRGQLQG 
Sbjct: 232 NLDVSPKTFVANEICDGKSVNIVEGMKLYEDQVNDSEISKLIALVNDLRAAGRRGQLQGH 291

Query: 115 TFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
           +F++SKRP +GHGREMIQLGVPIADAPPEDE+A+G S+
Sbjct: 292 SFVISKRPMKGHGREMIQLGVPIADAPPEDEAASGASR 329


>ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
           gi|462422058|gb|EMJ26321.1| hypothetical protein
           PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  308 bits (790), Expect = 5e-81
 Identities = 179/321 (55%), Positives = 208/321 (64%), Gaps = 13/321 (4%)
 Frame = -3

Query: 925 MAMQSGNVAAPEKM--PGPGEAVAVRGQWFHYQQHQQL--DERDGFLMWLRGEFAAANAI 758
           M M SGNV   +KM  P  G   AV G     Q H+Q   DERDGF+ WLRGEFAAANAI
Sbjct: 1   MTMPSGNVVLSDKMQFPSGGGGGAVGGGEIA-QHHRQWFPDERDGFISWLRGEFAAANAI 59

Query: 757 IDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWRRQQR 578
           IDSLCHHLR VGE GEYD VIGCIQQRRCNWNPVLHMQQYF VAEVIYALQ V WRRQQR
Sbjct: 60  IDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQR 119

Query: 577 AAGFDGGVKMGSWKXXXXXXXXXXXXXXGQ------NSSVEMNTKNLNVYVKPNADTNEN 416
              +   VK G+ +               +      NS++E ++ + N       +  E 
Sbjct: 120 ---YYDPVKAGAKEFKRSGVGFNKGQQRAEAFKEGHNSTLESHSNDGNSSGVVAPEKFER 176

Query: 415 ARE-NNNLDXXXXXXXXXXXXXGSCIVRK--ESRSIQILNEKQNLNTTPKTFVSTELYDG 245
             E    ++                  +K  ES SIQI N+KQNL+  PKTF+  E+ DG
Sbjct: 177 GSEVGEEVEPGGEVGKLNDKGLAPAGEKKVNESHSIQIQNQKQNLSIVPKTFIGNEISDG 236

Query: 244 KLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAAGRRGQLQGQTFIVSKRPTRGHGREMI 65
           K VNVV+GLKLYE+   D EVSKLV+LVNDLRAAG+R QLQGQT++VSKRP +GHGREMI
Sbjct: 237 KTVNVVDGLKLYEDFLGDTEVSKLVSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMI 296

Query: 64  QLGVPIADAPPEDESAAGISK 2
           QLG+PIADAPPEDE +AG SK
Sbjct: 297 QLGIPIADAPPEDEISAGTSK 317


>ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum]
          Length = 638

 Score =  308 bits (788), Expect = 8e-81
 Identities = 185/351 (52%), Positives = 218/351 (62%), Gaps = 45/351 (12%)
 Frame = -3

Query: 919  MQSGN--VAAPEKMPGPG---EAVAV--------RGQWFHYQQHQQLDERDGFLMWLRGE 779
            MQSGN  VA PEKM G G   EAVAV        + QWFH    QQ+DERDGF+ WLRGE
Sbjct: 1    MQSGNAAVAVPEKMNGNGVGGEAVAVALPRQHQHQQQWFH---PQQVDERDGFISWLRGE 57

Query: 778  FAAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQV 599
            FAA+NAIID+LCHHLR+VGE GEYDGVIGC+QQRR NWN VLHMQQY  VAEVIY+L QV
Sbjct: 58   FAASNAIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVIYSLHQV 117

Query: 598  GWRRQQRAAGFDGGVKM-----------GSWKXXXXXXXXXXXXXXGQNSSVEMNTKNLN 452
             W +QQ+  GFDGGVK            G WK              GQN S++ ++K   
Sbjct: 118  EWMKQQK--GFDGGVKKVEKRNGSRGGGGGWK---SEGLKDGKESQGQNFSLDAHSKTNG 172

Query: 451  V--------------YVKPNADTNENARE-------NNNLDXXXXXXXXXXXXXGSCIVR 335
            V               +  N + N + +        ++  +             GS  V 
Sbjct: 173  VEKIDVVEVKQGEKKELAANPEANSSVKSSVCTEAGDSQGEVDKTDDKRDSNSEGSSNVE 232

Query: 334  KESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVND 155
             ES SIQ+  EKQN+   PKTFV+TE+YDGK VNVV+G+KLYEEL    EVSKL+ LVND
Sbjct: 233  SESHSIQVPTEKQNV--VPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLLTLVND 290

Query: 154  LRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
            LRAAGRRGQL  Q FIVSKRP +GHGREM+QLG+PI DAPPE+E+A    K
Sbjct: 291  LRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEEAAISTYK 341


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  308 bits (788), Expect = 8e-81
 Identities = 182/352 (51%), Positives = 202/352 (57%), Gaps = 44/352 (12%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKMPGPGEA-VAVRGQWFHYQQHQQL-DERDGFLMWLRGEFAAANAIID 752
            M M SGNV   +KM  P  A  AV G   H Q  Q   DERDGF+ WLRGEFAAANAIID
Sbjct: 1    MTMPSGNVVLSDKMQYPSVAGAAVSGGEIHQQPRQWFPDERDGFISWLRGEFAAANAIID 60

Query: 751  SLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWRRQQR-- 578
            SLCHHLR VGE  EYD VIGC+QQRRCNW PVLHMQQYF VAEVIYALQQV WRRQQR  
Sbjct: 61   SLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEVIYALQQVAWRRQQRYY 120

Query: 577  ---------------AAGF----------------------DGGVKMGSWKXXXXXXXXX 509
                             GF                       G  K+GS           
Sbjct: 121  EPVKMGNKDYKRSNSGVGFKPRNEPVKEWHTASVEYRSYDGSGLEKVGSEMREEVKPGGE 180

Query: 508  XXXXXGQNSSVEMNTKNLNVYVKPNADTNENARENNNLDXXXXXXXXXXXXXGSC---IV 338
                  + S+    TK   V  KP+   +  +  N+                  C   I 
Sbjct: 181  AGKVDDKGSAAGAVTK--GVLTKPHEYISSRSSANSQGTISGNSESEDAVVNEGCTSSIK 238

Query: 337  RKESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVN 158
              ES SIQI NEKQNL+  PKTFV  E +DGK VNVV+GLKLYEE   D EVSKL +LVN
Sbjct: 239  ENESNSIQIQNEKQNLSLIPKTFVGNETFDGKTVNVVDGLKLYEEFLGDTEVSKLFSLVN 298

Query: 157  DLRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
            DLR  GRRGQLQGQT+++SKRP +GHGREMIQLG+PIAD P EDE +AGISK
Sbjct: 299  DLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQLGIPIADGPQEDEISAGISK 350


>ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum
            lycopersicum]
          Length = 641

 Score =  304 bits (779), Expect = 9e-80
 Identities = 186/351 (52%), Positives = 215/351 (61%), Gaps = 45/351 (12%)
 Frame = -3

Query: 919  MQSGN------VAAPEKMP---GPGEAVAVRGQWFHYQQH---QQLDERDGFLMWLRGEF 776
            MQSGN      VA PEK     G GEAVAV  Q  H QQ    QQ+DERDGF+ WLRGEF
Sbjct: 1    MQSGNAAVAVAVAVPEKKHSNGGGGEAVAVPRQHQHQQQWFHPQQVDERDGFISWLRGEF 60

Query: 775  AAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVG 596
            AA+NAIID+LCHHLR+VGE GEYDGVIGC+QQRR NWN VLHMQQY  VAEVIY+L QV 
Sbjct: 61   AASNAIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVIYSLHQVE 120

Query: 595  WRRQQRAAGFDGGVKM------------GSWKXXXXXXXXXXXXXXGQNSSVEMNTK--- 461
            W +QQ+  GFDGGV              G WK              GQN S++ ++K   
Sbjct: 121  WMKQQK--GFDGGVNKVGKRNGSKGGGGGGWK---SEGLKDGKESQGQNFSLDAHSKTNG 175

Query: 460  --NLNVYVKPNADTNENARE----------------NNNLDXXXXXXXXXXXXXGSCIVR 335
               ++V  +   D  E A +                ++  +             GS  V 
Sbjct: 176  VEKIDVVEEKQGDKKELAAKPEANSSVKGSVCTEAGDSQGEVDKTDDKRDSNSEGSSNVE 235

Query: 334  KESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVND 155
             ES S QI  EKQN+   PKTFV+TE+YDGK VNVV+G+KLYEEL    EVSKLV LVND
Sbjct: 236  SESHSFQIPTEKQNV--VPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLVTLVND 293

Query: 154  LRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
            LRAAGRRGQL  Q FIVSKRP +GHGREM+QLG+PI DAPPE+ESA    K
Sbjct: 294  LRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEESAISTYK 344


>ref|XP_009607025.1| PREDICTED: uncharacterized protein LOC104101279 [Nicotiana
           tomentosiformis]
          Length = 638

 Score =  303 bits (776), Expect = 2e-79
 Identities = 179/330 (54%), Positives = 208/330 (63%), Gaps = 29/330 (8%)
 Frame = -3

Query: 919 MQSGNVA-APEKMPGPGEAVAVRGQWFHYQQHQQLDERDGFLMWLRGEFAAANAIIDSLC 743
           MQSGN    PEKM G G   AV     H QQ    DERDGF+ WLRGEFAAANAIID+LC
Sbjct: 1   MQSGNATLVPEKMHGGGGGEAVEPPP-HQQQWFMGDERDGFISWLRGEFAAANAIIDALC 59

Query: 742 HHLRVVGEQ-GEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWRRQQ----- 581
           HHLR+VGEQ GEYDGVIGC+QQRR NW+ VLHMQQYF VAEVIYAL QV WR+QQ     
Sbjct: 60  HHLRLVGEQPGEYDGVIGCVQQRRGNWSTVLHMQQYFSVAEVIYALHQVEWRKQQKGGFN 119

Query: 580 --RAAGFDGGVKMGSWKXXXXXXXXXXXXXXGQNSS-----VEMNTKNLNV--------Y 446
             R +G   G   G W+                + S     +++  K ++V         
Sbjct: 120 NKRNSGGSRGGGGGGWRSEGHNFTMDANSKEFYSKSNGVEKIDVIEKEIDVKQGEKKELV 179

Query: 445 VKPNADTN-------ENARENNNLDXXXXXXXXXXXXXGSCIVRKESRSIQILNEKQNLN 287
             P  D++       E A   + +D             GSC    ESRS ++ NEKQN+ 
Sbjct: 180 GNPEGDSSMKSSVCIEAADSQSEMD--KTDHKRDSNSDGSCKAENESRSSEVPNEKQNVT 237

Query: 286 TTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAAGRRGQLQGQTFI 107
             PKTFV+TE+YDGK VNVV+G+KLYEEL    EVSKLV LVNDLRA+GRRGQL  QTFI
Sbjct: 238 VVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLVTLVNDLRASGRRGQLSAQTFI 297

Query: 106 VSKRPTRGHGREMIQLGVPIADAPPEDESA 17
           VSKRP +GHGREMIQLG+PIADAPPEDE+A
Sbjct: 298 VSKRPMKGHGREMIQLGLPIADAPPEDEAA 327


>ref|XP_009772695.1| PREDICTED: uncharacterized protein LOC104223045 [Nicotiana
           sylvestris]
          Length = 633

 Score =  299 bits (766), Expect = 3e-78
 Identities = 175/329 (53%), Positives = 205/329 (62%), Gaps = 28/329 (8%)
 Frame = -3

Query: 919 MQSGNVA-APEKMPGPGEAVAVRGQWFHYQQHQQLDERDGFLMWLRGEFAAANAIIDSLC 743
           MQSGN    PEKM G G   AV     H QQ    DERDGF+ WLRGEFAAANAIID+LC
Sbjct: 1   MQSGNATLVPEKMHGGGGGEAVAPPP-HQQQWFMGDERDGFISWLRGEFAAANAIIDALC 59

Query: 742 HHLRVVGEQ-GEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWRRQQR---- 578
           HHLR+VGEQ GEYDGVIGC+QQRR NW+ VLHMQQYF VAEVIYAL QV WR+QQ+    
Sbjct: 60  HHLRLVGEQPGEYDGVIGCVQQRRGNWSTVLHMQQYFSVAEVIYALHQVEWRKQQKGGFN 119

Query: 577 ----AAGFDGGVKMGSWKXXXXXXXXXXXXXXGQNSS-----VEMNTKNLNVYVKPNADT 425
                 G  GG   G W+                + S     +++  K ++V      + 
Sbjct: 120 NKRNGGGSRGGGGGGGWRSEGHNFSMDANSKEFYSKSNGVGKIDVIEKEIDVKQGEKKEL 179

Query: 424 NENARENNNL-------------DXXXXXXXXXXXXXGSCIVRKESRSIQILNEKQNLNT 284
             N   N+++             +             GS  V  ESRS Q+ NEKQN+  
Sbjct: 180 VGNPEGNSSMKSSVCIEAADSQSEIDKTDHKRDSKSDGSWNVENESRSSQVPNEKQNVTI 239

Query: 283 TPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAAGRRGQLQGQTFIV 104
            PKTFV+TE+ DGK VNVV+G+KLYEEL    EVSKLV LVNDLRA+GRRGQL  QTFI+
Sbjct: 240 VPKTFVATEICDGKPVNVVDGMKLYEELLSSSEVSKLVTLVNDLRASGRRGQLSAQTFII 299

Query: 103 SKRPTRGHGREMIQLGVPIADAPPEDESA 17
           SKRP +GHGREMIQLG+PIADAPPEDE+A
Sbjct: 300 SKRPMKGHGREMIQLGLPIADAPPEDEAA 328


>ref|XP_012846113.1| PREDICTED: uncharacterized protein LOC105966112 [Erythranthe
           guttatus]
          Length = 655

 Score =  298 bits (764), Expect = 5e-78
 Identities = 169/324 (52%), Positives = 193/324 (59%), Gaps = 20/324 (6%)
 Frame = -3

Query: 925 MAMQSGNVAAPEKMPGPGEAVAVRGQWFHYQQ-------HQQLDERDGFLMWLRGEFAAA 767
           MAMQ G V  P+K P          QW++ QQ       HQQ+DE+   LMWLRGEFAAA
Sbjct: 1   MAMQPGAVVVPDKTPA---------QWYNPQQQQQSPPPHQQMDEKKALLMWLRGEFAAA 51

Query: 766 NAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWRR 587
           NAIID+LCHHLR VG  GEYDGVIG IQQRRCNWNPVLHMQQYFPV EV+Y+LQQVGWRR
Sbjct: 52  NAIIDALCHHLRAVGGPGEYDGVIGSIQQRRCNWNPVLHMQQYFPVTEVVYSLQQVGWRR 111

Query: 586 QQRAAGFDGGVKMGSWKXXXXXXXXXXXXXXGQNSSVEMNTKNL--NVYVKPNADTN--- 422
           +Q+ AGF+G +  G  K               Q    E+       N Y K N + N   
Sbjct: 112 EQKPAGFEGRIGGGGGKDFRRGGRGQRVGVEVQKLGGEVTNGKYSNNAYAKSNVNGNGKL 171

Query: 421 -----ENARENNNLDXXXXXXXXXXXXXGSCIVRKESRSIQIL---NEKQNLNTTPKTFV 266
                 N  E                   +    KE      L   +EK NL  +PK+F 
Sbjct: 172 DGGDKANVEEKGEKKDSSEMKQGSTQGAVANADDKEDAVGDFLAPTSEKHNLEVSPKSFT 231

Query: 265 STELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAAGRRGQLQGQTFIVSKRPTR 86
            TE  +GKLVN+ EG+KLYE + DD E+SKL  LVN LRAAGRRGQL GQTFIVSKRP +
Sbjct: 232 VTETCEGKLVNIAEGMKLYENVLDDSEISKLNTLVNALRAAGRRGQLHGQTFIVSKRPMK 291

Query: 85  GHGREMIQLGVPIADAPPEDESAA 14
           G GRE IQLGVPIADAP E ESAA
Sbjct: 292 GRGREFIQLGVPIADAPLEYESAA 315


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  296 bits (757), Expect = 3e-77
 Identities = 177/354 (50%), Positives = 203/354 (57%), Gaps = 46/354 (12%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKMPGPGEAVAVRGQW---------------FHYQQHQQL--DERDGFL 797
            MAM SGNV   +KM  P  A A  G                  H   H+Q   DERDGF+
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 796  MWLRGEFAAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVI 617
             WLRGEFAA+NAIIDSLCHHLR VGE GEY+ VI CIQQRRCNWNPVLHMQQYF VAEV 
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 616  YALQQVGWRRQQR--AAGFDGGVKMGSWKXXXXXXXXXXXXXXGQNSSVEMNTKNLNVYV 443
            YALQQV WRR+QR   +G  GG K                   GQNS V+ +  +    V
Sbjct: 121  YALQQVAWRRRQRHYESGKVGG-KEFKRSGMGFKGQRMEVAKEGQNSGVDSDGNSTVTAV 179

Query: 442  KPNADTNENAREN---------------------------NNLDXXXXXXXXXXXXXGSC 344
                +     RE                             +                S 
Sbjct: 180  SERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESVTEDVNGGCTSS 239

Query: 343  IVRKESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNL 164
                +  SIQ  NEKQNL   PKTFV  E++DGK+VNVV+GLKLYEELFDD EV  LV+L
Sbjct: 240  YKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSL 299

Query: 163  VNDLRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
            VNDLRAAG+RGQLQGQT++ +KRP +GHGREMIQLG+PIADAP +DE+AAG SK
Sbjct: 300  VNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIADAPLDDENAAGTSK 353


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  291 bits (745), Expect = 8e-76
 Identities = 177/355 (49%), Positives = 203/355 (57%), Gaps = 47/355 (13%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKMPGPGEAVAVRGQW---------------FHYQQHQQL--DERDGFL 797
            MAM SGNV   +KM  P  A A  G                  H   H+Q   DERDGF+
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLPDERDGFI 60

Query: 796  MWLRGEFAAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVI 617
             WLRGEFAA+NAIIDSLCHHLR VGE GEY+ VI CIQQRRCNWNPVLHMQQYF VAEV 
Sbjct: 61   YWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVAEVS 120

Query: 616  YALQQVGWRRQQR--AAGFDGGVKMGSWKXXXXXXXXXXXXXXGQNSSVEMNTKNLNVYV 443
            YALQQV WRR+QR   +G  GG K                   GQNS V+ +  +    V
Sbjct: 121  YALQQVAWRRRQRHYESGKVGG-KEFKRSGMGFKGQRMEVAKEGQNSGVDSDGNSTVTAV 179

Query: 442  KPNADTNENARE---------------------------NNNLDXXXXXXXXXXXXXGSC 344
                +     RE                             +                S 
Sbjct: 180  SERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESVTEDVNGGCTSS 239

Query: 343  IVRKESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNL 164
                +  SIQ  NEKQNL   PKTFV  E++DGK+VNVV+GLKLYEELFDD EV  LV+L
Sbjct: 240  YKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSL 299

Query: 163  VNDLRAAGRRGQLQ-GQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
            VNDLRAAG+RGQLQ GQT++ +KRP +GHGREMIQLG+PIADAP +DE+AAG SK
Sbjct: 300  VNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIADAPLDDENAAGTSK 354


>ref|XP_008220943.1| PREDICTED: uncharacterized protein LOC103320980 [Prunus mume]
          Length = 691

 Score =  286 bits (731), Expect = 3e-74
 Identities = 177/359 (49%), Positives = 203/359 (56%), Gaps = 51/359 (14%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKM--PGPGEAVAVRGQWFHYQQHQQL--DERDGFLMWLRGEFAAANAI 758
            M M SGNV   +KM  P  G   AV G     Q H+Q   DERDGF+ WLRGEFAAANAI
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGGAVGGGEIP-QHHRQWFPDERDGFISWLRGEFAAANAI 59

Query: 757  IDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWRRQQR 578
            IDSLCHHLR VGE GEYD VIGCIQQRRCNWNPVLHMQQYF VAEVIYALQ V WRRQQR
Sbjct: 60   IDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQHVAWRRQQR 119

Query: 577  ---------------AAGFDGGVKMGSWKXXXXXXXXXXXXXXGQNSSV----------- 476
                             GF+ G +                   G +S V           
Sbjct: 120  YYDPVKAGAKEFKRSGVGFNKGQQRAEAFKEGHNSTLESHSNDGNSSGVVAPEKFERGSE 179

Query: 475  ------------EMNTKNL-------NVYVKPNADTNENARENNNLDXXXXXXXXXXXXX 353
                        ++N K L       +   KP  D+N  +  N+                
Sbjct: 180  VGEEVEPGGEVGKLNDKGLAPAGEKKDALTKPQEDSNLRSFGNSQGTISENSEPEVVEVD 239

Query: 352  GSCIVRKESRSIQILNEKQ--NLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVS 179
            G     K + S  I  + Q  NL+  PKTF+  E  DGK VN V+GLKLYE+   D EVS
Sbjct: 240  GCTPSSKVNESHSIQIQNQKQNLSIVPKTFIGNETSDGKTVNAVDGLKLYEDFLGDTEVS 299

Query: 178  KLVNLVNDLRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
            KL++LVNDLRAAG+R QLQGQT++VSKRP +GHGREMIQLG+PIADAPPEDE +AG SK
Sbjct: 300  KLLSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEISAGTSK 358


>ref|XP_011021729.1| PREDICTED: uncharacterized protein LOC105123720 isoform X3 [Populus
            euphratica]
          Length = 689

 Score =  280 bits (715), Expect = 2e-72
 Identities = 172/364 (47%), Positives = 198/364 (54%), Gaps = 56/364 (15%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKMP---GPGEAVAVRGQWFHY---QQHQ--QLDERDGFLMWLRGEFAA 770
            MAM  GNV  P+KM    G G   A  G   H    Q+HQ   +DERDGF+ WLRGEFAA
Sbjct: 1    MAMPPGNVVIPDKMQFPAGAGGGAASAGNEIHQHHPQRHQWFPVDERDGFISWLRGEFAA 60

Query: 769  ANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWR 590
            ANAIIDSLCHHLR VGE GEYD V+GCIQQRRCNWNPVLHMQQYF V EV+ ALQQ   R
Sbjct: 61   ANAIIDSLCHHLRAVGEPGEYDLVVGCIQQRRCNWNPVLHMQQYFSVGEVVAALQQAVLR 120

Query: 589  RQQR-----------------------------AAGFDGGVKMGSWKXXXXXXXXXXXXX 497
            RQQ+                             +AGF+ G + G                
Sbjct: 121  RQQQQQQQNHHHHQHKFYHDQGKVGGKDFKRSSSAGFNRGYRTGGGGEAVKEGVNYSVEN 180

Query: 496  XGQNSSVEMNT-------------------KNLNVYVKPNADTNENARENNNLDXXXXXX 374
               N +   N                    K  +V  KP+ D   N   N+         
Sbjct: 181  HTSNGNSSENVRSEKFEEVKSGGDYGNSDDKRADVTAKPHTDNLLNILGNSQGTFSGNPE 240

Query: 373  XXXXXXXGSCIVRKESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFD 194
                    S     +S      NEKQNL  TPK FV+ E+ D + VNVV+GLKLYE L D
Sbjct: 241  AVVVDERCS-PKESDSHPSNNQNEKQNLAITPKIFVAEEMIDEQKVNVVDGLKLYENLLD 299

Query: 193  DLEVSKLVNLVNDLRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAA 14
             LEV KLV+LVN+LRAAGRRGQ QGQT+I+SKRP +GHGREMIQ G+PIADAP E E+  
Sbjct: 300  GLEVPKLVSLVNELRAAGRRGQFQGQTYILSKRPMKGHGREMIQFGLPIADAPAETENET 359

Query: 13   GISK 2
            GISK
Sbjct: 360  GISK 363


>ref|XP_011021730.1| PREDICTED: uncharacterized protein LOC105123720 isoform X4 [Populus
            euphratica]
          Length = 688

 Score =  279 bits (713), Expect = 4e-72
 Identities = 171/363 (47%), Positives = 199/363 (54%), Gaps = 55/363 (15%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKMP---GPGEAVAVRGQWFHY---QQHQ--QLDERDGFLMWLRGEFAA 770
            MAM  GNV  P+KM    G G   A  G   H    Q+HQ   +DERDGF+ WLRGEFAA
Sbjct: 1    MAMPPGNVVIPDKMQFPAGAGGGAASAGNEIHQHHPQRHQWFPVDERDGFISWLRGEFAA 60

Query: 769  ANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWR 590
            ANAIIDSLCHHLR VGE GEYD V+GCIQQRRCNWNPVLHMQQYF V EV+ ALQQ   R
Sbjct: 61   ANAIIDSLCHHLRAVGEPGEYDLVVGCIQQRRCNWNPVLHMQQYFSVGEVVAALQQAVLR 120

Query: 589  RQQR-----------------------------AAGFDGGVKMGSWKXXXXXXXXXXXXX 497
            RQQ+                             +AGF+ G + G                
Sbjct: 121  RQQQQQQQNHHHHQHKFYHDQGKVGGKDFKRSSSAGFNRGYRTGGGGEAVKEGVNYSVEN 180

Query: 496  XGQNSSVEMNTKN------------------LNVYVKPNADTNENARENNNLDXXXXXXX 371
               N +   N ++                   +V  KP+ D   N   N+          
Sbjct: 181  HTSNGNSSENVRSEKFEEVKSGGDYGNSDDKRDVTAKPHTDNLLNILGNSQGTFSGNPEA 240

Query: 370  XXXXXXGSCIVRKESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDD 191
                   S     +S      NEKQNL  TPK FV+ E+ D + VNVV+GLKLYE L D 
Sbjct: 241  VVVDERCS-PKESDSHPSNNQNEKQNLAITPKIFVAEEMIDEQKVNVVDGLKLYENLLDG 299

Query: 190  LEVSKLVNLVNDLRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAG 11
            LEV KLV+LVN+LRAAGRRGQ QGQT+I+SKRP +GHGREMIQ G+PIADAP E E+  G
Sbjct: 300  LEVPKLVSLVNELRAAGRRGQFQGQTYILSKRPMKGHGREMIQFGLPIADAPAETENETG 359

Query: 10   ISK 2
            ISK
Sbjct: 360  ISK 362


>ref|XP_011021728.1| PREDICTED: uncharacterized protein LOC105123720 isoform X2 [Populus
            euphratica]
          Length = 694

 Score =  279 bits (713), Expect = 4e-72
 Identities = 171/368 (46%), Positives = 200/368 (54%), Gaps = 60/368 (16%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKMP---GPGEAVAVRGQWFHY---QQHQ--QLDERDGFLMWLRGEFAA 770
            MAM  GNV  P+KM    G G   A  G   H    Q+HQ   +DERDGF+ WLRGEFAA
Sbjct: 1    MAMPPGNVVIPDKMQFPAGAGGGAASAGNEIHQHHPQRHQWFPVDERDGFISWLRGEFAA 60

Query: 769  ANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQYFPVAEVIYALQQVGWR 590
            ANAIIDSLCHHLR VGE GEYD V+GCIQQRRCNWNPVLHMQQYF V EV+ ALQQ   R
Sbjct: 61   ANAIIDSLCHHLRAVGEPGEYDLVVGCIQQRRCNWNPVLHMQQYFSVGEVVAALQQAVLR 120

Query: 589  RQQR-----------------------------AAGFDGGVKMGSWKXXXXXXXXXXXXX 497
            RQQ+                             +AGF+ G + G                
Sbjct: 121  RQQQQQQQNHHHHQHKFYHDQGKVGGKDFKRSSSAGFNRGYRTGGGGEAVKEGVNYSVEN 180

Query: 496  XGQNSSVEMNTKN------------------LNVYVKPNADTNENARENNNLDXXXXXXX 371
               N +   N ++                   +V  KP+ D   N   N+          
Sbjct: 181  HTSNGNSSENVRSEKFEEVKSGGDYGNSDDKRDVTAKPHTDNLLNILGNSQGTFSGNPEA 240

Query: 370  XXXXXXGS-----CIVRKESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYE 206
                   S      +   +S      NEKQNL  TPK FV+ E+ D + VNVV+GLKLYE
Sbjct: 241  VVVDERCSPKDLVVLPESDSHPSNNQNEKQNLAITPKIFVAEEMIDEQKVNVVDGLKLYE 300

Query: 205  ELFDDLEVSKLVNLVNDLRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPED 26
             L D LEV KLV+LVN+LRAAGRRGQ QGQT+I+SKRP +GHGREMIQ G+PIADAP E 
Sbjct: 301  NLLDGLEVPKLVSLVNELRAAGRRGQFQGQTYILSKRPMKGHGREMIQFGLPIADAPAET 360

Query: 25   ESAAGISK 2
            E+  GISK
Sbjct: 361  ENETGISK 368


>gb|KDO60855.1| hypothetical protein CISIN_1g006789mg [Citrus sinensis]
          Length = 631

 Score =  275 bits (702), Expect = 7e-71
 Identities = 146/294 (49%), Positives = 187/294 (63%), Gaps = 16/294 (5%)
 Frame = -3

Query: 835 QQHQQLDERDGFLMWLRGEFAAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPV 656
           ++ QQL   D F+MWLRGEFAAANAIID+LCHHLRV+GE GEYD  I CIQQRRCNWN V
Sbjct: 5   EKMQQLPADDPFVMWLRGEFAAANAIIDTLCHHLRVIGEPGEYDFAINCIQQRRCNWNSV 64

Query: 655 LHMQQYFPVAEVIYALQQVGWRRQQRAAGFD-------------GGVKMGSWKXXXXXXX 515
           LH+QQYF V+EV+ ALQQV WR+QQR+  FD                K  ++        
Sbjct: 65  LHLQQYFSVSEVMLALQQVAWRKQQRS--FDHHHHHHHQQQHHLNRTKRSAFVKKDFHNN 122

Query: 514 XXXXXXXGQNSSVEMNTKNLNVYVKPNADTNENARENNNLDXXXXXXXXXXXXXGSC--- 344
                    ++S   + K  +V +K + D +  +  N+ +                C   
Sbjct: 123 NNNNNHAFDSNSSAFDDKKADVVMKAHDDGSAKSLGNSEITQVGDAEPKAEALDDGCTPG 182

Query: 343 IVRKESRSIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNL 164
           +   +S+S+Q  NEKQN +   K+FV TE+ DGK+VNVV+GLKLYEE+  + EVSKLV+L
Sbjct: 183 LKENDSQSVQSQNEKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVSL 242

Query: 163 VNDLRAAGRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
           VNDLR AG+RGQ+QG  ++VSKRP RGHGRE+IQLG+PI D PPEDE A G S+
Sbjct: 243 VNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPEDEIAGGTSR 296


>ref|XP_012448358.1| PREDICTED: uncharacterized protein LOC105771436 isoform X3
           [Gossypium raimondii] gi|763793869|gb|KJB60865.1|
           hypothetical protein B456_009G328700 [Gossypium
           raimondii]
          Length = 646

 Score =  272 bits (695), Expect = 5e-70
 Identities = 160/336 (47%), Positives = 195/336 (58%), Gaps = 28/336 (8%)
 Frame = -3

Query: 925 MAMQSGNVAAPEKMPGPGEAVAVRGQW----------------------FHYQQHQQL-- 818
           MA+ SGN    +KM  P  + AV G                        F+   H+    
Sbjct: 1   MAVPSGNAVLSDKMQFPAPSAAVAGAGGGDAGAVGGAGGGGGGGGGGAEFNQHHHRNWFP 60

Query: 817 DERDGFLMWLRGEFAAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQY 638
           DERDGF+ WLRGEFAAANA+IDSLCHHLR VGE GEY+ VI CIQQRRC+WNPVLHMQQY
Sbjct: 61  DERDGFIYWLRGEFAAANAMIDSLCHHLREVGEVGEYEAVIACIQQRRCHWNPVLHMQQY 120

Query: 637 FPVAEVIYALQQVGWRRQQRAA--GFDGGVKMGSWKXXXXXXXXXXXXXXGQNSSVEMNT 464
           F +AEV YALQQV WR +QR    G  GG K                    QNS V+ N 
Sbjct: 121 FSIAEVSYALQQVAWRCRQRHCDHGKVGG-KDFKRSGFGFKGHRVEVAKEIQNSVVDTNG 179

Query: 463 KNLNVYVKPNADTNENARENNNLDXXXXXXXXXXXXXGSCIVRKE--SRSIQILNEKQNL 290
            +        A +  N R +   +                +V +E  S   Q  NE Q L
Sbjct: 180 NS-----TVTAVSERNERGSEKYEELKLGGELGKVEDKGSVVTEEHDSHPAQNQNENQTL 234

Query: 289 NTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAAGRRGQLQGQTF 110
              PKTFV  E++DG +VNVV+GLKLYE+LFD+ EVS LV+L+N+LRAAG+RG  Q QT+
Sbjct: 235 ALLPKTFVGNEMFDGNMVNVVDGLKLYEKLFDEKEVSDLVSLINELRAAGKRGHFQVQTY 294

Query: 109 IVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
           + SK+P +GHGREMIQLG+PIADAP + E+AA  SK
Sbjct: 295 VASKKPMKGHGREMIQLGLPIADAPLDGETAARTSK 330


>ref|XP_012448359.1| PREDICTED: uncharacterized protein LOC105771436 isoform X4
           [Gossypium raimondii] gi|763793866|gb|KJB60862.1|
           hypothetical protein B456_009G328700 [Gossypium
           raimondii]
          Length = 645

 Score =  272 bits (695), Expect = 5e-70
 Identities = 160/336 (47%), Positives = 195/336 (58%), Gaps = 28/336 (8%)
 Frame = -3

Query: 925 MAMQSGNVAAPEKMPGPGEAVAVRGQW----------------------FHYQQHQQL-- 818
           MA+ SGN    +KM  P  + AV G                        F+   H+    
Sbjct: 1   MAVPSGNAVLSDKMQFPAPSAAVAGAGGGDAGAVGGAGGGGGGGGGGAEFNQHHHRNWFP 60

Query: 817 DERDGFLMWLRGEFAAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQY 638
           DERDGF+ WLRGEFAAANA+IDSLCHHLR VGE GEY+ VI CIQQRRC+WNPVLHMQQY
Sbjct: 61  DERDGFIYWLRGEFAAANAMIDSLCHHLREVGEVGEYEAVIACIQQRRCHWNPVLHMQQY 120

Query: 637 FPVAEVIYALQQVGWRRQQRAA--GFDGGVKMGSWKXXXXXXXXXXXXXXGQNSSVEMNT 464
           F +AEV YALQQV WR +QR    G  GG K                    QNS V+ N 
Sbjct: 121 FSIAEVSYALQQVAWRCRQRHCDHGKVGG-KDFKRSGFGFKGHRVEVAKEIQNSVVDTNG 179

Query: 463 KNLNVYVKPNADTNENARENNNLDXXXXXXXXXXXXXGSCIVRKE--SRSIQILNEKQNL 290
            +        A +  N R +   +                +V +E  S   Q  NE Q L
Sbjct: 180 NS-----TVTAVSERNERGSEKYEELKLGGELGKVEDKGSVVTEEHDSHPAQNQNENQTL 234

Query: 289 NTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAAGRRGQLQGQTF 110
              PKTFV  E++DG +VNVV+GLKLYE+LFD+ EVS LV+L+N+LRAAG+RG  Q QT+
Sbjct: 235 ALLPKTFVGNEMFDGNMVNVVDGLKLYEKLFDEKEVSDLVSLINELRAAGKRGHFQVQTY 294

Query: 109 IVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
           + SK+P +GHGREMIQLG+PIADAP + E+AA  SK
Sbjct: 295 VASKKPMKGHGREMIQLGLPIADAPLDGETAARTSK 330


>ref|XP_012448356.1| PREDICTED: uncharacterized protein LOC105771436 isoform X1 [Gossypium
            raimondii]
          Length = 662

 Score =  270 bits (689), Expect = 2e-69
 Identities = 161/347 (46%), Positives = 194/347 (55%), Gaps = 39/347 (11%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKMPGPGEAVAVRGQW----------------------FHYQQHQQL-- 818
            MA+ SGN    +KM  P  + AV G                        F+   H+    
Sbjct: 1    MAVPSGNAVLSDKMQFPAPSAAVAGAGGGDAGAVGGAGGGGGGGGGGAEFNQHHHRNWFP 60

Query: 817  DERDGFLMWLRGEFAAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQY 638
            DERDGF+ WLRGEFAAANA+IDSLCHHLR VGE GEY+ VI CIQQRRC+WNPVLHMQQY
Sbjct: 61   DERDGFIYWLRGEFAAANAMIDSLCHHLREVGEVGEYEAVIACIQQRRCHWNPVLHMQQY 120

Query: 637  FPVAEVIYALQQVGWRRQQRAA--GFDGGVKMGSWKXXXXXXXXXXXXXXGQNSSVEMNT 464
            F +AEV YALQQV WR +QR    G  GG K                    QNS V+ N 
Sbjct: 121  FSIAEVSYALQQVAWRCRQRHCDHGKVGG-KDFKRSGFGFKGHRVEVAKEIQNSVVDTNG 179

Query: 463  KNLNVYVKPNADTNENARENNNLDXXXXXXXXXXXXXG------SCIVRK-------ESR 323
             +    V    +      E   L                     SC+ R        +S 
Sbjct: 180  NSTVTAVSERNERGSEKYEELKLGGELGKVEDKGSVVTEEKSSLSCLARNSWNRLEHDSH 239

Query: 322  SIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAA 143
              Q  NE Q L   PKTFV  E++DG +VNVV+GLKLYE+LFD+ EVS LV+L+N+LRAA
Sbjct: 240  PAQNQNENQTLALLPKTFVGNEMFDGNMVNVVDGLKLYEKLFDEKEVSDLVSLINELRAA 299

Query: 142  GRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
            G+RG  Q QT++ SK+P +GHGREMIQLG+PIADAP + E+AA  SK
Sbjct: 300  GKRGHFQVQTYVASKKPMKGHGREMIQLGLPIADAPLDGETAARTSK 346


>ref|XP_012448357.1| PREDICTED: uncharacterized protein LOC105771436 isoform X2 [Gossypium
            raimondii] gi|763793868|gb|KJB60864.1| hypothetical
            protein B456_009G328700 [Gossypium raimondii]
          Length = 661

 Score =  270 bits (689), Expect = 2e-69
 Identities = 161/347 (46%), Positives = 194/347 (55%), Gaps = 39/347 (11%)
 Frame = -3

Query: 925  MAMQSGNVAAPEKMPGPGEAVAVRGQW----------------------FHYQQHQQL-- 818
            MA+ SGN    +KM  P  + AV G                        F+   H+    
Sbjct: 1    MAVPSGNAVLSDKMQFPAPSAAVAGAGGGDAGAVGGAGGGGGGGGGGAEFNQHHHRNWFP 60

Query: 817  DERDGFLMWLRGEFAAANAIIDSLCHHLRVVGEQGEYDGVIGCIQQRRCNWNPVLHMQQY 638
            DERDGF+ WLRGEFAAANA+IDSLCHHLR VGE GEY+ VI CIQQRRC+WNPVLHMQQY
Sbjct: 61   DERDGFIYWLRGEFAAANAMIDSLCHHLREVGEVGEYEAVIACIQQRRCHWNPVLHMQQY 120

Query: 637  FPVAEVIYALQQVGWRRQQRAA--GFDGGVKMGSWKXXXXXXXXXXXXXXGQNSSVEMNT 464
            F +AEV YALQQV WR +QR    G  GG K                    QNS V+ N 
Sbjct: 121  FSIAEVSYALQQVAWRCRQRHCDHGKVGG-KDFKRSGFGFKGHRVEVAKEIQNSVVDTNG 179

Query: 463  KNLNVYVKPNADTNENARENNNLDXXXXXXXXXXXXXG------SCIVRK-------ESR 323
             +    V    +      E   L                     SC+ R        +S 
Sbjct: 180  NSTVTAVSERNERGSEKYEELKLGGELGKVEDKGSVVTEEKSSLSCLARNSWNRLEHDSH 239

Query: 322  SIQILNEKQNLNTTPKTFVSTELYDGKLVNVVEGLKLYEELFDDLEVSKLVNLVNDLRAA 143
              Q  NE Q L   PKTFV  E++DG +VNVV+GLKLYE+LFD+ EVS LV+L+N+LRAA
Sbjct: 240  PAQNQNENQTLALLPKTFVGNEMFDGNMVNVVDGLKLYEKLFDEKEVSDLVSLINELRAA 299

Query: 142  GRRGQLQGQTFIVSKRPTRGHGREMIQLGVPIADAPPEDESAAGISK 2
            G+RG  Q QT++ SK+P +GHGREMIQLG+PIADAP + E+AA  SK
Sbjct: 300  GKRGHFQVQTYVASKKPMKGHGREMIQLGLPIADAPLDGETAARTSK 346


Top