BLASTX nr result

ID: Papaver32_contig00003796 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver32_contig00003796
         (3153 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010252088.1 PREDICTED: uncharacterized protein LOC104593785 [...   572   0.0  
KDO49671.1 hypothetical protein CISIN_1g002779mg [Citrus sinensis]    545   e-178
KDO49672.1 hypothetical protein CISIN_1g002779mg [Citrus sinensis]    545   e-178
KDO49669.1 hypothetical protein CISIN_1g002779mg [Citrus sinensis]    545   e-176
XP_008792209.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized p...   545   e-175
EOY04484.1 NT domain of poly(A) polymerase and terminal uridylyl...   545   e-175
XP_006429558.1 hypothetical protein CICLE_v10011044mg [Citrus cl...   544   e-175
XP_007033558.2 PREDICTED: uncharacterized protein LOC18602238 [T...   541   e-174
XP_010931351.1 PREDICTED: uncharacterized protein LOC105052283 [...   542   e-174
KJB27692.1 hypothetical protein B456_005G005000 [Gossypium raimo...   535   e-174
XP_012089694.1 PREDICTED: uncharacterized protein LOC105648043 [...   541   e-174
XP_008798572.1 PREDICTED: uncharacterized protein LOC103713424 i...   540   e-173
KHG19864.1 Poly (A) RNA polymerase cid14 [Gossypium arboreum]         535   e-172
XP_012481362.1 PREDICTED: uncharacterized protein LOC105796291 i...   535   e-172
XP_007208169.1 hypothetical protein PRUPE_ppa001915mg [Prunus pe...   529   e-171
XP_017633396.1 PREDICTED: uncharacterized protein LOC108475915 i...   533   e-171
OAY44712.1 hypothetical protein MANES_08G174000 [Manihot esculenta]   533   e-170
XP_016685214.1 PREDICTED: uncharacterized protein LOC107903624 i...   532   e-170
XP_016724078.1 PREDICTED: uncharacterized protein LOC107935961 [...   531   e-170
KJB27693.1 hypothetical protein B456_005G005100 [Gossypium raimo...   526   e-170

>XP_010252088.1 PREDICTED: uncharacterized protein LOC104593785 [Nelumbo nucifera]
          Length = 914

 Score =  572 bits (1474), Expect = 0.0
 Identities = 294/430 (68%), Positives = 336/430 (78%), Gaps = 1/430 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL+ W P  +GI  ++ +                        I +  WS+AE TT EI
Sbjct: 1    MGDLQAWLPLPDGILTEDRQ--------FPAPSSSSSPNPNPFSIGAGSWSRAELTTHEI 52

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            + +IQPTVVSE RR+ VI +VQ LIRGYLG EV PFGSVPLKTYLPDGDIDLTALS QNV
Sbjct: 53   VCRIQPTVVSEERRKAVIDYVQRLIRGYLGSEVFPFGSVPLKTYLPDGDIDLTALSYQNV 112

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            EDALANDVR VLE EE+N ++EFEVKDVQYI AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 113  EDALANDVRTVLEGEEQNNAAEFEVKDVQYIHAEVKLVKCLVQNIVVDISFNQLGGLCTL 172

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLE++D+ IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHL+HSS
Sbjct: 173  CFLERIDQLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLYHSS 232

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLYRFLDY+SKFDWDNYCISLNGPV + SLPEIVAE P+N          FL+ 
Sbjct: 233  LDGPLAVLYRFLDYFSKFDWDNYCISLNGPVFLSSLPEIVAEVPENGRTDLLLSKEFLKN 292

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C D+F++P++G ETNSRAF +K+LNI+DPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 293  CMDVFSVPARGNETNSRAFPKKHLNIIDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 352

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDEG 3040
            RILLLPG+ +  EL KFF NTLDRHG G RPDVQ+  P F   GS  TS +S   K  E 
Sbjct: 353  RILLLPGESLEVELKKFFMNTLDRHGNGQRPDVQDPVPHFCDNGSGLTSSKSGIGKIRED 412

Query: 3041 RRVSALSAID 3070
            +  S   +ID
Sbjct: 413  KSHSESPSID 422


>KDO49671.1 hypothetical protein CISIN_1g002779mg [Citrus sinensis]
          Length = 710

 Score =  545 bits (1405), Expect = e-178
 Identities = 279/423 (65%), Positives = 320/423 (75%), Gaps = 1/423 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG    E     +  +                 I ++ W +AEE TQ I
Sbjct: 1    MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQT-----------AIGAEYWQRAEEATQGI 49

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA    NV
Sbjct: 50   IAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNV 109

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E++NK++EF VKD Q I+AEVKLVKCLVQNI+VDISFNQ+GGLSTL
Sbjct: 110  EEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTL 169

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS
Sbjct: 170  CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 229

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            LNGPLAVLY+FLDY+SKFDWD+YCISLNGPV + SLPE+V E P+N          FL+ 
Sbjct: 230  LNGPLAVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKE 289

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G +TNSR+F  K+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 290  CVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 349

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDEG 3040
             IL  P + + DEL KFF NTLDRHG+G RPDVQ+  P     G   +S  S T  C E 
Sbjct: 350  HILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCRED 409

Query: 3041 RRV 3049
            + +
Sbjct: 410  QTI 412


>KDO49672.1 hypothetical protein CISIN_1g002779mg [Citrus sinensis]
          Length = 729

 Score =  545 bits (1405), Expect = e-178
 Identities = 279/423 (65%), Positives = 320/423 (75%), Gaps = 1/423 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG    E     +  +                 I ++ W +AEE TQ I
Sbjct: 1    MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQT-----------AIGAEYWQRAEEATQGI 49

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA    NV
Sbjct: 50   IAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNV 109

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E++NK++EF VKD Q I+AEVKLVKCLVQNI+VDISFNQ+GGLSTL
Sbjct: 110  EEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTL 169

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS
Sbjct: 170  CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 229

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            LNGPLAVLY+FLDY+SKFDWD+YCISLNGPV + SLPE+V E P+N          FL+ 
Sbjct: 230  LNGPLAVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKE 289

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G +TNSR+F  K+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 290  CVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 349

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDEG 3040
             IL  P + + DEL KFF NTLDRHG+G RPDVQ+  P     G   +S  S T  C E 
Sbjct: 350  HILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCRED 409

Query: 3041 RRV 3049
            + +
Sbjct: 410  QTI 412


>KDO49669.1 hypothetical protein CISIN_1g002779mg [Citrus sinensis]
          Length = 882

 Score =  545 bits (1405), Expect = e-176
 Identities = 279/423 (65%), Positives = 320/423 (75%), Gaps = 1/423 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG    E     +  +                 I ++ W +AEE TQ I
Sbjct: 1    MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQT-----------AIGAEYWQRAEEATQGI 49

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA    NV
Sbjct: 50   IAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNV 109

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E++NK++EF VKD Q I+AEVKLVKCLVQNI+VDISFNQ+GGLSTL
Sbjct: 110  EEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTL 169

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS
Sbjct: 170  CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 229

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            LNGPLAVLY+FLDY+SKFDWD+YCISLNGPV + SLPE+V E P+N          FL+ 
Sbjct: 230  LNGPLAVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKE 289

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G +TNSR+F  K+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 290  CVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 349

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDEG 3040
             IL  P + + DEL KFF NTLDRHG+G RPDVQ+  P     G   +S  S T  C E 
Sbjct: 350  HILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCRED 409

Query: 3041 RRV 3049
            + +
Sbjct: 410  QTI 412


>XP_008792209.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103708884
            [Phoenix dactylifera]
          Length = 905

 Score =  545 bits (1405), Expect = e-175
 Identities = 279/428 (65%), Positives = 325/428 (75%), Gaps = 9/428 (2%)
 Frame = +2

Query: 1796 EIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEIIHKI 1975
            E W P +NG + D   +  +                    I ++ W +AE+ TQE+I  I
Sbjct: 8    EAWVPQSNGAAGDGNRQPPSA----------QQSNPEPSAISAERWRQAEQATQEVIQCI 57

Query: 1976 QPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNVEDAL 2155
            QPTV+SE RR+ V+++VQ LIRGYL  E+ PFGSVPLKTYLPDGDIDLTAL + N ED L
Sbjct: 58   QPTVISEQRRRAVLEYVQKLIRGYLATEIFPFGSVPLKTYLPDGDIDLTALGVPNSEDVL 117

Query: 2156 ANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTLCFLE 2335
            AN+VR VLE EE+NK +EFEVKDVQYI AEVKLVKC+VQNI+VDISFNQIGGL TLCFLE
Sbjct: 118  ANEVRSVLEVEEQNKDAEFEVKDVQYIHAEVKLVKCIVQNIVVDISFNQIGGLCTLCFLE 177

Query: 2336 QVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLNGP 2515
            QVD +IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFH SL GP
Sbjct: 178  QVDSQIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHISLEGP 237

Query: 2516 LAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRRCADM 2695
            LAVLYRFLDYYSKFDWDNYCISL+GP+ + SLPE+VAE P+           FL++C DM
Sbjct: 238  LAVLYRFLDYYSKFDWDNYCISLHGPIPISSLPELVAEPPETHESDLLLSKEFLKKCVDM 297

Query: 2696 FALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLGRILL 2875
            F++PS+G E NSR F QK+LNIVDPLK+NNNLGRSVSKGNF RIRSAF+YGARKLGRILL
Sbjct: 298  FSVPSRGSENNSRIFSQKHLNIVDPLKENNNLGRSVSKGNFHRIRSAFTYGARKLGRILL 357

Query: 2876 LPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAP--------DFIAYGSQHTSYQSSTVK 3028
            LP + + DE+T FF NTLDRHG+G RPDVQ+  P        D    GS  ++ +    K
Sbjct: 358  LPAENMADEVTMFFTNTLDRHGSGDRPDVQDVFPSHSDSTMIDHDGLGSMSSNLK--VEK 415

Query: 3029 CDEGRRVS 3052
            CD+ + +S
Sbjct: 416  CDKDKLMS 423


>EOY04484.1 NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative [Theobroma
            cacao]
          Length = 890

 Score =  545 bits (1403), Expect = e-175
 Identities = 277/420 (65%), Positives = 320/420 (76%), Gaps = 2/420 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG++ +E     +                    I ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVASEERSSSSS------------SSSSNQAGIAAEYWKKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LI  YLGC V PFGSVPLKTYLPDGDIDLTA    N 
Sbjct: 52   IAQVQPTVVSEERRKAVIDYVQRLIGNYLGCGVFPFGSVPLKTYLPDGDIDLTAFGGLNF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E+ N+++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDVCSVLEREDHNRAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLE+VDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS
Sbjct: 172  CFLEKVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDWDNYCISLNGP+++ SLPE+V E P+N          FL+ 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWDNYCISLNGPIHISSLPEVVVETPENGGGDLLLSNDFLKE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C +MF++PS+G ETNSR F QK+LNIVDPL++NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVEMFSVPSRGFETNSRTFPQKHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAY-GSQHTSYQSSTVKCDE 3037
            +IL    + + DEL KFF NTLDRHG+G RPDVQ+  P    + G   TS  S T  C E
Sbjct: 352  KILSQAEESMADELRKFFSNTLDRHGSGQRPDVQDCIPSLSRFSGFGATSSVSGTESCQE 411


>XP_006429558.1 hypothetical protein CICLE_v10011044mg [Citrus clementina]
            XP_006481174.1 PREDICTED: uncharacterized protein
            LOC102622468 [Citrus sinensis] ESR42798.1 hypothetical
            protein CICLE_v10011044mg [Citrus clementina]
          Length = 882

 Score =  544 bits (1401), Expect = e-175
 Identities = 273/398 (68%), Positives = 311/398 (78%), Gaps = 1/398 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG    E     +  +                 I ++ W +AEE TQ I
Sbjct: 1    MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQT-----------AIGAEYWQRAEEATQAI 49

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA    NV
Sbjct: 50   IAQVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNV 109

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E++NK++EF VKD Q I+AEVKLVKCLVQNI+VDISFNQ+GGLSTL
Sbjct: 110  EEALANDVCSVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTL 169

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS
Sbjct: 170  CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 229

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            LNGPLAVLY+FLDY+SKFDWD+YCISLNGPV + SLPE+V E P+N          FL+ 
Sbjct: 230  LNGPLAVLYKFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKE 289

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G +TNSR+F  K+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 290  CVEQFSVPSRGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 349

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAP 2974
             IL  P + + DEL KFF NTLDRHG+G RPDVQ+  P
Sbjct: 350  HILSQPEESLTDELRKFFSNTLDRHGSGQRPDVQDPVP 387


>XP_007033558.2 PREDICTED: uncharacterized protein LOC18602238 [Theobroma cacao]
          Length = 890

 Score =  541 bits (1395), Expect = e-174
 Identities = 276/420 (65%), Positives = 319/420 (75%), Gaps = 2/420 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG++ +E     +                    I ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVASEERSSSSS------------SSSSNQAGIAAEYWKKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LI  YLGC V PFGSVPLKTYLPDGDIDLTA    N 
Sbjct: 52   IAQVQPTVVSEERRKAVIDYVQRLIGNYLGCGVFPFGSVPLKTYLPDGDIDLTAFGGLNF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E+ N+++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDVCSVLEREDHNRAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLE+VDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS
Sbjct: 172  CFLEKVDRCIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDWDNYCISLNGP+++ SLPE+V E P+N          FL+ 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWDNYCISLNGPIHISSLPEVVVETPENGGGDLLLSNDFLKE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C +MF++PS+G ETNSR F QK+LNIVDPL++NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVEMFSVPSRGFETNSRTFPQKHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAY-GSQHTSYQSSTVKCDE 3037
            +IL    + + DEL KFF NTLDRHG+G RPDVQ+  P    + G   TS  S T  C E
Sbjct: 352  KILSQAEESMADELRKFFSNTLDRHGSGQRPDVQDCIPSLSRFSGFGATSSVSGTESCQE 411


>XP_010931351.1 PREDICTED: uncharacterized protein LOC105052283 [Elaeis guineensis]
          Length = 905

 Score =  542 bits (1396), Expect = e-174
 Identities = 267/398 (67%), Positives = 315/398 (79%), Gaps = 1/398 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL+ W P +NG + D   +  +                    I ++ W +AE+ TQE+
Sbjct: 1    MGDLQAWVPQSNGAAGDGNTQTPSA----------QQSNPEPSAISAEGWQQAEQATQEV 50

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I  IQPTV+SE RR++V+++VQ LI+GYL  E+ PFGSVPLKTYLPDGDIDL AL + N 
Sbjct: 51   IQCIQPTVISEQRRRVVVEYVQKLIQGYLATEIFPFGSVPLKTYLPDGDIDLIALGMPNS 110

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            ED LAN+VR VLE EE+NK +EFEVKDVQYI AEVKLVKC+VQNI+VDISFNQIGGL TL
Sbjct: 111  EDVLANEVRSVLEVEEQNKDAEFEVKDVQYIHAEVKLVKCIVQNIVVDISFNQIGGLCTL 170

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVD +IG+DHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFH S
Sbjct: 171  CFLEQVDNQIGRDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHKS 230

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPL VLYRFLDYYSKFDWDNYCISL+GP+ + SLPE+VAE P+           FL++
Sbjct: 231  LDGPLVVLYRFLDYYSKFDWDNYCISLHGPIPISSLPELVAEPPETHESDSLLSKDFLKK 290

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C DMF++P +GLE NSR F QK+LNIVDPLK+NNNLGRS+SKGN +RIRSAF+YGARKLG
Sbjct: 291  CVDMFSVPLRGLENNSRTFSQKHLNIVDPLKENNNLGRSISKGNSYRIRSAFTYGARKLG 350

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAP 2974
            RILLLP + + D++T FF NTL+RHG+G RPDVQ   P
Sbjct: 351  RILLLPPENMADQVTMFFTNTLERHGSGDRPDVQGVFP 388


>KJB27692.1 hypothetical protein B456_005G005000 [Gossypium raimondii]
          Length = 737

 Score =  535 bits (1379), Expect = e-174
 Identities = 275/419 (65%), Positives = 313/419 (74%), Gaps = 1/419 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG+S  +     +                    I ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVSSRDRYSSSS------------SSSSNQAGISAEYWRKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA    N 
Sbjct: 52   IARVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALAND   VLE E+RN ++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDACSVLEREDRNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IG+DHLFKRSIILIKAWCYYESRILGAHHGLISTY LETLVLYIFHLFHSS
Sbjct: 172  CFLEQVDRLIGQDHLFKRSIILIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSS 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDW+NYCISLNGP+ + SLP+IV E P+N          FLR 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G + NSR F QK+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVETFSVPSRGFDANSRIFPQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDE 3037
            +IL    + + DEL KFF NTLDRHG G RPDVQ+ AP     G   T   S T  C E
Sbjct: 352  QILSQSEETLGDELHKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQE 410


>XP_012089694.1 PREDICTED: uncharacterized protein LOC105648043 [Jatropha curcas]
            KDP22776.1 hypothetical protein JCGZ_00363 [Jatropha
            curcas]
          Length = 900

 Score =  541 bits (1393), Expect = e-174
 Identities = 272/404 (67%), Positives = 314/404 (77%), Gaps = 1/404 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG+ ++E     +                  + I ++ W KAE+ TQ I
Sbjct: 1    MGDLRAWSPEPNGVVLEERPSWSS-----------SSQGNQTVIISAEYWQKAEDLTQGI 49

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR  +GCEV PFGSVPLKTYLPDGDIDLTA    NV
Sbjct: 50   IAQVQPTVVSEERRKAVIDYVQRLIRKSIGCEVFPFGSVPLKTYLPDGDIDLTAFGGMNV 109

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ LANDV  VLE E++N+++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 110  EEVLANDVCSVLEREDKNRTAEFIVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 169

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLE+VDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS
Sbjct: 170  CFLEKVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 229

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            LNGPLAVLY+FLDY+SKFDWD YCISLNGPV + SLPE++ E P+N          FL+ 
Sbjct: 230  LNGPLAVLYKFLDYFSKFDWDTYCISLNGPVRISSLPEVLVETPENGTCDLLLTNDFLKE 289

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C D F++P++G ETNSRAF  K+LNIVDPLK+NNNLGRSVSKGNF+RIRSAFSYGARKLG
Sbjct: 290  CVDTFSVPARGYETNSRAFSPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFSYGARKLG 349

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYG 2992
             IL  P + I  EL+KFF NTLDRHG+G RPDVQ+ AP    +G
Sbjct: 350  LILSQPEEIIAAELSKFFSNTLDRHGSGQRPDVQDPAPSESQHG 393


>XP_008798572.1 PREDICTED: uncharacterized protein LOC103713424 isoform X1 [Phoenix
            dactylifera]
          Length = 904

 Score =  540 bits (1392), Expect = e-173
 Identities = 284/459 (61%), Positives = 332/459 (72%), Gaps = 4/459 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL+ W P  NG + D   +   +                   I ++ W +AE+ TQE+
Sbjct: 1    MGDLQAWVPQPNGAAGDGNPQPPTV----------QPSNPHPSAIRAESWRRAEQATQEV 50

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I  IQPTVVSE RR+ V+++VQ LIRGYL  E+ PFGSVPLKTYLPDGDIDLTA  I   
Sbjct: 51   IQCIQPTVVSEQRRRAVVEYVQKLIRGYLAIEIFPFGSVPLKTYLPDGDIDLTAAGIP-- 108

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            EDALA++V  VLE EE+NK +EFEVKDVQYI AEVKLVKC+VQNI+VDISFNQIGGL TL
Sbjct: 109  EDALASEVHSVLEVEEQNKDAEFEVKDVQYIHAEVKLVKCIVQNIVVDISFNQIGGLCTL 168

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVD +IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVL IFH FH S
Sbjct: 169  CFLEQVDNQIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLCIFHFFHKS 228

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLYRFLDYYSKFDWDNYCISL GP+ V SLPE+VAE  +           FL+ 
Sbjct: 229  LDGPLAVLYRFLDYYSKFDWDNYCISLRGPIPVSSLPELVAEPLETQGGDLLLGEEFLKN 288

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C D F++P +GLE NSR F QK+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 289  CVDKFSVPPRGLENNSRTFSQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 348

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAP---DFIAYGSQHTSYQSSTVKC 3031
            RILLLP D I DE+  FF NTL+RHG+GVRPDVQ+ +P   D          + SS +K 
Sbjct: 349  RILLLPADNIADEVKMFFTNTLERHGSGVRPDVQDVSPSPSDRTMIDYDGLGFMSSDLKV 408

Query: 3032 DEGRRVSALSAIDPKPIECGIGADLHGLLNNRIRDIDIS 3148
            ++G     +S +           D +G L+ +  +I IS
Sbjct: 409  EKGNDDELISGLPT--------TDSYGALSEKNNNIKIS 439


>KHG19864.1 Poly (A) RNA polymerase cid14 [Gossypium arboreum]
          Length = 881

 Score =  535 bits (1379), Expect = e-172
 Identities = 275/419 (65%), Positives = 312/419 (74%), Gaps = 1/419 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG+S  +     +                      ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVSSRDRYSSSS------------SSSSNQTGTSAEYWRKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA    N 
Sbjct: 52   IARVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALAND   VLE E+RN ++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDACSVLEREDRNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY LETLVLYIFHLFHSS
Sbjct: 172  CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSS 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDW+NYCISLNGP+ + SLP+IV E P+N          FLR 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G + NSR F QK+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVETFSVPSRGFDANSRIFPQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDE 3037
            +IL    + + DEL KFF NTLDRHG G RPDVQ+ AP     G   T   S T  C E
Sbjct: 352  QILSQSEETLGDELRKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQE 410


>XP_012481362.1 PREDICTED: uncharacterized protein LOC105796291 isoform X1 [Gossypium
            raimondii] KJB27691.1 hypothetical protein
            B456_005G005000 [Gossypium raimondii]
          Length = 884

 Score =  535 bits (1379), Expect = e-172
 Identities = 275/419 (65%), Positives = 313/419 (74%), Gaps = 1/419 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG+S  +     +                    I ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVSSRDRYSSSS------------SSSSNQAGISAEYWRKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA    N 
Sbjct: 52   IARVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALAND   VLE E+RN ++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDACSVLEREDRNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IG+DHLFKRSIILIKAWCYYESRILGAHHGLISTY LETLVLYIFHLFHSS
Sbjct: 172  CFLEQVDRLIGQDHLFKRSIILIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSS 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDW+NYCISLNGP+ + SLP+IV E P+N          FLR 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G + NSR F QK+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVETFSVPSRGFDANSRIFPQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDE 3037
            +IL    + + DEL KFF NTLDRHG G RPDVQ+ AP     G   T   S T  C E
Sbjct: 352  QILSQSEETLGDELHKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQE 410


>XP_007208169.1 hypothetical protein PRUPE_ppa001915mg [Prunus persica]
          Length = 742

 Score =  529 bits (1363), Expect = e-171
 Identities = 284/460 (61%), Positives = 328/460 (71%), Gaps = 6/460 (1%)
 Frame = +2

Query: 1784 MGDL-EIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXIC--IDSQCWSKAEETT 1954
            MGDL E W    NG  ++E     +                      I ++ W KAEE T
Sbjct: 1    MGDLREDWSSELNGAVVEERPSSASSLSSSTSLLFSSNPASAAAAAGISAEYWKKAEEAT 60

Query: 1955 QEIIHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSI 2134
            Q +I ++QPT VSE RR+ VI +VQ LIRG LGCEV PFGSVPLKTYLPDGDIDLTA   
Sbjct: 61   QGVIAQVQPTDVSERRRKAVIDYVQRLIRGCLGCEVFPFGSVPLKTYLPDGDIDLTAFGG 120

Query: 2135 QNVEDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGL 2314
             NVE+ALANDV  VLE E +N ++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL
Sbjct: 121  INVEEALANDVCSVLEREVQNGTAEFMVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGL 180

Query: 2315 STLCFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLF 2494
             TLCFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLF
Sbjct: 181  CTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLF 240

Query: 2495 HSSLNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXF 2674
            H+SLNGPLAVLY+FLDY+SKFDWDNYCISL+GPV + SLPE++ E P+N          F
Sbjct: 241  HASLNGPLAVLYKFLDYFSKFDWDNYCISLSGPVRISSLPELLVETPENGGNDLLLSNDF 300

Query: 2675 LRRCADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGAR 2854
            L+ C  MF++PS+G ETN R F  K+ NIVDPLKDNNNLGRSVSKGNF+RIRSAF+YGAR
Sbjct: 301  LKECVQMFSVPSRGYETNYRTFPPKHFNIVDPLKDNNNLGRSVSKGNFYRIRSAFTYGAR 360

Query: 2855 KLGRILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAP--DFIAYGSQHTSYQSSTV 3025
            KLGRIL    D I DE+ KFF NTLDRHG G RPDVQ+  P   +  YGS   S  + T 
Sbjct: 361  KLGRILSQTEDNIDDEIRKFFANTLDRHGGGQRPDVQDLVPLSRYDGYGS--VSLFAGTE 418

Query: 3026 KCDEGRRVSALSAIDPKPIECGIGADLHGLLNNRIRDIDI 3145
              D+    S  +       ECG+ ++  G  N  + ++ I
Sbjct: 419  SQDQINYESESAYSSGMIGECGLNSE--GSWNGEVTNVQI 456


>XP_017633396.1 PREDICTED: uncharacterized protein LOC108475915 isoform X2 [Gossypium
            arboreum]
          Length = 885

 Score =  533 bits (1372), Expect = e-171
 Identities = 274/419 (65%), Positives = 312/419 (74%), Gaps = 1/419 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG+S  +     +                    I ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVSSRDRYSSSS------------SSSSNQTGISAEYWRKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA      
Sbjct: 52   IARVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLIF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E+ N ++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDVCSVLEREDHNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IGKDHLFKRSI+LIKAWCYYESRILGAHHGLISTY LETLVLYIFHLFHSS
Sbjct: 172  CFLEQVDRLIGKDHLFKRSIVLIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSS 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDW+NYCISLNGP+ + SLP+IV E P+N          FLR 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G E NSR F QK+LNIVDPL++NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVEKFSVPSRGFEANSRIFPQKHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDE 3037
            +IL    + + DEL KFF NTLDRHG G RPDVQ+ AP     G   T   S T  C E
Sbjct: 352  QILSQSEETLGDELHKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQE 410


>OAY44712.1 hypothetical protein MANES_08G174000 [Manihot esculenta]
          Length = 905

 Score =  533 bits (1373), Expect = e-170
 Identities = 267/398 (67%), Positives = 310/398 (77%), Gaps = 1/398 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG  ++E     +L L                 I ++ W +AE  TQ I
Sbjct: 1    MGDLRAWSPELNGAVLEERPSSSSLSLANQAG------------ISAESWQRAEAVTQGI 48

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPT+VSE RR+ VI +VQ LIR  LGCEV PFGSVPL+TYLPDGDIDLTA    ++
Sbjct: 49   IGQVQPTLVSEERRKAVIDYVQRLIRNSLGCEVFPFGSVPLRTYLPDGDIDLTAFGGMHI 108

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E++N+ +EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 109  EEALANDVCSVLEREDQNRIAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 168

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IG+DHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS
Sbjct: 169  CFLEQVDRLIGRDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 228

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L GPLAVLY+FLDY+SKFDWDNYCISLNGPV + SLPE+V E P+N          FL+ 
Sbjct: 229  LAGPLAVLYKFLDYFSKFDWDNYCISLNGPVRISSLPEVVVETPENGGFDLLLSNDFLKE 288

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C +MF++P++  ETNSR F  K+LNIVDPLK+NNNLGRSVSKGNF+RI+SAF+YGARKLG
Sbjct: 289  CVEMFSVPARAYETNSRTFPPKHLNIVDPLKENNNLGRSVSKGNFYRIKSAFTYGARKLG 348

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAP 2974
            RIL  P + I  EL KFF NTL+RHG+G RPDVQ+ AP
Sbjct: 349  RILSQPEESISSELHKFFSNTLERHGSGRRPDVQDPAP 386


>XP_016685214.1 PREDICTED: uncharacterized protein LOC107903624 isoform X1 [Gossypium
            hirsutum]
          Length = 884

 Score =  532 bits (1370), Expect = e-170
 Identities = 273/419 (65%), Positives = 311/419 (74%), Gaps = 1/419 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG+S  +     +                    I ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVSSRDRYSSSS------------SSSSNQAGISAEYWRKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA    N 
Sbjct: 52   IARVQPTVVSEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALAND   VLE E+ N ++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDACSVLEREDHNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IG+DHLFKRSIILIKAWCYYESRILGAHHGLISTY LETLVLYIFHLFHSS
Sbjct: 172  CFLEQVDRLIGQDHLFKRSIILIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSS 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDW+NYCISLNGP+ + SLP+IV E P+N          FLR 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G + N R F QK+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVETFSVPSRGFDANPRIFPQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDE 3037
            +IL    + + DEL KFF NTLDRHG G RPDVQ+ AP     G   T   S T  C E
Sbjct: 352  QILSQSEETLGDELHKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATQSVSGTESCQE 410


>XP_016724078.1 PREDICTED: uncharacterized protein LOC107935961 [Gossypium hirsutum]
          Length = 885

 Score =  531 bits (1368), Expect = e-170
 Identities = 274/419 (65%), Positives = 311/419 (74%), Gaps = 1/419 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG+S  +     +                    I ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVSSRDRYSSSS------------SSSSNQTGISAEYWRKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ VI  V+ L R YLGCEV PFGSVPLKTYLPDGDIDLTA    N 
Sbjct: 52   IARVQPTVVSEERRKAVIDDVRRLSRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALAND   VLE E+RN ++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDACSVLEREDRNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY LETLVLYIFHLFHSS
Sbjct: 172  CFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSS 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDW+NYCISLNGP+ + SLP+IV E P+N          FLR 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGNLLLSNDFLRE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G + NSR F QK+LNIVDPLK+NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVETFSVPSRGFDANSRIFPQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDE 3037
            +IL    + + DEL KFF NTLDRHG G RPDVQ+ AP     G   T   S T  C E
Sbjct: 352  QILSQSEETLGDELRKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQE 410


>KJB27693.1 hypothetical protein B456_005G005100 [Gossypium raimondii]
          Length = 736

 Score =  526 bits (1354), Expect = e-170
 Identities = 272/428 (63%), Positives = 312/428 (72%), Gaps = 1/428 (0%)
 Frame = +2

Query: 1784 MGDLEIWPPTANGISIDEEEEEDNLFLXXXXXXXXXXXXXXXICIDSQCWSKAEETTQEI 1963
            MGDL  W P  NG+S  +                          I ++ W KAEE TQ I
Sbjct: 4    MGDLRDWSPEPNGVSSRDSYSSS------------PSSSSNQTGISAEYWRKAEEATQGI 51

Query: 1964 IHKIQPTVVSEHRRQIVIQFVQNLIRGYLGCEVCPFGSVPLKTYLPDGDIDLTALSIQNV 2143
            I ++QPTVVSE RR+ V  +VQ LIR YLGCEV PFGSVPLKTYLPDGDIDLTA      
Sbjct: 52   IARVQPTVVSEERRKAVTDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLIF 111

Query: 2144 EDALANDVRIVLETEERNKSSEFEVKDVQYIQAEVKLVKCLVQNIIVDISFNQIGGLSTL 2323
            E+ALANDV  VLE E+ N ++EF VKDVQ I+AEVKLVKCLVQNI+VDISFNQ+GGL TL
Sbjct: 112  EEALANDVCSVLEREDHNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTL 171

Query: 2324 CFLEQVDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSS 2503
            CFLEQVDR IGK+HLFKRSI+LIKAWCYYESRILGAHHGLISTY LETLVLYIFHLFHS 
Sbjct: 172  CFLEQVDRLIGKNHLFKRSILLIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSF 231

Query: 2504 LNGPLAVLYRFLDYYSKFDWDNYCISLNGPVNVRSLPEIVAEAPDNXXXXXXXXXXFLRR 2683
            L+GPLAVLY+FLDY+SKFDW+NYCISLNGP+ + SLP+IV E P+N          FLR 
Sbjct: 232  LDGPLAVLYKFLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRE 291

Query: 2684 CADMFALPSKGLETNSRAFIQKYLNIVDPLKDNNNLGRSVSKGNFFRIRSAFSYGARKLG 2863
            C + F++PS+G E NSR F QK+LNIVDPL++NNNLGRSVSKGNF+RIRSAF+YGARKLG
Sbjct: 292  CVEKFSVPSRGFEANSRIFPQKHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLG 351

Query: 2864 RILLLPGD-IHDELTKFFGNTLDRHGTGVRPDVQNSAPDFIAYGSQHTSYQSSTVKCDEG 3040
            +IL    + + DEL KFF NTLDRHG G RPDVQ+ AP     G   T   S T  C E 
Sbjct: 352  QILSQSEETLGDELHKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQED 411

Query: 3041 RRVSALSA 3064
            +    L +
Sbjct: 412  QNFYELES 419


Top