BLASTX nr result

ID: Forsythia21_contig00002748 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00002748
         (1411 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011089608.1| PREDICTED: uncharacterized protein DDB_G0271...   259   4e-66
ref|XP_011089607.1| PREDICTED: uncharacterized protein DDB_G0271...   259   4e-66
ref|XP_012833582.1| PREDICTED: uncharacterized protein LOC105954...   244   9e-62
emb|CDP03981.1| unnamed protein product [Coffea canephora]            234   1e-58
ref|XP_003549295.1| PREDICTED: proline-rich protein PRCC-like [G...   205   6e-50
ref|XP_007014432.1| C-terminal, putative [Theobroma cacao] gi|50...   203   2e-49
gb|KHN17634.1| Proline-rich protein PRCC [Glycine soja]               202   5e-49
gb|ACU18505.1| unknown [Glycine max]                                  201   1e-48
ref|XP_007154707.1| hypothetical protein PHAVU_003G140900g [Phas...   197   2e-47
ref|XP_012068474.1| PREDICTED: uncharacterized protein LOC105631...   194   1e-46
ref|XP_006453238.1| hypothetical protein CICLE_v10008415mg [Citr...   192   7e-46
ref|XP_004296496.1| PREDICTED: proline-rich protein PRCC [Fragar...   192   7e-46
ref|XP_009626418.1| PREDICTED: proline-rich protein PRCC [Nicoti...   191   9e-46
ref|XP_012463463.1| PREDICTED: uncharacterized protein LOC105782...   191   1e-45
gb|KHN22658.1| hypothetical protein glysoja_027546 [Glycine soja]     191   1e-45
ref|XP_002308111.2| hypothetical protein POPTR_0006s07450g [Popu...   190   3e-45
gb|KHG24356.1| Proline-rich PRCC [Gossypium arboreum]                 189   4e-45
ref|XP_011020044.1| PREDICTED: proline-rich protein PRCC [Populu...   189   6e-45
ref|XP_010046926.1| PREDICTED: proline-rich protein PRCC [Eucaly...   187   1e-44
ref|XP_003542837.1| PREDICTED: proline-rich protein PRCC-like [G...   186   4e-44

>ref|XP_011089608.1| PREDICTED: uncharacterized protein DDB_G0271670 isoform X2 [Sesamum
            indicum] gi|747084403|ref|XP_011089609.1| PREDICTED:
            uncharacterized protein DDB_G0271670 isoform X3 [Sesamum
            indicum]
          Length = 477

 Score =  259 bits (662), Expect = 4e-66
 Identities = 199/513 (38%), Positives = 232/513 (45%), Gaps = 73/513 (14%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRK---ILD 1152
            MDSLLANYASSD           +P +LE     EK       + +FLS+ S K   I  
Sbjct: 1    MDSLLANYASSDDEEREEQPPSDKPVKLETGAGVEK-------DAEFLSESSAKRGGIFS 53

Query: 1151 SLPPPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKR-----EELEEIPENRNPKX 987
            SLPPPKSSLFNSLPPPKS     P         KPQ +F+      E  E+I E+  PK 
Sbjct: 54   SLPPPKSSLFNSLPPPKSQSLPNP---------KPQAEFEHQRDADEHDEQIVESSKPKS 104

Query: 986  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQ 807
                                              S     LP            SKRVVQ
Sbjct: 105  SSS-------------------------------SSLFASLPPPKSSSSSSSSASKRVVQ 133

Query: 806  FRPPPIMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGA 627
            FRPP I  P S                                    SIPAP+NS TLGA
Sbjct: 134  FRPPTIAKPYSGTFDDEDEDDDEGEQERERKRSKESISTSSAKSFLSSIPAPRNSATLGA 193

Query: 626  LPSASGTGRRSMLETDASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHSNWNS 447
            LPSASG GRRS+LET+A AS++  V   G+DA VN ++G   DQS   S L+Y +S+W+S
Sbjct: 194  LPSASGAGRRSILETEAPASSV--VSKPGNDAVVNPNVGSLLDQS---SELNYGYSSWSS 248

Query: 446  --------------------------------------------GSESYDGYGA------ 417
                                                        G ESY  YGA      
Sbjct: 249  ESESHAYYSGYGAVADDNVGLAPVGSSSTGNDQFHEVYDHSSSLGGESYAYYGAYGVGST 308

Query: 416  ------------GYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXY---EGNWIDES 282
                        G + N G+ E V+YS   GQ                    E NW   S
Sbjct: 309  AVGTVATAGSDAGMNSNEGSYEAVDYSYGNGQHVEYTNHGGSYGDYGNDAEYENNW---S 365

Query: 281  GATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAY 102
              TAV E   +  + L +P KRGR D P +IVEVKQDEL+KNRPREDQVKLTGIAFGPAY
Sbjct: 366  STTAVHEVPGIVGNALPLPVKRGRKDVPPEIVEVKQDELMKNRPREDQVKLTGIAFGPAY 425

Query: 101  KPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            +P STKGKP+KLHKRKHQIGSL +D+RQKEMEL
Sbjct: 426  QPTSTKGKPSKLHKRKHQIGSLYFDMRQKEMEL 458


>ref|XP_011089607.1| PREDICTED: uncharacterized protein DDB_G0271670 isoform X1 [Sesamum
            indicum]
          Length = 491

 Score =  259 bits (662), Expect = 4e-66
 Identities = 199/513 (38%), Positives = 232/513 (45%), Gaps = 73/513 (14%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRK---ILD 1152
            MDSLLANYASSD           +P +LE     EK       + +FLS+ S K   I  
Sbjct: 1    MDSLLANYASSDDEEREEQPPSDKPVKLETGAGVEK-------DAEFLSESSAKRGGIFS 53

Query: 1151 SLPPPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKR-----EELEEIPENRNPKX 987
            SLPPPKSSLFNSLPPPKS     P         KPQ +F+      E  E+I E+  PK 
Sbjct: 54   SLPPPKSSLFNSLPPPKSQSLPNP---------KPQAEFEHQRDADEHDEQIVESSKPKS 104

Query: 986  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQ 807
                                              S     LP            SKRVVQ
Sbjct: 105  SSS-------------------------------SSLFASLPPPKSSSSSSSSASKRVVQ 133

Query: 806  FRPPPIMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGA 627
            FRPP I  P S                                    SIPAP+NS TLGA
Sbjct: 134  FRPPTIAKPYSGTFDDEDEDDDEGEQERERKRSKESISTSSAKSFLSSIPAPRNSATLGA 193

Query: 626  LPSASGTGRRSMLETDASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHSNWNS 447
            LPSASG GRRS+LET+A AS++  V   G+DA VN ++G   DQS   S L+Y +S+W+S
Sbjct: 194  LPSASGAGRRSILETEAPASSV--VSKPGNDAVVNPNVGSLLDQS---SELNYGYSSWSS 248

Query: 446  --------------------------------------------GSESYDGYGA------ 417
                                                        G ESY  YGA      
Sbjct: 249  ESESHAYYSGYGAVADDNVGLAPVGSSSTGNDQFHEVYDHSSSLGGESYAYYGAYGVGST 308

Query: 416  ------------GYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXY---EGNWIDES 282
                        G + N G+ E V+YS   GQ                    E NW   S
Sbjct: 309  AVGTVATAGSDAGMNSNEGSYEAVDYSYGNGQHVEYTNHGGSYGDYGNDAEYENNW---S 365

Query: 281  GATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAY 102
              TAV E   +  + L +P KRGR D P +IVEVKQDEL+KNRPREDQVKLTGIAFGPAY
Sbjct: 366  STTAVHEVPGIVGNALPLPVKRGRKDVPPEIVEVKQDELMKNRPREDQVKLTGIAFGPAY 425

Query: 101  KPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            +P STKGKP+KLHKRKHQIGSL +D+RQKEMEL
Sbjct: 426  QPTSTKGKPSKLHKRKHQIGSLYFDMRQKEMEL 458


>ref|XP_012833582.1| PREDICTED: uncharacterized protein LOC105954458 [Erythranthe
            guttatus] gi|604341320|gb|EYU40672.1| hypothetical
            protein MIMGU_mgv1a007852mg [Erythranthe guttata]
          Length = 393

 Score =  244 bits (624), Expect = 9e-62
 Identities = 174/448 (38%), Positives = 222/448 (49%), Gaps = 8/448 (1%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRK---ILD 1152
            MDSLLANYASSD            P +   S++      E   + DFL+ P+ K   I +
Sbjct: 1    MDSLLANYASSDDEEPSPVQRRIVPARTVNSVS------EAGKDGDFLANPTSKHGGIFN 54

Query: 1151 SLPPPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQF--KREELEEIPENRNPKXXXX 978
            SLPPPKSSLFNSLPPPK                 PQ  F   R+  E+I E   PK    
Sbjct: 55   SLPPPKSSLFNSLPPPK-----------------PQSGFAKNRDFDEQIVEKSKPKPSSS 97

Query: 977  XXXXXXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRP 798
                                           +  P   P+            K+VVQFRP
Sbjct: 98   SSL---------------------------FTSLPPPKPSSSSS--------KKVVQFRP 122

Query: 797  PPIMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPS 618
            P I NP S+                                   SIPAP+++ TLG + S
Sbjct: 123  PTITNPNSSKFDDEDEDADEGELERQRKRAKESISTASPASFLSSIPAPRHTATLGTMSS 182

Query: 617  ASGTGRRSMLETDASASNLSNVGTM--GSDAAVNSSIG-YSEDQSGDGSSLSYDHSNWNS 447
            ASGT RRS++ET+A +SN +  GTM   +D  VN+S   Y +++    + ++      N 
Sbjct: 183  ASGTNRRSIIETEAPSSNANKTGTMKNNTDTIVNNSNAKYLKEEEDPTNEIT------NG 236

Query: 446  GSESYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXYEGNWIDESGATAV 267
            G+  Y   G+ YD + G G+ V+Y+N  G                 YE NW   + +  +
Sbjct: 237  GAVDYTA-GSSYDYSYGDGQYVDYTNSGGS-------YGNYGDHGQYENNW---ANSIPL 285

Query: 266  AEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAYKPAST 87
             E S +AE  L+VPG+RGR DTP QI+EVKQDEL+KNRPR+DQVK TGIAFGP Y+P ST
Sbjct: 286  PEVSAVAEEALRVPGRRGRKDTPLQIIEVKQDELMKNRPRQDQVKSTGIAFGPQYEPTST 345

Query: 86   KGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            KGKPTKLHKRKHQIGSLL+D+RQKE EL
Sbjct: 346  KGKPTKLHKRKHQIGSLLFDMRQKETEL 373


>emb|CDP03981.1| unnamed protein product [Coffea canephora]
          Length = 413

 Score =  234 bits (597), Expect = 1e-58
 Identities = 177/466 (37%), Positives = 220/466 (47%), Gaps = 26/466 (5%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKP-SRKILDSL 1146
            MDSLLA+YASSD                           E++ +   LS P S   L SL
Sbjct: 1    MDSLLASYASSD---------------------------EEQEDKPQLSNPKSAGFLSSL 33

Query: 1145 PPPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREELEEIPENRNPKXXXXXXXX 966
            PPPKSS  +S      HLA         S+PKP           +P+ ++          
Sbjct: 34   PPPKSSSSSS---SSGHLA---------SLPKPSSSL----FASLPQPKSSSTSSLFS-- 75

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPPPIM 786
                                      + +  + L              KRVVQF+PPP+ 
Sbjct: 76   -------------------------SLPQPTKTLNPDARAPPPAQSAGKRVVQFKPPPVY 110

Query: 785  NPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSASGT 606
            +  STN                                  SIPAP++S TLGALPSASG+
Sbjct: 111  S--STNVGNEDDDEDDDDDEEQEQKKQPVVQTASVKSFLSSIPAPRHSATLGALPSASGS 168

Query: 605  GRRSMLETDASASNLSNV--GTMGSDAAVN-SSIGYSEDQSGD-------GSSLSY---- 468
            GRRS ++ D      S V     GS+A V+ SSIGY E QS +       G  LS     
Sbjct: 169  GRRSTIDADVPGLKDSKVVNAASGSEAGVSTSSIGYYEGQSSNDQMSISSGGDLSNSSGY 228

Query: 467  -----DHSNWNSGSESYD---GYGAGYDDNVGTGEGVNYSNW---KGQTXXXXXXXXXXX 321
                 D+S+W  GSE+Y    GYGA Y++N G+G   +Y NW    G +           
Sbjct: 229  ANGGGDYSSWGHGSENYANHAGYGA-YENNGGSGVAGDYQNWDGGNGDSVNYNGDYGSYA 287

Query: 320  XXXXYEGNWIDESGATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPRED 141
                YE NW D   A    E S  AE+  +V GKRGRN+ PE+IVEVKQDEL+K+RPRED
Sbjct: 288  NYGQYENNWADVPTAAVGPEVSGFAENAWRVSGKRGRNNAPEEIVEVKQDELMKDRPRED 347

Query: 140  QVKLTGIAFGPAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            QVKLTGIAFGPAY+P STKGKP+KLHKRKHQIGSL +D++QKEMEL
Sbjct: 348  QVKLTGIAFGPAYQPTSTKGKPSKLHKRKHQIGSLFFDMKQKEMEL 393


>ref|XP_003549295.1| PREDICTED: proline-rich protein PRCC-like [Glycine max]
          Length = 372

 Score =  205 bits (522), Expect = 6e-50
 Identities = 160/445 (35%), Positives = 200/445 (44%), Gaps = 5/445 (1%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLANYASSD                            +E +    S P      SLP
Sbjct: 1    MDSLLANYASSD----------------------------EEEDQQQPSPPKTTSFSSLP 32

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKP-QPQFKREELEEIPENRNPKXXXXXXXX 966
             PKSSLF SLP PKS    +PF     S+P P QP  +   L     N NPK        
Sbjct: 33   QPKSSLFQSLPQPKS----SPFSSLFQSLPPPKQPSSESASLPNPNPNPNPK-------- 80

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPP--P 792
                                          PQI               KRVVQFRPP  P
Sbjct: 81   ------------------------------PQI----------EEPRPKRVVQFRPPIIP 100

Query: 791  IMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSAS 612
            + NP   +                                  SIPAP+N+ TLG + ++S
Sbjct: 101  LPNPTQLD---DDDDDEEEERNRRKNKLESSTQTSSVKSFLASIPAPRNTATLG-VQASS 156

Query: 611  GTGRRSMLETD--ASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHSNWNSGSE 438
            G+GRRS+LET+  A ASN         D +      Y   Q       +Y + N+ SG+E
Sbjct: 157  GSGRRSILETESPAPASNSGGSNNFPVDQSTGDYENYENYQYATDQYANY-YGNYGSGAE 215

Query: 437  SYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXYEGNWIDESGATAVAEA 258
                 G   +  V       Y N+                   Y  NW D S AT V EA
Sbjct: 216  PGSS-GTESEAGVAAYGTEQYGNY-------GDAYAAYGDYGQYGNNWGDVSAATPVPEA 267

Query: 257  SRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAYKPASTKGK 78
            S +++S +++PGKRGR++ P +++EVKQ+EL+KNRPREDQ KLTGIAFGP Y+PASTKGK
Sbjct: 268  SGISDSVMRIPGKRGRHEIPTEVIEVKQEELIKNRPREDQAKLTGIAFGPTYQPASTKGK 327

Query: 77   PTKLHKRKHQIGSLLYDLRQKEMEL 3
            PTKLHKRKHQIGSL +D++Q EM+L
Sbjct: 328  PTKLHKRKHQIGSLYFDMKQNEMKL 352


>ref|XP_007014432.1| C-terminal, putative [Theobroma cacao] gi|508784795|gb|EOY32051.1|
            C-terminal, putative [Theobroma cacao]
          Length = 542

 Score =  203 bits (517), Expect = 2e-49
 Identities = 162/457 (35%), Positives = 216/457 (47%), Gaps = 17/457 (3%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            M+SLLANYASSD                           E+E +      P    + SLP
Sbjct: 139  MESLLANYASSD---------------------------EEEEQQHRQPPPPTSHVSSLP 171

Query: 1142 PPKSS-LFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREELE----EIPENRNPKXXXX 978
             PKSS LF+SLP PK    QT    + P++P      +RE++E     +P  + P     
Sbjct: 172  QPKSSSLFSSLPHPK----QTS---QAPNIPIDHAN-QREDVEIPKLSVPHPKTPSNL-- 221

Query: 977  XXXXXXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRP 798
                                           S  PQ  P             KR+VQF+P
Sbjct: 222  ------------------------------FSSRPQ--PKSQAPQQQQPTNVKRIVQFKP 249

Query: 797  PPIMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPS 618
            P I  P + +                                  SIPAP+NSTTLG  P+
Sbjct: 250  PII--PTNHDDDDDEDDDDEKKERRRRRESETLAQGPSVKSFLSSIPAPRNSTTLGVAPT 307

Query: 617  ASGTGRRSMLETDASASNLSNVGTMGSDAAVNSSI-GYSEDQSGDGSSLSYDHSNWNSGS 441
             SG+GRRS++ET    S  S V    ++A++N +   YS  +SG GS+     +   S S
Sbjct: 308  -SGSGRRSIIETQVPTST-SAVFEDKNEASINQNAPNYSNYESGIGSNAGNSGNYQTSVS 365

Query: 440  ES---YDGYGAGYDDNVGT-GEGVNYSNWK-------GQTXXXXXXXXXXXXXXXYEGNW 294
             +   Y  Y +  D NVG      +Y +++       G                 YE  W
Sbjct: 366  HNAGNYGNYESVVDQNVGHYATYADYGSYQSSSGPNIGSIGGVTSYGTCGDFHGQYENTW 425

Query: 293  IDESGATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAF 114
            +D S AT + E + +AE G++V GKRGRN+ P +IVEV+QDEL+KNRPREDQVK+TGIAF
Sbjct: 426  VDGSAATTLPEITGMAEIGVKVKGKRGRNELPTEIVEVRQDELMKNRPREDQVKMTGIAF 485

Query: 113  GPAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            GP+Y+PA+TKGKP+KLHKRKHQIGSL +D++QKEMEL
Sbjct: 486  GPSYQPAATKGKPSKLHKRKHQIGSLYFDMKQKEMEL 522


>gb|KHN17634.1| Proline-rich protein PRCC [Glycine soja]
          Length = 370

 Score =  202 bits (514), Expect = 5e-49
 Identities = 160/445 (35%), Positives = 200/445 (44%), Gaps = 5/445 (1%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLANYASSD                            +E +    S P      SLP
Sbjct: 1    MDSLLANYASSD----------------------------EEEDQQQPSPPKTTSFSSLP 32

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKP-QPQFKREELEEIPENRNPKXXXXXXXX 966
             PKSSLF SLP PKS    +PF     S+P P QP  +   L     N NPK        
Sbjct: 33   QPKSSLFQSLPQPKS----SPFSSLFQSLPPPKQPSSESASLPN--PNPNPK-------- 78

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPP--P 792
                                          PQI               KRVVQFRPP  P
Sbjct: 79   ------------------------------PQI----------EEPRPKRVVQFRPPIIP 98

Query: 791  IMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSAS 612
            + NP   +                                  SIPAP+N+ TLG + ++S
Sbjct: 99   LPNPTQLD---DDDDDEEEERNRRKNKLESSTQTSSVKSFLASIPAPRNTATLG-VQASS 154

Query: 611  GTGRRSMLETD--ASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHSNWNSGSE 438
            G+GRRS+LET+  A ASN         D +      Y   Q       +Y + N+ SG+E
Sbjct: 155  GSGRRSILETESPAPASNSGGSNNFPVDQSTGDYENYENYQYATDQYANY-YGNYGSGAE 213

Query: 437  SYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXYEGNWIDESGATAVAEA 258
                 G   +  V       Y N+                   Y  NW D S AT V EA
Sbjct: 214  PGSS-GTESEAGVAAYGTEQYGNY-------GDAYAAYGDYGQYGNNWGDVSAATPVPEA 265

Query: 257  SRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAYKPASTKGK 78
            S +++S +++PGKRGR++ P +++EVKQ+EL+KNRPREDQ KLTGIAFGP Y+PASTKGK
Sbjct: 266  SGISDSVMRIPGKRGRHEIPTEVIEVKQEELIKNRPREDQAKLTGIAFGPTYQPASTKGK 325

Query: 77   PTKLHKRKHQIGSLLYDLRQKEMEL 3
            PTKLHKRKHQIGSL +D++Q EM+L
Sbjct: 326  PTKLHKRKHQIGSLYFDMKQNEMKL 350


>gb|ACU18505.1| unknown [Glycine max]
          Length = 372

 Score =  201 bits (511), Expect = 1e-48
 Identities = 159/445 (35%), Positives = 198/445 (44%), Gaps = 5/445 (1%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLANYASS                             +E +    S P      SLP
Sbjct: 1    MDSLLANYASSG----------------------------EEEDQQQPSPPKTTSFSSLP 32

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKP-QPQFKREELEEIPENRNPKXXXXXXXX 966
             PKSSLF SLP PKS    +PF     S+P P QP  +   L     N NPK        
Sbjct: 33   QPKSSLFQSLPQPKS----SPFSSLFQSLPPPKQPSSESASLPNPNPNPNPK-------- 80

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPP--P 792
                                          PQI               KRVVQFRPP  P
Sbjct: 81   ------------------------------PQI----------EEPRPKRVVQFRPPIIP 100

Query: 791  IMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSAS 612
            + NP   +                                  SIPAP+N+ TLG + ++S
Sbjct: 101  LPNPTQLD---DDDDDEEEERNRRKNKLESSTQTSSVKSFLASIPAPRNTATLG-VQASS 156

Query: 611  GTGRRSMLETD--ASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHSNWNSGSE 438
            G+GRRS+LET+  A ASN         D +      Y   Q       +Y + N+ SG+E
Sbjct: 157  GSGRRSILETESPAPASNSGGSNNFPVDQSTGDYENYENYQYATDQYANY-YGNYGSGAE 215

Query: 437  SYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXYEGNWIDESGATAVAEA 258
                 G   +  V       Y N+                   Y  NW D S AT V EA
Sbjct: 216  PGSS-GTESEAGVAAYGTEQYGNY-------GDAYAAYGDYGQYGNNWGDVSAATPVPEA 267

Query: 257  SRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAYKPASTKGK 78
            S +++S +++PGKRGR++ P + +EVKQ+EL+KNRPREDQ KLTGIAFGP Y+PASTKGK
Sbjct: 268  SGISDSVMRIPGKRGRHEIPTEAIEVKQEELIKNRPREDQAKLTGIAFGPTYQPASTKGK 327

Query: 77   PTKLHKRKHQIGSLLYDLRQKEMEL 3
            PTKLHKRKHQIGSL +D++Q EM+L
Sbjct: 328  PTKLHKRKHQIGSLYFDMKQNEMKL 352


>ref|XP_007154707.1| hypothetical protein PHAVU_003G140900g [Phaseolus vulgaris]
            gi|561028061|gb|ESW26701.1| hypothetical protein
            PHAVU_003G140900g [Phaseolus vulgaris]
          Length = 375

 Score =  197 bits (500), Expect = 2e-47
 Identities = 156/456 (34%), Positives = 209/456 (45%), Gaps = 16/456 (3%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLANYASS+                           E+E +   +   +     SLP
Sbjct: 1    MDSLLANYASSEE--------------------------EEEEQQQPIPPKTTTSFSSLP 34

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREELEEIPENRNPKXXXXXXXXX 963
             PKSSLF SL  PKS  + + F     S+P+P+        + +P  + P          
Sbjct: 35   QPKSSLFQSLSQPKS--SSSFFQ----SLPQPKSS-SSSFFQSLPPPKQPSLATSSETA- 86

Query: 962  XXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPP--PI 789
                                    D    PQI               KRVVQFRPP  P+
Sbjct: 87   ------------------------DPKPKPQI----------PQPQPKRVVQFRPPIIPL 112

Query: 788  MNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSASG 609
             NP  T                                   +IPAP+N+ TLG + ++SG
Sbjct: 113  TNP--TQLLDDDDEEEEEERDRRKKKLVSSTQTSSVKSFLANIPAPRNAATLG-VHASSG 169

Query: 608  TGRRSMLETDASASNLSNVGTMGSDAAVNSSIG-YSEDQSGDGSSLSYD--HSNWNS--- 447
            +GRRS++ET++ A   ++     S   V+ S+G Y  D++   ++  Y   + N+ S   
Sbjct: 170  SGRRSIIETESPALETASNSGGSSSVTVDQSVGDYGNDENYQYATDQYAGYYGNYGSVPE 229

Query: 446  --------GSESYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXYEGNWI 291
                    G+E Y  YG  Y      G+   Y N                       NW 
Sbjct: 230  PEAGAAAYGTEQYGNYGEAY------GDYGQYGN-----------------------NWG 260

Query: 290  DESGATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFG 111
            D S A  V+EAS ++ES +++PGKRGR++ P +++EVKQDEL+KNRPREDQVKLTGIAFG
Sbjct: 261  DVSAAP-VSEASGISESVVRIPGKRGRHEVPMEVIEVKQDELIKNRPREDQVKLTGIAFG 319

Query: 110  PAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            P Y+PASTKGKPTKLHKRKHQIGSL +D+RQ EM+L
Sbjct: 320  PTYQPASTKGKPTKLHKRKHQIGSLYFDMRQNEMKL 355


>ref|XP_012068474.1| PREDICTED: uncharacterized protein LOC105631086 [Jatropha curcas]
            gi|643734366|gb|KDP41111.1| hypothetical protein
            JCGZ_03241 [Jatropha curcas]
          Length = 403

 Score =  194 bits (493), Expect = 1e-46
 Identities = 155/448 (34%), Positives = 201/448 (44%), Gaps = 8/448 (1%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLANYASSD                         + E+    + +S P  KI  S  
Sbjct: 1    MDSLLANYASSD-------------------------EEEENQHQNSISYP--KITSS-- 31

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREELEEIPENRNPKXXXXXXXXX 963
               +S F+SLP PKS L  +           PQPQ +        +N             
Sbjct: 32   ---ASHFSSLPQPKSSLLFSAI---------PQPQHQLSTCVVHHDNHGNNKNIEEDEDD 79

Query: 962  XXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPPPIMN 783
                                     IS    + P             KRVVQF+PP  + 
Sbjct: 80   PKRSSKSSSLFSFLPQPKTQAPQQPISSVSSLDPTP-----------KRVVQFKPPINLT 128

Query: 782  PIS-TNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSASGT 606
             +  ++                                  SIPAPKNS+TLG LPSA+G+
Sbjct: 129  YVKPSDLDDEDDEDEGEMEKKWRKESEALPQSSSVKSFLSSIPAPKNSSTLGVLPSATGS 188

Query: 605  GRRSMLETDASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHS-NWNSGSE-SY 432
            GRRS++ET    S+  + G        N         S DG+SLSY+   + N GS  +Y
Sbjct: 189  GRRSIVETKTPTSSSGSFGAENDQTMGNYG-------SYDGTSLSYESGPDKNGGSNLNY 241

Query: 431  DGYGAGYDDNVGT-----GEGVNYSNWKGQTXXXXXXXXXXXXXXXYEGNWIDESGATAV 267
              Y +G   ++G       +G +Y +++  T                   W DE  A AV
Sbjct: 242  GSYESGISQDIGQKVNAGDDGSSYGSYENYTSYGTYNDYQQFG-----NTWSDELAA-AV 295

Query: 266  AEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAYKPAST 87
             E +  +ES L+VPGKRGR D   +++EVKQDEL KNRPREDQVKLTGIAFGP+Y+P ST
Sbjct: 296  PERTGPSESALRVPGKRGRKDIVTEVIEVKQDELTKNRPREDQVKLTGIAFGPSYEPTST 355

Query: 86   KGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            KGKP+KLHKRKHQIGSL +D++QKEMEL
Sbjct: 356  KGKPSKLHKRKHQIGSLYFDMKQKEMEL 383


>ref|XP_006453238.1| hypothetical protein CICLE_v10008415mg [Citrus clementina]
            gi|568840653|ref|XP_006474280.1| PREDICTED: suppressor
            protein SRP40-like [Citrus sinensis]
            gi|557556464|gb|ESR66478.1| hypothetical protein
            CICLE_v10008415mg [Citrus clementina]
          Length = 420

 Score =  192 bits (487), Expect = 7e-46
 Identities = 155/453 (34%), Positives = 205/453 (45%), Gaps = 13/453 (2%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLANYASSD             +Q      P  F           +K +  +  SLP
Sbjct: 1    MDSLLANYASSDEEEEQQK------QQQSSHSKPVSFSSSSS------TKAASSLFSSLP 48

Query: 1142 PPKSS-LFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREELEEIPENRNPKXXXXXXXX 966
             PKSS LF+S+P PK          K P           +  EE  E++ P         
Sbjct: 49   QPKSSPLFSSIPQPKQQQQTN----KNPVSKTLTSNNFDDHDEEEKESKKPTSNSNKPSS 104

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPP-PI 789
                                          P+   +           +KRVVQF+PP PI
Sbjct: 105  IFSSLPQ-----------------------PKTQTSSQQTLNPLEKTTKRVVQFKPPLPI 141

Query: 788  MNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSASG 609
                S+N                                  SIPAPK+S  LG+  S+SG
Sbjct: 142  Q---SSNFSDDDDEDDEEKERKKRKQAEFFNQSSSVKSFLSSIPAPKSSAALGSGHSSSG 198

Query: 608  TGRRSMLETDASASNLSNVGT-----MGSDAAVNSSIGYSEDQSGDGSSLSYDHS----- 459
            +GRRS+++T+A AS+    G       G DA  + +     DQ+ D S  +YD S     
Sbjct: 199  SGRRSIIDTEAPASSSVGFGAENEAGTGQDAVNHENYDVGSDQNVD-SYANYDQSVENYA 257

Query: 458  NWNSGSE-SYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXYEGNWIDES 282
            N+ +G + SY  YG   D NV +G+  +Y N+ G +                      E+
Sbjct: 258  NYEAGIDPSYVNYGI--DQNVHSGDASSYMNYGGYSSYGDYNGYGDYGQY--------EA 307

Query: 281  GATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAY 102
              T V E + +AES +++ GKRGR + P +IVEVKQDEL+KNRP ED+ KLTGIAFGP+Y
Sbjct: 308  ATTTVQEPAMVAESVVRMEGKRGRKEIPTEIVEVKQDELMKNRPSEDKAKLTGIAFGPSY 367

Query: 101  KPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
             P S KGKP+KLHKRKHQIGSL +D++QKEMEL
Sbjct: 368  LPVSVKGKPSKLHKRKHQIGSLFFDMKQKEMEL 400


>ref|XP_004296496.1| PREDICTED: proline-rich protein PRCC [Fragaria vesca subsp. vesca]
          Length = 372

 Score =  192 bits (487), Expect = 7e-46
 Identities = 158/451 (35%), Positives = 205/451 (45%), Gaps = 11/451 (2%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLA+YASSD              QL+   NP              S PS     SLP
Sbjct: 1    MDSLLASYASSDEEEDQKP-------QLQPPPNP--------------SSPSSS---SLP 36

Query: 1142 PPK-SSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREELEEIPENRNPKXXXXXXXX 966
             PK SSLFNSLPPPK     + F     S+PKP+ +         P++  PK        
Sbjct: 37   KPKPSSLFNSLPPPKPTSRSSLFS----SLPKPKLE---------PQDIKPKIPNKPIS- 82

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPPPIM 786
                                     + S + QI               KRVVQF+PP   
Sbjct: 83   -------------------------NSSSSLQI--------------PKRVVQFKPPIFA 103

Query: 785  NPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGA---LPSA 615
            N    +                                   IPAP+NS TLG    L S 
Sbjct: 104  NSTHLDEDDDDDNEEEERRRRKASEASSAQPQSVTSFLSA-IPAPRNSATLGVSSGLGSG 162

Query: 614  SGTGRRSMLETDASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHSNWNSGSES 435
            +G+GRR+++ET++S               V S +G  ++++      SYD+SN++   ES
Sbjct: 163  AGSGRRAIVETESSFK-------------VESDVGLDQNEN----VASYDNSNFDPNVES 205

Query: 434  YDGYGA------GYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXYEGN-WIDESGA 276
            Y+GYG       G D NV +G  +  +   G                   GN W D S A
Sbjct: 206  YEGYGGYESYQYGVDHNVDSGVQLQ-TGVSGSDALSYGGYGGYSGYAGKHGNEWADGSQA 264

Query: 275  TAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAYKP 96
              +  +    ++G++V GKR RND P +I+EVKQDEL+KNRPREDQ K+TGIAFGP+Y+P
Sbjct: 265  AVMGMSG---DAGVRVSGKRRRNDVPTEILEVKQDELMKNRPREDQAKVTGIAFGPSYQP 321

Query: 95   ASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
             S KGKPTKLHKRKHQIGSL +D++QKEMEL
Sbjct: 322  VSAKGKPTKLHKRKHQIGSLYFDMKQKEMEL 352


>ref|XP_009626418.1| PREDICTED: proline-rich protein PRCC [Nicotiana tomentosiformis]
          Length = 432

 Score =  191 bits (486), Expect = 9e-46
 Identities = 164/477 (34%), Positives = 210/477 (44%), Gaps = 37/477 (7%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            M+SL ANYASSD                +ES   +  + + +  N   S PS     SLP
Sbjct: 1    MESLFANYASSDEDDE------------QESQQDKHQQKQVQSSNS--SNPS-VFSSSLP 45

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKR-EELEEIPENRNPKXXXXXXXX 966
            P KSS   SLPPPKS    +P +   PS  KP  + +  EE +EIP+  +          
Sbjct: 46   PTKSSF--SLPPPKSQSKTSP-NLTTPSAVKPLNEKQHFEEEDEIPDKPSS--------- 93

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPPPIM 786
                                      +   P+  P             KRVVQF+PP   
Sbjct: 94   ------FFSSLPQPNKSTPSSSSLFSVLPPPKTTP----LSSSDPKPKKRVVQFKPPA-- 141

Query: 785  NPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXS----IPAPKNSTTLGALPS 618
            NP S  +                                 S    IPAPKNST+LG L  
Sbjct: 142  NPFSVKSNNSVDEDEDDDDEGEKEKQRKRSESFTQTPSVKSFLSSIPAPKNSTSLGVL-- 199

Query: 617  ASGTGRRSMLETDASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGS-------------- 480
             SG+GRRS +E D    N +   T  S+  V+S+ GY+E Q  DGS              
Sbjct: 200  GSGSGRRSTIEADVPVPNSATSNTQ-SEVLVSSNTGYNESQQVDGSLESSMGGIGGPTEY 258

Query: 479  ----SLSYDHS-NW------------NSGSESYDGYGAGYDDNVGTGEGVNYSNWKGQTX 351
                 ++ D+S NW            N G+  Y+ Y  GY+ N GT  G    + K    
Sbjct: 259  SASLGVAGDYSSNWGAHGYVNPESCANDGTSGYENY-PGYESNSGTYTGYEQYDHK---- 313

Query: 350  XXXXXXXXXXXXXXYEGNWIDESGATAVAEA-SRLAESGLQVPGKRGRNDTPEQIVEVKQ 174
                              W D S  TA A A +  AE  L +PGKRGR D P++ VEV Q
Sbjct: 314  ------------------WTDGSSTTAAAAAITETAEVALTLPGKRGRKDAPQKFVEVNQ 355

Query: 173  DELLKNRPREDQVKLTGIAFGPAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            DEL+KNRPREDQ +LTGIAFGP+Y+P S+KGKP+KL KRKHQI +L +D++QKEMEL
Sbjct: 356  DELMKNRPREDQSRLTGIAFGPSYQPVSSKGKPSKLLKRKHQISTLYFDMKQKEMEL 412


>ref|XP_012463463.1| PREDICTED: uncharacterized protein LOC105782915 [Gossypium raimondii]
            gi|763817092|gb|KJB83944.1| hypothetical protein
            B456_013G243000 [Gossypium raimondii]
          Length = 414

 Score =  191 bits (485), Expect = 1e-45
 Identities = 169/488 (34%), Positives = 214/488 (43%), Gaps = 48/488 (9%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            M+SLLANYASSD            P+Q+     P   KV                  SLP
Sbjct: 1    MESLLANYASSDDDE---------PQQITHPPPPPPPKVS-----------------SLP 34

Query: 1142 PPKSS-LFNSLPPPKSHLAQTPFDFKEPS-----------MPKPQPQFKREELEEIPENR 999
             PKSS LF +LP PK  L        E             +PKP           +P  +
Sbjct: 35   QPKSSSLFTNLPQPKQSLKSFTKHHDEDGNGGGGGEVAVRVPKPA----------VPHPK 84

Query: 998  NPKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSK 819
            NP                                    S  PQ  P             K
Sbjct: 85   NPSNL--------------------------------FSHLPQPKPQQPPNPPVA----K 108

Query: 818  RVVQFRPPPIMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNST 639
            R+VQF+PP  +NP                                      SIPAP+NST
Sbjct: 109  RIVQFKPP--INP--NTHVDSDDDDEEENERPKRGESETLAQGPSVKSFLSSIPAPRNST 164

Query: 638  TLGALPSASGTGRRSMLET-------------------DASASNLSNVGTMGSDAAVNSS 516
            TLG  PS SG+GRRS+++T                   D +  N SN    GSD    ++
Sbjct: 165  TLGVAPS-SGSGRRSIIDTQVIPTSTSSTFEDKKEASIDNNPPNYSNY-EWGSDVNAGTT 222

Query: 515  IGYSE----DQSG-DGSSLSYDHSNWNSGSES----YDGYGAGYDDNVGT-------GEG 384
            +GY+     DQS  D +S +Y +++ N GS +    Y  Y +  D N+G        G  
Sbjct: 223  VGYNNYVNYDQSSVDQNSGNYGNNDQNIGSYANYADYSSYQSSSDPNIGGVDAATSYGSY 282

Query: 383  VNYSNWKGQTXXXXXXXXXXXXXXXYEGNWIDESG-ATAVAEASRLAESGLQVPGKRGRN 207
             +Y N+  Q                YE NW D S  A+ + E   +A+ G++V GKRGRN
Sbjct: 283  ESYGNYHVQ----------------YENNWGDGSTTASMLPETKGIADFGVKVKGKRGRN 326

Query: 206  DTPEQIVEVKQDELLKNRPREDQVKLTGIAFGPAYKPASTKGKPTKLHKRKHQIGSLLYD 27
            D P +IVEVKQD+L KNRPREDQVK+TGIAFGP+Y+PAS+KGKPTKLHKRKHQIGSL +D
Sbjct: 327  DLPVEIVEVKQDDLTKNRPREDQVKMTGIAFGPSYQPASSKGKPTKLHKRKHQIGSLYFD 386

Query: 26   LRQKEMEL 3
            ++QKEMEL
Sbjct: 387  MKQKEMEL 394


>gb|KHN22658.1| hypothetical protein glysoja_027546 [Glycine soja]
          Length = 369

 Score =  191 bits (485), Expect = 1e-45
 Identities = 159/461 (34%), Positives = 195/461 (42%), Gaps = 21/461 (4%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLANYASSD                EE   P              S P      SLP
Sbjct: 1    MDSLLANYASSDE---------------EEDQQP--------------SPPKTTTFSSLP 31

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKP-QPQFKREELEEIPENRNPKXXXXXXXX 966
             PK SLF SLP PKS L Q        S+P P QP  +   L     N NP         
Sbjct: 32   QPKLSLFQSLPQPKSSLFQ--------SLPPPKQPSTESSSLPNPNPNPNP--------- 74

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPPPIM 786
                                          PQI               KRVVQFRPP I 
Sbjct: 75   ---------------------------DPKPQI----------EKTQPKRVVQFRPPIIP 97

Query: 785  NPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSASGT 606
             P  +                                   SIPAP+N+ TLG + ++SG+
Sbjct: 98   LPHPSQHDDDDDDDEEEERNRRKKKLEFSTQTSSVKSFLASIPAPRNTATLG-VQASSGS 156

Query: 605  GRRSMLETDAS--ASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHSNWNSGSE-- 438
            GR+S+LET+    ASN      +  D +      + + Q       SY + N+ SG+E  
Sbjct: 157  GRKSILETETPPPASNSGGFSNVPVDQSTGDYENFDDYQYATDQYASY-YGNFGSGAEPG 215

Query: 437  ----------------SYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXXXY 306
                             Y  YG  Y      G+   Y N                     
Sbjct: 216  SSGTEPKAGVAAYGTEQYGNYGDAY---ASYGDYGQYGN--------------------- 251

Query: 305  EGNWIDESGATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLT 126
              NW D S A  V EAS +  S +++PGKRGR++ P +++EVKQ+EL+KNRPREDQVKLT
Sbjct: 252  --NWGDVS-APPVLEASGIDVSVIRIPGKRGRHEIPTEVIEVKQEELIKNRPREDQVKLT 308

Query: 125  GIAFGPAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            GIAFGP Y+PASTKGKPTKLHKRKHQIGSL +D++Q EM+L
Sbjct: 309  GIAFGPTYQPASTKGKPTKLHKRKHQIGSLYFDMKQNEMKL 349


>ref|XP_002308111.2| hypothetical protein POPTR_0006s07450g [Populus trichocarpa]
            gi|550335710|gb|EEE91634.2| hypothetical protein
            POPTR_0006s07450g [Populus trichocarpa]
          Length = 403

 Score =  190 bits (482), Expect = 3e-45
 Identities = 146/405 (36%), Positives = 184/405 (45%), Gaps = 5/405 (1%)
 Frame = -3

Query: 1202 DEGENDFLSKPSRKILDSLPPPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREE 1023
            DE E D      R    + PP K SLF+SLP PKS  +     F     P  +P  K + 
Sbjct: 11   DEEEKDQPQPQQRHQTPASPPGKPSLFSSLPQPKSSSSL----FSSLPQPTQEPTSKPQV 66

Query: 1022 LEEIPENRNPKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXX 843
               IP+N N +                                   S    +  N     
Sbjct: 67   ---IPQNNNLRIANFKEEDKRPTFKSTTSLFSSLPQPKTETLQQPTSNLTPVDSNP---- 119

Query: 842  XXXXXXSKRVVQFRPPPIMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXS 663
                   KRVVQF+PP     I                                     S
Sbjct: 120  -------KRVVQFKPPINRPSILDEEDEDEEEKKEKERKRKKTESLLQSDSSSVKGFLSS 172

Query: 662  IPAPKNSTTLGALPSASGTGRRSMLETDASASNLSNVGTMGSDAAVNSSIGYSEDQSGDG 483
            IPAP+NS+TLG     SG+GRRS++E++   S+   VG         SS G+    S DG
Sbjct: 173  IPAPRNSSTLGVGSLGSGSGRRSVIESEGPTSSSGGVGAENESGVDQSSEGHV---SYDG 229

Query: 482  SSLSYDHSNW---NSGSESYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXXX 312
              + +DH+     N GS    G G     NVG G+GV+Y  ++                 
Sbjct: 230  GYVGFDHNGGDYVNYGSYE-SGAGQSVAQNVG-GDGVSYGGYESYGGYGDSGQYG----- 282

Query: 311  XYEGNWIDESGATAVAEASR--LAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQ 138
                NW D S A AVAE      AES L++ GKR RN+ P +I+EVKQDEL+KNRPREDQ
Sbjct: 283  ---SNWDDRSVA-AVAETGSGGAAESALRMMGKRRRNEIPTEIIEVKQDELIKNRPREDQ 338

Query: 137  VKLTGIAFGPAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            VK TGIAFGPAY+PAS+KGKP+KLHKRKHQIG+L +D++QKE EL
Sbjct: 339  VKSTGIAFGPAYQPASSKGKPSKLHKRKHQIGTLYFDMKQKETEL 383


>gb|KHG24356.1| Proline-rich PRCC [Gossypium arboreum]
          Length = 414

 Score =  189 bits (480), Expect = 4e-45
 Identities = 163/477 (34%), Positives = 214/477 (44%), Gaps = 37/477 (7%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            M+SLLANYASSD            P+Q+     P                P  K+   L 
Sbjct: 1    MESLLANYASSDDDE---------PQQITHPPTP----------------PPPKVSSLLQ 35

Query: 1142 PPKSSLFNSLP-PPKSHLAQTPFDFKEPSMPKPQPQFKREELEEIPENRNPKXXXXXXXX 966
            P  SSLF SLP P +S  + T    ++ +         R     +P  +N          
Sbjct: 36   PKSSSLFTSLPQPKQSLKSSTKHHDEDRNGGGGGEVAVRVPKPSLPHPKNSSNL------ 89

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPPPIM 786
                                       S  PQ  P             KR+VQF+PP  +
Sbjct: 90   --------------------------FSHLPQPKPQQPPNPPVA----KRIVQFKPP--I 117

Query: 785  NPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSASGT 606
            NP                                      SIPAP+NSTTLG  PS SG+
Sbjct: 118  NP--NTHVDSNDDEEEEKERPKRGESETLAQGPSVKSFLSSIPAPRNSTTLGVAPS-SGS 174

Query: 605  GRRSMLET-------------------DASASNLSNVGTMGSDAAVNSSIGYSE----DQ 495
            GRRS+++T                   D +A N SN    GSD    +++GY+     DQ
Sbjct: 175  GRRSIIDTQVIPTLTSSTFEDKKEASIDNNAPNYSNY-EWGSDVNAGTTVGYNNYVNYDQ 233

Query: 494  SG-DGSSLSYDHSNWNSGSES----YDGYGAGYDDNVGT-------GEGVNYSNWKGQTX 351
            S  D +S +Y +++ N GS +    Y  Y +  D N+G        G   +Y N+  Q  
Sbjct: 234  SSVDQNSGNYGNNDQNIGSYANYADYSSYQSSSDPNIGGVDAATSYGSYESYGNYHVQ-- 291

Query: 350  XXXXXXXXXXXXXXYEGNWIDESG-ATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQ 174
                          YE NW D S  A+ + E + +A+ G+++ GKRGRND P +IVEVKQ
Sbjct: 292  --------------YENNWGDGSTTASMLPETTGIADFGVKIKGKRGRNDLPVEIVEVKQ 337

Query: 173  DELLKNRPREDQVKLTGIAFGPAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            D+L KNRPREDQVK+TGIAFGP+Y+PAS+KGKPTKLHKRKHQIGSL +D++QKEMEL
Sbjct: 338  DDLTKNRPREDQVKMTGIAFGPSYQPASSKGKPTKLHKRKHQIGSLYFDMKQKEMEL 394


>ref|XP_011020044.1| PREDICTED: proline-rich protein PRCC [Populus euphratica]
          Length = 397

 Score =  189 bits (479), Expect = 6e-45
 Identities = 147/408 (36%), Positives = 185/408 (45%), Gaps = 8/408 (1%)
 Frame = -3

Query: 1202 DEGENDFLSKPSRKILDSLPPPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREE 1023
            DE E D      R    + PP K SLF+SLP PKS  +     F     PK +P  K + 
Sbjct: 11   DEEEKDQPRPQQRHQTPASPPGKPSLFSSLPQPKSSSSL----FSSLPQPKQEPASKPQV 66

Query: 1022 LEEIPENRNPKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSDISETPQ----ILPNX 855
               IPEN N +                                   S  PQ     L   
Sbjct: 67   ---IPENNNLRIANFKEEDKRPTVKSRTSL---------------FSSLPQPKTETLQQP 108

Query: 854  XXXXXXXXXXSKRVVQFRPPPIMNPISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 675
                       KRVVQF+PP I  P   +                               
Sbjct: 109  TSNLTPADSNPKRVVQFKPP-INRPSILDEEEDEEEKKEKERKRKKTESLLQSDSSSVKG 167

Query: 674  XXXSIPAPKNSTTLGALPSASGTGRRSMLETDASASNLSNVGTMGSDAAVNSSIGYSEDQ 495
               SIPAP+NS++LG     SG+GRRS++E++   S    VG         SS GY   +
Sbjct: 168  FLSSIPAPRNSSSLGVGSLGSGSGRRSVIESEGPTSISGGVGAEKESGVDQSSEGY---E 224

Query: 494  SGDGSSLSYDHSNWNSGSE-SYDGYGAGYDDNVGTGEGVN-YSNWKGQTXXXXXXXXXXX 321
            S DG  + +DH   N G   +Y  Y +G D +V    G   Y ++ G             
Sbjct: 225  SYDGGYVGFDH---NGGDYVNYGSYESGTDQSVAQNVGGGGYESYGGYGDSGQYG----- 276

Query: 320  XXXXYEGNWIDESGATAVAEASR--LAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPR 147
                   NW D+    AVAE      AES L++ GKR RN+ P +I+EVKQDEL+KNRPR
Sbjct: 277  ------SNW-DDGSVAAVAETGSGGAAESALRMMGKRRRNEMPTEIIEVKQDELIKNRPR 329

Query: 146  EDQVKLTGIAFGPAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            EDQVK TGIAFGPAY+PAS+KGKP+KLHKRKHQIG+L +D++QKE EL
Sbjct: 330  EDQVKSTGIAFGPAYQPASSKGKPSKLHKRKHQIGTLYFDMKQKETEL 377


>ref|XP_010046926.1| PREDICTED: proline-rich protein PRCC [Eucalyptus grandis]
            gi|702289030|ref|XP_010046927.1| PREDICTED: proline-rich
            protein PRCC [Eucalyptus grandis]
            gi|629113969|gb|KCW78644.1| hypothetical protein
            EUGRSUZ_C00107 [Eucalyptus grandis]
            gi|629113970|gb|KCW78645.1| hypothetical protein
            EUGRSUZ_C00107 [Eucalyptus grandis]
          Length = 404

 Score =  187 bits (476), Expect = 1e-44
 Identities = 153/456 (33%), Positives = 197/456 (43%), Gaps = 16/456 (3%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            M+SL+ NYASSD                           ED  +      PSR    SLP
Sbjct: 1    MESLMVNYASSDEDE------------------------EDRRDEPQPHPPSRPPFSSLP 36

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKPQPQFKREELEEIPENRNPKXXXXXXXXX 963
            PPKSS  +S     S L Q       P+        + +E    P               
Sbjct: 37   PPKSSSSSSSASLFSSLPQPKQTLTSPTAGPDAKAVRSDEGSSRPH-------------- 82

Query: 962  XXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPPPIMN 783
                                         P+  P            +KR+VQFRPP + +
Sbjct: 83   ---------APVASGSSSSSSRLFSSLPQPKQRPPSELPPAGANAGAKRIVQFRPPVLPS 133

Query: 782  PISTNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXS----IPAPKNSTTLGALPSA 615
              + +A                                 S    IPAP+NS+TLGALP+A
Sbjct: 134  LANPSAIDDDEEEDDEGEKEKERKRRRESESAAQTSSVTSFLSSIPAPRNSSTLGALPTA 193

Query: 614  SGTGRRSMLETDASASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSL-SYDHS-NWNSGS 441
             G+GRRS++ETD  A     VG+ G    + SS G ++   G+ S+  +YD   + N G+
Sbjct: 194  -GSGRRSVIETDTPA-----VGSTG----LESSNGGNDQNVGNNSTYGTYDSGIDQNGGA 243

Query: 440  ----ESYDGYGAGYDDNVGTGEG------VNYSNWKGQTXXXXXXXXXXXXXXXYEGNWI 291
                E Y  Y  GYD N    +       VNY N+                      N  
Sbjct: 244  YEYHEVYGSYEGGYDQNASGSDSSYYGGYVNYGNY---------------GEYGNYANHS 288

Query: 290  DESGATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQVKLTGIAFG 111
            D + A +      +++ G  V GKRGRN+ P +IVEVKQDEL+KNRPR+DQ KLTGIAFG
Sbjct: 289  DYATAASTGVVQGMSDRGATVSGKRGRNEVPAEIVEVKQDELMKNRPRQDQAKLTGIAFG 348

Query: 110  PAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            P+Y+PASTKGKPTKLHKRKHQIGSL +D+RQKEMEL
Sbjct: 349  PSYQPASTKGKPTKLHKRKHQIGSLYFDMRQKEMEL 384


>ref|XP_003542837.1| PREDICTED: proline-rich protein PRCC-like [Glycine max]
          Length = 370

 Score =  186 bits (472), Expect = 4e-44
 Identities = 159/464 (34%), Positives = 198/464 (42%), Gaps = 24/464 (5%)
 Frame = -3

Query: 1322 MDSLLANYASSDXXXXXXXXXXXEPKQLEESLNPEKFKVEDEGENDFLSKPSRKILDSLP 1143
            MDSLLANYASSD                EE   P              S P      SLP
Sbjct: 1    MDSLLANYASSDE---------------EEDQQP--------------SPPKTTTFSSLP 31

Query: 1142 PPKSSLFNSLPPPKSHLAQTPFDFKEPSMPKP-QPQFKREELEEIPENRNPKXXXXXXXX 966
             PK SLF SLP PKS L Q        S+P P QP  +   L     N +PK        
Sbjct: 32   QPKLSLFQSLPQPKSSLFQ--------SLPPPKQPSTESSSLPNPNPNPDPK-------- 75

Query: 965  XXXXXXXXXXXXXXXXXXXXXXXXSDISETPQILPNXXXXXXXXXXXSKRVVQFRPP--P 792
                                          PQI               KRVVQFRPP  P
Sbjct: 76   ------------------------------PQI----------EKTQPKRVVQFRPPIIP 95

Query: 791  IMNPIS-TNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSIPAPKNSTTLGALPSA 615
            + +P    +                                  SIPAP+N+ TLG + ++
Sbjct: 96   LPHPSQHDDDDDDDDDDEEEERNRRKKKLESSTQTSSVKSFLASIPAPRNTATLG-VQAS 154

Query: 614  SGTGRRSMLETDAS--ASNLSNVGTMGSDAAVNSSIGYSEDQSGDGSSLSYDHSNWNSGS 441
            SG+GR+S+LET+    ASN      +  D +      + + Q       SY + N+ SG+
Sbjct: 155  SGSGRKSILETETPPPASNSGGFSNVPVDQSTGDYENFDDYQYATDQYASY-YGNFGSGA 213

Query: 440  E------------------SYDGYGAGYDDNVGTGEGVNYSNWKGQTXXXXXXXXXXXXX 315
            E                   Y  YG  Y      G+   Y N                  
Sbjct: 214  EPGSSGTEPKAGVAAYGTEQYGNYGDAY---ASYGDYGQYGN------------------ 252

Query: 314  XXYEGNWIDESGATAVAEASRLAESGLQVPGKRGRNDTPEQIVEVKQDELLKNRPREDQV 135
                 NW D S A  V EAS +  S +++PGKRGR++ P +++EVKQ+EL+KNRPREDQV
Sbjct: 253  -----NWGDVS-APPVLEASGIDVSVVRIPGKRGRHEIPTEVIEVKQEELIKNRPREDQV 306

Query: 134  KLTGIAFGPAYKPASTKGKPTKLHKRKHQIGSLLYDLRQKEMEL 3
            KLTGIAFGP Y+PASTKGKPTKLHKRKHQIGSL +D++Q EM+L
Sbjct: 307  KLTGIAFGPTYQPASTKGKPTKLHKRKHQIGSLYFDMKQNEMKL 350


Top