BLASTX nr result

ID: Paeonia25_contig00005977 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00005977
         (1835 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853...   140   3e-30
ref|XP_002300521.2| hypothetical protein POPTR_0001s45660g [Popu...   134   1e-28
ref|XP_004309511.1| PREDICTED: uncharacterized protein LOC101295...   120   2e-24
ref|XP_007210890.1| hypothetical protein PRUPE_ppa001807mg [Prun...   119   4e-24
ref|XP_007039310.1| Uncharacterized protein isoform 5 [Theobroma...   118   1e-23
ref|XP_007039306.1| Uncharacterized protein isoform 1 [Theobroma...   118   1e-23
gb|EXC02134.1| hypothetical protein L484_024100 [Morus notabilis]     112   7e-22
ref|XP_002518949.1| conserved hypothetical protein [Ricinus comm...   105   5e-20
ref|XP_006377881.1| hypothetical protein POPTR_0011s15260g, part...    91   2e-15
ref|XP_007148023.1| hypothetical protein PHAVU_006G174000g [Phas...    78   2e-11
ref|XP_007148022.1| hypothetical protein PHAVU_006G174000g [Phas...    78   2e-11
ref|XP_007148021.1| hypothetical protein PHAVU_006G174000g [Phas...    78   2e-11
ref|XP_006594542.1| PREDICTED: uncharacterized protein LOC100804...    74   2e-10
ref|XP_006594540.1| PREDICTED: uncharacterized protein LOC100804...    74   2e-10
ref|XP_002893751.1| predicted protein [Arabidopsis lyrata subsp....    68   1e-08
ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Popu...    67   3e-08
ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prun...    67   3e-08
ref|XP_004972797.1| PREDICTED: uncharacterized protein LOC101763...    66   5e-08
gb|AAF31283.1|AC006424_12 CDS [Arabidopsis thaliana]                   65   8e-08
ref|NP_001117402.1| uncharacterized protein [Arabidopsis thalian...    65   8e-08

>ref|XP_003634177.1| PREDICTED: uncharacterized protein LOC100853355 [Vitis vinifera]
            gi|302143995|emb|CBI23100.3| unnamed protein product
            [Vitis vinifera]
          Length = 1167

 Score =  140 bits (352), Expect = 3e-30
 Identities = 174/662 (26%), Positives = 270/662 (40%), Gaps = 120/662 (18%)
 Frame = +1

Query: 193  HLSPEFHCKTPLSVHNQSDFTALSTST------DIPLTDYRRDSSGL------------L 318
            +++P     +PL V N+ ++  LSTS          L DY +  SGL            L
Sbjct: 157  YVAPAIEDNSPLVVLNEPNYDLLSTSHAAHLNGSSSLDDYTQSMSGLEYPSRWCGFWNGL 216

Query: 319  KGLSYGDD---RRSFSCKDNNGSVSLPNESLLKQGVPAAEGSQAFLNSASLCTNGSA-VL 486
              +  G       S   K++N   S    S + QG P AEG       + L       +L
Sbjct: 217  ADIEQGKKVELDESLCSKESNFVGSSIYRSYINQGDPTAEGVSNSEEGSVLSDRKYVDIL 276

Query: 487  GRDHQIGSRGMEQPGADSSSSPVEISNVATLKRPST--LCSTAILQDVP--KLPYLAPVV 654
            GRD+ +GS   +     S   P     V +L  P T  L ST++L + P  + P L PV 
Sbjct: 277  GRDNCVGSLSPDHFNNKSFYEPKANPMVVSLDFPRTSFLGSTSVLPETPHPRAPSLEPVT 336

Query: 655  T------PQ----------VNGSIGGVMAFPVSSP--VLSEDVNFSDGFAVN-------- 756
                   PQ          ++  +   ++   SSP  V+    N      VN        
Sbjct: 337  NSWNYRKPQSALYEKCFRKIDSCVDDPVSKAKSSPAIVIRPPANSPSSLGVNSFSSRNMI 396

Query: 757  ---NNDNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVS 927
               N++N   +    ++ P     SEG+E   D S ++   +  DHL +ESSSTK+ ++ 
Sbjct: 397  CTDNSENVSGHHLSNMEEPHIPVISEGRELYSDTSQLNGHWQRNDHLSMESSSTKKHELL 456

Query: 928  NKK------DPLFIKESELFVSH--PQNHLTEEPHCPERCVSIESSSEALDN-NSEVDSP 1080
            N +      D L    SEL + H   ++  +  P+  E   SI+++SE LD+ N  VDSP
Sbjct: 457  NNEMGVKETDNLLRARSELQIPHLNVEDGFSFSPNSIEAVNSIDNTSETLDHYNPAVDSP 516

Query: 1081 CWKGT-QSRCSPFECSRPVNSELLKHEVEAGKSLNPLAPQFFPQ-----------KVKES 1224
            CWKG+  S  SPFE S  ++   L  ++EA    N      FP            K  E+
Sbjct: 517  CWKGSITSHFSPFEVSEALSPHNLMEQLEALDGFNLQGHHIFPLNSDDAVNVSSLKPNEN 576

Query: 1225 SDYHGIECCQKSI------------------SLDAAEGGPCPSKVITEVGALCLDEVYAS 1350
            ++YH   C +  +                  SLDA + GP   K+ +  G    +++   
Sbjct: 577  TEYHKNVCGENGLLPSWKRPSVVNHPSREQRSLDAFKTGPYCQKLSSGDGNQSSNDIIQP 636

Query: 1351 KKEPALNNSKTSPEIISSQMAIPNVMEDYFRS-----------VTGDN------------ 1461
            K++ +L NS  S  +  S     +  E  F S           VTG+N            
Sbjct: 637  KRDHSLLNSSKSDNLELSHTMRQSFEEVKFTSERKLSSGVGVEVTGNNINDVSRDGSSHE 696

Query: 1462 TYGSVTGIKGAAPTGSFSGAVWDNYHPSST---IDIQVAVNALYKISELLVQNCSSDSNS 1632
            TY     I  +  +G  +         S +   ID+ + +N +  +S LL+ +CS ++ S
Sbjct: 697  TYHLTENISCSPLSGDDASTKLTKQPASESTPKIDVHMLINTVQDLSVLLLSHCSDNAFS 756

Query: 1633 LKEQDQNILQHVINNLYFCMRKSGGQRSSSIETTPYAGTSNFEGYDVMYYVQKTQSESVP 1812
            LKEQD   L+ VI+N   C+ K G + +         G+S+F G   +  + K+ S S P
Sbjct: 757  LKEQDHETLKRVIDNFDACLTKKGQKIAEQ-------GSSHFLG--ELPDLNKSASASWP 807

Query: 1813 HG 1818
             G
Sbjct: 808  LG 809


>ref|XP_002300521.2| hypothetical protein POPTR_0001s45660g [Populus trichocarpa]
            gi|550349961|gb|EEE85326.2| hypothetical protein
            POPTR_0001s45660g [Populus trichocarpa]
          Length = 911

 Score =  134 bits (338), Expect = 1e-28
 Identities = 175/657 (26%), Positives = 267/657 (40%), Gaps = 154/657 (23%)
 Frame = +1

Query: 226  LSVHNQSDFTALSTSTDIPLTDYRRDSSGLLKGLSYGDDRRSFSCKDNNGSVSLPNESLL 405
            L   ++SDF A+  +    L  Y    SG L+G+ +         KD +G  S+ N+   
Sbjct: 122  LFTESKSDFDAVPVTKSTEL-GYEAHKSGDLRGILHW--------KDKHGGFSMFNDDST 172

Query: 406  KQGV-------PAAEGSQAFL----NSASLCTNGSAVLGRDHQIGSRGMEQPGADSSSSP 552
            KQ +       PA   ++        S SLC   S +  +DH++  +   +   DS   P
Sbjct: 173  KQALVHMLFIFPAGSPAEGLKLSPETSDSLCGKLSGISLKDHEVRPKRTRE--IDSQCVP 230

Query: 553  VEISNVATLKRPSTLCSTAILQDVPK-LPYLAPVVT------------------------ 657
            + +    T    S L S+AILQD    + YL P V+                        
Sbjct: 231  ISLKFSTT----SDLNSSAILQDPQSGINYLPPSVSWSSCDTNIAYFGRSLSQQLDFHAA 286

Query: 658  PQVNGSIGGVMAFPV--SSP------------VLSEDVNFSDGFAVNNNDNSFAYTTFCL 795
             Q       + + PV  S P            VLSE+++ SDG    + +N   Y    L
Sbjct: 287  KQNVPPSSDINSLPVLVSEPSVASTGYLPFNHVLSENLD-SDGDGGVSKNNFLGYGQASL 345

Query: 796  KVPDFVWNSEGKEFNQDGSLIDTEKEIK-----DHLFVES--SSTKEAQVSNKKDPLFI- 951
            K P  V + + KE   +  L D  KE K      H  +E    +  E Q++    P+ + 
Sbjct: 346  KKPHAVVD-KSKEVFHNKVLTDKGKEGKMGKPVTHKVMEPVPMAKSELQITCPSPPIDLT 404

Query: 952  ----KESELF-------------VSHPQNHLTEEP----------HCPERCVS------- 1029
                K  E+F             +  P  H   EP           CP   +        
Sbjct: 405  LEVDKSKEVFHHKVLADKGKEGKLGKPVTHEVMEPVPMAKSELQITCPSLLIDLTLESLG 464

Query: 1030 ------IESSSEAL-DNNSEVDSPCWKGT----QSRCSPFECSRPVNSELLKHEVEAGKS 1176
                  IE+SS+ + +N+S++DSPCWKG     QS C   E S P N + LK E EA   
Sbjct: 465  IKESDPIENSSKIINENDSDLDSPCWKGKLAAEQSSC---EVSVPDNFQHLKSEQEACSY 521

Query: 1177 LNPLAPQFFPQKVKESSDYHGIE-------CCQKSIS------------LDAAEGGPCPS 1299
            LNPLAP FFP   K+  +Y G E         QK+ S              +A  G   S
Sbjct: 522  LNPLAPHFFPSSDKQKVNYCGNEGDGNDCFSFQKTASSVVNLVSREQRLQHSATAGSSSS 581

Query: 1300 KVITEVGALCLDEVYASKKE-PALNNSKTSPEIISSQMAIPNVMEDYFRS----VTGDNT 1464
            +  +   A C  +++   KE   L +S +S    SS + +P+V+EDYF S    +TG   
Sbjct: 582  EQSSITEAHCYSDMHVPNKEYELLTDSSSSSMHGSSCVVLPSVLEDYFTSSGQLLTGQCV 641

Query: 1465 YGSVTGIKGAAPTGSFSGAVWDNYHPSST---------------------------IDIQ 1563
             G    IK  AP GS S +++ + H   +                           +D Q
Sbjct: 642  GGFGKAIKDTAPNGSTSVSLFASKHVFDSSSCREGVSTDLSETYGGATKPLCSPPRLDFQ 701

Query: 1564 VAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETT 1734
            + V  + ++SELL+QNC++D +SL E + +I++ +I+NL  C+R   G+ +   E++
Sbjct: 702  IVVKTMNELSELLMQNCTNDLDSLNEHEHDIIKRIIHNLTLCIRNRVGEHTLMSESS 758


>ref|XP_004309511.1| PREDICTED: uncharacterized protein LOC101295876 [Fragaria vesca
            subsp. vesca]
          Length = 674

 Score =  120 bits (301), Expect = 2e-24
 Identities = 140/547 (25%), Positives = 221/547 (40%), Gaps = 66/547 (12%)
 Frame = +1

Query: 316  LKGLSYGDDRRSFSCKDNNGSVSLPNESLLKQGVPAAEGSQAFLNSASLCTNGSAV-LGR 492
            L+G S+G    S +C          N    +QG P  +      NS S     S + +G+
Sbjct: 70   LQGSSFGRHEASLAC----------NNYAYEQGKPVKKSKLYDNNSGSARDKCSHLTMGK 119

Query: 493  DHQIGSRGMEQPGADSSSSPVEISNVATLKRP-STLCSTAILQDV--PKLPYLAPVV--- 654
            ++   SR   Q  A   S  V  S     + P S  CS ++LQ    P+LPY  PV    
Sbjct: 120  ENPFTSRSTNQVDAGIFSFSVVNSVATPFEFPMSVKCSASMLQSYSQPELPYTTPVAGWN 179

Query: 655  ---------------------------TPQVN-------GSIGGVMAFPVSSPVLSEDVN 732
                                       +P+ N       GS    + F  S  +L ++  
Sbjct: 180  QTNSTMTFGESGLTKSDPCTDNFTVSRSPRDNAFPDVESGSSDTCITFSPSKSILLKNAE 239

Query: 733  FSDGFAVNNNDNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTK 912
             + G AV + DNS  Y++  +     +   EGK+ + D S      E          +  
Sbjct: 240  VTGGSAVIHKDNSSKYSSHDIMDLHQLLYGEGKKNDHDKSSSYKGNE---------RTCV 290

Query: 913  EAQVSNKKDPLFIKESE--LFVSHPQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPCW 1086
            EA  S   DPL   +S+  + +  P +  + E    E  +S+ +  +   N+S+VDSPCW
Sbjct: 291  EAVSSEGSDPLLTDKSDPQVTLKKPHDKSSLEHQDAEEAISLSTKLDG--NDSDVDSPCW 348

Query: 1087 KGT-QSRCSPFECSRPVNSELLKHEVEAGKSLNPLAPQFFPQKVK----------ESSDY 1233
            +G+  SR +P   SR ++S  +++  EA  SLNPLAP FFP+  K          ++ D+
Sbjct: 349  RGSLASRQTPLGVSRSLSSHSIENVQEASYSLNPLAPHFFPRPSKAIDNCYANEYDADDF 408

Query: 1234 HGI------------ECCQKSISLDAAEGGPCPSKVITEVGALCLDEVYASKKEPALNNS 1377
                              +++IS+D A  G   S  I  +G    + ++ SK+E AL N 
Sbjct: 409  SSFIKSDSGAVGAVSSFSKENISVDKA--GAKSSLSINGMGTQTSNNIHESKREYALLNK 466

Query: 1378 KTSPEIISSQMAIPNVMEDYFRSVTGDNTYGSVTGIKGAAPTGSFSGAVWDNYHPSSTID 1557
              S   +S                            KG +   S            S ID
Sbjct: 467  SGSDSALS----------------------------KGVSKLLS----------TDSKID 488

Query: 1558 IQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETTP 1737
            +   ++ ++ +S  LVQNCS+D   +   D +++QH+INNL  C++   G + S  + T 
Sbjct: 489  VSTVLDMMHDLSSFLVQNCSND---VLLDDHDLIQHIINNLRMCIQHRAGGKCSIPDFT- 544

Query: 1738 YAGTSNF 1758
             +GTSNF
Sbjct: 545  VSGTSNF 551


>ref|XP_007210890.1| hypothetical protein PRUPE_ppa001807mg [Prunus persica]
            gi|462406625|gb|EMJ12089.1| hypothetical protein
            PRUPE_ppa001807mg [Prunus persica]
          Length = 762

 Score =  119 bits (299), Expect = 4e-24
 Identities = 169/637 (26%), Positives = 270/637 (42%), Gaps = 88/637 (13%)
 Frame = +1

Query: 166  KDHPLQTIYHLSPEF----HCKTPLSVHNQSDFTALSTSTDIPLTDYRRDSSGLLKGLSY 333
            +D P  T  +   EF    H     S +  SD   + +++   LT+Y   SS       +
Sbjct: 59   EDDPFSTAPYSFLEFVEDSHFPQYPSANAASDLGFMPSASKESLTNYTELSS-------F 111

Query: 334  GDDRRSFSCKDNNGSVSLPNESLLKQG------------------------------VPA 423
            G  + SFS   +N + SL  E+LL+QG                               PA
Sbjct: 112  GHSQASFS---SNKNASLAYETLLEQGPLLSCMEGTLSMRVVRNESYFLWLTCYDVNTPA 168

Query: 424  AEGSQA-FLNSASLCTNGSAV-LGRDHQIGSRGMEQPGADSSSSPVEISNVATLKRP--- 588
             +GS+    NS S+    S + +G ++Q  SR  +Q  A   S     S V T+  P   
Sbjct: 169  VKGSKPNHENSESVHEKCSDLTIGTENQFISRSTDQVDAGFFS----FSAVNTMATPHEF 224

Query: 589  --STLCSTAILQDVPK--LPYLAPVVT------------------------------PQV 666
              S   ST+ LQD  +  LPY AP VT                              P  
Sbjct: 225  PMSVTSSTSRLQDYSQAQLPYTAPNVTWSHCNSEIALCDSGFTKLDALTAKSTVFHLPTN 284

Query: 667  NGSIGGVMAFPVSSPV------LSEDVNFSDGFAVNNNDNSFAYTTFCLKVPDFVWNSEG 828
            N     ++    S+ V      LS++V+F   +  NN D+S   +   +K    + +SEG
Sbjct: 285  NSFPAVLLESDTSTTVSPLNLALSKNVDFKGNYPPNNYDSSSKCSPSGIKDLHDLISSEG 344

Query: 829  KEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLFIKESELFVSHPQNHLTEEPH 1008
            KE + DGS  D  K  KD   + S     A +    +PL         + P +   + P 
Sbjct: 345  KEIHHDGSPNDKGKGGKDGKPLSSEGIG-ALLKATSEPLIT-----LTNIPDDFSLKHPG 398

Query: 1009 CPERCVSIESSSEALDNNSEVDSPCWKGTQSRCSPFECSRPVNSELLKHEVEAGKSLNPL 1188
             P+  VSI  + +  +N+S++DSPCWKGT +    +  SR ++S+ + +E E   SLNPL
Sbjct: 399  -PKGAVSISKNLD--ENDSDLDSPCWKGTLAS-RQYGVSRSLSSDFVGNEQEVRNSLNPL 454

Query: 1189 APQFFPQKVKESSDYHGIECCQKSISLDAAEGGPCPSKVITEVGALCLDEVYASKKEPAL 1368
            APQFFP+  K   DYH  +            G    S   +E  A+          + A 
Sbjct: 455  APQFFPRHAKAIVDYHANDYV----------GDDFSSFQKSESSAVNSSSKGHGPVDQAG 504

Query: 1369 NNSKTSPEIISSQMAIPNVMEDYFR--SVTGDNTYGSVTGIKGAAPTGSFSGAVWDNYHP 1542
            + S +S + I +Q +  N + D  R   +  ++  GSV  +    P G     +      
Sbjct: 505  SKSSSSIKGIGTQTS--NDIHDLERVYPLLNNSESGSVLNL----PEG-----LSKLLST 553

Query: 1543 SSTIDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQR--- 1713
             S +D+   +N ++ +SELLVQ CS+D +SL E  ++++Q++INNL   ++   G +   
Sbjct: 554  HSKLDVPTILNMMHDLSELLVQKCSNDLDSLNEH-KHVMQNIINNLCTYIQHGDGGKVPI 612

Query: 1714 -SSSIETTPYAGTSNFEGY---DVMYYVQKTQSESVP 1812
               ++  TPY    + E +   ++ + V K ++ +VP
Sbjct: 613  SDITLTGTPYCPVKSTELHKCSNMGFQVTKKKALAVP 649


>ref|XP_007039310.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|590674956|ref|XP_007039311.1| Uncharacterized protein
            isoform 5 [Theobroma cacao] gi|508776555|gb|EOY23811.1|
            Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508776556|gb|EOY23812.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 835

 Score =  118 bits (295), Expect = 1e-23
 Identities = 157/639 (24%), Positives = 235/639 (36%), Gaps = 128/639 (20%)
 Frame = +1

Query: 202  PEFHCKTPLSVHNQSDFTALSTSTDIPLTDYRRDSSGLLKGLSYGDDRRSFSCKDNNGSV 381
            P  H    LS H  S    + +S+         +  GL     +        C     S 
Sbjct: 105  PPLHTHFTLSTHQSSQTNFIPSSSSFGNVG---NKGGLQGTAVHQQGTEILRCNRQVASA 161

Query: 382  -SLPNESLLKQGVPAAEGSQAFLNSASLCTNGSAVLGRDHQIGSRGMEQPGADSSSSPVE 558
             SL + + L+QG            S  L   GS V+G+D+QI     E+   +SS  P+ 
Sbjct: 162  GSLSSNNPLEQGTTLEGSKLVSETSFVLRGKGSVVIGKDNQIRPEDKEKIHTESSIFPLA 221

Query: 559  ISNVATLKRPSTLCSTAILQDVP----------KLPYLAPVVTPQVNGS----------- 675
             S V  L +  T    +I  D+P          +L Y A  +   + GS           
Sbjct: 222  NSEVNLLMKCVTK-PFSISSDLPFPPRPQDTQSQLLYSAESIACSLFGSTIFPYESCFPH 280

Query: 676  IGGVMAF--------------------------PVSSPVLSEDVNFSDGFAVNNNDNSFA 777
            +G   A                           P+ +PV   +V      AV++ D+ F 
Sbjct: 281  LGSCHAETLVSHAPECFSYSAQICKPSSAGSNPPIVNPVPLVNVASGGSDAVSSRDSYFD 340

Query: 778  YTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLFIKE 957
            Y    +     V N   K    D  +I+           E     E       +P    +
Sbjct: 341  YVLPGMMDTSTVHNPVDKVACHDQVIIEKG---------EKGKIVEPFHDETNNPSIRAK 391

Query: 958  SELFVSHPQ--NHLTEEPHCPERCVSIESSSEALDNNSEVDSPCWKGTQSRCSP------ 1113
            S+L ++ P     LT E H  +  +  + SS +   +S+VDSPCWKGTQ+  SP      
Sbjct: 392  SKLRIACPNVPQDLTLEQHGAKPGIPDDKSSTS-HGDSDVDSPCWKGTQANKSPLSDSVP 450

Query: 1114 -----------FECSRPVNSELLKHEVEAGKSLNPLAPQFFPQKVKESSDYHGIE----- 1245
                       F  S P+ SE  K+E  A  SLNP AP F P   K   D+H  E     
Sbjct: 451  ANSEDSKGQSPFRVSMPLKSEHSKNEKVARSSLNPQAPVFIPGNSKPKVDHHQKEGHGDS 510

Query: 1246 --CCQKSISLDAAEGGP------------CPSKVITEVGALCLDEVYASKKEPALNNSKT 1383
                QKS +LD                  CPS+ I ++G     +V+ SKKE  +     
Sbjct: 511  SLSSQKSAALDVTSSSSEHRSTDSVNAVKCPSERIDDIGIQSSSDVHDSKKECGIPYKSF 570

Query: 1384 SPEIISSQMAI-PNVMEDYFRS----VTGDNTYGSVTGIKGAAPTGSFSGAVWDNYHPSS 1548
                ++S  +  P + E+Y  S    V G N  GS+ GI  AA  G  S     ++ PS+
Sbjct: 571  RSSAVNSSCSFQPYLREEYVTSASQLVRGTNVAGSMEGIADAAHNGLDSVEDIAHHGPST 630

Query: 1549 T-------------------------------------IDIQVAVNALYKISELLVQNCS 1617
            +                                     ID+++ +N +  +SELL+QN S
Sbjct: 631  SFSFLETETALNSHSTGVGVFSDFTERPQEPSKSTPPKIDVKLMINTMQYLSELLLQNSS 690

Query: 1618 SDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETT 1734
             D  SL E + + L  ++NNLY  +R   G  +  +E++
Sbjct: 691  FDLGSLSEHEYDKLLTIMNNLYVLIRNKAGLMAVRLESS 729


>ref|XP_007039306.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590674942|ref|XP_007039307.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590674946|ref|XP_007039308.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590674950|ref|XP_007039309.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508776551|gb|EOY23807.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776552|gb|EOY23808.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508776553|gb|EOY23809.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776554|gb|EOY23810.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 848

 Score =  118 bits (295), Expect = 1e-23
 Identities = 157/639 (24%), Positives = 235/639 (36%), Gaps = 128/639 (20%)
 Frame = +1

Query: 202  PEFHCKTPLSVHNQSDFTALSTSTDIPLTDYRRDSSGLLKGLSYGDDRRSFSCKDNNGSV 381
            P  H    LS H  S    + +S+         +  GL     +        C     S 
Sbjct: 105  PPLHTHFTLSTHQSSQTNFIPSSSSFGNVG---NKGGLQGTAVHQQGTEILRCNRQVASA 161

Query: 382  -SLPNESLLKQGVPAAEGSQAFLNSASLCTNGSAVLGRDHQIGSRGMEQPGADSSSSPVE 558
             SL + + L+QG            S  L   GS V+G+D+QI     E+   +SS  P+ 
Sbjct: 162  GSLSSNNPLEQGTTLEGSKLVSETSFVLRGKGSVVIGKDNQIRPEDKEKIHTESSIFPLA 221

Query: 559  ISNVATLKRPSTLCSTAILQDVP----------KLPYLAPVVTPQVNGS----------- 675
             S V  L +  T    +I  D+P          +L Y A  +   + GS           
Sbjct: 222  NSEVNLLMKCVTK-PFSISSDLPFPPRPQDTQSQLLYSAESIACSLFGSTIFPYESCFPH 280

Query: 676  IGGVMAF--------------------------PVSSPVLSEDVNFSDGFAVNNNDNSFA 777
            +G   A                           P+ +PV   +V      AV++ D+ F 
Sbjct: 281  LGSCHAETLVSHAPECFSYSAQICKPSSAGSNPPIVNPVPLVNVASGGSDAVSSRDSYFD 340

Query: 778  YTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLFIKE 957
            Y    +     V N   K    D  +I+           E     E       +P    +
Sbjct: 341  YVLPGMMDTSTVHNPVDKVACHDQVIIEKG---------EKGKIVEPFHDETNNPSIRAK 391

Query: 958  SELFVSHPQ--NHLTEEPHCPERCVSIESSSEALDNNSEVDSPCWKGTQSRCSP------ 1113
            S+L ++ P     LT E H  +  +  + SS +   +S+VDSPCWKGTQ+  SP      
Sbjct: 392  SKLRIACPNVPQDLTLEQHGAKPGIPDDKSSTS-HGDSDVDSPCWKGTQANKSPLSDSVP 450

Query: 1114 -----------FECSRPVNSELLKHEVEAGKSLNPLAPQFFPQKVKESSDYHGIE----- 1245
                       F  S P+ SE  K+E  A  SLNP AP F P   K   D+H  E     
Sbjct: 451  ANSEDSKGQSPFRVSMPLKSEHSKNEKVARSSLNPQAPVFIPGNSKPKVDHHQKEGHGDS 510

Query: 1246 --CCQKSISLDAAEGGP------------CPSKVITEVGALCLDEVYASKKEPALNNSKT 1383
                QKS +LD                  CPS+ I ++G     +V+ SKKE  +     
Sbjct: 511  SLSSQKSAALDVTSSSSEHRSTDSVNAVKCPSERIDDIGIQSSSDVHDSKKECGIPYKSF 570

Query: 1384 SPEIISSQMAI-PNVMEDYFRS----VTGDNTYGSVTGIKGAAPTGSFSGAVWDNYHPSS 1548
                ++S  +  P + E+Y  S    V G N  GS+ GI  AA  G  S     ++ PS+
Sbjct: 571  RSSAVNSSCSFQPYLREEYVTSASQLVRGTNVAGSMEGIADAAHNGLDSVEDIAHHGPST 630

Query: 1549 T-------------------------------------IDIQVAVNALYKISELLVQNCS 1617
            +                                     ID+++ +N +  +SELL+QN S
Sbjct: 631  SFSFLETETALNSHSTGVGVFSDFTERPQEPSKSTPPKIDVKLMINTMQYLSELLLQNSS 690

Query: 1618 SDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETT 1734
             D  SL E + + L  ++NNLY  +R   G  +  +E++
Sbjct: 691  FDLGSLSEHEYDKLLTIMNNLYVLIRNKAGLMAVRLESS 729


>gb|EXC02134.1| hypothetical protein L484_024100 [Morus notabilis]
          Length = 753

 Score =  112 bits (279), Expect = 7e-22
 Identities = 120/438 (27%), Positives = 191/438 (43%), Gaps = 54/438 (12%)
 Frame = +1

Query: 658  PQVNGSIGGVMAFPVSSPVLSEDVNFSDGFAVNNNDNSFAYTTFCLKVPDFVWNSEGKEF 837
            P +  S  G   F  SS +L ++V+       +NN +S        +    + N+   E 
Sbjct: 195  PMLGSSANGT-DFTTSSCILPKNVDLPGNSVASNNKSSSGRIISGNRDIHGLPNAYSNEG 253

Query: 838  NQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLFIKESEL--FVSHPQNHLTEEPHC 1011
            +QD  L D   EIK+   V   +          DP+ I +SE+   ++   +    E   
Sbjct: 254  HQDKGLGDEGMEIKNAKSVPCKAL---------DPVVIAKSEVRFAINDIFDGSVMERVG 304

Query: 1012 PERCVSIESSSEALDNN-SEVDSPCWKGTQ-SRCSPFECSRPVNSELLKHEVEAGKSLNP 1185
                +S + SS+ LD + S++DSPCWKG Q S  SP   +   ++  +++E EAG SLNP
Sbjct: 305  TLAAISTKGSSKLLDEDESDLDSPCWKGIQNSTKSPNIVAESSSTHSIRNESEAGTSLNP 364

Query: 1186 LAPQFFPQKVKESSDY------HGI------ECCQKSIS------LDAAEGGPCPSKVIT 1311
             APQFFP   K S DY       G+      EC    +S      +D+ + G        
Sbjct: 365  RAPQFFPSHSKGSIDYLQNNTVGGVPYFGKGECSAFDLSYKETPIVDSYKAGLETRGSTN 424

Query: 1312 EVGALCLDEVYASKKEPA-LNNSKTSPEIISSQMAIPNVMEDYFR----SVTGDNTYGSV 1476
             VG    + V    KE A L +SK+S  +   QM  P +++ +F     SV G +  G  
Sbjct: 425  AVGYQYSNGVNEPGKESAMLKDSKSSSALSPPQMIKPYLVDGFFTSKEVSVKGVDFEGFA 484

Query: 1477 TGIKGAA---------------PTGSFSG------------AVWDNYHPSSTIDIQVAVN 1575
             GI  AA               P  S SG             + ++       ++ V VN
Sbjct: 485  DGIMDAANKNPRNLSALAAEYVPHLSSSGVGALSDCSELLQCLTESLSKCPKTNVAVTVN 544

Query: 1576 ALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETTPYAGTSN 1755
            A+  +S+LLV+NCS+D +SL E +  +++H+INNLY  ++   G+ +  ++   + G+ +
Sbjct: 545  AIRCLSDLLVENCSNDLDSLNEHEHEMIRHIINNLYALIKHRVGEETPILDLL-HTGSLD 603

Query: 1756 FEGYDVMYYVQKTQSESV 1809
            +       Y Q      V
Sbjct: 604  YRDKSTATYEQSNMEFQV 621


>ref|XP_002518949.1| conserved hypothetical protein [Ricinus communis]
            gi|223541936|gb|EEF43482.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 605

 Score =  105 bits (263), Expect = 5e-20
 Identities = 87/303 (28%), Positives = 141/303 (46%), Gaps = 51/303 (16%)
 Frame = +1

Query: 979  PQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPCWKGT-QSRCSPFECSRPVNSELLKH 1155
            PQ   +  PH  E  V ++ + E    NS++DSPCWKGT  +  S  E S PVN + L+ 
Sbjct: 254  PQVPYSSVPH--ELTVKLQGAEEG---NSDLDSPCWKGTLAANQSILEDSGPVNGQQLRS 308

Query: 1156 EVEAGKSLNPLAPQFFPQKVKESSDYHGIECCQKSISL-------------------DAA 1278
              E   SL+ LA + F    K++  Y   EC + S S                    ++ 
Sbjct: 309  GQEELNSLSLLASELFASSDKQNC-YRVNECDEDSSSFFHKTASSAVPLQPVEQRSANSV 367

Query: 1279 EGGPCPSKVITEVGALCLDEVYASKKEPALNNSKTSPEIISSQMAIPNVMEDYFRS---- 1446
              G   S++   + + C ++V    KE A+  +  +  ++ S +  P+ +ED+  S    
Sbjct: 368  TTGSAFSELTNVIWSCCTNDVCLPDKEDAILKNSNNSSMLKSCILEPSSVEDHCYSNSQL 427

Query: 1447 VTGDNTYGSVTGIKGAAPTGSFSGAVWDNYHPSST------------------------- 1551
            VTG N  G++ GI+ +   GS S   ++N +  S+                         
Sbjct: 428  VTGPNIAGTLRGIRESVQHGS-SRISFENKNVISSSSCRIHIPSDFTETCQGASRSFSCP 486

Query: 1552 --IDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSI 1725
              + IQ  VN + ++SELL+ NCS+D +SL E + +I++H+INNL  C+R   G+R+   
Sbjct: 487  PRLHIQKVVNTMNELSELLLHNCSNDLDSLNEHEHDIIEHIINNLTACIRNRNGRRTLMP 546

Query: 1726 ETT 1734
            E T
Sbjct: 547  EAT 549


>ref|XP_006377881.1| hypothetical protein POPTR_0011s15260g, partial [Populus trichocarpa]
            gi|550328449|gb|ERP55678.1| hypothetical protein
            POPTR_0011s15260g, partial [Populus trichocarpa]
          Length = 873

 Score = 90.9 bits (224), Expect = 2e-15
 Identities = 86/276 (31%), Positives = 132/276 (47%), Gaps = 35/276 (12%)
 Frame = +1

Query: 856  IDTEKEI-KDHLFVESSSTKEAQVSNKK--DPLFIKESELFVSHPQN--HLTEEPHCPER 1020
            +D  KE+  D +  + S  K ++ + ++  +PL +  SEL ++ P +   L  +    + 
Sbjct: 578  VDKRKEVFHDKVLTDKSKGKMSKPATQEVMEPLSMTVSELQITCPSHPIELASKSLGVKE 637

Query: 1021 CVSIESSSEAL-DNNSEVDSPCWKG----TQSRCSPFECSRPVNSELLKHEVEAGKSLNP 1185
               I +SSE + +N+S++DSPCWKG     QS C   E SRP + + LK    A  +LNP
Sbjct: 638  SDPIGNSSEIINENDSDLDSPCWKGKLSANQSTC---EVSRPDDFQHLKSARGACSNLNP 694

Query: 1186 LAPQFFPQKVKESSDYHGIEC-------CQK----SISLDAAE----------GGPCPSK 1302
            LAP F P   K+  +Y G EC        QK    ++SL + E                 
Sbjct: 695  LAPHFVPSCGKQKVNYRGTECEGDDSLTFQKTESSAVSLFSREHTLQKPGTAGSSSSDRS 754

Query: 1303 VITEVGALCLDEVYASKKEPALNNSKTSPEIISSQMAIPNVMEDYFRS----VTGDNTYG 1470
             ITE     +D    +K+   L NS TS  + SS +  P++ EDYF S    +TG    G
Sbjct: 755  SITETHC-SIDNHVRNKEYEPLTNSSTSSMLSSSCLVQPSIPEDYFISNGQLLTGKKVGG 813

Query: 1471 SVTGIKGAAPTGSFSGAVWDNYHPSSTIDIQVAVNA 1578
            S   IK A   GS S ++  + H +S+   +V V++
Sbjct: 814  SGKDIKDAVSNGSTSVSLLASEHVTSSSSCRVGVSS 849



 Score = 86.7 bits (213), Expect = 3e-14
 Identities = 86/272 (31%), Positives = 129/272 (47%), Gaps = 37/272 (13%)
 Frame = +1

Query: 856  IDTEKEI-KDHLFVESSSTKEAQVSNKK--DPLFIKESELFVSHPQN--HLTEEPHCPER 1020
            +D  KE+  D +  + S  K ++ + ++  +PL +  SEL ++ P +   L  +    + 
Sbjct: 181  VDKRKEVFHDEVLTDKSKVKMSKPATQEVMEPLSMTVSELQITCPSHPIELASKSLGVKE 240

Query: 1021 CVSIESSSEAL-DNNSEVDSPCWKG----TQSRCSPFECSRPVNSELLKHEVEAGKSLNP 1185
               I +SSE + +N+S++DSPCWKG     QS C   E SRP + + LK    A  +LNP
Sbjct: 241  SDPIGNSSEIINENDSDLDSPCWKGKLSANQSTC---EVSRPDDFQHLKSARGACSNLNP 297

Query: 1186 LAPQFFPQKVKESSDYHGIEC-------CQK----SISLDAAE----------GGPCPSK 1302
            LAP F P   ++  +Y G EC        QK    ++SL + E                 
Sbjct: 298  LAPHFVPSCGQQKVNYRGTECEGDDSLTFQKTESSAVSLFSREHTLQKPGTAGSSSSDRS 357

Query: 1303 VITEVGALCLDEVYASKKEPALNNSKTSPEIISSQMAIPNVMEDYFRS----VTGDNTYG 1470
             ITE      + V   + EP L NS TS  + SS +  P+++EDYF S    +T     G
Sbjct: 358  SITETHCSIDNHVRNEEYEP-LTNSSTSSMLSSSCVVQPSILEDYFTSNGQLLTRQKVGG 416

Query: 1471 SVTGIKGAAPTGSFSGAVWDNYH--PSSTIDI 1560
            S   I+ A P GS S ++  + H  P ST  I
Sbjct: 417  SGKVIEDAVPNGSTSVSLLASKHVRPISTRQI 448


>ref|XP_007148023.1| hypothetical protein PHAVU_006G174000g [Phaseolus vulgaris]
            gi|561021246|gb|ESW20017.1| hypothetical protein
            PHAVU_006G174000g [Phaseolus vulgaris]
          Length = 572

 Score = 77.8 bits (190), Expect = 2e-11
 Identities = 77/294 (26%), Positives = 120/294 (40%), Gaps = 41/294 (13%)
 Frame = +1

Query: 979  PQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPCWKGTQSRC-SPFECSRPVNSELLKH 1155
            P   LT +    +     +SS   ++N+S+VDSPCWKGT++ C +  E S  V    ++ 
Sbjct: 148  PVKSLTTDMSSAKNTYLDQSSKTLVENDSDVDSPCWKGTRAFCQTSIENSGSVQINNVEK 207

Query: 1156 EVEAGKSLNPLAPQFFPQKVKESSD--------------YHGIECCQKSISLDA------ 1275
              E   SLNPLAPQFFP+      D              + G     K++  ++      
Sbjct: 208  ATEKHNSLNPLAPQFFPRIAYVKDDFGSSNSSSPVATNFFSGEHMLMKTVMAESPVELNM 267

Query: 1276 -AEGGPCPSKVITEVGALCLDEVYASKKEPALN-----NSKTSPEIISSQMAIPNVMEDY 1437
              E  P  +    E     +++   S  +P LN        +S E  S     P  + D 
Sbjct: 268  GIELQPSSNTRGKEKAINMINDPKNSYVDPVLNLHCKVTKSSSKEDCSMSKGKPEAVVDA 327

Query: 1438 FRSVTGDNTYGSVTGIKGAAPTGSFSG------------AVWDNYHPSSTIDIQVAVNAL 1581
               V G     S      ++ + S SG             V  +   S   D+ + V+A+
Sbjct: 328  DNFVKGATKSSSPISTLASSSSSSSSGVAVVTDLMKTFEGVSKSLSKSPKPDVGMVVSAI 387

Query: 1582 YKISELLVQNCSS--DSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETTP 1737
            + +SELLVQ       SN+    D+ ++Q  INNL     K   QR  ++++TP
Sbjct: 388  HVLSELLVQTSMDGVGSNNEHGHDEIMIQQTINNLNDFRTKRCVQRIPTLKSTP 441


>ref|XP_007148022.1| hypothetical protein PHAVU_006G174000g [Phaseolus vulgaris]
            gi|561021245|gb|ESW20016.1| hypothetical protein
            PHAVU_006G174000g [Phaseolus vulgaris]
          Length = 571

 Score = 77.8 bits (190), Expect = 2e-11
 Identities = 77/294 (26%), Positives = 120/294 (40%), Gaps = 41/294 (13%)
 Frame = +1

Query: 979  PQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPCWKGTQSRC-SPFECSRPVNSELLKH 1155
            P   LT +    +     +SS   ++N+S+VDSPCWKGT++ C +  E S  V    ++ 
Sbjct: 148  PVKSLTTDMSSAKNTYLDQSSKTLVENDSDVDSPCWKGTRAFCQTSIENSGSVQINNVEK 207

Query: 1156 EVEAGKSLNPLAPQFFPQKVKESSD--------------YHGIECCQKSISLDA------ 1275
              E   SLNPLAPQFFP+      D              + G     K++  ++      
Sbjct: 208  ATEKHNSLNPLAPQFFPRIAYVKDDFGSSNSSSPVATNFFSGEHMLMKTVMAESPVELNM 267

Query: 1276 -AEGGPCPSKVITEVGALCLDEVYASKKEPALN-----NSKTSPEIISSQMAIPNVMEDY 1437
              E  P  +    E     +++   S  +P LN        +S E  S     P  + D 
Sbjct: 268  GIELQPSSNTRGKEKAINMINDPKNSYVDPVLNLHCKVTKSSSKEDCSMSKGKPEAVVDA 327

Query: 1438 FRSVTGDNTYGSVTGIKGAAPTGSFSG------------AVWDNYHPSSTIDIQVAVNAL 1581
               V G     S      ++ + S SG             V  +   S   D+ + V+A+
Sbjct: 328  DNFVKGATKSSSPISTLASSSSSSSSGVAVVTDLMKTFEGVSKSLSKSPKPDVGMVVSAI 387

Query: 1582 YKISELLVQNCSS--DSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETTP 1737
            + +SELLVQ       SN+    D+ ++Q  INNL     K   QR  ++++TP
Sbjct: 388  HVLSELLVQTSMDGVGSNNEHGHDEIMIQQTINNLNDFRTKRCVQRIPTLKSTP 441


>ref|XP_007148021.1| hypothetical protein PHAVU_006G174000g [Phaseolus vulgaris]
            gi|561021244|gb|ESW20015.1| hypothetical protein
            PHAVU_006G174000g [Phaseolus vulgaris]
          Length = 460

 Score = 77.8 bits (190), Expect = 2e-11
 Identities = 77/294 (26%), Positives = 120/294 (40%), Gaps = 41/294 (13%)
 Frame = +1

Query: 979  PQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPCWKGTQSRC-SPFECSRPVNSELLKH 1155
            P   LT +    +     +SS   ++N+S+VDSPCWKGT++ C +  E S  V    ++ 
Sbjct: 148  PVKSLTTDMSSAKNTYLDQSSKTLVENDSDVDSPCWKGTRAFCQTSIENSGSVQINNVEK 207

Query: 1156 EVEAGKSLNPLAPQFFPQKVKESSD--------------YHGIECCQKSISLDA------ 1275
              E   SLNPLAPQFFP+      D              + G     K++  ++      
Sbjct: 208  ATEKHNSLNPLAPQFFPRIAYVKDDFGSSNSSSPVATNFFSGEHMLMKTVMAESPVELNM 267

Query: 1276 -AEGGPCPSKVITEVGALCLDEVYASKKEPALN-----NSKTSPEIISSQMAIPNVMEDY 1437
              E  P  +    E     +++   S  +P LN        +S E  S     P  + D 
Sbjct: 268  GIELQPSSNTRGKEKAINMINDPKNSYVDPVLNLHCKVTKSSSKEDCSMSKGKPEAVVDA 327

Query: 1438 FRSVTGDNTYGSVTGIKGAAPTGSFSG------------AVWDNYHPSSTIDIQVAVNAL 1581
               V G     S      ++ + S SG             V  +   S   D+ + V+A+
Sbjct: 328  DNFVKGATKSSSPISTLASSSSSSSSGVAVVTDLMKTFEGVSKSLSKSPKPDVGMVVSAI 387

Query: 1582 YKISELLVQNCSS--DSNSLKEQDQNILQHVINNLYFCMRKSGGQRSSSIETTP 1737
            + +SELLVQ       SN+    D+ ++Q  INNL     K   QR  ++++TP
Sbjct: 388  HVLSELLVQTSMDGVGSNNEHGHDEIMIQQTINNLNDFRTKRCVQRIPTLKSTP 441


>ref|XP_006594542.1| PREDICTED: uncharacterized protein LOC100804726 isoform X3 [Glycine
            max]
          Length = 568

 Score = 73.9 bits (180), Expect = 2e-10
 Identities = 87/325 (26%), Positives = 133/325 (40%), Gaps = 47/325 (14%)
 Frame = +1

Query: 904  STKEAQVSNKKDPLFIKESELFVSHPQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPC 1083
            S  +  + +  + + +  S   VS  +  +TE P       + +SS    +++S+VDSPC
Sbjct: 113  SNSKGLMHSANESISVPLSNFKVSPLKLPITELPSAKNTSQN-QSSKNLGESDSDVDSPC 171

Query: 1084 WKGTQSRC-SPFECSRPVNSELLKHEVEAGKSLNPLAPQFFPQKVKESSDY-HGIECCQK 1257
            WKGT + C +P E S  +    ++   E   SLNPLAPQFFP       D+   I C   
Sbjct: 172  WKGTMAFCLTPIENSGSIQISNVEKATEKHNSLNPLAPQFFPGIGYVKDDFGSSISCTPV 231

Query: 1258 SISLDA----------AEGGPCPSKVI----------TEVGALCLDEVYASKKEPALN-- 1371
            + +L +          AE    P K I           E      +    S  +P LN  
Sbjct: 232  ATNLLSGEDMLIKTVMAESPVEPRKGIELQSSSNTCGREKAFNMFNNPKNSSVDPVLNLH 291

Query: 1372 ----NSKTSPEIISSQMAIPNVME-DYFRSVTGDNTYGSVTGIKGAAP---TGSFSGAVW 1527
                 S +  +   S   +  V++ D F   T D+   +    KG  P     + S  V 
Sbjct: 292  CMVTQSSSKEDCSISNGKLETVVDVDNFVKGTKDSRVCNAFPAKGHFPFPTQAALSSGVN 351

Query: 1528 DNYHPSSTI-------------DIQVAVNALYKISELLVQNCSS--DSNSLKEQDQNILQ 1662
                P  T              D+   V+A++ +SELLVQ      DSNS    D+ ++Q
Sbjct: 352  AVPDPLKTFEGLSKTLIKSPKPDVGTIVSAIHVLSELLVQTSMDGVDSNSEHGHDEIMIQ 411

Query: 1663 HVINNLYFCMRKSGGQRSSSIETTP 1737
             +INNL     K  G R  ++++TP
Sbjct: 412  QIINNLNDFSTKRCGLRIPTLDSTP 436


>ref|XP_006594540.1| PREDICTED: uncharacterized protein LOC100804726 isoform X1 [Glycine
            max] gi|571499822|ref|XP_006594541.1| PREDICTED:
            uncharacterized protein LOC100804726 isoform X2 [Glycine
            max]
          Length = 570

 Score = 73.9 bits (180), Expect = 2e-10
 Identities = 87/325 (26%), Positives = 133/325 (40%), Gaps = 47/325 (14%)
 Frame = +1

Query: 904  STKEAQVSNKKDPLFIKESELFVSHPQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPC 1083
            S  +  + +  + + +  S   VS  +  +TE P       + +SS    +++S+VDSPC
Sbjct: 113  SNSKGLMHSANESISVPLSNFKVSPLKLPITELPSAKNTSQN-QSSKNLGESDSDVDSPC 171

Query: 1084 WKGTQSRC-SPFECSRPVNSELLKHEVEAGKSLNPLAPQFFPQKVKESSDY-HGIECCQK 1257
            WKGT + C +P E S  +    ++   E   SLNPLAPQFFP       D+   I C   
Sbjct: 172  WKGTMAFCLTPIENSGSIQISNVEKATEKHNSLNPLAPQFFPGIGYVKDDFGSSISCTPV 231

Query: 1258 SISLDA----------AEGGPCPSKVI----------TEVGALCLDEVYASKKEPALN-- 1371
            + +L +          AE    P K I           E      +    S  +P LN  
Sbjct: 232  ATNLLSGEDMLIKTVMAESPVEPRKGIELQSSSNTCGREKAFNMFNNPKNSSVDPVLNLH 291

Query: 1372 ----NSKTSPEIISSQMAIPNVME-DYFRSVTGDNTYGSVTGIKGAAP---TGSFSGAVW 1527
                 S +  +   S   +  V++ D F   T D+   +    KG  P     + S  V 
Sbjct: 292  CMVTQSSSKEDCSISNGKLETVVDVDNFVKGTKDSRVCNAFPAKGHFPFPTQAALSSGVN 351

Query: 1528 DNYHPSSTI-------------DIQVAVNALYKISELLVQNCSS--DSNSLKEQDQNILQ 1662
                P  T              D+   V+A++ +SELLVQ      DSNS    D+ ++Q
Sbjct: 352  AVPDPLKTFEGLSKTLIKSPKPDVGTIVSAIHVLSELLVQTSMDGVDSNSEHGHDEIMIQ 411

Query: 1663 HVINNLYFCMRKSGGQRSSSIETTP 1737
             +INNL     K  G R  ++++TP
Sbjct: 412  QIINNLNDFSTKRCGLRIPTLDSTP 436


>ref|XP_002893751.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297339593|gb|EFH70010.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 606

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 48/180 (26%), Positives = 84/180 (46%), Gaps = 3/180 (1%)
 Frame = +1

Query: 736  SDGFAVNNNDNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEI---KDHLFVESSS 906
            S  F++ +      +  + L +     + +    + D SL +  +++   K+ L +E   
Sbjct: 29   SPSFSLKSEHEDSIWGDYTLDLSFLSSDDQRSGLDDDDSLSNLSRDVETKKEGLVLEEKI 88

Query: 907  TKEAQVSNKKDPLFIKESELFVSHPQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPCW 1086
                +V    +P+F K  E+ +  P N +  +      CVS +SS+E+ +++SE DSPCW
Sbjct: 89   ASSGKVLVNPNPIFSKLPEVLIK-PSN-VAGDAKLGLSCVSEKSSTESDEDDSEEDSPCW 146

Query: 1087 KGTQSRCSPFECSRPVNSELLKHEVEAGKSLNPLAPQFFPQKVKESSDYHGIECCQKSIS 1266
             G  S  S    ++ V S     ++     LNPLAPQF P   K+  +  G +C + S S
Sbjct: 147  IGMHSHKSLASGAKAVASRRSTDDLSGFHRLNPLAPQFIPSNSKKKVETDGEKCEENSSS 206


>ref|XP_002321950.2| hypothetical protein POPTR_0015s00600g [Populus trichocarpa]
            gi|550321678|gb|EEF06077.2| hypothetical protein
            POPTR_0015s00600g [Populus trichocarpa]
          Length = 1236

 Score = 67.0 bits (162), Expect = 3e-08
 Identities = 99/387 (25%), Positives = 163/387 (42%), Gaps = 43/387 (11%)
 Frame = +1

Query: 193  HLSPEFHCKTPLSVHNQSDFTALSTS--------------TDIPLTDYRRDSSGLLKGLS 330
            ++SP       L + NQS +  LSTS                + + ++    SGL +G++
Sbjct: 133  YVSPAIASDGSLKIPNQSGYELLSTSHVGTSNGSSRDDYSQSLVVLEHPAQWSGLWEGVT 192

Query: 331  YGDDRRSFSCKDNNGSVSLPNESLLKQGVPA------AEGSQAFLNSASLCTNG-SAVLG 489
              D  +S   + + G  +   E+ + QG  A       E +   +N     T+  SA  G
Sbjct: 193  --DWHQSKKMQLDGGFSA--KENFINQGFSAFKDISKCEETSLGINVVGRQTHTESASTG 248

Query: 490  R-DHQ--IGSRGMEQPGADSSSSPVEISNVATLKRPSTLCSTAILQDVPKLP--YLAPVV 654
            + D++  +G +    P   S+ SP+   +VA    P    S  +   + ++P   L    
Sbjct: 249  QMDYKAFLGEKPKFMPAGYSTPSPLVFPSVAPQAYPQVPSSNVVNSPINQMPDVILYGKS 308

Query: 655  TPQVNGSIGGVMAFPVSSPVLSEDVNFSDGFAVNN-----NDNSFAYTTFCLKVPDFVWN 819
            + + + S    M     SPV+       D ++  N     + +     +  ++ P+   +
Sbjct: 309  SRKRDASPNDSMPVTKPSPVVVVRSPGQDTYSFKNMNTGCDGDEKGNNSSSVQEPNPFIS 368

Query: 820  SEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKKDPLFIKESELFVSHPQN---- 987
            SEGK F  D S I+   +  D    E SS      SNK   +   + +LF +   N    
Sbjct: 369  SEGKVF-YDSSQINFHLKQNDDYLAEISSKNNELPSNKNISVDFFD-QLFKAKMDNKVLR 426

Query: 988  ------HLTEEPHCPERCVSIESSSEALDN-NSEVDSPCWKGTQ-SRCSPFECSRPVNSE 1143
                  +L  + H  E   S+E++SE+LD+ N  VDSPCWKG   S  S FE S  V+  
Sbjct: 427  RNLDFFNLAMDGH--EAIGSVENTSESLDHYNPAVDSPCWKGAPVSHLSAFEISEVVD-P 483

Query: 1144 LLKHEVEAGKSLNPLAPQFFPQKVKES 1224
            L+  +VEA   L+P  PQ FP    ++
Sbjct: 484  LIPKKVEACNGLSPQGPQIFPSATNDA 510


>ref|XP_007220585.1| hypothetical protein PRUPE_ppa000352mg [Prunus persica]
            gi|462417047|gb|EMJ21784.1| hypothetical protein
            PRUPE_ppa000352mg [Prunus persica]
          Length = 1254

 Score = 67.0 bits (162), Expect = 3e-08
 Identities = 111/397 (27%), Positives = 150/397 (37%), Gaps = 61/397 (15%)
 Frame = +1

Query: 193  HLSPEFHCKTPLSVHNQSDFTALSTSTDIPLTD--------------YRRDSSGLLKGLS 330
            +LSP  H  +PL V +Q  +  LST+   PL                Y     GL  GLS
Sbjct: 143  YLSPTIHGDSPLVVPDQPSYDWLSTTHFAPLDGCSRKDYTQRPPDLKYTAQWGGLWNGLS 202

Query: 331  ------YGDDRRSFSCKDNNGSVSLPNESLLKQGVPAAEGSQAFLNSASLCTN------- 471
                   GD   SF  K  + S S   ++ + Q  P +  S      AS   N       
Sbjct: 203  EWEQGKQGDFDGSFCSKKTDVSGSFLYKNFMNQE-PHSSNSLNSFEEASHGINTLGWEKP 261

Query: 472  ---GSAVLGRDHQIGSRGMEQPGADSSSSPVEISNVAT--LKRPSTLCSTAILQDVPKLP 636
               G+A LG    +G      P   S S    +S V    LK PS+ C T       K P
Sbjct: 262  GGSGNAHLGDKSLVGKNSKFTPSDFSKSVMGSLSVVPEPHLKAPSSQCVTKTSNC--KTP 319

Query: 637  YLAPVVTPQVNGSIGGVMAFPVSSPV-----------LSED-------VNFSDGFAVNNN 762
            Y     T Q++ S+  + +   SSP            LSE        +NF    A  ++
Sbjct: 320  YSVSSETQQLDASLDYITSISESSPAFATRTPALGTKLSEPGTGLFRRLNFISDAADTDH 379

Query: 763  DNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEIKDHLFVESSSTKEAQVSNKK-- 936
             + ++       +P     SEGK    D S +      KD    ESSS +  ++SN +  
Sbjct: 380  GDYYSSGVQESHLPQI---SEGKVLF-DSSQLGFHLGAKDCFSAESSSARNEELSNNRNI 435

Query: 937  ------DPLFIKESELFVSHPQ-NHLTEEPHCPERCVSIESSSEALD-NNSEVDSPCWKG 1092
                  D +F  +  L  SH   +         E   S  SSS+ +D NN  VDSPCWKG
Sbjct: 436  INKDAWDKVFKAKPGLQNSHVGLDGFKMAFKTNETINSFLSSSDNVDPNNPGVDSPCWKG 495

Query: 1093 TQSRC-SPFECSRPVNSELLKHEVEAGKSLNPLAPQF 1200
                C SPF  S     E +K ++E    LN   P F
Sbjct: 496  VPGSCFSPFGASEDGVPEQIK-KLEDCSGLNIHMPMF 531


>ref|XP_004972797.1| PREDICTED: uncharacterized protein LOC101763969 [Setaria italica]
          Length = 713

 Score = 66.2 bits (160), Expect = 5e-08
 Identities = 67/242 (27%), Positives = 104/242 (42%), Gaps = 8/242 (3%)
 Frame = +1

Query: 1000 EPH----CPERCVSIESSSEALDNNSEVDSPCWKGTQSRCSPFECSRPVNSELLKHEVEA 1167
            +PH    C   CV++        +   VDSPCW+GT SR SPF+  + + ++ +K E  A
Sbjct: 222  KPHGPSACSSPCVTVADDVNPDPSECSVDSPCWRGTASRLSPFDIHQTLVAQSVKQESVA 281

Query: 1168 GKSLNPLAPQFFPQKVKESSDYHGIECCQKS---ISLDAAEGGPCPSKVITEVGALCLDE 1338
              +          Q+   S DY      +KS    S    E G   SK   ++G   + +
Sbjct: 282  SDA---------GQEQSSSIDYLQNFVTRKSKQNHSQPHVESG--LSKAPGDIGTNLIQD 330

Query: 1339 VYASKKEPALNN-SKTSPEIISSQMAIPNVMEDYFRSVTGDNTYGSVTGIKGAAPTGSFS 1515
             +  + E   +  +K + E   S++    +      S   D    SV     +  TGS S
Sbjct: 331  SHGKELEFVKHGAAKCNSEKQCSEVIDDLIKRSGLNSAAPDFIPFSVRKSNTSNVTGSCS 390

Query: 1516 GAVWDNYHPSSTIDIQVAVNALYKISELLVQNCSSDSNSLKEQDQNILQHVINNLYFCMR 1695
                     SS ++I   + A+  +SE+L  N  SD   +KE D N+LQ VI NL  C+ 
Sbjct: 391  ---------SSGLNISGILKAIKSMSEVLCGN-YSDEIEMKEHDYNLLQSVIENLQSCLH 440

Query: 1696 KS 1701
            K+
Sbjct: 441  KA 442


>gb|AAF31283.1|AC006424_12 CDS [Arabidopsis thaliana]
          Length = 607

 Score = 65.5 bits (158), Expect = 8e-08
 Identities = 46/180 (25%), Positives = 82/180 (45%), Gaps = 3/180 (1%)
 Frame = +1

Query: 736  SDGFAVNNNDNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEI---KDHLFVESSS 906
            S  F++ +      +  + L +     + +    + D SL +  +++   K+ L +E   
Sbjct: 29   SPAFSLKSEHEDSIWGDYTLDLSFLSSDDQRSGLDDDDSLSNLSRDVETNKEGLVLEEKI 88

Query: 907  TKEAQVSNKKDPLFIKESELFVSHPQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPCW 1086
                +V    +P F K  E+ +    + +  +      CVS +SS+E+  + SE DSPCW
Sbjct: 89   ASSGKVLVNPNPSFSKLPEVLMK--SSDVVGDAKLGLSCVSEKSSTESDLDESEEDSPCW 146

Query: 1087 KGTQSRCSPFECSRPVNSELLKHEVEAGKSLNPLAPQFFPQKVKESSDYHGIECCQKSIS 1266
            KG  S  S    ++ + S     ++   + LNPLAPQF P   K+  +  G +C + S S
Sbjct: 147  KGMLSHKSLASGTKSMTSRRSTDDLSGFRKLNPLAPQFIPSSSKKKLETDGEKCEETSSS 206


>ref|NP_001117402.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332193434|gb|AEE31555.1| uncharacterized protein
            AT1G33050 [Arabidopsis thaliana]
          Length = 644

 Score = 65.5 bits (158), Expect = 8e-08
 Identities = 46/180 (25%), Positives = 82/180 (45%), Gaps = 3/180 (1%)
 Frame = +1

Query: 736  SDGFAVNNNDNSFAYTTFCLKVPDFVWNSEGKEFNQDGSLIDTEKEI---KDHLFVESSS 906
            S  F++ +      +  + L +     + +    + D SL +  +++   K+ L +E   
Sbjct: 29   SPAFSLKSEHEDSIWGDYTLDLSFLSSDDQRSGLDDDDSLSNLSRDVETNKEGLVLEEKI 88

Query: 907  TKEAQVSNKKDPLFIKESELFVSHPQNHLTEEPHCPERCVSIESSSEALDNNSEVDSPCW 1086
                +V    +P F K  E+ +    + +  +      CVS +SS+E+  + SE DSPCW
Sbjct: 89   ASSGKVLVNPNPSFSKLPEVLMK--SSDVVGDAKLGLSCVSEKSSTESDLDESEEDSPCW 146

Query: 1087 KGTQSRCSPFECSRPVNSELLKHEVEAGKSLNPLAPQFFPQKVKESSDYHGIECCQKSIS 1266
            KG  S  S    ++ + S     ++   + LNPLAPQF P   K+  +  G +C + S S
Sbjct: 147  KGMLSHKSLASGTKSMTSRRSTDDLSGFRKLNPLAPQFIPSSSKKKLETDGEKCEETSSS 206


Top