BLASTX nr result

ID: Forsythia22_contig00004395 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00004395
         (1002 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002533836.1| conserved hypothetical protein [Ricinus comm...   117   1e-23
ref|XP_002881116.1| hypothetical protein ARALYDRAFT_481963 [Arab...   117   1e-23
ref|NP_973566.1| late embryogenesis abundant hydroxyproline-rich...   115   5e-23
gb|KDO54911.1| hypothetical protein CISIN_1g036355mg [Citrus sin...   114   8e-23
ref|XP_010469849.1| PREDICTED: uncharacterized protein LOC104749...   107   2e-20
gb|EYU36893.1| hypothetical protein MIMGU_mgv1a014780mg [Erythra...   107   2e-20
ref|XP_010414260.1| PREDICTED: uncharacterized protein LOC104700...   105   5e-20
gb|AAU05499.1| At2g30505 [Arabidopsis thaliana] gi|51972134|gb|A...   105   5e-20
gb|KHN24723.1| hypothetical protein glysoja_029792 [Glycine soja]     103   2e-19
ref|XP_006591267.1| PREDICTED: uncharacterized protein LOC102666...   103   2e-19
ref|XP_006294612.1| hypothetical protein CARUB_v10023650mg [Caps...   103   3e-19
ref|XP_010510349.1| PREDICTED: uncharacterized protein LOC104786...   102   3e-19
ref|XP_010522019.1| PREDICTED: uncharacterized protein LOC104800...   100   2e-18
gb|KFK22951.1| hypothetical protein AALP_AAs47561U001000 [Arabis...   100   2e-18
gb|KHN18650.1| hypothetical protein glysoja_033757 [Glycine soja]     100   3e-18
ref|XP_006603113.1| PREDICTED: uncharacterized protein LOC102662...   100   3e-18
ref|XP_006410155.1| hypothetical protein EUTSA_v10016822mg [Eutr...   100   3e-18
ref|XP_010034559.1| PREDICTED: uncharacterized protein LOC104423...    98   8e-18
ref|XP_007146902.1| hypothetical protein PHAVU_006G080100g [Phas...    98   8e-18
emb|CDY69716.1| BnaUnng03380D [Brassica napus]                         97   1e-17

>ref|XP_002533836.1| conserved hypothetical protein [Ricinus communis]
           gi|223526228|gb|EEF28550.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 287

 Score =  117 bits (293), Expect = 1e-23
 Identities = 71/242 (29%), Positives = 121/242 (50%)
 Frame = -1

Query: 909 SMSSGAAANRKPQQSSQKNLSRRVSFNESTLAKSRAEPYGGDLEGQGEKGRCSSRFNMCC 730
           S+ S    + +  +  ++N    +   E    K   E    + E   E+GR   R ++CC
Sbjct: 45  SLPSVTEESEQELEGQRENQHEDIENEEEAKLKEEEEE---EEEEDEEEGRHRPRCSLCC 101

Query: 729 ACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFNIY 550
           A                  I+   L+S LP F+V R+D PKL +     + FL ++ +I 
Sbjct: 102 AWMFFGSLAAVLIVLIIILIFVVTLRSALPEFYVLRMDFPKLHLASENNELFLDADVHIR 161

Query: 549 LNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDA 370
           + A N N+++ L YS + V+V+SE + LG+  +P FSQ P   T L++ T V +    + 
Sbjct: 162 IQALNINEEVELEYSELKVQVSSEDIPLGETKIPGFSQNPKDKTVLRMRTRVWNSPANED 221

Query: 369 DAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNV 190
           DA+ L + ++  Q ++DI+L G+I F +   K+N  P  + C+ I Q+++D      CNV
Sbjct: 222 DAKALKENAENRQTVVDILLNGNIGFHVGVVKLNWVPALIACQKIKQAEVDFAQKPKCNV 281

Query: 189 KM 184
           K+
Sbjct: 282 KI 283


>ref|XP_002881116.1| hypothetical protein ARALYDRAFT_481963 [Arabidopsis lyrata subsp.
           lyrata] gi|297326955|gb|EFH57375.1| hypothetical protein
           ARALYDRAFT_481963 [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score =  117 bits (293), Expect = 1e-23
 Identities = 83/255 (32%), Positives = 124/255 (48%), Gaps = 2/255 (0%)
 Frame = -1

Query: 945 SAAKPSKNGQIPSMSSGAAANRKPQQSSQKNLSRRVSFNESTLA-KSRAEPYGGDLEGQG 769
           S+   S+NG     + G A++ K   S     SR   F E   A +SR      D  GQ 
Sbjct: 63  SSPSVSRNGFDDVENPGKASSFKRPLSG----SRLSGFREEEEADRSRNSGSFVDHIGQE 118

Query: 768 EKGRCSSR-FNMCCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINE 592
           +K  C+S  F  CCAC                    + ++S LP+  V  L   +L + +
Sbjct: 119 DKRICASGCFRKCCACTCMFVSIVLIIVLLVGLSANSSIKSILPQVLVTNLKFSRLDVAK 178

Query: 591 SKTDTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTEL 412
           S TD  + +  N  L  +NNNDK VL YS M  +++SE +NLGK  +P F Q P   T L
Sbjct: 179 SSTDLLMNANLNTVLQLSNNNDKTVLYYSPMKADISSENINLGKKMLPGFKQDPGNVTSL 238

Query: 411 KVHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIF 232
           K+ T ++   V D DA  L +K K  + ++D+ L G +     G K++  P  + C+++ 
Sbjct: 239 KILTRLRKSKVYDVDATLLTNKEKTLEAVVDVFLRGKLSVDWLGFKVH-IPIVIACENVK 297

Query: 231 QSDLDNGHPAACNVK 187
           QSD+ NG   AC+V+
Sbjct: 298 QSDVINGLKPACDVR 312


>ref|NP_973566.1| late embryogenesis abundant hydroxyproline-rich glycoprotein
           [Arabidopsis thaliana] gi|110739946|dbj|BAF01878.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|330253306|gb|AEC08400.1| late embryogenesis abundant
           hydroxyproline-rich glycoprotein [Arabidopsis thaliana]
          Length = 321

 Score =  115 bits (288), Expect = 5e-23
 Identities = 77/224 (34%), Positives = 111/224 (49%), Gaps = 2/224 (0%)
 Frame = -1

Query: 849 SRRVSFNESTLA-KSRAEPYGGDLEGQGEKGRCSSR-FNMCCACGSXXXXXXXXXXXXXX 676
           SR   F E   A +SR      D  GQ +K  C+S  F  CCAC                
Sbjct: 96  SRLSGFREEEEADRSRKSGSFVDHIGQEDKRICASGCFRKCCACTCMFVSVVLIIVLLVG 155

Query: 675 XIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNATNNNDKIVLVYSSMH 496
               + ++S LP+  V  L   +L I +S TD  + +  N  L  +NNNDK VL YS M 
Sbjct: 156 LSANSSIKSILPQVLVTNLKFSRLDIAKSSTDLLMNANLNTVLQLSNNNDKTVLYYSPMK 215

Query: 495 VEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQDLHDKSKVHQLLMDI 316
            +++SE +NLGK  +  F Q P   T LK+ T ++   V D DA  L +K K  + L+D+
Sbjct: 216 ADISSENINLGKKTLSGFKQDPGNVTSLKILTRLRKSKVYDVDATLLTNKEKTLEALVDV 275

Query: 315 VLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
            L G +     G K++  P  + C+S+ QSD+ NG   AC+V++
Sbjct: 276 FLRGKLSVDWLGFKVH-IPIVIACESVKQSDVINGLKPACDVRI 318


>gb|KDO54911.1| hypothetical protein CISIN_1g036355mg [Citrus sinensis]
          Length = 255

 Score =  114 bits (286), Expect = 8e-23
 Identities = 64/231 (27%), Positives = 114/231 (49%)
 Frame = -1

Query: 876 PQQSSQKNLSRRVSFNESTLAKSRAEPYGGDLEGQGEKGRCSSRFNMCCACGSXXXXXXX 697
           PQ   Q+  ++RV+F+E    ++  E    D++    KG  S R   CCA          
Sbjct: 23  PQHQHQQQRTKRVAFSEIPGKRTMYE----DIQDPNSKGHRSCRCFTCCAWICISVTAFL 78

Query: 696 XXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNATNNNDKIV 517
                   +    L+S+LP  +V ++    + I+ES    FLT E ++ L   N NDK+ 
Sbjct: 79  IIIFVLGFVAVGILRSSLPHINVLKVHSSNISISESSRQKFLTMEISVKLEFENENDKMS 138

Query: 516 LVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQDLHDKSKV 337
           L Y  + VE+ +E V +G   +P FSQ+    T++ V+T+  +  ++DA+A+ L  + + 
Sbjct: 139 LHYEKLRVEIKAENVRIGHTAIPGFSQRSGNETKIDVNTTTTNTRIDDANAEILKLRYRQ 198

Query: 336 HQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
             +L+D+ + G + F   G  + G PF V C+ + ++ + +  P  C  K+
Sbjct: 199 KAMLVDLFMTGDVGFSFGGLNIKGLPFEVNCQKVNEAYIKSDDPPKCYTKL 249


>ref|XP_010469849.1| PREDICTED: uncharacterized protein LOC104749838 isoform X1
           [Camelina sativa] gi|727593641|ref|XP_010469850.1|
           PREDICTED: uncharacterized protein LOC104749838 isoform
           X2 [Camelina sativa]
          Length = 327

 Score =  107 bits (266), Expect = 2e-20
 Identities = 74/234 (31%), Positives = 110/234 (47%), Gaps = 4/234 (1%)
 Frame = -1

Query: 873 QQSSQKNLSRRVSFNESTLA-KSRAEPYGGDLEGQGEKGR---CSSRFNMCCACGSXXXX 706
           ++SS   LS    F E   A +SR      D  GQGE+ +    S  F  CC C      
Sbjct: 95  EESSSLKLS---GFREDEEADRSRNSGSLVDRIGQGEEDKRLCASGCFRKCCPCTCMLVF 151

Query: 705 XXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNATNNND 526
                           ++S LP+  V  L   +L + +S TD  + +  N  L  +N ND
Sbjct: 152 IVLVIVLLTGLSVNTSIKSKLPQVLVMNLKFSRLDVVKSTTDLLMNANLNTVLQLSNQND 211

Query: 525 KIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQDLHDK 346
           K VL YS M  +V+SE +NLGK  +  F Q P   T LK+ T ++   V D DA  L +K
Sbjct: 212 KTVLYYSPMKADVSSENINLGKKTLHGFKQDPGNVTSLKIPTRLRQSKVYDVDATLLTNK 271

Query: 345 SKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
            K  + ++D+ L G + F   G  ++  P  + C+ + QSD+ NG    C+V++
Sbjct: 272 EKNLEAVVDVYLRGKLSFDWLGFSVH-IPIVIACEEVKQSDVINGLKPTCDVRI 324


>gb|EYU36893.1| hypothetical protein MIMGU_mgv1a014780mg [Erythranthe guttata]
          Length = 178

 Score =  107 bits (266), Expect = 2e-20
 Identities = 71/172 (41%), Positives = 89/172 (51%), Gaps = 18/172 (10%)
 Frame = -1

Query: 891 AANRKPQQSS---QKNLSRRVSFNESTLAK----------SRAEPYGGDLEGQGE----K 763
           AA+  P+ S    QK+LSRRVSFNE+TL K          S A  Y  D E Q      +
Sbjct: 2   AADSAPKGSRLPLQKSLSRRVSFNENTLPKPPDHRRPRPPSTAGDYDSDTESQTNARPRR 61

Query: 762 GRCSSRFNMCCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKT 583
             C    N CCA  S               IYFAFLQSNLP   +QRLD+  L +N +  
Sbjct: 62  RGCGPLCNSCCAWTSLTVGILLILFLLLGGIYFAFLQSNLPEVRLQRLDINALAVNTTAA 121

Query: 582 -DTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKP 430
            DT LT++F + LNATN +  I L YSSM   ++S GVN G++ + D  Q P
Sbjct: 122 GDTLLTADFEVRLNATNGSGNIELGYSSMTATISSAGVNFGEVRIADMRQAP 173


>ref|XP_010414260.1| PREDICTED: uncharacterized protein LOC104700436 [Camelina sativa]
          Length = 325

 Score =  105 bits (262), Expect = 5e-20
 Identities = 75/233 (32%), Positives = 109/233 (46%), Gaps = 3/233 (1%)
 Frame = -1

Query: 873 QQSSQKNLSRRVSFNESTLA-KSRAEPYGGDLEGQGEKGR--CSSRFNMCCACGSXXXXX 703
           ++SS   LS    F E   A +SR      D  GQ E  R   S  F  CCAC       
Sbjct: 94  EESSSLKLS---GFREDEEANRSRNSGSFVDHIGQEEDKRLCASGCFRKCCACTCMFVSI 150

Query: 702 XXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNATNNNDK 523
                          ++S LP+  V  L   +L + +S TD  + +  N  L  +N NDK
Sbjct: 151 VLVIVLLTGLSVNTSIKSKLPQVLVMNLKFSRLDVVKSATDLLMNANLNTVLQLSNQNDK 210

Query: 522 IVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQDLHDKS 343
            VL YS M  +V+SE +NLGK  +  F Q P   T LK+ T ++   V D DA  L +K 
Sbjct: 211 TVLYYSPMKADVSSENINLGKKTLHGFKQDPGNVTSLKIPTRLRKSKVYDVDATLLTNKE 270

Query: 342 KVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
           K  + ++D+ L G + F   G  ++  P  + C+ + QSD+ NG    C+V++
Sbjct: 271 KNLEAVVDVYLRGKLSFDWLGFSVH-IPIVIACEEVKQSDVLNGLKPTCDVRI 322


>gb|AAU05499.1| At2g30505 [Arabidopsis thaliana] gi|51972134|gb|AAU15171.1|
           At2g30505 [Arabidopsis thaliana]
          Length = 180

 Score =  105 bits (262), Expect = 5e-20
 Identities = 59/158 (37%), Positives = 89/158 (56%)
 Frame = -1

Query: 657 LQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSE 478
           ++S LP+  V  L   +L I +S TD  + +  N  L  +NNNDK VL YS M  +++SE
Sbjct: 21  IKSILPQVLVTNLKFSRLDIAKSSTDLLMNANLNTVLQLSNNNDKTVLYYSPMKADISSE 80

Query: 477 GVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHI 298
            +NLGK  +  F Q P   T LK+ T ++   V D DA  L +K K  + L+D+ L G +
Sbjct: 81  NINLGKKTLSGFKQDPGNVTSLKILTRLRKSKVYDVDATLLTNKEKTLEALVDVFLRGKL 140

Query: 297 DFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
                G K++  P  + C+S+ QSD+ NG   AC+V++
Sbjct: 141 SVDWLGFKVH-IPIVIACESVKQSDVINGLKPACDVRI 177


>gb|KHN24723.1| hypothetical protein glysoja_029792 [Glycine soja]
          Length = 212

 Score =  103 bits (257), Expect = 2e-19
 Identities = 59/195 (30%), Positives = 97/195 (49%)
 Frame = -1

Query: 768 EKGRCSSRFNMCCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINES 589
           +KGR       CCA                   Y AFL+S +P+ +V+  ++ K ++++ 
Sbjct: 16  DKGRYHPCCFACCAWSCLIVFILIIAILFLGITYLAFLKSGMPKINVRAFNITKFQVDDG 75

Query: 588 KTDTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELK 409
                + S   + L  +N NDK+ L+Y  + V+VTSE V LGK     FSQKP   T L 
Sbjct: 76  SQK--MNSVIGLGLIFSNKNDKLKLLYGPLDVDVTSEDVLLGKKKQGGFSQKPLNVTNLD 133

Query: 408 VHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQ 229
           +  ++++  V+   A++L    K ++++ D+ + GHI F +   +MN  PF   C  I +
Sbjct: 134 MTMTLENADVDKYAAEELKSDIKAYEMVFDLYVGGHIGFQVGKLQMNNVPFLASCNQIKR 193

Query: 228 SDLDNGHPAACNVKM 184
            D+D G    C VK+
Sbjct: 194 EDVDFGRKPECEVKL 208


>ref|XP_006591267.1| PREDICTED: uncharacterized protein LOC102666272 [Glycine max]
          Length = 249

 Score =  103 bits (257), Expect = 2e-19
 Identities = 59/195 (30%), Positives = 97/195 (49%)
 Frame = -1

Query: 768 EKGRCSSRFNMCCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINES 589
           +KGR       CCA                   Y AFL+S +P+ +V+  ++ K ++++ 
Sbjct: 49  DKGRYHPCCFACCAWSCLIVFILIIAILFLGITYLAFLKSGMPKINVRAFNITKFQVDDG 108

Query: 588 KTDTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELK 409
                + S   + L  +N NDK+ L+Y  + V+VTSE V LGK     FSQKP   T L 
Sbjct: 109 SQK--MNSVIGLGLIFSNKNDKLKLLYGPLDVDVTSEDVLLGKKKQGGFSQKPLNVTNLD 166

Query: 408 VHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQ 229
           +  ++++  V+   A++L    K ++++ D+ + GHI F +   +MN  PF   C  I +
Sbjct: 167 MTMTLENADVDKYAAEELKSDIKAYEMVFDLYVGGHIGFQVGKLQMNNVPFLASCNQIKR 226

Query: 228 SDLDNGHPAACNVKM 184
            D+D G    C VK+
Sbjct: 227 EDVDFGRKPECEVKL 241


>ref|XP_006294612.1| hypothetical protein CARUB_v10023650mg [Capsella rubella]
           gi|482563320|gb|EOA27510.1| hypothetical protein
           CARUB_v10023650mg [Capsella rubella]
          Length = 320

 Score =  103 bits (256), Expect = 3e-19
 Identities = 71/239 (29%), Positives = 111/239 (46%), Gaps = 2/239 (0%)
 Frame = -1

Query: 894 AAANRKPQQSSQKNLSRRVSFNESTLA-KSRAEPYGGDLEGQGEKGRCSSR-FNMCCACG 721
           A++ + P    +++  +   F E   A +SR      D  G+ +K  C +  F  CCAC 
Sbjct: 80  ASSLKLPPSGMRESPLKLSGFREEEEADRSRNSGSFVDHIGREDKRICPAGCFKKCCACT 139

Query: 720 SXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNA 541
                                ++S LP   V  L   +L + +S TD  + +  N  L  
Sbjct: 140 CMFVSIVLIIVVLTGLSVNTSIKSKLPEVLVMNLKFSRLDVAKSSTDLLMNANLNTVLQL 199

Query: 540 TNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQ 361
           +N NDK VL YS M  +V+SE +NLGK  +  F Q P   T LK+ T ++   V D DA 
Sbjct: 200 SNKNDKTVLYYSPMKADVSSENINLGKKTLLGFKQDPGNITSLKIPTRLRKSKVYDVDAT 259

Query: 360 DLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
            L +K K  +  +D+ L G + F   G  ++  P  + C+ + QSD+ NG    C+V++
Sbjct: 260 LLTNKEKNLEAEVDVFLRGKLSFDWLGFNVH-IPIVIACEDVKQSDVINGLKPTCDVRI 317


>ref|XP_010510349.1| PREDICTED: uncharacterized protein LOC104786608 [Camelina sativa]
          Length = 324

 Score =  102 bits (255), Expect = 3e-19
 Identities = 73/237 (30%), Positives = 109/237 (45%), Gaps = 4/237 (1%)
 Frame = -1

Query: 882 RKPQQSSQKNLSRRVSFNESTLA-KSRAEPYGGDLEGQGEKGR---CSSRFNMCCACGSX 715
           RK   S  +  S    F E   A +SR      D  GQ E+ +    S  F  CCA    
Sbjct: 86  RKLPPSGLREESSSSGFREDDEADRSRNSGSFVDRIGQEEEDKRLCASGCFRKCCAWTCM 145

Query: 714 XXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNATN 535
                            + ++S LP+  V  L   +L + +S TD  + +  N  L  +N
Sbjct: 146 FVSIVLVIVLLTGLTVNSSIKSKLPQVLVMNLKFSRLDVVKSATDLLMNANLNTVLQLSN 205

Query: 534 NNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQDL 355
            NDK VL YS M  +V+SE +NLGK  +  F Q P   T LK+ T ++   V D DA  L
Sbjct: 206 QNDKTVLYYSPMKADVSSENINLGKKTLHGFKQDPGNVTSLKIPTRLRKSKVYDVDATLL 265

Query: 354 HDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
            +K K  + ++D+ L G + F   G  ++  P  + C+ + QSD+ NG    C+V++
Sbjct: 266 TNKEKNLEAVVDVYLRGKLSFDWLGFSVH-IPIVIACEEVKQSDVINGLKPTCDVRI 321


>ref|XP_010522019.1| PREDICTED: uncharacterized protein LOC104800796 [Tarenaya
           hassleriana]
          Length = 272

 Score =  100 bits (249), Expect = 2e-18
 Identities = 57/197 (28%), Positives = 96/197 (48%)
 Frame = -1

Query: 780 EGQGEKGRCSSRFNMCCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLK 601
           E +  + RC   F MCCAC                    + ++S+LP  +V  L  PK++
Sbjct: 71  EERARRRRCPPCFRMCCACTCLIVSLFLLILLIVGVSLVSSVKSSLPLVNVANLRFPKME 130

Query: 600 INESKTDTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTT 421
            N +  +  L ++    L  +N NDK VL YS+M  +V+SE ++LG   +  F Q P   
Sbjct: 131 FNGTSAELVLNADSIAVLQFSNKNDKTVLYYSAMTADVSSESIDLGHTKIQGFRQDPDDV 190

Query: 420 TELKVHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCK 241
             +++ T V+  +V D DA  L DK K  + ++D+ L G +     G ++N  P  + C+
Sbjct: 191 RTVRISTRVRKSSVYDVDATLLRDKLKRFEAVVDVFLSGKMGLDFWGVRIN-LPTVIACE 249

Query: 240 SIFQSDLDNGHPAACNV 190
            + Q+++  G    C+V
Sbjct: 250 DVNQNEVIQGLKPKCHV 266


>gb|KFK22951.1| hypothetical protein AALP_AAs47561U001000 [Arabis alpina]
          Length = 308

 Score =  100 bits (248), Expect = 2e-18
 Identities = 68/216 (31%), Positives = 99/216 (45%), Gaps = 1/216 (0%)
 Frame = -1

Query: 828 ESTLAKSRAEPYGGDLEGQGEKGRCSSRFNMCCACGSXXXXXXXXXXXXXXXIYFAFLQS 649
           ES+  +   E   G    Q EK  CS  F  CCA                       ++S
Sbjct: 92  ESSTFREEDERNSGSFVDQEEKRDCSGCFRKCCAYTCMFLSIVLIIVLLTGLSVNTSIKS 151

Query: 648 NLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNATN-NNDKIVLVYSSMHVEVTSEGV 472
            LP   V  L   +L    S +D  + +  N  L  +N NNDK VL YS M  +V+SE +
Sbjct: 152 RLPEVLVTNLKFSRLDFAIS-SDLLMNANMNTVLQLSNKNNDKTVLYYSPMKADVSSENI 210

Query: 471 NLGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHIDF 292
           NLG+  +  F Q P   T LK+ T V+   V D DA  L +K K  + ++D+ L G + F
Sbjct: 211 NLGQKRLVGFEQNPGNVTSLKIATRVRKSKVYDVDATLLRNKEKNREAVVDVFLRGKLSF 270

Query: 291 LLHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
              G +++  P    C+ + QSD+ NG    C+V++
Sbjct: 271 DWLGFRIH-IPVVFACEDVKQSDVLNGLKPNCDVRI 305


>gb|KHN18650.1| hypothetical protein glysoja_033757 [Glycine soja]
          Length = 233

 Score = 99.8 bits (247), Expect = 3e-18
 Identities = 55/184 (29%), Positives = 95/184 (51%)
 Frame = -1

Query: 735 CCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFN 556
           CCA                   Y AFL+S +P+ +V+  ++ KL++++      + +   
Sbjct: 48  CCAWSCLIMFLLIIAILFLGITYLAFLKSGMPKVNVRAFNITKLQVDDGSQK--MNAVIG 105

Query: 555 IYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVE 376
           + L  +N NDK+ L+Y  + V+VTSE V LGK  +  F Q P   T+L +  ++++  V+
Sbjct: 106 LGLIFSNKNDKLKLLYGPLDVDVTSEDVLLGKKKLGGFYQNPLNDTQLDMTMTLENADVD 165

Query: 375 DADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAAC 196
              A+DL    K ++++ D+ + GHI F +   +MN  PF   C  I + D+D G    C
Sbjct: 166 KYAAEDLISDIKAYEMVFDLYVGGHIGFQVGKLQMNNVPFLASCHQIKREDVDFGRKPEC 225

Query: 195 NVKM 184
           +VK+
Sbjct: 226 DVKL 229


>ref|XP_006603113.1| PREDICTED: uncharacterized protein LOC102662992 [Glycine max]
          Length = 237

 Score = 99.8 bits (247), Expect = 3e-18
 Identities = 55/184 (29%), Positives = 95/184 (51%)
 Frame = -1

Query: 735 CCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEFN 556
           CCA                   Y AFL+S +P+ +V+  ++ KL++++      + +   
Sbjct: 48  CCAWSCLIMFLLIIAILFLGITYLAFLKSGMPKVNVRAFNITKLQVDDGSQK--MNAVIG 105

Query: 555 IYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAVE 376
           + L  +N NDK+ L+Y  + V+VTSE V LGK  +  F Q P   T+L +  ++++  V+
Sbjct: 106 LGLIFSNKNDKLKLLYGPLDVDVTSEDVLLGKKKLGGFYQNPLNDTQLDMTMTLENADVD 165

Query: 375 DADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAAC 196
              A+DL    K ++++ D+ + GHI F +   +MN  PF   C  I + D+D G    C
Sbjct: 166 KYAAEDLISDIKAYEMVFDLYVGGHIGFQVGKLQMNNVPFLASCHQIKREDVDFGRKPEC 225

Query: 195 NVKM 184
           +VK+
Sbjct: 226 DVKL 229


>ref|XP_006410155.1| hypothetical protein EUTSA_v10016822mg [Eutrema salsugineum]
           gi|557111324|gb|ESQ51608.1| hypothetical protein
           EUTSA_v10016822mg [Eutrema salsugineum]
          Length = 371

 Score = 99.8 bits (247), Expect = 3e-18
 Identities = 62/215 (28%), Positives = 100/215 (46%)
 Frame = -1

Query: 828 ESTLAKSRAEPYGGDLEGQGEKGRCSSRFNMCCACGSXXXXXXXXXXXXXXXIYFAFLQS 649
           E  + +SR      D  G+ EK +CS  F  CCAC                    + ++S
Sbjct: 155 EEDIDRSRNSGSFVDRIGREEKRKCSGCFRKCCACTCMFLSIVLIIVLLTGLSVNSSIKS 214

Query: 648 NLPRFHVQRLDVPKLKINESKTDTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSEGVN 469
            LP   V  L   +L    S TD  L +     L  +N NDK +L YS M   V+SE +N
Sbjct: 215 RLPLVFVTNLKFSRLDFANSTTDLLLNANLKTVLQLSNKNDKTLLYYSPMKASVSSENIN 274

Query: 468 LGKMHVPDFSQKPSTTTELKVHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHIDFL 289
           LG+  +  F+Q P   T L + T ++   V D DA  L +K K  + ++++ + G + F 
Sbjct: 275 LGQKTLLGFTQSPGNDTYLMIPTRLRKAKVYDVDASLLRNKEKKLEAVVNVFVSGKLGFD 334

Query: 288 LHGKKMNGFPFRVLCKSIFQSDLDNGHPAACNVKM 184
             G +++  P  + C+++ QSD+       C+V++
Sbjct: 335 WLGFRVH-LPIVIACENVNQSDVIKALKPKCDVRI 368


>ref|XP_010034559.1| PREDICTED: uncharacterized protein LOC104423812 [Eucalyptus
           grandis] gi|629086090|gb|KCW52447.1| hypothetical
           protein EUGRSUZ_J01845 [Eucalyptus grandis]
          Length = 247

 Score = 98.2 bits (243), Expect = 8e-18
 Identities = 63/245 (25%), Positives = 112/245 (45%)
 Frame = -1

Query: 918 QIPSMSSGAAANRKPQQSSQKNLSRRVSFNESTLAKSRAEPYGGDLEGQGEKGRCSSRFN 739
           + P +S G     K Q     N +RRV+F++ +   S    Y      +  +   S  F 
Sbjct: 5   ETPPLSEGGHPAGKKQ-----NETRRVAFSDVSECSSVPGTYPRKKSRKCCRSGGSCCF- 58

Query: 738 MCCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINESKTDTFLTSEF 559
           +CCA                   + +F++S+LP   V  L  P L++     +T L +  
Sbjct: 59  ICCAWACLGVVAVILGLLVLGVWFVSFVKSDLPEVRVHSLSFPGLRVANLSKETRLDTGV 118

Query: 558 NIYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELKVHTSVKDMAV 379
           ++ LN TN N K+ + Y  M  E++ E + +G+ HV  F Q+P + + LK+ T      +
Sbjct: 119 DLRLNFTNKNSKVGISYEKMTAEISVEEIRIGRAHVSGFLQEPRSRSILKIATQATRSVI 178

Query: 378 EDADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQSDLDNGHPAA 199
              D QDL   +K   +++D+VL G +       K++  PF + C  + Q+++D G    
Sbjct: 179 PKDDGQDLPSNAKGKAIVLDVVLNGMLGLNFGSFKLHRLPFVIKCTDLKQAEVDIGVEPR 238

Query: 198 CNVKM 184
           C +++
Sbjct: 239 CGIRI 243


>ref|XP_007146902.1| hypothetical protein PHAVU_006G080100g [Phaseolus vulgaris]
           gi|561020125|gb|ESW18896.1| hypothetical protein
           PHAVU_006G080100g [Phaseolus vulgaris]
          Length = 243

 Score = 98.2 bits (243), Expect = 8e-18
 Identities = 57/195 (29%), Positives = 95/195 (48%)
 Frame = -1

Query: 768 EKGRCSSRFNMCCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKINES 589
           +KGR    + +CCA                   Y AFL+S +P  +V+  ++   +++E 
Sbjct: 43  DKGRHHPYWFVCCAWSCLLLFFLVIAFLFACITYLAFLKSGMPLVYVRAFNITGFQVDEG 102

Query: 588 KTDTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTELK 409
                + +   + L  +N NDK+ L+Y  + ++VTSE + LGK  V  F+Q P   T L 
Sbjct: 103 SQK--MDAVIGLKLLFSNKNDKLQLLYGPLSIDVTSENILLGKNKVDAFTQMPLNDTNLD 160

Query: 408 VHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSIFQ 229
           +  +V +  V    A+DL    K ++++ D+   GHI F +   +MN  PF   C  I  
Sbjct: 161 MIMTVNNADVNPYAAEDLKADIKANEMVFDVYAGGHIGFKVGSLEMNNVPFVACCHEINL 220

Query: 228 SDLDNGHPAACNVKM 184
            D+D G   +C+VK+
Sbjct: 221 MDVDFGRRPSCDVKL 235


>emb|CDY69716.1| BnaUnng03380D [Brassica napus]
          Length = 322

 Score = 97.4 bits (241), Expect = 1e-17
 Identities = 57/197 (28%), Positives = 93/197 (47%)
 Frame = -1

Query: 774 QGEKGRCSSRFNMCCACGSXXXXXXXXXXXXXXXIYFAFLQSNLPRFHVQRLDVPKLKIN 595
           + EK   S  F  CCAC                    + L S LP+ +V  L   +L + 
Sbjct: 124 EDEKRDWSGCFRKCCACTCMFLSIVLIIVLLTGLSVNSSLHSRLPQVYVTNLKFSRLGLA 183

Query: 594 ESKTDTFLTSEFNIYLNATNNNDKIVLVYSSMHVEVTSEGVNLGKMHVPDFSQKPSTTTE 415
            + TD  L +     L  +N ND+  L YS M   V+SE +NLG+  +  F+Q P   T 
Sbjct: 184 NTTTDLLLNANVKTVLELSNKNDQPTLYYSPMKASVSSENINLGQKKLLGFTQSPGNVTY 243

Query: 414 LKVHTSVKDMAVEDADAQDLHDKSKVHQLLMDIVLIGHIDFLLHGKKMNGFPFRVLCKSI 235
           L + T ++   V D D+  L +K K H+ ++ +++ G + F   G ++   P  + C+ +
Sbjct: 244 LNIATRIRKSKVYDVDSTLLRNKEKNHEAVVKVLVSGKLGFDWLGFRVR-LPVVIACEDV 302

Query: 234 FQSDLDNGHPAACNVKM 184
            QSD+ NG    C+V++
Sbjct: 303 KQSDVINGLKPMCDVRI 319