BLASTX nr result

ID: Rheum21_contig00017849 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00017849
         (3167 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253...   482   e-133
gb|EOY34688.1| NT domain of poly(A) polymerase and terminal urid...   459   e-126
gb|EOY34687.1| NT domain of poly(A) polymerase and terminal urid...   459   e-126
ref|XP_002319410.2| hypothetical protein POPTR_0013s15100g [Popu...   459   e-126
ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citr...   458   e-126
ref|XP_006371669.1| hypothetical protein POPTR_0019s14930g [Popu...   454   e-124
ref|XP_002325647.1| predicted protein [Populus trichocarpa]           454   e-124
ref|XP_006575451.1| PREDICTED: uncharacterized protein LOC100814...   450   e-123
ref|XP_006575450.1| PREDICTED: uncharacterized protein LOC100814...   450   e-123
ref|XP_003519288.1| PREDICTED: uncharacterized protein LOC100814...   450   e-123
gb|EOY04484.1| NT domain of poly(A) polymerase and terminal urid...   450   e-123
ref|XP_002518281.1| nucleic acid binding protein, putative [Rici...   447   e-122
gb|ESW14042.1| hypothetical protein PHAVU_008G248100g [Phaseolus...   447   e-122
ref|XP_006596466.1| PREDICTED: uncharacterized protein LOC100816...   447   e-122
ref|XP_006596465.1| PREDICTED: uncharacterized protein LOC100816...   447   e-122
ref|XP_003544929.1| PREDICTED: uncharacterized protein LOC100816...   447   e-122
ref|XP_006350879.1| PREDICTED: uncharacterized protein LOC102602...   446   e-122
ref|XP_004490712.1| PREDICTED: uncharacterized protein LOC101490...   446   e-122
gb|EMJ09368.1| hypothetical protein PRUPE_ppa001915mg [Prunus pe...   445   e-122
ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207...   445   e-122

>ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
          Length = 854

 Score =  482 bits (1241), Expect = e-133
 Identities = 247/382 (64%), Positives = 291/382 (76%), Gaps = 7/382 (1%)
 Frame = -3

Query: 2700 MGDLRV-SPRRPNGAVW------PLEVSSCVGADVAGDLRWTAVEEAAAEVLRKIQPTLS 2542
            MGDL++ SP  PNG V        L  S  + A +AGD  W A E A  E++ K+QPTL 
Sbjct: 1    MGDLKLPSPFLPNGVVSYRGASRSLSSSPPLPASIAGD-SWAAAERATQEIVAKMQPTLG 59

Query: 2541 SERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGDIDLTVLSKPSAEDALISDIF 2362
            S R+R  VI YVQRL+ C LGC+VFPYGSVPLKTYL DGDIDLT L   + E+AL SD+ 
Sbjct: 60   SMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTYLLDGDIDLTALCSSNVEEALASDVH 119

Query: 2361 YVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDISCNQLGGLCTLCFLEQIDRF 2182
             VL+ EE NE+AEFEV+D+Q I+AEVKLVKCL++DI++DIS NQLGGL TLCFLEQ+DR 
Sbjct: 120  AVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKDIVIDISFNQLGGLSTLCFLEQVDRL 179

Query: 2181 VDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEILVLYIFHIYHSSLRGPLSVLY 2002
            + KDHLFKRSIILIK+WCYYESRILGA HGL STYALEILVLYIFH++H SL GPL+VLY
Sbjct: 180  IGKDHLFKRSIILIKSWCYYESRILGAHHGLISTYALEILVLYIFHLFHLSLDGPLAVLY 239

Query: 2001 RFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARYSRLLSKQFLRNSTSMFKVPP 1822
            RFLDYFS FDW+NYC+SL+G V KS LPDIV   PE  +   LLS++FLRN   MF VP 
Sbjct: 240  RFLDYFSKFDWDNYCISLNGPVCKSSLPDIVAELPENGQDDLLLSEEFLRNCVDMFSVPF 299

Query: 1821 RDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIRSALKFGARKLGQILSLPTES 1642
            R     SR F  K +NIIDPL+++NNLGRSVN+GNFYRIRSA K+G+ KLGQILSLP E 
Sbjct: 300  RGLETNSRTFPLKHLNIIDPLRENNNLGRSVNKGNFYRIRSAFKYGSHKLGQILSLPREV 359

Query: 1641 TRRELKNFFTNTLARHGGRLQA 1576
             + ELKNFF +TL RH  +  A
Sbjct: 360  IQDELKNFFASTLERHRSKYMA 381



 Score = 84.3 bits (207), Expect = 3e-13
 Identities = 89/333 (26%), Positives = 136/333 (40%), Gaps = 46/333 (13%)
 Frame = -2

Query: 1330 SIDIVGEPMENSRAATNSLPPCNHYRRTHSVSSGRKPYTARVSSG----HRSWSLIDHNE 1163
            SI +  E  EN   A  S    +++   +S+ S     TA +S       R      +  
Sbjct: 528  SIVLQQESKENHFVANTSFSSHSYHEGHNSIGSIISRPTANISENTALAFRGRDFACNAG 587

Query: 1162 ISGPLDPFSDLTGDYESHLKSLLYGQSFHGNATIQPMVYNLPVWDMQCQTN--------- 1010
              G L+   DL+GDY+SH++SL YGQ  +G+A   P++ + P+   Q Q N         
Sbjct: 588  SLGSLETLLDLSGDYDSHIRSLQYGQCCYGHALPPPLLPSPPLSPSQLQINTPWDKVRQH 647

Query: 1009 --FGQDCYFQMNPNHVMWEQPFP-----------------KVRGTGTYIPRTDLGSWKGG 887
              F Q+ + QM+ N V+    FP                 K RGTGTY P  ++      
Sbjct: 648  LQFTQNLHSQMDSNGVILGNHFPVKHPARSITAFGLEDKQKPRGTGTYFP--NMSHLPNR 705

Query: 886  KLPVRKGRKRTQGSPTF-QRYNHDHKFGVGMITVQANVTDQIYHHVDVPESKRSTAYPSS 710
              PV  G++R Q   +  Q +   H+ G+     + N+ ++  H +          YP  
Sbjct: 706  DRPV--GQRRNQALESHSQLHRRKHRNGLVAAQQEMNLIEETSHELS------QLQYPVL 757

Query: 709  VYAASILSEQS----EGCEFRSSENLAGEKDGPDERASDDSE---------PNPKAPAML 569
             +  SI +  S    +  EF S   ++     PD     DS           +P    M 
Sbjct: 758  GHGKSIHANGSSLPPKRLEFGSFGTMSSGLPTPDRCTKPDSSGTLPAWGATASPVGSRMQ 817

Query: 568  IPDQKLADTEGTLERVAGKSYQLKDEEDFPPLA 470
             P   L + E   +R  G SY LK+E+DFPPL+
Sbjct: 818  SPKPVLGNEE---KRFEGLSYHLKNEDDFPPLS 847


>gb|EOY34688.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative isoform 2
            [Theobroma cacao]
          Length = 836

 Score =  459 bits (1181), Expect = e-126
 Identities = 226/337 (67%), Positives = 262/337 (77%)
 Frame = -3

Query: 2601 WTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGD 2422
            W + EE A  ++  +QPTL ++RKR  ++ YVQRL+   LG QVFPYGSVPLKTYLPDGD
Sbjct: 47   WDSAEETARRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVPLKTYLPDGD 106

Query: 2421 IDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDI 2242
            IDLT LS P+ ED L+SD+  +L+ EE N+ A + V+DV  I AEVKLVKCL+QDI+VDI
Sbjct: 107  IDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKCLVQDIVVDI 166

Query: 2241 SCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEIL 2062
            S NQLGGLCTLCFLEQIDR V KDHLFKRSIILIKAWCYYESRILGA HGL STYALE L
Sbjct: 167  SFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 226

Query: 2061 VLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARY 1882
            VLYIFH++HSSL GP++VLYRFLDYFS FDWENYC+SL+G V KS LPDIV   PE    
Sbjct: 227  VLYIFHLFHSSLTGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIVAEVPENVGN 286

Query: 1881 SRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIR 1702
            + LLS++FLR   +MF VP +     SR F  K +NIIDPLK++NNLGRSVN+GN+YRIR
Sbjct: 287  NPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSVNRGNYYRIR 346

Query: 1701 SALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            SA K+GA KL QIL LP E    EL  FF NTL RHG
Sbjct: 347  SAFKYGAHKLEQILILPRERIPDELVKFFANTLERHG 383


>gb|EOY34687.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative isoform 1
            [Theobroma cacao]
          Length = 836

 Score =  459 bits (1181), Expect = e-126
 Identities = 226/337 (67%), Positives = 262/337 (77%)
 Frame = -3

Query: 2601 WTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGD 2422
            W + EE A  ++  +QPTL ++RKR  ++ YVQRL+   LG QVFPYGSVPLKTYLPDGD
Sbjct: 47   WDSAEETARRIVWSVQPTLDADRKRKEIVEYVQRLIQDGLGYQVFPYGSVPLKTYLPDGD 106

Query: 2421 IDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDI 2242
            IDLT LS P+ ED L+SD+  +L+ EE N+ A + V+DV  I AEVKLVKCL+QDI+VDI
Sbjct: 107  IDLTTLSSPAIEDTLVSDVHAILRGEEHNQKAPYRVKDVHCIDAEVKLVKCLVQDIVVDI 166

Query: 2241 SCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEIL 2062
            S NQLGGLCTLCFLEQIDR V KDHLFKRSIILIKAWCYYESRILGA HGL STYALE L
Sbjct: 167  SFNQLGGLCTLCFLEQIDRLVGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 226

Query: 2061 VLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARY 1882
            VLYIFH++HSSL GP++VLYRFLDYFS FDWENYC+SL+G V KS LPDIV   PE    
Sbjct: 227  VLYIFHLFHSSLTGPIAVLYRFLDYFSKFDWENYCISLNGPVCKSSLPDIVAEVPENVGN 286

Query: 1881 SRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIR 1702
            + LLS++FLR   +MF VP +     SR F  K +NIIDPLK++NNLGRSVN+GN+YRIR
Sbjct: 287  NPLLSEEFLRKCINMFSVPSKGVETNSRLFPLKHLNIIDPLKENNNLGRSVNRGNYYRIR 346

Query: 1701 SALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            SA K+GA KL QIL LP E    EL  FF NTL RHG
Sbjct: 347  SAFKYGAHKLEQILILPRERIPDELVKFFANTLERHG 383



 Score = 68.6 bits (166), Expect = 2e-08
 Identities = 78/270 (28%), Positives = 109/270 (40%), Gaps = 40/270 (14%)
 Frame = -2

Query: 1159 SGPLDPFSDLTGDYESHLKSLLYGQSFH---GNATIQPMVYNLPVWD-MQCQTNFGQDCY 992
            S  L    DLTGDY+    SLLYGQ  H    ++ + P + N   W+ ++      QD Y
Sbjct: 573  SESLKSLLDLTGDYDGQFWSLLYGQYCHLFSVSSPVSPHLQNENHWETIEQSIPLKQDLY 632

Query: 991  FQMNPN----------------HVMWEQPFPKVRGTGTYIPRTDLGSWKGGKLPVRKGRK 860
             Q + N                H   +    K RGTGTYIP      ++  +     GR 
Sbjct: 633  SQRDSNGILGSQFCFSKPPVAVHTALDSEDKKKRGTGTYIPSI---KYRSNRERHSSGRG 689

Query: 859  RTQGSPTF---QRYNHDHKFGVGMITVQANV--TDQIYHHVDVPE-----------SKRS 728
              Q S  +   QRY ++     G  TVQ  +  + +  H +   E               
Sbjct: 690  IFQASRAYSQLQRYTNNK----GSATVQQEMALSQEGSHELSPKEYPALGPVKFGPPNTH 745

Query: 727  TAYPS--SVYAASILSEQSEGCEFRSSENLAGEKDGPDERASDDSEPNPKAPAMLIPDQK 554
              YPS   + AAS L+   E  E  SS       + P++ A  D       P+++IP  +
Sbjct: 746  PPYPSVWGLCAASGLNCPPERFESESSSLELQSTNMPEDNALPDPCTCGSTPSVMIPAAQ 805

Query: 553  LADT--EGTLERVAGKSYQLKDEEDFPPLA 470
             A    E   E  AG SY LK+E DFPPL+
Sbjct: 806  SAKPVLESNQESDAGLSYHLKNEHDFPPLS 835


>ref|XP_002319410.2| hypothetical protein POPTR_0013s15100g [Populus trichocarpa]
            gi|550325888|gb|EEE95333.2| hypothetical protein
            POPTR_0013s15100g [Populus trichocarpa]
          Length = 681

 Score =  459 bits (1180), Expect = e-126
 Identities = 223/337 (66%), Positives = 265/337 (78%)
 Frame = -3

Query: 2601 WTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGD 2422
            W   EE A E++ +I PT+ S  KR  VI YVQRL+  SLG +VFPYGSVPLKTYLPDGD
Sbjct: 58   WERAEEVATEIVYRIHPTVESSFKRKQVIDYVQRLIRYSLGFEVFPYGSVPLKTYLPDGD 117

Query: 2421 IDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDI 2242
            IDLT +S P+ E+AL+SD++ VL+ EELNE A +EV+DV  I AEVKL+KC++Q+ +VDI
Sbjct: 118  IDLTAISSPAIEEALVSDVYTVLRGEELNEDALYEVKDVHCIDAEVKLIKCIVQNTVVDI 177

Query: 2241 SCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEIL 2062
            S NQLGGLCTLCFLE++DR V K+HLFKRSIILIKAWCYYESRILGA HGL STYALE L
Sbjct: 178  SFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 237

Query: 2061 VLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARY 1882
            +LYIFH++HSSL GPL+VLY+FLDYFS FDWENYC+SL+G V KS LP+IV   PE    
Sbjct: 238  ILYIFHLFHSSLNGPLAVLYKFLDYFSKFDWENYCISLNGPVCKSSLPNIVAKPPENVSG 297

Query: 1881 SRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIR 1702
              LLS +FL++    F VP R     SRPF QK +NI+DPLK++NNLGRSVN+GNF+RIR
Sbjct: 298  ELLLSDEFLKDCVDRFYVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRIR 357

Query: 1701 SALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            SA K+G RKLG+IL LP E    ELK FF NTL RHG
Sbjct: 358  SAFKYGGRKLGRILLLPREKIADELKTFFANTLDRHG 394


>ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citrus clementina]
            gi|568855155|ref|XP_006481174.1| PREDICTED:
            uncharacterized protein LOC102622468 [Citrus sinensis]
            gi|557531615|gb|ESR42798.1| hypothetical protein
            CICLE_v10011044mg [Citrus clementina]
          Length = 882

 Score =  458 bits (1179), Expect = e-126
 Identities = 233/375 (62%), Positives = 281/375 (74%), Gaps = 5/375 (1%)
 Frame = -3

Query: 2700 MGDLRVSPRRPNGAVW---PLEVSSCVGAD--VAGDLRWTAVEEAAAEVLRKIQPTLSSE 2536
            MGDLR     PNGAV+   P   SS V ++    G   W   EEA   ++ ++QPT+ SE
Sbjct: 1    MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQTAIGAEYWQRAEEATQAIIAQVQPTVVSE 60

Query: 2535 RKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGDIDLTVLSKPSAEDALISDIFYV 2356
             +R  VI YVQRL+   LGC+VFP+GSVPLKTYLPDGDIDLT     + E+AL +D+  V
Sbjct: 61   ERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVCSV 120

Query: 2355 LQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDISCNQLGGLCTLCFLEQIDRFVD 2176
            L++E+ N+ AEF V+D QLI AEVKLVKCL+Q+I+VDIS NQLGGL TLCFLEQ+DR + 
Sbjct: 121  LEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRLIG 180

Query: 2175 KDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEILVLYIFHIYHSSLRGPLSVLYRF 1996
            KDHLFKRSIILIKAWCYYESRILGA HGL STYALE LVLYIFH++HSSL GPL+VLY+F
Sbjct: 181  KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLNGPLAVLYKF 240

Query: 1995 LDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARYSRLLSKQFLRNSTSMFKVPPRD 1816
            LDYFS FDW++YC+SL+G VR S LP++VV TPE +    LLS +FL+     F VP R 
Sbjct: 241  LDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPSRG 300

Query: 1815 HGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIRSALKFGARKLGQILSLPTESTR 1636
                SR F  K +NI+DPLK++NNLGRSV++GNFYRIRSA  +GARKLG ILS P ES  
Sbjct: 301  FDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEESLT 360

Query: 1635 RELKNFFTNTLARHG 1591
             EL+ FF+NTL RHG
Sbjct: 361  DELRKFFSNTLDRHG 375


>ref|XP_006371669.1| hypothetical protein POPTR_0019s14930g [Populus trichocarpa]
            gi|550317591|gb|ERP49466.1| hypothetical protein
            POPTR_0019s14930g [Populus trichocarpa]
          Length = 808

 Score =  454 bits (1167), Expect = e-124
 Identities = 222/337 (65%), Positives = 264/337 (78%)
 Frame = -3

Query: 2601 WTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGD 2422
            W   EE   E++ +I PT+ S  KR  +I YVQRL+  SLG +VFPYGSVPLKTYLPDGD
Sbjct: 58   WERAEEFTREIVYRIHPTVESNFKRKQIIGYVQRLIKSSLGFEVFPYGSVPLKTYLPDGD 117

Query: 2421 IDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDI 2242
            IDLT +S P+ E+AL+SDI  VL++EELNE + FEV+DV  I AEVKL+KC++Q+ +VDI
Sbjct: 118  IDLTSISSPAIEEALVSDIHAVLRREELNEDSTFEVKDVHCIDAEVKLIKCIVQNTVVDI 177

Query: 2241 SCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEIL 2062
            S NQLGGLCTLCFLE++DR V K+HLFKRSIILIKAWCYYESRILGA HGL STYALE L
Sbjct: 178  SFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 237

Query: 2061 VLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARY 1882
            +LYIFH++H SL GPL+VLYRFL+YFS FDWENYC+SL+G V KS LP+IV    E  + 
Sbjct: 238  ILYIFHLFHCSLNGPLAVLYRFLEYFSKFDWENYCISLNGPVCKSSLPNIVAEPLENGQG 297

Query: 1881 SRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIR 1702
              LLS +FL++    F VP R     SRPF QK +NI+DPLK++NNLGRSVN+GNF+RIR
Sbjct: 298  ELLLSDEFLKDCADRFSVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRIR 357

Query: 1701 SALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            SA K+GARKLGQIL LP E    ELK FF NTL RHG
Sbjct: 358  SAFKYGARKLGQILLLPKERIADELKIFFANTLDRHG 394



 Score = 64.3 bits (155), Expect = 3e-07
 Identities = 92/335 (27%), Positives = 134/335 (40%), Gaps = 55/335 (16%)
 Frame = -2

Query: 1312 EPMENSRAATNSLPPCN-HYRRTHSVSSGRKPY--------TARVSSGHRSWSLIDHNEI 1160
            EP +N    +NS+  C  H     SVS+   P         T RV    + ++ I  N  
Sbjct: 484  EPKQNHFQNSNSVCSCTKHEGIAPSVSTTPNPADNVPENLSTTRVE---KDFAGITGN-- 538

Query: 1159 SGPLDPFSDLTGDYESHLKSLLYGQSFHGNAT---IQPMVYNLPV------WD-MQCQTN 1010
            S PL     L GD+  HL+SL Y Q  H +A    I P    LP+      W+ +Q    
Sbjct: 539  SQPLKSLLGLRGDHNGHLQSLAYSQYCHMHAVSAPIPPCPSMLPLSENKNRWETVQQSLQ 598

Query: 1009 FGQDCYFQMNPNHVMWEQ--------PFPKV---------RGTGTYIPRTDLGSWKGGKL 881
              Q+ + QMN NH+   Q        PF            RGTGTYIP     S +G +L
Sbjct: 599  LKQNGHSQMNTNHIFGTQLYCVNPGGPFRAATDSEEKKIRRGTGTYIPNMSYHSSRGDRL 658

Query: 880  PVRKGRKRTQGSPTFQRYNHDHKFGVGMITVQANVTDQIYHHVDVPES------------ 737
             + +GR + Q +   Q + + H+ G+     + N+++   H  D+ E+            
Sbjct: 659  SLGRGRTQPQANHG-QLHKYTHENGLPTTLQEKNLSE---HGHDLSEAEYPHLGNGKPVP 714

Query: 736  -KRSTAYPSSVYAASILSEQSEG-----CEFRSSENLAGEKDGPDERASDDSEPNPKAPA 575
             +   +YP SV+ +S  +  S       C  R  ++  G     D      S P   A +
Sbjct: 715  LEAHHSYP-SVWGSSNANGSSRAFVRTDCGSRGLQHPEGPPSTSDLVVL--SCPGTSATS 771

Query: 574  MLIPDQK-LADTEGTLERVAGKSYQLKDEEDFPPL 473
             +    K L   E   ER   + Y LKD   FPPL
Sbjct: 772  PVASTAKDLEILENEQERALLQQYHLKDNVHFPPL 806


>ref|XP_002325647.1| predicted protein [Populus trichocarpa]
          Length = 533

 Score =  454 bits (1167), Expect = e-124
 Identities = 222/337 (65%), Positives = 264/337 (78%)
 Frame = -3

Query: 2601 WTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGD 2422
            W   EE   E++ +I PT+ S  KR  +I YVQRL+  SLG +VFPYGSVPLKTYLPDGD
Sbjct: 58   WERAEEFTREIVYRIHPTVESNFKRKQIIGYVQRLIKSSLGFEVFPYGSVPLKTYLPDGD 117

Query: 2421 IDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDI 2242
            IDLT +S P+ E+AL+SDI  VL++EELNE + FEV+DV  I AEVKL+KC++Q+ +VDI
Sbjct: 118  IDLTSISSPAIEEALVSDIHAVLRREELNEDSTFEVKDVHCIDAEVKLIKCIVQNTVVDI 177

Query: 2241 SCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEIL 2062
            S NQLGGLCTLCFLE++DR V K+HLFKRSIILIKAWCYYESRILGA HGL STYALE L
Sbjct: 178  SFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 237

Query: 2061 VLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARY 1882
            +LYIFH++H SL GPL+VLYRFL+YFS FDWENYC+SL+G V KS LP+IV    E  + 
Sbjct: 238  ILYIFHLFHCSLNGPLAVLYRFLEYFSKFDWENYCISLNGPVCKSSLPNIVAEPLENGQG 297

Query: 1881 SRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIR 1702
              LLS +FL++    F VP R     SRPF QK +NI+DPLK++NNLGRSVN+GNF+RIR
Sbjct: 298  ELLLSDEFLKDCADRFSVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRIR 357

Query: 1701 SALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            SA K+GARKLGQIL LP E    ELK FF NTL RHG
Sbjct: 358  SAFKYGARKLGQILLLPKERIADELKIFFANTLDRHG 394


>ref|XP_006575451.1| PREDICTED: uncharacterized protein LOC100814626 isoform X3 [Glycine
            max]
          Length = 782

 Score =  450 bits (1158), Expect = e-123
 Identities = 236/383 (61%), Positives = 277/383 (72%), Gaps = 13/383 (3%)
 Frame = -3

Query: 2700 MGDLRV-------------SPRRPNGAVWPLEVSSCVGADVAGDLRWTAVEEAAAEVLRK 2560
            MGDL V             SP  P    W  + SS V AD      W A E   AE+LR+
Sbjct: 1    MGDLHVNGVVFGEDRPCASSPPSPPLPPWNPDPSS-VAADA-----WAAAERNTAEILRR 54

Query: 2559 IQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGDIDLTVLSKPSAEDA 2380
            I+PTL+++R+R  V+ YVQRL+     C+VFPYGSVPLKTYLPDGDIDLT LS  + ED 
Sbjct: 55   IRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLTALSCENIEDG 114

Query: 2379 LISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDISCNQLGGLCTLCFL 2200
            L+SD+  VL  EE+NE AE+EV+DV+ I AEVKLVKC++QDI+VDIS NQLGGL TLCFL
Sbjct: 115  LVSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQLGGLSTLCFL 174

Query: 2199 EQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEILVLYIFHIYHSSLRG 2020
            E++DR V KDHLFKRSIILIKAWCYYESR+LGA HGL STYALE LVLYIFH +H SL G
Sbjct: 175  EKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYIFHQFHVSLDG 234

Query: 2019 PLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARYSRLLSKQFLRNSTS 1840
            PL+VLYRFLDYFS FDW+NYCVSL G V K+ LP+IV   PE    + LL+++F+R+   
Sbjct: 235  PLAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIVAEVPENGG-NTLLTEEFIRSCVE 293

Query: 1839 MFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIRSALKFGARKLGQIL 1660
             F VP R      R F QK +NIIDPLK++NNLGRSVN+GNFYRIRSA K+GARKLG IL
Sbjct: 294  SFSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFKYGARKLGWIL 353

Query: 1659 SLPTESTRRELKNFFTNTLARHG 1591
             LP +    EL  FF NTL RHG
Sbjct: 354  RLPEDRIAEELIRFFANTLERHG 376


>ref|XP_006575450.1| PREDICTED: uncharacterized protein LOC100814626 isoform X2 [Glycine
            max]
          Length = 783

 Score =  450 bits (1158), Expect = e-123
 Identities = 236/383 (61%), Positives = 277/383 (72%), Gaps = 13/383 (3%)
 Frame = -3

Query: 2700 MGDLRV-------------SPRRPNGAVWPLEVSSCVGADVAGDLRWTAVEEAAAEVLRK 2560
            MGDL V             SP  P    W  + SS V AD      W A E   AE+LR+
Sbjct: 1    MGDLHVNGVVFGEDRPCASSPPSPPLPPWNPDPSS-VAADA-----WAAAERNTAEILRR 54

Query: 2559 IQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGDIDLTVLSKPSAEDA 2380
            I+PTL+++R+R  V+ YVQRL+     C+VFPYGSVPLKTYLPDGDIDLT LS  + ED 
Sbjct: 55   IRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLTALSCENIEDG 114

Query: 2379 LISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDISCNQLGGLCTLCFL 2200
            L+SD+  VL  EE+NE AE+EV+DV+ I AEVKLVKC++QDI+VDIS NQLGGL TLCFL
Sbjct: 115  LVSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQLGGLSTLCFL 174

Query: 2199 EQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEILVLYIFHIYHSSLRG 2020
            E++DR V KDHLFKRSIILIKAWCYYESR+LGA HGL STYALE LVLYIFH +H SL G
Sbjct: 175  EKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYIFHQFHVSLDG 234

Query: 2019 PLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARYSRLLSKQFLRNSTS 1840
            PL+VLYRFLDYFS FDW+NYCVSL G V K+ LP+IV   PE    + LL+++F+R+   
Sbjct: 235  PLAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIVAEVPENGG-NTLLTEEFIRSCVE 293

Query: 1839 MFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIRSALKFGARKLGQIL 1660
             F VP R      R F QK +NIIDPLK++NNLGRSVN+GNFYRIRSA K+GARKLG IL
Sbjct: 294  SFSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFKYGARKLGWIL 353

Query: 1659 SLPTESTRRELKNFFTNTLARHG 1591
             LP +    EL  FF NTL RHG
Sbjct: 354  RLPEDRIAEELIRFFANTLERHG 376


>ref|XP_003519288.1| PREDICTED: uncharacterized protein LOC100814626 isoform X1 [Glycine
            max]
          Length = 780

 Score =  450 bits (1158), Expect = e-123
 Identities = 236/383 (61%), Positives = 277/383 (72%), Gaps = 13/383 (3%)
 Frame = -3

Query: 2700 MGDLRV-------------SPRRPNGAVWPLEVSSCVGADVAGDLRWTAVEEAAAEVLRK 2560
            MGDL V             SP  P    W  + SS V AD      W A E   AE+LR+
Sbjct: 1    MGDLHVNGVVFGEDRPCASSPPSPPLPPWNPDPSS-VAADA-----WAAAERNTAEILRR 54

Query: 2559 IQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGDIDLTVLSKPSAEDA 2380
            I+PTL+++R+R  V+ YVQRL+     C+VFPYGSVPLKTYLPDGDIDLT LS  + ED 
Sbjct: 55   IRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLTALSCENIEDG 114

Query: 2379 LISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDISCNQLGGLCTLCFL 2200
            L+SD+  VL  EE+NE AE+EV+DV+ I AEVKLVKC++QDI+VDIS NQLGGL TLCFL
Sbjct: 115  LVSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQLGGLSTLCFL 174

Query: 2199 EQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEILVLYIFHIYHSSLRG 2020
            E++DR V KDHLFKRSIILIKAWCYYESR+LGA HGL STYALE LVLYIFH +H SL G
Sbjct: 175  EKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYIFHQFHVSLDG 234

Query: 2019 PLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARYSRLLSKQFLRNSTS 1840
            PL+VLYRFLDYFS FDW+NYCVSL G V K+ LP+IV   PE    + LL+++F+R+   
Sbjct: 235  PLAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIVAEVPENGG-NTLLTEEFIRSCVE 293

Query: 1839 MFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIRSALKFGARKLGQIL 1660
             F VP R      R F QK +NIIDPLK++NNLGRSVN+GNFYRIRSA K+GARKLG IL
Sbjct: 294  SFSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFKYGARKLGWIL 353

Query: 1659 SLPTESTRRELKNFFTNTLARHG 1591
             LP +    EL  FF NTL RHG
Sbjct: 354  RLPEDRIAEELIRFFANTLERHG 376


>gb|EOY04484.1| NT domain of poly(A) polymerase and terminal uridylyl
            transferase-containing protein, putative [Theobroma
            cacao]
          Length = 890

 Score =  450 bits (1157), Expect = e-123
 Identities = 230/375 (61%), Positives = 276/375 (73%), Gaps = 5/375 (1%)
 Frame = -3

Query: 2700 MGDLRVSPRRPNGAVWPLEVSSCVG-----ADVAGDLRWTAVEEAAAEVLRKIQPTLSSE 2536
            MGDLR     PNG       SS        A +A +  W   EEA   ++ ++QPT+ SE
Sbjct: 4    MGDLRDWSPEPNGVASEERSSSSSSSSSNQAGIAAEY-WKKAEEATQGIIAQVQPTVVSE 62

Query: 2535 RKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGDIDLTVLSKPSAEDALISDIFYV 2356
             +R  VI YVQRL+   LGC VFP+GSVPLKTYLPDGDIDLT     + E+AL +D+  V
Sbjct: 63   ERRKAVIDYVQRLIGNYLGCGVFPFGSVPLKTYLPDGDIDLTAFGGLNFEEALANDVCSV 122

Query: 2355 LQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDISCNQLGGLCTLCFLEQIDRFVD 2176
            L++E+ N  AEF V+DVQLI AEVKLVKCL+Q+I+VDIS NQLGGLCTLCFLE++DR + 
Sbjct: 123  LEREDHNRAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKVDRRIG 182

Query: 2175 KDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEILVLYIFHIYHSSLRGPLSVLYRF 1996
            KDHLFKRSIILIKAWCYYESRILGA HGL STYALE LVLYIFH++HSSL GPL+VLY+F
Sbjct: 183  KDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLDGPLAVLYKF 242

Query: 1995 LDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARYSRLLSKQFLRNSTSMFKVPPRD 1816
            LDYFS FDW+NYC+SL+G +  S LP++VV TPE      LLS  FL+    MF VP R 
Sbjct: 243  LDYFSKFDWDNYCISLNGPIHISSLPEVVVETPENGGGDLLLSNDFLKECVEMFSVPSRG 302

Query: 1815 HGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIRSALKFGARKLGQILSLPTESTR 1636
                SR F QK +NI+DPL+++NNLGRSV++GNFYRIRSA  +GARKLG+ILS   ES  
Sbjct: 303  FETNSRTFPQKHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLGKILSQAEESMA 362

Query: 1635 RELKNFFTNTLARHG 1591
             EL+ FF+NTL RHG
Sbjct: 363  DELRKFFSNTLDRHG 377


>ref|XP_002518281.1| nucleic acid binding protein, putative [Ricinus communis]
            gi|223542501|gb|EEF44041.1| nucleic acid binding protein,
            putative [Ricinus communis]
          Length = 821

 Score =  447 bits (1151), Expect = e-122
 Identities = 215/337 (63%), Positives = 258/337 (76%)
 Frame = -3

Query: 2601 WTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGD 2422
            W   E+A  +++ +I PT+ ++  R HV+ YVQ L+  SLG QVFPYGSVPLKTYLPDGD
Sbjct: 51   WERAEQATLQIVYRIHPTVEADCNRKHVVEYVQSLIQSSLGFQVFPYGSVPLKTYLPDGD 110

Query: 2421 IDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDI 2242
            IDLT +  P+  DA +SD+  VL++EE N  A ++V+DV  I AEVKL+KC++ DI+VDI
Sbjct: 111  IDLTAIINPAGVDASVSDVHAVLRREEQNRDAPYKVKDVHFIDAEVKLIKCIVHDIVVDI 170

Query: 2241 SCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEIL 2062
            S NQLGGL TLCFLEQ+D+ + K HLFKRSIILIKAWCYYESRILGA HGL STYALE L
Sbjct: 171  SFNQLGGLSTLCFLEQVDQLIGKSHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 230

Query: 2061 VLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARY 1882
            +LYIFH++HSSL GPL VLYRFLDYFS FDW+NYC+SL+G V KS LP IV   PE  R 
Sbjct: 231  ILYIFHLFHSSLNGPLMVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPKIVAEPPETGRG 290

Query: 1881 SRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIR 1702
            + LL  +FLRNS  M  VP R     SRPF QK +NI+DPL+++NNLGRSVN+GNFYRIR
Sbjct: 291  NLLLDDEFLRNSVKMLSVPSRSPEMNSRPFTQKHLNIVDPLRENNNLGRSVNRGNFYRIR 350

Query: 1701 SALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            SA K+GARKLG ILSL ++    EL  FF NTL RHG
Sbjct: 351  SAFKYGARKLGHILSLQSDRMINELDKFFANTLDRHG 387



 Score = 69.7 bits (169), Expect = 8e-09
 Identities = 82/333 (24%), Positives = 127/333 (38%), Gaps = 52/333 (15%)
 Frame = -2

Query: 1312 EPMENSRAATNSLPPCNHYRRTHSVSSGRKPYTARVS------SGHRSWSLIDHNEISGP 1151
            E  EN     NS   C+++    S+ S        +S      +  R ++ I  ++I   
Sbjct: 488  ESKENHFVINNSACSCSNHEGKTSLCSTIPSLVNNISENLAPTTAERDFASI--SQIPRS 545

Query: 1150 LDPFSDLTGDYESHLKSLLYGQS---FHGNATIQPMVYNLP------VWDMQCQT-NFGQ 1001
                 DLTGDY+SHLKS+ +GQ    F  +A + P     P       W+   Q+    +
Sbjct: 546  FKSLLDLTGDYDSHLKSVKFGQGCCFFAVSAPVLPCSPTAPHSKNKNPWETVRQSLQLKR 605

Query: 1000 DCYFQMNPNHVMWEQ--------PFP---------KVRGTGTYIPRTDLGSWKGGKLPVR 872
            + + Q+N N +   Q        PF          K RGTGTYIP     S +      R
Sbjct: 606  NVHSQINTNGIFGHQQHFLNHLVPFTTAFSSEEKRKQRGTGTYIPNMSYHSNRERPSSER 665

Query: 871  KGRKRTQGSPTFQRYNHDHKFGVGMITVQANVTDQIYHH----VDVPESKRSTAYPSSVY 704
            +    T  +    R   D+    G+   +  +    + H     + P        PS V 
Sbjct: 666  RKNHVTANNGDLHRRTRDN----GLAATRPGINSYQHGHELSEAEYPYLGNGKPVPSEVQ 721

Query: 703  ----------AASILSEQSEGCEFRSSENLAGEKDGPDERASDDSEPN-----PKAPAML 569
                      +A+  S  SE  +F   E    E    +   + DS  +     P +P + 
Sbjct: 722  LSQSFVWGPSSANGFSRPSERIDFGGQELQLQEASLQERVPTQDSSTSSTLVFPSSPEVT 781

Query: 568  IPDQKLADTEGTLERVAGKSYQLKDEEDFPPLA 470
              +++    +   ER A +SY LKDE DFPPL+
Sbjct: 782  AAERREPVLQNVQERAASESYHLKDEVDFPPLS 814


>gb|ESW14042.1| hypothetical protein PHAVU_008G248100g [Phaseolus vulgaris]
          Length = 803

 Score =  447 bits (1150), Expect = e-122
 Identities = 227/354 (64%), Positives = 268/354 (75%)
 Frame = -3

Query: 2652 PLEVSSCVGADVAGDLRWTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQ 2473
            PL +S+   + V  D  W A E+   E+LR IQPTL+++R+R  V+ YVQRL+     C+
Sbjct: 25   PLPISNPDPSSVVADA-WAAAEQTTGEILRSIQPTLAADRRRREVVDYVQRLIRYGARCE 83

Query: 2472 VFPYGSVPLKTYLPDGDIDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLIS 2293
            VFPYGSVPLKTYLPDGDIDLT LS  + ED L+SD+  VL  EE NE AE+EV+DV+ I 
Sbjct: 84   VFPYGSVPLKTYLPDGDIDLTALSCQNIEDGLVSDVRAVLHGEENNEAAEYEVKDVRFID 143

Query: 2292 AEVKLVKCLIQDIIVDISCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESR 2113
            AEVKLVKC++QDI+VDIS NQLGGL TLCFLE++DR V KDHLFKRSIILIKAWCYYESR
Sbjct: 144  AEVKLVKCIVQDIVVDISFNQLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESR 203

Query: 2112 ILGATHGLFSTYALEILVLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVR 1933
            +LGA HGL STYALE LVLYIFH +H SL GPL+VLYRFLDYFS FDW+NYCVSL G V 
Sbjct: 204  VLGAHHGLISTYALETLVLYIFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVS 263

Query: 1932 KSCLPDIVVHTPEVARYSRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKD 1753
            KS LP+IV   PE    + LL+++F+R+    F VP R      R F QK +NIIDPLK+
Sbjct: 264  KSSLPNIVAEGPENGG-NTLLTEEFIRSCVESFSVPSRGPDLNLRVFPQKHLNIIDPLKE 322

Query: 1752 DNNLGRSVNQGNFYRIRSALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            +NNLGRSVN+GNF+RIRSA K+GARKLG IL LP +    EL  FF NTL RHG
Sbjct: 323  NNNLGRSVNKGNFFRIRSAFKYGARKLGWILMLPDDRIADELIRFFANTLERHG 376


>ref|XP_006596466.1| PREDICTED: uncharacterized protein LOC100816328 isoform X3 [Glycine
            max]
          Length = 780

 Score =  447 bits (1149), Expect = e-122
 Identities = 226/354 (63%), Positives = 270/354 (76%)
 Frame = -3

Query: 2652 PLEVSSCVGADVAGDLRWTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQ 2473
            PL  S+   + VA D  W A E+  AE+L +I+PTL+++R+R  V+ YVQRL+     C+
Sbjct: 25   PLPPSNPDPSSVAADA-WAAAEKTTAEILSRIRPTLAADRRRREVVDYVQRLIRYGARCE 83

Query: 2472 VFPYGSVPLKTYLPDGDIDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLIS 2293
            VFPYGSVPLKTYLPDGDIDLT LS  + ED L+SD+  VL  EE+NE +E+EV+DV+ I 
Sbjct: 84   VFPYGSVPLKTYLPDGDIDLTALSCQNIEDGLVSDVRAVLHGEEINEASEYEVKDVRFID 143

Query: 2292 AEVKLVKCLIQDIIVDISCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESR 2113
            AEVKLVKC++QDI+VDIS NQLGGL TLCFLE++DR V KDHLFKRSIILIKAWCYYESR
Sbjct: 144  AEVKLVKCIVQDIVVDISFNQLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESR 203

Query: 2112 ILGATHGLFSTYALEILVLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVR 1933
            +LGA HGL STYALE LVLYIFH +H SL GPL+VLYRFLDYFS FDW+NYCVSL G V 
Sbjct: 204  VLGAHHGLISTYALETLVLYIFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVG 263

Query: 1932 KSCLPDIVVHTPEVARYSRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKD 1753
            KS  P+IV   PE    + LL+++F+R+    F +P R      R F QK +NIIDPLK+
Sbjct: 264  KSSPPNIVAEVPENGG-NTLLTEEFIRSCVESFSLPSRGADLNLRAFPQKHLNIIDPLKE 322

Query: 1752 DNNLGRSVNQGNFYRIRSALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            +NNLGRSVN+GNFYRIRSA K+GARKLG IL LP +    EL  FFTNTL RHG
Sbjct: 323  NNNLGRSVNKGNFYRIRSAFKYGARKLGWILMLPEDRITEELIRFFTNTLERHG 376


>ref|XP_006596465.1| PREDICTED: uncharacterized protein LOC100816328 isoform X2 [Glycine
            max]
          Length = 781

 Score =  447 bits (1149), Expect = e-122
 Identities = 226/354 (63%), Positives = 270/354 (76%)
 Frame = -3

Query: 2652 PLEVSSCVGADVAGDLRWTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQ 2473
            PL  S+   + VA D  W A E+  AE+L +I+PTL+++R+R  V+ YVQRL+     C+
Sbjct: 25   PLPPSNPDPSSVAADA-WAAAEKTTAEILSRIRPTLAADRRRREVVDYVQRLIRYGARCE 83

Query: 2472 VFPYGSVPLKTYLPDGDIDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLIS 2293
            VFPYGSVPLKTYLPDGDIDLT LS  + ED L+SD+  VL  EE+NE +E+EV+DV+ I 
Sbjct: 84   VFPYGSVPLKTYLPDGDIDLTALSCQNIEDGLVSDVRAVLHGEEINEASEYEVKDVRFID 143

Query: 2292 AEVKLVKCLIQDIIVDISCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESR 2113
            AEVKLVKC++QDI+VDIS NQLGGL TLCFLE++DR V KDHLFKRSIILIKAWCYYESR
Sbjct: 144  AEVKLVKCIVQDIVVDISFNQLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESR 203

Query: 2112 ILGATHGLFSTYALEILVLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVR 1933
            +LGA HGL STYALE LVLYIFH +H SL GPL+VLYRFLDYFS FDW+NYCVSL G V 
Sbjct: 204  VLGAHHGLISTYALETLVLYIFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVG 263

Query: 1932 KSCLPDIVVHTPEVARYSRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKD 1753
            KS  P+IV   PE    + LL+++F+R+    F +P R      R F QK +NIIDPLK+
Sbjct: 264  KSSPPNIVAEVPENGG-NTLLTEEFIRSCVESFSLPSRGADLNLRAFPQKHLNIIDPLKE 322

Query: 1752 DNNLGRSVNQGNFYRIRSALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            +NNLGRSVN+GNFYRIRSA K+GARKLG IL LP +    EL  FFTNTL RHG
Sbjct: 323  NNNLGRSVNKGNFYRIRSAFKYGARKLGWILMLPEDRITEELIRFFTNTLERHG 376


>ref|XP_003544929.1| PREDICTED: uncharacterized protein LOC100816328 isoform X1 [Glycine
            max]
          Length = 779

 Score =  447 bits (1149), Expect = e-122
 Identities = 226/354 (63%), Positives = 270/354 (76%)
 Frame = -3

Query: 2652 PLEVSSCVGADVAGDLRWTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQ 2473
            PL  S+   + VA D  W A E+  AE+L +I+PTL+++R+R  V+ YVQRL+     C+
Sbjct: 25   PLPPSNPDPSSVAADA-WAAAEKTTAEILSRIRPTLAADRRRREVVDYVQRLIRYGARCE 83

Query: 2472 VFPYGSVPLKTYLPDGDIDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLIS 2293
            VFPYGSVPLKTYLPDGDIDLT LS  + ED L+SD+  VL  EE+NE +E+EV+DV+ I 
Sbjct: 84   VFPYGSVPLKTYLPDGDIDLTALSCQNIEDGLVSDVRAVLHGEEINEASEYEVKDVRFID 143

Query: 2292 AEVKLVKCLIQDIIVDISCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESR 2113
            AEVKLVKC++QDI+VDIS NQLGGL TLCFLE++DR V KDHLFKRSIILIKAWCYYESR
Sbjct: 144  AEVKLVKCIVQDIVVDISFNQLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESR 203

Query: 2112 ILGATHGLFSTYALEILVLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVR 1933
            +LGA HGL STYALE LVLYIFH +H SL GPL+VLYRFLDYFS FDW+NYCVSL G V 
Sbjct: 204  VLGAHHGLISTYALETLVLYIFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVG 263

Query: 1932 KSCLPDIVVHTPEVARYSRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKD 1753
            KS  P+IV   PE    + LL+++F+R+    F +P R      R F QK +NIIDPLK+
Sbjct: 264  KSSPPNIVAEVPENGG-NTLLTEEFIRSCVESFSLPSRGADLNLRAFPQKHLNIIDPLKE 322

Query: 1752 DNNLGRSVNQGNFYRIRSALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            +NNLGRSVN+GNFYRIRSA K+GARKLG IL LP +    EL  FFTNTL RHG
Sbjct: 323  NNNLGRSVNKGNFYRIRSAFKYGARKLGWILMLPEDRITEELIRFFTNTLERHG 376


>ref|XP_006350879.1| PREDICTED: uncharacterized protein LOC102602843 [Solanum tuberosum]
          Length = 844

 Score =  446 bits (1148), Expect = e-122
 Identities = 211/336 (62%), Positives = 260/336 (77%)
 Frame = -3

Query: 2601 WTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGD 2422
            W   EEA  EV+  + PTL +E KR  V+ YVQRL+ C+LGC+VF YGSVPLKTYLPDGD
Sbjct: 32   WAVAEEAVQEVVNCVHPTLDTEEKRKDVVDYVQRLIRCTLGCEVFSYGSVPLKTYLPDGD 91

Query: 2421 IDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDI 2242
            IDLTV   P  E+ L  D+  VLQ+EEL E+ E++V+D Q I AEVKLVKC++++ ++DI
Sbjct: 92   IDLTVFGSPVIEETLARDVLAVLQEEELKENTEYDVKDPQFIDAEVKLVKCIVRNTVIDI 151

Query: 2241 SCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEIL 2062
            S NQLGGL TLCFLEQ+DR V K+HLFKRSIILIKAWCYYESR+LGA HGL STYALE L
Sbjct: 152  SFNQLGGLSTLCFLEQVDRLVGKNHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETL 211

Query: 2061 VLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARY 1882
            VL+IF ++HSSL GPL+VLYRFLDY+S FDW+ YC+SL+G V KS LP++ V  P+    
Sbjct: 212  VLFIFQLFHSSLNGPLAVLYRFLDYYSKFDWDKYCISLNGPVCKSSLPELFVEMPDYISN 271

Query: 1881 SRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIR 1702
              LLS++FLRNS  MF VP R   + +RPFQQK++NIIDPLK++NNLGRSV++GN YRI+
Sbjct: 272  ELLLSEEFLRNSAEMFSVPSRGLESDTRPFQQKYLNIIDPLKENNNLGRSVSKGNLYRIQ 331

Query: 1701 SALKFGARKLGQILSLPTESTRRELKNFFTNTLARH 1594
             A K+GARKLG IL  P +    E+K FF NT+ RH
Sbjct: 332  RAFKYGARKLGDILLSPDDKVADEIKKFFANTIERH 367


>ref|XP_004490712.1| PREDICTED: uncharacterized protein LOC101490873 [Cicer arietinum]
          Length = 811

 Score =  446 bits (1146), Expect = e-122
 Identities = 222/337 (65%), Positives = 262/337 (77%)
 Frame = -3

Query: 2601 WTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGD 2422
            W A EE  A++LR+IQPTL+++R+R  V+ YVQRL+     C+VFPYGSVPLKTYLPDGD
Sbjct: 41   WFAAEETTADILRRIQPTLAADRRRREVVDYVQRLIRFGARCEVFPYGSVPLKTYLPDGD 100

Query: 2421 IDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDI 2242
            IDLT LS  + ED L+S++  VL+ EE NE AE+EV+DV+ I AEVKLVKCL+Q+I+VDI
Sbjct: 101  IDLTALSCQNIEDGLVSEVHAVLRGEENNEAAEYEVKDVRFIDAEVKLVKCLVQNIVVDI 160

Query: 2241 SCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEIL 2062
            S NQLGGL TLCFLE++DR V KDH+FKRSIILIKAWCYYESRILGA HGL STYALE L
Sbjct: 161  SFNQLGGLSTLCFLEKVDRLVAKDHIFKRSIILIKAWCYYESRILGAHHGLISTYALETL 220

Query: 2061 VLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARY 1882
            VLYIFH +H SL GPL+VLYRFLDYFS FDW+NYCVSL G V KS + D+V   PE    
Sbjct: 221  VLYIFHRFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVGKSSVSDVVAEAPENGG- 279

Query: 1881 SRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIR 1702
            + LL+ +F+R+    F VPPR      R F QK +NIIDPLK++NNLGRSVN+GNFYRIR
Sbjct: 280  NTLLTDEFIRSCVESFSVPPRGLELNLRSFPQKHLNIIDPLKENNNLGRSVNKGNFYRIR 339

Query: 1701 SALKFGARKLGQILSLPTESTRRELKNFFTNTLARHG 1591
            SA K+GARKLG IL LP +    EL  FF NTL RHG
Sbjct: 340  SAFKYGARKLGWILMLPEDRIADELNRFFANTLDRHG 376


>gb|EMJ09368.1| hypothetical protein PRUPE_ppa001915mg [Prunus persica]
          Length = 742

 Score =  445 bits (1144), Expect = e-122
 Identities = 221/351 (62%), Positives = 266/351 (75%)
 Frame = -3

Query: 2640 SSCVGADVAGDLRWTAVEEAAAEVLRKIQPTLSSERKRAHVISYVQRLLSCSLGCQVFPY 2461
            S+   A ++ +  W   EEA   V+ ++QPT  SER+R  VI YVQRL+   LGC+VFP+
Sbjct: 41   SAAAAAGISAEY-WKKAEEATQGVIAQVQPTDVSERRRKAVIDYVQRLIRGCLGCEVFPF 99

Query: 2460 GSVPLKTYLPDGDIDLTVLSKPSAEDALISDIFYVLQQEELNEHAEFEVQDVQLISAEVK 2281
            GSVPLKTYLPDGDIDLT     + E+AL +D+  VL++E  N  AEF V+DVQLI AEVK
Sbjct: 100  GSVPLKTYLPDGDIDLTAFGGINVEEALANDVCSVLEREVQNGTAEFMVKDVQLIRAEVK 159

Query: 2280 LVKCLIQDIIVDISCNQLGGLCTLCFLEQIDRFVDKDHLFKRSIILIKAWCYYESRILGA 2101
            LVKCL+Q+I+VDIS NQLGGLCTLCFLEQ+DR + KDHLFKRSIILIKAWCYYESRILGA
Sbjct: 160  LVKCLVQNIVVDISFNQLGGLCTLCFLEQVDRLIGKDHLFKRSIILIKAWCYYESRILGA 219

Query: 2100 THGLFSTYALEILVLYIFHIYHSSLRGPLSVLYRFLDYFSNFDWENYCVSLHGLVRKSCL 1921
             HGL STYALE LVLYIFH++H+SL GPL+VLY+FLDYFS FDW+NYC+SL G VR S L
Sbjct: 220  HHGLISTYALETLVLYIFHLFHASLNGPLAVLYKFLDYFSKFDWDNYCISLSGPVRISSL 279

Query: 1920 PDIVVHTPEVARYSRLLSKQFLRNSTSMFKVPPRDHGNQSRPFQQKFINIIDPLKDDNNL 1741
            P+++V TPE      LLS  FL+    MF VP R +    R F  K  NI+DPLKD+NNL
Sbjct: 280  PELLVETPENGGNDLLLSNDFLKECVQMFSVPSRGYETNYRTFPPKHFNIVDPLKDNNNL 339

Query: 1740 GRSVNQGNFYRIRSALKFGARKLGQILSLPTESTRRELKNFFTNTLARHGG 1588
            GRSV++GNFYRIRSA  +GARKLG+ILS   ++   E++ FF NTL RHGG
Sbjct: 340  GRSVSKGNFYRIRSAFTYGARKLGRILSQTEDNIDDEIRKFFANTLDRHGG 390


>ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
          Length = 898

 Score =  445 bits (1144), Expect = e-122
 Identities = 230/381 (60%), Positives = 277/381 (72%), Gaps = 10/381 (2%)
 Frame = -3

Query: 2700 MGDLRVSPRRPNGAVWPLEVSSCVGADVAGDLR----------WTAVEEAAAEVLRKIQP 2551
            MGDLR      NGAV   + SS   +  +  L           W   EEA   ++ ++QP
Sbjct: 1    MGDLRSWSLEQNGAVAEDKPSSSSFSSFSSLLPSNPTPIGVDYWRRAEEATQAIISQVQP 60

Query: 2550 TLSSERKRAHVISYVQRLLSCSLGCQVFPYGSVPLKTYLPDGDIDLTVLSKPSAEDALIS 2371
            T+ SER+R  VI YVQRL+   L C+VFP+GSVPLKTYLPDGDIDLT L   + E+AL S
Sbjct: 61   TVVSERRRKAVIDYVQRLIRGRLRCEVFPFGSVPLKTYLPDGDIDLTALGGSNVEEALAS 120

Query: 2370 DIFYVLQQEELNEHAEFEVQDVQLISAEVKLVKCLIQDIIVDISCNQLGGLCTLCFLEQI 2191
            D+  VL  E+ N  AEF V+DVQLI AEVKLVKCL+Q+I+VDIS NQLGGLCTLCFLE+I
Sbjct: 121  DVCSVLNSEDQNGAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKI 180

Query: 2190 DRFVDKDHLFKRSIILIKAWCYYESRILGATHGLFSTYALEILVLYIFHIYHSSLRGPLS 2011
            DR + KDHLFKRSIILIKAWCYYESRILGA HGL STYALE LVLYIFH++HS+L GPL 
Sbjct: 181  DRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSALNGPLQ 240

Query: 2010 VLYRFLDYFSNFDWENYCVSLHGLVRKSCLPDIVVHTPEVARYSRLLSKQFLRNSTSMFK 1831
            VLY+FLDYFS FDW+NYC+SL+G VR S LP++V  TP+      LLS  FL++    F 
Sbjct: 241  VLYKFLDYFSKFDWDNYCISLNGPVRISSLPELVAETPDNGGGDLLLSTDFLQSCLETFS 300

Query: 1830 VPPRDHGNQSRPFQQKFINIIDPLKDDNNLGRSVNQGNFYRIRSALKFGARKLGQILSLP 1651
            VP R +   SR F  K +NI+DPLK++NNLGRSV++GNFYRIRSA  +GARKLG ILS P
Sbjct: 301  VPARGYEANSRAFPIKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHP 360

Query: 1650 TESTRRELKNFFTNTLARHGG 1588
             ++   E++ FF+NTL RHGG
Sbjct: 361  EDNVVDEVRKFFSNTLDRHGG 381



 Score = 67.8 bits (164), Expect = 3e-08
 Identities = 76/285 (26%), Positives = 115/285 (40%), Gaps = 55/285 (19%)
 Frame = -2

Query: 1156 GPLDPF---SDLTGDYESHLKSLLYGQSFH----GNATIQPMVYNLPV-------WDM-Q 1022
            GP + F   SDL GDYESH  SL  G+ ++      A + P+   LP        WD+ +
Sbjct: 626  GPPEAFNALSDLNGDYESHCNSLQIGRWYYEYALSAAALSPIPPPLPSQYPNKNPWDIIR 685

Query: 1021 CQTNFGQDCYFQMNPNHVMWEQPF-------------------PKVRGTGTYIP-----R 914
                  Q+ + Q+N N ++    F                   PK RGTGTY P     R
Sbjct: 686  RSVQVKQNAFAQINSNGLLARPAFYPMPSPILPGGATLAMEEMPKPRGTGTYFPNMNHYR 745

Query: 913  TDLGSWKGG-----KLPVRKGRKRT---------QGSPTFQRYNHDHKFGVGMITVQANV 776
                S +G      + P   GR  T          G   +Q    +H  G+GM++  ++ 
Sbjct: 746  DRPASARGRNQVSVRSPRNNGRSLTPLETTVAEKSGQDLYQVPTVNHGGGIGMLSSSSSP 805

Query: 775  TDQIYHHVD--VPESKRSTAYPSSVYAASILSEQSEGCEFRSSENLAGEKDGPDERASDD 602
              + +H+ +  +P   R+  + S  +               SS + +GE         + 
Sbjct: 806  VRKAHHNGNGAMPRPDRAVEFGSFGHLP-----------IESSVDCSGEPTPATAHFQNS 854

Query: 601  SEPNPKAPAMLIPDQKLADTEGTLERVAGKSYQLKDEEDFPPLAS 467
            S  N  +P M    Q L   +  L  V  +SY+LKDEEDFPPL++
Sbjct: 855  SALNVSSPKMQKAKQTLITDQDRLS-VHMQSYELKDEEDFPPLSN 898


Top