BLASTX nr result

ID: Rheum21_contig00028728 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00028728
         (752 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   185   1e-44
ref|XP_002331075.1| predicted protein [Populus trichocarpa]           184   4e-44
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   181   2e-43
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   179   9e-43
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   176   7e-42
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   174   2e-41
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   172   1e-40
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               172   1e-40
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               171   3e-40
ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660...   170   5e-40
dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like ...   169   7e-40
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   169   1e-39
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   167   3e-39
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   167   3e-39
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       165   1e-38
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   165   2e-38
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   164   2e-38
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]             164   4e-38
gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thali...   162   9e-38
ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A...   162   1e-37

>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
            lycopersicum]
          Length = 717

 Score =  185 bits (470), Expect = 1e-44
 Identities = 87/222 (39%), Positives = 144/222 (64%), Gaps = 1/222 (0%)
 Frame = -2

Query: 733  DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
            DDL +F +GD+ SI+++++    F   SGL  N+NKS I+C G+    ++QI+  L   +
Sbjct: 498  DDLLLFSRGDLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQMEVRQQIIQQLGYTI 557

Query: 553  GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
             +LP  YLGVPL S +   +++ PLI K+ +++N+W ++KLSYAGR +LVK VL+GV   
Sbjct: 558  EELPFKYLGVPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQAL 617

Query: 373  WCKVFLMPKKVVKVMVASMRNFLWSGSNDV-RRAPIAWKAVCKPRDEGGLGIKEVLSWN* 197
            W ++F++P K++K++    R++LWSG   V ++A IAW  VC P+ EGGLG+  +  WN 
Sbjct: 618  WAQLFIIPAKIIKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNR 677

Query: 196  TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSS 71
            +++ +L +DLA   + +W    HA  ++G   W  ++ +T+S
Sbjct: 678  SAVTKLCWDLANKEDKLWIKWIHAYYIKGQREW--KKSNTAS 717


>ref|XP_002331075.1| predicted protein [Populus trichocarpa]
          Length = 517

 Score =  184 bits (466), Expect = 4e-44
 Identities = 99/226 (43%), Positives = 140/226 (61%), Gaps = 3/226 (1%)
 Frame = -2

Query: 685 IRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMELGDLPVTYLGVPLCSCR 506
           IR  L  F + SGL  N NKS IF +G+ ++ +EQI+ IL    G+LP+ YLGVPL S R
Sbjct: 8   IRTVLTKFQDLSGLYPNPNKSDIFLSGVLNAEREQIIHILGFREGELPMKYLGVPLLSSR 67

Query: 505 ATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHYWCKVFLMPKKVVKVMV 326
              +  + L+ +ITSKV  W  R LSYAGR++L+  VL+ +  YW  +FL+P +V+K + 
Sbjct: 68  LKAIYCKGLVDRITSKVRHWTCRTLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVE 127

Query: 325 ASMRNFLWSGSNDVRR--APIAWKAVCKPRDEGGLGIKEVLSWN*TSLCELIYDLAQDTN 152
             M++FLWSGS D+R   A +AW  VC P+ EGGLGIK +  WN  +L + I++L  D++
Sbjct: 128 QIMKSFLWSGS-DMRTTGAKVAWDQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSD 186

Query: 151 -SIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIRDEIQSRME 17
            SIW T   +NLLRG   WT++     S  W  IL +R     +M+
Sbjct: 187 GSIWSTWIRSNLLRGRNFWTIKTPQNCSWAWGKILKLRSLAWPKMK 232


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  181 bits (459), Expect = 2e-43
 Identities = 87/236 (36%), Positives = 142/236 (60%), Gaps = 1/236 (0%)
 Frame = -2

Query: 733 DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
           DDL +F +GD  S+  +      F   +GL++N  K  + CAG+D+ TK +IL++   + 
Sbjct: 76  DDLLLFSRGDKISVGMMMRAYESFSKATGLLVNPQKCSLLCAGIDAVTKREILEVSGFQE 135

Query: 553 GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
           G LP  YLGVP+ S + + + Y PLI KI  K+  W +R LSYAGRL+LV  V++ + +Y
Sbjct: 136 GQLPFKYLGVPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNY 195

Query: 373 WCKVFLMPKKVVKVMVASMRNFLWSGS-NDVRRAPIAWKAVCKPRDEGGLGIKEVLSWN* 197
           W   F  PK V++ + A  R FLW+G     R++P+AWK +C PR  GGL I ++  WN 
Sbjct: 196 WLNCFPFPKSVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNK 255

Query: 196 TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIRDEIQ 29
            +L +L+++L+   +S+W     A  ++ + +  ++ K+T S I + IL  R++++
Sbjct: 256 ANLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDLE 311


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  179 bits (454), Expect = 9e-43
 Identities = 87/244 (35%), Positives = 150/244 (61%), Gaps = 4/244 (1%)
 Frame = -2

Query: 733  DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
            DDL +F +GDI S+Q + +  N F  + GL +N +K  I+C  +D + KEQ+L I   + 
Sbjct: 518  DDLLLFSRGDIGSVQIMLDKFNTFLRSMGLHVNPSKCNIYCGSVDINVKEQLLLISGFKE 577

Query: 553  GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
            G +P  YLG+PL S +     YQ LI KI  ++  W++  LSYAGR++L++ V++  I++
Sbjct: 578  GKMPFRYLGIPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINF 637

Query: 373  WCKVFLMPKKVVKVMVASMRNFLWSGSNDV-RRAPIAWKAVCKPRDEGGLGIKEVLSWN* 197
            W +   +PK V+  + A  R+FLW G++++ R++PIAW+ VC P+  GGL I  +  WN 
Sbjct: 638  WMQCLPLPKFVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNK 697

Query: 196  TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIRD---EIQS 26
             S+ +L++++   ++++W    H   +RG  IW++  K + S I  +++ +R    + QS
Sbjct: 698  ISILKLLWNVCNKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLRPLLLQYQS 757

Query: 25   RMEN 14
            RM++
Sbjct: 758  RMQD 761


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  176 bits (446), Expect = 7e-42
 Identities = 87/240 (36%), Positives = 145/240 (60%), Gaps = 1/240 (0%)
 Frame = -2

Query: 742 STPDDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILN 563
           S  DD+F+  +GD  SI+ I +  + F  ++GL IN  K ++FC GL+  + + I  I  
Sbjct: 176 SFADDVFLLCRGDKKSIKMIIKAFSFFSKSTGLQINPAKCKVFCGGLNCDSIQVITKITG 235

Query: 562 MELGDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGV 383
            E G LPV YLGVPL   +     Y PL+ KI  K+  W+S+ LS AGR++LV+ ++  +
Sbjct: 236 FEEGTLPVRYLGVPLSCKKLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAI 295

Query: 382 IHYWCKVFLMPKKVVKVMVASMRNFLWSGSNDV-RRAPIAWKAVCKPRDEGGLGIKEVLS 206
             YW  VF MPKKV++ + +  R+F+WSGS +V R++ +AWK VCKP   GGL +  +  
Sbjct: 296 AQYWMSVFPMPKKVIQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLEL 355

Query: 205 WN*TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIRDEIQS 26
           WN T++ + ++++    +++W    HA  L+G+ + +   KS S+ I ++++  R ++ +
Sbjct: 356 WNVTAMLKCLWNICSKEDNLWVKWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQVNN 415


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  174 bits (442), Expect = 2e-41
 Identities = 82/233 (35%), Positives = 139/233 (59%), Gaps = 1/233 (0%)
 Frame = -2

Query: 733  DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
            DDL +F  GD  S++++ +  + F + S L  N+++S+IF AG+D ++ + +L + N  L
Sbjct: 527  DDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTNFSL 586

Query: 553  GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
            G  PV YLG+PL + +    +  PL+ +I +++ +W ++ LS+AGRL+L++ VL  +  Y
Sbjct: 587  GTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVY 646

Query: 373  WCKVFLMPKKVVKVMVASMRNFLWSGSNDVRRA-PIAWKAVCKPRDEGGLGIKEVLSWN* 197
            W    ++PKKV+K +   +R FLW+G+   R A  +AW  +C P+ EGGLGIK++  WN 
Sbjct: 647  WASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNK 706

Query: 196  TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIRD 38
              +   I++L   +++ W       LL+GN  W     S  S  W+ +L IR+
Sbjct: 707  ALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRE 759


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  172 bits (436), Expect = 1e-40
 Identities = 94/246 (38%), Positives = 143/246 (58%), Gaps = 4/246 (1%)
 Frame = -2

Query: 733  DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
            DD+ +F  G   SIQ        F   S L I++ KS IF AG+  + K  IL     EL
Sbjct: 852  DDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFPFEL 911

Query: 553  GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
            G LPV YLG+PL + R T+ +Y PL+ KI +++ +W +R LS+AGRL+L+K VL  + ++
Sbjct: 912  GTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNF 971

Query: 373  WCKVFLMPKKVVKVMVASMRNFLWSGSN-DVRRAPIAWKAVCKPRDEGGLGIKEVLSWN* 197
            W  VF +PK  ++ +      FLWSG + + ++A IAW  VCK ++EGGLG+K +   N 
Sbjct: 972  WLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANE 1031

Query: 196  TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKS-TSSAIWQTILHIRDEIQ--S 26
             SL +LI+ +    +S+W    + +L+R    W+V+  +   S +W+ IL  RD+ +   
Sbjct: 1032 VSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFH 1091

Query: 25   RMENNS 8
            RME  S
Sbjct: 1092 RMEVRS 1097


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  172 bits (435), Expect = 1e-40
 Identities = 88/241 (36%), Positives = 138/241 (57%), Gaps = 2/241 (0%)
 Frame = -2

Query: 742  STPDDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILN 563
            S  DDL +   G   SI+ I E  + FC  SGL I++ KS ++ AG+    K++I     
Sbjct: 349  SFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAKFL 408

Query: 562  MELGDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGV 383
             ++G LPV YLG+PL + R T  +Y PL+ +I  ++  W  R  S+AGR  L+K VL+ +
Sbjct: 409  FDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSI 468

Query: 382  IHYWCKVFLMPKKVVKVMVASMRNFLWSGSN-DVRRAPIAWKAVCKPRDEGGLGIKEVLS 206
             ++W   F +P++ ++ +     +FLWSGS     +A I+W  VCKP+ EGGLG++ +  
Sbjct: 469  CNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLKE 528

Query: 205  WN*TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTV-QRKSTSSAIWQTILHIRDEIQ 29
             N  S  +L++ +  ++NS+W       L+R   IW++ Q  S  S IW+ IL IRD  +
Sbjct: 529  ANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRDVAK 588

Query: 28   S 26
            S
Sbjct: 589  S 589


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  171 bits (432), Expect = 3e-40
 Identities = 85/236 (36%), Positives = 139/236 (58%), Gaps = 2/236 (0%)
 Frame = -2

Query: 742 STPDDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILN 563
           S  DDL I   G   SI+ I E  ++F   SGL I++ KS IF AGL S+++ Q+     
Sbjct: 255 SFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSSTSRAQLHTHFP 314

Query: 562 MELGDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGV 383
            E+G+LP+ YLG+PL + R + ++Y PLI +I  ++ +W+SR LS+AGR  L+  +++  
Sbjct: 315 FEVGELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSS 374

Query: 382 IHYWCKVFLMPKKVVKVMVASMRNFLWSGSN-DVRRAPIAWKAVCKPRDEGGLGIKEVLS 206
            ++W   F +P+  ++ +     +FLWSG+N + ++A I+W  VCKP+ EGGLG++ +  
Sbjct: 375 CNFWLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLRSLKE 434

Query: 205 WN*TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKST-SSAIWQTILHIR 41
            N     +L++ +    +S+W      NLL+  I W V+  +   S IW+ IL  R
Sbjct: 435 ANDVCCLKLVWRIISHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYR 490


>ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max]
          Length = 303

 Score =  170 bits (430), Expect = 5e-40
 Identities = 85/238 (35%), Positives = 135/238 (56%), Gaps = 1/238 (0%)
 Frame = -2

Query: 733 DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
           DD+ +  +GDIPS+ ++   L  FC   GL I+ +KS I+ + + +     I  +    L
Sbjct: 28  DDIMLLSRGDIPSMSTMFAKLQHFCRVLGLSISSDKSSIYSSSIRTHELSHIQQLTGFSL 87

Query: 553 GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
           G  P  YLGVPL S R     Y PL+SKIT  +  W+ + LSYAG+LEL++ V+ G++++
Sbjct: 88  GGFPFRYLGVPLLSSRLNVCHYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNF 147

Query: 373 WCKVFLMPKKVVKVMVASMRNFLWSGSNDVRRAP-IAWKAVCKPRDEGGLGIKEVLSWN* 197
           W  +F +P+ V+  + AS RNFLW  ++  ++ P +AW  VC P+ EGGLG+  +  WN 
Sbjct: 148 WIGIFPLPQSVLDRINASCRNFLWGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWNL 207

Query: 196 TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIRDEIQSR 23
             L  +++D     +S+W    H    R + +W     S+ S + + I+ IRD I S+
Sbjct: 208 ALLSCILWDFHCKKDSLW---VHHYYFRRSDVWNYNTSSSYSVLIKKIIQIRDFIISK 262


>dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like protein
           [Arabidopsis thaliana]
          Length = 489

 Score =  169 bits (429), Expect = 7e-40
 Identities = 85/237 (35%), Positives = 137/237 (57%), Gaps = 2/237 (0%)
 Frame = -2

Query: 742 STPDDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILN 563
           S  DDL +   G + SI+ I E    F   SGL I++ KS ++ AGL  ++ ++++    
Sbjct: 84  SFADDLMVLSDGKVRSIEGIVEVFETFAKCSGLRISMEKSTVYFAGLSHTSPQEVMAHFP 143

Query: 562 MELGDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGV 383
             +G LPV YLG+PL + + +  +Y PLI  I  K+ +W++R LSYAGRL L+  VL+ +
Sbjct: 144 FAVGTLPVRYLGLPLVTKQLSSTDYLPLIEHIKKKIGSWSARFLSYAGRLNLISSVLWSI 203

Query: 382 IHYWCKVFLMPKKVVKVMVASMRNFLWSGSN-DVRRAPIAWKAVCKPRDEGGLGIKEVLS 206
            ++W   F +P++ ++ +      +LWSG + +  +A IAW  VCKP+DEGGLG++ +  
Sbjct: 204 CNFWMGAFRLPRECIREIDKMCSAYLWSGGDLNTSKAKIAWTDVCKPKDEGGLGLRSLKE 263

Query: 205 WN*TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTV-QRKSTSSAIWQTILHIRD 38
            N  S  +LI+ +    +S+W    HA LL+    W V +  S  S +W+ +L  RD
Sbjct: 264 ANDVSCLKLIWRIISHADSLWVKWIHATLLKQVSFWAVRENTSLGSWMWKKVLKFRD 320


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  169 bits (427), Expect = 1e-39
 Identities = 78/237 (32%), Positives = 143/237 (60%), Gaps = 1/237 (0%)
 Frame = -2

Query: 733  DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
            DD+ +F +GD+ S++ +   +N F  T+GL++N NK RI+  G+D +TK +I  I + E 
Sbjct: 518  DDVLLFCRGDVMSVEMMLHVINKFSATTGLVVNPNKCRIYFGGVDGTTKNKIQQISSYEE 577

Query: 553  GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
            G LPV YLGVPL S +     Y PLI KIT+++  W S+ L+  GR+++V   +  ++ +
Sbjct: 578  GQLPVRYLGVPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQF 637

Query: 373  WCKVFLMPKKVVKVMVASMRNFLWSGSNDV-RRAPIAWKAVCKPRDEGGLGIKEVLSWN* 197
            W +   +P  V+K + +  R+F+WS S ++ R++PIAW +VC+P+ +GGL I  +  WN 
Sbjct: 638  WMQCLPIPMSVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNH 697

Query: 196  TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIRDEIQS 26
             ++   +++L +  +++W    HA+ ++ + +      +  S + + +L  R+ I +
Sbjct: 698  ITVLNCLWNLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHT 754


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  167 bits (424), Expect = 3e-39
 Identities = 84/237 (35%), Positives = 135/237 (56%), Gaps = 2/237 (0%)
 Frame = -2

Query: 742  STPDDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILN 563
            S  DDL +   G I SI+ I +  + F   SGL I++ KS ++ AGL ++ + ++ D   
Sbjct: 702  SFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFP 761

Query: 562  MELGDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGV 383
               G LPV YLG+PL + R +  +  PL+ ++  ++ +W SR LSYAGRL L+  VL+ +
Sbjct: 762  FSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSI 821

Query: 382  IHYWCKVFLMPKKVVKVMVASMRNFLWSGSN-DVRRAPIAWKAVCKPRDEGGLGIKEVLS 206
             ++W   F +P+K ++ +      FLWSG+  +  +A I+W  VCKP+DEGGLG++ +  
Sbjct: 822  CNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKE 881

Query: 205  WN*TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTV-QRKSTSSAIWQTILHIRD 38
             N     +L++ +   +NS+W      +LLR    W V Q  S  S IW+ +L  R+
Sbjct: 882  ANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYRE 938


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  167 bits (423), Expect = 3e-39
 Identities = 86/248 (34%), Positives = 140/248 (56%), Gaps = 4/248 (1%)
 Frame = -2

Query: 733 DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
           DDL I   G + S+  I E +N+F   SGL IN+ K+ ++ AG+    +  ++      L
Sbjct: 110 DDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMISRYPFGL 169

Query: 553 GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
           G LPV YLG+PL + R TK +  PL  +I +++  W SR LS+AGRL L+  VL+  +++
Sbjct: 170 GQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNF 229

Query: 373 WCKVFLMPKKVVKVMVASMRNFLWSGSN-DVRRAPIAWKAVCKPRDEGGLGIKEVLSWN* 197
           W   F +P   +K + +    FLWSG     R+A ++W  +CKP+ EGGLG++ +   N 
Sbjct: 230 WMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANV 289

Query: 196 TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTV-QRKSTSSAIWQTILHIRDEIQ--S 26
            S+ +LI+ +  + +S+W      NLL+    W++    S  S +W+ +L  R+  +  S
Sbjct: 290 VSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLTPNSSLGSWMWKKMLKYRETAKPFS 349

Query: 25  RMENNSRA 2
           R+E N+ A
Sbjct: 350 RVEVNNGA 357


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  165 bits (418), Expect = 1e-38
 Identities = 82/232 (35%), Positives = 134/232 (57%), Gaps = 1/232 (0%)
 Frame = -2

Query: 733  DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
            DD+ IF  G   S+  I ETL+ F + SGL +N +KS ++ AGL+   +          +
Sbjct: 699  DDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQ-LESNANAAYGFPI 757

Query: 553  GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
            G LP+ YLG+PL + +    EY+PL+ KIT++  +W ++ LS+AGR++L+  V++G I++
Sbjct: 758  GTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINF 817

Query: 373  WCKVFLMPKKVVKVMVASMRNFLWSGS-NDVRRAPIAWKAVCKPRDEGGLGIKEVLSWN* 197
            W   FL+PK  +K + +    FLWSG+    +   ++W A+C P+ EGGLG++ +L WN 
Sbjct: 818  WMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNK 877

Query: 196  TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIR 41
            T    LI+ L    +S+W    H + L     W V+   + S  W+ +L +R
Sbjct: 878  TLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLR 929


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  165 bits (417), Expect = 2e-38
 Identities = 87/234 (37%), Positives = 128/234 (54%), Gaps = 3/234 (1%)
 Frame = -2

Query: 733  DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
            DDL IF  G   S++ I+  L  F N SGL +N  KS ++ AGL+ + KE  L       
Sbjct: 698  DDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGFVN 756

Query: 553  GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
            G  P  YLG+PL   +  + +Y  LI KI ++ N WA++ LS+AGRL+L+  V+Y  +++
Sbjct: 757  GTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNF 816

Query: 373  WCKVFLMPKKVVKVMVASMRNFLWSGSNDVRR---APIAWKAVCKPRDEGGLGIKEVLSW 203
            W   F++PK  +K +      FLW   ND+ R     ++W+  C P+ EGGLG++   +W
Sbjct: 817  WLSSFILPKCCLKTIEQMCNRFLW--GNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTW 874

Query: 202  N*TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIR 41
            N T    LI+ L    +S+W    HAN LR    W  +  S  S IW+ IL +R
Sbjct: 875  NKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIWKAILGLR 928


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  164 bits (416), Expect = 2e-38
 Identities = 84/233 (36%), Positives = 133/233 (57%), Gaps = 1/233 (0%)
 Frame = -2

Query: 733  DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
            DDL +F + D  SI  I    N F   SGL  +I KS I+  G+     EQ+ D + M +
Sbjct: 693  DDLLMFARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPI 752

Query: 553  GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
            G LP  YLGVPL S +    + +PLI KIT++   W +  LSYAGRL+LVK +LY + +Y
Sbjct: 753  GSLPFRYLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNY 812

Query: 373  WCKVFLMPKKVVKVMVASMRNFLWSGSNDVR-RAPIAWKAVCKPRDEGGLGIKEVLSWN* 197
            W ++F +PKK++K +  + R FLW+G+ D   +AP+AW  + +P+  GGL +  ++ WN 
Sbjct: 813  WGQIFPLPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNK 872

Query: 196  TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKSTSSAIWQTILHIRD 38
             ++ +L++ +    + +W    +A  ++   I  V   S +S I + I   R+
Sbjct: 873  AAILKLLWAITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESRE 925


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score =  164 bits (414), Expect = 4e-38
 Identities = 82/230 (35%), Positives = 129/230 (56%), Gaps = 2/230 (0%)
 Frame = -2

Query: 742 STPDDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILN 563
           S  DDL +   G + SI  I E  ++F   SGL I++ KS I+ AG+      +I +   
Sbjct: 229 SFADDLMVLSDGKVRSIDGIVEVFDIFAKFSGLKISMEKSTIYLAGVTEDVYHEIQNRYQ 288

Query: 562 MELGDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGV 383
            ++G LPV YLG+PL + R T  +Y PL+  I  K+  W +R LSYAGRL L+  VL+ +
Sbjct: 289 FDVGQLPVRYLGLPLVTKRLTATDYSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSI 348

Query: 382 IHYWCKVFLMPKKVVKVMVASMRNFLWSGSN-DVRRAPIAWKAVCKPRDEGGLGIKEVLS 206
            ++W   F +P++ ++ +      FLWSG + + R+  + W  VCKP+ EGGLG++ +  
Sbjct: 349 CNFWLAAFRLPRECIREIDKICSAFLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKE 408

Query: 205 WN*TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTVQRKST-SSAIWQ 59
            N  S  +LI+ +   TNS+W       LL+ +  W+VQ  +   S +W+
Sbjct: 409 MNEVSCLKLIWRIVSHTNSLWVRWIEQYLLKHDTFWSVQTTTNMDSVLWR 458


>gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thaliana]
          Length = 504

 Score =  162 bits (411), Expect = 9e-38
 Identities = 80/237 (33%), Positives = 136/237 (57%), Gaps = 2/237 (0%)
 Frame = -2

Query: 742 STPDDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILN 563
           S  DDL +   G + SI+ I +  + F   S L I++ KS ++ AGL  +T+++++D  +
Sbjct: 32  SFADDLMVLSDGKVRSIEGIVDVFDTFAKCSDLKISMEKSTVYLAGLSHTTRQEVIDRFS 91

Query: 562 MELGDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGV 383
             +G LPV YLG+PL + + +  +Y PLI  I  K+ +W++R LSY GRL L+  +L+ +
Sbjct: 92  FAVGTLPVRYLGLPLVTKQFSSTDYLPLIDHIKQKICSWSARFLSYTGRLNLISSILWSI 151

Query: 382 IHYWCKVFLMPKKVVKVMVASMRNFLWSGSN-DVRRAPIAWKAVCKPRDEGGLGIKEVLS 206
            ++W   F +P+  ++ +      +LWSG   +  +A IAW  VCKP++EGGLG++ +  
Sbjct: 152 CNFWMGAFRLPRDCIREIDKMCSAYLWSGGELNTSKAKIAWAFVCKPKEEGGLGLRSLKE 211

Query: 205 WN*TSLCELIYDLAQDTNSIWHTCGHANLLRGNIIWTV-QRKSTSSAIWQTILHIRD 38
            N     +LI+ +    +S+W     ++LL+    W V +  S  S +W+ IL  RD
Sbjct: 212 ANDVCCLKLIWRIISHADSLWVKWIQSSLLKKVFFWAVRENTSLGSWMWRKILKFRD 268


>ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 239

 Score =  162 bits (410), Expect = 1e-37
 Identities = 79/199 (39%), Positives = 116/199 (58%), Gaps = 2/199 (1%)
 Frame = -2

Query: 733 DDLFIFIKGDIPSIQSIRETLNMFCNTSGLMININKSRIFCAGLDSSTKEQILDILNMEL 554
           DD+    +GDIPS+ ++   L  FC  SGL IN +KS I+ AG+       I  +    L
Sbjct: 28  DDIMFLSRGDIPSVSTMFAKLQHFCRVSGLSINSDKSAIYSAGIRPHELSHIQQLTGFNL 87

Query: 553 GDLPVTYLGVPLCSCRATKLEYQPLISKITSKVNAWASRKLSYAGRLELVKGVLYGVIHY 374
           G  P  YLGVPL S R     Y PL+SKIT  +  W+ + LSYAG+LEL++ V+ G++++
Sbjct: 88  GGFPFRYLGVPLLSSRLNVCHYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNF 147

Query: 373 WCKVFLMPKKVVKVMVASMRNFLWSGSNDV--RRAPIAWKAVCKPRDEGGLGIKEVLSWN 200
           W K+F + + V+  + AS  NFLW G  D+   ++ IAW  VC P+ EGGLG+  +  WN
Sbjct: 148 WMKIFPLSQSVLDRINASCCNFLW-GKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWN 206

Query: 199 *TSLCELIYDLAQDTNSIW 143
            T L  +++D     + +W
Sbjct: 207 LTLLSRILWDFHCKKDFLW 225