BLASTX nr result

ID: Mentha22_contig00004964 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00004964
         (940 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...   157   4e-36
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   150   9e-34
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   147   6e-33
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   129   2e-27
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   127   8e-27
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   125   2e-26
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   120   1e-24
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   115   2e-23
ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668...   105   2e-20
ref|XP_004173856.1| PREDICTED: putative ribonuclease H protein A...   102   2e-19
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   100   6e-19
gb|EMT09892.1| Branched-chain-amino-acid aminotransferase-like p...   100   1e-18
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...    96   2e-17
ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A...    96   3e-17
ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660...    94   1e-16
ref|XP_002459639.1| hypothetical protein SORBIDRAFT_02g007880 [S...    90   1e-15
ref|XP_007201486.1| hypothetical protein PRUPE_ppa016462mg, part...    87   9e-15
emb|CAN69470.1| hypothetical protein VITISV_014371 [Vitis vinifera]    86   3e-14
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]        85   4e-14
ref|XP_004173049.1| PREDICTED: putative ribonuclease H protein A...    84   8e-14

>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score =  157 bits (398), Expect = 4e-36
 Identities = 86/254 (33%), Positives = 134/254 (52%), Gaps = 8/254 (3%)
 Frame = -3

Query: 938 LQALPLPATVIDRITKLLRKFLWV----GNYCP-VAWTQVCLPRHEGGLGLRDLSAWNKA 774
           +   PLP +V+D I    R FLW     G   P VAW++VC P+ EGGLGL +L  WN A
Sbjct: 137 MSIFPLPQSVLDTIIATCRNFLWGKADGGKIKPLVAWSEVCTPKKEGGLGLFNLKDWNIA 196

Query: 773 LHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAP--HFKNILLIRDQILHDC 600
           L S  LW++H+K DSLW++ VH  Y +  +VWD      D+   H ++I++ +++     
Sbjct: 197 LLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWDFISSSSDSVFIHIRDIIISKEE----- 251

Query: 599 GGNLTDAQSKLASWFAGDRG-TKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMR 423
             N+  A+  L SW   ++    + Y++ R       W   IW   IP K S  LWLA +
Sbjct: 252 --NIEVAKLMLNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATK 309

Query: 422 GRLKTFDRLKFSDIPR*CMLCKAAEETNDHLFFQCPRTVEIWSGICSWLKIRQRISTISS 243
            RL   DR  F +    C LC    E++ HLFF C  ++ +W+ I  W+ ++++  ++  
Sbjct: 310 NRLLALDRAAFLNKGFLCPLCTNEAESHAHLFFSCRTSLRVWAHIRDWIPLKRQSISLQH 369

Query: 242 AIRRFQQEKAGSGI 201
           +I    + +A SG+
Sbjct: 370 SISALIRRRATSGV 383


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  150 bits (378), Expect = 9e-34
 Identities = 97/299 (32%), Positives = 144/299 (48%), Gaps = 7/299 (2%)
 Frame = -3

Query: 938  LQALPLPATVIDRITKLLRKFLWVGNY-----CPVAWTQVCLPRHEGGLGLRDLSAWNKA 774
            +Q LPLP  VI RI  + R FLW+GN       P+AW +VC P+  GGL + +L+ WNK 
Sbjct: 639  MQCLPLPKFVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKI 698

Query: 773  LHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGG 594
               K LWN+  KSD+LWI+W+H  YIR +S+W +   K  +    +++ +R  +L     
Sbjct: 699  SILKLLWNVCNKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLRPLLL----- 753

Query: 593  NLTDAQSKLASWFAGDRGTKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRL 414
                 QS++   F      K+ Y     + EK  W   +  +   P+    LW A   RL
Sbjct: 754  ---QYQSRMQDVFK----MKKIYLALFEESEKMSWRTLMCNNLARPRALFCLWQACHFRL 806

Query: 413  KTFDRL-KFS-DIPR*CMLCKAAEETNDHLFFQCPRTVEIWSGICSWLKIRQRISTISSA 240
             + DRL KF  ++   C  C ++ E+++HLFF C     IW+ + +WL+I    ST S  
Sbjct: 807  ASKDRLIKFGLNVDANCAFC-SSMESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTWSEE 865

Query: 239  IRRFQQEKAGSGIVRKAKWIALGATVSYIWYARNSLYTEGKSPVSSAIIKEIKTDVYRV 63
            +    ++  G G        A   T+ +IW  RN     G           I T +YRV
Sbjct: 866  LNWITRKCKGKGWRAMLLKCAFTETIYHIWAYRNHRVFGGNVNNRKVEDSIINTIIYRV 924


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  147 bits (371), Expect = 6e-33
 Identities = 85/275 (30%), Positives = 131/275 (47%), Gaps = 7/275 (2%)
 Frame = -3

Query: 938 LQALPLPATVIDRITKLLRKFLWVGNY-----CPVAWTQVCLPRHEGGLGLRDLSAWNKA 774
           L   P P +V+ +I  + R FLW G +      PVAW Q+C PR  GGL + D+  WNKA
Sbjct: 197 LNCFPFPKSVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNKA 256

Query: 773 LHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGG 594
              K LWN+ +K DSLW++W+   Y++   +  +     D+   K IL  R+ +      
Sbjct: 257 NLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDL------ 310

Query: 593 NLTDAQSKLASWFAGDRGTKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRL 414
              D   +L     G     + Y   +  G++K W   ++ +   P+ +  LWLA  GRL
Sbjct: 311 EKIDNMEEL--MIRGSINMGKLYRKLQDCGQRKEWKNLLYGNTARPRANFILWLACHGRL 368

Query: 413 KTFDRL-KFSDI-PR*CMLCKAAEETNDHLFFQCPRTVEIWSGICSWLKIRQRISTISSA 240
            T DRL K+  I  + C  C + EE+ +HLFF C  +  +W  +  W++IR   S   + 
Sbjct: 369 STKDRLCKYGMIDDKSCCFC-SEEESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWPNE 427

Query: 239 IRRFQQEKAGSGIVRKAKWIALGATVSYIWYARNS 135
           +        G G       +A+  T+  IW  RN+
Sbjct: 428 LHWLTHHTKGKGTRAAVLKMAIAETIYEIWNIRNN 462


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  129 bits (323), Expect = 2e-27
 Identities = 85/296 (28%), Positives = 130/296 (43%), Gaps = 15/296 (5%)
 Frame = -3

Query: 926  PLPATVIDRITKLLRKFLWVGNY-----CPVAWTQVCLPRHEGGLGLRDLSAWNKALHSK 762
            PL   VI  + K+ RKFLW G        PVAW  +  P+  GG  + ++  WN+A   K
Sbjct: 815  PLSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLK 874

Query: 761  TLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGGNLTD 582
             LW I  K D LW++W+H  YI+ + +  V+   +     + I+  RD +          
Sbjct: 875  LLWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRKIVKARDHL---------- 924

Query: 581  AQSKLASW---FAGDR-GTKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRL 414
              S +  W     GD+   K+AY+     GE+  W + I  +Y  PK    LW+ +  RL
Sbjct: 925  --SNIGDWDEICIGDKFSMKKAYKKISENGERVRWRRLICNNYATPKSKFILWMMLHERL 982

Query: 413  KTFDRLKFSDIPR*C----MLCKAAEETNDHLFFQCPRTVEIWSGICSWLKIRQRISTIS 246
             T DR+    +   C     LC+   ET  HLFF C  +  +WS IC  +    R     
Sbjct: 983  PTVDRISRWGVQ--CDLNYRLCRNDGETIQHLFFSCSYSAGVWSKICYIM----RFPNSG 1036

Query: 245  SAIRRFQQEKAGSGIVRKAKWIALGAT--VSYIWYARNSLYTEGKSPVSSAIIKEI 84
             + +       G    +K K I +  T  V  IW  RN     G++   + ++++I
Sbjct: 1037 VSHQEIISSVCGQARKKKGKLIVMLYTEFVYAIWKQRNKRTFTGENKDENEVLRKI 1092


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  127 bits (318), Expect = 8e-27
 Identities = 100/306 (32%), Positives = 131/306 (42%), Gaps = 21/306 (6%)
 Frame = -3

Query: 938  LQALPLPATVIDRITKLLRKFLWVG-----NYCPVAWTQVCLPRHEGGLGLRDLSAWNKA 774
            + A  LP   I  I ++   FLW G     +   VAW  VC P+ EGGLGLR L   NK 
Sbjct: 1081 ISAFRLPRACIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKI 1140

Query: 773  LHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHD--- 603
               K +W + +   SLW+ W+    IR  +V +     R   H       RD IL+D   
Sbjct: 1141 CCFKLIWRLVSAKHSLWVNWIQNNLIR--TVAEALSSHRRRSH-------RDDILNDIEE 1191

Query: 602  ------CGGNLTDAQSKLASWFAGDRGTK----EAYEHFRAKGEKKFWHKAIWRSYIPPK 453
                  C G  T+    L     G    K    E +   R +G  K WHKAIW S   PK
Sbjct: 1192 ELEKLLCRGICTEQDRSLCRSIGGQFKAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPK 1251

Query: 452  FSVTLWLAMRGRLKTFDRLKF--SDIPR*CMLCKAAEETNDHLFFQCPRTVEIWSGICSW 279
            F+   WLA   RL T D++      I   C+LC  + E+ DHLFF C  +  IW  +   
Sbjct: 1252 FTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRR 1311

Query: 278  LKIRQRISTISSAIRRFQQEKAGSGIVRKAKWIALGATVSYIWYARNSLYTEGKSPV-SS 102
            L +  R +T   A+      +  SG  R        AT+  +W  RN     G  P+ S 
Sbjct: 1312 L-LLCRYTTNFPALLLLLSGQDFSGTKRFLLRYVFQATIHTLWRERNK-RRHGDLPIPSD 1369

Query: 101  AIIKEI 84
             IIK I
Sbjct: 1370 HIIKFI 1375


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  125 bits (314), Expect = 2e-26
 Identities = 90/309 (29%), Positives = 141/309 (45%), Gaps = 18/309 (5%)
 Frame = -3

Query: 938  LQALPLPATVIDRITKLLRKFLW-----VGNYCPVAWTQVCLPRHEGGLGLRDLSAWNKA 774
            +Q LP+P +VI +I  + R F+W     +    P+AW  VC P+ +GGL + +L  WN  
Sbjct: 639  MQCLPIPMSVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHI 698

Query: 773  LHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQI--LHDC 600
                 LWN+  K D+LW++W+H  YI++ SV +       +   KN+L  R+ I  L   
Sbjct: 699  TVLNCLWNLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPV 758

Query: 599  GGNLTDAQSKLASWFAGDRGTKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRG 420
               L +++             K+AY+    + ++  W   + ++   P+   T WLA  G
Sbjct: 759  WDELLNSER---------FKMKKAYDKM-MEADRVHWSGLMRKNCARPRAIHTTWLACHG 808

Query: 419  RLKTFDRL-KFSDI-PR*CMLCKAAEETNDHLFFQCPRTVEIWSGICSWL---KIRQRIS 255
            RL T DRL +F  I  +   LCK  EET +H+ F C    +IWS + + +    + Q   
Sbjct: 809  RLGTKDRLVRFGMITDKIWSLCKEVEETQNHILFSCKVATDIWSNVLNRIGIDHVPQEWP 868

Query: 254  TISSAIRRFQQEKAGSGIVRKAKWIALGATVSYIWYARNS------LYTEGKSPVSSAII 93
                 +      K     + K   +++  T+  IW  RNS       Y      VS  II
Sbjct: 869  LELDWLLNLTNRKGWRAYLLK---LSVTETIYGIWINRNSKIFGDNTYRNTSKDVSDGII 925

Query: 92   KEIKTDVYR 66
            + I   VYR
Sbjct: 926  ENI---VYR 931


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  120 bits (300), Expect = 1e-24
 Identities = 82/292 (28%), Positives = 127/292 (43%), Gaps = 8/292 (2%)
 Frame = -3

Query: 935  QALPLPATVIDRITKLLRKFLWVGNY-----CPVAWTQVCLPRHEGGLGLRDLSAWNKAL 771
            Q  PLP  +I  +    RKFLW G        PVAW  +  P+  GGL + ++  WNKA 
Sbjct: 815  QIFPLPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAA 874

Query: 770  HSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGGN 591
              K LW I  K D LW++WV+  YI+ +++ +V+     +   + I   R+ +    G  
Sbjct: 875  ILKLLWAITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFESRELLTRTGGWE 934

Query: 590  LTDAQSKLASWFAGDRGTKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRLK 411
                    +         K+ Y+  +   E   W + I  +   PK    LWLAM  RL 
Sbjct: 935  AVSNHMNFS--------IKKTYKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLA 986

Query: 410  TFDRLK--FSDIPR*CMLCKAAEETNDHLFFQCPRTVEIWSGICSWLKIRQRISTISSAI 237
            T +R+     D+   C +C    ET  HLFF C  + EIW  +  +L ++ +    + A 
Sbjct: 987  TAERVSRWNRDVSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQAD--AQAK 1044

Query: 236  RRFQQEKAGSGIVRKAKWIALGATVSY-IWYARNSLYTEGKSPVSSAIIKEI 84
            +    +KA S   R   ++ +     Y IW  RN+    G     +  +K I
Sbjct: 1045 KELAIKKARSTKDRNKLYVMMFTESVYAIWLLRNAKVFRGIEINQNQAVKSI 1096


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  115 bits (289), Expect = 2e-23
 Identities = 70/221 (31%), Positives = 104/221 (47%), Gaps = 7/221 (3%)
 Frame = -3

Query: 938 LQALPLPATVIDRITKLLRKFLWVGNY-----CPVAWTQVCLPRHEGGLGLRDLSAWNKA 774
           +   P+P  VI +I  + R F+W G+        VAW QVC P   GGL L +L  WN  
Sbjct: 300 MSVFPMPKKVIQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVT 359

Query: 773 LHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGG 594
              K LWNI +K D+LW++W+H  +++  +V   +         K+++  R Q+      
Sbjct: 360 AMLKCLWNICSKEDNLWVKWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQV-----N 414

Query: 593 NLTDAQSKLASWFAGDRGTKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRL 414
           NL     ++          K+ Y        K  W + +  +   P+ +VTLWLA + RL
Sbjct: 415 NLQLVWIEMLR--KRKFSMKQVYMELVEDHNKIDWFRLLRYNRARPRANVTLWLACQNRL 472

Query: 413 KTFDRLKFSDIPR--*CMLCKAAEETNDHLFFQCPRTVEIW 297
            T  RLK  ++ +   C LCK  +E  DHL F C  T  IW
Sbjct: 473 ATKTRLKNMNMIQCSLCSLCKEQDEDLDHLMFSCRVTKAIW 513


>ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max]
          Length = 477

 Score =  105 bits (263), Expect = 2e-20
 Identities = 66/217 (30%), Positives = 104/217 (47%), Gaps = 1/217 (0%)
 Frame = -3

Query: 713 VHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGGNLTDAQSKLASWFAGDRGTK 534
           VH  Y +  +VWD      D+   K I+ IRD I+     N+  A+  L SW + ++   
Sbjct: 242 VHHNYFKGGNVWDFISSASDSVLIKKIIHIRD-IITIKEDNVEAAKQTLNSWNSNEQLLA 300

Query: 533 -EAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRLKTFDRLKFSDIPR*CMLCK 357
            +AY++ R       W+  +W   IP K S  LWLA +  L T DR  F +    C LC+
Sbjct: 301 GKAYDYIRGVKPAVNWNSVVWNPAIPSKMSFILWLATKNHLLTLDRAAFLNKGLLCPLCR 360

Query: 356 AAEETNDHLFFQCPRTVEIWSGICSWLKIRQRISTISSAIRRFQQEKAGSGIVRKAKWIA 177
              +++ HLFF C  ++++W+ I  W+ + ++  ++   I      +A SG   K + +A
Sbjct: 361 TKAKSHAHLFFSCRISLQVWANIRDWIPLHRQTISLQCTINSRICGRATSGTWGKFRCLA 420

Query: 176 LGATVSYIWYARNSLYTEGKSPVSSAIIKEIKTDVYR 66
           L   V   W +RN L  E        II +IK  VY+
Sbjct: 421 LAIAVYCTWISRNLLLFENSPFSVINIINKIKFLVYK 457


>ref|XP_004173856.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Cucumis
           sativus]
          Length = 342

 Score =  102 bits (254), Expect = 2e-19
 Identities = 89/315 (28%), Positives = 125/315 (39%), Gaps = 61/315 (19%)
 Frame = -3

Query: 899 ITKLLRKFLWVGNY-----CPVAWTQVCLPRHEGGLGLRDLSAWNKALHSKTLWNIHAKS 735
           + K+LR +LW G         V W +VCLP  EGGL +RD S+WN A   K LW +  KS
Sbjct: 7   VDKILRAYLWRGKEEGRSGAKVGWDEVCLPFDEGGLNIRDGSSWNIASTLKILWLLLVKS 66

Query: 734 DSLWIQWVHGEYIRDKSVWDVSFPKRD----------APHFKNILLIR---DQILHDCG- 597
            SLW+ WV    ++ +S+W++                 P  +   +I+   +++++D G 
Sbjct: 67  GSLWVSWVESYILKGRSLWEIDAGMEVGNGRKCRVWLVPWIQGGPIIQQFGERVIYDAGS 126

Query: 596 ----------GNLTDAQSKLAS------------------------WFAGDRGT---KEA 528
                     G   D +  L S                        W  G R +     A
Sbjct: 127 RWDVRLVDFMGRNGDWRWSLVSLDLMDIWDRVQGVRPSPSVEDRWVWVPGSRDSFSITSA 186

Query: 527 YEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRLKTFDRLKFSD--IPR*CMLCKA 354
           +E  R    +  W   +W     PK S   WLA+R RL T DRL   D  IP  C+LC  
Sbjct: 187 WETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLSTRDRLSRWDRSIPLSCLLCGG 246

Query: 353 AEETNDHLFFQCPRTVEIWSG---ICSWLKIRQRISTISSAIRRFQQEKAGSGIVRKAKW 183
             E+ +HLFF          G   +C  L        I S I        G  + RK   
Sbjct: 247 NYESRNHLFFLVILGGRFGRGSFCLCHLL--------IESGI--------GKSVRRKLLR 290

Query: 182 IALGATVSYIWYARN 138
           +   AT+ +IW  RN
Sbjct: 291 LLWCATIYFIWQERN 305


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  100 bits (250), Expect = 6e-19
 Identities = 82/311 (26%), Positives = 129/311 (41%), Gaps = 19/311 (6%)
 Frame = -3

Query: 929  LPLPATVIDRITKLLRKFLWVGNYC-----PVAWTQVCLPRHEGGLGLRDLSAWNKALHS 765
            L LP  V+  I K LR FLW GN        VAW+++CLP+ EGGLG++DL  WNKAL  
Sbjct: 651  LILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMI 710

Query: 764  KTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGGNLT 585
              +WN+ + S + W  WV    ++  S W+   P   + +++ +L IR+         + 
Sbjct: 711  SHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRELCCSFFVNIIG 770

Query: 584  DAQSKLASWF-----AGDRGTKEAYEHFRAKGEKK---------FWHKAIWRSYIPPKFS 447
            D ++  + WF      G    + +       G  K         +   + W +  P +F 
Sbjct: 771  DGRA-TSLWFDNWHPLGPLTLRWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFI 829

Query: 446  VTLWLAMRGRLKTFDRLKFSDIPR*CMLCKAAEETNDHLFFQCPRTVEIWSGICSWLKIR 267
            V  +     RL  F                   ET++HLFF C  +  IW+ + S   + 
Sbjct: 830  VPWY-----RLVWF-----------------VAETHNHLFFDCAYSFGIWTHVLSKCDVS 867

Query: 266  QRISTISSAIRRFQQEKAGSGIVRKAKWIALGATVSYIWYARNSLYTEGKSPVSSAIIKE 87
            + +   S  I        G+ +      +AL A V  IW  RN+     +S   + + K 
Sbjct: 868  KPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRRFRNESLPPAVVFKG 927

Query: 86   IKTDVYRVLYS 54
            I   +   L S
Sbjct: 928  IVESIRLCLLS 938


>gb|EMT09892.1| Branched-chain-amino-acid aminotransferase-like protein 3,
           chloroplastic [Aegilops tauschii]
          Length = 600

 Score = 99.8 bits (247), Expect = 1e-18
 Identities = 81/283 (28%), Positives = 119/283 (42%), Gaps = 26/283 (9%)
 Frame = -3

Query: 908 IDRITKLLRKFLWVGN------YCPVAWTQVCLPRHEGGLGLRDLSAWNKALHSKTLWNI 747
           + ++  LLR F W G        C VAW  V LPR  GGLG+R L A N+A+  K +  I
Sbjct: 4   LGKLECLLRAFFWQGKSKVKGGQCLVAWDTVSLPRINGGLGIRQLQAHNQAMMCKFVSKI 63

Query: 746 HAKSDSLWIQW---------------VHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQI 612
              SD    +W               VH +Y+     W +      +      LL   ++
Sbjct: 64  LQSSDIPCYKWFATHYCRAALPQACSVHSQYVN--GAWAIQLHPNLSQMASTELLALHEL 121

Query: 611 LHDCGGNLTDAQSKLASWFAGDRGTKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWL 432
           L D   NL +   ++ S  +G   T+  Y     +G    +   +W S IP K  + LWL
Sbjct: 122 LSDVTPNLLNEDKRIPSLGSGQLSTRHFYSLLTFRGVLTTFEPWVWDSLIPLKHRIFLWL 181

Query: 431 AMRGRLKTFDRL--KFSDIPR*CMLCKA--AEETNDHLFFQCPRTVEIWSGICSWLKIRQ 264
           A RGRL T D +  K   +      C A  A E+ DHL  +C     +W  +     +  
Sbjct: 182 AFRGRLNTRDNMVKKGWSVVAPFAHCDACPAVESADHLLLRCASASVLWGKL-----VLD 236

Query: 263 RISTISSAIRRFQQEKAGSGIVRKAKW-IALGATVSYIWYARN 138
            ++  +  I  F  E+A   +  K KW +A  A    +W+ARN
Sbjct: 237 TLACSAPDILAF-VEQAQHQLSFKRKWNVAFAACALTLWHARN 278


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score = 96.3 bits (238), Expect = 2e-17
 Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 2/165 (1%)
 Frame = -3

Query: 539  TKEAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRLKTFDRLKF--SDIPR*CM 366
            TK+ + H R    ++ WHK +W ++  PKFS   WLA+R RL T DR+    +  P  C+
Sbjct: 762  TKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCV 821

Query: 365  LCKAAEETNDHLFFQCPRTVEIWSGICSWLKIRQRISTISSAIRRFQQEKAGSGIVRKAK 186
             C +  ET DHLFFQC  + EIW+ I   +  + R ST  SA+  +  +     I     
Sbjct: 822  FCSSPMETRDHLFFQCCYSSEIWTSIAKNV-YKDRFSTKWSAVVNYISDSQPDRIQSFLS 880

Query: 185  WIALGATVSYIWYARNSLYTEGKSPVSSAIIKEIKTDVYRVLYSL 51
                  ++  IW  RNS     KS  +S +I++I   +   L ++
Sbjct: 881  RYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQIDKTIRNQLSTI 925



 Score = 67.8 bits (164), Expect = 6e-09
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
 Frame = -3

Query: 938 LQALPLPATVIDRITKLLRKFLWVG-----NYCPVAWTQVCLPRHEGGLGLRDLSAWNKA 774
           + A  LP   I+ I ++    LW G         V+W ++C P+ EGGLGL+ L   NK 
Sbjct: 547 MNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKV 606

Query: 773 LHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDV 672
              K +W + +  DSLW++W     ++ +S W +
Sbjct: 607 SSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSI 640


>ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 316

 Score = 95.5 bits (236), Expect = 3e-17
 Identities = 51/132 (38%), Positives = 73/132 (55%), Gaps = 5/132 (3%)
 Frame = -3

Query: 938 LQALPLPATVIDRITKLLRKFLW----VGNYCP-VAWTQVCLPRHEGGLGLRDLSAWNKA 774
           ++  PLP +V+DRI      FLW    +G   P VAW  VC P+ EGGLGL +L  WN A
Sbjct: 182 MRIFPLPQSVLDRINASCCNFLWSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLA 241

Query: 773 LHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGG 594
           L S  LW+ H K DSL ++WVH  Y R    W+ +    ++   K I+ IRD I+     
Sbjct: 242 LLSHILWDFHCKKDSLRVRWVHHYYFRRSDEWNYNISSSNSVLIKKIIQIRDFIISK-EL 300

Query: 593 NLTDAQSKLASW 558
           ++ + + ++ SW
Sbjct: 301 SMEETKKRIQSW 312


>ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max]
          Length = 303

 Score = 93.6 bits (231), Expect = 1e-16
 Identities = 58/144 (40%), Positives = 77/144 (53%), Gaps = 6/144 (4%)
 Frame = -3

Query: 926 PLPATVIDRITKLLRKFLW----VGNYCP-VAWTQVCLPRHEGGLGLRDLSAWNKALHSK 762
           PLP +V+DRI    R FLW    +G   P VAW+ VC P+ EGGLGL +L  WN AL S 
Sbjct: 153 PLPQSVLDRINASCRNFLWGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWNLALLSC 212

Query: 761 TLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCGGNLTD 582
            LW+ H K DSL   WVH  Y R   VW+ +     +   K I+ IRD I+     +  +
Sbjct: 213 ILWDFHCKKDSL---WVHHYYFRRSDVWNYNTSSSYSVLIKKIIQIRDFIISK-ELSTEE 268

Query: 581 AQSKLASWFA-GDRGTKEAYEHFR 513
           A+ ++ SW   G     + YE+ R
Sbjct: 269 AKKRIQSWRTNGQLLVGKVYEYIR 292


>ref|XP_002459639.1| hypothetical protein SORBIDRAFT_02g007880 [Sorghum bicolor]
           gi|241923016|gb|EER96160.1| hypothetical protein
           SORBIDRAFT_02g007880 [Sorghum bicolor]
          Length = 475

 Score = 90.1 bits (222), Expect = 1e-15
 Identities = 72/231 (31%), Positives = 106/231 (45%), Gaps = 11/231 (4%)
 Frame = -3

Query: 938 LQALPLPATVIDRITKLLRKFLWVGNY------CPVAWTQVCLPRHEGGLGLRDLSAWNK 777
           + ++ +P  + + I K+ R+FLW G+       C VAWT V  P   GGLG+ DL  +++
Sbjct: 183 MASMKVPRQLKEDIDKIRRRFLWAGDKELTGGKCKVAWTTVAKPIDFGGLGIIDLERFSR 242

Query: 776 ALHSKTLWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILH--D 603
           AL  +  W        LW QW + E   + +   V     +A +F       D I+   +
Sbjct: 243 ALRIR--W--------LWFQWANPERPGNGTEMPVD-KSIEAANFNPEETEEDSIVWTLE 291

Query: 602 CGGNLTDAQSKLASWFAGDRGTKEAYEHFRAKGEKKFWHKA-IWRSYIPPKFSVTLWLAM 426
             G  T A+S  A  FAG+  +                H A IWR +  PK    +WL +
Sbjct: 292 SSGEYT-AKSAYAVQFAGNIVSN---------------HPALIWRVWATPKCKYFIWLLL 335

Query: 425 RGRLKTFDRLKFSDIPR*--CMLCKAAEETNDHLFFQCPRTVEIWSGICSW 279
           + RL T  RL+         C LC+   ET  HLFF+CP ++E+W GI  W
Sbjct: 336 QNRLWTAARLQLRRWTNNYFCALCERNLETAHHLFFECPFSLEVWHGIAVW 386


>ref|XP_007201486.1| hypothetical protein PRUPE_ppa016462mg, partial [Prunus persica]
            gi|462396886|gb|EMJ02685.1| hypothetical protein
            PRUPE_ppa016462mg, partial [Prunus persica]
          Length = 983

 Score = 87.0 bits (214), Expect = 9e-15
 Identities = 69/253 (27%), Positives = 110/253 (43%), Gaps = 45/253 (17%)
 Frame = -3

Query: 920  PATVIDRITKLLRKFLWVG----NYCP-VAWTQVCLPRHEGGLGLRDLSAWNKALHSKTL 756
            P  V  ++ +L+R FLW G      C  V W +V   + EGGLG+  L   N+AL +K L
Sbjct: 671  PIGVATKVEQLMRNFLWEGLEDGKKCHLVRWERVTKSKEEGGLGIGSLRERNEALRAKWL 730

Query: 755  WNIHAKSDSLWIQWVHGEYIRDKSVWDVS---------------------FP-------- 663
            W    +S+SLW + +  +Y  D + + V                      FP        
Sbjct: 731  WRFPLESNSLWHRIIKSKYGIDSNGFSVGNGEKIRFWEDLWLKEWILKNLFPRLSSLSRR 790

Query: 662  KRDAPHFKNILLIRDQILHDCGGN--LTDAQSKLASWFAGDRGTK--EAYEHFRAKGEKK 495
            K+   +     +    IL D  G   L  ++S   SW   ++G+   +++  F     + 
Sbjct: 791  KKSKRNLSEAEIAEVVILLDILGKVRLYGSRSDRRSWEIEEQGSFSCKSFRSFLLSTTRD 850

Query: 494  FWHK--AIWRSYIPPKFSVTLWLAMRGRLKTFDRL-----KFSDIPR*CMLCKAAEETND 336
             +    +IW++  PPK    +WLA+ GR+ T D +     K    P  C+LCK   E  D
Sbjct: 851  VFPPFISIWKAKTPPKIQFFVWLAVNGRINTCDCIQRRQPKMCLYPSWCVLCKENAENID 910

Query: 335  HLFFQCPRTVEIW 297
            HLF  C  ++++W
Sbjct: 911  HLFIHCSYSLKLW 923


>emb|CAN69470.1| hypothetical protein VITISV_014371 [Vitis vinifera]
          Length = 492

 Score = 85.5 bits (210), Expect = 3e-14
 Identities = 61/195 (31%), Positives = 87/195 (44%), Gaps = 15/195 (7%)
 Frame = -3

Query: 836 VCLPRHEGGLGLRDLSAWNKALHSKTLWNIHAKSDSLWIQ----WVHGEYIRDK--SVWD 675
           +C  + EGGLG+R L+ +NKALH K LW    +++SLW Q    WV   +  D+    W 
Sbjct: 180 ICADKKEGGLGIRSLATFNKALHGKWLWRFANENESLWKQIIFRWVAEAWEEDEGGDSWG 239

Query: 674 VSFPKR----DAPHFKNILLIRDQILHDCGGNLTDAQSKLASWFAGDRGT---KEAYEHF 516
           + F +     +    +++L      LH     +      L  W     GT   K  Y  F
Sbjct: 240 LRFNRHLNDWEVGEVESLL----SKLHPL--TIRRGVEDLFRWKENKNGTFFVKSFYSSF 293

Query: 515 RAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRLKTFDRLKFS--DIPR*CMLCKAAEET 342
               +  F  + IW  ++P + S   W A   R+ T DRLK     IP  C LCK  EET
Sbjct: 294 SRDTKPPFPARTIWTPWVPIRASFFGWEAAWSRVLTTDRLKRFGWSIPNKCFLCKYKEET 353

Query: 341 NDHLFFQCPRTVEIW 297
            +HL   C +   +W
Sbjct: 354 TNHLLLFCNKARMLW 368


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score = 85.1 bits (209), Expect = 4e-14
 Identities = 42/106 (39%), Positives = 56/106 (52%), Gaps = 5/106 (4%)
 Frame = -3

Query: 923  LPATVIDRITKLLRKFLWVGNY-----CPVAWTQVCLPRHEGGLGLRDLSAWNKALHSKT 759
            LP   I RI  L  +FLW GN        V+W  +CLP+ EGGLGLR L  WNK L  + 
Sbjct: 824  LPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRL 883

Query: 758  LWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIR 621
            +W +    DSLW  W H  ++   S W V   + D+  +K +L +R
Sbjct: 884  IWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLR 929



 Score = 58.9 bits (141), Expect = 3e-06
 Identities = 45/169 (26%), Positives = 73/169 (43%), Gaps = 9/169 (5%)
 Frame = -3

Query: 533  EAYEHFRAKGEKKFWHKAIWRSYIPPKFSVTLWLAMRGRLKTFDRLKF-----SDIPR*C 369
            + +E  R K   K W  +IW     PK++  +W++   RL T  RL       SD    C
Sbjct: 1037 KTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDA---C 1093

Query: 368  MLCKAAEETNDHLFFQCPRTVEIWSGICSWLKIRQRI----STISSAIRRFQQEKAGSGI 201
            +LC  A E+ DHL   C  + ++W  +   +  RQR+    S + S +R  Q       +
Sbjct: 1094 VLCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWVR--QSSPEAPPL 1151

Query: 200  VRKAKWIALGATVSYIWYARNSLYTEGKSPVSSAIIKEIKTDVYRVLYS 54
            +RK   I     V  +W  RN+L         + I K +  ++  ++ S
Sbjct: 1152 LRK---IVSQVVVYNLWRQRNNLLHNSLRLAPAVIFKLVDREIRNIISS 1197


>ref|XP_004173049.1| PREDICTED: putative ribonuclease H protein At1g65750-like, partial
           [Cucumis sativus]
          Length = 647

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 48/133 (36%), Positives = 68/133 (51%), Gaps = 8/133 (6%)
 Frame = -3

Query: 923 LPATVIDRITKLLRKFLWVGNY-----CPVAWTQVCLPRHEGGLGLRDLSAWNKALHSKT 759
           LP  V   + K+LR +LW G         VAW +VCLP  EGGL + D S+WNKA   K 
Sbjct: 283 LPMKVHKDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGLAICDGSSWNKASTLKI 342

Query: 758 LWNIHAKSDSLWIQWVHGEYIRDKSVWDVSFPKRDAPHFKNILLIRDQILHDCG---GNL 588
           LW +  KS SLW+ WV    ++ +S+W++      +  F+ IL  RD +        GN+
Sbjct: 343 LWLLLVKSGSLWVAWVEAYILKGRSLWEIDAGAGRSWCFRAILRKRDILKAHVEMKLGNV 402

Query: 587 TDAQSKLASWFAG 549
              +  L +W  G
Sbjct: 403 RKCRMLLDAWIQG 415


Top