BLASTX nr result

ID: Astragalus23_contig00029829 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00029829
         (889 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subte...   199   4e-54
gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Ca...   181   7e-48
gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family ...   167   1e-43
dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subt...   155   5e-41
ref|XP_020230539.1| uncharacterized protein LOC109811261 [Cajanu...   151   4e-40
dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subt...   158   7e-40
dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subt...   153   2e-39
dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subt...   150   3e-39
ref|XP_020225471.1| uncharacterized protein LOC109807365 [Cajanu...   147   2e-38
dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subt...   150   7e-37
dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subt...   135   8e-35
dbj|GAU44059.1| hypothetical protein TSUD_399580 [Trifolium subt...   144   9e-35
ref|XP_020225309.1| uncharacterized protein LOC109807197 [Cajanu...   132   3e-33
dbj|GAU37566.1| hypothetical protein TSUD_153990 [Trifolium subt...   134   4e-33
gb|PNX68200.1| pentatricopeptide repeat-containing protein, part...   129   9e-33
dbj|GAU27275.1| hypothetical protein TSUD_125560 [Trifolium subt...   130   6e-32
gb|KYP48455.1| hypothetical protein KK1_029830 [Cajanus cajan]        133   9e-32
gb|ABD28710.1| Polynucleotidyl transferase, Ribonuclease H fold ...   131   1e-31
dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subt...   131   3e-31
gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family ...   129   5e-31

>dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subterraneum]
          Length = 1601

 Score =  199 bits (507), Expect = 4e-54
 Identities = 110/327 (33%), Positives = 167/327 (51%), Gaps = 39/327 (11%)
 Frame = -2

Query: 864  AMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPT 691
            AM+ LI+N   ++ G WM IW+L IPQ+VK+ LWR+  G    R    S  V C + CP 
Sbjct: 1266 AMDTLINNEQYKIPGDWMLIWKLSIPQRVKIFLWRIAIGCLPTRDRLQSRGVQCTDLCPH 1325

Query: 690  CESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIF 511
            CE+  E++WH+F SC  A  +W    L  EV   ++  +         L  L+   R  F
Sbjct: 1326 CETTYENDWHLFVSCNKAHEVWREANLWDEVCSVVETVSCIKDFIFAALAALAEPRRSEF 1385

Query: 510  AMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELW 331
             M++W LWK RN+K WE  V+P  V   +A D L +W  AR R  +            + 
Sbjct: 1386 VMMLWCLWKCRNDKIWEDKVQPVRVGMQLARDMLYQWRNARRREDTTGHHDSHN---VIQ 1442

Query: 330  WRKPSIGGLKCNVDAMIF*EENKYGIG-CIR*E--------------------------- 235
            W+ P IG +KCN+DA +F E++K+G+G CIR +                           
Sbjct: 1443 WQPPPIGKVKCNIDAALFNEQHKFGLGMCIRDDHGIFVKARTKWFHGSPPPVEAEAWALK 1502

Query: 234  ---------QITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVR 82
                     +++ VVIE+DCL ++N I ++S   ++FG +   C  ++  + ++ IS+V+
Sbjct: 1503 EAITWMGELELSRVVIELDCLLVVNAIKSNSNNQSEFGHIISDCHRLLENYPNFEISFVK 1562

Query: 81   RQTNLVAHTLVRVSRSYASSCVHDFSP 1
            RQ N VAH+L R S+SYAS+   +  P
Sbjct: 1563 RQANFVAHSLARASKSYASTHTFNLIP 1589


>gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan]
          Length = 816

 Score =  181 bits (458), Expect = 7e-48
 Identities = 99/326 (30%), Positives = 157/326 (48%), Gaps = 39/326 (11%)
 Frame = -2

Query: 861  MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688
            ME +I N+ LRVQG WM +W LKIP   ++ LWR+  G    R       VPC   CP C
Sbjct: 484  MEHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWRLLRGCIPTRLNLQQKGVPCTSSCPHC 543

Query: 687  ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508
             +  E+EWH+F+SCP A ++W  +G    ++  ++   SF      LL  L+  +   F 
Sbjct: 544  SANQENEWHLFYSCPAALSIWIDSGCWPRIAHIVEQGISFIDTTWKLLGHLTGSDLTSFT 603

Query: 507  MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWW 328
            +++W +W+ RN+K W+    P + S  +       W     RS     +  A P +   W
Sbjct: 604  LMLWCIWRRRNDKVWKEGAPPPKTSIQLTEQHFHAW-----RSAHRNLAQTASPVVNHRW 658

Query: 327  RKPSIGGLKCNVDAMIF*EENKYGIG-CIR*EQ--------------------------- 232
             KP      CNVDA++F + + +G G C+R  +                           
Sbjct: 659  TKPPADTFTCNVDAVLFNDSSTFGFGICVRDTRGLFQTAISGWKHGLPPPHEAEAAAMLE 718

Query: 231  ---------ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRR 79
                       +V +E DC  + + +N+   L++++G++   CR+++  HQ+ ++ ++RR
Sbjct: 719  AIQYLIHSPYDNVCVETDCKQVADHLNSTQVLHSEYGIIINQCRSLLRSHQNLQVRFIRR 778

Query: 78   QTNLVAHTLVRVSRSYASSCVHDFSP 1
            Q N VAHTL RV+RS AS    DF P
Sbjct: 779  QANRVAHTLARVARSSASHHFFDFIP 804


>gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 606

 Score =  167 bits (422), Expect = 1e-43
 Identities = 102/331 (30%), Positives = 155/331 (46%), Gaps = 41/331 (12%)
 Frame = -2

Query: 870  FAAMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEEC 697
            +  ME L  N  LR+ G W  +W +K P   K+ LWRV  G    R      +VPC   C
Sbjct: 270  YQLMEHLTPNVDLRIPGNWSMLWSMKAPNTKKIFLWRVLRGCLPTRLNLQRRHVPCTMLC 329

Query: 696  PTCESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRY 517
            PTC +G+E+EWHIFF C  A  +W  +G   ++S+ + D+         LL+ LS     
Sbjct: 330  PTCSAGIENEWHIFFECVEAKDIWAASGFWPKISQIIADSDGIQQAIFQLLQCLSPSEAL 389

Query: 516  IFAMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIE 337
                ++W +W+ RN+  W   V P      +A  ++ EW  AR  +  + + +   P   
Sbjct: 390  DLLCLMWGIWRKRNDILWNNKVTPSHTVIFLARQRISEWMSAR-ETQQIPKVARNDP--- 445

Query: 336  LWWRKPSIGGLKCNVDAMIF*EENKYG-------------------------------IG 250
            + W KP    +KCNVD  IF + N  G                               + 
Sbjct: 446  ICWFKPPPEYMKCNVDVTIFTDSNCCGFAFYIRDDLGRFKAATTGWYNGSLPPNEAEAMA 505

Query: 249  CIR*EQIT--------SVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRI 94
            C+  E IT         V+IE+DC  +++ + + ++L +++G L Y  R+++  H++  +
Sbjct: 506  CL--EAITWLANSHYEKVLIELDCKKVVDDLYDSTSLFSEYGRLSYKGRSLLALHKNLEV 563

Query: 93   SYVRRQTNLVAHTLVRVSRSYASSCVHDFSP 1
             +VRRQ N VA TL RVSR YAS    DF P
Sbjct: 564  RFVRRQANHVARTLARVSRLYASPHYFDFIP 594


>dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subterraneum]
          Length = 395

 Score =  155 bits (393), Expect = 5e-41
 Identities = 96/319 (30%), Positives = 145/319 (45%), Gaps = 40/319 (12%)
 Frame = -2

Query: 861  MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688
            +++LID SHLRV G W  +W++K P KVK L+WR+       R+      V C   C  C
Sbjct: 59   VQELIDTSHLRVNGDWNLLWKIKAPPKVKNLIWRICRRCVSTRARLQDKGVNCPNLCALC 118

Query: 687  ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508
                ED  H+FF CP +  +W  T     VS  + +    +     +L QLS  +  +FA
Sbjct: 119  NIEGEDSLHVFFKCPSSQNVWSMTSFFQVVSSVINNENEASAIVFQILRQLSKEDAALFA 178

Query: 507  MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWW 328
             I+W++WK RN + W  +          A + L+EW    +R+ +    S  +P     W
Sbjct: 179  CILWSIWKQRNNQIWNNVTDAQSFVFSRANNMLQEWN--TVRNVAATPVSNQQPGAACIW 236

Query: 327  RKPSIGGLKCNVDAMIF*EENKYGIG-CIR*EQ--------------------------- 232
            RKPS G +KCNVDA      NK GIG CIR +Q                           
Sbjct: 237  RKPSAGHVKCNVDASFLPHNNKVGIGICIRDDQGAFILAKTEWFSPKSEVHTGEALGLLA 296

Query: 231  ---------ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTII-VQHQSYRISYVR 82
                     +  V  E+D   +++  ++      +FGV+  HC++I    +++  + +VR
Sbjct: 297  ALNWVHELNLGPVEFELDSKRVVDSFHSSKRDFTEFGVIVEHCKSIFSTYYRNSSVEFVR 356

Query: 81   RQTNLVAHTLVRVSRSYAS 25
            RQ N VAH L + +   AS
Sbjct: 357  RQANEVAHKLAKAATLSAS 375


>ref|XP_020230539.1| uncharacterized protein LOC109811261 [Cajanus cajan]
          Length = 307

 Score =  151 bits (381), Expect = 4e-40
 Identities = 89/300 (29%), Positives = 144/300 (48%), Gaps = 40/300 (13%)
 Frame = -2

Query: 780 VKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFFSCPHATALWDYTGLE 607
           +K+ LWR+  G    R      +VPC   C +C S +E+EWH+FF+C  A  +W  +G+ 
Sbjct: 1   MKIFLWRLLRGCLPTRINLQRKHVPCTTLCVSCNSELENEWHVFFTCAAAKDIWTSSGMW 60

Query: 606 LEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWALWKSRNEKFWEGIVKPYEVSAI 427
            ++   ++           LL  L          ++W +W+ RN+K W  +  P  VS  
Sbjct: 61  DKIKNIVEQGEGTTDTVFQLLNHLDTKEATELLALLWCIWRRRNDKLWNDVSSPIGVSIF 120

Query: 426 IAMDQLREWEQARLRSTSLVRS-SIAKPSIELWWRKPSIGGLKCNVDAMIF*EENKYG-- 256
           +A  +L EW  A  R+T+L  S  +A+P+   +W KP  G +KCN DA IF + N Y   
Sbjct: 121 LARQRLEEWLAA--RTTNLAPSPRVAEPN---YWVKPQPGFMKCNTDAAIFKDTNSYSFA 175

Query: 255 -----------------------------IGCIR------*EQITSVVIEMDCLSILNGI 181
                                        I CI            +V+IE+DC ++++ +
Sbjct: 176 FCLRDNHGRFKAATTGWYHGLSPRHEAEVIACIEAMSWLTNSSYENVLIELDCKTVVDDL 235

Query: 180 NNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQTNLVAHTLVRVSRSYASSCVHDFSP 1
           +  + L +++G+L    R+I+  H++  + ++RRQ N VAH+L R +RSYAS    DF P
Sbjct: 236 HGSNQLLSEYGLLIQKGRSILASHKNLSVRFIRRQANHVAHSLARAARSYASPHTFDFIP 295


>dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subterraneum]
          Length = 1012

 Score =  158 bits (400), Expect = 7e-40
 Identities = 82/211 (38%), Positives = 119/211 (56%), Gaps = 3/211 (1%)
 Frame = -2

Query: 864  AMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPT 691
            +ME LIDN   ++ G WM IW LKIPQ+VK  +WRV  G    R       V C + CP 
Sbjct: 758  SMETLIDNEGYKLPGDWMQIWNLKIPQRVKKFMWRVLRGCLPTRDKLQRKGVQCTDLCPH 817

Query: 690  CESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIF 511
            CE+  E+EWH+F  C  A  +W   GL  ++++ +  A SFN+     +          F
Sbjct: 818  CETTYENEWHVFLGCEKAKRIWIEAGLWDDIAQLVVAANSFNSLVFSFMTVNLEQKCSDF 877

Query: 510  AMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELW 331
             MI+W LWK RNEK WEG+ KP  +S   A + L +W + + R  ++  ++I   + ++ 
Sbjct: 878  VMIMWCLWKRRNEKIWEGVEKPVHLSINTAREYLVQWREIKARQENVRPAAI---NTQVV 934

Query: 330  WRKPSIGGLKCNVDAMIF*EENKYGIG-CIR 241
            W+ P+ G  KCNVDA +F EE ++G+G CIR
Sbjct: 935  WQPPADGEFKCNVDAALFNEEQQFGLGMCIR 965


>dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subterraneum]
          Length = 479

 Score =  153 bits (387), Expect = 2e-39
 Identities = 100/328 (30%), Positives = 149/328 (45%), Gaps = 49/328 (14%)
 Frame = -2

Query: 846  DNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVE 673
            DNS   + G W  IWR KIP KVK LLWR+G  V   R+   S  V C   C  C    E
Sbjct: 150  DNSG--IAGNWHQIWRAKIPPKVKNLLWRIGRNVLPTRATLNSRSVQCLVHCAVCNDSAE 207

Query: 672  DEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWA 493
            D  HI F CP +T  W   GL  ++   +  + +      ++L+ L+   + IF++++W+
Sbjct: 208  DSIHILFLCPRSTECWQQAGLWNQIDAGLNTSNNIADILLFILQSLNKEQQEIFSVLLWS 267

Query: 492  LWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQA-RLRSTSLVRSSIAKPSIELWWRKPS 316
            +WK RN K W+ I +        A   L  W+QA + RS +     I + +    W KPS
Sbjct: 268  IWKRRNAKVWDNITESNTNVYERAQHLLTSWKQAQQTRSYANTPQPIQQRTN---WEKPS 324

Query: 315  IGGLKCNVDAMIF*EENKYGIG-CIR*EQ------------------------------- 232
             G  KCN+DA      NK GIG CIR +Q                               
Sbjct: 325  QGRYKCNIDASFSSTHNKVGIGMCIRDDQGRYVAAKTEWLEPILDVEIGEAMGLFSAVKW 384

Query: 231  -----ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQH-QSYRISYVRRQTN 70
                 ++ V  EMDC  +++ +++  T N+  G +   CR I+  +  +  + ++RRQ N
Sbjct: 385  VDELRLSDVDFEMDCKRVVDCLHSSRTYNSDLGDILRDCRVILATNLVNSHVKFIRRQAN 444

Query: 69   LVAHTLVRVSRSYAS--------SCVHD 10
             VAH L R +   AS        +C++D
Sbjct: 445  EVAHRLAREATCLASFHIFIDIPTCIYD 472


>dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subterraneum]
          Length = 372

 Score =  150 bits (380), Expect = 3e-39
 Identities = 79/213 (37%), Positives = 114/213 (53%), Gaps = 11/213 (5%)
 Frame = -2

Query: 813 MNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFFSCPH 640
           M IW +KIPQK+K+ LWR   G    R    +  V C + C  CE   E++WH+FF C  
Sbjct: 1   MQIWNMKIPQKIKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNK 60

Query: 639 ATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWALWKSRNEKFWE 460
              +W   GL   +   ++ A  F   F  LLE LS HN ++FAM +W++WK RN+K W 
Sbjct: 61  VEEVWAEAGLWSFIRDKLEIADGFVALFFQLLELLSQHNLHMFAMTMWSIWKRRNDKLWN 120

Query: 459 GIVKPYEVSAIIAMDQLREWE---QARLRSTSLVRS-----SIAKPSIELWWRKPSIGGL 304
           GI     VS ++A D L +W+   Q R  + ++  S     ++   S  + WRKP  G +
Sbjct: 121 GIETRPTVSIMLARDSLHQWQLIRQKRQHTAAVTGSDSSAATLHSSSNTIRWRKPGTGEV 180

Query: 303 KCNVDAMIF*EENKYGIG-CIR*EQITSVVIEM 208
           KCNVDA IF +   YG+G C+R +    +  +M
Sbjct: 181 KCNVDAAIFKDHGCYGVGICLRGDNCEFIAAKM 213


>ref|XP_020225471.1| uncharacterized protein LOC109807365 [Cajanus cajan]
          Length = 319

 Score =  147 bits (371), Expect = 2e-38
 Identities = 88/308 (28%), Positives = 137/308 (44%), Gaps = 39/308 (12%)
 Frame = -2

Query: 807 IWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFFSCPHAT 634
           +W L IP  +K+ LWR+       R       VPC   CP CE+  E+ WHIFF C  A 
Sbjct: 4   LWALPIPHNMKIFLWRLLRDCLPSRQRLQQKGVPCTSLCPHCEAAQENNWHIFFGCQEAQ 63

Query: 633 ALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWALWKSRNEKFWEGI 454
            +W  TG+   +   +            LL  +S  +     + +  +W+ RN K W+  
Sbjct: 64  TVWQATGIWQHIKSLVDVGEGIVEVIFSLLGSISQSHIVEVVVTLSCIWRRRNAKVWDQG 123

Query: 453 VKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWWRKPSIGGLKCNVDAMIF* 274
             P  V+   A    R+W+ A+ RS+    +    P  +L W+KP  G   CN+DA +F 
Sbjct: 124 APPSGVATSQAKQYFRDWQAAQARSS----TQRTPPVHDLQWKKPHAGTFTCNIDAALFQ 179

Query: 273 EENKYGIG-CIR*E------------------------------------QITSVVIEMD 205
           + + +G   CIR +                                     +T V IE D
Sbjct: 180 DSSYFGYSMCIRNDHGQFLTAKTGWAHGLPPVHEAEATALLTAIQWIVTLSLTHVTIESD 239

Query: 204 CLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQTNLVAHTLVRVSRSYAS 25
           C S+L+ ++   + ++++G L   CR ++  H +  + ++ RQ N VAH L RVSR YAS
Sbjct: 240 CKSVLDALSGTQSHHSEYGSLLNKCRGLLHNHPNLSLKFIPRQANRVAHCLARVSRCYAS 299

Query: 24  SCVHDFSP 1
           S + +F P
Sbjct: 300 SHIFEFIP 307


>dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subterraneum]
          Length = 1688

 Score =  150 bits (378), Expect = 7e-37
 Identities = 83/233 (35%), Positives = 118/233 (50%), Gaps = 23/233 (9%)
 Frame = -2

Query: 870  FAAMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEEC 697
            +  ME+L+DN+ LRV+G W  IW LKIPQK+K+ LWR   G    R       V C   C
Sbjct: 1398 YYTMENLVDNTGLRVEGNWGKIWELKIPQKMKVFLWRAARGCLPTRYRLQQKGVNCPHTC 1457

Query: 696  PTCESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRY 517
              C++  E++WH+FF C  A  +W+  GL   +    +    F + F  LLE LS H   
Sbjct: 1458 AYCQNNFENDWHVFFGCVKAQEIWEEAGLWSFIEGMFESTEGFVSLFFSLLELLSQHKII 1517

Query: 516  IFAMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQAR------------LRSTS 373
            +F    W +WK RN+K WE I     VS  +A D + +W+ A+            L  ++
Sbjct: 1518 LFVAAFWCIWKRRNQKIWEDIELHPSVSLQLASDIIYQWKTAQTSHQRQQTSAAILPHSA 1577

Query: 372  LVRS--------SIAKPSIELWWRKPSIGGLKCNVDAMIF*EENKYGIG-CIR 241
              R+        S+   ++ + W  P  G LKCNVDA IF E+N +G G C+R
Sbjct: 1578 ATRNASGEERSVSVTTSAVRVIWTPPVQGMLKCNVDAAIFKEQNCFGAGMCLR 1630


>dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subterraneum]
          Length = 249

 Score =  135 bits (341), Expect = 8e-35
 Identities = 73/197 (37%), Positives = 104/197 (52%), Gaps = 11/197 (5%)
 Frame = -2

Query: 798 LKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFFSCPHATALW 625
           +KIPQKVK+ LWR   G    R    +  V C + C  CE   E++WH+FF C     +W
Sbjct: 1   MKIPQKVKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNKVEEVW 60

Query: 624 DYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMIIWALWKSRNEKFWEGIVKP 445
               L   +   ++ A  F   F  LLE LS HN ++FAM +W +WK RN+K W GI   
Sbjct: 61  AEARLWSFIRDKLEIADGFVALFFQLLELLSQHNLHMFAMTMWCIWKRRNDKLWNGIETR 120

Query: 444 YEVSAIIAMDQLREWE---QARLRSTSLVRSSIAKPSIE-----LWWRKPSIGGLKCNVD 289
             VS ++A D L +W+   Q R  + ++  S  +  ++      + WRKP  G +KCNVD
Sbjct: 121 PTVSIMLACDSLHQWQLIRQKRQHTAAVTGSDSSAATLHSSNNTIRWRKPGTGEVKCNVD 180

Query: 288 AMIF*EENKYGIG-CIR 241
           A IF +    G+G C+R
Sbjct: 181 AAIFKDHGCCGVGICLR 197


>dbj|GAU44059.1| hypothetical protein TSUD_399580 [Trifolium subterraneum]
          Length = 1229

 Score =  144 bits (362), Expect = 9e-35
 Identities = 89/311 (28%), Positives = 139/311 (44%), Gaps = 40/311 (12%)
 Frame = -2

Query: 852  LIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVYRS---VAISGYVPCQEECPTCES 682
            L  +    V G W  IW ++IP K+K   WR+      +   + I G V CQ  C  C +
Sbjct: 900  LTSHDSFNVSGDWRKIWTMQIPPKLKHFCWRMLRYCLPTRLKLHIRG-VNCQTTCAVCSN 958

Query: 681  GVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMI 502
              EDE H+FF CPHA + W    L   + + M  + SF++    +L  L    +  F  I
Sbjct: 959  ATEDELHLFFDCPHAISCWKELNLWQRLEQKMHQSGSFSSIIFAILADLDADTQARFVAI 1018

Query: 501  IWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWWRK 322
            +W++W++RN+  WE        +  +A D + ++        +++ ++ + P +   W+K
Sbjct: 1019 LWSIWRTRNDCLWEHKQPSTVTTCRLATDIVSDYTWC----CNMLDTTQSSPPVHR-WKK 1073

Query: 321  PSIGGLKCNVDAMIF*EENKYGIG-CIR*EQ----------------------------- 232
            P    LKCNVD  IF  E K+GIG C R +Q                             
Sbjct: 1074 PEANWLKCNVDGAIFSTEGKFGIGICFRNDQGILVQAHTMYFPFEVTVNECEASALKYAL 1133

Query: 231  -------ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQT 73
                      V+ E D  +++N I N     N+ G L   C++++    SY +++VRRQ 
Sbjct: 1134 LIALSSGFERVIFESDSQTVVNSILNDYRYENELGSLLSACKSLLSVIASYNVAFVRRQA 1193

Query: 72   NLVAHTLVRVS 40
            N VAH L R S
Sbjct: 1194 NRVAHNLARAS 1204


>ref|XP_020225309.1| uncharacterized protein LOC109807197 [Cajanus cajan]
          Length = 273

 Score =  132 bits (332), Expect = 3e-33
 Identities = 76/264 (28%), Positives = 120/264 (45%), Gaps = 37/264 (14%)
 Frame = -2

Query: 681 GVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFAMI 502
           G+E+EWH+FF C  A A+W+ +G+   +S  + +   F     +LL  LS  N     + 
Sbjct: 2   GLENEWHLFFDCAEAQAIWNASGIWTLISHAVNNGNDFKETLGHLLNSLSHENIVKMVVS 61

Query: 501 IWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWWRK 322
           +W +W+  N K W     P  +    +M +  EW+ AR +       S +  S    W +
Sbjct: 62  LWCIWQRHNNKIWSNTTTPPHLVISQSMQKFEEWQHARAKEHPPPTQSSSPGS----WTR 117

Query: 321 PSIGGLKCNVDAMIF*EENKYGIG-CIR-------------------------------- 241
           P +G +K NVDA IF E+NK G G C+R                                
Sbjct: 118 PQVGFIKGNVDATIFKEDNKVGFGICLRDATGSLIKAKSGWLYGVAPPHEEEATTLLESI 177

Query: 240 ----*EQITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQT 73
                +  T V++E D   ++  I N +   +++G   + C +++  H +  + ++RRQ 
Sbjct: 178 RWVCDQGYTRVILESDSKQVVEDILNSNIYYSEYGHTLHRCHSLLNSHPNLLVRFIRRQA 237

Query: 72  NLVAHTLVRVSRSYASSCVHDFSP 1
           N VAH+L R SR YASS V  F P
Sbjct: 238 NHVAHSLTRTSRYYASSHVFYFIP 261


>dbj|GAU37566.1| hypothetical protein TSUD_153990 [Trifolium subterraneum]
          Length = 343

 Score =  134 bits (336), Expect = 4e-33
 Identities = 72/213 (33%), Positives = 110/213 (51%), Gaps = 3/213 (1%)
 Frame = -2

Query: 861 MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688
           +++LID S+LRV G W  +W +K+P KVK L+WR+       R       V C + C  C
Sbjct: 124 VQELIDTSYLRVNGNWNLVWNIKVPPKVKNLIWRICRRCLPTRVRLRDKGVECTQTCALC 183

Query: 687 ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508
               ED  HIFF CP +  +W  TG    VS  + +  +      ++L+QLS  +  +FA
Sbjct: 184 NEENEDSEHIFFKCPSSRNVWSMTGFFHVVSNAINNNNNAQDIIFHILQQLSKDDSTVFA 243

Query: 507 MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWW 328
            I+W++WK RN + W  +          A++ L+EW+   + +++    S  +  +   W
Sbjct: 244 CILWSIWKQRNNQIWNNVTDAQNFVLSRAVNMLQEWKAVCIVASN--PDSKTQEPLARKW 301

Query: 327 RKPSIGGLKCNVDAMIF*EENKYGIG-CIR*EQ 232
           RKP  G +KCN+DA      +  GIG CIR EQ
Sbjct: 302 RKPMAGRVKCNIDASFPANSDIVGIGICIRDEQ 334


>gb|PNX68200.1| pentatricopeptide repeat-containing protein, partial [Trifolium
           pratense]
          Length = 220

 Score =  129 bits (325), Expect = 9e-33
 Identities = 63/164 (38%), Positives = 89/164 (54%), Gaps = 2/164 (1%)
 Frame = -2

Query: 870 FAAMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEEC 697
           +  ME+L+DN+ LRV+G W  IW LKIPQK+K+ LWR   G    R       V C   C
Sbjct: 23  YYTMENLVDNTGLRVEGNWGKIWGLKIPQKMKVFLWRAARGCLPTRYRLQRKGVNCPHTC 82

Query: 696 PTCESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRY 517
             C++  E++WH+FF C  A  +W+  GL   +    + A  F + F  LLE LS HN  
Sbjct: 83  AYCQNNFENDWHVFFGCVKAQEIWEEAGLWSLIEGMFESAEGFVSLFFSLLELLSQHNII 142

Query: 516 IFAMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARL 385
           +F    W +WK RN+K WE I     VS  +A D + +W+  ++
Sbjct: 143 LFVAAFWCIWKRRNQKIWEDIELRPSVSLQLATDIIYQWKTTQI 186


>dbj|GAU27275.1| hypothetical protein TSUD_125560 [Trifolium subterraneum]
          Length = 330

 Score =  130 bits (327), Expect = 6e-32
 Identities = 86/322 (26%), Positives = 143/322 (44%), Gaps = 47/322 (14%)
 Frame = -2

Query: 825 QGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTCESGVEDEWHIFF 652
           Q  W ++W++  P K K LLWR+ +G    R+     +VPC   CP C+   ED+WH+FF
Sbjct: 7   QEDWSSLWKIHAPPKAKHLLWRICKGCIPTRTRLHERFVPCPLICPVCDQCNEDDWHVFF 66

Query: 651 SCPHATALWDYTGLELEVSRPMQDATS-----FNTCFCYLLEQLSVHNRYI---FAMIIW 496
           +C  +       GLE  +S  +Q   +     FN C           +R I   FA+++W
Sbjct: 67  TCNDSIHARQAAGLEHVISTRLQQLRTTQEVIFNIC--------KGEDRMIAGQFAVLLW 118

Query: 495 ALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWWRKPS 316
            LW +RN+K W     P     I A     EW   +       R++  +  I+  W KP 
Sbjct: 119 TLWNNRNDKVWNESRTPGRSLGIKASQFWHEWFAIQKVQQQSPRAAQQQQFIK--WEKPP 176

Query: 315 IGGLKCNVDAMIF*EENKYGIG-CIR*EQ------------------------------- 232
           +G  KCNVDA  +   ++   G C+R  Q                               
Sbjct: 177 MGWHKCNVDAGFYHNLHRTTAGWCLRDHQGSFVRAGTSWSNGNYYIAEGEAAAVLDAMKA 236

Query: 231 -----ITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTIIVQHQSYRISYVRRQTNL 67
                +T V+ E D  S+++ I N    +++F  +  + +  ++ + ++ + +++RQ N+
Sbjct: 237 VENQGVTHVIFETDSKSVVDAIYNFHGGSSEFSSIICNIKNALLSNPNFVVKFIKRQANM 296

Query: 66  VAHTLVRVSRSYASSCVHDFSP 1
           VAHTL R + S+++ C  D  P
Sbjct: 297 VAHTLARAAISWSNRCTFDLLP 318


>gb|KYP48455.1| hypothetical protein KK1_029830 [Cajanus cajan]
          Length = 536

 Score =  133 bits (335), Expect = 9e-32
 Identities = 68/210 (32%), Positives = 104/210 (49%), Gaps = 3/210 (1%)
 Frame = -2

Query: 861 MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVYRSVAISGY--VPCQEECPTC 688
           ME +I N+ LRVQG WM +W LKIP   ++ LWR+  G   +        V C   CP C
Sbjct: 329 MEHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWRLLRGCIPTCLNLQQKGVSCTSSCPHC 388

Query: 687 ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508
            +  E+EWH+F+SCP A ++W  +G    ++R ++   SF      LL  L+  +   F 
Sbjct: 389 SANQENEWHLFYSCPAAISIWIDSGCWPRIARIVEQGISFIDTTWKLLGHLTSSDLTSFT 448

Query: 507 MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIELWW 328
           +++W +W+ RN+K W+    P   S  +       W+ A    T       A P +   W
Sbjct: 449 LMLWCIWRWRNDKVWKESAPPPRTSIQLTEQHFHAWQSAHRNLT-----QNASPVVNHRW 503

Query: 327 RKPSIGGLKCNVDAMIF*EENKYGIG-CIR 241
            KP      CNVDA +F + + +G+  C+R
Sbjct: 504 TKPPANTFTCNVDAALFKDSSTFGLSICVR 533


>gb|ABD28710.1| Polynucleotidyl transferase, Ribonuclease H fold [Medicago
            truncatula]
          Length = 393

 Score =  131 bits (329), Expect = 1e-31
 Identities = 99/344 (28%), Positives = 146/344 (42%), Gaps = 57/344 (16%)
 Frame = -2

Query: 861  MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688
            +ED+++N+HLR  G W  IWRLK+P +VK L+WRV    +  R   IS  V C   C  C
Sbjct: 41   VEDVVNNAHLRKPGYWSGIWRLKVPPRVKKLVWRVCRECFPTRVRLISRGVNCPSACVKC 100

Query: 687  ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508
            E   ED +HIFF C  A  +W+   +   ++  +    +       LL++LS        
Sbjct: 101  EDPHEDCYHIFFHCRTAIDVWNTANVWHLIAPSLSQFDNAPDIIFNLLQKLSASQMESIV 160

Query: 507  MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVR---SSIAKPSI- 340
             I+W++WKSRN K W+ + +        A   L  W +A  +   L +    + ++P   
Sbjct: 161  TIMWSIWKSRNLKLWQQVSESSVTILERAKHLLEGWRKANHKQGLLGQVHSPTNSRPQTH 220

Query: 339  ----------ELWWRKPSIGGLKCNVDAMIF*EENKYGIG-CIR*E-------------- 235
                       + WRKP  G LKCNVDA      NK GIG CIR                
Sbjct: 221  DSQNTDNRYGNIRWRKPKSGRLKCNVDASFSTSSNKVGIGMCIRDSEGNHVRSKTMWFSP 280

Query: 234  ----------------------QITSVVIEMDCLSILNGINNHSTLNNKFGVLFYH---- 133
                                  Q+T+V  E+D  +I +  N     N +FG +  +    
Sbjct: 281  LCPVNIGEALGLYHATRWINELQLTNVDFEVDSKTIADYFNKARGDNTEFGSIMENTIQF 340

Query: 132  CRTIIVQHQSYRISYVRRQTNLVAHTLVRVSRSYASSCVHDFSP 1
            C   +    +  + + RRQ N VAH L + +    S  + D SP
Sbjct: 341  CNIFLT---NSHVEFTRRQANEVAHELAKAATLGPSFHIFDESP 381


>dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subterraneum]
          Length = 474

 Score =  131 bits (329), Expect = 3e-31
 Identities = 85/320 (26%), Positives = 144/320 (45%), Gaps = 41/320 (12%)
 Frame = -2

Query: 861  MEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEECPTC 688
            +++ +D SHL++ G W  IW+LK+P +VK L+WRV       R+   +  V C   C  C
Sbjct: 139  VQEELDTSHLKMTGDWNLIWKLKVPPRVKNLVWRVCRQCIPTRTNLQNRGVNCTTVCALC 198

Query: 687  ESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRYIFA 508
                ED  HIFF C  ++ +W        ++  +Q           +L+QL+V    + A
Sbjct: 199  NEYDEDSGHIFFDCLSSSNIWSMCTFNHVITAGLQHYAGVTELIFAVLQQLNVDEAALMA 258

Query: 507  MIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQAR-LRSTSLVRSSIAKPSIELW 331
             IIW++WK RN + W  +     V    A+  L +W   + +R+ +  +  I    IE  
Sbjct: 259  CIIWSIWKQRNNQIWNNVTDAQSVVFSRAVTTLHDWCVVQVIRNDTREQQRI----IEHK 314

Query: 330  WRKPSIGGLKCNVDAMIF*EENKYGIG-CIR*E--------------------------- 235
            W+KP+ G +KCN+DA      N+ GIG CIR E                           
Sbjct: 315  WKKPNNGRVKCNIDASFSRNLNRVGIGICIRDEYGIYVMAKYDQFSPICDVRIGEALGLL 374

Query: 234  ---------QITSVVIEMDCLSILNGINNHSTLNNKFGVLFYHCRTII-VQHQSYRISYV 85
                         V  E+D   +++   ++   +++FG +  HCR +  + + +  + ++
Sbjct: 375  SALRWVHELNFGPVDFELDSKLVVDSFRSNKYNDSEFGEIIAHCRRLFSLLYNNSSVEFI 434

Query: 84   RRQTNLVAHTLVRVSRSYAS 25
            RRQ N + H+L + +   AS
Sbjct: 435  RRQANKIVHSLSKAATYVAS 454


>gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 406

 Score =  129 bits (325), Expect = 5e-31
 Identities = 73/213 (34%), Positives = 105/213 (49%), Gaps = 3/213 (1%)
 Frame = -2

Query: 870 FAAMEDLIDNSHLRVQGAWMNIWRLKIPQKVKLLLWRVGEGVY--RSVAISGYVPCQEEC 697
           +  ME LI N+HL V G W  IW LK+   +K+ LWR+       R       +P    C
Sbjct: 165 YYVMESLISNTHLHVPGNWKQIWSLKVLNTMKIFLWRIARRCLPSRMNLQQRGIPRTSLC 224

Query: 696 PTCESGVEDEWHIFFSCPHATALWDYTGLELEVSRPMQDATSFNTCFCYLLEQLSVHNRY 517
             C    E+EWHIFF C  A ++W   GL    +  + +   F      L+  L      
Sbjct: 225 AHCSLNQENEWHIFFGCQTAESIWMTFGLWPSTNAYIDNGEDFKDTIFSLISNLHHDIAC 284

Query: 516 IFAMIIWALWKSRNEKFWEGIVKPYEVSAIIAMDQLREWEQARLRSTSLVRSSIAKPSIE 337
              +I+W++W++RN+K W     P  ++   AM +  EW+ A+++  S   +S  +P + 
Sbjct: 285 KVIIILWSIWRNRNDKVWSDTTTPPGIAVHKAMQRYSEWQFAKVKDKS---TSQQQPHVN 341

Query: 336 LWWRKPSIGGLKCNVDAMIF*EENKYGIG-CIR 241
             W KP  G LKCNVDA +F EEN  G G CIR
Sbjct: 342 T-WTKPLPGLLKCNVDAAVFKEENIMGFGLCIR 373


Top