BLASTX nr result

ID: Astragalus22_contig00033023 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00033023
         (887 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subte...   199   6e-54
dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subt...   167   4e-47
ref|XP_020230539.1| uncharacterized protein LOC109811261 [Cajanu...   167   4e-46
dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subt...   167   9e-46
ref|XP_020225471.1| uncharacterized protein LOC109807365 [Cajanu...   165   2e-45
dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subt...   172   1e-44
dbj|GAU18498.1| hypothetical protein TSUD_366810 [Trifolium subt...   156   5e-42
gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family ...   160   2e-41
dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subt...   152   8e-40
dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subt...   154   9e-40
dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subt...   154   1e-39
gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Ca...   156   3e-39
dbj|GAU44081.1| hypothetical protein TSUD_399630 [Trifolium subt...   148   3e-37
dbj|GAU10454.1| hypothetical protein TSUD_423510, partial [Trifo...   140   3e-36
gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family ...   142   8e-36
gb|PNY17850.1| ribonuclease H [Trifolium pratense]                    140   2e-35
dbj|GAU50352.1| hypothetical protein TSUD_288030 [Trifolium subt...   142   2e-35
dbj|GAU47271.1| hypothetical protein TSUD_280940 [Trifolium subt...   143   1e-34
gb|KYP36545.1| hypothetical protein KK1_042329 [Cajanus cajan]        135   2e-34
gb|KYP46236.1| Putative ribonuclease H protein At1g65750 family ...   140   3e-34

>dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subterraneum]
          Length = 1601

 Score =  199 bits (506), Expect = 6e-54
 Identities = 109/269 (40%), Positives = 143/269 (53%)
 Frame = -2

Query: 859  SIPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWE 680
            SIP +VK+ LWR+A GCLPTR  LQ R V C ++ P C    E++ HLF  C +A  +W 
Sbjct: 1289 SIPQRVKIFLWRIAIGCLPTRDRLQSRGVQCTDLCPHCETTYENDWHLFVSCNKAHEVWR 1348

Query: 679  RCRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPV 500
                                  F    +L E +  E  ++ W +WK  N K+WED ++PV
Sbjct: 1349 EANLWDEVCSVVETVSCIKDFIFAALAALAEPRRSEFVMMLWCLWKCRNDKIWEDKVQPV 1408

Query: 499  AVSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEG 320
             V    A D L  W   R   R            +Q  WQ P  G +KCNID AL +++ 
Sbjct: 1409 RVGMQLARDMLYQWRNARR--REDTTGHHDSHNVIQ--WQPPPIGKVKCNIDAALFNEQH 1464

Query: 319  KYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLL 140
            K+G+  CIRD++GIF++A+T WF G P P E EA  L +A+ W  EL L  VVIE DCLL
Sbjct: 1465 KFGLGMCIRDDHGIFVKARTKWFHGSPPPVEAEAWALKEAITWMGELELSRVVIELDCLL 1524

Query: 139  VVNAVNKASILNTEFDVIISHCKIRILLN 53
            VVNA+   S   +EF  IIS C  R+L N
Sbjct: 1525 VVNAIKSNSNNQSEFGHIISDCH-RLLEN 1552


>dbj|GAU39798.1| hypothetical protein TSUD_219730 [Trifolium subterraneum]
          Length = 249

 Score =  167 bits (424), Expect = 4e-47
 Identities = 89/243 (36%), Positives = 122/243 (50%), Gaps = 7/243 (2%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           IP KVK+ LWR ARGCLPTR  L+ R V C +    C    E++ H+FF C + + +W  
Sbjct: 3   IPQKVKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNKVEEVWAE 62

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
            R              F   FF L + L++      ++  W IWKR N K+W  +     
Sbjct: 63  ARLWSFIRDKLEIADGFVALFFQLLELLSQHNLHMFAMTMWCIWKRRNDKLWNGIETRPT 122

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQ-------QWQRPEPGVLKCNIDPA 338
           VS   A D L  W+L+R   +H          A          +W++P  G +KCN+D A
Sbjct: 123 VSIMLACDSLHQWQLIRQKRQHTAAVTGSDSSAATLHSSNNTIRWRKPGTGEVKCNVDAA 182

Query: 337 LIDQEGKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVI 158
           +    G  GV  C+R +NG F+ AKT WF G+P+PQE EA GL + + W  + GL AV I
Sbjct: 183 IFKDHGCCGVGICLRGDNGEFIAAKTAWFYGLPQPQEAEACGLRETILWLGDRGLTAVSI 242

Query: 157 ETD 149
           E D
Sbjct: 243 ELD 245


>ref|XP_020230539.1| uncharacterized protein LOC109811261 [Cajanus cajan]
          Length = 307

 Score =  167 bits (422), Expect = 4e-46
 Identities = 86/254 (33%), Positives = 124/254 (48%)
 Frame = -2

Query: 844 VKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERCRXX 665
           +K+ LWR+ RGCLPTR+NLQR+HVPC  +   C   +E+E H+FF C  AK +W      
Sbjct: 1   MKIFLWRLLRGCLPTRINLQRKHVPCTTLCVSCNSELENEWHVFFTCAAAKDIWTSSGMW 60

Query: 664 XXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAVSQH 485
                            F L + L   +  E   + W IW+R N K+W DV  P+ VS  
Sbjct: 61  DKIKNIVEQGEGTTDTVFQLLNHLDTKEATELLALLWCIWRRRNDKLWNDVSSPIGVSIF 120

Query: 484 AALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKYGVA 305
            A   L +W   R               A    W +P+PG +KCN D A+      Y  A
Sbjct: 121 LARQRLEEWLAART-----TNLAPSPRVAEPNYWVKPQPGFMKCNTDAAIFKDTNSYSFA 175

Query: 304 FCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVVNAV 125
           FC+RD +G F  A T W+ G+    E E +   +A+ W      E V+IE DC  VV+ +
Sbjct: 176 FCLRDNHGRFKAATTGWYHGLSPRHEAEVIACIEAMSWLTNSSYENVLIELDCKTVVDDL 235

Query: 124 NKASILNTEFDVII 83
           + ++ L +E+ ++I
Sbjct: 236 HGSNQLLSEYGLLI 249


>dbj|GAU25895.1| hypothetical protein TSUD_376140 [Trifolium subterraneum]
          Length = 372

 Score =  167 bits (424), Expect = 9e-46
 Identities = 91/254 (35%), Positives = 128/254 (50%), Gaps = 7/254 (2%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           IP K+K+ LWR ARGCLPTR  L+ R V C +    C    E++ H+FF C + + +W  
Sbjct: 8   IPQKIKVFLWRAARGCLPTRERLRTRGVQCTDRCVHCEQSFENDWHVFFGCNKVEEVWAE 67

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                           F   FF L + L++      ++  W+IWKR N K+W  +     
Sbjct: 68  AGLWSFIRDKLEIADGFVALFFQLLELLSQHNLHMFAMTMWSIWKRRNDKLWNGIETRPT 127

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQ-------QWQRPEPGVLKCNIDPA 338
           VS   A D L  W+L+R   +H          A          +W++P  G +KCN+D A
Sbjct: 128 VSIMLARDSLHQWQLIRQKRQHTAAVTGSDSSAATLHSSSNTIRWRKPGTGEVKCNVDAA 187

Query: 337 LIDQEGKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVI 158
           +    G YGV  C+R +N  F+ AK  WF G+P+PQE EA GL +A+ W  + GL AV I
Sbjct: 188 IFKDHGCYGVGICLRGDNCEFIAAKMAWFYGLPQPQEAEACGLREAILWLGDRGLTAVSI 247

Query: 157 ETDCLLVVNAVNKA 116
           E D L  V+ V K+
Sbjct: 248 ELDYLCGVSLVAKS 261


>ref|XP_020225471.1| uncharacterized protein LOC109807365 [Cajanus cajan]
          Length = 319

 Score =  165 bits (418), Expect = 2e-45
 Identities = 83/262 (31%), Positives = 129/262 (49%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           IP+ +K+ LWR+ R CLP+R  LQ++ VPC  + P C    E+  H+FF C EA+ +W+ 
Sbjct: 9   IPHNMKIFLWRLLRDCLPSRQRLQQKGVPCTSLCPHCEAAQENNWHIFFGCQEAQTVWQA 68

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                                F+L  S+++    E  V    IW+R N KVW+    P  
Sbjct: 69  TGIWQHIKSLVDVGEGIVEVIFSLLGSISQSHIVEVVVTLSCIWRRRNAKVWDQGAPPSG 128

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
           V+   A  Y  DW+  +                   QW++P  G   CNID AL      
Sbjct: 129 VATSQAKQYFRDWQAAQA-----RSSTQRTPPVHDLQWKKPHAGTFTCNIDAALFQDSSY 183

Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137
           +G + CIR+++G F+ AKT W  G+P   E EA  L  A++W + L L  V IE+DC  V
Sbjct: 184 FGYSMCIRNDHGQFLTAKTGWAHGLPPVHEAEATALLTAIQWIVTLSLTHVTIESDCKSV 243

Query: 136 VNAVNKASILNTEFDVIISHCK 71
           ++A++     ++E+  +++ C+
Sbjct: 244 LDALSGTQSHHSEYGSLLNKCR 265


>dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subterraneum]
          Length = 1012

 Score =  172 bits (436), Expect = 1e-44
 Identities = 89/231 (38%), Positives = 121/231 (52%)
 Frame = -2

Query: 856  IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
            IP +VK  +WRV RGCLPTR  LQR+ V C ++ P C    E+E H+F  C +AK +W  
Sbjct: 782  IPQRVKKFMWRVLRGCLPTRDKLQRKGVQCTDLCPHCETTYENEWHVFLGCEKAKRIWIE 841

Query: 676  CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                            F    F+      E KC +  +I W +WKR N K+WE V KPV 
Sbjct: 842  AGLWDDIAQLVVAANSFNSLVFSFMTVNLEQKCSDFVMIMWCLWKRRNEKIWEGVEKPVH 901

Query: 496  VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
            +S + A +YL  W  ++    +            Q  WQ P  G  KCN+D AL ++E +
Sbjct: 902  LSINTAREYLVQWREIKARQENVRPAAIN----TQVVWQPPADGEFKCNVDAALFNEEQQ 957

Query: 316  YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAV 164
            +G+  CIR  +G F++A+TM FEG P P E EA  L +AL W  ELG+  V
Sbjct: 958  FGLGMCIRGAHGTFVKARTMVFEGTPPPLEAEAYALKEALIWLEELGISRV 1008


>dbj|GAU18498.1| hypothetical protein TSUD_366810 [Trifolium subterraneum]
          Length = 319

 Score =  156 bits (395), Expect = 5e-42
 Identities = 89/261 (34%), Positives = 121/261 (46%)
 Frame = -2

Query: 853 PNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERC 674
           P K+K LLWR+ R C PTR+ LQ + + C     +C    ED  HLFFKC  +  +W + 
Sbjct: 51  PPKIKNLLWRICRHCCPTRVRLQDKGIECPTDCVLCEDHDEDSFHLFFKCRNSLNIWNQT 110

Query: 673 RXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494
                           A   F L   + + K    ++I W+IW + N KVW +   P   
Sbjct: 111 NIAQAVLQASEEQSDAAAVIFTLLQQVDKDKTGIFAIIIWSIWNQRNDKVWRNKDTPQQT 170

Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKY 314
               A+++L DW+   +                  +W++P PG +KCNID A        
Sbjct: 171 VILRAMNFLNDWK--NIISVQTSTSVDMQAETTLTKWKKPSPGRIKCNIDVAFPSNTNLI 228

Query: 313 GVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVV 134
           G+  CIRDE G F+ AKT WFE   E    EA+G   AL+W  EL L  V  E D  L+V
Sbjct: 229 GIGICIRDETGAFVRAKTEWFEPKCEVHVGEALGFLSALRWVHELNLGPVDFELDSKLMV 288

Query: 133 NAVNKASILNTEFDVIISHCK 71
           ++        TEF  II HCK
Sbjct: 289 DSSRYHRKDFTEFGAIIQHCK 309


>gb|KYP35971.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 606

 Score =  160 bits (406), Expect = 2e-41
 Identities = 86/253 (33%), Positives = 120/253 (47%)
 Frame = -2

Query: 853  PNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERC 674
            PN  K+ LWRV RGCLPTR+NLQRRHVPC  + P C  GIE+E H+FF+CVEAK +W   
Sbjct: 297  PNTKKIFLWRVLRGCLPTRLNLQRRHVPCTMLCPTCSAGIENEWHIFFECVEAKDIWAAS 356

Query: 673  RXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494
                                F L   L+  +  +   + W IW++ N  +W + + P   
Sbjct: 357  GFWPKISQIIADSDGIQQAIFQLLQCLSPSEALDLLCLMWGIWRKRNDILWNNKVTPSHT 416

Query: 493  SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKY 314
                A   + +W   R   +                W +P P  +KCN+D  +       
Sbjct: 417  VIFLARQRISEWMSARETQQIPKVARNDPIC-----WFKPPPEYMKCNVDVTIFTDSNCC 471

Query: 313  GVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVV 134
            G AF IRD+ G F  A T W+ G   P E EAM   +A+ W      E V+IE DC  VV
Sbjct: 472  GFAFYIRDDLGRFKAATTGWYNGSLPPNEAEAMACLEAITWLANSHYEKVLIELDCKKVV 531

Query: 133  NAVNKASILNTEF 95
            + +  ++ L +E+
Sbjct: 532  DDLYDSTSLFSEY 544


>dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subterraneum]
          Length = 395

 Score =  152 bits (385), Expect = 8e-40
 Identities = 82/261 (31%), Positives = 124/261 (47%)
 Frame = -2

Query: 853 PNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERC 674
           P KVK L+WR+ R C+ TR  LQ + V C  +  +C +  ED +H+FFKC  ++ +W   
Sbjct: 83  PPKVKNLIWRICRRCVSTRARLQDKGVNCPNLCALCNIEGEDSLHVFFKCPSSQNVWSMT 142

Query: 673 RXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494
                           +   F +   L++      + I W+IWK+ N ++W +V    + 
Sbjct: 143 SFFQVVSSVINNENEASAIVFQILRQLSKEDAALFACILWSIWKQRNNQIWNNVTDAQSF 202

Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKY 314
               A + L +W  +R                +   W++P  G +KCN+D + +    K 
Sbjct: 203 VFSRANNMLQEWNTVRNVAATPVSNQQPGAACI---WRKPSAGHVKCNVDASFLPHNNKV 259

Query: 313 GVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVV 134
           G+  CIRD+ G F+ AKT WF    E    EA+GL  AL W  EL L  V  E D   VV
Sbjct: 260 GIGICIRDDQGAFILAKTEWFSPKSEVHTGEALGLLAALNWVHELNLGPVEFELDSKRVV 319

Query: 133 NAVNKASILNTEFDVIISHCK 71
           ++ + +    TEF VI+ HCK
Sbjct: 320 DSFHSSKRDFTEFGVIVEHCK 340


>dbj|GAU36374.1| hypothetical protein TSUD_151410 [Trifolium subterraneum]
          Length = 474

 Score =  154 bits (389), Expect = 9e-40
 Identities = 86/272 (31%), Positives = 133/272 (48%), Gaps = 2/272 (0%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           +P +VK L+WRV R C+PTR NLQ R V C  V  +C    ED  H+FF C+ +  +W  
Sbjct: 162 VPPRVKNLVWRVCRQCIPTRTNLQNRGVNCTTVCALCNEYDEDSGHIFFDCLSSSNIWSM 221

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
           C                    F +   L   +    + I W+IWK+ N ++W +V    +
Sbjct: 222 CTFNHVITAGLQHYAGVTELIFAVLQQLNVDEAALMACIIWSIWKQRNNQIWNNVTDAQS 281

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
           V    A+  L DW ++++               ++ +W++P  G +KCNID +      +
Sbjct: 282 VVFSRAVTTLHDWCVVQV----IRNDTREQQRIIEHKWKKPNNGRVKCNIDASFSRNLNR 337

Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137
            G+  CIRDE GI++ AK   F  + + +  EA+GL  AL+W  EL    V  E D  LV
Sbjct: 338 VGIGICIRDEYGIYVMAKYDQFSPICDVRIGEALGLLSALRWVHELNFGPVDFELDSKLV 397

Query: 136 VNAVNKASILNTEFDVIISHCK--IRILLNSS 47
           V++       ++EF  II+HC+    +L N+S
Sbjct: 398 VDSFRSNKYNDSEFGEIIAHCRRLFSLLYNNS 429


>dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subterraneum]
          Length = 479

 Score =  154 bits (389), Expect = 1e-39
 Identities = 86/270 (31%), Positives = 128/270 (47%), Gaps = 2/270 (0%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           IP KVK LLWR+ R  LPTR  L  R V C     +C    ED IH+ F C  +   W++
Sbjct: 166 IPPKVKNLLWRIGRNVLPTRATLNSRSVQCLVHCAVCNDSAEDSIHILFLCPRSTECWQQ 225

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                            A     +  SL + + +  SV+ W+IWKR N KVW+++ +   
Sbjct: 226 AGLWNQIDAGLNTSNNIADILLFILQSLNKEQQEIFSVLLWSIWKRRNAKVWDNITESNT 285

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQ--WQRPEPGVLKCNIDPALIDQE 323
                A   L  W+  +                +QQ+  W++P  G  KCNID +     
Sbjct: 286 NVYERAQHLLTSWKQAQ-----QTRSYANTPQPIQQRTNWEKPSQGRYKCNIDASFSSTH 340

Query: 322 GKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCL 143
            K G+  CIRD+ G ++ AKT W E + + +  EAMGL+ A+KW  EL L  V  E DC 
Sbjct: 341 NKVGIGMCIRDDQGRYVAAKTEWLEPILDVEIGEAMGLFSAVKWVDELRLSDVDFEMDCK 400

Query: 142 LVVNAVNKASILNTEFDVIISHCKIRILLN 53
            VV+ ++ +   N++   I+  C++ +  N
Sbjct: 401 RVVDCLHSSRTYNSDLGDILRDCRVILATN 430


>gb|KYP66749.1| LINE-1 reverse transcriptase isogeny, partial [Cajanus cajan]
          Length = 816

 Score =  156 bits (395), Expect = 3e-39
 Identities = 76/262 (29%), Positives = 121/262 (46%)
 Frame = -2

Query: 856  IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
            IP+  ++ LWR+ RGC+PTR+NLQ++ VPC    P C    E+E HLF+ C  A  +W  
Sbjct: 507  IPHSTQIFLWRLLRGCIPTRLNLQQKGVPCTSSCPHCSANQENEWHLFYSCPAALSIWID 566

Query: 676  CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                            F    + L   LT       +++ W IW+R N KVW++   P  
Sbjct: 567  SGCWPRIAHIVEQGISFIDTTWKLLGHLTGSDLTSFTLMLWCIWRRRNDKVWKEGAPPPK 626

Query: 496  VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
             S      +   W                    V  +W +P      CN+D  L +    
Sbjct: 627  TSIQLTEQHFHAWRSAH------RNLAQTASPVVNHRWTKPPADTFTCNVDAVLFNDSST 680

Query: 316  YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137
            +G   C+RD  G+F  A + W  G+P P E EA  + +A+++ +    + V +ETDC  V
Sbjct: 681  FGFGICVRDTRGLFQTAISGWKHGLPPPHEAEAAAMLEAIQYLIHSPYDNVCVETDCKQV 740

Query: 136  VNAVNKASILNTEFDVIISHCK 71
             + +N   +L++E+ +II+ C+
Sbjct: 741  ADHLNSTQVLHSEYGIIINQCR 762


>dbj|GAU44081.1| hypothetical protein TSUD_399630 [Trifolium subterraneum]
          Length = 539

 Score =  148 bits (374), Expect = 3e-37
 Identities = 79/261 (30%), Positives = 125/261 (47%)
 Frame = -2

Query: 853 PNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERC 674
           P +V+ LLWR+ R C+PTR+NL+ R + C  V  +C    ED  H+FF C  ++ +W  C
Sbjct: 228 PPRVRNLLWRICRRCVPTRVNLRSRGMNCTTVCSLCNDQDEDSRHIFFDCPSSRNVWSMC 287

Query: 673 RXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494
                           +Y  F+L   L+       + + W+IWK+ N ++W +V+     
Sbjct: 288 CFGNKIIAALHNDYAASYLIFDLLQQLSNEDASLMACVIWSIWKQRNSRIWNNVIDAQNF 347

Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGKY 314
               A+  + DW  ++                 + +W +P  G +KCNID +      + 
Sbjct: 348 VLSRAVALINDWCDVQ----QARPDAMGQHTTTEIKWNKPANGRVKCNIDASFSSHNNRV 403

Query: 313 GVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLVV 134
           G++ CIRDE G ++ AK   F  + + +  EA+G   AL W  EL L  V  E D  LV+
Sbjct: 404 GISVCIRDEKGAYVSAKLDQFSPICDVRVGEALGFLSALSWIHELNLGPVDFELDSKLVI 463

Query: 133 NAVNKASILNTEFDVIISHCK 71
           +  +  +   TEF  IISHC+
Sbjct: 464 DGFHSNNHDITEFREIISHCR 484


>dbj|GAU10454.1| hypothetical protein TSUD_423510, partial [Trifolium subterraneum]
          Length = 280

 Score =  140 bits (353), Expect = 3e-36
 Identities = 82/270 (30%), Positives = 122/270 (45%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           +P++VK  LWR+A  CLPTR  L  R + C++   +C   +E ++H FF C +A   WE+
Sbjct: 3   VPSRVKSFLWRMAHNCLPTRDQLATRGIHCDDTCVVCEQLMETQMHTFFACSKAVKCWEK 62

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                           F   FF LFD L   +    ++  W++WK  N K+WE +     
Sbjct: 63  INMDGLVRELLLVANNFTTMFFTLFDRLAINQQAIVAMTLWSLWKCRNMKLWEGIDTSPH 122

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
           +    A D L +W  ++   +H               W +P    +KCN+D A  +    
Sbjct: 123 MIITRAKDALYEWSTIQT-AKHPVHKGTNHDI----SWTKPPLNTVKCNVDCAFFNNNTI 177

Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137
            G   C RD  G FM  ++ W +      E EA  L  ++K S   G + V  ETDC LV
Sbjct: 178 MGYGLCFRDATGQFMHGESSWKQCFMTTAEAEATALLASIKASFAQGYQKVFFETDCKLV 237

Query: 136 VNAVNKASILNTEFDVIISHCKIRILLNSS 47
           V+A+   S    E   IIS CK  +  N++
Sbjct: 238 VDALYSHSAPQNELGDIISLCKNLLSTNNN 267


>gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 406

 Score =  142 bits (358), Expect = 8e-36
 Identities = 77/224 (34%), Positives = 115/224 (51%), Gaps = 6/224 (2%)
 Frame = -2

Query: 850 NKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERCR 671
           N +K+ LWR+AR CLP+RMNLQ+R +P   +   C +  E+E H+FF C  A+ +W    
Sbjct: 193 NTMKIFLWRIARRCLPSRMNLQQRGIPRTSLCAHCSLNQENEWHIFFGCQTAESIWMTFG 252

Query: 670 XXXXXXXXXXXXXVFAYCFFNLFDSL-TEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAV 494
                         F    F+L  +L  +I CK   +I W+IW+  N KVW D   P  +
Sbjct: 253 LWPSTNAYIDNGEDFKDTIFSLISNLHHDIACK-VIIILWSIWRNRNDKVWSDTTTPPGI 311

Query: 493 SQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQ-----WQRPEPGVLKCNIDPALID 329
           + H A+    +W+  ++  +             QQQ     W +P PG+LKCN+D A+  
Sbjct: 312 AVHKAMQRYSEWQFAKVKDKSTS----------QQQPHVNTWTKPLPGLLKCNVDAAVFK 361

Query: 328 QEGKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQAL 197
           +E   G   CIR+ +G F++AK+ W  G    QE EA+ L +AL
Sbjct: 362 EENIMGFGLCIRNADGSFIKAKSGWQHGFINFQEAEALTLLEAL 405


>gb|PNY17850.1| ribonuclease H [Trifolium pratense]
          Length = 363

 Score =  140 bits (353), Expect = 2e-35
 Identities = 76/270 (28%), Positives = 120/270 (44%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           +P KV+ L+WR+AR CLPTR+ L  RHVPC     +C   +E + HL F+C  +   W+ 
Sbjct: 50  VPPKVRSLIWRIARNCLPTRLRLNERHVPCPINCEICNDSVESDWHLLFQCDTSIQSWQT 109

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                                 ++     E       +I W +W   N  +W        
Sbjct: 110 EGLWPQIRDRVQRMNSAIEVVLDICSREVEAVVNRFMIIVWGLWHNRNEWIWNQKQMNPD 169

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
              H       +W   +   +           +V ++W +P  G LKCN+D A      K
Sbjct: 170 QINHWTKARWSEWNAAQ---QRRVTADATEYSSVHRRWVKPITGELKCNVDAAFHHSIDK 226

Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137
                C+RD NG F++A + W        E EA+G+WQ + W   LG   V+ E+D   +
Sbjct: 227 TSYGCCLRDSNGDFIQALSGWCNPELSVCEGEALGMWQVMSWVQNLGWSKVIFESDSKTL 286

Query: 136 VNAVNKASILNTEFDVIISHCKIRILLNSS 47
           V+AVN  S+  +EF V++S+ +  + LN++
Sbjct: 287 VDAVNSKSVGGSEFHVLVSNIRTLLSLNNN 316


>dbj|GAU50352.1| hypothetical protein TSUD_288030 [Trifolium subterraneum]
          Length = 452

 Score =  142 bits (357), Expect = 2e-35
 Identities = 78/242 (32%), Positives = 111/242 (45%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           +P K+K   WR+ RGCLPTR NL RR V C+ +  +C    EDE+H F  C  A   W+ 
Sbjct: 203 LPPKLKHFCWRLLRGCLPTRFNLHRRGVQCQTICALCNNATEDELHPFTDCAHAILCWKE 262

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                           F+   F++  S+ E K      + W+IW+  N  +WE+      
Sbjct: 263 VNLWQSLEPQFLQSGSFSSIIFSIISSMEETKQSVFVAVLWSIWRARNECIWENKQANPV 322

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
            S     D + D+        H           V   W++P    LKCN+D A+   EGK
Sbjct: 323 ASCRLDFDLIRDFNWC-----HNMLNADHMPTHV-HTWEKPPTSWLKCNVDGAIFMTEGK 376

Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137
           +G+  C RD +G F++A TM F       E EA  +  AL  ++  G E V+ E+DC  V
Sbjct: 377 FGIGICFRDSSGSFVQAHTMTFPFEVTAAECEATAMKHALALALSNGFERVLFESDCKQV 436

Query: 136 VN 131
           VN
Sbjct: 437 VN 438


>dbj|GAU47271.1| hypothetical protein TSUD_280940 [Trifolium subterraneum]
          Length = 780

 Score =  143 bits (360), Expect = 1e-34
 Identities = 78/244 (31%), Positives = 114/244 (46%)
 Frame = -2

Query: 856  IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
            +P K+K   WR+ RGCLPTR NL RR V C+ +  +C    EDE+HLF  C  A   W+ 
Sbjct: 521  LPPKLKHFCWRLLRGCLPTRFNLHRRGVQCQTICALCNNATEDELHLFTDCANAILCWKE 580

Query: 676  CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                            F+   F++  S+ E K    + + W+IW+  N  +WE+      
Sbjct: 581  VNLWQSLEHQFLQSGSFSSIIFSIISSMEETKQSLFAAVLWSIWRARNECIWENKQANPV 640

Query: 496  VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
             S   A D + D+        H           V   W++P    LKCN+D A+   E K
Sbjct: 641  ASCRLAFDLIRDFNWC-----HNMLNAYHMPTHV-HTWEKPLVNWLKCNVDGAIFTTEAK 694

Query: 316  YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDCLLV 137
            +G+  C RD +G F++A TM F       E EA  +  AL  ++    E V+ E+DC  V
Sbjct: 695  FGIGICFRDSSGSFVQAHTMTFPFEVTAVECEATAMKHALALALSNAFERVLFESDCQQV 754

Query: 136  VNAV 125
            +NA+
Sbjct: 755  MNAL 758


>gb|KYP36545.1| hypothetical protein KK1_042329 [Cajanus cajan]
          Length = 291

 Score =  135 bits (341), Expect = 2e-34
 Identities = 77/265 (29%), Positives = 121/265 (45%), Gaps = 7/265 (2%)
 Frame = -2

Query: 844 VKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWERCRXX 665
           +K+ LWR+ R CLP+R  LQ++ VPC     +C             C EA+ +W+     
Sbjct: 1   MKIFLWRLLRDCLPSRQRLQQKGVPCTS---LC-------------CQEAQTVWQATGIW 44

Query: 664 XXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVAVSQH 485
                            F+L  S+++    E  V    IW+R N KVW+    P  V+  
Sbjct: 45  QHIKSFVDVGEGIVEVIFSLLGSISQSHIVEVVVTLGCIWRRRNAKVWDQGAPPSGVAIS 104

Query: 484 AALDYLCDWELL-------RL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQ 326
            A  +  DW+         R+ P H              QW++P  G   CNID AL   
Sbjct: 105 QAKQHFRDWQAAQARSSTQRIPPVHDL------------QWKKPHVGTFTCNIDAALFQD 152

Query: 325 EGKYGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALKWSMELGLEAVVIETDC 146
              +G + CIR+++G F+ AKT W   +P   E EA  L  A++W   L L  V IE+DC
Sbjct: 153 SSYFGYSMCIRNDHGQFLTAKTGWAHSLPPVHEAEATALLTAIQWIENLSLTHVTIESDC 212

Query: 145 LLVVNAVNKASILNTEFDVIISHCK 71
             V++A+++    ++E+  +++ C+
Sbjct: 213 KSVLDALSRTQSQHSEYGSLLNKCR 237


>gb|KYP46236.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 507

 Score =  140 bits (352), Expect = 3e-34
 Identities = 71/221 (32%), Positives = 104/221 (47%)
 Frame = -2

Query: 856 IPNKVKLLLWRVARGCLPTRMNLQRRHVPCEEVYPMCGVGIEDEIHLFFKCVEAKPMWER 677
           IP+ +K+ LWR+ R CLP+R  LQ++ VPC  + P C    E+  H+FF C EA+ +W+ 
Sbjct: 292 IPHNMKIFLWRLLRDCLPSRQRLQQKGVPCTSLCPHCEAAQENNWHIFFGCQEAQTVWQA 351

Query: 676 CRXXXXXXXXXXXXXVFAYCFFNLFDSLTEIKCKETSVIFWAIWKRHNGKVWEDVLKPVA 497
                                F+L  S+++    E  V    IW+R N KVW+    P  
Sbjct: 352 TGIWQHIKSLIDVGEGIVEVIFSLLGSISQSHIVEVVVTLSCIWRRRNAKVWDQGAPPSG 411

Query: 496 VSQHAALDYLCDWELLRL*PRHXXXXXXXXXXAVQQQWQRPEPGVLKCNIDPALIDQEGK 317
           V+   A  Y  DW+  +                   QW++P  G   CNID AL      
Sbjct: 412 VATSQAKQYFRDWQAAQ-----ARSSTQRTPPVHDLQWKKPHAGTFTCNIDAALFQDSSY 466

Query: 316 YGVAFCIRDENGIFMEAKTMWFEGVPEPQEVEAMGLWQALK 194
           +G + CIR+++G F+ AKT W  G+P   E EA  L  A++
Sbjct: 467 FGYSMCIRNDHGQFLTAKTGWAHGLPPVHEAEATALLTAIQ 507


Top