BLASTX nr result

ID: Astragalus22_contig00014905 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00014905
         (1431 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004502023.1| PREDICTED: zinc finger CCHC domain-containin...   594   0.0  
gb|KHN42223.1| Cellular nucleic acid-binding protein like [Glyci...   561   0.0  
ref|XP_006605629.1| PREDICTED: zinc finger CCHC domain-containin...   562   0.0  
dbj|GAU44155.1| hypothetical protein TSUD_399770 [Trifolium subt...   561   0.0  
ref|XP_007146032.1| hypothetical protein PHAVU_006G006900g [Phas...   551   0.0  
ref|XP_012571773.1| PREDICTED: protein AIR1-like isoform X2 [Cic...   548   0.0  
gb|PNY14547.1| cellular nucleic acid-binding protein [Trifolium ...   550   0.0  
ref|XP_017437156.1| PREDICTED: zinc finger CCHC domain-containin...   535   0.0  
ref|XP_020223827.1| zinc finger CCHC domain-containing protein 7...   526   0.0  
gb|KRH17934.1| hypothetical protein GLYMA_13G028000 [Glycine max]     525   0.0  
ref|XP_014516861.1| zinc finger CCHC domain-containing protein 7...   523   e-180
ref|XP_003542327.1| PREDICTED: protein AIR1-like isoform X1 [Gly...   520   e-178
gb|ACU17716.1| unknown [Glycine max]                                  502   e-174
ref|XP_006593907.1| PREDICTED: protein AIR1-like isoform X2 [Gly...   505   e-173
gb|KRG89903.1| hypothetical protein GLYMA_20G055200 [Glycine max]     483   e-164
gb|KRG89906.1| hypothetical protein GLYMA_20G055200 [Glycine max]     395   e-156
ref|XP_015973610.1| protein AIR2 [Arachis duranensis]                 451   e-152
ref|XP_016166718.1| protein AIR2 [Arachis ipaensis]                   448   e-151
gb|KRG89905.1| hypothetical protein GLYMA_20G055200 [Glycine max]     416   e-139
ref|XP_015873156.1| PREDICTED: zinc finger CCHC domain-containin...   419   e-139

>ref|XP_004502023.1| PREDICTED: zinc finger CCHC domain-containing protein 7-like isoform
            X1 [Cicer arietinum]
          Length = 528

 Score =  594 bits (1532), Expect = 0.0
 Identities = 279/330 (84%), Positives = 294/330 (89%), Gaps = 6/330 (1%)
 Frame = +3

Query: 456  KTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAARR 635
            KTTEK ESVEASAVQIGDN VLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAA+R
Sbjct: 95   KTTEKDESVEASAVQIGDNAVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAAKR 154

Query: 636  QKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHISA--SKSLTVCLKCGNSGH 809
             KPCYVCGGLGH AKQCTKAQ CFICKKGGHRAKDCPEK ++A  SKSLT+CLKCGNSGH
Sbjct: 155  MKPCYVCGGLGHGAKQCTKAQSCFICKKGGHRAKDCPEKLMTARVSKSLTICLKCGNSGH 214

Query: 810  DMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLR 989
            DMFSCKN+YS DDLKEIQCYLCKTFGH+CC NT DA PGEISCYKCGQ+GHTGLACSRL+
Sbjct: 215  DMFSCKNDYSRDDLKEIQCYLCKTFGHLCCVNTVDAIPGEISCYKCGQMGHTGLACSRLQ 274

Query: 990  GETTSTATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAPH 1169
             ETT  A+PS CYRCGE GHFARECTSS KAGKKN EFSNTK RRSYKEND++GH SAPH
Sbjct: 275  SETTGAASPSLCYRCGEVGHFARECTSSTKAGKKNSEFSNTKKRRSYKENDFRGHWSAPH 334

Query: 1170 DLGKMHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTPKKSRKSRGGWTREHPT 1349
            D GKMHKKKRPL +ERGFTTPKKSKSRGGW RELPTEERG +TP KSR SRGGWT EHP 
Sbjct: 335  DAGKMHKKKRPLPDERGFTTPKKSKSRGGWSRELPTEERGLTTPTKSR-SRGGWTMEHPA 393

Query: 1350 EE----RGFSTPKKSRKSRGGWTTEHPREE 1427
            EE    RGF+TPKKS KSRGGWT EHP +E
Sbjct: 394  EERNFKRGFTTPKKS-KSRGGWTMEHPADE 422



 Score = 88.6 bits (218), Expect = 5e-15
 Identities = 54/98 (55%), Positives = 61/98 (62%), Gaps = 2/98 (2%)
 Frame = +1

Query: 73  ASEGEEVGELNDDVDGVSTPLLVFSSDDD--EANQDLSLKIVEKAMRMREAKLSPNDALF 246
           A + E+V E+ND++DG STP LVFSSDDD  EANQDLSLKIVEKAMR REAKLSPND + 
Sbjct: 3   ARDHEQVAEINDNLDGASTPSLVFSSDDDDEEANQDLSLKIVEKAMRTREAKLSPNDDV- 61

Query: 247 NGIDCVDSPLQQPELVVAPTGAGSDGPEGIAALEVMEK 360
                                  SD P G AA EVME+
Sbjct: 62  -----------------------SDEPSGPAAFEVMEE 76


>gb|KHN42223.1| Cellular nucleic acid-binding protein like [Glycine soja]
          Length = 432

 Score =  561 bits (1447), Expect = 0.0
 Identities = 257/322 (79%), Positives = 287/322 (89%)
 Frame = +3

Query: 462  TEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAARRQK 641
            TE HE VE S V IGDN+VLRKLLRGPRYFDPPDSSWGAC+NCGE+GHAAVNC+AA+R+K
Sbjct: 8    TENHEFVEGSPVLIGDNMVLRKLLRGPRYFDPPDSSWGACFNCGEDGHAAVNCSAAKRKK 67

Query: 642  PCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHISASKSLTVCLKCGNSGHDMFS 821
            PCYVCGGLGHNA+QCTKAQDCFICKKGGHRAKDC EKH S SKS+ +CLKCGNSGHDMFS
Sbjct: 68   PCYVCGGLGHNARQCTKAQDCFICKKGGHRAKDCLEKHTSRSKSVAICLKCGNSGHDMFS 127

Query: 822  CKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLRGETT 1001
            C+N+YS DDLKEIQCY+CK  GH+CC NTDDATPGEISCYKCGQLGHTGLACSRLR E T
Sbjct: 128  CRNDYSPDDLKEIQCYVCKRVGHLCCVNTDDATPGEISCYKCGQLGHTGLACSRLRDEIT 187

Query: 1002 STATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAPHDLGK 1181
            S ATPSSC++CGE GHFARECTSSIK+GK+N E S+TK++RS KENDY G+RSA +D+  
Sbjct: 188  SGATPSSCFKCGEEGHFARECTSSIKSGKRNWESSHTKDKRSQKENDYMGNRSASNDMVG 247

Query: 1182 MHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTPKKSRKSRGGWTREHPTEERG 1361
              +KKR  TEERGF+TPKKSKSRGGW  E PTEERGF+TPKKS KSRGGWT EHPTEERG
Sbjct: 248  ARRKKRSPTEERGFSTPKKSKSRGGWTAEYPTEERGFTTPKKS-KSRGGWTTEHPTEERG 306

Query: 1362 FSTPKKSRKSRGGWTTEHPREE 1427
            F+TPKKS K+RGGWT+EHP E+
Sbjct: 307  FTTPKKS-KNRGGWTSEHPLEQ 327


>ref|XP_006605629.1| PREDICTED: zinc finger CCHC domain-containing protein 7-like [Glycine
            max]
 gb|KRG89904.1| hypothetical protein GLYMA_20G055200 [Glycine max]
          Length = 552

 Score =  562 bits (1449), Expect = 0.0
 Identities = 260/342 (76%), Positives = 291/342 (85%)
 Frame = +3

Query: 402  SGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGAC 581
            +GDQS               TE HE VE S V IG N+VLRKLLRGPRYFDPPDSSWGAC
Sbjct: 108  AGDQSVVIAEEQEMEETSNATENHEFVEGSPVLIGHNMVLRKLLRGPRYFDPPDSSWGAC 167

Query: 582  YNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHIS 761
            +NCGE+GHAAVNC+AA+R+KPCYVCGGLGHNA+QCTKAQDCFICKKGGHRAKDC EKH S
Sbjct: 168  FNCGEDGHAAVNCSAAKRKKPCYVCGGLGHNARQCTKAQDCFICKKGGHRAKDCLEKHTS 227

Query: 762  ASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCY 941
             SKS+ +CLKCGNSGHDMFSC+N+YS DDLKEIQCY+CK  GH+CC NTDDATPGEISCY
Sbjct: 228  RSKSVAICLKCGNSGHDMFSCRNDYSPDDLKEIQCYVCKRVGHLCCVNTDDATPGEISCY 287

Query: 942  KCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNR 1121
            KCGQLGHTGLACSRLR E TS ATPSSC++CGE GHFARECTSSIK+GK+N E S+TK++
Sbjct: 288  KCGQLGHTGLACSRLRDEITSGATPSSCFKCGEEGHFARECTSSIKSGKRNWESSHTKDK 347

Query: 1122 RSYKENDYKGHRSAPHDLGKMHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTP 1301
            RS KENDY G+RSA +D+    +KKR  TEERGF+TPKKSKSRGGW  E PTEERGF+TP
Sbjct: 348  RSQKENDYMGNRSASNDMVGARRKKRSPTEERGFSTPKKSKSRGGWTAEYPTEERGFTTP 407

Query: 1302 KKSRKSRGGWTREHPTEERGFSTPKKSRKSRGGWTTEHPREE 1427
            KKS KSRGGWT EHPTEERGF+TPKKS K+RGGWT+EHP E+
Sbjct: 408  KKS-KSRGGWTTEHPTEERGFTTPKKS-KNRGGWTSEHPLEQ 447



 Score = 97.4 bits (241), Expect = 7e-18
 Identities = 65/114 (57%), Positives = 73/114 (64%), Gaps = 1/114 (0%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREAK 222
           MGRKEK   KA + EE   +N   +G STP LVFSSDDDEANQDLSLKIVEKAMRMR AK
Sbjct: 1   MGRKEKQNAKAID-EEHDVVN--FNGASTPSLVFSSDDDEANQDLSLKIVEKAMRMRAAK 57

Query: 223 LSPNDALFNGIDCVDSPL-QQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
            +PND        V SP  Q+ EL V      SD P  IA  EV EK++T KL+
Sbjct: 58  HAPNDD-------VSSPFSQKSELAVPLNDVVSDLPSAIADSEVTEKKKTAKLK 104


>dbj|GAU44155.1| hypothetical protein TSUD_399770 [Trifolium subterraneum]
          Length = 658

 Score =  561 bits (1446), Expect = 0.0
 Identities = 276/408 (67%), Positives = 298/408 (73%), Gaps = 65/408 (15%)
 Frame = +3

Query: 399  ESGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGA 578
            ESG+ S             KTTE  ESVEASA QI DN+VLRKLLRGPRYFD PDSSWGA
Sbjct: 114  ESGNPSVIIAAEQEVEEMMKTTENDESVEASADQISDNMVLRKLLRGPRYFDTPDSSWGA 173

Query: 579  CYNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHI 758
            CYNCGEEGHAAVNCTAA+R+KPCYVCGGLGH AKQCTKAQ C+ICKKG HRAKDCPEKH+
Sbjct: 174  CYNCGEEGHAAVNCTAAKRKKPCYVCGGLGHAAKQCTKAQSCYICKKGDHRAKDCPEKHM 233

Query: 759  SA--SKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEI 932
            +A  SKSLTVCLKCGNSGHDMFSC+N+YS DDLKEIQCY+CKTFGH+CC NT DA  GEI
Sbjct: 234  TARASKSLTVCLKCGNSGHDMFSCRNDYSPDDLKEIQCYVCKTFGHLCCVNTADAVLGEI 293

Query: 933  SCYKCGQLGHTGLACSRLRGETTSTAT--------------------------------- 1013
            SCYKCGQ+GHTGLACSRL+GETT  AT                                 
Sbjct: 294  SCYKCGQMGHTGLACSRLQGETTGNATATLCYRCGEGGHFARECTSSIKVQLMQACSRFQ 353

Query: 1014 --------PSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAPH 1169
                    P+ CYRCGEGGHFARECT+SIKAG+K  EFSNTKNRRSYKENDY GH SAP 
Sbjct: 354  GETTGAATPTLCYRCGEGGHFARECTNSIKAGRKVSEFSNTKNRRSYKENDYTGHFSAPP 413

Query: 1170 DLGKMHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTPKKSR------------ 1313
            D+GK+HKKKRP T+ERGFTTPKKSK RGGWM E PTEER F+TPKKS+            
Sbjct: 414  DMGKIHKKKRPFTDERGFTTPKKSKKRGGWMTEFPTEERSFTTPKKSKHRGGRTRELPTE 473

Query: 1314 ----------KSRGGWTREHPTEERGFSTPKKSRKSRGGWTTEHPREE 1427
                      KSRGGW  EH TEERGF+TPKKS KSRGGWTTEHP EE
Sbjct: 474  GRAFTAPKKSKSRGGWMAEHSTEERGFTTPKKS-KSRGGWTTEHPTEE 520



 Score =  106 bits (264), Expect = 1e-20
 Identities = 61/116 (52%), Positives = 77/116 (66%), Gaps = 4/116 (3%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVFSSDDD----EANQDLSLKIVEKAMRM 210
           MGRKEK K KA + E++ E++D++DG STP LVFS DDD    EANQDL+LKIVEKA+R 
Sbjct: 1   MGRKEKSKAKARDQEQLDEISDNLDGASTPSLVFSGDDDDDDEEANQDLTLKIVEKALRT 60

Query: 211 REAKLSPNDALFNGIDCVDSPLQQPELVVAPTGAGSDGPEGIAALEVMEKEETTKL 378
           RE K++ NDA+ NGI       QQ + VV        G  GIA LEVME+++   L
Sbjct: 61  REEKIASNDAVLNGIGS-----QQSKFVVTQNDDVLGGMNGIADLEVMEEKKMENL 111


>ref|XP_007146032.1| hypothetical protein PHAVU_006G006900g [Phaseolus vulgaris]
 gb|ESW18026.1| hypothetical protein PHAVU_006G006900g [Phaseolus vulgaris]
          Length = 527

 Score =  551 bits (1421), Expect = 0.0
 Identities = 250/353 (70%), Positives = 290/353 (82%), Gaps = 1/353 (0%)
 Frame = +3

Query: 369  DKAEXXXXXXESGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGPRY 548
            +K +      E+GDQ              K TE HESVE  AVQ+GDN+VLRKLLRGPRY
Sbjct: 94   EKKKTTKLKIETGDQRVVIANEHETEEIIKDTENHESVEGGAVQLGDNMVLRKLLRGPRY 153

Query: 549  FDPPDSSWGACYNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGH 728
            FDPP+SSWGAC+NCGEEGHAAVNC+ A+R+KPCYVCG LGHNAKQCTK QDCFICK+GGH
Sbjct: 154  FDPPNSSWGACFNCGEEGHAAVNCSVAKRKKPCYVCGVLGHNAKQCTKTQDCFICKQGGH 213

Query: 729  RAKDCPEKHISASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANT 908
            RA+DCPEKH S  +S+ +CLKCGNSGHDMF CKN+YSLDDL+EIQCY+CK  GH+CC N+
Sbjct: 214  RARDCPEKHTSTPRSIAICLKCGNSGHDMFGCKNDYSLDDLEEIQCYVCKRLGHLCCVNS 273

Query: 909  DDATPGEISCYKCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHFARECTSSIKAGK 1088
            DDATPGEISCYKCG+LGHTGLACSRL+ E  S ATPSSC++CGE GHFARECTS++K GK
Sbjct: 274  DDATPGEISCYKCGRLGHTGLACSRLQDEIASGATPSSCFKCGEEGHFARECTSAVKTGK 333

Query: 1089 KNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEER-GFTTPKKSKSRGGWMR 1265
            +NR+ S TK++R YKENDY G+RSAP+D+G   +KKR   EER GF+ PKKSKSRGGWM+
Sbjct: 334  RNRDSSRTKDKRPYKENDYIGNRSAPNDMGVARRKKRSPAEERGGFSLPKKSKSRGGWMQ 393

Query: 1266 ELPTEERGFSTPKKSRKSRGGWTREHPTEERGFSTPKKSRKSRGGWTTEHPRE 1424
            E P EERGF+TPKKS KSRGGWT EHP E  G++TPKKS KSRGGWTT+HP E
Sbjct: 394  EHPAEERGFTTPKKS-KSRGGWTTEHPAEHNGYTTPKKS-KSRGGWTTDHPEE 444



 Score = 90.9 bits (224), Expect = 9e-16
 Identities = 63/114 (55%), Positives = 72/114 (63%), Gaps = 1/114 (0%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREAK 222
           MGRKEK K KA E  E G+  +   G STP LVFSSDDD+ANQDLSLKIVEKAMR+R AK
Sbjct: 1   MGRKEKAKAKAIE--ENGD--NHFGGASTPSLVFSSDDDDANQDLSLKIVEKAMRIRAAK 56

Query: 223 -LSPNDALFNGIDCVDSPLQQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
             +PND +        S  Q  EL VA      D P  IA  EV EK++TTKL+
Sbjct: 57  RAAPNDNV--------SSSQTLELSVARNVGVLDVPSAIADSEVTEKKKTTKLK 102


>ref|XP_012571773.1| PREDICTED: protein AIR1-like isoform X2 [Cicer arietinum]
          Length = 478

 Score =  548 bits (1411), Expect = 0.0
 Identities = 257/314 (81%), Positives = 274/314 (87%), Gaps = 2/314 (0%)
 Frame = +3

Query: 456  KTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAARR 635
            KTTEK ESVEASAVQIGDN VLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAA+R
Sbjct: 95   KTTEKDESVEASAVQIGDNAVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAAKR 154

Query: 636  QKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHISA--SKSLTVCLKCGNSGH 809
             KPCYVCGGLGH AKQCTKAQ CFICKKGGHRAKDCPEK ++A  SKSLT+CLKCGNSGH
Sbjct: 155  MKPCYVCGGLGHGAKQCTKAQSCFICKKGGHRAKDCPEKLMTARVSKSLTICLKCGNSGH 214

Query: 810  DMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLR 989
            DMFSCKN+YS DDLKEIQCYLCKTFGH+CC NT DA PGEISCYKCGQ+GHTGLACSRL+
Sbjct: 215  DMFSCKNDYSRDDLKEIQCYLCKTFGHLCCVNTVDAIPGEISCYKCGQMGHTGLACSRLQ 274

Query: 990  GETTSTATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAPH 1169
             ETT  A+PS CYRCGE GHFARECTSS KAGKKN EFSNTK RRSYKEND++GH SAPH
Sbjct: 275  SETTGAASPSLCYRCGEVGHFARECTSSTKAGKKNSEFSNTKKRRSYKENDFRGHWSAPH 334

Query: 1170 DLGKMHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTPKKSRKSRGGWTREHPT 1349
            D GKMHKKKRPL +ERGFTTPKKSKSRGGW  E P +ER F++PKK  KSRGGWT E+  
Sbjct: 335  DAGKMHKKKRPLPDERGFTTPKKSKSRGGWTMEHPADERDFNSPKKF-KSRGGWTAEYAG 393

Query: 1350 EERGFSTPKKSRKS 1391
            E   FS+ K  R S
Sbjct: 394  E---FSSSKSKRSS 404



 Score = 88.6 bits (218), Expect = 4e-15
 Identities = 54/98 (55%), Positives = 61/98 (62%), Gaps = 2/98 (2%)
 Frame = +1

Query: 73  ASEGEEVGELNDDVDGVSTPLLVFSSDDD--EANQDLSLKIVEKAMRMREAKLSPNDALF 246
           A + E+V E+ND++DG STP LVFSSDDD  EANQDLSLKIVEKAMR REAKLSPND + 
Sbjct: 3   ARDHEQVAEINDNLDGASTPSLVFSSDDDDEEANQDLSLKIVEKAMRTREAKLSPNDDV- 61

Query: 247 NGIDCVDSPLQQPELVVAPTGAGSDGPEGIAALEVMEK 360
                                  SD P G AA EVME+
Sbjct: 62  -----------------------SDEPSGPAAFEVMEE 76


>gb|PNY14547.1| cellular nucleic acid-binding protein [Trifolium pratense]
          Length = 653

 Score =  550 bits (1417), Expect = 0.0
 Identities = 276/412 (66%), Positives = 296/412 (71%), Gaps = 88/412 (21%)
 Frame = +3

Query: 456  KTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAARR 635
            KTTE  ESVEASA QIGDN+VLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAA+R
Sbjct: 135  KTTENLESVEASADQIGDNMVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAAKR 194

Query: 636  QKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHIS--ASKSLTVCLKCGNSGH 809
            +KPCYVCGGLGH AKQCTKAQ C+ICKKG HRAKDCPEKH +  ASKSLTVCLKCGNSGH
Sbjct: 195  KKPCYVCGGLGHAAKQCTKAQSCYICKKGDHRAKDCPEKHTTPRASKSLTVCLKCGNSGH 254

Query: 810  DMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLR 989
            DMFSC+N+YS DDLKEIQCY+CKTFGH+CC NT D   GEISCYKCGQ+GHTGLACSRL+
Sbjct: 255  DMFSCRNDYSPDDLKEIQCYVCKTFGHLCCVNTADVILGEISCYKCGQMGHTGLACSRLQ 314

Query: 990  GETTSTATPSSCYRCGEGGHFARECTSSIK------------------------------ 1079
            GETT  ATP+ CYRCGEGGHFARECTSSIK                              
Sbjct: 315  GETTGAATPTLCYRCGEGGHFARECTSSIKVQLMQACSRFQGETTGAATPTLCYRCGEGG 374

Query: 1080 -----------AGKKNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEERGFT 1226
                       AGKK  EFS+TKNRRSYKENDY GH SAPHD+GKM+KKKRPLT+ERGFT
Sbjct: 375  HFARECTNSVKAGKKVSEFSSTKNRRSYKENDYTGHWSAPHDMGKMNKKKRPLTDERGFT 434

Query: 1227 TPKK-----------------------SKSRGGWMRELPTEERGFSTPKKSR-------- 1313
            TPKK                       SKSRGG  RELPTEER F +PKKS+        
Sbjct: 435  TPKKSKRRGGWMTEFPTEERSFTTPKKSKSRGGRTRELPTEERAFKSPKKSKSRAGWMAD 494

Query: 1314 --------------KSRGGWTREHPTEERGFSTPKKSRKSRGGWTTEHPREE 1427
                          K+RGGWT EHPTEERGF+TPKKS KSRGGWT EHP EE
Sbjct: 495  HSTVERGFTTPKKSKNRGGWTTEHPTEERGFTTPKKS-KSRGGWTMEHPAEE 545



 Score =  101 bits (251), Expect = 5e-19
 Identities = 60/117 (51%), Positives = 77/117 (65%), Gaps = 5/117 (4%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEE-VGELNDDVDGVSTPLLVFSSDDD----EANQDLSLKIVEKAMR 207
           MGR+EK K KA + E+ + E++D++DG STP LVFS D+D    EANQDLSLKIVEKA+R
Sbjct: 1   MGREEKSKAKARDHEQQLDEISDNLDGASTPSLVFSGDEDDDDEEANQDLSLKIVEKALR 60

Query: 208 MREAKLSPNDALFNGIDCVDSPLQQPELVVAPTGAGSDGPEGIAALEVMEKEETTKL 378
            REAKL+PNDA+ N I       QQ E  V        G  G+A +EVME+++   L
Sbjct: 61  TREAKLAPNDAVLNDIGS-----QQSEFTVTQNDDVLGGLNGVADMEVMEEKKMENL 112


>ref|XP_017437156.1| PREDICTED: zinc finger CCHC domain-containing protein 7-like [Vigna
            angularis]
 gb|KOM51460.1| hypothetical protein LR48_Vigan09g011900 [Vigna angularis]
 dbj|BAT88912.1| hypothetical protein VIGAN_05255800 [Vigna angularis var. angularis]
          Length = 526

 Score =  535 bits (1377), Expect = 0.0
 Identities = 243/355 (68%), Positives = 286/355 (80%), Gaps = 1/355 (0%)
 Frame = +3

Query: 363  GNDKAEXXXXXXESGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGP 542
            G +K +      E+ D+              K TE  ES E  AVQ G+N+VLRKLLRGP
Sbjct: 92   GKEKKKTTKLKTETRDERVVIANEQEMEETIKDTENQESAEGGAVQTGNNMVLRKLLRGP 151

Query: 543  RYFDPPDSSWGACYNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKG 722
            RYFDPP++SWGAC+NCGEEGHAAVNC+ A+R+KPCYVCG LGHNAKQCTK QDCFICKKG
Sbjct: 152  RYFDPPNNSWGACFNCGEEGHAAVNCSVAKRKKPCYVCGVLGHNAKQCTKTQDCFICKKG 211

Query: 723  GHRAKDCPEKHISASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCA 902
            GHRA+DCPEKH S  + + +CLKCGNSGHDMF CKN+YSLDDL+EIQCY+CK  GH+CC 
Sbjct: 212  GHRARDCPEKHASTPRIIAICLKCGNSGHDMFGCKNDYSLDDLQEIQCYVCKRLGHLCCV 271

Query: 903  NTDDATPGEISCYKCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHFARECTSSIKA 1082
            N+DDATPGEISCYKCG+LGHTGLACSRL+ E  S ATPSSC++CGE GHFARECTS++K 
Sbjct: 272  NSDDATPGEISCYKCGRLGHTGLACSRLQDEIASGATPSSCFKCGEEGHFARECTSAVKT 331

Query: 1083 GKKNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEER-GFTTPKKSKSRGGW 1259
            GK++R+ S TK++R +KENDY G+RSAP+D+G   +KKR  TEER GF+ PKKSKSRGGW
Sbjct: 332  GKRSRDSSRTKDKRFHKENDYIGNRSAPNDMGVARRKKRLPTEERGGFSLPKKSKSRGGW 391

Query: 1260 MRELPTEERGFSTPKKSRKSRGGWTREHPTEERGFSTPKKSRKSRGGWTTEHPRE 1424
            M+E P EERGF+TPKKS K RGGWT EHP E +G++TPKKS KSR GWTTEHP E
Sbjct: 392  MQEHPAEERGFTTPKKS-KGRGGWTTEHPAEHKGYTTPKKS-KSRDGWTTEHPEE 444



 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 62/115 (53%), Positives = 70/115 (60%), Gaps = 2/115 (1%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDD-VDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREA 219
           MGRKEK K K      + EL+DD    VSTP LVFSSDDDEANQDLSLKIVEKAMRMR A
Sbjct: 1   MGRKEKSKSKT-----MVELDDDHFTSVSTPSLVFSSDDDEANQDLSLKIVEKAMRMRTA 55

Query: 220 K-LSPNDALFNGIDCVDSPLQQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
           K  +P+D +           Q  EL VA      D P  IA  E  EK++TTKL+
Sbjct: 56  KCAAPSDDVLLS--------QTLELAVARNVDVPDVPSAIADSEGKEKKKTTKLK 102


>ref|XP_020223827.1| zinc finger CCHC domain-containing protein 7 [Cajanus cajan]
          Length = 382

 Score =  526 bits (1356), Expect = 0.0
 Identities = 238/304 (78%), Positives = 268/304 (88%)
 Frame = +3

Query: 513  IVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTK 692
            +VLRKLLRGPRYFDPP +SWGAC+NCGEEGHAAVNC+AA+R+KPCYVCGGLGHNAKQCTK
Sbjct: 1    MVLRKLLRGPRYFDPPGNSWGACFNCGEEGHAAVNCSAAKRKKPCYVCGGLGHNAKQCTK 60

Query: 693  AQDCFICKKGGHRAKDCPEKHISASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYL 872
             QDCFICKKGGHRAKDCPEKH S SKS+ +CLKCGNSGHDMF CK++YSLDDLKEIQCY+
Sbjct: 61   TQDCFICKKGGHRAKDCPEKHASTSKSICICLKCGNSGHDMFFCKSDYSLDDLKEIQCYV 120

Query: 873  CKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHF 1052
            CK  GH+CC N+DDATPGEISCYKCG LGHTGLACSRLRGE  S ATPSSC++CGE GHF
Sbjct: 121  CKRLGHLCCVNSDDATPGEISCYKCGHLGHTGLACSRLRGEIASGATPSSCFKCGEEGHF 180

Query: 1053 ARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEERGFTTP 1232
            ARECTSSIKAGK+N E SNTK++R  KENDY G+RSAP+D+   H+KKR  TEERGF+TP
Sbjct: 181  ARECTSSIKAGKRNYESSNTKDKRPQKENDYMGNRSAPNDV--RHRKKRSSTEERGFSTP 238

Query: 1233 KKSKSRGGWMRELPTEERGFSTPKKSRKSRGGWTREHPTEERGFSTPKKSRKSRGGWTTE 1412
            KKSKSRGGWM E P EE+ +++PKKS K RGGWT EHP E+RG++TP KS KSRGGW TE
Sbjct: 239  KKSKSRGGWMVEHPAEEKDYTSPKKS-KHRGGWTSEHPPEQRGYTTPMKS-KSRGGWKTE 296

Query: 1413 HPRE 1424
            HP E
Sbjct: 297  HPEE 300


>gb|KRH17934.1| hypothetical protein GLYMA_13G028000 [Glycine max]
          Length = 525

 Score =  525 bits (1353), Expect = 0.0
 Identities = 245/352 (69%), Positives = 283/352 (80%)
 Frame = +3

Query: 369  DKAEXXXXXXESGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGPRY 548
            +K +      E+GDQS               TE H  VE    +IGDN+VLRKLLRGPRY
Sbjct: 96   EKKKTAKLKVEAGDQSVVIAEEQEMEETINATENH--VEGRP-EIGDNMVLRKLLRGPRY 152

Query: 549  FDPPDSSWGACYNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGH 728
            FDPPD+SWGAC+NCGEEGHAAVNC+A +R+KPCYVCG LGHNA+QC+K QDCFICKKGGH
Sbjct: 153  FDPPDNSWGACFNCGEEGHAAVNCSAVKRKKPCYVCGCLGHNARQCSKVQDCFICKKGGH 212

Query: 729  RAKDCPEKHISASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANT 908
            RAKDCPEKH S SKS+ +CLKCGNSGHD+FSC+N+YS DDLKEIQCY+CK  GH+CC NT
Sbjct: 213  RAKDCPEKHTSTSKSIAICLKCGNSGHDIFSCRNDYSQDDLKEIQCYVCKRLGHLCCVNT 272

Query: 909  DDATPGEISCYKCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHFARECTSSIKAGK 1088
            DDAT GEISCYKCGQLGH GLAC RL+ E  S ATPSSC++CGE GHFARECTSSI +GK
Sbjct: 273  DDATAGEISCYKCGQLGHMGLACLRLQDEIASGATPSSCFKCGEEGHFARECTSSINSGK 332

Query: 1089 KNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEERGFTTPKKSKSRGGWMRE 1268
             N E S TK++RS KEN Y G+RSAP+D+    +KKR  TEERGF+TPKKSKSRGGWM E
Sbjct: 333  GNWESSRTKDKRSQKENRYMGNRSAPNDISGARRKKRSPTEERGFSTPKKSKSRGGWMAE 392

Query: 1269 LPTEERGFSTPKKSRKSRGGWTREHPTEERGFSTPKKSRKSRGGWTTEHPRE 1424
             PT+ERGF+TPKKS KSRGG T EHP+E++ ++TPKKS KSRGGWT+EHP E
Sbjct: 393  YPTKERGFTTPKKS-KSRGGCTTEHPSEQQDYATPKKS-KSRGGWTSEHPEE 442



 Score =  112 bits (281), Expect = 5e-23
 Identities = 66/114 (57%), Positives = 78/114 (68%), Gaps = 1/114 (0%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREAK 222
           MGRKEK   KA E E      D+ +G STP LVFSSDDDEANQDLSLKI++KAMRMR AK
Sbjct: 1   MGRKEKQNTKAIEEERD---QDNFNGASTPPLVFSSDDDEANQDLSLKIIKKAMRMRTAK 57

Query: 223 LSPNDALFNGIDCVDSPL-QQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
            +PND        V SP  Q+P+L + P+G  SDGP  IA  EVMEK++T KL+
Sbjct: 58  HAPNDD-------VSSPFSQKPDLALPPSGGVSDGPSAIADSEVMEKKKTAKLK 104


>ref|XP_014516861.1| zinc finger CCHC domain-containing protein 7 [Vigna radiata var.
            radiata]
          Length = 522

 Score =  523 bits (1347), Expect = e-180
 Identities = 239/327 (73%), Positives = 278/327 (85%), Gaps = 4/327 (1%)
 Frame = +3

Query: 456  KTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGACYNC---GEEGHAAVNCTA 626
            K TE ++S E  AVQ  DN+VLRKLLRGPRYFDPP+ SWGAC+NC   GEEGHAAVNC+ 
Sbjct: 117  KDTE-NQSAEGGAVQTVDNMVLRKLLRGPRYFDPPNDSWGACFNCFNCGEEGHAAVNCSV 175

Query: 627  ARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHISASKSLTVCLKCGNSG 806
            A+R+KPCYVCG LGHNAKQCTK QDCFICKKGGHRA+DCPEKH S  + + +CLKCGNSG
Sbjct: 176  AKRKKPCYVCGVLGHNAKQCTKTQDCFICKKGGHRARDCPEKHASTPRIIAICLKCGNSG 235

Query: 807  HDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRL 986
            HDMF CKN+YSLDDL+EIQCY+CK  GH+CC N+DDATPGEISCYKCG+LGHTGLACSRL
Sbjct: 236  HDMFGCKNDYSLDDLQEIQCYVCKRLGHLCCVNSDDATPGEISCYKCGRLGHTGLACSRL 295

Query: 987  RGETTSTATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAP 1166
            + E  S ATPSSC++CGE GHFARECTS++K GK++R+ S TK++RS+KENDY G+RSAP
Sbjct: 296  QDEIASGATPSSCFKCGEEGHFARECTSAVKTGKRSRDSSRTKDKRSHKENDYIGNRSAP 355

Query: 1167 HDLGKMHKKKRPLTEER-GFTTPKKSKSRGGWMRELPTEERGFSTPKKSRKSRGGWTREH 1343
            +D+G   +KKR  TEER GF+ PKKSKSRGGWM+E P EERGF+TPKKS   RGGWT EH
Sbjct: 356  NDMGVARRKKRLPTEERGGFSLPKKSKSRGGWMQEHPAEERGFTTPKKS-NGRGGWTTEH 414

Query: 1344 PTEERGFSTPKKSRKSRGGWTTEHPRE 1424
            P E +G++TPKKS KSRGGWTTEHP E
Sbjct: 415  PAEHKGYTTPKKS-KSRGGWTTEHPEE 440



 Score = 90.5 bits (223), Expect = 1e-15
 Identities = 65/115 (56%), Positives = 73/115 (63%), Gaps = 2/115 (1%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDD-VDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREA 219
           MGRKEK K KA     V EL+DD   G STP LVFSSD+DEANQDLSLKIVEKAMRMREA
Sbjct: 1   MGRKEKSKSKA-----VVELHDDQFTGASTPSLVFSSDEDEANQDLSLKIVEKAMRMREA 55

Query: 220 K-LSPNDALFNGIDCVDSPLQQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
           K  +P+D +        S  Q  E  VA      D P  IA  EV EK++TTKL+
Sbjct: 56  KCATPSDHV--------SLSQTLEFAVA-RNVVPDVPSAIADSEVKEKKKTTKLK 101


>ref|XP_003542327.1| PREDICTED: protein AIR1-like isoform X1 [Glycine max]
 gb|KHN03658.1| Cellular nucleic acid-binding protein like [Glycine soja]
 gb|KRH17935.1| hypothetical protein GLYMA_13G028000 [Glycine max]
          Length = 529

 Score =  520 bits (1339), Expect = e-178
 Identities = 245/356 (68%), Positives = 284/356 (79%), Gaps = 4/356 (1%)
 Frame = +3

Query: 369  DKAEXXXXXXESGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGPRY 548
            +K +      E+GDQS               TE H  VE    +IGDN+VLRKLLRGPRY
Sbjct: 96   EKKKTAKLKVEAGDQSVVIAEEQEMEETINATENH--VEGRP-EIGDNMVLRKLLRGPRY 152

Query: 549  FDPPDSSWGACYNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGH 728
            FDPPD+SWGAC+NCGEEGHAAVNC+A +R+KPCYVCG LGHNA+QC+K QDCFICKKGGH
Sbjct: 153  FDPPDNSWGACFNCGEEGHAAVNCSAVKRKKPCYVCGCLGHNARQCSKVQDCFICKKGGH 212

Query: 729  RAKDCPEKHISASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANT 908
            RAKDCPEKH S SKS+ +CLKCGNSGHD+FSC+N+YS DDLKEIQCY+CK  GH+CC NT
Sbjct: 213  RAKDCPEKHTSTSKSIAICLKCGNSGHDIFSCRNDYSQDDLKEIQCYVCKRLGHLCCVNT 272

Query: 909  DDATPGEISCYKCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHFARECTSSI---- 1076
            DDAT GEISCYKCGQLGH GLAC RL+ E  S ATPSSC++CGE GHFARECTSSI    
Sbjct: 273  DDATAGEISCYKCGQLGHMGLACLRLQDEIASGATPSSCFKCGEEGHFARECTSSINFPP 332

Query: 1077 KAGKKNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEERGFTTPKKSKSRGG 1256
            ++GK N E S TK++RS KEN Y G+RSAP+D+    +KKR  TEERGF+TPKKSKSRGG
Sbjct: 333  QSGKGNWESSRTKDKRSQKENRYMGNRSAPNDISGARRKKRSPTEERGFSTPKKSKSRGG 392

Query: 1257 WMRELPTEERGFSTPKKSRKSRGGWTREHPTEERGFSTPKKSRKSRGGWTTEHPRE 1424
            WM E PT+ERGF+TPKKS KSRGG T EHP+E++ ++TPKKS KSRGGWT+EHP E
Sbjct: 393  WMAEYPTKERGFTTPKKS-KSRGGCTTEHPSEQQDYATPKKS-KSRGGWTSEHPEE 446



 Score =  112 bits (281), Expect = 5e-23
 Identities = 66/114 (57%), Positives = 78/114 (68%), Gaps = 1/114 (0%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREAK 222
           MGRKEK   KA E E      D+ +G STP LVFSSDDDEANQDLSLKI++KAMRMR AK
Sbjct: 1   MGRKEKQNTKAIEEERD---QDNFNGASTPPLVFSSDDDEANQDLSLKIIKKAMRMRTAK 57

Query: 223 LSPNDALFNGIDCVDSPL-QQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
            +PND        V SP  Q+P+L + P+G  SDGP  IA  EVMEK++T KL+
Sbjct: 58  HAPNDD-------VSSPFSQKPDLALPPSGGVSDGPSAIADSEVMEKKKTAKLK 104


>gb|ACU17716.1| unknown [Glycine max]
          Length = 389

 Score =  502 bits (1293), Expect = e-174
 Identities = 229/308 (74%), Positives = 264/308 (85%), Gaps = 4/308 (1%)
 Frame = +3

Query: 513  IVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTK 692
            +VLRKLLRGPRYFDPPD+SWGAC+NCGEEGHAAVNC+A +R+KPCYVCG LGHNA+QC+K
Sbjct: 1    MVLRKLLRGPRYFDPPDNSWGACFNCGEEGHAAVNCSAVKRKKPCYVCGCLGHNARQCSK 60

Query: 693  AQDCFICKKGGHRAKDCPEKHISASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYL 872
             QDCFICKK GHRAKDCPEKH S SKS+ +CLKCGNSGHD+FSC+N+YS DDLKEIQCY+
Sbjct: 61   VQDCFICKKDGHRAKDCPEKHTSTSKSIAICLKCGNSGHDIFSCRNDYSQDDLKEIQCYV 120

Query: 873  CKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHF 1052
            CK  GH+CC NTDDAT GEISCYKCGQLGH GLAC RL+ E  S ATPSSC++CGE GHF
Sbjct: 121  CKRLGHLCCVNTDDATAGEISCYKCGQLGHMGLACLRLQDEIASGATPSSCFKCGEEGHF 180

Query: 1053 ARECTSSI----KAGKKNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEERG 1220
            ARECTSSI    ++GK N E S TK++RS KEN Y G+RSAP+D+    +KKR  TEERG
Sbjct: 181  ARECTSSINFPPQSGKGNWESSRTKDKRSQKENRYMGNRSAPNDISGARRKKRSPTEERG 240

Query: 1221 FTTPKKSKSRGGWMRELPTEERGFSTPKKSRKSRGGWTREHPTEERGFSTPKKSRKSRGG 1400
            F+TPKKSKSRGGWM E PT+ERGF+TPKKS KSRGG T EHP+E++ ++TPKKS KSRGG
Sbjct: 241  FSTPKKSKSRGGWMAEYPTKERGFTTPKKS-KSRGGCTTEHPSEQQDYATPKKS-KSRGG 298

Query: 1401 WTTEHPRE 1424
            WT+EHP E
Sbjct: 299  WTSEHPEE 306


>ref|XP_006593907.1| PREDICTED: protein AIR1-like isoform X2 [Glycine max]
 gb|KRH17933.1| hypothetical protein GLYMA_13G028000 [Glycine max]
          Length = 523

 Score =  505 bits (1301), Expect = e-173
 Identities = 241/356 (67%), Positives = 279/356 (78%), Gaps = 4/356 (1%)
 Frame = +3

Query: 369  DKAEXXXXXXESGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGPRY 548
            +K +      E+GDQS               TE H  VE    +IGDN+VLRKLLRGPRY
Sbjct: 96   EKKKTAKLKVEAGDQSVVIAEEQEMEETINATENH--VEGRP-EIGDNMVLRKLLRGPRY 152

Query: 549  FDPPDSSWGACYNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGH 728
            FDPPD+SWGAC+NCGEEGHAAVNC+A +R+KPCYVCG LGHNA+QC+K QDCFICKKGGH
Sbjct: 153  FDPPDNSWGACFNCGEEGHAAVNCSAVKRKKPCYVCGCLGHNARQCSKVQDCFICKKGGH 212

Query: 729  RAKDCPEKHISASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANT 908
            RAKDCPEKH S SKS+ +CLKCGNSGHD+FSC+N+YS DDLKEIQCY+CK  GH+CC NT
Sbjct: 213  RAKDCPEKHTSTSKSIAICLKCGNSGHDIFSCRNDYSQDDLKEIQCYVCKRLGHLCCVNT 272

Query: 909  DDATPGEISCYKCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHFARECTSSI---- 1076
            DDAT GEISCYKCGQLGH GL       E  S ATPSSC++CGE GHFARECTSSI    
Sbjct: 273  DDATAGEISCYKCGQLGHMGL------DEIASGATPSSCFKCGEEGHFARECTSSINFPP 326

Query: 1077 KAGKKNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEERGFTTPKKSKSRGG 1256
            ++GK N E S TK++RS KEN Y G+RSAP+D+    +KKR  TEERGF+TPKKSKSRGG
Sbjct: 327  QSGKGNWESSRTKDKRSQKENRYMGNRSAPNDISGARRKKRSPTEERGFSTPKKSKSRGG 386

Query: 1257 WMRELPTEERGFSTPKKSRKSRGGWTREHPTEERGFSTPKKSRKSRGGWTTEHPRE 1424
            WM E PT+ERGF+TPKKS KSRGG T EHP+E++ ++TPKKS KSRGGWT+EHP E
Sbjct: 387  WMAEYPTKERGFTTPKKS-KSRGGCTTEHPSEQQDYATPKKS-KSRGGWTSEHPEE 440



 Score =  112 bits (281), Expect = 5e-23
 Identities = 66/114 (57%), Positives = 78/114 (68%), Gaps = 1/114 (0%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREAK 222
           MGRKEK   KA E E      D+ +G STP LVFSSDDDEANQDLSLKI++KAMRMR AK
Sbjct: 1   MGRKEKQNTKAIEEERD---QDNFNGASTPPLVFSSDDDEANQDLSLKIIKKAMRMRTAK 57

Query: 223 LSPNDALFNGIDCVDSPL-QQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
            +PND        V SP  Q+P+L + P+G  SDGP  IA  EVMEK++T KL+
Sbjct: 58  HAPNDD-------VSSPFSQKPDLALPPSGGVSDGPSAIADSEVMEKKKTAKLK 104


>gb|KRG89903.1| hypothetical protein GLYMA_20G055200 [Glycine max]
          Length = 483

 Score =  483 bits (1242), Expect = e-164
 Identities = 224/306 (73%), Positives = 251/306 (82%)
 Frame = +3

Query: 402  SGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGAC 581
            +GDQS               TE HE VE S V IG N+VLRKLLRGPRYFDPPDSSWGAC
Sbjct: 108  AGDQSVVIAEEQEMEETSNATENHEFVEGSPVLIGHNMVLRKLLRGPRYFDPPDSSWGAC 167

Query: 582  YNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHIS 761
            +NCGE+GHAAVNC+AA+R+KPCYVCGGLGHNA+QCTKAQDCFICKKGGHRAKDC EKH S
Sbjct: 168  FNCGEDGHAAVNCSAAKRKKPCYVCGGLGHNARQCTKAQDCFICKKGGHRAKDCLEKHTS 227

Query: 762  ASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCY 941
             SKS+ +CLKCGNSGHDMFSC+N+YS DDLKEIQCY+CK  GH+CC NTDDATPGEISCY
Sbjct: 228  RSKSVAICLKCGNSGHDMFSCRNDYSPDDLKEIQCYVCKRVGHLCCVNTDDATPGEISCY 287

Query: 942  KCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNR 1121
            KCGQLGHTGLACSRLR E TS ATPSSC++CGE GHFARECTSSIK+GK+N E S+TK++
Sbjct: 288  KCGQLGHTGLACSRLRDEITSGATPSSCFKCGEEGHFARECTSSIKSGKRNWESSHTKDK 347

Query: 1122 RSYKENDYKGHRSAPHDLGKMHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTP 1301
            RS KENDY G+RSA +D+    +KKR  TEERGF+TPKKSKSRGGWM E P E   F  P
Sbjct: 348  RSQKENDYMGNRSASNDMVGARRKKRSPTEERGFSTPKKSKSRGGWMSEHPEE---FFPP 404

Query: 1302 KKSRKS 1319
              SR +
Sbjct: 405  MSSRSN 410



 Score = 97.4 bits (241), Expect = 5e-18
 Identities = 65/114 (57%), Positives = 73/114 (64%), Gaps = 1/114 (0%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREAK 222
           MGRKEK   KA + EE   +N   +G STP LVFSSDDDEANQDLSLKIVEKAMRMR AK
Sbjct: 1   MGRKEKQNAKAID-EEHDVVN--FNGASTPSLVFSSDDDEANQDLSLKIVEKAMRMRAAK 57

Query: 223 LSPNDALFNGIDCVDSPL-QQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
            +PND        V SP  Q+ EL V      SD P  IA  EV EK++T KL+
Sbjct: 58  HAPNDD-------VSSPFSQKSELAVPLNDVVSDLPSAIADSEVTEKKKTAKLK 104


>gb|KRG89906.1| hypothetical protein GLYMA_20G055200 [Glycine max]
          Length = 482

 Score =  395 bits (1015), Expect(3) = e-156
 Identities = 177/226 (78%), Positives = 193/226 (85%)
 Frame = +3

Query: 402  SGDQSXXXXXXXXXXXXXKTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGAC 581
            +GDQS               TE HE VE S V IG N+VLRKLLRGPRYFDPPDSSWGAC
Sbjct: 108  AGDQSVVIAEEQEMEETSNATENHEFVEGSPVLIGHNMVLRKLLRGPRYFDPPDSSWGAC 167

Query: 582  YNCGEEGHAAVNCTAARRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHIS 761
            +NCGE+GHAAVNC+AA+R+KPCYVCGGLGHNA+QCTKAQDCFICKKGGHRAKDC EKH S
Sbjct: 168  FNCGEDGHAAVNCSAAKRKKPCYVCGGLGHNARQCTKAQDCFICKKGGHRAKDCLEKHTS 227

Query: 762  ASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCY 941
             SKS+ +CLKCGNSGHDMFSC+N+YS DDLKEIQCY+CK  GH+CC NTDDATPGEISCY
Sbjct: 228  RSKSVAICLKCGNSGHDMFSCRNDYSPDDLKEIQCYVCKRVGHLCCVNTDDATPGEISCY 287

Query: 942  KCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGHFARECTSSIK 1079
            KCGQLGHTGLACSRLR E TS ATPSSC++CGE GHFARECTSSIK
Sbjct: 288  KCGQLGHTGLACSRLRDEITSGATPSSCFKCGEEGHFARECTSSIK 333



 Score =  112 bits (279), Expect(3) = e-156
 Identities = 70/115 (60%), Positives = 79/115 (68%)
 Frame = +1

Query: 1084 GRRIVSSQTQKIEDPIKKMITRDIGLHLMIWVRCIKRNGLLQKKEALQHLKNLRVEVVG* 1263
            GR I +  TQKI+DP KKMIT +I LHLMIW+   +RN  L KKEA Q  +N RVEVVG 
Sbjct: 334  GRGIGNHHTQKIKDPKKKMITWEIDLHLMIWLGLAERNVHLPKKEAFQLPRNQRVEVVGR 393

Query: 1264 GSFLQKKEAFQPPRNQERVEVAGQESILQKKEAFQPPRNQERVEVAGQQSILEKK 1428
             S L KKEA Q  RN  +VEVAGQ+SIL KKEA    RN  R+EVAG QSIL  K
Sbjct: 394  RSILPKKEALQLQRN-PKVEVAGQQSILLKKEALPLQRN-PRIEVAGHQSILWNK 446



 Score = 97.4 bits (241), Expect(3) = e-156
 Identities = 65/114 (57%), Positives = 73/114 (64%), Gaps = 1/114 (0%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVFSSDDDEANQDLSLKIVEKAMRMREAK 222
           MGRKEK   KA + EE   +N   +G STP LVFSSDDDEANQDLSLKIVEKAMRMR AK
Sbjct: 1   MGRKEKQNAKAID-EEHDVVN--FNGASTPSLVFSSDDDEANQDLSLKIVEKAMRMRAAK 57

Query: 223 LSPNDALFNGIDCVDSPL-QQPELVVAPTGAGSDGPEGIAALEVMEKEETTKLR 381
            +PND        V SP  Q+ EL V      SD P  IA  EV EK++T KL+
Sbjct: 58  HAPNDD-------VSSPFSQKSELAVPLNDVVSDLPSAIADSEVTEKKKTAKLK 104


>ref|XP_015973610.1| protein AIR2 [Arachis duranensis]
          Length = 510

 Score =  451 bits (1160), Expect = e-152
 Identities = 206/290 (71%), Positives = 234/290 (80%), Gaps = 2/290 (0%)
 Frame = +3

Query: 456  KTTEKHESVEASA-VQIGDNIVLRKLLRGPRYFDPPDSS-WGACYNCGEEGHAAVNCTAA 629
            K  E  E  E  A VQ+GDN VLRKLLRGPRYFD PDS  WGACYNCGEEGHAAVNCTAA
Sbjct: 152  KEAENDEPAEEPAMVQMGDNAVLRKLLRGPRYFDAPDSGGWGACYNCGEEGHAAVNCTAA 211

Query: 630  RRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHISASKSLTVCLKCGNSGH 809
            +R+KPCYVCG L HNAK CT  +DC+ICKKGGHRAKDCPEK++  S+SL +CLKCG+SGH
Sbjct: 212  KRKKPCYVCGSLEHNAKHCTMGRDCYICKKGGHRAKDCPEKNLIGSQSLKICLKCGDSGH 271

Query: 810  DMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLR 989
            +MFSCKN+YS DDLKE+QCY+CK FGH+CC NT D+TP  ISCY+CGQLGHTGLAC+RLR
Sbjct: 272  EMFSCKNDYSPDDLKEVQCYVCKKFGHLCCVNTADSTPRVISCYQCGQLGHTGLACARLR 331

Query: 990  GETTSTATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAPH 1169
             E    ATPSSCYRCGE GHFARECTSS+K GK+ RE SNTK  +  KENDY GHRSAPH
Sbjct: 332  TEAADAATPSSCYRCGEAGHFARECTSSVKLGKRRRESSNTKTPKFQKENDYVGHRSAPH 391

Query: 1170 DLGKMHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTPKKSRKS 1319
            D+GK  KKK+P TEE+G TTP+K K RGGWM E P E     +P KS++S
Sbjct: 392  DIGKSWKKKKPFTEEKGLTTPRKPKHRGGWMTEHPAE----FSPSKSKRS 437



 Score = 63.9 bits (154), Expect = 4e-07
 Identities = 51/120 (42%), Positives = 69/120 (57%), Gaps = 7/120 (5%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVF--SSDDDEANQDLSLKIVEKAMRMRE 216
           MGRK+K + K +E EE  E    +   STP LVF  SSDD+EAN+DLSL IVEKA+  R 
Sbjct: 1   MGRKDKQRAKKTEHEEEEEHGSPMG--STPPLVFEVSSDDEEANEDLSLGIVEKALMRR- 57

Query: 217 AKLSPNDALFNGIDCV---DSPLQQPELVVAPTGAGSDGPEGI--AALEVMEKEETTKLR 381
            KL  ND + NG D +    S  +Q E+ VA         EG+   A EV++  ++ +L+
Sbjct: 58  -KLPRNDVVSNGDDAIILGASSSRQDEVAVARN-------EGVLNEAREVVDVSDSEELK 109


>ref|XP_016166718.1| protein AIR2 [Arachis ipaensis]
          Length = 510

 Score =  448 bits (1153), Expect = e-151
 Identities = 206/290 (71%), Positives = 231/290 (79%), Gaps = 2/290 (0%)
 Frame = +3

Query: 456  KTTEKHESVEASA-VQIGDNIVLRKLLRGPRYFDPPDSS-WGACYNCGEEGHAAVNCTAA 629
            K  EK E  E  A VQ+GDN VLRKLLRGPRYFD PDS  WG CYNCGEEGHAAVNCTAA
Sbjct: 152  KEAEKDEPAEEPAMVQMGDNAVLRKLLRGPRYFDAPDSGGWGTCYNCGEEGHAAVNCTAA 211

Query: 630  RRQKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHISASKSLTVCLKCGNSGH 809
            +R+KPCYVCG L H AK CT  +DC+ICKKGGHRAKDCPEK++  S+SL +CLKCG+SGH
Sbjct: 212  KRKKPCYVCGSLEHYAKHCTMGRDCYICKKGGHRAKDCPEKNLIGSQSLNICLKCGDSGH 271

Query: 810  DMFSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLR 989
            +MFSCKN+YS DDLKE+QCY+CK FGH+CC NT D+TP  ISCY+CGQLGHTGLAC+RLR
Sbjct: 272  EMFSCKNDYSPDDLKEVQCYVCKKFGHLCCVNTADSTPRVISCYQCGQLGHTGLACARLR 331

Query: 990  GETTSTATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAPH 1169
             E    ATPSSCYRCGE GHFARECTSS+K GK+ RE SNTK  +  KENDY GHRSAPH
Sbjct: 332  TEAADAATPSSCYRCGEAGHFARECTSSVKLGKRRRESSNTKTPKFQKENDYVGHRSAPH 391

Query: 1170 DLGKMHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTPKKSRKS 1319
            D+GK  KKK+P TEE+G TTP+K K RGGWM E P E   FS  K  R S
Sbjct: 392  DIGKSWKKKKPFTEEKGLTTPRKPKHRGGWMTEHPAE---FSPSKPKRSS 438



 Score = 64.3 bits (155), Expect = 3e-07
 Identities = 51/114 (44%), Positives = 67/114 (58%), Gaps = 6/114 (5%)
 Frame = +1

Query: 43  MGRKEKPKVKASEGEEVGELNDDVDGVSTPLLVF--SSDDDEANQDLSLKIVEKAMRMRE 216
           MGRK+K + K +E EE  E    +   STP LVF  SSDD+EAN+DLSL+IVEKA+  R 
Sbjct: 1   MGRKDKQRAKKTEHEEEEEHGSPMG--STPPLVFEVSSDDEEANEDLSLRIVEKALMRR- 57

Query: 217 AKLSPNDALFNGIDCV---DSPLQQPELVVAPT-GAGSDGPEGIAALEVMEKEE 366
            KL  ND   NG D +    S  +Q E+ VA   GA ++  E    ++V + EE
Sbjct: 58  -KLPRNDVDSNGDDAIILGGSSSRQDEVAVARNEGALNEARE---VVDVSDSEE 107


>gb|KRG89905.1| hypothetical protein GLYMA_20G055200 [Glycine max]
          Length = 391

 Score =  416 bits (1068), Expect = e-139
 Identities = 192/246 (78%), Positives = 216/246 (87%)
 Frame = +3

Query: 690  KAQDCFICKKGGHRAKDCPEKHISASKSLTVCLKCGNSGHDMFSCKNEYSLDDLKEIQCY 869
            +AQDCFICKKGGHRAKDC EKH S SKS+ +CLKCGNSGHDMFSC+N+YS DDLKEIQCY
Sbjct: 43   QAQDCFICKKGGHRAKDCLEKHTSRSKSVAICLKCGNSGHDMFSCRNDYSPDDLKEIQCY 102

Query: 870  LCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLRGETTSTATPSSCYRCGEGGH 1049
            +CK  GH+CC NTDDATPGEISCYKCGQLGHTGLACSRLR E TS ATPSSC++CGE GH
Sbjct: 103  VCKRVGHLCCVNTDDATPGEISCYKCGQLGHTGLACSRLRDEITSGATPSSCFKCGEEGH 162

Query: 1050 FARECTSSIKAGKKNREFSNTKNRRSYKENDYKGHRSAPHDLGKMHKKKRPLTEERGFTT 1229
            FARECTSSIK+GK+N E S+TK++RS KENDY G+RSA +D+    +KKR  TEERGF+T
Sbjct: 163  FARECTSSIKSGKRNWESSHTKDKRSQKENDYMGNRSASNDMVGARRKKRSPTEERGFST 222

Query: 1230 PKKSKSRGGWMRELPTEERGFSTPKKSRKSRGGWTREHPTEERGFSTPKKSRKSRGGWTT 1409
            PKKSKSRGGW  E PTEERGF+TPKKS KSRGGWT EHPTEERGF+TPKKS K+RGGWT+
Sbjct: 223  PKKSKSRGGWTAEYPTEERGFTTPKKS-KSRGGWTTEHPTEERGFTTPKKS-KNRGGWTS 280

Query: 1410 EHPREE 1427
            EHP E+
Sbjct: 281  EHPLEQ 286


>ref|XP_015873156.1| PREDICTED: zinc finger CCHC domain-containing protein 7-like
            [Ziziphus jujuba]
 ref|XP_015873253.1| PREDICTED: zinc finger CCHC domain-containing protein 7-like
            [Ziziphus jujuba]
          Length = 496

 Score =  419 bits (1078), Expect = e-139
 Identities = 200/327 (61%), Positives = 242/327 (74%), Gaps = 9/327 (2%)
 Frame = +3

Query: 456  KTTEKHESVEASAVQIGDNIVLRKLLRGPRYFDPPDSSWGACYNCGEEGHAAVNCTAARR 635
            K  E  E+ + + V   DNIVLRKLLRGPRYFDPPDSSWG CYNCGEEGHAAVNCTAA+R
Sbjct: 154  KAAEIAEATDPNPVDTADNIVLRKLLRGPRYFDPPDSSWGMCYNCGEEGHAAVNCTAAKR 213

Query: 636  QKPCYVCGGLGHNAKQCTKAQDCFICKKGGHRAKDCPEKHISASKSLTVCLKCGNSGHDM 815
            +KPC+VCG L HNA+QC+KAQDCFICKKGGHRAKDCP+KH   S S  +CLKCG+ GHDM
Sbjct: 214  KKPCFVCGSLEHNARQCSKAQDCFICKKGGHRAKDCPDKHKGGSSS-KICLKCGDYGHDM 272

Query: 816  FSCKNEYSLDDLKEIQCYLCKTFGHICCANTDDATPGEISCYKCGQLGHTGLACSRLRGE 995
            FSC+N+Y  DDLKEIQCY+CK FGH+CC+   D+   E+SCYKCGQLGHTGLAC+R RGE
Sbjct: 273  FSCRNDYPPDDLKEIQCYVCKNFGHLCCSEYADSFSREVSCYKCGQLGHTGLACTRFRGE 332

Query: 996  TTSTATPSSCYRCGEGGHFARECTSSIKAGKKNREFSNTKNRRSYKEN-DYKGHRSAPHD 1172
            TT   T SSC+RCG+ GHFAREC SS K GK+N   S+T N RS++E+ D+KG++S PHD
Sbjct: 333  TTGNETASSCFRCGQDGHFARECKSSAKGGKRNHA-SSTPNSRSHREDKDHKGYKSVPHD 391

Query: 1173 LGKMHKKKRPLTEERGFTTPKKSKSRGGWMRELPTEERGFSTPKKSRK--------SRGG 1328
            LGK  K+K+   EE GFTTP+KS+ RGGW+    TE+ G  + +K RK        + G 
Sbjct: 392  LGKARKRKKIQYEESGFTTPQKSRHRGGWI----TEDPGDFSLRKCRKNVFRSPATASGK 447

Query: 1329 WTREHPTEERGFSTPKKSRKSRGGWTT 1409
               +H    +GF TP +  KS+G   T
Sbjct: 448  GHSKH--SRKGFGTPTQHIKSQGSSKT 472


Top