BLASTX nr result

ID: Astragalus22_contig00007728 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00007728
         (856 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNY02282.1| zinc finger protein, partial [Trifolium pratense]      277   4e-88
ref|XP_012573710.1| PREDICTED: protein indeterminate-domain 1 [C...   278   3e-87
ref|XP_003517239.1| PREDICTED: protein indeterminate-domain 2-li...   264   1e-81
ref|XP_020217260.1| protein indeterminate-domain 1-like isoform ...   258   1e-79
ref|XP_020217259.1| protein indeterminate-domain 2-like isoform ...   258   3e-79
gb|KOM45124.1| hypothetical protein LR48_Vigan06g043000 [Vigna a...   252   2e-77
ref|XP_017427555.1| PREDICTED: protein indeterminate-domain 2-li...   252   4e-77
ref|XP_007156918.1| hypothetical protein PHAVU_002G028100g [Phas...   252   7e-77
dbj|BAU00111.1| hypothetical protein VIGAN_10167800 [Vigna angul...   252   7e-77
ref|XP_014520893.1| protein indeterminate-domain 2 [Vigna radiat...   249   6e-76
gb|KYP66733.1| Zinc finger protein MAGPIE [Cajanus cajan]             248   1e-75
ref|XP_015964045.2| LOW QUALITY PROTEIN: protein indeterminate-d...   241   1e-72
gb|OMO56870.1| hypothetical protein CCACVL1_26206 [Corchorus cap...   230   2e-68
ref|XP_019444753.1| PREDICTED: protein indeterminate-domain 1 [L...   228   8e-68
ref|XP_017985426.1| PREDICTED: protein indeterminate-domain 2 [T...   229   9e-68
gb|EOX91151.1| C2H2-like zinc finger protein [Theobroma cacao]        228   1e-67
ref|XP_021273679.1| LOW QUALITY PROTEIN: protein indeterminate-d...   228   1e-67
gb|KHN44500.1| Zinc finger protein MAGPIE [Glycine soja]              226   3e-67
ref|XP_003547992.1| PREDICTED: protein indeterminate-domain 2-li...   226   4e-67
ref|XP_022740965.1| protein indeterminate-domain 2-like [Durio z...   225   2e-66

>gb|PNY02282.1| zinc finger protein, partial [Trifolium pratense]
          Length = 392

 Score =  277 bits (708), Expect = 4e-88
 Identities = 165/292 (56%), Positives = 174/292 (59%), Gaps = 8/292 (2%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSKVCGSREYKCD CG VFSRRDSFITHRAFCDALAE
Sbjct: 3   HGEKKWKCDKCSKKYAVQSDWKAHSKVCGSREYKCD-CGTVFSRRDSFITHRAFCDALAE 61

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSPSKXXXXXXXXXXXXDQSNSVAVSSALQTQK 362
           ENAK+Q  N    KANSESDSK LTGD  P               QSNS  VSSAL+ QK
Sbjct: 62  ENAKSQ--NQAVGKANSESDSKVLTGDSLP----VAPTPAAITTPQSNS-GVSSALENQK 114

Query: 363 LELLENPPQIIEEQQ--------VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXX 518
           L+L ENPPQI+EE +                                      V      
Sbjct: 115 LDLPENPPQIVEEPEAIVTTTTAAAAAAVLNANCSSSSSTSSTSNGCAATSSGVFASLFA 174

Query: 519 XXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQ 698
                         PVFTDLIR+MGCPDR  D SAPPSSEAISLCLSTN GSSIFGTGGQ
Sbjct: 175 SSTASASASMQSHTPVFTDLIRSMGCPDRSTDFSAPPSSEAISLCLSTNPGSSIFGTGGQ 234

Query: 699 DHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIMXXXXXXXXGQ 854
           +  QY P+ QPPAMSATALLQKAAQMGA  TNASLLRGLGI+        GQ
Sbjct: 235 ECRQYVPTHQPPAMSATALLQKAAQMGAAATNASLLRGLGIVSSSASTSSGQ 286


>ref|XP_012573710.1| PREDICTED: protein indeterminate-domain 1 [Cicer arietinum]
          Length = 521

 Score =  278 bits (712), Expect = 3e-87
 Identities = 167/283 (59%), Positives = 175/283 (61%), Gaps = 9/283 (3%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSKVCGSREYKCD CG VFSRRDSFITHRAFCDALAE
Sbjct: 131 HGEKKWKCDKCSKKYAVQSDWKAHSKVCGSREYKCD-CGTVFSRRDSFITHRAFCDALAE 189

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSPSKXXXXXXXXXXXXDQSNSVAVSSALQTQK 362
           ENAK+QT      KANSESDSK LTGD SP               QSNSV VSS L+T K
Sbjct: 190 ENAKSQTVG----KANSESDSKVLTGDSSPPSMPAATVTAATTA-QSNSV-VSSGLETHK 243

Query: 363 LELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXX---------VXXXXX 515
           +E   NPPQIIEE QV                                      V     
Sbjct: 244 IE---NPPQIIEEPQVVVTTTTASTATTTNALNGSCSSNSASSTSNGGATTTSGVFASLF 300

Query: 516 XXXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGG 695
                        Q   FTDLIR+MGCPDRPAD SAPP+SEAISLCLSTNHGSSIFGTGG
Sbjct: 301 ASSTASTSASLQSQTLAFTDLIRSMGCPDRPADFSAPPTSEAISLCLSTNHGSSIFGTGG 360

Query: 696 QDHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
           Q+  QYAP+ QPPAMSATALLQKAAQMGA  TNASLLRGLGI+
Sbjct: 361 QECRQYAPTPQPPAMSATALLQKAAQMGAAATNASLLRGLGIV 403


>ref|XP_003517239.1| PREDICTED: protein indeterminate-domain 2-like [Glycine max]
 gb|KRH76829.1| hypothetical protein GLYMA_01G176600 [Glycine max]
          Length = 517

 Score =  264 bits (675), Expect = 1e-81
 Identities = 156/276 (56%), Positives = 171/276 (61%), Gaps = 2/276 (0%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132 HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSPS-KXXXXXXXXXXXXDQSNSVAV-SSALQT 356
           E+A++Q + +   KA+SESDSK +TGD SP                QSNSV V SS+LQT
Sbjct: 191 ESARSQPQTV--AKASSESDSKAVTGDSSPPVAVEAPPPLVPPVSSQSNSVVVPSSSLQT 248

Query: 357 QKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXX 536
           QK EL EN PQIIEE +V                             V            
Sbjct: 249 QKPELPENSPQIIEEPKVNTAMNGSCSSTSTSTTSSTSNSNSGASSSVFASLFASSSASA 308

Query: 537 XXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQDHHQYA 716
                 Q P FTDLIRAMG PD PADLS P SSE ISLCL+TNHGSSIFGTG Q+  QYA
Sbjct: 309 TASLHSQTPAFTDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGRQERRQYA 368

Query: 717 PSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
           P  Q PAMSATALLQKAAQMGA  TNAS LRGLGI+
Sbjct: 369 PPPQ-PAMSATALLQKAAQMGAAATNASFLRGLGIV 403


>ref|XP_020217260.1| protein indeterminate-domain 1-like isoform X2 [Cajanus cajan]
          Length = 476

 Score =  258 bits (658), Expect = 1e-79
 Identities = 154/288 (53%), Positives = 166/288 (57%), Gaps = 4/288 (1%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 93  HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 151

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSP----SKXXXXXXXXXXXXDQSNSVAVSSAL 350
           E+A++Q     A KA+SESDSK +TGD  P    +              QS SV V   L
Sbjct: 152 ESARSQPPT--AAKASSESDSKAVTGDSPPPPPPTAGAVAAPPPLPPSSQSTSVVV---L 206

Query: 351 QTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXX 530
           QTQ  EL EN PQI+EE Q                                         
Sbjct: 207 QTQNPELPENSPQIVEEPQANTAMNGSCSSTSTTSSTSNSNSGTGSSVFASLFASSTASG 266

Query: 531 XXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQDHHQ 710
                   Q P FTDLIRAMG PD P DLS P SSE ISLCLSTNHGSSIFGTGGQ+  Q
Sbjct: 267 TASLSLQSQTPAFTDLIRAMGHPDHPGDLSRPSSSEPISLCLSTNHGSSIFGTGGQERRQ 326

Query: 711 YAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIMXXXXXXXXGQ 854
           YAP  Q PAMSATALLQKAAQMGAT TNAS LRGLGI+        GQ
Sbjct: 327 YAPPPQ-PAMSATALLQKAAQMGATATNASFLRGLGIVSSSASTSSGQ 373


>ref|XP_020217259.1| protein indeterminate-domain 2-like isoform X1 [Cajanus cajan]
          Length = 515

 Score =  258 bits (658), Expect = 3e-79
 Identities = 154/288 (53%), Positives = 166/288 (57%), Gaps = 4/288 (1%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132 HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSP----SKXXXXXXXXXXXXDQSNSVAVSSAL 350
           E+A++Q     A KA+SESDSK +TGD  P    +              QS SV V   L
Sbjct: 191 ESARSQPPT--AAKASSESDSKAVTGDSPPPPPPTAGAVAAPPPLPPSSQSTSVVV---L 245

Query: 351 QTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXX 530
           QTQ  EL EN PQI+EE Q                                         
Sbjct: 246 QTQNPELPENSPQIVEEPQANTAMNGSCSSTSTTSSTSNSNSGTGSSVFASLFASSTASG 305

Query: 531 XXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQDHHQ 710
                   Q P FTDLIRAMG PD P DLS P SSE ISLCLSTNHGSSIFGTGGQ+  Q
Sbjct: 306 TASLSLQSQTPAFTDLIRAMGHPDHPGDLSRPSSSEPISLCLSTNHGSSIFGTGGQERRQ 365

Query: 711 YAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIMXXXXXXXXGQ 854
           YAP  Q PAMSATALLQKAAQMGAT TNAS LRGLGI+        GQ
Sbjct: 366 YAPPPQ-PAMSATALLQKAAQMGATATNASFLRGLGIVSSSASTSSGQ 412


>gb|KOM45124.1| hypothetical protein LR48_Vigan06g043000 [Vigna angularis]
          Length = 468

 Score =  252 bits (643), Expect = 2e-77
 Identities = 153/281 (54%), Positives = 169/281 (60%), Gaps = 7/281 (2%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 77  HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 135

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSP----SKXXXXXXXXXXXXDQSNSVAVSSA- 347
           E+A++Q +   A KA+SESDSK +TGD SP    +              +SNSV VSS+ 
Sbjct: 136 ESARSQPQT--AAKASSESDSKAVTGDSSPPAAAAAATPPPPSAPPASSKSNSVVVSSSV 193

Query: 348 LQTQKLELLENPPQIIEEQQV--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXX 521
           LQT   EL EN PQ+IEE Q                                V       
Sbjct: 194 LQTPNPELPENSPQVIEEPQANPAVSGSCSGTSTSTSTTSSTSNSNGAASSSVFASLFAS 253

Query: 522 XXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQD 701
                      Q P FTDLIRAMG PD PADLS P SSE ISLCL+TNHGSSIFGTG Q+
Sbjct: 254 STASATASLQSQTPAFTDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGLQE 313

Query: 702 HHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
             QYAP  Q PAMSATALLQKAAQMGA  TNAS LRGLGI+
Sbjct: 314 CRQYAPPPQ-PAMSATALLQKAAQMGAAATNASFLRGLGIV 353


>ref|XP_017427555.1| PREDICTED: protein indeterminate-domain 2-like [Vigna angularis]
          Length = 497

 Score =  252 bits (643), Expect = 4e-77
 Identities = 153/281 (54%), Positives = 169/281 (60%), Gaps = 7/281 (2%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 106 HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 164

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSP----SKXXXXXXXXXXXXDQSNSVAVSSA- 347
           E+A++Q +   A KA+SESDSK +TGD SP    +              +SNSV VSS+ 
Sbjct: 165 ESARSQPQT--AAKASSESDSKAVTGDSSPPAAAAAATPPPPSAPPASSKSNSVVVSSSV 222

Query: 348 LQTQKLELLENPPQIIEEQQV--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXX 521
           LQT   EL EN PQ+IEE Q                                V       
Sbjct: 223 LQTPNPELPENSPQVIEEPQANPAVSGSCSGTSTSTSTTSSTSNSNGAASSSVFASLFAS 282

Query: 522 XXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQD 701
                      Q P FTDLIRAMG PD PADLS P SSE ISLCL+TNHGSSIFGTG Q+
Sbjct: 283 STASATASLQSQTPAFTDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGLQE 342

Query: 702 HHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
             QYAP  Q PAMSATALLQKAAQMGA  TNAS LRGLGI+
Sbjct: 343 CRQYAPPPQ-PAMSATALLQKAAQMGAAATNASFLRGLGIV 382


>ref|XP_007156918.1| hypothetical protein PHAVU_002G028100g [Phaseolus vulgaris]
 gb|ESW28912.1| hypothetical protein PHAVU_002G028100g [Phaseolus vulgaris]
          Length = 521

 Score =  252 bits (643), Expect = 7e-77
 Identities = 153/280 (54%), Positives = 169/280 (60%), Gaps = 6/280 (2%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132 HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSP--SKXXXXXXXXXXXXDQSNSVAV-SSALQ 353
           E+A++Q + +   KA+SESDSK +TGD SP  +              +SNSV V SSALQ
Sbjct: 191 ESARSQPQTV--AKASSESDSKAVTGDSSPPAAVATPPPPPAPPASPKSNSVVVSSSALQ 248

Query: 354 TQKLELLENPPQIIEEQQ---VXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXX 524
           TQ  EL EN PQ+IEE Q                                          
Sbjct: 249 TQNPELPENSPQVIEETQANPAMSGSCSSSGTSTSTTSSTSNSNGGGSSSVFASLFASST 308

Query: 525 XXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQDH 704
                     Q P FTDLIRAMG PD PADLS P SSE ISLCL+TNHGSSIFGTG Q+ 
Sbjct: 309 AASATASLHSQTPAFTDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGLQEC 368

Query: 705 HQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
            QYAP  Q PAMSATALLQKAAQMGA  TNAS LRGLGI+
Sbjct: 369 RQYAPPPQ-PAMSATALLQKAAQMGAAATNASFLRGLGIV 407


>dbj|BAU00111.1| hypothetical protein VIGAN_10167800 [Vigna angularis var.
           angularis]
          Length = 523

 Score =  252 bits (643), Expect = 7e-77
 Identities = 153/281 (54%), Positives = 169/281 (60%), Gaps = 7/281 (2%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132 HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSP----SKXXXXXXXXXXXXDQSNSVAVSSA- 347
           E+A++Q +   A KA+SESDSK +TGD SP    +              +SNSV VSS+ 
Sbjct: 191 ESARSQPQT--AAKASSESDSKAVTGDSSPPAAAAAATPPPPSAPPASSKSNSVVVSSSV 248

Query: 348 LQTQKLELLENPPQIIEEQQV--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXX 521
           LQT   EL EN PQ+IEE Q                                V       
Sbjct: 249 LQTPNPELPENSPQVIEEPQANPAVSGSCSGTSTSTSTTSSTSNSNGAASSSVFASLFAS 308

Query: 522 XXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQD 701
                      Q P FTDLIRAMG PD PADLS P SSE ISLCL+TNHGSSIFGTG Q+
Sbjct: 309 STASATASLQSQTPAFTDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGLQE 368

Query: 702 HHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
             QYAP  Q PAMSATALLQKAAQMGA  TNAS LRGLGI+
Sbjct: 369 CRQYAPPPQ-PAMSATALLQKAAQMGAAATNASFLRGLGIV 408


>ref|XP_014520893.1| protein indeterminate-domain 2 [Vigna radiata var. radiata]
          Length = 522

 Score =  249 bits (637), Expect = 6e-76
 Identities = 151/281 (53%), Positives = 169/281 (60%), Gaps = 7/281 (2%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132 HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDDSP----SKXXXXXXXXXXXXDQSNSVAVSSA- 347
           E+A++Q + +   KA+SESDSK +TGD SP    +              +SNSV VSS+ 
Sbjct: 191 ESARSQPQTV--AKASSESDSKAVTGDSSPPAAAAAATPPPPSAPPASSKSNSVVVSSSV 248

Query: 348 LQTQKLELLENPPQIIEEQQV--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXX 521
           LQT   EL EN PQ+IEE Q                                V       
Sbjct: 249 LQTPNPELPENSPQVIEEPQANPAVSGSCSGTSTSTSTTSSTSNSNGGASSSVFASLFAS 308

Query: 522 XXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQD 701
                      Q P FTDLIRAMG PD PADLS P +SE ISLCL+TNHGSSIFGTG Q+
Sbjct: 309 STASATASLQSQTPAFTDLIRAMGHPDHPADLSRPSASEPISLCLATNHGSSIFGTGLQE 368

Query: 702 HHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
             QYAP  Q PAMSATALLQKAAQMGA  TNAS LRGLGI+
Sbjct: 369 CRQYAPPPQ-PAMSATALLQKAAQMGAAATNASFLRGLGIV 408


>gb|KYP66733.1| Zinc finger protein MAGPIE [Cajanus cajan]
          Length = 486

 Score =  248 bits (632), Expect = 1e-75
 Identities = 152/288 (52%), Positives = 164/288 (56%), Gaps = 4/288 (1%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132 HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183 ENAKAQTKNMMAVKANSESDSKGLTGDD----SPSKXXXXXXXXXXXXDQSNSVAVSSAL 350
           E+A++Q     A KA+SESDSK +TGD      P+              QS SV V   L
Sbjct: 191 ESARSQPPT--AAKASSESDSKAVTGDSPPPPPPTAGAVAAPPPLPPSSQSTSVVV---L 245

Query: 351 QTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXX 530
           QTQ  EL EN PQI+EE Q                                         
Sbjct: 246 QTQNPELPENSPQIVEEPQAN-----------------------------TAMNGSCSST 276

Query: 531 XXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGGQDHHQ 710
                       FTDLIRAMG PD P DLS P SSE ISLCLSTNHGSSIFGTGGQ+  Q
Sbjct: 277 STTSSTSNSNSAFTDLIRAMGHPDHPGDLSRPSSSEPISLCLSTNHGSSIFGTGGQERRQ 336

Query: 711 YAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIMXXXXXXXXGQ 854
           YAP  Q PAMSATALLQKAAQMGAT TNAS LRGLGI+        GQ
Sbjct: 337 YAPPPQ-PAMSATALLQKAAQMGATATNASFLRGLGIVSSSASTSSGQ 383


>ref|XP_015964045.2| LOW QUALITY PROTEIN: protein indeterminate-domain 2-like [Arachis
            duranensis]
          Length = 546

 Score =  241 bits (616), Expect = 1e-72
 Identities = 147/304 (48%), Positives = 163/304 (53%), Gaps = 30/304 (9%)
 Frame = +3

Query: 3    HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
            HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 138  HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 196

Query: 183  ENAKAQT-----KNMMAVKANSESDSKGLTGDDS-------------------------P 272
            E+A++QT     ++ + VK +SESDSK +  + S                         P
Sbjct: 197  ESARSQTHSQTTQSQIGVKVSSESDSKAVNAESSSPQPTPPPPATTPAPAPPPPPPPVQP 256

Query: 273  SKXXXXXXXXXXXXDQSNSVAVSSALQTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXX 452
                           Q NSV V   LQTQ  EL EN PQIIEE Q               
Sbjct: 257  EAPPLPATTTTTTTTQPNSVVVPLVLQTQNPELPENSPQIIEEPQANTALNGSCSSSTSS 316

Query: 453  XXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPS 632
                                               PP FTDL+RAMG PD P DL  P S
Sbjct: 317  TSNGGTSSSVFASLFASSTASGNLQSQT-------PPAFTDLVRAMGPPDHPTDLPGPSS 369

Query: 633  SEAISLCLSTNHGSSIFGTGGQDHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRG 812
            SE ISLCL+TNHGSSIFGTGGQ+  QYAP  Q P MSATALLQKAAQMGA  TNASLLRG
Sbjct: 370  SEPISLCLATNHGSSIFGTGGQERRQYAPPPQ-PTMSATALLQKAAQMGAAATNASLLRG 428

Query: 813  LGIM 824
            LGI+
Sbjct: 429  LGIV 432


>gb|OMO56870.1| hypothetical protein CCACVL1_26206 [Corchorus capsularis]
          Length = 526

 Score =  230 bits (586), Expect = 2e-68
 Identities = 141/288 (48%), Positives = 160/288 (55%), Gaps = 14/288 (4%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132 HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183 ENAKAQTK------NMMAVKANSESDSKGLTGDDS--------PSKXXXXXXXXXXXXDQ 320
           E+A+AQT+      N  A   +SESD K    + S        P+               
Sbjct: 191 ESARAQTQPSSQNQNQAAANPSSESDPKTQAMESSSPPAPPPAPASVSAPPPAAVQVSAS 250

Query: 321 SNSVAVSSALQTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXV 500
           + SV  SS L  Q  EL ENP  I+EE                                V
Sbjct: 251 TTSVISSSVLPMQSGELQENPTPILEEDPPPPPPPAPAGLNGSCSSSNSSSSNGSSSSTV 310

Query: 501 XXXXXXXXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSI 680
                             QPP FTD+IRAMG P+RPADL+   S+E ISLCLSTNHGSSI
Sbjct: 311 FASLFASSTASASLQPP-QPPAFTDVIRAMGRPERPADLAPSTSTEPISLCLSTNHGSSI 369

Query: 681 FGTGGQDHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
           FGT GQ+  QYAP+ Q PAMSATALLQKAAQMGA  +NASLLRG GI+
Sbjct: 370 FGTAGQERRQYAPAPQ-PAMSATALLQKAAQMGAAASNASLLRGFGIV 416


>ref|XP_019444753.1| PREDICTED: protein indeterminate-domain 1 [Lupinus angustifolius]
 ref|XP_019444754.1| PREDICTED: protein indeterminate-domain 1 [Lupinus angustifolius]
 gb|OIW10994.1| hypothetical protein TanjilG_22801 [Lupinus angustifolius]
          Length = 505

 Score =  228 bits (581), Expect = 8e-68
 Identities = 143/283 (50%), Positives = 160/283 (56%), Gaps = 9/283 (3%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSK+CG+REYKCD CG VFSRRDSFITHRAFCDALAE
Sbjct: 130 HGEKKWKCEKCSKKYAVQSDWKAHSKICGTREYKCD-CGTVFSRRDSFITHRAFCDALAE 188

Query: 183 ENAKAQ-----TKNMMAVKANSESDSKGLTGDDSP----SKXXXXXXXXXXXXDQSNSVA 335
           E+A++Q     T+   A+KANS+SDSK +TGDDS                    QSNS A
Sbjct: 189 ESARSQPQSQTTQTQSAIKANSDSDSKAVTGDDSSPMEVPPLPPPSPPAPPAIPQSNSAA 248

Query: 336 VSSALQTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXX 515
           +S  L+ Q  EL EN PQ +EE Q                              V     
Sbjct: 249 LSD-LKVQNPELPENTPQSLEELQA-----------KNALNGSCSTSTNTTSNGVSVFAS 296

Query: 516 XXXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLCLSTNHGSSIFGTGG 695
                        Q P FTDLIRAMG PD   D+  P  S+ ISLCL    GSS+F TGG
Sbjct: 297 LFASSTTSENLQSQTPAFTDLIRAMGRPDHSVDIPGPSFSDPISLCL----GSSMFATGG 352

Query: 696 QDHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
           Q+  QYAP  Q PAMSATALLQKAAQMGA  TNASLLRGLGI+
Sbjct: 353 QERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGLGIV 394


>ref|XP_017985426.1| PREDICTED: protein indeterminate-domain 2 [Theobroma cacao]
          Length = 538

 Score =  229 bits (583), Expect = 9e-68
 Identities = 145/301 (48%), Positives = 158/301 (52%), Gaps = 27/301 (8%)
 Frame = +3

Query: 3    HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
            HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132  HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183  ENAKAQT------KNMMAVKANSESDSKGLTGDDSP---------------------SKX 281
            E+A+AQT      +N      +SESD K    D S                      S  
Sbjct: 191  ESARAQTHPQPQNQNQAVANPSSESDPKVQAVDSSAPPAPAPTPAPAPASAPVQVSASAP 250

Query: 282  XXXXXXXXXXXDQSNSVAVSSALQTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXX 461
                        QS SV  SS L  +  EL ENP  I+EE  V                 
Sbjct: 251  APAPTPAAPTLPQSTSVISSSVLPIRSSELPENPTPIVEEAPVPAPAPAGLNGSCSTSTS 310

Query: 462  XXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEA 641
                                           QPP FTDLIRAMG PDRPADL+   S+E 
Sbjct: 311  SGSNGGSRSSVFASLFASSTASTSLQPP---QPPAFTDLIRAMGRPDRPADLAPSTSTEP 367

Query: 642  ISLCLSTNHGSSIFGTGGQDHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGI 821
            ISLCLSTNHGSSIFGT GQ+  QYAP  Q PAMSATALLQKAAQMGA  TNASLLRG GI
Sbjct: 368  ISLCLSTNHGSSIFGTAGQERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGFGI 426

Query: 822  M 824
            +
Sbjct: 427  V 427


>gb|EOX91151.1| C2H2-like zinc finger protein [Theobroma cacao]
          Length = 534

 Score =  228 bits (582), Expect = 1e-67
 Identities = 144/297 (48%), Positives = 157/297 (52%), Gaps = 23/297 (7%)
 Frame = +3

Query: 3    HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
            HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132  HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183  ENAKAQT------KNMMAVKANSESDSKGLTGDDSPSKXXXXXXXXXXXX---------- 314
            E+A+AQT      +N      +SESD K    D S                         
Sbjct: 191  ESARAQTHPQPQNQNQAVANPSSESDPKVQAVDSSAPPAPAPTPAPAPASAPVQVSASAP 250

Query: 315  -------DQSNSVAVSSALQTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXX 473
                    QS SV  SS L  +  EL ENP  I+EE  V                     
Sbjct: 251  APAAPTLPQSTSVISSSVLPIRSSELPENPTPIVEEAPVPAPAPAGLNGSCSTSTSSGSN 310

Query: 474  XXXXXXXXVXXXXXXXXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAISLC 653
                                       QPP FTDLIRAMG PDRPADL+   S+E ISLC
Sbjct: 311  GGSRSSVFASLFASSTASTSLQPP---QPPAFTDLIRAMGRPDRPADLAPSTSTEPISLC 367

Query: 654  LSTNHGSSIFGTGGQDHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
            LSTNHGSSIFGT GQ+  QYAP  Q PAMSATALLQKAAQMGA  TNASLLRG GI+
Sbjct: 368  LSTNHGSSIFGTAGQERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGFGIV 423


>ref|XP_021273679.1| LOW QUALITY PROTEIN: protein indeterminate-domain 2 [Herrania
            umbratica]
          Length = 538

 Score =  228 bits (582), Expect = 1e-67
 Identities = 144/299 (48%), Positives = 158/299 (52%), Gaps = 25/299 (8%)
 Frame = +3

Query: 3    HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
            HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSFITHRAFCDALAE
Sbjct: 132  HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFITHRAFCDALAE 190

Query: 183  ENAKAQT------KNMMAVKANSESDSKGLTGDDSP-------------------SKXXX 287
            E+A+AQT      +N      +SESD K    D S                    S    
Sbjct: 191  ESARAQTHPQPQNQNQAVANPSSESDPKVQAVDSSAPPAPAPTPXSGTASAPVQVSASAP 250

Query: 288  XXXXXXXXXDQSNSVAVSSALQTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXX 467
                      QS SV  SS L  +  EL ENP  I+E+  V                   
Sbjct: 251  APTPAPPALPQSTSVISSSVLPIRSSELPENPTPIVEDAPVPAPAPAGLNGSCSTSTSSG 310

Query: 468  XXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAIS 647
                                         QPP FTDLIRAMG PDRPADL+   S+E IS
Sbjct: 311  SNGGSRSSVFASLFASSTASASLQPP---QPPAFTDLIRAMGRPDRPADLAPSTSTEPIS 367

Query: 648  LCLSTNHGSSIFGTGGQDHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
            LCLSTNHGSSIFGT GQ+  QYAP  Q PAMSATALLQKAAQMGA  TNASLLRG GI+
Sbjct: 368  LCLSTNHGSSIFGTAGQERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGFGIV 425


>gb|KHN44500.1| Zinc finger protein MAGPIE [Glycine soja]
          Length = 507

 Score =  226 bits (577), Expect = 3e-67
 Identities = 146/278 (52%), Positives = 158/278 (56%), Gaps = 4/278 (1%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSKVCG+REYKCD CG VFSRRDSFITHRAFCD LAE
Sbjct: 131 HGEKKWKCDKCSKKYAVQSDWKAHSKVCGTREYKCD-CGTVFSRRDSFITHRAFCDVLAE 189

Query: 183 ENAKAQTKNMMAVKANSESDSK--GLTGDDSPSKXXXXXXXXXXXXDQSNSVAVSSALQT 356
           EN ++       VK NSE+DSK   LTGD  P +             Q+NS A+S  LQT
Sbjct: 190 ENVRSHA----VVKDNSENDSKVLTLTGDSPPLQ------PVSATTTQTNS-AMSCGLQT 238

Query: 357 QKLELLE-NPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXX 533
           Q LEL E NPPQ+IEE+                                           
Sbjct: 239 QNLELPETNPPQVIEEEPQGATAVSGSCGSNSTCSTSNGGATSNSNSSSSVFAGLFASST 298

Query: 534 XXXXXXXQPPVFTDLIRAMGCPDRPADL-SAPPSSEAISLCLSTNHGSSIFGTGGQDHHQ 710
                  Q P F+DLIRAMG P+ PADL SAP SSEAISLCLST   S IF TGGQ   Q
Sbjct: 299 ASGSLQSQTPAFSDLIRAMGPPEHPADLISAPSSSEAISLCLSTTSASPIFATGGQ---Q 355

Query: 711 YAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
           Y  S   PAMSATALLQKAAQMGA  TNASLLRG GI+
Sbjct: 356 YVSSPPQPAMSATALLQKAAQMGAAATNASLLRGFGIV 393


>ref|XP_003547992.1| PREDICTED: protein indeterminate-domain 2-like [Glycine max]
 gb|KRH08297.1| hypothetical protein GLYMA_16G141100 [Glycine max]
 gb|KRH08298.1| hypothetical protein GLYMA_16G141100 [Glycine max]
          Length = 511

 Score =  226 bits (577), Expect = 4e-67
 Identities = 146/278 (52%), Positives = 158/278 (56%), Gaps = 4/278 (1%)
 Frame = +3

Query: 3   HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
           HGE           YAVQSDWKAHSKVCG+REYKCD CG VFSRRDSFITHRAFCD LAE
Sbjct: 131 HGEKKWKCDKCSKKYAVQSDWKAHSKVCGTREYKCD-CGTVFSRRDSFITHRAFCDVLAE 189

Query: 183 ENAKAQTKNMMAVKANSESDSK--GLTGDDSPSKXXXXXXXXXXXXDQSNSVAVSSALQT 356
           EN ++       VK NSE+DSK   LTGD  P +             Q+NS A+S  LQT
Sbjct: 190 ENVRSHA----VVKDNSENDSKVLTLTGDSPPLQ--PVSATVATTTTQTNS-AMSCGLQT 242

Query: 357 QKLELLE-NPPQIIEEQQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXX 533
           Q LEL E NPPQ+IEE+                                           
Sbjct: 243 QNLELPETNPPQVIEEEPQGATAVSGSCGSNSTCSTSNGGATSNSNSSSSVFAGLFASST 302

Query: 534 XXXXXXXQPPVFTDLIRAMGCPDRPADL-SAPPSSEAISLCLSTNHGSSIFGTGGQDHHQ 710
                  Q P F+DLIRAMG P+ PADL SAP SSEAISLCLST   S IF TGGQ   Q
Sbjct: 303 ASGSLQSQTPAFSDLIRAMGPPEHPADLISAPSSSEAISLCLSTTSASPIFATGGQ---Q 359

Query: 711 YAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
           Y  S   PAMSATALLQKAAQMGA  TNASLLRG GI+
Sbjct: 360 YVSSPPQPAMSATALLQKAAQMGAAATNASLLRGFGIV 397


>ref|XP_022740965.1| protein indeterminate-domain 2-like [Durio zibethinus]
          Length = 543

 Score =  225 bits (574), Expect = 2e-66
 Identities = 142/299 (47%), Positives = 156/299 (52%), Gaps = 25/299 (8%)
 Frame = +3

Query: 3    HGEXXXXXXXXXXXYAVQSDWKAHSKVCGSREYKCDYCGAVFSRRDSFITHRAFCDALAE 182
            HGE           YAVQSDWKAHSK+CG+REYKCD CG +FSRRDSF+THRAFCDALAE
Sbjct: 132  HGEKKWKCDKCSKKYAVQSDWKAHSKICGTREYKCD-CGTLFSRRDSFVTHRAFCDALAE 190

Query: 183  ENAKAQTK------NMMAVKANSESDSKGLTGDDSP-------------------SKXXX 287
            E+A+AQT+      N      +SESD K    D SP                   S    
Sbjct: 191  ESARAQTQPQPQNQNHPVANPSSESDLKVQAVDSSPLPAPAPVSSLAPAPAPVQVSAPAP 250

Query: 288  XXXXXXXXXDQSNSVAVSSALQTQKLELLENPPQIIEEQQVXXXXXXXXXXXXXXXXXXX 467
                      QS SV  +S L  Q  EL ENP  I+EE                      
Sbjct: 251  APTPAAPPRPQSTSVISTSVLPVQNCELPENPASIVEEALAPAPAPAGRNGSCSSSTSSG 310

Query: 468  XXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPVFTDLIRAMGCPDRPADLSAPPSSEAIS 647
                                         QPP FTDLIRAMG PD PADL+   S+E IS
Sbjct: 311  SNGSSSSSVFASLFASSAASVSLQPP---QPPAFTDLIRAMGRPDHPADLAPSTSNEPIS 367

Query: 648  LCLSTNHGSSIFGTGGQDHHQYAPSQQPPAMSATALLQKAAQMGATTTNASLLRGLGIM 824
            LCLSTNHGSSIFGT  Q+  QYAP  Q PAMSATALLQKAAQMGA  TNASLLRG GI+
Sbjct: 368  LCLSTNHGSSIFGTARQERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGFGIV 425


Top