BLASTX nr result

ID: Astragalus24_contig00010508 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00010508
         (1473 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNY02282.1| zinc finger protein, partial [Trifolium pratense]      401   e-134
ref|XP_012573710.1| PREDICTED: protein indeterminate-domain 1 [C...   387   e-126
ref|XP_003517239.1| PREDICTED: protein indeterminate-domain 2-li...   376   e-122
ref|XP_020217260.1| protein indeterminate-domain 1-like isoform ...   373   e-121
ref|XP_020217259.1| protein indeterminate-domain 2-like isoform ...   373   e-121
gb|KYP66733.1| Zinc finger protein MAGPIE [Cajanus cajan]             367   e-119
gb|KOM45124.1| hypothetical protein LR48_Vigan06g043000 [Vigna a...   364   e-118
ref|XP_017427555.1| PREDICTED: protein indeterminate-domain 2-li...   364   e-118
dbj|BAU00111.1| hypothetical protein VIGAN_10167800 [Vigna angul...   364   e-117
ref|XP_014520893.1| protein indeterminate-domain 2 [Vigna radiat...   362   e-117
ref|XP_007156918.1| hypothetical protein PHAVU_002G028100g [Phas...   362   e-116
ref|XP_015964045.2| LOW QUALITY PROTEIN: protein indeterminate-d...   354   e-113
ref|XP_019444753.1| PREDICTED: protein indeterminate-domain 1 [L...   346   e-110
ref|XP_007156028.1| hypothetical protein PHAVU_003G252400g [Phas...   335   e-106
gb|EOX91151.1| C2H2-like zinc finger protein [Theobroma cacao]        333   e-105
ref|XP_012079066.1| protein indeterminate-domain 2 [Jatropha cur...   332   e-105
ref|XP_017985426.1| PREDICTED: protein indeterminate-domain 2 [T...   332   e-104
gb|POO02574.1| TFIIH C1-like domain containing protein [Trema or...   330   e-104
ref|XP_021273679.1| LOW QUALITY PROTEIN: protein indeterminate-d...   329   e-103
gb|OMO56870.1| hypothetical protein CCACVL1_26206 [Corchorus cap...   327   e-103

>gb|PNY02282.1| zinc finger protein, partial [Trifolium pratense]
          Length = 392

 Score =  401 bits (1031), Expect = e-134
 Identities = 231/386 (59%), Positives = 247/386 (63%), Gaps = 9/386 (2%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAK+Q + +  GKA
Sbjct: 17   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKSQNQAV--GKA 74

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXXQSNSVVVSSALQTQKLELQENPPQIIE 400
            NSESDSKVLTGD                     QSNS V SSAL+ QKL+L ENPPQI+E
Sbjct: 75   NSESDSKVLTGDSLPVAPTPAAITTP-------QSNSGV-SSALENQKLDLPENPPQIVE 126

Query: 401  EPQVVT-------AAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXX 559
            EP+ +        AAA+                        V                  
Sbjct: 127  EPEAIVTTTTAAAAAAVLNANCSSSSSTSSTSNGCAATSSGVFASLFASSTASASASMQS 186

Query: 560  QPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAP 739
              P FTDLIR+MGCPDRS D  APPSSEAISLCLSTN GSS FGTGGQ+ RQY P+ Q P
Sbjct: 187  HTPVFTDLIRSMGCPDRSTDFSAPPSSEAISLCLSTNPGSSIFGTGGQECRQYVPTHQPP 246

Query: 740  AMSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVP 919
            AMSATALLQKAAQMGAAATNASLLRGLGIV        G  +SL W   Q+EPE S   P
Sbjct: 247  AMSATALLQKAAQMGAAATNASLLRGLGIVSSSASTSSGQHDSLHWGLGQVEPEGSGLGP 306

Query: 920  AGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSI--D 1093
            AGLGLGLPCDG SGLKELML TPSMF PKQTTLDF                  ITSI   
Sbjct: 307  AGLGLGLPCDGDSGLKELMLGTPSMFGPKQTTLDFLGLGMAAGGSAGGGLSALITSIGGS 366

Query: 1094 SDMDITAAAASFGNGEFSSKDIGRRS 1171
              +D+TAA ASFGNGEFS KDIGR S
Sbjct: 367  GGLDVTAATASFGNGEFSGKDIGRNS 392


>ref|XP_012573710.1| PREDICTED: protein indeterminate-domain 1 [Cicer arietinum]
          Length = 521

 Score =  387 bits (994), Expect = e-126
 Identities = 236/392 (60%), Positives = 248/392 (63%), Gaps = 15/392 (3%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAK+QT     GKA
Sbjct: 145  YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKSQT----VGKA 200

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXXQSNSVVVSSALQTQKLELQENPPQIIE 400
            NSESDSKVLTGD                     QSNSVV SS L+T K+E   NPPQIIE
Sbjct: 201  NSESDSKVLTGDSSPPSMPAATVTAATTA----QSNSVV-SSGLETHKIE---NPPQIIE 252

Query: 401  EPQVV----------TAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXX 550
            EPQVV          T  ALN                       V               
Sbjct: 253  EPQVVVTTTTASTATTTNALNGSCSSNSASSTSNGGATTTSG--VFASLFASSTASTSAS 310

Query: 551  XXXQPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQ 730
               Q  AFTDLIR+MGCPDR  D  APP+SEAISLCLSTNHGSS FGTGGQ+ RQYAP+ 
Sbjct: 311  LQSQTLAFTDLIRSMGCPDRPADFSAPPTSEAISLCLSTNHGSSIFGTGGQECRQYAPTP 370

Query: 731  QAPAMSATALLQKAAQMGAAATNASLLRGLGIV-XXXXXXXXGLQNSLQWCQMQMEPESS 907
            Q PAMSATALLQKAAQMGAAATNASLLRGLGIV         G Q+ L W   Q+EPE S
Sbjct: 371  QPPAMSATALLQKAAQMGAAATNASLLRGLGIVSSSAPSSSSGQQDCLHWGLGQVEPEGS 430

Query: 908  ASVPAGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITS 1087
            + VPAGLGLGLPCD  SGLKELML TPSMF PKQTTLDF                  ITS
Sbjct: 431  SLVPAGLGLGLPCDSDSGLKELMLGTPSMFGPKQTTLDFLGLGMAAGGSAGGGLSALITS 490

Query: 1088 I----DSDMDITAAAASFGNGEFSSKDIGRRS 1171
            I       +D+T AAASFGNGEFS KDIGR S
Sbjct: 491  IGGGGGGGLDVT-AAASFGNGEFSGKDIGRSS 521


>ref|XP_003517239.1| PREDICTED: protein indeterminate-domain 2-like [Glycine max]
 gb|KRH76829.1| hypothetical protein GLYMA_01G176600 [Glycine max]
          Length = 517

 Score =  376 bits (966), Expect = e-122
 Identities = 221/378 (58%), Positives = 242/378 (64%), Gaps = 1/378 (0%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q +T+   KA
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPQTVA--KA 203

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXXQSNSVVV-SSALQTQKLELQENPPQII 397
            +SESDSK +TGD                     QSNSVVV SS+LQTQK EL EN PQII
Sbjct: 204  SSESDSKAVTGDSSPPVAVEAPPPLVPPVSS--QSNSVVVPSSSLQTQKPELPENSPQII 261

Query: 398  EEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPAFT 577
            EEP+V TA                           V                  Q PAFT
Sbjct: 262  EEPKVNTAMN-GSCSSTSTSTTSSTSNSNSGASSSVFASLFASSSASATASLHSQTPAFT 320

Query: 578  DLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSATA 757
            DLIRAMG PD   DL  P SSE ISLCL+TNHGSS FGTG Q+ RQYAP  Q PAMSATA
Sbjct: 321  DLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGRQERRQYAPPPQ-PAMSATA 379

Query: 758  LLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLGLG 937
            LLQKAAQMGAAATNAS LRGLGIV          Q++LQW    +EPE SASVPAGLGLG
Sbjct: 380  LLQKAAQMGAAATNASFLRGLGIVSSSASTSSVQQDNLQWGHQPVEPE-SASVPAGLGLG 438

Query: 938  LPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDITAA 1117
            LPCD  SGLKELM+ TPSMF PKQTTLDF                  ITSI   +D+T A
Sbjct: 439  LPCDSSSGLKELMMGTPSMFGPKQTTLDFLGLGMAAGGTPGGGLSALITSIGGGLDVTTA 498

Query: 1118 AASFGNGEFSSKDIGRRS 1171
            AASF +GEF  KDIGRRS
Sbjct: 499  AASFASGEFPGKDIGRRS 516


>ref|XP_020217260.1| protein indeterminate-domain 1-like isoform X2 [Cajanus cajan]
          Length = 476

 Score =  373 bits (958), Expect = e-121
 Identities = 218/376 (57%), Positives = 236/376 (62%), Gaps = 1/376 (0%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q  T  A KA
Sbjct: 107  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPPT--AAKA 164

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-QSNSVVVSSALQTQKLELQENPPQII 397
            +SESDSK +TGD                      QS SVVV   LQTQ  EL EN PQI+
Sbjct: 165  SSESDSKAVTGDSPPPPPPTAGAVAAPPPLPPSSQSTSVVV---LQTQNPELPENSPQIV 221

Query: 398  EEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPAFT 577
            EEPQ  TA                                              Q PAFT
Sbjct: 222  EEPQANTAMN-GSCSSTSTTSSTSNSNSGTGSSVFASLFASSTASGTASLSLQSQTPAFT 280

Query: 578  DLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSATA 757
            DLIRAMG PD   DL  P SSE ISLCLSTNHGSS FGTGGQ+ RQYAP  Q PAMSATA
Sbjct: 281  DLIRAMGHPDHPGDLSRPSSSEPISLCLSTNHGSSIFGTGGQERRQYAPPPQ-PAMSATA 339

Query: 758  LLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLGLG 937
            LLQKAAQMGA ATNAS LRGLGIV        G Q+S+QW Q  +EPE + SVPAGLGLG
Sbjct: 340  LLQKAAQMGATATNASFLRGLGIVSSSASTSSGQQDSMQWGQQPVEPEGT-SVPAGLGLG 398

Query: 938  LPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDITAA 1117
            LPCDG SGLKELM+ TPSMF PKQTTLDF                  ITSI   +D+TAA
Sbjct: 399  LPCDGSSGLKELMMGTPSMFGPKQTTLDFLGLGMAAGGTPGGGLSALITSIGGGLDVTAA 458

Query: 1118 AASFGNGEFSSKDIGR 1165
            AASFG+GEF  KDIGR
Sbjct: 459  AASFGSGEFPGKDIGR 474


>ref|XP_020217259.1| protein indeterminate-domain 2-like isoform X1 [Cajanus cajan]
          Length = 515

 Score =  373 bits (958), Expect = e-121
 Identities = 218/376 (57%), Positives = 236/376 (62%), Gaps = 1/376 (0%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q  T  A KA
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPPT--AAKA 203

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-QSNSVVVSSALQTQKLELQENPPQII 397
            +SESDSK +TGD                      QS SVVV   LQTQ  EL EN PQI+
Sbjct: 204  SSESDSKAVTGDSPPPPPPTAGAVAAPPPLPPSSQSTSVVV---LQTQNPELPENSPQIV 260

Query: 398  EEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPAFT 577
            EEPQ  TA                                              Q PAFT
Sbjct: 261  EEPQANTAMN-GSCSSTSTTSSTSNSNSGTGSSVFASLFASSTASGTASLSLQSQTPAFT 319

Query: 578  DLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSATA 757
            DLIRAMG PD   DL  P SSE ISLCLSTNHGSS FGTGGQ+ RQYAP  Q PAMSATA
Sbjct: 320  DLIRAMGHPDHPGDLSRPSSSEPISLCLSTNHGSSIFGTGGQERRQYAPPPQ-PAMSATA 378

Query: 758  LLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLGLG 937
            LLQKAAQMGA ATNAS LRGLGIV        G Q+S+QW Q  +EPE + SVPAGLGLG
Sbjct: 379  LLQKAAQMGATATNASFLRGLGIVSSSASTSSGQQDSMQWGQQPVEPEGT-SVPAGLGLG 437

Query: 938  LPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDITAA 1117
            LPCDG SGLKELM+ TPSMF PKQTTLDF                  ITSI   +D+TAA
Sbjct: 438  LPCDGSSGLKELMMGTPSMFGPKQTTLDFLGLGMAAGGTPGGGLSALITSIGGGLDVTAA 497

Query: 1118 AASFGNGEFSSKDIGR 1165
            AASFG+GEF  KDIGR
Sbjct: 498  AASFGSGEFPGKDIGR 513


>gb|KYP66733.1| Zinc finger protein MAGPIE [Cajanus cajan]
          Length = 486

 Score =  367 bits (943), Expect = e-119
 Identities = 216/376 (57%), Positives = 234/376 (62%), Gaps = 1/376 (0%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q  T  A KA
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPPT--AAKA 203

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-QSNSVVVSSALQTQKLELQENPPQII 397
            +SESDSK +TGD                      QS SVVV   LQTQ  EL EN PQI+
Sbjct: 204  SSESDSKAVTGDSPPPPPPTAGAVAAPPPLPPSSQSTSVVV---LQTQNPELPENSPQIV 260

Query: 398  EEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPAFT 577
            EEPQ  TA                                                 AFT
Sbjct: 261  EEPQANTAM------------------------------NGSCSSTSTTSSTSNSNSAFT 290

Query: 578  DLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSATA 757
            DLIRAMG PD   DL  P SSE ISLCLSTNHGSS FGTGGQ+ RQYAP  Q PAMSATA
Sbjct: 291  DLIRAMGHPDHPGDLSRPSSSEPISLCLSTNHGSSIFGTGGQERRQYAPPPQ-PAMSATA 349

Query: 758  LLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLGLG 937
            LLQKAAQMGA ATNAS LRGLGIV        G Q+S+QW Q  +EPE + SVPAGLGLG
Sbjct: 350  LLQKAAQMGATATNASFLRGLGIVSSSASTSSGQQDSMQWGQQPVEPEGT-SVPAGLGLG 408

Query: 938  LPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDITAA 1117
            LPCDG SGLKELM+ TPSMF PKQTTLDF                  ITSI   +D+TAA
Sbjct: 409  LPCDGSSGLKELMMGTPSMFGPKQTTLDFLGLGMAAGGTPGGGLSALITSIGGGLDVTAA 468

Query: 1118 AASFGNGEFSSKDIGR 1165
            AASFG+GEF  KDIGR
Sbjct: 469  AASFGSGEFPGKDIGR 484


>gb|KOM45124.1| hypothetical protein LR48_Vigan06g043000 [Vigna angularis]
          Length = 468

 Score =  364 bits (935), Expect = e-118
 Identities = 220/381 (57%), Positives = 241/381 (63%), Gaps = 6/381 (1%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q +T  A KA
Sbjct: 91   YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPQT--AAKA 148

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-QSNSVVVSSA-LQTQKLELQENPPQI 394
            +SESDSK +TGD                      +SNSVVVSS+ LQT   EL EN PQ+
Sbjct: 149  SSESDSKAVTGDSSPPAAAAAATPPPPSAPPASSKSNSVVVSSSVLQTPNPELPENSPQV 208

Query: 395  IEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXX-VXXXXXXXXXXXXXXXXXXQPPA 571
            IEEPQ   A + +                        V                  Q PA
Sbjct: 209  IEEPQANPAVSGSCSGTSTSTSTTSSTSNSNGAASSSVFASLFASSTASATASLQSQTPA 268

Query: 572  FTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSA 751
            FTDLIRAMG PD   DL  P SSE ISLCL+TNHGSS FGTG Q+ RQYAP  Q PAMSA
Sbjct: 269  FTDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGLQECRQYAPPPQ-PAMSA 327

Query: 752  TALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLG 931
            TALLQKAAQMGAAATNAS LRGLGIV        G Q+SLQW Q   EPE  ASVPAGLG
Sbjct: 328  TALLQKAAQMGAAATNASFLRGLGIV-SSASTSSGQQDSLQWGQQPGEPE-GASVPAGLG 385

Query: 932  LGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDIT 1111
            LGLPCDG SGLKELM+ TPS+F PKQTTLDF                  ITSI   +D+T
Sbjct: 386  LGLPCDGSSGLKELMMGTPSVFGPKQTTLDFLGLGMAAGGNPGGGLSALITSIGGSLDVT 445

Query: 1112 ---AAAASFGNGEFSSKDIGR 1165
               AAAASFGNGEF  KDIGR
Sbjct: 446  AAAAAAASFGNGEFPGKDIGR 466


>ref|XP_017427555.1| PREDICTED: protein indeterminate-domain 2-like [Vigna angularis]
          Length = 497

 Score =  364 bits (935), Expect = e-118
 Identities = 220/381 (57%), Positives = 241/381 (63%), Gaps = 6/381 (1%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q +T  A KA
Sbjct: 120  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPQT--AAKA 177

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-QSNSVVVSSA-LQTQKLELQENPPQI 394
            +SESDSK +TGD                      +SNSVVVSS+ LQT   EL EN PQ+
Sbjct: 178  SSESDSKAVTGDSSPPAAAAAATPPPPSAPPASSKSNSVVVSSSVLQTPNPELPENSPQV 237

Query: 395  IEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXX-VXXXXXXXXXXXXXXXXXXQPPA 571
            IEEPQ   A + +                        V                  Q PA
Sbjct: 238  IEEPQANPAVSGSCSGTSTSTSTTSSTSNSNGAASSSVFASLFASSTASATASLQSQTPA 297

Query: 572  FTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSA 751
            FTDLIRAMG PD   DL  P SSE ISLCL+TNHGSS FGTG Q+ RQYAP  Q PAMSA
Sbjct: 298  FTDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGLQECRQYAPPPQ-PAMSA 356

Query: 752  TALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLG 931
            TALLQKAAQMGAAATNAS LRGLGIV        G Q+SLQW Q   EPE  ASVPAGLG
Sbjct: 357  TALLQKAAQMGAAATNASFLRGLGIV-SSASTSSGQQDSLQWGQQPGEPE-GASVPAGLG 414

Query: 932  LGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDIT 1111
            LGLPCDG SGLKELM+ TPS+F PKQTTLDF                  ITSI   +D+T
Sbjct: 415  LGLPCDGSSGLKELMMGTPSVFGPKQTTLDFLGLGMAAGGNPGGGLSALITSIGGSLDVT 474

Query: 1112 ---AAAASFGNGEFSSKDIGR 1165
               AAAASFGNGEF  KDIGR
Sbjct: 475  AAAAAAASFGNGEFPGKDIGR 495


>dbj|BAU00111.1| hypothetical protein VIGAN_10167800 [Vigna angularis var. angularis]
          Length = 523

 Score =  364 bits (935), Expect = e-117
 Identities = 220/381 (57%), Positives = 241/381 (63%), Gaps = 6/381 (1%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q +T  A KA
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPQT--AAKA 203

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-QSNSVVVSSA-LQTQKLELQENPPQI 394
            +SESDSK +TGD                      +SNSVVVSS+ LQT   EL EN PQ+
Sbjct: 204  SSESDSKAVTGDSSPPAAAAAATPPPPSAPPASSKSNSVVVSSSVLQTPNPELPENSPQV 263

Query: 395  IEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXX-VXXXXXXXXXXXXXXXXXXQPPA 571
            IEEPQ   A + +                        V                  Q PA
Sbjct: 264  IEEPQANPAVSGSCSGTSTSTSTTSSTSNSNGAASSSVFASLFASSTASATASLQSQTPA 323

Query: 572  FTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSA 751
            FTDLIRAMG PD   DL  P SSE ISLCL+TNHGSS FGTG Q+ RQYAP  Q PAMSA
Sbjct: 324  FTDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGLQECRQYAPPPQ-PAMSA 382

Query: 752  TALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLG 931
            TALLQKAAQMGAAATNAS LRGLGIV        G Q+SLQW Q   EPE  ASVPAGLG
Sbjct: 383  TALLQKAAQMGAAATNASFLRGLGIV-SSASTSSGQQDSLQWGQQPGEPE-GASVPAGLG 440

Query: 932  LGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDIT 1111
            LGLPCDG SGLKELM+ TPS+F PKQTTLDF                  ITSI   +D+T
Sbjct: 441  LGLPCDGSSGLKELMMGTPSVFGPKQTTLDFLGLGMAAGGNPGGGLSALITSIGGSLDVT 500

Query: 1112 ---AAAASFGNGEFSSKDIGR 1165
               AAAASFGNGEF  KDIGR
Sbjct: 501  AAAAAAASFGNGEFPGKDIGR 521


>ref|XP_014520893.1| protein indeterminate-domain 2 [Vigna radiata var. radiata]
          Length = 522

 Score =  362 bits (930), Expect = e-117
 Identities = 218/380 (57%), Positives = 241/380 (63%), Gaps = 5/380 (1%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q +T+   KA
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPQTVA--KA 203

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-QSNSVVVSSA-LQTQKLELQENPPQI 394
            +SESDSK +TGD                      +SNSVVVSS+ LQT   EL EN PQ+
Sbjct: 204  SSESDSKAVTGDSSPPAAAAAATPPPPSAPPASSKSNSVVVSSSVLQTPNPELPENSPQV 263

Query: 395  IEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXX-VXXXXXXXXXXXXXXXXXXQPPA 571
            IEEPQ   A + +                        V                  Q PA
Sbjct: 264  IEEPQANPAVSGSCSGTSTSTSTTSSTSNSNGGASSSVFASLFASSTASATASLQSQTPA 323

Query: 572  FTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSA 751
            FTDLIRAMG PD   DL  P +SE ISLCL+TNHGSS FGTG Q+ RQYAP  Q PAMSA
Sbjct: 324  FTDLIRAMGHPDHPADLSRPSASEPISLCLATNHGSSIFGTGLQECRQYAPPPQ-PAMSA 382

Query: 752  TALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLG 931
            TALLQKAAQMGAAATNAS LRGLGIV        G Q+SLQW Q   EPE  ASVPAGLG
Sbjct: 383  TALLQKAAQMGAAATNASFLRGLGIV-SSASTSSGQQDSLQWGQQPGEPE-GASVPAGLG 440

Query: 932  LGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDIT 1111
            LGLPCDG SGLKELM+ TPS+F PKQTTLDF                  ITSI   +D+T
Sbjct: 441  LGLPCDGSSGLKELMMGTPSVFGPKQTTLDFLGLGMAAGGNPGGGLSALITSIGGSLDVT 500

Query: 1112 --AAAASFGNGEFSSKDIGR 1165
              AAAASFGNGEF  KDIGR
Sbjct: 501  AAAAAASFGNGEFPGKDIGR 520


>ref|XP_007156918.1| hypothetical protein PHAVU_002G028100g [Phaseolus vulgaris]
 gb|ESW28912.1| hypothetical protein PHAVU_002G028100g [Phaseolus vulgaris]
          Length = 521

 Score =  362 bits (929), Expect = e-116
 Identities = 214/379 (56%), Positives = 237/379 (62%), Gaps = 4/379 (1%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++Q +T+   KA
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQPQTVA--KA 203

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXXQSNSVVVSSALQTQKLELQENPPQIIE 400
            +SESDSK +TGD                      ++ VV SSALQTQ  EL EN PQ+IE
Sbjct: 204  SSESDSKAVTGDSSPPAAVATPPPPPAPPASPKSNSVVVSSSALQTQNPELPENSPQVIE 263

Query: 401  EPQVVTA--AALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPAF 574
            E Q   A   + +                                          Q PAF
Sbjct: 264  ETQANPAMSGSCSSSGTSTSTTSSTSNSNGGGSSSVFASLFASSTAASATASLHSQTPAF 323

Query: 575  TDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSAT 754
            TDLIRAMG PD   DL  P SSE ISLCL+TNHGSS FGTG Q+ RQYAP  Q PAMSAT
Sbjct: 324  TDLIRAMGHPDHPADLSRPSSSEPISLCLATNHGSSIFGTGLQECRQYAPPPQ-PAMSAT 382

Query: 755  ALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLGL 934
            ALLQKAAQMGAAATNAS LRGLGIV        G Q+SLQW Q   EPE  ASVPAGLGL
Sbjct: 383  ALLQKAAQMGAAATNASFLRGLGIV-SSASTSSGQQDSLQWGQQPGEPE-GASVPAGLGL 440

Query: 935  GLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDIT- 1111
            GLPCDG SGLKELM+ TPS+F PKQTTLDF                  ITSI   +D+T 
Sbjct: 441  GLPCDGSSGLKELMMGTPSVFGPKQTTLDFLGLGMAAGGNPGGGLSALITSIGGSLDVTA 500

Query: 1112 -AAAASFGNGEFSSKDIGR 1165
             AAAASFG+GEF  KDIGR
Sbjct: 501  AAAAASFGSGEFPGKDIGR 519


>ref|XP_015964045.2| LOW QUALITY PROTEIN: protein indeterminate-domain 2-like [Arachis
            duranensis]
          Length = 546

 Score =  354 bits (908), Expect = e-113
 Identities = 216/403 (53%), Positives = 234/403 (58%), Gaps = 28/403 (6%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAG-- 214
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A++QT + T    
Sbjct: 152  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARSQTHSQTTQSQ 211

Query: 215  ---KANSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX---------------------- 319
               K +SESDSK +  +                                           
Sbjct: 212  IGVKVSSESDSKAVNAESSSPQPTPPPPATTPAPAPPPPPPPVQPEAPPLPATTTTTTTT 271

Query: 320  QSNSVVVSSALQTQKLELQENPPQIIEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXX 499
            Q NSVVV   LQTQ  EL EN PQIIEEPQ  TA  LN                      
Sbjct: 272  QPNSVVVPLVLQTQNPELPENSPQIIEEPQANTA--LNGSCSSSTSSTSNGGTSSSVFAS 329

Query: 500  XVXXXXXXXXXXXXXXXXXXQPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGS 679
                                 PPAFTDL+RAMG PD   DLP P SSE ISLCL+TNHGS
Sbjct: 330  LFASSTASGNLQSQT------PPAFTDLVRAMGPPDHPTDLPGPSSSEPISLCLATNHGS 383

Query: 680  SFFGTGGQDHRQYAPSQQAPAMSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGL 859
            S FGTGGQ+ RQYAP  Q P MSATALLQKAAQMGAAATNASLLRGLGIV          
Sbjct: 384  SIFGTGGQERRQYAPPPQ-PTMSATALLQKAAQMGAAATNASLLRGLGIVSSSASSSPAQ 442

Query: 860  QNSLQWCQMQMEPESSASVPAGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXX 1039
            Q+ LQW Q   EPE SASVPAGLGLGL  D GSGLKELM+ TPSMF PK TTLDF     
Sbjct: 443  QDGLQWGQQPAEPE-SASVPAGLGLGLSFDSGSGLKELMMGTPSMFGPKHTTLDFLGLGM 501

Query: 1040 XXXXXXXXXXXXXITSIDSDMDIT-AAAASFGNGEFSSKDIGR 1165
                         ITSI   +D++ AAAASFGNGEFS KDIGR
Sbjct: 502  AAGGTPGGGLSALITSIGGGLDVSAAAAASFGNGEFSGKDIGR 544


>ref|XP_019444753.1| PREDICTED: protein indeterminate-domain 1 [Lupinus angustifolius]
 ref|XP_019444754.1| PREDICTED: protein indeterminate-domain 1 [Lupinus angustifolius]
 gb|OIW10994.1| hypothetical protein TanjilG_22801 [Lupinus angustifolius]
          Length = 505

 Score =  346 bits (887), Expect = e-110
 Identities = 210/383 (54%), Positives = 234/383 (61%), Gaps = 6/383 (1%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQ-----TKTL 205
            YAVQSDWKAHSK+CG+REYKCDCGTVFSRRDSFITHRAFCDALAEE+A++Q     T+T 
Sbjct: 144  YAVQSDWKAHSKICGTREYKCDCGTVFSRRDSFITHRAFCDALAEESARSQPQSQTTQTQ 203

Query: 206  TAGKANSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-QSNSVVVSSALQTQKLELQEN 382
            +A KANS+SDSK +TGD                      QSNS  +S  L+ Q  EL EN
Sbjct: 204  SAIKANSDSDSKAVTGDDSSPMEVPPLPPPSPPAPPAIPQSNSAALSD-LKVQNPELPEN 262

Query: 383  PPQIIEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQ 562
             PQ +EE Q   A  LN                       V                  Q
Sbjct: 263  TPQSLEELQAKNA--LNGSCSTSTNTTSNG----------VSVFASLFASSTTSENLQSQ 310

Query: 563  PPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPA 742
             PAFTDLIRAMG PD SVD+P P  S+ ISLCL    GSS F TGGQ+ RQYAP  Q PA
Sbjct: 311  TPAFTDLIRAMGRPDHSVDIPGPSFSDPISLCL----GSSMFATGGQERRQYAPPPQ-PA 365

Query: 743  MSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSASVPA 922
            MSATALLQKAAQMGAAATNASLLRGLGIV          Q+SLQW Q Q+EP+S++  P 
Sbjct: 366  MSATALLQKAAQMGAAATNASLLRGLGIVSSSASTAPTQQDSLQWGQRQVEPDSASISP- 424

Query: 923  GLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDM 1102
              GLGLPCD GSGLKELML  PS+F PKQTTLDF                  ITSI   +
Sbjct: 425  --GLGLPCDSGSGLKELMLGAPSLFGPKQTTLDFLGLGMVAGGTPGGGLSALITSIGGSL 482

Query: 1103 DITAAAASFGNGEFSSKDIGRRS 1171
            D+TAAA SFGNGEFS +DIGR S
Sbjct: 483  DVTAAATSFGNGEFSGEDIGRNS 505


>ref|XP_007156028.1| hypothetical protein PHAVU_003G252400g [Phaseolus vulgaris]
 gb|ESW28022.1| hypothetical protein PHAVU_003G252400g [Phaseolus vulgaris]
          Length = 506

 Score =  335 bits (859), Expect = e-106
 Identities = 208/381 (54%), Positives = 229/381 (60%), Gaps = 4/381 (1%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSKVCG+REYKCDCGTVFSRRDSFITHRAFCDALA+E+ ++ T      KA
Sbjct: 143  YAVQSDWKAHSKVCGNREYKCDCGTVFSRRDSFITHRAFCDALAKESMRSHTDVT---KA 199

Query: 221  NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXXQSNSVVVSSALQTQKLELQENP-PQII 397
            N E+DSKVLT                       QSNS + SS  QTQ  EL EN  PQ+ 
Sbjct: 200  NEENDSKVLTDSPPPEVAAATATSPA-------QSNSAI-SSGGQTQNPELPENNLPQVT 251

Query: 398  EEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXXXXXQPPAFT 577
            EEPQ +TA +                                            Q PAF+
Sbjct: 252  EEPQALTATS-GSCGSNSSNNCSTSNGGATSNSNSSSMFASLFASSTSTGTQQSQTPAFS 310

Query: 578  DLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQQAPAMSATA 757
            DL+RAMG PD   D+ AP SSEAISLCLST++ S  F TGGQ+HRQYA   Q PAMSATA
Sbjct: 311  DLVRAMGPPDHHADISAPSSSEAISLCLSTSNASPIFATGGQEHRQYASPPQ-PAMSATA 369

Query: 758  LLQKAAQMGAAATNASLLRGLGIV-XXXXXXXXGLQNSLQWCQMQMEPESSASVPAGLGL 934
            LLQKAAQMGAAATNASLLRG GIV         G QN LQW Q Q+E E S SVPA LGL
Sbjct: 370  LLQKAAQMGAAATNASLLRGFGIVSSSSASTSSGQQNGLQWGQPQLESE-SGSVPAALGL 428

Query: 935  GLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSIDSDMDIT- 1111
             LPCDG SGL ELM+ TPSMF PK TTLDF                  ITSI   +D+T 
Sbjct: 429  SLPCDGDSGLNELMMGTPSMFGPKHTTLDF----LGLGMAAGGGLSALITSIGGGLDVTA 484

Query: 1112 -AAAASFGNGEFSSKDIGRRS 1171
             AAAA+FGNGEFS KD GRRS
Sbjct: 485  AAAAATFGNGEFSGKDTGRRS 505


>gb|EOX91151.1| C2H2-like zinc finger protein [Theobroma cacao]
          Length = 534

 Score =  333 bits (854), Expect = e-105
 Identities = 207/398 (52%), Positives = 228/398 (57%), Gaps = 21/398 (5%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQT------KT 202
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A+AQT      + 
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARAQTHPQPQNQN 205

Query: 203  LTAGKANSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX--------------QSNSVVV 340
                  +SESD KV   D                                   QS SV+ 
Sbjct: 206  QAVANPSSESDPKVQAVDSSAPPAPAPTPAPAPASAPVQVSASAPAPAAPTLPQSTSVIS 265

Query: 341  SSALQTQKLELQENPPQIIEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXX 520
            SS L  +  EL ENP  I+EE  V   A                                
Sbjct: 266  SSVLPIRSSELPENPTPIVEEAPVPAPAPAGLNGSCSTSTSSGSNGGSRSSVFA----SL 321

Query: 521  XXXXXXXXXXXXXQPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGG 700
                         QPPAFTDLIRAMG PDR  DL    S+E ISLCLSTNHGSS FGT G
Sbjct: 322  FASSTASTSLQPPQPPAFTDLIRAMGRPDRPADLAPSTSTEPISLCLSTNHGSSIFGTAG 381

Query: 701  QDHRQYAPSQQAPAMSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWC 880
            Q+ RQYAP  Q PAMSATALLQKAAQMGAAATNASLLRG GIV          Q+SLQW 
Sbjct: 382  QERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGFGIVSSSSSSEQ--QDSLQWG 438

Query: 881  QMQMEPESSASVPAGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXX 1060
            Q Q+EPE +ASVPAGLGLGLPCDG SGLKELM+ TP +F PKQTTLDF            
Sbjct: 439  QRQVEPE-NASVPAGLGLGLPCDGSSGLKELMMGTP-VFGPKQTTLDFLGLGMAAGGSPN 496

Query: 1061 XXXXXXITSIDSDMDI-TAAAASFGNGEFSSKDIGRRS 1171
                  ITSI   +D+  AAAASFG G+F+ KDIGR S
Sbjct: 497  GGLSALITSIGGGLDVAAAAAASFGGGDFTGKDIGRSS 534


>ref|XP_012079066.1| protein indeterminate-domain 2 [Jatropha curcas]
 gb|KDP31793.1| hypothetical protein JCGZ_12254 [Jatropha curcas]
          Length = 522

 Score =  332 bits (851), Expect = e-105
 Identities = 204/387 (52%), Positives = 228/387 (58%), Gaps = 10/387 (2%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLTAGKA 220
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A+A T+T     A
Sbjct: 148  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARA-TQTPNPNPA 206

Query: 221  ------NSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXXQSNSVVVSSALQTQKLELQEN 382
                  N ES+ KV                         QS  V+ SS       EL +N
Sbjct: 207  AVNLNPNQESEPKVQVDPSPPPPPPPPLAPVAAAPAPPAQSAGVISSSISPNHSPELPDN 266

Query: 383  PPQIIEE---PQVVTAAA-LNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXXXXXXX 550
            P  IIEE   PQ   A A LN                                       
Sbjct: 267  PSPIIEEALAPQSALATAGLNGSSSSSTSSSSNGSTSSSVFASLFASSTASGSIQPP--- 323

Query: 551  XXXQPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQYAPSQ 730
               Q PAFTDLIRAM  PDR  DL  P S+E ISLCLST+HGSS FGT GQ+ RQYAP  
Sbjct: 324  ---QTPAFTDLIRAMAHPDRPADLAPPSSTEPISLCLSTSHGSSIFGTAGQERRQYAPPP 380

Query: 731  QAPAMSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQMEPESSA 910
            Q PAMSATALLQKAAQ+GAAATNASLLRG GIV          Q+++QW   Q+EPE++ 
Sbjct: 381  Q-PAMSATALLQKAAQIGAAATNASLLRGFGIV---SSSSSAQQDNMQWGHRQIEPENT- 435

Query: 911  SVPAGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXXXITSI 1090
            SV AGLGLGLPCDGGSGLKELM+ TPS+F PKQTTLDF                  ITSI
Sbjct: 436  SVTAGLGLGLPCDGGSGLKELMMGTPSVFGPKQTTLDFLGLGMAAGGSPSSGLSALITSI 495

Query: 1091 DSDMDITAAAASFGNGEFSSKDIGRRS 1171
             S MD+ AAAASFG GE+S KD+GR S
Sbjct: 496  GSGMDVAAAAASFGGGEYSGKDLGRSS 522


>ref|XP_017985426.1| PREDICTED: protein indeterminate-domain 2 [Theobroma cacao]
          Length = 538

 Score =  332 bits (850), Expect = e-104
 Identities = 207/402 (51%), Positives = 228/402 (56%), Gaps = 25/402 (6%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQT------KT 202
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A+AQT      + 
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARAQTHPQPQNQN 205

Query: 203  LTAGKANSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX------------------QSN 328
                  +SESD KV   D                                       QS 
Sbjct: 206  QAVANPSSESDPKVQAVDSSAPPAPAPTPAPAPASAPVQVSASAPAPAPTPAAPTLPQST 265

Query: 329  SVVVSSALQTQKLELQENPPQIIEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVX 508
            SV+ SS L  +  EL ENP  I+EE  V   A                            
Sbjct: 266  SVISSSVLPIRSSELPENPTPIVEEAPVPAPAPAGLNGSCSTSTSSGSNGGSRSSVFA-- 323

Query: 509  XXXXXXXXXXXXXXXXXQPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFF 688
                             QPPAFTDLIRAMG PDR  DL    S+E ISLCLSTNHGSS F
Sbjct: 324  --SLFASSTASTSLQPPQPPAFTDLIRAMGRPDRPADLAPSTSTEPISLCLSTNHGSSIF 381

Query: 689  GTGGQDHRQYAPSQQAPAMSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNS 868
            GT GQ+ RQYAP  Q PAMSATALLQKAAQMGAAATNASLLRG GIV          Q+S
Sbjct: 382  GTAGQERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGFGIVSSSSSSEQ--QDS 438

Query: 869  LQWCQMQMEPESSASVPAGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXX 1048
            LQW Q Q+EPE +ASVPAGLGLGLPCDG SGLKELM+ TP +F PKQTTLDF        
Sbjct: 439  LQWGQRQVEPE-NASVPAGLGLGLPCDGSSGLKELMMGTP-VFGPKQTTLDFLGLGMAAG 496

Query: 1049 XXXXXXXXXXITSIDSDMDI-TAAAASFGNGEFSSKDIGRRS 1171
                      ITSI   +D+  AAAASFG G+F+ KDIGR S
Sbjct: 497  GSPNGGLSALITSIGGGLDVAAAAAASFGGGDFTGKDIGRSS 538


>gb|POO02574.1| TFIIH C1-like domain containing protein [Trema orientalis]
          Length = 540

 Score =  330 bits (846), Expect = e-104
 Identities = 209/401 (52%), Positives = 231/401 (57%), Gaps = 24/401 (5%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAK----------- 187
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+AK           
Sbjct: 151  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESAKTQNPTPAQNQT 210

Query: 188  -AQTKTLTAGKANSESDSKVLT-GDXXXXXXXXXXXXXXXXXXXXXQSNSV--------- 334
             AQ +T TAG   SES+++V T                         S+SV         
Sbjct: 211  SAQNQTRTAGNMKSESETQVRTVKSSSPPPPPPPPPPAPLVAPAAPPSSSVPSGPPQSTG 270

Query: 335  VVSSALQTQK-LELQENPPQIIEE-PQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVX 508
            V+SS L  Q   EL ENPP+ +EE P   T  +                           
Sbjct: 271  VLSSTLPIQTPAELPENPPRPLEEAPSAATGLS-------GSSSSSTTSSSSNGSTSSTV 323

Query: 509  XXXXXXXXXXXXXXXXXQPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFF 688
                             QPPAFTDLIRAM  PD   DL    S E ISL LST+HGSS F
Sbjct: 324  FASLFASSTTSASLQPPQPPAFTDLIRAMSRPDCPTDLAPSSSLEPISLGLSTSHGSSIF 383

Query: 689  GTGGQDHRQYAPSQQAPAMSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNS 868
            GT GQ+ RQYAP  Q PAMSATALLQKAAQMGAAATNASLLRGLGIV        G Q +
Sbjct: 384  GTAGQERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGLGIV--SSSSSSGRQEN 440

Query: 869  LQWCQMQMEPESSASVPAGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXX 1048
            LQW Q Q+EP+S+ SVP GLGLGLPCDG SGLKELM+ TPSMF PKQTTLDF        
Sbjct: 441  LQWPQQQVEPDSN-SVPPGLGLGLPCDGSSGLKELMMGTPSMFGPKQTTLDFLGLGMAAG 499

Query: 1049 XXXXXXXXXXITSIDSDMDITAAAASFGNGEFSSKDIGRRS 1171
                      ITSI   +D+ AAAASFG GEFS KDIGR S
Sbjct: 500  GTPSGGLSALITSIGGGLDVAAAAASFGGGEFSGKDIGRSS 540


>ref|XP_021273679.1| LOW QUALITY PROTEIN: protein indeterminate-domain 2 [Herrania
            umbratica]
          Length = 538

 Score =  329 bits (844), Expect = e-103
 Identities = 206/400 (51%), Positives = 228/400 (57%), Gaps = 25/400 (6%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQT------KT 202
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A+AQT      + 
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARAQTHPQPQNQN 205

Query: 203  LTAGKANSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX----------------QSNSV 334
                  +SESD KV   D                                     QS SV
Sbjct: 206  QAVANPSSESDPKVQAVDSSAPPAPAPTPXSGTASAPVQVSASAPAPTPAPPALPQSTSV 265

Query: 335  VVSSALQTQKLELQENPPQIIEEPQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXX 514
            + SS L  +  EL ENP  I+E+  V   A                              
Sbjct: 266  ISSSVLPIRSSELPENPTPIVEDAPVPAPAPAGLNGSCSTSTSSGSNGGSRSSVFA---- 321

Query: 515  XXXXXXXXXXXXXXXQPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGT 694
                           QPPAFTDLIRAMG PDR  DL    S+E ISLCLSTNHGSS FGT
Sbjct: 322  SLFASSTASASLQPPQPPAFTDLIRAMGRPDRPADLAPSTSTEPISLCLSTNHGSSIFGT 381

Query: 695  GGQDHRQYAPSQQAPAMSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQ 874
             GQ+ RQYAP  Q PAMSATALLQKAAQMGAAATNASLLRG GIV          Q+SLQ
Sbjct: 382  AGQERRQYAPPPQ-PAMSATALLQKAAQMGAAATNASLLRGFGIVSSSSSSEQ--QDSLQ 438

Query: 875  WCQMQMEPESSASVPAGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXX 1054
            W Q Q+EPE +ASVPAGLGLGLPCDG SGLKELM+ TP +F PKQTTLDF          
Sbjct: 439  WGQRQVEPE-NASVPAGLGLGLPCDGSSGLKELMMGTP-VFGPKQTTLDFLGLGMAAGGN 496

Query: 1055 XXXXXXXXITSIDSDMDI---TAAAASFGNGEFSSKDIGR 1165
                    ITSI   +D+    AAAASFG G+F+ KDIGR
Sbjct: 497  PSGGLSALITSIGGGLDVAAAAAAAASFGGGDFTGKDIGR 536


>gb|OMO56870.1| hypothetical protein CCACVL1_26206 [Corchorus capsularis]
          Length = 526

 Score =  327 bits (837), Expect = e-103
 Identities = 200/392 (51%), Positives = 230/392 (58%), Gaps = 15/392 (3%)
 Frame = +2

Query: 41   YAVQSDWKAHSKVCGSREYKCDCGTVFSRRDSFITHRAFCDALAEENAKAQTKTLT---- 208
            YAVQSDWKAHSK+CG+REYKCDCGT+FSRRDSFITHRAFCDALAEE+A+AQT+  +    
Sbjct: 146  YAVQSDWKAHSKICGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARAQTQPSSQNQN 205

Query: 209  --AGKANSESDSKVLTGDXXXXXXXXXXXXXXXXXXXXX-----QSNSVVVSSALQTQKL 367
              A   +SESD K    +                           + SV+ SS L  Q  
Sbjct: 206  QAAANPSSESDPKTQAMESSSPPAPPPAPASVSAPPPAAVQVSASTTSVISSSVLPMQSG 265

Query: 368  ELQENPPQIIEE----PQVVTAAALNXXXXXXXXXXXXXXXXXXXXXXXVXXXXXXXXXX 535
            ELQENP  I+EE    P     A LN                                  
Sbjct: 266  ELQENPTPILEEDPPPPPPPAPAGLNGSCSSSNSSSSNGSSSS------TVFASLFASST 319

Query: 536  XXXXXXXXQPPAFTDLIRAMGCPDRSVDLPAPPSSEAISLCLSTNHGSSFFGTGGQDHRQ 715
                    QPPAFTD+IRAMG P+R  DL    S+E ISLCLSTNHGSS FGT GQ+ RQ
Sbjct: 320  ASASLQPPQPPAFTDVIRAMGRPERPADLAPSTSTEPISLCLSTNHGSSIFGTAGQERRQ 379

Query: 716  YAPSQQAPAMSATALLQKAAQMGAAATNASLLRGLGIVXXXXXXXXGLQNSLQWCQMQME 895
            YAP+ Q PAMSATALLQKAAQMGAAA+NASLLRG GIV          Q++L W Q Q+E
Sbjct: 380  YAPAPQ-PAMSATALLQKAAQMGAAASNASLLRGFGIV--SSSSSSAQQDNLPWGQRQVE 436

Query: 896  PESSASVPAGLGLGLPCDGGSGLKELMLSTPSMFVPKQTTLDFXXXXXXXXXXXXXXXXX 1075
            P+ +ASVPAGLGLGLP DG SGLKELM+ TP +F PKQTTLDF                 
Sbjct: 437  PD-NASVPAGLGLGLPIDGSSGLKELMMGTP-VFGPKQTTLDFLGLGMAAGGSPNGGLSA 494

Query: 1076 XITSIDSDMDITAAAASFGNGEFSSKDIGRRS 1171
             ITSI   +D+ AAAASFG G+++ KDIGR S
Sbjct: 495  LITSIGGGLDVAAAAASFGGGDYTGKDIGRNS 526


Top