BLASTX nr result

ID: Rehmannia31_contig00009240 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00009240
         (1631 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011097265.1| UPF0400 protein C337.03 isoform X2 [Sesamum ...   556   0.0  
ref|XP_011097264.1| UPF0400 protein C337.03 isoform X1 [Sesamum ...   551   0.0  
gb|PIN20758.1| Regulator of nuclear mRNA [Handroanthus impetigin...   548   0.0  
ref|XP_011097266.1| UPF0400 protein C337.03 isoform X3 [Sesamum ...   542   0.0  
ref|XP_011097267.1| UPF0400 protein C337.03 isoform X4 [Sesamum ...   542   0.0  
ref|XP_012844029.1| PREDICTED: regulation of nuclear pre-mRNA do...   511   e-174
emb|CBI21316.3| unnamed protein product, partial [Vitis vinifera]     487   e-165
ref|XP_022880268.1| UPF0400 protein C337.03-like [Olea europaea ...   476   e-160
emb|CDP16715.1| unnamed protein product [Coffea canephora]            472   e-159
ref|XP_019183948.1| PREDICTED: regulation of nuclear pre-mRNA do...   471   e-158
gb|KZV42059.1| ENTH/VHS family protein isoform 1 [Dorcoceras hyg...   466   e-156
ref|XP_010660722.1| PREDICTED: regulation of nuclear pre-mRNA do...   463   e-155
ref|XP_022774225.1| UPF0400 protein C337.03-like isoform X2 [Dur...   456   e-152
gb|EOY02866.1| ENTH/VHS family protein isoform 2 [Theobroma cacao]    454   e-151
ref|XP_007031940.2| PREDICTED: regulation of nuclear pre-mRNA do...   452   e-151
ref|XP_022774223.1| UPF0400 protein C337.03-like isoform X1 [Dur...   452   e-151
ref|XP_021299053.1| UPF0400 protein C337.03 isoform X2 [Herrania...   450   e-150
gb|EOY02865.1| ENTH/VHS family protein isoform 1 [Theobroma cacao]    449   e-150
ref|XP_019264842.1| PREDICTED: regulation of nuclear pre-mRNA do...   449   e-149
ref|XP_007031939.2| PREDICTED: regulation of nuclear pre-mRNA do...   448   e-149

>ref|XP_011097265.1| UPF0400 protein C337.03 isoform X2 [Sesamum indicum]
          Length = 525

 Score =  556 bits (1433), Expect = 0.0
 Identities = 280/369 (75%), Positives = 312/369 (84%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKA QVVETWDRQFHCAPREQRLAFLYLANDI+QNSRRKGAEFVA
Sbjct: 21   QQSIETLSHWCIFHMNKATQVVETWDRQFHCAPREQRLAFLYLANDIIQNSRRKGAEFVA 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD LRDVIENGDEFG+NAALRLISIWEERKVFGSRGQILKEEF K+ LD+ +RN
Sbjct: 81   EFWKVLPDCLRDVIENGDEFGRNAALRLISIWEERKVFGSRGQILKEEFVKRPLDNGSRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
             K AG KLR PGGNALDKL+SG+QAVY SQLDEDAIFNKCRNAI FIEKV+KD  GDY L
Sbjct: 141  PKLAGHKLRPPGGNALDKLVSGYQAVYGSQLDEDAIFNKCRNAIGFIEKVNKDTNGDYRL 200

Query: 541  GHVNSSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQAA 720
             HVN +I DELKGQ AILRDC+EQL VVE+SR NL+S+LREALQ+Q YKLE+VR+QLQAA
Sbjct: 201  AHVNGNIRDELKGQQAILRDCIEQLTVVESSRENLLSILREALQEQEYKLEEVRNQLQAA 260

Query: 721  QTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQLP 900
            Q+HSEQAS+ICRQL G N DGQ L+E+SGN + P SQ PQN IPG +EQSTSVTYTRQ+ 
Sbjct: 261  QSHSEQASSICRQLHGGNGDGQILSEQSGNGS-PASQAPQNPIPGTAEQSTSVTYTRQVS 319

Query: 901  FAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYPFE 1080
            F  KS   +EDP                  QMLT+VLSSLASEGVIGN +++SS++YP E
Sbjct: 320  FGGKSGGVEEDPKSAAAAVAAKLAASASSAQMLTYVLSSLASEGVIGNPVKESSSDYPSE 379

Query: 1081 KKAKIENEH 1107
            K+AKIEN+H
Sbjct: 380  KRAKIENDH 388



 Score = 64.7 bits (156), Expect = 3e-07
 Identities = 34/69 (49%), Positives = 37/69 (53%)
 Frame = +1

Query: 1243 AMPILSGPYGYNXXXXXXXXXXXXXXXXXXXXXXXSTFAPPSNSYQSYQPESGFYGQPSS 1422
            AMP+ SGPYGYN                       STFAPP N Y+SY  E GFY QPS+
Sbjct: 458  AMPMPSGPYGYNMNQQLPATLHSYASVGPPISGA-STFAPPPNGYESYPTEGGFYDQPST 516

Query: 1423 LPMAPMSRQ 1449
            LPMAPM  Q
Sbjct: 517  LPMAPMGHQ 525


>ref|XP_011097264.1| UPF0400 protein C337.03 isoform X1 [Sesamum indicum]
 ref|XP_020554289.1| UPF0400 protein C337.03 isoform X1 [Sesamum indicum]
          Length = 526

 Score =  551 bits (1421), Expect = 0.0
 Identities = 280/370 (75%), Positives = 312/370 (84%), Gaps = 1/370 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKA QVVETWDRQFHCAPREQRLAFLYLANDI+QNSRRKGAEFVA
Sbjct: 21   QQSIETLSHWCIFHMNKATQVVETWDRQFHCAPREQRLAFLYLANDIIQNSRRKGAEFVA 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD LRDVIENGDEFG+NAALRLISIWEERKVFGSRGQILKEEF K+ LD+ +RN
Sbjct: 81   EFWKVLPDCLRDVIENGDEFGRNAALRLISIWEERKVFGSRGQILKEEFVKRPLDNGSRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
             K AG KLR PGGNALDKL+SG+QAVY SQLDEDAIFNKCRNAI FIEKV+KD  GDY L
Sbjct: 141  PKLAGHKLRPPGGNALDKLVSGYQAVYGSQLDEDAIFNKCRNAIGFIEKVNKDTNGDYRL 200

Query: 541  -GHVNSSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQA 717
              HVN +I DELKGQ AILRDC+EQL VVE+SR NL+S+LREALQ+Q YKLE+VR+QLQA
Sbjct: 201  AAHVNGNIRDELKGQQAILRDCIEQLTVVESSRENLLSILREALQEQEYKLEEVRNQLQA 260

Query: 718  AQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQL 897
            AQ+HSEQAS+ICRQL G N DGQ L+E+SGN + P SQ PQN IPG +EQSTSVTYTRQ+
Sbjct: 261  AQSHSEQASSICRQLHGGNGDGQILSEQSGNGS-PASQAPQNPIPGTAEQSTSVTYTRQV 319

Query: 898  PFAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYPF 1077
             F  KS   +EDP                  QMLT+VLSSLASEGVIGN +++SS++YP 
Sbjct: 320  SFGGKSGGVEEDPKSAAAAVAAKLAASASSAQMLTYVLSSLASEGVIGNPVKESSSDYPS 379

Query: 1078 EKKAKIENEH 1107
            EK+AKIEN+H
Sbjct: 380  EKRAKIENDH 389



 Score = 64.7 bits (156), Expect = 3e-07
 Identities = 34/69 (49%), Positives = 37/69 (53%)
 Frame = +1

Query: 1243 AMPILSGPYGYNXXXXXXXXXXXXXXXXXXXXXXXSTFAPPSNSYQSYQPESGFYGQPSS 1422
            AMP+ SGPYGYN                       STFAPP N Y+SY  E GFY QPS+
Sbjct: 459  AMPMPSGPYGYNMNQQLPATLHSYASVGPPISGA-STFAPPPNGYESYPTEGGFYDQPST 517

Query: 1423 LPMAPMSRQ 1449
            LPMAPM  Q
Sbjct: 518  LPMAPMGHQ 526


>gb|PIN20758.1| Regulator of nuclear mRNA [Handroanthus impetiginosus]
          Length = 498

 Score =  548 bits (1413), Expect = 0.0
 Identities = 293/494 (59%), Positives = 336/494 (68%), Gaps = 11/494 (2%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKA QVVETW RQFHCAPR+QRLAFLYLANDI+QNSRRKGAEFVA
Sbjct: 21   QQSIETLSHWCIFHMNKATQVVETWARQFHCAPRDQRLAFLYLANDIIQNSRRKGAEFVA 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPDSLRDVIENG+E  +NAALRLISIWEERKVFGSRGQILKEEF K+ LD  NRN
Sbjct: 81   EFWKVLPDSLRDVIENGNESERNAALRLISIWEERKVFGSRGQILKEEFVKRPLDVGNRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
             KHAGFK+R PGGNALDKL+SG+QAVY SQLDE+AIF+KCRNA++F+EKVDKD+E     
Sbjct: 141  SKHAGFKVRPPGGNALDKLMSGYQAVYGSQLDEEAIFDKCRNAVNFVEKVDKDIE----- 195

Query: 541  GHVNSSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQAA 720
            GH++    DELKGQHA+L DC++QL VVE+SR NL+S+LREA+++Q YKL++VR+QLQAA
Sbjct: 196  GHIDGRTKDELKGQHAVLVDCIQQLTVVESSRENLLSILREAVREQEYKLDQVRNQLQAA 255

Query: 721  QTHSEQASTICRQLLGSNSDGQALTERSGNAN---IPTSQNPQNYIPGASEQSTSVTYTR 891
            Q+HSEQA +ICRQL G N          GN N    P SQ PQN+IPG  EQ+TSV YTR
Sbjct: 256  QSHSEQADSICRQLFGGN----------GNENENESPVSQAPQNFIPGTGEQTTSVIYTR 305

Query: 892  QLPFAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEY 1071
            Q+PFAEKS   +EDP                  QMLT+VLSSLASEGVI N I+D+  +Y
Sbjct: 306  QIPFAEKSDHIEEDPKSAAAAVAAKLAASTSSAQMLTYVLSSLASEGVINNPIKDAPNDY 365

Query: 1072 PFEKKAKIENEH--------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1227
            P EK+A+IEN+H                                                
Sbjct: 366  PSEKRARIENDHPSYVPSQVPQSNLSITSQELTPSEPPPPPSSPPPLPPLPPPLQPYQVP 425

Query: 1228 XXXXXAMPILSGPYGYNXXXXXXXXXXXXXXXXXXXXXXXSTFAPPSNSYQSYQPESGFY 1407
                 AMPI SGPYGY+                       S F PPSN YQSY  E+  Y
Sbjct: 426  QYMQTAMPISSGPYGYS-MNQQPPVPFQGYVPVGPPANAVSAFVPPSNGYQSYPTEASLY 484

Query: 1408 GQPSSLPMAPMSRQ 1449
            GQPSSLPMAPM RQ
Sbjct: 485  GQPSSLPMAPMGRQ 498


>ref|XP_011097266.1| UPF0400 protein C337.03 isoform X3 [Sesamum indicum]
          Length = 521

 Score =  542 bits (1397), Expect = 0.0
 Identities = 276/369 (74%), Positives = 308/369 (83%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKA QVVETWDRQFHCAPREQRLAFLYLANDI+QNSRRKGAEFVA
Sbjct: 21   QQSIETLSHWCIFHMNKATQVVETWDRQFHCAPREQRLAFLYLANDIIQNSRRKGAEFVA 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD LRDVIENGDEFG+NAALRLISIWEERKVFGSRGQILKEEF K+ LD+ +RN
Sbjct: 81   EFWKVLPDCLRDVIENGDEFGRNAALRLISIWEERKVFGSRGQILKEEFVKRPLDNGSRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
             K AG KLR PGGNALDKL+SG+QAVY SQLDEDAIFNKCRNAI FIEKV+KD       
Sbjct: 141  PKLAGHKLRPPGGNALDKLVSGYQAVYGSQLDEDAIFNKCRNAIGFIEKVNKDTNA---- 196

Query: 541  GHVNSSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQAA 720
             HVN +I DELKGQ AILRDC+EQL VVE+SR NL+S+LREALQ+Q YKLE+VR+QLQAA
Sbjct: 197  AHVNGNIRDELKGQQAILRDCIEQLTVVESSRENLLSILREALQEQEYKLEEVRNQLQAA 256

Query: 721  QTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQLP 900
            Q+HSEQAS+ICRQL G N DGQ L+E+SGN + P SQ PQN IPG +EQSTSVTYTRQ+ 
Sbjct: 257  QSHSEQASSICRQLHGGNGDGQILSEQSGNGS-PASQAPQNPIPGTAEQSTSVTYTRQVS 315

Query: 901  FAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYPFE 1080
            F  KS   +EDP                  QMLT+VLSSLASEGVIGN +++SS++YP E
Sbjct: 316  FGGKSGGVEEDPKSAAAAVAAKLAASASSAQMLTYVLSSLASEGVIGNPVKESSSDYPSE 375

Query: 1081 KKAKIENEH 1107
            K+AKIEN+H
Sbjct: 376  KRAKIENDH 384



 Score = 64.7 bits (156), Expect = 3e-07
 Identities = 34/69 (49%), Positives = 37/69 (53%)
 Frame = +1

Query: 1243 AMPILSGPYGYNXXXXXXXXXXXXXXXXXXXXXXXSTFAPPSNSYQSYQPESGFYGQPSS 1422
            AMP+ SGPYGYN                       STFAPP N Y+SY  E GFY QPS+
Sbjct: 454  AMPMPSGPYGYNMNQQLPATLHSYASVGPPISGA-STFAPPPNGYESYPTEGGFYDQPST 512

Query: 1423 LPMAPMSRQ 1449
            LPMAPM  Q
Sbjct: 513  LPMAPMGHQ 521


>ref|XP_011097267.1| UPF0400 protein C337.03 isoform X4 [Sesamum indicum]
          Length = 520

 Score =  542 bits (1396), Expect = 0.0
 Identities = 276/369 (74%), Positives = 308/369 (83%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKA QVVETWDRQFHCAPREQRLAFLYLANDI+QNSRRKGAEFVA
Sbjct: 21   QQSIETLSHWCIFHMNKATQVVETWDRQFHCAPREQRLAFLYLANDIIQNSRRKGAEFVA 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD LRDVIENGDEFG+NAALRLISIWEERKVFGSRGQILKEEF K+ LD+ +RN
Sbjct: 81   EFWKVLPDCLRDVIENGDEFGRNAALRLISIWEERKVFGSRGQILKEEFVKRPLDNGSRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
             K AG KLR PGGNALDKL+SG+QAVY SQLDEDAIFNKCRNAI FIEKV+KD       
Sbjct: 141  PKLAGHKLRPPGGNALDKLVSGYQAVYGSQLDEDAIFNKCRNAIGFIEKVNKDTN----- 195

Query: 541  GHVNSSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQAA 720
             HVN +I DELKGQ AILRDC+EQL VVE+SR NL+S+LREALQ+Q YKLE+VR+QLQAA
Sbjct: 196  AHVNGNIRDELKGQQAILRDCIEQLTVVESSRENLLSILREALQEQEYKLEEVRNQLQAA 255

Query: 721  QTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQLP 900
            Q+HSEQAS+ICRQL G N DGQ L+E+SGN + P SQ PQN IPG +EQSTSVTYTRQ+ 
Sbjct: 256  QSHSEQASSICRQLHGGNGDGQILSEQSGNGS-PASQAPQNPIPGTAEQSTSVTYTRQVS 314

Query: 901  FAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYPFE 1080
            F  KS   +EDP                  QMLT+VLSSLASEGVIGN +++SS++YP E
Sbjct: 315  FGGKSGGVEEDPKSAAAAVAAKLAASASSAQMLTYVLSSLASEGVIGNPVKESSSDYPSE 374

Query: 1081 KKAKIENEH 1107
            K+AKIEN+H
Sbjct: 375  KRAKIENDH 383



 Score = 64.7 bits (156), Expect = 3e-07
 Identities = 34/69 (49%), Positives = 37/69 (53%)
 Frame = +1

Query: 1243 AMPILSGPYGYNXXXXXXXXXXXXXXXXXXXXXXXSTFAPPSNSYQSYQPESGFYGQPSS 1422
            AMP+ SGPYGYN                       STFAPP N Y+SY  E GFY QPS+
Sbjct: 453  AMPMPSGPYGYNMNQQLPATLHSYASVGPPISGA-STFAPPPNGYESYPTEGGFYDQPST 511

Query: 1423 LPMAPMSRQ 1449
            LPMAPM  Q
Sbjct: 512  LPMAPMGHQ 520


>ref|XP_012844029.1| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein
            1B [Erythranthe guttata]
 gb|EYU31847.1| hypothetical protein MIMGU_mgv1a005149mg [Erythranthe guttata]
          Length = 495

 Score =  511 bits (1316), Expect = e-174
 Identities = 293/496 (59%), Positives = 332/496 (66%), Gaps = 13/496 (2%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKAKQVVETWDRQF  APREQRLAFLYLANDILQNSRRKGAEFVA
Sbjct: 21   QQSIETLSHWCIFHMNKAKQVVETWDRQFRSAPREQRLAFLYLANDILQNSRRKGAEFVA 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD+LR V+E+GDEFGKNAA RLISIWEERKVFGSRGQILKEEF K+QLD+ NRN
Sbjct: 81   EFWKVLPDALRHVVEHGDEFGKNAAFRLISIWEERKVFGSRGQILKEEFVKRQLDNGNRN 140

Query: 361  GKHA--GFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDY 534
             KHA    K+RSP GN LDKL+S +QAVYSSQLDED  FN+C++AI+FI+KVDK+ EGDY
Sbjct: 141  WKHAAPAPKMRSPVGNTLDKLVSAYQAVYSSQLDEDGTFNRCKSAITFIQKVDKNTEGDY 200

Query: 535  GLGHVNSSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQ 714
             L      ITDELKGQH +LRDC+EQL+VVE+SRANLVSLLRE LQ+QVYKL++VRDQLQ
Sbjct: 201  RL----DGITDELKGQHTVLRDCIEQLSVVESSRANLVSLLREVLQEQVYKLDQVRDQLQ 256

Query: 715  AAQTHSEQASTICRQLLGSN-SDGQALT-ERSGNANIPTSQNPQNYIPGASEQSTSVTYT 888
            AAQTHSEQA  ICRQL+ SN ++GQ L+ E+S N              G SEQSTSV+YT
Sbjct: 257  AAQTHSEQADGICRQLIASNGNNGQVLSHEQSAN--------------GVSEQSTSVSYT 302

Query: 889  RQLPFAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSI--RDSS 1062
            R +P    ++  +E+P                  +MLT VLSSLASEGVIGN+I    SS
Sbjct: 303  RHVP----AAHVEENPKSAAAAVAAKLAASTSSAEMLTLVLSSLASEGVIGNAIIKESSS 358

Query: 1063 TEYPFEKKAKIENEH------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1224
            +EYP EK+AKIENE                                              
Sbjct: 359  SEYPSEKRAKIENEQHTSYFPQNPQPPETNSDEAPPPPPSSPPPPQPPPPMQQQQQPYQV 418

Query: 1225 XXXXXXAMPIL-SGPYGYNXXXXXXXXXXXXXXXXXXXXXXXSTFAPPSNSYQSYQPESG 1401
                  AMPI+  GPYGY                        + F P SN+YQSY  E G
Sbjct: 419  PQYMQTAMPIIGGGPYGYT--NNVSQQQTAAQPGYAPVGGGVTAFDPSSNTYQSYPAEGG 476

Query: 1402 FYGQPSSLPMAPMSRQ 1449
             YGQPSSLPMAPMSRQ
Sbjct: 477  LYGQPSSLPMAPMSRQ 492


>emb|CBI21316.3| unnamed protein product, partial [Vitis vinifera]
          Length = 499

 Score =  487 bits (1254), Expect = e-165
 Identities = 262/486 (53%), Positives = 317/486 (65%), Gaps = 3/486 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKAKQVVETWDRQFHC+P EQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QQSIETLSHWCIFHMNKAKQVVETWDRQFHCSPSEQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD+LRDV+ENGDEFG+NA LRLI IWEERKVFGSRGQILKEEFG +QL++ NRN
Sbjct: 81   EFWKVLPDALRDVMENGDEFGRNAVLRLIGIWEERKVFGSRGQILKEEFGGRQLENSNRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
            GKH GFKL+   GN L+K++SG+Q +Y  QLDED I  KC NAIS++EK DK ++G    
Sbjct: 141  GKHLGFKLKQSAGNTLEKIVSGYQVIYGGQLDEDVILRKCTNAISYVEKADKGIDGGINS 200

Query: 541  GHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQA 717
               N S   +EL+GQH ILRDC+EQL +VE+SRA+LVS LREALQ+Q +KL++VR+QLQA
Sbjct: 201  AQQNGSGFVEELQGQHTILRDCIEQLTLVESSRASLVSNLREALQEQEFKLDQVRNQLQA 260

Query: 718  AQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQL 897
            AQ  +EQA  +CR+LL  N++ Q L E+S      T++   +++PG  EQS  V +TRQ+
Sbjct: 261  AQFQAEQAGNMCRRLLKCNNNTQLLAEQS-LKETRTTEALLSFVPGTVEQSAPVMFTRQV 319

Query: 898  PFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYP 1074
             F EK    +EDP                   QMLTFVLSSLASEGVIGN  ++SS +YP
Sbjct: 320  SFPEKPGHIEEDPRKSAAAAVAAKLTASTSSAQMLTFVLSSLASEGVIGNPTKESSGDYP 379

Query: 1075 FEKKAKIENEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAMPI 1254
             EK+ K+EN+                                              A  +
Sbjct: 380  AEKRTKLENDQ------SAYTPQNPQPSQPPPPSSPPPLPPMPPMPPYQVPQYMQAAGSM 433

Query: 1255 LSGPYGYNXXXXXXXXXXXXXXXXXXXXXXXSTFAPPSNSYQSYQ-PESGFYGQPSSLPM 1431
             + PY Y                        S   PP+NSYQS+Q  E GFYGQPSSLPM
Sbjct: 434  TNIPYSYGMTQQQPPSAANYPTVGPPVSSISSFTTPPANSYQSFQGSEGGFYGQPSSLPM 493

Query: 1432 APMSRQ 1449
            AP+SR+
Sbjct: 494  APISRR 499


>ref|XP_022880268.1| UPF0400 protein C337.03-like [Olea europaea var. sylvestris]
 ref|XP_022880269.1| UPF0400 protein C337.03-like [Olea europaea var. sylvestris]
          Length = 513

 Score =  476 bits (1225), Expect = e-160
 Identities = 240/370 (64%), Positives = 283/370 (76%), Gaps = 1/370 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMN AKQVVETWDRQFHC+P EQRLAFLYLANDI+QNSRRKGAEFV 
Sbjct: 21   QQSIETLSHWCIFHMNNAKQVVETWDRQFHCSPHEQRLAFLYLANDIIQNSRRKGAEFVT 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD+LRDVIE+GDEFGKNAA RLI IWEERKVFGSRGQ+LKEEF K QLD+DN+N
Sbjct: 81   EFWKVLPDALRDVIEHGDEFGKNAAFRLIGIWEERKVFGSRGQLLKEEFTKNQLDNDNKN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
            GKH GFKL+   GN LDK++SG+Q VY  QL+EDA+ N C NAISFIEK++KD+  +   
Sbjct: 141  GKHVGFKLKPTAGNTLDKIVSGYQVVYGGQLNEDAVLNNCSNAISFIEKIEKDIGSENRS 200

Query: 541  GHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQA 717
            GH N S I DELKG+HAILRDC++QL VVE+SR  LVSLLREALQ+Q YKL++VR+QLQA
Sbjct: 201  GHPNQSGIMDELKGKHAILRDCIQQLTVVESSRTKLVSLLREALQEQEYKLDQVRNQLQA 260

Query: 718  AQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQL 897
            AQ H+EQA  I +Q +    D  +L ++S      TSQ   +++ G  EQS  + YTRQ+
Sbjct: 261  AQAHTEQAGNIFQQEVDGKGDEPSLAKQS-RKETQTSQMSHDFMSGTREQSAPIMYTRQV 319

Query: 898  PFAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYPF 1077
            PF EKS  T EDP                  +MLT+VLSSLASEGVI N I++   +Y  
Sbjct: 320  PFTEKSGHT-EDPKSAAAAVAAKLTASTSSAEMLTYVLSSLASEGVISNQIKELPGDYSS 378

Query: 1078 EKKAKIENEH 1107
            EK+AKIEN+H
Sbjct: 379  EKRAKIENDH 388


>emb|CDP16715.1| unnamed protein product [Coffea canephora]
          Length = 513

 Score =  472 bits (1215), Expect = e-159
 Identities = 259/495 (52%), Positives = 318/495 (64%), Gaps = 12/495 (2%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKAKQVVETWDRQFHC+PREQRLA+LYLANDILQNSRRKG+EFV 
Sbjct: 21   QQSIETLSHWCIFHMNKAKQVVETWDRQFHCSPREQRLAYLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD+LRDVIENGDE G+ AA+RL+SIWEERKVFGSRGQILKEEF  +Q+D+ N N
Sbjct: 81   EFWKVLPDALRDVIENGDESGRTAAVRLVSIWEERKVFGSRGQILKEEFVGRQVDNVNGN 140

Query: 361  GKHAGFKL-RSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYG 537
             KH G KL R   G+ALD+++S +Q VYSSQLDED++ NKCR+AI+ I+K DK++ GD  
Sbjct: 141  LKHTGSKLQRHAAGDALDRIVSSYQLVYSSQLDEDSVINKCRSAINCIQKADKEIGGDLR 200

Query: 538  LGHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQ 714
             G V+ S + DELKGQHA LRDC+  L  +E+SRANL++ LREALQ++ YKL++VR+QLQ
Sbjct: 201  SGPVDGSGVVDELKGQHATLRDCIGLLTSIESSRANLMAHLREALQEEEYKLDQVRNQLQ 260

Query: 715  AAQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQ 894
            AA+ HSEQA   CRQLL  +++ Q L E+    N   SQ   ++  G+ EQS  V Y++Q
Sbjct: 261  AARVHSEQAGNKCRQLLSCDNNEQVLAEQDRQEN-QLSQGTHDFDSGSKEQSAPVMYSQQ 319

Query: 895  LPFAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYP 1074
            + F EKSS  +EDP                  QMLTFVLSSLASEGVIGN +++S ++YP
Sbjct: 320  VSFTEKSSHLEEDPKSAAAAVAAKLTASSSSAQMLTFVLSSLASEGVIGNPVKESPSDYP 379

Query: 1075 FEKKAKIENEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAMP- 1251
             EK+ K+ENE                                                P 
Sbjct: 380  SEKRPKLENERSSYVPLQNSEAFQQNIPVFSQDATSNEQPPPPSSPPPLPPLPPMQPYPV 439

Query: 1252 ---------ILSGPYGYNXXXXXXXXXXXXXXXXXXXXXXXSTFAPPSNSYQSYQPESGF 1404
                     I + P+GY+                          AP +N+YQSY  E GF
Sbjct: 440  PQYMSSAGTIANVPFGYSTIQQQQVAVPGYSPTFPVNGVAPFAAAP-TNTYQSYPTEGGF 498

Query: 1405 YGQPSSLPMAPMSRQ 1449
            YGQ  SLPMAP+SRQ
Sbjct: 499  YGQQPSLPMAPVSRQ 513


>ref|XP_019183948.1| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein
            1A [Ipomoea nil]
 ref|XP_019183953.1| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein
            1A [Ipomoea nil]
          Length = 509

 Score =  471 bits (1211), Expect = e-158
 Identities = 240/373 (64%), Positives = 290/373 (77%), Gaps = 4/373 (1%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLP +LR V++NGDEFG+NAALRLI IWEERKVFGSRGQILKEEF  K  D+ NRN
Sbjct: 81   EFWKVLPGALRKVVDNGDEFGRNAALRLIDIWEERKVFGSRGQILKEEFAGKHGDTSNRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
            GK +GF+LR   GN LDK++S +  +Y  QLDEDAI  +CRNAI+ +EK+DK++ GD   
Sbjct: 141  GKPSGFRLRPSAGNGLDKIVSSYHVLYGGQLDEDAILTRCRNAINSVEKIDKEIGGDLNS 200

Query: 541  GHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQA 717
            GH+N S    ELKG   +LR+C+EQL +VE+SRANLVS LREAL++Q YKLE VR +LQA
Sbjct: 201  GHLNGSGFAHELKGPQTMLRECIEQLIMVESSRANLVSQLREALREQEYKLELVRSELQA 260

Query: 718  AQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQ-NPQNYIPGA-SEQSTSVTYTR 891
            AQ+HSEQAS IC+QL+  ++  Q L E  G  + PTSQ  P++++ G+  +QS  VTYTR
Sbjct: 261  AQSHSEQASNICKQLVNGDNAMQILGEH-GRKDGPTSQAPPRSFVSGSGGDQSAPVTYTR 319

Query: 892  QLPFAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVI-GNSIRDSSTE 1068
            Q+ F EKS R +EDP                  QMLT+VLSSLASEG+I GN +++S+T+
Sbjct: 320  QVSFGEKSGRLEEDPKSAAAAVAAKLTASTSSAQMLTYVLSSLASEGMIGGNQMKESATD 379

Query: 1069 YPFEKKAKIENEH 1107
            YP EK+AK+ENEH
Sbjct: 380  YPAEKRAKLENEH 392


>gb|KZV42059.1| ENTH/VHS family protein isoform 1 [Dorcoceras hygrometricum]
          Length = 519

 Score =  466 bits (1200), Expect = e-156
 Identities = 238/369 (64%), Positives = 286/369 (77%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKAK+VVETWDRQFHCAPR+QRLAFLYLANDILQNSRRKG EFV 
Sbjct: 21   QQSIETLSHWCIFHMNKAKEVVETWDRQFHCAPRDQRLAFLYLANDILQNSRRKGVEFVV 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLP SLRDVIENGDE G+NA+LRLI+IWEERKVFGSRG ILKEE  K+Q D DNRN
Sbjct: 81   EFWKVLPGSLRDVIENGDEVGRNASLRLINIWEERKVFGSRGHILKEEIVKRQPDYDNRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
             KHA +KL+  G + LDKL+SG+QAVY+SQ DEDAI  KCRNA+SF+EKVDK++E D   
Sbjct: 141  WKHAEYKLKPAGADTLDKLVSGYQAVYNSQQDEDAILYKCRNAVSFVEKVDKNIERDNRS 200

Query: 541  GHVNSSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQAA 720
            G+VN SI DEL+GQ+A LRDC+EQL   E SR++LV+ LREALQ+Q YKL++VRDQLQAA
Sbjct: 201  GNVNGSIADELRGQNATLRDCIEQLQGFEASRSDLVAFLREALQEQEYKLDQVRDQLQAA 260

Query: 721  QTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQLP 900
            QT +EQA  ICRQLLGS+ +GQ L E++GN +   SQ PQN +PGA EQ+++ T+  Q  
Sbjct: 261  QTCTEQAGRICRQLLGSSGNGQVLPEQTGNRS-SASQVPQNLVPGAREQTSTTTHAHQSS 319

Query: 901  FAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYPFE 1080
              +KSS    DP                  Q+L++ ++SL   GVIGN  +D  ++YP E
Sbjct: 320  LVDKSSYV-GDPKSAAAAVAAQLTALTSSAQVLSYAIASL-EPGVIGNPPKDLPSDYPSE 377

Query: 1081 KKAKIENEH 1107
            K+AK EN+H
Sbjct: 378  KRAKFENDH 386


>ref|XP_010660722.1| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 2
            [Vitis vinifera]
 ref|XP_010660723.1| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 2
            [Vitis vinifera]
 ref|XP_019080869.1| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 2
            [Vitis vinifera]
          Length = 524

 Score =  463 bits (1191), Expect = e-155
 Identities = 233/370 (62%), Positives = 282/370 (76%), Gaps = 2/370 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKAKQVVETWDRQFHC+P EQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QQSIETLSHWCIFHMNKAKQVVETWDRQFHCSPSEQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD+LRDV+ENGDEFG+NA LRLI IWEERKVFGSRGQILKEEFG +QL++ NRN
Sbjct: 81   EFWKVLPDALRDVMENGDEFGRNAVLRLIGIWEERKVFGSRGQILKEEFGGRQLENSNRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
            GKH GFKL+   GN L+K++SG+Q +Y  QLDED I  KC NAIS++EK DK ++G    
Sbjct: 141  GKHLGFKLKQSAGNTLEKIVSGYQVIYGGQLDEDVILRKCTNAISYVEKADKGIDGGINS 200

Query: 541  GHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQA 717
               N S   +EL+GQH ILRDC+EQL +VE+SRA+LVS LREALQ+Q +KL++VR+QLQA
Sbjct: 201  AQQNGSGFVEELQGQHTILRDCIEQLTLVESSRASLVSNLREALQEQEFKLDQVRNQLQA 260

Query: 718  AQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQL 897
            AQ  +EQA  +CR+LL  N++ Q L E+S      T++   +++PG  EQS  V +TRQ+
Sbjct: 261  AQFQAEQAGNMCRRLLKCNNNTQLLAEQS-LKETRTTEALLSFVPGTVEQSAPVMFTRQV 319

Query: 898  PFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYP 1074
             F EK    +EDP                   QMLTFVLSSLASEGVIGN  ++SS +YP
Sbjct: 320  SFPEKPGHIEEDPRKSAAAAVAAKLTASTSSAQMLTFVLSSLASEGVIGNPTKESSGDYP 379

Query: 1075 FEKKAKIENE 1104
             EK+ K+EN+
Sbjct: 380  AEKRTKLEND 389


>ref|XP_022774225.1| UPF0400 protein C337.03-like isoform X2 [Durio zibethinus]
          Length = 524

 Score =  456 bits (1174), Expect = e-152
 Identities = 231/370 (62%), Positives = 283/370 (76%), Gaps = 2/370 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            Q SIETLSHWCIFHMNKAKQVVETWDRQFHC+PREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QASIETLSHWCIFHMNKAKQVVETWDRQFHCSPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD+LRDVIE+GDEFG+NAALRLISIWEERKVFGSRGQILKEE   +Q +++NRN
Sbjct: 81   EFWKVLPDALRDVIESGDEFGRNAALRLISIWEERKVFGSRGQILKEELVGRQSENNNRN 140

Query: 361  GKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYGL 540
            G+H G KL+ P G+ +DK++SG+Q VY SQ+DED IF++CRNAI+ IEKVDK+   D   
Sbjct: 141  GRHIGAKLKQPVGSTVDKIVSGYQVVYGSQMDEDVIFSRCRNAINCIEKVDKETRTDVNS 200

Query: 541  GHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQA 717
            G  + S++ +E++G HA+LRDC+EQL +V +SRA+LVS LREALQ+Q  KLE+V  QLQA
Sbjct: 201  GQFHGSALVEEVQGHHAVLRDCIEQLTIVASSRASLVSHLREALQEQELKLEQVHTQLQA 260

Query: 718  AQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQL 897
            AQ+ SEQA  ICRQLL  N D   L     +    TS  PQ++ PGA+EQS  V Y RQ+
Sbjct: 261  AQSQSEQAGNICRQLL--NCDNPQLVAEPSSKESHTSVAPQSFFPGAAEQSAPVMYARQV 318

Query: 898  PFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEYP 1074
             F + S   +EDP                   +ML++VLSSLASEGVIGN +++SS +YP
Sbjct: 319  SFPDNSGHIEEDPRKSAAAAVAAKLTASTSSAEMLSYVLSSLASEGVIGNPMKESSGDYP 378

Query: 1075 FEKKAKIENE 1104
             EK+ K+EN+
Sbjct: 379  SEKRPKLEND 388


>gb|EOY02866.1| ENTH/VHS family protein isoform 2 [Theobroma cacao]
          Length = 525

 Score =  454 bits (1167), Expect = e-151
 Identities = 233/371 (62%), Positives = 286/371 (77%), Gaps = 3/371 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            Q SIETLSHWCIFHMNKAKQVVETWDRQFHC+PREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QASIETLSHWCIFHMNKAKQVVETWDRQFHCSPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEF-GKKQLDSDNR 357
            EFWKVLPD+LRDVIENGDEFG+NAA RLISIWEERKVFGSRGQILKEE  G++  ++ +R
Sbjct: 81   EFWKVLPDALRDVIENGDEFGRNAAFRLISIWEERKVFGSRGQILKEELVGRQSENNSSR 140

Query: 358  NGKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYG 537
            NG+H G KL+ P G+ +DK++SG+Q VY SQ+DED I +KCRNA+S IEKVDK++  D  
Sbjct: 141  NGRHLGLKLKQPVGSTVDKIVSGYQVVYGSQMDEDVILSKCRNAMSCIEKVDKEIGSDVN 200

Query: 538  LGHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQ 714
             G  + S++ +E++GQHA+LRDC+EQL    +SRA+L+S LREALQ+Q +KLE+VR QLQ
Sbjct: 201  SGQFHGSALVEEVQGQHAVLRDCIEQLTAAASSRASLISHLREALQEQEFKLEQVRTQLQ 260

Query: 715  AAQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQ 894
            +AQ+ SEQA  ICRQLL S  + Q L E S   +  TS  PQ++IP A+EQS  V Y+RQ
Sbjct: 261  SAQSQSEQAGNICRQLL-SCENPQLLAEESSKES-QTSIAPQSFIPAATEQSAPVMYSRQ 318

Query: 895  LPFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEY 1071
            L F + S   +EDP                   QML++VLSSLASEGVIGN  ++SS +Y
Sbjct: 319  LSFPDNSGHIEEDPRKSAAAAVAAKLTASTSSAQMLSYVLSSLASEGVIGNPTKESSGDY 378

Query: 1072 PFEKKAKIENE 1104
            P EK+ K+EN+
Sbjct: 379  PSEKRPKLEND 389


>ref|XP_007031940.2| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 2
            isoform X2 [Theobroma cacao]
          Length = 525

 Score =  452 bits (1164), Expect = e-151
 Identities = 232/371 (62%), Positives = 286/371 (77%), Gaps = 3/371 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            Q SIETLSHWCIFHMNKAKQVVETWDRQFHC+PREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QASIETLSHWCIFHMNKAKQVVETWDRQFHCSPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEF-GKKQLDSDNR 357
            EFWKVLPD+LRDVIENGDEFG+NAA RLISIWEERKVFGSRGQILKEE  G++  ++ +R
Sbjct: 81   EFWKVLPDALRDVIENGDEFGRNAAFRLISIWEERKVFGSRGQILKEELVGRQSENNSSR 140

Query: 358  NGKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYG 537
            NG+H G KL+ P G+ +DK++SG+Q VY SQ+DED I +KCRNA+S IEKVDK++  D  
Sbjct: 141  NGRHLGLKLKQPVGSTVDKIVSGYQVVYGSQMDEDVILSKCRNAMSCIEKVDKEIGSDVN 200

Query: 538  LGHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQ 714
             G  + S++ +E++GQHA+LRDC+EQL    +SRA+L+S LREALQ+Q +KLE+VR QLQ
Sbjct: 201  SGQFHGSALVEEVQGQHAVLRDCIEQLTAAASSRASLISHLREALQEQEFKLEQVRTQLQ 260

Query: 715  AAQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQ 894
            +AQ+ SEQA  ICRQLL S  + Q + E S   +  TS  PQ++IP A+EQS  V Y+RQ
Sbjct: 261  SAQSQSEQAGNICRQLL-SCENPQLVAEESSKES-QTSIAPQSFIPAATEQSAPVMYSRQ 318

Query: 895  LPFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEY 1071
            L F + S   +EDP                   QML++VLSSLASEGVIGN  ++SS +Y
Sbjct: 319  LSFPDNSGHIEEDPRKSAAAAVAAKLTASTSSAQMLSYVLSSLASEGVIGNPTKESSGDY 378

Query: 1072 PFEKKAKIENE 1104
            P EK+ K+EN+
Sbjct: 379  PSEKRPKLEND 389


>ref|XP_022774223.1| UPF0400 protein C337.03-like isoform X1 [Durio zibethinus]
 ref|XP_022774224.1| UPF0400 protein C337.03-like isoform X1 [Durio zibethinus]
          Length = 525

 Score =  452 bits (1162), Expect = e-151
 Identities = 231/371 (62%), Positives = 283/371 (76%), Gaps = 3/371 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            Q SIETLSHWCIFHMNKAKQVVETWDRQFHC+PREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QASIETLSHWCIFHMNKAKQVVETWDRQFHCSPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD+LRDVIE+GDEFG+NAALRLISIWEERKVFGSRGQILKEE   +Q +++NRN
Sbjct: 81   EFWKVLPDALRDVIESGDEFGRNAALRLISIWEERKVFGSRGQILKEELVGRQSENNNRN 140

Query: 361  GKHAGFKL-RSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYG 537
            G+H G KL + P G+ +DK++SG+Q VY SQ+DED IF++CRNAI+ IEKVDK+   D  
Sbjct: 141  GRHIGAKLQKQPVGSTVDKIVSGYQVVYGSQMDEDVIFSRCRNAINCIEKVDKETRTDVN 200

Query: 538  LGHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQ 714
             G  + S++ +E++G HA+LRDC+EQL +V +SRA+LVS LREALQ+Q  KLE+V  QLQ
Sbjct: 201  SGQFHGSALVEEVQGHHAVLRDCIEQLTIVASSRASLVSHLREALQEQELKLEQVHTQLQ 260

Query: 715  AAQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQ 894
            AAQ+ SEQA  ICRQLL  N D   L     +    TS  PQ++ PGA+EQS  V Y RQ
Sbjct: 261  AAQSQSEQAGNICRQLL--NCDNPQLVAEPSSKESHTSVAPQSFFPGAAEQSAPVMYARQ 318

Query: 895  LPFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEY 1071
            + F + S   +EDP                   +ML++VLSSLASEGVIGN +++SS +Y
Sbjct: 319  VSFPDNSGHIEEDPRKSAAAAVAAKLTASTSSAEMLSYVLSSLASEGVIGNPMKESSGDY 378

Query: 1072 PFEKKAKIENE 1104
            P EK+ K+EN+
Sbjct: 379  PSEKRPKLEND 389


>ref|XP_021299053.1| UPF0400 protein C337.03 isoform X2 [Herrania umbratica]
          Length = 525

 Score =  450 bits (1157), Expect = e-150
 Identities = 231/371 (62%), Positives = 287/371 (77%), Gaps = 3/371 (0%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            Q SIETLSHWCIFHMNKAKQVVETWDRQFHC+PREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QASIETLSHWCIFHMNKAKQVVETWDRQFHCSPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEF-GKKQLDSDNR 357
            EFWKVLPD+LRDVIE GDEFG+NAA RLISIWEERKVFGSRGQILKEE  G++  ++ +R
Sbjct: 81   EFWKVLPDALRDVIEIGDEFGRNAAFRLISIWEERKVFGSRGQILKEELVGRQSENNSSR 140

Query: 358  NGKHAGFKLRSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDYG 537
            NG+H G KL+ P G+ +DK++S +Q VY SQ+DED I +KCRNAIS IEKVDK++  D  
Sbjct: 141  NGRHVGLKLKQPVGSTVDKIVSAYQVVYGSQMDEDVILSKCRNAISCIEKVDKEIGTDIN 200

Query: 538  LGHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQLQ 714
             G  + S++ +E++GQHA+LRDC+EQL    +SRA+L+S LREALQ+Q +KL++VR QLQ
Sbjct: 201  SGQFHGSALVEEVQGQHAVLRDCIEQLTAAASSRASLISHLREALQEQEFKLKQVRTQLQ 260

Query: 715  AAQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTRQ 894
            AAQ+ SEQA  ICRQLL S  + Q + E+S   ++ TS  PQ++IP A+EQS  V Y+RQ
Sbjct: 261  AAQSQSEQADNICRQLL-SCENPQLVAEQSSKESL-TSIAPQSFIPAATEQSAPVMYSRQ 318

Query: 895  LPFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTEY 1071
            + F + S  T+EDP                   QML++VLSSLASEGVIGN  ++SS +Y
Sbjct: 319  VSFPDNSGHTEEDPRKSAAAAVAAKLTASTSSAQMLSYVLSSLASEGVIGNPTKESSGDY 378

Query: 1072 PFEKKAKIENE 1104
            P EK+ K+EN+
Sbjct: 379  PSEKRPKLEND 389


>gb|EOY02865.1| ENTH/VHS family protein isoform 1 [Theobroma cacao]
          Length = 526

 Score =  449 bits (1155), Expect = e-150
 Identities = 233/372 (62%), Positives = 286/372 (76%), Gaps = 4/372 (1%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            Q SIETLSHWCIFHMNKAKQVVETWDRQFHC+PREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QASIETLSHWCIFHMNKAKQVVETWDRQFHCSPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEF-GKKQLDSDNR 357
            EFWKVLPD+LRDVIENGDEFG+NAA RLISIWEERKVFGSRGQILKEE  G++  ++ +R
Sbjct: 81   EFWKVLPDALRDVIENGDEFGRNAAFRLISIWEERKVFGSRGQILKEELVGRQSENNSSR 140

Query: 358  NGKHAGFKL-RSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDY 534
            NG+H G KL + P G+ +DK++SG+Q VY SQ+DED I +KCRNA+S IEKVDK++  D 
Sbjct: 141  NGRHLGLKLQKQPVGSTVDKIVSGYQVVYGSQMDEDVILSKCRNAMSCIEKVDKEIGSDV 200

Query: 535  GLGHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQL 711
              G  + S++ +E++GQHA+LRDC+EQL    +SRA+L+S LREALQ+Q +KLE+VR QL
Sbjct: 201  NSGQFHGSALVEEVQGQHAVLRDCIEQLTAAASSRASLISHLREALQEQEFKLEQVRTQL 260

Query: 712  QAAQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTR 891
            Q+AQ+ SEQA  ICRQLL S  + Q L E S   +  TS  PQ++IP A+EQS  V Y+R
Sbjct: 261  QSAQSQSEQAGNICRQLL-SCENPQLLAEESSKES-QTSIAPQSFIPAATEQSAPVMYSR 318

Query: 892  QLPFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTE 1068
            QL F + S   +EDP                   QML++VLSSLASEGVIGN  ++SS +
Sbjct: 319  QLSFPDNSGHIEEDPRKSAAAAVAAKLTASTSSAQMLSYVLSSLASEGVIGNPTKESSGD 378

Query: 1069 YPFEKKAKIENE 1104
            YP EK+ K+EN+
Sbjct: 379  YPSEKRPKLEND 390


>ref|XP_019264842.1| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein
            1B [Nicotiana attenuata]
 gb|OIT36137.1| hypothetical protein A4A49_30938 [Nicotiana attenuata]
          Length = 528

 Score =  449 bits (1154), Expect = e-149
 Identities = 232/381 (60%), Positives = 286/381 (75%), Gaps = 12/381 (3%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            QQSIETLSHWCIFHMNKAKQVVETW +QFHC+PREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QQSIETLSHWCIFHMNKAKQVVETWAQQFHCSPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEFGKKQLDSDNRN 360
            EFWKVLPD+LRDVIENG+EFG+NAALRLI+IWEERKVFGSRGQ+LKEEF  K +     N
Sbjct: 81   EFWKVLPDALRDVIENGNEFGRNAALRLITIWEERKVFGSRGQLLKEEFAGKHVG----N 136

Query: 361  GKHAGFKL-----------RSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEK 507
            GKH+G K+           R+  G+ALDK++S +Q +Y  Q+DEDAI ++C+NAIS ++K
Sbjct: 137  GKHSGVKVLDRTFSSHSLRRNSTGDALDKIVSSYQMLYGGQIDEDAILSRCKNAISSVDK 196

Query: 508  VDKDVEGDYGLGHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVY 684
            +DK++ GD   GH+N S I DELKGQH IL+DC+EQL  VE+SRANL+S LRE LQ+Q Y
Sbjct: 197  LDKEIGGDLNPGHLNGSGIADELKGQHTILKDCIEQLTTVESSRANLISHLREVLQEQEY 256

Query: 685  KLEKVRDQLQAAQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASE 864
            KL++VR+QLQAAQ+H++QA  IC+QLL  +++GQ L E++      TSQ  Q Y+ G  E
Sbjct: 257  KLDQVRNQLQAAQSHADQAGNICKQLLNCDANGQILAEQN-RKEASTSQAAQAYVAGNRE 315

Query: 865  QSTSVTYTRQLPFAEKSSRTQEDPXXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGN 1044
            QS  V YTRQ+ + EKS    ED                   QMLT+VLSSLASEGVIGN
Sbjct: 316  QSAPVMYTRQVSY-EKSGNLDEDMKSAAAAVAAKLTASTSSAQMLTYVLSSLASEGVIGN 374

Query: 1045 SIRDSSTEYPFEKKAKIENEH 1107
            S ++SS +   EK+ K+EN+H
Sbjct: 375  SSKESSHDNQSEKRVKLENDH 395


>ref|XP_007031939.2| PREDICTED: regulation of nuclear pre-mRNA domain-containing protein 2
            isoform X1 [Theobroma cacao]
          Length = 526

 Score =  448 bits (1152), Expect = e-149
 Identities = 232/372 (62%), Positives = 286/372 (76%), Gaps = 4/372 (1%)
 Frame = +1

Query: 1    QQSIETLSHWCIFHMNKAKQVVETWDRQFHCAPREQRLAFLYLANDILQNSRRKGAEFVA 180
            Q SIETLSHWCIFHMNKAKQVVETWDRQFHC+PREQRLAFLYLANDILQNSRRKG+EFV 
Sbjct: 21   QASIETLSHWCIFHMNKAKQVVETWDRQFHCSPREQRLAFLYLANDILQNSRRKGSEFVG 80

Query: 181  EFWKVLPDSLRDVIENGDEFGKNAALRLISIWEERKVFGSRGQILKEEF-GKKQLDSDNR 357
            EFWKVLPD+LRDVIENGDEFG+NAA RLISIWEERKVFGSRGQILKEE  G++  ++ +R
Sbjct: 81   EFWKVLPDALRDVIENGDEFGRNAAFRLISIWEERKVFGSRGQILKEELVGRQSENNSSR 140

Query: 358  NGKHAGFKL-RSPGGNALDKLISGFQAVYSSQLDEDAIFNKCRNAISFIEKVDKDVEGDY 534
            NG+H G KL + P G+ +DK++SG+Q VY SQ+DED I +KCRNA+S IEKVDK++  D 
Sbjct: 141  NGRHLGLKLQKQPVGSTVDKIVSGYQVVYGSQMDEDVILSKCRNAMSCIEKVDKEIGSDV 200

Query: 535  GLGHVN-SSITDELKGQHAILRDCMEQLAVVETSRANLVSLLREALQDQVYKLEKVRDQL 711
              G  + S++ +E++GQHA+LRDC+EQL    +SRA+L+S LREALQ+Q +KLE+VR QL
Sbjct: 201  NSGQFHGSALVEEVQGQHAVLRDCIEQLTAAASSRASLISHLREALQEQEFKLEQVRTQL 260

Query: 712  QAAQTHSEQASTICRQLLGSNSDGQALTERSGNANIPTSQNPQNYIPGASEQSTSVTYTR 891
            Q+AQ+ SEQA  ICRQLL S  + Q + E S   +  TS  PQ++IP A+EQS  V Y+R
Sbjct: 261  QSAQSQSEQAGNICRQLL-SCENPQLVAEESSKES-QTSIAPQSFIPAATEQSAPVMYSR 318

Query: 892  QLPFAEKSSRTQEDP-XXXXXXXXXXXXXXXXXXQMLTFVLSSLASEGVIGNSIRDSSTE 1068
            QL F + S   +EDP                   QML++VLSSLASEGVIGN  ++SS +
Sbjct: 319  QLSFPDNSGHIEEDPRKSAAAAVAAKLTASTSSAQMLSYVLSSLASEGVIGNPTKESSGD 378

Query: 1069 YPFEKKAKIENE 1104
            YP EK+ K+EN+
Sbjct: 379  YPSEKRPKLEND 390


Top