BLASTX nr result

ID: Paeonia23_contig00013757 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00013757
         (1076 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007027070.1| RNA-binding family protein [Theobroma cacao]...   173   1e-40
ref|XP_007009396.1| RNA-binding family protein, putative isoform...   173   1e-40
ref|XP_002531375.1| RNA and export factor binding protein, putat...   172   2e-40
ref|XP_002284215.1| PREDICTED: RNA and export factor-binding pro...   171   4e-40
ref|XP_006846043.1| hypothetical protein AMTR_s00012p00035940 [A...   170   8e-40
ref|XP_002316299.1| hypothetical protein POPTR_0010s21510g [Popu...   169   2e-39
ref|XP_006488282.1| PREDICTED: THO complex subunit 4-like isofor...   166   2e-38
gb|EXB52242.1| RNA and export factor-binding protein 2 [Morus no...   165   3e-38
ref|XP_006424776.1| hypothetical protein CICLE_v10029160mg [Citr...   164   6e-38
emb|CAN83715.1| hypothetical protein VITISV_023787 [Vitis vinifera]   163   1e-37
ref|XP_006288186.1| hypothetical protein CARUB_v10001422mg, part...   162   2e-37
ref|XP_003538006.1| PREDICTED: THO complex subunit 4-like isofor...   162   2e-37
ref|XP_002873059.1| hypothetical protein ARALYDRAFT_487031 [Arab...   162   2e-37
ref|XP_006480779.1| PREDICTED: THO complex subunit 4-like isofor...   161   4e-37
ref|XP_006363386.1| PREDICTED: THO complex subunit 4-like [Solan...   161   4e-37
ref|XP_007220341.1| hypothetical protein PRUPE_ppa014850mg [Prun...   161   4e-37
ref|XP_006592204.1| PREDICTED: THO complex subunit 4-like isofor...   161   5e-37
ref|NP_974965.1| RNA recognition motif-containing protein [Arabi...   161   5e-37
ref|XP_007016490.1| RNA-binding family protein [Theobroma cacao]...   160   7e-37
emb|CBI31707.3| unnamed protein product [Vitis vinifera]              160   9e-37

>ref|XP_007027070.1| RNA-binding family protein [Theobroma cacao]
           gi|508715675|gb|EOY07572.1| RNA-binding family protein
           [Theobroma cacao]
          Length = 240

 Score =  173 bits (439), Expect = 1e-40
 Identities = 115/255 (45%), Positives = 130/255 (50%), Gaps = 10/255 (3%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           M++ LDMSLDDLI K+N   GLT                        F N    R APY 
Sbjct: 1   MTNPLDMSLDDLI-KSNRKSGLTRTRPPLNSGNGPSRR---------FPNRAANRTAPY- 49

Query: 281 THPPAQASGLMLQPQMIMAGG----------SVAESGTKLYVSNLDYGVSNEDIKVLFSE 430
              P QA     Q  M +  G          S  E+GTKLY+SNLDYGVSNEDIK LFSE
Sbjct: 50  -SKPVQAPDTTWQHDMFVDDGAGFPSAAGRASSIETGTKLYISNLDYGVSNEDIKELFSE 108

Query: 431 VGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIP 610
           VG+++RYS+HYDRSGRSKGTAEVVF R+TDA  A KRYN VQLDGKPM+IE+VG NV  P
Sbjct: 109 VGDMKRYSVHYDRSGRSKGTAEVVFSRRTDAAVAFKRYNGVQLDGKPMKIEIVGTNVATP 168

Query: 611 AALPPNAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXX 790
            ALPP                      S + R                            
Sbjct: 169 VALPPATNGKFANPNGVPRSGQGRGSFSGRSR-----------GPGRGARRGRGQGRGQG 217

Query: 791 EKISAEDLDADLEKY 835
           EK+SAEDLDADLE Y
Sbjct: 218 EKVSAEDLDADLENY 232


>ref|XP_007009396.1| RNA-binding family protein, putative isoform 2 [Theobroma cacao]
           gi|508726309|gb|EOY18206.1| RNA-binding family protein,
           putative isoform 2 [Theobroma cacao]
          Length = 242

 Score =  173 bits (438), Expect = 1e-40
 Identities = 113/251 (45%), Positives = 134/251 (53%), Gaps = 6/251 (2%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MS  LDMSLD++I     S+G   +                         H P+R  PY 
Sbjct: 1   MSGSLDMSLDEIIRNRGRSEGHFRDSRRKPHGSGPGPGPDRRGP-----THDPLRTNPYP 55

Query: 281 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 460
             P   A+      Q++ +GGS  E+  KL +SNLDYGVSNED+KVLFSEVG+L+RYSI+
Sbjct: 56  VRPVPTAAAW--HGQLVSSGGSDMEA--KLCISNLDYGVSNEDVKVLFSEVGDLKRYSIN 111

Query: 461 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALPPNAXXX 640
           YDRSGRSKGTAEVVF RQTDALAAIKRYN VQLDGKPM IELVG NVV+ A +PP     
Sbjct: 112 YDRSGRSKGTAEVVFYRQTDALAAIKRYNNVQLDGKPMTIELVGANVVMSAPIPPT---- 167

Query: 641 XXXXXXXXXXXXXXAFRSVQER------XXXXXXXXXXXXXXXXXXXXXXXXXXXXEKIS 802
                         AFR  QE+                                  +K+S
Sbjct: 168 ----NSSIVRNPNVAFRRDQEKVGGSRWVHGGGNGPNGGGAGRGFARRRRQGGHVGQKLS 223

Query: 803 AEDLDADLEKY 835
           AEDLDADL+KY
Sbjct: 224 AEDLDADLDKY 234


>ref|XP_002531375.1| RNA and export factor binding protein, putative [Ricinus communis]
           gi|223529035|gb|EEF31023.1| RNA and export factor
           binding protein, putative [Ricinus communis]
          Length = 247

 Score =  172 bits (436), Expect = 2e-40
 Identities = 118/259 (45%), Positives = 131/259 (50%), Gaps = 12/259 (4%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MSS LDMSLDD+I  N       S                       F N    R APY 
Sbjct: 1   MSSALDMSLDDIIKSNKKPGSGNSRGRGRSSGPGPTRR---------FNNRGTNRAAPYA 51

Query: 281 T---------HPPAQASGLMLQPQMIMAGGSVA---ESGTKLYVSNLDYGVSNEDIKVLF 424
                     H      G M   Q    GGS A   E+GTKLY+SNL+YGVSNEDIK LF
Sbjct: 52  AAKAPESTWQHDMFTDQGGMGMFQGQGGGGSRASGIETGTKLYISNLEYGVSNEDIKELF 111

Query: 425 SEVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVV 604
           SEVG+L+RYSIHYDRSGRSKGTAEVVF R+TDALAA+KRYN VQLDGKPM+IE+VG N+ 
Sbjct: 112 SEVGDLKRYSIHYDRSGRSKGTAEVVFSRRTDALAAVKRYNNVQLDGKPMKIEIVGTNIA 171

Query: 605 IPAALPPNAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXX 784
            PAA P                    A R  Q R                          
Sbjct: 172 TPAATP---------AANGNFGNSNGAPRGGQGRGGTMRRPRGASSDGRGFGRGRGRGRG 222

Query: 785 XXEKISAEDLDADLEKYRA 841
             EK+SAEDLDADLEKY +
Sbjct: 223 RGEKVSAEDLDADLEKYHS 241


>ref|XP_002284215.1| PREDICTED: RNA and export factor-binding protein 2 [Vitis vinifera]
           gi|297746115|emb|CBI16171.3| unnamed protein product
           [Vitis vinifera]
          Length = 243

 Score =  171 bits (434), Expect = 4e-40
 Identities = 111/247 (44%), Positives = 127/247 (51%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MS+ LDMSLDDLI KNN   G   N                            +  AP  
Sbjct: 1   MSNALDMSLDDLI-KNNKRSGGGGNARGRGRGSGPGPARRLPNRGANRITPYSVSKAPET 59

Query: 281 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 460
           T      +             S  E+GTKLY+SNLDYGVSNEDIK LFSEVG+L+RYSIH
Sbjct: 60  TWQHDMFADQAAAYPAQAGRTSAIETGTKLYISNLDYGVSNEDIKELFSEVGDLKRYSIH 119

Query: 461 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALPPNAXXX 640
           YDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG N+  PAA+PP     
Sbjct: 120 YDRSGRSKGTAEVVFSRRGDAVAAVKRYNNVQLDGKPMKIEIVGTNIATPAAVPP----- 174

Query: 641 XXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXXEKISAEDLDA 820
                           RS Q R                            EK+SAEDLDA
Sbjct: 175 ---VTNGTFGNSNGGLRSAQGR-VGSQGRPRGGSGGRGFGRGRGRGRGRGEKVSAEDLDA 230

Query: 821 DLEKYRA 841
           DLEKY +
Sbjct: 231 DLEKYHS 237


>ref|XP_006846043.1| hypothetical protein AMTR_s00012p00035940 [Amborella trichopoda]
           gi|548848813|gb|ERN07718.1| hypothetical protein
           AMTR_s00012p00035940 [Amborella trichopoda]
          Length = 253

 Score =  170 bits (431), Expect = 8e-40
 Identities = 117/260 (45%), Positives = 132/260 (50%), Gaps = 13/260 (5%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MSS LDMSLDDLI  N  S G                            N    R APY 
Sbjct: 1   MSSALDMSLDDLIKNNKKSGG-----GGGGGVMRGRNRGGTSGPVRRIQNRTANRAAPYS 55

Query: 281 THPPAQAS-------GLMLQPQ----MIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFS 427
                QA+        L   P     +  A  S  E+GTKLY+SNLDYGVSNEDIK LFS
Sbjct: 56  MGKAFQAAPETTWQHDLFADPVGPYGVQAARPSAIETGTKLYISNLDYGVSNEDIKELFS 115

Query: 428 EVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVI 607
           EVG+L+RYSIHYDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG N+V 
Sbjct: 116 EVGDLKRYSIHYDRSGRSKGTAEVVFSRRADAVAAVKRYNNVQLDGKPMKIEIVGTNLVT 175

Query: 608 PAALPPNAXXXXXXXXXXXXXXXXXAFRSVQER--XXXXXXXXXXXXXXXXXXXXXXXXX 781
           PA +P  A                   RSVQ R                           
Sbjct: 176 PAPIPQVANGLLATPNGIP--------RSVQPRGAGIIGRGPRGGGRGGGRGRGRGGRGG 227

Query: 782 XXXEKISAEDLDADLEKYRA 841
              EK+SA DLDADLEKY +
Sbjct: 228 GRGEKVSAADLDADLEKYHS 247


>ref|XP_002316299.1| hypothetical protein POPTR_0010s21510g [Populus trichocarpa]
           gi|222865339|gb|EEF02470.1| hypothetical protein
           POPTR_0010s21510g [Populus trichocarpa]
          Length = 250

 Score =  169 bits (428), Expect = 2e-39
 Identities = 119/260 (45%), Positives = 138/260 (53%), Gaps = 11/260 (4%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MSS LDMSLDDLI K   + G  S+                      F    P R  PY 
Sbjct: 1   MSSLLDMSLDDLIRKGKENGGRDSDFRGSGRGAGSGSVLGPGPDRLVFRRD-PTRPKPYS 59

Query: 281 THPPAQASGLMLQPQMIMAG-GSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSI 457
             P  Q   +  +P M+ A  GS  E+  KLY+SNLDYGVSNEDIKVLFSEVGEL RYS+
Sbjct: 60  VRP-VQVMQVQQEPLMLAASEGSNGEA--KLYISNLDYGVSNEDIKVLFSEVGELLRYSL 116

Query: 458 HYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVV--IPAALPPNA 631
           HYD SGRSKGTAEVVF RQTDALAAI+RYN VQLDGKP++IELVG+NV+  +P  +P  A
Sbjct: 117 HYDMSGRSKGTAEVVFSRQTDALAAIRRYNNVQLDGKPLKIELVGVNVITPVPVTVPVTA 176

Query: 632 XXXXXXXXXXXXXXXXXAFRSVQE------RXXXXXXXXXXXXXXXXXXXXXXXXXXXXE 793
                            A RSV E      R                            E
Sbjct: 177 --------ITNVANPNGAVRSVHERIGARGRGHGGGAGGRGGGSVQEFARGQGQVRRRVE 228

Query: 794 KISAEDLDADLEKY--RAMK 847
           K++AE LD+DL+KY   AMK
Sbjct: 229 KLTAEALDSDLDKYHFEAMK 248


>ref|XP_006488282.1| PREDICTED: THO complex subunit 4-like isoform X1 [Citrus sinensis]
          Length = 247

 Score =  166 bits (420), Expect = 2e-38
 Identities = 112/254 (44%), Positives = 130/254 (51%), Gaps = 9/254 (3%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MSS LDMSLDD+I KNN   G  +                         N    R APY 
Sbjct: 1   MSSALDMSLDDII-KNNKKSGSGN--------FRGRGRGSGPGPARRIPNRGANRVAPYT 51

Query: 281 THPPAQ--------ASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVG 436
           T    +        A  +   P       S  E+GTKLY+SNLDYGVSNEDIK LFSEVG
Sbjct: 52  TAKAPETTWQHDMFADQVSAFPVQQAGRASAIETGTKLYISNLDYGVSNEDIKELFSEVG 111

Query: 437 ELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAA 616
           +L+RYSIHYDRSGRSKGTAEVV+ R+ DA+AA+KRYNTVQLDGKPM+IE+VG N+    A
Sbjct: 112 DLKRYSIHYDRSGRSKGTAEVVYSRRADAVAAVKRYNTVQLDGKPMKIEIVGTNIATRTA 171

Query: 617 LP-PNAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXXE 793
            P  N                  AFR ++                              E
Sbjct: 172 APAANVNFGNSNGVPRGGQGRGGAFRRLR---------GGGGGGGRGFGRGRGRGRERNE 222

Query: 794 KISAEDLDADLEKY 835
           KISAEDLDADL+KY
Sbjct: 223 KISAEDLDADLDKY 236


>gb|EXB52242.1| RNA and export factor-binding protein 2 [Morus notabilis]
          Length = 248

 Score =  165 bits (417), Expect = 3e-38
 Identities = 113/260 (43%), Positives = 129/260 (49%), Gaps = 13/260 (5%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MS  L MSLDD+I  N  +                              N  P R APY 
Sbjct: 1   MSDPLSMSLDDIIKTNKKTGS--------GNPRGRGRLSGGPGPARRVPNRAPNRAAPYA 52

Query: 281 THPPAQASGLMLQPQMIMAGG----------SVAESGTKLYVSNLDYGVSNEDIKVLFSE 430
             P A  +    Q  M M  G          S  ++GTKLY+SNL+YGVSNEDIK LFSE
Sbjct: 53  AAPKAPET--TWQHDMYMDQGTAFAAQAGRASAIQTGTKLYISNLEYGVSNEDIKELFSE 110

Query: 431 VGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIP 610
           VG+L+RY+IHYDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG NV  P
Sbjct: 111 VGDLKRYAIHYDRSGRSKGTAEVVFSRRADAVAAVKRYNNVQLDGKPMKIEIVGTNVATP 170

Query: 611 AALPPNAXXXXXXXXXXXXXXXXXAFRSVQER---XXXXXXXXXXXXXXXXXXXXXXXXX 781
           AA PP A                   R  Q R                            
Sbjct: 171 AAPPPPANGSFGNSNGLP--------RGGQGRGGGAFGRPRGGGGGGRGPRRGRGRGQGR 222

Query: 782 XXXEKISAEDLDADLEKYRA 841
              EKISA+DLDADLEKY A
Sbjct: 223 GTGEKISADDLDADLEKYHA 242


>ref|XP_006424776.1| hypothetical protein CICLE_v10029160mg [Citrus clementina]
           gi|567864256|ref|XP_006424777.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|567864258|ref|XP_006424778.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|567864260|ref|XP_006424779.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|557526710|gb|ESR38016.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|557526711|gb|ESR38017.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|557526712|gb|ESR38018.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|557526713|gb|ESR38019.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
          Length = 247

 Score =  164 bits (415), Expect = 6e-38
 Identities = 111/254 (43%), Positives = 129/254 (50%), Gaps = 9/254 (3%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MSS LDMSLDD+I KNN   G  +                         N    R APY 
Sbjct: 1   MSSALDMSLDDII-KNNKKSGSGN--------FRGRGRGSGPGPARRIPNRGANRVAPYT 51

Query: 281 THPPAQ--------ASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVG 436
           T    +        A  +   P       S  E+GTKLY+SNLDYGVSNEDIK LFSEVG
Sbjct: 52  TAKAPETTWQHDMFADQVSAFPVQQAGRASAIETGTKLYISNLDYGVSNEDIKELFSEVG 111

Query: 437 ELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAA 616
           +L+RYSIHYDRSGRSKGTAEVV+ R+ DA+AA+KRYN VQLDGKPM+IE+VG N+    A
Sbjct: 112 DLKRYSIHYDRSGRSKGTAEVVYSRRADAVAAVKRYNNVQLDGKPMKIEIVGTNIATRTA 171

Query: 617 LP-PNAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXXE 793
            P  N                  AFR ++                              E
Sbjct: 172 APAANVNFGNSNGVPRGGQGRGGAFRRLR---------GGGGGGGRGFGRGRGRGRERNE 222

Query: 794 KISAEDLDADLEKY 835
           KISAEDLDADL+KY
Sbjct: 223 KISAEDLDADLDKY 236


>emb|CAN83715.1| hypothetical protein VITISV_023787 [Vitis vinifera]
          Length = 281

 Score =  163 bits (412), Expect = 1e-37
 Identities = 93/175 (53%), Positives = 107/175 (61%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MS+ LDMSLDDLI KNN   G   N                            +  AP  
Sbjct: 1   MSNALDMSLDDLI-KNNKRSGGGGNARGRGRGSGPGPARRLPNRGANRITPYSVSKAPET 59

Query: 281 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 460
           T      +             S  E+GTKLY+SNLDYGVSNEDIK LFSEVG+L+RYSIH
Sbjct: 60  TWQHDMFADQAAAYPAQAGRTSAIETGTKLYISNLDYGVSNEDIKELFSEVGDLKRYSIH 119

Query: 461 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALPP 625
           YDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG N+  PAA+PP
Sbjct: 120 YDRSGRSKGTAEVVFSRRGDAVAAVKRYNNVQLDGKPMKIEIVGTNIATPAAVPP 174


>ref|XP_006288186.1| hypothetical protein CARUB_v10001422mg, partial [Capsella rubella]
           gi|482556892|gb|EOA21084.1| hypothetical protein
           CARUB_v10001422mg, partial [Capsella rubella]
          Length = 329

 Score =  162 bits (411), Expect = 2e-37
 Identities = 96/189 (50%), Positives = 108/189 (57%), Gaps = 18/189 (9%)
 Frame = +2

Query: 98  KMSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXX--FANHVPIRNA 271
           KMS GLDMSLDD+I  N    G                            FAN V  R A
Sbjct: 41  KMSGGLDMSLDDIIKSNRKPTGSRGRGGVGGNSTGGRGGFGGSGSGPSRRFANRVGNRTA 100

Query: 272 PYLTHPPAQASGLMLQPQMI----------------MAGGSVAESGTKLYVSNLDYGVSN 403
           PY      QA   M Q  +                 + GGS  E+GTKLY+SNLDYGVSN
Sbjct: 101 PYSRPVQLQAQDAMWQNDVFATDASVAAAFGHQPAAVVGGSSIETGTKLYISNLDYGVSN 160

Query: 404 EDIKVLFSEVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIE 583
           EDIK LFSEVG+L+RY IHYDRSGRSKGTAEVVF R+ D LAA+KRYN VQLDGK M+IE
Sbjct: 161 EDIKELFSEVGDLKRYGIHYDRSGRSKGTAEVVFSRRGDGLAAVKRYNNVQLDGKLMKIE 220

Query: 584 LVGMNVVIP 610
           +VG N+  P
Sbjct: 221 IVGTNIPAP 229


>ref|XP_003538006.1| PREDICTED: THO complex subunit 4-like isoform X1 [Glycine max]
           gi|571488643|ref|XP_006590992.1| PREDICTED: THO complex
           subunit 4-like isoform X2 [Glycine max]
          Length = 247

 Score =  162 bits (411), Expect = 2e-37
 Identities = 108/258 (41%), Positives = 134/258 (51%), Gaps = 11/258 (4%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MS+ +DMSLDD+I KNN   G  S+                      F N    R APY 
Sbjct: 1   MSAAMDMSLDDII-KNNKKSGSGSSRGRIRPSGSGPSRR--------FPNRAANRAAPYA 51

Query: 281 THPPAQASGL--MLQPQMIMAGGSVA--------ESGTKLYVSNLDYGVSNEDIKVLFSE 430
           T    +A+    +   Q + A G  A        E+GTKLY+SNLDYGVS++DIK LF+E
Sbjct: 52  TAKAPEATWQHDLYADQQVAAAGYPAQGGRAASIETGTKLYISNLDYGVSSDDIKELFAE 111

Query: 431 VGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIP 610
           VG+L+R+++HYDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG N+  P
Sbjct: 112 VGDLKRHAVHYDRSGRSKGTAEVVFSRRADAVAAVKRYNNVQLDGKPMKIEIVGTNISTP 171

Query: 611 AALPP-NAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXX 787
              P  N                  A R    R                           
Sbjct: 172 GVAPARNGAIGNFDGVPRSGQGRGGALRRPGGR--------GQGVRRDRGRGRGRGGAGR 223

Query: 788 XEKISAEDLDADLEKYRA 841
            EK+SA+DLDADLEKY A
Sbjct: 224 GEKVSADDLDADLEKYHA 241


>ref|XP_002873059.1| hypothetical protein ARALYDRAFT_487031 [Arabidopsis lyrata subsp.
           lyrata] gi|297318896|gb|EFH49318.1| hypothetical protein
           ARALYDRAFT_487031 [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  162 bits (410), Expect = 2e-37
 Identities = 96/189 (50%), Positives = 109/189 (57%), Gaps = 19/189 (10%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXX--FANHVPIRNAP 274
           MS GLDMSLDD+I  N    G                            FAN V  R AP
Sbjct: 1   MSGGLDMSLDDIIKSNRKPTGSRGRGGIGGGNNTGGRGGSGSNSGPSRRFANRVGARTAP 60

Query: 275 YLTHPPAQASGLMLQPQM-----------------IMAGGSVAESGTKLYVSNLDYGVSN 403
           Y      QA   M Q  +                 ++ GGS  E+GTKLY+SNLDYGVSN
Sbjct: 61  YSRPIQQQAHDAMWQNDVFATDASVAAAFGHHQTAVVGGGSSIETGTKLYISNLDYGVSN 120

Query: 404 EDIKVLFSEVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIE 583
           EDIK LFSEVG+L+RY IHYDRSGRSKGTAEVVF R+ DALAA+KRYN VQLDGK M+IE
Sbjct: 121 EDIKELFSEVGDLKRYGIHYDRSGRSKGTAEVVFSRRGDALAAVKRYNNVQLDGKLMKIE 180

Query: 584 LVGMNVVIP 610
           +VG N+  P
Sbjct: 181 IVGTNLSAP 189


>ref|XP_006480779.1| PREDICTED: THO complex subunit 4-like isoform X1 [Citrus sinensis]
          Length = 287

 Score =  161 bits (408), Expect = 4e-37
 Identities = 112/273 (41%), Positives = 131/273 (47%), Gaps = 10/273 (3%)
 Frame = +2

Query: 62  SEALHTFSTCSPKMSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXX 241
           S  LH+    S +M+S LDMSLDD+I  N  S                            
Sbjct: 19  SRNLHSPFRVSSRMTSALDMSLDDIIKSNKRSSS--------RWRGGGGRGSGLGPARHT 70

Query: 242 FANHVPIRNAPYLTHPPAQASGLMLQPQMIMAGGSVA--------ESGTKLYVSNLDYGV 397
           F   V  R APY    P QA        MI  G   A        ESGTKLY+SNL+YGV
Sbjct: 71  FKRSVN-RTAPY--SKPVQAPQATWPQNMIFNGAVAAAAARSSSIESGTKLYISNLEYGV 127

Query: 398 SNEDIKVLFSEVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQ 577
           SNEDIK LFSEVGEL+R S+H+DRSGRSKGTAEVV  R+ DA+AA+KRYN VQLDGKPM+
Sbjct: 128 SNEDIKELFSEVGELKRCSVHFDRSGRSKGTAEVVLTRRADAIAAVKRYNNVQLDGKPMK 187

Query: 578 IELVGMNVVIPAALPP--NAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXX 751
           IE++G N+  PAA+ P  N                  A    + R               
Sbjct: 188 IEIIGTNIGPPAAVLPITNVMYGNEIGTSRSHLRMGGAIPIERPRHGRGGLGGRGGRGGA 247

Query: 752 XXXXXXXXXXXXXEKISAEDLDADLEKYRAMKK 850
                          +S EDLDADLEKY    K
Sbjct: 248 RGGRGRGRGQGAKPNLSVEDLDADLEKYHIEAK 280


>ref|XP_006363386.1| PREDICTED: THO complex subunit 4-like [Solanum tuberosum]
          Length = 254

 Score =  161 bits (408), Expect = 4e-37
 Identities = 107/251 (42%), Positives = 127/251 (50%), Gaps = 8/251 (3%)
 Frame = +2

Query: 107 SGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYLTH 286
           + LDM+LDDLI KN T  G                          F N    R APY T 
Sbjct: 4   AALDMTLDDLIKKNKTGTG-----GKPRGRGCGAASTSSAGPSRRFPNRSANRAAPYSTA 58

Query: 287 PPAQAS---GLMLQPQMI---MAGG--SVAESGTKLYVSNLDYGVSNEDIKVLFSEVGEL 442
              +AS    +    Q +    AGG  S  E+GTKLY+SNLDYGVS EDIK LFSE+G+L
Sbjct: 59  KAPEASWNHDMFAADQAVAFGQAGGRASSIETGTKLYISNLDYGVSKEDIKELFSEIGDL 118

Query: 443 ERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALP 622
           +RY++HYDRSGRSKGT EVVF R+ D LA +KR+N VQLDGKPM+IE+VG N+V P A  
Sbjct: 119 KRYAVHYDRSGRSKGTTEVVFSRRQDTLAGVKRFNNVQLDGKPMKIEIVGTNIVTPTAPF 178

Query: 623 PNAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXXEKIS 802
            N                  AF  V+                              EK+S
Sbjct: 179 SNG---AFGFGDTNGAPRRYAFGQVRGGGFGRSRGARGRGRGFRGGNRGWGRGGRGEKVS 235

Query: 803 AEDLDADLEKY 835
           AEDLDADL KY
Sbjct: 236 AEDLDADLMKY 246


>ref|XP_007220341.1| hypothetical protein PRUPE_ppa014850mg [Prunus persica]
           gi|462416803|gb|EMJ21540.1| hypothetical protein
           PRUPE_ppa014850mg [Prunus persica]
          Length = 259

 Score =  161 bits (408), Expect = 4e-37
 Identities = 84/175 (48%), Positives = 111/175 (63%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           M  GLDMSLD+LI K     G                            N   +R APY 
Sbjct: 1   MPRGLDMSLDELIAKRKKPGGYHG---YFRGRGRGRGRGYGPGPTRRLMNRNTVRTAPYS 57

Query: 281 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 460
             P  Q     ++ +M ++GG+  E GTKLY+SNLDY VSN DI++LFSE+G ++R+++H
Sbjct: 58  AQPIMQVVRTTVEQEMEVSGGTDTEEGTKLYLSNLDYDVSNSDIELLFSEIGHVKRHTVH 117

Query: 461 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALPP 625
           YDRSGRSKGTAEV+F+  +DALAAI++YN VQLDGKP++IELVG+N V P ++PP
Sbjct: 118 YDRSGRSKGTAEVIFVHHSDALAAIEKYNNVQLDGKPLKIELVGVNPVAPISVPP 172


>ref|XP_006592204.1| PREDICTED: THO complex subunit 4-like isoform X1 [Glycine max]
           gi|571492362|ref|XP_006592205.1| PREDICTED: THO complex
           subunit 4-like isoform X2 [Glycine max]
          Length = 247

 Score =  161 bits (407), Expect = 5e-37
 Identities = 107/258 (41%), Positives = 133/258 (51%), Gaps = 11/258 (4%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MS+ +DMSLDD+I KNN   G  S+                        N    R APY 
Sbjct: 1   MSAAMDMSLDDII-KNNKKSGSGSSRGRTRPSGSGPTRR--------LPNRAANRAAPYA 51

Query: 281 THPPAQASGL--MLQPQMIMAGGSVA--------ESGTKLYVSNLDYGVSNEDIKVLFSE 430
                +A+    +   Q + A G  A        E+GTKLY+SNLDYGVSN+DIK LF+E
Sbjct: 52  PAKAPEATWQHDLYADQHVAAAGYPAQGGRAASIETGTKLYISNLDYGVSNDDIKELFAE 111

Query: 431 VGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIP 610
           VG+L+R+++HYDRSGRSKGTAEVVF R+ DA++A+KRYN VQLDGKPM+IE+VG N+  P
Sbjct: 112 VGDLKRHAVHYDRSGRSKGTAEVVFSRRADAVSAVKRYNNVQLDGKPMKIEIVGTNISTP 171

Query: 611 AALP-PNAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXX 787
              P PN                  A R    R                           
Sbjct: 172 GVAPAPNGAIGNFNGVPRSGQGRGGALRRPGGR--------GQGIRRDRGRGRGRGGGGR 223

Query: 788 XEKISAEDLDADLEKYRA 841
            EK+SA+DLDADLEKY A
Sbjct: 224 GEKVSADDLDADLEKYHA 241


>ref|NP_974965.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
           gi|332009873|gb|AED97256.1| RNA recognition
           motif-containing protein [Arabidopsis thaliana]
          Length = 242

 Score =  161 bits (407), Expect = 5e-37
 Identities = 108/254 (42%), Positives = 133/254 (52%), Gaps = 7/254 (2%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MS+GLDMSLDD+I KN  S G                            N    R+APY 
Sbjct: 1   MSTGLDMSLDDMIAKNRKSRGGAGPARGTGSGSGPGPTRRNNP------NRKSTRSAPYQ 54

Query: 281 THPPAQASGLMLQPQMI--MAGGSVA--ESGTKLYVSNLDYGVSNEDIKVLFSEVGELER 448
           + P +     M   +     +G S A  E+GTKLY+SNLDYGV NEDIK LF+EVGEL+R
Sbjct: 55  SAPESTWGHDMFSDRSEDHRSGRSSAGIETGTKLYISNLDYGVMNEDIKELFAEVGELKR 114

Query: 449 YSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNV---VIPAAL 619
           Y++H+DRSGRSKGTAEVV+ R+ DALAA+K+YN VQLDGKPM+IE+VG N+     P+  
Sbjct: 115 YTVHFDRSGRSKGTAEVVYSRRGDALAAVKKYNDVQLDGKPMKIEIVGTNLQTAAAPSGR 174

Query: 620 PPNAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXXEKI 799
           P N                   +R  Q R                            EKI
Sbjct: 175 PANG------------NSNGAPWRGGQGRGGQQRGGGRGGGGRGGGGRGRRPGKGPAEKI 222

Query: 800 SAEDLDADLEKYRA 841
           SAEDLDADL+KY +
Sbjct: 223 SAEDLDADLDKYHS 236


>ref|XP_007016490.1| RNA-binding family protein [Theobroma cacao]
           gi|508786853|gb|EOY34109.1| RNA-binding family protein
           [Theobroma cacao]
          Length = 241

 Score =  160 bits (406), Expect = 7e-37
 Identities = 111/256 (43%), Positives = 131/256 (51%), Gaps = 9/256 (3%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           MSS L+MSLDDLI +N  S    S                       F N    R+ PY 
Sbjct: 1   MSSALEMSLDDLIKRNRKSGSGNSRGRGRGSGPGPARR---------FPNRGANRSGPYT 51

Query: 281 T---------HPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEV 433
                     H      G   Q Q   A  S  E+GTKLY+SNLDYGVSN+DIK LF+EV
Sbjct: 52  AAKAPETTWQHDMYSDKGAAFQGQAGRA--SAIETGTKLYISNLDYGVSNDDIKELFAEV 109

Query: 434 GELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPA 613
           G+L+R++IHYDRSGRSKGTAEVVF R+TDA+AA+KRYN VQLDGKPM+IE+VG NV  P 
Sbjct: 110 GDLKRFTIHYDRSGRSKGTAEVVFSRRTDAMAAVKRYNNVQLDGKPMKIEIVGTNVATPG 169

Query: 614 ALPPNAXXXXXXXXXXXXXXXXXAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXXE 793
           A  P+A                 A R    R                            E
Sbjct: 170 A--PSA-------GNGAFGNSNGAPRGGHGR-GGGFGKQRGGGGGRGFGRGRGRGKGRGE 219

Query: 794 KISAEDLDADLEKYRA 841
           K+SAEDLDA+LEKY +
Sbjct: 220 KVSAEDLDAELEKYHS 235


>emb|CBI31707.3| unnamed protein product [Vitis vinifera]
          Length = 160

 Score =  160 bits (405), Expect = 9e-37
 Identities = 87/166 (52%), Positives = 105/166 (63%)
 Frame = +2

Query: 101 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXXFANHVPIRNAPYL 280
           M   LDMSLD++I     S G++SN                      F N   +R  PY 
Sbjct: 1   MPDPLDMSLDEIIRNKKKSAGVSSNVRGIGSGPGPGPARR-------FGNRELLRTTPYS 53

Query: 281 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 460
             P  Q      + ++   G S  E+GTKLY+SNL+YGVSN+DIK LFSEVGEL++YSIH
Sbjct: 54  VAPVFQVLEAAWKQEVFTGGVSTMETGTKLYISNLEYGVSNDDIKELFSEVGELKQYSIH 113

Query: 461 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMN 598
           YD+SG SKGT EVVFLRQTDALAAIKRYN VQLDGKP +I+L+G N
Sbjct: 114 YDKSGISKGTGEVVFLRQTDALAAIKRYNNVQLDGKPQKIDLIGAN 159


Top