BLASTX nr result

ID: Paeonia22_contig00014301 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00014301
         (1033 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007027070.1| RNA-binding family protein [Theobroma cacao]...   173   9e-41
ref|XP_007009396.1| RNA-binding family protein, putative isoform...   173   1e-40
ref|XP_002531375.1| RNA and export factor binding protein, putat...   172   2e-40
ref|XP_002284215.1| PREDICTED: RNA and export factor-binding pro...   171   3e-40
ref|XP_006846043.1| hypothetical protein AMTR_s00012p00035940 [A...   170   8e-40
ref|XP_002316299.1| hypothetical protein POPTR_0010s21510g [Popu...   169   2e-39
ref|XP_006488282.1| PREDICTED: THO complex subunit 4-like isofor...   166   1e-38
gb|EXB52242.1| RNA and export factor-binding protein 2 [Morus no...   165   3e-38
ref|XP_006424776.1| hypothetical protein CICLE_v10029160mg [Citr...   164   6e-38
emb|CAN83715.1| hypothetical protein VITISV_023787 [Vitis vinifera]   163   1e-37
ref|XP_006288186.1| hypothetical protein CARUB_v10001422mg, part...   162   2e-37
ref|XP_003538006.1| PREDICTED: THO complex subunit 4-like isofor...   162   2e-37
ref|XP_002873059.1| hypothetical protein ARALYDRAFT_487031 [Arab...   162   2e-37
ref|XP_006480779.1| PREDICTED: THO complex subunit 4-like isofor...   161   4e-37
ref|XP_006363386.1| PREDICTED: THO complex subunit 4-like [Solan...   161   4e-37
ref|XP_007220341.1| hypothetical protein PRUPE_ppa014850mg [Prun...   161   4e-37
ref|XP_006592204.1| PREDICTED: THO complex subunit 4-like isofor...   161   5e-37
ref|NP_974965.1| RNA recognition motif-containing protein [Arabi...   161   5e-37
ref|XP_007016490.1| RNA-binding family protein [Theobroma cacao]...   160   6e-37
emb|CBI31707.3| unnamed protein product [Vitis vinifera]              160   8e-37

>ref|XP_007027070.1| RNA-binding family protein [Theobroma cacao]
           gi|508715675|gb|EOY07572.1| RNA-binding family protein
           [Theobroma cacao]
          Length = 240

 Score =  173 bits (439), Expect = 9e-41
 Identities = 115/255 (45%), Positives = 130/255 (50%), Gaps = 10/255 (3%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           M++ LDMSLDDLI K+N   GLT                        F N    R APY 
Sbjct: 1   MTNPLDMSLDDLI-KSNRKSGLTRTRPPLNSGNGPSRR---------FPNRAANRTAPY- 49

Query: 744 THPPAQASGLMLQPQMIMAGG----------SVAESGTKLYVSNLDYGVSNEDIKVLFSE 595
              P QA     Q  M +  G          S  E+GTKLY+SNLDYGVSNEDIK LFSE
Sbjct: 50  -SKPVQAPDTTWQHDMFVDDGAGFPSAAGRASSIETGTKLYISNLDYGVSNEDIKELFSE 108

Query: 594 VGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIP 415
           VG+++RYS+HYDRSGRSKGTAEVVF R+TDA  A KRYN VQLDGKPM+IE+VG NV  P
Sbjct: 109 VGDMKRYSVHYDRSGRSKGTAEVVFSRRTDAAVAFKRYNGVQLDGKPMKIEIVGTNVATP 168

Query: 414 AALPPNAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXD 235
            ALPP                      S + R                            
Sbjct: 169 VALPPATNGKFANPNGVPRSGQGRGSFSGRSR-----------GPGRGARRGRGQGRGQG 217

Query: 234 EKISAEDLDADLEKY 190
           EK+SAEDLDADLE Y
Sbjct: 218 EKVSAEDLDADLENY 232


>ref|XP_007009396.1| RNA-binding family protein, putative isoform 2 [Theobroma cacao]
           gi|508726309|gb|EOY18206.1| RNA-binding family protein,
           putative isoform 2 [Theobroma cacao]
          Length = 242

 Score =  173 bits (438), Expect = 1e-40
 Identities = 113/251 (45%), Positives = 134/251 (53%), Gaps = 6/251 (2%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MS  LDMSLD++I     S+G   +                         H P+R  PY 
Sbjct: 1   MSGSLDMSLDEIIRNRGRSEGHFRDSRRKPHGSGPGPGPDRRGP-----THDPLRTNPYP 55

Query: 744 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 565
             P   A+      Q++ +GGS  E+  KL +SNLDYGVSNED+KVLFSEVG+L+RYSI+
Sbjct: 56  VRPVPTAAAW--HGQLVSSGGSDMEA--KLCISNLDYGVSNEDVKVLFSEVGDLKRYSIN 111

Query: 564 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALPPNAXXX 385
           YDRSGRSKGTAEVVF RQTDALAAIKRYN VQLDGKPM IELVG NVV+ A +PP     
Sbjct: 112 YDRSGRSKGTAEVVFYRQTDALAAIKRYNNVQLDGKPMTIELVGANVVMSAPIPPT---- 167

Query: 384 XXXXXXXXXXXXXGAFRSVQER------XXXXXXXXXXXXXXXXXXXXXXXXXXXDEKIS 223
                         AFR  QE+                                  +K+S
Sbjct: 168 ----NSSIVRNPNVAFRRDQEKVGGSRWVHGGGNGPNGGGAGRGFARRRRQGGHVGQKLS 223

Query: 222 AEDLDADLEKY 190
           AEDLDADL+KY
Sbjct: 224 AEDLDADLDKY 234


>ref|XP_002531375.1| RNA and export factor binding protein, putative [Ricinus communis]
           gi|223529035|gb|EEF31023.1| RNA and export factor
           binding protein, putative [Ricinus communis]
          Length = 247

 Score =  172 bits (436), Expect = 2e-40
 Identities = 119/259 (45%), Positives = 132/259 (50%), Gaps = 12/259 (4%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MSS LDMSLDD+I  N       S                       F N    R APY 
Sbjct: 1   MSSALDMSLDDIIKSNKKPGSGNSRGRGRSSGPGPTRR---------FNNRGTNRAAPYA 51

Query: 744 T---------HPPAQASGLMLQPQMIMAGGSVA---ESGTKLYVSNLDYGVSNEDIKVLF 601
                     H      G M   Q    GGS A   E+GTKLY+SNL+YGVSNEDIK LF
Sbjct: 52  AAKAPESTWQHDMFTDQGGMGMFQGQGGGGSRASGIETGTKLYISNLEYGVSNEDIKELF 111

Query: 600 SEVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVV 421
           SEVG+L+RYSIHYDRSGRSKGTAEVVF R+TDALAA+KRYN VQLDGKPM+IE+VG N+ 
Sbjct: 112 SEVGDLKRYSIHYDRSGRSKGTAEVVFSRRTDALAAVKRYNNVQLDGKPMKIEIVGTNIA 171

Query: 420 IPAALPPNAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXX 241
            PAA P                   GA R  Q R                          
Sbjct: 172 TPAATP---------AANGNFGNSNGAPRGGQGRGGTMRRPRGASSDGRGFGRGRGRGRG 222

Query: 240 XDEKISAEDLDADLEKYRA 184
             EK+SAEDLDADLEKY +
Sbjct: 223 RGEKVSAEDLDADLEKYHS 241


>ref|XP_002284215.1| PREDICTED: RNA and export factor-binding protein 2 [Vitis vinifera]
           gi|297746115|emb|CBI16171.3| unnamed protein product
           [Vitis vinifera]
          Length = 243

 Score =  171 bits (434), Expect = 3e-40
 Identities = 112/247 (45%), Positives = 128/247 (51%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MS+ LDMSLDDLI KNN   G   N                            +  AP  
Sbjct: 1   MSNALDMSLDDLI-KNNKRSGGGGNARGRGRGSGPGPARRLPNRGANRITPYSVSKAPET 59

Query: 744 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 565
           T      +             S  E+GTKLY+SNLDYGVSNEDIK LFSEVG+L+RYSIH
Sbjct: 60  TWQHDMFADQAAAYPAQAGRTSAIETGTKLYISNLDYGVSNEDIKELFSEVGDLKRYSIH 119

Query: 564 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALPPNAXXX 385
           YDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG N+  PAA+PP     
Sbjct: 120 YDRSGRSKGTAEVVFSRRGDAVAAVKRYNNVQLDGKPMKIEIVGTNIATPAAVPP----- 174

Query: 384 XXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXDEKISAEDLDA 205
                        G  RS Q R                            EK+SAEDLDA
Sbjct: 175 ---VTNGTFGNSNGGLRSAQGR-VGSQGRPRGGSGGRGFGRGRGRGRGRGEKVSAEDLDA 230

Query: 204 DLEKYRA 184
           DLEKY +
Sbjct: 231 DLEKYHS 237


>ref|XP_006846043.1| hypothetical protein AMTR_s00012p00035940 [Amborella trichopoda]
           gi|548848813|gb|ERN07718.1| hypothetical protein
           AMTR_s00012p00035940 [Amborella trichopoda]
          Length = 253

 Score =  170 bits (431), Expect = 8e-40
 Identities = 118/260 (45%), Positives = 133/260 (51%), Gaps = 13/260 (5%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MSS LDMSLDDLI  N  S G                         R  N    R APY 
Sbjct: 1   MSSALDMSLDDLIKNNKKSGG-----GGGGGVMRGRNRGGTSGPVRRIQNRTANRAAPYS 55

Query: 744 THPPAQAS-------GLMLQPQ----MIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFS 598
                QA+        L   P     +  A  S  E+GTKLY+SNLDYGVSNEDIK LFS
Sbjct: 56  MGKAFQAAPETTWQHDLFADPVGPYGVQAARPSAIETGTKLYISNLDYGVSNEDIKELFS 115

Query: 597 EVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVI 418
           EVG+L+RYSIHYDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG N+V 
Sbjct: 116 EVGDLKRYSIHYDRSGRSKGTAEVVFSRRADAVAAVKRYNNVQLDGKPMKIEIVGTNLVT 175

Query: 417 PAALPPNAXXXXXXXXXXXXXXXXGAFRSVQER--XXXXXXXXXXXXXXXXXXXXXXXXX 244
           PA +P  A                   RSVQ R                           
Sbjct: 176 PAPIPQVANGLLATPNGIP--------RSVQPRGAGIIGRGPRGGGRGGGRGRGRGGRGG 227

Query: 243 XXDEKISAEDLDADLEKYRA 184
              EK+SA DLDADLEKY +
Sbjct: 228 GRGEKVSAADLDADLEKYHS 247


>ref|XP_002316299.1| hypothetical protein POPTR_0010s21510g [Populus trichocarpa]
           gi|222865339|gb|EEF02470.1| hypothetical protein
           POPTR_0010s21510g [Populus trichocarpa]
          Length = 250

 Score =  169 bits (428), Expect = 2e-39
 Identities = 120/260 (46%), Positives = 139/260 (53%), Gaps = 11/260 (4%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MSS LDMSLDDLI K   + G  S+                      F    P R  PY 
Sbjct: 1   MSSLLDMSLDDLIRKGKENGGRDSDFRGSGRGAGSGSVLGPGPDRLVFRRD-PTRPKPYS 59

Query: 744 THPPAQASGLMLQPQMIMAG-GSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSI 568
             P  Q   +  +P M+ A  GS  E+  KLY+SNLDYGVSNEDIKVLFSEVGEL RYS+
Sbjct: 60  VRP-VQVMQVQQEPLMLAASEGSNGEA--KLYISNLDYGVSNEDIKVLFSEVGELLRYSL 116

Query: 567 HYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVV--IPAALPPNA 394
           HYD SGRSKGTAEVVF RQTDALAAI+RYN VQLDGKP++IELVG+NV+  +P  +P  A
Sbjct: 117 HYDMSGRSKGTAEVVFSRQTDALAAIRRYNNVQLDGKPLKIELVGVNVITPVPVTVPVTA 176

Query: 393 XXXXXXXXXXXXXXXXGAFRSVQE------RXXXXXXXXXXXXXXXXXXXXXXXXXXXDE 232
                           GA RSV E      R                            E
Sbjct: 177 --------ITNVANPNGAVRSVHERIGARGRGHGGGAGGRGGGSVQEFARGQGQVRRRVE 228

Query: 231 KISAEDLDADLEKY--RAMK 178
           K++AE LD+DL+KY   AMK
Sbjct: 229 KLTAEALDSDLDKYHFEAMK 248


>ref|XP_006488282.1| PREDICTED: THO complex subunit 4-like isoform X1 [Citrus sinensis]
          Length = 247

 Score =  166 bits (420), Expect = 1e-38
 Identities = 114/254 (44%), Positives = 133/254 (52%), Gaps = 9/254 (3%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MSS LDMSLDD+I KNN   G  +                      R  N    R APY 
Sbjct: 1   MSSALDMSLDDII-KNNKKSGSGN--------FRGRGRGSGPGPARRIPNRGANRVAPYT 51

Query: 744 THPPAQ--------ASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVG 589
           T    +        A  +   P       S  E+GTKLY+SNLDYGVSNEDIK LFSEVG
Sbjct: 52  TAKAPETTWQHDMFADQVSAFPVQQAGRASAIETGTKLYISNLDYGVSNEDIKELFSEVG 111

Query: 588 ELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAA 409
           +L+RYSIHYDRSGRSKGTAEVV+ R+ DA+AA+KRYNTVQLDGKPM+IE+VG N+    A
Sbjct: 112 DLKRYSIHYDRSGRSKGTAEVVYSRRADAVAAVKRYNTVQLDGKPMKIEIVGTNIATRTA 171

Query: 408 LP-PNAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXDE 232
            P  N                 GAFR ++                             +E
Sbjct: 172 APAANVNFGNSNGVPRGGQGRGGAFRRLR---------GGGGGGGRGFGRGRGRGRERNE 222

Query: 231 KISAEDLDADLEKY 190
           KISAEDLDADL+KY
Sbjct: 223 KISAEDLDADLDKY 236


>gb|EXB52242.1| RNA and export factor-binding protein 2 [Morus notabilis]
          Length = 248

 Score =  165 bits (417), Expect = 3e-38
 Identities = 114/260 (43%), Positives = 130/260 (50%), Gaps = 13/260 (5%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MS  L MSLDD+I  N  +                           R  N  P R APY 
Sbjct: 1   MSDPLSMSLDDIIKTNKKTGS--------GNPRGRGRLSGGPGPARRVPNRAPNRAAPYA 52

Query: 744 THPPAQASGLMLQPQMIMAGG----------SVAESGTKLYVSNLDYGVSNEDIKVLFSE 595
             P A  +    Q  M M  G          S  ++GTKLY+SNL+YGVSNEDIK LFSE
Sbjct: 53  AAPKAPET--TWQHDMYMDQGTAFAAQAGRASAIQTGTKLYISNLEYGVSNEDIKELFSE 110

Query: 594 VGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIP 415
           VG+L+RY+IHYDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG NV  P
Sbjct: 111 VGDLKRYAIHYDRSGRSKGTAEVVFSRRADAVAAVKRYNNVQLDGKPMKIEIVGTNVATP 170

Query: 414 AALPPNAXXXXXXXXXXXXXXXXGAFRSVQER---XXXXXXXXXXXXXXXXXXXXXXXXX 244
           AA PP A                   R  Q R                            
Sbjct: 171 AAPPPPANGSFGNSNGLP--------RGGQGRGGGAFGRPRGGGGGGRGPRRGRGRGQGR 222

Query: 243 XXDEKISAEDLDADLEKYRA 184
              EKISA+DLDADLEKY A
Sbjct: 223 GTGEKISADDLDADLEKYHA 242


>ref|XP_006424776.1| hypothetical protein CICLE_v10029160mg [Citrus clementina]
           gi|567864256|ref|XP_006424777.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|567864258|ref|XP_006424778.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|567864260|ref|XP_006424779.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|557526710|gb|ESR38016.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|557526711|gb|ESR38017.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|557526712|gb|ESR38018.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
           gi|557526713|gb|ESR38019.1| hypothetical protein
           CICLE_v10029160mg [Citrus clementina]
          Length = 247

 Score =  164 bits (415), Expect = 6e-38
 Identities = 113/254 (44%), Positives = 132/254 (51%), Gaps = 9/254 (3%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MSS LDMSLDD+I KNN   G  +                      R  N    R APY 
Sbjct: 1   MSSALDMSLDDII-KNNKKSGSGN--------FRGRGRGSGPGPARRIPNRGANRVAPYT 51

Query: 744 THPPAQ--------ASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVG 589
           T    +        A  +   P       S  E+GTKLY+SNLDYGVSNEDIK LFSEVG
Sbjct: 52  TAKAPETTWQHDMFADQVSAFPVQQAGRASAIETGTKLYISNLDYGVSNEDIKELFSEVG 111

Query: 588 ELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAA 409
           +L+RYSIHYDRSGRSKGTAEVV+ R+ DA+AA+KRYN VQLDGKPM+IE+VG N+    A
Sbjct: 112 DLKRYSIHYDRSGRSKGTAEVVYSRRADAVAAVKRYNNVQLDGKPMKIEIVGTNIATRTA 171

Query: 408 LP-PNAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXDE 232
            P  N                 GAFR ++                             +E
Sbjct: 172 APAANVNFGNSNGVPRGGQGRGGAFRRLR---------GGGGGGGRGFGRGRGRGRERNE 222

Query: 231 KISAEDLDADLEKY 190
           KISAEDLDADL+KY
Sbjct: 223 KISAEDLDADLDKY 236


>emb|CAN83715.1| hypothetical protein VITISV_023787 [Vitis vinifera]
          Length = 281

 Score =  163 bits (412), Expect = 1e-37
 Identities = 93/175 (53%), Positives = 107/175 (61%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MS+ LDMSLDDLI KNN   G   N                            +  AP  
Sbjct: 1   MSNALDMSLDDLI-KNNKRSGGGGNARGRGRGSGPGPARRLPNRGANRITPYSVSKAPET 59

Query: 744 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 565
           T      +             S  E+GTKLY+SNLDYGVSNEDIK LFSEVG+L+RYSIH
Sbjct: 60  TWQHDMFADQAAAYPAQAGRTSAIETGTKLYISNLDYGVSNEDIKELFSEVGDLKRYSIH 119

Query: 564 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALPP 400
           YDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG N+  PAA+PP
Sbjct: 120 YDRSGRSKGTAEVVFSRRGDAVAAVKRYNNVQLDGKPMKIEIVGTNIATPAAVPP 174


>ref|XP_006288186.1| hypothetical protein CARUB_v10001422mg, partial [Capsella rubella]
           gi|482556892|gb|EOA21084.1| hypothetical protein
           CARUB_v10001422mg, partial [Capsella rubella]
          Length = 329

 Score =  162 bits (411), Expect = 2e-37
 Identities = 96/189 (50%), Positives = 108/189 (57%), Gaps = 18/189 (9%)
 Frame = -2

Query: 927 KMSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXR--FANHVPIRNA 754
           KMS GLDMSLDD+I  N    G                            FAN V  R A
Sbjct: 41  KMSGGLDMSLDDIIKSNRKPTGSRGRGGVGGNSTGGRGGFGGSGSGPSRRFANRVGNRTA 100

Query: 753 PYLTHPPAQASGLMLQPQMI----------------MAGGSVAESGTKLYVSNLDYGVSN 622
           PY      QA   M Q  +                 + GGS  E+GTKLY+SNLDYGVSN
Sbjct: 101 PYSRPVQLQAQDAMWQNDVFATDASVAAAFGHQPAAVVGGSSIETGTKLYISNLDYGVSN 160

Query: 621 EDIKVLFSEVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIE 442
           EDIK LFSEVG+L+RY IHYDRSGRSKGTAEVVF R+ D LAA+KRYN VQLDGK M+IE
Sbjct: 161 EDIKELFSEVGDLKRYGIHYDRSGRSKGTAEVVFSRRGDGLAAVKRYNNVQLDGKLMKIE 220

Query: 441 LVGMNVVIP 415
           +VG N+  P
Sbjct: 221 IVGTNIPAP 229


>ref|XP_003538006.1| PREDICTED: THO complex subunit 4-like isoform X1 [Glycine max]
           gi|571488643|ref|XP_006590992.1| PREDICTED: THO complex
           subunit 4-like isoform X2 [Glycine max]
          Length = 247

 Score =  162 bits (411), Expect = 2e-37
 Identities = 109/258 (42%), Positives = 135/258 (52%), Gaps = 11/258 (4%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MS+ +DMSLDD+I KNN   G  S+                      F N    R APY 
Sbjct: 1   MSAAMDMSLDDII-KNNKKSGSGSSRGRIRPSGSGPSRR--------FPNRAANRAAPYA 51

Query: 744 THPPAQASGL--MLQPQMIMAGGSVA--------ESGTKLYVSNLDYGVSNEDIKVLFSE 595
           T    +A+    +   Q + A G  A        E+GTKLY+SNLDYGVS++DIK LF+E
Sbjct: 52  TAKAPEATWQHDLYADQQVAAAGYPAQGGRAASIETGTKLYISNLDYGVSSDDIKELFAE 111

Query: 594 VGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIP 415
           VG+L+R+++HYDRSGRSKGTAEVVF R+ DA+AA+KRYN VQLDGKPM+IE+VG N+  P
Sbjct: 112 VGDLKRHAVHYDRSGRSKGTAEVVFSRRADAVAAVKRYNNVQLDGKPMKIEIVGTNISTP 171

Query: 414 AALPP-NAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXX 238
              P  N                 GA R    R                           
Sbjct: 172 GVAPARNGAIGNFDGVPRSGQGRGGALRRPGGR--------GQGVRRDRGRGRGRGGAGR 223

Query: 237 DEKISAEDLDADLEKYRA 184
            EK+SA+DLDADLEKY A
Sbjct: 224 GEKVSADDLDADLEKYHA 241


>ref|XP_002873059.1| hypothetical protein ARALYDRAFT_487031 [Arabidopsis lyrata subsp.
           lyrata] gi|297318896|gb|EFH49318.1| hypothetical protein
           ARALYDRAFT_487031 [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  162 bits (410), Expect = 2e-37
 Identities = 96/189 (50%), Positives = 109/189 (57%), Gaps = 19/189 (10%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXR--FANHVPIRNAP 751
           MS GLDMSLDD+I  N    G                            FAN V  R AP
Sbjct: 1   MSGGLDMSLDDIIKSNRKPTGSRGRGGIGGGNNTGGRGGSGSNSGPSRRFANRVGARTAP 60

Query: 750 YLTHPPAQASGLMLQPQM-----------------IMAGGSVAESGTKLYVSNLDYGVSN 622
           Y      QA   M Q  +                 ++ GGS  E+GTKLY+SNLDYGVSN
Sbjct: 61  YSRPIQQQAHDAMWQNDVFATDASVAAAFGHHQTAVVGGGSSIETGTKLYISNLDYGVSN 120

Query: 621 EDIKVLFSEVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIE 442
           EDIK LFSEVG+L+RY IHYDRSGRSKGTAEVVF R+ DALAA+KRYN VQLDGK M+IE
Sbjct: 121 EDIKELFSEVGDLKRYGIHYDRSGRSKGTAEVVFSRRGDALAAVKRYNNVQLDGKLMKIE 180

Query: 441 LVGMNVVIP 415
           +VG N+  P
Sbjct: 181 IVGTNLSAP 189


>ref|XP_006480779.1| PREDICTED: THO complex subunit 4-like isoform X1 [Citrus sinensis]
          Length = 287

 Score =  161 bits (408), Expect = 4e-37
 Identities = 113/273 (41%), Positives = 132/273 (48%), Gaps = 10/273 (3%)
 Frame = -2

Query: 963 SEALHTFSTCSPKMSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXR 784
           S  LH+    S +M+S LDMSLDD+I  N  S                            
Sbjct: 19  SRNLHSPFRVSSRMTSALDMSLDDIIKSNKRSSS--------RWRGGGGRGSGLGPARHT 70

Query: 783 FANHVPIRNAPYLTHPPAQASGLMLQPQMIMAGGSVA--------ESGTKLYVSNLDYGV 628
           F   V  R APY    P QA        MI  G   A        ESGTKLY+SNL+YGV
Sbjct: 71  FKRSVN-RTAPY--SKPVQAPQATWPQNMIFNGAVAAAAARSSSIESGTKLYISNLEYGV 127

Query: 627 SNEDIKVLFSEVGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQ 448
           SNEDIK LFSEVGEL+R S+H+DRSGRSKGTAEVV  R+ DA+AA+KRYN VQLDGKPM+
Sbjct: 128 SNEDIKELFSEVGELKRCSVHFDRSGRSKGTAEVVLTRRADAIAAVKRYNNVQLDGKPMK 187

Query: 447 IELVGMNVVIPAALPP--NAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXX 274
           IE++G N+  PAA+ P  N                 GA    + R               
Sbjct: 188 IEIIGTNIGPPAAVLPITNVMYGNEIGTSRSHLRMGGAIPIERPRHGRGGLGGRGGRGGA 247

Query: 273 XXXXXXXXXXXXDEKISAEDLDADLEKYRAMKK 175
                          +S EDLDADLEKY    K
Sbjct: 248 RGGRGRGRGQGAKPNLSVEDLDADLEKYHIEAK 280


>ref|XP_006363386.1| PREDICTED: THO complex subunit 4-like [Solanum tuberosum]
          Length = 254

 Score =  161 bits (408), Expect = 4e-37
 Identities = 108/251 (43%), Positives = 128/251 (50%), Gaps = 8/251 (3%)
 Frame = -2

Query: 918 SGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYLTH 739
           + LDM+LDDLI KN T  G                         RF N    R APY T 
Sbjct: 4   AALDMTLDDLIKKNKTGTG-----GKPRGRGCGAASTSSAGPSRRFPNRSANRAAPYSTA 58

Query: 738 PPAQAS---GLMLQPQMI---MAGG--SVAESGTKLYVSNLDYGVSNEDIKVLFSEVGEL 583
              +AS    +    Q +    AGG  S  E+GTKLY+SNLDYGVS EDIK LFSE+G+L
Sbjct: 59  KAPEASWNHDMFAADQAVAFGQAGGRASSIETGTKLYISNLDYGVSKEDIKELFSEIGDL 118

Query: 582 ERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALP 403
           +RY++HYDRSGRSKGT EVVF R+ D LA +KR+N VQLDGKPM+IE+VG N+V P A  
Sbjct: 119 KRYAVHYDRSGRSKGTTEVVFSRRQDTLAGVKRFNNVQLDGKPMKIEIVGTNIVTPTAPF 178

Query: 402 PNAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXDEKIS 223
            N                  AF  V+                              EK+S
Sbjct: 179 SNG---AFGFGDTNGAPRRYAFGQVRGGGFGRSRGARGRGRGFRGGNRGWGRGGRGEKVS 235

Query: 222 AEDLDADLEKY 190
           AEDLDADL KY
Sbjct: 236 AEDLDADLMKY 246


>ref|XP_007220341.1| hypothetical protein PRUPE_ppa014850mg [Prunus persica]
           gi|462416803|gb|EMJ21540.1| hypothetical protein
           PRUPE_ppa014850mg [Prunus persica]
          Length = 259

 Score =  161 bits (408), Expect = 4e-37
 Identities = 85/175 (48%), Positives = 112/175 (64%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           M  GLDMSLD+LI K     G                         R  N   +R APY 
Sbjct: 1   MPRGLDMSLDELIAKRKKPGGYHG---YFRGRGRGRGRGYGPGPTRRLMNRNTVRTAPYS 57

Query: 744 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 565
             P  Q     ++ +M ++GG+  E GTKLY+SNLDY VSN DI++LFSE+G ++R+++H
Sbjct: 58  AQPIMQVVRTTVEQEMEVSGGTDTEEGTKLYLSNLDYDVSNSDIELLFSEIGHVKRHTVH 117

Query: 564 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPAALPP 400
           YDRSGRSKGTAEV+F+  +DALAAI++YN VQLDGKP++IELVG+N V P ++PP
Sbjct: 118 YDRSGRSKGTAEVIFVHHSDALAAIEKYNNVQLDGKPLKIELVGVNPVAPISVPP 172


>ref|XP_006592204.1| PREDICTED: THO complex subunit 4-like isoform X1 [Glycine max]
           gi|571492362|ref|XP_006592205.1| PREDICTED: THO complex
           subunit 4-like isoform X2 [Glycine max]
          Length = 247

 Score =  161 bits (407), Expect = 5e-37
 Identities = 108/258 (41%), Positives = 134/258 (51%), Gaps = 11/258 (4%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MS+ +DMSLDD+I KNN   G  S+                        N    R APY 
Sbjct: 1   MSAAMDMSLDDII-KNNKKSGSGSSRGRTRPSGSGPTRR--------LPNRAANRAAPYA 51

Query: 744 THPPAQASGL--MLQPQMIMAGGSVA--------ESGTKLYVSNLDYGVSNEDIKVLFSE 595
                +A+    +   Q + A G  A        E+GTKLY+SNLDYGVSN+DIK LF+E
Sbjct: 52  PAKAPEATWQHDLYADQHVAAAGYPAQGGRAASIETGTKLYISNLDYGVSNDDIKELFAE 111

Query: 594 VGELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIP 415
           VG+L+R+++HYDRSGRSKGTAEVVF R+ DA++A+KRYN VQLDGKPM+IE+VG N+  P
Sbjct: 112 VGDLKRHAVHYDRSGRSKGTAEVVFSRRADAVSAVKRYNNVQLDGKPMKIEIVGTNISTP 171

Query: 414 AALP-PNAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXX 238
              P PN                 GA R    R                           
Sbjct: 172 GVAPAPNGAIGNFNGVPRSGQGRGGALRRPGGR--------GQGIRRDRGRGRGRGGGGR 223

Query: 237 DEKISAEDLDADLEKYRA 184
            EK+SA+DLDADLEKY A
Sbjct: 224 GEKVSADDLDADLEKYHA 241


>ref|NP_974965.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
           gi|332009873|gb|AED97256.1| RNA recognition
           motif-containing protein [Arabidopsis thaliana]
          Length = 242

 Score =  161 bits (407), Expect = 5e-37
 Identities = 108/254 (42%), Positives = 133/254 (52%), Gaps = 7/254 (2%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MS+GLDMSLDD+I KN  S G                            N    R+APY 
Sbjct: 1   MSTGLDMSLDDMIAKNRKSRGGAGPARGTGSGSGPGPTRRNNP------NRKSTRSAPYQ 54

Query: 744 THPPAQASGLMLQPQMI--MAGGSVA--ESGTKLYVSNLDYGVSNEDIKVLFSEVGELER 577
           + P +     M   +     +G S A  E+GTKLY+SNLDYGV NEDIK LF+EVGEL+R
Sbjct: 55  SAPESTWGHDMFSDRSEDHRSGRSSAGIETGTKLYISNLDYGVMNEDIKELFAEVGELKR 114

Query: 576 YSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNV---VIPAAL 406
           Y++H+DRSGRSKGTAEVV+ R+ DALAA+K+YN VQLDGKPM+IE+VG N+     P+  
Sbjct: 115 YTVHFDRSGRSKGTAEVVYSRRGDALAAVKKYNDVQLDGKPMKIEIVGTNLQTAAAPSGR 174

Query: 405 PPNAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXDEKI 226
           P N                   +R  Q R                            EKI
Sbjct: 175 PANG------------NSNGAPWRGGQGRGGQQRGGGRGGGGRGGGGRGRRPGKGPAEKI 222

Query: 225 SAEDLDADLEKYRA 184
           SAEDLDADL+KY +
Sbjct: 223 SAEDLDADLDKYHS 236


>ref|XP_007016490.1| RNA-binding family protein [Theobroma cacao]
           gi|508786853|gb|EOY34109.1| RNA-binding family protein
           [Theobroma cacao]
          Length = 241

 Score =  160 bits (406), Expect = 6e-37
 Identities = 112/256 (43%), Positives = 132/256 (51%), Gaps = 9/256 (3%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           MSS L+MSLDDLI +N  S    S                       F N    R+ PY 
Sbjct: 1   MSSALEMSLDDLIKRNRKSGSGNSRGRGRGSGPGPARR---------FPNRGANRSGPYT 51

Query: 744 T---------HPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEV 592
                     H      G   Q Q   A  S  E+GTKLY+SNLDYGVSN+DIK LF+EV
Sbjct: 52  AAKAPETTWQHDMYSDKGAAFQGQAGRA--SAIETGTKLYISNLDYGVSNDDIKELFAEV 109

Query: 591 GELERYSIHYDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMNVVIPA 412
           G+L+R++IHYDRSGRSKGTAEVVF R+TDA+AA+KRYN VQLDGKPM+IE+VG NV  P 
Sbjct: 110 GDLKRFTIHYDRSGRSKGTAEVVFSRRTDAMAAVKRYNNVQLDGKPMKIEIVGTNVATPG 169

Query: 411 ALPPNAXXXXXXXXXXXXXXXXGAFRSVQERXXXXXXXXXXXXXXXXXXXXXXXXXXXDE 232
           A  P+A                GA R    R                            E
Sbjct: 170 A--PSA-------GNGAFGNSNGAPRGGHGR-GGGFGKQRGGGGGRGFGRGRGRGKGRGE 219

Query: 231 KISAEDLDADLEKYRA 184
           K+SAEDLDA+LEKY +
Sbjct: 220 KVSAEDLDAELEKYHS 235


>emb|CBI31707.3| unnamed protein product [Vitis vinifera]
          Length = 160

 Score =  160 bits (405), Expect = 8e-37
 Identities = 87/166 (52%), Positives = 105/166 (63%)
 Frame = -2

Query: 924 MSSGLDMSLDDLINKNNTSDGLTSNXXXXXXXXXXXXXXXXXXXXXRFANHVPIRNAPYL 745
           M   LDMSLD++I     S G++SN                      F N   +R  PY 
Sbjct: 1   MPDPLDMSLDEIIRNKKKSAGVSSNVRGIGSGPGPGPARR-------FGNRELLRTTPYS 53

Query: 744 THPPAQASGLMLQPQMIMAGGSVAESGTKLYVSNLDYGVSNEDIKVLFSEVGELERYSIH 565
             P  Q      + ++   G S  E+GTKLY+SNL+YGVSN+DIK LFSEVGEL++YSIH
Sbjct: 54  VAPVFQVLEAAWKQEVFTGGVSTMETGTKLYISNLEYGVSNDDIKELFSEVGELKQYSIH 113

Query: 564 YDRSGRSKGTAEVVFLRQTDALAAIKRYNTVQLDGKPMQIELVGMN 427
           YD+SG SKGT EVVFLRQTDALAAIKRYN VQLDGKP +I+L+G N
Sbjct: 114 YDKSGISKGTGEVVFLRQTDALAAIKRYNNVQLDGKPQKIDLIGAN 159


Top