BLASTX nr result

ID: Paeonia25_contig00001100 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00001100
         (1175 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007045493.1| THO complex subunit 2 isoform 1 [Theobroma c...   231   2e-65
ref|XP_007045494.1| THO complex subunit 2 isoform 2 [Theobroma c...   226   7e-64
ref|XP_007045497.1| THO complex subunit 2 isoform 5 [Theobroma c...   226   7e-64
emb|CBI26799.3| unnamed protein product [Vitis vinifera]              244   7e-62
ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis...   243   1e-61
ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citru...   216   5e-60
ref|XP_006448121.1| hypothetical protein CICLE_v10014076mg [Citr...   216   1e-59
ref|XP_007045496.1| THO complex subunit 2 isoform 4 [Theobroma c...   207   3e-58
ref|XP_007217095.1| hypothetical protein PRUPE_ppa000084mg [Prun...   207   6e-58
ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi...   206   2e-56
ref|XP_002325475.1| F5A9.22 family protein [Populus trichocarpa]...   219   2e-54
ref|XP_006376042.1| F5A9.22 family protein [Populus trichocarpa]...   217   9e-54
ref|XP_006586338.1| PREDICTED: THO complex subunit 2-like [Glyci...   214   4e-53
ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isofor...   206   2e-50
ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isofor...   206   2e-50
ref|XP_007160466.1| hypothetical protein PHAVU_002G324500g [Phas...   204   5e-50
ref|XP_004503324.1| PREDICTED: LOW QUALITY PROTEIN: THO complex ...   198   3e-48
ref|XP_003631008.1| THO complex subunit [Medicago truncatula] gi...   197   7e-48
ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucum...   177   2e-47
ref|XP_007045495.1| THO2 isoform 3 [Theobroma cacao] gi|50870943...   164   3e-45

>ref|XP_007045493.1| THO complex subunit 2 isoform 1 [Theobroma cacao]
            gi|508709428|gb|EOY01325.1| THO complex subunit 2 isoform
            1 [Theobroma cacao]
          Length = 1853

 Score =  231 bits (589), Expect(2) = 2e-65
 Identities = 166/398 (41%), Positives = 215/398 (54%), Gaps = 59/398 (14%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSN-ASKSLSGTFDSQSEGRTIS 997
            REDLK           ARK SWVTDEEFGMG ++ K  ++ ASKSL+G   S   G +I+
Sbjct: 1203 REDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQNGSSIN 1262

Query: 996  TGASDSAGS--------------VKDQVLRTKPVDGRLERTESAS---GHMKVKGGDSQS 868
               S++AG+              VKDQ+ RTK  DGRLER E+AS     +K KGG S +
Sbjct: 1263 VSQSEAAGARAVALGTQQSDVNLVKDQIPRTKS-DGRLERAENASLGKSDLKTKGGTSAN 1321

Query: 867  SSTSAVP-------AGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTV 709
             S + +        AGT KSLEN+KQ+DE + K LDE+++K   K SAE E +AS KR+ 
Sbjct: 1322 GSDAVLSVVLATSQAGTGKSLENQKQLDESSNK-LDEHLAKVPAKNSAELESKASAKRSA 1380

Query: 708  PV-TVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANG 532
            P  ++ K  KQ+  KDD K  K + RTS +  ID+D+P+H +EGR GG  N+ S VT+NG
Sbjct: 1381 PAGSLTKTQKQDPGKDDGKSGKAVGRTSVTCVIDRDVPSH-TEGRQGGTTNVPSAVTSNG 1439

Query: 531  NTVPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDI--PRVRPVHSPRHDSSSI 358
            N V                             KDD +ELPD   P  R VHSPRHDSS+ 
Sbjct: 1440 NAVSAPPKG-----------------------KDDGSELPDASRPSSRIVHSPRHDSSA- 1475

Query: 357  PPSKSVDKQVKRTSPAEEPDRLSKRRKGDRDLED--------------EPRFVTDEKSGN 220
              SKS DK  KRT+P EE DRL+KRRKGD +L+D              +P+    +K G 
Sbjct: 1476 TVSKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRERSTDPQLADFDKPGT 1535

Query: 219  EE---HRS--------------KEHREYRERLERSDKS 157
            +E   HR+              +  R+YRERLER +KS
Sbjct: 1536 DELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKS 1573



 Score = 46.2 bits (108), Expect(2) = 2e-65
 Identities = 29/55 (52%), Positives = 34/55 (61%), Gaps = 15/55 (27%)
 Frame = -1

Query: 149  RYGRERSVEKVQERNFS---DKAKD--------KPRHTETSS----VDDRFHGQS 30
            RYGRERSVE+  +RN     DKAKD        K R+ +TS+    VDDRFHGQS
Sbjct: 1590 RYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQS 1644


>ref|XP_007045494.1| THO complex subunit 2 isoform 2 [Theobroma cacao]
            gi|508709429|gb|EOY01326.1| THO complex subunit 2 isoform
            2 [Theobroma cacao]
          Length = 1844

 Score =  226 bits (576), Expect(2) = 7e-64
 Identities = 164/398 (41%), Positives = 213/398 (53%), Gaps = 59/398 (14%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSN-ASKSLSGTFDSQSEGRTIS 997
            REDLK           ARK SWVTDEEFGMG ++ K  ++ ASKSL+G   S   G +I+
Sbjct: 1203 REDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQNGSSIN 1262

Query: 996  TGASDSAGS--------------VKDQVLRTKPVDGRLERTESAS---GHMKVKGGDSQS 868
               S++AG+              VKDQ+ RTK  DGRLER E+AS     +K KGG S +
Sbjct: 1263 VSQSEAAGARAVALGTQQSDVNLVKDQIPRTKS-DGRLERAENASLGKSDLKTKGGTSAN 1321

Query: 867  SSTSAVP-------AGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTV 709
             S + +        AGT KSLEN+KQ+DE + K LDE+++K   K SAE E +AS KR+ 
Sbjct: 1322 GSDAVLSVVLATSQAGTGKSLENQKQLDESSNK-LDEHLAKVPAKNSAELESKASAKRSA 1380

Query: 708  PV-TVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANG 532
            P  ++ K  KQ+  KDD K  K + RTS +  ID+D+P+H +EGR GG  N+ S VT+NG
Sbjct: 1381 PAGSLTKTQKQDPGKDDGKSGKAVGRTSVTCVIDRDVPSH-TEGRQGGTTNVPSAVTSNG 1439

Query: 531  NTVPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDI--PRVRPVHSPRHDSSSI 358
                                            KDD +ELPD   P  R VHSPRHDSS+ 
Sbjct: 1440 --------------------------------KDDGSELPDASRPSSRIVHSPRHDSSAT 1467

Query: 357  PPSKSVDKQVKRTSPAEEPDRLSKRRKGDRDLED--------------EPRFVTDEKSGN 220
              SKS DK  KRT+P EE DRL+KRRKGD +L+D              +P+    +K G 
Sbjct: 1468 V-SKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRERSTDPQLADFDKPGT 1526

Query: 219  EE---HRS--------------KEHREYRERLERSDKS 157
            +E   HR+              +  R+YRERLER +KS
Sbjct: 1527 DELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKS 1564



 Score = 46.2 bits (108), Expect(2) = 7e-64
 Identities = 29/55 (52%), Positives = 34/55 (61%), Gaps = 15/55 (27%)
 Frame = -1

Query: 149  RYGRERSVEKVQERNFS---DKAKD--------KPRHTETSS----VDDRFHGQS 30
            RYGRERSVE+  +RN     DKAKD        K R+ +TS+    VDDRFHGQS
Sbjct: 1581 RYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQS 1635


>ref|XP_007045497.1| THO complex subunit 2 isoform 5 [Theobroma cacao]
            gi|508709432|gb|EOY01329.1| THO complex subunit 2 isoform
            5 [Theobroma cacao]
          Length = 1824

 Score =  226 bits (576), Expect(2) = 7e-64
 Identities = 164/398 (41%), Positives = 213/398 (53%), Gaps = 59/398 (14%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSN-ASKSLSGTFDSQSEGRTIS 997
            REDLK           ARK SWVTDEEFGMG ++ K  ++ ASKSL+G   S   G +I+
Sbjct: 1203 REDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQNGSSIN 1262

Query: 996  TGASDSAGS--------------VKDQVLRTKPVDGRLERTESAS---GHMKVKGGDSQS 868
               S++AG+              VKDQ+ RTK  DGRLER E+AS     +K KGG S +
Sbjct: 1263 VSQSEAAGARAVALGTQQSDVNLVKDQIPRTKS-DGRLERAENASLGKSDLKTKGGTSAN 1321

Query: 867  SSTSAVP-------AGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTV 709
             S + +        AGT KSLEN+KQ+DE + K LDE+++K   K SAE E +AS KR+ 
Sbjct: 1322 GSDAVLSVVLATSQAGTGKSLENQKQLDESSNK-LDEHLAKVPAKNSAELESKASAKRSA 1380

Query: 708  PV-TVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANG 532
            P  ++ K  KQ+  KDD K  K + RTS +  ID+D+P+H +EGR GG  N+ S VT+NG
Sbjct: 1381 PAGSLTKTQKQDPGKDDGKSGKAVGRTSVTCVIDRDVPSH-TEGRQGGTTNVPSAVTSNG 1439

Query: 531  NTVPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDI--PRVRPVHSPRHDSSSI 358
                                            KDD +ELPD   P  R VHSPRHDSS+ 
Sbjct: 1440 --------------------------------KDDGSELPDASRPSSRIVHSPRHDSSAT 1467

Query: 357  PPSKSVDKQVKRTSPAEEPDRLSKRRKGDRDLED--------------EPRFVTDEKSGN 220
              SKS DK  KRT+P EE DRL+KRRKGD +L+D              +P+    +K G 
Sbjct: 1468 V-SKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRERSTDPQLADFDKPGT 1526

Query: 219  EE---HRS--------------KEHREYRERLERSDKS 157
            +E   HR+              +  R+YRERLER +KS
Sbjct: 1527 DELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKS 1564



 Score = 46.2 bits (108), Expect(2) = 7e-64
 Identities = 29/55 (52%), Positives = 34/55 (61%), Gaps = 15/55 (27%)
 Frame = -1

Query: 149  RYGRERSVEKVQERNFS---DKAKD--------KPRHTETSS----VDDRFHGQS 30
            RYGRERSVE+  +RN     DKAKD        K R+ +TS+    VDDRFHGQS
Sbjct: 1581 RYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQS 1635


>emb|CBI26799.3| unnamed protein product [Vitis vinifera]
          Length = 1767

 Score =  244 bits (622), Expect = 7e-62
 Identities = 161/358 (44%), Positives = 206/358 (57%), Gaps = 19/358 (5%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFK-TQSNASKSLSGTFDSQSEGRTIS 997
            REDLK           ARKPSWVTDEEFGMG ++ K   S ASK+++             
Sbjct: 1203 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSLASKTVAS-----------G 1251

Query: 996  TGASDSAGSVKDQVLRTKPVDGRLERTESAS------GHMKVKGGDS-------QSSSTS 856
            T   D+  SVK+QVLR K VDGRLERTES S       H KVKGG S       QS  ++
Sbjct: 1252 TQHLDAGNSVKEQVLRAKTVDGRLERTESVSLVKSDPVHAKVKGGSSVNGSDIQQSMPSA 1311

Query: 855  AVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVPV-TVLKPSKQ 679
            A   GTS+S EN++ +DE   +TLDE+  K   + S ESELRA+GKR++P  ++ K  K 
Sbjct: 1312 ASHTGTSRSGENQRPVDESTNRTLDESTVKVSSRASTESELRATGKRSLPSGSLTKQPKL 1371

Query: 678  EIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANGNTVPXXXXXXX 499
            ++AKDDSK  K + RTS SS  D+D+P H  EGR  G  N+ S  TA+G++         
Sbjct: 1372 DVAKDDSKSGKGVGRTSGSSTSDRDLPAHQLEGRQSGVTNVSSAGTADGSSA-------- 1423

Query: 498  XXXXXXXXXXXXXXXAKFSALKDDNAELPD-IPRVRPVHSPRHDSSSIPPSKSVDKQVKR 322
                            + SA+KDD  E+ D  P  RP+HSPRHD+S+    KS DKQ KR
Sbjct: 1424 --------------DLRLSAVKDDGNEVSDRAPSSRPIHSPRHDNSA--TIKSGDKQQKR 1467

Query: 321  TSPAEEPDRLSKRRKGD---RDLEDEPRFVTDEKSGNEEHRSKEHREYRERLERSDKS 157
            TSPAEEP+R++KRRKGD   RD E E RF       +++   +  R++RERLER DKS
Sbjct: 1468 TSPAEEPERVNKRRKGDTEVRDFEGEVRF-------SDKESERYERDHRERLERPDKS 1518


>ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis vinifera]
          Length = 1849

 Score =  243 bits (619), Expect = 1e-61
 Identities = 172/406 (42%), Positives = 215/406 (52%), Gaps = 67/406 (16%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFK-TQSNASKSLSGTF---------- 1027
            REDLK           ARKPSWVTDEEFGMG ++ K   S ASKSL+G            
Sbjct: 1203 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSLASKSLAGNLVAVPNGSGLN 1262

Query: 1026 ---DSQSEGRTISTGAS--DSAGSVKDQVLRTKPVDGRLERTESAS------GHMKVKGG 880
               +  S GRT+++G    D+  SVK+QVLR K VDGRLERTES S       H KVKGG
Sbjct: 1263 IFQNESSGGRTVASGTQHLDAGNSVKEQVLRAKTVDGRLERTESVSLVKSDPVHAKVKGG 1322

Query: 879  DS-------QSSSTSAVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASG 721
             S       QS  ++A   GTS+S EN++ +DE   +TLDE+  K   + S ESELRA+G
Sbjct: 1323 SSVNGSDIQQSMPSAASHTGTSRSGENQRPVDESTNRTLDESTVKVSSRASTESELRATG 1382

Query: 720  KRTVPV-TVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTV 544
            KR++P  ++ K  K ++AKDDSK  K + RTS SS  D+D+P H  EGR  G  N+ S  
Sbjct: 1383 KRSLPSGSLTKQPKLDVAKDDSKSGKGVGRTSGSSTSDRDLPAHQLEGRQSGVTNVSSAG 1442

Query: 543  TANGNTVPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPD-IPRVRPVHSPRHDS 367
            TA+G                             S +KDD  E+ D  P  RP+HSPRHD+
Sbjct: 1443 TADG-----------------------------SVVKDDGNEVSDRAPSSRPIHSPRHDN 1473

Query: 366  SSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGD---RDLEDEPRF---------------- 244
            S+    KS DKQ KRTSPAEEP+R++KRRKGD   RD E E RF                
Sbjct: 1474 SA--TIKSGDKQQKRTSPAEEPERVNKRRKGDTEVRDFEGEVRFSDKERSMDPRLDKSHA 1531

Query: 243  VTDEKSGNEEH-----------------RSKEHREYRERLERSDKS 157
            V  +KSG +E                    +  R++RERLER DKS
Sbjct: 1532 VDLDKSGTDEQGISRATDKPSDRLKDKGSERYERDHRERLERPDKS 1577


>ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citrus sinensis]
          Length = 1874

 Score =  216 bits (550), Expect(2) = 5e-60
 Identities = 157/390 (40%), Positives = 207/390 (53%), Gaps = 51/390 (13%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFK-TQSNASKSLSGTFDS-QSEGRTI 1000
            REDLK            RK  WVTDEEFGMG ++ K   S ASKSLSG   + Q     +
Sbjct: 1203 REDLKVLATGVAAALANRKSFWVTDEEFGMGYLELKPAPSLASKSLSGNVVAVQGSAINV 1262

Query: 999  STGASDSAGSVKDQVLRTKPVDGRLERTESAS----GHMKVKGGDSQSSS-------TSA 853
            S     +  SVKD + R KP DGRLERTES S     ++K+KG    + S       ++A
Sbjct: 1263 SQSEPGTGNSVKDHISRAKPGDGRLERTESISHVKSDNVKLKGSSLTNGSDIHSSVPSTA 1322

Query: 852  VPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVP-VTVLKPSKQE 676
            V A  S+ +EN+KQ+DE      DEN++K   K SAESE +AS KR+VP  ++ K  KQ+
Sbjct: 1323 VQAEMSRVVENQKQVDE------DENMAKVAMKNSAESESKASVKRSVPSASLTKAPKQD 1376

Query: 675  IAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLIST-------VTANGNTVPX 517
            +AKDD+K AK + RTS SSA D+D  +H +EG+ GGA  + S        V+A G++   
Sbjct: 1377 LAKDDNKSAKAVGRTSGSSANDRDFSSHAAEGKQGGATTVSSAAAVTANLVSAKGSSSSS 1436

Query: 516  XXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDIPR---VRPVHSPRHDSSSIPPSK 346
                                  + S  K D  E+ D P+    R +HSPRHD SS+  SK
Sbjct: 1437 RASDMHGNESKTDGGVAKSSEVRLSTGKSDGNEVSDAPKSSSSRAMHSPRHD-SSVATSK 1495

Query: 345  SVDKQVKRTSPAEEPDRLSKRRKGDRDLED--------------EPRFVTDEKSGNEEH- 211
            S D+  KRTSP+E+PDR SKR KGD +L D              +PRF   +K G +E  
Sbjct: 1496 SGDRLQKRTSPSEDPDRPSKRYKGDTELRDSDGEVRVPDRERSADPRFADLDKIGTDEQS 1555

Query: 210  ------RSKE------HREYRERLERSDKS 157
                  RSK+       R++RERL+R DKS
Sbjct: 1556 MYRTTDRSKDKGNERYERDHRERLDRLDKS 1585



 Score = 43.5 bits (101), Expect(2) = 5e-60
 Identities = 29/59 (49%), Positives = 35/59 (59%), Gaps = 19/59 (32%)
 Frame = -1

Query: 149  RYGRERSVEKVQER-------NFSDKAKD--------KPRHTETSS----VDDRFHGQS 30
            RYGRERSVE+ QER         +DKAKD        K R+ ++SS    VD+RFHGQS
Sbjct: 1602 RYGRERSVERGQERGADRAFDRLADKAKDDRNKDDRSKLRYNDSSSEKSHVDERFHGQS 1660


>ref|XP_006448121.1| hypothetical protein CICLE_v10014076mg [Citrus clementina]
            gi|557550732|gb|ESR61361.1| hypothetical protein
            CICLE_v10014076mg [Citrus clementina]
          Length = 1193

 Score =  216 bits (550), Expect(2) = 1e-59
 Identities = 159/390 (40%), Positives = 203/390 (52%), Gaps = 51/390 (13%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFK-TQSNASKSLSGTFDS-QSEGRTI 1000
            REDLK            RK  WVTDEEFGMG ++ K   S ASKSLSG   + Q     +
Sbjct: 522  REDLKVLATGVAAALANRKSFWVTDEEFGMGYLELKPAPSLASKSLSGNVVAVQGSAINV 581

Query: 999  STGASDSAGSVKDQVLRTKPVDGRLERTESAS----------GHMKVKGGDSQSSSTS-A 853
            S     +  SVKD + R KP DGRLERTES S          G     G D  SS  S A
Sbjct: 582  SQSEPGTGNSVKDHISRAKPGDGRLERTESISHVKSDNVKLKGSSLTNGSDIHSSMPSTA 641

Query: 852  VPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVP-VTVLKPSKQE 676
            V A  S+ +EN+KQ+DE      DEN++K   K SAESE +AS KR+VP  ++ K  KQ+
Sbjct: 642  VQAEMSRVVENQKQVDE------DENMAKVAMKNSAESESKASVKRSVPSASLTKAPKQD 695

Query: 675  IAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLIST-------VTANGNTVPX 517
            +AKDD+K AK + RTS SSA D+D  +H +EG+ GGA  + S        V+A G++   
Sbjct: 696  LAKDDNKSAKAVGRTSGSSANDRDFSSHAAEGKQGGATTVSSAAAVTANLVSAKGSSSSS 755

Query: 516  XXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDIPR---VRPVHSPRHDSSSIPPSK 346
                                  + S  K D  E+ D P+    R +HSPRHD SS+  SK
Sbjct: 756  RASDMHGNESKTDGGVAKSSEVRLSTGKSDGNEVSDAPKSSSSRTMHSPRHD-SSVAASK 814

Query: 345  SVDKQVKRTSPAEEPDRLSKRRKGDRDLED--------------EPRFVTDEKSGNEEH- 211
            S D+  KRTSP+E+PDR SKR KGD +L D              +PRF   +K G +E  
Sbjct: 815  SGDRLQKRTSPSEDPDRPSKRYKGDTELRDSDGEVRVPDRERSADPRFADLDKIGTDEQS 874

Query: 210  ------RSKE------HREYRERLERSDKS 157
                  RSK+       R++RERL+R DKS
Sbjct: 875  MYRTTDRSKDKGNERYERDHRERLDRLDKS 904



 Score = 42.0 bits (97), Expect(2) = 1e-59
 Identities = 28/59 (47%), Positives = 35/59 (59%), Gaps = 19/59 (32%)
 Frame = -1

Query: 149  RYGRERSVEKVQER-------NFSDKAKD--------KPRHTETSS----VDDRFHGQS 30
            RYGRERSVE+ QER         ++KAKD        K R+ ++SS    VD+RFHGQS
Sbjct: 921  RYGRERSVERGQERGADRAFDRLAEKAKDDRNKDDRSKLRYNDSSSEKSHVDERFHGQS 979


>ref|XP_007045496.1| THO complex subunit 2 isoform 4 [Theobroma cacao]
            gi|508709431|gb|EOY01328.1| THO complex subunit 2 isoform
            4 [Theobroma cacao]
          Length = 1831

 Score =  207 bits (527), Expect(2) = 3e-58
 Identities = 158/398 (39%), Positives = 203/398 (51%), Gaps = 59/398 (14%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFK-TQSNASKSLSGTFDSQSEGRTIS 997
            REDLK           ARK SWVTDEEFGMG ++ K   S ASKSL+G   S   G +I+
Sbjct: 1203 REDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQNGSSIN 1262

Query: 996  TGASDSAGS--------------VKDQVLRTKPVDGRLERTESAS---GHMKVKGGDSQS 868
               S++AG+              VKDQ+ RTK  DGRLER E+AS     +K KGG S +
Sbjct: 1263 VSQSEAAGARAVALGTQQSDVNLVKDQIPRTKS-DGRLERAENASLGKSDLKTKGGTSAN 1321

Query: 867  SSTSAV-------PAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTV 709
             S + +        AGT KSLEN+KQ+DE + K LDE+++K   K SAE E +AS KR+ 
Sbjct: 1322 GSDAVLSVVLATSQAGTGKSLENQKQLDESSNK-LDEHLAKVPAKNSAELESKASAKRSA 1380

Query: 708  PV-TVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANG 532
            P  ++ K  KQ+  KDD K  K + RTS +  ID+D+P+H +EGR G             
Sbjct: 1381 PAGSLTKTQKQDPGKDDGKSGKAVGRTSVTCVIDRDVPSH-TEGRQG------------- 1426

Query: 531  NTVPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDI--PRVRPVHSPRHDSSSI 358
                                            KDD +ELPD   P  R VHSPRHDSS+ 
Sbjct: 1427 --------------------------------KDDGSELPDASRPSSRIVHSPRHDSSA- 1453

Query: 357  PPSKSVDKQVKRTSPAEEPDRLSKRRKGDRDLED--------------EPRFVTDEKSGN 220
              SKS DK  KRT+P EE DRL+KRRKGD +L+D              +P+    +K G 
Sbjct: 1454 TVSKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRERSTDPQLADFDKPGT 1513

Query: 219  EE---HRS--------------KEHREYRERLERSDKS 157
            +E   HR+              +  R+YRERLER +KS
Sbjct: 1514 DELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKS 1551



 Score = 46.2 bits (108), Expect(2) = 3e-58
 Identities = 29/55 (52%), Positives = 34/55 (61%), Gaps = 15/55 (27%)
 Frame = -1

Query: 149  RYGRERSVEKVQERNFS---DKAKD--------KPRHTETSS----VDDRFHGQS 30
            RYGRERSVE+  +RN     DKAKD        K R+ +TS+    VDDRFHGQS
Sbjct: 1568 RYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQS 1622


>ref|XP_007217095.1| hypothetical protein PRUPE_ppa000084mg [Prunus persica]
            gi|462413245|gb|EMJ18294.1| hypothetical protein
            PRUPE_ppa000084mg [Prunus persica]
          Length = 1878

 Score =  207 bits (528), Expect(2) = 6e-58
 Identities = 157/410 (38%), Positives = 208/410 (50%), Gaps = 71/410 (17%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQSEGRTIST 994
            REDLK           ARK SW+TDEEFG G ++ K+   ASKS +G   +   G TI+ 
Sbjct: 1201 REDLKVLATGVAAALAARKSSWITDEEFGNGYLELKSAPLASKSSAGNSAATHSGSTINI 1260

Query: 993  GASD---------------SAGSVKDQVLRTKPVDGRLERTESAS------GHMKVK--- 886
              S+               S+ SVKDQ+L+TK  DGRLER ES S      GH+K+K   
Sbjct: 1261 SQSEPIGGKVGALPSQHPESSNSVKDQILKTKTSDGRLERVESISTVKSDQGHLKLKVGS 1320

Query: 885  ---GGDSQS-SSTSAVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGK 718
               G D QS  S+ A+ +GTS+S+ENKKQ++E + +T DEN+ KA PK S+ESELRA  K
Sbjct: 1321 LVSGSDGQSLMSSPALQSGTSRSMENKKQVNESSNRTSDENMGKAAPKNSSESELRAQAK 1380

Query: 717  RTVPV-TVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVT 541
            R+ P  ++ KP KQ++AKDD +  K          I +D+  H S      + N+   + 
Sbjct: 1381 RSGPAGSLAKPPKQDLAKDDGRSGK---------GIGRDVLCHAS----AVSTNVSPAIA 1427

Query: 540  ANGNTV--------PXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDIPR---VR 394
            ANGNTV                                + SA K+D  E  D  R    R
Sbjct: 1428 ANGNTVSASAKGSFAKTSVEIHGIDSKVDVGAAKASNTRVSAPKEDGPETSDALRPHSSR 1487

Query: 393  PVHSPRHDSSSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGDRDLED-------------- 256
             VHSPRHD+S+   SKS DK  KRTSPAEE DR SKRRKG+ ++ D              
Sbjct: 1488 LVHSPRHDNSA-SASKSSDKLQKRTSPAEETDRQSKRRKGETEMRDFEGEARLSDRERSV 1546

Query: 255  EPRFVTDEKSGNEEH-----------RSKE------HREYRERLERSDKS 157
            + R +  +KSG ++            RSK+       ++YRERL+R DKS
Sbjct: 1547 DARLLDLDKSGTDDQSVYKATDKPSDRSKDKGSERHDKDYRERLDRPDKS 1596



 Score = 45.1 bits (105), Expect(2) = 6e-58
 Identities = 29/54 (53%), Positives = 34/54 (62%), Gaps = 14/54 (25%)
 Frame = -1

Query: 149  RYGRERSVEKVQER-------NFSDKAKD---KPRH----TETSSVDDRFHGQS 30
            R+GRE SVEKVQER         SDK+KD   K R+    TE S VD+R+HGQS
Sbjct: 1612 RHGREHSVEKVQERGMDRSVDRLSDKSKDDRGKVRYNDISTEKSHVDERYHGQS 1665


>ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi|223533086|gb|EEF34845.1|
            tho2 protein, putative [Ricinus communis]
          Length = 1828

 Score =  206 bits (524), Expect(2) = 2e-56
 Identities = 156/395 (39%), Positives = 196/395 (49%), Gaps = 56/395 (14%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFD---------- 1024
            REDLK           ARKPSWVTDEEFGMG +D +  + ASKS+SG             
Sbjct: 1198 REDLKVLATSVASALAARKPSWVTDEEFGMGYLDIRPPA-ASKSVSGNISVGQNSSGLNA 1256

Query: 1023 SQSE---GRTISTGAS--DSAGSVKDQVLRTKPVDGR--LERTESASGHMKVKGG----- 880
            SQ E   GR +ST     D   S K+ + R KP D +  +   +S S + KVKGG     
Sbjct: 1257 SQGESAGGRAVSTTTQHGDVGNSAKEHISRAKPADKQESVSYVKSDSVNQKVKGGSLVIQ 1316

Query: 879  -DSQSSSTSAV-PAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVP 706
             D QSS+      AG S+S EN+KQM E  I   D       PK SAESE +ASGKR +P
Sbjct: 1317 SDLQSSAALVTGQAGASRSAENQKQMSESPIIIPD------APKNSAESESKASGKRAMP 1370

Query: 705  VTVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANGNT 526
               +K  +Q++AKDD K  K + R   +S+ DKD+P+H SE R G   N+ ST T+N   
Sbjct: 1371 AGSVKTPRQDVAKDDLKSGKTVGRVPVASSSDKDMPSHLSESRLGNGTNVSSTGTSNDGA 1430

Query: 525  VPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDI--PRVRPVHSPRHDSSSIPP 352
                                       S +KDD  E+ D+  P  R VHSPRHD S    
Sbjct: 1431 AK-------------------------SVVKDDATEVGDVQKPPSRVVHSPRHDGSFASS 1465

Query: 351  SKSVDKQVKRTSPAEEPDRLSKRRKGDRDLED--------------EPRFVTDEKSGNEE 214
            SKS DK  KR SP ++PDRLSKRRKGD +L D              + R V  +K G++E
Sbjct: 1466 SKSSDKLQKRASPGDDPDRLSKRRKGDTELRDLDGDIRFSDRERPMDSRLVDLDKIGSDE 1525

Query: 213  --HRSKE--------------HREYRERLERSDKS 157
              HRS +               R++RER ER DKS
Sbjct: 1526 RVHRSMDKPLDRSKDKGMERYDRDHRERSERPDKS 1560



 Score = 41.2 bits (95), Expect(2) = 2e-56
 Identities = 28/56 (50%), Positives = 33/56 (58%), Gaps = 16/56 (28%)
 Frame = -1

Query: 149  RYGRERSVEKVQER--------NFSDKA-----KDKPRHTETSSV---DDRFHGQS 30
            RYGRERSVE+ QER         FSDK      KDK R+ +TS     DDRF+GQ+
Sbjct: 1577 RYGRERSVERGQERGGADRSFDRFSDKTKDERNKDKVRYGDTSVEKLHDDRFYGQN 1632


>ref|XP_002325475.1| F5A9.22 family protein [Populus trichocarpa]
            gi|222862350|gb|EEE99856.1| F5A9.22 family protein
            [Populus trichocarpa]
          Length = 1836

 Score =  219 bits (557), Expect = 2e-54
 Identities = 162/401 (40%), Positives = 209/401 (52%), Gaps = 62/401 (15%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQ-------- 1018
            REDLK           ARKPSW+TDEEFGMG ++ K  S ASKSLSG   +         
Sbjct: 1183 REDLKVLATGVAAALAARKPSWITDEEFGMGYLEIKPPSAASKSLSGNAAAAQNSSALNV 1242

Query: 1017 -----SEGRTISTGAS--DSAGSVKDQVLRTKPVDGRLERTESAS------GHMKVKGGD 877
                 +EGR   TG+   D   S ++Q+ R K  DGR +RT++ S      GH K KGG 
Sbjct: 1243 SQGEPAEGRAPHTGSQHGDPGNSTREQISRAKHADGRSDRTDNVSHSKFDQGHQKSKGGS 1302

Query: 876  S-------QSSSTSAVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGK 718
            S        + S +AV  G S+S EN+K +D+ + +TL++   +A PK  AESE++ S K
Sbjct: 1303 STNGSNAQSAGSAAAVHVGASRS-ENRKGVDDSSNRTLEDGTVRAAPKNLAESEMKISTK 1361

Query: 717  RTVPVTVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTA 538
            R V     K  KQ++ KDD+K  K + RT +SS  DKDI  H SEGR GGA N+ S +T 
Sbjct: 1362 RLVS----KTPKQDVVKDDNKSGKAVGRTPSSSTSDKDIQVHLSEGRQGGAANVSSALTL 1417

Query: 537  NGNTVPXXXXXXXXXXXXXXXXXXXXXXAKFSAL--KDDNAELPDIPR-VRPVHSPRHDS 367
            NGN V                        K S L  +  ++ + D+ +  + VHSPRHD 
Sbjct: 1418 NGNAV--------------------STSGKISTLSTRASDSYVADVQKPPQLVHSPRHD- 1456

Query: 366  SSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGD---RDLEDEPRFVTDEKS---------- 226
            +S+  SKS DK  KR SPAEEPDR SKRRKGD   RDLE E +F   E+S          
Sbjct: 1457 NSVAASKSSDKLQKRASPAEEPDRSSKRRKGDGELRDLEGEVKFSERERSTDTRSADLDK 1516

Query: 225  -GNEE---HRSKE--------------HREYRERLERSDKS 157
             GN+E   HRS +               R++RER ER DKS
Sbjct: 1517 VGNDEQNKHRSTDKPLDRSKDKGNDRYDRDHRERSERPDKS 1557


>ref|XP_006376042.1| F5A9.22 family protein [Populus trichocarpa]
            gi|550325266|gb|ERP53839.1| F5A9.22 family protein
            [Populus trichocarpa]
          Length = 1805

 Score =  217 bits (552), Expect = 9e-54
 Identities = 159/387 (41%), Positives = 202/387 (52%), Gaps = 48/387 (12%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQ-------- 1018
            REDLK           ARKPSWVTDEEFGMG +D K  S ASKSLSG   +         
Sbjct: 1173 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLDIKPPSVASKSLSGNVAAAQNSSALNV 1232

Query: 1017 -----SEGRTISTGAS--DSAGSVKDQVLRTKPVDGRLERTESASGHMKVKGGDSQSSST 859
                 ++GR + TG+   D   S +D + R K  DGR +RTE+ S H+K   G  +S   
Sbjct: 1233 SQGEPADGRALVTGSQHGDPGNSNRDPISRAKHADGRSDRTENIS-HLKSDLGHQKSK-- 1289

Query: 858  SAVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVPVTVLKPSKQ 679
                 G S+S EN+K MD+   +TL+++  +   K  AESEL+ S KR V     K  KQ
Sbjct: 1290 -----GASRSAENQKGMDDSTNRTLEDSTVRVAAKNLAESELKVSTKRPVS----KTPKQ 1340

Query: 678  EIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANGNTVPXXXXXXX 499
            ++ KDD+K  K + RT +SS  DKDI  H SEGR GGA+N+ S +T+N +          
Sbjct: 1341 DVVKDDNKSGKGVGRTLSSSTSDKDIQVHLSEGRQGGASNVSSVLTSNESKPDSGGNKPM 1400

Query: 498  XXXXXXXXXXXXXXXAKFSALKDDNAELPDI--PRVRPVHSPRHDSSSIPPSKSVDKQVK 325
                                LKD+  E+ D+  P  R VHSPRHD+S +  SKS DK  K
Sbjct: 1401 --------------------LKDEATEVADVQKPPSRLVHSPRHDNS-VAASKSSDKLQK 1439

Query: 324  RTSPAEEPDRLSKRRKGD---RDLEDEPRFVTDEKS-----------GNEEH-------- 211
            R SPAEEPDRLSKR+KGD   RDLE E +F   E+S           GN+EH        
Sbjct: 1440 RASPAEEPDRLSKRQKGDVELRDLEGEVKFSERERSTDTRSADLDKVGNDEHNLYRSVDK 1499

Query: 210  ---RSKE------HREYRERLERSDKS 157
               RSK+       R++RER ER DKS
Sbjct: 1500 PLDRSKDKGNDRYDRDHRERSERPDKS 1526


>ref|XP_006586338.1| PREDICTED: THO complex subunit 2-like [Glycine max]
          Length = 1778

 Score =  214 bits (546), Expect = 4e-53
 Identities = 158/401 (39%), Positives = 208/401 (51%), Gaps = 62/401 (15%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQSEGRTIST 994
            REDLK           ARKPSWVTDEEFGMG ++ K   + +KS +G   +   G  ++ 
Sbjct: 1110 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSVTKSSAGNSATVQSGINLNV 1169

Query: 993  GASDSAGS--------VKDQVLRTKPVDGRLERTESAS------GHMKVK------GGDS 874
              ++SA          VKDQ +RTK  DGR ERTES +      GH+K+K      G D+
Sbjct: 1170 SQTESASGKHVDSGNIVKDQAMRTKTADGRSERTESITVTKSDTGHIKLKSSSMVNGLDA 1229

Query: 873  QSS-STSAVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVPVTV 697
            QSS + S+V +GTSKS+EN KQ++E   +  DE+ ++        +ELR S KR+VP   
Sbjct: 1230 QSSLAPSSVQSGTSKSMENPKQVEESINRASDEHGTRT-------TELRTSAKRSVPAGS 1282

Query: 696  L-KPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGR-------PGGANNLISTVT 541
            L KPSKQ+  K+D +  KP+ RTS SS+ DK++ TH  EGR       P    N IS  T
Sbjct: 1283 LSKPSKQDPVKEDGRSGKPVARTSGSSSSDKELQTHALEGRYTGTTNVPSSNGNTISGST 1342

Query: 540  ANGNTVPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDIPR---VRPVHSPRHD 370
               N                          + S +KDD  ++ D PR    R VHSPR++
Sbjct: 1343 KGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGASSRVVHSPRYE 1402

Query: 369  SSSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGD---RDLEDEPRF----------VTDEK 229
            ++ +  SKS DK  KR S AEEPDRL KRRKGD   RD E E RF            D+K
Sbjct: 1403 NTGVT-SKSNDKVQKRASSAEEPDRLGKRRKGDVELRDFETEVRFSEREKMMDPRFADDK 1461

Query: 228  SGNEEH-----------RSKE------HREYRERLERSDKS 157
            SG EEH           R+K+       R++RER++R DKS
Sbjct: 1462 SGPEEHGLYRAGDKPLERAKDKGNERYERDHRERMDRLDKS 1502


>ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isoform X2 [Glycine max]
          Length = 1845

 Score =  206 bits (523), Expect = 2e-50
 Identities = 155/404 (38%), Positives = 211/404 (52%), Gaps = 65/404 (16%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQSEGRTIST 994
            REDLK           ARKPSWVTDEEFGMG ++ K   + +KS +G   +   G  ++ 
Sbjct: 1202 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPSPSMTKSSAGNSATVQSGINLNV 1261

Query: 993  GAS--------DSAGSVKDQVLRTKPVDGRLER------TESASGHMKVK------GGDS 874
              +        DS  +VKDQ +RTK VDG+ ER      T+S +GH+K+K      G D+
Sbjct: 1262 SQTESVSGKHVDSGNTVKDQAIRTKTVDGKSERIESITVTKSDAGHIKLKSSSMVNGLDA 1321

Query: 873  QSS-STSAVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVPVTV 697
            QSS + S+V +G  KS+EN KQ++E   +  DE+ +++       +ELR S KR+VP + 
Sbjct: 1322 QSSMAPSSVQSGMPKSMENPKQVEESINRASDEHGTRS-------TELRTSAKRSVPASS 1374

Query: 696  L-KPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANGNTVP 520
            L KPSKQ+  K+D +  KP+ RTS S + DKD+ TH  EGR  G  N+ S+   NGNT+ 
Sbjct: 1375 LAKPSKQDPVKEDGRSGKPVARTSGSLSSDKDLQTHALEGRHTGTTNVPSS---NGNTIS 1431

Query: 519  XXXXXXXXXXXXXXXXXXXXXXAKF----------SALKDDNAELPDIPR---VRPVHSP 379
                                  A+           S +KDD  ++ D PR    R VHSP
Sbjct: 1432 GSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGSSSRIVHSP 1491

Query: 378  RHDSSSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGD---RDLEDEPRF----------VT 238
            RH+++ +  SKS D+  KR S  EEPDRL KRRKGD   RD E E RF            
Sbjct: 1492 RHENTVVT-SKSNDRVQKRASSVEEPDRLGKRRKGDVELRDFETELRFSEREKMMDPRFA 1550

Query: 237  DEKSGNEEH-----------RSKE------HREYRERLERSDKS 157
            D+K G EEH           R+K+       R++RER++R DKS
Sbjct: 1551 DDKLGPEEHGLYRASDKPLERTKDKGNERYERDHRERMDRLDKS 1594


>ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isoform X1 [Glycine max]
          Length = 1870

 Score =  206 bits (523), Expect = 2e-50
 Identities = 155/404 (38%), Positives = 211/404 (52%), Gaps = 65/404 (16%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQSEGRTIST 994
            REDLK           ARKPSWVTDEEFGMG ++ K   + +KS +G   +   G  ++ 
Sbjct: 1202 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPSPSMTKSSAGNSATVQSGINLNV 1261

Query: 993  GAS--------DSAGSVKDQVLRTKPVDGRLER------TESASGHMKVK------GGDS 874
              +        DS  +VKDQ +RTK VDG+ ER      T+S +GH+K+K      G D+
Sbjct: 1262 SQTESVSGKHVDSGNTVKDQAIRTKTVDGKSERIESITVTKSDAGHIKLKSSSMVNGLDA 1321

Query: 873  QSS-STSAVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVPVTV 697
            QSS + S+V +G  KS+EN KQ++E   +  DE+ +++       +ELR S KR+VP + 
Sbjct: 1322 QSSMAPSSVQSGMPKSMENPKQVEESINRASDEHGTRS-------TELRTSAKRSVPASS 1374

Query: 696  L-KPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANGNTVP 520
            L KPSKQ+  K+D +  KP+ RTS S + DKD+ TH  EGR  G  N+ S+   NGNT+ 
Sbjct: 1375 LAKPSKQDPVKEDGRSGKPVARTSGSLSSDKDLQTHALEGRHTGTTNVPSS---NGNTIS 1431

Query: 519  XXXXXXXXXXXXXXXXXXXXXXAKF----------SALKDDNAELPDIPR---VRPVHSP 379
                                  A+           S +KDD  ++ D PR    R VHSP
Sbjct: 1432 GSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGSSSRIVHSP 1491

Query: 378  RHDSSSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGD---RDLEDEPRF----------VT 238
            RH+++ +  SKS D+  KR S  EEPDRL KRRKGD   RD E E RF            
Sbjct: 1492 RHENTVVT-SKSNDRVQKRASSVEEPDRLGKRRKGDVELRDFETELRFSEREKMMDPRFA 1550

Query: 237  DEKSGNEEH-----------RSKE------HREYRERLERSDKS 157
            D+K G EEH           R+K+       R++RER++R DKS
Sbjct: 1551 DDKLGPEEHGLYRASDKPLERTKDKGNERYERDHRERMDRLDKS 1594


>ref|XP_007160466.1| hypothetical protein PHAVU_002G324500g [Phaseolus vulgaris]
            gi|561033881|gb|ESW32460.1| hypothetical protein
            PHAVU_002G324500g [Phaseolus vulgaris]
          Length = 1864

 Score =  204 bits (520), Expect = 5e-50
 Identities = 154/404 (38%), Positives = 207/404 (51%), Gaps = 65/404 (16%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQSEGRTIST 994
            REDLK           ARKPSWVTDEEFGMG ++ K   + +KS +G   +   G  ++ 
Sbjct: 1201 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSGTKSSAGNPSTVHSGMNLNV 1260

Query: 993  GASDSAG--------SVKDQVLRTKPVDGRLERTESA------SGHMKVKGG------DS 874
              ++SA         +VKDQV+RTK  DG+ ERTES       SGH KVK G      D 
Sbjct: 1261 SQTESASGKHVDSGNTVKDQVIRTKTTDGKSERTESMTATKSDSGHTKVKTGAMVNGFDG 1320

Query: 873  QSSS-TSAVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVPVTV 697
            Q+SS +S++ +G SKS+EN KQ++E   +  D++ ++        +E RAS KR+VP   
Sbjct: 1321 QTSSISSSIQSGMSKSMENSKQVEELINRASDDHGTRT-------AESRASAKRSVPTGS 1373

Query: 696  L-KPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANGNTVP 520
            L KPSKQ+  K+DS+  KP+ RTS S + DKD+ +        G  N+ S+V+ANGNT+ 
Sbjct: 1374 LSKPSKQDPLKEDSRSGKPVARTSGSLSSDKDLHS--------GTTNVTSSVSANGNTIT 1425

Query: 519  XXXXXXXXXXXXXXXXXXXXXXAKF----------SALKDDNAELPDIPR---VRPVHSP 379
                                  A+           S +KDD  +  D+ R    R VHSP
Sbjct: 1426 GSTKGSNAPVRISLDGPGNESKAEVGVSKSSDIRASVVKDDGNDTADLTRGSSSRVVHSP 1485

Query: 378  RHDSSSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGD---RDLEDEPRF----------VT 238
            RH+++ +  SKS +K  KR S AEEPDRL KRRKGD   RD E E RF            
Sbjct: 1486 RHENTGVA-SKSNEKVQKRASSAEEPDRLGKRRKGDVELRDFESEVRFSDRDKLMDPRFA 1544

Query: 237  DEKSGNEEH-----------------RSKEHREYRERLERSDKS 157
            D+K G EEH                   +  R++RERL+R DKS
Sbjct: 1545 DDKLGPEEHGLYRAGDKSLERPKDKGNERYERDHRERLDRVDKS 1588


>ref|XP_004503324.1| PREDICTED: LOW QUALITY PROTEIN: THO complex subunit 2-like [Cicer
            arietinum]
          Length = 2058

 Score =  198 bits (504), Expect = 3e-48
 Identities = 157/401 (39%), Positives = 205/401 (51%), Gaps = 62/401 (15%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQSEGRTIST 994
            REDLK           ARK SWVTDEEFGMG +D K   + +KS +    +   G +++ 
Sbjct: 1399 REDLKVLATGVAAALAARKSSWVTDEEFGMGYLDLKAAPSTTKSSAXNSAAVQSGISLNV 1458

Query: 993  GASDSAG--------SVKDQVLRTKPVDGRLERTESA------SGHMKVKGG------DS 874
              ++S          + KDQ +RTK  DG+ ERTES       SGH+K+KGG      D+
Sbjct: 1459 SQTESTSGKHLESGNTAKDQTIRTKTADGKSERTESITATKYDSGHVKLKGGSMVNGLDA 1518

Query: 873  QSSSTSAVPAGTS---KSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVPV 703
            QSS  S  PAG S   KS+EN KQM+E   K  D++ ++         E R S KR+V  
Sbjct: 1519 QSSLPS--PAGQSGALKSVENPKQMEESISKAPDDHTTR-------NVESRTSTKRSVAA 1569

Query: 702  TVL-KPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANGNT 526
              L KPSKQ+  K+D +F K + RTS S   DKD+ TH S+GR  G  N+ ++V+ANGN+
Sbjct: 1570 GSLSKPSKQDPVKEDGRFGKTVIRTSGSLCSDKDLQTHVSDGRHTGI-NISTSVSANGNS 1628

Query: 525  VP-----XXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDIPR---VRPVHSPRHD 370
            V                            +K S +KDD +++ D  R    R VHSPRH+
Sbjct: 1629 VSGSAKGLAPLAKISFDGSGNESKAEVGASKSSLVKDDGSDIADFTRGSSSRVVHSPRHE 1688

Query: 369  SSSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGD---RDLEDEPRF----------VTDEK 229
            +++   SKS DK  KR   A+E DRL KRRKGD   RDLE E RF          V D+K
Sbjct: 1689 NTA--TSKSSDKIQKRAGSADELDRLGKRRKGDVDLRDLEGEVRFSEREKLLDPRVDDDK 1746

Query: 228  SGNEE-----------HRSKE------HREYRERLERSDKS 157
             G +E            R KE       RE+RERL+R DKS
Sbjct: 1747 GGPDELGLYRAGDKTLERPKEKGNERYEREHRERLDRLDKS 1787


>ref|XP_003631008.1| THO complex subunit [Medicago truncatula] gi|355525030|gb|AET05484.1|
            THO complex subunit [Medicago truncatula]
          Length = 2048

 Score =  197 bits (501), Expect = 7e-48
 Identities = 156/400 (39%), Positives = 203/400 (50%), Gaps = 61/400 (15%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGTFDSQSEG----- 1009
            REDLK           ARKPSWVTDEEFGMG ++ K   + +KS +G   +   G     
Sbjct: 1391 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSMTKSAAGNSAAVQSGIGLQF 1450

Query: 1008 ---RTISTGASDSAGSVKDQVLRTKPVDGRLERTESA------SGHMKVKGG------DS 874
                + S    DS  +VKDQ ++TK  DG+ ERTES       SGH K+KG       D+
Sbjct: 1451 SQTESASGKHLDSGNTVKDQTVKTKTADGKSERTESLTATKSDSGHGKLKGSSMVNGVDA 1510

Query: 873  QSSSTSAVPAGTS---KSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRTVPV 703
            QSS  S  PAG S   KS+EN+KQ++E   +  DE+I++     + ES      +     
Sbjct: 1511 QSSLAS--PAGQSGALKSVENQKQVEESISRAPDEHITR-----NVESRPSVKQRSVATG 1563

Query: 702  TVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGANNLISTVTANGNTV 523
            ++LKPSKQ+  K+D +  K + RTS SS+ DKD+ TH S+GR  G  N+ S+ +ANGN+V
Sbjct: 1564 SLLKPSKQDPLKEDGRSGKTVTRTSGSSSSDKDLQTHASDGRHTG-TNISSSFSANGNSV 1622

Query: 522  P-----XXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDIPR---VRPVHSPRHDS 367
                                        AKFS +KDD  E  D  R    R VHSPRH++
Sbjct: 1623 SGSAKGLAQAATTAFDGSGNESKAEVGAAKFSMVKDDVNEFADFTRGSSSRVVHSPRHEN 1682

Query: 366  SSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGD---RDLEDEPRF----------VTDEKS 226
            ++   SKS DK  KR    +E DRL KRRKGD   RDLE E RF          + D+K 
Sbjct: 1683 TA--TSKSSDKIQKRAGSVDELDRLGKRRKGDIDLRDLEGEVRFSEREKLMDPRLADDKV 1740

Query: 225  GNEE-----------HRSKE------HREYRERLERSDKS 157
            G +E            R KE       RE+RERL+R DKS
Sbjct: 1741 GPDELGVYRTGDKTLERPKEKGTDRYEREHRERLDRLDKS 1780


>ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucumis sativus]
            gi|449506883|ref|XP_004162874.1| PREDICTED: THO complex
            subunit 2-like [Cucumis sativus]
          Length = 1887

 Score =  177 bits (449), Expect(2) = 2e-47
 Identities = 148/410 (36%), Positives = 201/410 (49%), Gaps = 71/410 (17%)
 Frame = -3

Query: 1173 REDLKXXXXXXXXXXXARKPSWVTDEEFGMGLVDFKTQSNASKSLSGT---------FDS 1021
            REDLK           ARKPSWVTDEEFGMG ++ KT S ASK  +           F S
Sbjct: 1204 REDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKTPSLASKPSASNLASSQNNSIFVS 1263

Query: 1020 QSEGRTISTGA-----SDSAGSVKDQVLRTKPVDGRLERTESAS------GH-----MKV 889
            Q+E     T A     SDS    KD  LR++  D R ++ +  S      GH     M +
Sbjct: 1264 QNEPVGGKTSALPIPNSDSGNMAKDHSLRSRTSDVRTDKIDGLSVPKSELGHGKQKGMSL 1323

Query: 888  KGGDSQSSSTS-AVPAGTSKSLENKKQMDEPAIKTLDENISKAGPKISAESELRASGKRT 712
             G DSQ    S +V +G+ K ++++K  D+ + +TLDE  SK   K S+ESELR S KR+
Sbjct: 1324 NGPDSQPLVPSTSVHSGSLKMVDSQKPGDD-STRTLDEGSSKVVSKTSSESELRGSTKRS 1382

Query: 711  VPVTVL-KPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEG-RPGGANNLISTVTA 538
             PVT L K  KQ+I KD+ +  K   +   SS  ++++P H ++G R GG +N  S + +
Sbjct: 1383 GPVTSLNKAPKQDITKDEIRSGKAASKNPGSSTSERELPVHATDGGRHGGPSNSPS-IMS 1441

Query: 537  NGNT---------VPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDIPRV---R 394
            NGNT         +                        + S++KDD  E  D+ R    R
Sbjct: 1442 NGNTQNSLTKGSSLTVKASDGHTIESKAESGVGRTSDGRVSSVKDDGPEALDVSRSSSSR 1501

Query: 393  PVHSPRHDSSSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGDRDLED-------------- 256
              HSPRHD+S+   S+S DK  KR SPAEEPDR  KRRKGD ++ D              
Sbjct: 1502 LGHSPRHDNSA-SGSRSSDKLQKRASPAEEPDRQGKRRKGDGEIRDVDGDFRISDKDRSM 1560

Query: 255  EPRFVTDEKSGNEEH-----------RSKE------HREYRERLERSDKS 157
            +PR +  +K G EE            R+K+       R+YR+R ER +KS
Sbjct: 1561 DPRSIDADKIGMEEQSGYRGLDKPLDRTKDKVNERYDRDYRDRAERPEKS 1610



 Score = 40.0 bits (92), Expect(2) = 2e-47
 Identities = 25/54 (46%), Positives = 30/54 (55%), Gaps = 14/54 (25%)
 Frame = -1

Query: 149  RYGRERSVEKVQ------------ERNFSDKAKDK--PRHTETSSVDDRFHGQS 30
            RYGRERSVEKV+            ERN  D++K +      + S  DDRFHGQS
Sbjct: 1627 RYGRERSVEKVERVSDRYPEKSKDERNKDDRSKLRYSDSTVDKSHTDDRFHGQS 1680


>ref|XP_007045495.1| THO2 isoform 3 [Theobroma cacao] gi|508709430|gb|EOY01327.1| THO2
            isoform 3 [Theobroma cacao]
          Length = 1762

 Score =  164 bits (415), Expect(2) = 3e-45
 Identities = 116/289 (40%), Positives = 155/289 (53%), Gaps = 36/289 (12%)
 Frame = -3

Query: 915  ESASGHMKVKGGDSQSSSTSAVP--AGTSKSLENKKQMDEPAIKTLDENISKAGPKISAE 742
            E   G++++K   S +S + A    AGT KSLEN+KQ+DE + K LDE+++K   K SAE
Sbjct: 1229 EFGMGYLELKPATSLASKSLAATSQAGTGKSLENQKQLDESSNK-LDEHLAKVPAKNSAE 1287

Query: 741  SELRASGKRTVPV-TVLKPSKQEIAKDDSKFAKPMDRTSTSSAIDKDIPTHPSEGRPGGA 565
             E +AS KR+ P  ++ K  KQ+  KDD K  K + RTS +  ID+D+P+H +EGR GG 
Sbjct: 1288 LESKASAKRSAPAGSLTKTQKQDPGKDDGKSGKAVGRTSVTCVIDRDVPSH-TEGRQGGT 1346

Query: 564  NNLISTVTANGNTVPXXXXXXXXXXXXXXXXXXXXXXAKFSALKDDNAELPDI--PRVRP 391
             N+ S VT+NG                                KDD +ELPD   P  R 
Sbjct: 1347 TNVPSAVTSNG--------------------------------KDDGSELPDASRPSSRI 1374

Query: 390  VHSPRHDSSSIPPSKSVDKQVKRTSPAEEPDRLSKRRKGDRDLED--------------E 253
            VHSPRHDSS+   SKS DK  KRT+P EE DRL+KRRKGD +L+D              +
Sbjct: 1375 VHSPRHDSSA-TVSKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRERSTD 1433

Query: 252  PRFVTDEKSGNEE---HRS--------------KEHREYRERLERSDKS 157
            P+    +K G +E   HR+              +  R+YRERLER +KS
Sbjct: 1434 PQLADFDKPGTDELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKS 1482



 Score = 46.2 bits (108), Expect(2) = 3e-45
 Identities = 29/55 (52%), Positives = 34/55 (61%), Gaps = 15/55 (27%)
 Frame = -1

Query: 149  RYGRERSVEKVQERNFS---DKAKD--------KPRHTETSS----VDDRFHGQS 30
            RYGRERSVE+  +RN     DKAKD        K R+ +TS+    VDDRFHGQS
Sbjct: 1499 RYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQS 1553


Top