BLASTX nr result

ID: Catharanthus22_contig00001599 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00001599
         (2227 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004234688.1| PREDICTED: uncharacterized protein LOC101255...   414   e-113
ref|XP_006346854.1| PREDICTED: uncharacterized protein LOC102582...   412   e-112
emb|CBI16864.3| unnamed protein product [Vitis vinifera]              402   e-109
ref|XP_002279557.2| PREDICTED: uncharacterized protein LOC100247...   401   e-109
ref|XP_006482308.1| PREDICTED: uncharacterized protein LOC102626...   396   e-107
gb|EMJ18348.1| hypothetical protein PRUPE_ppa003054mg [Prunus pe...   393   e-106
ref|XP_006430828.1| hypothetical protein CICLE_v10013582mg, part...   388   e-105
ref|XP_002525972.1| nucleic acid binding protein, putative [Rici...   385   e-104
ref|XP_006373469.1| hypothetical protein POPTR_0017s14060g [Popu...   380   e-102
gb|EOY04289.1| Proline-rich spliceosome-associated family protei...   378   e-102
ref|XP_004304149.1| PREDICTED: uncharacterized protein LOC101295...   369   3e-99
gb|EXB72259.1| Zinc finger CCHC domain-containing protein 8 [Mor...   367   2e-98
ref|XP_006593391.1| PREDICTED: uncharacterized protein LOC100527...   364   1e-97
gb|ADN34281.1| nucleic acid binding protein [Cucumis melo subsp....   362   3e-97
ref|XP_004169819.1| PREDICTED: uncharacterized protein LOC101230...   360   1e-96
ref|XP_004141493.1| PREDICTED: uncharacterized protein LOC101212...   360   1e-96
ref|XP_002305958.1| proline-rich spliceosome-associated family p...   359   3e-96
ref|XP_006603953.1| PREDICTED: uncharacterized protein LOC100805...   356   2e-95
ref|XP_004514436.1| PREDICTED: uncharacterized protein LOC101500...   354   8e-95
ref|XP_002329267.1| predicted protein [Populus trichocarpa]           354   1e-94

>ref|XP_004234688.1| PREDICTED: uncharacterized protein LOC101255771 [Solanum
            lycopersicum]
          Length = 530

 Score =  414 bits (1065), Expect = e-113
 Identities = 218/452 (48%), Positives = 286/452 (63%), Gaps = 18/452 (3%)
 Frame = +1

Query: 295  METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLE--------VNVNESHGS 450
            M TED  +LPAS +   G +N E+  +G      N +  +  E        +++    GS
Sbjct: 1    MGTEDPNNLPASDNLERGIENIEVGANGDTSKTTNFEVSESNEPLRESDSDMDLESDPGS 60

Query: 451  ST--DIVNEDNQLLLDADNKSENQDKLEVI-----ADAGLME--DGNGLCNHAEDAFQDS 603
                D+    +Q+ ++        +++ ++     A+ GL+   D N   N  ED    S
Sbjct: 61   QVGVDLTGTPSQVCVELAETVGITEEVTMVDSVIHAENGLLSLPDANYSSNQTEDQDHVS 120

Query: 604  CPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSA 783
              +          KRPR   D +QPSV V+Y SLTR+S++ LE LLQQWS+WHA++CSSA
Sbjct: 121  TQEIGGVKCLSGVKRPRATLDVEQPSVHVVYDSLTRESRKMLEGLLQQWSEWHAKHCSSA 180

Query: 784  NDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGYXX 963
             DS   + SGEETYFPAL VG +KPSA+++W++ + S    E I  DGNS+PLYDRGY  
Sbjct: 181  QDSRELLESGEETYFPALHVGLEKPSAVTYWVDKQASNNKSEFIPLDGNSIPLYDRGYSF 240

Query: 964  XXXXXXXXXXXERVRERAEDSRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNA 1143
                       ER  E  + SRCFNCGSY H+LK+CPKPRDN AVN+ARKQHK RRNQ+A
Sbjct: 241  ALTATDSSTNVERGIEMVDSSRCFNCGSYGHALKECPKPRDNAAVNSARKQHKSRRNQSA 300

Query: 1144 AARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD 1323
            ++RNPTRYYQ++P GKYDGL+PG L++ETRKLLGLGELDPPPW+NRMR++GYPPGYLE D
Sbjct: 301  SSRNPTRYYQDSPRGKYDGLRPGALDSETRKLLGLGELDPPPWINRMRQMGYPPGYLEDD 360

Query: 1324 -DQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEERWAASG 1500
             DQPSGITIF                      P+K +V+FPGVNAPIPE+ADE RW A+ 
Sbjct: 361  EDQPSGITIFADEKNKEETEEGEILDKSLPNLPRKMTVDFPGVNAPIPEHADERRWEAAP 420

Query: 1501 SSDLYMSRNHSYSRYNRPSEPISRGRHHEQQR 1596
            SS  Y SR+HS++RYN   + ++RG +HEQ+R
Sbjct: 421  SSSRY-SRSHSHNRYNHAQDYVNRGHYHEQRR 451


>ref|XP_006346854.1| PREDICTED: uncharacterized protein LOC102582187 [Solanum tuberosum]
          Length = 530

 Score =  412 bits (1059), Expect = e-112
 Identities = 214/452 (47%), Positives = 283/452 (62%), Gaps = 18/452 (3%)
 Frame = +1

Query: 295  METEDVISLPASSSPANGGDNEELNDSG----------CQPSEQNCQSLDVLEVNVNESH 444
            M TED  + PAS +   G +N E+  +G           +  E   +S   +++  +   
Sbjct: 1    MGTEDPNNCPASDNLERGIENSEVGANGDTSKPTNFVVSESKEPQQESDSDMDLESDPGS 60

Query: 445  GSSTDIVNEDNQLLLDADNKSENQDKLEVI-----ADAGLME--DGNGLCNHAEDAFQDS 603
                D+    +Q+ ++     E  +++  +     A+ GL+   D N   N  +D    S
Sbjct: 61   QVGVDLTGTPSQVGVELAETVEITEEVTTLDSVVHAENGLLSLPDRNNSSNQTKDQDHVS 120

Query: 604  CPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSA 783
              +          KRPR   D +QPSV V+Y SLTR+S+  LE LLQQWS+WHA++CSSA
Sbjct: 121  TQEIGGVKCLSGVKRPRATLDVEQPSVHVVYDSLTRESRNMLEGLLQQWSEWHAKHCSSA 180

Query: 784  NDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGYXX 963
            +DS   + SGEETYFPAL VG + PSA+++W++ + S    E I  DGNS+PLYDRGY  
Sbjct: 181  HDSRELLESGEETYFPALHVGLENPSAVTYWVDKQASNNKSEFIPLDGNSIPLYDRGYSF 240

Query: 964  XXXXXXXXXXXERVRERAEDSRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNA 1143
                       ER  E  + SRCFNCGSY H+LK+CPKPRDN AVN+ARKQHK RRNQ+A
Sbjct: 241  ALTATDSSTNVERGMEMVDSSRCFNCGSYGHALKECPKPRDNAAVNSARKQHKSRRNQSA 300

Query: 1144 AARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD 1323
            ++RNPTRYYQ++P GKYDGL+PG L++ETRKLLGLGELDPPPW+NRMR++GYPPGYLE D
Sbjct: 301  SSRNPTRYYQDSPRGKYDGLRPGALDSETRKLLGLGELDPPPWINRMRQMGYPPGYLEED 360

Query: 1324 -DQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEERWAASG 1500
             DQPSGITIF                      P+K SV+FPGVNAPIPE+ADE RW A+ 
Sbjct: 361  EDQPSGITIFADEKNKEETEEGEILDKSFPNPPRKMSVDFPGVNAPIPEHADERRWEAAP 420

Query: 1501 SSDLYMSRNHSYSRYNRPSEPISRGRHHEQQR 1596
            SS  Y SR++S++RYN   + ++RG +HEQ+R
Sbjct: 421  SSSRY-SRSYSHNRYNHAQDYVNRGHYHEQRR 451


>emb|CBI16864.3| unnamed protein product [Vitis vinifera]
          Length = 1165

 Score =  402 bits (1032), Expect = e-109
 Identities = 230/465 (49%), Positives = 292/465 (62%), Gaps = 26/465 (5%)
 Frame = +1

Query: 277  PERML---EMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSL--DVLE--VNVN 435
            P+++L   +M TE++I+ PA S    G ++ EL++S  +P E +  S   +V E  +N+ 
Sbjct: 582  PQKLLLDSDMGTEELINPPAPSGSVCGSEDNELHNSNPEPGEADSSSSNSEVKEDKLNIE 641

Query: 436  ESHGSSTDIVNEDNQL----LLD---ADNKSENQDKLEV----IADAGLMEDGNGL---- 570
                +  D    D++L    +LD    D +  +Q  +EV    +    +    +G+    
Sbjct: 642  SLMQNKVDFEKVDSRLTPGVVLDKDLVDKQLTSQGSVEVTETIVVTKLINSSSSGVPTEN 701

Query: 571  -CNHAEDAFQDSCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQ 747
             C  A D            S+SG  KR RL  DEQQPSV VIY SLTRDSKRKLEELLQQ
Sbjct: 702  GCLTAPDEGPIGNHMIDGTSISGV-KRARLTIDEQQPSVHVIYNSLTRDSKRKLEELLQQ 760

Query: 748  WSQWHAQNCSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFD 924
            WS+WHA+  SS++D    + SGE+TYFPAL VG +K SA+SFW++N+T   Q+KE I  D
Sbjct: 761  WSEWHAKYVSSSHDPKGQLDSGEKTYFPALHVGLNKSSAVSFWVDNQTRKQQDKEFISLD 820

Query: 925  GNSVPLYDRGYXXXXXXXXXXXXXERVRERAEDSRCFNCGSYSHSLKDCPKPRDNVAVNN 1104
            G+SVPLYDRG+             E   E  + SRCFNCGSY+HS+K+CPKPRDNVAVNN
Sbjct: 821  GDSVPLYDRGFALGLVSEDGQSKPEGALEIIDASRCFNCGSYNHSMKECPKPRDNVAVNN 880

Query: 1105 ARKQHKVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRM 1284
            ARKQHK RRNQN  +RNPTRYYQN+PGG+YDGL+PG L  ETR+LLGLGELDPPPWLNRM
Sbjct: 881  ARKQHKSRRNQNPGSRNPTRYYQNSPGGRYDGLRPGALGVETRELLGLGELDPPPWLNRM 940

Query: 1285 REIGYPPGYL--EVDDQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAP 1458
            RE+GYPPGYL  E ++QPSGITI+                  + +  +K SVEFPG+NAP
Sbjct: 941  REMGYPPGYLDPEEEEQPSGITIYADEEVKDEQEDGEILETEYLEPQRKMSVEFPGINAP 1000

Query: 1459 IPENADEERWAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 1593
            IP+NADE RWAA   S  +   NHSY       EP SR   HEQ+
Sbjct: 1001 IPKNADERRWAA--GSRPHRRLNHSY-------EPSSRRNSHEQR 1036


>ref|XP_002279557.2| PREDICTED: uncharacterized protein LOC100247996 [Vitis vinifera]
          Length = 575

 Score =  401 bits (1030), Expect = e-109
 Identities = 230/458 (50%), Positives = 287/458 (62%), Gaps = 23/458 (5%)
 Frame = +1

Query: 295  METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSL--DVLE--VNVNESHGSSTDI 462
            M TE++I+ PA S    G ++ EL++S  +P E +  S   +V E  +N+     +  D 
Sbjct: 1    MGTEELINPPAPSGSVCGSEDNELHNSNPEPGEADSSSSNSEVKEDKLNIESLMQNKVDF 60

Query: 463  VNEDNQL----LLD---ADNKSENQDKLEV----IADAGLMEDGNGL-----CNHAEDAF 594
               D++L    +LD    D +  +Q  +EV    +    +    +G+     C  A D  
Sbjct: 61   EKVDSRLTPGVVLDKDLVDKQLTSQGSVEVTETIVVTKLINSSSSGVPTENGCLTAPDEG 120

Query: 595  QDSCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNC 774
                      S+SG  KR RL  DEQQPSV VIY SLTRDSKRKLEELLQQWS+WHA+  
Sbjct: 121  PIGNHMIDGTSISGV-KRARLTIDEQQPSVHVIYNSLTRDSKRKLEELLQQWSEWHAKYV 179

Query: 775  SSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDR 951
            SS++D    + SGE+TYFPAL VG +K SA+SFW++N+T   Q+KE I  DG+SVPLYDR
Sbjct: 180  SSSHDPKGQLDSGEKTYFPALHVGLNKSSAVSFWVDNQTRKQQDKEFISLDGDSVPLYDR 239

Query: 952  GYXXXXXXXXXXXXXERVRERAEDSRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRR 1131
            G+             E   E  + SRCFNCGSY+HS+K+CPKPRDNVAVNNARKQHK RR
Sbjct: 240  GFALGLVSEDGQSKPEGALEIIDASRCFNCGSYNHSMKECPKPRDNVAVNNARKQHKSRR 299

Query: 1132 NQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGY 1311
            NQN  +RNPTRYYQN+PGG+YDGL+PG L  ETR+LLGLGELDPPPWLNRMRE+GYPPGY
Sbjct: 300  NQNPGSRNPTRYYQNSPGGRYDGLRPGALGVETRELLGLGELDPPPWLNRMREMGYPPGY 359

Query: 1312 L--EVDDQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEER 1485
            L  E ++QPSGITI+                  + +  +K SVEFPG+NAPIP+NADE R
Sbjct: 360  LDPEEEEQPSGITIYADEEVKDEQEDGEILETEYLEPQRKMSVEFPGINAPIPKNADERR 419

Query: 1486 WAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
            WAA   S  +   NHSY       EP SR   HE QRW
Sbjct: 420  WAA--GSRPHRRLNHSY-------EPSSRRNSHE-QRW 447


>ref|XP_006482308.1| PREDICTED: uncharacterized protein LOC102626617 [Citrus sinensis]
          Length = 553

 Score =  396 bits (1018), Expect = e-107
 Identities = 229/458 (50%), Positives = 283/458 (61%), Gaps = 23/458 (5%)
 Frame = +1

Query: 295  METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNVNESHGSSTDIVNED 474
            ME EDVI L ASS      +N E+     +P + + Q  D  E   ++S+G S ++ NE 
Sbjct: 1    MEAEDVIDLLASSPSGCEEENNEMPGRDGEPGKSDFQPNDS-EKKEDDSNGESMEL-NEL 58

Query: 475  NQLLLDADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAF--------QDSCPQTR---- 618
            N  + D     E +   +V+ D+ +  +G      AE           Q+ C +      
Sbjct: 59   NVEIEDGQLIEEGEVGKDVVDDSNVNVEGTTTVELAETIVESDSRIHVQNGCLEVGNRSP 118

Query: 619  -------ANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCS 777
                    +S SG  KR R+  DE+QPSV VIY SLTR SK+KLEELLQQWS+W AQ  S
Sbjct: 119  NHNRMKDVSSTSGV-KRARMTLDEEQPSVHVIYNSLTRASKQKLEELLQQWSEWQAQFGS 177

Query: 778  SANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQ-NKELIRFDGNSVPLYDRG 954
            S+ND   G+  GE+T+FPA++VG  K  A+SFW++N+T  Q NK  I  D +S PLYDRG
Sbjct: 178  SSNDPNEGIEFGEQTFFPAIRVGKAKGPAVSFWIDNQTRNQQNKNFIPSDSHSTPLYDRG 237

Query: 955  YXXXXXXXXXXXXXERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRR 1131
            Y             E   E  +D SRCFNCGSYSHSLK+CPKPRD  AVNNARKQHK +R
Sbjct: 238  YALGLTSGDGSSNLEGGLEIIDDASRCFNCGSYSHSLKECPKPRDKDAVNNARKQHKSKR 297

Query: 1132 NQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGY 1311
            NQN+A+RNP RYYQN+ GGKYDGL+PG L+AETR+LLGLGELDPPPWL+RMRE+GYPPGY
Sbjct: 298  NQNSASRNPMRYYQNSAGGKYDGLRPGALDAETRQLLGLGELDPPPWLHRMRELGYPPGY 357

Query: 1312 L--EVDDQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEER 1485
            L  E DDQPSGITI+                     + +K + EFPG+NAPIPENADE  
Sbjct: 358  LDSEDDDQPSGITIYADREIKEGQEDGEIIETGRPASKRKMTAEFPGINAPIPENADERL 417

Query: 1486 WAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
            WAA  SS    SR+ S+ R N  SE ISRGR+HE QRW
Sbjct: 418  WAARPSSS-DSSRDRSHHRLNHHSESISRGRYHE-QRW 453


>gb|EMJ18348.1| hypothetical protein PRUPE_ppa003054mg [Prunus persica]
          Length = 608

 Score =  393 bits (1009), Expect = e-106
 Identities = 242/509 (47%), Positives = 292/509 (57%), Gaps = 74/509 (14%)
 Frame = +1

Query: 295  METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLD-------------------- 414
            ME+ED I LP S        N+E N+S C   E N Q  +                    
Sbjct: 1    MESEDFIGLPPSGDSLCRNVNDEPNNSNCDSKEVNSQPTNSEDREDKPKSENLGNDSDAQ 60

Query: 415  ------VLEVNV-NESHGSSTDIVNEDNQLLLDADNKSENQDKLEVIADAGLMEDGNGLC 573
                  V E N+ NE  GS +D+  ED   L  A N+S++ D  E I   G  +DG+  C
Sbjct: 61   REVSHCVPEENLENELVGSGSDMEIEDISNL-PALNRSDSAD--EEIKIKG-NKDGDAHC 116

Query: 574  ----NHAEDAF-----------------------------------QDSCP----QTRAN 624
                NH  D F                                   QD+ P    +T   
Sbjct: 117  LQQANHNNDLFDESSLLSVAQSETVTVAQESNVFCSKVHKNGCLPVQDASPFGTHKTGGT 176

Query: 625  SLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGM 804
            ++SG  KR R+  DE+QPSV+V YKSLTR SK KLEELLQQWS+WHAQ   S+ D    +
Sbjct: 177  TISGV-KRARITVDERQPSVRVTYKSLTRASKHKLEELLQQWSEWHAQYVPSSQDPIEVV 235

Query: 805  VSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXX 981
             SGE+T+FPAL VG +K SA+SFWM+N+T   ++KE    D N VPLYDRGY        
Sbjct: 236  ESGEDTFFPALHVGTEKTSAVSFWMDNQTRKAESKESTPLDSNYVPLYDRGYALGLTLAG 295

Query: 982  XXXXXERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNP 1158
                 E   E  +D SRCFNCGSY+HSLKDCPKPR++VAVNNARKQ K +RNQNA +RN 
Sbjct: 296  GSSNLEGGLEIIDDASRCFNCGSYNHSLKDCPKPRNHVAVNNARKQLKFKRNQNANSRNS 355

Query: 1159 TRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQP 1332
            TRYYQN+P GKYDGL+PG L+AETRKLLG+GELDPPPWLNRMREIGYPPGYL+ D  DQP
Sbjct: 356  TRYYQNSPAGKYDGLRPGALDAETRKLLGIGELDPPPWLNRMREIGYPPGYLDPDDEDQP 415

Query: 1333 SGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEERWAASGSSDL 1512
            SGI I+                  + +  +K +VEFPG+N PIPE+ADE  W A G S  
Sbjct: 416  SGIIIYADEEIKGEQEDGEIIETDYPEPQRKMTVEFPGLNGPIPEDADERLW-APGPSFS 474

Query: 1513 YMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
              SRN SYSR N  SEP+SRG HH +QRW
Sbjct: 475  DHSRNRSYSRSNHYSEPVSRG-HHREQRW 502


>ref|XP_006430828.1| hypothetical protein CICLE_v10013582mg, partial [Citrus clementina]
            gi|557532885|gb|ESR44068.1| hypothetical protein
            CICLE_v10013582mg, partial [Citrus clementina]
          Length = 1076

 Score =  388 bits (996), Expect = e-105
 Identities = 226/460 (49%), Positives = 282/460 (61%), Gaps = 23/460 (5%)
 Frame = +1

Query: 289  LEMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNVNESHGSSTDIVN 468
            L ME EDVI L ASS      +N E+ D   +P + + Q  D  E   ++S+G S ++ N
Sbjct: 522  LYMEAEDVIDLLASSPSGCEEENNEMPDRDGEPGKSDFQPNDS-EKKEDDSNGESMEL-N 579

Query: 469  EDNQLLLDADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAF--------QDSCPQTR-- 618
            E N  + D     E +   +V+ D+ +  +G      AE           Q+ C +    
Sbjct: 580  ELNVEIEDGQLIEEGEVGKDVVDDSNVNVEGTTTVELAETIVESDSRIHVQNGCLEVGNR 639

Query: 619  ---------ANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQN 771
                      +S+SG  KR R+  DE+QPSV VIY SLTR SK+KLEELLQQWS+W AQ 
Sbjct: 640  SPNHNRMKDVSSISGV-KRARMTLDEEQPSVHVIYNSLTRASKQKLEELLQQWSEWQAQF 698

Query: 772  CSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQ-NKELIRFDGNSVPLYD 948
             SS+ND   G+  GE+T+FPA++VG  K  A+  +++ +   Q NK  I  D +S PLYD
Sbjct: 699  GSSSNDPNEGIEFGEQTFFPAIRVGKAKGPAVVIFLDRQPKQQQNKNFIPSDSHSTPLYD 758

Query: 949  RGYXXXXXXXXXXXXXERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKV 1125
            RGY             E   E  +D SRCFNCGSYSHSLK+CPKPRD  AVNNARKQHK 
Sbjct: 759  RGYALGLTSGDGSSNLEGGLEIIDDASRCFNCGSYSHSLKECPKPRDKDAVNNARKQHKS 818

Query: 1126 RRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPP 1305
            +RNQN+A+RNP RYYQN+ GGKYDGL+PG L+AETR+LLGLGELDPPPWL+RMRE+GYPP
Sbjct: 819  KRNQNSASRNPMRYYQNSAGGKYDGLRPGALDAETRQLLGLGELDPPPWLHRMRELGYPP 878

Query: 1306 GYL--EVDDQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADE 1479
            GYL  E DDQPSGITI+                     + +K + EFPG+NAPIPENADE
Sbjct: 879  GYLDSEDDDQPSGITIYADGEIKEGQEDGEIIETGRPASKRKMTTEFPGINAPIPENADE 938

Query: 1480 ERWAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
              WAA  SS    SR+ S+ R N  SE ISRGR+HE QRW
Sbjct: 939  RLWAARPSSS-DSSRDRSHHRLNHHSESISRGRYHE-QRW 976


>ref|XP_002525972.1| nucleic acid binding protein, putative [Ricinus communis]
            gi|223534704|gb|EEF36396.1| nucleic acid binding protein,
            putative [Ricinus communis]
          Length = 693

 Score =  385 bits (989), Expect = e-104
 Identities = 232/534 (43%), Positives = 293/534 (54%), Gaps = 98/534 (18%)
 Frame = +1

Query: 286  MLEMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQ-------------------S 408
            +L METED+ISLP S++  +G +N EL+     P E   Q                    
Sbjct: 36   ILSMETEDMISLPDSTNSGDGIENNELDQPESGPGEAESQPSNYEAEEGMIDGHNMGLNE 95

Query: 409  LDV-----------LEVNVNE------SHGSSTDIVNEDN--------------QLLLDA 495
            +D+           LE+N N+      + GS    +NE+N              + L+DA
Sbjct: 96   VDIGNKTETSDPEKLELNQNDFGAEECTKGSKDSELNEENVKTEECSAVQENLGENLVDA 155

Query: 496  DNKSENQDKLEVIADAGLMEDGNGLCNHAEDA---------------------------- 591
              + +  D+  +  + G++ +    C    D                             
Sbjct: 156  VTEEDTIDRDYLFLNQGVVREEGAQCLVETDVDMDLVDSPVMQVNIEVAEAVAVSGNLSS 215

Query: 592  ------FQDSCPQTRANSLS----------GAFKRPRLAEDEQQPSVQVIYKSLTRDSKR 723
                   Q+SC  T+  SL              KR R+A +EQQPSV V Y SLTR SKR
Sbjct: 216  FGFRLNAQNSCLDTQNESLIQNHMMKGGHVSGVKRARIAYNEQQPSVHVTYNSLTRASKR 275

Query: 724  KLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQ- 900
            KLEELLQQWS+WH Q  SS+ D    + SGEETYFPAL VG +K SA+SFW+EN+T  Q 
Sbjct: 276  KLEELLQQWSEWHVQRGSSSQDLNEVLESGEETYFPALCVGTEKSSAVSFWIENQTKKQL 335

Query: 901  NKELIRFDGNSVPLYDRGYXXXXXXXXXXXXXERVRERA-EDSRCFNCGSYSHSLKDCPK 1077
            N +LI  D +SVPLYDRG+             E   E   E +RCFNCGSYSH+LK+CPK
Sbjct: 336  NNDLISSDSDSVPLYDRGFAIGLTSTDGPSNVEGGLEIVNEAARCFNCGSYSHALKECPK 395

Query: 1078 PRDNVAVNNARKQHKVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGEL 1257
            PR+N AVNNARKQHK +RNQNA +RN TRYYQ++ GGKY+GLKPG+L+AETR+LLGLGEL
Sbjct: 396  PRNNAAVNNARKQHKSKRNQNAGSRNGTRYYQSSSGGKYEGLKPGSLDAETRRLLGLGEL 455

Query: 1258 DPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKS 1431
            DPPPWLNRMRE+GYPPGYL+ D  DQPSGI IF                  +   P+K +
Sbjct: 456  DPPPWLNRMRELGYPPGYLDPDDEDQPSGIIIFADGDIKDEQEDGEIIETENPDPPRKMA 515

Query: 1432 VEFPGVNAPIPENADEERWAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 1593
            VEFPG+NAPIPENADE  W  +G S     RN  + + +  SE ISR  HHEQ+
Sbjct: 516  VEFPGINAPIPENADERLW-ETGPSSYNSFRNRPFRKSDHSSETISRWHHHEQR 568


>ref|XP_006373469.1| hypothetical protein POPTR_0017s14060g [Populus trichocarpa]
            gi|550320291|gb|ERP51266.1| hypothetical protein
            POPTR_0017s14060g [Populus trichocarpa]
          Length = 615

 Score =  380 bits (977), Expect = e-102
 Identities = 225/501 (44%), Positives = 289/501 (57%), Gaps = 66/501 (13%)
 Frame = +1

Query: 295  METEDVISLPASSSPANGGDNEELNDSGCQPSEQN------------------------- 399
            MET+D+I LP S       +N+EL+ S   PSE                           
Sbjct: 1    METDDMIGLPGSIDFGYKNENDELSKSDFGPSESRSQPCSNDGKESKDDEEGLGLCEGVV 60

Query: 400  ----------CQSLDVLEVNVNESHGSSTDIVNEDNQLL-------------LDADNKSE 510
                      C  L+V +    E+    +++V E+  +              +D      
Sbjct: 61   GNEEGIVDPGCSGLNVGDTGTEEAATDQSNLVLEERDIGSKGVQFAVETEADMDLVVSPV 120

Query: 511  NQDKLEVIADAGLME---DGNGLCNHAEDAFQDSCPQTRANSLS----------GAFKRP 651
             Q  L+V+ DA ++    D + +  + ED F D    T+ NSL              KR 
Sbjct: 121  RQVNLDVV-DAVIVSKKPDISSIIGNVEDCFLD----TQNNSLVQQGKVDGSHISGVKRK 175

Query: 652  RLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFP 831
            R+A DEQQPSV V+Y SLTR  K+KLEELLQQWS+WHAQ  +S++DS   + SGE+TYFP
Sbjct: 176  RMAYDEQQPSVHVMYNSLTRSGKQKLEELLQQWSEWHAQQ-NSSHDSDEMLQSGEDTYFP 234

Query: 832  ALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXXERVR 1008
            AL+VG +K SA+SFW+EN+    Q+ +LI    N VPLYDRGY             E   
Sbjct: 235  ALRVGMEKSSAVSFWIENQARKQQDNDLILQHSNFVPLYDRGYVLGLTSADGPINVEGGL 294

Query: 1009 ERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNPTRYYQNTPG 1185
            E  + + RCFNCG+Y+HSLK+CPKPRDN AVNNARKQHK +RNQN+++RNPTRYYQ++ G
Sbjct: 295  EIVDAAARCFNCGAYNHSLKECPKPRDNAAVNNARKQHKFKRNQNSSSRNPTRYYQSSSG 354

Query: 1186 GKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXX 1359
            GKYDGLKPG+L+ ETR+LLGLGELDPPPWLNRMRE+GYPPGYL+ D  DQPSGITIF   
Sbjct: 355  GKYDGLKPGSLDTETRQLLGLGELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIFDDG 414

Query: 1360 XXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEERW-AASGSSDLYMSRNHSY 1536
                           H + P+K SVEFPG+NAPIPENA++  W     SSD +  R+ S 
Sbjct: 415  DVEEEQEDGEIMETDHPEPPRKMSVEFPGINAPIPENANQRFWEVGPSSSDPF--RHRSR 472

Query: 1537 SRYNRPSEPISRGRHHEQQRW 1599
             R N  SE   R  HHEQ+++
Sbjct: 473  HRSNHSSEATGRWHHHEQRQY 493


>gb|EOY04289.1| Proline-rich spliceosome-associated family protein / zinc knuckle
            family protein, putative isoform 1 [Theobroma cacao]
            gi|508712393|gb|EOY04290.1| Proline-rich
            spliceosome-associated family protein / zinc knuckle
            family protein, putative isoform 1 [Theobroma cacao]
          Length = 595

 Score =  378 bits (971), Expect = e-102
 Identities = 224/477 (46%), Positives = 286/477 (59%), Gaps = 44/477 (9%)
 Frame = +1

Query: 295  METEDVISLPASS--SPANGGDNEELNDSGCQPSEQ--NCQSLD------VLEVNVNESH 444
            ME +D+I+LPASS  S +  G+  +L+D  CQ   Q  N ++ D       LEVN     
Sbjct: 1    MEGQDIINLPASSNSSGSESGELRDLDDGPCQVGSQPNNAETKDGEGKVESLEVNEGVIK 60

Query: 445  GSSTDIVNE---DNQLLLDADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAF------- 594
               +D++ E   DN L+   D+ S+ Q   E+     + E   GL   A  A+       
Sbjct: 61   NPQSDLIVETEVDNTLV---DDSSDMQISDEITETVRVKETLEGLSFGAHSAYFTADEKM 117

Query: 595  ---QDSCP--QTRANSLSGA--------------FKRPRLAEDEQQPSVQVIYKSLTRDS 717
                 S P  + R ++ +G+               KRPR+  D+QQPSV ++Y  LTR S
Sbjct: 118  DGLSSSVPTKKRRLDAQNGSPIQNDMMDGIPISGVKRPRMTFDDQQPSVHIVYNFLTRAS 177

Query: 718  KRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSG 897
            K+KLEELLQ+WS+W A++ + + D    + SGEETYFPAL+VGA+KPS +SFW++N+T  
Sbjct: 178  KQKLEELLQKWSEWQAEHGTLSPDENELIESGEETYFPALRVGAEKPSTVSFWIDNQTRN 237

Query: 898  -QNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXXERVRERAED-SRCFNCGSYSHSLKDC 1071
             ++ E+I  D N VPLYDRGY             E   E  +D SRCFNCGSYSHSLK C
Sbjct: 238  PRDTEIITLDSNIVPLYDRGYAMCLTSADGSSNLEGGLEIKDDASRCFNCGSYSHSLKQC 297

Query: 1072 PKPRDNVAVNNARKQH-KVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGL 1248
            PKPRDN+AVN ARKQH K +RNQN  +RN  RYYQ++ GGKYD LKPG L A+TR+LLGL
Sbjct: 298  PKPRDNLAVNAARKQHYKSKRNQNTGSRNAIRYYQSSQGGKYDDLKPGVLSADTRQLLGL 357

Query: 1249 GELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPK 1422
            GE DPPPWLNRMREIGYP GYL  D  DQPSGITI+                  H +  K
Sbjct: 358  GEFDPPPWLNRMREIGYPTGYLAPDDEDQPSGITIYADGETNEEQEDGEITEVVHAEPEK 417

Query: 1423 KKSVEFPGVNAPIPENADEERWAASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 1593
            K +VEFPG+NAPIP  ADE+ W A GSS    SR+ S+ R +  SEP SRG HHE++
Sbjct: 418  KMTVEFPGINAPIPVEADEKLW-APGSSSSESSRSRSHRRLHHSSEPGSRGHHHERR 473


>ref|XP_004304149.1| PREDICTED: uncharacterized protein LOC101295545 [Fragaria vesca
            subsp. vesca]
          Length = 553

 Score =  369 bits (947), Expect = 3e-99
 Identities = 219/461 (47%), Positives = 280/461 (60%), Gaps = 26/461 (5%)
 Frame = +1

Query: 295  METEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNV-------NESHGSS 453
            ME+ED I+LP S    +G ++ ELND      E++ + +D   +N        N    SS
Sbjct: 1    MESEDFIALPDSGD--SGFEDGELND------EKDAKEVDAQPINSEDKEDKPNSERESS 52

Query: 454  TDIVNED--------NQLL-LDADNKSENQDKLEVIADAGLMEDG--NGLCNHAEDAFQD 600
                  +        N+L+  D+D + E+ + L  +  +G  E    N   +     +++
Sbjct: 53   AQREESECIPEASPANELVDNDSDMEIEDINNLPALTSSGPKEGDVQNVNIDLHSTLYEN 112

Query: 601  SCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSS 780
                 +A       KR R   DEQQ SV+V Y  LTR SK KLEELLQQWS+WHA++ SS
Sbjct: 113  GHLAVQAKR---GVKRARTTVDEQQASVRVTYSHLTRASKHKLEELLQQWSEWHAKHVSS 169

Query: 781  ANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGY 957
            + D+   + SGEET FPAL VG ++ S +SFWM+N+T + QN E +  D N  PLYDRGY
Sbjct: 170  SQDTPQVLESGEETLFPALHVGTERTSGVSFWMDNQTGTAQNMESLPLDSNYAPLYDRGY 229

Query: 958  XXXXXXXXXXXXXERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRN 1134
                         E   E  +D SRCFNCGSY+H+L++CPKPRD+VAVN ARKQ K+++N
Sbjct: 230  ALGLTVAGSSTNQEGGLEIIDDASRCFNCGSYNHALRECPKPRDHVAVNKARKQLKIKKN 289

Query: 1135 QNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYL 1314
            Q   +RN TRYYQN+P GKYDGL+PG LEAETRKLLGLGELDPPPWLNRMREIGYPPGYL
Sbjct: 290  QTPNSRNSTRYYQNSPAGKYDGLRPGALEAETRKLLGLGELDPPPWLNRMREIGYPPGYL 349

Query: 1315 EVD--DQPSGITIFXXXXXXXXXXXXXXXXXXHNKAP---KKKSVEFPGVNAPIPENADE 1479
            +VD  DQPSGI I+                    + P   +K +V FPG+NAPIPENADE
Sbjct: 350  DVDDEDQPSGIIIYGVEETKGEQEDGEIIETDLPEPPEPRRKMTVGFPGMNAPIPENADE 409

Query: 1480 ERWAASGS-SDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
             RW  S S SD   SRNHS++R N   EP+SRG H+ +QRW
Sbjct: 410  RRWTPSPSVSD--PSRNHSHNRPNHYYEPVSRG-HYREQRW 447


>gb|EXB72259.1| Zinc finger CCHC domain-containing protein 8 [Morus notabilis]
          Length = 660

 Score =  367 bits (941), Expect = 2e-98
 Identities = 217/466 (46%), Positives = 271/466 (58%), Gaps = 30/466 (6%)
 Frame = +1

Query: 292  EMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNV---NESHGSSTD- 459
            +ME ED+  +P  ++   G +N  ++     P +   QS+ + + ++   +E+  S+ D 
Sbjct: 80   DMEIEDLNGVPVLTAAGYGLENNGIDSFSNDPRQAGSQSVTLADKDMKTTSENLVSNMDG 139

Query: 460  IVNEDNQLLLDA-------DNKSENQDKLEVIADAGLMEDGNGL--CNHAEDAFQDS--- 603
               ED    L A       DN S  Q  + +   A + E       C     A QD    
Sbjct: 140  AQREDGTWKLKAIQEKDLADNSSLLQVNVNLTDTAAVAEASKTTFGCEIGRVAVQDKISI 199

Query: 604  --------CPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQW 759
                    C     N      KR R+  +EQQPSV V +  LTR SK KLEEL+QQWS+W
Sbjct: 200  RTKKREGYCILCLVNYTISGVKRSRVMFEEQQPSVCVKFNFLTRSSKYKLEELMQQWSEW 259

Query: 760  HAQNCSSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENETSGQ-NKELIRFDGNSV 936
             AQ+ SS+ D    + SGEETYF AL +G +K S++ FW++ +T  Q N EL   D NSV
Sbjct: 260  QAQHHSSSQDPPEALESGEETYFSALHIGLEKASSVPFWIDKQTGKQQNNELSPLDCNSV 319

Query: 937  PLYDRGYXXXXXXXXXXXXXERVRERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARK 1113
            PLYDRG+             E   E  ED+ RCFNCGSY+H+LK+CPKPRDNVAVNNARK
Sbjct: 320  PLYDRGFALGLTSDGGSSNVEGGLEIVEDAVRCFNCGSYNHALKECPKPRDNVAVNNARK 379

Query: 1114 QHKVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREI 1293
            Q K +RNQN ++RNPTRYYQN+P GKYDGLKPGTL+ ETRKLLGL ELDPPPWL RMREI
Sbjct: 380  QLKSKRNQNPSSRNPTRYYQNSPAGKYDGLKPGTLDPETRKLLGLRELDPPPWLGRMREI 439

Query: 1294 GYPPGYLEVD--DQPSGITIFXXXXXXXXXXXXXXXXXXHNK-APKKKSVEFPGVNAPIP 1464
            GYPPGYL+ D  DQPSGITI+                   N+  P+K +VEFPG+N PIP
Sbjct: 440  GYPPGYLDPDEEDQPSGITIYADGEGNKAEQEDGEIIEADNREPPRKMTVEFPGINGPIP 499

Query: 1465 ENADEERW-AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
            ENAD   W AA  SSD+Y  RN    R N  SEP  R  HH ++RW
Sbjct: 500  ENADRRIWTAAPASSDIY--RNRLLRRSNHSSEPTGRS-HHREERW 542


>ref|XP_006593391.1| PREDICTED: uncharacterized protein LOC100527170 isoform X1 [Glycine
            max] gi|571495821|ref|XP_006593392.1| PREDICTED:
            uncharacterized protein LOC100527170 isoform X2 [Glycine
            max]
          Length = 517

 Score =  364 bits (934), Expect = 1e-97
 Identities = 200/400 (50%), Positives = 261/400 (65%), Gaps = 10/400 (2%)
 Frame = +1

Query: 424  VNVNESHGSSTDIVNEDNQLLLDADNKSENQDKLEVIAD---AGLMEDGNGLCNHAEDAF 594
            +N +ES    +D + E  ++L D     ++  KL V+ +    G++ + NG C   ED  
Sbjct: 7    MNPSESSNLGSDSL-EKEKILEDETEDLQDGLKLSVVTEEMSGGVLAE-NG-CISLEDGS 63

Query: 595  QDSCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNC 774
                 +T   S+SGA KR R+  DE QPSV   Y SLTR S++KL+ELLQQWS+WHA++ 
Sbjct: 64   LKRSIETVETSVSGA-KRARITVDEDQPSVHFTYNSLTRASRQKLQELLQQWSEWHAKHV 122

Query: 775  SSANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDR 951
             S+ND+   + SGEET+FPAL VG +K SA+SFWMEN+T   +NK+ I    NSVPLYDR
Sbjct: 123  LSSNDASEVLESGEETFFPALHVGLEKTSAVSFWMENQTRKDKNKDFIPLADNSVPLYDR 182

Query: 952  GYXXXXXXXXXXXXXERVRERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVR 1128
            GY             +   E  + + RCFNCGSY+HSL++CP+PRDN AVNNAR +HK R
Sbjct: 183  GYTLGLTSADGSSNVDGGLEIIDAAARCFNCGSYNHSLRECPRPRDNTAVNNARNKHKSR 242

Query: 1129 RNQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPG 1308
            RNQN+++RNPTRYYQN+P GKYDGL+PG L+  TR+LLGLGELDPPPWLNRMRE+GYPPG
Sbjct: 243  RNQNSSSRNPTRYYQNSPAGKYDGLRPGALDDATRQLLGLGELDPPPWLNRMRELGYPPG 302

Query: 1309 YLEVD--DQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEE 1482
            YL+VD  DQPSGITI+                   +K  +KK+V+FPG+NAPIP+NADE 
Sbjct: 303  YLDVDDEDQPSGITIYTDREIADQEDGEIMEADA-SKPKRKKTVKFPGINAPIPDNADER 361

Query: 1483 RW---AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 1593
             W   A   SSD+  + +    R N  ++  SRG H E +
Sbjct: 362  LWGTRAGPSSSDISRNLSLPQHRSNYSTDYGSRGYHREHR 401


>gb|ADN34281.1| nucleic acid binding protein [Cucumis melo subsp. melo]
          Length = 610

 Score =  362 bits (930), Expect = 3e-97
 Identities = 212/450 (47%), Positives = 269/450 (59%), Gaps = 14/450 (3%)
 Frame = +1

Query: 292  EMETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNV----NESHGSSTD 459
            +ME ED+ +LP  S   +  +N E+  S  +    N    ++L  N     NE H    D
Sbjct: 81   DMEIEDLNNLPDFSKTRSRSENSEIL-SKAEDLPVNSADGNILPSNEPLQQNELHTRYED 139

Query: 460  IVNEDNQLLLD--ADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAFQDSCPQTRANSLS 633
            + + ++Q       DN S ++   ++    G+  D N L + A      +          
Sbjct: 140  VCHVESQNFQKDLVDNSSFSKTGGQLTVMNGVSIDFNELNSGAPMENGSATSHHHGGPRI 199

Query: 634  GAFKRPRLAE---DEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGS-- 798
               KRPR+A    DEQQPSV ++Y SLTRDSK+KL+ELL+QWS+WHAQ  S + D     
Sbjct: 200  SGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHAQQGSLSRDDKDTE 259

Query: 799  GMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGYXXXXXXX 978
             + SGEET+FPAL VG  K SA++FWM+N+ S Q +  +  D NSVPLYDRG+       
Sbjct: 260  NLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQTFVPIDDNSVPLYDRGFTLGLTSA 319

Query: 979  XXXXXXERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARN 1155
                  E  ++  +D SRCFNCGSY+HSLKDC KPRDN AVNNAR ++K  +  N+A+RN
Sbjct: 320  NDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYK--KQHNSASRN 377

Query: 1156 PTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYL--EVDDQ 1329
             TRYYQN+ GGKYD L+PGTL+AETR+LLGL ELDPPPWLNRMRE+GYPPGYL  E +DQ
Sbjct: 378  STRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPEDEDQ 437

Query: 1330 PSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEERWAASGSSD 1509
            PSGITI+                  + K  KK SVEFPG+NAPIPENADE  WA   SS 
Sbjct: 438  PSGITIY-ADEKTDEQEDGEITEAEYRKPQKKMSVEFPGINAPIPENADERLWAPEPSSS 496

Query: 1510 LYMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
              + RN S  R N   E  +RG  H QQRW
Sbjct: 497  -GLPRNRSNQRLNHYPEYDTRGNDHHQQRW 525


>ref|XP_004169819.1| PREDICTED: uncharacterized protein LOC101230973 [Cucumis sativus]
          Length = 610

 Score =  360 bits (925), Expect = 1e-96
 Identities = 212/457 (46%), Positives = 268/457 (58%), Gaps = 21/457 (4%)
 Frame = +1

Query: 292  EMETEDVISLPASSSPANGGDNEEL-----------NDSGCQPSEQNCQSLDVLEVNVNE 438
            +ME ED+ +LP  S   +  +N E+            D    PS +  Q         NE
Sbjct: 81   DMEIEDLNNLPDFSKTRSRSENSEILSKAADLPVNSADGNILPSSEPLQQ--------NE 132

Query: 439  SHGSSTDIVNEDNQLLLD--ADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAFQDSCPQ 612
             H    D+ + +++       DN S  +   ++    G+  D N L + A      +   
Sbjct: 133  FHTRYEDVCHVESKNFQKDLVDNSSFLKTGGQLTVMNGVSIDFNELNSGAPMENGSATSH 192

Query: 613  TRANSLSGAFKRPRLAE---DEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSA 783
                      KRPR+A    DEQQPSV ++Y SLTRDSK+KL+ELL+QWS+WHAQ  S +
Sbjct: 193  HHGGPRISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHAQQGSLS 252

Query: 784  NDSGS--GMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGY 957
             D      + SGEET+FPAL VG  K SA++FWM+N+ S Q +  +  D NSVPLYDRG+
Sbjct: 253  CDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQNFVPIDDNSVPLYDRGF 312

Query: 958  XXXXXXXXXXXXXERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRN 1134
                         E  ++  +D SRCFNCGSY+HSLKDC KPRDN AVNNAR ++K  + 
Sbjct: 313  TLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYK--KQ 370

Query: 1135 QNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYL 1314
             N+A+RN TRYYQN+ GGKYD L+PGTL+AETR+LLGL ELDPPPWLNRMRE+GYPPGYL
Sbjct: 371  HNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRMRELGYPPGYL 430

Query: 1315 --EVDDQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEERW 1488
              E +DQPSGITI+                  + K  KKKSVEFPG+NAPIPENADE  W
Sbjct: 431  DPEDEDQPSGITIY-ADEKTDEQEDGEITEAEYRKPRKKKSVEFPGINAPIPENADERLW 489

Query: 1489 AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
            A   S+   +SRN S  R N   E  +RG  H QQRW
Sbjct: 490  APEPSNS-GLSRNRSNQRLNHYPEYDTRGNDHHQQRW 525


>ref|XP_004141493.1| PREDICTED: uncharacterized protein LOC101212144 [Cucumis sativus]
          Length = 610

 Score =  360 bits (925), Expect = 1e-96
 Identities = 212/457 (46%), Positives = 268/457 (58%), Gaps = 21/457 (4%)
 Frame = +1

Query: 292  EMETEDVISLPASSSPANGGDNEEL-----------NDSGCQPSEQNCQSLDVLEVNVNE 438
            +ME ED+ +LP  S   +  +N E+            D    PS +  Q         NE
Sbjct: 81   DMEIEDLNNLPDFSKTRSRSENSEILSKAADLPVNSADGNILPSSELLQQ--------NE 132

Query: 439  SHGSSTDIVNEDNQLLLD--ADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAFQDSCPQ 612
             H    D+ + +++       DN S  +   ++    G+  D N L + A      +   
Sbjct: 133  LHTRYEDVCHVESKKFQKDLVDNSSFLKTGGQLTVMNGVSIDFNELNSGAPMENGSATSH 192

Query: 613  TRANSLSGAFKRPRLAE---DEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSA 783
                      KRPR+A    DEQQPSV ++Y SLTRDSK+KL+ELL+QWS+WHAQ  S +
Sbjct: 193  HHGGPRISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHAQQGSLS 252

Query: 784  NDSGS--GMVSGEETYFPALQVGADKPSAMSFWMENETSGQNKELIRFDGNSVPLYDRGY 957
             D      + SGEET+FPAL VG  K SA++FWM+N+ S Q +  +  D NSVPLYDRG+
Sbjct: 253  CDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQNFVPIDDNSVPLYDRGF 312

Query: 958  XXXXXXXXXXXXXERVRERAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRN 1134
                         E  ++  +D SRCFNCGSY+HSLKDC KPRDN AVNNAR ++K  + 
Sbjct: 313  TLGLTSANDSSNAEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYK--KQ 370

Query: 1135 QNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYL 1314
             N+A+RN TRYYQN+ GGKYD L+PGTL+AETR+LLGL ELDPPPWLNRMRE+GYPPGYL
Sbjct: 371  HNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRMRELGYPPGYL 430

Query: 1315 --EVDDQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEERW 1488
              E +DQPSGITI+                  + K  KKKSVEFPG+NAPIPENADE  W
Sbjct: 431  DPEDEDQPSGITIY-ADEKTDEQEDGEITEAEYRKPRKKKSVEFPGINAPIPENADERLW 489

Query: 1489 AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQRW 1599
            A   S+   +SRN S  R N   E  +RG  H QQRW
Sbjct: 490  APEPSNS-GLSRNRSNQRLNHYPEYDTRGNDHHQQRW 525


>ref|XP_002305958.1| proline-rich spliceosome-associated family protein [Populus
            trichocarpa] gi|222848922|gb|EEE86469.1| proline-rich
            spliceosome-associated family protein [Populus
            trichocarpa]
          Length = 531

 Score =  359 bits (921), Expect = 3e-96
 Identities = 202/431 (46%), Positives = 267/431 (61%), Gaps = 16/431 (3%)
 Frame = +1

Query: 355  NEELNDSGCQPSEQNCQSLDVLEVNVNESHGSSTDIVNEDNQLLLDADNKSENQDKLEVI 534
            + +L  +  + S+ + +S+++ E  V            E N+  +  D + +N + +E+ 
Sbjct: 19   HSQLGSNETKESKDDEESVELNEGAVGNDERMKNGESVELNEGAVGNDERMKNGESVELN 78

Query: 535  ADAGLMEDGN----------GLCNHAEDAFQDSCPQTRANSLSGAFKRPRLAEDEQQPSV 684
              A    +G           G+  + E          +AN +SG  KR R+  +E+QPSV
Sbjct: 79   EGAVGNNEGTKNGEGFELNVGVIGNDEVTVDPGYSALKAN-VSGV-KRKRITYNEEQPSV 136

Query: 685  QVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFPALQVGADKPSA 864
             V+Y SLTR SK+KLEELLQQWS+WHAQ  SS++DS   + SGE+TYFPAL++G  K SA
Sbjct: 137  HVMYNSLTRASKKKLEELLQQWSEWHAQQNSSSHDSDEMLQSGEDTYFPALRIGMVKSSA 196

Query: 865  MSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXXERVRERAEDS-RCFN 1038
            ++FW+EN+T   Q+  +I    N VPLYDRGY             ER  E   D+ RC+N
Sbjct: 197  VTFWIENQTRKQQDNAIIPLQSNYVPLYDRGYALGLTSADGPINIERGLEIVGDAARCYN 256

Query: 1039 CGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNPTRYYQNTPGGKYDGLKPGTL 1218
            C SY+HSLK+CPKPRDN AVNNARKQHK +RNQN+++RNPTRYYQ++ GGKYDGLKPG+L
Sbjct: 257  CASYNHSLKECPKPRDNAAVNNARKQHKFKRNQNSSSRNPTRYYQSSSGGKYDGLKPGSL 316

Query: 1219 EAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXXXXXXXXXXXXX 1392
            + ET+KLLGLGELDPPPWLNRM+E+GYPPGYL+ D  DQPSGITIF              
Sbjct: 317  DTETQKLLGLGELDPPPWLNRMQELGYPPGYLDPDDEDQPSGITIFADGDVNEEQEDGEI 376

Query: 1393 XXXXHNKAPKKK-SVEFPGVNAPIPENADEERW-AASGSSDLYMSRNHSYSRYNRPSEPI 1566
                    P++K SVEFPG+NA IPENAD+  W     SSD +  R+ S  R    SE  
Sbjct: 377  TETDPPPEPQRKMSVEFPGINAAIPENADQRLWEVGPTSSDPW--RHRSQHRLKYSSEAT 434

Query: 1567 SRGRHHEQQRW 1599
             R  HHEQ+++
Sbjct: 435  GRWHHHEQRQY 445


>ref|XP_006603953.1| PREDICTED: uncharacterized protein LOC100805423 isoform X1 [Glycine
            max] gi|571554248|ref|XP_006603954.1| PREDICTED:
            uncharacterized protein LOC100805423 isoform X2 [Glycine
            max]
          Length = 519

 Score =  356 bits (914), Expect = 2e-95
 Identities = 199/399 (49%), Positives = 262/399 (65%), Gaps = 12/399 (3%)
 Frame = +1

Query: 433  NESHGSSTDIVNEDNQLLLDADNKSENQD--KLEVIAD---AGLMEDGNGLCNHAEDAFQ 597
            +E+    +D + ++N L    D K + QD  KL+V+ +    GL+ + NG C   ED   
Sbjct: 12   SENSNLGSDSLEKENIL---EDEKEDLQDGLKLKVVTEEVSGGLLAE-NG-CISLEDGSL 66

Query: 598  DSCPQTRANSLSGAFKRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCS 777
                +T   S+SGA KR R+  DE QPSV   Y SLTR S++KL+ELLQ+WS WHA++ S
Sbjct: 67   KRSLETVGTSVSGA-KRARITVDEYQPSVHFTYNSLTRASRQKLQELLQKWSAWHAKHVS 125

Query: 778  SANDSGSGMVSGEETYFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRG 954
            S++D+   + SGEET+FPAL VG +K SA+SFWMEN+T + +NK+ I    N+VPLYDRG
Sbjct: 126  SSSDASEVLESGEETFFPALHVGLEKTSAVSFWMENQTRNDKNKDFIPLADNTVPLYDRG 185

Query: 955  YXXXXXXXXXXXXXERVRERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRR 1131
            Y             +   E  + + RCFNCGSY+HSL++CP+PRDN+AVNNAR + K RR
Sbjct: 186  YALGLTSADGSSNVDGGLEIIDAAARCFNCGSYNHSLRECPRPRDNIAVNNARDKLKSRR 245

Query: 1132 NQNAAARNPTRYYQNTPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGY 1311
            NQN+++R+PTRYYQN+P GKYDGL+PG+L+  TRKLLGL ELDPPPWLNRMRE+GYPPGY
Sbjct: 246  NQNSSSRHPTRYYQNSPAGKYDGLRPGSLDDATRKLLGLRELDPPPWLNRMRELGYPPGY 305

Query: 1312 LEVD--DQPSGITIFXXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEER 1485
            L+VD  DQPSGITIF                   +K  +KK+V+FPG+NAPIPE ADE  
Sbjct: 306  LDVDNEDQPSGITIFTDSEIADQEDGEIMEANA-SKPKRKKTVKFPGINAPIPEKADERL 364

Query: 1486 W---AASGSSDLYMSRNHSYSRYNRPSEPISRGRHHEQQ 1593
            W   A   SSD+  + +    R N  ++  SRG H E +
Sbjct: 365  WGTRAGPSSSDISRNLSLPQHRSNYSTDYGSRGYHREHR 403


>ref|XP_004514436.1| PREDICTED: uncharacterized protein LOC101500938 isoform X1 [Cicer
            arietinum] gi|502168650|ref|XP_004514437.1| PREDICTED:
            uncharacterized protein LOC101500938 isoform X2 [Cicer
            arietinum]
          Length = 532

 Score =  354 bits (909), Expect = 8e-95
 Identities = 216/442 (48%), Positives = 274/442 (61%), Gaps = 10/442 (2%)
 Frame = +1

Query: 298  ETEDVISLPASSSPANGGDNEELNDSGCQPSEQNCQSLDVLEVNVNESHGSSTDIVNEDN 477
            E E +  +  +S   +G + E  +D   + S+    SLD +++ V +S    T IVN D 
Sbjct: 3    EEEHMNEVLKNSVGISGSEAENKSDKNMEISD----SLDEVKM-VEKSSLLETSIVNTDL 57

Query: 478  QLLLDADNKSENQDKLEVIADAGLMEDGNGLCNHAEDAFQDSCPQTRANSLSGAFKRPRL 657
            QL +           LE+     + E+        +++   S    R  +     KR R+
Sbjct: 58   QLEVG----------LELTDTVSISEEEGVRGTVHDESLNGSIEIDRRGT-----KRARI 102

Query: 658  A-EDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEETYFPA 834
              +DE QPSV  IYKSLTR SK+KLEELLQQWS WHA++ SS+ND    + SGEET+FPA
Sbjct: 103  TVDDENQPSVHFIYKSLTRASKKKLEELLQQWSHWHAKHVSSSNDPSEVLESGEETFFPA 162

Query: 835  LQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXXERVRE 1011
            L VG +  SA+SFWMEN+T +  NK +I  DG+SVPLYDRGY             +   E
Sbjct: 163  LCVGHETTSAVSFWMENQTVNDTNKYVIPIDGDSVPLYDRGYALGLTSSSNNA--DGGLE 220

Query: 1012 RAED-SRCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNPTRYYQNTPGG 1188
              +D SRCFNCGSY+H+L++CP+PRDNVAVNNARKQ K RRNQN+++R+PTRYYQ++P G
Sbjct: 221  IIDDPSRCFNCGSYNHALRECPRPRDNVAVNNARKQLKSRRNQNSSSRHPTRYYQSSPAG 280

Query: 1189 KYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIFXXXX 1362
            KYDGLKPG L+  TR+LLGLGELDPPPWLNRMRE+GYPPGYL+ D  D+PSGITIF    
Sbjct: 281  KYDGLKPGALDDATRQLLGLGELDPPPWLNRMRELGYPPGYLDADDEDEPSGITIF--TD 338

Query: 1363 XXXXXXXXXXXXXXHNKAPKKK-SVEFPGVNAPIPENADEERWAA----SGSSDLYMSRN 1527
                           +  PK+K SVEFPG+NAPIPE ADE  WAA      SSD+  S+N
Sbjct: 339  KDMEEQEDGEIVGADSSQPKRKMSVEFPGINAPIPEKADERLWAARVGPPSSSDI--SKN 396

Query: 1528 HSYSRYNRPSEPISRGRHHEQQ 1593
             S     R S   SRG H EQ+
Sbjct: 397  WS---QQRSSSYGSRGHHREQR 415


>ref|XP_002329267.1| predicted protein [Populus trichocarpa]
          Length = 289

 Score =  354 bits (908), Expect = 1e-94
 Identities = 175/286 (61%), Positives = 212/286 (74%), Gaps = 4/286 (1%)
 Frame = +1

Query: 643  KRPRLAEDEQQPSVQVIYKSLTRDSKRKLEELLQQWSQWHAQNCSSANDSGSGMVSGEET 822
            KR R+A DEQQPSV V+Y SLTR  K+KLEELLQQWS+WHAQ  +S++DS   + SGE+T
Sbjct: 4    KRKRMAYDEQQPSVHVMYNSLTRSGKQKLEELLQQWSEWHAQQ-NSSHDSNEMLQSGEDT 62

Query: 823  YFPALQVGADKPSAMSFWMENET-SGQNKELIRFDGNSVPLYDRGYXXXXXXXXXXXXXE 999
            YFPAL+VG +K SA+SFW+EN+    Q+ +LI    N VPLYDRGY             E
Sbjct: 63   YFPALRVGMEKSSAVSFWIENQARKQQDNDLILQHSNFVPLYDRGYVLGLTSADGPINVE 122

Query: 1000 RVRERAEDS-RCFNCGSYSHSLKDCPKPRDNVAVNNARKQHKVRRNQNAAARNPTRYYQN 1176
               E  + + RCFNCG+Y+HSLK+CPKPRDN AVNNARKQHK +RNQN+++RNPTRYYQ+
Sbjct: 123  GGLEIVDAAARCFNCGAYNHSLKECPKPRDNAAVNNARKQHKFKRNQNSSSRNPTRYYQS 182

Query: 1177 TPGGKYDGLKPGTLEAETRKLLGLGELDPPPWLNRMREIGYPPGYLEVD--DQPSGITIF 1350
            + GGKYDGLKPG+L+ ETR+LLGLGELDPPPWLNRMRE+GYPPGYL+ D  DQPSGITIF
Sbjct: 183  SSGGKYDGLKPGSLDTETRQLLGLGELDPPPWLNRMRELGYPPGYLDPDDEDQPSGITIF 242

Query: 1351 XXXXXXXXXXXXXXXXXXHNKAPKKKSVEFPGVNAPIPENADEERW 1488
                              H + P+K SVEFPG+NAPIPENA++  W
Sbjct: 243  DDGDVEEEQEDGEIMETDHPEPPRKMSVEFPGINAPIPENANQRFW 288


Top