BLASTX nr result

ID: Anemarrhena21_contig00005502 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Anemarrhena21_contig00005502
         (3158 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008802399.1| PREDICTED: splicing factor U2af large subuni...   834   0.0  
ref|XP_010920327.1| PREDICTED: uncharacterized protein LOC105044...   833   0.0  
ref|XP_010920328.1| PREDICTED: uncharacterized protein LOC105044...   761   0.0  
ref|XP_009382209.1| PREDICTED: uncharacterized protein LOC103970...   697   0.0  
ref|XP_010278013.1| PREDICTED: splicing factor U2af large subuni...   635   e-179
ref|XP_010278014.1| PREDICTED: splicing factor U2af large subuni...   630   e-177
ref|XP_010650126.1| PREDICTED: uncharacterized protein LOC100266...   609   e-171
ref|XP_010650125.1| PREDICTED: uncharacterized protein LOC100266...   609   e-171
ref|XP_010650124.1| PREDICTED: uncharacterized protein LOC100266...   609   e-171
ref|XP_006857448.2| PREDICTED: splicing factor U2af large subuni...   566   e-158
gb|ERN18915.1| hypothetical protein AMTR_s00067p00176230 [Ambore...   566   e-158
ref|XP_012069885.1| PREDICTED: splicing factor U2af large subuni...   558   e-155
gb|KDP40369.1| hypothetical protein JCGZ_02367 [Jatropha curcas]      558   e-155
ref|XP_011030446.1| PREDICTED: uncharacterized protein LOC105129...   557   e-155
emb|CBI23686.3| unnamed protein product [Vitis vinifera]              552   e-154
ref|XP_006489671.1| PREDICTED: splicing factor U2af large subuni...   546   e-152
gb|KDO54082.1| hypothetical protein CISIN_1g002093mg [Citrus sin...   543   e-151
ref|XP_006420295.1| hypothetical protein CICLE_v10004248mg [Citr...   542   e-151
ref|XP_010091889.1| Splicing factor U2AF 50 kDa subunit [Morus n...   542   e-151
ref|XP_006489672.1| PREDICTED: splicing factor U2af large subuni...   540   e-150

>ref|XP_008802399.1| PREDICTED: splicing factor U2af large subunit A [Phoenix dactylifera]
          Length = 945

 Score =  834 bits (2154), Expect = 0.0
 Identities = 495/962 (51%), Positives = 604/962 (62%), Gaps = 68/962 (7%)
 Frame = -1

Query: 2894 MSKTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRKKKL-----------NTEVKERASE 2748
            M +T  S E   +NSNEG+AARTRP+S EE++ RRKKKL              VK  A  
Sbjct: 1    MKRTERSKEAHRRNSNEGSAARTRPLSFEEIMLRRKKKLAAGAEGASGLREPYVKHDAKG 60

Query: 2747 RRDSSEKDDN----GVLSD------RGTEXXXXXXXXXXXXXK--RLEMTISKKEQEHVE 2604
              D +E D+      VL D      R TE                ++E  +SK++    E
Sbjct: 61   ASDRAEPDEAYKSPRVLKDTVRGSSRRTEDVVIKREGDQFKGSSRKMEEVVSKRDGYRAE 120

Query: 2603 NKDKGNCDSVTKTKLKGNYSKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVS 2424
             K+KG  D     K K +YSK S D+ESK E+Q HH+SRT + L ADSEKE EKK+ K  
Sbjct: 121  GKEKGKLDHEANIKAKSSYSKRSGDRESKTERQSHHRSRTGDWLEADSEKEFEKKRPKDI 180

Query: 2423 AXXXXXXXXXXXXXXXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKE 2244
                                     D+K RS +D S  K+HDSGK RD EYSER DRKKE
Sbjct: 181  VEKDKGKERDRRPKGEAKRKYHGHNDEKSRSEIDGSNSKKHDSGKSRD-EYSERNDRKKE 239

Query: 2243 HSRTHTEEPKAKRRRSRSREYDWERDR-SASMSPRAYGRTNHGRDYDEPVFQXXXXXXXX 2067
            HS+ + EE ++KRRRSRSRE+D ERDR S S SPRA  R+ HGRDY+E  F         
Sbjct: 240  HSKAYFEELRSKRRRSRSREFDRERDRRSVSHSPRANKRSFHGRDYEESPFPSLKDKSRG 299

Query: 2066 XXXXXXKYRTSGNGGYSSGHYRKHGSGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWD 1887
                  K+R SGNGGY+SGH+RK+GSGLGGYSPRKRRTEAA+RTPSP  RSPEKK++TWD
Sbjct: 300  KYSDGDKHRASGNGGYASGHHRKYGSGLGGYSPRKRRTEAAIRTPSPPIRSPEKKTATWD 359

Query: 1886 QPPAGIIHGASGSMFASLQSPSSKILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDS 1707
            QPP G  H   GS  A+ QS            ++TP  +K +P    + TS ++SA++DS
Sbjct: 360  QPPPGANHAGLGSTLANFQS------------SITPATAKSRPTPSKETTSVVMSASVDS 407

Query: 1706 VQLTQATRPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQ 1527
            VQLTQATRP+RRLYIENLP  ASEK+VIDCLNDF LS GVN IQGA PCISC+INKEKCQ
Sbjct: 408  VQLTQATRPIRRLYIENLPPLASEKTVIDCLNDFLLSYGVNRIQGANPCISCLINKEKCQ 467

Query: 1526 AVVEFLTPEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDS 1347
            A+VEFLTPEDAT+A+SFDGRSL GSVLKIRRP+DFVEAA G PEKPV   KAIS+VVKDS
Sbjct: 468  ALVEFLTPEDATAALSFDGRSLFGSVLKIRRPRDFVEAATGTPEKPVEEAKAISDVVKDS 527

Query: 1346 PHKIFIGGISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAG 1167
            P KIFIGGISKALSSDMLLE+V AFG LRAYHY+FNEELNGPCAFLEY D S+T+KACAG
Sbjct: 528  PQKIFIGGISKALSSDMLLEIVSAFGSLRAYHYKFNEELNGPCAFLEYADPSVTRKACAG 587

Query: 1166 LNGMKLGGHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEE 987
            LNGMKLGG +LTAVQA L S   E++ E+ PSY IP HA+ LL DST+VLQLKN+ N EE
Sbjct: 588  LNGMKLGGCILTAVQA-LPSAHVEENTESAPSYGIPMHAKPLLADSTKVLQLKNVFNREE 646

Query: 986  FMXXXXXXXXXXXEDVRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPHADSAKNLSA 807
            F+           ED+RLEC RFG VKS+N++RF S +E A ++SE     A S +  S 
Sbjct: 647  FLLLSESELEETLEDIRLECGRFGAVKSVNIIRFTSTVETAPKASEPET-LAGSTEIEST 705

Query: 806  VTSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDG-- 633
              SP S+ +  D ++  +N+  E QD RD +E+  SN E   +D E KESA  S  D   
Sbjct: 706  AASPRSDSEE-DGNIPATNDANEPQDARDVSEDRQSNHETSTKDLE-KESADFSPSDSVV 763

Query: 632  -GGDSSKLEEPSSH-----DKGADTNIEESICLESGLVSEISLQ-EVDVTGDKTTNL--- 483
               D+ +L EPS       D   D   EES+ +E+ +V + S + EVD T  +  +L   
Sbjct: 764  VFQDAQQLNEPSGDPRTELDDNVDAEREESVGVENDIVPKSSPKLEVDTTIAEDADLNGS 823

Query: 482  ----EARASNGKSV---QIPVETTNPNIFSADEGD------------------------- 399
                EA   +G+ +    +  +TT  +  +  +GD                         
Sbjct: 824  AAKQEASRGDGEHMLMENVDSKTTVRDGANLSKGDYDCMSIENATRSTAVDKAVDKTGAS 883

Query: 398  KRECNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSLHRRCYGERIVSVSYVSHDLYITRF 219
              + +NQ  DL+VFEPGSVLVEF+RKEAAC AAHSLH R Y ERIV+  YV  DLY++RF
Sbjct: 884  NGDDDNQVQDLDVFEPGSVLVEFMRKEAACMAAHSLHGRTYSERIVTAGYVPPDLYLSRF 943

Query: 218  PR 213
            PR
Sbjct: 944  PR 945


>ref|XP_010920327.1| PREDICTED: uncharacterized protein LOC105044207 isoform X1 [Elaeis
            guineensis]
          Length = 940

 Score =  833 bits (2151), Expect = 0.0
 Identities = 486/959 (50%), Positives = 586/959 (61%), Gaps = 65/959 (6%)
 Frame = -1

Query: 2894 MSKTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRKKKLNTEVKERASERRDSSEKDDNG 2715
            MS+T  S E   +NSNEGTAARTRP+S EE++ RRKKKL    +  +  R  S + D  G
Sbjct: 1    MSRTERSKEAHRRNSNEGTAARTRPLSFEEIMLRRKKKLTAGAEGASGSREPSVKHDAKG 60

Query: 2714 VLS----DRGTEXXXXXXXXXXXXXKRLEMTISKKEQE-------------------HVE 2604
                   D G +             +R+E  +SK+E++                    +E
Sbjct: 61   ASDCAEPDEGYKSTRVLKDTAKGSSRRMEEVVSKRERDWLEGTSRKMEEVVSKREGDRLE 120

Query: 2603 NKDKGNCDSVTKTKLKGNYSKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVS 2424
             K+KG  D     K K +YSK S D+ESK E+Q H++SRT   L ADSEKE EKK+ K  
Sbjct: 121  GKEKGKLDYEANIKAKSSYSKRSGDRESKTERQSHYRSRTGYGLEADSEKEFEKKRPKDI 180

Query: 2423 AXXXXXXXXXXXXXXXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKE 2244
                                     D+K RS +D S  K+HDSGK RD EYSER DRKKE
Sbjct: 181  VEKDKVKERDRKLKGEGKRKYHGHNDEKSRSEIDGSKLKKHDSGKSRD-EYSERNDRKKE 239

Query: 2243 HSRTHTEEPKAKRRRSRSREYDWERDRSASMSPRAYGRTNHGRDYDEPVFQXXXXXXXXX 2064
            HS+ + E+P++KRRRSRSRE D ER+   S SPRA  R+ H RDY+E  F          
Sbjct: 240  HSKAYYEDPRSKRRRSRSRENDQERE--VSFSPRANKRSYHVRDYEESSFPSLKDKSRRK 297

Query: 2063 XXXXXKYRTSGNGGYSSGHYRKHGSGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQ 1884
                 K+R SGNGGY+SGH+RK+GSGLGGYSPRKRRTEAAVRTPSP  RSPEKK++TWDQ
Sbjct: 298  YSDGDKHRASGNGGYASGHHRKYGSGLGGYSPRKRRTEAAVRTPSPTIRSPEKKTATWDQ 357

Query: 1883 PPAGIIHGASGSMFASLQSPSSKILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSV 1704
            PP G  H   GS  A+ QS            + TP ++K QP    +  S ++SA++DSV
Sbjct: 358  PPPGANHTGFGSTLANFQS------------STTPAIAKSQPTPSKETVSVVMSASVDSV 405

Query: 1703 QLTQATRPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQA 1524
            QLTQATRP RRLYIENLP SASEK+VID LND  LSSGVN IQG  PCISC+INKEKCQA
Sbjct: 406  QLTQATRPKRRLYIENLPPSASEKTVIDSLNDCLLSSGVNQIQGTSPCISCLINKEKCQA 465

Query: 1523 VVEFLTPEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSP 1344
            +VEFLTPEDAT+A+SFDGRSL GSVLKIRRPKDFVEAA G  EKP+   KAIS+VVKDSP
Sbjct: 466  LVEFLTPEDATAALSFDGRSLFGSVLKIRRPKDFVEAATGAQEKPMEEAKAISDVVKDSP 525

Query: 1343 HKIFIGGISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGL 1164
             KIFIGGISK LSSDML+E+V AFG LRAYHYEFNEELNGPCAFLEY DHS+TQKACAGL
Sbjct: 526  QKIFIGGISKTLSSDMLMEIVSAFGPLRAYHYEFNEELNGPCAFLEYVDHSVTQKACAGL 585

Query: 1163 NGMKLGGHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEF 984
            NGMKLGG VLTAVQAF  ++ EE + E+ PSY IP HA+ LL DST+VLQLKN+   EEF
Sbjct: 586  NGMKLGGCVLTAVQAFPSAHVEE-NTESAPSYGIPMHAKPLLADSTKVLQLKNVFKREEF 644

Query: 983  MXXXXXXXXXXXEDVRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPHADSAKNLSAV 804
            +           EDVRLEC RFGTVKS+N+VR+ S +E A ++SE        +K + + 
Sbjct: 645  LLLSESELEETLEDVRLECGRFGTVKSVNIVRYTSTVETAPKASEPET--IGGSKKIEST 702

Query: 803  TSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDG--G 630
            TS   +    D ++  +NN  E QD RD +E+  +N E   +D E KESA  S  DG   
Sbjct: 703  TSSPKSDSEEDGNIPVTNNANEPQDARDASEDRQNNHETSTKDLE-KESAGFSPSDGVVF 761

Query: 629  GDSSKLEEPSSH-----DKGADTNIEESICLESGLVSEISLQEVDVT-----GDKTTNLE 480
             D  +L EPS       D   D   EES      +   +S  EVD T     G  ++  +
Sbjct: 762  QDVQQLNEPSGDPRAELDDNVDAEREESGVENDMVPKSLSKLEVDTTIAEDAGLNSSAAK 821

Query: 479  ARASNGKSVQIPVETTNPNIFSAD-----EGD-------------------------KRE 390
              AS G    + +E  +P     D     +GD                           +
Sbjct: 822  QEASRGDGEHMSMENVDPKTTVRDGANSSKGDDDCMPIDNANLSNAVDEAVDKTGVSNGD 881

Query: 389  CNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
             N+Q  DL+VFEPGSVLVEFLRKEA C AAH LH R Y E+IV+  YV HDLY+ RFPR
Sbjct: 882  DNHQAQDLDVFEPGSVLVEFLRKEATCMAAHCLHGRTYSEQIVTAGYVPHDLYLARFPR 940


>ref|XP_010920328.1| PREDICTED: uncharacterized protein LOC105044207 isoform X2 [Elaeis
            guineensis]
          Length = 910

 Score =  761 bits (1966), Expect = 0.0
 Identities = 460/959 (47%), Positives = 558/959 (58%), Gaps = 65/959 (6%)
 Frame = -1

Query: 2894 MSKTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRKKKLNTEVKERASERRDSSEKDDNG 2715
            MS+T  S E   +NSNEGTAARTRP+S EE++ RRKKKL    +  +  R  S + D  G
Sbjct: 1    MSRTERSKEAHRRNSNEGTAARTRPLSFEEIMLRRKKKLTAGAEGASGSREPSVKHDAKG 60

Query: 2714 VLS----DRGTEXXXXXXXXXXXXXKRLEMTISKKEQE-------------------HVE 2604
                   D G +             +R+E  +SK+E++                    +E
Sbjct: 61   ASDCAEPDEGYKSTRVLKDTAKGSSRRMEEVVSKRERDWLEGTSRKMEEVVSKREGDRLE 120

Query: 2603 NKDKGNCDSVTKTKLKGNYSKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVS 2424
             K+KG  D     K K +YSK S D+ESK E+Q H++SRT   L ADSEKE EKK+ K  
Sbjct: 121  GKEKGKLDYEANIKAKSSYSKRSGDRESKTERQSHYRSRTGYGLEADSEKEFEKKRPKDI 180

Query: 2423 AXXXXXXXXXXXXXXXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKE 2244
                                     D+K RS +D S  K+HDSGK RD EYSER DRKKE
Sbjct: 181  VEKDKVKERDRKLKGEGKRKYHGHNDEKSRSEIDGSKLKKHDSGKSRD-EYSERNDRKKE 239

Query: 2243 HSRTHTEEPKAKRRRSRSREYDWERDRSASMSPRAYGRTNHGRDYDEPVFQXXXXXXXXX 2064
            HS+ + E+P++KRRRSRSRE D ER+   S SPRA  R+ H RDY+E  F          
Sbjct: 240  HSKAYYEDPRSKRRRSRSRENDQERE--VSFSPRANKRSYHVRDYEESSFPSLKDKSRRK 297

Query: 2063 XXXXXKYRTSGNGGYSSGHYRKHGSGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQ 1884
                 K+R SGNGGY+SGH+RK+GSGLGGYSPRKRRTEAAVRTPSP  RSPEKK++TWDQ
Sbjct: 298  YSDGDKHRASGNGGYASGHHRKYGSGLGGYSPRKRRTEAAVRTPSPTIRSPEKKTATWDQ 357

Query: 1883 PPAGIIHGASGSMFASLQSPSSKILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSV 1704
            PP G  H   GS  A+ QS            + TP ++K QP    +  S ++SA++DSV
Sbjct: 358  PPPGANHTGFGSTLANFQS------------STTPAIAKSQPTPSKETVSVVMSASVDSV 405

Query: 1703 QLTQATRPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQA 1524
            QLTQATRP RRLYIENLP SASEK+VID LND  LSSGVN IQG  PCISC+INKEKCQA
Sbjct: 406  QLTQATRPKRRLYIENLPPSASEKTVIDSLNDCLLSSGVNQIQGTSPCISCLINKEKCQA 465

Query: 1523 VVEFLTPEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSP 1344
            +VEFLTPEDAT+A+SFDGRSL GSVLKIRRPKDFVEAA G  EKP+   KAIS+VVKDSP
Sbjct: 466  LVEFLTPEDATAALSFDGRSLFGSVLKIRRPKDFVEAATGAQEKPMEEAKAISDVVKDSP 525

Query: 1343 HKIFIGGISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGL 1164
             KIFIGGISK LSSDM                              Y DHS+TQKACAGL
Sbjct: 526  QKIFIGGISKTLSSDM------------------------------YVDHSVTQKACAGL 555

Query: 1163 NGMKLGGHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEF 984
            NGMKLGG VLTAVQAF  ++ EE + E+ PSY IP HA+ LL DST+VLQLKN+   EEF
Sbjct: 556  NGMKLGGCVLTAVQAFPSAHVEE-NTESAPSYGIPMHAKPLLADSTKVLQLKNVFKREEF 614

Query: 983  MXXXXXXXXXXXEDVRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPHADSAKNLSAV 804
            +           EDVRLEC RFGTVKS+N+VR+ S +E A ++SE        +K + + 
Sbjct: 615  LLLSESELEETLEDVRLECGRFGTVKSVNIVRYTSTVETAPKASEPET--IGGSKKIEST 672

Query: 803  TSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDG--G 630
            TS   +    D ++  +NN  E QD RD +E+  +N E   +D E KESA  S  DG   
Sbjct: 673  TSSPKSDSEEDGNIPVTNNANEPQDARDASEDRQNNHETSTKDLE-KESAGFSPSDGVVF 731

Query: 629  GDSSKLEEPSSH-----DKGADTNIEESICLESGLVSEISLQEVDVT-----GDKTTNLE 480
             D  +L EPS       D   D   EES      +   +S  EVD T     G  ++  +
Sbjct: 732  QDVQQLNEPSGDPRAELDDNVDAEREESGVENDMVPKSLSKLEVDTTIAEDAGLNSSAAK 791

Query: 479  ARASNGKSVQIPVETTNPNIFSAD-----EGD-------------------------KRE 390
              AS G    + +E  +P     D     +GD                           +
Sbjct: 792  QEASRGDGEHMSMENVDPKTTVRDGANSSKGDDDCMPIDNANLSNAVDEAVDKTGVSNGD 851

Query: 389  CNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
             N+Q  DL+VFEPGSVLVEFLRKEA C AAH LH R Y E+IV+  YV HDLY+ RFPR
Sbjct: 852  DNHQAQDLDVFEPGSVLVEFLRKEATCMAAHCLHGRTYSEQIVTAGYVPHDLYLARFPR 910


>ref|XP_009382209.1| PREDICTED: uncharacterized protein LOC103970240 [Musa acuminata
            subsp. malaccensis] gi|695070373|ref|XP_009382210.1|
            PREDICTED: uncharacterized protein LOC103970240 [Musa
            acuminata subsp. malaccensis]
          Length = 908

 Score =  697 bits (1799), Expect = 0.0
 Identities = 415/936 (44%), Positives = 548/936 (58%), Gaps = 42/936 (4%)
 Frame = -1

Query: 2894 MSKTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRKKKLNTEVKERASERRDSSEKDDNG 2715
            MS+TN   E    N NEGTAARTRPIS EE++ RRKKKL  +  ER S  ++   KDD  
Sbjct: 1    MSRTNRRKEAPGIN-NEGTAARTRPISFEEIMLRRKKKLTADGNERESRFKEHFVKDDAK 59

Query: 2714 VLSDRGTEXXXXXXXXXXXXXKRLEMTISKKEQEHVENKDKGNCDSVTKTKL-------- 2559
            V  DR                +      SKK +E +     G+ + + K KL        
Sbjct: 60   VTLDRSDADRAGMKDFKDTMKES-----SKKTKEKIYKVKDGDLEGMHKHKLELYADHKP 114

Query: 2558 KGNYSKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVSAXXXXXXXXXXXXXX 2379
            K  YS+S++DKE  NEKQ H +SR  ++LG  S KE EKKQ K +               
Sbjct: 115  KSIYSRSNKDKERLNEKQNHPRSRNGDKLGNSSAKEFEKKQPKYTTEKERHDDRDRKSRG 174

Query: 2378 XXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKAKRRR 2199
                   S  D+++R  +D S  +RHD GK R  EY+ER DR+K+ S+ + E+ K+KRRR
Sbjct: 175  EMKKKEHSYADERNRLEIDYSTLRRHDLGKSRHPEYAERNDRRKDGSKHYFEQLKSKRRR 234

Query: 2198 SRSREYDWERDRSASMSPRAYGRTNHGRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGY 2019
            SRS E + ER RS S SPR + R+ HGR+Y+E                    +      +
Sbjct: 235  SRSPECERERGRSVSSSPRGHKRSYHGREYEES--------------SSVSLKEKSRKKH 280

Query: 2018 SSG--HYRKHGSGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGSM 1845
            S G  HY KHGSGLGGYSPRKR+ EAAVRTPSP    PE+KS+ WDQPP    +  +G  
Sbjct: 281  SDGDKHYHKHGSGLGGYSPRKRKPEAAVRTPSPTIPYPERKSAKWDQPPPDAKNSGAGVP 340

Query: 1844 FASLQSPSSKILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQATRPMRRLY 1665
              + +SP++K+LE    + VTP     Q     +  S ++S +IDSVQLTQATR  RRLY
Sbjct: 341  PDTFKSPATKMLEPALVSPVTPAAKIHQHAPSSEIASVVMSTSIDSVQLTQATRRTRRLY 400

Query: 1664 IENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPEDATSA 1485
            +ENLP+SASEK+++D LN+  +SS VNHI+G  PCISCI+NKEK QA+VEFLTP+DAT+A
Sbjct: 401  VENLPASASEKTLVDFLNNLLVSSSVNHIKGTSPCISCILNKEKSQALVEFLTPQDATAA 460

Query: 1484 VSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIGGISKALS 1305
            +S+DGRS+SGSVLKIRRPKDFVE A   PEKP   +K + + VKDS HKIFIGGIS+A+S
Sbjct: 461  LSYDGRSISGSVLKIRRPKDFVETATVAPEKPKEELKVVDDFVKDSSHKIFIGGISEAIS 520

Query: 1304 SDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGHVLTAV 1125
            S+ML+E+VGAFG L AYH+EFN+ELNGPCAFLEY DHSIT KACAGLNGMKLGG VLTAV
Sbjct: 521  SNMLMEIVGAFGTLNAYHFEFNKELNGPCAFLEYVDHSITSKACAGLNGMKLGGRVLTAV 580

Query: 1124 QAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXXXXXXE 945
            Q   D+   +++AE +  Y IP HA+SLL  ST+VLQLKN+ N EEF+           E
Sbjct: 581  QTLPDA---QENAEIVSCYGIPVHAKSLLGPSTKVLQLKNVFNKEEFLLLSESELEEVIE 637

Query: 944  DVRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPHADSAKNLSAVTSPISNCKNGDVD 765
            D+RLECTRFGTVK++N++R+ S +  ATE+ E   P     K  S++  P SN    DVD
Sbjct: 638  DIRLECTRFGTVKAVNIIRYISNLGTATENFE-TYPSGYLVKAESSMECP-SNDSKQDVD 695

Query: 764  VHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDGGGDSSKLEEPSSHDKG 585
            +    +     D+R++     +N  N M   E +E+     IDG   S   ++       
Sbjct: 696  MAPHKDAERPGDIRNDVVEIKNN--NNMPLKEIEENIVFGEIDGSTTSKDAQQIGDVAAD 753

Query: 584  ADTNIEESICLESGL-----------------------VSEISLQEVDVTGD----KTTN 486
                 E+ + +E G                        +S I+ ++ +++ D     T N
Sbjct: 754  QPMETEKDVIVEIGAEIGLFSETPKPGEDAAEVEQNADLSSITAEKAELSSDVDPASTEN 813

Query: 485  LEARASNGKSVQIPVE-----TTNPNIFSADEGDKRECNNQTHDLEVFEPGSVLVEFLRK 321
               + S+ K  ++  E     +T P     +    +E ++Q  D+ +FEPGSV VEFLR 
Sbjct: 814  TCLQTSSAKEAELTKEDDEHLSTYPTGIDKEATSDKE-DHQDFDVSLFEPGSVFVEFLRT 872

Query: 320  EAACAAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
            EA C AAH LH R YGE+ V+  +  HD+Y+ RF R
Sbjct: 873  EATCMAAHCLHGRTYGEQTVTAGFFPHDMYLARFRR 908


>ref|XP_010278013.1| PREDICTED: splicing factor U2af large subunit A-like isoform X1
            [Nelumbo nucifera]
          Length = 943

 Score =  635 bits (1637), Expect = e-179
 Identities = 403/932 (43%), Positives = 542/932 (58%), Gaps = 46/932 (4%)
 Frame = -1

Query: 2870 EVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERRDSSEKDDNGVLSDRGT 2694
            E++  N NEGTAARTRP S EE++ RR+ KK  ++ KE   E    S    + V S    
Sbjct: 19   ELSRDNCNEGTAARTRPFSFEEIMLRRQNKKPTSDAKEGTGELGKLSVSGKDNVESTHSE 78

Query: 2693 EXXXXXXXXXXXXXKRLEMT---ISK--------KEQEHVENKDKGNCDSVTKTKLKGNY 2547
                             E T   ISK        K+ + V++KD G+ D   K   + N 
Sbjct: 79   AAGGYKWSKDAIAMNASEDTAKRISKNQEGNTPIKKSKLVKDKDDGSHDIENKLSARSNN 138

Query: 2546 SKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVSAXXXXXXXXXXXXXXXXXX 2367
            +  SR K  KNEKQ   KSR+ +R+    E ESEK+ +K +                   
Sbjct: 139  NMGSRSKGDKNEKQSQLKSRSYDRMRDYFEDESEKRHSKNTTVKDKYSDRDRGKSERETK 198

Query: 2366 XXXSRTDD-KHRSVVDDSIPKRHDSGKLRDA-EYSERKDRKKEHSRTHTEEPKAKRRRSR 2193
                  DD K RS ++ S  K++DSGK  D+ E SERK RK E S++  +E + KRRRSR
Sbjct: 199  RKQRTGDDEKKRSDINGSDVKKYDSGKWHDSSEPSERKGRK-ESSQSRYDEGRQKRRRSR 257

Query: 2192 SREYDWERDR-SASMSPRAYGRTN-HGRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGY 2019
            SRE+D +RDR S S+SPR++ R++ HG+++ +  F               ++RTS NG Y
Sbjct: 258  SREHDRDRDRRSHSLSPRSHKRSSYHGQEHGDSSFNSSKDRPRKQHSDADRHRTSNNG-Y 316

Query: 2018 SSGHYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGS 1848
             S H+R+HG   SGLGGYSPRKRRTEAA +TPSP  RSPE+K+  WD PP    + ++GS
Sbjct: 317  PSSHHRRHGGSTSGLGGYSPRKRRTEAAAKTPSPTVRSPERKTVGWDLPPKDTDNASAGS 376

Query: 1847 MFASLQSPSSKI---LELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQATRPM 1677
            +  + QS +  +   +EL +    T  V++    +  +  S   + +IDS+QLTQATRPM
Sbjct: 377  LLVNFQSSNQTVTTNIELPNVVQATLNVARALSGVSPNTLSMTKTESIDSIQLTQATRPM 436

Query: 1676 RRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPED 1497
            RRLY+EN+P+SAS+K+VI+C+N F LSSGVNHIQG  PCISCIINKEK  A++EFLT ED
Sbjct: 437  RRLYVENVPASASDKAVIECVNGFLLSSGVNHIQGTHPCISCIINKEKGHAILEFLTAED 496

Query: 1496 ATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPT-VKAISEVVKDSPHKIFIGGI 1320
            AT+A+SFDGRS SGS+LKIRRPKDFVEAA GVP+KPV T   AIS++V DSPHKIFIGGI
Sbjct: 497  ATAALSFDGRSFSGSILKIRRPKDFVEAATGVPQKPVATSADAISDIVNDSPHKIFIGGI 556

Query: 1319 SKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGH 1140
            S+ LSSDML+E+ GAFG L+A+    NE+L    AFLEY D SIT KACAGLNGMKLGG 
Sbjct: 557  SRDLSSDMLMEIAGAFGHLKAFCIHVNEDLKEQIAFLEYVDKSITLKACAGLNGMKLGGQ 616

Query: 1139 VLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXX 960
            +LT VQA  D++ E+++ EN PSY+IP HA+ LL   TQVL+LKN+ N EE         
Sbjct: 617  ILTVVQAVPDASSEQENTENPPSYEIPDHAKPLLDKPTQVLKLKNVFNQEELASLSGPEL 676

Query: 959  XXXXEDVRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPH----------ADSAKNLS 810
                ED+RLEC RFGTVKS+N+V+  S    A  +SE    +           D  K + 
Sbjct: 677  EETLEDIRLECARFGTVKSVNIVKDRSNYAFALGTSEVTSQNDSRDLLYPEDDDHIKEIP 736

Query: 809  AVTSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDGG 630
             +   +      +      N+ +E        E  +++++  ++D  +  S+ P   D  
Sbjct: 737  RMGDSLHLSSEDNSKPEHPNDAKEFSGGGGTAEENSTSVDMPVQDLAKDGSSEPGHPDKR 796

Query: 629  GDSSKLEEPSSHDKGADTNIEESICLESGLVSEISLQEVDVTGDKT----TNL---EARA 471
                +L    + D        +    E  LV   + +++DV   K     TNL   E RA
Sbjct: 797  AGLVELICHLNADGALQEPAVQLDATEGQLVH--NKEDIDVLWAKESGMGTNLMVEEFRA 854

Query: 470  ---SNGKSVQIPVETTNPNIFS--ADEGDKRECNNQTHDLE-VFEPGSVLVEFLRKEAAC 309
               ++ K V I ++ T   I S   D+GDK++   +  DL  +FEPG +LVE+ R EA+C
Sbjct: 855  EETNDKKDVSIELDATATQIESGVTDKGDKKQ---EASDLSYIFEPGCILVEYARTEASC 911

Query: 308  AAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
             AAH LHRR +G R V V YV+HDLY+  FP+
Sbjct: 912  MAAHCLHRRPFGNRNVEVGYVAHDLYLAMFPK 943


>ref|XP_010278014.1| PREDICTED: splicing factor U2af large subunit A-like isoform X2
            [Nelumbo nucifera]
          Length = 942

 Score =  630 bits (1626), Expect = e-177
 Identities = 404/932 (43%), Positives = 541/932 (58%), Gaps = 46/932 (4%)
 Frame = -1

Query: 2870 EVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERRDSSEKDDNGVLSDRGT 2694
            E++  N NEGTAARTRP S EE++ RR+ KK  ++ KE   E    S    + V S    
Sbjct: 19   ELSRDNCNEGTAARTRPFSFEEIMLRRQNKKPTSDAKEGTGELGKLSVSGKDNVESTHSE 78

Query: 2693 EXXXXXXXXXXXXXKRLEMT---ISK--------KEQEHVENKDKGNCDSVTKTKLKGNY 2547
                             E T   ISK        K+ + V++KD G+ D   K   + N 
Sbjct: 79   AAGGYKWSKDAIAMNASEDTAKRISKNQEGNTPIKKSKLVKDKDDGSHDIENKLSARSNN 138

Query: 2546 SKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVSAXXXXXXXXXXXXXXXXXX 2367
            +  SR K  KNEKQ   KSR+ +R+    E ESEK+ +K +                   
Sbjct: 139  NMGSRSKGDKNEKQSQLKSRSYDRMRDYFEDESEKRHSKNTTVKDKYSDRDRGKSERETK 198

Query: 2366 XXXSRTDD-KHRSVVDDSIPKRHDSGKLRDA-EYSERKDRKKEHSRTHTEEPKAKRRRSR 2193
                  DD K RS ++ S  K++DSGK  D+ E SERK RK E S++  +E + KRRRSR
Sbjct: 199  RKQRTGDDEKKRSDINGSDVKKYDSGKWHDSSEPSERKGRK-ESSQSRYDEGRQKRRRSR 257

Query: 2192 SREYDWERDR-SASMSPRAYGRTN-HGRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGY 2019
            SRE+D +RDR S S+SPR++ R++ HG+++ +  F               ++RTS NG Y
Sbjct: 258  SREHDRDRDRRSHSLSPRSHKRSSYHGQEHGDSSFNSSKDRPRKQHSDADRHRTSNNG-Y 316

Query: 2018 SSGHYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGS 1848
             S H+R+HG   SGLGGYSPRKRRTEAA +TPSP  RSPE+K+  WD PP    + ++GS
Sbjct: 317  PSSHHRRHGGSTSGLGGYSPRKRRTEAAAKTPSPTVRSPERKTVGWDLPPKDTDNASAGS 376

Query: 1847 MFASLQSPSSKI---LELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQATRPM 1677
            +  + QS +  +   +EL +    T  V++    +  +  S   + +IDS+QLTQATRPM
Sbjct: 377  LLVNFQSSNQTVTTNIELPNVVQATLNVARALSGVSPNTLSMTKTESIDSIQLTQATRPM 436

Query: 1676 RRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPED 1497
            RRLY+EN+P+SAS+K+VI+C+N F LSSGVNHIQG  PCISCIINKEK  A++EFLT ED
Sbjct: 437  RRLYVENVPASASDKAVIECVNGFLLSSGVNHIQGTHPCISCIINKEKGHAILEFLTAED 496

Query: 1496 ATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPT-VKAISEVVKDSPHKIFIGGI 1320
            AT+A+SFDGRS SGS+LKIRRPKDFVEAA GVP+KPV T   AIS++V DSPHKIFIGGI
Sbjct: 497  ATAALSFDGRSFSGSILKIRRPKDFVEAATGVPQKPVATSADAISDIVNDSPHKIFIGGI 556

Query: 1319 SKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGH 1140
            S+ LSSDML+E+ GAFG L+A+    NE+L    AFLEY D SIT KACAGLNGMKLGG 
Sbjct: 557  SRDLSSDMLMEIAGAFGHLKAFCIHVNEDLKEQIAFLEYVDKSITLKACAGLNGMKLGGQ 616

Query: 1139 VLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXX 960
            +LT VQA  D++ EE + EN PSY+IP HA+ LL   TQVL+LKN+ N EE         
Sbjct: 617  ILTVVQAVPDASSEE-NTENPPSYEIPDHAKPLLDKPTQVLKLKNVFNQEELASLSGPEL 675

Query: 959  XXXXEDVRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPH----------ADSAKNLS 810
                ED+RLEC RFGTVKS+N+V+  S    A  +SE    +           D  K + 
Sbjct: 676  EETLEDIRLECARFGTVKSVNIVKDRSNYAFALGTSEVTSQNDSRDLLYPEDDDHIKEIP 735

Query: 809  AVTSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDGG 630
             +   +      +      N+ +E        E  +++++  ++D  +  S+ P   D  
Sbjct: 736  RMGDSLHLSSEDNSKPEHPNDAKEFSGGGGTAEENSTSVDMPVQDLAKDGSSEPGHPDKR 795

Query: 629  GDSSKLEEPSSHDKGADTNIEESICLESGLVSEISLQEVDVTGDKT----TNL---EARA 471
                +L    + D        +    E  LV   + +++DV   K     TNL   E RA
Sbjct: 796  AGLVELICHLNADGALQEPAVQLDATEGQLVH--NKEDIDVLWAKESGMGTNLMVEEFRA 853

Query: 470  ---SNGKSVQIPVETTNPNIFS--ADEGDKRECNNQTHDLE-VFEPGSVLVEFLRKEAAC 309
               ++ K V I ++ T   I S   D+GDK++   +  DL  +FEPG +LVE+ R EA+C
Sbjct: 854  EETNDKKDVSIELDATATQIESGVTDKGDKKQ---EASDLSYIFEPGCILVEYARTEASC 910

Query: 308  AAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
             AAH LHRR +G R V V YV+HDLY+  FP+
Sbjct: 911  MAAHCLHRRPFGNRNVEVGYVAHDLYLAMFPK 942


>ref|XP_010650126.1| PREDICTED: uncharacterized protein LOC100266510 isoform X3 [Vitis
            vinifera] gi|731389815|ref|XP_010650127.1| PREDICTED:
            uncharacterized protein LOC100266510 isoform X3 [Vitis
            vinifera]
          Length = 936

 Score =  609 bits (1570), Expect = e-171
 Identities = 390/940 (41%), Positives = 547/940 (58%), Gaps = 48/940 (5%)
 Frame = -1

Query: 2888 KTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERRDSSEKD---- 2724
            K   S E+   N  EGTAARTRP S +E++ RRK KKL  +VK+ A E  + S KD    
Sbjct: 10   KYGKSTELYLDNFKEGTAARTRPFSFDEIMLRRKNKKLYGDVKDGAGEEGNISRKDIVKN 69

Query: 2723 -DNGVLSDRGTEXXXXXXXXXXXXXKRLEMTISKKEQEHVEN---------KDKGNCDSV 2574
              +   SD+G +                 + +S ++++   +         KDK + +S 
Sbjct: 70   VSDCYESDKGYKHNEDSFPGAINHSSEDFVKVSSRKKDENTSMKEGNLGNVKDKESHNSE 129

Query: 2573 TKTKLKGNYSKSSRDKESKNEKQRHHKSRT---SNRLGADSEKESEKKQAKVSAXXXXXX 2403
             K K K N   + + KE K  ++ H + +    S R   DSE E EKK ++ S       
Sbjct: 130  DKLKAKPNKGMTDKSKEGKINQRVHGRKKIDERSRRSSDDSESEPEKKFSRDSVGKDRYA 189

Query: 2402 XXXXXXXXXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTE 2223
                           +  D+K+R   + +  K+HD GK  ++E+ +RK+R+ E   +  E
Sbjct: 190  DRSRKSEKESKRKHRTGEDEKNR---ERNSMKKHDPGKRHESEFLDRKERR-ESPPSRYE 245

Query: 2222 EPKAKRRRSRSREYDWERD-RSASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXX 2049
            E + KRRRSRSRE D +RD RS+S+SPRA  RT+H GR++ E                  
Sbjct: 246  ESRPKRRRSRSRERDKDRDKRSSSLSPRAQKRTSHHGREHAELSLHSLKDRSGRQHSDAD 305

Query: 2048 KYRTSGNGGYSSGHYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPP 1878
            + R S NG  SS H+R+HG   SGLGGYSPRKRRTEAA++TPSP  RSPEKKS+ WD PP
Sbjct: 306  RNRISNNG--SSSHFRRHGGSASGLGGYSPRKRRTEAAIKTPSPTNRSPEKKSAGWDLPP 363

Query: 1877 AGIIHGASGSMFASLQ----SPSSKILELTSSAAV----TPTVSKPQ-PVLPLDATSTLI 1725
            +      +GS+ +SLQ    + SS   EL S+  V    T T +KP  P +  DA S   
Sbjct: 364  SRTDGMNAGSVLSSLQVLKPTVSSNADELPSAVPVAVPVTATTAKPPLPRIYSDAVSKNK 423

Query: 1724 SATIDSVQLTQATRPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCII 1545
            + +IDS+QLTQATRPMRRLY+ENLP S+SEK++++CLN+F LSSG+NH+QG  PCISCII
Sbjct: 424  NVSIDSIQLTQATRPMRRLYVENLPVSSSEKALMECLNNFLLSSGINHVQGTPPCISCII 483

Query: 1544 NKEKCQAVVEFLTPEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAIS 1365
            +KEK QA+VEFLTPEDA++A+SFDG S SGS+LKIRRPKDFV+   GV EK V    AIS
Sbjct: 484  HKEKGQALVEFLTPEDASAALSFDGISFSGSILKIRRPKDFVDMT-GVQEKLVAAPDAIS 542

Query: 1364 EVVKDSPHKIFIGGISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSIT 1185
            ++VKDSPHKIFIGGIS+ALSSDML+E+  AFG L+AY ++ NE+L  PCAFLEY D S+T
Sbjct: 543  DIVKDSPHKIFIGGISRALSSDMLMEIAAAFGPLKAYRFQVNEDLGEPCAFLEYVDQSVT 602

Query: 1184 QKACAGLNGMKLGGHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKN 1005
             KACAGLNGMKLGG VLT VQA  ++   E+   N+P Y IP HA+ LL   TQVL+LKN
Sbjct: 603  LKACAGLNGMKLGGQVLTVVQAIPNALAMENTG-NLPFYGIPEHAKPLLERPTQVLKLKN 661

Query: 1004 MLNPEEFMXXXXXXXXXXXEDVRLECTRFGTVKSINLVRFNS------KIEGATESSEQR 843
            ++NP++             ED+RLECTRFGTVKS+N+V++N+      ++  A +++   
Sbjct: 662  VVNPDDLSSLSEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVSTLEVYEAADNTGSN 721

Query: 842  VPHADSAKNLSAVTSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERK 663
            +    ++     +     N  +    +    +V++ ++V +  E  + + +  + D  + 
Sbjct: 722  LGCDGNSMKAETLGGGTDNGSSDISGIKPPTDVKDLKEVDEVVERNSISDDKSLTDLIKN 781

Query: 662  ESAAPSTIDGGGDSSKLEEPSSHDKGAD---------TNIEESICLESGLVSEISLQEVD 510
            E   PS ID    ++ ++EP   D   D          N++  + L +   +++ +QE  
Sbjct: 782  ELCEPSHID---SNTAVKEPGCPDGSDDIPRGLPDQLNNMKHEVELRNDKAADV-IQEDF 837

Query: 509  VTGDKTTNLEARASNGKSVQIPVETTNPNIFSADEGDKRECNNQTHDL-EVFEPGSVLVE 333
            +  +K   +E   +N K +    E  +     +D   K +      DL ++FE G VLVE
Sbjct: 838  IIKNKLMTVE-EETNRKLLGTSAELDSSPGIKSDFTGKNDSEKGLCDLDDMFEVGCVLVE 896

Query: 332  FLRKEAACAAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
            + R EA+C AAH LH R + +R+V V YV+ DLY  +FPR
Sbjct: 897  YGRTEASCMAAHCLHGRYFDDRVVVVGYVALDLYRMKFPR 936


>ref|XP_010650125.1| PREDICTED: uncharacterized protein LOC100266510 isoform X2 [Vitis
            vinifera]
          Length = 963

 Score =  609 bits (1570), Expect = e-171
 Identities = 390/940 (41%), Positives = 547/940 (58%), Gaps = 48/940 (5%)
 Frame = -1

Query: 2888 KTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERRDSSEKD---- 2724
            K   S E+   N  EGTAARTRP S +E++ RRK KKL  +VK+ A E  + S KD    
Sbjct: 37   KYGKSTELYLDNFKEGTAARTRPFSFDEIMLRRKNKKLYGDVKDGAGEEGNISRKDIVKN 96

Query: 2723 -DNGVLSDRGTEXXXXXXXXXXXXXKRLEMTISKKEQEHVEN---------KDKGNCDSV 2574
              +   SD+G +                 + +S ++++   +         KDK + +S 
Sbjct: 97   VSDCYESDKGYKHNEDSFPGAINHSSEDFVKVSSRKKDENTSMKEGNLGNVKDKESHNSE 156

Query: 2573 TKTKLKGNYSKSSRDKESKNEKQRHHKSRT---SNRLGADSEKESEKKQAKVSAXXXXXX 2403
             K K K N   + + KE K  ++ H + +    S R   DSE E EKK ++ S       
Sbjct: 157  DKLKAKPNKGMTDKSKEGKINQRVHGRKKIDERSRRSSDDSESEPEKKFSRDSVGKDRYA 216

Query: 2402 XXXXXXXXXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTE 2223
                           +  D+K+R   + +  K+HD GK  ++E+ +RK+R+ E   +  E
Sbjct: 217  DRSRKSEKESKRKHRTGEDEKNR---ERNSMKKHDPGKRHESEFLDRKERR-ESPPSRYE 272

Query: 2222 EPKAKRRRSRSREYDWERD-RSASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXX 2049
            E + KRRRSRSRE D +RD RS+S+SPRA  RT+H GR++ E                  
Sbjct: 273  ESRPKRRRSRSRERDKDRDKRSSSLSPRAQKRTSHHGREHAELSLHSLKDRSGRQHSDAD 332

Query: 2048 KYRTSGNGGYSSGHYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPP 1878
            + R S NG  SS H+R+HG   SGLGGYSPRKRRTEAA++TPSP  RSPEKKS+ WD PP
Sbjct: 333  RNRISNNG--SSSHFRRHGGSASGLGGYSPRKRRTEAAIKTPSPTNRSPEKKSAGWDLPP 390

Query: 1877 AGIIHGASGSMFASLQ----SPSSKILELTSSAAV----TPTVSKPQ-PVLPLDATSTLI 1725
            +      +GS+ +SLQ    + SS   EL S+  V    T T +KP  P +  DA S   
Sbjct: 391  SRTDGMNAGSVLSSLQVLKPTVSSNADELPSAVPVAVPVTATTAKPPLPRIYSDAVSKNK 450

Query: 1724 SATIDSVQLTQATRPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCII 1545
            + +IDS+QLTQATRPMRRLY+ENLP S+SEK++++CLN+F LSSG+NH+QG  PCISCII
Sbjct: 451  NVSIDSIQLTQATRPMRRLYVENLPVSSSEKALMECLNNFLLSSGINHVQGTPPCISCII 510

Query: 1544 NKEKCQAVVEFLTPEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAIS 1365
            +KEK QA+VEFLTPEDA++A+SFDG S SGS+LKIRRPKDFV+   GV EK V    AIS
Sbjct: 511  HKEKGQALVEFLTPEDASAALSFDGISFSGSILKIRRPKDFVDMT-GVQEKLVAAPDAIS 569

Query: 1364 EVVKDSPHKIFIGGISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSIT 1185
            ++VKDSPHKIFIGGIS+ALSSDML+E+  AFG L+AY ++ NE+L  PCAFLEY D S+T
Sbjct: 570  DIVKDSPHKIFIGGISRALSSDMLMEIAAAFGPLKAYRFQVNEDLGEPCAFLEYVDQSVT 629

Query: 1184 QKACAGLNGMKLGGHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKN 1005
             KACAGLNGMKLGG VLT VQA  ++   E+   N+P Y IP HA+ LL   TQVL+LKN
Sbjct: 630  LKACAGLNGMKLGGQVLTVVQAIPNALAMENTG-NLPFYGIPEHAKPLLERPTQVLKLKN 688

Query: 1004 MLNPEEFMXXXXXXXXXXXEDVRLECTRFGTVKSINLVRFNS------KIEGATESSEQR 843
            ++NP++             ED+RLECTRFGTVKS+N+V++N+      ++  A +++   
Sbjct: 689  VVNPDDLSSLSEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVSTLEVYEAADNTGSN 748

Query: 842  VPHADSAKNLSAVTSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERK 663
            +    ++     +     N  +    +    +V++ ++V +  E  + + +  + D  + 
Sbjct: 749  LGCDGNSMKAETLGGGTDNGSSDISGIKPPTDVKDLKEVDEVVERNSISDDKSLTDLIKN 808

Query: 662  ESAAPSTIDGGGDSSKLEEPSSHDKGAD---------TNIEESICLESGLVSEISLQEVD 510
            E   PS ID    ++ ++EP   D   D          N++  + L +   +++ +QE  
Sbjct: 809  ELCEPSHID---SNTAVKEPGCPDGSDDIPRGLPDQLNNMKHEVELRNDKAADV-IQEDF 864

Query: 509  VTGDKTTNLEARASNGKSVQIPVETTNPNIFSADEGDKRECNNQTHDL-EVFEPGSVLVE 333
            +  +K   +E   +N K +    E  +     +D   K +      DL ++FE G VLVE
Sbjct: 865  IIKNKLMTVE-EETNRKLLGTSAELDSSPGIKSDFTGKNDSEKGLCDLDDMFEVGCVLVE 923

Query: 332  FLRKEAACAAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
            + R EA+C AAH LH R + +R+V V YV+ DLY  +FPR
Sbjct: 924  YGRTEASCMAAHCLHGRYFDDRVVVVGYVALDLYRMKFPR 963


>ref|XP_010650124.1| PREDICTED: uncharacterized protein LOC100266510 isoform X1 [Vitis
            vinifera]
          Length = 976

 Score =  609 bits (1570), Expect = e-171
 Identities = 390/940 (41%), Positives = 547/940 (58%), Gaps = 48/940 (5%)
 Frame = -1

Query: 2888 KTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERRDSSEKD---- 2724
            K   S E+   N  EGTAARTRP S +E++ RRK KKL  +VK+ A E  + S KD    
Sbjct: 50   KYGKSTELYLDNFKEGTAARTRPFSFDEIMLRRKNKKLYGDVKDGAGEEGNISRKDIVKN 109

Query: 2723 -DNGVLSDRGTEXXXXXXXXXXXXXKRLEMTISKKEQEHVEN---------KDKGNCDSV 2574
              +   SD+G +                 + +S ++++   +         KDK + +S 
Sbjct: 110  VSDCYESDKGYKHNEDSFPGAINHSSEDFVKVSSRKKDENTSMKEGNLGNVKDKESHNSE 169

Query: 2573 TKTKLKGNYSKSSRDKESKNEKQRHHKSRT---SNRLGADSEKESEKKQAKVSAXXXXXX 2403
             K K K N   + + KE K  ++ H + +    S R   DSE E EKK ++ S       
Sbjct: 170  DKLKAKPNKGMTDKSKEGKINQRVHGRKKIDERSRRSSDDSESEPEKKFSRDSVGKDRYA 229

Query: 2402 XXXXXXXXXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTE 2223
                           +  D+K+R   + +  K+HD GK  ++E+ +RK+R+ E   +  E
Sbjct: 230  DRSRKSEKESKRKHRTGEDEKNR---ERNSMKKHDPGKRHESEFLDRKERR-ESPPSRYE 285

Query: 2222 EPKAKRRRSRSREYDWERD-RSASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXX 2049
            E + KRRRSRSRE D +RD RS+S+SPRA  RT+H GR++ E                  
Sbjct: 286  ESRPKRRRSRSRERDKDRDKRSSSLSPRAQKRTSHHGREHAELSLHSLKDRSGRQHSDAD 345

Query: 2048 KYRTSGNGGYSSGHYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPP 1878
            + R S NG  SS H+R+HG   SGLGGYSPRKRRTEAA++TPSP  RSPEKKS+ WD PP
Sbjct: 346  RNRISNNG--SSSHFRRHGGSASGLGGYSPRKRRTEAAIKTPSPTNRSPEKKSAGWDLPP 403

Query: 1877 AGIIHGASGSMFASLQ----SPSSKILELTSSAAV----TPTVSKPQ-PVLPLDATSTLI 1725
            +      +GS+ +SLQ    + SS   EL S+  V    T T +KP  P +  DA S   
Sbjct: 404  SRTDGMNAGSVLSSLQVLKPTVSSNADELPSAVPVAVPVTATTAKPPLPRIYSDAVSKNK 463

Query: 1724 SATIDSVQLTQATRPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCII 1545
            + +IDS+QLTQATRPMRRLY+ENLP S+SEK++++CLN+F LSSG+NH+QG  PCISCII
Sbjct: 464  NVSIDSIQLTQATRPMRRLYVENLPVSSSEKALMECLNNFLLSSGINHVQGTPPCISCII 523

Query: 1544 NKEKCQAVVEFLTPEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAIS 1365
            +KEK QA+VEFLTPEDA++A+SFDG S SGS+LKIRRPKDFV+   GV EK V    AIS
Sbjct: 524  HKEKGQALVEFLTPEDASAALSFDGISFSGSILKIRRPKDFVDMT-GVQEKLVAAPDAIS 582

Query: 1364 EVVKDSPHKIFIGGISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSIT 1185
            ++VKDSPHKIFIGGIS+ALSSDML+E+  AFG L+AY ++ NE+L  PCAFLEY D S+T
Sbjct: 583  DIVKDSPHKIFIGGISRALSSDMLMEIAAAFGPLKAYRFQVNEDLGEPCAFLEYVDQSVT 642

Query: 1184 QKACAGLNGMKLGGHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKN 1005
             KACAGLNGMKLGG VLT VQA  ++   E+   N+P Y IP HA+ LL   TQVL+LKN
Sbjct: 643  LKACAGLNGMKLGGQVLTVVQAIPNALAMENTG-NLPFYGIPEHAKPLLERPTQVLKLKN 701

Query: 1004 MLNPEEFMXXXXXXXXXXXEDVRLECTRFGTVKSINLVRFNS------KIEGATESSEQR 843
            ++NP++             ED+RLECTRFGTVKS+N+V++N+      ++  A +++   
Sbjct: 702  VVNPDDLSSLSEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVSTLEVYEAADNTGSN 761

Query: 842  VPHADSAKNLSAVTSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERK 663
            +    ++     +     N  +    +    +V++ ++V +  E  + + +  + D  + 
Sbjct: 762  LGCDGNSMKAETLGGGTDNGSSDISGIKPPTDVKDLKEVDEVVERNSISDDKSLTDLIKN 821

Query: 662  ESAAPSTIDGGGDSSKLEEPSSHDKGAD---------TNIEESICLESGLVSEISLQEVD 510
            E   PS ID    ++ ++EP   D   D          N++  + L +   +++ +QE  
Sbjct: 822  ELCEPSHID---SNTAVKEPGCPDGSDDIPRGLPDQLNNMKHEVELRNDKAADV-IQEDF 877

Query: 509  VTGDKTTNLEARASNGKSVQIPVETTNPNIFSADEGDKRECNNQTHDL-EVFEPGSVLVE 333
            +  +K   +E   +N K +    E  +     +D   K +      DL ++FE G VLVE
Sbjct: 878  IIKNKLMTVE-EETNRKLLGTSAELDSSPGIKSDFTGKNDSEKGLCDLDDMFEVGCVLVE 936

Query: 332  FLRKEAACAAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
            + R EA+C AAH LH R + +R+V V YV+ DLY  +FPR
Sbjct: 937  YGRTEASCMAAHCLHGRYFDDRVVVVGYVALDLYRMKFPR 976


>ref|XP_006857448.2| PREDICTED: splicing factor U2af large subunit A [Amborella
            trichopoda]
          Length = 930

 Score =  566 bits (1459), Expect = e-158
 Identities = 366/926 (39%), Positives = 505/926 (54%), Gaps = 40/926 (4%)
 Frame = -1

Query: 2870 EVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERRDSSEKDDNGVLSDRGT 2694
            E++    NEGT+ARTRP+++E+++ RR  KKL     ER   +   S  + +      G+
Sbjct: 18   ELSRNERNEGTSARTRPLTIEDLILRRNNKKLLETFGERTIGKTKKSVPETDATSDHSGS 77

Query: 2693 EXXXXXXXXXXXXXKRLEMTISKKEQEHVENKDKGNCDSVTKTKLKGNYSKSSRDKE--- 2523
            +              + ++  SKK+      K  G   +      KG   K  RDK    
Sbjct: 78   DTIHKRDSSSKDVKGKHDLDDSKKKGS--SKKKNGRLPTKEDGYSKGKEEKLHRDKGRDT 135

Query: 2522 -SKNEKQRHHKSRTSNRLGADSEKESEKKQAKVSAXXXXXXXXXXXXXXXXXXXXXSRTD 2346
              KNEK  HH+ +       D      KK                           S +D
Sbjct: 136  GGKNEKHGHHRGKL------DDHNTGSKKHHFSEVGVKDRHEERDKYKKESKKKHKSESD 189

Query: 2345 DKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRT-HTEEPKAKRRRSRSREYDWER 2169
            +K++   D  + ++ +  +  D +Y E   RKK+ +++ + +E + KRRRS SRE +  R
Sbjct: 190  EKYKLEKDGVVARKQEPSRREDDDYLEGNSRKKQSNQSSYHDETRPKRRRSESREPNRGR 249

Query: 2168 DR-SASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGYSSGHYRKH 1995
            +R S S+SPR+  RT++ G  +DE  +               + R   NG  S+GHYR+H
Sbjct: 250  ERRSVSLSPRSRKRTSYRGWGHDESTYYSIKERVGRHHSETERSRKGSNGSSSNGHYRRH 309

Query: 1994 G--SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGSMFASLQSPS 1821
            G  SGLGGYSPRKRR+EAAVRTPSP+ RSPE+KS+ WD PP G+      S   SLQS S
Sbjct: 310  GNASGLGGYSPRKRRSEAAVRTPSPMVRSPERKSAAWDLPPVGLDTTGVISNVGSLQSSS 369

Query: 1820 SKILELTSSAAVTPTVSKPQPVLP---LDATST---LISATIDSVQLTQATRPMRRLYIE 1659
            S+ +  + +  +   VS     L    L++T T   +     DSVQLTQATRP RRLY+E
Sbjct: 370  SRQVVTSQTHELPKVVSFASSALNSSMLNSTKTGILIAENPFDSVQLTQATRPSRRLYLE 429

Query: 1658 NLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPEDATSAVS 1479
            N+P+SAS++SV++CLN+F LSSG   I+G  PCISC+INKEK QA+VEFLTPE+AT+A++
Sbjct: 430  NIPASASDESVVECLNNFLLSSGAIRIKGTHPCISCLINKEKGQALVEFLTPENATAALA 489

Query: 1478 FDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIGGISKALSSD 1299
            FDG+S+SGS++KIRRPKDF+E      EKPV TV A+S++VKDSPHKIFIGGI K+LSSD
Sbjct: 490  FDGKSISGSIVKIRRPKDFIETPAVATEKPVATVDAVSDIVKDSPHKIFIGGIPKSLSSD 549

Query: 1298 MLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGHVLTAVQA 1119
             L E+V  FG L+AYH+E N E  G CAFLEY D SIT KACAGLNGMKLGG VLT VQA
Sbjct: 550  KLQEIVSVFGHLKAYHFEVNRESGGSCAFLEYTDQSITLKACAGLNGMKLGGCVLTVVQA 609

Query: 1118 FLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXXXXXXEDV 939
            F D + EE  ++  PSY IP HA+ LL + TQ+L+LKN+ N ++             ED+
Sbjct: 610  FPDVSAEE-ISKGPPSYGIPQHAKPLLKEPTQILKLKNVFNMDDL---SESEIEESLEDI 665

Query: 938  RLECTRFGTVKSINLVRFNSKIEGA------TESSEQRVPHADSA---KNLSAVTSPISN 786
            R+ECTRFGTVKS+N++R +   E A      T +++   P  D     + L +V S I  
Sbjct: 666  RIECTRFGTVKSVNIIRLSKSSEEAPNMTITTGNNDSPGPKQDPTQIMEKLDSVNSDILG 725

Query: 785  CKNGDV-DVHTSNNVREHQDVRDN---------TENTNSNIENLMEDYERKESAAPSTID 636
             K   + ++  S+ V     + D              + N+E +    E+       T D
Sbjct: 726  AKQDSLHELEKSDPVNCDMQMSDQDPIQEIEIWEPGYSENVEIVASIDEKTRDLEMITDD 785

Query: 635  GGGDSSKLEEPSSHDKGADTNIEESICLESGLVSEISLQEVDVTGDKTTNLEARASNGKS 456
                  K +E  S     +            L   +SLQ  +   + T +L  +    + 
Sbjct: 786  KDEHLLKNKEDESGTSNCEQTTLAGDDASDQLPCSLSLQ-YNNAHEPTFSLSQQDRVSEE 844

Query: 455  VQIPVETT-NPNIFSADEG----DKRECNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSL 291
             Q   E   +  +   D G    D++   N + D + F+PG VLVE+ RKEAAC AAH L
Sbjct: 845  FQKKCEAPGSMKLEDFDMGSSGDDQKTMINPSSDFDAFQPGCVLVEYSRKEAACLAAHCL 904

Query: 290  HRRCYGERIVSVSYVSHDLYITRFPR 213
            H R YG+  V+V YV++DLY  RFPR
Sbjct: 905  HGRLYGDHRVAVEYVAYDLYRARFPR 930


>gb|ERN18915.1| hypothetical protein AMTR_s00067p00176230 [Amborella trichopoda]
          Length = 928

 Score =  566 bits (1459), Expect = e-158
 Identities = 366/926 (39%), Positives = 505/926 (54%), Gaps = 40/926 (4%)
 Frame = -1

Query: 2870 EVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERRDSSEKDDNGVLSDRGT 2694
            E++    NEGT+ARTRP+++E+++ RR  KKL     ER   +   S  + +      G+
Sbjct: 16   ELSRNERNEGTSARTRPLTIEDLILRRNNKKLLETFGERTIGKTKKSVPETDATSDHSGS 75

Query: 2693 EXXXXXXXXXXXXXKRLEMTISKKEQEHVENKDKGNCDSVTKTKLKGNYSKSSRDKE--- 2523
            +              + ++  SKK+      K  G   +      KG   K  RDK    
Sbjct: 76   DTIHKRDSSSKDVKGKHDLDDSKKKGS--SKKKNGRLPTKEDGYSKGKEEKLHRDKGRDT 133

Query: 2522 -SKNEKQRHHKSRTSNRLGADSEKESEKKQAKVSAXXXXXXXXXXXXXXXXXXXXXSRTD 2346
              KNEK  HH+ +       D      KK                           S +D
Sbjct: 134  GGKNEKHGHHRGKL------DDHNTGSKKHHFSEVGVKDRHEERDKYKKESKKKHKSESD 187

Query: 2345 DKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRT-HTEEPKAKRRRSRSREYDWER 2169
            +K++   D  + ++ +  +  D +Y E   RKK+ +++ + +E + KRRRS SRE +  R
Sbjct: 188  EKYKLEKDGVVARKQEPSRREDDDYLEGNSRKKQSNQSSYHDETRPKRRRSESREPNRGR 247

Query: 2168 DR-SASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGYSSGHYRKH 1995
            +R S S+SPR+  RT++ G  +DE  +               + R   NG  S+GHYR+H
Sbjct: 248  ERRSVSLSPRSRKRTSYRGWGHDESTYYSIKERVGRHHSETERSRKGSNGSSSNGHYRRH 307

Query: 1994 G--SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGSMFASLQSPS 1821
            G  SGLGGYSPRKRR+EAAVRTPSP+ RSPE+KS+ WD PP G+      S   SLQS S
Sbjct: 308  GNASGLGGYSPRKRRSEAAVRTPSPMVRSPERKSAAWDLPPVGLDTTGVISNVGSLQSSS 367

Query: 1820 SKILELTSSAAVTPTVSKPQPVLP---LDATST---LISATIDSVQLTQATRPMRRLYIE 1659
            S+ +  + +  +   VS     L    L++T T   +     DSVQLTQATRP RRLY+E
Sbjct: 368  SRQVVTSQTHELPKVVSFASSALNSSMLNSTKTGILIAENPFDSVQLTQATRPSRRLYLE 427

Query: 1658 NLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPEDATSAVS 1479
            N+P+SAS++SV++CLN+F LSSG   I+G  PCISC+INKEK QA+VEFLTPE+AT+A++
Sbjct: 428  NIPASASDESVVECLNNFLLSSGAIRIKGTHPCISCLINKEKGQALVEFLTPENATAALA 487

Query: 1478 FDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIGGISKALSSD 1299
            FDG+S+SGS++KIRRPKDF+E      EKPV TV A+S++VKDSPHKIFIGGI K+LSSD
Sbjct: 488  FDGKSISGSIVKIRRPKDFIETPAVATEKPVATVDAVSDIVKDSPHKIFIGGIPKSLSSD 547

Query: 1298 MLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGHVLTAVQA 1119
             L E+V  FG L+AYH+E N E  G CAFLEY D SIT KACAGLNGMKLGG VLT VQA
Sbjct: 548  KLQEIVSVFGHLKAYHFEVNRESGGSCAFLEYTDQSITLKACAGLNGMKLGGCVLTVVQA 607

Query: 1118 FLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXXXXXXEDV 939
            F D + EE  ++  PSY IP HA+ LL + TQ+L+LKN+ N ++             ED+
Sbjct: 608  FPDVSAEE-ISKGPPSYGIPQHAKPLLKEPTQILKLKNVFNMDDL---SESEIEESLEDI 663

Query: 938  RLECTRFGTVKSINLVRFNSKIEGA------TESSEQRVPHADSA---KNLSAVTSPISN 786
            R+ECTRFGTVKS+N++R +   E A      T +++   P  D     + L +V S I  
Sbjct: 664  RIECTRFGTVKSVNIIRLSKSSEEAPNMTITTGNNDSPGPKQDPTQIMEKLDSVNSDILG 723

Query: 785  CKNGDV-DVHTSNNVREHQDVRDN---------TENTNSNIENLMEDYERKESAAPSTID 636
             K   + ++  S+ V     + D              + N+E +    E+       T D
Sbjct: 724  AKQDSLHELEKSDPVNCDMQMSDQDPIQEIEIWEPGYSENVEIVASIDEKTRDLEMITDD 783

Query: 635  GGGDSSKLEEPSSHDKGADTNIEESICLESGLVSEISLQEVDVTGDKTTNLEARASNGKS 456
                  K +E  S     +            L   +SLQ  +   + T +L  +    + 
Sbjct: 784  KDEHLLKNKEDESGTSNCEQTTLAGDDASDQLPCSLSLQ-YNNAHEPTFSLSQQDRVSEE 842

Query: 455  VQIPVETT-NPNIFSADEG----DKRECNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSL 291
             Q   E   +  +   D G    D++   N + D + F+PG VLVE+ RKEAAC AAH L
Sbjct: 843  FQKKCEAPGSMKLEDFDMGSSGDDQKTMINPSSDFDAFQPGCVLVEYSRKEAACLAAHCL 902

Query: 290  HRRCYGERIVSVSYVSHDLYITRFPR 213
            H R YG+  V+V YV++DLY  RFPR
Sbjct: 903  HGRLYGDHRVAVEYVAYDLYRARFPR 928


>ref|XP_012069885.1| PREDICTED: splicing factor U2af large subunit A [Jatropha curcas]
          Length = 908

 Score =  558 bits (1437), Expect = e-155
 Identities = 365/917 (39%), Positives = 509/917 (55%), Gaps = 25/917 (2%)
 Frame = -1

Query: 2888 KTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRKKKLNTEVKERASERRDSSEKDDNGVL 2709
            K   S E+   + +EGTAARTRP+S +E++S+RK K      E+     D S   +   L
Sbjct: 14   KYEKSSEIPKDHYHEGTAARTRPLSFDEIMSKRKNKKKLFENEKVGVVEDISRDGNTEKL 73

Query: 2708 SDRGTEXXXXXXXXXXXXXKRL-----EMTISKKEQ------EHVENKDKGNCDSVTKTK 2562
            +D+                K       + ++ KKE+      ++ +  D+   DS  + K
Sbjct: 74   NDQSRRGNSRSKDSSHGVKKHFPEGDAKASLKKKEKNTFMKDDYSKRNDRELGDS--EIK 131

Query: 2561 LKGNYSKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVSAXXXXXXXXXXXXX 2382
            LK    K  + K   +EK    + R   R      +  +K                    
Sbjct: 132  LKAKVDKDLKAKGKSDEKNYGSRKRDEGRSNNVENEALKKHSRDFPEKDRHMNGTVGKSE 191

Query: 2381 XXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKAKRR 2202
                    S  D+K+R   D    ++HD GK+ D+E S+RK+RK E S++  EE   KRR
Sbjct: 192  RENKRKYRSGADEKNR---DRYTTRKHDLGKVHDSETSDRKNRK-ELSKSRYEELNLKRR 247

Query: 2201 RSRSREYDWERDRSASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNG 2025
            RSRSRE+   + RS S  PR+    ++  R+++EP                   R   NG
Sbjct: 248  RSRSREHVDGKKRSISPFPRSQKHVSYYSREHEEPTSSLKGRSERPHSDTDKS-RVLNNG 306

Query: 2024 GYSSGHYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGAS 1854
              SSGHY++HG   SGLGGYSPRKRRT+ A +TPSP KRSPEKKS+ WD  PA   +  S
Sbjct: 307  --SSGHYKRHGGSTSGLGGYSPRKRRTDNAAKTPSPPKRSPEKKSAKWDLAPAVADNTFS 364

Query: 1853 GSMFASLQSPS----SKILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQAT 1686
             S+ ++ Q  +    S + E  S+ +   T+ KP  V P    ST  + +IDSVQLTQAT
Sbjct: 365  VSIPSNFQLSNQLTPSNMHEAISAVSFASTILKPLSV-PFGILSTNKNDSIDSVQLTQAT 423

Query: 1685 RPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLT 1506
            RPMRRLY+EN+P+SASEK+V++ LN+F +SSGVNHIQG +PCISCII+KEK QA+VEFLT
Sbjct: 424  RPMRRLYVENIPASASEKAVMEFLNNFLISSGVNHIQGTQPCISCIIHKEKGQALVEFLT 483

Query: 1505 PEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIG 1326
            PEDA++A+SFDGRS  GS++KIRRPKDFVEAA G  EK V  V AIS +V D+PHKIFIG
Sbjct: 484  PEDASAALSFDGRSFCGSIIKIRRPKDFVEAATGELEKSVAAVDAISCIVNDTPHKIFIG 543

Query: 1325 GISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLG 1146
            G SKA SS M++E+  AFG L+AYH+E +++L+ PCAFLEY D SIT KACAGLNGMKLG
Sbjct: 544  GFSKAFSSKMIMEIASAFGPLKAYHFENSDDLSEPCAFLEYADQSITLKACAGLNGMKLG 603

Query: 1145 GHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXX 966
            G V+TAVQA  ++    +   N PSY IP  A++LL   T+VL+LKN+ +P+ F      
Sbjct: 604  GQVVTAVQAVPNAPALANSG-NPPSYGIPEQAKALLKKPTEVLRLKNVFDPDAFPSLSHV 662

Query: 965  XXXXXXEDVRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPHADSAKNLSAVTSPISN 786
                  EDVRLECTRFGTVKS+N+V++++    ++ +    +   D   +L  +      
Sbjct: 663  EIEEVLEDVRLECTRFGTVKSVNVVKYSAAPISSSVACGV-IEDVDLPGSLQKLV----- 716

Query: 785  CKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDGG---GDSSK 615
            C     +  T     E + V  N    +    ++MED    E+  P   D      + S 
Sbjct: 717  CNEAYAETVTIKQTAEPKIVESNGIEDDKPGGSVMED----ETCHPGQSDSNVVVDNQSA 772

Query: 614  LEEPSSHDKGADTNIEESICLESGLVSEISLQEVDVTGDKTTNLEA--RASNGKSVQIPV 441
               P S +    T+ +ES C    ++  I +++ ++        E+     +GKS ++ V
Sbjct: 773  NSIPDSQEHFQKTSKDESECFGDKVIDNIQIKDRNLEDQLPIREESDLEEVSGKSKELLV 832

Query: 440  ETTNPNIFSADEGDKRECNNQTHDLE-VFEPGSVLVEFLRKEAACAAAHSLHRRCYGERI 264
            +  +  +  +D  +K +C  Q  D + +FE G V VEF R EA+C AAH LH R +    
Sbjct: 833  D-DHDYMIGSDAIEKGDCEQQNCDPDHIFESGCVFVEFRRTEASCMAAHCLHGRLFDSHT 891

Query: 263  VSVSYVSHDLYITRFPR 213
            V+V YV  D+Y  RFP+
Sbjct: 892  VTVEYVPLDVYRARFPK 908


>gb|KDP40369.1| hypothetical protein JCGZ_02367 [Jatropha curcas]
          Length = 904

 Score =  558 bits (1437), Expect = e-155
 Identities = 365/917 (39%), Positives = 509/917 (55%), Gaps = 25/917 (2%)
 Frame = -1

Query: 2888 KTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRKKKLNTEVKERASERRDSSEKDDNGVL 2709
            K   S E+   + +EGTAARTRP+S +E++S+RK K      E+     D S   +   L
Sbjct: 10   KYEKSSEIPKDHYHEGTAARTRPLSFDEIMSKRKNKKKLFENEKVGVVEDISRDGNTEKL 69

Query: 2708 SDRGTEXXXXXXXXXXXXXKRL-----EMTISKKEQ------EHVENKDKGNCDSVTKTK 2562
            +D+                K       + ++ KKE+      ++ +  D+   DS  + K
Sbjct: 70   NDQSRRGNSRSKDSSHGVKKHFPEGDAKASLKKKEKNTFMKDDYSKRNDRELGDS--EIK 127

Query: 2561 LKGNYSKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVSAXXXXXXXXXXXXX 2382
            LK    K  + K   +EK    + R   R      +  +K                    
Sbjct: 128  LKAKVDKDLKAKGKSDEKNYGSRKRDEGRSNNVENEALKKHSRDFPEKDRHMNGTVGKSE 187

Query: 2381 XXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKAKRR 2202
                    S  D+K+R   D    ++HD GK+ D+E S+RK+RK E S++  EE   KRR
Sbjct: 188  RENKRKYRSGADEKNR---DRYTTRKHDLGKVHDSETSDRKNRK-ELSKSRYEELNLKRR 243

Query: 2201 RSRSREYDWERDRSASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNG 2025
            RSRSRE+   + RS S  PR+    ++  R+++EP                   R   NG
Sbjct: 244  RSRSREHVDGKKRSISPFPRSQKHVSYYSREHEEPTSSLKGRSERPHSDTDKS-RVLNNG 302

Query: 2024 GYSSGHYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGAS 1854
              SSGHY++HG   SGLGGYSPRKRRT+ A +TPSP KRSPEKKS+ WD  PA   +  S
Sbjct: 303  --SSGHYKRHGGSTSGLGGYSPRKRRTDNAAKTPSPPKRSPEKKSAKWDLAPAVADNTFS 360

Query: 1853 GSMFASLQSPS----SKILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQAT 1686
             S+ ++ Q  +    S + E  S+ +   T+ KP  V P    ST  + +IDSVQLTQAT
Sbjct: 361  VSIPSNFQLSNQLTPSNMHEAISAVSFASTILKPLSV-PFGILSTNKNDSIDSVQLTQAT 419

Query: 1685 RPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLT 1506
            RPMRRLY+EN+P+SASEK+V++ LN+F +SSGVNHIQG +PCISCII+KEK QA+VEFLT
Sbjct: 420  RPMRRLYVENIPASASEKAVMEFLNNFLISSGVNHIQGTQPCISCIIHKEKGQALVEFLT 479

Query: 1505 PEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIG 1326
            PEDA++A+SFDGRS  GS++KIRRPKDFVEAA G  EK V  V AIS +V D+PHKIFIG
Sbjct: 480  PEDASAALSFDGRSFCGSIIKIRRPKDFVEAATGELEKSVAAVDAISCIVNDTPHKIFIG 539

Query: 1325 GISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLG 1146
            G SKA SS M++E+  AFG L+AYH+E +++L+ PCAFLEY D SIT KACAGLNGMKLG
Sbjct: 540  GFSKAFSSKMIMEIASAFGPLKAYHFENSDDLSEPCAFLEYADQSITLKACAGLNGMKLG 599

Query: 1145 GHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXX 966
            G V+TAVQA  ++    +   N PSY IP  A++LL   T+VL+LKN+ +P+ F      
Sbjct: 600  GQVVTAVQAVPNAPALANSG-NPPSYGIPEQAKALLKKPTEVLRLKNVFDPDAFPSLSHV 658

Query: 965  XXXXXXEDVRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPHADSAKNLSAVTSPISN 786
                  EDVRLECTRFGTVKS+N+V++++    ++ +    +   D   +L  +      
Sbjct: 659  EIEEVLEDVRLECTRFGTVKSVNVVKYSAAPISSSVACGV-IEDVDLPGSLQKLV----- 712

Query: 785  CKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDGG---GDSSK 615
            C     +  T     E + V  N    +    ++MED    E+  P   D      + S 
Sbjct: 713  CNEAYAETVTIKQTAEPKIVESNGIEDDKPGGSVMED----ETCHPGQSDSNVVVDNQSA 768

Query: 614  LEEPSSHDKGADTNIEESICLESGLVSEISLQEVDVTGDKTTNLEA--RASNGKSVQIPV 441
               P S +    T+ +ES C    ++  I +++ ++        E+     +GKS ++ V
Sbjct: 769  NSIPDSQEHFQKTSKDESECFGDKVIDNIQIKDRNLEDQLPIREESDLEEVSGKSKELLV 828

Query: 440  ETTNPNIFSADEGDKRECNNQTHDLE-VFEPGSVLVEFLRKEAACAAAHSLHRRCYGERI 264
            +  +  +  +D  +K +C  Q  D + +FE G V VEF R EA+C AAH LH R +    
Sbjct: 829  D-DHDYMIGSDAIEKGDCEQQNCDPDHIFESGCVFVEFRRTEASCMAAHCLHGRLFDSHT 887

Query: 263  VSVSYVSHDLYITRFPR 213
            V+V YV  D+Y  RFP+
Sbjct: 888  VTVEYVPLDVYRARFPK 904


>ref|XP_011030446.1| PREDICTED: uncharacterized protein LOC105129896 [Populus euphratica]
            gi|743858395|ref|XP_011030447.1| PREDICTED:
            uncharacterized protein LOC105129896 [Populus euphratica]
            gi|743858399|ref|XP_011030448.1| PREDICTED:
            uncharacterized protein LOC105129896 [Populus euphratica]
            gi|743858403|ref|XP_011030449.1| PREDICTED:
            uncharacterized protein LOC105129896 [Populus euphratica]
            gi|743858407|ref|XP_011030450.1| PREDICTED:
            uncharacterized protein LOC105129896 [Populus euphratica]
          Length = 919

 Score =  557 bits (1436), Expect = e-155
 Identities = 368/925 (39%), Positives = 509/925 (55%), Gaps = 33/925 (3%)
 Frame = -1

Query: 2888 KTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRKKKLNTEV-------------KERASE 2748
            K   S E++  +  EGTAARTRP S +E++S RK K  +E+              E+AS+
Sbjct: 11   KYERSSEMSQDHQYEGTAARTRPFSFDEIMSIRKNKKASEILEGELKDILGGVINEKASD 70

Query: 2747 RRDSSEKDDNGVLSDRGTEXXXXXXXXXXXXXKR-LEMTISKKEQEHVENKDKGNCDSVT 2571
             R  SE+  NG   +  T               R  E  +S KE   V+ +D+   DS T
Sbjct: 71   HR--SERG-NGHNEESSTGLRQHLSEEHGKASYREKEDNVSMKEDYIVKGRDRDVRDSET 127

Query: 2570 KTKLKGNYSKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAKVSAXXXXXXXXXX 2391
             +K K N    +  KE  NEK  H + +   R    SE E+ KK ++             
Sbjct: 128  NSKSKMNEDMRTEIKEKTNEKI-HDRRKVDKRPSNISESEAVKKHSRDMQKDRHVDKSRG 186

Query: 2390 XXXXXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKA 2211
                       +  DDK R   D +  K+HD GK    E SERK+RK E S+ H EE + 
Sbjct: 187  KSERDRKEKYRNGIDDKSR---DRNAAKKHDLGKGHHLETSERKERK-ESSKYHHEELRL 242

Query: 2210 KRRRSRSREYDWERDRSASMSPRAYGRTN-HGRDYDEPVFQXXXXXXXXXXXXXXKYRTS 2034
            KRRRSRSRE++    RS S SPRA+   + H R++ E                    + +
Sbjct: 243  KRRRSRSREHEDRNRRSISRSPRAHKHGSYHKREHVELSSHSVKERSGRQQSDAENNQLT 302

Query: 2033 GNGGYSSGHYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIH 1863
             +   SS H R+HG   SGLGGYSPRKR+TEAAV+TPSP KRSPEKKS+ WD  P     
Sbjct: 303  NSS--SSRHQRRHGGFASGLGGYSPRKRKTEAAVKTPSPTKRSPEKKSAKWDLAPEETSS 360

Query: 1862 GASGSMFASLQSP----SSKILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLT 1695
                 + ++ QSP    SS I E+ S+  V     KP   + L + ST +  + +S+QLT
Sbjct: 361  VFPAVILSNFQSPNETASSNIHEVVSAVPVVSVPMKPLSGVSLSSLSTAMKVSTESIQLT 420

Query: 1694 QATRPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVE 1515
            QAT P+RRLY+EN+P+SASEK+V+DCLN+F +SSGVNHIQG +PCISCI+ KEK QA+VE
Sbjct: 421  QATHPIRRLYMENIPASASEKAVMDCLNNFLISSGVNHIQGTQPCISCIMQKEKGQALVE 480

Query: 1514 FLTPEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKI 1335
            FLTPEDA++A+SFDGRS SGS++K+RRPKDF+E A G  EK    + AI ++VKDSPHKI
Sbjct: 481  FLTPEDASAALSFDGRSFSGSIIKVRRPKDFIEVATGELEKSAAAIDAIGDIVKDSPHKI 540

Query: 1334 FIGGISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGM 1155
            FIGGISK LSS ML+E+  AFG L+AY +E +++ + P AFLEY D SIT KACAGLNGM
Sbjct: 541  FIGGISKVLSSKMLMEIASAFGPLKAYQFENSKDSDEPFAFLEYADKSITFKACAGLNGM 600

Query: 1154 KLGGHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXX 975
            KLGG V+TA++A  +++    D  N     I  HA++LL   T+VL+LKN+ + E     
Sbjct: 601  KLGGQVITAIRAVPNASSSGSDG-NPQFGQISQHAKALLEKPTEVLKLKNVFDSESLSSL 659

Query: 974  XXXXXXXXXEDVRLECTRFGTVKSINLVRFNSKIEGATESSE--QRVPHADSAKNLSA-V 804
                     +DVRLEC RFG+VKSIN+V++ +     ++S E        ++ ++L    
Sbjct: 660  SNTEVEEVLDDVRLECARFGSVKSINVVKYAAITISTSKSCEFNDDTVSTEATQSLGCDG 719

Query: 803  TSPISNCKNGDVD--VHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDGG 630
            T+P +   +G +D      N++ + +   D  E        +  D   ++ A  S+ D  
Sbjct: 720  TNPRTRNISGSIDQKFMEGNSIGDDKPASDVMEEEPCQPGQVDSDMAVQDLACKSSSDSQ 779

Query: 629  GDSSKLEEPSSHDKGADTNIEESICLE--SGLVSEISLQEVD----VTGDKTTNLEARAS 468
                 + + S+ DK  D    E + +E  S    +++L+EV     + G++    E    
Sbjct: 780  EPPQDVSD-SNVDKVTDDIEIEGVHVENKSKAGEDLNLKEVGDNKLMAGEELNPEEVSGD 838

Query: 467  NGKSVQIPVETTNPNIFSADEGDKRECNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSLH 288
              K+         PN     +  +++CN       +FEPG V VEF R EAAC AAH LH
Sbjct: 839  VEKAFVNDSLEMKPNSIEKGDCKEQDCNLGL----IFEPGCVFVEFRRTEAACMAAHCLH 894

Query: 287  RRCYGERIVSVSYVSHDLYITRFPR 213
             R + +R V V YV  D+Y+ RFP+
Sbjct: 895  GRLFDDRAVVVEYVPLDIYLARFPK 919


>emb|CBI23686.3| unnamed protein product [Vitis vinifera]
          Length = 882

 Score =  552 bits (1423), Expect = e-154
 Identities = 361/915 (39%), Positives = 507/915 (55%), Gaps = 34/915 (3%)
 Frame = -1

Query: 2855 NSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERRDSSEKD-----DNGVLSDRGT 2694
            N  EGTAARTRP S +E++ RRK KKL  +VK+ A E  + S KD      +   SD+G 
Sbjct: 52   NFKEGTAARTRPFSFDEIMLRRKNKKLYGDVKDGAGEEGNISRKDIVKNVSDCYESDKGY 111

Query: 2693 EXXXXXXXXXXXXXKRLEMTISKKEQEHVEN---------KDKGNCDSVTKTKLKGNYSK 2541
            +                 + +S ++++   +         KDK + +S  K K K N   
Sbjct: 112  KHNEDSFPGAINHSSEDFVKVSSRKKDENTSMKEGNLGNVKDKESHNSEDKLKAKPNKGM 171

Query: 2540 SSRDKESKNEKQRHHKSR---TSNRLGADSEKESEKKQAKVSAXXXXXXXXXXXXXXXXX 2370
            + + KE K  ++ H + +    S R   DSE E EKK ++ S                  
Sbjct: 172  TDKSKEGKINQRVHGRKKIDERSRRSSDDSESEPEKKFSRDSVGKDRYADRSRKSEKESK 231

Query: 2369 XXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKAKRRRSRS 2190
                +  D+K+R   + +  K+HD GK  ++E+ +RK+R+        E P ++R+ S  
Sbjct: 232  RKHRTGEDEKNR---ERNSMKKHDPGKRHESEFLDRKERR--------ESPPSRRQHS-- 278

Query: 2189 REYDWERDRSASMSPRAYGRTNHGRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGYSSG 2010
                 + DR+                                       R S NG  SS 
Sbjct: 279  -----DADRN---------------------------------------RISNNG--SSS 292

Query: 2009 HYRKHG---SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGSMFA 1839
            H+R+HG   SGLGGYSPRKRRTEAA++TPSP  RSPEKKS+ WD PP+      +GS+ +
Sbjct: 293  HFRRHGGSASGLGGYSPRKRRTEAAIKTPSPTNRSPEKKSAGWDLPPSRTDGMNAGSVLS 352

Query: 1838 SLQSPSSKILELTSSAAVTPTVSKPQ-PVLPLDATSTLISATIDSVQLTQATRPMRRLYI 1662
            + + PS+    +  +  VT T +KP  P +  DA S   + +IDS+QLTQATRPMRRLY+
Sbjct: 353  N-ELPSA----VPVAVPVTATTAKPPLPRIYSDAVSKNKNVSIDSIQLTQATRPMRRLYV 407

Query: 1661 ENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPEDATSAV 1482
            ENLP S+SEK++++CLN+F LSSG+NH+QG  PCISCII+KEK QA+VEFLTPEDA++A+
Sbjct: 408  ENLPVSSSEKALMECLNNFLLSSGINHVQGTPPCISCIIHKEKGQALVEFLTPEDASAAL 467

Query: 1481 SFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIGGISKALSS 1302
            SFDG S SGS+LKIRRPKDFV+   GV EK V    AIS++VKDSPHKIFIGGIS+ALSS
Sbjct: 468  SFDGISFSGSILKIRRPKDFVDMT-GVQEKLVAAPDAISDIVKDSPHKIFIGGISRALSS 526

Query: 1301 DMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGHVLTAVQ 1122
            DML+E+  AFG L+AY ++ NE+L  PCAFLEY D S+T KACAGLNGMKLGG VLT VQ
Sbjct: 527  DMLMEIAAAFGPLKAYRFQVNEDLGEPCAFLEYVDQSVTLKACAGLNGMKLGGQVLTVVQ 586

Query: 1121 AFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXXXXXXED 942
            A  ++   E+   N+P Y IP HA+ LL   TQVL+LKN++NP++             ED
Sbjct: 587  AIPNALAMENTG-NLPFYGIPEHAKPLLERPTQVLKLKNVVNPDDLSSLSEAELEEILED 645

Query: 941  VRLECTRFGTVKSINLVRFNSKIEGATESSEQRVPHADSAKNLSAVTSPISN--CKNGDV 768
            +RLECTRFGTVKS+N+V++N+              H  + +   A  +  SN  C    +
Sbjct: 646  IRLECTRFGTVKSVNIVKYNNS-------------HVSTLEVYEAADNTGSNLGCDGNSM 692

Query: 767  DVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPSTIDGGGDSSKLEEPSSHDK 588
               T     ++  + +  E  + + +  + D  + E   PS ID    ++ ++EP   D 
Sbjct: 693  KAETLGGGTDNGSIDEVVERNSISDDKSLTDLIKNELCEPSHID---SNTAVKEPGCPDG 749

Query: 587  GAD---------TNIEESICLESGLVSEISLQEVDVTGDKTTNLEARASNGKSVQIPVET 435
              D          N++  + L +   +++ +QE  +  +K   +E   +N K +    E 
Sbjct: 750  SDDIPRGLPDQLNNMKHEVELRNDKAADV-IQEDFIIKNKLMTVE-EETNRKLLGTSAEL 807

Query: 434  TNPNIFSADEGDKRECNNQTHDL-EVFEPGSVLVEFLRKEAACAAAHSLHRRCYGERIVS 258
             +     +D   K +      DL ++FE G VLVE+ R EA+C AAH LH R + +R+V 
Sbjct: 808  DSSPGIKSDFTGKNDSEKGLCDLDDMFEVGCVLVEYGRTEASCMAAHCLHGRYFDDRVVV 867

Query: 257  VSYVSHDLYITRFPR 213
            V YV+ DLY  +FPR
Sbjct: 868  VGYVALDLYRMKFPR 882


>ref|XP_006489671.1| PREDICTED: splicing factor U2af large subunit B-like isoform X1
            [Citrus sinensis]
          Length = 967

 Score =  546 bits (1408), Expect = e-152
 Identities = 373/986 (37%), Positives = 526/986 (53%), Gaps = 91/986 (9%)
 Frame = -1

Query: 2897 KMSKTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASE--------- 2748
            K  K    GE++  +SNEG+AARTRP S +E++ RRK K L+  +KE A +         
Sbjct: 7    KKEKHGKRGELSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSNIPGDCT 66

Query: 2747 -------------------------------RRDSSEKDDNGVLSD-----------RGT 2694
                                           +  S +K+DN +  +           R  
Sbjct: 67   IENVSDDHKSGRHHGHKKSPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDL 126

Query: 2693 EXXXXXXXXXXXXXKR-LEMTISKKEQEHVENKDKGNCDSVTKTKLKGNYSKSSRDKES- 2520
            E             +R L+  I  K++     KD+GN  S   TKLK + +K +  K + 
Sbjct: 127  ETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGS--DTKLKADITKETSSKPNV 184

Query: 2519 KNEKQRHHKSRTSNRLGADSEKESEKKQAK-VSAXXXXXXXXXXXXXXXXXXXXXSRTDD 2343
            K+EK    + R+ N+   DS  E+ KK ++ ++                      +R D+
Sbjct: 185  KSEKLIPSQGRSHNQSIVDSRDEATKKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDE 244

Query: 2342 KHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKAKRRRSRSREYDWERDR 2163
            K+R   D S  K+ D GK  D + SERK+ KKE  ++     + KRRRSRSRE +    R
Sbjct: 245  KNR---DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSREREDRNRR 300

Query: 2162 SASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGYSSGHYRKHG-- 1992
            S S+SPRA  RT++  R+++                     R +GNG   SGHYR+H   
Sbjct: 301  SLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSGHYRRHDGS 358

Query: 1991 -SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGSMFASLQSPSSK 1815
             SGLGGYSPRKRRTEAA +TPSP+ RSPEKKS+ WD  P    +    ++  S Q+ SS 
Sbjct: 359  TSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVET-YSVPSNVHTSNQAASSN 417

Query: 1814 ILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQATRPMRRLYIENLPSSASE 1635
              E+ SS  VT T  KP   + + A+   ++ ++DSVQLTQ+ RPMRRL +ENLP SASE
Sbjct: 418  AHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPVSASE 477

Query: 1634 KSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPEDATSAVSFDGRSLSG 1455
            K++++ LN+F LSSGV H+ G+ PCI C+I +EK QA VEFLT EDA++A+  DG S SG
Sbjct: 478  KALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSG 537

Query: 1454 SVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIGGISKALSSDMLLEVVGA 1275
            S+LKI+RPK+FVE A G  EK V +V ++S +VKDSPHKIFIGGIS+ LSS M++E+V A
Sbjct: 538  SILKIKRPKEFVEVASGEAEKSVASVDSVSGIVKDSPHKIFIGGISRTLSSKMVMEIVCA 597

Query: 1274 FGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGHVLTAVQAFLDSNGEE 1095
            FG L+AYH+E NE+   PCAF+EY D  +T KA AGLNG+K+GG VLTAVQA LD +   
Sbjct: 598  FGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS-IM 656

Query: 1094 DDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXXXXXXEDVRLECTRFG 915
            D++ N P + IP HA  LL   T+VL+LKN+ NPE F            EDVRLEC RFG
Sbjct: 657  DNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG 716

Query: 914  TVKSINLVRFNSK-------IEG----------------ATESSEQRVPHADSAKNLSAV 804
            TVKS+N+V++           EG                 T   ++R+      K++   
Sbjct: 717  TVKSVNVVKYGDSNIFTIQACEGNENTASAGVGQNLTNDETNEKQERLEEVTDHKSIKNN 776

Query: 803  TSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPST--IDGG 630
               I N     ++    NNV++++    +  +  S +  L  D   +  A  ST  I   
Sbjct: 777  ELEILNDSKEVMEAGEVNNVKDNRPASGSMGDEPSQLCELDTDMAVEYQAHDSTSEIVSQ 836

Query: 629  GDSSKL----EEPSSHDKGADTNIEESICLE---SGLVSEISLQEVDVTGDKTTNLEARA 471
            G  +++    +EP +HD     NI+     E   S    +++L+EV        N  + A
Sbjct: 837  GVPTQVNTLKDEPCAHDDKVTCNIQLEHMGEENKSSAKEDLNLEEV--------NGNSEA 888

Query: 470  SNGKSVQIPVETTNPNIFSADEGDKRECNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSL 291
              G S ++ ++++     + + GD    N   +   +FEPG V VE+ R EA+C AAHSL
Sbjct: 889  FTGASNEMGMQSS-----AVENGDNE--NQDPNQGHIFEPGCVFVEYRRAEASCMAAHSL 941

Query: 290  HRRCYGERIVSVSYVSHDLYITRFPR 213
            HRR + +RIV+V Y+  +LY  RF +
Sbjct: 942  HRRLFDDRIVAVEYIPLNLYRARFSK 967


>gb|KDO54082.1| hypothetical protein CISIN_1g002093mg [Citrus sinensis]
          Length = 967

 Score =  543 bits (1399), Expect = e-151
 Identities = 369/986 (37%), Positives = 523/986 (53%), Gaps = 91/986 (9%)
 Frame = -1

Query: 2897 KMSKTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASE--------- 2748
            K  K    GE +  +SNEG+AARTRP S +E++ RRK K L+  +KE A +         
Sbjct: 7    KKEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSNIPGDCT 66

Query: 2747 -------------------------------RRDSSEKDDNGVLSD-----------RGT 2694
                                           +  S +K+DN +  +           R  
Sbjct: 67   IENVSDDHKSGRHHGHKKSPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDL 126

Query: 2693 EXXXXXXXXXXXXXKR-LEMTISKKEQEHVENKDKGNCDSVTKTKLKGNYSKSSRDKES- 2520
            E             +R L+  I  K++     KD+GN  S   TKLK + +K +  K + 
Sbjct: 127  ETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGS--DTKLKADVTKETSSKPNV 184

Query: 2519 KNEKQRHHKSRTSNRLGADSEKESEKKQAK-VSAXXXXXXXXXXXXXXXXXXXXXSRTDD 2343
            K+EK    + R+ N+   DS  E+ KK ++ ++                      +R D+
Sbjct: 185  KSEKLIPSQGRSHNQSIVDSRDEATKKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDE 244

Query: 2342 KHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKAKRRRSRSREYDWERDR 2163
            K+R   D S  K+ D GK  D + SERK+ KKE  ++     + KRRRSRSRE +    R
Sbjct: 245  KNR---DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSREREDRNRR 300

Query: 2162 SASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGYSSGHYRKHG-- 1992
            S S+SPRA  RT++  R+++                     R +GNG   SGHYR+H   
Sbjct: 301  SLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSGHYRRHDGS 358

Query: 1991 -SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGSMFASLQSPSSK 1815
             SGLGGYSPRKRRTEAA +TPSP+ RSPEKKS+ WD  P    +    ++  S Q+ SS 
Sbjct: 359  TSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVET-YSVPSNVHTSNQAASSN 417

Query: 1814 ILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQATRPMRRLYIENLPSSASE 1635
              E+ SS  VT T  KP   + + A+   ++ ++DSVQLTQ+ RPMRRL +ENLP SASE
Sbjct: 418  AHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASE 477

Query: 1634 KSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPEDATSAVSFDGRSLSG 1455
            K++++ LN+F LSSGV H+ G+ PCI C+I +EK QA VEFLT EDA++A+  DG S SG
Sbjct: 478  KALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSG 537

Query: 1454 SVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIGGISKALSSDMLLEVVGA 1275
            S+LKI+RPK+FVE A G  EK V +V ++S +VKDSPHKIFIGGIS+ LSS M++E+V A
Sbjct: 538  SILKIKRPKEFVEVASGEAEKSVASVDSVSGIVKDSPHKIFIGGISRTLSSKMVMEIVCA 597

Query: 1274 FGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGHVLTAVQAFLDSNGEE 1095
            FG L+AYH+E NE+   PCAF+EY D  +T KA AGLNG+K+GG VLTAVQA LD +   
Sbjct: 598  FGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS-IM 656

Query: 1094 DDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXXXXXXEDVRLECTRFG 915
            D++ N P + IP HA  LL   T+VL+LKN+ NPE F            EDVRLEC RFG
Sbjct: 657  DNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG 716

Query: 914  TVKSINLVRFNSK-------IEG----------------ATESSEQRVPHADSAKNLSAV 804
            +VKS+N+V++           EG                 T    +R+      K++   
Sbjct: 717  SVKSVNVVKYGDSNISTIQACEGNENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNN 776

Query: 803  TSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMED----YERKESAAPSTID 636
               I N     ++    NNV++++       +  S +  L  D    Y+ ++S +     
Sbjct: 777  ELEILNDSKEVMEAGEVNNVKDNRPASGTMGDEPSQLCELDTDMAVEYQARDSTSEIVSQ 836

Query: 635  GGGD--SSKLEEPSSHDKGADTNIEESICLE---SGLVSEISLQEVDVTGDKTTNLEARA 471
            G     ++  + P +HD     NI+     E   S    +++L+EV        N  + A
Sbjct: 837  GVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEV--------NGNSEA 888

Query: 470  SNGKSVQIPVETTNPNIFSADEGDKRECNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSL 291
              G S ++ ++++     + + GD    N   +   +FEPG V VE++R EA+C AAHSL
Sbjct: 889  FTGASNEMGMQSS-----AVENGDNE--NQDPNQGHIFEPGCVFVEYMRAEASCMAAHSL 941

Query: 290  HRRCYGERIVSVSYVSHDLYITRFPR 213
            HRR + +RIV+V Y+  +LY  RF +
Sbjct: 942  HRRLFDDRIVAVEYIPLNLYRARFSK 967


>ref|XP_006420295.1| hypothetical protein CICLE_v10004248mg [Citrus clementina]
            gi|557522168|gb|ESR33535.1| hypothetical protein
            CICLE_v10004248mg [Citrus clementina]
          Length = 967

 Score =  542 bits (1397), Expect = e-151
 Identities = 368/986 (37%), Positives = 523/986 (53%), Gaps = 91/986 (9%)
 Frame = -1

Query: 2897 KMSKTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASE--------- 2748
            K  K    GE +  +SNEG+AARTRP S +E++ RRK K L+  +KE A +         
Sbjct: 7    KKEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKAKNLSENIKEDAMDISNIPGDCT 66

Query: 2747 -------------------------------RRDSSEKDDNGVLSD-----------RGT 2694
                                           +  S +K+DN +  +           R  
Sbjct: 67   IENVSDDHKSGRHHGHKKYPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDL 126

Query: 2693 EXXXXXXXXXXXXXKR-LEMTISKKEQEHVENKDKGNCDSVTKTKLKGNYSKSSRDKES- 2520
            E             +R L+  I  K++     KD+GN  S   TKLK + +K +  K + 
Sbjct: 127  ETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGS--DTKLKADVTKETSSKPNV 184

Query: 2519 KNEKQRHHKSRTSNRLGADSEKESEKKQAK-VSAXXXXXXXXXXXXXXXXXXXXXSRTDD 2343
            K+EK    + R+ ++   DS  E+ KK ++ ++                      +R D+
Sbjct: 185  KSEKLIPSQGRSHDQSIVDSRDEATKKHSRDLTRKDRHADKSGEKSERESKRKHRNRDDE 244

Query: 2342 KHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKAKRRRSRSREYDWERDR 2163
            K+R   D S  K+ D GK  D + SERK+ KKE  ++     + KRRRSRSRE +    R
Sbjct: 245  KNR---DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSREREDRNRR 300

Query: 2162 SASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGYSSGHYRKHG-- 1992
            S S+SPRA  RT++  R+++                     R + NG   SGHYR+H   
Sbjct: 301  SLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTSNG--LSGHYRRHDGS 358

Query: 1991 -SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGSMFASLQSPSSK 1815
             SGLGGYSPRKRRTEAA +TPSP+ RSPEKKS+ WD  P    +    ++  S Q+ SS 
Sbjct: 359  TSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVET-YSVPSNVHTSNQAASSN 417

Query: 1814 ILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQATRPMRRLYIENLPSSASE 1635
              E+ SS  VT T  KP   + + A+   ++ ++DSVQLTQ+ RPMRRL +ENLP SASE
Sbjct: 418  AREMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASE 477

Query: 1634 KSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPEDATSAVSFDGRSLSG 1455
            K++++ LN+F LSSGV H+ G+ PCI C+I +EK QA VEFLT EDA++A+  DGRS SG
Sbjct: 478  KALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGRSFSG 537

Query: 1454 SVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIGGISKALSSDMLLEVVGA 1275
            S+LKI+RPK+FVE A G  EK V +V ++S +VKDSPHKIFIGGIS+ LSS M++E+V A
Sbjct: 538  SILKIKRPKEFVEVASGEAEKSVASVDSVSGIVKDSPHKIFIGGISRTLSSKMVMEIVCA 597

Query: 1274 FGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGHVLTAVQAFLDSNGEE 1095
            FG L+AYH+E NE+   PCAF+EY D  +T KA AGLNG+K+GG +LTAVQA LD +   
Sbjct: 598  FGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGRLLTAVQAVLDGS-IM 656

Query: 1094 DDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXXXXXXEDVRLECTRFG 915
            D++ N P + IP HA  LL   T+VL+LKN+ NPE F            EDVRLEC RFG
Sbjct: 657  DNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG 716

Query: 914  TVKSINLVRFNSK-------IEG----------------ATESSEQRVPHADSAKNLSAV 804
            TVKS+N+V++           EG                 T    +R+      K++   
Sbjct: 717  TVKSVNVVKYGDSNISTIQACEGNENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNN 776

Query: 803  TSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMED----YERKESAAPSTID 636
               I N     ++    NNV++++       +  S +  L  D    Y+ ++S +     
Sbjct: 777  ELEILNDSKEVMEAGEVNNVKDNRPASGTMGDEPSQLCELDTDMAVEYQARDSTSEIVSQ 836

Query: 635  GGGD--SSKLEEPSSHDKGADTNIEESICLE---SGLVSEISLQEVDVTGDKTTNLEARA 471
            G     ++  + P +HD     NI+     E   S    +++L+EV        N  + A
Sbjct: 837  GVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEV--------NGNSEA 888

Query: 470  SNGKSVQIPVETTNPNIFSADEGDKRECNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSL 291
              G S ++ ++++     + + GD    N   +   +FEPG V VE++R EA+C AAHSL
Sbjct: 889  FTGASNEMGMQSS-----AVENGDNE--NQDPNQGHIFEPGCVFVEYMRAEASCMAAHSL 941

Query: 290  HRRCYGERIVSVSYVSHDLYITRFPR 213
            HRR + +RIV+V Y+  +LY  RF +
Sbjct: 942  HRRLFDDRIVAVEYIPLNLYRARFSK 967


>ref|XP_010091889.1| Splicing factor U2AF 50 kDa subunit [Morus notabilis]
            gi|587856766|gb|EXB46745.1| Splicing factor U2AF 50 kDa
            subunit [Morus notabilis]
          Length = 931

 Score =  542 bits (1396), Expect = e-151
 Identities = 380/956 (39%), Positives = 509/956 (53%), Gaps = 60/956 (6%)
 Frame = -1

Query: 2900 SKMSKTNN--SGEVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASERR-DSS 2733
            S   K NN  S   +  N++EG+AARTRP S EE++ RRK K  + +VKE+  E +  SS
Sbjct: 4    SHRQKENNEKSRRPSLHNNDEGSAARTRPFSFEEIMLRRKGKSFSEDVKEKVVEEQIVSS 63

Query: 2732 EKDDNGVLSDRGTEXXXXXXXXXXXXXKR---------------LEMTIS-KKEQEHVEN 2601
            E     V    G+E             +R                + ++S K E+   + 
Sbjct: 64   ENVVKSVAHCLGSERVYRHYTNSLPVAERHVVEEEKRGSFRKEEKKTSVSDKSEESRAKR 123

Query: 2600 KDKGNCDSVTKTKL---KGNYSKSSRDKESKNEKQRHHKSRTSNRLGADSEKESEKKQAK 2430
            K++G     +K ++   + N    +  K  KN+K+ H +     R   + +KE+ K+ ++
Sbjct: 124  KERGTRRLESKVEVVFSRPNNETRNEIKGGKNDKKMHDRRENDKRSTDNIQKEAGKRHSR 183

Query: 2429 VSAXXXXXXXXXXXXXXXXXXXXXSRTDDKHRSVVDDSIPKRHDSGKLRDAEYSERKDRK 2250
             S                         DD+   + D +  K+ D+G+  + + S R +++
Sbjct: 184  DSRGKERHTKSSRGKSERESKRKYINGDDE--KIKDRNPAKKLDTGRHHETDNSAR-NKR 240

Query: 2249 KEHSRTHTEEPKAKRRRSRSREYDWERDRSASMSPRAYGRTNHGRDYDEPVFQXXXXXXX 2070
            KE S+   EEP+ K  RSRSR++D    RS S S + +   ++ R     V         
Sbjct: 241  KEPSQYRFEEPRPKTERSRSRDHDRRSRRSKSPSLKDHKTASYDRMTYREVASHSHKDKS 300

Query: 2069 XXXXXXXKYRTSGNGGYSSGHYRKHGSGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTW 1890
                   + R S NG  SS    +  S LGGYSPRKR TEAA +TP P   S EKK + W
Sbjct: 301  RKPHHADRNRLSSNG--SSRRRDESPSALGGYSPRKRITEAAAKTPPPPDHSSEKKIAKW 358

Query: 1889 DQPPAGIIHGASGSMFASLQS----PSSKILELTSSAAVTPTVSKPQPVLPLDATSTLIS 1722
            D PPAG  +  S S+ ++ QS     S+ + EL S+A +  T  +    +P  A ST   
Sbjct: 359  DVPPAGTDNVLSASVPSNFQSSNNIESTGVQELASAAPIASTFPQLPSAVPSIAVSTRSF 418

Query: 1721 ATIDSVQLTQATRPMRRLYIENLPSSASEKSVIDCLNDFFLSSGVNHIQGAKPCISCIIN 1542
            A+ID+VQLTQATRPMRRLY+EN+PSS SEK++++  ND FLSS VNHIQG +PCISCIIN
Sbjct: 419  ASIDTVQLTQATRPMRRLYVENIPSSTSEKALVEWFNDLFLSSRVNHIQGTQPCISCIIN 478

Query: 1541 KEKCQAVVEFLTPEDATSAVSFDGRSLSGSVLKIRRPKDFVEAAIGVPEKPVPTVKAISE 1362
            KEK QA+VEFLTPEDA++A+SF+G S+SGSVLKIRRPKDFVE A G  EK    V  IS+
Sbjct: 479  KEKSQALVEFLTPEDASAALSFNGSSISGSVLKIRRPKDFVEVATGDLEKSTDAVDTISD 538

Query: 1361 VVKDSPHKIFIGGISKALSSDMLLEVVGAFGLLRAYHYEFNEELNGPCAFLEYEDHSITQ 1182
            VVKDSP+KIFIGGISKALSS ML+E+V AFG L+AYH+E N+ELN PCAFLEY D SI  
Sbjct: 539  VVKDSPNKIFIGGISKALSSKMLMEIVSAFGPLKAYHFEVNDELNDPCAFLEYVDQSIAP 598

Query: 1181 KACAGLNGMKLGGHVLTAVQAFLDSNGEEDDAENIPSYDIPSHARSLLTDSTQVLQLKNM 1002
            KACAGLNGMKLGG VLT +QA   +    + AE+   Y IP HA+ LL   TQVL+LKNM
Sbjct: 599  KACAGLNGMKLGGKVLTVIQAIRGAESLGNSAES-SLYKIPEHAKPLLKQPTQVLKLKNM 657

Query: 1001 LNPEEFMXXXXXXXXXXXEDVRLECTRFGTVKSINLVR-FNSKI--EGATESSEQRVPHA 831
             N   F            EDVRLEC RFG VKS+N+V+  NS+I   G  E +  R    
Sbjct: 658  FNLVGFSSLSEPEVEEVIEDVRLECVRFGNVKSVNVVKQSNSQITSSGICELN-NRAQTG 716

Query: 830  DSAKNL-----SAVTSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYER 666
            +   NL     +A T     C NG+                    +  + +E +  D E 
Sbjct: 717  EFGPNLGCEGNNAKTENFGGCTNGE-------------------PSGIAALEFVKNDQEL 757

Query: 665  KESAAPSTIDGGGDSSKLEEPSSHDKGADTN-----------IEESICLESGLVSEISLQ 519
            KE+  P   D G D+ +L+   + DK   T            I E +  +     E+S Q
Sbjct: 758  KENEVPK--DSGTDNRQLDNIIAEDKSCQTGQLTSDENEPNIIPEELPTQLNSPREVSEQ 815

Query: 518  EVDVTGDKT---TNLEARASNGKSVQIPVETTNPNIFSADEGD---KRECNNQTHD---- 369
              D  G  T   T+   +   G+      +T +    + +E D   + E N++  D    
Sbjct: 816  LDDKVGSATPTDTHGMEKKITGEDNSTRGDTDSKKQGTVEEFDGFMETESNDKVMDDSKE 875

Query: 368  ----LEVFEPGSVLVEFLRKEAACAAAHSLHRRCYGERIVSVSYVSHDLYITRFPR 213
                  +FE G VLVEF R EAAC AAH LH R + +RIVSV YV+ D Y TRFP+
Sbjct: 876  QFDLGSIFEVGCVLVEFGRTEAACTAAHCLHGRLFDDRIVSVEYVALDHYKTRFPK 931


>ref|XP_006489672.1| PREDICTED: splicing factor U2af large subunit B-like isoform X2
            [Citrus sinensis]
          Length = 965

 Score =  540 bits (1391), Expect = e-150
 Identities = 372/986 (37%), Positives = 525/986 (53%), Gaps = 91/986 (9%)
 Frame = -1

Query: 2897 KMSKTNNSGEVAHKNSNEGTAARTRPISLEEVLSRRK-KKLNTEVKERASE--------- 2748
            K  K    GE++  +SNEG+AARTRP S +E++ RRK K L+  +KE A +         
Sbjct: 7    KKEKHGKRGELSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSNIPGDCT 66

Query: 2747 -------------------------------RRDSSEKDDNGVLSD-----------RGT 2694
                                           +  S +K+DN +  +           R  
Sbjct: 67   IENVSDDHKSGRHHGHKKSPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDL 126

Query: 2693 EXXXXXXXXXXXXXKR-LEMTISKKEQEHVENKDKGNCDSVTKTKLKGNYSKSSRDKES- 2520
            E             +R L+  I  K++     KD+GN  S   TKLK + +K +  K + 
Sbjct: 127  ETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGS--DTKLKADITKETSSKPNV 184

Query: 2519 KNEKQRHHKSRTSNRLGADSEKESEKKQAK-VSAXXXXXXXXXXXXXXXXXXXXXSRTDD 2343
            K+EK    + R+ N+   DS  E+ KK ++ ++                      +R D+
Sbjct: 185  KSEKLIPSQGRSHNQSIVDSRDEATKKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDE 244

Query: 2342 KHRSVVDDSIPKRHDSGKLRDAEYSERKDRKKEHSRTHTEEPKAKRRRSRSREYDWERDR 2163
            K+R   D S  K+ D GK  D + SERK+ KKE  ++     + KRRRSRSRE +    R
Sbjct: 245  KNR---DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSREREDRNRR 300

Query: 2162 SASMSPRAYGRTNH-GRDYDEPVFQXXXXXXXXXXXXXXKYRTSGNGGYSSGHYRKHG-- 1992
            S S+SPRA  RT++  R+++                     R +GNG   SGHYR+H   
Sbjct: 301  SLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSGHYRRHDGS 358

Query: 1991 -SGLGGYSPRKRRTEAAVRTPSPVKRSPEKKSSTWDQPPAGIIHGASGSMFASLQSPSSK 1815
             SGLGGYSPRKRRTEAA +TPSP+ RSPEKKS+ WD  P    +    ++  S Q+ SS 
Sbjct: 359  TSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVET-YSVPSNVHTSNQAASSN 417

Query: 1814 ILELTSSAAVTPTVSKPQPVLPLDATSTLISATIDSVQLTQATRPMRRLYIENLPSSASE 1635
              E+ SS  VT T  KP   + + A+   ++ ++DSVQLTQ+ RPMRRL +ENLP SASE
Sbjct: 418  AHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPVSASE 477

Query: 1634 KSVIDCLNDFFLSSGVNHIQGAKPCISCIINKEKCQAVVEFLTPEDATSAVSFDGRSLSG 1455
            K++++ LN+F LSSGV H+ G+ PCI C+  +EK QA VEFLT EDA++A+  DG S SG
Sbjct: 478  KALMEFLNNFLLSSGVQHVHGSLPCIGCV--REKGQAFVEFLTAEDASAALCCDGCSFSG 535

Query: 1454 SVLKIRRPKDFVEAAIGVPEKPVPTVKAISEVVKDSPHKIFIGGISKALSSDMLLEVVGA 1275
            S+LKI+RPK+FVE A G  EK V +V ++S +VKDSPHKIFIGGIS+ LSS M++E+V A
Sbjct: 536  SILKIKRPKEFVEVASGEAEKSVASVDSVSGIVKDSPHKIFIGGISRTLSSKMVMEIVCA 595

Query: 1274 FGLLRAYHYEFNEELNGPCAFLEYEDHSITQKACAGLNGMKLGGHVLTAVQAFLDSNGEE 1095
            FG L+AYH+E NE+   PCAF+EY D  +T KA AGLNG+K+GG VLTAVQA LD +   
Sbjct: 596  FGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS-IM 654

Query: 1094 DDAENIPSYDIPSHARSLLTDSTQVLQLKNMLNPEEFMXXXXXXXXXXXEDVRLECTRFG 915
            D++ N P + IP HA  LL   T+VL+LKN+ NPE F            EDVRLEC RFG
Sbjct: 655  DNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG 714

Query: 914  TVKSINLVRFNSK-------IEG----------------ATESSEQRVPHADSAKNLSAV 804
            TVKS+N+V++           EG                 T   ++R+      K++   
Sbjct: 715  TVKSVNVVKYGDSNIFTIQACEGNENTASAGVGQNLTNDETNEKQERLEEVTDHKSIKNN 774

Query: 803  TSPISNCKNGDVDVHTSNNVREHQDVRDNTENTNSNIENLMEDYERKESAAPST--IDGG 630
               I N     ++    NNV++++    +  +  S +  L  D   +  A  ST  I   
Sbjct: 775  ELEILNDSKEVMEAGEVNNVKDNRPASGSMGDEPSQLCELDTDMAVEYQAHDSTSEIVSQ 834

Query: 629  GDSSKL----EEPSSHDKGADTNIEESICLE---SGLVSEISLQEVDVTGDKTTNLEARA 471
            G  +++    +EP +HD     NI+     E   S    +++L+EV        N  + A
Sbjct: 835  GVPTQVNTLKDEPCAHDDKVTCNIQLEHMGEENKSSAKEDLNLEEV--------NGNSEA 886

Query: 470  SNGKSVQIPVETTNPNIFSADEGDKRECNNQTHDLEVFEPGSVLVEFLRKEAACAAAHSL 291
              G S ++ ++++     + + GD    N   +   +FEPG V VE+ R EA+C AAHSL
Sbjct: 887  FTGASNEMGMQSS-----AVENGDNE--NQDPNQGHIFEPGCVFVEYRRAEASCMAAHSL 939

Query: 290  HRRCYGERIVSVSYVSHDLYITRFPR 213
            HRR + +RIV+V Y+  +LY  RF +
Sbjct: 940  HRRLFDDRIVAVEYIPLNLYRARFSK 965


Top