BLASTX nr result

ID: Cornus23_contig00003828 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00003828
         (2641 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010278733.1| PREDICTED: splicing factor U2AF-associated p...   660   0.0  
ref|XP_010278734.1| PREDICTED: splicing factor U2AF-associated p...   660   0.0  
ref|XP_010038454.1| PREDICTED: splicing factor U2AF-associated p...   648   0.0  
ref|XP_003635108.2| PREDICTED: HIV Tat-specific factor 1 [Vitis ...   647   0.0  
ref|XP_011077113.1| PREDICTED: splicing factor U2AF-associated p...   645   0.0  
ref|XP_003536163.1| PREDICTED: HIV Tat-specific factor 1 homolog...   645   0.0  
ref|XP_011077112.1| PREDICTED: splicing factor U2AF-associated p...   644   0.0  
ref|XP_011077114.1| PREDICTED: splicing factor U2AF-associated p...   643   0.0  
ref|XP_014510975.1| PREDICTED: splicing factor U2AF-associated p...   640   e-180
gb|KOM29717.1| hypothetical protein LR48_Vigan747s001900 [Vigna ...   637   e-179
ref|XP_002316170.1| hypothetical protein POPTR_0010s18610g [Popu...   635   e-179
ref|XP_003556435.1| PREDICTED: HIV Tat-specific factor 1 homolog...   635   e-179
ref|XP_007143970.1| hypothetical protein PHAVU_007G117900g [Phas...   625   e-176
gb|KCW84610.1| hypothetical protein EUGRSUZ_B01442 [Eucalyptus g...   623   e-175
emb|CDP04154.1| unnamed protein product [Coffea canephora]            621   e-174
ref|XP_012470592.1| PREDICTED: HIV Tat-specific factor 1 isoform...   620   e-174
ref|XP_012470590.1| PREDICTED: HIV Tat-specific factor 1 isoform...   619   e-174
ref|XP_012470595.1| PREDICTED: HIV Tat-specific factor 1 isoform...   619   e-174
ref|XP_012470593.1| PREDICTED: HIV Tat-specific factor 1 isoform...   618   e-174
ref|XP_012470596.1| PREDICTED: HIV Tat-specific factor 1 isoform...   614   e-172

>ref|XP_010278733.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X1
            [Nelumbo nucifera]
          Length = 499

 Score =  660 bits (1704), Expect = 0.0
 Identities = 322/477 (67%), Positives = 386/477 (80%), Gaps = 19/477 (3%)
 Frame = -2

Query: 2568 AENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQP 2389
            +ENG D +SE   EVGWY+LG +Q+H+GPYA SEL EHF NG+LSE+TL+WSEGRSDW P
Sbjct: 17   SENGGDADSENVPEVGWYILGENQEHVGPYAISELQEHFLNGYLSENTLLWSEGRSDWMP 76

Query: 2388 LSSIFGLMTEVSQQVP-----TNKDDEFEKWQKEVREAEAEALKHEAVNSN--------- 2251
            LS I  L T +SQQ P     ++ DDEF KWQKEV+EAEAEA   +A  ++         
Sbjct: 77   LSLIPELFTSISQQGPDPTVTSDNDDEFLKWQKEVKEAEAEAEALKACGTSGHVGDADHL 136

Query: 2250 -----DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLE 2086
                 D ++RP                TYKWDRGLRAWVPQDN  +  ++YG+E+M +L+
Sbjct: 137  NDMVGDADDRPQTPPDGEEEFTDDDGTTYKWDRGLRAWVPQDNSFSRGKEYGVEDMIYLQ 196

Query: 2085 EQELFSTVNAVDTSVKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKV 1906
            E+E+F+T    + S KE+   T+EV++  AK + KRKLPDK  EKK+ANKPPDSWF+LKV
Sbjct: 197  EEEVFATPKVAEPSKKEEASGTTEVVD--AKPDVKRKLPDKQTEKKQANKPPDSWFDLKV 254

Query: 1905 NTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMK 1726
            NTH+Y+TGLPDDVT +E+VE FSKCG+IKEDPET+KPRVKIYVDKETGRKKGDAL+SY+K
Sbjct: 255  NTHVYITGLPDDVTAEEIVEVFSKCGVIKEDPETRKPRVKIYVDKETGRKKGDALVSYLK 314

Query: 1725 EPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLG 1546
            EPSV LAIQILDG PLRPGGK+PMSVTQAKFEQKGDKFI+KQ+DK+KK+KL+K E+K+LG
Sbjct: 315  EPSVVLAIQILDGTPLRPGGKVPMSVTQAKFEQKGDKFIAKQVDKKKKKKLKKAEEKILG 374

Query: 1545 WGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHP 1366
            WGGRDDAKL IPATV+LR+MF PAEMRSD +LRSELEADV+EECVKLGPV+ ++VCENHP
Sbjct: 375  WGGRDDAKLSIPATVVLRHMFTPAEMRSDADLRSELEADVKEECVKLGPVDLIRVCENHP 434

Query: 1365 QGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            QGVVLV+FKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDL +DA RLE+
Sbjct: 435  QGVVLVKFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLDEDAARLEQ 491


>ref|XP_010278734.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X2
            [Nelumbo nucifera]
          Length = 497

 Score =  660 bits (1703), Expect = 0.0
 Identities = 322/476 (67%), Positives = 385/476 (80%), Gaps = 19/476 (3%)
 Frame = -2

Query: 2565 ENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPL 2386
            ENG D +SE   EVGWY+LG +Q+H+GPYA SEL EHF NG+LSE+TL+WSEGRSDW PL
Sbjct: 16   ENGGDADSENVPEVGWYILGENQEHVGPYAISELQEHFLNGYLSENTLLWSEGRSDWMPL 75

Query: 2385 SSIFGLMTEVSQQVP-----TNKDDEFEKWQKEVREAEAEALKHEAVNSN---------- 2251
            S I  L T +SQQ P     ++ DDEF KWQKEV+EAEAEA   +A  ++          
Sbjct: 76   SLIPELFTSISQQGPDPTVTSDNDDEFLKWQKEVKEAEAEAEALKACGTSGHVGDADHLN 135

Query: 2250 ----DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEE 2083
                D ++RP                TYKWDRGLRAWVPQDN  +  ++YG+E+M +L+E
Sbjct: 136  DMVGDADDRPQTPPDGEEEFTDDDGTTYKWDRGLRAWVPQDNSFSRGKEYGVEDMIYLQE 195

Query: 2082 QELFSTVNAVDTSVKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVN 1903
            +E+F+T    + S KE+   T+EV++  AK + KRKLPDK  EKK+ANKPPDSWF+LKVN
Sbjct: 196  EEVFATPKVAEPSKKEEASGTTEVVD--AKPDVKRKLPDKQTEKKQANKPPDSWFDLKVN 253

Query: 1902 THIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKE 1723
            TH+Y+TGLPDDVT +E+VE FSKCG+IKEDPET+KPRVKIYVDKETGRKKGDAL+SY+KE
Sbjct: 254  THVYITGLPDDVTAEEIVEVFSKCGVIKEDPETRKPRVKIYVDKETGRKKGDALVSYLKE 313

Query: 1722 PSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGW 1543
            PSV LAIQILDG PLRPGGK+PMSVTQAKFEQKGDKFI+KQ+DK+KK+KL+K E+K+LGW
Sbjct: 314  PSVVLAIQILDGTPLRPGGKVPMSVTQAKFEQKGDKFIAKQVDKKKKKKLKKAEEKILGW 373

Query: 1542 GGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQ 1363
            GGRDDAKL IPATV+LR+MF PAEMRSD +LRSELEADV+EECVKLGPV+ ++VCENHPQ
Sbjct: 374  GGRDDAKLSIPATVVLRHMFTPAEMRSDADLRSELEADVKEECVKLGPVDLIRVCENHPQ 433

Query: 1362 GVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            GVVLV+FKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDL +DA RLE+
Sbjct: 434  GVVLVKFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLDEDAARLEQ 489


>ref|XP_010038454.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X1
            [Eucalyptus grandis]
          Length = 474

 Score =  648 bits (1672), Expect = 0.0
 Identities = 319/466 (68%), Positives = 381/466 (81%), Gaps = 19/466 (4%)
 Frame = -2

Query: 2535 STEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMTEV 2356
            ++E GWY+LG +QQ++GPYA +EL EH  NG+LSESTLVW+EGR+DWQPLSS+  LM  +
Sbjct: 2    ASEAGWYILGDNQQNVGPYAAAELLEHLKNGYLSESTLVWAEGRADWQPLSSVPELMLPL 61

Query: 2355 SQ-----QVPT--NKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXXXXXXX 2197
            S      Q P   N ++EFEKWQ+EVRE+EA  L + + ++ DD  RPS           
Sbjct: 62   SDNGDGSQNPAVLNSEEEFEKWQREVRESEAVGLNNGSQSAEDDLIRPSTPPEGEEEFVD 121

Query: 2196 XXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKEDVIDTS 2017
                 YKWDRGLRAW PQDNIS  +++YGLEEMTFLEE+E+F + N  D   KE+V + +
Sbjct: 122  DDGTRYKWDRGLRAWAPQDNISANSDRYGLEEMTFLEEEEVFPSGNW-DEPTKEEVNEPA 180

Query: 2016 EVMEG-------EAKHNDKRKLPDKLAEKKEA-----NKPPDSWFELKVNTHIYVTGLPD 1873
            ++ E        EAK N KRK P+K A +KEA     NKPPDSWF+LKVNTH+YVTGLP+
Sbjct: 181  DIAEAKTVSDSEEAKPNAKRKQPEKEASEKEASKKEPNKPPDSWFDLKVNTHVYVTGLPE 240

Query: 1872 DVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQIL 1693
            DVT +EVVE FSKCGI+KEDPETKKPRVKIYVDKETGRKKGDAL++Y+KEPSVALAIQIL
Sbjct: 241  DVTMEEVVEVFSKCGILKEDPETKKPRVKIYVDKETGRKKGDALVTYLKEPSVALAIQIL 300

Query: 1692 DGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLI 1513
            DGAP RPGGK+PMSV+QAKFEQKGDKFISKQ+D +KK+KL+KVE+KMLGWGGRDDAK+L+
Sbjct: 301  DGAPFRPGGKVPMSVSQAKFEQKGDKFISKQVDGKKKKKLKKVEEKMLGWGGRDDAKVLV 360

Query: 1512 PATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDR 1333
            P TV+LRYMFAPAEMR+D+NLR ELE D++EECVKLGPV+SVKVCENHPQGVVLV+FKDR
Sbjct: 361  PTTVVLRYMFAPAEMRADDNLRPELEEDIREECVKLGPVDSVKVCENHPQGVVLVKFKDR 420

Query: 1332 KDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            KDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDL++DA RLE+
Sbjct: 421  KDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLEEDAARLEQ 466


>ref|XP_003635108.2| PREDICTED: HIV Tat-specific factor 1 [Vitis vinifera]
          Length = 488

 Score =  647 bits (1670), Expect = 0.0
 Identities = 321/459 (69%), Positives = 377/459 (82%), Gaps = 15/459 (3%)
 Frame = -2

Query: 2529 EVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMTEVSQ 2350
            EVGWY+LG +QQ++GPYAFSEL EHF NG+LSE++L+WSEGRSDWQPLSSI  L T +SQ
Sbjct: 25   EVGWYILGENQQNLGPYAFSELREHFLNGYLSENSLLWSEGRSDWQPLSSIPELTTAISQ 84

Query: 2349 Q--------VPTNKDDEFEKWQKEVREAEAEALKHEAVNSN-------DDNERPSXXXXX 2215
                      P N +DEFEKWQKEVREAEA  LK+ + + +       +DNERPS     
Sbjct: 85   PGVDCSSAGPPINDEDEFEKWQKEVREAEA--LKNGSASGSVGGDFGDEDNERPSTPPDG 142

Query: 2214 XXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKE 2035
                      TYKWDRGLRAWVPQDN ST +++Y  EEMTF  E+E+F T+   + SVKE
Sbjct: 143  EDEFTDDDGTTYKWDRGLRAWVPQDNPSTRSDEYKPEEMTFSVEEEIFPTIQVAEDSVKE 202

Query: 2034 DVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDE 1855
              ++ ++V+E E KH+ KRKLP++ AEKKEANKPPDSWF+LKVNTH+YVTGLPDDVT DE
Sbjct: 203  --VNGTDVVE-ETKHDAKRKLPEQQAEKKEANKPPDSWFDLKVNTHVYVTGLPDDVTVDE 259

Query: 1854 VVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLR 1675
            VVE FSKCG+IKEDPET++PRVK+Y+DK TGRKKGDAL+SY+KEPSVALAIQILDG PLR
Sbjct: 260  VVEVFSKCGLIKEDPETRRPRVKLYIDKNTGRKKGDALVSYLKEPSVALAIQILDGTPLR 319

Query: 1674 PGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVIL 1495
            P G IPMSVT AKFEQKG+KF++KQIDKRKK+KL++VE K+LGWGG DDAKL IPATV+L
Sbjct: 320  PVGTIPMSVTLAKFEQKGEKFVAKQIDKRKKKKLKRVEDKILGWGGHDDAKLSIPATVVL 379

Query: 1494 RYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKC 1315
            RYMF PAEMR+D NLRSELE DVQEEC+KLG V+ VKVCE+HPQGVVLV++KDR+DAQKC
Sbjct: 380  RYMFTPAEMRADPNLRSELEGDVQEECIKLGSVDLVKVCESHPQGVVLVKYKDRRDAQKC 439

Query: 1314 IDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLE 1198
            I+LMNGRWFGGRQIHASEDDG+VNHA VRDL  DAERLE
Sbjct: 440  IELMNGRWFGGRQIHASEDDGSVNHALVRDLDADAERLE 478


>ref|XP_011077113.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X2
            [Sesamum indicum]
          Length = 469

 Score =  645 bits (1664), Expect = 0.0
 Identities = 320/462 (69%), Positives = 377/462 (81%), Gaps = 10/462 (2%)
 Frame = -2

Query: 2550 QNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFG 2371
            Q  E  T  GWY+LG DQQ IGPY  SEL EH+++G+ S+STLVWSEG SDWQPLSS+ G
Sbjct: 6    QVPEMVTGAGWYILGQDQQLIGPYTVSELQEHYSSGYFSQSTLVWSEGYSDWQPLSSVPG 65

Query: 2370 LMTEVSQQ-----VPTNKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXXXX 2206
            L+T+   Q     V +N++DEFEKWQ+EVREAEAEA     VN NDD +RP+        
Sbjct: 66   LLTDAPPQNALVPVTSNEEDEFEKWQREVREAEAEA----EVNKNDDQDRPTTPPEGEEE 121

Query: 2205 XXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKEDVI 2026
                   TYKWDR LRAWVPQ+N +  TE Y  E+MTF++E+E+F T++A    VKE+  
Sbjct: 122  FTDDDGTTYKWDRTLRAWVPQENTTQNTEDYHPEDMTFVQEEEVFPTLDADHLPVKEEDS 181

Query: 2025 DTSEVMEGEAKHNDKRKLPDKLAEKK-----EANKPPDSWFELKVNTHIYVTGLPDDVTF 1861
              +EV+E   K N KRKLP+K ++KK     EANKPPD+WFELKVNTH+YVTGLPDDVT 
Sbjct: 182  AANEVVE--EKQNGKRKLPEKTSDKKNVDKKEANKPPDAWFELKVNTHVYVTGLPDDVTT 239

Query: 1860 DEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAP 1681
            +EVVE FSKCGIIKEDPETKKPRVKIYVDKETGRKKGDAL+SY+KEPSVALAIQILDGAP
Sbjct: 240  EEVVEVFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALVSYLKEPSVALAIQILDGAP 299

Query: 1680 LRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATV 1501
            LRP GKIPM+VT+AKFEQKGD+FISKQ+DK KKRKLQKVEQKMLGWGGRDDAK+ IPATV
Sbjct: 300  LRPDGKIPMTVTKAKFEQKGDRFISKQVDKNKKRKLQKVEQKMLGWGGRDDAKVSIPATV 359

Query: 1500 ILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQ 1321
            ILRYMF PAE+R++E+LRSELE DV++EC KLGP++SVKVCENHPQGV+LV+FKD KDA 
Sbjct: 360  ILRYMFTPAELRAEEDLRSELEEDVRDECGKLGPLDSVKVCENHPQGVILVKFKDSKDAH 419

Query: 1320 KCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            KCI+LMNGRWFGG+QIHAS DDG+VNHA VRDL+++ +RLEK
Sbjct: 420  KCIELMNGRWFGGKQIHASIDDGSVNHALVRDLEEETDRLEK 461


>ref|XP_003536163.1| PREDICTED: HIV Tat-specific factor 1 homolog [Glycine max]
            gi|734358277|gb|KHN14770.1| HIV Tat-specific factor 1
            like [Glycine soja]
          Length = 503

 Score =  645 bits (1664), Expect = 0.0
 Identities = 320/478 (66%), Positives = 375/478 (78%), Gaps = 28/478 (5%)
 Frame = -2

Query: 2544 SETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLM 2365
            +E  TEVGWYVLG DQQ IGPYAFSEL EHF NG+LSE+T VWSEGRS+WQPLSS+  L 
Sbjct: 18   AEKITEVGWYVLGEDQQQIGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLW 77

Query: 2364 TEVSQQVPTNKD-------DEFEKWQKEVREAEAEALKHE---------AVNSNDDNERP 2233
             +++QQ P +         DEFE+WQKE++EAEA+    E         +  + +D+ERP
Sbjct: 78   AQINQQGPDSSTTVSAPDVDEFERWQKEIQEAEAQVEGSEFGSLSGNAGSTGAGEDSERP 137

Query: 2232 SXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAV 2053
            S                YKWDR LRAWVPQ++ +  TE YG++EMTFLEE+E+F T+   
Sbjct: 138  STPPEGEEEFTDDDGTVYKWDRNLRAWVPQEHPTGSTEPYGVQEMTFLEEEEVFPTIPIS 197

Query: 2052 DTSVK-EDVI-----------DTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELK 1909
            D S K ED             +T+E         +KRKL D+  +KKEANKPPDSWFELK
Sbjct: 198  DASEKFEDSPKLSVSVPPLKEETNEANNTNVVSGEKRKLSDQQTDKKEANKPPDSWFELK 257

Query: 1908 VNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYM 1729
            +NTH+YVTGLP+DVT DE+VE FSKCGIIKEDPETKKPRVK+YVDK TGRKKGDAL++Y+
Sbjct: 258  INTHVYVTGLPEDVTTDEIVEVFSKCGIIKEDPETKKPRVKLYVDKGTGRKKGDALVTYL 317

Query: 1728 KEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKML 1549
            KEPSVALAIQILDGAPLRP GKIPMSV+QAKFEQKGDKF+SKQ+D +KK+KL+KVE KML
Sbjct: 318  KEPSVALAIQILDGAPLRPNGKIPMSVSQAKFEQKGDKFVSKQVDNKKKKKLKKVEDKML 377

Query: 1548 GWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENH 1369
            GWGGRDDAK+ IPATVILRYMFAPAEMR+DENLR ELE DV+EEC KLGP++SVK+CENH
Sbjct: 378  GWGGRDDAKVSIPATVILRYMFAPAEMRADENLRLELEEDVKEECTKLGPLDSVKICENH 437

Query: 1368 PQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            PQGVVLVRFKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDL++DA RLE+
Sbjct: 438  PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLEEDAIRLEQ 495


>ref|XP_011077112.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X1
            [Sesamum indicum]
          Length = 471

 Score =  644 bits (1662), Expect = 0.0
 Identities = 320/464 (68%), Positives = 377/464 (81%), Gaps = 12/464 (2%)
 Frame = -2

Query: 2550 QNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFG 2371
            Q  E  T  GWY+LG DQQ IGPY  SEL EH+++G+ S+STLVWSEG SDWQPLSS+ G
Sbjct: 6    QVPEMVTGAGWYILGQDQQLIGPYTVSELQEHYSSGYFSQSTLVWSEGYSDWQPLSSVPG 65

Query: 2370 LMTEVSQQ-------VPTNKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXX 2212
            L+T+   Q       V +N++DEFEKWQ+EVREAEAEA     VN NDD +RP+      
Sbjct: 66   LLTDAPPQNALGSVPVTSNEEDEFEKWQREVREAEAEA----EVNKNDDQDRPTTPPEGE 121

Query: 2211 XXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKED 2032
                     TYKWDR LRAWVPQ+N +  TE Y  E+MTF++E+E+F T++A    VKE+
Sbjct: 122  EEFTDDDGTTYKWDRTLRAWVPQENTTQNTEDYHPEDMTFVQEEEVFPTLDADHLPVKEE 181

Query: 2031 VIDTSEVMEGEAKHNDKRKLPDKLAEKK-----EANKPPDSWFELKVNTHIYVTGLPDDV 1867
                +EV+E   K N KRKLP+K ++KK     EANKPPD+WFELKVNTH+YVTGLPDDV
Sbjct: 182  DSAANEVVE--EKQNGKRKLPEKTSDKKNVDKKEANKPPDAWFELKVNTHVYVTGLPDDV 239

Query: 1866 TFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDG 1687
            T +EVVE FSKCGIIKEDPETKKPRVKIYVDKETGRKKGDAL+SY+KEPSVALAIQILDG
Sbjct: 240  TTEEVVEVFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALVSYLKEPSVALAIQILDG 299

Query: 1686 APLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPA 1507
            APLRP GKIPM+VT+AKFEQKGD+FISKQ+DK KKRKLQKVEQKMLGWGGRDDAK+ IPA
Sbjct: 300  APLRPDGKIPMTVTKAKFEQKGDRFISKQVDKNKKRKLQKVEQKMLGWGGRDDAKVSIPA 359

Query: 1506 TVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKD 1327
            TVILRYMF PAE+R++E+LRSELE DV++EC KLGP++SVKVCENHPQGV+LV+FKD KD
Sbjct: 360  TVILRYMFTPAELRAEEDLRSELEEDVRDECGKLGPLDSVKVCENHPQGVILVKFKDSKD 419

Query: 1326 AQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            A KCI+LMNGRWFGG+QIHAS DDG+VNHA VRDL+++ +RLEK
Sbjct: 420  AHKCIELMNGRWFGGKQIHASIDDGSVNHALVRDLEEETDRLEK 463


>ref|XP_011077114.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X3
            [Sesamum indicum]
          Length = 462

 Score =  643 bits (1659), Expect = 0.0
 Identities = 318/458 (69%), Positives = 375/458 (81%), Gaps = 12/458 (2%)
 Frame = -2

Query: 2532 TEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMTEVS 2353
            T  GWY+LG DQQ IGPY  SEL EH+++G+ S+STLVWSEG SDWQPLSS+ GL+T+  
Sbjct: 3    TGAGWYILGQDQQLIGPYTVSELQEHYSSGYFSQSTLVWSEGYSDWQPLSSVPGLLTDAP 62

Query: 2352 QQ-------VPTNKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXXXXXXXX 2194
             Q       V +N++DEFEKWQ+EVREAEAEA     VN NDD +RP+            
Sbjct: 63   PQNALGSVPVTSNEEDEFEKWQREVREAEAEA----EVNKNDDQDRPTTPPEGEEEFTDD 118

Query: 2193 XXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKEDVIDTSE 2014
               TYKWDR LRAWVPQ+N +  TE Y  E+MTF++E+E+F T++A    VKE+    +E
Sbjct: 119  DGTTYKWDRTLRAWVPQENTTQNTEDYHPEDMTFVQEEEVFPTLDADHLPVKEEDSAANE 178

Query: 2013 VMEGEAKHNDKRKLPDKLAEKK-----EANKPPDSWFELKVNTHIYVTGLPDDVTFDEVV 1849
            V+E   K N KRKLP+K ++KK     EANKPPD+WFELKVNTH+YVTGLPDDVT +EVV
Sbjct: 179  VVE--EKQNGKRKLPEKTSDKKNVDKKEANKPPDAWFELKVNTHVYVTGLPDDVTTEEVV 236

Query: 1848 EAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPG 1669
            E FSKCGIIKEDPETKKPRVKIYVDKETGRKKGDAL+SY+KEPSVALAIQILDGAPLRP 
Sbjct: 237  EVFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALVSYLKEPSVALAIQILDGAPLRPD 296

Query: 1668 GKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRY 1489
            GKIPM+VT+AKFEQKGD+FISKQ+DK KKRKLQKVEQKMLGWGGRDDAK+ IPATVILRY
Sbjct: 297  GKIPMTVTKAKFEQKGDRFISKQVDKNKKRKLQKVEQKMLGWGGRDDAKVSIPATVILRY 356

Query: 1488 MFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCID 1309
            MF PAE+R++E+LRSELE DV++EC KLGP++SVKVCENHPQGV+LV+FKD KDA KCI+
Sbjct: 357  MFTPAELRAEEDLRSELEEDVRDECGKLGPLDSVKVCENHPQGVILVKFKDSKDAHKCIE 416

Query: 1308 LMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            LMNGRWFGG+QIHAS DDG+VNHA VRDL+++ +RLEK
Sbjct: 417  LMNGRWFGGKQIHASIDDGSVNHALVRDLEEETDRLEK 454


>ref|XP_014510975.1| PREDICTED: splicing factor U2AF-associated protein 2 [Vigna radiata
            var. radiata]
          Length = 506

 Score =  640 bits (1652), Expect = e-180
 Identities = 313/480 (65%), Positives = 373/480 (77%), Gaps = 31/480 (6%)
 Frame = -2

Query: 2541 ETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMT 2362
            E  TEVGWYVLG DQQ +GPYAFSEL EHF NG+LSE+T VWSEGRS+WQPLSS+  L T
Sbjct: 19   EKVTEVGWYVLGEDQQQVGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLWT 78

Query: 2361 EVSQQ-------VPTNKDDEFEKWQKEVREAEAEALKHE---------AVNSNDDNERPS 2230
            +++QQ       V  +  DEFE+W+KE++EAEA+    +            + +D+ERPS
Sbjct: 79   QINQQGLDSSTAVSAHDVDEFERWEKEIKEAEAQVEGSDFGSFSGNVGGTAAGEDSERPS 138

Query: 2229 XXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVD 2050
                            YKWDR LRAWVPQ+  +  TE YG+E+MTFL+E+E+F T+   D
Sbjct: 139  TPPEGEEEFTDDDGTVYKWDRNLRAWVPQEYPTGSTEPYGVEDMTFLQEEEVFPTITNSD 198

Query: 2049 TS---------------VKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFE 1915
             S               +KE+  +T+E          KRKL D+  +KKEANKPPDSWFE
Sbjct: 199  ASEKIEDSSELVISDPSLKEEANETNETNNASVVAGGKRKLSDQQTDKKEANKPPDSWFE 258

Query: 1914 LKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALIS 1735
            LK+NTH+YV GLP+DVT DE+VE FSKCGIIKEDPETK+PRVK+YVDKETGRKKGDAL++
Sbjct: 259  LKINTHVYVNGLPEDVTTDEIVEVFSKCGIIKEDPETKRPRVKLYVDKETGRKKGDALVT 318

Query: 1734 YMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQK 1555
            Y+KEPSVALAIQILDGAP RPGGKIPMSV+QAKFEQKGDKF+SKQ+D +KK+KL++VE K
Sbjct: 319  YLKEPSVALAIQILDGAPFRPGGKIPMSVSQAKFEQKGDKFVSKQVDNKKKKKLKRVEDK 378

Query: 1554 MLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCE 1375
            MLGWGGRDDAK+ IPATVILR+MF+PAEMR+DENLR ELE DV+EEC KLGPV+SVK+CE
Sbjct: 379  MLGWGGRDDAKVSIPATVILRFMFSPAEMRADENLRLELEEDVKEECTKLGPVDSVKICE 438

Query: 1374 NHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            NHPQGVVLV+FKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDLQ+DA RLE+
Sbjct: 439  NHPQGVVLVKFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLQEDAIRLEQ 498


>gb|KOM29717.1| hypothetical protein LR48_Vigan747s001900 [Vigna angularis]
          Length = 506

 Score =  637 bits (1644), Expect = e-179
 Identities = 311/480 (64%), Positives = 374/480 (77%), Gaps = 31/480 (6%)
 Frame = -2

Query: 2541 ETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMT 2362
            E  TEVGWYVLG DQQ +GPYAFSEL EHF NG+LSE+T VWSEGRS+WQPLSS+  L T
Sbjct: 19   EKVTEVGWYVLGEDQQQVGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLWT 78

Query: 2361 EVSQQ-------VPTNKDDEFEKWQKEVREAEAEALKHE---------AVNSNDDNERPS 2230
            +++QQ       V  +  DEFE+W+KE++EAEA+    +         +  + +D+ERPS
Sbjct: 79   QINQQGSDFSTAVSAHDVDEFERWEKEIKEAEAQVEGSDFGSFSGNVGSTAAGEDSERPS 138

Query: 2229 XXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVN--- 2059
                            YKWDR LRAWVPQ+  +  TE YG+E+MTFL+E+E+F T+    
Sbjct: 139  TPPEGEEEFTDDDGTVYKWDRNLRAWVPQEYPTGSTEPYGVEDMTFLQEEEVFPTITNSD 198

Query: 2058 ------------AVDTSVKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFE 1915
                          D S+KE+  +T+E          KRKL D+  +KKEANKPPDSWFE
Sbjct: 199  ASEKIEDSSELVVSDPSLKEEPNETNETNNASVVAGGKRKLSDQQTDKKEANKPPDSWFE 258

Query: 1914 LKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALIS 1735
            LK+NTH+YV GLP+DVT DE+VE FSKCGIIKEDPETK+PRVK+YVDKETGRKKGDAL++
Sbjct: 259  LKINTHVYVNGLPEDVTTDEIVEVFSKCGIIKEDPETKRPRVKLYVDKETGRKKGDALVT 318

Query: 1734 YMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQK 1555
            Y+KEPSVALAIQILDGAP RPGGKIPMSV+QAKFEQKGDKF+S+Q+D +KK+KL++VE+K
Sbjct: 319  YLKEPSVALAIQILDGAPFRPGGKIPMSVSQAKFEQKGDKFVSRQVDNKKKKKLKRVEEK 378

Query: 1554 MLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCE 1375
            MLGWGGRDDAK+ IPATVILR+MF+PAEMR+DENLR ELE DV+EEC KLGPV+SVK+CE
Sbjct: 379  MLGWGGRDDAKVSIPATVILRFMFSPAEMRADENLRLELEEDVKEECTKLGPVDSVKICE 438

Query: 1374 NHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            NHPQGVVLV+FKDRKDAQKCI+LMNGRWFGGR IHASEDDG+VNHA VRDLQ+DA RLE+
Sbjct: 439  NHPQGVVLVKFKDRKDAQKCIELMNGRWFGGRLIHASEDDGSVNHALVRDLQEDAIRLEQ 498


>ref|XP_002316170.1| hypothetical protein POPTR_0010s18610g [Populus trichocarpa]
            gi|222865210|gb|EEF02341.1| hypothetical protein
            POPTR_0010s18610g [Populus trichocarpa]
          Length = 497

 Score =  635 bits (1639), Expect = e-179
 Identities = 323/484 (66%), Positives = 379/484 (78%), Gaps = 23/484 (4%)
 Frame = -2

Query: 2577 CITAENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSD 2398
            C  A NG+D N  T  EVGWY+LG DQQ +GPY FSEL EHF NG+L ESTLVWSEGRSD
Sbjct: 8    CTGAGNGYDGNYNTVAEVGWYILGEDQQQVGPYVFSELREHFLNGYLLESTLVWSEGRSD 67

Query: 2397 WQPLSSIFGLMTEVSQQ-------VPTNKD-DEFEKWQKEVREAEAEA--LKHEAVNSN- 2251
            WQPLSSI  LM+  SQQ       V +N D DEFEKWQ+EV+EAEAEA  LK+ ++  N 
Sbjct: 68   WQPLSSIPELMSGTSQQGSDYSRAVSSNDDEDEFEKWQREVKEAEAEAERLKNGSLPGNT 127

Query: 2250 ------DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFL 2089
                  DD++R                 TYKWDR LRAWVPQDN+S+ + QYG+E+MTF 
Sbjct: 128  GDDFGIDDSDRILSPPDGEDEFTDDDGTTYKWDRSLRAWVPQDNLSSVSGQYGVEQMTFH 187

Query: 2088 EEQELFSTVNAVDTSVKEDVIDTSEVMEGEAKHNDKRKLPD------KLAEKKEANKPPD 1927
            E++E+F  VNA D S+K++   T EV+E +   +DKRKL D      K A+KKEANK PD
Sbjct: 188  EQEEVFLNVNAADASLKDEANGTGEVVESQ--RSDKRKLQDEQADKDKQADKKEANKAPD 245

Query: 1926 SWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGD 1747
            SWFELKVNTH+YVTGLPDDVT +EVVE FSKCG+IKEDPE KKPRVKIYVDKETGR KGD
Sbjct: 246  SWFELKVNTHVYVTGLPDDVTAEEVVEVFSKCGVIKEDPEKKKPRVKIYVDKETGRIKGD 305

Query: 1746 ALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQK 1567
            AL++Y+KEPSV LA+QILDG PLRPGG IPMSVTQAKFEQKGD+FI+KQ+D +KKRKL+K
Sbjct: 306  ALVTYLKEPSVDLAMQILDGTPLRPGGTIPMSVTQAKFEQKGDRFITKQVDSKKKRKLKK 365

Query: 1566 VEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESV 1387
            VE ++LGWGGRDDAK+ IPATV+LR MF  +EMR+DE+LRSELE DV+EEC KLGPV+SV
Sbjct: 366  VEDRILGWGGRDDAKVSIPATVVLRQMFTLSEMRADESLRSELEVDVREECAKLGPVDSV 425

Query: 1386 KVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAE 1207
            KVCEN+P GVVLV+FKDRKDAQ CI+LMNGRWFGGRQ+ ASEDDG +NHA VRD  +DA 
Sbjct: 426  KVCENNPHGVVLVKFKDRKDAQSCIELMNGRWFGGRQVDASEDDGLINHALVRDHDEDAA 485

Query: 1206 RLEK 1195
            RLE+
Sbjct: 486  RLEQ 489


>ref|XP_003556435.1| PREDICTED: HIV Tat-specific factor 1 homolog isoform X1 [Glycine max]
            gi|734324209|gb|KHN04984.1| HIV Tat-specific factor 1
            like [Glycine soja] gi|947042838|gb|KRG92562.1|
            hypothetical protein GLYMA_20G218800 [Glycine max]
          Length = 500

 Score =  635 bits (1638), Expect = e-179
 Identities = 320/492 (65%), Positives = 378/492 (76%), Gaps = 32/492 (6%)
 Frame = -2

Query: 2574 ITAENGFDQN-------SETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVW 2416
            +T++NG + +       +E  TEVGWYVLG DQQ IGPYAFSEL +HF NG+LSE+T VW
Sbjct: 1    MTSQNGDESHHPPPQAQAEKVTEVGWYVLGEDQQQIGPYAFSELCQHFLNGYLSENTFVW 60

Query: 2415 SEGRSDWQPLSSIFGLMTEVSQQVPTNKD-------DEFEKWQKEVREAEAEALKHE--- 2266
            SEG S+WQPLSS+  L  ++++Q P +         DEFE+WQKE++E EA+    E   
Sbjct: 61   SEGSSEWQPLSSVSDLWAQINRQGPDSSTTVSAPDVDEFERWQKEIQEVEAQVEGSEFGS 120

Query: 2265 ------AVNSNDDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLE 2104
                     + +D+ERPS                YKWDR LRAWVPQD  +  T+ YG+E
Sbjct: 121  LSGNVGGTGAGEDSERPSTPPEGEEGFTDDDGTVYKWDRSLRAWVPQDYPTGSTKPYGVE 180

Query: 2103 EMTFLEEQELFSTVNAVDTSVK-EDV----IDTSEVMEGEAKHN----DKRKLPDKLAEK 1951
            EMTFLEE+E+F T+   D S K ED     +    + E E   N     KR L D+  +K
Sbjct: 181  EMTFLEEEEVFPTIPNSDASEKFEDSPKLSVSVPPLKEEENNTNVISGGKRMLSDQQTDK 240

Query: 1950 KEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDK 1771
            KEANKPPDSWFELK+NTH+YVTGLP+DVT DE+VE FSKCGIIKEDPETK+PRVK+YVDK
Sbjct: 241  KEANKPPDSWFELKINTHVYVTGLPEDVTTDEIVEVFSKCGIIKEDPETKRPRVKLYVDK 300

Query: 1770 ETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDK 1591
            ETGRKKGDAL++Y+KEPSVALAIQILDGAPLRPGGKIPMSV+QAKFEQKGDKF+SKQ+D 
Sbjct: 301  ETGRKKGDALVTYLKEPSVALAIQILDGAPLRPGGKIPMSVSQAKFEQKGDKFVSKQVDG 360

Query: 1590 RKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECV 1411
            +KK+KL+KVE KMLGWGGRDDAK+ IPATVILRYMFAPAEMR+DENL  ELE DV+EEC 
Sbjct: 361  KKKKKLKKVEDKMLGWGGRDDAKVSIPATVILRYMFAPAEMRADENLHLELEEDVKEECT 420

Query: 1410 KLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKV 1231
            KLGPV+SVK+CENHPQGVVLVRFKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA V
Sbjct: 421  KLGPVDSVKICENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALV 480

Query: 1230 RDLQDDAERLEK 1195
            RDL++D  RLE+
Sbjct: 481  RDLEEDVIRLEQ 492


>ref|XP_007143970.1| hypothetical protein PHAVU_007G117900g [Phaseolus vulgaris]
            gi|561017160|gb|ESW15964.1| hypothetical protein
            PHAVU_007G117900g [Phaseolus vulgaris]
          Length = 509

 Score =  625 bits (1611), Expect = e-176
 Identities = 306/483 (63%), Positives = 374/483 (77%), Gaps = 34/483 (7%)
 Frame = -2

Query: 2541 ETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMT 2362
            E  TEVGWYVLG DQQ +GPYAFSEL EHF NG+LSE+T VWSEGRS+WQPLSS+  L T
Sbjct: 19   EKVTEVGWYVLGEDQQQVGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLWT 78

Query: 2361 EVSQQ-------VPTNKDDEFEKWQKEVREAEAEALKHE---------AVNSNDDNERPS 2230
            ++++Q       V  +  DEFE+W+KE++EAEA+    +            + +D+ERPS
Sbjct: 79   QINRQGLDSSAAVSAHDVDEFERWEKEIQEAEAQVEGSDFGSFAGNVGGTAAGEDSERPS 138

Query: 2229 XXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVN--- 2059
                            YKWDR LRAWVPQ+  +  TE Y +E+MTFL+E+E+F T+    
Sbjct: 139  TPPEGEEEFTDDDGTVYKWDRNLRAWVPQEYPTGSTEPYRVEDMTFLQEEEVFPTITNSD 198

Query: 2058 ------------AVDTSVKEDVID---TSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDS 1924
                          D S+KE+V +   T+E  +       KRKL D+  +KKEANKPPDS
Sbjct: 199  ASEKFEDSSKLGVSDPSLKEEVNNANKTNEANDISVVAGGKRKLSDQQTDKKEANKPPDS 258

Query: 1923 WFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDA 1744
            WFELK+NTH+YV GLP+DVT DE+VE FSKCGIIKEDPETK+PRVK+YVDKETG+ KGDA
Sbjct: 259  WFELKINTHVYVNGLPEDVTTDEIVEVFSKCGIIKEDPETKRPRVKLYVDKETGKNKGDA 318

Query: 1743 LISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKV 1564
            L++Y+KEPSVALAIQILDGAP RPGGKIPMSV+QAKF+QKGD+F+SKQ+D +KK+KL++V
Sbjct: 319  LVTYLKEPSVALAIQILDGAPFRPGGKIPMSVSQAKFQQKGDRFVSKQVDNKKKKKLKRV 378

Query: 1563 EQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVK 1384
            E+KMLGWGGRDDAK+ IPAT+ILR+MF+PAEMR+DENLR ELE DV+EEC KLGPV+SVK
Sbjct: 379  EEKMLGWGGRDDAKVSIPATMILRFMFSPAEMRADENLRLELEEDVKEECTKLGPVDSVK 438

Query: 1383 VCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAER 1204
            +CENHPQGVVLV+FKDRKDAQKCI+LMNGRWFGGRQ+HASEDDG+VNHA VRDLQ+DA R
Sbjct: 439  ICENHPQGVVLVKFKDRKDAQKCIELMNGRWFGGRQVHASEDDGSVNHALVRDLQEDAIR 498

Query: 1203 LEK 1195
            LE+
Sbjct: 499  LEQ 501


>gb|KCW84610.1| hypothetical protein EUGRSUZ_B01442 [Eucalyptus grandis]
          Length = 472

 Score =  623 bits (1606), Expect = e-175
 Identities = 306/449 (68%), Positives = 366/449 (81%), Gaps = 19/449 (4%)
 Frame = -2

Query: 2535 STEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMTEV 2356
            ++E GWY+LG +QQ++GPYA +EL EH  NG+LSESTLVW+EGR+DWQPLSS+  LM  +
Sbjct: 2    ASEAGWYILGDNQQNVGPYAAAELLEHLKNGYLSESTLVWAEGRADWQPLSSVPELMLPL 61

Query: 2355 SQ-----QVPT--NKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXXXXXXX 2197
            S      Q P   N ++EFEKWQ+EVRE+EA  L + + ++ DD  RPS           
Sbjct: 62   SDNGDGSQNPAVLNSEEEFEKWQREVRESEAVGLNNGSQSAEDDLIRPSTPPEGEEEFVD 121

Query: 2196 XXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKEDVIDTS 2017
                 YKWDRGLRAW PQDNIS  +++YGLEEMTFLEE+E+F + N  D   KE+V + +
Sbjct: 122  DDGTRYKWDRGLRAWAPQDNISANSDRYGLEEMTFLEEEEVFPSGNW-DEPTKEEVNEPA 180

Query: 2016 EVMEG-------EAKHNDKRKLPDKLAEKKEA-----NKPPDSWFELKVNTHIYVTGLPD 1873
            ++ E        EAK N KRK P+K A +KEA     NKPPDSWF+LKVNTH+YVTGLP+
Sbjct: 181  DIAEAKTVSDSEEAKPNAKRKQPEKEASEKEASKKEPNKPPDSWFDLKVNTHVYVTGLPE 240

Query: 1872 DVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQIL 1693
            DVT +EVVE FSKCGI+KEDPETKKPRVKIYVDKETGRKKGDAL++Y+KEPSVALAIQIL
Sbjct: 241  DVTMEEVVEVFSKCGILKEDPETKKPRVKIYVDKETGRKKGDALVTYLKEPSVALAIQIL 300

Query: 1692 DGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLI 1513
            DGAP RPGGK+PMSV+QAKFEQKGDKFISKQ+D +KK+KL+KVE+KMLGWGGRDDAK+L+
Sbjct: 301  DGAPFRPGGKVPMSVSQAKFEQKGDKFISKQVDGKKKKKLKKVEEKMLGWGGRDDAKVLV 360

Query: 1512 PATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDR 1333
            P TV+LRYMFAPAEMR+D+NLR ELE D++EECVKLGPV+SVKVCENHPQGVVLV+FKDR
Sbjct: 361  PTTVVLRYMFAPAEMRADDNLRPELEEDIREECVKLGPVDSVKVCENHPQGVVLVKFKDR 420

Query: 1332 KDAQKCIDLMNGRWFGGRQIHASEDDGAV 1246
            KDAQKCI+LMNGRWFGGRQIHASEDDG++
Sbjct: 421  KDAQKCIELMNGRWFGGRQIHASEDDGSL 449


>emb|CDP04154.1| unnamed protein product [Coffea canephora]
          Length = 477

 Score =  621 bits (1601), Expect = e-174
 Identities = 304/465 (65%), Positives = 376/465 (80%), Gaps = 9/465 (1%)
 Frame = -2

Query: 2562 NGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLS 2383
            NG    + ++T+V W+VLG DQQ IGPY+ SEL EH+++G+LS++TLVW +G ++WQP+S
Sbjct: 11   NGLTSLTTSATDVAWFVLGPDQQPIGPYSSSELREHYSSGYLSDATLVWFQGATNWQPVS 70

Query: 2382 SIFGLMTEVSQQ-------VP--TNKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPS 2230
            S+ GL+T++  Q       VP  +N++DEFEKWQ+EVREAEAEA +   +    + E+PS
Sbjct: 71   SVPGLLTDLPVQNAQIQLAVPKTSNEEDEFEKWQREVREAEAEAERAVTI----EPEKPS 126

Query: 2229 XXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVD 2050
                            YKWDR LRAWVPQ++ S  T  YG+++M F++E+E+F T+ A D
Sbjct: 127  TPPEGEEEFTDDDGTLYKWDRTLRAWVPQEDNSENTANYGVDDMIFVKEEEVFPTIKADD 186

Query: 2049 TSVKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDD 1870
              V+E++  TS+ +E  A  N KRKLP+K AEKKEANKPPDSWFELKVNTH+YVTGLPDD
Sbjct: 187  FPVEEEIKGTSDTVE--ANPNGKRKLPEKTAEKKEANKPPDSWFELKVNTHVYVTGLPDD 244

Query: 1869 VTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILD 1690
            VT DEVVE FSKCGIIKEDPE KKPRVKIYVDKE+GR+KGDAL++++KEPSV LAIQILD
Sbjct: 245  VTVDEVVEVFSKCGIIKEDPEMKKPRVKIYVDKESGRQKGDALVTFLKEPSVDLAIQILD 304

Query: 1689 GAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIP 1510
            G P R GGKIPMSVT+AKFEQKG+ F+ K++DKRKK+KLQ +E+KMLGWGG DDAKLLIP
Sbjct: 305  GTPFRAGGKIPMSVTKAKFEQKGETFLPKKVDKRKKKKLQHLERKMLGWGGLDDAKLLIP 364

Query: 1509 ATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRK 1330
            ATVILRYMF P E+R+DENLRSELE DV++EC KLGP+ESVKVCENHPQGV+LV+FKDRK
Sbjct: 365  ATVILRYMFTPDEIRADENLRSELEEDVRDECTKLGPLESVKVCENHPQGVILVKFKDRK 424

Query: 1329 DAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            DA KCI+LMNGRWFG RQIHASEDDG+VNHA VRDL+ +A+RLE+
Sbjct: 425  DALKCIELMNGRWFGKRQIHASEDDGSVNHALVRDLEAEADRLEQ 469


>ref|XP_012470592.1| PREDICTED: HIV Tat-specific factor 1 isoform X2 [Gossypium raimondii]
            gi|763751794|gb|KJB19182.1| hypothetical protein
            B456_003G087900 [Gossypium raimondii]
          Length = 518

 Score =  620 bits (1600), Expect = e-174
 Identities = 320/514 (62%), Positives = 379/514 (73%), Gaps = 44/514 (8%)
 Frame = -2

Query: 2604 NHPFLQSEACITAENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSEST 2425
            NHP   +E C+ A  G          VGWY+LG DQQ++GPYA SEL EHF NG+L+EST
Sbjct: 9    NHPQSGTENCLNAVAG----------VGWYILGEDQQNVGPYAISELREHFLNGYLTEST 58

Query: 2424 LVWSEGRSDWQPLSSIFGLMTEVSQQ-------------------------VPTNK---D 2329
            L WSEGRS WQPLSSI   ++ +S Q                         VP+N     
Sbjct: 59   LAWSEGRSQWQPLSSIPEFVSVISHQANNFSATGDDDAFLNSMKEGDNSNAVPSNDGDGS 118

Query: 2328 DEFEKWQKEVREAEAEA--LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXXTYK 2176
            DEFEKWQ+E+REAEAE   LK  +V+ +       DD +RP                 YK
Sbjct: 119  DEFEKWQREIREAEAETERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGTRYK 178

Query: 2175 WDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTS-------VKEDVIDTS 2017
            WDR LRAWVPQD++ST+   YG+EEMTFLEE E+F T++A+D S       V+E+V    
Sbjct: 179  WDRNLRAWVPQDDMSTKNGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVNGGG 238

Query: 2016 EVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFS 1837
            E  + E   N KRKL +K  +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE FS
Sbjct: 239  E--QTEVNCNAKRKLLEKPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVEVFS 296

Query: 1836 KCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIP 1657
            KCGIIKEDPETK+PRVKIYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP GKIP
Sbjct: 297  KCGIIKEDPETKRPRVKIYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDGKIP 356

Query: 1656 MSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAP 1477
            MSV+QAKFEQKGDKFI+KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR MF P
Sbjct: 357  MSVSQAKFEQKGDKFIAKQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNMFTP 416

Query: 1476 AEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNG 1297
            AEMR+DENL SELE DV+EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+LMNG
Sbjct: 417  AEMRADENLCSELEEDVKEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIELMNG 476

Query: 1296 RWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            RWFGGRQIHASEDDG VNHA VRDL +DA RLE+
Sbjct: 477  RWFGGRQIHASEDDGVVNHALVRDLDEDASRLEQ 510


>ref|XP_012470590.1| PREDICTED: HIV Tat-specific factor 1 isoform X1 [Gossypium raimondii]
            gi|823141545|ref|XP_012470591.1| PREDICTED: HIV
            Tat-specific factor 1 isoform X1 [Gossypium raimondii]
          Length = 521

 Score =  619 bits (1597), Expect = e-174
 Identities = 320/517 (61%), Positives = 379/517 (73%), Gaps = 47/517 (9%)
 Frame = -2

Query: 2604 NHPFLQSEACITAENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSEST 2425
            NHP   +E C+ A  G          VGWY+LG DQQ++GPYA SEL EHF NG+L+EST
Sbjct: 9    NHPQSGTENCLNAVAG----------VGWYILGEDQQNVGPYAISELREHFLNGYLTEST 58

Query: 2424 LVWSEGRSDWQPLSSIFGLMTEVSQQ----------------------------VPTNK- 2332
            L WSEGRS WQPLSSI   ++ +S Q                            VP+N  
Sbjct: 59   LAWSEGRSQWQPLSSIPEFVSVISHQANNFSATVSLGDDDAFLNSMKEGDNSNAVPSNDG 118

Query: 2331 --DDEFEKWQKEVREAEAEA--LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXX 2185
               DEFEKWQ+E+REAEAE   LK  +V+ +       DD +RP                
Sbjct: 119  DGSDEFEKWQREIREAEAETERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGT 178

Query: 2184 TYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAV-------DTSVKEDVI 2026
             YKWDR LRAWVPQD++ST+   YG+EEMTFLEE E+F T++A+       D SV+E+V 
Sbjct: 179  RYKWDRNLRAWVPQDDMSTKNGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVN 238

Query: 2025 DTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVE 1846
               E  + E   N KRKL +K  +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE
Sbjct: 239  GGGE--QTEVNCNAKRKLLEKPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVE 296

Query: 1845 AFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGG 1666
             FSKCGIIKEDPETK+PRVKIYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP G
Sbjct: 297  VFSKCGIIKEDPETKRPRVKIYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDG 356

Query: 1665 KIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYM 1486
            KIPMSV+QAKFEQKGDKFI+KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR M
Sbjct: 357  KIPMSVSQAKFEQKGDKFIAKQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNM 416

Query: 1485 FAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDL 1306
            F PAEMR+DENL SELE DV+EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+L
Sbjct: 417  FTPAEMRADENLCSELEEDVKEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIEL 476

Query: 1305 MNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195
            MNGRWFGGRQIHASEDDG VNHA VRDL +DA RLE+
Sbjct: 477  MNGRWFGGRQIHASEDDGVVNHALVRDLDEDASRLEQ 513


>ref|XP_012470595.1| PREDICTED: HIV Tat-specific factor 1 isoform X4 [Gossypium raimondii]
            gi|763751795|gb|KJB19183.1| hypothetical protein
            B456_003G087900 [Gossypium raimondii]
            gi|763751799|gb|KJB19187.1| hypothetical protein
            B456_003G087900 [Gossypium raimondii]
          Length = 510

 Score =  619 bits (1597), Expect = e-174
 Identities = 315/502 (62%), Positives = 377/502 (75%), Gaps = 44/502 (8%)
 Frame = -2

Query: 2568 AENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQP 2389
            + +G D + ++   VGWY+LG DQQ++GPYA SEL EHF NG+L+ESTL WSEGRS WQP
Sbjct: 3    SSDGSDNHPQSVAGVGWYILGEDQQNVGPYAISELREHFLNGYLTESTLAWSEGRSQWQP 62

Query: 2388 LSSIFGLMTEVSQQ-------------------------VPTNK---DDEFEKWQKEVRE 2293
            LSSI   ++ +S Q                         VP+N     DEFEKWQ+E+RE
Sbjct: 63   LSSIPEFVSVISHQANNFSATGDDDAFLNSMKEGDNSNAVPSNDGDGSDEFEKWQREIRE 122

Query: 2292 AEAEA--LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQD 2140
            AEAE   LK  +V+ +       DD +RP                 YKWDR LRAWVPQD
Sbjct: 123  AEAETERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGTRYKWDRNLRAWVPQD 182

Query: 2139 NISTETEQYGLEEMTFLEEQELFSTVNAVDTS-------VKEDVIDTSEVMEGEAKHNDK 1981
            ++ST+   YG+EEMTFLEE E+F T++A+D S       V+E+V    E  + E   N K
Sbjct: 183  DMSTKNGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVNGGGE--QTEVNCNAK 240

Query: 1980 RKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETK 1801
            RKL +K  +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE FSKCGIIKEDPETK
Sbjct: 241  RKLLEKPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVEVFSKCGIIKEDPETK 300

Query: 1800 KPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKG 1621
            +PRVKIYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP GKIPMSV+QAKFEQKG
Sbjct: 301  RPRVKIYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDGKIPMSVSQAKFEQKG 360

Query: 1620 DKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSE 1441
            DKFI+KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR MF PAEMR+DENL SE
Sbjct: 361  DKFIAKQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNMFTPAEMRADENLCSE 420

Query: 1440 LEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASE 1261
            LE DV+EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+LMNGRWFGGRQIHASE
Sbjct: 421  LEEDVKEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIELMNGRWFGGRQIHASE 480

Query: 1260 DDGAVNHAKVRDLQDDAERLEK 1195
            DDG VNHA VRDL +DA RLE+
Sbjct: 481  DDGVVNHALVRDLDEDASRLEQ 502


>ref|XP_012470593.1| PREDICTED: HIV Tat-specific factor 1 isoform X3 [Gossypium raimondii]
            gi|823141551|ref|XP_012470594.1| PREDICTED: HIV
            Tat-specific factor 1 isoform X3 [Gossypium raimondii]
          Length = 513

 Score =  618 bits (1594), Expect = e-174
 Identities = 315/505 (62%), Positives = 377/505 (74%), Gaps = 47/505 (9%)
 Frame = -2

Query: 2568 AENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQP 2389
            + +G D + ++   VGWY+LG DQQ++GPYA SEL EHF NG+L+ESTL WSEGRS WQP
Sbjct: 3    SSDGSDNHPQSVAGVGWYILGEDQQNVGPYAISELREHFLNGYLTESTLAWSEGRSQWQP 62

Query: 2388 LSSIFGLMTEVSQQ----------------------------VPTNK---DDEFEKWQKE 2302
            LSSI   ++ +S Q                            VP+N     DEFEKWQ+E
Sbjct: 63   LSSIPEFVSVISHQANNFSATVSLGDDDAFLNSMKEGDNSNAVPSNDGDGSDEFEKWQRE 122

Query: 2301 VREAEAEA--LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWV 2149
            +REAEAE   LK  +V+ +       DD +RP                 YKWDR LRAWV
Sbjct: 123  IREAEAETERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGTRYKWDRNLRAWV 182

Query: 2148 PQDNISTETEQYGLEEMTFLEEQELFSTVNAV-------DTSVKEDVIDTSEVMEGEAKH 1990
            PQD++ST+   YG+EEMTFLEE E+F T++A+       D SV+E+V    E  + E   
Sbjct: 183  PQDDMSTKNGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVNGGGE--QTEVNC 240

Query: 1989 NDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDP 1810
            N KRKL +K  +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE FSKCGIIKEDP
Sbjct: 241  NAKRKLLEKPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVEVFSKCGIIKEDP 300

Query: 1809 ETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFE 1630
            ETK+PRVKIYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP GKIPMSV+QAKFE
Sbjct: 301  ETKRPRVKIYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDGKIPMSVSQAKFE 360

Query: 1629 QKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENL 1450
            QKGDKFI+KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR MF PAEMR+DENL
Sbjct: 361  QKGDKFIAKQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNMFTPAEMRADENL 420

Query: 1449 RSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIH 1270
             SELE DV+EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+LMNGRWFGGRQIH
Sbjct: 421  CSELEEDVKEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIELMNGRWFGGRQIH 480

Query: 1269 ASEDDGAVNHAKVRDLQDDAERLEK 1195
            ASEDDG VNHA VRDL +DA RLE+
Sbjct: 481  ASEDDGVVNHALVRDLDEDASRLEQ 505


>ref|XP_012470596.1| PREDICTED: HIV Tat-specific factor 1 isoform X5 [Gossypium raimondii]
          Length = 509

 Score =  614 bits (1583), Expect = e-172
 Identities = 313/497 (62%), Positives = 372/497 (74%), Gaps = 47/497 (9%)
 Frame = -2

Query: 2544 SETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLM 2365
            S+   + GWY+LG DQQ++GPYA SEL EHF NG+L+ESTL WSEGRS WQPLSSI   +
Sbjct: 7    SDNHPQSGWYILGEDQQNVGPYAISELREHFLNGYLTESTLAWSEGRSQWQPLSSIPEFV 66

Query: 2364 TEVSQQ----------------------------VPTNK---DDEFEKWQKEVREAEAEA 2278
            + +S Q                            VP+N     DEFEKWQ+E+REAEAE 
Sbjct: 67   SVISHQANNFSATVSLGDDDAFLNSMKEGDNSNAVPSNDGDGSDEFEKWQREIREAEAET 126

Query: 2277 --LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTE 2125
              LK  +V+ +       DD +RP                 YKWDR LRAWVPQD++ST+
Sbjct: 127  ERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGTRYKWDRNLRAWVPQDDMSTK 186

Query: 2124 TEQYGLEEMTFLEEQELFSTVNAV-------DTSVKEDVIDTSEVMEGEAKHNDKRKLPD 1966
               YG+EEMTFLEE E+F T++A+       D SV+E+V    E  + E   N KRKL +
Sbjct: 187  NGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVNGGGE--QTEVNCNAKRKLLE 244

Query: 1965 KLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVK 1786
            K  +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE FSKCGIIKEDPETK+PRVK
Sbjct: 245  KPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVEVFSKCGIIKEDPETKRPRVK 304

Query: 1785 IYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFIS 1606
            IYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP GKIPMSV+QAKFEQKGDKFI+
Sbjct: 305  IYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDGKIPMSVSQAKFEQKGDKFIA 364

Query: 1605 KQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADV 1426
            KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR MF PAEMR+DENL SELE DV
Sbjct: 365  KQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNMFTPAEMRADENLCSELEEDV 424

Query: 1425 QEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAV 1246
            +EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+LMNGRWFGGRQIHASEDDG V
Sbjct: 425  KEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIELMNGRWFGGRQIHASEDDGVV 484

Query: 1245 NHAKVRDLQDDAERLEK 1195
            NHA VRDL +DA RLE+
Sbjct: 485  NHALVRDLDEDASRLEQ 501


Top