BLASTX nr result
ID: Cornus23_contig00003828
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cornus23_contig00003828 (2641 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010278733.1| PREDICTED: splicing factor U2AF-associated p... 660 0.0 ref|XP_010278734.1| PREDICTED: splicing factor U2AF-associated p... 660 0.0 ref|XP_010038454.1| PREDICTED: splicing factor U2AF-associated p... 648 0.0 ref|XP_003635108.2| PREDICTED: HIV Tat-specific factor 1 [Vitis ... 647 0.0 ref|XP_011077113.1| PREDICTED: splicing factor U2AF-associated p... 645 0.0 ref|XP_003536163.1| PREDICTED: HIV Tat-specific factor 1 homolog... 645 0.0 ref|XP_011077112.1| PREDICTED: splicing factor U2AF-associated p... 644 0.0 ref|XP_011077114.1| PREDICTED: splicing factor U2AF-associated p... 643 0.0 ref|XP_014510975.1| PREDICTED: splicing factor U2AF-associated p... 640 e-180 gb|KOM29717.1| hypothetical protein LR48_Vigan747s001900 [Vigna ... 637 e-179 ref|XP_002316170.1| hypothetical protein POPTR_0010s18610g [Popu... 635 e-179 ref|XP_003556435.1| PREDICTED: HIV Tat-specific factor 1 homolog... 635 e-179 ref|XP_007143970.1| hypothetical protein PHAVU_007G117900g [Phas... 625 e-176 gb|KCW84610.1| hypothetical protein EUGRSUZ_B01442 [Eucalyptus g... 623 e-175 emb|CDP04154.1| unnamed protein product [Coffea canephora] 621 e-174 ref|XP_012470592.1| PREDICTED: HIV Tat-specific factor 1 isoform... 620 e-174 ref|XP_012470590.1| PREDICTED: HIV Tat-specific factor 1 isoform... 619 e-174 ref|XP_012470595.1| PREDICTED: HIV Tat-specific factor 1 isoform... 619 e-174 ref|XP_012470593.1| PREDICTED: HIV Tat-specific factor 1 isoform... 618 e-174 ref|XP_012470596.1| PREDICTED: HIV Tat-specific factor 1 isoform... 614 e-172 >ref|XP_010278733.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X1 [Nelumbo nucifera] Length = 499 Score = 660 bits (1704), Expect = 0.0 Identities = 322/477 (67%), Positives = 386/477 (80%), Gaps = 19/477 (3%) Frame = -2 Query: 2568 AENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQP 2389 +ENG D +SE EVGWY+LG +Q+H+GPYA SEL EHF NG+LSE+TL+WSEGRSDW P Sbjct: 17 SENGGDADSENVPEVGWYILGENQEHVGPYAISELQEHFLNGYLSENTLLWSEGRSDWMP 76 Query: 2388 LSSIFGLMTEVSQQVP-----TNKDDEFEKWQKEVREAEAEALKHEAVNSN--------- 2251 LS I L T +SQQ P ++ DDEF KWQKEV+EAEAEA +A ++ Sbjct: 77 LSLIPELFTSISQQGPDPTVTSDNDDEFLKWQKEVKEAEAEAEALKACGTSGHVGDADHL 136 Query: 2250 -----DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLE 2086 D ++RP TYKWDRGLRAWVPQDN + ++YG+E+M +L+ Sbjct: 137 NDMVGDADDRPQTPPDGEEEFTDDDGTTYKWDRGLRAWVPQDNSFSRGKEYGVEDMIYLQ 196 Query: 2085 EQELFSTVNAVDTSVKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKV 1906 E+E+F+T + S KE+ T+EV++ AK + KRKLPDK EKK+ANKPPDSWF+LKV Sbjct: 197 EEEVFATPKVAEPSKKEEASGTTEVVD--AKPDVKRKLPDKQTEKKQANKPPDSWFDLKV 254 Query: 1905 NTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMK 1726 NTH+Y+TGLPDDVT +E+VE FSKCG+IKEDPET+KPRVKIYVDKETGRKKGDAL+SY+K Sbjct: 255 NTHVYITGLPDDVTAEEIVEVFSKCGVIKEDPETRKPRVKIYVDKETGRKKGDALVSYLK 314 Query: 1725 EPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLG 1546 EPSV LAIQILDG PLRPGGK+PMSVTQAKFEQKGDKFI+KQ+DK+KK+KL+K E+K+LG Sbjct: 315 EPSVVLAIQILDGTPLRPGGKVPMSVTQAKFEQKGDKFIAKQVDKKKKKKLKKAEEKILG 374 Query: 1545 WGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHP 1366 WGGRDDAKL IPATV+LR+MF PAEMRSD +LRSELEADV+EECVKLGPV+ ++VCENHP Sbjct: 375 WGGRDDAKLSIPATVVLRHMFTPAEMRSDADLRSELEADVKEECVKLGPVDLIRVCENHP 434 Query: 1365 QGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 QGVVLV+FKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDL +DA RLE+ Sbjct: 435 QGVVLVKFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLDEDAARLEQ 491 >ref|XP_010278734.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X2 [Nelumbo nucifera] Length = 497 Score = 660 bits (1703), Expect = 0.0 Identities = 322/476 (67%), Positives = 385/476 (80%), Gaps = 19/476 (3%) Frame = -2 Query: 2565 ENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPL 2386 ENG D +SE EVGWY+LG +Q+H+GPYA SEL EHF NG+LSE+TL+WSEGRSDW PL Sbjct: 16 ENGGDADSENVPEVGWYILGENQEHVGPYAISELQEHFLNGYLSENTLLWSEGRSDWMPL 75 Query: 2385 SSIFGLMTEVSQQVP-----TNKDDEFEKWQKEVREAEAEALKHEAVNSN---------- 2251 S I L T +SQQ P ++ DDEF KWQKEV+EAEAEA +A ++ Sbjct: 76 SLIPELFTSISQQGPDPTVTSDNDDEFLKWQKEVKEAEAEAEALKACGTSGHVGDADHLN 135 Query: 2250 ----DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEE 2083 D ++RP TYKWDRGLRAWVPQDN + ++YG+E+M +L+E Sbjct: 136 DMVGDADDRPQTPPDGEEEFTDDDGTTYKWDRGLRAWVPQDNSFSRGKEYGVEDMIYLQE 195 Query: 2082 QELFSTVNAVDTSVKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVN 1903 +E+F+T + S KE+ T+EV++ AK + KRKLPDK EKK+ANKPPDSWF+LKVN Sbjct: 196 EEVFATPKVAEPSKKEEASGTTEVVD--AKPDVKRKLPDKQTEKKQANKPPDSWFDLKVN 253 Query: 1902 THIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKE 1723 TH+Y+TGLPDDVT +E+VE FSKCG+IKEDPET+KPRVKIYVDKETGRKKGDAL+SY+KE Sbjct: 254 THVYITGLPDDVTAEEIVEVFSKCGVIKEDPETRKPRVKIYVDKETGRKKGDALVSYLKE 313 Query: 1722 PSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGW 1543 PSV LAIQILDG PLRPGGK+PMSVTQAKFEQKGDKFI+KQ+DK+KK+KL+K E+K+LGW Sbjct: 314 PSVVLAIQILDGTPLRPGGKVPMSVTQAKFEQKGDKFIAKQVDKKKKKKLKKAEEKILGW 373 Query: 1542 GGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQ 1363 GGRDDAKL IPATV+LR+MF PAEMRSD +LRSELEADV+EECVKLGPV+ ++VCENHPQ Sbjct: 374 GGRDDAKLSIPATVVLRHMFTPAEMRSDADLRSELEADVKEECVKLGPVDLIRVCENHPQ 433 Query: 1362 GVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 GVVLV+FKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDL +DA RLE+ Sbjct: 434 GVVLVKFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLDEDAARLEQ 489 >ref|XP_010038454.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X1 [Eucalyptus grandis] Length = 474 Score = 648 bits (1672), Expect = 0.0 Identities = 319/466 (68%), Positives = 381/466 (81%), Gaps = 19/466 (4%) Frame = -2 Query: 2535 STEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMTEV 2356 ++E GWY+LG +QQ++GPYA +EL EH NG+LSESTLVW+EGR+DWQPLSS+ LM + Sbjct: 2 ASEAGWYILGDNQQNVGPYAAAELLEHLKNGYLSESTLVWAEGRADWQPLSSVPELMLPL 61 Query: 2355 SQ-----QVPT--NKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXXXXXXX 2197 S Q P N ++EFEKWQ+EVRE+EA L + + ++ DD RPS Sbjct: 62 SDNGDGSQNPAVLNSEEEFEKWQREVRESEAVGLNNGSQSAEDDLIRPSTPPEGEEEFVD 121 Query: 2196 XXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKEDVIDTS 2017 YKWDRGLRAW PQDNIS +++YGLEEMTFLEE+E+F + N D KE+V + + Sbjct: 122 DDGTRYKWDRGLRAWAPQDNISANSDRYGLEEMTFLEEEEVFPSGNW-DEPTKEEVNEPA 180 Query: 2016 EVMEG-------EAKHNDKRKLPDKLAEKKEA-----NKPPDSWFELKVNTHIYVTGLPD 1873 ++ E EAK N KRK P+K A +KEA NKPPDSWF+LKVNTH+YVTGLP+ Sbjct: 181 DIAEAKTVSDSEEAKPNAKRKQPEKEASEKEASKKEPNKPPDSWFDLKVNTHVYVTGLPE 240 Query: 1872 DVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQIL 1693 DVT +EVVE FSKCGI+KEDPETKKPRVKIYVDKETGRKKGDAL++Y+KEPSVALAIQIL Sbjct: 241 DVTMEEVVEVFSKCGILKEDPETKKPRVKIYVDKETGRKKGDALVTYLKEPSVALAIQIL 300 Query: 1692 DGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLI 1513 DGAP RPGGK+PMSV+QAKFEQKGDKFISKQ+D +KK+KL+KVE+KMLGWGGRDDAK+L+ Sbjct: 301 DGAPFRPGGKVPMSVSQAKFEQKGDKFISKQVDGKKKKKLKKVEEKMLGWGGRDDAKVLV 360 Query: 1512 PATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDR 1333 P TV+LRYMFAPAEMR+D+NLR ELE D++EECVKLGPV+SVKVCENHPQGVVLV+FKDR Sbjct: 361 PTTVVLRYMFAPAEMRADDNLRPELEEDIREECVKLGPVDSVKVCENHPQGVVLVKFKDR 420 Query: 1332 KDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 KDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDL++DA RLE+ Sbjct: 421 KDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLEEDAARLEQ 466 >ref|XP_003635108.2| PREDICTED: HIV Tat-specific factor 1 [Vitis vinifera] Length = 488 Score = 647 bits (1670), Expect = 0.0 Identities = 321/459 (69%), Positives = 377/459 (82%), Gaps = 15/459 (3%) Frame = -2 Query: 2529 EVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMTEVSQ 2350 EVGWY+LG +QQ++GPYAFSEL EHF NG+LSE++L+WSEGRSDWQPLSSI L T +SQ Sbjct: 25 EVGWYILGENQQNLGPYAFSELREHFLNGYLSENSLLWSEGRSDWQPLSSIPELTTAISQ 84 Query: 2349 Q--------VPTNKDDEFEKWQKEVREAEAEALKHEAVNSN-------DDNERPSXXXXX 2215 P N +DEFEKWQKEVREAEA LK+ + + + +DNERPS Sbjct: 85 PGVDCSSAGPPINDEDEFEKWQKEVREAEA--LKNGSASGSVGGDFGDEDNERPSTPPDG 142 Query: 2214 XXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKE 2035 TYKWDRGLRAWVPQDN ST +++Y EEMTF E+E+F T+ + SVKE Sbjct: 143 EDEFTDDDGTTYKWDRGLRAWVPQDNPSTRSDEYKPEEMTFSVEEEIFPTIQVAEDSVKE 202 Query: 2034 DVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDE 1855 ++ ++V+E E KH+ KRKLP++ AEKKEANKPPDSWF+LKVNTH+YVTGLPDDVT DE Sbjct: 203 --VNGTDVVE-ETKHDAKRKLPEQQAEKKEANKPPDSWFDLKVNTHVYVTGLPDDVTVDE 259 Query: 1854 VVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLR 1675 VVE FSKCG+IKEDPET++PRVK+Y+DK TGRKKGDAL+SY+KEPSVALAIQILDG PLR Sbjct: 260 VVEVFSKCGLIKEDPETRRPRVKLYIDKNTGRKKGDALVSYLKEPSVALAIQILDGTPLR 319 Query: 1674 PGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVIL 1495 P G IPMSVT AKFEQKG+KF++KQIDKRKK+KL++VE K+LGWGG DDAKL IPATV+L Sbjct: 320 PVGTIPMSVTLAKFEQKGEKFVAKQIDKRKKKKLKRVEDKILGWGGHDDAKLSIPATVVL 379 Query: 1494 RYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKC 1315 RYMF PAEMR+D NLRSELE DVQEEC+KLG V+ VKVCE+HPQGVVLV++KDR+DAQKC Sbjct: 380 RYMFTPAEMRADPNLRSELEGDVQEECIKLGSVDLVKVCESHPQGVVLVKYKDRRDAQKC 439 Query: 1314 IDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLE 1198 I+LMNGRWFGGRQIHASEDDG+VNHA VRDL DAERLE Sbjct: 440 IELMNGRWFGGRQIHASEDDGSVNHALVRDLDADAERLE 478 >ref|XP_011077113.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X2 [Sesamum indicum] Length = 469 Score = 645 bits (1664), Expect = 0.0 Identities = 320/462 (69%), Positives = 377/462 (81%), Gaps = 10/462 (2%) Frame = -2 Query: 2550 QNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFG 2371 Q E T GWY+LG DQQ IGPY SEL EH+++G+ S+STLVWSEG SDWQPLSS+ G Sbjct: 6 QVPEMVTGAGWYILGQDQQLIGPYTVSELQEHYSSGYFSQSTLVWSEGYSDWQPLSSVPG 65 Query: 2370 LMTEVSQQ-----VPTNKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXXXX 2206 L+T+ Q V +N++DEFEKWQ+EVREAEAEA VN NDD +RP+ Sbjct: 66 LLTDAPPQNALVPVTSNEEDEFEKWQREVREAEAEA----EVNKNDDQDRPTTPPEGEEE 121 Query: 2205 XXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKEDVI 2026 TYKWDR LRAWVPQ+N + TE Y E+MTF++E+E+F T++A VKE+ Sbjct: 122 FTDDDGTTYKWDRTLRAWVPQENTTQNTEDYHPEDMTFVQEEEVFPTLDADHLPVKEEDS 181 Query: 2025 DTSEVMEGEAKHNDKRKLPDKLAEKK-----EANKPPDSWFELKVNTHIYVTGLPDDVTF 1861 +EV+E K N KRKLP+K ++KK EANKPPD+WFELKVNTH+YVTGLPDDVT Sbjct: 182 AANEVVE--EKQNGKRKLPEKTSDKKNVDKKEANKPPDAWFELKVNTHVYVTGLPDDVTT 239 Query: 1860 DEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAP 1681 +EVVE FSKCGIIKEDPETKKPRVKIYVDKETGRKKGDAL+SY+KEPSVALAIQILDGAP Sbjct: 240 EEVVEVFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALVSYLKEPSVALAIQILDGAP 299 Query: 1680 LRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATV 1501 LRP GKIPM+VT+AKFEQKGD+FISKQ+DK KKRKLQKVEQKMLGWGGRDDAK+ IPATV Sbjct: 300 LRPDGKIPMTVTKAKFEQKGDRFISKQVDKNKKRKLQKVEQKMLGWGGRDDAKVSIPATV 359 Query: 1500 ILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQ 1321 ILRYMF PAE+R++E+LRSELE DV++EC KLGP++SVKVCENHPQGV+LV+FKD KDA Sbjct: 360 ILRYMFTPAELRAEEDLRSELEEDVRDECGKLGPLDSVKVCENHPQGVILVKFKDSKDAH 419 Query: 1320 KCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 KCI+LMNGRWFGG+QIHAS DDG+VNHA VRDL+++ +RLEK Sbjct: 420 KCIELMNGRWFGGKQIHASIDDGSVNHALVRDLEEETDRLEK 461 >ref|XP_003536163.1| PREDICTED: HIV Tat-specific factor 1 homolog [Glycine max] gi|734358277|gb|KHN14770.1| HIV Tat-specific factor 1 like [Glycine soja] Length = 503 Score = 645 bits (1664), Expect = 0.0 Identities = 320/478 (66%), Positives = 375/478 (78%), Gaps = 28/478 (5%) Frame = -2 Query: 2544 SETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLM 2365 +E TEVGWYVLG DQQ IGPYAFSEL EHF NG+LSE+T VWSEGRS+WQPLSS+ L Sbjct: 18 AEKITEVGWYVLGEDQQQIGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLW 77 Query: 2364 TEVSQQVPTNKD-------DEFEKWQKEVREAEAEALKHE---------AVNSNDDNERP 2233 +++QQ P + DEFE+WQKE++EAEA+ E + + +D+ERP Sbjct: 78 AQINQQGPDSSTTVSAPDVDEFERWQKEIQEAEAQVEGSEFGSLSGNAGSTGAGEDSERP 137 Query: 2232 SXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAV 2053 S YKWDR LRAWVPQ++ + TE YG++EMTFLEE+E+F T+ Sbjct: 138 STPPEGEEEFTDDDGTVYKWDRNLRAWVPQEHPTGSTEPYGVQEMTFLEEEEVFPTIPIS 197 Query: 2052 DTSVK-EDVI-----------DTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELK 1909 D S K ED +T+E +KRKL D+ +KKEANKPPDSWFELK Sbjct: 198 DASEKFEDSPKLSVSVPPLKEETNEANNTNVVSGEKRKLSDQQTDKKEANKPPDSWFELK 257 Query: 1908 VNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYM 1729 +NTH+YVTGLP+DVT DE+VE FSKCGIIKEDPETKKPRVK+YVDK TGRKKGDAL++Y+ Sbjct: 258 INTHVYVTGLPEDVTTDEIVEVFSKCGIIKEDPETKKPRVKLYVDKGTGRKKGDALVTYL 317 Query: 1728 KEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKML 1549 KEPSVALAIQILDGAPLRP GKIPMSV+QAKFEQKGDKF+SKQ+D +KK+KL+KVE KML Sbjct: 318 KEPSVALAIQILDGAPLRPNGKIPMSVSQAKFEQKGDKFVSKQVDNKKKKKLKKVEDKML 377 Query: 1548 GWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENH 1369 GWGGRDDAK+ IPATVILRYMFAPAEMR+DENLR ELE DV+EEC KLGP++SVK+CENH Sbjct: 378 GWGGRDDAKVSIPATVILRYMFAPAEMRADENLRLELEEDVKEECTKLGPLDSVKICENH 437 Query: 1368 PQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 PQGVVLVRFKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDL++DA RLE+ Sbjct: 438 PQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLEEDAIRLEQ 495 >ref|XP_011077112.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X1 [Sesamum indicum] Length = 471 Score = 644 bits (1662), Expect = 0.0 Identities = 320/464 (68%), Positives = 377/464 (81%), Gaps = 12/464 (2%) Frame = -2 Query: 2550 QNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFG 2371 Q E T GWY+LG DQQ IGPY SEL EH+++G+ S+STLVWSEG SDWQPLSS+ G Sbjct: 6 QVPEMVTGAGWYILGQDQQLIGPYTVSELQEHYSSGYFSQSTLVWSEGYSDWQPLSSVPG 65 Query: 2370 LMTEVSQQ-------VPTNKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXX 2212 L+T+ Q V +N++DEFEKWQ+EVREAEAEA VN NDD +RP+ Sbjct: 66 LLTDAPPQNALGSVPVTSNEEDEFEKWQREVREAEAEA----EVNKNDDQDRPTTPPEGE 121 Query: 2211 XXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKED 2032 TYKWDR LRAWVPQ+N + TE Y E+MTF++E+E+F T++A VKE+ Sbjct: 122 EEFTDDDGTTYKWDRTLRAWVPQENTTQNTEDYHPEDMTFVQEEEVFPTLDADHLPVKEE 181 Query: 2031 VIDTSEVMEGEAKHNDKRKLPDKLAEKK-----EANKPPDSWFELKVNTHIYVTGLPDDV 1867 +EV+E K N KRKLP+K ++KK EANKPPD+WFELKVNTH+YVTGLPDDV Sbjct: 182 DSAANEVVE--EKQNGKRKLPEKTSDKKNVDKKEANKPPDAWFELKVNTHVYVTGLPDDV 239 Query: 1866 TFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDG 1687 T +EVVE FSKCGIIKEDPETKKPRVKIYVDKETGRKKGDAL+SY+KEPSVALAIQILDG Sbjct: 240 TTEEVVEVFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALVSYLKEPSVALAIQILDG 299 Query: 1686 APLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPA 1507 APLRP GKIPM+VT+AKFEQKGD+FISKQ+DK KKRKLQKVEQKMLGWGGRDDAK+ IPA Sbjct: 300 APLRPDGKIPMTVTKAKFEQKGDRFISKQVDKNKKRKLQKVEQKMLGWGGRDDAKVSIPA 359 Query: 1506 TVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKD 1327 TVILRYMF PAE+R++E+LRSELE DV++EC KLGP++SVKVCENHPQGV+LV+FKD KD Sbjct: 360 TVILRYMFTPAELRAEEDLRSELEEDVRDECGKLGPLDSVKVCENHPQGVILVKFKDSKD 419 Query: 1326 AQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 A KCI+LMNGRWFGG+QIHAS DDG+VNHA VRDL+++ +RLEK Sbjct: 420 AHKCIELMNGRWFGGKQIHASIDDGSVNHALVRDLEEETDRLEK 463 >ref|XP_011077114.1| PREDICTED: splicing factor U2AF-associated protein 2 isoform X3 [Sesamum indicum] Length = 462 Score = 643 bits (1659), Expect = 0.0 Identities = 318/458 (69%), Positives = 375/458 (81%), Gaps = 12/458 (2%) Frame = -2 Query: 2532 TEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMTEVS 2353 T GWY+LG DQQ IGPY SEL EH+++G+ S+STLVWSEG SDWQPLSS+ GL+T+ Sbjct: 3 TGAGWYILGQDQQLIGPYTVSELQEHYSSGYFSQSTLVWSEGYSDWQPLSSVPGLLTDAP 62 Query: 2352 QQ-------VPTNKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXXXXXXXX 2194 Q V +N++DEFEKWQ+EVREAEAEA VN NDD +RP+ Sbjct: 63 PQNALGSVPVTSNEEDEFEKWQREVREAEAEA----EVNKNDDQDRPTTPPEGEEEFTDD 118 Query: 2193 XXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKEDVIDTSE 2014 TYKWDR LRAWVPQ+N + TE Y E+MTF++E+E+F T++A VKE+ +E Sbjct: 119 DGTTYKWDRTLRAWVPQENTTQNTEDYHPEDMTFVQEEEVFPTLDADHLPVKEEDSAANE 178 Query: 2013 VMEGEAKHNDKRKLPDKLAEKK-----EANKPPDSWFELKVNTHIYVTGLPDDVTFDEVV 1849 V+E K N KRKLP+K ++KK EANKPPD+WFELKVNTH+YVTGLPDDVT +EVV Sbjct: 179 VVE--EKQNGKRKLPEKTSDKKNVDKKEANKPPDAWFELKVNTHVYVTGLPDDVTTEEVV 236 Query: 1848 EAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPG 1669 E FSKCGIIKEDPETKKPRVKIYVDKETGRKKGDAL+SY+KEPSVALAIQILDGAPLRP Sbjct: 237 EVFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALVSYLKEPSVALAIQILDGAPLRPD 296 Query: 1668 GKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRY 1489 GKIPM+VT+AKFEQKGD+FISKQ+DK KKRKLQKVEQKMLGWGGRDDAK+ IPATVILRY Sbjct: 297 GKIPMTVTKAKFEQKGDRFISKQVDKNKKRKLQKVEQKMLGWGGRDDAKVSIPATVILRY 356 Query: 1488 MFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCID 1309 MF PAE+R++E+LRSELE DV++EC KLGP++SVKVCENHPQGV+LV+FKD KDA KCI+ Sbjct: 357 MFTPAELRAEEDLRSELEEDVRDECGKLGPLDSVKVCENHPQGVILVKFKDSKDAHKCIE 416 Query: 1308 LMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 LMNGRWFGG+QIHAS DDG+VNHA VRDL+++ +RLEK Sbjct: 417 LMNGRWFGGKQIHASIDDGSVNHALVRDLEEETDRLEK 454 >ref|XP_014510975.1| PREDICTED: splicing factor U2AF-associated protein 2 [Vigna radiata var. radiata] Length = 506 Score = 640 bits (1652), Expect = e-180 Identities = 313/480 (65%), Positives = 373/480 (77%), Gaps = 31/480 (6%) Frame = -2 Query: 2541 ETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMT 2362 E TEVGWYVLG DQQ +GPYAFSEL EHF NG+LSE+T VWSEGRS+WQPLSS+ L T Sbjct: 19 EKVTEVGWYVLGEDQQQVGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLWT 78 Query: 2361 EVSQQ-------VPTNKDDEFEKWQKEVREAEAEALKHE---------AVNSNDDNERPS 2230 +++QQ V + DEFE+W+KE++EAEA+ + + +D+ERPS Sbjct: 79 QINQQGLDSSTAVSAHDVDEFERWEKEIKEAEAQVEGSDFGSFSGNVGGTAAGEDSERPS 138 Query: 2229 XXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVD 2050 YKWDR LRAWVPQ+ + TE YG+E+MTFL+E+E+F T+ D Sbjct: 139 TPPEGEEEFTDDDGTVYKWDRNLRAWVPQEYPTGSTEPYGVEDMTFLQEEEVFPTITNSD 198 Query: 2049 TS---------------VKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFE 1915 S +KE+ +T+E KRKL D+ +KKEANKPPDSWFE Sbjct: 199 ASEKIEDSSELVISDPSLKEEANETNETNNASVVAGGKRKLSDQQTDKKEANKPPDSWFE 258 Query: 1914 LKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALIS 1735 LK+NTH+YV GLP+DVT DE+VE FSKCGIIKEDPETK+PRVK+YVDKETGRKKGDAL++ Sbjct: 259 LKINTHVYVNGLPEDVTTDEIVEVFSKCGIIKEDPETKRPRVKLYVDKETGRKKGDALVT 318 Query: 1734 YMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQK 1555 Y+KEPSVALAIQILDGAP RPGGKIPMSV+QAKFEQKGDKF+SKQ+D +KK+KL++VE K Sbjct: 319 YLKEPSVALAIQILDGAPFRPGGKIPMSVSQAKFEQKGDKFVSKQVDNKKKKKLKRVEDK 378 Query: 1554 MLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCE 1375 MLGWGGRDDAK+ IPATVILR+MF+PAEMR+DENLR ELE DV+EEC KLGPV+SVK+CE Sbjct: 379 MLGWGGRDDAKVSIPATVILRFMFSPAEMRADENLRLELEEDVKEECTKLGPVDSVKICE 438 Query: 1374 NHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 NHPQGVVLV+FKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA VRDLQ+DA RLE+ Sbjct: 439 NHPQGVVLVKFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALVRDLQEDAIRLEQ 498 >gb|KOM29717.1| hypothetical protein LR48_Vigan747s001900 [Vigna angularis] Length = 506 Score = 637 bits (1644), Expect = e-179 Identities = 311/480 (64%), Positives = 374/480 (77%), Gaps = 31/480 (6%) Frame = -2 Query: 2541 ETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMT 2362 E TEVGWYVLG DQQ +GPYAFSEL EHF NG+LSE+T VWSEGRS+WQPLSS+ L T Sbjct: 19 EKVTEVGWYVLGEDQQQVGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLWT 78 Query: 2361 EVSQQ-------VPTNKDDEFEKWQKEVREAEAEALKHE---------AVNSNDDNERPS 2230 +++QQ V + DEFE+W+KE++EAEA+ + + + +D+ERPS Sbjct: 79 QINQQGSDFSTAVSAHDVDEFERWEKEIKEAEAQVEGSDFGSFSGNVGSTAAGEDSERPS 138 Query: 2229 XXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVN--- 2059 YKWDR LRAWVPQ+ + TE YG+E+MTFL+E+E+F T+ Sbjct: 139 TPPEGEEEFTDDDGTVYKWDRNLRAWVPQEYPTGSTEPYGVEDMTFLQEEEVFPTITNSD 198 Query: 2058 ------------AVDTSVKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFE 1915 D S+KE+ +T+E KRKL D+ +KKEANKPPDSWFE Sbjct: 199 ASEKIEDSSELVVSDPSLKEEPNETNETNNASVVAGGKRKLSDQQTDKKEANKPPDSWFE 258 Query: 1914 LKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALIS 1735 LK+NTH+YV GLP+DVT DE+VE FSKCGIIKEDPETK+PRVK+YVDKETGRKKGDAL++ Sbjct: 259 LKINTHVYVNGLPEDVTTDEIVEVFSKCGIIKEDPETKRPRVKLYVDKETGRKKGDALVT 318 Query: 1734 YMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQK 1555 Y+KEPSVALAIQILDGAP RPGGKIPMSV+QAKFEQKGDKF+S+Q+D +KK+KL++VE+K Sbjct: 319 YLKEPSVALAIQILDGAPFRPGGKIPMSVSQAKFEQKGDKFVSRQVDNKKKKKLKRVEEK 378 Query: 1554 MLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCE 1375 MLGWGGRDDAK+ IPATVILR+MF+PAEMR+DENLR ELE DV+EEC KLGPV+SVK+CE Sbjct: 379 MLGWGGRDDAKVSIPATVILRFMFSPAEMRADENLRLELEEDVKEECTKLGPVDSVKICE 438 Query: 1374 NHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 NHPQGVVLV+FKDRKDAQKCI+LMNGRWFGGR IHASEDDG+VNHA VRDLQ+DA RLE+ Sbjct: 439 NHPQGVVLVKFKDRKDAQKCIELMNGRWFGGRLIHASEDDGSVNHALVRDLQEDAIRLEQ 498 >ref|XP_002316170.1| hypothetical protein POPTR_0010s18610g [Populus trichocarpa] gi|222865210|gb|EEF02341.1| hypothetical protein POPTR_0010s18610g [Populus trichocarpa] Length = 497 Score = 635 bits (1639), Expect = e-179 Identities = 323/484 (66%), Positives = 379/484 (78%), Gaps = 23/484 (4%) Frame = -2 Query: 2577 CITAENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSD 2398 C A NG+D N T EVGWY+LG DQQ +GPY FSEL EHF NG+L ESTLVWSEGRSD Sbjct: 8 CTGAGNGYDGNYNTVAEVGWYILGEDQQQVGPYVFSELREHFLNGYLLESTLVWSEGRSD 67 Query: 2397 WQPLSSIFGLMTEVSQQ-------VPTNKD-DEFEKWQKEVREAEAEA--LKHEAVNSN- 2251 WQPLSSI LM+ SQQ V +N D DEFEKWQ+EV+EAEAEA LK+ ++ N Sbjct: 68 WQPLSSIPELMSGTSQQGSDYSRAVSSNDDEDEFEKWQREVKEAEAEAERLKNGSLPGNT 127 Query: 2250 ------DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFL 2089 DD++R TYKWDR LRAWVPQDN+S+ + QYG+E+MTF Sbjct: 128 GDDFGIDDSDRILSPPDGEDEFTDDDGTTYKWDRSLRAWVPQDNLSSVSGQYGVEQMTFH 187 Query: 2088 EEQELFSTVNAVDTSVKEDVIDTSEVMEGEAKHNDKRKLPD------KLAEKKEANKPPD 1927 E++E+F VNA D S+K++ T EV+E + +DKRKL D K A+KKEANK PD Sbjct: 188 EQEEVFLNVNAADASLKDEANGTGEVVESQ--RSDKRKLQDEQADKDKQADKKEANKAPD 245 Query: 1926 SWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGD 1747 SWFELKVNTH+YVTGLPDDVT +EVVE FSKCG+IKEDPE KKPRVKIYVDKETGR KGD Sbjct: 246 SWFELKVNTHVYVTGLPDDVTAEEVVEVFSKCGVIKEDPEKKKPRVKIYVDKETGRIKGD 305 Query: 1746 ALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQK 1567 AL++Y+KEPSV LA+QILDG PLRPGG IPMSVTQAKFEQKGD+FI+KQ+D +KKRKL+K Sbjct: 306 ALVTYLKEPSVDLAMQILDGTPLRPGGTIPMSVTQAKFEQKGDRFITKQVDSKKKRKLKK 365 Query: 1566 VEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESV 1387 VE ++LGWGGRDDAK+ IPATV+LR MF +EMR+DE+LRSELE DV+EEC KLGPV+SV Sbjct: 366 VEDRILGWGGRDDAKVSIPATVVLRQMFTLSEMRADESLRSELEVDVREECAKLGPVDSV 425 Query: 1386 KVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAE 1207 KVCEN+P GVVLV+FKDRKDAQ CI+LMNGRWFGGRQ+ ASEDDG +NHA VRD +DA Sbjct: 426 KVCENNPHGVVLVKFKDRKDAQSCIELMNGRWFGGRQVDASEDDGLINHALVRDHDEDAA 485 Query: 1206 RLEK 1195 RLE+ Sbjct: 486 RLEQ 489 >ref|XP_003556435.1| PREDICTED: HIV Tat-specific factor 1 homolog isoform X1 [Glycine max] gi|734324209|gb|KHN04984.1| HIV Tat-specific factor 1 like [Glycine soja] gi|947042838|gb|KRG92562.1| hypothetical protein GLYMA_20G218800 [Glycine max] Length = 500 Score = 635 bits (1638), Expect = e-179 Identities = 320/492 (65%), Positives = 378/492 (76%), Gaps = 32/492 (6%) Frame = -2 Query: 2574 ITAENGFDQN-------SETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVW 2416 +T++NG + + +E TEVGWYVLG DQQ IGPYAFSEL +HF NG+LSE+T VW Sbjct: 1 MTSQNGDESHHPPPQAQAEKVTEVGWYVLGEDQQQIGPYAFSELCQHFLNGYLSENTFVW 60 Query: 2415 SEGRSDWQPLSSIFGLMTEVSQQVPTNKD-------DEFEKWQKEVREAEAEALKHE--- 2266 SEG S+WQPLSS+ L ++++Q P + DEFE+WQKE++E EA+ E Sbjct: 61 SEGSSEWQPLSSVSDLWAQINRQGPDSSTTVSAPDVDEFERWQKEIQEVEAQVEGSEFGS 120 Query: 2265 ------AVNSNDDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLE 2104 + +D+ERPS YKWDR LRAWVPQD + T+ YG+E Sbjct: 121 LSGNVGGTGAGEDSERPSTPPEGEEGFTDDDGTVYKWDRSLRAWVPQDYPTGSTKPYGVE 180 Query: 2103 EMTFLEEQELFSTVNAVDTSVK-EDV----IDTSEVMEGEAKHN----DKRKLPDKLAEK 1951 EMTFLEE+E+F T+ D S K ED + + E E N KR L D+ +K Sbjct: 181 EMTFLEEEEVFPTIPNSDASEKFEDSPKLSVSVPPLKEEENNTNVISGGKRMLSDQQTDK 240 Query: 1950 KEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDK 1771 KEANKPPDSWFELK+NTH+YVTGLP+DVT DE+VE FSKCGIIKEDPETK+PRVK+YVDK Sbjct: 241 KEANKPPDSWFELKINTHVYVTGLPEDVTTDEIVEVFSKCGIIKEDPETKRPRVKLYVDK 300 Query: 1770 ETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDK 1591 ETGRKKGDAL++Y+KEPSVALAIQILDGAPLRPGGKIPMSV+QAKFEQKGDKF+SKQ+D Sbjct: 301 ETGRKKGDALVTYLKEPSVALAIQILDGAPLRPGGKIPMSVSQAKFEQKGDKFVSKQVDG 360 Query: 1590 RKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECV 1411 +KK+KL+KVE KMLGWGGRDDAK+ IPATVILRYMFAPAEMR+DENL ELE DV+EEC Sbjct: 361 KKKKKLKKVEDKMLGWGGRDDAKVSIPATVILRYMFAPAEMRADENLHLELEEDVKEECT 420 Query: 1410 KLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKV 1231 KLGPV+SVK+CENHPQGVVLVRFKDRKDAQKCI+LMNGRWFGGRQIHASEDDG+VNHA V Sbjct: 421 KLGPVDSVKICENHPQGVVLVRFKDRKDAQKCIELMNGRWFGGRQIHASEDDGSVNHALV 480 Query: 1230 RDLQDDAERLEK 1195 RDL++D RLE+ Sbjct: 481 RDLEEDVIRLEQ 492 >ref|XP_007143970.1| hypothetical protein PHAVU_007G117900g [Phaseolus vulgaris] gi|561017160|gb|ESW15964.1| hypothetical protein PHAVU_007G117900g [Phaseolus vulgaris] Length = 509 Score = 625 bits (1611), Expect = e-176 Identities = 306/483 (63%), Positives = 374/483 (77%), Gaps = 34/483 (7%) Frame = -2 Query: 2541 ETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMT 2362 E TEVGWYVLG DQQ +GPYAFSEL EHF NG+LSE+T VWSEGRS+WQPLSS+ L T Sbjct: 19 EKVTEVGWYVLGEDQQQVGPYAFSELREHFLNGYLSENTFVWSEGRSEWQPLSSVSDLWT 78 Query: 2361 EVSQQ-------VPTNKDDEFEKWQKEVREAEAEALKHE---------AVNSNDDNERPS 2230 ++++Q V + DEFE+W+KE++EAEA+ + + +D+ERPS Sbjct: 79 QINRQGLDSSAAVSAHDVDEFERWEKEIQEAEAQVEGSDFGSFAGNVGGTAAGEDSERPS 138 Query: 2229 XXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVN--- 2059 YKWDR LRAWVPQ+ + TE Y +E+MTFL+E+E+F T+ Sbjct: 139 TPPEGEEEFTDDDGTVYKWDRNLRAWVPQEYPTGSTEPYRVEDMTFLQEEEVFPTITNSD 198 Query: 2058 ------------AVDTSVKEDVID---TSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDS 1924 D S+KE+V + T+E + KRKL D+ +KKEANKPPDS Sbjct: 199 ASEKFEDSSKLGVSDPSLKEEVNNANKTNEANDISVVAGGKRKLSDQQTDKKEANKPPDS 258 Query: 1923 WFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDA 1744 WFELK+NTH+YV GLP+DVT DE+VE FSKCGIIKEDPETK+PRVK+YVDKETG+ KGDA Sbjct: 259 WFELKINTHVYVNGLPEDVTTDEIVEVFSKCGIIKEDPETKRPRVKLYVDKETGKNKGDA 318 Query: 1743 LISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKV 1564 L++Y+KEPSVALAIQILDGAP RPGGKIPMSV+QAKF+QKGD+F+SKQ+D +KK+KL++V Sbjct: 319 LVTYLKEPSVALAIQILDGAPFRPGGKIPMSVSQAKFQQKGDRFVSKQVDNKKKKKLKRV 378 Query: 1563 EQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVK 1384 E+KMLGWGGRDDAK+ IPAT+ILR+MF+PAEMR+DENLR ELE DV+EEC KLGPV+SVK Sbjct: 379 EEKMLGWGGRDDAKVSIPATMILRFMFSPAEMRADENLRLELEEDVKEECTKLGPVDSVK 438 Query: 1383 VCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAER 1204 +CENHPQGVVLV+FKDRKDAQKCI+LMNGRWFGGRQ+HASEDDG+VNHA VRDLQ+DA R Sbjct: 439 ICENHPQGVVLVKFKDRKDAQKCIELMNGRWFGGRQVHASEDDGSVNHALVRDLQEDAIR 498 Query: 1203 LEK 1195 LE+ Sbjct: 499 LEQ 501 >gb|KCW84610.1| hypothetical protein EUGRSUZ_B01442 [Eucalyptus grandis] Length = 472 Score = 623 bits (1606), Expect = e-175 Identities = 306/449 (68%), Positives = 366/449 (81%), Gaps = 19/449 (4%) Frame = -2 Query: 2535 STEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLMTEV 2356 ++E GWY+LG +QQ++GPYA +EL EH NG+LSESTLVW+EGR+DWQPLSS+ LM + Sbjct: 2 ASEAGWYILGDNQQNVGPYAAAELLEHLKNGYLSESTLVWAEGRADWQPLSSVPELMLPL 61 Query: 2355 SQ-----QVPT--NKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPSXXXXXXXXXXX 2197 S Q P N ++EFEKWQ+EVRE+EA L + + ++ DD RPS Sbjct: 62 SDNGDGSQNPAVLNSEEEFEKWQREVRESEAVGLNNGSQSAEDDLIRPSTPPEGEEEFVD 121 Query: 2196 XXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTSVKEDVIDTS 2017 YKWDRGLRAW PQDNIS +++YGLEEMTFLEE+E+F + N D KE+V + + Sbjct: 122 DDGTRYKWDRGLRAWAPQDNISANSDRYGLEEMTFLEEEEVFPSGNW-DEPTKEEVNEPA 180 Query: 2016 EVMEG-------EAKHNDKRKLPDKLAEKKEA-----NKPPDSWFELKVNTHIYVTGLPD 1873 ++ E EAK N KRK P+K A +KEA NKPPDSWF+LKVNTH+YVTGLP+ Sbjct: 181 DIAEAKTVSDSEEAKPNAKRKQPEKEASEKEASKKEPNKPPDSWFDLKVNTHVYVTGLPE 240 Query: 1872 DVTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQIL 1693 DVT +EVVE FSKCGI+KEDPETKKPRVKIYVDKETGRKKGDAL++Y+KEPSVALAIQIL Sbjct: 241 DVTMEEVVEVFSKCGILKEDPETKKPRVKIYVDKETGRKKGDALVTYLKEPSVALAIQIL 300 Query: 1692 DGAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLI 1513 DGAP RPGGK+PMSV+QAKFEQKGDKFISKQ+D +KK+KL+KVE+KMLGWGGRDDAK+L+ Sbjct: 301 DGAPFRPGGKVPMSVSQAKFEQKGDKFISKQVDGKKKKKLKKVEEKMLGWGGRDDAKVLV 360 Query: 1512 PATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDR 1333 P TV+LRYMFAPAEMR+D+NLR ELE D++EECVKLGPV+SVKVCENHPQGVVLV+FKDR Sbjct: 361 PTTVVLRYMFAPAEMRADDNLRPELEEDIREECVKLGPVDSVKVCENHPQGVVLVKFKDR 420 Query: 1332 KDAQKCIDLMNGRWFGGRQIHASEDDGAV 1246 KDAQKCI+LMNGRWFGGRQIHASEDDG++ Sbjct: 421 KDAQKCIELMNGRWFGGRQIHASEDDGSL 449 >emb|CDP04154.1| unnamed protein product [Coffea canephora] Length = 477 Score = 621 bits (1601), Expect = e-174 Identities = 304/465 (65%), Positives = 376/465 (80%), Gaps = 9/465 (1%) Frame = -2 Query: 2562 NGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLS 2383 NG + ++T+V W+VLG DQQ IGPY+ SEL EH+++G+LS++TLVW +G ++WQP+S Sbjct: 11 NGLTSLTTSATDVAWFVLGPDQQPIGPYSSSELREHYSSGYLSDATLVWFQGATNWQPVS 70 Query: 2382 SIFGLMTEVSQQ-------VP--TNKDDEFEKWQKEVREAEAEALKHEAVNSNDDNERPS 2230 S+ GL+T++ Q VP +N++DEFEKWQ+EVREAEAEA + + + E+PS Sbjct: 71 SVPGLLTDLPVQNAQIQLAVPKTSNEEDEFEKWQREVREAEAEAERAVTI----EPEKPS 126 Query: 2229 XXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVD 2050 YKWDR LRAWVPQ++ S T YG+++M F++E+E+F T+ A D Sbjct: 127 TPPEGEEEFTDDDGTLYKWDRTLRAWVPQEDNSENTANYGVDDMIFVKEEEVFPTIKADD 186 Query: 2049 TSVKEDVIDTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDD 1870 V+E++ TS+ +E A N KRKLP+K AEKKEANKPPDSWFELKVNTH+YVTGLPDD Sbjct: 187 FPVEEEIKGTSDTVE--ANPNGKRKLPEKTAEKKEANKPPDSWFELKVNTHVYVTGLPDD 244 Query: 1869 VTFDEVVEAFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILD 1690 VT DEVVE FSKCGIIKEDPE KKPRVKIYVDKE+GR+KGDAL++++KEPSV LAIQILD Sbjct: 245 VTVDEVVEVFSKCGIIKEDPEMKKPRVKIYVDKESGRQKGDALVTFLKEPSVDLAIQILD 304 Query: 1689 GAPLRPGGKIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIP 1510 G P R GGKIPMSVT+AKFEQKG+ F+ K++DKRKK+KLQ +E+KMLGWGG DDAKLLIP Sbjct: 305 GTPFRAGGKIPMSVTKAKFEQKGETFLPKKVDKRKKKKLQHLERKMLGWGGLDDAKLLIP 364 Query: 1509 ATVILRYMFAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRK 1330 ATVILRYMF P E+R+DENLRSELE DV++EC KLGP+ESVKVCENHPQGV+LV+FKDRK Sbjct: 365 ATVILRYMFTPDEIRADENLRSELEEDVRDECTKLGPLESVKVCENHPQGVILVKFKDRK 424 Query: 1329 DAQKCIDLMNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 DA KCI+LMNGRWFG RQIHASEDDG+VNHA VRDL+ +A+RLE+ Sbjct: 425 DALKCIELMNGRWFGKRQIHASEDDGSVNHALVRDLEAEADRLEQ 469 >ref|XP_012470592.1| PREDICTED: HIV Tat-specific factor 1 isoform X2 [Gossypium raimondii] gi|763751794|gb|KJB19182.1| hypothetical protein B456_003G087900 [Gossypium raimondii] Length = 518 Score = 620 bits (1600), Expect = e-174 Identities = 320/514 (62%), Positives = 379/514 (73%), Gaps = 44/514 (8%) Frame = -2 Query: 2604 NHPFLQSEACITAENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSEST 2425 NHP +E C+ A G VGWY+LG DQQ++GPYA SEL EHF NG+L+EST Sbjct: 9 NHPQSGTENCLNAVAG----------VGWYILGEDQQNVGPYAISELREHFLNGYLTEST 58 Query: 2424 LVWSEGRSDWQPLSSIFGLMTEVSQQ-------------------------VPTNK---D 2329 L WSEGRS WQPLSSI ++ +S Q VP+N Sbjct: 59 LAWSEGRSQWQPLSSIPEFVSVISHQANNFSATGDDDAFLNSMKEGDNSNAVPSNDGDGS 118 Query: 2328 DEFEKWQKEVREAEAEA--LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXXTYK 2176 DEFEKWQ+E+REAEAE LK +V+ + DD +RP YK Sbjct: 119 DEFEKWQREIREAEAETERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGTRYK 178 Query: 2175 WDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAVDTS-------VKEDVIDTS 2017 WDR LRAWVPQD++ST+ YG+EEMTFLEE E+F T++A+D S V+E+V Sbjct: 179 WDRNLRAWVPQDDMSTKNGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVNGGG 238 Query: 2016 EVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFS 1837 E + E N KRKL +K +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE FS Sbjct: 239 E--QTEVNCNAKRKLLEKPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVEVFS 296 Query: 1836 KCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIP 1657 KCGIIKEDPETK+PRVKIYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP GKIP Sbjct: 297 KCGIIKEDPETKRPRVKIYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDGKIP 356 Query: 1656 MSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAP 1477 MSV+QAKFEQKGDKFI+KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR MF P Sbjct: 357 MSVSQAKFEQKGDKFIAKQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNMFTP 416 Query: 1476 AEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNG 1297 AEMR+DENL SELE DV+EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+LMNG Sbjct: 417 AEMRADENLCSELEEDVKEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIELMNG 476 Query: 1296 RWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 RWFGGRQIHASEDDG VNHA VRDL +DA RLE+ Sbjct: 477 RWFGGRQIHASEDDGVVNHALVRDLDEDASRLEQ 510 >ref|XP_012470590.1| PREDICTED: HIV Tat-specific factor 1 isoform X1 [Gossypium raimondii] gi|823141545|ref|XP_012470591.1| PREDICTED: HIV Tat-specific factor 1 isoform X1 [Gossypium raimondii] Length = 521 Score = 619 bits (1597), Expect = e-174 Identities = 320/517 (61%), Positives = 379/517 (73%), Gaps = 47/517 (9%) Frame = -2 Query: 2604 NHPFLQSEACITAENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSEST 2425 NHP +E C+ A G VGWY+LG DQQ++GPYA SEL EHF NG+L+EST Sbjct: 9 NHPQSGTENCLNAVAG----------VGWYILGEDQQNVGPYAISELREHFLNGYLTEST 58 Query: 2424 LVWSEGRSDWQPLSSIFGLMTEVSQQ----------------------------VPTNK- 2332 L WSEGRS WQPLSSI ++ +S Q VP+N Sbjct: 59 LAWSEGRSQWQPLSSIPEFVSVISHQANNFSATVSLGDDDAFLNSMKEGDNSNAVPSNDG 118 Query: 2331 --DDEFEKWQKEVREAEAEA--LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXX 2185 DEFEKWQ+E+REAEAE LK +V+ + DD +RP Sbjct: 119 DGSDEFEKWQREIREAEAETERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGT 178 Query: 2184 TYKWDRGLRAWVPQDNISTETEQYGLEEMTFLEEQELFSTVNAV-------DTSVKEDVI 2026 YKWDR LRAWVPQD++ST+ YG+EEMTFLEE E+F T++A+ D SV+E+V Sbjct: 179 RYKWDRNLRAWVPQDDMSTKNGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVN 238 Query: 2025 DTSEVMEGEAKHNDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVE 1846 E + E N KRKL +K +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE Sbjct: 239 GGGE--QTEVNCNAKRKLLEKPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVE 296 Query: 1845 AFSKCGIIKEDPETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGG 1666 FSKCGIIKEDPETK+PRVKIYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP G Sbjct: 297 VFSKCGIIKEDPETKRPRVKIYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDG 356 Query: 1665 KIPMSVTQAKFEQKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYM 1486 KIPMSV+QAKFEQKGDKFI+KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR M Sbjct: 357 KIPMSVSQAKFEQKGDKFIAKQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNM 416 Query: 1485 FAPAEMRSDENLRSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDL 1306 F PAEMR+DENL SELE DV+EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+L Sbjct: 417 FTPAEMRADENLCSELEEDVKEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIEL 476 Query: 1305 MNGRWFGGRQIHASEDDGAVNHAKVRDLQDDAERLEK 1195 MNGRWFGGRQIHASEDDG VNHA VRDL +DA RLE+ Sbjct: 477 MNGRWFGGRQIHASEDDGVVNHALVRDLDEDASRLEQ 513 >ref|XP_012470595.1| PREDICTED: HIV Tat-specific factor 1 isoform X4 [Gossypium raimondii] gi|763751795|gb|KJB19183.1| hypothetical protein B456_003G087900 [Gossypium raimondii] gi|763751799|gb|KJB19187.1| hypothetical protein B456_003G087900 [Gossypium raimondii] Length = 510 Score = 619 bits (1597), Expect = e-174 Identities = 315/502 (62%), Positives = 377/502 (75%), Gaps = 44/502 (8%) Frame = -2 Query: 2568 AENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQP 2389 + +G D + ++ VGWY+LG DQQ++GPYA SEL EHF NG+L+ESTL WSEGRS WQP Sbjct: 3 SSDGSDNHPQSVAGVGWYILGEDQQNVGPYAISELREHFLNGYLTESTLAWSEGRSQWQP 62 Query: 2388 LSSIFGLMTEVSQQ-------------------------VPTNK---DDEFEKWQKEVRE 2293 LSSI ++ +S Q VP+N DEFEKWQ+E+RE Sbjct: 63 LSSIPEFVSVISHQANNFSATGDDDAFLNSMKEGDNSNAVPSNDGDGSDEFEKWQREIRE 122 Query: 2292 AEAEA--LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQD 2140 AEAE LK +V+ + DD +RP YKWDR LRAWVPQD Sbjct: 123 AEAETERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGTRYKWDRNLRAWVPQD 182 Query: 2139 NISTETEQYGLEEMTFLEEQELFSTVNAVDTS-------VKEDVIDTSEVMEGEAKHNDK 1981 ++ST+ YG+EEMTFLEE E+F T++A+D S V+E+V E + E N K Sbjct: 183 DMSTKNGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVNGGGE--QTEVNCNAK 240 Query: 1980 RKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETK 1801 RKL +K +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE FSKCGIIKEDPETK Sbjct: 241 RKLLEKPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVEVFSKCGIIKEDPETK 300 Query: 1800 KPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKG 1621 +PRVKIYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP GKIPMSV+QAKFEQKG Sbjct: 301 RPRVKIYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDGKIPMSVSQAKFEQKG 360 Query: 1620 DKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSE 1441 DKFI+KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR MF PAEMR+DENL SE Sbjct: 361 DKFIAKQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNMFTPAEMRADENLCSE 420 Query: 1440 LEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASE 1261 LE DV+EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+LMNGRWFGGRQIHASE Sbjct: 421 LEEDVKEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIELMNGRWFGGRQIHASE 480 Query: 1260 DDGAVNHAKVRDLQDDAERLEK 1195 DDG VNHA VRDL +DA RLE+ Sbjct: 481 DDGVVNHALVRDLDEDASRLEQ 502 >ref|XP_012470593.1| PREDICTED: HIV Tat-specific factor 1 isoform X3 [Gossypium raimondii] gi|823141551|ref|XP_012470594.1| PREDICTED: HIV Tat-specific factor 1 isoform X3 [Gossypium raimondii] Length = 513 Score = 618 bits (1594), Expect = e-174 Identities = 315/505 (62%), Positives = 377/505 (74%), Gaps = 47/505 (9%) Frame = -2 Query: 2568 AENGFDQNSETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQP 2389 + +G D + ++ VGWY+LG DQQ++GPYA SEL EHF NG+L+ESTL WSEGRS WQP Sbjct: 3 SSDGSDNHPQSVAGVGWYILGEDQQNVGPYAISELREHFLNGYLTESTLAWSEGRSQWQP 62 Query: 2388 LSSIFGLMTEVSQQ----------------------------VPTNK---DDEFEKWQKE 2302 LSSI ++ +S Q VP+N DEFEKWQ+E Sbjct: 63 LSSIPEFVSVISHQANNFSATVSLGDDDAFLNSMKEGDNSNAVPSNDGDGSDEFEKWQRE 122 Query: 2301 VREAEAEA--LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWV 2149 +REAEAE LK +V+ + DD +RP YKWDR LRAWV Sbjct: 123 IREAEAETERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGTRYKWDRNLRAWV 182 Query: 2148 PQDNISTETEQYGLEEMTFLEEQELFSTVNAV-------DTSVKEDVIDTSEVMEGEAKH 1990 PQD++ST+ YG+EEMTFLEE E+F T++A+ D SV+E+V E + E Sbjct: 183 PQDDMSTKNGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVNGGGE--QTEVNC 240 Query: 1989 NDKRKLPDKLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDP 1810 N KRKL +K +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE FSKCGIIKEDP Sbjct: 241 NAKRKLLEKPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVEVFSKCGIIKEDP 300 Query: 1809 ETKKPRVKIYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFE 1630 ETK+PRVKIYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP GKIPMSV+QAKFE Sbjct: 301 ETKRPRVKIYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDGKIPMSVSQAKFE 360 Query: 1629 QKGDKFISKQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENL 1450 QKGDKFI+KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR MF PAEMR+DENL Sbjct: 361 QKGDKFIAKQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNMFTPAEMRADENL 420 Query: 1449 RSELEADVQEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIH 1270 SELE DV+EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+LMNGRWFGGRQIH Sbjct: 421 CSELEEDVKEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIELMNGRWFGGRQIH 480 Query: 1269 ASEDDGAVNHAKVRDLQDDAERLEK 1195 ASEDDG VNHA VRDL +DA RLE+ Sbjct: 481 ASEDDGVVNHALVRDLDEDASRLEQ 505 >ref|XP_012470596.1| PREDICTED: HIV Tat-specific factor 1 isoform X5 [Gossypium raimondii] Length = 509 Score = 614 bits (1583), Expect = e-172 Identities = 313/497 (62%), Positives = 372/497 (74%), Gaps = 47/497 (9%) Frame = -2 Query: 2544 SETSTEVGWYVLGGDQQHIGPYAFSELHEHFTNGFLSESTLVWSEGRSDWQPLSSIFGLM 2365 S+ + GWY+LG DQQ++GPYA SEL EHF NG+L+ESTL WSEGRS WQPLSSI + Sbjct: 7 SDNHPQSGWYILGEDQQNVGPYAISELREHFLNGYLTESTLAWSEGRSQWQPLSSIPEFV 66 Query: 2364 TEVSQQ----------------------------VPTNK---DDEFEKWQKEVREAEAEA 2278 + +S Q VP+N DEFEKWQ+E+REAEAE Sbjct: 67 SVISHQANNFSATVSLGDDDAFLNSMKEGDNSNAVPSNDGDGSDEFEKWQREIREAEAET 126 Query: 2277 --LKHEAVNSN-------DDNERPSXXXXXXXXXXXXXXXTYKWDRGLRAWVPQDNISTE 2125 LK +V+ + DD +RP YKWDR LRAWVPQD++ST+ Sbjct: 127 ERLKTGSVSRSTGDAFGFDDQDRPLTPPEGEEEFTDDDGTRYKWDRNLRAWVPQDDMSTK 186 Query: 2124 TEQYGLEEMTFLEEQELFSTVNAV-------DTSVKEDVIDTSEVMEGEAKHNDKRKLPD 1966 YG+EEMTFLEE E+F T++A+ D SV+E+V E + E N KRKL + Sbjct: 187 NGNYGVEEMTFLEEDEVFPTISAIDASAAVADASVRENVNGGGE--QTEVNCNAKRKLLE 244 Query: 1965 KLAEKKEANKPPDSWFELKVNTHIYVTGLPDDVTFDEVVEAFSKCGIIKEDPETKKPRVK 1786 K +KKEANKPPDSWF+LKVNTH+YVTGLPDDVT +E+VE FSKCGIIKEDPETK+PRVK Sbjct: 245 KPVDKKEANKPPDSWFQLKVNTHVYVTGLPDDVTAEELVEVFSKCGIIKEDPETKRPRVK 304 Query: 1785 IYVDKETGRKKGDALISYMKEPSVALAIQILDGAPLRPGGKIPMSVTQAKFEQKGDKFIS 1606 IYVDKETGRKKGDAL++Y+KEPSVALA+QILDG P RP GKIPMSV+QAKFEQKGDKFI+ Sbjct: 305 IYVDKETGRKKGDALVTYLKEPSVALAVQILDGTPFRPDGKIPMSVSQAKFEQKGDKFIA 364 Query: 1605 KQIDKRKKRKLQKVEQKMLGWGGRDDAKLLIPATVILRYMFAPAEMRSDENLRSELEADV 1426 KQ+D RKK+KL+KVE++ML WGGRDDAK+ IPATV+LR MF PAEMR+DENL SELE DV Sbjct: 365 KQVDSRKKKKLKKVEERMLSWGGRDDAKVTIPATVVLRNMFTPAEMRADENLCSELEEDV 424 Query: 1425 QEECVKLGPVESVKVCENHPQGVVLVRFKDRKDAQKCIDLMNGRWFGGRQIHASEDDGAV 1246 +EEC+KLG ++SVKVC N+PQGVVLV++KDRKDAQKCI+LMNGRWFGGRQIHASEDDG V Sbjct: 425 KEECLKLGLLDSVKVCSNNPQGVVLVKYKDRKDAQKCIELMNGRWFGGRQIHASEDDGVV 484 Query: 1245 NHAKVRDLQDDAERLEK 1195 NHA VRDL +DA RLE+ Sbjct: 485 NHALVRDLDEDASRLEQ 501