BLASTX nr result
ID: Papaver31_contig00017178
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver31_contig00017178 (3269 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011465851.1| PREDICTED: uncharacterized protein LOC105351... 106 1e-19 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 103 1e-18 ref|XP_013452104.1| DUF4283 domain protein [Medicago truncatula]... 102 2e-18 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 102 3e-18 ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobrom... 100 7e-18 ref|XP_009341595.1| PREDICTED: uncharacterized protein LOC103933... 99 2e-17 ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom... 99 2e-17 ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobrom... 99 3e-17 ref|XP_009355139.1| PREDICTED: uncharacterized protein LOC103946... 99 3e-17 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 98 4e-17 ref|XP_007040951.1| Uncharacterized protein TCM_016760 [Theobrom... 98 6e-17 ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom... 97 1e-16 ref|XP_010690660.1| PREDICTED: uncharacterized protein LOC104904... 96 2e-16 ref|XP_008350100.1| PREDICTED: uncharacterized protein LOC103413... 96 2e-16 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 96 2e-16 ref|XP_007206965.1| hypothetical protein PRUPE_ppb019035mg [Prun... 96 2e-16 ref|XP_011471050.1| PREDICTED: uncharacterized protein LOC105353... 96 3e-16 gb|ABE91952.1| Zinc finger, CCHC-type [Medicago truncatula] 96 3e-16 gb|ABE87590.1| non-LTR retrolelement reverse transcriptase-like ... 95 4e-16 gb|KRH05184.1| hypothetical protein GLYMA_17G212000 [Glycine max] 95 5e-16 >ref|XP_011465851.1| PREDICTED: uncharacterized protein LOC105351930 [Fragaria vesca subsp. vesca] Length = 500 Score = 106 bits (265), Expect = 1e-19 Identities = 65/224 (29%), Positives = 109/224 (48%), Gaps = 5/224 (2%) Frame = -3 Query: 1824 SLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRL----NFKGLKLADVQKRLEEQWGF 1657 SLP + + + + E Q G + + L+GR+ N K +D+ ++L WG Sbjct: 37 SLPLPEIHDGKTTVTISEEGYQSGLEKCRNMLLGRVHLASNEKPYSPSDLSRKLGLLWG- 95 Query: 1656 GPGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRA 1477 GS +++PM KG++ S + +VW + ++ LR + PNF P Q+ + A Sbjct: 96 DIGSWRIIPMGKGYYTFNFASEATRSKVWEKGSIALKPGVLRFMQWTPNFSPASQKNTNA 155 Query: 1476 SVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKR 1297 VWV+ L +E W + L IA IG P+ ID T + G +A VL++ID + P+ Sbjct: 156 QVWVNLWDLGLEFWEPRTLFEIAHGIGVPVKIDHNTSERKFGLFARVLVDIDLSYDPPRE 215 Query: 1296 IRL-EANGKEFWQYIEIQDLENVKFCTHCKFMGHKFENCLAAKK 1168 + + NG+ +E + L + C+HC +GH C +K Sbjct: 216 LAVRRKNGETVIMEVEYERLPYL--CSHCGNVGHMVTTCKLLRK 257 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 103 bits (256), Expect = 1e-18 Identities = 106/458 (23%), Positives = 190/458 (41%), Gaps = 14/458 (3%) Frame = -3 Query: 1884 SFADLVKDKKPYMLTDSDLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNFKG 1705 SF ++ +KP ++ + + + D PA + +Q F+ SL+G+ + + Sbjct: 1758 SFLSIITGEKPSVVPLTR-----DPFVFKDRPAAAFFEDEIQTLAKPFKLSLVGKFS-RM 1811 Query: 1704 LKLADVQKRLEEQWGFG-PGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRV 1528 KL DV+ + G G G+ ++ + +I L++ ++ R+W W I Q++RV Sbjct: 1812 PKLQDVRAAFK---GIGLAGAYEVRWLDYKHVLIHLSNEQDFNRIWTKQNWFIATQKMRV 1868 Query: 1527 FNFYPNFDPEKQQTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGN 1348 F + P F+PEK+ ++ VW+SFP L L+ K L+ IAK +GKP+ +D+ T + + Sbjct: 1869 FKWTPEFEPEKE-SAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPS 1927 Query: 1347 YAAVLINIDFAKSIPKRIRLEANGKE-------FWQYIEIQDLENVKFCTHCKFMGHKFE 1189 A V + D + ++ + ++ + Q +E + +C HC +GHK Sbjct: 1928 VARVCVEFDCRQPPLDQVWIVVQNRKTGEITNGYSQRVEFAQMP--AYCDHCCHVGHKET 1985 Query: 1188 NCLAAKKILGGSHAELKPPTVNIKSGKEEKPSKPVWKEVGKISHAGQANNVNNVQPGKES 1009 +C IL G+ A T S E+ + KE G+ + + N N+ +P + Sbjct: 1986 DC-----ILLGNKARPPGITKQPNSRLEDGGRRVGSKEDGEFTTEKRKNIENSKKPQNDK 2040 Query: 1008 INSAHEVAVHNAWXXXXXXXXXXXXXXXXXDLPRS------EPILSANAFEVLVETDEDN 847 I E H +S E I +N F ++ E +ED Sbjct: 2041 ILYPEEPPKHQKRGQPANKGSTSGTKIWQGKKVQSDKASKDENISVSNRFHIISEEEEDE 2100 Query: 846 PELEADQLEMELAKASVEFREAQMKLIRCKNVIAEKKDIAVRKGATKAQQENAAKGTSHV 667 A + + K + K K +R+G T+ + A Sbjct: 2101 HSRTA--------------QNGKEKKEKNKEKDEGGKTEGIRRGTTEERTTGA------- 2139 Query: 666 NTQLQEGKKDTNGWKQGSTPFKLTGIAEEVLQSSEHFY 553 ++Q G G + +TP L+ I E+ Q + H Y Sbjct: 2140 --EIQTGSGKPEGAEMTATPSALSQILEDNTQGTLHEY 2175 Score = 83.2 bits (204), Expect = 1e-12 Identities = 56/191 (29%), Positives = 90/191 (47%), Gaps = 19/191 (9%) Frame = -3 Query: 1611 IIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRASVWVSFPALYIELWT 1432 +I L++ ++ R+W W I NQ++RVF + P+F+ EK+ VW+SFP L L+ Sbjct: 31 LIHLSNEQDFNRIWTKQQWFIANQKMRVFKWSPDFEAEKESPI-VPVWISFPNLKAHLYE 89 Query: 1431 KKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKS--------IPKRIRLEANG 1276 K L+ IAK +GKP+ ID+ T + + A V + + + I R+ G Sbjct: 90 KSALLLIAKTVGKPLFIDEATSNASRPSVARVCVEYNCRNAPVEEIWIVIKDRVTGTVTG 149 Query: 1275 KEFWQYIEIQDLENVKFCTHCKFMGHKFENCLA--------AKKILGGSHAEL---KPPT 1129 + Q +E + + +C HC +GH CL K+ L H++ K T Sbjct: 150 -GYAQKVEFSKMPD--YCEHCGHVGHSVSTCLVLGNRSENLRKEKLSNVHSKSLAGKKQT 206 Query: 1128 VNIKSGKEEKP 1096 N G + KP Sbjct: 207 ENDDKGLDSKP 217 >ref|XP_013452104.1| DUF4283 domain protein [Medicago truncatula] gi|657382204|gb|KEH26132.1| DUF4283 domain protein [Medicago truncatula] Length = 480 Score = 102 bits (254), Expect = 2e-18 Identities = 83/276 (30%), Positives = 128/276 (46%), Gaps = 11/276 (3%) Frame = -3 Query: 1779 LPNELLQEGRDIFQFSLIGRLNF-KGLK---LADVQKRLEEQWGFGPGSSKLVPMTKGFF 1612 + +E ++G D + +L GRL KG K +VQ +L++ W G K+ P+ KG+F Sbjct: 65 ISDETYEQGIDACKINLRGRLILSKGDKPYGFREVQTKLQQLWK-NVGPWKMTPLGKGYF 123 Query: 1611 IIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRASVWVSFPALYIELWT 1432 +S E+ VW +T ++ LR+F + +F Q+ + A VW+ L E W Sbjct: 124 EFYFSSYEDMRSVWSKETQNLKPSLLRLFEWSKDFTARTQRQTHAQVWIRLLELPQEYWM 183 Query: 1431 KKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRIRLEANGKEFWQYIE 1252 + L I IG P+ ID T + G+Y VL+++D +K I + +E G F I Sbjct: 184 DRTLKEIGSAIGTPVLIDSATQNRVFGHYVRVLVDMDLSKHIFNEVMIERTGFSFSIEIT 243 Query: 1251 IQDLENVKFCTHCKFMGHKFENCLAAKKILGGSHAELKPPTVNIK--SGKEEKPSKPVWK 1078 + L FCTH +GH +C + K P ++ K S +KP P W+ Sbjct: 244 YECLP--AFCTHYGNIGHHISSCRWLHPV--------KEPVIDKKKQSIVLQKPRPPKWQ 293 Query: 1077 -----EVGKISHAGQANNVNNVQPGKESINSAHEVA 985 +V S A +A NN E I + HEVA Sbjct: 294 PKDNLDVIGSSKAFEAPVGNN---EVEDIPTPHEVA 326 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 102 bits (253), Expect = 3e-18 Identities = 86/327 (26%), Positives = 145/327 (44%), Gaps = 33/327 (10%) Frame = -3 Query: 1884 SFADLVKDKKPYMLTDSDLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNFKG 1705 SF +V +KP ++ S + + D PA + +Q + SL+G+ + + Sbjct: 92 SFLSIVSGQKPPVVPLSR-----DPFVFKDRPAAAFYEDEIQTLAQPLKLSLVGKFS-RM 145 Query: 1704 LKLADVQKRLEEQWGFG-PGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRV 1528 KL DV+ + G G G+ ++ + +I LT+ + RVW W I NQ++RV Sbjct: 146 PKLQDVRSAFK---GIGLAGAYEVRWLDYKHILIHLTNEHDCNRVWTKQVWFIANQKMRV 202 Query: 1527 FNFYPNFDPEKQQTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGN 1348 F + P F+PEK+ ++ VW++FP L L+ K L+ IAK +GKP+ +D+ T + + Sbjct: 203 FKWTPEFEPEKE-SAMVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPS 261 Query: 1347 YAAVLINIDFAKSIPKRIRLEANGKE-------FWQYIEIQDLENVKFCTHCKFMGHKFE 1189 A V I D K ++ + +E + Q +E + +C HC +GHK Sbjct: 262 VARVCIEYDCRKPPIDQVWIVVQNRETGTVTSGYPQKVEFSQMP--AYCDHCCHVGHKEI 319 Query: 1188 NCLA----------------------AKKILGGS---HAELKPPTVNIKSGKEEKPSKPV 1084 +C+ KK GGS + E K ++E+P Sbjct: 320 DCIVLGNKDKPLGSSKSQFLRVLEAEKKKGYGGSSEKNLEKSKNPEKEKIARQEEPVSQR 379 Query: 1083 WKEVGKISHAGQANNVNNVQPGKESIN 1003 W+ V K +G + Q GKE ++ Sbjct: 380 WQPVNKAGTSGTKD-----QQGKEIVS 401 >ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobroma cacao] gi|508727304|gb|EOY19201.1| Uncharacterized protein TCM_044158 [Theobroma cacao] Length = 830 Score = 100 bits (250), Expect = 7e-18 Identities = 71/269 (26%), Positives = 131/269 (48%), Gaps = 21/269 (7%) Frame = -3 Query: 1803 YLDEPALVLPNELLQEGRDIFQFSLIGRLNFKGLKLADVQKRLEEQWGFGP-GSSKLVPM 1627 Y D PA ++ + F+FS++G+ + + L++ +++ + G G G+ ++ + Sbjct: 79 YKDRPAASFFDDEISTLAQPFKFSMVGKFS-RMLRMQEIRVAFK---GIGLIGAYEIRWL 134 Query: 1626 TKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRASVWVSFPALY 1447 +I+L++ + R+W W I NQ++RVF + P F PEK+ +S VW+SFP L Sbjct: 135 DYKHILIQLSNEHDLNRIWLKQVWFISNQKMRVFKWSPEFQPEKE-SSMVPVWISFPNLK 193 Query: 1446 IELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRIRLEANGKE- 1270 L+ K L +I K +G+P+ +D+ T + + A V + D + ++ + ++ Sbjct: 194 AHLYEKSALSAIVKTVGRPLMVDEATANGTRPSVARVCVEFDCQQPPIDQVWIVTRNRQS 253 Query: 1269 ------FWQYIEIQDLENVKFCTHCKFMGHKFENCLAA-------KKILGGSHAELK--P 1135 + Q +E L +FCTHC +GH +C+ K+ +GG K Sbjct: 254 GSVMGGYMQKVEFARLS--EFCTHCSHVGHGVSSCMVIGNRPEKNKQPMGGKKQLKKEDK 311 Query: 1134 PTVNIKSG----KEEKPSKPVWKEVGKIS 1060 N + G +EEK ++P+ E K S Sbjct: 312 DRTNARKGDLKPQEEKETEPIQAEQQKQS 340 >ref|XP_009341595.1| PREDICTED: uncharacterized protein LOC103933633 [Pyrus x bretschneideri] Length = 572 Score = 99.4 bits (246), Expect = 2e-17 Identities = 71/242 (29%), Positives = 116/242 (47%), Gaps = 6/242 (2%) Frame = -3 Query: 1833 DLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNFKG----LKLADVQKRLEEQ 1666 +L LP + D + + +L QE + +LIGRL K +K ++ L Sbjct: 121 NLSQLPSPAVRGDVTYVKISEDLYQEQLRSCRTNLIGRLLLKKGTMPMKTEFLKSALASL 180 Query: 1665 WGFGPGSS-KLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFD-PEKQ 1492 W P +S KLVP+ KG+F + ++ E+ RVW G T +Q R+ + P+F + Sbjct: 181 WK--PHNSWKLVPLGKGYFDLHFSNEEDVRRVWGGGTCTLQFGLFRLSQWQPDFKLGDAL 238 Query: 1491 QTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAK 1312 + A VW+ L E W +IL+ IA+ +G P+ +D T + G YA +L+++D + Sbjct: 239 PQTHAQVWIKIYGLSQEYWHPRILMEIARGVGTPLQLDHATREKLYGYYARILVDVDLSA 298 Query: 1311 SIPKRIRLEANGKEFWQYIEIQDLENVKFCTHCKFMGHKFENCLAAKKILGGSHAELKPP 1132 +P I +E F I ++L C HC +GH C + L G+H++ P Sbjct: 299 DLPSSIMVEREQHGFSVDIIYENLPPT--CGHCGVIGHNANKC----RHLKGNHSDAMHP 352 Query: 1131 TV 1126 V Sbjct: 353 DV 354 >ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao] gi|508787493|gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao] Length = 2606 Score = 99.4 bits (246), Expect = 2e-17 Identities = 85/326 (26%), Positives = 149/326 (45%), Gaps = 33/326 (10%) Frame = -3 Query: 1884 SFADLVKDKKPYMLTDSDLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNFKG 1705 SF +V KP ++ S + + D PA + +Q + SL+G+ + + Sbjct: 1689 SFLSIVSGDKPPVIPLSR-----DPLVFKDRPAAAFFEDEIQTLAQPLKLSLVGKFS-RM 1742 Query: 1704 LKLADVQKRLEEQWGFG-PGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRV 1528 KL DV+ + G G G+ ++ + +I L++ ++ RVW W I NQ++RV Sbjct: 1743 PKLQDVRSAFK---GIGLTGAYEVRWLDYKHVLIHLSNEQDCNRVWTKQVWFIANQKMRV 1799 Query: 1527 FNFYPNFDPEKQQTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGN 1348 F + P F+PEK+ ++ VW++FP L L+ K L+ IAK +GKP+ +D+ T + + Sbjct: 1800 FKWTPEFEPEKE-SAVVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPS 1858 Query: 1347 YAAVLINIDFAKSIPKRIRLEANGKE-------FWQYIEIQDLENVKFCTHCKFMGHKFE 1189 A V I D + ++ + +E + Q +E + +C HC +GHK Sbjct: 1859 VARVCIEFDCRRPPIDQVWIVVQNRETGTVTSGYPQRVEFSQMP--AYCDHCCHVGHKEN 1916 Query: 1188 NCLA---AKKILGGSHAE-LKPPTVNIKSG---------------------KEEKPSKPV 1084 +C+ K LG S ++ L+ V K+G + E+P+ Sbjct: 1917 DCIVLGNKDKSLGLSKSQSLRTLAVEKKTGYGGGSEKNLEKRKNPEKEKIVRPEEPASLR 1976 Query: 1083 WKEVGKISHAGQANNVNNVQPGKESI 1006 W++V K +G + Q GKE + Sbjct: 1977 WQQVSKAGISGTKD-----QQGKEIV 1997 Score = 95.1 bits (235), Expect = 4e-16 Identities = 63/226 (27%), Positives = 106/226 (46%), Gaps = 18/226 (7%) Frame = -3 Query: 1803 YLDEPALVLPNELLQEGRDIFQFSLIGRLN-----------FKGLKLADVQKRLEEQWGF 1657 Y D PA+ + + F+ S++G+ + FKG+ L V E +W Sbjct: 85 YRDRPAVAFFEDEIVALAQPFKHSMVGKFSRMPKLNDIRAAFKGISLVGVY---EIRW-- 139 Query: 1656 GPGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRA 1477 + +I L++ ++ R+W W I NQ++RVF + P+F PEK+ +S Sbjct: 140 ---------LDYKHILIHLSNEQDLNRLWMRQAWFIANQKMRVFKWTPDFQPEKE-SSLV 189 Query: 1476 SVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKR 1297 VW+SFP L L+ K L+ IAK +G+P+ +D+ T + + A V + D + ++ Sbjct: 190 PVWISFPNLRAHLYEKSALLMIAKSVGRPLFVDEATANGTRPSVARVCVEYDCQQPPLEQ 249 Query: 1296 IRLEANGKE-------FWQYIEIQDLENVKFCTHCKFMGHKFENCL 1180 I + + F Q ++ L N +CTHC +GH CL Sbjct: 250 IWIVTRDRRTGDITGGFQQKVDFAKLPN--YCTHCCHVGHSASTCL 293 >ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobroma cacao] gi|508710348|gb|EOY02245.1| Uncharacterized protein TCM_016772 [Theobroma cacao] Length = 1296 Score = 99.0 bits (245), Expect = 3e-17 Identities = 66/218 (30%), Positives = 110/218 (50%), Gaps = 6/218 (2%) Frame = -3 Query: 1815 EASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNFKGLKLADVQKRLEEQWGFG-PGSSK 1639 E S Y D PA + + F+FS+IG+ + KL +++ + G G G+ Sbjct: 81 EPSWYRDRPAASFFDNEIATLALSFKFSMIGKFT-RMPKLQEIRTAFK---GIGLVGAYN 136 Query: 1638 LVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRASVWVSF 1459 + + +I L++ + R+W W I N+++RVF + P F PEK+ +S VW+SF Sbjct: 137 IRWLDYKHILIHLSNEHDLNRIWMKQNWFIVNKKMRVFKWTPEFHPEKE-SSLVPVWISF 195 Query: 1458 PALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRIRLEAN 1279 P L + K L+ IAK +G+P+ +D+ T + N A + + D KS+ +I + Sbjct: 196 PNLRAHFYEKSTLMMIAKSVGRPLFVDEATANGTRPNVARICVEYDCQKSLLDQIWIVTR 255 Query: 1278 GKEFWQYIE--IQDLENVK---FCTHCKFMGHKFENCL 1180 ++ + IQ +E VK +CTHC +GH CL Sbjct: 256 SRQTGEVTGGFIQKVEFVKMPDYCTHCCHVGHNASACL 293 >ref|XP_009355139.1| PREDICTED: uncharacterized protein LOC103946202 [Pyrus x bretschneideri] Length = 572 Score = 98.6 bits (244), Expect = 3e-17 Identities = 78/294 (26%), Positives = 137/294 (46%), Gaps = 8/294 (2%) Frame = -3 Query: 1833 DLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNFKG----LKLADVQKRLEEQ 1666 +L LP + D + + +L QE + +LIGRL K +K ++ L Sbjct: 121 NLSQLPSPTVRGDVTYVKISEDLYQEQLRSCRTNLIGRLLLKKGTMPMKTEFLKSALASL 180 Query: 1665 WGFGPGSS-KLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFD-PEKQ 1492 W P +S KLVP+ KG+F + ++ E+ RVW G T +Q + R+ + P+F + Sbjct: 181 WK--PHNSWKLVPLGKGYFDLHFSNEEDVRRVWGGGTCTLQFGRFRLSQWQPDFKLGDVL 238 Query: 1491 QTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAK 1312 + A VW+ L E W +IL+ IA+ +G P+ +D T + G YA +L+++D + Sbjct: 239 PQTHAQVWIKIYGLSQEYWHPRILMEIARGVGTPLQLDHATREKLYGYYARILVDVDLSA 298 Query: 1311 SIPKRIRLEANGKEFWQYIEIQDLENVKFCTHCKFMGHKFENCLAAKKILGGSHAELKPP 1132 +P I +E F I + L C HC +GH C + L G+H++ P Sbjct: 299 DLPSSIMVEREQHGFSADIIYEILPPT--CGHCGVIGHNRNKC----RHLKGNHSDGMHP 352 Query: 1131 TVNIKSGKEEKPSKPVWKEVGKISHAGQANNVNNVQ--PGKESINSAHEVAVHN 976 V + P++ V++ A + + +++ P E I++ ++N Sbjct: 353 DVEAHH-RGRSPTRQVYRPKPSAKTASPSGDPIHMECSPSVEIIHNKCSTGLNN 405 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 98.2 bits (243), Expect = 4e-17 Identities = 89/335 (26%), Positives = 154/335 (45%), Gaps = 33/335 (9%) Frame = -3 Query: 1884 SFADLVKDKKPYMLTDSDLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNFKG 1705 SF +V +KP ++ + + Y D PA + + F+ SL+G+ + + Sbjct: 86 SFLSIVSGEKPSVVPLTR-----DPFVYKDRPAAAFFEDEIHILAQPFKLSLVGKFS-RM 139 Query: 1704 LKLADVQKRLEEQWGFG-PGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRV 1528 KL +V+ + G G GS ++ + +I L++ ++ R W W I NQ++RV Sbjct: 140 PKLQEVRSAFK---GIGLAGSYEIRWLDYKHILIHLSNEQDFNRFWTKQAWFIANQKMRV 196 Query: 1527 FNFYPNFDPEKQQTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGN 1348 F + P F+PEK+ ++ VW+SFP L L+ K L+ IAK +GKP+ ID+ T + + Sbjct: 197 FKWTPEFEPEKE-SAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFIDEATANGSRPS 255 Query: 1347 YAAVLINIDFAKSIPKRIRL----EANGKEFWQYIE-IQDLENVKFCTHCKFMGHKFENC 1183 A V I D + ++ + A G Y + ++ + +C HC +GHK NC Sbjct: 256 VARVCIEYDCREPPVDQVWIVVQNRATGAVTSGYPQKVEFAQMPAYCDHCCHVGHKEINC 315 Query: 1182 --LAAKKILGGS-----HAEL------------KPPTVNIKSGKEEKPSKPVWKEVGKIS 1060 L K L GS H+ + P I S +++ + W+ VGK+ Sbjct: 316 IVLGNKNGLQGSGKPQPHSVVDADKLRNLEKIKNPDKGKIVSTEDQAKHQQKWQPVGKVG 375 Query: 1059 HAG----QANNVNNVQPGKES----INSAHEVAVH 979 +G Q +++ + KE+ N H ++ H Sbjct: 376 TSGTKDRQGKEIDSDKGTKEANVPISNRFHGISGH 410 >ref|XP_007040951.1| Uncharacterized protein TCM_016760 [Theobroma cacao] gi|508778196|gb|EOY25452.1| Uncharacterized protein TCM_016760 [Theobroma cacao] Length = 1109 Score = 97.8 bits (242), Expect = 6e-17 Identities = 67/246 (27%), Positives = 123/246 (50%), Gaps = 8/246 (3%) Frame = -3 Query: 1893 IRVSFADLVKDKKPYMLTDSDLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLN 1714 ++ SF + ++P ++ S + S Y D PA + + +Q F SL+G+ + Sbjct: 58 LKKSFLTVAVGERPPVIPPSR-----DPSVYKDRPAAIFYEDEIQTLARPFSHSLVGKFS 112 Query: 1713 FKGLKLADVQKRLEEQWGFG-PGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQ 1537 + KL +++ + G G G+ ++ M +I L++ ++ RVW W I NQ+ Sbjct: 113 -RMPKLQEIRHAFK---GIGLSGAYEIRWMDYKHVLIHLSNEQDFNRVWVKQQWFIVNQK 168 Query: 1536 LRVFNFYPNFDPEKQQTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHD 1357 +RVF + P+F+ EK+ ++ VW+SFP L L+ K L+ IAK +GKP+ +D+ T + Sbjct: 169 MRVFKWAPDFEAEKE-SAMVPVWISFPNLKAHLYEKSALLLIAKTVGKPLYVDEATANGS 227 Query: 1356 VGNYAAVLINIDFAKSIPKRIRLEANGKE-------FWQYIEIQDLENVKFCTHCKFMGH 1198 + A V + D K + I + +E + Q +E + + +C +C +GH Sbjct: 228 RPSVARVCVEYDCRKQPVEEIWIVIRNRETGAVTGGYSQRVEFARMPD--YCGYCSHVGH 285 Query: 1197 KFENCL 1180 K C+ Sbjct: 286 KENECI 291 >ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao] gi|508715062|gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 96.7 bits (239), Expect = 1e-16 Identities = 67/243 (27%), Positives = 120/243 (49%), Gaps = 8/243 (3%) Frame = -3 Query: 1884 SFADLVKDKKPYMLTDSDLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNFKG 1705 SF + +KP ++ + E Y D PA+ + + F+ S++G+ + + Sbjct: 63 SFLSVAAGEKPPIIPTNR-----EPFWYRDRPAVAFFEDEIVALAQPFKHSMVGKFS-RM 116 Query: 1704 LKLADVQKRLEEQWGFG-PGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRV 1528 KL D++ + G G G ++ + +I L++ ++ R+W W I NQ++RV Sbjct: 117 PKLNDIRAAFK---GIGLVGVYEIRWLDYKHILIHLSNEQDLNRLWMRQAWFIANQKMRV 173 Query: 1527 FNFYPNFDPEKQQTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGN 1348 F + P+F PEK+ +S VW+SFP L L+ K L+ IAK +G+P+ +D+ T + + Sbjct: 174 FKWSPDFQPEKE-SSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPLFVDEATANGTRPS 232 Query: 1347 YAAVLINIDFAKSIPKRIRLEANGKE-------FWQYIEIQDLENVKFCTHCKFMGHKFE 1189 A V + D + ++I + + + F Q ++ L N +CTHC +GH Sbjct: 233 VARVCVEYDCQQPPLEQIWIVSRDRRTGDITGGFQQKVDFAKLPN--YCTHCCHVGHSAS 290 Query: 1188 NCL 1180 CL Sbjct: 291 TCL 293 >ref|XP_010690660.1| PREDICTED: uncharacterized protein LOC104904165 [Beta vulgaris subsp. vulgaris] Length = 442 Score = 95.9 bits (237), Expect = 2e-16 Identities = 63/244 (25%), Positives = 119/244 (48%), Gaps = 9/244 (3%) Frame = -3 Query: 1884 SFADLVKDKKPYMLTDSDLLSLPEASTYLDEPALVLPN---------ELLQEGRDIFQFS 1732 SF D+V + + L + S +E A+ +P+ E LQ R+ ++ + Sbjct: 73 SFRDIVAGSSQWF---KEAKQLVQTSMEWEEEAVEIPDSQLAVSFSKEKLQSLREPWRNT 129 Query: 1731 LIGRLNFKGLKLADVQKRLEEQWGFGPGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWV 1552 L+ ++ + + R+ W +++ + +G F++K ++++ ER +G W Sbjct: 130 LMAKVLGMPINRNFLVDRVNRMWKT-KDRLEVIDLGQGIFLLKFHNSDDMERALYGSPWF 188 Query: 1551 IQNQQLRVFNFYPNFDPEKQQTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQK 1372 I N L + + P+F P + VW+ FP L +E + K L +IA+ +GKPI +D Sbjct: 189 ILNHYLMLTKWKPDFRPSSSSFDKIMVWIRFPELPLEYYEKDALFAIAEKVGKPIKVDYA 248 Query: 1371 TLDHDVGNYAAVLINIDFAKSIPKRIRLEANGKEFWQYIEIQDLENVKFCTHCKFMGHKF 1192 T G YA V I ++ +K++ R+ + + WQ +E ++L+ V C C +GH+ Sbjct: 249 TDTVVRGRYARVCIELELSKALVTRVWV----AKAWQTVEYENLDLV--CFKCGRIGHRQ 302 Query: 1191 ENCL 1180 + CL Sbjct: 303 DQCL 306 >ref|XP_008350100.1| PREDICTED: uncharacterized protein LOC103413411 [Malus domestica] Length = 405 Score = 95.9 bits (237), Expect = 2e-16 Identities = 58/191 (30%), Positives = 96/191 (50%), Gaps = 1/191 (0%) Frame = -3 Query: 1647 SSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDP-EKQQTSRASV 1471 S KLVP+ KG+F + +S E+ RVW G T +Q R+ + P+F P + + A V Sbjct: 21 SWKLVPLGKGYFDLHFSSEEDMRRVWGGGTCTLQFGLFRLSQWQPDFKPGDVLPQTHAQV 80 Query: 1470 WVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRIR 1291 W+ L E W +IL+ IA+ +G P+ +D T + G YA +L+++D + +P I Sbjct: 81 WIKIYGLSQEYWHPRILMEIARGVGTPLQLDHATREKLYGYYARILVDVDLSADLPSTIM 140 Query: 1290 LEANGKEFWQYIEIQDLENVKFCTHCKFMGHKFENCLAAKKILGGSHAELKPPTVNIKSG 1111 +E F I ++L C HC +GH C + L G+H++ P V Sbjct: 141 VEREQHGFSVDIIYENLPPT--CGHCGVIGHNTNKC----RHLKGNHSDGMHPHVEAHR- 193 Query: 1110 KEEKPSKPVWK 1078 + P++ V++ Sbjct: 194 RGRSPTRQVYR 204 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 95.9 bits (237), Expect = 2e-16 Identities = 64/219 (29%), Positives = 108/219 (49%), Gaps = 32/219 (14%) Frame = -3 Query: 1611 IIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRASVWVSFPALYIELWT 1432 +I L++ ++ RVW W I NQ++RVF + P+F+PEK+ ++ VW++FP L L+ Sbjct: 31 LIHLSNEQDCNRVWTKQVWFIANQKMRVFKWTPDFEPEKE-SAVVPVWIAFPNLKAHLFE 89 Query: 1431 KKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRIRLEANGKE------ 1270 K L+ IAK +GKP+ +D+ T + + A V I D +S ++ + +E Sbjct: 90 KSALLLIAKTVGKPLFVDEATANGSRPSVARVCIEYDCRRSPIDQVWIVVQNRETGTVTS 149 Query: 1269 -FWQYIEIQDLENVKFCTHCKFMGHKFENCLA---AKKILGGSHAE-LKPPTVNIKSG-- 1111 + Q +E + +C HC +GHK +C+ K LG S ++ L+ TV K+G Sbjct: 150 GYPQRVEFSQMP--AYCDHCCHVGHKEIDCIVLGNKDKSLGRSKSQSLRALTVEKKTGYG 207 Query: 1110 -------------------KEEKPSKPVWKEVGKISHAG 1051 + E+P+ WK+V K +G Sbjct: 208 GGSEKNLEKRKNPEKEKIVRPEEPASLRWKQVSKAGTSG 246 >ref|XP_007206965.1| hypothetical protein PRUPE_ppb019035mg [Prunus persica] gi|462402607|gb|EMJ08164.1| hypothetical protein PRUPE_ppb019035mg [Prunus persica] Length = 409 Score = 95.9 bits (237), Expect = 2e-16 Identities = 67/262 (25%), Positives = 118/262 (45%) Frame = -3 Query: 1899 GNIRVSFADLVKDKKPYMLTDSDLLSLPEASTYLDEPALVLPNELLQEGRDIFQFSLIGR 1720 G+ R + DK + + D D + S P++ ++ + ++ S+I + Sbjct: 98 GSTRTGLGGMADDK--FTIEDDDFI----VSEGEKGPSIRFSEQVKERLYRPWRTSIIIK 151 Query: 1719 LNFKGLKLADVQKRLEEQWGFGPGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQ 1540 L K V RL+ +WG G KL+ + GFFI++ E+ + + G WVI Q Sbjct: 152 LMGKAHTYNFVLARLQHRWGMIKGPWKLIDLENGFFIVRFVLEEDMKAILCGGPWVIAGQ 211 Query: 1539 QLRVFNFYPNFDPEKQQTSRASVWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDH 1360 L + + P FDP +Q +R +VWV L++E + + ++ I ++G +D T+ Sbjct: 212 YLVMQRWKPGFDPLVEQVTRMTVWVRIIGLHVEWFRPEAMLRIGDLLGTTFKVDSNTVAQ 271 Query: 1359 DVGNYAAVLINIDFAKSIPKRIRLEANGKEFWQYIEIQDLENVKFCTHCKFMGHKFENCL 1180 G YA V + ID + + +++E N W +E + + V C C GH C Sbjct: 272 VRGKYARVCVEIDLTQPLQAFVQVEDN----WYGLEYEGIHLV--CFACGCYGHNRNVCP 325 Query: 1179 AAKKILGGSHAELKPPTVNIKS 1114 + K KP T N+++ Sbjct: 326 SVIK---------KPHTENVQN 338 >ref|XP_011471050.1| PREDICTED: uncharacterized protein LOC105353507 [Fragaria vesca subsp. vesca] Length = 341 Score = 95.5 bits (236), Expect = 3e-16 Identities = 57/197 (28%), Positives = 98/197 (49%), Gaps = 1/197 (0%) Frame = -3 Query: 1650 GSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRASV 1471 G KL+PM +G+F SA+ +W ++ LR+ + PNF P + + A V Sbjct: 98 GDWKLIPMGRGYFSFCFPSADCVSAIWAKGAVNLKPGILRIMRWVPNFSPASHRNTNAQV 157 Query: 1470 WVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRIR 1291 WV F L +E W + L IA IG PI +D TL+ G +A +LI+ID P + Sbjct: 158 WVRFWDLGLEFWETQTLFEIASGIGIPIKVDNYTLERRYGLFARILIDIDLTIDPPLDLV 217 Query: 1290 LE-ANGKEFWQYIEIQDLENVKFCTHCKFMGHKFENCLAAKKILGGSHAELKPPTVNIKS 1114 +E +G+ ++E + L ++ CT+C +GH C + K+ + +E++ + Sbjct: 218 VERESGEALVLHVEYEKLPSL--CTNCGNLGHVVSGCNSVKRSKENNVSEVE------RR 269 Query: 1113 GKEEKPSKPVWKEVGKI 1063 G+ KP K + + ++ Sbjct: 270 GRSRKPCKRKHRGISQV 286 >gb|ABE91952.1| Zinc finger, CCHC-type [Medicago truncatula] Length = 504 Score = 95.5 bits (236), Expect = 3e-16 Identities = 58/204 (28%), Positives = 103/204 (50%) Frame = -3 Query: 1791 PALVLPNELLQEGRDIFQFSLIGRLNFKGLKLADVQKRLEEQWGFGPGSSKLVPMTKGFF 1612 P + + + QE ++ +L+ +L K L ++ RL++ W G ++ GFF Sbjct: 71 PKIYIEPQTFQELCTPWKDALVVKLLGKSLGYNTMKDRLQKIWKL-QGGFDIMDNDNGFF 129 Query: 1611 IIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRASVWVSFPALYIELWT 1432 ++K A +KE+V G W+I + L V ++ P F + R VWV FP L + + Sbjct: 130 MVKFDQAADKEKVITGGPWLIFDHCLAVTHWTPEFASPNAKVDRTVVWVRFPGLNLVYYD 189 Query: 1431 KKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRIRLEANGKEFWQYIE 1252 + L+++A +G+PI +D TL + G +A V + ID ++P ++ NG W ++ Sbjct: 190 ESFLLAMASALGRPIKVDTNTLKVERGKFARVCVEIDL--TVPVVGKIWVNG--HWYKVQ 245 Query: 1251 IQDLENVKFCTHCKFMGHKFENCL 1180 + L + CT+C GH NC+ Sbjct: 246 YEGLHLI--CTNCGCYGHLGRNCM 267 >gb|ABE87590.1| non-LTR retrolelement reverse transcriptase-like protein, related [Medicago truncatula] Length = 497 Score = 95.1 bits (235), Expect = 4e-16 Identities = 68/256 (26%), Positives = 113/256 (44%), Gaps = 4/256 (1%) Frame = -3 Query: 1821 LPEASTYLDEPALVLPNELLQEGRDIFQFSLIGRLNF----KGLKLADVQKRLEEQWGFG 1654 LP S + A+ + G D + +L GRL K D+ +L++ W Sbjct: 63 LPLPSILGETLAVAISTTAYFRGVDYCKINLRGRLVLSKGDKPYATKDITAKLQKLWKV- 121 Query: 1653 PGSSKLVPMTKGFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRAS 1474 G ++ + +GF+ S E+ VW T ++ LR+F + +F+ Q+ + Sbjct: 122 KGPWHMLSLGRGFYEFFFASQEDMRTVWAAGTVSLKPGLLRLFEWTKDFNLHTQRQTHTQ 181 Query: 1473 VWVSFPALYIELWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRI 1294 VW+ L E W ++ L IA +G P+ ID T + G+YA +L+++D +K I + Sbjct: 182 VWIRLWELPQEYWMERTLYEIAGAVGTPLLIDNVTRNRLYGHYARILVDLDLSKKIFYEV 241 Query: 1293 RLEANGKEFWQYIEIQDLENVKFCTHCKFMGHKFENCLAAKKILGGSHAELKPPTVNIKS 1114 +E G F IE + L +FCTHC +GH C + L E N+K Sbjct: 242 LVEREGFSFPIAIEYEGLP--EFCTHCHSIGHNINLC----RRLHPRRPETHEQPTNVKK 295 Query: 1113 GKEEKPSKPVWKEVGK 1066 + +PV + K Sbjct: 296 NTGDNGKQPVHSQQPK 311 >gb|KRH05184.1| hypothetical protein GLYMA_17G212000 [Glycine max] Length = 516 Score = 94.7 bits (234), Expect = 5e-16 Identities = 86/342 (25%), Positives = 144/342 (42%), Gaps = 22/342 (6%) Frame = -3 Query: 1929 ENPDSNVSNSGNIRVSF----------------ADLVKDKKPYMLTDSDLLSLPEASTYL 1798 E PD SG RVSF DL+K+K ++ + D P ++ Sbjct: 23 EPPDGG--GSGQTRVSFKEMAMANREALPQRPKVDLIKEKLAKIVFEDDNPLKP--IVHI 78 Query: 1797 DEPALVLPNELLQEGRDIFQFSLIGR-LNFKGLKLADVQKRLEEQWGFGPGSSKLVPMTK 1621 D+ N L +D L+G+ + F+ +K L W G ++ + Sbjct: 79 DDSIF---NGLCAPWQDALVVKLLGKNIGFQAMK-----DHLTRIWKLVAGFD-ILDIGN 129 Query: 1620 GFFIIKLTSAENKERVWHGDTWVIQNQQLRVFNFYPNFDPEKQQTSRASVWVSFPALYIE 1441 F+++K + E++++V G W+I + L + + P+F + + VWV FP+L + Sbjct: 130 HFYMVKFDTTEDRQKVIEGGPWMIFDHYLTIQTWTPDFISPTAKIDKTMVWVRFPSLNLI 189 Query: 1440 LWTKKILISIAKIIGKPIAIDQKTLDHDVGNYAAVLINIDFAKSIPKRIRLEANGKEFWQ 1261 + + IL+++A+ IG PI +D TLD G++A + + ID K + ++ L K +W Sbjct: 190 YYDENILLALARAIGTPIKVDSNTLDVRRGHFARICVQIDLNKPVVGKVGL----KGYWY 245 Query: 1260 YIEIQDLENVKFCTHCKFMGHKFENCLAAKKILGGSHAELKPPTVNIKSGKEEKPSKPVW 1081 +E + L + C+ C GH C K L PT K P P Sbjct: 246 KVEYEGLHRI--CSSCGCYGHLARECPTPAKT--PMMKNLSAPT------KFNGPKIP-- 293 Query: 1080 KEVGKISHAGQANNVN-----NVQPGKESINSAHEVAVHNAW 970 I+HA N N GK+ N+ +V +H W Sbjct: 294 ---ASITHANNCGNNGTVAQVNAITGKDVANNEKDV-LHGEW 331