BLASTX nr result
ID: Catharanthus22_contig00011275
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00011275 (1698 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002279676.2| PREDICTED: uncharacterized protein LOC100250... 137 2e-29 ref|XP_006346563.1| PREDICTED: uncharacterized protein LOC102595... 130 2e-27 ref|XP_004243957.1| PREDICTED: uncharacterized protein LOC101261... 127 2e-26 ref|XP_006475430.1| PREDICTED: uncharacterized protein LOC102627... 120 2e-24 emb|CBI27917.3| unnamed protein product [Vitis vinifera] 108 9e-21 ref|XP_004488037.1| PREDICTED: uncharacterized protein LOC101498... 103 3e-19 ref|XP_004488036.1| PREDICTED: uncharacterized protein LOC101498... 102 4e-19 ref|XP_004488034.1| PREDICTED: uncharacterized protein LOC101498... 99 4e-18 ref|XP_004488033.1| PREDICTED: uncharacterized protein LOC101498... 99 4e-18 ref|XP_006451456.1| hypothetical protein CICLE_v10010605mg, part... 99 7e-18 ref|XP_004288940.1| PREDICTED: uncharacterized protein LOC101298... 98 9e-18 gb|EOY30469.1| Uncharacterized protein isoform 2 [Theobroma cacao] 96 6e-17 gb|EOY30468.1| Uncharacterized protein isoform 1 [Theobroma cacao] 96 6e-17 gb|EOY30470.1| Uncharacterized protein isoform 3 [Theobroma cacao] 93 4e-16 gb|EXB38835.1| hypothetical protein L484_027269 [Morus notabilis] 80 2e-12 ref|XP_002514104.1| conserved hypothetical protein [Ricinus comm... 79 5e-12 ref|XP_006381679.1| hypothetical protein POPTR_0006s15560g [Popu... 73 3e-10 ref|XP_004141054.1| PREDICTED: uncharacterized protein LOC101204... 72 6e-10 ref|XP_003534912.1| PREDICTED: transcriptional regulator ATRX ho... 59 5e-06 >ref|XP_002279676.2| PREDICTED: uncharacterized protein LOC100250764 [Vitis vinifera] Length = 597 Score = 137 bits (344), Expect = 2e-29 Identities = 117/357 (32%), Positives = 175/357 (49%), Gaps = 32/357 (8%) Frame = +1 Query: 433 AKSTSVSSKEGHVDKSEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXX 612 + S S +E H D +E+Q L KTPSNVRK++SAFE+S+ Q+ + P Sbjct: 251 SNSASPRLEENHADNTEKQSSLRKTPSNVRKMISAFENSLTQEMGHRVAPPVTKSQSTKS 310 Query: 613 XXXXYLKDSDPV-ETFSGRPK---------KPFIDRG-LKATG----KGGEKIDSGRIIV 747 L+ + ET + PK K F+ +G + T K GE+IDS R + Sbjct: 311 WREVLLRGPQKLKETETWNPKVTQSTSEEAKDFVLKGEFQQTAAYIRKRGEQIDSDRAMD 370 Query: 748 DDMS-MDTKRLKQSASANIESRIKNPSST-------EWAKSSLDL----ETQTSAKSGSV 891 + + + K+ ++ +I+S+ + PS E KS +L +T+ SG + Sbjct: 371 KSKAPLYAGQSKELSAKHIQSKNETPSDKNRQVHKKEERKSFENLIKKFPIETATASGGI 430 Query: 892 SGRKLDHGSGSSYSFAEQKGSYGTSHVE----GRSENGSAEISDHVASSGKSKYLEYREE 1059 R+ S +++ S GTS +E G S+EI + K + Y ++ Sbjct: 431 FNRQ--GRLQPSNLVTDERDSGGTSVIEKDGVGVQSRFSSEIISQGGTENTPKPVLYCKD 488 Query: 1060 ELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQKRKTGFLSGHVDKDGT 1239 E F FE G IF D ++LCITT+ K+ MNV+G TE HQ++ L H G+ Sbjct: 489 ENFSFESCGSCIFLDDTRRLCITTSDKQVMNVMGGSPTEVDIHQRK----LKVH----GS 540 Query: 1240 EHEKKKDHNKTQNQKKSRPESSSG-DASGGLVAQAIKVAVVVGFGILVFLTRQREPR 1407 + K KT ++ K R ESS+ + S G VA+AIKVA++ GFG VFLTRQR R Sbjct: 541 SDIEGKQAQKTSHRVKKRLESSADVEPSRGPVARAIKVAIMAGFGTFVFLTRQRSSR 597 >ref|XP_006346563.1| PREDICTED: uncharacterized protein LOC102595100 [Solanum tuberosum] Length = 663 Score = 130 bits (327), Expect = 2e-27 Identities = 130/406 (32%), Positives = 188/406 (46%), Gaps = 51/406 (12%) Frame = +1 Query: 427 VAAKSTSVSSKEGHVDKSEQ---QGPLEKTPSNVRKLVSAFESSVAQ--DSKSPIKPSAX 591 VAAK SVS K VD S + QG LEKTPS+VRK++SAFE+ + Q +S + A Sbjct: 269 VAAKKESVSQKN-IVDLSNEPDDQGVLEKTPSSVRKMISAFETGLTQKKGGRSLTRTRAS 327 Query: 592 XXXXXXXXXXXYLKDSDPVETFSGRPKKPF---IDRGLKATGKGGEKIDSGR-------- 738 LKD D RP K ++R L +I+ G+ Sbjct: 328 KSQPNLVGIGGSLKDLDSDNI--SRPNKMSALRLERPLNTVDLPEPQINIGKRGQNSSPE 385 Query: 739 --IIVDDMSMDTKRLKQSA------------------SANIESRIKNPSSTEWAKSSLDL 858 + + + + LKQS+ SA ES S + + S +L Sbjct: 386 QDFVGTEQPVFHEELKQSSVHIVRFNEAGSSQQETFVSAKKESNTVATSPVDLMRLS-NL 444 Query: 859 ETQTSAKSGSV--------SGRKLDHGSGSSYSFAEQKGSYGTSHVEGRSENGSAEISDH 1014 ET TS+++ SV + + S + S AE++ E RSE E+ Sbjct: 445 ETATSSQTTSVAHPDVLKANNLAANRDSFNGPSVAEKQNR------EIRSET-LPEVHFE 497 Query: 1015 VASSGKSKYLEYREEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQK 1194 AS+ K K + RE+EL+ E SG WIFP ++K+LC+T+AGK +++ + HQ Sbjct: 498 KASNVKPKLIACREDELYDSENSGAWIFPDNKKRLCMTSAGKNIVHLSEDCRIGVDDHQS 557 Query: 1195 RK-------TGFLSGHVDKDGTEHEKKKDHNKTQNQKKSRPESSSGDASGGLVAQAIKVA 1353 K TG S D + ++ K +NQ KS E + S G V Q + +A Sbjct: 558 NKRPSMQETTGKRSFFHRSDSMTKKGREKPQKPRNQSKSFGE----NGSSGPVRQVMNIA 613 Query: 1354 VVVGFGILVFLTRQREPRKSKKDRNDLFFESSKYMDERILSLREEE 1491 +VVGFGILV LTRQRE RK+ + D +F S +MD+ L+ EE+ Sbjct: 614 LVVGFGILVLLTRQRETRKNDRKSKDFYFTSPDFMDQ--LASSEEQ 657 >ref|XP_004243957.1| PREDICTED: uncharacterized protein LOC101261837 [Solanum lycopersicum] Length = 516 Score = 127 bits (318), Expect = 2e-26 Identities = 118/398 (29%), Positives = 187/398 (46%), Gaps = 42/398 (10%) Frame = +1 Query: 424 EVAAKSTSVSSKE--GHVDKSEQQGPLEKTPSNVRKLVSAFESSVAQDS--KSPIKPSAX 591 ++AAK SVS K ++ E QG LEKTPS+VRK++SAFE+ + Q +S + A Sbjct: 121 DIAAKKESVSRKNVVASSNEPEDQGVLEKTPSSVRKMISAFETGLTQKKGRRSLTRTRAS 180 Query: 592 XXXXXXXXXXXYLKDSDPVETFSGRPKKPF---IDRGLKATGKGGEKIDSGRIIVDDM-- 756 LKD D + RP K ++R L +I+ G+ + + Sbjct: 181 KSQPNLVGIGGSLKDLDSDKI--SRPNKISALRLERPLNTVDLPEPQINIGKRVQNSSPA 238 Query: 757 ------------------SMDTKRLKQSASANIESRIK-NPSSTEWAKSSLDL------E 861 S+ + ++ S+ E+ + S A S +DL E Sbjct: 239 QDFVGTEQPVFHEQFKQSSVHIVQFNEAGSSQQETFVSAKKDSNTVAASPVDLIRLSNLE 298 Query: 862 TQTSAKSGSVSGRKLDHGSGSSYSFAEQKGSYGTSHVEGRSENGSAEISDHV----ASSG 1029 T S+++ SV+ + S + A+Q G + E R+ +E S V AS+ Sbjct: 299 TAISSQTTSVAHPDMLKASNLA---ADQDFFNGPAVAEKRNREIRSETSPEVHFEKASNV 355 Query: 1030 KSKYLEYREEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQKRKTGF 1209 K K + R++ELF E SG WIFP ++++LC+T+AGK +++ HQ+ K Sbjct: 356 KPKLIACRKDELFDSENSGAWIFPDNKRRLCMTSAGKNIVHLSEDCHIGVDDHQRNKRPS 415 Query: 1210 LSGHVDKDG----TEHEKKKDHNKTQNQKKSRPESSSGDASGGLVAQAIKVAVVVGFGIL 1377 + K +E K K Q + +++ ES + S G V Q + +A+VVGFGIL Sbjct: 416 MQETTGKRSFFRRSESTTMKGREKPQ-KPRTQSESFGENGSSGPVRQVMNIALVVGFGIL 474 Query: 1378 VFLTRQREPRKSKKDRNDLFFESSKYMDERILSLREEE 1491 V LTRQRE RK+ + ++F S +MD+ L+ EE+ Sbjct: 475 VLLTRQRETRKNDRKSKVVYFNSPDFMDQ--LASSEEQ 510 >ref|XP_006475430.1| PREDICTED: uncharacterized protein LOC102627547 isoform X1 [Citrus sinensis] Length = 583 Score = 120 bits (300), Expect = 2e-24 Identities = 107/367 (29%), Positives = 171/367 (46%), Gaps = 30/367 (8%) Frame = +1 Query: 388 GQEASCKLEYSPE--VAAKSTSVSSK---EGHVDKSEQQGPLEKTPSNVRKLVSAFESSV 552 G + + L S E VAAK+ ++ K G +++ E+Q P EK PSNVR +++AFESS+ Sbjct: 218 GSKEASSLSKSSELKVAAKNKLIARKLEEAGSINR-EKQSPAEKIPSNVRNMINAFESSL 276 Query: 553 AQDSKSPIKP----------SAXXXXXXXXXXXXYLKDSDPVETFSGRPKKPFIDRGL-K 699 +QD + IKP S+ + P SGR K PF+ + Sbjct: 277 SQDIRPYIKPAPAKSQLRKISSEASLTSLSADEFKTEQIKPAALMSGRIKTPFLTGEFQQ 336 Query: 700 ATGKGGEKIDS-GRIIVDDMSMDTKRLKQ--SASANIESRIKNPSSTEWAKSSLDLETQT 870 AT K D G + D + +Q + ++ + +KNP ++ SS L ++ Sbjct: 337 ATMHTRAKEDQLGYVKAFDGYTAHQGTRQFGLSPVDVRTEVKNPDMSD-KNSSEGLMRES 395 Query: 871 SAKSGSVSGRKLD-----HGSGSSYSFAEQKGSYGT--SHVEGRSENGSAEISDHVASSG 1029 + K+ +VSGR +D G +Q G + ++G + S E+ S Sbjct: 396 TGKAATVSGRMVDEHIRRQHPGKLLLNEQQSGGKSSIKESMKGVRQEYSLEVDSKGTSIN 455 Query: 1030 KSKYLEYREEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQKRKTGF 1209 K KY E ++ + G W+FP+ + LCIT GK ++++G E H++ K Sbjct: 456 KLKYKENWKDIHYSSNCPGTWMFPTGSRNLCITAGGKHLIDLMGICHAEVEIHREEKFP- 514 Query: 1210 LSGHVDKDGTEHEKKKDHNKTQNQKKSR-PE---SSSGDASGGLVAQAIKVAVVVGFGIL 1377 +V+K T K + ++ KK+R PE S + S G V Q +KVA+++GF L Sbjct: 515 APENVEKSSTRPGCNKGNEVDESSKKARKPELVNSEDNENSRGAVGQVLKVAIMIGFAAL 574 Query: 1378 VFLTRQR 1398 V TRQR Sbjct: 575 VLFTRQR 581 >emb|CBI27917.3| unnamed protein product [Vitis vinifera] Length = 657 Score = 108 bits (269), Expect = 9e-21 Identities = 105/356 (29%), Positives = 167/356 (46%), Gaps = 32/356 (8%) Frame = +1 Query: 433 AKSTSVSSKEGHVDKSEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXX 612 + S S +E H D +E+Q L KTPSNVRK++SAFE+S+ Q+ + P Sbjct: 311 SNSASPRLEENHADNTEKQSSLRKTPSNVRKMISAFENSLTQEMGHRVAPPVTKSQSTKS 370 Query: 613 XXXXYLKDSDPV-ETFSGRPK---------KPFIDRG-LKATG----KGGEKIDSGRIIV 747 L+ + ET + PK K F+ +G + T K GE+IDS R + Sbjct: 371 WREVLLRGPQKLKETETWNPKVTQSTSEEAKDFVLKGEFQQTAAYIRKRGEQIDSDRAMD 430 Query: 748 DDMS-MDTKRLKQSASANIESRIKNPSST-------EWAKSSLDL----ETQTSAKSGSV 891 + + + K+ ++ +I+S+ + PS E KS +L +T+ SG + Sbjct: 431 KSKAPLYAGQSKELSAKHIQSKNETPSDKNRQVHKKEERKSFENLIKKFPIETATASGGI 490 Query: 892 SGRKLDHGSGSSYSFAEQKGSYGTSHVE----GRSENGSAEISDHVASSGKSKYLEYREE 1059 R+ S +++ S GTS +E G S+EI + K + Y ++ Sbjct: 491 FNRQ--GRLQPSNLVTDERDSGGTSVIEKDGVGVQSRFSSEIISQGGTENTPKPVLYCKD 548 Query: 1060 ELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQKRKTGFLSGHVDKDGT 1239 E F FE G IF D ++LCITT+ K+ MNV+G TE HQ++ L H G+ Sbjct: 549 ENFSFESCGSCIFLDDTRRLCITTSDKQVMNVMGGSPTEVDIHQRK----LKVH----GS 600 Query: 1240 EHEKKKDHNKTQNQKKSRPESSSG-DASGGLVAQAIKVAVVVGFGILVFLTRQREP 1404 + K KT ++ K R ESS+ + S G VA+ A+++ I++ T + P Sbjct: 601 SDIEGKQAQKTSHRVKKRLESSADVEPSRGPVARV--RALILAPKIMIQTTNSKPP 654 >ref|XP_004488037.1| PREDICTED: uncharacterized protein LOC101498984 isoform X5 [Cicer arietinum] Length = 337 Score = 103 bits (256), Expect = 3e-19 Identities = 88/334 (26%), Positives = 152/334 (45%), Gaps = 21/334 (6%) Frame = +1 Query: 478 SEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXXXXYLKDSDPVETF 657 SE++ P ++TPSNV+K+++AFES + +D +S IKP ++ +E Sbjct: 30 SEKKAPPKRTPSNVKKMITAFESGLPKDMRSHIKPPPTKYQVSPIEKEDS-SEAQHLEQD 88 Query: 658 SGRPKKPFIDRGLKATGKGGEKIDSGRIIVDDMSMDTKRLKQSASANIESRIKNPSSTEW 837 K+P +G E++ S ++ + S+ +L A Sbjct: 89 KSLNKEP--------SGFLQERVKSASLVTKEESIGQIKLLNYAQP-------------- 126 Query: 838 AKSSLDLE---TQTSAKSGSVSGRKLDHGSGSSYSFAEQKGSYGTSHV-EGRSENGSAEI 1005 K+++ LE T T K + R D ++ + A K T+ + E + +G + Sbjct: 127 -KNTMQLELSTTNTLNKQTDSNARNKDQVEETNNNEAYSKHHMMTTSIFETVTVSGKMPL 185 Query: 1006 SDHVASSGKSKYLEYR-------EEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGS 1164 + GK+ + Y + + + FE S WIFP + +++C+TT+GK M++L + Sbjct: 186 KEE-KRRGKAPEVSYETCTERDLDNKYYSFESSEAWIFPHESRRICVTTSGKSVMDILEN 244 Query: 1165 RKTEAGYHQKRKTGFLSGHVDKD---------GTEHEKKKDHNKTQNQKKSRPESSSGDA 1317 T+ H ++ F V+ GTE K + K + +++GD Sbjct: 245 EDTK---HLSQQRSFDFPKVENKEKNATYIGTGTEGSKYEKVQDILESKTTTASNNNGDE 301 Query: 1318 -SGGLVAQAIKVAVVVGFGILVFLTRQREPRKSK 1416 SGG Q IK A+++GFG+LV LTRQR+ RK K Sbjct: 302 NSGGPFDQVIKAAIIIGFGLLVLLTRQRKKRKEK 335 >ref|XP_004488036.1| PREDICTED: uncharacterized protein LOC101498984 isoform X4 [Cicer arietinum] Length = 338 Score = 102 bits (255), Expect = 4e-19 Identities = 87/336 (25%), Positives = 154/336 (45%), Gaps = 21/336 (6%) Frame = +1 Query: 478 SEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXXXXYLKDSDPVETF 657 SE++ P ++TPSNV+K+++AFES + +D +S IKP ++ +E Sbjct: 30 SEKKAPPKRTPSNVKKMITAFESGLPKDMRSHIKPPPTKYQVSPIEKEDS-SEAQHLEQD 88 Query: 658 SGRPKKPFIDRGLKATGKGGEKIDSGRIIVDDMSMDTKRLKQSASANIESRIKNPSSTEW 837 K+P +G E++ S ++ + S+ +L A Sbjct: 89 KSLNKEP--------SGFLQERVKSASLVTKEESIGQIKLLNYAQP-------------- 126 Query: 838 AKSSLDLE---TQTSAKSGSVSGRKLDHGSGSSYSFAEQKGSYGTSHV-EGRSENGSAEI 1005 K+++ LE T T K + R D ++ + A K T+ + E + +G + Sbjct: 127 -KNTMQLELSTTNTLNKQTDSNARNKDQVEETNNNEAYSKHHMMTTSIFETVTVSGKMPL 185 Query: 1006 SDHVASSGKSKYLEYR-------EEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGS 1164 + GK+ + Y + + + FE S WIFP + +++C+TT+GK M++L + Sbjct: 186 KEE-KRRGKAPEVSYETCTERDLDNKYYSFESSEAWIFPHESRRICVTTSGKSVMDILEN 244 Query: 1165 RKTEAGYHQKRKTGFLSGHVDKD---------GTEHEKKKDHNKTQNQKKSRPESSSGDA 1317 T+ H ++ F V+ GTE K + K + +++GD Sbjct: 245 EDTK---HLSQQRSFDFPKVENKEKNATYIGTGTEGSKYEKVQDILESKTTTASNNNGDE 301 Query: 1318 -SGGLVAQAIKVAVVVGFGILVFLTRQREPRKSKKD 1422 SGG Q IK A+++GFG+LV LTRQR+ R+ +K+ Sbjct: 302 NSGGPFDQVIKAAIIIGFGLLVLLTRQRKKRRKEKN 337 >ref|XP_004488034.1| PREDICTED: uncharacterized protein LOC101498984 isoform X2 [Cicer arietinum] Length = 346 Score = 99.4 bits (246), Expect = 4e-18 Identities = 86/331 (25%), Positives = 150/331 (45%), Gaps = 21/331 (6%) Frame = +1 Query: 478 SEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXXXXYLKDSDPVETF 657 SE++ P ++TPSNV+K+++AFES + +D +S IKP ++ +E Sbjct: 21 SEKKAPPKRTPSNVKKMITAFESGLPKDMRSHIKPPPTKYQVSPIEKEDS-SEAQHLEQD 79 Query: 658 SGRPKKPFIDRGLKATGKGGEKIDSGRIIVDDMSMDTKRLKQSASANIESRIKNPSSTEW 837 K+P +G E++ S ++ + S+ +L A Sbjct: 80 KSLNKEP--------SGFLQERVKSASLVTKEESIGQIKLLNYAQP-------------- 117 Query: 838 AKSSLDLE---TQTSAKSGSVSGRKLDHGSGSSYSFAEQKGSYGTSHV-EGRSENGSAEI 1005 K+++ LE T T K + R D ++ + A K T+ + E + +G + Sbjct: 118 -KNTMQLELSTTNTLNKQTDSNARNKDQVEETNNNEAYSKHHMMTTSIFETVTVSGKMPL 176 Query: 1006 SDHVASSGKSKYLEYR-------EEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGS 1164 + GK+ + Y + + + FE S WIFP + +++C+TT+GK M++L + Sbjct: 177 KEE-KRRGKAPEVSYETCTERDLDNKYYSFESSEAWIFPHESRRICVTTSGKSVMDILEN 235 Query: 1165 RKTEAGYHQKRKTGFLSGHVDK---------DGTEHEKKKDHNKTQNQKKSRPESSSGDA 1317 T+ H ++ F V+ GTE K + K + +++GD Sbjct: 236 EDTK---HLSQQRSFDFPKVENKEKNATYIGTGTEGSKYEKVQDILESKTTTASNNNGDE 292 Query: 1318 -SGGLVAQAIKVAVVVGFGILVFLTRQREPR 1407 SGG Q IK A+++GFG+LV LTRQR+ R Sbjct: 293 NSGGPFDQVIKAAIIIGFGLLVLLTRQRKKR 323 >ref|XP_004488033.1| PREDICTED: uncharacterized protein LOC101498984 isoform X1 [Cicer arietinum] Length = 355 Score = 99.4 bits (246), Expect = 4e-18 Identities = 86/331 (25%), Positives = 150/331 (45%), Gaps = 21/331 (6%) Frame = +1 Query: 478 SEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXXXXYLKDSDPVETF 657 SE++ P ++TPSNV+K+++AFES + +D +S IKP ++ +E Sbjct: 30 SEKKAPPKRTPSNVKKMITAFESGLPKDMRSHIKPPPTKYQVSPIEKEDS-SEAQHLEQD 88 Query: 658 SGRPKKPFIDRGLKATGKGGEKIDSGRIIVDDMSMDTKRLKQSASANIESRIKNPSSTEW 837 K+P +G E++ S ++ + S+ +L A Sbjct: 89 KSLNKEP--------SGFLQERVKSASLVTKEESIGQIKLLNYAQP-------------- 126 Query: 838 AKSSLDLE---TQTSAKSGSVSGRKLDHGSGSSYSFAEQKGSYGTSHV-EGRSENGSAEI 1005 K+++ LE T T K + R D ++ + A K T+ + E + +G + Sbjct: 127 -KNTMQLELSTTNTLNKQTDSNARNKDQVEETNNNEAYSKHHMMTTSIFETVTVSGKMPL 185 Query: 1006 SDHVASSGKSKYLEYR-------EEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGS 1164 + GK+ + Y + + + FE S WIFP + +++C+TT+GK M++L + Sbjct: 186 KEE-KRRGKAPEVSYETCTERDLDNKYYSFESSEAWIFPHESRRICVTTSGKSVMDILEN 244 Query: 1165 RKTEAGYHQKRKTGFLSGHVDK---------DGTEHEKKKDHNKTQNQKKSRPESSSGDA 1317 T+ H ++ F V+ GTE K + K + +++GD Sbjct: 245 EDTK---HLSQQRSFDFPKVENKEKNATYIGTGTEGSKYEKVQDILESKTTTASNNNGDE 301 Query: 1318 -SGGLVAQAIKVAVVVGFGILVFLTRQREPR 1407 SGG Q IK A+++GFG+LV LTRQR+ R Sbjct: 302 NSGGPFDQVIKAAIIIGFGLLVLLTRQRKKR 332 >ref|XP_006451456.1| hypothetical protein CICLE_v10010605mg, partial [Citrus clementina] gi|557554682|gb|ESR64696.1| hypothetical protein CICLE_v10010605mg, partial [Citrus clementina] Length = 577 Score = 98.6 bits (244), Expect = 7e-18 Identities = 96/353 (27%), Positives = 159/353 (45%), Gaps = 30/353 (8%) Frame = +1 Query: 388 GQEASCKLEYSPE--VAAKSTSVSSK---EGHVDKSEQQGPLEKTPSNVRKLVSAFESSV 552 G + + L S E VAAK+ ++ K G +++ E+Q P EK PSNVR +++AFESS+ Sbjct: 211 GSKEASSLSKSSELKVAAKNKLIARKLEEAGSINR-EKQSPAEKIPSNVRNMINAFESSL 269 Query: 553 AQDSKSPIKP----------SAXXXXXXXXXXXXYLKDSDPVETFSGRPKKPFIDRGL-K 699 +QD + IKP S+ + P SGR K PF+ + Sbjct: 270 SQDIRPYIKPAPAKSQLRKISSEASLTSLSADEFKTEQIKPAALMSGRIKTPFLTGEFQQ 329 Query: 700 ATGKGGEKIDS-GRIIVDDMSMDTKRLKQ--SASANIESRIKNPSSTEWAKSSLDLETQT 870 AT K D G + D + +Q + ++ + +KNP ++ SS L ++ Sbjct: 330 ATMHTRAKEDQLGYVKAFDGYTAHQGTRQFGLSPVDVRTEVKNPDMSD-KNSSEGLMRES 388 Query: 871 SAKSGSVSGRKLD-----HGSGSSYSFAEQKGSYGT--SHVEGRSENGSAEISDHVASSG 1029 + K+ +VSGR +D G +Q G + ++G + S E+ S Sbjct: 389 TGKAATVSGRMVDEHIRRQHPGKLLLNEQQSGGKSSIKESMKGVRQEYSLEVDSKGTSIN 448 Query: 1030 KSKYLEYREEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQKRKTGF 1209 K KY E ++ + G W+FP+ + LCIT GK ++++G E H++ K Sbjct: 449 KLKYKENWKDIHYSSNCPGTWMFPTGSRNLCITAGGKHLIDLMGICHAEVEIHREEKFP- 507 Query: 1210 LSGHVDKDGTEHEKKKDHNKTQNQKKSR-PE---SSSGDASGGLVAQAIKVAV 1356 +V+K T K + ++ KK+R PE S + S G V Q + ++ Sbjct: 508 APENVEKSSTRPGCNKGNEVDESSKKARKPELVNSEDNENSRGAVGQVLSYSI 560 >ref|XP_004288940.1| PREDICTED: uncharacterized protein LOC101298548 [Fragaria vesca subsp. vesca] Length = 558 Score = 98.2 bits (243), Expect = 9e-18 Identities = 95/337 (28%), Positives = 149/337 (44%), Gaps = 30/337 (8%) Frame = +1 Query: 481 EQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXXXXYLKDSDPVETFS 660 +QQ PL+K+ S+V+ ++SAFE+ +A+D K IKPS + K+ + E Sbjct: 197 KQQSPLKKS-SSVKNMISAFETGLAEDKKPHIKPSPMKVQSNKTLTGNFPKNQNLKED-- 253 Query: 661 GRPKKPFIDRGLKATGKGGEKIDSGRIIVD-------DMSMDTKRL---KQSASANIESR 810 KK + + + + G++ D + ++ + L K S I+SR Sbjct: 254 --KKKVNTETSESVSKRAVNSLPPGQLQHDRTYGGSSEKNISLEALGGTKSSHDTGIKSR 311 Query: 811 IK---NPSSTEWAKSSLDLETQTSAKSGSVSGRKLD-HGSGSSYSFAEQKGSYGTSHVEG 978 K K D T +S VS R LD H S ++ S G+ +E Sbjct: 312 SKLLHQEVHINEDKVLTDFSTTSSEIVVKVSERVLDDHRHQPSKMLHGKRDSGGSPVIE- 370 Query: 979 RSENGSAEISDHVASSGKSKYLEYR--------EEELFHFEKSGMWIFPSDQKQLCITTA 1134 E+ EIS + + + + E+ FE+SG WIFP + CITT+ Sbjct: 371 --ESHQPEISSRNSQKSNIQGVAAQICTSTAKCEDRHNPFERSGGWIFPDEAIHFCITTS 428 Query: 1135 GKRAMNVLGSRK-TEAGYHQKRKTGFLSGHVDKDGTEHEKKKDHNKTQNQKKSRP----- 1296 GK+ M+ LG + G HQ K+ + + + +N+KK+RP Sbjct: 429 GKKVMDFLGGHNLIKPGMHQDGKSLSPPENAVDQQENADSSNGNEVNRNEKKNRPREQHK 488 Query: 1297 --ESSSGDASGGLVAQAIKVAVVVGFGILVFLTRQRE 1401 +S + SG V QAIK A+++GFG LV LTRQR+ Sbjct: 489 VQKSEETETSGVAVGQAIKAAIMIGFGTLVLLTRQRK 525 >gb|EOY30469.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 576 Score = 95.5 bits (236), Expect = 6e-17 Identities = 98/353 (27%), Positives = 155/353 (43%), Gaps = 32/353 (9%) Frame = +1 Query: 445 SVSSKEGHVDKSEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXXXX 624 SV S+ + S++ G K SNVRK++SAFE + QD KS IKP Sbjct: 224 SVPSELEKANNSKKLGAAGKAHSNVRKMISAFEDGLNQDMKSSIKPLPKKPQTRNIGMDS 283 Query: 625 YLKDS--DPVETFSGRPKKPFIDR-GLKATGKGG----EKIDS----GRIIVDDMSMDTK 771 +L +S + VET P K + R K + EK+ + I S +T+ Sbjct: 284 FLANSQLNEVETEKIIPPKANLGRINTKEFEQTNIYFREKVQTIGCVKPIYEAASSKETQ 343 Query: 772 RLKQSASANIESRIKN-----------PSSTEWAKSSLDLETQTSAKSGSVSGRKLD-HG 915 +LK+S +A I++ KN S E + E + + + + S R LD H Sbjct: 344 QLKESNAACIQTERKNLDLKNKFKVIQKESDEKEEKKYSEEFKRALEKAAFSRRMLDKHS 403 Query: 916 SGSSYSFAEQKGSYGTSHVEGRSENGSAEISDHVASSG----KSKYLEYREEELFHFEKS 1083 G+ K + + ++ + + D + G K K + ++ S Sbjct: 404 KGNQSWNLFSKKQHSSRNLVAKEGGDEIFLKDPRGAEGNLNEKLKSVAIWSDDHCSIGSS 463 Query: 1084 GMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQ---KRKTGFLSGHVDKD-GTEHEK 1251 G+WIFP + K LCITT GK+ M++ G E HQ + +G V+ D GT +E Sbjct: 464 GLWIFPGEAKCLCITTGGKQIMDLTGGFWDETNTHQIKLSARDPKNTGEVNADAGTGNEA 523 Query: 1252 KKDHNKTQNQKKSRPESS-SGDASGGLVAQAIKVAVVVGFGILVFLTRQREPR 1407 D + + + + E+S + + G V Q I+ ++V F LV LTR+R R Sbjct: 524 NGDAKSSSQKLRPKLENSRDPEHTIGPVGQVIRAIIMVSFATLVLLTRKRTYR 576 >gb|EOY30468.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 575 Score = 95.5 bits (236), Expect = 6e-17 Identities = 98/353 (27%), Positives = 155/353 (43%), Gaps = 32/353 (9%) Frame = +1 Query: 445 SVSSKEGHVDKSEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXXXX 624 SV S+ + S++ G K SNVRK++SAFE + QD KS IKP Sbjct: 223 SVPSELEKANNSKKLGAAGKAHSNVRKMISAFEDGLNQDMKSSIKPLPKKPQTRNIGMDS 282 Query: 625 YLKDS--DPVETFSGRPKKPFIDR-GLKATGKGG----EKIDS----GRIIVDDMSMDTK 771 +L +S + VET P K + R K + EK+ + I S +T+ Sbjct: 283 FLANSQLNEVETEKIIPPKANLGRINTKEFEQTNIYFREKVQTIGCVKPIYEAASSKETQ 342 Query: 772 RLKQSASANIESRIKN-----------PSSTEWAKSSLDLETQTSAKSGSVSGRKLD-HG 915 +LK+S +A I++ KN S E + E + + + + S R LD H Sbjct: 343 QLKESNAACIQTERKNLDLKNKFKVIQKESDEKEEKKYSEEFKRALEKAAFSRRMLDKHS 402 Query: 916 SGSSYSFAEQKGSYGTSHVEGRSENGSAEISDHVASSG----KSKYLEYREEELFHFEKS 1083 G+ K + + ++ + + D + G K K + ++ S Sbjct: 403 KGNQSWNLFSKKQHSSRNLVAKEGGDEIFLKDPRGAEGNLNEKLKSVAIWSDDHCSIGSS 462 Query: 1084 GMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQ---KRKTGFLSGHVDKD-GTEHEK 1251 G+WIFP + K LCITT GK+ M++ G E HQ + +G V+ D GT +E Sbjct: 463 GLWIFPGEAKCLCITTGGKQIMDLTGGFWDETNTHQIKLSARDPKNTGEVNADAGTGNEA 522 Query: 1252 KKDHNKTQNQKKSRPESS-SGDASGGLVAQAIKVAVVVGFGILVFLTRQREPR 1407 D + + + + E+S + + G V Q I+ ++V F LV LTR+R R Sbjct: 523 NGDAKSSSQKLRPKLENSRDPEHTIGPVGQVIRAIIMVSFATLVLLTRKRTYR 575 >gb|EOY30470.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 577 Score = 92.8 bits (229), Expect = 4e-16 Identities = 97/354 (27%), Positives = 153/354 (43%), Gaps = 33/354 (9%) Frame = +1 Query: 445 SVSSKEGHVDKSEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXXXX 624 SV S+ + S++ G K SNVRK++SAFE + QD KS IKP Sbjct: 224 SVPSELEKANNSKKLGAAGKAHSNVRKMISAFEDGLNQDMKSSIKPLPKKPQTRNIGMDS 283 Query: 625 YLKDS--DPVETFSGRPKKPFIDR-GLKATGKGG----EKIDS----GRIIVDDMSMDTK 771 +L +S + VET P K + R K + EK+ + I S +T+ Sbjct: 284 FLANSQLNEVETEKIIPPKANLGRINTKEFEQTNIYFREKVQTIGCVKPIYEAASSKETQ 343 Query: 772 RLKQSASANIESRIKN-----------PSSTEWAKSSLDLETQTSAKSGSVSGRKLD-HG 915 +LK+S +A I++ KN S E + E + + + + S R LD H Sbjct: 344 QLKESNAACIQTERKNLDLKNKFKVIQKESDEKEEKKYSEEFKRALEKAAFSRRMLDKHS 403 Query: 916 SGSSYSFAEQKGSYGTSHVEGRSENGSAEISDHVASSG----KSKYLEYREEELFHFEKS 1083 G+ K + + ++ + + D + G K K + ++ S Sbjct: 404 KGNQSWNLFSKKQHSSRNLVAKEGGDEIFLKDPRGAEGNLNEKLKSVAIWSDDHCSIGSS 463 Query: 1084 GMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYHQ---KRKTGFLSGHVDKD-GTEHEK 1251 G+WIFP + K LCITT GK+ M++ G E HQ + +G V+ D GT +E Sbjct: 464 GLWIFPGEAKCLCITTGGKQIMDLTGGFWDETNTHQIKLSARDPKNTGEVNADAGTGNEA 523 Query: 1252 KKDHNKTQNQKKSRPESSSG--DASGGLVAQAIKVAVVVGFGILVFLTRQREPR 1407 D + + + + E+S G + Q I+ ++V F LV LTR+R R Sbjct: 524 NGDAKSSSQKLRPKLENSRDPEHTIGPVGQQVIRAIIMVSFATLVLLTRKRTYR 577 >gb|EXB38835.1| hypothetical protein L484_027269 [Morus notabilis] Length = 608 Score = 80.5 bits (197), Expect = 2e-12 Identities = 88/315 (27%), Positives = 140/315 (44%), Gaps = 10/315 (3%) Frame = +1 Query: 439 STSVSSKEGHVDKSEQQGPLEKTPSNVRKLVSAFESSVAQDSKSPIKPSAXXXXXXXXXX 618 S+ SS+ G S ++ LE+ PS+VR ++SAFES+++QD+KS IKPS Sbjct: 321 SSMGSSESGSY--SRKRSSLERIPSSVRNMISAFESNLSQDTKSHIKPSP---------- 368 Query: 619 XXYLKDSDPVETFSGRPKKPFIDRGLKATGKGGEKIDSGRIIVDDMSMDTKRLKQSASAN 798 P + S + I K + G + ++++S R K + + + Sbjct: 369 --------PTKVQSNK-----IGSQSKEAKTENTETMRGSVQLEELSTTKVRGKGTNNTS 415 Query: 799 IESRIK---NPSSTEWAKSSLDLETQ----TSAKSGSVSGRKLDHGSGSSYSFAEQKGSY 957 +S++K +TE KS D TS+ +VS R+ H S S + + Sbjct: 416 TKSKLKFTHKELATEEEKSHGDSTRTPMITTSSAVDNVSDRQ--HSSKSQTKISSETSPS 473 Query: 958 GTSHVEGRSENGSAEISDHVASSGKSKYLEYREEELFHFEKSGMWIFPSDQKQLCITTAG 1137 T EI+ H + +E+ E+ F+ WIFP ++ C+TT G Sbjct: 474 FTMR----------EITTH-----NLRPVEHYEDMHDSFDSFSAWIFPDQMRRFCVTTGG 518 Query: 1138 KRAMNVL---GSRKTEAGYHQKRKTGFLSGHVDKDGTEHEKKKDHNKTQNQKKSRPESSS 1308 K+ M+ + GS K+E K D GT +E K+ KT +KS+PES Sbjct: 519 KKLMDFVGGYGSIKSEV-RQGKVNISIPEKSNDNGGTRNEIKRS-KKTHKARKSKPESED 576 Query: 1309 GDASGGLVAQAIKVA 1353 ++S G V Q IK+A Sbjct: 577 VESSTGPVGQ-IKLA 590 >ref|XP_002514104.1| conserved hypothetical protein [Ricinus communis] gi|223546560|gb|EEF48058.1| conserved hypothetical protein [Ricinus communis] Length = 556 Score = 79.3 bits (194), Expect = 5e-12 Identities = 84/330 (25%), Positives = 145/330 (43%), Gaps = 20/330 (6%) Frame = +1 Query: 481 EQQGPLEKTPSNVRKLVSAFESSV--AQDSKSPIKPSAXXXXXXXXXXXXYLKDSDPVET 654 E+Q ++KTPS +R ++SAFES+V QD K I+P++ D E Sbjct: 252 EKQSTIKKTPSKIRNMISAFESNVNQKQDMKPKIRPTSIKSE----------SDKTRAEG 301 Query: 655 FSGRPKKPFIDRGLKATGKGGEKIDSGRIIVDDMSMD-TKRLKQSASANIESRIK----N 819 S R ++ + + DS R + ++ T L + I R K N Sbjct: 302 SSSR----HLNEVKVENAELAQLSDSVRNAIHTRDLELTPSLIREREEQIGIRTKGTTQN 357 Query: 820 PSSTEWAKSSLDLETQTSAKSG----SVSGRKLDHGSGSSYSFAEQKGSYGTSHVEGRSE 987 K ++++ + ++ G S SGR + S +KG + ++ GR Sbjct: 358 LKDKTKVKQKVNIQEEKTSYEGFARVSTSGR-----ASVSGRMLNEKGRHPLRNLIGRRR 412 Query: 988 NGSAEISDHVASSG----KSKYLEYREEELFH----FEKSGMWIFPSDQKQLCITTAGKR 1143 +I + + G ++L ++ + E +G WIFP K++CITT K+ Sbjct: 413 LSGDKIVEQKSVKGIQPKDMQHLNNQDRASSNDPCPSECNGAWIFPDGGKRMCITTNAKQ 472 Query: 1144 AMNVLGSRKTEAGYHQKRKTGFLSGHVDKDGTEHEKKKDHNKTQNQKKSR-PESSSGDAS 1320 MN++G EA K + G L+ V ++ + E +D +Q K+ + +S+ + Sbjct: 473 MMNLMGGFFAEA----KNQLGNLTSPVAENAKQVE--EDGEASQRYKEPKVDDSTDAETP 526 Query: 1321 GGLVAQAIKVAVVVGFGILVFLTRQREPRK 1410 G V Q ++V ++VGF LV RQR+ K Sbjct: 527 RGPVGQVMRVVIMVGFATLVLFARQRKSDK 556 >ref|XP_006381679.1| hypothetical protein POPTR_0006s15560g [Populus trichocarpa] gi|550336409|gb|ERP59476.1| hypothetical protein POPTR_0006s15560g [Populus trichocarpa] Length = 565 Score = 73.2 bits (178), Expect = 3e-10 Identities = 69/276 (25%), Positives = 120/276 (43%), Gaps = 28/276 (10%) Frame = +1 Query: 421 PEVAAKSTSVSSKEGHVDKS-EQQGPLEKTPSNVRKLVSAFESSVAQDSKS-----PIKP 582 P+V A +GH D +Q P+ KTPSNVR +++AFESS+ QD K PIK Sbjct: 264 PKVTASDKIPVKLKGHGDSVLGKQNPVNKTPSNVRNMITAFESSLNQDVKPKETPPPIKS 323 Query: 583 SAXXXXXXXXXXXXYLKD-----SDPVETFSGRPKKPFIDRGLKATGKG---GEKIDSGR 738 ++ + + + P ++ G+ + P++ ++ K GE+ G Sbjct: 324 ASGRLEMEFSPKCFWSDEVRTEKNIPEQSLPGKDRSPYLIEDMQGASKNIREGEE-HVGF 382 Query: 739 IIVDDMSMDTKRLKQSASANIESRIKNPSSTEWAKSSLDLETQTSA---KSGSVSGRKL- 906 + ++ ++ +S ++ +N S K+ L L + K+ V R L Sbjct: 383 VRAPTVATSSQGTGKSEEELSDASFRNKGSNVVLKNKLQLMDKADIGKEKTSDVLLRALV 442 Query: 907 -DHGSGSSYSFAEQKGSYGTSHVEGRSENGSAEISDHVASSGKSKYLEYRE--------- 1056 D S S E G + + +N + + SGK + + E Sbjct: 443 GDKASNSGRMLNEYLGKHPYCKLLAGKKNSGGTLL--ITKSGKETHSKDLERISIQEGSG 500 Query: 1057 EELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGS 1164 + + E G WIFP ++++LCITTAG + +N++GS Sbjct: 501 DAHYSSECYGAWIFPYERRRLCITTAGTQILNLMGS 536 >ref|XP_004141054.1| PREDICTED: uncharacterized protein LOC101204948 [Cucumis sativus] gi|449488032|ref|XP_004157922.1| PREDICTED: uncharacterized protein LOC101224166 [Cucumis sativus] Length = 137 Score = 72.4 bits (176), Expect = 6e-10 Identities = 39/135 (28%), Positives = 75/135 (55%), Gaps = 1/135 (0%) Frame = +1 Query: 1009 DHVASSGKSKYLEYREEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGSRKTEAGYH 1188 D + Y +++ F ++SG WIFP ++++LC+TT+ + ++ G + + Sbjct: 6 DDIDVKNDESYQSRVQDKQFLSKRSGGWIFPDERRRLCVTTSDNQIQDLAGGGRISYTFV 65 Query: 1189 QKRKTGFLSGHVDKDGTEHEKKKDHNKTQNQKKSRPESSSG-DASGGLVAQAIKVAVVVG 1365 +K G + ++ E K + K+++Q+ +P+SS G +A+A+K+A++VG Sbjct: 66 RK---GEMKISTEESRGTSETKANGGKSEHQEMIKPDSSDDVKPFEGALAKALKIAIMVG 122 Query: 1366 FGILVFLTRQREPRK 1410 FG LV TRQR+ +K Sbjct: 123 FGTLVLFTRQRKKKK 137 >ref|XP_003534912.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Glycine max] Length = 429 Score = 59.3 bits (142), Expect = 5e-06 Identities = 68/264 (25%), Positives = 114/264 (43%), Gaps = 26/264 (9%) Frame = +1 Query: 706 GKGGEKIDSGRIIVDDMSMDTKRLKQSASANIESRIKNPSSTEWAKSSLDLET------- 864 G G K R +V K+L + NI + + S T+ ++ L Sbjct: 171 GSGDGKESETRSVVGAQLEQKKQL----TTNIADQYRESSYTKPVSQNVTLTQLQQKGKK 226 Query: 865 ---QTSAKSGSVSGRKLDHGSGS---SYSFAEQKGSYGTSHV-EGRSENGSAEISDHVAS 1023 Q+S+ S S + G GS + F +K TS++ + + E + S S Sbjct: 227 PAYQSSSPDRSPSEKHPQRGIGSEEIAVLFGSEKKDALTSNLTQPKIEESDLQYSKKRTS 286 Query: 1024 SGK-----SKYLEYREEELFHFEKSGMWIFPSDQKQLCITTAGKRAMNVLGSRKTE---- 1176 G+ K + E L ++ + P+ Q++ I L K++ Sbjct: 287 LGRIPSNVKKMISAFEGGLDQDKRPQIKPPPTKQQESSIERRDSSKTQHLEQDKSKNIDP 346 Query: 1177 AGYHQKRKTGFLSGHVDKDGTEHEKKKDHNKTQNQKKSRPESSSGDA---SGGLVAQAIK 1347 A ++ K+ L+ D GTE + ++ K Q K+S+P++S + SGG Q +K Sbjct: 347 ADLQERVKSASLNEANDGTGTE---ESEYEKIQETKESKPKTSDNNGDENSGGPFNQVVK 403 Query: 1348 VAVVVGFGILVFLTRQREPRKSKK 1419 VA+++GFG+LV LTRQR+ R KK Sbjct: 404 VAIIIGFGLLVLLTRQRKRRMKKK 427