BLASTX nr result

ID: Forsythia21_contig00002118 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00002118
         (1327 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011089175.1| PREDICTED: uncharacterized protein LOC105170...   370   1e-99
ref|XP_002285757.1| PREDICTED: uncharacterized protein LOC100265...   363   2e-97
ref|XP_002303980.2| hypothetical protein POPTR_0003s20820g [Popu...   357   1e-95
ref|XP_007205626.1| hypothetical protein PRUPE_ppa009156mg [Prun...   356   2e-95
ref|XP_011025116.1| PREDICTED: uncharacterized protein LOC105126...   355   4e-95
ref|XP_006488956.1| PREDICTED: uncharacterized protein LOC102627...   355   6e-95
ref|XP_006445559.1| hypothetical protein CICLE_v10015880mg [Citr...   355   6e-95
ref|XP_012091719.1| PREDICTED: uncharacterized protein LOC105649...   354   7e-95
gb|KDP21050.1| hypothetical protein JCGZ_21521 [Jatropha curcas]      354   7e-95
gb|KDO54511.1| hypothetical protein CISIN_1g021860mg [Citrus sin...   352   3e-94
emb|CBI16564.3| unnamed protein product [Vitis vinifera]              350   1e-93
ref|XP_006359198.1| PREDICTED: uncharacterized protein LOC102583...   349   2e-93
gb|KDO54510.1| hypothetical protein CISIN_1g021860mg [Citrus sin...   349   3e-93
emb|CDP18571.1| unnamed protein product [Coffea canephora]            348   5e-93
ref|XP_007014028.1| Antitermination NusB domain-containing prote...   347   9e-93
ref|XP_004228949.1| PREDICTED: uncharacterized protein LOC101254...   345   3e-92
gb|KHG01681.1| N utilization substance B [Gossypium arboreum]         345   6e-92
ref|XP_012835305.1| PREDICTED: uncharacterized protein LOC105956...   343   2e-91
ref|XP_010103088.1| hypothetical protein L484_005067 [Morus nota...   342   4e-91
ref|XP_012473006.1| PREDICTED: uncharacterized protein LOC105790...   342   5e-91

>ref|XP_011089175.1| PREDICTED: uncharacterized protein LOC105170208 [Sesamum indicum]
          Length = 293

 Score =  370 bits (950), Expect = 1e-99
 Identities = 189/253 (74%), Positives = 214/253 (84%), Gaps = 9/253 (3%)
 Frame = -2

Query: 1128 PRLLLSCPKSSLLRPWALTTDEIVEN---------SNSKEMMPKIDKSGRFCSPRAAREL 976
            PRLL SCP  S      LT  ++VEN         SNSK+M+PKIDKSGRFCSPRAAREL
Sbjct: 44   PRLLFSCPCPSTS---PLTIAQVVENPTLHSLIPTSNSKDMLPKIDKSGRFCSPRAAREL 100

Query: 975  ALMILYAACLEGSDPVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEAN 796
            ALMILYAACL+GSDPVRLF+KRLNARRE+GYEFD+A LMEYNHMSFGGPPV      EA+
Sbjct: 101  ALMILYAACLDGSDPVRLFEKRLNARRETGYEFDKAYLMEYNHMSFGGPPVTAETPEEAD 160

Query: 795  ELLSNDEKESEIEAEVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWK 616
            E+L ND+KES+IEAEVLSAPPKLVYSKL+L FTRKLLVAV EKWDS+VL I+KVAP NWK
Sbjct: 161  EILQNDQKESDIEAEVLSAPPKLVYSKLVLRFTRKLLVAVAEKWDSNVLAIDKVAPHNWK 220

Query: 615  DEPAGRILELSILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMED 436
            +EPAGRILE S+LHLAMSE+AVLGTRHQIVINEAVDLAKRFCDGAAPR+INGCLR F+++
Sbjct: 221  NEPAGRILEFSVLHLAMSEMAVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRNFLKE 280

Query: 435  IKGNVNARELKIK 397
            +K N  A  ++IK
Sbjct: 281  LKENGEAENVEIK 293


>ref|XP_002285757.1| PREDICTED: uncharacterized protein LOC100265613 [Vitis vinifera]
          Length = 288

 Score =  363 bits (931), Expect = 2e-97
 Identities = 186/247 (75%), Positives = 214/247 (86%), Gaps = 1/247 (0%)
 Frame = -2

Query: 1119 LLSCPKSSLLRPWALTTDEIVEN-SNSKEMMPKIDKSGRFCSPRAARELALMILYAACLE 943
            LLS P++SL R  ALT ++ ++  S  +EM+P+IDKSGRFCSPRAARELAL+I YAACLE
Sbjct: 43   LLSSPRTSL-RTSALTVEKPLDKPSEPREMLPRIDKSGRFCSPRAARELALLIAYAACLE 101

Query: 942  GSDPVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESE 763
            GSDPVRLF++R+NARRE GYEFD+ SL+EYNHMSFGGPPV      EA+ELL N+EKES 
Sbjct: 102  GSDPVRLFERRMNARREPGYEFDKDSLLEYNHMSFGGPPVTTETVEEADELLRNNEKESA 161

Query: 762  IEAEVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELS 583
            IEAEVLSAPPKLVY KLIL FTRKLLVAVV+KW+SHVLVI+KVAPPNWK+EPAGRILEL 
Sbjct: 162  IEAEVLSAPPKLVYGKLILRFTRKLLVAVVDKWNSHVLVIDKVAPPNWKNEPAGRILELC 221

Query: 582  ILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELK 403
            ILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPR+INGCLRTF++D++G    R  +
Sbjct: 222  ILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDLEGTGITRASE 281

Query: 402  IKQTVIS 382
              Q V+S
Sbjct: 282  TTQEVVS 288


>ref|XP_002303980.2| hypothetical protein POPTR_0003s20820g [Populus trichocarpa]
            gi|550343660|gb|EEE78959.2| hypothetical protein
            POPTR_0003s20820g [Populus trichocarpa]
          Length = 308

 Score =  357 bits (915), Expect = 1e-95
 Identities = 177/228 (77%), Positives = 201/228 (88%), Gaps = 3/228 (1%)
 Frame = -2

Query: 1101 SSLLRPWALTTDEIVENS---NSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDP 931
            +S LR  A   D+ +++S   N KEMMPKIDKSGRFCSPRAARELAL+I+YAACLEGSDP
Sbjct: 66   ASSLRTSAFVVDKALDDSSPTNYKEMMPKIDKSGRFCSPRAARELALLIIYAACLEGSDP 125

Query: 930  VRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAE 751
            +RLF+KR+NARRE GYEFD+ASL+EYNHMSFGGPPV      EA+EL  +DEKES IEAE
Sbjct: 126  IRLFEKRMNARREPGYEFDKASLLEYNHMSFGGPPVTTETVEEADELQLSDEKESAIEAE 185

Query: 750  VLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILHL 571
            VLSAPPKLVYSKL+L FTRKLLVAVV+KWDSHVLVI+KV+PPNWK+EPAGRILE  ILH+
Sbjct: 186  VLSAPPKLVYSKLLLRFTRKLLVAVVDKWDSHVLVIDKVSPPNWKNEPAGRILEFCILHM 245

Query: 570  AMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKG 427
            AMSEI VLGTRHQIVINEAVDLAKRFCDGA PR+INGCLRTF++D+ G
Sbjct: 246  AMSEITVLGTRHQIVINEAVDLAKRFCDGAGPRIINGCLRTFLKDLSG 293


>ref|XP_007205626.1| hypothetical protein PRUPE_ppa009156mg [Prunus persica]
            gi|645215178|ref|XP_008221137.1| PREDICTED:
            uncharacterized protein LOC103321114 [Prunus mume]
            gi|462401268|gb|EMJ06825.1| hypothetical protein
            PRUPE_ppa009156mg [Prunus persica]
          Length = 304

 Score =  356 bits (914), Expect = 2e-95
 Identities = 184/231 (79%), Positives = 203/231 (87%), Gaps = 2/231 (0%)
 Frame = -2

Query: 1116 LSCPKSSLLRPWALTTDEIVE--NSNSKEMMPKIDKSGRFCSPRAARELALMILYAACLE 943
            LS P++SL R    T DE +E  NS+S+EM+PKIDKSGRFCSPRAARELAL I+YAACLE
Sbjct: 58   LSSPRTSL-RTSTFTLDEALEKPNSDSREMLPKIDKSGRFCSPRAARELALSIVYAACLE 116

Query: 942  GSDPVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESE 763
            GSDPVRLF+KR+N RRE GYEFDRASL+EYN MSFGGPPV      EA+ELL NDEKES 
Sbjct: 117  GSDPVRLFEKRMNVRREPGYEFDRASLLEYNPMSFGGPPVTVETVEEADELLRNDEKESA 176

Query: 762  IEAEVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELS 583
            IEAEVL+APPKLVYSKLIL FTRKLLVAV++KWDSHVLVI+KVAPPNWKDEPAGRILEL 
Sbjct: 177  IEAEVLAAPPKLVYSKLILRFTRKLLVAVMDKWDSHVLVIDKVAPPNWKDEPAGRILELC 236

Query: 582  ILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIK 430
            ILHLAMSEI VL TRH IVINEAVDLAKRFCDG+APRVINGCLRTF++ I+
Sbjct: 237  ILHLAMSEITVLETRHPIVINEAVDLAKRFCDGSAPRVINGCLRTFVKGIE 287


>ref|XP_011025116.1| PREDICTED: uncharacterized protein LOC105126073 [Populus euphratica]
          Length = 308

 Score =  355 bits (911), Expect = 4e-95
 Identities = 176/228 (77%), Positives = 201/228 (88%), Gaps = 3/228 (1%)
 Frame = -2

Query: 1101 SSLLRPWALTTDEIVENS---NSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDP 931
            +S LR  A   D+ +++S   N KEMMPKIDKSGRFCSPRAARELAL+I+YAACLEGSDP
Sbjct: 66   ASSLRTSAFVVDKALDDSSPTNYKEMMPKIDKSGRFCSPRAARELALLIIYAACLEGSDP 125

Query: 930  VRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAE 751
            +RLF+KR+NARRE GYEFD++SL+EYNHMSFGGPPV      EA+EL  +DEKES IEAE
Sbjct: 126  IRLFEKRMNARREPGYEFDKSSLLEYNHMSFGGPPVTTETVEEADELQLSDEKESAIEAE 185

Query: 750  VLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILHL 571
            VLSAPPKLVYSKL+L FTRKLLVAVV+KWDSHVLVI+KV+PPNWK+EPAGRILE  ILH+
Sbjct: 186  VLSAPPKLVYSKLLLRFTRKLLVAVVDKWDSHVLVIDKVSPPNWKNEPAGRILEFCILHM 245

Query: 570  AMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKG 427
            AMSEI VLGTRHQIVINEAVDLAKRFCDGA PR+INGCLRTF++D+ G
Sbjct: 246  AMSEITVLGTRHQIVINEAVDLAKRFCDGAGPRIINGCLRTFLKDLPG 293


>ref|XP_006488956.1| PREDICTED: uncharacterized protein LOC102627108 [Citrus sinensis]
            gi|641835534|gb|KDO54509.1| hypothetical protein
            CISIN_1g021860mg [Citrus sinensis]
          Length = 302

 Score =  355 bits (910), Expect = 6e-95
 Identities = 175/232 (75%), Positives = 202/232 (87%)
 Frame = -2

Query: 1071 TDEIVENSNSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDPVRLFDKRLNARRE 892
            T E V  S++KEMMPKIDKSGRFCSPRAARELAL+++YAACLEGSDP+RLF+KRLN+RRE
Sbjct: 71   TRESVMASSAKEMMPKIDKSGRFCSPRAARELALLVVYAACLEGSDPIRLFEKRLNSRRE 130

Query: 891  SGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAEVLSAPPKLVYSKL 712
             GYEFD++SL+EYNHMSFGGPPV      EA+ELL +DE+ES IEAEVLSAPPKLVYSKL
Sbjct: 131  PGYEFDKSSLLEYNHMSFGGPPVTTETVEEADELLRSDEEESAIEAEVLSAPPKLVYSKL 190

Query: 711  ILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILHLAMSEIAVLGTRHQ 532
            +L FTRKLLVAVV+KWD+HV +I+KV PP WKD+PAGRILELSILHLAMSEI V+GTRHQ
Sbjct: 191  LLRFTRKLLVAVVDKWDAHVHIIDKVVPPIWKDQPAGRILELSILHLAMSEITVVGTRHQ 250

Query: 531  IVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELKIKQTVISSV 376
            IVINEAVDLAKRFCDGAAPR+INGCLRTF+ +++G  N    K  + V S V
Sbjct: 251  IVINEAVDLAKRFCDGAAPRIINGCLRTFVRNLEGTANIEASKASKEVPSEV 302


>ref|XP_006445559.1| hypothetical protein CICLE_v10015880mg [Citrus clementina]
            gi|557548170|gb|ESR58799.1| hypothetical protein
            CICLE_v10015880mg [Citrus clementina]
          Length = 333

 Score =  355 bits (910), Expect = 6e-95
 Identities = 175/232 (75%), Positives = 202/232 (87%)
 Frame = -2

Query: 1071 TDEIVENSNSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDPVRLFDKRLNARRE 892
            T E V  S++KEMMPKIDKSGRFCSPRAARELAL+++YAACLEGSDP+RLF+KRLN+RRE
Sbjct: 102  TRESVMASSAKEMMPKIDKSGRFCSPRAARELALLVVYAACLEGSDPIRLFEKRLNSRRE 161

Query: 891  SGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAEVLSAPPKLVYSKL 712
             GYEFD++SL+EYNHMSFGGPPV      EA+ELL +DE+ES IEAEVLSAPPKLVYSKL
Sbjct: 162  PGYEFDKSSLLEYNHMSFGGPPVTTETVEEADELLRSDEEESAIEAEVLSAPPKLVYSKL 221

Query: 711  ILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILHLAMSEIAVLGTRHQ 532
            +L FTRKLLVAVV+KWD+HV +I+KV PP WKD+PAGRILELSILHLAMSEI V+GTRHQ
Sbjct: 222  LLRFTRKLLVAVVDKWDAHVHIIDKVVPPIWKDQPAGRILELSILHLAMSEITVVGTRHQ 281

Query: 531  IVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELKIKQTVISSV 376
            IVINEAVDLAKRFCDGAAPR+INGCLRTF+ +++G  N    K  + V S V
Sbjct: 282  IVINEAVDLAKRFCDGAAPRIINGCLRTFVRNLEGTANIEASKASKEVPSEV 333


>ref|XP_012091719.1| PREDICTED: uncharacterized protein LOC105649626 [Jatropha curcas]
          Length = 310

 Score =  354 bits (909), Expect = 7e-95
 Identities = 177/241 (73%), Positives = 204/241 (84%), Gaps = 1/241 (0%)
 Frame = -2

Query: 1107 PKSSLLRPWALTTDEIVENSNS-KEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDP 931
            P+SSL R  A   ++  ENS   KEMMPKIDKSGRFCSPRAARELAL+ +YAACLEGSDP
Sbjct: 69   PRSSL-RTSAFVIEKATENSTKGKEMMPKIDKSGRFCSPRAARELALLTIYAACLEGSDP 127

Query: 930  VRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAE 751
            +RLF+KR+NARRE GYEF++ASLMEYNHMSFGGPPV      EA++L  +DEKES +E E
Sbjct: 128  IRLFEKRMNARREPGYEFNKASLMEYNHMSFGGPPVTAETVEEADKLQQSDEKESAVEEE 187

Query: 750  VLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILHL 571
            VLSAPPKLVYSKL+L FTRKLLVAVV+KWDSHVL+I+KV+PPNWK++PAGRILE  ILHL
Sbjct: 188  VLSAPPKLVYSKLLLRFTRKLLVAVVDKWDSHVLIIDKVSPPNWKNQPAGRILEFCILHL 247

Query: 570  AMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELKIKQT 391
            AMSEI VLGTRHQIVINEA+DLAKRFCDG+APR+INGCLR+FM+D+ G   AR     Q 
Sbjct: 248  AMSEITVLGTRHQIVINEAIDLAKRFCDGSAPRIINGCLRSFMKDLSGTNEARATDANQK 307

Query: 390  V 388
            V
Sbjct: 308  V 308


>gb|KDP21050.1| hypothetical protein JCGZ_21521 [Jatropha curcas]
          Length = 260

 Score =  354 bits (909), Expect = 7e-95
 Identities = 177/241 (73%), Positives = 204/241 (84%), Gaps = 1/241 (0%)
 Frame = -2

Query: 1107 PKSSLLRPWALTTDEIVENSNS-KEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDP 931
            P+SSL R  A   ++  ENS   KEMMPKIDKSGRFCSPRAARELAL+ +YAACLEGSDP
Sbjct: 19   PRSSL-RTSAFVIEKATENSTKGKEMMPKIDKSGRFCSPRAARELALLTIYAACLEGSDP 77

Query: 930  VRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAE 751
            +RLF+KR+NARRE GYEF++ASLMEYNHMSFGGPPV      EA++L  +DEKES +E E
Sbjct: 78   IRLFEKRMNARREPGYEFNKASLMEYNHMSFGGPPVTAETVEEADKLQQSDEKESAVEEE 137

Query: 750  VLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILHL 571
            VLSAPPKLVYSKL+L FTRKLLVAVV+KWDSHVL+I+KV+PPNWK++PAGRILE  ILHL
Sbjct: 138  VLSAPPKLVYSKLLLRFTRKLLVAVVDKWDSHVLIIDKVSPPNWKNQPAGRILEFCILHL 197

Query: 570  AMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELKIKQT 391
            AMSEI VLGTRHQIVINEA+DLAKRFCDG+APR+INGCLR+FM+D+ G   AR     Q 
Sbjct: 198  AMSEITVLGTRHQIVINEAIDLAKRFCDGSAPRIINGCLRSFMKDLSGTNEARATDANQK 257

Query: 390  V 388
            V
Sbjct: 258  V 258


>gb|KDO54511.1| hypothetical protein CISIN_1g021860mg [Citrus sinensis]
          Length = 227

 Score =  352 bits (904), Expect = 3e-94
 Identities = 172/225 (76%), Positives = 199/225 (88%)
 Frame = -2

Query: 1050 SNSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDPVRLFDKRLNARRESGYEFDR 871
            S++KEMMPKIDKSGRFCSPRAARELAL+++YAACLEGSDP+RLF+KRLN+RRE GYEFD+
Sbjct: 3    SSAKEMMPKIDKSGRFCSPRAARELALLVVYAACLEGSDPIRLFEKRLNSRREPGYEFDK 62

Query: 870  ASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAEVLSAPPKLVYSKLILHFTRK 691
            +SL+EYNHMSFGGPPV      EA+ELL +DE+ES IEAEVLSAPPKLVYSKL+L FTRK
Sbjct: 63   SSLLEYNHMSFGGPPVTTETVEEADELLRSDEEESAIEAEVLSAPPKLVYSKLLLRFTRK 122

Query: 690  LLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILHLAMSEIAVLGTRHQIVINEAV 511
            LLVAVV+KWD+HV +I+KV PP WKD+PAGRILELSILHLAMSEI V+GTRHQIVINEAV
Sbjct: 123  LLVAVVDKWDAHVHIIDKVVPPIWKDQPAGRILELSILHLAMSEITVVGTRHQIVINEAV 182

Query: 510  DLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELKIKQTVISSV 376
            DLAKRFCDGAAPR+INGCLRTF+ +++G  N    K  + V S V
Sbjct: 183  DLAKRFCDGAAPRIINGCLRTFVRNLEGTANIEASKASKEVPSEV 227


>emb|CBI16564.3| unnamed protein product [Vitis vinifera]
          Length = 218

 Score =  350 bits (899), Expect = 1e-93
 Identities = 174/218 (79%), Positives = 195/218 (89%)
 Frame = -2

Query: 1035 MMPKIDKSGRFCSPRAARELALMILYAACLEGSDPVRLFDKRLNARRESGYEFDRASLME 856
            M+P+IDKSGRFCSPRAARELAL+I YAACLEGSDPVRLF++R+NARRE GYEFD+ SL+E
Sbjct: 1    MLPRIDKSGRFCSPRAARELALLIAYAACLEGSDPVRLFERRMNARREPGYEFDKDSLLE 60

Query: 855  YNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAEVLSAPPKLVYSKLILHFTRKLLVAV 676
            YNHMSFGGPPV      EA+ELL N+EKES IEAEVLSAPPKLVY KLIL FTRKLLVAV
Sbjct: 61   YNHMSFGGPPVTTETVEEADELLRNNEKESAIEAEVLSAPPKLVYGKLILRFTRKLLVAV 120

Query: 675  VEKWDSHVLVINKVAPPNWKDEPAGRILELSILHLAMSEIAVLGTRHQIVINEAVDLAKR 496
            V+KW+SHVLVI+KVAPPNWK+EPAGRILEL ILHLAMSEIAVLGTRHQIVINEAVDLAKR
Sbjct: 121  VDKWNSHVLVIDKVAPPNWKNEPAGRILELCILHLAMSEIAVLGTRHQIVINEAVDLAKR 180

Query: 495  FCDGAAPRVINGCLRTFMEDIKGNVNARELKIKQTVIS 382
            FCDGAAPR+INGCLRTF++D++G    R  +  Q V+S
Sbjct: 181  FCDGAAPRIINGCLRTFVKDLEGTGITRASETTQEVVS 218


>ref|XP_006359198.1| PREDICTED: uncharacterized protein LOC102583345 [Solanum tuberosum]
          Length = 291

 Score =  349 bits (896), Expect = 2e-93
 Identities = 180/244 (73%), Positives = 206/244 (84%)
 Frame = -2

Query: 1113 SCPKSSLLRPWALTTDEIVENSNSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSD 934
            S  + SLLRP ALT +E V+  +   + PKIDKSGRFCSPRAARELALM +YAACLEGSD
Sbjct: 54   SVQRFSLLRPSALTFEETVDTFS---VQPKIDKSGRFCSPRAARELALMTIYAACLEGSD 110

Query: 933  PVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEA 754
            PVR F+KRLN RRE GY FD+  LM+YNHMSFGGPPV      EA+ELL  DE +SEIEA
Sbjct: 111  PVRFFEKRLNIRREPGYIFDKEWLMKYNHMSFGGPPVKTETVEEADELLKADENDSEIEA 170

Query: 753  EVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILH 574
            EVLSAPPKLVYSKLIL FTRKLLVAV EKWDSHVLVINKVAP NWK+EPAGRILELSILH
Sbjct: 171  EVLSAPPKLVYSKLILRFTRKLLVAVEEKWDSHVLVINKVAPDNWKNEPAGRILELSILH 230

Query: 573  LAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELKIKQ 394
            LAMSEI+VLGTRHQIVINEAVDLAKRFCDG+APR++NGCLRTF++D++    A++ ++ Q
Sbjct: 231  LAMSEISVLGTRHQIVINEAVDLAKRFCDGSAPRIVNGCLRTFVKDLE---IAKDARVNQ 287

Query: 393  TVIS 382
            +V +
Sbjct: 288  SVFT 291


>gb|KDO54510.1| hypothetical protein CISIN_1g021860mg [Citrus sinensis]
          Length = 306

 Score =  349 bits (895), Expect = 3e-93
 Identities = 175/236 (74%), Positives = 202/236 (85%), Gaps = 4/236 (1%)
 Frame = -2

Query: 1071 TDEIVENSNSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDPVRLFDKRLNARRE 892
            T E V  S++KEMMPKIDKSGRFCSPRAARELAL+++YAACLEGSDP+RLF+KRLN+RRE
Sbjct: 71   TRESVMASSAKEMMPKIDKSGRFCSPRAARELALLVVYAACLEGSDPIRLFEKRLNSRRE 130

Query: 891  SGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAEVLSAPPKLVYSKL 712
             GYEFD++SL+EYNHMSFGGPPV      EA+ELL +DE+ES IEAEVLSAPPKLVYSKL
Sbjct: 131  PGYEFDKSSLLEYNHMSFGGPPVTTETVEEADELLRSDEEESAIEAEVLSAPPKLVYSKL 190

Query: 711  ILHFTRKLLVAVVEKWDSHVLVINKVAPPNWK----DEPAGRILELSILHLAMSEIAVLG 544
            +L FTRKLLVAVV+KWD+HV +I+KV PP WK    D+PAGRILELSILHLAMSEI V+G
Sbjct: 191  LLRFTRKLLVAVVDKWDAHVHIIDKVVPPIWKVWKMDQPAGRILELSILHLAMSEITVVG 250

Query: 543  TRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELKIKQTVISSV 376
            TRHQIVINEAVDLAKRFCDGAAPR+INGCLRTF+ +++G  N    K  + V S V
Sbjct: 251  TRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVRNLEGTANIEASKASKEVPSEV 306


>emb|CDP18571.1| unnamed protein product [Coffea canephora]
          Length = 211

 Score =  348 bits (893), Expect = 5e-93
 Identities = 171/203 (84%), Positives = 189/203 (93%)
 Frame = -2

Query: 1035 MMPKIDKSGRFCSPRAARELALMILYAACLEGSDPVRLFDKRLNARRESGYEFDRASLME 856
            M+PKIDKSGRFCSPRAARELAL+ILYA+CLEGSDPVRLF+KR+NARRE GY+FD+ SL+E
Sbjct: 1    MLPKIDKSGRFCSPRAARELALLILYASCLEGSDPVRLFEKRINARREPGYDFDKESLVE 60

Query: 855  YNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAEVLSAPPKLVYSKLILHFTRKLLVAV 676
            YNHMSF GPPV      EA+ELL +D+KESEIEAEVLSAPPKLVYSKLIL FTRKLLVAV
Sbjct: 61   YNHMSFAGPPVTANTLEEADELLLHDQKESEIEAEVLSAPPKLVYSKLILRFTRKLLVAV 120

Query: 675  VEKWDSHVLVINKVAPPNWKDEPAGRILELSILHLAMSEIAVLGTRHQIVINEAVDLAKR 496
             EKWDSHV VI+KVAPPNWK+EPA RILELSILHLAMSEIAVLGTRHQIVINEAVDLAKR
Sbjct: 121  AEKWDSHVFVIDKVAPPNWKNEPAARILELSILHLAMSEIAVLGTRHQIVINEAVDLAKR 180

Query: 495  FCDGAAPRVINGCLRTFMEDIKG 427
            FCDGAAPR+INGCLRTF++D++G
Sbjct: 181  FCDGAAPRIINGCLRTFIKDLQG 203


>ref|XP_007014028.1| Antitermination NusB domain-containing protein isoform 1 [Theobroma
            cacao] gi|508784391|gb|EOY31647.1| Antitermination NusB
            domain-containing protein isoform 1 [Theobroma cacao]
          Length = 290

 Score =  347 bits (891), Expect = 9e-93
 Identities = 174/232 (75%), Positives = 200/232 (86%)
 Frame = -2

Query: 1098 SLLRPWALTTDEIVENSNSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSDPVRLF 919
            S L   AL   +      SKEM+PKIDKSGRFCSPRAARELAL+I+YAACL+GSDP+RLF
Sbjct: 54   SCLPTLALQVQDQQSFHKSKEMLPKIDKSGRFCSPRAARELALLIVYAACLQGSDPIRLF 113

Query: 918  DKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEAEVLSA 739
            +KR+NA RE GYEFD+ASL++YNHMSFGGPPV      EA+ELL +DE++S IEAEVLSA
Sbjct: 114  EKRVNATREPGYEFDKASLLQYNHMSFGGPPVTTQSAEEADELLRSDEQDSAIEAEVLSA 173

Query: 738  PPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILHLAMSE 559
            PPKLVYSKL+L FTRKLLVA+V+KWDSHVLVI+K+AP NWK+EPAGRILELSILHLAMSE
Sbjct: 174  PPKLVYSKLLLRFTRKLLVAIVDKWDSHVLVIDKIAPLNWKNEPAGRILELSILHLAMSE 233

Query: 558  IAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNARELK 403
            + VLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTF++D+ G   A+  K
Sbjct: 234  MTVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFVKDLAGTGIAQASK 285


>ref|XP_004228949.1| PREDICTED: uncharacterized protein LOC101254024 isoform X1 [Solanum
            lycopersicum]
          Length = 291

 Score =  345 bits (886), Expect = 3e-92
 Identities = 177/234 (75%), Positives = 200/234 (85%)
 Frame = -2

Query: 1113 SCPKSSLLRPWALTTDEIVENSNSKEMMPKIDKSGRFCSPRAARELALMILYAACLEGSD 934
            S  + SLLRP ALT +E V+  +   + PKIDKSGRFCSPRAARELALM +YAACLEGSD
Sbjct: 54   SVERFSLLRPSALTFEETVDTFS---VQPKIDKSGRFCSPRAARELALMTIYAACLEGSD 110

Query: 933  PVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEKESEIEA 754
            PVR F+K+LN RRE GYEFD+  LM+YNHMSFGGPPV      EA+ELL  DE +S IEA
Sbjct: 111  PVRFFEKQLNIRREPGYEFDKDWLMKYNHMSFGGPPVKTETVEEADELLKADENDSVIEA 170

Query: 753  EVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRILELSILH 574
            EVLSAPPKLVYSKLIL FTRKLLVAV EKWDSHVLVINKVAP NWK+EPAGRILELSILH
Sbjct: 171  EVLSAPPKLVYSKLILRFTRKLLVAVEEKWDSHVLVINKVAPDNWKNEPAGRILELSILH 230

Query: 573  LAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNAR 412
            LAMSEI+VLGTRHQIV+NEAVDLAKRFCDG+APR++NGCLRTF++D++   +AR
Sbjct: 231  LAMSEISVLGTRHQIVVNEAVDLAKRFCDGSAPRIVNGCLRTFIKDLEIVKDAR 284


>gb|KHG01681.1| N utilization substance B [Gossypium arboreum]
          Length = 290

 Score =  345 bits (884), Expect = 6e-92
 Identities = 175/248 (70%), Positives = 203/248 (81%), Gaps = 1/248 (0%)
 Frame = -2

Query: 1128 PRLLLSCPKSSLLRP-WALTTDEIVENSNSKEMMPKIDKSGRFCSPRAARELALMILYAA 952
            PR   S PK  +  P  AL   +      SKEM+PKIDKSGRFCSPRAARELAL+I+YA+
Sbjct: 39   PRHPASRPKVRISLPSMALQVQDQQSFHKSKEMLPKIDKSGRFCSPRAARELALLIVYAS 98

Query: 951  CLEGSDPVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEK 772
            CL+GSDP+RLF+KR+NA RE GYEFD+ASL++YNHMSFGGPPV      EA+ELL +DE+
Sbjct: 99   CLQGSDPIRLFEKRINAAREPGYEFDKASLLQYNHMSFGGPPVTTHSVEEADELLRSDEQ 158

Query: 771  ESEIEAEVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRIL 592
            +S IEAEVLSAPPKLVYSKL+L FTRKLLVA+V+KWD+HVL I+KV P NWK+EPAGRIL
Sbjct: 159  DSAIEAEVLSAPPKLVYSKLLLSFTRKLLVAIVDKWDNHVLAIDKVVPSNWKNEPAGRIL 218

Query: 591  ELSILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNAR 412
            ELSILHLAMSEI VLGTRH IVINEAVDLAKRFCDGAAPR+INGCLRTF++D+ G   A+
Sbjct: 219  ELSILHLAMSEITVLGTRHPIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDLAGTSTAQ 278

Query: 411  ELKIKQTV 388
                K  V
Sbjct: 279  TSNSKLEV 286


>ref|XP_012835305.1| PREDICTED: uncharacterized protein LOC105956032 [Erythranthe
            guttatus] gi|604335340|gb|EYU39282.1| hypothetical
            protein MIMGU_mgv1a011150mg [Erythranthe guttata]
          Length = 290

 Score =  343 bits (880), Expect = 2e-91
 Identities = 172/248 (69%), Positives = 206/248 (83%), Gaps = 2/248 (0%)
 Frame = -2

Query: 1125 RLLLSCPKSSLLRPWALTTDEIVENSNS--KEMMPKIDKSGRFCSPRAARELALMILYAA 952
            RLL SCP  S   P  +     + +S+S  K+M+PKIDKSGRFCSPRAARELALMILYAA
Sbjct: 45   RLLFSCPTRS--SPLTVENPPTLTDSSSGTKDMLPKIDKSGRFCSPRAARELALMILYAA 102

Query: 951  CLEGSDPVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEK 772
            CL+GSDPVRLF+KR+N+RRESGY+FD  +L+EY+HMSFGGPPV      EA E+L  D+K
Sbjct: 103  CLDGSDPVRLFEKRVNSRRESGYKFDNGALLEYDHMSFGGPPVATETDEEAEEILRIDQK 162

Query: 771  ESEIEAEVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRIL 592
            ES+IEA+VLSAPPKLVYSKLIL F RKLL+AV E WD++VL I+KVAP NWK EPAGRIL
Sbjct: 163  ESDIEAQVLSAPPKLVYSKLILRFARKLLLAVAENWDNNVLAIDKVAPDNWKKEPAGRIL 222

Query: 591  ELSILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNAR 412
            E S+LH+AMSEIA+LGT+HQIVINEAVDLAKRFCDGAAPR INGC+RTF++++  +  A 
Sbjct: 223  EFSVLHIAMSEIAILGTKHQIVINEAVDLAKRFCDGAAPRTINGCMRTFVKELNRSGEAH 282

Query: 411  ELKIKQTV 388
             +KIKQT+
Sbjct: 283  TVKIKQTL 290


>ref|XP_010103088.1| hypothetical protein L484_005067 [Morus notabilis]
            gi|587906766|gb|EXB94811.1| hypothetical protein
            L484_005067 [Morus notabilis]
          Length = 301

 Score =  342 bits (877), Expect = 4e-91
 Identities = 176/253 (69%), Positives = 201/253 (79%), Gaps = 4/253 (1%)
 Frame = -2

Query: 1122 LLLSCPKSSLLRPWALTTDEIVENSNS----KEMMPKIDKSGRFCSPRAARELALMILYA 955
            L L CP++SL             NSNS     + +PK D+ GRFCSPRAARELAL I+YA
Sbjct: 48   LSLLCPRASLRSSTFFVETPNTHNSNSTSSSSDALPKTDRFGRFCSPRAARELALSIVYA 107

Query: 954  ACLEGSDPVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDE 775
            +CLEGSDPVRLF+KR+NARRE GYEFD+ SL++YNHMSFGGPPV      E  EL  ND+
Sbjct: 108  SCLEGSDPVRLFEKRINARREPGYEFDKESLLQYNHMSFGGPPVTVETLEEEEELTRNDK 167

Query: 774  KESEIEAEVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRI 595
            KES+IEAEVL APPKLVYSKLIL  TRKLLVAV ++WDSHV+VI+KVAPPNWK+EPAGRI
Sbjct: 168  KESDIEAEVLGAPPKLVYSKLILRLTRKLLVAVSDQWDSHVIVIDKVAPPNWKNEPAGRI 227

Query: 594  LELSILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNA 415
            LE  ILHLAMSEI+VLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTF++DI+     
Sbjct: 228  LEFCILHLAMSEISVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFVKDIEETGLP 287

Query: 414  RELKIKQTVISSV 376
              L+ KQ  +SSV
Sbjct: 288  LALEDKQKAVSSV 300


>ref|XP_012473006.1| PREDICTED: uncharacterized protein LOC105790127 [Gossypium raimondii]
            gi|763754535|gb|KJB21866.1| hypothetical protein
            B456_004G020300 [Gossypium raimondii]
          Length = 292

 Score =  342 bits (876), Expect = 5e-91
 Identities = 174/248 (70%), Positives = 201/248 (81%), Gaps = 1/248 (0%)
 Frame = -2

Query: 1128 PRLLLSCPKSSLLRP-WALTTDEIVENSNSKEMMPKIDKSGRFCSPRAARELALMILYAA 952
            PR   S PK  +  P  AL   +      SKEM+PKIDKSGRFCSPRAARELAL+I+YA+
Sbjct: 41   PRHPASRPKVRISLPSMALQVQDHQSFHKSKEMLPKIDKSGRFCSPRAARELALLIVYAS 100

Query: 951  CLEGSDPVRLFDKRLNARRESGYEFDRASLMEYNHMSFGGPPVXXXXXXEANELLSNDEK 772
            CL+GSDP+RLF+KR+NA RE GYEFD+ASL++YNHMSFGGPPV      EA+ELL +DE+
Sbjct: 101  CLQGSDPIRLFEKRINAAREPGYEFDKASLLQYNHMSFGGPPVTTHSVEEADELLRSDEQ 160

Query: 771  ESEIEAEVLSAPPKLVYSKLILHFTRKLLVAVVEKWDSHVLVINKVAPPNWKDEPAGRIL 592
            +S IEAEVLSAPPKLVYSKL+L FTRKLLVA V+KWD+HVL I+KV P NWK+EPAGRIL
Sbjct: 161  DSAIEAEVLSAPPKLVYSKLLLSFTRKLLVATVDKWDNHVLAIDKVVPSNWKNEPAGRIL 220

Query: 591  ELSILHLAMSEIAVLGTRHQIVINEAVDLAKRFCDGAAPRVINGCLRTFMEDIKGNVNAR 412
            ELSILHLAMSEI VLGTRH IVINEAVDLA RFCDGAAPR+INGCLRTF++D+ G   A+
Sbjct: 221  ELSILHLAMSEITVLGTRHPIVINEAVDLANRFCDGAAPRIINGCLRTFVKDLAGTSTAQ 280

Query: 411  ELKIKQTV 388
                K  V
Sbjct: 281  TSNSKLEV 288


Top