BLASTX nr result

ID: Rehmannia28_contig00045787 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00045787
         (771 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699...   264   1e-82
ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697...   265   1e-82
ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobrom...   197   2e-58
ref|XP_007033573.1| Uncharacterized protein TCM_019740 [Theobrom...   187   6e-56
ref|XP_008806397.1| PREDICTED: uncharacterized protein LOC103719...   177   6e-50
ref|XP_007029160.1| Uncharacterized protein isoform 1 [Theobroma...   171   6e-49
ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobrom...   172   9e-49
ref|XP_007029161.1| Uncharacterized protein isoform 2 [Theobroma...   169   1e-48
emb|CAN65820.1| hypothetical protein VITISV_042324 [Vitis vinifera]   176   7e-47
ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955...   169   9e-46
ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954...   171   1e-45
ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobrom...   162   5e-45
ref|XP_007049075.1| Uncharacterized protein TCM_002073 [Theobrom...   168   3e-44
gb|KHN28108.1| hypothetical protein glysoja_039628, partial [Gly...   156   8e-44
gb|KYP49735.1| Retrovirus-related Pol polyprotein from transposo...   162   9e-44
ref|XP_012856897.1| PREDICTED: uncharacterized protein LOC105976...   166   2e-43
ref|XP_013615493.1| PREDICTED: uncharacterized protein LOC106321...   159   2e-43
ref|XP_013674504.1| PREDICTED: uncharacterized protein LOC106379...   156   3e-43
gb|KYP59020.1| hypothetical protein KK1_014445 [Cajanus cajan]        155   5e-43
ref|XP_013691399.1| PREDICTED: uncharacterized protein LOC106395...   157   6e-43

>ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699729, partial [Phoenix
           dactylifera]
          Length = 490

 Score =  264 bits (675), Expect = 1e-82
 Identities = 141/274 (51%), Positives = 187/274 (68%), Gaps = 18/274 (6%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A+E W KLK RF+QPDNVRI+QL+QQLSSI Q + SV+EYFTQLNA+WEELRNYRPLP C
Sbjct: 117 AKEVWNKLKSRFAQPDNVRIYQLKQQLSSITQRSLSVSEYFTQLNAIWEELRNYRPLPYC 176

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           SCG C C+AL+ +GE    D++F+FLMGLN++++ +RG I+LMS  PSLDK FS++LQEE
Sbjct: 177 SCGHCICDALKGVGEDLELDHIFQFLMGLNDTYDTVRGQIILMSPLPSLDKTFSLVLQEE 236

Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231
           RQR+AR    P  +SS+ A   V +       + C HCG +GH ++KC+RLIGFPPNFKF
Sbjct: 237 RQRQARAIIFPAPESSALAA--VLNKSKNRAEITCYHCGKSGHTKEKCYRLIGFPPNFKF 294

Query: 230 TKSKPRHFGQK----HSAH-LISSQENQGVSNEKQNEGNVPFTQDQIQKLMALINSDSMQ 66
           TK+K      K    HSA+ +ISS + +G+S  +     +  +Q QIQ+L+AL+NS   Q
Sbjct: 295 TKTKFPSVNNKSVAPHSANQVISSTQGKGLSAPQ-----LSLSQTQIQQLLALVNSGIPQ 349

Query: 65  LSQN-------------TPQSQSGNIHPHLSNMA 3
           +S N             TP +++GN     SNMA
Sbjct: 350 MSLNSASTQQEPILPMVTPTTETGNNSAPSSNMA 383


>ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697258 [Phoenix
           dactylifera]
          Length = 514

 Score =  265 bits (676), Expect = 1e-82
 Identities = 137/254 (53%), Positives = 176/254 (69%), Gaps = 5/254 (1%)
 Frame = -2

Query: 767 QETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCS 588
           +E W KLK RF+QPDNVRI+QL+QQLSSI QGT SV+EYFTQLNA+WEELRNYRPLP CS
Sbjct: 118 KEVWNKLKSRFAQPDNVRIYQLKQQLSSITQGTLSVSEYFTQLNAIWEELRNYRPLPYCS 177

Query: 587 CGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEER 408
           CG C C+AL+ +GE    DY+F+FLM LN +F+ +RG I+LMS  PSLDK FS++LQEER
Sbjct: 178 CGHCICDALKGVGENLELDYIFQFLMELNNTFDSVRGQIILMSPLPSLDKTFSLVLQEER 237

Query: 407 QREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKFT 228
           QR+AR    P  +SS+ A   V +       + C HCG  GH R+KC+RLIGFPPNFKFT
Sbjct: 238 QRQARAIIFPAPESSALAA--VLNKPKNKAKITCYHCGKPGHTREKCYRLIGFPPNFKFT 295

Query: 227 KSKPRHFGQK----HSAH-LISSQENQGVSNEKQNEGNVPFTQDQIQKLMALINSDSMQL 63
           K+K      K    HSA+ +IS  + +G++  +     +  +Q Q+Q+L AL+NS   QL
Sbjct: 296 KTKSPSVNNKSVASHSANQVISPTQGKGLAAPQ-----LSLSQAQVQQLFALVNSGITQL 350

Query: 62  SQNTPQSQSGNIHP 21
           + N+  SQ   I P
Sbjct: 351 NLNSASSQQEPIPP 364


>ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobroma cacao]
           gi|508722464|gb|EOY14361.1| Uncharacterized protein
           TCM_033758 [Theobroma cacao]
          Length = 328

 Score =  197 bits (500), Expect = 2e-58
 Identities = 104/232 (44%), Positives = 144/232 (62%), Gaps = 2/232 (0%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A E WE LK RFSQPD+ RI  LQ  L +I QGT SV+ YFT+LN +WEELRNYRPLP C
Sbjct: 83  AYEVWETLKERFSQPDDARICNLQFNLYNISQGTRSVDAYFTELNCIWEELRNYRPLPHC 142

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           SCG+C     ++  +    D VF+FL GLNESF  +R  IL+M   PSL+K +++++++E
Sbjct: 143 SCGICNSACFQTYIDQYQKDSVFRFLNGLNESFSALRSQILMMKPFPSLNKAYNLVIRDE 202

Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231
            QR   +   P ++SS+ A             VVC +C   GH +DKC+RLIGFPP+FKF
Sbjct: 203 SQRNLYLHTMPIIESSAMA-TMTEGKVKSKVDVVCSYCHKKGHTKDKCYRLIGFPPDFKF 261

Query: 230 TKSK-PRHFGQKHSAHLISSQENQGVSNEK-QNEGNVPFTQDQIQKLMALIN 81
            K K P   G   S + +    ++   +E  ++  ++  ++ QIQKLM+LIN
Sbjct: 262 LKGKSPLKKGNVWSINNVGPVTSKEECDESTKSLSSLTLSKHQIQKLMSLIN 313


>ref|XP_007033573.1| Uncharacterized protein TCM_019740 [Theobroma cacao]
           gi|508712602|gb|EOY04499.1| Uncharacterized protein
           TCM_019740 [Theobroma cacao]
          Length = 211

 Score =  187 bits (474), Expect = 6e-56
 Identities = 91/209 (43%), Positives = 129/209 (61%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A + W+ LK  FSQPD+ RI  LQ  L +I Q T  V+ YFT+LN +WEEL+NYRPLP C
Sbjct: 4   AADIWQTLKNHFSQPDDTRICNLQYSLCNITQDTRPVDSYFTKLNGIWEELKNYRPLPYC 63

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
            CG CT +  +   E+   D VF+FL GLNESF  +R HI+++   PSLD+ ++++L+EE
Sbjct: 64  ECGKCTQSCFQKYIELWEKDRVFRFLNGLNESFSALRSHIIMIKPFPSLDEAYNLVLREE 123

Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231
            QR   +   P +D++  AV            VVC HC   GH ++KC+ +IGFPP+FKF
Sbjct: 124 SQRSILMQSQPLLDTTVVAV-VTESKIRVKNEVVCSHCAKNGHVKEKCYCIIGFPPDFKF 182

Query: 230 TKSKPRHFGQKHSAHLISSQENQGVSNEK 144
           TK K  +F +K  + + +S     V N++
Sbjct: 183 TKGK-GNFSRKAMSAVANSTNQSQVENQE 210


>ref|XP_008806397.1| PREDICTED: uncharacterized protein LOC103719098 [Phoenix
           dactylifera]
          Length = 406

 Score =  177 bits (449), Expect = 6e-50
 Identities = 104/262 (39%), Positives = 154/262 (58%), Gaps = 19/262 (7%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A+E W  L+ RFSQ +  RIFQ+Q+ ++S+ Q  +SV+ YFT+L  +WEEL NYRP P C
Sbjct: 24  AREIWNNLQERFSQGNGPRIFQIQKSIASLSQDQSSVSAYFTKLKGLWEELWNYRPNPIC 83

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           S G     A++ L E Q+ +   +FLMGLN+S+  IRG ILL+   PS +KVFS++LQEE
Sbjct: 84  SSG-----AMKQLIEYQNQECTMQFLMGLNDSYSQIRGQILLIDPLPSTNKVFSLVLQEE 138

Query: 410 RQREARVPFSPTMDSSSF--AVNF------------VADXXXXXXSVVCEHCGITGHRRD 273
           +QRE   P +P M+ ++F    N+             A+        VC HCG+TGH ++
Sbjct: 139 KQREITSPVNPNMNIAAFLGRTNYNNAPSILAKYGAGANQFQRRERSVCSHCGVTGHTKE 198

Query: 272 KCFRLIGFPPNFKFTKSKPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLM 93
           +C++L G+P  +K +K+K    G + + +  SSQ     S +  N   +PFT +Q Q+L+
Sbjct: 199 RCYKLHGYPRGYK-SKNK----GSQITVNQASSQTG---SKQFTNAPQLPFTVEQCQQLL 250

Query: 92  ALINSDSM-----QLSQNTPQS 42
           A+IN  S        S  TPQ+
Sbjct: 251 AMINHSSSSDAGHSNSSTTPQN 272


>ref|XP_007029160.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508717765|gb|EOY09662.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 263

 Score =  171 bits (432), Expect = 6e-49
 Identities = 94/254 (37%), Positives = 137/254 (53%), Gaps = 13/254 (5%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A E W  LK  F+QPD+ R+  LQ  L ++ QG  +V+ YF +L  +WEELRNYRPLP C
Sbjct: 4   AAEIWNTLKQNFAQPDDTRVCNLQYTLGNVSQGARTVDVYFIELKGIWEELRNYRPLPHC 63

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
            CG       +   +    D VF+FL GLN+SF  IR  ILLM   P LDKV+S+IL+EE
Sbjct: 64  ECGSYNPGCFKKYTDQFQKDMVFRFLNGLNKSFSAIRSQILLMDPIPGLDKVYSLILREE 123

Query: 410 RQREARVPFSPTMDSSSFAVNFVAD-XXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFK 234
            QR   V   P ++  SFA+   AD        ++C HCG  GH +DKC+++I F  +FK
Sbjct: 124 SQRNILVQPQPLLE--SFAMFTAADNKKKARKDIICNHCGKKGHTKDKCYKIISFLDDFK 181

Query: 233 FTKS------KPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFT------QDQIQKLMA 90
           FTK       K ++      A   +S +++     K+ + +  F       + Q+ KLM 
Sbjct: 182 FTKGGRSNPRKGKNLVNNVFAVSDASTDSESQVETKEEQASAGFVCQLSMIKQQVNKLMQ 241

Query: 89  LINSDSMQLSQNTP 48
            ++ + +  ++  P
Sbjct: 242 FLSENGISSNEGHP 255


>ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobroma cacao]
           gi|508779769|gb|EOY27025.1| Uncharacterized protein
           TCM_028976 [Theobroma cacao]
          Length = 318

 Score =  172 bits (435), Expect = 9e-49
 Identities = 84/185 (45%), Positives = 109/185 (58%), Gaps = 2/185 (1%)
 Frame = -2

Query: 764 ETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCSC 585
           E W  LK+ ++QPDN  +  LQ  L S+ Q    V  YF +L  +WEELRNYRPLP C C
Sbjct: 122 EIWNTLKLNYAQPDNTCVCNLQYTLGSVTQRVKIVYAYFIELKCIWEELRNYRPLPHCEC 181

Query: 584 GLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEERQ 405
           G C  N  +   +    D VF+FL GLNESF  IR  I+LM   PSLDKV+SM+L+EE Q
Sbjct: 182 GKCNANCFKKFSDQYQKDMVFRFLNGLNESFSAIRSQIILMDPIPSLDKVYSMVLREESQ 241

Query: 404 REARVPFSPTMDSSSF--AVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231
           +   +   P ++S +   A N           + C HCG  GH ++KC+R+I FP +FKF
Sbjct: 242 KNMFLQSQPFLESLAMLAATNV---KKKPMKDLTCTHCGKKGHVKEKCYRIIRFPEDFKF 298

Query: 230 TKSKP 216
           TK KP
Sbjct: 299 TKGKP 303


>ref|XP_007029161.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508717766|gb|EOY09663.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 260

 Score =  169 bits (429), Expect = 1e-48
 Identities = 97/260 (37%), Positives = 139/260 (53%), Gaps = 13/260 (5%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A E W  LK  F+QPD+ R+  LQ  L ++ QG  +V+ YF +L  +WEELRNYRPLP C
Sbjct: 4   AAEIWNTLKQNFAQPDDTRVCNLQYTLGNVSQGARTVDVYFIELKGIWEELRNYRPLPHC 63

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
            CG       +   +    D VF+FL GLN+SF  IR  ILLM   P LDKV+S+IL+EE
Sbjct: 64  ECGSYNPGCFKKYTDQFQKDMVFRFLNGLNKSFSAIRSQILLMDPIPGLDKVYSLILREE 123

Query: 410 RQREARVPFSPTMDSSSFAVNFVAD-XXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFK 234
            QR   V   P ++  SFA+   AD        ++C HCG  GH +DKC+++I F  +FK
Sbjct: 124 SQRNILVQPQPLLE--SFAMFTAADNKKKARKDIICNHCGKKGHTKDKCYKIISFLDDFK 181

Query: 233 FTKS------KPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFT------QDQIQKLMA 90
           FTK       K ++      A   +S +++     K+ + +  F       + Q+ KLM 
Sbjct: 182 FTKGGRSNPRKGKNLVNNVFAVSDASTDSESQVETKEEQASAGFVCQLSMIKQQVNKLMQ 241

Query: 89  LINSDSMQLSQNTPQSQSGN 30
            ++ +   +S N  +  S N
Sbjct: 242 FLSENG--ISSNEGKGISSN 259


>emb|CAN65820.1| hypothetical protein VITISV_042324 [Vitis vinifera]
          Length = 1262

 Score =  176 bits (447), Expect = 7e-47
 Identities = 97/260 (37%), Positives = 145/260 (55%), Gaps = 17/260 (6%)
 Frame = -2

Query: 755 EKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCSCG-- 582
           E+LK+R+ + D  R+F L++ LSSI Q + S+ EYF++  A+W+E  +YRP+P C CG  
Sbjct: 89  EELKIRYLRSDGPRVFSLEKSLSSISQNSKSITEYFSEFKALWDEYISYRPIPSCRCGNL 148

Query: 581 -LCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEERQ 405
             C+CN L+ L + Q SDYV KFL+GL++S+  IR  +LL S  PS+ +VFS++LQEE Q
Sbjct: 149 NRCSCNILKDLTDRQQSDYVMKFLVGLHDSYSAIRSQLLLQSPLPSMSRVFSLLLQEESQ 208

Query: 404 REARVPFSPTMDSSSFAV----------NFVADXXXXXXSVVCEHCGITGHRRDKCFRLI 255
           R        ++DS +             N            +C HCG +GH  DKCF+LI
Sbjct: 209 RSLTNAVGISIDSQAMVAEQSSRTVSTSNTQFTKQKGKSDAICSHCGYSGHLVDKCFQLI 268

Query: 254 GFPPNFKFTKSKPRHFGQKHSA----HLISSQENQGVSNEKQNEGNVPFTQDQIQKLMAL 87
           G+PP +K  + K   F    +A      + +  N  V  +  +  N+ F+Q+QIQ L+ L
Sbjct: 269 GYPPRWKGPRGK--IFNSTPTAAKNFQRLPTANNTNVLEQNSSNSNMIFSQEQIQNLLTL 326

Query: 86  INSDSMQLSQNTPQSQSGNI 27
            NS S   + NT  +   N+
Sbjct: 327 ANSLS---NSNTNFNAXSNV 343


>ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955841, partial
           [Erythranthe guttata]
          Length = 514

 Score =  169 bits (427), Expect = 9e-46
 Identities = 98/270 (36%), Positives = 139/270 (51%), Gaps = 14/270 (5%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           ++E W+ LK RFSQ +  RIFQL++ L+++ QG+ SVN YFT++ A+W+EL NYRP  CC
Sbjct: 105 SKEIWDDLKTRFSQTNGPRIFQLRRDLANLTQGSQSVNVYFTKVKAIWDELVNYRP--CC 162

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           SCG C C     L    + +YV  FLMGLNES    RG ILLM   P + KVF+ + QEE
Sbjct: 163 SCGKCDCGGFEKLQAHYNQEYVMSFLMGLNESLASTRGQILLMDPLPPISKVFAFVSQEE 222

Query: 410 RQREARVPFSPTMDSSSFAV-----------NFVADXXXXXXSVVCEHCGITGHRRDKCF 264
           RQR   V        S F+V            F            C HC + GH  +KC+
Sbjct: 223 RQRSV-VSSHVESSGSVFSVKNEGFKRSINNQFYNTGFKKKERSFCTHCNMQGHTVEKCY 281

Query: 263 RLIGFPPNFKFTKSK---PRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLM 93
           +L G+PP++K  KS+   P +      + L S   + GVS++  +      T  Q Q+ M
Sbjct: 282 KLHGYPPSYKPQKSRFSSPANQVSGFDSSLDSHSSDSGVSSQHVDGYLQSMTPSQCQQFM 341

Query: 92  ALINSDSMQLSQNTPQSQSGNIHPHLSNMA 3
           ++ +S      Q +  S       H ++ A
Sbjct: 342 SMFSSHMAAQQQQSAASAQPQSSAHGADTA 371


>ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954710 [Erythranthe
           guttata]
          Length = 659

 Score =  171 bits (432), Expect = 1e-45
 Identities = 102/266 (38%), Positives = 144/266 (54%), Gaps = 17/266 (6%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           ++E W+ LK RFSQ +  RIFQL++ L+++ QG+ SVN YFT++ A+W+EL NYRP  CC
Sbjct: 105 SKEIWDDLKTRFSQTNGPRIFQLRRDLANLTQGSQSVNVYFTKVKAIWDELANYRP--CC 162

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           SCG C C     L    + +YV  FLMGLN+S    RG ILLM   P + KVF+ I QEE
Sbjct: 163 SCGKCDCGGFEKLQAHYNQEYVMSFLMGLNDSLASTRGQILLMDPLPPISKVFAFISQEE 222

Query: 410 RQREARVPFSPTMDSSS--FAV-----------NFVADXXXXXXSVVCEHCGITGHRRDK 270
           RQR      S  +DSS   F+V            F            C HC + GH  +K
Sbjct: 223 RQRSV---VSSHVDSSGSVFSVKNEGFKRSINNQFYNPGLKKRERSFCTHCNMQGHTVEK 279

Query: 269 CFRLIGFPPNFKFTKSK-PRHFGQ--KHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQK 99
           C++L G+PP++K  KS+   H  Q     + L S   + GVS+++ +      T  Q Q+
Sbjct: 280 CYKLHGYPPSYKPQKSRFSSHVNQVSGFDSSLDSHSSDAGVSSQQVDGYLQSMTPSQCQQ 339

Query: 98  LMALINSD-SMQLSQNTPQSQSGNIH 24
            M++ +S  + Q  Q+T   Q  + H
Sbjct: 340 FMSMFSSHMAAQQQQSTASIQPQSAH 365


>ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobroma cacao]
           gi|508708772|gb|EOY00669.1| Uncharacterized protein
           TCM_010591 [Theobroma cacao]
          Length = 336

 Score =  162 bits (411), Expect = 5e-45
 Identities = 94/232 (40%), Positives = 139/232 (59%), Gaps = 2/232 (0%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A+E  E LK RFSQP    I  LQ QL +I+QGT SVN YFT+LN+VW+EL+N+RPLP C
Sbjct: 108 AKEILETLKNRFSQPYETIICNLQFQLRNILQGTRSVNTYFTELNSVWQELKNFRPLPQC 167

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
                  N  +   + Q+ D VF FL GLNESF  +R HIL++    S+D+ +S+++++ 
Sbjct: 168 DYEGRKNNCYKKYADQQNKDAVFCFLNGLNESFSCLRSHILMLKPFLSIDQAYSLVIKKM 227

Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKF 231
            QR + +  SP  +S+   V  + +      ++VC HCG  GH ++K + +IGFP NFKF
Sbjct: 228 LQR-SLILQSPVENSTMATV--ITEEKRKNTNLVCSHCGKKGHSKEKYYCIIGFPENFKF 284

Query: 230 TKSK--PRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLMALIN 81
           TK K   R  G   ++ +  S++++       +   +  T+ QIQKLM LI+
Sbjct: 285 TKLKRNMRKGGSSVNSAISGSEQDEYDETVTNSISQLSLTKAQIQKLMTLIS 336


>ref|XP_007049075.1| Uncharacterized protein TCM_002073 [Theobroma cacao]
           gi|508701336|gb|EOX93232.1| Uncharacterized protein
           TCM_002073 [Theobroma cacao]
          Length = 817

 Score =  168 bits (426), Expect = 3e-44
 Identities = 85/226 (37%), Positives = 126/226 (55%)
 Frame = -2

Query: 758 WEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCSCGL 579
           W  LK  ++QPD+ R+  LQ  L +I QGT SV+ YF +L AV EE+R+YRPLP C CG 
Sbjct: 87  WNTLKQNYAQPDDTRLCNLQYTLGNITQGTRSVDSYFIELKAVREEIRSYRPLPHCECGR 146

Query: 578 CTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEERQRE 399
           C  N  +   +    D VF+FL GLNESF  IR HI+LM   P+LD+V++ +L+EE Q+ 
Sbjct: 147 CNANCFKRYIDQYHKDMVFRFLNGLNESFSAIRSHIILMDPIPTLDRVYNFMLREETQKN 206

Query: 398 ARVPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIGFPPNFKFTKSK 219
                   ++SS+  +            +VC HCG  GH ++KC+RLIGFP +FKFT  K
Sbjct: 207 LLFQSQSVLESSTM-LTTTDSKKKLKKDLVCSHCGKKGHNKEKCYRLIGFPYDFKFTTRK 265

Query: 218 PRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLMALIN 81
                 K + + +++     +   + +      + +  Q   +L+N
Sbjct: 266 ANIKKGKTAVNNVTASNEISIDEFQVDSDGKGISSNSQQGKQSLVN 311


>gb|KHN28108.1| hypothetical protein glysoja_039628, partial [Glycine soja]
          Length = 230

 Score =  156 bits (395), Expect = 8e-44
 Identities = 80/187 (42%), Positives = 115/187 (61%), Gaps = 10/187 (5%)
 Frame = -2

Query: 767 QETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCCS 588
           +E W+ LK  F + +  RIFQL++QL S+ QGT+ +N Y T+L ++WEEL  Y+P   C+
Sbjct: 49  KEIWDDLKTWFLRKNGPRIFQLKRQLMSLQQGTDDINTYHTKLKSIWEELTGYKPTFSCT 108

Query: 587 CGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEER 408
           CG      L+ +      +YV  FLMGLN+SF  IRG ILL +  PS+  VFS+ILQEE 
Sbjct: 109 CG-----GLQQIHTHHEFEYVMSFLMGLNDSFSQIRGQILLSNPLPSIGNVFSLILQEEA 163

Query: 407 QREARVPFSPT--MDSSSFAVNFVA--------DXXXXXXSVVCEHCGITGHRRDKCFRL 258
           +RE  V  SPT  +D+ +F+VN+V+                  C HC + GH +DKC++L
Sbjct: 164 KREIVVTHSPTNSLDNIAFSVNYVSKNQYENTKGKYIKKERPKCAHCDMLGHTKDKCYKL 223

Query: 257 IGFPPNF 237
           +G+PPN+
Sbjct: 224 VGYPPNY 230


>gb|KYP49735.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 434

 Score =  162 bits (409), Expect = 9e-44
 Identities = 96/241 (39%), Positives = 134/241 (55%), Gaps = 10/241 (4%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A+E W+ LK RFS+ +  RIF L++QL S+ QG++ V+ Y+T+L ++WEEL  Y+P   C
Sbjct: 60  AKENWDDLKTRFSRKNGPRIFHLKRQLMSLQQGSDDVSTYYTKLKSIWEELAGYKPNFQC 119

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           +CG      L SL +   S+YV  FLMGLN+SF  IRG ILL    PS+  VFS+ILQEE
Sbjct: 120 TCG-----GLESLHKHTQSEYVMSFLMGLNDSFSQIRGQILLSDPLPSIGNVFSLILQEE 174

Query: 410 RQREARVPF--SPTMDSSSFAVNFVA--------DXXXXXXSVVCEHCGITGHRRDKCFR 261
            Q+E  V    S   D  +FAVN  +                + C HC + GH +DKC++
Sbjct: 175 TQKEIAVTHATSAHSDDMAFAVNQCSKTNFDNNKGKFVKKDRLKCAHCEMFGHTKDKCYK 234

Query: 260 LIGFPPNFKFTKSKPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLMALIN 81
           L+G+PPN+ F   +P+   Q   +H          SN   N      T  Q Q+LM L+N
Sbjct: 235 LVGYPPNY-FKNRQPQVVNQVDISH------ESSTSNTALN-----LTPAQCQQLMTLLN 282

Query: 80  S 78
           +
Sbjct: 283 N 283


>ref|XP_012856897.1| PREDICTED: uncharacterized protein LOC105976150 [Erythranthe
           guttata]
          Length = 746

 Score =  166 bits (419), Expect = 2e-43
 Identities = 96/272 (35%), Positives = 142/272 (52%), Gaps = 25/272 (9%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A E W+ L  RFSQ +  RIFQL+++LS++ Q T SVN YFT+L A+W+EL N+RP   C
Sbjct: 103 AHEMWKDLNTRFSQTNGPRIFQLRRELSNLTQDTQSVNVYFTKLKAIWDELSNFRP--SC 160

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           +CG CTC  ++ L E  + ++V  FLMGLNES    RG ILLM   P ++KVF+++ QEE
Sbjct: 161 TCGACTCGGVQKLNEHYNLEHVMAFLMGLNESLTSTRGQILLMDPLPPINKVFALVSQEE 220

Query: 410 RQREARVPFSPTMDSSSFAVNFVADXXXXXXSVV------------CEHCGITGHRRDKC 267
           RQR      +   +S +F++           + V            C HC I GH  DKC
Sbjct: 221 RQRSIHSSHNEVQNSLAFSIRGDQSVQRSVHNQVYTSAPKRKERGFCTHCNIYGHTIDKC 280

Query: 266 FRLIGFPPNFKFTKSKPRHFG---QKHSAHLISSQENQGVSNEKQNEGNVPFTQD----- 111
           ++L G+PP +   K+KPR+      + S + +++ E+        +    PF        
Sbjct: 281 YKLHGYPPGY---KAKPRYSSLPQSRFSVNQVAAMESPLDYATSGSTSQPPFVSSDPVLA 337

Query: 110 -----QIQKLMALINSDSMQLSQNTPQSQSGN 30
                Q Q+LMA  ++      Q + Q   G+
Sbjct: 338 NMSAAQCQQLMAYFSNQMAAKKQVSTQQSHGD 369


>ref|XP_013615493.1| PREDICTED: uncharacterized protein LOC106321802, partial [Brassica
           oleracea var. oleracea]
          Length = 353

 Score =  159 bits (402), Expect = 2e-43
 Identities = 89/234 (38%), Positives = 125/234 (53%), Gaps = 7/234 (2%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A+  W+ L  RF Q D  RIF+++Q+LS+I QG+  V+ Y+T+L  +WEE +NY  LP C
Sbjct: 108 AELIWKNLMSRFKQDDAPRIFEIEQKLSNIQQGSLDVSTYYTELVTLWEEFQNYVDLPVC 167

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           +CG C CNA  S   IQ    V KFLMGLNES++  R HIL++   PS+++VF+M+ Q+E
Sbjct: 168 TCGKCECNAAASWELIQQRSRVTKFLMGLNESYDATRRHILMLKPIPSIEEVFNMVAQDE 227

Query: 410 RQREAR-------VPFSPTMDSSSFAVNFVADXXXXXXSVVCEHCGITGHRRDKCFRLIG 252
           RQ+  R       V F  +   S+      A         VC HCG+ GH   KCF+L G
Sbjct: 228 RQKIIRPSLKTDSVVFQTSATESASPHYAAAVAYRPKQRPVCTHCGMAGHIVQKCFKLHG 287

Query: 251 FPPNFKFTKSKPRHFGQKHSAHLISSQENQGVSNEKQNEGNVPFTQDQIQKLMA 90
           +PP  +F  +              SSQ+     +  Q+ G V  +  Q Q   A
Sbjct: 288 YPPGHRFYNTN------------ASSQQRLSAPSNNQSRGPVSQSSHQHQSTTA 329


>ref|XP_013674504.1| PREDICTED: uncharacterized protein LOC106379017 [Brassica napus]
           gi|923870047|ref|XP_013709354.1| PREDICTED:
           uncharacterized protein LOC106413059 [Brassica napus]
          Length = 267

 Score =  156 bits (394), Expect = 3e-43
 Identities = 86/237 (36%), Positives = 127/237 (53%), Gaps = 26/237 (10%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A+  W+ +  RF Q D  R+++++QQLSSI QG+  V+ Y+T L  +WEE +NY  LP C
Sbjct: 27  AEAIWKNILSRFKQDDAPRVYEIEQQLSSIQQGSMDVSAYYTALVTLWEEHKNYVELPVC 86

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           SCG C CNA      +Q    V KFLMGLNES+E  R HIL++   P+++ VF+++ Q+E
Sbjct: 87  SCGKCECNAAELWERLQQRSRVTKFLMGLNESYESTRRHILMLKPIPTIEDVFNLVTQDE 146

Query: 410 RQREAR-----VPFS----------PTM-----DSSSFAVNFVADXXXXXXSVVCEHCGI 291
           RQR  +     +P +          PT+     D S+FA              +C +C  
Sbjct: 147 RQRGIKPSTTSIPVALQASGPTESLPTIDVVAPDHSAFATTHNNSGYRPKQRPLCTYCNQ 206

Query: 290 TGHRRDKCFRLIGFPPNFKFTKSKPRHFG------QKHSAHLISSQENQGVSNEKQN 138
            GH  DKCFRL G+PP  K+ KS   + G        +    +  Q +Q  ++++QN
Sbjct: 207 LGHVVDKCFRLHGYPPGHKYNKSSHPNAGFAPRGQNNYQQRPVQQQNSQYFASQQQN 263


>gb|KYP59020.1| hypothetical protein KK1_014445 [Cajanus cajan]
          Length = 262

 Score =  155 bits (392), Expect = 5e-43
 Identities = 80/186 (43%), Positives = 111/186 (59%), Gaps = 8/186 (4%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           AQE W+ LK RFS+ +  RIFQL+ QL S+ QG + ++ Y+T+L ++WEEL  Y+P   C
Sbjct: 60  AQEIWDDLKTRFSRKNGPRIFQLRCQLMSLHQGMDDISTYYTKLKSIWEELSGYKPTFQC 119

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           +CG      L+ L     S+YV  FLMGLN+SF  IRG ILL    PS++  FS++LQ+E
Sbjct: 120 TCG-----GLQQLQSFTESEYVMSFLMGLNDSFSQIRGQILLSDPLPSIENFFSLVLQDE 174

Query: 410 RQREARVPFSPTM---DSSSFAVNFVADXXXXXXSVV-----CEHCGITGHRRDKCFRLI 255
            QRE  V  SP +   D+ +F VN                  C HC I GH +D C++L+
Sbjct: 175 AQREIAVTSSPPVANSDNIAFTVNSSQPATSRNRFTKKERPRCAHCDILGHTKDTCYKLV 234

Query: 254 GFPPNF 237
           G+PPN+
Sbjct: 235 GYPPNY 240


>ref|XP_013691399.1| PREDICTED: uncharacterized protein LOC106395505 [Brassica napus]
          Length = 344

 Score =  157 bits (398), Expect = 6e-43
 Identities = 80/200 (40%), Positives = 110/200 (55%), Gaps = 17/200 (8%)
 Frame = -2

Query: 770 AQETWEKLKVRFSQPDNVRIFQLQQQLSSIVQGTNSVNEYFTQLNAVWEELRNYRPLPCC 591
           A+  W+ +  RF Q D  R++++ Q+LSSI QG++ V  Y+T L  +WEE +NY  LP C
Sbjct: 106 AESIWKNILSRFKQDDAPRVYEIDQKLSSIQQGSDDVTTYYTALVTLWEEHKNYVELPVC 165

Query: 590 SCGLCTCNALRSLGEIQSSDYVFKFLMGLNESFEGIRGHILLMSLTPSLDKVFSMILQEE 411
           SCG C CNA      +Q    V KFLMGLNES+E    HIL++   P +++VF+++ Q+E
Sbjct: 166 SCGKCECNAAELWERLQERSRVTKFLMGLNESYESTHRHILMLKPIPPIEEVFNLVTQDE 225

Query: 410 RQREARVPFSPTM-----------------DSSSFAVNFVADXXXXXXSVVCEHCGITGH 282
           RQR  +   +PT                  D S+FA              +C +CG  GH
Sbjct: 226 RQRAIKPSSTPTSVVFQASGPDETLLSAPPDHSAFAAAHANSGYRPKQRPLCTYCGQLGH 285

Query: 281 RRDKCFRLIGFPPNFKFTKS 222
             DKCFRL G+PP  KF KS
Sbjct: 286 IVDKCFRLHGYPPGHKFNKS 305


Top