BLASTX nr result

ID: Perilla23_contig00007736 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00007736
         (888 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011093988.1| PREDICTED: uncharacterized protein LOC105173...   268   4e-69
ref|XP_011093021.1| PREDICTED: uncharacterized protein LOC105173...   252   3e-64
ref|XP_010661958.1| PREDICTED: uncharacterized protein LOC100260...   192   4e-46
ref|XP_002265230.1| PREDICTED: uncharacterized protein LOC100260...   192   4e-46
ref|XP_012083190.1| PREDICTED: uncharacterized protein LOC105642...   186   2e-44
ref|XP_012083189.1| PREDICTED: uncharacterized protein LOC105642...   186   2e-44
ref|XP_012083184.1| PREDICTED: uncharacterized protein LOC105642...   186   2e-44
ref|XP_012835590.1| PREDICTED: uncharacterized protein LOC105956...   184   8e-44
ref|XP_009590542.1| PREDICTED: uncharacterized protein LOC104087...   177   7e-42
emb|CDP08657.1| unnamed protein product [Coffea canephora]            175   4e-41
ref|XP_012480955.1| PREDICTED: uncharacterized protein LOC105795...   174   6e-41
gb|KHG01749.1| hypothetical protein F383_21898 [Gossypium arboreum]   172   4e-40
ref|XP_011038435.1| PREDICTED: uncharacterized protein LOC105135...   170   2e-39
ref|XP_012490137.1| PREDICTED: uncharacterized protein LOC105802...   165   4e-38
ref|XP_010100718.1| hypothetical protein L484_023487 [Morus nota...   165   5e-38
gb|KHG18669.1| Alanine--tRNA ligase [Gossypium arboreum]              162   2e-37
ref|XP_008376151.1| PREDICTED: uncharacterized protein LOC103439...   161   7e-37
ref|XP_007217991.1| hypothetical protein PRUPE_ppa005611mg [Prun...   159   3e-36
ref|XP_010523818.1| PREDICTED: uncharacterized protein LOC104802...   156   2e-35
ref|XP_010266583.1| PREDICTED: uncharacterized protein LOC104604...   154   1e-34

>ref|XP_011093988.1| PREDICTED: uncharacterized protein LOC105173804 [Sesamum indicum]
           gi|747045438|ref|XP_011093996.1| PREDICTED:
           uncharacterized protein LOC105173804 [Sesamum indicum]
           gi|747045440|ref|XP_011094003.1| PREDICTED:
           uncharacterized protein LOC105173804 [Sesamum indicum]
          Length = 454

 Score =  268 bits (685), Expect = 4e-69
 Identities = 156/273 (57%), Positives = 178/273 (65%), Gaps = 23/273 (8%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADT-VSSASGGQDVPGAVS 574
           MALAEARAA  RTANRC VQEDAKRAPKLACCSSV   VKQA+T  ++A+ GQD P   S
Sbjct: 2   MALAEARAAWQRTANRCFVQEDAKRAPKLACCSSVTPSVKQAETGTNTAAAGQDTPSTGS 61

Query: 573 ---NLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEIN----QKHESSGAFARNE 415
              NL PS+S+LSPNSKWWL LQP+Y YQKGLMDE+F S+E N    Q  ES GA AR E
Sbjct: 62  LPLNL-PSYSNLSPNSKWWLQLQPSYVYQKGLMDEQFNSLEANMGTCQMQESLGALARKE 120

Query: 414 DNPGLIDQTSCKKLSCG---RNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIVEDVCPEV 244
           D+PGL +Q +    S     R+  T DK  FGVKED             LK+   V  EV
Sbjct: 121 DDPGLSNQMTSNTFSSDSQYRSFTTGDKKVFGVKED----------VGELKLGGLVGDEV 170

Query: 243 PKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENCDLPWPQNARVKK 64
            K ANE   D  SSWIG E+N PWWRTADT ELA LV++  LD+IENCDLP PQNA VKK
Sbjct: 171 LKNANELYLDSESSWIGGERNTPWWRTADTGELAFLVSRSLLDHIENCDLPSPQNACVKK 230

Query: 63  DVGVNICCFGHDGIPN------------HHYSS 1
           D+ VN C FGHD IP+            HH+S+
Sbjct: 231 DMDVNTCSFGHDRIPSSLLDPNLKSGSPHHFST 263


>ref|XP_011093021.1| PREDICTED: uncharacterized protein LOC105173071 [Sesamum indicum]
           gi|747090650|ref|XP_011093022.1| PREDICTED:
           uncharacterized protein LOC105173071 [Sesamum indicum]
          Length = 478

 Score =  252 bits (643), Expect = 3e-64
 Identities = 147/263 (55%), Positives = 171/263 (65%), Gaps = 20/263 (7%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSS-VPTCVKQADT-VSSASGGQDVPGAV 577
           MA AEARAA  RTANRCLVQEDAKRAPKLACCSS VP  VKQA+T  + A+GGQD+P + 
Sbjct: 1   MAFAEARAAWQRTANRCLVQEDAKRAPKLACCSSSVPPSVKQAETGPTGAAGGQDIPNSP 60

Query: 576 S---NLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEIN----QKHESSGAFARN 418
           S   N  PS+S+LSPNS+WWL LQPNYGYQKGL DE+FTS E      Q  E+SG  A N
Sbjct: 61  SLPFNHYPSYSNLSPNSRWWLQLQPNYGYQKGLTDEQFTSTEGKIGTCQIQENSGDVASN 120

Query: 417 EDNPGLIDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIV--------- 265
                ++D+TS  + S         K EF V++     F   +CQDPLK+          
Sbjct: 121 -----VVDRTSHVESSFDNQF---GKEEFSVEDGKFGTFCGRDCQDPLKLEVKEDFRELR 172

Query: 264 -EDV-CPEVPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENCDLP 91
            ED+ C EV K AN+   D   SWIG EKN PWWRTADT+ELA LVAQRSLD IENCDLP
Sbjct: 173 GEDLFCCEVSKNANDLYLDSELSWIGPEKNTPWWRTADTEELALLVAQRSLDLIENCDLP 232

Query: 90  WPQNARVKKDVGVNICCFGHDGI 22
            PQN    KD+ +N  C+ HD I
Sbjct: 233 RPQNTHFTKDLRINASCYSHDQI 255


>ref|XP_010661958.1| PREDICTED: uncharacterized protein LOC100260339 isoform X1 [Vitis
           vinifera] gi|731422024|ref|XP_010661959.1| PREDICTED:
           uncharacterized protein LOC100260339 isoform X1 [Vitis
           vinifera] gi|731422026|ref|XP_010661960.1| PREDICTED:
           uncharacterized protein LOC100260339 isoform X1 [Vitis
           vinifera]
          Length = 479

 Score =  192 bits (487), Expect = 4e-46
 Identities = 122/276 (44%), Positives = 150/276 (54%), Gaps = 34/276 (12%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADT--VSSASGGQDVPGAV 577
           MA AEARA   R ANRC VQEDAKRAPKLACC S  +  KQAD    ++A G    P   
Sbjct: 1   MAAAEARAVWQRAANRCFVQEDAKRAPKLACCPSSSSSSKQADAGHANAADGPDHPPVGF 60

Query: 576 SNLN-PSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEI----------NQKHESSGA 430
             LN  S+S+L P+++WWL LQPNYGYQKGL  E+  ++E           ++  E  GA
Sbjct: 61  MPLNRTSYSNLPPDTRWWLQLQPNYGYQKGLTSEQLNALEAEVEMLIDGTASKTSELDGA 120

Query: 429 FARNEDNPGLIDQTSCKKLSCGRNMETS---DKLEFG--VKEDG------LRPFYSWNCQ 283
           +A+NED  G +D         G+N E+    D + F   V++D       +    S N Q
Sbjct: 121 YAQNEDGSGRVDG--------GKNTESFFDVDNINFAGCVEKDPDFGKQEVNALDSKNAQ 172

Query: 282 D----------PLKIVEDVCPEVPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLV 133
           D           L   E +     K  +E   D  SSWIG EKN PWWRTADTDELASLV
Sbjct: 173 DLEVNNMWKYYELVETEPIGSSASKQPSELYLDSESSWIGVEKNEPWWRTADTDELASLV 232

Query: 132 AQRSLDYIENCDLPWPQNARVKKDVGVNICCFGHDG 25
            Q+SLD+IENCDLP PQ   V+ D    +  F H G
Sbjct: 233 VQKSLDHIENCDLPPPQKMHVRSDPFAPLGSFVHKG 268


>ref|XP_002265230.1| PREDICTED: uncharacterized protein LOC100260339 isoform X2 [Vitis
           vinifera]
          Length = 478

 Score =  192 bits (487), Expect = 4e-46
 Identities = 122/276 (44%), Positives = 150/276 (54%), Gaps = 34/276 (12%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADT--VSSASGGQDVPGAV 577
           MA AEARA   R ANRC VQEDAKRAPKLACC S  +  KQAD    ++A G    P   
Sbjct: 1   MAAAEARAVWQRAANRCFVQEDAKRAPKLACCPSSSSSSKQADAGHANAADGPDHPPVGF 60

Query: 576 SNLN-PSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEI----------NQKHESSGA 430
             LN  S+S+L P+++WWL LQPNYGYQKGL  E+  ++E           ++  E  GA
Sbjct: 61  MPLNRTSYSNLPPDTRWWLQLQPNYGYQKGLTSEQLNALEAEVEMLIDGTASKTSELDGA 120

Query: 429 FARNEDNPGLIDQTSCKKLSCGRNMETS---DKLEFG--VKEDG------LRPFYSWNCQ 283
           +A+NED  G +D         G+N E+    D + F   V++D       +    S N Q
Sbjct: 121 YAQNEDGSGRVDG--------GKNTESFFDVDNINFAGCVEKDPDFGKQEVNALDSKNAQ 172

Query: 282 D----------PLKIVEDVCPEVPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLV 133
           D           L   E +     K  +E   D  SSWIG EKN PWWRTADTDELASLV
Sbjct: 173 DLEVNNMWKYYELVETEPIGSSASKQPSELYLDSESSWIGVEKNEPWWRTADTDELASLV 232

Query: 132 AQRSLDYIENCDLPWPQNARVKKDVGVNICCFGHDG 25
            Q+SLD+IENCDLP PQ   V+ D    +  F H G
Sbjct: 233 VQKSLDHIENCDLPPPQKMHVRSDPFAPLGSFVHKG 268


>ref|XP_012083190.1| PREDICTED: uncharacterized protein LOC105642829 isoform X3
           [Jatropha curcas]
          Length = 352

 Score =  186 bits (473), Expect = 2e-44
 Identities = 108/268 (40%), Positives = 144/268 (53%), Gaps = 27/268 (10%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSS-ASGGQDVPGA-- 580
           MA+AEARA   R ANRC VQEDAKRAPKLACC S  +  +Q D  S+ A+   D PG   
Sbjct: 1   MAVAEARAVWQRVANRCFVQEDAKRAPKLACCQSSSSSSRQIDGGSTDAADMTDNPGVGF 60

Query: 579 -VSNLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKH-ESSGAFARNEDNP 406
              + NPS+S+L P+++WWL LQPNYGYQKGL  E+  ++E   +   +  A + ++   
Sbjct: 61  MPLHRNPSYSNLPPDTRWWLQLQPNYGYQKGLTYEQLNALEAEMESLRAEIADSTSKIGE 120

Query: 405 GLIDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIVEDVCPEV------ 244
              D   C+     +N E+S    +    D ++       Q+ + + +   PE       
Sbjct: 121 VCPDDDRCRCFDGSKNSESSFDAHWKTAADCMKKDLEVKRQETIGLYDKNAPESIELKDT 180

Query: 243 ----------------PKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDY 112
                           P+  NE CFDP S WI DEK VPWWRT D D+L SLVAQ+SLDY
Sbjct: 181 RVNSNWMDMDPIECCGPQKTNEYCFDPESPWIEDEKTVPWWRTTDKDDLVSLVAQKSLDY 240

Query: 111 IENCDLPWPQNARVKKDVGVNICCFGHD 28
            +NCDLP PQ   V++   V +    HD
Sbjct: 241 FQNCDLPPPQKMHVRRYPSVRVGSSDHD 268


>ref|XP_012083189.1| PREDICTED: uncharacterized protein LOC105642829 isoform X2
           [Jatropha curcas]
          Length = 475

 Score =  186 bits (473), Expect = 2e-44
 Identities = 108/268 (40%), Positives = 144/268 (53%), Gaps = 27/268 (10%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSS-ASGGQDVPGA-- 580
           MA+AEARA   R ANRC VQEDAKRAPKLACC S  +  +Q D  S+ A+   D PG   
Sbjct: 1   MAVAEARAVWQRVANRCFVQEDAKRAPKLACCQSSSSSSRQIDGGSTDAADMTDNPGVGF 60

Query: 579 -VSNLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKH-ESSGAFARNEDNP 406
              + NPS+S+L P+++WWL LQPNYGYQKGL  E+  ++E   +   +  A + ++   
Sbjct: 61  MPLHRNPSYSNLPPDTRWWLQLQPNYGYQKGLTYEQLNALEAEMESLRAEIADSTSKIGE 120

Query: 405 GLIDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIVEDVCPEV------ 244
              D   C+     +N E+S    +    D ++       Q+ + + +   PE       
Sbjct: 121 VCPDDDRCRCFDGSKNSESSFDAHWKTAADCMKKDLEVKRQETIGLYDKNAPESIELKDT 180

Query: 243 ----------------PKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDY 112
                           P+  NE CFDP S WI DEK VPWWRT D D+L SLVAQ+SLDY
Sbjct: 181 RVNSNWMDMDPIECCGPQKTNEYCFDPESPWIEDEKTVPWWRTTDKDDLVSLVAQKSLDY 240

Query: 111 IENCDLPWPQNARVKKDVGVNICCFGHD 28
            +NCDLP PQ   V++   V +    HD
Sbjct: 241 FQNCDLPPPQKMHVRRYPSVRVGSSDHD 268


>ref|XP_012083184.1| PREDICTED: uncharacterized protein LOC105642829 isoform X1
           [Jatropha curcas] gi|802694482|ref|XP_012083185.1|
           PREDICTED: uncharacterized protein LOC105642829 isoform
           X1 [Jatropha curcas] gi|802694486|ref|XP_012083186.1|
           PREDICTED: uncharacterized protein LOC105642829 isoform
           X1 [Jatropha curcas] gi|802694489|ref|XP_012083187.1|
           PREDICTED: uncharacterized protein LOC105642829 isoform
           X1 [Jatropha curcas] gi|802694493|ref|XP_012083188.1|
           PREDICTED: uncharacterized protein LOC105642829 isoform
           X1 [Jatropha curcas] gi|643716844|gb|KDP28470.1|
           hypothetical protein JCGZ_14241 [Jatropha curcas]
          Length = 480

 Score =  186 bits (473), Expect = 2e-44
 Identities = 108/268 (40%), Positives = 144/268 (53%), Gaps = 27/268 (10%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSS-ASGGQDVPGA-- 580
           MA+AEARA   R ANRC VQEDAKRAPKLACC S  +  +Q D  S+ A+   D PG   
Sbjct: 1   MAVAEARAVWQRVANRCFVQEDAKRAPKLACCQSSSSSSRQIDGGSTDAADMTDNPGVGF 60

Query: 579 -VSNLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKH-ESSGAFARNEDNP 406
              + NPS+S+L P+++WWL LQPNYGYQKGL  E+  ++E   +   +  A + ++   
Sbjct: 61  MPLHRNPSYSNLPPDTRWWLQLQPNYGYQKGLTYEQLNALEAEMESLRAEIADSTSKIGE 120

Query: 405 GLIDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIVEDVCPEV------ 244
              D   C+     +N E+S    +    D ++       Q+ + + +   PE       
Sbjct: 121 VCPDDDRCRCFDGSKNSESSFDAHWKTAADCMKKDLEVKRQETIGLYDKNAPESIELKDT 180

Query: 243 ----------------PKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDY 112
                           P+  NE CFDP S WI DEK VPWWRT D D+L SLVAQ+SLDY
Sbjct: 181 RVNSNWMDMDPIECCGPQKTNEYCFDPESPWIEDEKTVPWWRTTDKDDLVSLVAQKSLDY 240

Query: 111 IENCDLPWPQNARVKKDVGVNICCFGHD 28
            +NCDLP PQ   V++   V +    HD
Sbjct: 241 FQNCDLPPPQKMHVRRYPSVRVGSSDHD 268


>ref|XP_012835590.1| PREDICTED: uncharacterized protein LOC105956292 [Erythranthe
           guttatus] gi|604334813|gb|EYU38879.1| hypothetical
           protein MIMGU_mgv1a007152mg [Erythranthe guttata]
          Length = 417

 Score =  184 bits (467), Expect = 8e-44
 Identities = 119/248 (47%), Positives = 139/248 (56%), Gaps = 13/248 (5%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPT-CVKQADTVSSASGG-QDVPGAV 577
           MALAEARAA  RT NRCLVQEDAKRAPKLA  SS+P  C K +DT  + S   QD+P   
Sbjct: 1   MALAEARAAWQRTGNRCLVQEDAKRAPKLAYSSSLPPPCSKPSDTGPTTSPSPQDIPPPS 60

Query: 576 SNLNP-----SFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKHESSGAFARNED 412
           ++ NP     S+S+LSPNS+WWL+LQPNY  QKG  DE      I+      G F   E 
Sbjct: 61  ASSNPFDRNSSYSNLSPNSRWWLHLQPNYPCQKGFTDE------IDTCQIKDGNFVARES 114

Query: 411 NPGLIDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQ-----DPLKIVEDVCPE 247
                D T                    VKE+  R F   N Q     +P      +   
Sbjct: 115 K----DST--------------------VKEEKFRSFCDINPQGRYFDEPKDECVVISCG 150

Query: 246 VPKGANEPCFDPGSSWIG-DEKNVPWWRTADTDELASLVAQRSLDYIENCDLPWPQNARV 70
           V K  NE CF   SSWIG  EKN PWWRTADT+ELA LVAQ+S D+IENCDLP PQN  +
Sbjct: 151 VSKNTNEHCFYSESSWIGAGEKNSPWWRTADTEELALLVAQKSHDFIENCDLPSPQNTHL 210

Query: 69  KKDVGVNI 46
           KK+  +NI
Sbjct: 211 KKETSMNI 218


>ref|XP_009590542.1| PREDICTED: uncharacterized protein LOC104087700 [Nicotiana
           tomentosiformis] gi|697163439|ref|XP_009590543.1|
           PREDICTED: uncharacterized protein LOC104087700
           [Nicotiana tomentosiformis]
          Length = 474

 Score =  177 bits (450), Expect = 7e-42
 Identities = 113/264 (42%), Positives = 146/264 (55%), Gaps = 29/264 (10%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSSASGGQDVPGAVS- 574
           MA+AEAR A  R ANRCLVQEDAKRAPKLACCSS     KQ DT  +       P +   
Sbjct: 1   MAVAEARTAWQRAANRCLVQEDAKRAPKLACCSSASPSSKQVDTGPANGADAQNPSSTCF 60

Query: 573 ---NLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKH---------ESSGA 430
              N N S+  LSPN++WWL+LQPNYGYQ+GL+ E   S E   ++         ++S  
Sbjct: 61  LPFNRNSSYCDLSPNTRWWLHLQPNYGYQRGLVSETVDSQEAEMENIGPVLDSTPKNSKF 120

Query: 429 FARNEDNPGLIDQTSCK-----KLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIV 265
             ++E + G +D+ +       ++    +   SD LE G KE  L   ++   +D L + 
Sbjct: 121 CDQSEADGGYMDEVTVGGSLDYQVKRSASHVNSD-LEVGSKE--LIDVFTEISKDGLHLE 177

Query: 264 EDVCP-----------EVPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSL 118
           +   P            V K  +E  FD    WIG EK  PWWRTAD +ELA LVAQRS 
Sbjct: 178 DTGYPYEESKKDMVDFTVCKQVDELSFDREYPWIGVEKTEPWWRTADREELALLVAQRSH 237

Query: 117 DYIENCDLPWPQNARVKKDVGVNI 46
           D+IENCDLP PQN  VK+D  V++
Sbjct: 238 DFIENCDLPQPQNNFVKRDHDVDV 261


>emb|CDP08657.1| unnamed protein product [Coffea canephora]
          Length = 476

 Score =  175 bits (444), Expect = 4e-41
 Identities = 113/264 (42%), Positives = 146/264 (55%), Gaps = 23/264 (8%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADT-VSSASGGQDVPGAVS 574
           MA+AEARAA  RT NRCLVQEDAKRAPKLA C      V++ +   ++A+  QD+     
Sbjct: 1   MAVAEARAAWQRTVNRCLVQEDAKRAPKLAYCPLASPFVREVEVGPANAAEAQDISSVAF 60

Query: 573 ---NLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERF--TSMEINQKHESSGAFAR---N 418
              N + SFS+LSPNSKWWL L  NY +Q+GL D++   T  E+   H+ + +  +   +
Sbjct: 61  PPFNQSTSFSNLSPNSKWWLQLPSNYRHQRGLTDQQLNCTDSEMETFHDRTSSALKMPES 120

Query: 417 EDNPGLID---QTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWN--CQDPLKIVEDVC 253
           ED   L     +T     S  R + T  K +  V +  L P    N  C   L+ V D+ 
Sbjct: 121 EDGSALFYDSIETESFVDSDLRILSTGLKKDTEVGDKDLTPMNKLNPQCSPKLEDVGDLY 180

Query: 252 PE---------VPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENC 100
                      V K  NE   D  S WIGDEK  PWWRTAD DELA LV++ S   IENC
Sbjct: 181 ERAEIGTYGCTVSKKKNELFPDSESPWIGDEKIGPWWRTADQDELALLVSRGSFGLIENC 240

Query: 99  DLPWPQNARVKKDVGVNICCFGHD 28
           DLP PQN  V+++  V++CCF HD
Sbjct: 241 DLPQPQNTCVEREAFVDLCCFDHD 264


>ref|XP_012480955.1| PREDICTED: uncharacterized protein LOC105795842 [Gossypium
           raimondii] gi|823124621|ref|XP_012480961.1| PREDICTED:
           uncharacterized protein LOC105795842 [Gossypium
           raimondii] gi|763742133|gb|KJB09632.1| hypothetical
           protein B456_001G153700 [Gossypium raimondii]
           gi|763742134|gb|KJB09633.1| hypothetical protein
           B456_001G153700 [Gossypium raimondii]
           gi|763742135|gb|KJB09634.1| hypothetical protein
           B456_001G153700 [Gossypium raimondii]
          Length = 425

 Score =  174 bits (442), Expect = 6e-41
 Identities = 100/232 (43%), Positives = 129/232 (55%), Gaps = 3/232 (1%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSSASGGQDVPGA--- 580
           MA AEARA   RTANRC VQEDAKRAPKLACC S   C +   + +  +  +D P     
Sbjct: 1   MAAAEARAVWQRTANRCFVQEDAKRAPKLACCQSSSHCKQADSSPTGVADARDHPAVDLN 60

Query: 579 VSNLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKHESSGAFARNEDNPGL 400
             N NPS+S+L P+ +WWL LQP+YG QKGL +E+  ++E   + ES     ++  N   
Sbjct: 61  ALNRNPSYSNLPPDMRWWLQLQPSYGPQKGLKNEQLYALE--DEVESLKGEIKSPSNVSR 118

Query: 399 IDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIVEDVCPEVPKGANEPC 220
           +     +  S         K   G   D      ++   +    +E V   V K  N+ C
Sbjct: 119 VQPHDAQDAS-----GVDRKRNNGGSLDSTETVRNYELLE----MESVECHVSKKINDCC 169

Query: 219 FDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENCDLPWPQNARVKK 64
           +DP S W GD K  PWWRT D DELASLVAQ+SLD+IENCDLP PQ   V++
Sbjct: 170 YDPESPWAGDGKAEPWWRTTDKDELASLVAQKSLDFIENCDLPPPQKMHVRR 221


>gb|KHG01749.1| hypothetical protein F383_21898 [Gossypium arboreum]
          Length = 432

 Score =  172 bits (435), Expect = 4e-40
 Identities = 100/232 (43%), Positives = 129/232 (55%), Gaps = 3/232 (1%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSSASGGQDVPGA--- 580
           MA AEARA   RTANRC VQEDAKRAPKLACC S   C +   + +  +  +D P     
Sbjct: 1   MAAAEARAVWQRTANRCFVQEDAKRAPKLACCQSSSHCKQADSSPTGVADARDHPAVDLN 60

Query: 579 VSNLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKHESSGAFARNEDNPGL 400
             N NPS+S+L  + +WWL LQP+YG QKGL +E+  ++E   + ES  A  ++  N   
Sbjct: 61  ALNRNPSYSNLPLDMRWWLQLQPSYGPQKGLTNEQLYALE--DEVESLKAEIKSPSNVSR 118

Query: 399 IDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIVEDVCPEVPKGANEPC 220
           +     +  S         K   G   D      ++   +    +E V   V K  N+ C
Sbjct: 119 VHPHDAQDAS-----GVDRKKNNGGSLDSTETVRNYELLE----MESVECHVSKKINDCC 169

Query: 219 FDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENCDLPWPQNARVKK 64
           +DP S W GD K  PWWRT D DELASLVAQ+SLD+IENCDLP PQ   V++
Sbjct: 170 YDPESPWAGDGKAEPWWRTTDKDELASLVAQKSLDFIENCDLPPPQKMHVRR 221


>ref|XP_011038435.1| PREDICTED: uncharacterized protein LOC105135311 [Populus
           euphratica]
          Length = 480

 Score =  170 bits (430), Expect = 2e-39
 Identities = 107/278 (38%), Positives = 139/278 (50%), Gaps = 34/278 (12%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSSASGGQDVPGAVSN 571
           MA AEARA   RTANRC VQEDAKRAPKLACC S  +  KQ D     +  QD+P   S 
Sbjct: 1   MAAAEARAVWQRTANRCFVQEDAKRAPKLACCQSSSSSSKQLD--GGPTSAQDMPDQSSG 58

Query: 570 ------LNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKH--------ESSG 433
                   PS+S L P+++WWL LQP+YGYQK    E+  ++E   +          S  
Sbjct: 59  GFMPLRRYPSYSSLPPDTRWWLQLQPSYGYQKCFTLEQLNALEAELESLRADIVDSPSKS 118

Query: 432 AFARNEDNPGLI---------DQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQD 280
            F + +D   +             SC  +     M+  D     VK+  L+  Y  + Q+
Sbjct: 119 KFCQQDDTDSIFVDGSKNSESSLDSCCMIPADYVMKDHD-----VKKQELKVLYDKDFQE 173

Query: 279 PLKI-----------VEDVCPEVPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLV 133
             ++           +E     + +  +E  FDP S+WIG EKN+PWWR  D D+LASLV
Sbjct: 174 FKELKGARKNSKSTEIEPTGWPISQKDSEYAFDPESAWIGGEKNMPWWRVTDKDDLASLV 233

Query: 132 AQRSLDYIENCDLPWPQNARVKKDVGVNICCFGHDGIP 19
           AQ+SLDYI NCDLP PQ   + K        F HD  P
Sbjct: 234 AQKSLDYITNCDLPPPQKMNIGKYPCARPGSFQHDNTP 271


>ref|XP_012490137.1| PREDICTED: uncharacterized protein LOC105802813 [Gossypium
           raimondii] gi|763774447|gb|KJB41570.1| hypothetical
           protein B456_007G109800 [Gossypium raimondii]
           gi|763774448|gb|KJB41571.1| hypothetical protein
           B456_007G109800 [Gossypium raimondii]
          Length = 436

 Score =  165 bits (418), Expect = 4e-38
 Identities = 101/247 (40%), Positives = 133/247 (53%), Gaps = 8/247 (3%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSSASGGQDVPGAVS- 574
           MA AEARAA  RTANRC VQEDAKRAPKLACC S  +  KQAD+  S    +    A+  
Sbjct: 1   MAAAEARAAWQRTANRCFVQEDAKRAPKLACCQS-SSSTKQADSNPSGVADKHNHPAIGF 59

Query: 573 ---NLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKHESSGAFARNEDNPG 403
              N NPS+S+L P+++WWL LQPNYG Q GL +E+  ++E              ++   
Sbjct: 60  MPLNRNPSYSNLPPDTRWWLQLQPNYGPQMGLTNEQLNALE--------------DEVES 105

Query: 402 LIDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIVE----DVCPEVPKG 235
           L  + +  K+S     +  D      K++      S    +  + +E    + C  +   
Sbjct: 106 LKAEINSSKVSSDLQQDAHDSSITDRKKNNSYSLDSKETMESFEFLEMESVECCASMK-- 163

Query: 234 ANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENCDLPWPQNARVKKDVG 55
            N+ C +P S W G  K  PWWRT D DEL SLVAQ+SLD+IENCDLP PQ   V+   G
Sbjct: 164 TNDFCSEPESPWSGGGKAEPWWRTTDKDELTSLVAQKSLDFIENCDLPPPQKVHVR---G 220

Query: 54  VNICCFG 34
            +  C G
Sbjct: 221 YSHVCSG 227


>ref|XP_010100718.1| hypothetical protein L484_023487 [Morus notabilis]
           gi|587895379|gb|EXB83880.1| hypothetical protein
           L484_023487 [Morus notabilis]
          Length = 472

 Score =  165 bits (417), Expect = 5e-38
 Identities = 103/262 (39%), Positives = 141/262 (53%), Gaps = 24/262 (9%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVK-QADTVSSASGGQDVPGA-- 580
           MA+AEARA   R ANRC VQEDAKRAPKLACC S  T  + +A   ++A+ G D P    
Sbjct: 1   MAVAEARAVWQRAANRCFVQEDAKRAPKLACCQSSSTSKQVEAGGHATATDGPDHPAVGF 60

Query: 579 -VSNLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSME----------INQKHESSG 433
             +N  PS+S+L P+++WWL++QPNYG QKG   E+  ++E          +N     S 
Sbjct: 61  MPTNRCPSYSNLPPDTRWWLHMQPNYGCQKGFTYEQMNALENEEGTKNAGVVNSTSRISE 120

Query: 432 AFARNEDNPG----LIDQTSCKKLS--CGRNMETSDKLEF----GVKEDGLRPFYSWNCQ 283
           A  R  D        +   + KK S    +N++  D  +     G+++  +    SW   
Sbjct: 121 AHKRKGDKNNECFVSVHNAAQKKASEVGKKNVKALDGKDIEELIGLEDSTV----SWEIM 176

Query: 282 DPLKIVEDVCPEVPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIEN 103
                V+ +     K +NE CF+P  SW+G EK+ PWWR  D DEL SLVAQ+SLD + N
Sbjct: 177 Q----VDSIDCSDTKQSNEMCFEPEYSWMGSEKSEPWWRMTDRDELVSLVAQKSLDRVGN 232

Query: 102 CDLPWPQNARVKKDVGVNICCF 37
           CDLP PQ    ++     I CF
Sbjct: 233 CDLPPPQKTSHRRHPYARIGCF 254


>gb|KHG18669.1| Alanine--tRNA ligase [Gossypium arboreum]
          Length = 436

 Score =  162 bits (411), Expect = 2e-37
 Identities = 106/252 (42%), Positives = 134/252 (53%), Gaps = 13/252 (5%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSSASGGQDVPGAVS- 574
           MA AEARAA  RTANRC VQEDAKRAPKLACC S  +  KQAD+  S    +    A+  
Sbjct: 1   MAAAEARAAWQRTANRCFVQEDAKRAPKLACCQS-SSSTKQADSNPSGVADKHNHPAIGF 59

Query: 573 ---NLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSME---------INQKHESSGA 430
              N  PS+S+L P+++WWL LQPNYG Q  L +E+  ++E         IN    SS  
Sbjct: 60  MPLNRTPSYSNLPPDTRWWLQLQPNYGPQMSLTNEQLNALEDEVESLKAEINSSKVSSD- 118

Query: 429 FARNEDNPGLIDQTSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIVEDVCP 250
             ++  +  + D+      S   + ET +  EF   E       S  C+  +K       
Sbjct: 119 LQQDAHDISITDRKKNNSYSLD-SKETMESFEFLEME-------SVECRASMK------- 163

Query: 249 EVPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENCDLPWPQNARV 70
                 N+ C +P S W G  K  PWWRT D DELASLVAQ+SLD+IENCDLP PQ   V
Sbjct: 164 -----TNDFCSEPESPWSGGGKAEPWWRTTDKDELASLVAQKSLDFIENCDLPPPQKVHV 218

Query: 69  KKDVGVNICCFG 34
           +   G +  C G
Sbjct: 219 R---GYSHVCSG 227


>ref|XP_008376151.1| PREDICTED: uncharacterized protein LOC103439371 [Malus domestica]
           gi|657968877|ref|XP_008376152.1| PREDICTED:
           uncharacterized protein LOC103439371 [Malus domestica]
          Length = 450

 Score =  161 bits (407), Expect = 7e-37
 Identities = 100/256 (39%), Positives = 136/256 (53%), Gaps = 19/256 (7%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADT-VSSASGGQDVPGA-- 580
           MA AEARA   RTANRC VQEDAKRAPKLACC S  +  +Q D   ++ + G D P    
Sbjct: 1   MAAAEARAVWQRTANRCFVQEDAKRAPKLACCQSSSSTTRQVDAGPATVAEGPDHPATGF 60

Query: 579 -VSNLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKHESSGAFARNEDNPG 403
              N NPS+S L P+++WWL +QP+YGYQK    E+ +++E + +   +G F ++     
Sbjct: 61  MPINRNPSYSSLPPDTRWWLQMQPSYGYQKDFTYEQLSALEADMETLRAG-FVKSTPKTS 119

Query: 402 LIDQ-----TSCKKLSCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKI---------- 268
            + Q     T      C +    + K +   K       YS N Q+PL+           
Sbjct: 120 EVHQQKAEFTDAVSAVCMKTGYEAQKQDVSAK-------YSKNMQEPLQYEMKEKYEIMG 172

Query: 267 VEDVCPEVPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENCDLPW 88
           ++ +   V     E C D    WIG  +  PWWRT D DELASLVAQ+SL+++ENCDLP 
Sbjct: 173 MDTIDCPVSNQPKEFCCD--YPWIGGGRAEPWWRTTDRDELASLVAQKSLNHMENCDLPP 230

Query: 87  PQNARVKKDVGVNICC 40
           PQ    K+    +I C
Sbjct: 231 PQKTYHKRHPYADIGC 246


>ref|XP_007217991.1| hypothetical protein PRUPE_ppa005611mg [Prunus persica]
           gi|462414453|gb|EMJ19190.1| hypothetical protein
           PRUPE_ppa005611mg [Prunus persica]
          Length = 451

 Score =  159 bits (402), Expect = 3e-36
 Identities = 98/252 (38%), Positives = 136/252 (53%), Gaps = 9/252 (3%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADT-VSSASGGQDVPGA-- 580
           MA AEARA   R ANRC VQEDAKRAPKLACC S  +  KQ D   ++A+ G D P A  
Sbjct: 1   MAAAEARAVWQRVANRCFVQEDAKRAPKLACCQSSSSTTKQVDAGPATAAEGPDHPAAGF 60

Query: 579 -VSNLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKHESSGAFARNEDNPG 403
              N NPS+S L P+++WWL +QP+YGYQK    E+  ++E + +   +G F ++     
Sbjct: 61  VPLNRNPSYSSLPPDARWWLQMQPSYGYQKDFTYEQLNALEADMETLRAG-FVKSTPKTS 119

Query: 402 LIDQTS--CKKLSCGRNMETSDK---LEFGVKEDGLRPFYSWNCQDPLKIVEDVCPEVPK 238
            + Q    C      +N +   +    ++G     L  +     +  +  ++ +     K
Sbjct: 120 EVRQQKGECTDADGHKNSKVQKQDVNAQYGKDMKELVQYKDVREKYEIMGMDTIDYPFSK 179

Query: 237 GANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRSLDYIENCDLPWPQNARVKKDV 58
              E C D    WIG  +  PWWRT D DELASLVAQ+SL+++ENCDLP PQ    K+  
Sbjct: 180 QPEEFCCD--YPWIGGGRAEPWWRTTDRDELASLVAQKSLNHVENCDLPPPQKMYHKRHP 237

Query: 57  GVNICCFGHDGI 22
             +I C  H+ I
Sbjct: 238 YADIGCSDHNVI 249


>ref|XP_010523818.1| PREDICTED: uncharacterized protein LOC104802082 [Tarenaya
           hassleriana] gi|729453706|ref|XP_010523819.1| PREDICTED:
           uncharacterized protein LOC104802082 [Tarenaya
           hassleriana]
          Length = 430

 Score =  156 bits (394), Expect = 2e-35
 Identities = 103/265 (38%), Positives = 131/265 (49%), Gaps = 22/265 (8%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACC-SSVPTCVKQADTVSSASGGQDVPGAVS 574
           MA AEAR    R  NRC VQEDAKRAPKLACC SS     KQ D+  S     D+    S
Sbjct: 1   MAAAEARTLWQRAVNRCFVQEDAKRAPKLACCQSSSSASTKQFDSSGSTGTAPDLHDQSS 60

Query: 573 ------NLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKHESSGAFARNED 412
                 N NP F  L P+++WW ++QP Y  +K L+ ++   +E N             D
Sbjct: 61  ACFMPLNRNPRFPDLPPDTRWWAHIQPGYPDRKDLVKDQMRPLEANA------------D 108

Query: 411 NPGLIDQTSCKKLSCGRNMETSDKLEFGVKEDGLRP-------------FYSWNCQDPLK 271
            PG    T  +  S        D   FG ++D  RP               +  C++  +
Sbjct: 109 IPGEGSATVIRSPS-------KDDCSFGPEDDQGRPRCKKRNPGTGENEVEALTCENFQE 161

Query: 270 IVEDVCPEVPKGANEPCFDPGSSW-IGDEKNVPWWRTA-DTDELASLVAQRSLDYIENCD 97
            +E +  E  K +NE  FD GS W +  EK  PWWRT  D DELASLVAQ+SLDY+ENCD
Sbjct: 162 YIELIGYEPSKKSNELSFDMGSPWNLSSEKAEPWWRTTTDKDELASLVAQKSLDYVENCD 221

Query: 96  LPWPQNARVKKDVGVNICCFGHDGI 22
           LP PQ    K+    +  CF  DG+
Sbjct: 222 LPTPQKIHKKRPSYDSPRCFNSDGL 246


>ref|XP_010266583.1| PREDICTED: uncharacterized protein LOC104604051 isoform X3 [Nelumbo
           nucifera]
          Length = 481

 Score =  154 bits (388), Expect = 1e-34
 Identities = 105/273 (38%), Positives = 139/273 (50%), Gaps = 30/273 (10%)
 Frame = -1

Query: 750 MALAEARAALHRTANRCLVQEDAKRAPKLACCSSVPTCVKQADTVSSAS--GGQD--VPG 583
           MA AE RAA  R ANRC VQEDAKRAPKLACC S P+  K    + +     G D  + G
Sbjct: 1   MAAAEVRAAWQRAANRCFVQEDAKRAPKLACCPSSPSSSKPQVDIGAGDLPNGPDHSIAG 60

Query: 582 AVS-NLNPSFSHLSPNSKWWLYLQPNYGYQKGLMDERFTSMEINQKHESSGAFARN---- 418
            +  N N S S+L P+SKWWL LQPN+GYQK    E+  ++E   +    G   +N    
Sbjct: 61  FIPLNWNISNSNLPPDSKWWLQLQPNFGYQKDFTCEQLNALETELEVLKGGEVNKNYELG 120

Query: 417 EDNPGLIDQTSCKKL---------SCGRNMETSDKLEFGVKEDGLRPFYSWNCQDPLKIV 265
            ++P   + ++  +          SC +   T  KL    K   L+   + N Q+ LK  
Sbjct: 121 REHPLSKEDSTYAESGKNADSYLDSCWQVPATCAKLGAEAKMRELKAASTKNLQEMLKHK 180

Query: 264 E-----------DVCPE-VPKGANEPCFDPGSSWIGDEKNVPWWRTADTDELASLVAQRS 121
           +           D  P  + +   +   D  S W+G EK  PWWR  D D+LASLVAQ+S
Sbjct: 181 DIGNYWFQDEEMDFDPSLISEQPEKLTSDSKSPWMGAEKTEPWWRAVDKDDLASLVAQKS 240

Query: 120 LDYIENCDLPWPQNARVKKDVGVNICCFGHDGI 22
           L++IENCDLP PQ   V +   V+  CF  D I
Sbjct: 241 LEHIENCDLPRPQTMHVSRGPFVSHECFNCDKI 273


Top