BLASTX nr result

ID: Lithospermum23_contig00006834 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00006834
         (1886 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_012844576.1 PREDICTED: UPF0481 protein At3g47200-like [Erythr...   290   3e-88
ACH87183.1 unknown protein [Camellia sinensis]                        284   5e-86
XP_011101987.1 PREDICTED: UPF0481 protein At3g47200-like [Sesamu...   281   2e-84
XP_011086479.1 PREDICTED: UPF0481 protein At3g47200-like isoform...   256   2e-75
EOY05383.1 Uncharacterized protein TCM_020393 isoform 1 [Theobro...   254   1e-74
XP_017610281.1 PREDICTED: UPF0481 protein At3g47200-like [Gossyp...   254   1e-74
XP_016672368.1 PREDICTED: UPF0481 protein At3g47200-like isoform...   253   3e-74
EYU24251.1 hypothetical protein MIMGU_mgv1a019478mg [Erythranthe...   250   4e-74
XP_012484204.1 PREDICTED: UPF0481 protein At3g47200-like isoform...   252   8e-74
OMO53035.1 hypothetical protein CCACVL1_28930 [Corchorus capsula...   252   9e-74
XP_016703022.1 PREDICTED: UPF0481 protein At3g47200-like isoform...   247   5e-72
XP_007034456.2 PREDICTED: UPF0481 protein At3g47200 [Theobroma c...   240   2e-69
XP_015388586.1 PREDICTED: UPF0481 protein At3g47200-like [Citrus...   239   5e-69
KCW60073.1 hypothetical protein EUGRSUZ_H02803, partial [Eucalyp...   238   1e-68
KDO36306.1 hypothetical protein CISIN_1g014530mg [Citrus sinensis]    238   2e-68
XP_010023720.1 PREDICTED: UPF0481 protein At3g47200 isoform X5 [...   238   3e-68
EOY05382.1 Uncharacterized protein TCM_020392 [Theobroma cacao]       237   3e-68
XP_010023544.1 PREDICTED: UPF0481 protein At3g47200 [Eucalyptus ...   236   2e-67
KCW60065.1 hypothetical protein EUGRSUZ_H02794, partial [Eucalyp...   232   2e-66
XP_010026753.1 PREDICTED: UPF0481 protein At3g47200 [Eucalyptus ...   232   3e-66

>XP_012844576.1 PREDICTED: UPF0481 protein At3g47200-like [Erythranthe guttata]
            EYU31471.1 hypothetical protein MIMGU_mgv1a022973mg
            [Erythranthe guttata]
          Length = 426

 Score =  290 bits (742), Expect = 3e-88
 Identities = 183/419 (43%), Positives = 248/419 (59%), Gaps = 9/419 (2%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAELDQAVTNKCRD-- 688
            +I +VH++LR+V   AY+P +VSIGPYH    N  MM+D+KL YL  L Q       +  
Sbjct: 29   TIYRVHKHLRNVNMKAYEPEVVSIGPYHRDKDNLTMMEDKKLCYLHLLLQRKNESIENYV 88

Query: 689  -AIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIK----NLLKPNDPIF 853
             AI+    + +K Y EP+      ++ +  EM++ DG FII LV K     L + ND IF
Sbjct: 89   AAIEPFEFEARKCYAEPISL----NAVEFIEMLVLDGCFIIDLVRKCNMIYLREKNDSIF 144

Query: 854  NSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNK-TRLIQLLIDFFDSLIPEPSLGG 1030
               WI+NS+QRDL+L ENQ+P+FIL  LF LIE PN+ +RLI LL++FF+SL P      
Sbjct: 145  LMDWIINSLQRDLMLFENQVPFFILCKLFDLIEVPNQHSRLIYLLLNFFNSLFPGKVYRD 204

Query: 1031 EKLTNHQSASP-NHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRNQIP 1207
             K  +  SA    HL+DLIH  W P       +    N +++     K++ R  N     
Sbjct: 205  SK--DRSSAEVIKHLLDLIHSNWHPSFDW---LDFEKNKKSE-----KKRWRFVNN---- 250

Query: 1208 TATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYFGDN 1387
             ATEL E  VK K + +G +LFDI FQ G + + P ++++ TE   RNL+AYEQ YFGD+
Sbjct: 251  -ATELRETNVKFK-RIEGISLFDIRFQDGNMLLAPLTIEDRTESLFRNLMAYEQ-YFGDS 307

Query: 1388 RKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSEDKF 1567
            + +YVTDYVK LDCLI+S +DV IL   GI +NWLGD+  VAN+FN +TDS+  P   +F
Sbjct: 308  QISYVTDYVKFLDCLIDSSRDVSILSWHGIIDNWLGDDEVVANMFNKLTDSIAGPG-IRF 366

Query: 1568 LYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYLQV 1744
            +YA+VF+ VN HC    +  MA L+RNY+N  W  IS             QT  S LQV
Sbjct: 367  VYADVFEDVNKHCNRRRNKWMAKLRRNYMNSPWAVISILFAVVLLLLTVAQTVCSILQV 425


>ACH87183.1 unknown protein [Camellia sinensis]
          Length = 417

 Score =  284 bits (726), Expect = 5e-86
 Identities = 177/406 (43%), Positives = 231/406 (56%), Gaps = 12/406 (2%)
 Frame = +2

Query: 497  HQPIKSSICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAELDQAVTN 676
            H P +  I ++H  LR + + AYDP I+SIGPYH G  N +MM+  KLRY   L Q    
Sbjct: 23   HPPTEFFIFRLHEELRQLNDKAYDPEIISIGPYHRGKQNLQMMERHKLRYFHSLLQEKNL 82

Query: 677  KCRD---AIKKLSGQVKKSYDEPLITPGPGSSYDIDEM---MLEDGVFIIQLVIKN---- 826
               D   AI  L       Y EP+       S D DEM   M+ DG FII+L+ K     
Sbjct: 83   SPEDFVYAIGSLELHACDFYAEPI-------SLDSDEMIKMMVLDGCFIIELLRKFDMEF 135

Query: 827  LLKPNDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKP-NKTRLIQLLIDFFDS 1003
            L   NDPIF   WI N +QRDL+L ENQIP+F+L  LF +IE P N  RLI L + FF  
Sbjct: 136  LRDENDPIFKRDWIFNRLQRDLMLFENQIPFFVLCKLFDMIEAPGNHKRLIYLALRFFSD 195

Query: 1004 LIPEPSLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGR 1183
            L+P    G  +         +HL+ LIH  W P  +G   +        +D    K KG 
Sbjct: 196  LLP--GTGKREDGKESQGKISHLLGLIHSNWHPSFAGVEPV--------EDASNKKGKG- 244

Query: 1184 SHNRNQIPTATELTEAGVKL-KNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVA 1360
              N   IP+  EL E+GVK+ K +  G +LFDI F+ G + IPP +++  TE F RNL+A
Sbjct: 245  --NWRFIPSTRELQESGVKIEKFEVTGGSLFDIEFKNGVMQIPPLTIEGRTESFFRNLIA 302

Query: 1361 YEQHYFGDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDS 1540
            YEQ Y  DN+ +YV DYVK LD LI+SPKDV+IL  +GI +NWLGD+  V+N+FN I+D+
Sbjct: 303  YEQ-YSPDNQFSYVADYVKFLDFLIDSPKDVKILSRRGIIDNWLGDDEAVSNLFNKISDT 361

Query: 1541 VVLPSEDKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYIS 1678
            V   S   F YA++F +VN HC  P++   A+L RNY N  W  I+
Sbjct: 362  VSGTSM-HFRYADIFNRVNIHCSQPWNLYRATLNRNYFNNPWAMIA 406


>XP_011101987.1 PREDICTED: UPF0481 protein At3g47200-like [Sesamum indicum]
          Length = 447

 Score =  281 bits (718), Expect = 2e-84
 Identities = 181/435 (41%), Positives = 245/435 (56%), Gaps = 25/435 (5%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAELDQAVTN---KCR 685
            +I +VH++LRSV + AY+P +++IGPYH    N KMM+D KLR L  L Q   +   K  
Sbjct: 27   TIYRVHKHLRSVNDKAYEPEVIAIGPYHRDKGNLKMMEDHKLRNLHLLLQRKNDDVEKYV 86

Query: 686  DAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKN----LLKPNDPIF 853
             AI  L    +K Y E +      S+ D  EM++ DG FII L  K+    L + NDPIF
Sbjct: 87   SAIGPLEPLARKCYAETISL----SAADFIEMLVLDGCFIIDLARKSNMPHLREKNDPIF 142

Query: 854  NSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNK-TRLIQLLIDFFDSLIPEPSLGG 1030
            +  WIMNS+QRDL+L ENQIP+F+L  LF LIE PN+ +RLI LL+ FFD+L P     G
Sbjct: 143  HMEWIMNSLQRDLMLFENQIPFFVLCKLFDLIEVPNQHSRLIYLLLSFFDNLYP-----G 197

Query: 1031 EKLTNHQSASPN---HLVDLIHICWCPQVSGSIAIGINTNDRNQDM----------VEGK 1171
            +     +S S +   HL+DLIH  W P       + ++ N  N+            + G 
Sbjct: 198  KVYVESRSWSSHEIKHLLDLIHRNWLPSFDW---LDVSKNGENRKKRWRFVNCGTCLGGA 254

Query: 1172 RKGRSHNRNQ----IPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTEC 1339
               +  N       I  AT L EA VK   +     LF+I F+ GT+ + P +V++ T+ 
Sbjct: 255  NVSKGENSKTRWRFINCATWLREANVKFARRDD-VTLFNIRFKNGTMFLAPLTVEDRTDS 313

Query: 1340 FLRNLVAYEQHYFGDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANI 1519
            F RN++AYEQ YF D   ++VTDYVK LDCLI+S +DV+IL   GI +NWLGD+  VAN+
Sbjct: 314  FFRNVIAYEQ-YFQDTEFSFVTDYVKFLDCLIDSSRDVQILGRAGIIDNWLGDDEVVANM 372

Query: 1520 FNTITDSVVLPSEDKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXX 1699
             N ++DSV  P    F+YA++F  VN HC    +  MA L+RNYLN  W  +S       
Sbjct: 373  INKLSDSVTGPG-SSFVYAKIFDSVNKHCNKRRNRWMAKLRRNYLNSPWAIMSIVVAVVL 431

Query: 1700 XXXXXXQTWLSYLQV 1744
                  QT  S LQV
Sbjct: 432  VLLTITQTVFSILQV 446


>XP_011086479.1 PREDICTED: UPF0481 protein At3g47200-like isoform X1 [Sesamum
            indicum]
          Length = 423

 Score =  256 bits (655), Expect = 2e-75
 Identities = 163/413 (39%), Positives = 225/413 (54%), Gaps = 6/413 (1%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAELDQAVTNKCR--- 685
            SI +VH++LR V N AY+P I++IGPYH    + KMM++ KL YL  L +          
Sbjct: 28   SIYRVHKDLRDVNNKAYEPEIIAIGPYHRDKDHLKMMEEHKLWYLHLLLKRKKENVEAYI 87

Query: 686  DAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLL--KPNDPIFNS 859
             A+ +L  + +  Y EP+      +S    +M++ D  FII+LV K+      +DPIF  
Sbjct: 88   SAMGELEQEARNCYAEPVSL----NSAKFIKMLVLDSCFIIELVRKDHEGDHEDDPIFEM 143

Query: 860  AWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNK-TRLIQLLIDFFDSLIPEPSLGGEK 1036
             WIMNS+Q D++L ENQIP+FIL  LF +IE PN+   LI  L+ F+ +    P    + 
Sbjct: 144  GWIMNSLQPDIILFENQIPFFILCRLFDMIEGPNRHNMLIDRLLLFYQN---NPGGRLKN 200

Query: 1037 LTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRNQIPTAT 1216
             T        HL+DLIH         S+   +     N + V   R+ +      IP AT
Sbjct: 201  KTERSLQEIKHLLDLIH--------SSLIDSVEELYVNVEEVPKTREWQF-----IPPAT 247

Query: 1217 ELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYFGDNRKT 1396
            ELT+A V  KN  +  N F + F+ G + IP F +D+  EC  RNL+A+EQ+     R  
Sbjct: 248  ELTDANVAFKN-VESDNFFKVDFRDGIMLIPHFIIDDSKECLFRNLIAHEQYSSTTTRPN 306

Query: 1397 YVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSEDKFLYA 1576
            +VTDYV+ +DCLINS KDVEIL   GI ENWLGDN  VANIFN +  SV L + D F Y 
Sbjct: 307  FVTDYVRFMDCLINSSKDVEILSECGIIENWLGDNEEVANIFNQLFTSVTL-TRDNFFYG 365

Query: 1577 EVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSY 1735
             +  +VN H +  ++ +MA L+R+Y N  W Y S             QT LS+
Sbjct: 366  TICDRVNTHYKKRWNRAMALLRRDYFNSPWSYFSFSAALAILLLTVAQTVLSF 418


>EOY05383.1 Uncharacterized protein TCM_020393 isoform 1 [Theobroma cacao]
            EOY05384.1 Uncharacterized protein TCM_020393 isoform 1
            [Theobroma cacao]
          Length = 418

 Score =  254 bits (649), Expect = 1e-74
 Identities = 155/401 (38%), Positives = 228/401 (56%), Gaps = 9/401 (2%)
 Frame = +2

Query: 503  PIKSSIC--KVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLA----ELDQ 664
            PI S  C  KV   LR V   AY+P +V+IGP+H G  + K M+++K+R+L     E  +
Sbjct: 22   PISSDCCIFKVPNYLRKVNEKAYEPEVVAIGPFHRGKDHLKPMEERKIRFLQLILQERGE 81

Query: 665  AVTNKCRDAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLLKPN- 841
                K    +++L  + +K Y EP+     G      EMML DG  IIQL+ K+    + 
Sbjct: 82   NDITKYVVVMRELEERARKCYAEPVSLDSDG----FVEMMLLDGCLIIQLIRKSARTTSI 137

Query: 842  -DPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTRLIQLLIDFFDSLIPEP 1018
             DPIF  +     + RD+LL+ENQ+P F+L  LF +I  P + R I  +I FF  ++P  
Sbjct: 138  DDPIFKMSGFHGILCRDMLLIENQLPLFVLWELFCVIAVPREDRFIDDIIKFFTVVLPGK 197

Query: 1019 SLGGEKLTNHQSASPN-HLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNR 1195
                + L   +S + N HL+ LI+ CW P                    E + K ++   
Sbjct: 198  GCIRKSL---RSITENKHLLGLIYDCWHPSA-----------------FEMEVKTKTIEC 237

Query: 1196 NQIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHY 1375
            + +  ATEL EAG++ K K +G+++FDI F+ GT+ IP   +D+ TE FLRN++AYEQ +
Sbjct: 238  SFMHCATELKEAGIRFK-KVEGRSIFDIKFENGTMKIPTLEIDDDTEWFLRNVIAYEQ-F 295

Query: 1376 FGDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPS 1555
            F  +   +VTDY+  +DCLINS KDVEILR +GI +NWLGD+  +A +FN + DSV +P+
Sbjct: 296  FSGSSLNHVTDYMNFMDCLINSRKDVEILRQRGIVKNWLGDDEVIATMFNRLGDSVTIPA 355

Query: 1556 EDKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYIS 1678
                LY+EVF  VN +C   ++   A+LK NY N  W ++S
Sbjct: 356  FS--LYSEVFNNVNMYCSGRWNKRFANLKHNYFNSPWAFLS 394


>XP_017610281.1 PREDICTED: UPF0481 protein At3g47200-like [Gossypium arboreum]
          Length = 420

 Score =  254 bits (649), Expect = 1e-74
 Identities = 155/420 (36%), Positives = 228/420 (54%), Gaps = 8/420 (1%)
 Frame = +2

Query: 503  PIKSSIC--KVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLA----ELDQ 664
            PI S+ C  +V   LR V   AY+P +V+IGPYHHG  + K M++ K R+L     E+ +
Sbjct: 22   PISSNCCIFRVPNYLRKVNEKAYEPEVVAIGPYHHGKHHLKPMEEHKFRFLGMLLNEMKE 81

Query: 665  AVTNKCRDAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLLKP-- 838
             VTN     +++   + +K Y E L       + D  EMML D  FIIQL+ K  +    
Sbjct: 82   DVTNYVM-VMRESEDRARKCYAEQLDL----DTDDFVEMMLIDACFIIQLIRKFAMTTVM 136

Query: 839  NDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTRLIQLLIDFFDSLIPEP 1018
            +DP F        + RDLLL+ENQ+P+FIL  LF +IE PN    + + ++FF  ++P  
Sbjct: 137  DDPFFKIGGFHGILCRDLLLVENQLPFFILWELFCMIEIPNPDIFLYITMNFFGIILPGK 196

Query: 1019 SLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRN 1198
                + L +       HL+ L++ CW P  S              +MV  ++K +    +
Sbjct: 197  GCTRDSLKSIMEIK--HLLGLVNDCWQPSES--------------EMVAYRKKTKPIEWS 240

Query: 1199 QIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYF 1378
             +  ATEL E G++ + K  G ++FDI F+ GT+ IP   +D+ TECF RN++A+EQ +F
Sbjct: 241  FMHCATELQEDGIRFE-KADGSSIFDIKFENGTMKIPTLEIDDHTECFFRNVIAFEQ-FF 298

Query: 1379 GDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSE 1558
                  +VTDY+  +DCLINSPKDVE+LR +GI  NWLG++  +A +FN + DSV +   
Sbjct: 299  PGRSLNHVTDYMNFMDCLINSPKDVELLRRRGIINNWLGNDEVIATMFNRLGDSVSISRY 358

Query: 1559 DKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYL 1738
                Y+EVF  VNG+C   ++  +A+LK NY N  W  +S             QT  S L
Sbjct: 359  S--FYSEVFSNVNGYCSKQWNKWIANLKHNYFNSPWALVSVLAAVLLLLLTMVQTIFSVL 416


>XP_016672368.1 PREDICTED: UPF0481 protein At3g47200-like isoform X1 [Gossypium
            hirsutum] XP_016672369.1 PREDICTED: UPF0481 protein
            At3g47200-like isoform X1 [Gossypium hirsutum]
            XP_016672370.1 PREDICTED: UPF0481 protein At3g47200-like
            isoform X1 [Gossypium hirsutum]
          Length = 420

 Score =  253 bits (646), Expect = 3e-74
 Identities = 158/420 (37%), Positives = 226/420 (53%), Gaps = 8/420 (1%)
 Frame = +2

Query: 503  PIKSSIC--KVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLA----ELDQ 664
            PI S+ C  KV   LR V   AY+P +V+IGPYHHG  + K M++ K R+L     E+ Q
Sbjct: 22   PISSNCCIFKVPNYLRKVNEKAYEPEVVAIGPYHHGKHHLKPMEEHKFRFLRMLLNEMKQ 81

Query: 665  AVTNKCRDAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLLKP-- 838
             VTN     ++ L  + +K Y E +       + D  EMML D  FIIQL+ K  +    
Sbjct: 82   DVTNYVM-VMRGLEDRARKCYAEQIGL----DTNDFVEMMLIDACFIIQLIRKFAMTTVM 136

Query: 839  NDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTRLIQLLIDFFDSLIPEP 1018
            +DP F      N + RDLLL+ENQ P FIL  LFG+IE PN      + ++FF  ++P  
Sbjct: 137  DDPFFKIGGFHNLLCRDLLLVENQFPLFILWKLFGMIEIPNPDIFRYITMNFFAIILPGK 196

Query: 1019 SLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRN 1198
                + L +       HL+ L++ CW P  S              +MV  ++K +    +
Sbjct: 197  GCNRDSLKSIMEIK--HLLGLVNDCWQPSES--------------EMVAYRKKTKPIEWS 240

Query: 1199 QIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYF 1378
             +  ATEL E G++ + K  G ++FDI F+ GT+ IP   +D+ TECF RN++A+EQ +F
Sbjct: 241  FMHCATELQEDGIRFE-KADGSSIFDIKFENGTMKIPKLKIDDHTECFFRNVIAFEQ-FF 298

Query: 1379 GDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSE 1558
                  +VTDY+  +DCLINS KDVE+LR +GI +NWLG++  VA +FN + DSV +   
Sbjct: 299  PGRSLNHVTDYMNFMDCLINSSKDVELLRRRGIIKNWLGNDEVVATMFNRLGDSVSISRY 358

Query: 1559 DKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYL 1738
                Y+EVF  VN +C   ++  +A+LK NY N  W  +S             QT  S L
Sbjct: 359  S--FYSEVFSNVNRYCSKQWNKWIANLKHNYFNSPWALVSVLAAVLLLLLTMVQTIFSVL 416


>EYU24251.1 hypothetical protein MIMGU_mgv1a019478mg [Erythranthe guttata]
          Length = 350

 Score =  250 bits (639), Expect = 4e-74
 Identities = 165/415 (39%), Positives = 218/415 (52%), Gaps = 5/415 (1%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAE-LDQAVTNKCR-- 685
            +I +VH++LR+V   AY+P +++IGPYH    N +MM+D KL YL   L++   N     
Sbjct: 7    TIYRVHKHLRNVNMKAYEPEVIAIGPYHRSKDNLQMMEDHKLCYLRLILERKEINNIETY 66

Query: 686  -DAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLLKPNDPIFNSA 862
              AI+ L  + +K Y EP+    P    D  EM++ DG FII L                
Sbjct: 67   VSAIEPLEEEARKCYAEPITLNSP----DFLEMLILDGCFIIDL---------------- 106

Query: 863  WIMNSVQRDLLLLENQIPYFILETLFGLIEKPNK-TRLIQLLIDFFDSLIPEPSLGGEKL 1039
                   RDL+L ENQ+P+FIL  LF LIE PN+ +RLI LL+ FF+S+ P         
Sbjct: 107  -------RDLILFENQVPFFILCKLFDLIEVPNEHSRLIYLLLIFFNSIFPG-------- 151

Query: 1040 TNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRNQIPTATE 1219
                                          IN N+  +D    +R+ R  N      ATE
Sbjct: 152  -----------------------------NINRNNFREDEDNERRRWRFINN-----ATE 177

Query: 1220 LTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYFGDNRKTY 1399
            L EA VK K +T+G +LFD+ F+ G + + P ++++ TE   RNLVAYEQ YFGD +  Y
Sbjct: 178  LREANVKFK-RTEGVSLFDLRFEDGNMLLSPLTIEDRTESLFRNLVAYEQ-YFGDGQTNY 235

Query: 1400 VTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSEDKFLYAE 1579
            VTDYVK LDCLI+S +DV IL   GI +NWLGD+  VAN+FN +TDSV  P    F+YA 
Sbjct: 236  VTDYVKFLDCLIDSSRDVAILSRHGIIDNWLGDDEVVANMFNKLTDSVAGPG-THFVYAN 294

Query: 1580 VFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYLQV 1744
            +F  VN HC    +  MA L+RNYLN  W  IS             QT  S LQV
Sbjct: 295  IFHVVNKHCNGRRNRWMAKLRRNYLNSPWAVISVLFAVLLLLLTVTQTVCSILQV 349


>XP_012484204.1 PREDICTED: UPF0481 protein At3g47200-like isoform X1 [Gossypium
            raimondii] XP_012484205.1 PREDICTED: UPF0481 protein
            At3g47200-like isoform X1 [Gossypium raimondii]
            XP_012484206.1 PREDICTED: UPF0481 protein At3g47200-like
            isoform X1 [Gossypium raimondii] KJB38844.1 hypothetical
            protein B456_006G055900 [Gossypium raimondii] KJB38845.1
            hypothetical protein B456_006G055900 [Gossypium
            raimondii]
          Length = 420

 Score =  252 bits (643), Expect = 8e-74
 Identities = 158/420 (37%), Positives = 227/420 (54%), Gaps = 8/420 (1%)
 Frame = +2

Query: 503  PIKSSIC--KVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLA----ELDQ 664
            PI S+ C  KV   LR V   AY+P +VSIGPYHHG  + K M++ K R+L     E+ +
Sbjct: 22   PISSNCCIFKVPNYLRKVNEKAYEPDVVSIGPYHHGKHHLKPMEEHKFRFLRMLLNEMKE 81

Query: 665  AVTNKCRDAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLLKP-- 838
             VTN     ++ L  + +K Y E +       + D  EMML D  FIIQL+ K  +    
Sbjct: 82   DVTNYVM-VMRGLEDRARKCYAEQIGL----DTDDFIEMMLIDACFIIQLIRKFAMTTVM 136

Query: 839  NDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTRLIQLLIDFFDSLIPEP 1018
            +DP F      N + RDLLL+ENQ+P FIL  LFG+IE PN      + ++FF  ++P  
Sbjct: 137  DDPFFKIDGFHNLLCRDLLLVENQLPLFILWKLFGMIEIPNPDIFRYITMNFFAIILPGK 196

Query: 1019 SLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRN 1198
                + L +     P  L+ L++ CW P  S              +MV  ++K +    +
Sbjct: 197  GCNRDSLKSIMEIKP--LLGLVNECWQPSES--------------EMVAYRKKTKPIEWS 240

Query: 1199 QIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYF 1378
             +  ATEL E G++ + K  G ++FDI F+ GT+ IP   +D+ TECF RN++A+EQ +F
Sbjct: 241  FMHCATELQEDGIRFE-KADGSSIFDIKFENGTMKIPKLKIDDHTECFFRNVIAFEQ-FF 298

Query: 1379 GDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSE 1558
                  +VTDY+  +DCLINS KDVE+LR +GI +NWLG++  VA +FN + DSV +   
Sbjct: 299  PGRSLNHVTDYMNFMDCLINSSKDVELLRRRGIIKNWLGNDEVVATMFNRLGDSVSISRY 358

Query: 1559 DKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYL 1738
                Y+EVF  VN +C   ++  +A+LK NY N  W  +S             QT  S L
Sbjct: 359  S--FYSEVFSNVNRYCSKQWNKWIANLKHNYFNSPWALVSVLAAVLLLLLTMVQTIFSVL 416


>OMO53035.1 hypothetical protein CCACVL1_28930 [Corchorus capsularis]
          Length = 421

 Score =  252 bits (643), Expect = 9e-74
 Identities = 155/426 (36%), Positives = 232/426 (54%), Gaps = 14/426 (3%)
 Frame = +2

Query: 503  PIKSSIC--KVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLA-------E 655
            PI S  C  KV   LR V   AY+P +V+IGPYHHG  + + M++ KLR+L        E
Sbjct: 22   PISSDCCIFKVPNYLRKVNEKAYEPEVVAIGPYHHGKDHLEPMEEHKLRFLQLLLQERRE 81

Query: 656  LDQAVTNKCRDAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLLK 835
            +D  +  K    ++KL G+V++ Y EP+       + D  EMML DG  I+QL+ K  + 
Sbjct: 82   IDVTMYVK---VMRKLGGRVRRCYAEPISL----DTDDFVEMMLLDGCLIVQLIRKFAMT 134

Query: 836  P--NDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTRLIQLLIDFFDSLI 1009
               +DP+F        + RD+LL+ENQ+P F++  LF +IE  N+   I  +I+FF  ++
Sbjct: 135  TLNDDPVFKMGGFHGILCRDMLLVENQLPLFVVWELFCMIENSNQDIFIYSVINFFTIIL 194

Query: 1010 PEPSLGGEKLTNHQSASP---NHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKG 1180
            P     G+    H   S     HL+ L++ CW P                 +M   +++ 
Sbjct: 195  P-----GKGCIRHNLKSILEIKHLLGLVNDCWHPSAL--------------EMEAYRKET 235

Query: 1181 RSHNRNQIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVA 1360
            +  + + +  ATEL EAG++ + K +G ++FD  F+ GT+ IP   +D+ TECFLRNL+A
Sbjct: 236  KKIDWSFMHCATELQEAGIRFR-KAEGSSIFDFKFEDGTMKIPTLEIDDHTECFLRNLIA 294

Query: 1361 YEQHYFGDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDS 1540
            +EQ +F      +VTDY+  +DCLINS KDVE+LR +GI  NWLG++  +A +FNT+ DS
Sbjct: 295  FEQ-FFPGRSLNHVTDYMNFMDCLINSTKDVELLRQRGIINNWLGNDEVIATMFNTLGDS 353

Query: 1541 VVLPSEDKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQ 1720
            V +       Y+EVF  VN +C   ++  +A+LK NY N  W  +S             Q
Sbjct: 354  VSISRYS--FYSEVFNNVNIYCSRRWNKWIANLKHNYFNSPWALVSILAAVVLLQLTLLQ 411

Query: 1721 TWLSYL 1738
            T  S L
Sbjct: 412  TVFSIL 417


>XP_016703022.1 PREDICTED: UPF0481 protein At3g47200-like isoform X1 [Gossypium
            hirsutum]
          Length = 420

 Score =  247 bits (631), Expect = 5e-72
 Identities = 152/420 (36%), Positives = 226/420 (53%), Gaps = 8/420 (1%)
 Frame = +2

Query: 503  PIKSSIC--KVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLA----ELDQ 664
            PI S+ C  KV   LR V   AY+P +V+IGPYH G  + K M++ K R+L     E+ +
Sbjct: 22   PISSNCCIFKVPNYLRKVNEKAYEPEVVAIGPYHQGKHHLKPMEEHKFRFLGMLLNEMKE 81

Query: 665  AVTNKCRDAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLLKP-- 838
             VTN     +++   + +K Y E L       + D  EMML D  FIIQL+ K  +    
Sbjct: 82   DVTNYVM-VMRESEDRARKCYAEQLDL----DTDDFVEMMLIDACFIIQLIRKFAMTTVM 136

Query: 839  NDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTRLIQLLIDFFDSLIPEP 1018
            +DP F        + RDLLL+ENQ+P+FIL  LF ++E PN    + + ++FF  ++P  
Sbjct: 137  DDPFFKIGGFHGILCRDLLLVENQLPFFILWELFCMVEIPNPDIFLYITMNFFGIILPGK 196

Query: 1019 SLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRN 1198
                + L +       HL+ L++ CW P  S              +MV  ++K +    +
Sbjct: 197  GCTRDSLKSIMEIK--HLLGLVNDCWQPSES--------------EMVAYRKKTKPIEWS 240

Query: 1199 QIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYF 1378
             +  ATEL E G++ + K  G ++FDI F+ GT+ IP   +D+ TECF RN++A+EQ +F
Sbjct: 241  FMHCATELQEDGIRFE-KADGSSIFDIKFENGTMKIPTLEIDDHTECFFRNVIAFEQ-FF 298

Query: 1379 GDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSE 1558
                  +VTDY+  +DCLINSPKDVE+L+ +GI  NWLG++  +A +FN + DSV +   
Sbjct: 299  PGRSLNHVTDYMNFMDCLINSPKDVELLQRRGIINNWLGNDEVIATMFNRLGDSVSISRY 358

Query: 1559 DKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYL 1738
                Y+EVF  VN +C   ++  +A+LK NY N  W  +S             QT  S L
Sbjct: 359  S--FYSEVFSNVNRYCSKQWNKWIANLKHNYFNSPWALVSVLAAVLLLLLTMVQTIFSVL 416


>XP_007034456.2 PREDICTED: UPF0481 protein At3g47200 [Theobroma cacao] XP_017974424.1
            PREDICTED: UPF0481 protein At3g47200 [Theobroma cacao]
          Length = 425

 Score =  240 bits (613), Expect = 2e-69
 Identities = 152/416 (36%), Positives = 221/416 (53%), Gaps = 9/416 (2%)
 Frame = +2

Query: 518  ICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRY----LAELDQAVTNKCR 685
            I +V   LR     AY+P +++IGPYHH   + K M++ K+RY    L E  +   ++  
Sbjct: 29   IARVPNYLRKANEQAYEPELIAIGPYHHAKPHLKAMEEHKIRYFQLLLQERRENDVSRYV 88

Query: 686  DAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIK----NLLKPNDPIF 853
              I+ L  Q +K Y +P        S D  +M+L DG FI+QL+ K     L   +DPIF
Sbjct: 89   MIIRSLEEQARKCYSDPFAL----ESDDFVKMLLLDGCFIVQLIRKFSEIRLRDESDPIF 144

Query: 854  NSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTRLIQLLIDFFDSLIPEPSLGGE 1033
                +  +++RD LL+ENQ+P F+L  L+ +IE P++   + ++  FF  ++P       
Sbjct: 145  KLVSLRGTIRRDTLLVENQLPLFVLWELYAMIEYPDQRTFMAIVFSFFCHILPGEGWPQN 204

Query: 1034 KLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRNQIPTA 1213
             L + +    NHLVDL+H CW P                 ++   +   ++   N I   
Sbjct: 205  SLNSIRVI--NHLVDLVHECWHPSPL--------------ELKAYQNLNKNVPWNFIHCV 248

Query: 1214 TELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYFGDNRK 1393
            TEL EAG+K + K +G +LFD+ F+ GT+ IP   + +  E  LRNL+A+EQ  F  +R 
Sbjct: 249  TELKEAGIKFQMK-RGNSLFDLKFENGTMKIPTLRIYDRLEGTLRNLIAFEQ--FSSHRG 305

Query: 1394 -TYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSEDKFL 1570
              +VTDYV L  CL+NS KDVEILR  GI EN LGD+  VA + N +  SV   S D F 
Sbjct: 306  LNHVTDYVLLFHCLVNSTKDVEILRQSGIIENMLGDDEEVARMLNRLGVSVFF-SPDNFY 364

Query: 1571 YAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYL 1738
            Y+E+F KVN +C+  ++  +A+LK NYLN  W  IS             QT  S L
Sbjct: 365  YSELFNKVNKYCDRRWNKWIANLKHNYLNSPWALISFLAAVVLLLLTLVQTVFSVL 420


>XP_015388586.1 PREDICTED: UPF0481 protein At3g47200-like [Citrus sinensis]
            XP_015388587.1 PREDICTED: UPF0481 protein At3g47200-like
            [Citrus sinensis]
          Length = 430

 Score =  239 bits (611), Expect = 5e-69
 Identities = 150/410 (36%), Positives = 217/410 (52%), Gaps = 18/410 (4%)
 Frame = +2

Query: 503  PIKSSICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAEL--DQAVTN 676
            P + SI +V   LR +   AY+P +++IGPYHHG  +    ++ K RYL  L   +++  
Sbjct: 25   PSQFSIFRVPNQLRKINATAYEPEMLAIGPYHHGKDHLMAFEEHKTRYLQNLLHRRSLNR 84

Query: 677  KCRD---AIKKLSGQVKKSYDEPLITPGPGSSYDID-----EMMLEDGVFIIQLVIKNLL 832
               D    ++ L  + +K Y         G S  +D     EMML DG FI++++ KNL 
Sbjct: 85   SLSDYVTTLRALEEKARKCY---------GGSISLDKEEFVEMMLLDGCFIVEIIRKNLR 135

Query: 833  KP----NDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTR----LIQLLI 988
            +     NDPIF   W++  + RD+ L+ENQ+P+F+L  LF + E  N  +       +++
Sbjct: 136  QESREDNDPIFKLGWMLPFIARDMFLVENQLPFFVLWELFSMTEVTNNNQTNYSFFYMIL 195

Query: 989  DFFDSLIPEPSLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEG 1168
             FF  ++P    G  ++  +      HLV  IH  W P  +G  A  IN +  ++     
Sbjct: 196  YFFYGILP--GKGYPRVDVYPIEEIKHLVGFIHNNWLPSPTGIDAFKINASKNSEWRF-- 251

Query: 1169 KRKGRSHNRNQIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLR 1348
                       I  ATE+ EAGVK +    G  LFDI F+ G + IP  ++ + TE  LR
Sbjct: 252  -----------ICCATEIQEAGVKFQKVEDGL-LFDIKFENGVMKIPTLAIGDTTEAVLR 299

Query: 1349 NLVAYEQHYFGDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNT 1528
            NL+AYEQ     N K ++ DYVK LDCLINS KD E+LR  GI +NWLGD+  +A + + 
Sbjct: 300  NLIAYEQFSHDQNPK-HILDYVKFLDCLINSSKDAELLRRCGIIDNWLGDDEVIAGLISR 358

Query: 1529 ITDSVVLPSEDKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYIS 1678
            + D+VVL   D+F Y+EVF KVN HC    +   A L+ NY N  W  IS
Sbjct: 359  LGDAVVL--SDQFYYSEVFNKVNLHCSRRVNKWKAKLRHNYFNTPWAIIS 406


>KCW60073.1 hypothetical protein EUGRSUZ_H02803, partial [Eucalyptus grandis]
          Length = 395

 Score =  238 bits (606), Expect = 1e-68
 Identities = 149/397 (37%), Positives = 219/397 (55%), Gaps = 9/397 (2%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAELDQAVTNKCRD-- 688
            SI +V   LR V N AY+P I+ +GP+H+GN  +K M++QK+RY+ +L Q    +  D  
Sbjct: 2    SIFRVRPQLRRVNNKAYEPEILVVGPHHYGNDKFKSMEEQKMRYVQQLLQRRKEESIDRY 61

Query: 689  --AIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIK----NLLKPNDPI 850
               +++L   V+  Y E +      S      MM  DG FI++L  K     L   + P+
Sbjct: 62   MPTLRELEQLVRNCYAETINL----SQEKFLAMMFIDGCFIVELFRKYNMEKLRNKDGPL 117

Query: 851  FNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTR-LIQLLIDFFDSLIPEPSLG 1027
              + WI   +QRDLLLLENQ+P F L  L+ L ++P++ R LI +   +FD  + +    
Sbjct: 118  MEADWIRYCLQRDLLLLENQLPLFFLNKLYDLTKRPDEPRELIDIATTYFDFKLGD---S 174

Query: 1028 GEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRNQIP 1207
            G+  T  +S    HL+ L+H CW   +             N   + G+          + 
Sbjct: 175  GQCPTLRES---KHLLHLMHTCWTSGLP------------NVPRLSGRAPPTKEKLMFMS 219

Query: 1208 TATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYFGDN 1387
            +ATEL E+GVKL+   +G+++ DI F+ G L IP   V + TE   RNL+AYEQH  G  
Sbjct: 220  SATELRESGVKLR-AVRGRHMKDIRFENGKLEIPVLIVQDHTESQFRNLIAYEQHRQGGG 278

Query: 1388 RKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSEDKF 1567
              +Y TDYV L+DCLINS KDVE+LR  GI +N+LGD+  +A +FN + D V LP+   F
Sbjct: 279  I-SYFTDYVTLMDCLINSSKDVEVLRRAGIIKNYLGDDEVIAQMFNRMGDYVTLPN---F 334

Query: 1568 LYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYIS 1678
             Y+E+FK VN +C    +  MA L+R Y +  W ++S
Sbjct: 335  YYSEIFKTVNAYCNKRRNVWMAKLRREYFHSPWAFLS 371


>KDO36306.1 hypothetical protein CISIN_1g014530mg [Citrus sinensis]
          Length = 423

 Score =  238 bits (606), Expect = 2e-68
 Identities = 149/406 (36%), Positives = 214/406 (52%), Gaps = 18/406 (4%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAEL--DQAVTNKCRD 688
            SI +V   LR +   AY+P +++IGPYHHG  +    ++ K RYL  L   +++     D
Sbjct: 17   SIFRVPNQLRKINATAYEPEMLAIGPYHHGKDHLMAFEEHKTRYLQNLLHRRSLNRSLSD 76

Query: 689  ---AIKKLSGQVKKSYDEPLITPGPGSSYDID-----EMMLEDGVFIIQLVIKNLLKP-- 838
                ++ L  + +K Y         G S  +D     EMML DG FI++++ KNL +   
Sbjct: 77   YVTTLRALEEKARKCY---------GGSISLDKEEFVEMMLLDGCFIVEIIRKNLRQESR 127

Query: 839  --NDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTR----LIQLLIDFFD 1000
              NDPIF   W++  + RD+ L+ENQ+P+F+L  LF + E  N  +       +++ FF 
Sbjct: 128  EDNDPIFKLGWMLPFIARDMFLVENQLPFFVLWELFSMTEVTNNNQTNYSFFYMILYFFY 187

Query: 1001 SLIPEPSLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKG 1180
             ++P    G  ++  +      HLV  IH  W P  +G  A  IN +  ++         
Sbjct: 188  GILP--GKGYPRVDVYPIEEIKHLVGFIHNNWLPSPTGIDAFKINASKNSEWKF------ 239

Query: 1181 RSHNRNQIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVA 1360
                   I  ATE+ EAGVK +    G  LFDI F  G + IP  ++ + TE  LRNL+A
Sbjct: 240  -------ICCATEIQEAGVKFQKVEDGL-LFDIKFDNGVMKIPTLAIGDTTEAVLRNLIA 291

Query: 1361 YEQHYFGDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDS 1540
            YEQ     N K ++ DYVK LDCLINS KD E+LR  GI +NWLGD+  +A + + + D+
Sbjct: 292  YEQFSHDQNSK-HILDYVKFLDCLINSSKDAELLRRCGIIDNWLGDDEVIAGLISRLGDA 350

Query: 1541 VVLPSEDKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYIS 1678
            VVL   D+F Y+EVF KVN HC    +   A L+ NY N  W  IS
Sbjct: 351  VVL--SDQFYYSEVFNKVNLHCSRRVNKWKAKLRHNYFNTPWAIIS 394


>XP_010023720.1 PREDICTED: UPF0481 protein At3g47200 isoform X5 [Eucalyptus grandis]
          Length = 434

 Score =  238 bits (606), Expect = 3e-68
 Identities = 149/397 (37%), Positives = 219/397 (55%), Gaps = 9/397 (2%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAELDQAVTNKCRD-- 688
            SI +V   LR V N AY+P I+ +GP+H+GN  +K M++QK+RY+ +L Q    +  D  
Sbjct: 41   SIFRVRPQLRRVNNKAYEPEILVVGPHHYGNDKFKSMEEQKMRYVQQLLQRRKEESIDRY 100

Query: 689  --AIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIK----NLLKPNDPI 850
               +++L   V+  Y E +      S      MM  DG FI++L  K     L   + P+
Sbjct: 101  MPTLRELEQLVRNCYAETINL----SQEKFLAMMFIDGCFIVELFRKYNMEKLRNKDGPL 156

Query: 851  FNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTR-LIQLLIDFFDSLIPEPSLG 1027
              + WI   +QRDLLLLENQ+P F L  L+ L ++P++ R LI +   +FD  + +    
Sbjct: 157  MEADWIRYCLQRDLLLLENQLPLFFLNKLYDLTKRPDEPRELIDIATTYFDFKLGD---S 213

Query: 1028 GEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRNQIP 1207
            G+  T  +S    HL+ L+H CW   +             N   + G+          + 
Sbjct: 214  GQCPTLRES---KHLLHLMHTCWTSGLP------------NVPRLSGRAPPTKEKLMFMS 258

Query: 1208 TATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYFGDN 1387
            +ATEL E+GVKL+   +G+++ DI F+ G L IP   V + TE   RNL+AYEQH  G  
Sbjct: 259  SATELRESGVKLR-AVRGRHMKDIRFENGKLEIPVLIVQDHTESQFRNLIAYEQHRQGGG 317

Query: 1388 RKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSEDKF 1567
              +Y TDYV L+DCLINS KDVE+LR  GI +N+LGD+  +A +FN + D V LP+   F
Sbjct: 318  I-SYFTDYVTLMDCLINSSKDVEVLRRAGIIKNYLGDDEVIAQMFNRMGDYVTLPN---F 373

Query: 1568 LYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYIS 1678
             Y+E+FK VN +C    +  MA L+R Y +  W ++S
Sbjct: 374  YYSEIFKTVNAYCNKRRNVWMAKLRREYFHSPWAFLS 410


>EOY05382.1 Uncharacterized protein TCM_020392 [Theobroma cacao]
          Length = 425

 Score =  237 bits (605), Expect = 3e-68
 Identities = 150/416 (36%), Positives = 220/416 (52%), Gaps = 9/416 (2%)
 Frame = +2

Query: 518  ICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRY----LAELDQAVTNKCR 685
            I +V   LR     AY+P +++IGPYHH   + K M++ K+RY    L E  +   ++  
Sbjct: 29   IARVPNYLRKANEQAYEPELIAIGPYHHAKPHLKAMEEHKIRYFQLLLQERRENDVSRYV 88

Query: 686  DAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIK----NLLKPNDPIF 853
              I+ L  + +K Y +P        S D  +M+L DG FI+QL+ K     L   +DPIF
Sbjct: 89   MIIRSLEEKARKCYSDPFAL----ESDDFVKMLLLDGCFIVQLIRKFSEIRLRDESDPIF 144

Query: 854  NSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTRLIQLLIDFFDSLIPEPSLGGE 1033
                +  +++RD LL+ENQ+P F+L  L+ +IE P++   + ++  FF  ++P       
Sbjct: 145  KLVSLRGTIRRDTLLVENQLPLFVLWELYAMIEYPDQRTFMAIVFSFFCHILPGEGWPQN 204

Query: 1034 KLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRNQIPTA 1213
             L + +     HLVDL+H CW P                 ++   +   ++   N I   
Sbjct: 205  SLNSIRVIK--HLVDLVHECWHPSPL--------------ELKAYQNLNKNVPWNFIHCV 248

Query: 1214 TELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYFGDNRK 1393
            TEL EAG+K + K +G +LFD+ F+ GT+ IP   + +  E  LRNL+A+EQ  F  +R 
Sbjct: 249  TELKEAGIKFQMK-RGNSLFDLKFENGTMKIPTLRIYDSLEGTLRNLIAFEQ--FSSHRG 305

Query: 1394 -TYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSEDKFL 1570
              +VTDYV L  CL+NS KDVEILR  GI EN LGD+  VA + N +  SV   S D F 
Sbjct: 306  LNHVTDYVLLFHCLVNSTKDVEILRQSGIIENMLGDDEEVARMLNRLGVSVFF-SPDNFY 364

Query: 1571 YAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYL 1738
            Y+E+F KVN +C+  ++  +A+LK NYLN  W  IS             QT  S L
Sbjct: 365  YSELFNKVNKYCDRRWNKWIANLKHNYLNSPWALISFLAAVVLLLLTLVQTIFSVL 420


>XP_010023544.1 PREDICTED: UPF0481 protein At3g47200 [Eucalyptus grandis] KCW59838.1
            hypothetical protein EUGRSUZ_H02583 [Eucalyptus grandis]
          Length = 434

 Score =  236 bits (601), Expect = 2e-67
 Identities = 151/421 (35%), Positives = 223/421 (52%), Gaps = 9/421 (2%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAEL----DQAVTNKC 682
            SI +V R LR V   AY+P I++IGPYH GN  +K M++QKLRY+  L     +   +K 
Sbjct: 41   SIFRVRRQLRGVNEKAYEPEILAIGPYHSGNDKFKFMEEQKLRYVKRLLWRRREGSVDKY 100

Query: 683  RDAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIKNLLKP----NDPI 850
              A++ +    +  Y E +      S  +   MML DG+F+++L  KN +K     +DP+
Sbjct: 101  MPALRSMEQWARDCYAEAVDL----SQENFLAMMLIDGLFLVELFRKNSIKELRDEDDPV 156

Query: 851  FNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNK-TRLIQLLIDFFDSLIPEPSLG 1027
                WI   + RDL+LLENQIP+ ILE L+GL + P + + LI +   + +  +P  S  
Sbjct: 157  MKEDWIRFCLPRDLVLLENQIPFLILEELYGLTKGPEEHSELIDVATRYLN-FVPSDSNR 215

Query: 1028 GEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNRNQIP 1207
            G            HL+ L+H C        +      N R + M E           +  
Sbjct: 216  G------MLRESKHLLHLMHTCLT-----GLPRRHRLNPRTKPMTE-----------KFM 253

Query: 1208 TATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHYFGDN 1387
            +  EL E GV+ + K K  +L DITF+ GTL IP  +V + TE  LRNL+AYEQH     
Sbjct: 254  SPAELREFGVRFRVK-KSPDLLDITFKDGTLEIPVLTVQDHTESQLRNLIAYEQHR-PSG 311

Query: 1388 RKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPSEDKF 1567
               Y+TDYV  +DCLI+S  DVE+LR  GI +N++GD+  VA +FN + D V L     F
Sbjct: 312  EVNYMTDYVTFMDCLIDSSTDVELLRAAGIIKNYMGDDEAVAQMFNKMGDYVTL---SNF 368

Query: 1568 LYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYISXXXXXXXXXXXXXQTWLSYLQVK 1747
             Y ++F+++N HC+  ++ SMA L+R +L+  W  +S             QT L+YL  +
Sbjct: 369  YYDDIFRRLNAHCKKRWNRSMAKLRREHLHSPWALLSISAATMLLLLTVAQTVLTYLAYR 428

Query: 1748 H 1750
            +
Sbjct: 429  N 429


>KCW60065.1 hypothetical protein EUGRSUZ_H02794, partial [Eucalyptus grandis]
          Length = 421

 Score =  232 bits (592), Expect = 2e-66
 Identities = 154/423 (36%), Positives = 221/423 (52%), Gaps = 35/423 (8%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRY---------------- 646
            SI +V   LR V N AY+P I+ IGPYH+GN  +K M++QKLRY                
Sbjct: 2    SIFRVRPQLRRVNNKAYEPEILVIGPYHYGNDKFKSMEEQKLRYVQQLLQRRKEESVDWY 61

Query: 647  ---LAELDQAVT-----------NKCRDAIKKLSGQVKKSYDEPLITPGPGSSYDIDEMM 784
               L EL+Q V            ++    +++L   V+  Y E +      S      MM
Sbjct: 62   MPTLRELEQLVQQLLQRRKEESIDRYMPTLRELEQLVRNCYAETINL----SQEKFLAMM 117

Query: 785  LEDGVFIIQLVIK----NLLKPNDPIFNSAWIMNSVQRDLLLLENQIPYFILETLFGLIE 952
              DG FI++L  K     L   + P+  + WI   +QRDLLLLENQ+P F L  L+ L +
Sbjct: 118  FIDGCFIVELFRKYNMEKLRNKDGPLMEADWIRYCLQRDLLLLENQLPLFFLNKLYDLTK 177

Query: 953  KPNKTR-LIQLLIDFFDSLIPEPSLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIG 1129
            +P++ R LI +   +FD  + +    G+  T  +S    HL+ L+H CW   +       
Sbjct: 178  RPDEPRELIDIATTYFDFKLGD---SGQCPTLRES---KHLLHLMHTCWTSGLP------ 225

Query: 1130 INTNDRNQDMVEGKRKGRSHNRNQIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIP 1309
                  N   + G+          + +ATEL E+GVKL+   +G+++ DI F+ G L IP
Sbjct: 226  ------NVPRLSGRAPPTKEKLMFMSSATELRESGVKLR-AVRGRHMKDIRFENGKLEIP 278

Query: 1310 PFSVDEGTECFLRNLVAYEQHYFGDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENW 1489
               V + TE   RNL+AYEQH  G    +Y TDYV L+DCLINS KDVE+LR  GI +N+
Sbjct: 279  VLIVQDHTESQFRNLIAYEQHRQGGGI-SYFTDYVTLMDCLINSSKDVEVLRRAGIIKNY 337

Query: 1490 LGDNGGVANIFNTITDSVVLPSEDKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWG 1669
            LGD+  +A +FN + D V LP+   F Y+E+FK VN +C    +  MA L+R Y +  W 
Sbjct: 338  LGDDEVIAQMFNRMGDYVTLPN---FYYSEIFKTVNAYCNKRRNVWMAKLRREYFHSPWA 394

Query: 1670 YIS 1678
            ++S
Sbjct: 395  FLS 397


>XP_010026753.1 PREDICTED: UPF0481 protein At3g47200 [Eucalyptus grandis]
          Length = 434

 Score =  232 bits (592), Expect = 3e-66
 Identities = 150/401 (37%), Positives = 214/401 (53%), Gaps = 13/401 (3%)
 Frame = +2

Query: 515  SICKVHRNLRSVKNDAYDPYIVSIGPYHHGNVNYKMMQDQKLRYLAELDQAVTNKCRD-- 688
            SI +V   LR V N AY+P I+ IGPYH+GN  +K M++QK+RY+ +L Q    +  D  
Sbjct: 41   SIFRVRPQLRRVNNKAYEPEILVIGPYHYGNDKFKSMEEQKMRYVQQLLQRRKEESVDRY 100

Query: 689  --AIKKLSGQVKKSYDEPLITPGPGSSYDIDEMMLEDGVFIIQLVIK----NLLKPNDPI 850
               +++L   V+  Y E +      S      MM  DG FI++L  K     L   + P+
Sbjct: 101  MPTLRELEQLVRNCYAETINL----SQEKFLAMMFIDGCFIVELFRKYNMEKLRNKDGPL 156

Query: 851  FNSAWIMNSVQRDLLLLENQIPYFILETLFGLIEKPNKTR-LIQLLIDFFDSLIPE---- 1015
              + WI   +QRDLLLLENQ+P F L  L+ L + P++ R LI +   +FD  + +    
Sbjct: 157  MEADWIRYCLQRDLLLLENQLPLFFLNKLYDLTKGPDEPRKLIDIATTYFDFKLGDSDQC 216

Query: 1016 PSLGGEKLTNHQSASPNHLVDLIHICWCPQVSGSIAIGINTNDRNQDMVEGKRKGRSHNR 1195
            P+L   K          HL+ L+H CW   +             N   + G+        
Sbjct: 217  PTLRKSK----------HLLHLMHTCWTSGLP------------NVPRLYGRAPPTKEKL 254

Query: 1196 NQIPTATELTEAGVKLKNKTKGKNLFDITFQCGTLHIPPFSVDEGTECFLRNLVAYEQHY 1375
              + +ATEL E+GVKL+   +G+ + DI F+ G L IP   V + TE   RNL+AYEQH 
Sbjct: 255  MFMSSATELRESGVKLR-AVRGRRMKDIRFENGKLEIPVLIVQDHTESQFRNLIAYEQHR 313

Query: 1376 FGDNRKTYVTDYVKLLDCLINSPKDVEILRVKGIAENWLGDNGGVANIFNTITDSVVLPS 1555
             G    +Y TDYV L+DCLINS KDVE+LR  GI +N+LGD+  +A +FN + D V L  
Sbjct: 314  QGGGI-SYFTDYVTLMDCLINSSKDVEVLRRAGIIKNYLGDDEVIAQMFNRMGDYVTL-- 370

Query: 1556 EDKFLYAEVFKKVNGHCESPYHSSMASLKRNYLNGAWGYIS 1678
               F Y+E+FK VN +C    +  MA L+R Y +  W ++S
Sbjct: 371  -SNFYYSEIFKTVNAYCNKRRNVWMAKLRREYFHSPWAFLS 410


Top