BLASTX nr result

ID: Forsythia21_contig00013370 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00013370
         (2279 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011091511.1| PREDICTED: protein INVOLVED IN DE NOVO 2 [Se...  1053   0.0  
ref|XP_012829115.1| PREDICTED: protein INVOLVED IN DE NOVO 2 [Er...   961   0.0  
ref|XP_009589398.1| PREDICTED: golgin candidate 4 [Nicotiana tom...   960   0.0  
ref|XP_009799105.1| PREDICTED: golgin candidate 4 [Nicotiana syl...   949   0.0  
ref|XP_006362466.1| PREDICTED: myosin-2 heavy chain-like isoform...   948   0.0  
gb|EYU17877.1| hypothetical protein MIMGU_mgv1a002871mg [Erythra...   947   0.0  
ref|XP_004240949.1| PREDICTED: protein INVOLVED IN DE NOVO 2 [So...   943   0.0  
ref|XP_006362467.1| PREDICTED: myosin-2 heavy chain-like isoform...   942   0.0  
emb|CDP00105.1| unnamed protein product [Coffea canephora]            903   0.0  
ref|XP_002278500.1| PREDICTED: protein INVOLVED IN DE NOVO 2 iso...   871   0.0  
ref|XP_007009298.1| XH/XS domain-containing protein, putative is...   867   0.0  
ref|XP_010658558.1| PREDICTED: protein INVOLVED IN DE NOVO 2 iso...   861   0.0  
ref|XP_007009299.1| XH/XS domain-containing protein, putative is...   859   0.0  
gb|KHG15173.1| Forkhead-associated domain-containing 1 [Gossypiu...   857   0.0  
ref|XP_006435548.1| hypothetical protein CICLE_v10030937mg [Citr...   851   0.0  
gb|KDO69291.1| hypothetical protein CISIN_1g0065972mg [Citrus si...   850   0.0  
ref|XP_012459360.1| PREDICTED: protein INVOLVED IN DE NOVO 2-lik...   848   0.0  
ref|XP_012830016.1| PREDICTED: protein INVOLVED IN DE NOVO 2-lik...   847   0.0  
ref|XP_012089069.1| PREDICTED: protein INVOLVED IN DE NOVO 2 [Ja...   836   0.0  
ref|XP_002533154.1| conserved hypothetical protein [Ricinus comm...   833   0.0  

>ref|XP_011091511.1| PREDICTED: protein INVOLVED IN DE NOVO 2 [Sesamum indicum]
          Length = 641

 Score = 1053 bits (2723), Expect = 0.0
 Identities = 516/641 (80%), Positives = 573/641 (89%), Gaps = 1/641 (0%)
 Frame = -3

Query: 2106 MGSSMEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLY 1930
            MGSS E+SSG++TD+SDS          E LK+G HQ K+++ AYTCP+CP KRKRDF Y
Sbjct: 1    MGSSAEHSSGEETDISDSETEDYQEKAYEELKSGKHQVKISDQAYTCPYCPKKRKRDFQY 60

Query: 1929 KELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMVDPLADHDRDE 1750
            KELLQHA  +G C+S KRT +DKANHLALAK+LEND  VDAGPS+ +  VD LAD DRDE
Sbjct: 61   KELLQHASAIGSCSSHKRTARDKANHLALAKYLENDTAVDAGPSKPSAEVDALADQDRDE 120

Query: 1749 MFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVE 1570
            +FVWPWIGIVVNIPT+FKDGRYVGESG+KLRDQLTRRGFNPTRVRPLWNYQGHSGTA+VE
Sbjct: 121  LFVWPWIGIVVNIPTDFKDGRYVGESGSKLRDQLTRRGFNPTRVRPLWNYQGHSGTAIVE 180

Query: 1569 FQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRK 1390
            FQKDW GF NAMSFEK YEANHHG+++W +KN +KSDLY WVARADDYNS+ I GENLRK
Sbjct: 181  FQKDWSGFTNAMSFEKFYEANHHGQKNWLAKNGMKSDLYAWVARADDYNSNNIVGENLRK 240

Query: 1389 IGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKL 1210
            IGDLRTISDIME+EARKT KLVGNLTNVIEAKKMHLLEMENKFKETE+SL QLI EKD L
Sbjct: 241  IGDLRTISDIMEEEARKTNKLVGNLTNVIEAKKMHLLEMENKFKETESSLLQLIMEKDNL 300

Query: 1209 LQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRK 1030
             Q YN+EIKKI+SSA+DHF+KIFNDHEKLKLQLETQKR+LE RG EL KRETHNE ER+K
Sbjct: 301  HQAYNEEIKKIESSARDHFKKIFNDHEKLKLQLETQKRDLEFRGQELMKRETHNEIERKK 360

Query: 1029 LSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAV 850
            LSE+LE+NAVKNCSLQAA  EQRKADEKVMKLAEEQKKQKENLHK+II LEKQLDAKQAV
Sbjct: 361  LSEDLEQNAVKNCSLQAAAEEQRKADEKVMKLAEEQKKQKENLHKRIILLEKQLDAKQAV 420

Query: 849  ELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNS 670
            +LEIEQLRGKLNVM+HM DEGDLEVL KVD LLK+LREKEGELED+  LNQTL+VQER  
Sbjct: 421  QLEIEQLRGKLNVMKHMGDEGDLEVLNKVDLLLKALREKEGELEDLEVLNQTLIVQERKR 480

Query: 669  NDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSL 490
            NDELQDARKELVNGLKE+STNG IGVKRMGEL+SKPFH+AMKRKY+E EADERATELCSL
Sbjct: 481  NDELQDARKELVNGLKEMSTNGHIGVKRMGELDSKPFHDAMKRKYSEAEADERATELCSL 540

Query: 489  WEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNP 310
            WEEYLRDPEWHPIKVV INGKHQAVI +DDEKL+DL+ NYGDE+Y+AVT+AL E+NEYNP
Sbjct: 541  WEEYLRDPEWHPIKVVSINGKHQAVIKEDDEKLRDLKENYGDEVYNAVTAALCEINEYNP 600

Query: 309  SGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            SGRYIISELWNY+EGR+ATL+EGV  ++K WR  KRKRGMD
Sbjct: 601  SGRYIISELWNYNEGRRATLKEGVEVLLKQWRYQKRKRGMD 641


>ref|XP_012829115.1| PREDICTED: protein INVOLVED IN DE NOVO 2 [Erythranthe guttatus]
          Length = 639

 Score =  961 bits (2484), Expect = 0.0
 Identities = 476/642 (74%), Positives = 550/642 (85%), Gaps = 2/642 (0%)
 Frame = -3

Query: 2106 MGSSMEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLY 1930
            MGSS E+SSG+DTD+SDS          E LK G HQ K+++ AYTCP+CPTK+K  F Y
Sbjct: 1    MGSSGEHSSGEDTDISDSETEEYTEKSYEELKKGKHQVKLSDQAYTCPYCPTKKKA-FQY 59

Query: 1929 KELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAG-PSQQATMVDPLADHDRD 1753
            K+L QHA  +    S KR  +++ANHLAL K+LE D  VD+G PS+ A  +D LADHD D
Sbjct: 60   KDLFQHANAIA--TSPKRNARERANHLALTKYLETDSSVDSGGPSKPAGKLDTLADHDGD 117

Query: 1752 EMFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALV 1573
            EMFVWPWIGIVVNIPT+ KDGRYVGESG+KLRDQL RRGFNPTRVRPLWNYQGHSGTA+V
Sbjct: 118  EMFVWPWIGIVVNIPTDLKDGRYVGESGSKLRDQLARRGFNPTRVRPLWNYQGHSGTAIV 177

Query: 1572 EFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLR 1393
            EF K+W GF NAMSFEK YEANHHG+++W +KN+ KSDLY WVAR DDYN + I GENLR
Sbjct: 178  EFHKNWSGFTNAMSFEKDYEANHHGRKNWLAKNEKKSDLYAWVARTDDYNLNNIVGENLR 237

Query: 1392 KIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDK 1213
            KIGDLRTISDIME+EARKT KLVGNLTNVIEAKKMHL+EME+KFKET++SL Q+I EKD 
Sbjct: 238  KIGDLRTISDIMEEEARKTNKLVGNLTNVIEAKKMHLIEMESKFKETDSSLRQMIMEKDN 297

Query: 1212 LLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERR 1033
            L Q YN+EIKKI+SSA+DHF+KIFNDHEKLK QLETQKR+LE RG EL KRETHNE E++
Sbjct: 298  LHQAYNEEIKKIESSARDHFKKIFNDHEKLKFQLETQKRDLELRGQELMKRETHNEIEKK 357

Query: 1032 KLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQA 853
            KL+E+LE+NAVKNCSLQAA  EQRKADEKVMKLAEE K +KE LHK+IIQLEKQLDAKQA
Sbjct: 358  KLAEDLEQNAVKNCSLQAAAEEQRKADEKVMKLAEEHKNEKEKLHKRIIQLEKQLDAKQA 417

Query: 852  VELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERN 673
            V+LEIEQLRGKLNV++HM DEGDLEVL KVD +LK+LREKEGEL+D+ ALNQTLVVQER 
Sbjct: 418  VQLEIEQLRGKLNVVKHMGDEGDLEVLNKVDLVLKALREKEGELDDLEALNQTLVVQERK 477

Query: 672  SNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCS 493
            SNDELQDARKELVNGLK++S N  IGVKRMGEL+SKPFH+AMKRKYNE EADERATELCS
Sbjct: 478  SNDELQDARKELVNGLKDMSLNSHIGVKRMGELDSKPFHDAMKRKYNEAEADERATELCS 537

Query: 492  LWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYN 313
            LWEEYL+DPEW PIKVV I+G  + VI++DDEKL DLR NYG+E+Y+AV + L E+NEYN
Sbjct: 538  LWEEYLKDPEWDPIKVVTIDGTPKNVINEDDEKLIDLRENYGEEVYNAVKATLFEINEYN 597

Query: 312  PSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            PSGRY+ISELWNY EGR+A L+EGVA ++K W+ +KRK+GMD
Sbjct: 598  PSGRYVISELWNYKEGRRAYLKEGVAVLLKQWKFFKRKKGMD 639


>ref|XP_009589398.1| PREDICTED: golgin candidate 4 [Nicotiana tomentosiformis]
            gi|697161248|ref|XP_009589399.1| PREDICTED: golgin
            candidate 4 [Nicotiana tomentosiformis]
          Length = 638

 Score =  960 bits (2481), Expect = 0.0
 Identities = 481/644 (74%), Positives = 543/644 (84%), Gaps = 8/644 (1%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M++SSG+DTD+S+S          E LK GNH  K+++ AYTCP+CP KRKRDF YKEL+
Sbjct: 1    MDHSSGEDTDISESEIEEYEEKSYEDLKRGNHSLKISDVAYTCPYCPKKRKRDFGYKELV 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVV-------DAGPSQQATMVDPLADHD 1759
            QHA GVG CNS KRT +DKANHLALAK+LENDV V       DA P  QA   DPLADHD
Sbjct: 61   QHASGVGSCNSNKRTARDKANHLALAKYLENDVAVAGDSSKPDAKPDPQA---DPLADHD 117

Query: 1758 RDEMFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTA 1579
            RDEMFVWPWIGIVVNI TE+KDGRYVGESG+KLRDQL RRGFNPTRVRPLWNYQGHSGTA
Sbjct: 118  RDEMFVWPWIGIVVNILTEYKDGRYVGESGSKLRDQLARRGFNPTRVRPLWNYQGHSGTA 177

Query: 1578 LVEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGEN 1399
            LVEF KDW GF NAMSFEKAYEA+HHGKR W +    KSDLY W+ARADDY +  I GEN
Sbjct: 178  LVEFNKDWSGFGNAMSFEKAYEADHHGKREWKANIGKKSDLYAWIARADDYKALNIVGEN 237

Query: 1398 LRKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEK 1219
            LRK+GDLRTISDIM++EARKT KLV NLTNVIE KK+HL EMENKFKET  SL QL+ EK
Sbjct: 238  LRKVGDLRTISDIMDEEARKTNKLVSNLTNVIEVKKLHLKEMENKFKETAQSLQQLVEEK 297

Query: 1218 DKLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENE 1039
            DKL Q YN+EI+KIQSSA+DHFQKIFNDHEKLKLQLE+QK+ELE RG EL +RE  NE++
Sbjct: 298  DKLHQAYNEEIRKIQSSARDHFQKIFNDHEKLKLQLESQKKELELRGRELEQREAKNESD 357

Query: 1038 RRKLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAK 859
            R+KLSEELE+NAV N SL AA  EQRK DEKV+KLAEEQK+QKE+LHK+II LEKQLDAK
Sbjct: 358  RKKLSEELEQNAVLNTSLSAAAEEQRKVDEKVLKLAEEQKRQKEDLHKRIIHLEKQLDAK 417

Query: 858  QAVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQE 679
            QAVELEIEQLRG LNVM+H+EDEGD EVL+KVD+LLKSLREKE E E + ALNQTL+V+E
Sbjct: 418  QAVELEIEQLRGSLNVMKHIEDEGDQEVLKKVDTLLKSLREKEEEYEGLEALNQTLIVKE 477

Query: 678  RNSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATEL 499
            RNSNDELQDARKELVNGLKEL   G IGVKRMGEL+++PFHEAMKRKYNE EADERATEL
Sbjct: 478  RNSNDELQDARKELVNGLKELPRVGPIGVKRMGELDNRPFHEAMKRKYNESEADERATEL 537

Query: 498  CSLWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNE 319
            CSLWEEYLRDP WHPIKVV +NGK + VID +DEKLKDL+ NYG+E+Y AVT+AL E+N+
Sbjct: 538  CSLWEEYLRDPGWHPIKVVTVNGKLENVIDDEDEKLKDLKKNYGEEVYKAVTAALMEIND 597

Query: 318  YNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            YNPSGRYIISELWNY+  +KATL+EGV  ++ LW   K+KRG D
Sbjct: 598  YNPSGRYIISELWNYAVNQKATLEEGVTVLLNLW---KKKRGPD 638


>ref|XP_009799105.1| PREDICTED: golgin candidate 4 [Nicotiana sylvestris]
            gi|698507540|ref|XP_009799106.1| PREDICTED: golgin
            candidate 4 [Nicotiana sylvestris]
          Length = 639

 Score =  949 bits (2453), Expect = 0.0
 Identities = 478/645 (74%), Positives = 540/645 (83%), Gaps = 12/645 (1%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M++SSG+DTD+S+S          E LK GNH  K+++ AYTCP+CP KRKRDF YKEL+
Sbjct: 1    MDHSSGEDTDISESEIEEYEEKSYEDLKRGNHSLKISDVAYTCPYCPKKRKRDFGYKELV 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVV-----------DAGPSQQATMVDPL 1771
            QHA GVG  NS KRT +DKANHLALAK+LENDVVV           DA P  QA   DPL
Sbjct: 61   QHASGVGSGNSNKRTARDKANHLALAKYLENDVVVAGDSSNDSSKPDAKPDPQA---DPL 117

Query: 1770 ADHDRDEMFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGH 1591
            ADHDRDEMFVWPWIGIVVNI TE+KDGRYVGESG+KLRDQL RRGFNPTRVRPLWNYQGH
Sbjct: 118  ADHDRDEMFVWPWIGIVVNILTEYKDGRYVGESGSKLRDQLARRGFNPTRVRPLWNYQGH 177

Query: 1590 SGTALVEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAI 1411
            SGTALVEF KDW GF NAMSFEKAYEA+HHGKR W +    KSDLY W+ARADDY +  I
Sbjct: 178  SGTALVEFNKDWSGFGNAMSFEKAYEADHHGKREWKANIGKKSDLYAWIARADDYKALNI 237

Query: 1410 FGENLRKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQL 1231
             GENLRK+GDLRTISDIME+EARKT KLV NLTNVIE KK+HL EMENKFKET  SL QL
Sbjct: 238  VGENLRKVGDLRTISDIMEEEARKTNKLVSNLTNVIEVKKLHLKEMENKFKETAQSLQQL 297

Query: 1230 ITEKDKLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETH 1051
            + EKDKL Q YN+EI+KIQSSA+DHFQKIFNDHEKLKLQLE+QK+ELE RG EL +RE  
Sbjct: 298  VEEKDKLHQAYNEEIRKIQSSARDHFQKIFNDHEKLKLQLESQKKELELRGRELEQREAK 357

Query: 1050 NENERRKLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQ 871
            NE++R+KLSEELE+NAV N SL AA  EQRK DEKV+KLAEEQK+QKE+LHK+II LEKQ
Sbjct: 358  NESDRKKLSEELEQNAVLNTSLSAAAEEQRKVDEKVLKLAEEQKRQKEDLHKRIIHLEKQ 417

Query: 870  LDAKQAVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTL 691
            LDAKQAVELEIEQLRG L VM+H+EDEGD EVL+KVD+LLKSLREKE E E + ALNQTL
Sbjct: 418  LDAKQAVELEIEQLRGSLKVMKHIEDEGDQEVLKKVDTLLKSLREKEEEYEGLEALNQTL 477

Query: 690  VVQERNSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADER 511
            +V+ERNSNDELQDARKELVNGLKEL   G IGVKRMGEL+++PFHEAMKRKYNE EADER
Sbjct: 478  IVKERNSNDELQDARKELVNGLKELPRVGPIGVKRMGELDNRPFHEAMKRKYNESEADER 537

Query: 510  ATELCSLWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALT 331
            ATELCSLWEEYLRDP WHPIKVV +NGK + VID +DEKLKDL+ NYG+++Y AVT+AL 
Sbjct: 538  ATELCSLWEEYLRDPGWHPIKVVTVNGKLENVIDDEDEKLKDLKRNYGEDVYKAVTAALI 597

Query: 330  EVNEYNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKR 196
            E+N+YNPSGRYIISELWNY+  +KATL+EGV  ++ LW   K+KR
Sbjct: 598  EINDYNPSGRYIISELWNYAVNQKATLEEGVTVLLNLW---KKKR 639


>ref|XP_006362466.1| PREDICTED: myosin-2 heavy chain-like isoform X1 [Solanum tuberosum]
          Length = 653

 Score =  948 bits (2450), Expect = 0.0
 Identities = 470/644 (72%), Positives = 541/644 (84%), Gaps = 5/644 (0%)
 Frame = -3

Query: 2103 GSSMEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYK 1927
            GSSM++SSG+DTD+S+S          E LK+G+H  K+++ AYTCP+CP KRKRDFLYK
Sbjct: 13   GSSMDHSSGEDTDISESEIEEYGEKSYEDLKSGSHSLKISDVAYTCPYCPKKRKRDFLYK 72

Query: 1926 ELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPS----QQATMVDPLADHD 1759
            EL+QHA GVG C+S KRT ++KANHL LAK+LE D  V A  S    +  +  DPLADHD
Sbjct: 73   ELIQHASGVGSCSSNKRTAREKANHLGLAKYLETDAAVAADSSKPDAEPDSQTDPLADHD 132

Query: 1758 RDEMFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTA 1579
            RDEMFVWPWIGIVVNIPTEFKDGR VGESG+KLRDQLTRRGFNPTRVRPLWNYQGHSGTA
Sbjct: 133  RDEMFVWPWIGIVVNIPTEFKDGRNVGESGSKLRDQLTRRGFNPTRVRPLWNYQGHSGTA 192

Query: 1578 LVEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGEN 1399
            LVEF KDW GF NAM++EKAYEA+HHGK+ W   +   SDLY W+ARADDY +  I GEN
Sbjct: 193  LVEFNKDWSGFGNAMAYEKAYEADHHGKKDWKVNHCKNSDLYAWIARADDYKAINIIGEN 252

Query: 1398 LRKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEK 1219
            LRK+GDLRTISDIME+EARKT KLV NLTNVIE KK+HL EM +KFKET  SL QL+ EK
Sbjct: 253  LRKVGDLRTISDIMEEEARKTNKLVSNLTNVIEVKKLHLKEMADKFKETSQSLKQLVEEK 312

Query: 1218 DKLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENE 1039
            DKL Q YN+EI+KIQSSA+DHFQKIFNDHEKLKLQLE+QK+ELE RG EL KRE  NE++
Sbjct: 313  DKLHQAYNEEIRKIQSSARDHFQKIFNDHEKLKLQLESQKKELELRGRELEKREAKNESD 372

Query: 1038 RRKLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAK 859
            R+KLSE+LE+NA  N SL AA  EQRK DEKV+KLAEEQK+QKE+LHK+IIQLEKQLDAK
Sbjct: 373  RKKLSEDLEQNATLNTSLSAAAEEQRKVDEKVLKLAEEQKRQKEDLHKRIIQLEKQLDAK 432

Query: 858  QAVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQE 679
            QAVELEIEQLRG LNVM+H+EDEGD EVL+KVD+LLKSLREKE E + + ALNQTL+V+E
Sbjct: 433  QAVELEIEQLRGSLNVMKHIEDEGDQEVLKKVDTLLKSLREKEEEYDGLEALNQTLIVKE 492

Query: 678  RNSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATEL 499
            RNSNDELQDARKELVNGLKEL   G IGVKRMGEL+++PFHEAMKR YNE EADERATEL
Sbjct: 493  RNSNDELQDARKELVNGLKELPRVGPIGVKRMGELDNRPFHEAMKRSYNESEADERATEL 552

Query: 498  CSLWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNE 319
            CSLWEEYLRDP WHPIKVV +NGK + VID +DEKLKDL+ NYG+E+  AVT+AL EVN+
Sbjct: 553  CSLWEEYLRDPGWHPIKVVMVNGKPENVIDDEDEKLKDLKKNYGEEVCKAVTAALMEVND 612

Query: 318  YNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            YNPSGRYIISELWNY+  +KATL+EGV  ++ +W   K+KRG D
Sbjct: 613  YNPSGRYIISELWNYAVNKKATLEEGVTVLLNMW---KKKRGSD 653


>gb|EYU17877.1| hypothetical protein MIMGU_mgv1a002871mg [Erythranthe guttata]
          Length = 629

 Score =  947 bits (2449), Expect = 0.0
 Identities = 470/641 (73%), Positives = 542/641 (84%), Gaps = 1/641 (0%)
 Frame = -3

Query: 2106 MGSSMEYSSGDDTDVSDSXXXXXXXXXXELKNGNHQYKVANDAYTCPFCPTKRKRDFLYK 1927
            MGSS E+SSG+DTD+SDS                  Y+    AYTCP+CPTK+K  F YK
Sbjct: 1    MGSSGEHSSGEDTDISDSETEEYT---------EKSYEELKKAYTCPYCPTKKKA-FQYK 50

Query: 1926 ELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAG-PSQQATMVDPLADHDRDE 1750
            +L QHA  +    S KR  +++ANHLAL K+LE D  VD+G PS+ A  +D LADHD DE
Sbjct: 51   DLFQHANAIA--TSPKRNARERANHLALTKYLETDSSVDSGGPSKPAGKLDTLADHDGDE 108

Query: 1749 MFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVE 1570
            MFVWPWIGIVVNIPT+ KDGRYVGESG+KLRDQL RRGFNPTRVRPLWNYQGHSGTA+VE
Sbjct: 109  MFVWPWIGIVVNIPTDLKDGRYVGESGSKLRDQLARRGFNPTRVRPLWNYQGHSGTAIVE 168

Query: 1569 FQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRK 1390
            F K+W GF NAMSFEK YEANHHG+++W +KN+ KSDLY WVAR DDYN + I GENLRK
Sbjct: 169  FHKNWSGFTNAMSFEKDYEANHHGRKNWLAKNEKKSDLYAWVARTDDYNLNNIVGENLRK 228

Query: 1389 IGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKL 1210
            IGDLRTISDIME+EARKT KLVGNLTNVIEAKKMHL+EME+KFKET++SL Q+I EKD L
Sbjct: 229  IGDLRTISDIMEEEARKTNKLVGNLTNVIEAKKMHLIEMESKFKETDSSLRQMIMEKDNL 288

Query: 1209 LQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRK 1030
             Q YN+EIKKI+SSA+DHF+KIFNDHEKLK QLETQKR+LE RG EL KRETHNE E++K
Sbjct: 289  HQAYNEEIKKIESSARDHFKKIFNDHEKLKFQLETQKRDLELRGQELMKRETHNEIEKKK 348

Query: 1029 LSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAV 850
            L+E+LE+NAVKNCSLQAA  EQRKADEKVMKLAEE K +KE LHK+IIQLEKQLDAKQAV
Sbjct: 349  LAEDLEQNAVKNCSLQAAAEEQRKADEKVMKLAEEHKNEKEKLHKRIIQLEKQLDAKQAV 408

Query: 849  ELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNS 670
            +LEIEQLRGKLNV++HM DEGDLEVL KVD +LK+LREKEGEL+D+ ALNQTLVVQER S
Sbjct: 409  QLEIEQLRGKLNVVKHMGDEGDLEVLNKVDLVLKALREKEGELDDLEALNQTLVVQERKS 468

Query: 669  NDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSL 490
            NDELQDARKELVNGLK++S N  IGVKRMGEL+SKPFH+AMKRKYNE EADERATELCSL
Sbjct: 469  NDELQDARKELVNGLKDMSLNSHIGVKRMGELDSKPFHDAMKRKYNEAEADERATELCSL 528

Query: 489  WEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNP 310
            WEEYL+DPEW PIKVV I+G  + VI++DDEKL DLR NYG+E+Y+AV + L E+NEYNP
Sbjct: 529  WEEYLKDPEWDPIKVVTIDGTPKNVINEDDEKLIDLRENYGEEVYNAVKATLFEINEYNP 588

Query: 309  SGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            SGRY+ISELWNY EGR+A L+EGVA ++K W+ +KRK+GMD
Sbjct: 589  SGRYVISELWNYKEGRRAYLKEGVAVLLKQWKFFKRKKGMD 629


>ref|XP_004240949.1| PREDICTED: protein INVOLVED IN DE NOVO 2 [Solanum lycopersicum]
          Length = 638

 Score =  943 bits (2438), Expect = 0.0
 Identities = 467/641 (72%), Positives = 538/641 (83%), Gaps = 5/641 (0%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M+YSSG+DTD+S+S          E LK+G+   K+++ AYTCP+CP KRKRDFLYKEL+
Sbjct: 1    MDYSSGEDTDISESEIEEYGEKSYEDLKSGSRSLKISDVAYTCPYCPKKRKRDFLYKELV 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPS----QQATMVDPLADHDRDE 1750
            QHA GVG C+S KRT ++KANHL LAK+LE D  V A  S    +  T  DPLADHDRDE
Sbjct: 61   QHASGVGSCSSNKRTAREKANHLGLAKYLETDAAVAADSSKPDAEPDTQTDPLADHDRDE 120

Query: 1749 MFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVE 1570
            MFVWPWIG+VVNIPTE+KDGR VGESG+KLRDQLTRRGFNPTRVRPLWNYQGHSGTALVE
Sbjct: 121  MFVWPWIGVVVNIPTEYKDGRNVGESGSKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVE 180

Query: 1569 FQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRK 1390
            F KDW GF NAM++EKAYEA+HHGK+ W   +   SDLY W+ARADDY +  I GENLRK
Sbjct: 181  FNKDWSGFGNAMAYEKAYEADHHGKKDWKVNHSKNSDLYAWIARADDYKAINIIGENLRK 240

Query: 1389 IGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKL 1210
            +GDLRTISDIME+EARKT KLV NLTNVIE KK+HL EM +KFKET  SL QL+ EKDKL
Sbjct: 241  VGDLRTISDIMEEEARKTNKLVSNLTNVIEVKKLHLKEMADKFKETSQSLKQLVEEKDKL 300

Query: 1209 LQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRK 1030
             Q YN+EI+KIQSSA+DHFQKIFNDHEKLKLQLE+QK+ELE RG EL KRE  NE++R+K
Sbjct: 301  HQAYNEEIRKIQSSARDHFQKIFNDHEKLKLQLESQKKELELRGRELEKREAKNESDRKK 360

Query: 1029 LSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAV 850
            LSE+LE+NA  N SL AA  EQRK DEKV+KLAEEQK+QKE+LHK+IIQLEKQLDAKQAV
Sbjct: 361  LSEDLEQNATLNTSLSAAAEEQRKVDEKVLKLAEEQKRQKEDLHKRIIQLEKQLDAKQAV 420

Query: 849  ELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNS 670
            ELEIEQLRG LNVM+H+EDEGD EVL+KVD+LLKSLREKE E + + ALNQTL+V+ERNS
Sbjct: 421  ELEIEQLRGSLNVMKHIEDEGDQEVLKKVDTLLKSLREKEEEYDGLEALNQTLIVKERNS 480

Query: 669  NDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSL 490
            NDELQDARKELVNGLKEL   G IGVKRMGEL+++PFHEAMKR YNE EADERATELCSL
Sbjct: 481  NDELQDARKELVNGLKELPRVGPIGVKRMGELDNRPFHEAMKRNYNESEADERATELCSL 540

Query: 489  WEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNP 310
            WEEYLRDP WHPIKVV INGK + VID++DEKLKDL+ NYG+E+  AVT+AL EVN+YNP
Sbjct: 541  WEEYLRDPGWHPIKVVMINGKPENVIDEEDEKLKDLKRNYGEEVCKAVTAALMEVNDYNP 600

Query: 309  SGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            SGRYIISELWNY+  +KATL+EGV  ++ +W   K+KRG D
Sbjct: 601  SGRYIISELWNYAVNKKATLEEGVTVLLNMW---KKKRGSD 638


>ref|XP_006362467.1| PREDICTED: myosin-2 heavy chain-like isoform X2 [Solanum tuberosum]
          Length = 638

 Score =  942 bits (2436), Expect = 0.0
 Identities = 467/641 (72%), Positives = 538/641 (83%), Gaps = 5/641 (0%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M++SSG+DTD+S+S          E LK+G+H  K+++ AYTCP+CP KRKRDFLYKEL+
Sbjct: 1    MDHSSGEDTDISESEIEEYGEKSYEDLKSGSHSLKISDVAYTCPYCPKKRKRDFLYKELI 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPS----QQATMVDPLADHDRDE 1750
            QHA GVG C+S KRT ++KANHL LAK+LE D  V A  S    +  +  DPLADHDRDE
Sbjct: 61   QHASGVGSCSSNKRTAREKANHLGLAKYLETDAAVAADSSKPDAEPDSQTDPLADHDRDE 120

Query: 1749 MFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVE 1570
            MFVWPWIGIVVNIPTEFKDGR VGESG+KLRDQLTRRGFNPTRVRPLWNYQGHSGTALVE
Sbjct: 121  MFVWPWIGIVVNIPTEFKDGRNVGESGSKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVE 180

Query: 1569 FQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRK 1390
            F KDW GF NAM++EKAYEA+HHGK+ W   +   SDLY W+ARADDY +  I GENLRK
Sbjct: 181  FNKDWSGFGNAMAYEKAYEADHHGKKDWKVNHCKNSDLYAWIARADDYKAINIIGENLRK 240

Query: 1389 IGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKL 1210
            +GDLRTISDIME+EARKT KLV NLTNVIE KK+HL EM +KFKET  SL QL+ EKDKL
Sbjct: 241  VGDLRTISDIMEEEARKTNKLVSNLTNVIEVKKLHLKEMADKFKETSQSLKQLVEEKDKL 300

Query: 1209 LQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRK 1030
             Q YN+EI+KIQSSA+DHFQKIFNDHEKLKLQLE+QK+ELE RG EL KRE  NE++R+K
Sbjct: 301  HQAYNEEIRKIQSSARDHFQKIFNDHEKLKLQLESQKKELELRGRELEKREAKNESDRKK 360

Query: 1029 LSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAV 850
            LSE+LE+NA  N SL AA  EQRK DEKV+KLAEEQK+QKE+LHK+IIQLEKQLDAKQAV
Sbjct: 361  LSEDLEQNATLNTSLSAAAEEQRKVDEKVLKLAEEQKRQKEDLHKRIIQLEKQLDAKQAV 420

Query: 849  ELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNS 670
            ELEIEQLRG LNVM+H+EDEGD EVL+KVD+LLKSLREKE E + + ALNQTL+V+ERNS
Sbjct: 421  ELEIEQLRGSLNVMKHIEDEGDQEVLKKVDTLLKSLREKEEEYDGLEALNQTLIVKERNS 480

Query: 669  NDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSL 490
            NDELQDARKELVNGLKEL   G IGVKRMGEL+++PFHEAMKR YNE EADERATELCSL
Sbjct: 481  NDELQDARKELVNGLKELPRVGPIGVKRMGELDNRPFHEAMKRSYNESEADERATELCSL 540

Query: 489  WEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNP 310
            WEEYLRDP WHPIKVV +NGK + VID +DEKLKDL+ NYG+E+  AVT+AL EVN+YNP
Sbjct: 541  WEEYLRDPGWHPIKVVMVNGKPENVIDDEDEKLKDLKKNYGEEVCKAVTAALMEVNDYNP 600

Query: 309  SGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            SGRYIISELWNY+  +KATL+EGV  ++ +W   K+KRG D
Sbjct: 601  SGRYIISELWNYAVNKKATLEEGVTVLLNMW---KKKRGSD 638


>emb|CDP00105.1| unnamed protein product [Coffea canephora]
          Length = 658

 Score =  903 bits (2334), Expect = 0.0
 Identities = 436/644 (67%), Positives = 529/644 (82%), Gaps = 2/644 (0%)
 Frame = -3

Query: 2124 FIGLRLMGSSMEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKR 1948
            F+ L +MG S+++SSG+DTD+S+S          + LK+G  + KV++ A+TCP+CP KR
Sbjct: 12   FVSLNVMGGSVDHSSGEDTDISESEIEEYEDKSYQALKSGKKEVKVSDGAFTCPYCPKKR 71

Query: 1947 KRDFLYKELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMVDPLA 1768
            K DFLYK++LQHA GVG   S+KR+ +DKANHLALAK+LE D+ V AGPSQ    VDPLA
Sbjct: 72   KADFLYKDILQHASGVGSSTSKKRSARDKANHLALAKYLEKDMSVTAGPSQATVEVDPLA 131

Query: 1767 DHDRDEMFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHS 1588
            DHDRDEMFVWPWIGIVVN+ T+FKDGRYVG+SG+ LRD+LT RGFNPTRVRPLWNYQGHS
Sbjct: 132  DHDRDEMFVWPWIGIVVNLATDFKDGRYVGKSGSNLRDELTSRGFNPTRVRPLWNYQGHS 191

Query: 1587 GTALVEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIF 1408
            GTALVEF KDWFGF NAMSFEKAYEA+ HGKR W + N LKSDLY WVAR DDY SD I 
Sbjct: 192  GTALVEFTKDWFGFKNAMSFEKAYEADRHGKRDWMANNTLKSDLYAWVARVDDYKSDGII 251

Query: 1407 GENLRKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLI 1228
            GENLRKIGD+RTISDI+E+E RKT KLV NLTNVIE K M + EME+KF+ETE+SL  LI
Sbjct: 252  GENLRKIGDIRTISDIIEEEVRKTSKLVNNLTNVIEVKNMQVKEMESKFQETESSLGILI 311

Query: 1227 TEKDKLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHN 1048
             EK+KL Q YN+E+KK+QS+A++HFQKI +DHEK+K QLE QK+ELE RG +L +RE  N
Sbjct: 312  EEKNKLHQAYNEELKKLQSTAREHFQKILSDHEKMKSQLENQKKELELRGKQLEQREAKN 371

Query: 1047 ENERRKLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQL 868
            E++R+KL EELEENA +N  + AA+ EQRK D KV+KLAEEQ+KQKE L  +IIQLEK L
Sbjct: 372  ESDRKKLLEELEENAAQNSLIDAASEEQRKVDAKVLKLAEEQRKQKEELLNRIIQLEKDL 431

Query: 867  DAKQAVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLV 688
            D KQA++LEIE+L+G +NVM+HM DEGDLEVL+K DSL K LREKEGELED   LNQTL+
Sbjct: 432  DTKQALQLEIERLKGTINVMKHMGDEGDLEVLQKADSLNKLLREKEGELEDFDRLNQTLI 491

Query: 687  VQERNSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKY-NEVEADER 511
            V+ER SN+ELQDARKEL+NGLKE   +  IG+KRMGEL++KPF+EAMK KY +E +A+ER
Sbjct: 492  VKERKSNEELQDARKELINGLKEYPNSAYIGIKRMGELDNKPFYEAMKLKYKHEADAEER 551

Query: 510  ATELCSLWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALT 331
            A+ELCSLW+EYL+DPEWHP KV+ +NGKH+ VID+DDEKL DL+  YG E+Y AVT+A  
Sbjct: 552  ASELCSLWDEYLKDPEWHPTKVILVNGKHEEVIDEDDEKLNDLKEKYGHEVYKAVTTAFI 611

Query: 330  EVNEYNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRK 199
            E+NEYN SGRYI SELWNY+EGRKATL+EG A ++K W+ +  K
Sbjct: 612  EINEYNASGRYITSELWNYAEGRKATLKEGAAFLLKKWKLHMEK 655


>ref|XP_002278500.1| PREDICTED: protein INVOLVED IN DE NOVO 2 isoform X2 [Vitis vinifera]
            gi|296086223|emb|CBI31664.3| unnamed protein product
            [Vitis vinifera]
          Length = 641

 Score =  871 bits (2250), Expect = 0.0
 Identities = 433/644 (67%), Positives = 517/644 (80%), Gaps = 4/644 (0%)
 Frame = -3

Query: 2106 MGSSMEYSSGDDTDVSDSXXXXXXXXXXELKNGNHQYKVANDAYTCPFCPTKRKRDFLYK 1927
            MG+S + S  +D   S+           ELKNGNH  K + ++++CP+C  K+KRD+LYK
Sbjct: 1    MGNSSDDSLEEDISESE-IEEYEDKSYEELKNGNHHVKTSGESFSCPYCSKKKKRDYLYK 59

Query: 1926 ELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVV-VDA---GPSQQATMVDPLADHD 1759
            ELLQHA+GVGK +S+KR+ KDKANHLALAK+LE D++ VD    GPS+  +      DHD
Sbjct: 60   ELLQHAIGVGKSSSEKRSMKDKANHLALAKYLEKDIMDVDGPSGGPSKPKSEAPLGCDHD 119

Query: 1758 RDEMFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTA 1579
              EMFVWPW G+VVNIPTE +DGRY+G SG+KLRD+LT RGFNP RV PLWNY+GHSG A
Sbjct: 120  --EMFVWPWTGVVVNIPTELRDGRYIGGSGSKLRDELTARGFNPIRVHPLWNYRGHSGCA 177

Query: 1578 LVEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGEN 1399
             VEF KDW G +NAMSFEK YEA+HHGK+ W + N   S LY WVARADDY + +I GE+
Sbjct: 178  AVEFNKDWPGLHNAMSFEKEYEADHHGKKDWIASNGRGSGLYAWVARADDYKAASIIGEH 237

Query: 1398 LRKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEK 1219
            LRKIGDL+TISDIM +EARK  KLV NLTNVIE K  HL EM+    E   SL+ LI EK
Sbjct: 238  LRKIGDLKTISDIMAEEARKQSKLVSNLTNVIEVKNKHLEEMKRIVSEASVSLNNLIEEK 297

Query: 1218 DKLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENE 1039
            DKL Q YN+EI+KIQ SA+DHFQKIFNDHEKLKLQLE+ KREL+ RG EL KRE HNENE
Sbjct: 298  DKLHQAYNEEIRKIQMSARDHFQKIFNDHEKLKLQLESHKRELDLRGRELEKREAHNENE 357

Query: 1038 RRKLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAK 859
            R+KL EE+E+N +KN SLQ A  EQ+KADEKV KLAE+QKKQKENLH++IIQLEKQLDAK
Sbjct: 358  RKKLCEEIEKNVMKNSSLQLAAVEQQKADEKVYKLAEDQKKQKENLHRRIIQLEKQLDAK 417

Query: 858  QAVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQE 679
            QA+ELEIE+LRG LNVM+HM D+GD+E+L+K+DS+LK LREKEGELED+ ALNQTL+V+E
Sbjct: 418  QALELEIERLRGTLNVMKHMGDDGDMEILKKMDSMLKVLREKEGELEDLEALNQTLIVKE 477

Query: 678  RNSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATEL 499
            R SNDELQ+ARKEL++GLKE+S    IGVKRMGEL++KPFHEA KRKY   E +ERA EL
Sbjct: 478  RKSNDELQEARKELISGLKEMSGRAHIGVKRMGELDNKPFHEACKRKYGVAEPEERALEL 537

Query: 498  CSLWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNE 319
            CSLWEE+LRD EWHP KVVEI GKHQ VID +DEKL+ +R+  GDE+Y AV +AL E+NE
Sbjct: 538  CSLWEEFLRDSEWHPFKVVEIEGKHQGVIDDNDEKLRSVRDELGDEVYTAVRTALIEINE 597

Query: 318  YNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            YNPSGRYIISELWNY EGRKATLQEGVA I++ W+TY+ KRG++
Sbjct: 598  YNPSGRYIISELWNYGEGRKATLQEGVAFILRKWKTYRDKRGIN 641


>ref|XP_007009298.1| XH/XS domain-containing protein, putative isoform 1 [Theobroma cacao]
            gi|508726211|gb|EOY18108.1| XH/XS domain-containing
            protein, putative isoform 1 [Theobroma cacao]
          Length = 640

 Score =  867 bits (2240), Expect = 0.0
 Identities = 426/638 (66%), Positives = 514/638 (80%), Gaps = 4/638 (0%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M+ SSG+D+D+S+S          E LKNG H  KV+ + YTCP+CP K+KRD+LYKELL
Sbjct: 1    MDISSGEDSDISESEMEEYEDKSYEKLKNGKHNIKVSEETYTCPYCPKKKKRDYLYKELL 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMVDPLADHDRDEMFVW 1738
            QHA GVG  NS+KR+ K+KANHLAL K+LE D+V     S+ A   DPL+ +D DE  VW
Sbjct: 61   QHASGVGNSNSEKRSAKEKANHLALVKYLEKDLVAVGSSSKTAAEEDPLSGYDHDEKIVW 120

Query: 1737 PWIGIVVNIPTE-FKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVEFQK 1561
            PW GIVVNIPT   +DGR VGESG+KLRD+L RRGFNP RV PLWNY+GHSGTA+VEF K
Sbjct: 121  PWTGIVVNIPTRRSEDGRSVGESGSKLRDELIRRGFNPIRVLPLWNYRGHSGTAVVEFHK 180

Query: 1560 DWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRKIGD 1381
            DW G +NA+SFEKAY+A+HHGK+ W + ND+K  LY WVARADDY S  I GENLRK  D
Sbjct: 181  DWPGLHNALSFEKAYQADHHGKKEWCANNDVKFGLYAWVARADDYKSSGIIGENLRKTSD 240

Query: 1380 LRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKLLQE 1201
            L+TIS IME+EARK  KLV NLTN+IE K  H+ EME +  ET  SL  L+ EKD LLQ 
Sbjct: 241  LKTISGIMEEEARKQDKLVSNLTNIIETKNKHIKEMEARCSETSKSLEVLMDEKDNLLQA 300

Query: 1200 YNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRKLSE 1021
            YN+EIKKIQ SA++HF +IFNDHEKLK QLE+ KR+LE RG EL KRE  NE+ER+KL+E
Sbjct: 301  YNEEIKKIQLSAREHFLRIFNDHEKLKSQLESHKRDLELRGVELEKREALNESERKKLAE 360

Query: 1020 ELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAVELE 841
            ELE+NAV+N +LQ A+ EQ+KADE VMKLAE+QK+QKE LH +IIQLEKQLD KQA+ELE
Sbjct: 361  ELEQNAVQNSALQLASLEQKKADENVMKLAEDQKRQKEELHNRIIQLEKQLDQKQALELE 420

Query: 840  IEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNSNDE 661
            IEQLRG LNV+RHM DE D+EVL K+++ LK LREKEGELEDV ALNQTL+V+ER SNDE
Sbjct: 421  IEQLRGSLNVIRHMGDEDDIEVLRKMEATLKELREKEGELEDVEALNQTLIVRERKSNDE 480

Query: 660  LQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSLWEE 481
            LQ+ARKEL+NGLKE+S+   IGVKRMGEL+SKPF E MKR+YNE +A+ERA+ELCSLW+E
Sbjct: 481  LQEARKELINGLKEISSRAHIGVKRMGELDSKPFFEVMKRRYNEEQAEERASELCSLWDE 540

Query: 480  YLRDPEWHPIKVVEING--KHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNPS 307
            YL+DP+WHP K +++ G  ++Q VI+ +DEKL+DLRN  G+E+Y  VTSA+ E+NEYNPS
Sbjct: 541  YLKDPDWHPFKRIKLEGEEEYQEVINDEDEKLRDLRNQMGNEVYKVVTSAIKEINEYNPS 600

Query: 306  GRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRG 193
            GRYIISELWNY EGRKATLQEGV +++KLW T KRKRG
Sbjct: 601  GRYIISELWNYGEGRKATLQEGVIYLLKLWNTAKRKRG 638


>ref|XP_010658558.1| PREDICTED: protein INVOLVED IN DE NOVO 2 isoform X1 [Vitis vinifera]
          Length = 656

 Score =  861 bits (2224), Expect = 0.0
 Identities = 433/659 (65%), Positives = 517/659 (78%), Gaps = 19/659 (2%)
 Frame = -3

Query: 2106 MGSSMEYSSGDDTDVSDSXXXXXXXXXXELKNGNHQYKVANDAYTCPFCPTKRKRDFLYK 1927
            MG+S + S  +D   S+           ELKNGNH  K + ++++CP+C  K+KRD+LYK
Sbjct: 1    MGNSSDDSLEEDISESE-IEEYEDKSYEELKNGNHHVKTSGESFSCPYCSKKKKRDYLYK 59

Query: 1926 ELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVV-VDA---GPSQQATMVDPLADHD 1759
            ELLQHA+GVGK +S+KR+ KDKANHLALAK+LE D++ VD    GPS+  +      DHD
Sbjct: 60   ELLQHAIGVGKSSSEKRSMKDKANHLALAKYLEKDIMDVDGPSGGPSKPKSEAPLGCDHD 119

Query: 1758 RDEMFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTA 1579
              EMFVWPW G+VVNIPTE +DGRY+G SG+KLRD+LT RGFNP RV PLWNY+GHSG A
Sbjct: 120  --EMFVWPWTGVVVNIPTELRDGRYIGGSGSKLRDELTARGFNPIRVHPLWNYRGHSGCA 177

Query: 1578 LVEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGEN 1399
             VEF KDW G +NAMSFEK YEA+HHGK+ W + N   S LY WVARADDY + +I GE+
Sbjct: 178  AVEFNKDWPGLHNAMSFEKEYEADHHGKKDWIASNGRGSGLYAWVARADDYKAASIIGEH 237

Query: 1398 LRKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEK 1219
            LRKIGDL+TISDIM +EARK  KLV NLTNVIE K  HL EM+    E   SL+ LI EK
Sbjct: 238  LRKIGDLKTISDIMAEEARKQSKLVSNLTNVIEVKNKHLEEMKRIVSEASVSLNNLIEEK 297

Query: 1218 DKLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENE 1039
            DKL Q YN+EI+KIQ SA+DHFQKIFNDHEKLKLQLE+ KREL+ RG EL KRE HNENE
Sbjct: 298  DKLHQAYNEEIRKIQMSARDHFQKIFNDHEKLKLQLESHKRELDLRGRELEKREAHNENE 357

Query: 1038 RRKLSEELEE---------------NAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKEN 904
            R+KL EE+E+               N +KN SLQ A  EQ+KADEKV KLAE+QKKQKEN
Sbjct: 358  RKKLCEEIEKICVNCQVVTLLFCLHNVMKNSSLQLAAVEQQKADEKVYKLAEDQKKQKEN 417

Query: 903  LHKKIIQLEKQLDAKQAVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGE 724
            LH++IIQLEKQLDAKQA+ELEIE+LRG LNVM+HM D+GD+E+L+K+DS+LK LREKEGE
Sbjct: 418  LHRRIIQLEKQLDAKQALELEIERLRGTLNVMKHMGDDGDMEILKKMDSMLKVLREKEGE 477

Query: 723  LEDVLALNQTLVVQERNSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMK 544
            LED+ ALNQTL+V+ER SNDELQ+ARKEL++GLKE+S    IGVKRMGEL++KPFHEA K
Sbjct: 478  LEDLEALNQTLIVKERKSNDELQEARKELISGLKEMSGRAHIGVKRMGELDNKPFHEACK 537

Query: 543  RKYNEVEADERATELCSLWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGD 364
            RKY   E +ERA ELCSLWEE+LRD EWHP KVVEI GKHQ VID +DEKL+ +R+  GD
Sbjct: 538  RKYGVAEPEERALELCSLWEEFLRDSEWHPFKVVEIEGKHQGVIDDNDEKLRSVRDELGD 597

Query: 363  EMYDAVTSALTEVNEYNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGMD 187
            E+Y AV +AL E+NEYNPSGRYIISELWNY EGRKATLQEGVA I++ W+TY+ KRG++
Sbjct: 598  EVYTAVRTALIEINEYNPSGRYIISELWNYGEGRKATLQEGVAFILRKWKTYRDKRGIN 656


>ref|XP_007009299.1| XH/XS domain-containing protein, putative isoform 2 [Theobroma cacao]
            gi|508726212|gb|EOY18109.1| XH/XS domain-containing
            protein, putative isoform 2 [Theobroma cacao]
          Length = 638

 Score =  859 bits (2220), Expect = 0.0
 Identities = 425/638 (66%), Positives = 512/638 (80%), Gaps = 4/638 (0%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M+ SSG+D+D+S+S          E LKNG H  KV+ + YTCP+CP K+KRD+LYKELL
Sbjct: 1    MDISSGEDSDISESEMEEYEDKSYEKLKNGKHNIKVSEETYTCPYCPKKKKRDYLYKELL 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMVDPLADHDRDEMFVW 1738
            QHA GVG  NS+KR+ K+KANHLAL K+LE D+V     S+ A   DPL+ +D DE  VW
Sbjct: 61   QHASGVGNSNSEKRSAKEKANHLALVKYLEKDLVAVGSSSKTAAEEDPLSGYDHDEKIVW 120

Query: 1737 PWIGIVVNIPTE-FKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVEFQK 1561
            PW GIVVNIPT   +DGR VGESG+KLRD+L RRGFNP RV PLWNY+GHSGTA+VEF K
Sbjct: 121  PWTGIVVNIPTRRSEDGRSVGESGSKLRDELIRRGFNPIRVLPLWNYRGHSGTAVVEFHK 180

Query: 1560 DWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRKIGD 1381
            DW G +NA+SFEKAY+A+HHGK+ W + ND+K  LY WVARADDY S  I GENLRK  D
Sbjct: 181  DWPGLHNALSFEKAYQADHHGKKEWCANNDVKFGLYAWVARADDYKSSGIIGENLRKTSD 240

Query: 1380 LRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKLLQE 1201
            L+TIS IME+EARK  KLV NLTN+IE K  H+ EME +  ET  SL  L+ EKD LLQ 
Sbjct: 241  LKTISGIMEEEARKQDKLVSNLTNIIETKNKHIKEMEARCSETSKSLEVLMDEKDNLLQA 300

Query: 1200 YNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRKLSE 1021
            YN+EIKKIQ SA++HF +IFNDHEKLK QLE+ KR+LE RG EL KRE  NE+ER+KL+E
Sbjct: 301  YNEEIKKIQLSAREHFLRIFNDHEKLKSQLESHKRDLELRGVELEKREALNESERKKLAE 360

Query: 1020 ELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAVELE 841
            ELE+NAV+N +LQ A+ EQ+KADE VMKLAE+QK  KE LH +IIQLEKQLD KQA+ELE
Sbjct: 361  ELEQNAVQNSALQLASLEQKKADENVMKLAEDQK--KEELHNRIIQLEKQLDQKQALELE 418

Query: 840  IEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNSNDE 661
            IEQLRG LNV+RHM DE D+EVL K+++ LK LREKEGELEDV ALNQTL+V+ER SNDE
Sbjct: 419  IEQLRGSLNVIRHMGDEDDIEVLRKMEATLKELREKEGELEDVEALNQTLIVRERKSNDE 478

Query: 660  LQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSLWEE 481
            LQ+ARKEL+NGLKE+S+   IGVKRMGEL+SKPF E MKR+YNE +A+ERA+ELCSLW+E
Sbjct: 479  LQEARKELINGLKEISSRAHIGVKRMGELDSKPFFEVMKRRYNEEQAEERASELCSLWDE 538

Query: 480  YLRDPEWHPIKVVEING--KHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNPS 307
            YL+DP+WHP K +++ G  ++Q VI+ +DEKL+DLRN  G+E+Y  VTSA+ E+NEYNPS
Sbjct: 539  YLKDPDWHPFKRIKLEGEEEYQEVINDEDEKLRDLRNQMGNEVYKVVTSAIKEINEYNPS 598

Query: 306  GRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRG 193
            GRYIISELWNY EGRKATLQEGV +++KLW T KRKRG
Sbjct: 599  GRYIISELWNYGEGRKATLQEGVIYLLKLWNTAKRKRG 636


>gb|KHG15173.1| Forkhead-associated domain-containing 1 [Gossypium arboreum]
          Length = 645

 Score =  857 bits (2214), Expect = 0.0
 Identities = 424/644 (65%), Positives = 517/644 (80%), Gaps = 9/644 (1%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M+ SSG+D+D+S+S          E LKNGN++ KV+N+ YTCPFCP K+K+DFLYK+LL
Sbjct: 1    MDISSGEDSDISESEMEEYEDKYYEKLKNGNYKIKVSNEKYTCPFCPKKKKQDFLYKDLL 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQ-----ATMVDPLADHDRD 1753
            QHA GVGK NS KR+ ++KANHLAL K+LEND+    G S       A + DPL+  D D
Sbjct: 61   QHASGVGKSNSDKRSAREKANHLALFKYLENDLRGTVGSSSSSAAAAAEVEDPLSGCDHD 120

Query: 1752 EMFVWPWIGIVVNIPTE-FKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTAL 1576
            E  VWPW G+VVNIPT+  +DGR VG SG+KLRD+L RRGFNP RV PLWNY+GHSGTA+
Sbjct: 121  EKIVWPWTGVVVNIPTQKLEDGRSVGGSGSKLRDELIRRGFNPLRVHPLWNYRGHSGTAV 180

Query: 1575 VEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENL 1396
            VEF+KDW G +NA+SFEKAYEA+HHGK+ W + N +K  LY WVARADDY S  I GE+L
Sbjct: 181  VEFRKDWPGLHNALSFEKAYEADHHGKKDWFANNGVKEGLYAWVARADDYKSSTIIGEHL 240

Query: 1395 RKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKD 1216
            RKIGDL+T+S++ME+EARK  +LV NLTN+IE K  H+ EME +  ET  SL  L+ EKD
Sbjct: 241  RKIGDLKTVSELMEEEARKQDRLVTNLTNIIETKNKHIQEMEQRCSETSKSLEALMEEKD 300

Query: 1215 KLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENER 1036
             L Q YN+EIKKIQ SA+DHFQ+IF+DHEKLK QLE+ K++LE RG EL KRE  NE+ER
Sbjct: 301  NLSQAYNEEIKKIQVSARDHFQRIFSDHEKLKSQLESHKKDLELRGVELEKREALNESER 360

Query: 1035 RKLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQ 856
            +KL+EELEENAV+N +L  A  EQ++ADE VMKLAE+QK+QKE LH +IIQLEK+LD KQ
Sbjct: 361  KKLAEELEENAVQNSALHLAALEQKRADENVMKLAEDQKRQKEELHNRIIQLEKKLDQKQ 420

Query: 855  AVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQER 676
            A+ELEIEQLRG LNV+RHM DE D+EVLEKVD+ LK LREKE ELED+ ALNQTL+V+ER
Sbjct: 421  ALELEIEQLRGSLNVIRHMGDEDDMEVLEKVDASLKELREKEAELEDLEALNQTLIVRER 480

Query: 675  NSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELC 496
             SNDELQDARKEL+NGLKE+ST  QIGVKRMGEL+SKPF EAMKR+YNE  A+ERA+E+C
Sbjct: 481  KSNDELQDARKELINGLKEISTRSQIGVKRMGELDSKPFLEAMKRRYNEELAEERASEVC 540

Query: 495  SLWEEYLRDPEWHPIKVVEING--KHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVN 322
            SLWEEYL+DP+WHP K +++ G  ++Q VID +DEKL+DL++  G E Y +VTSA+ E+N
Sbjct: 541  SLWEEYLKDPDWHPFKRIKLEGGEEYQEVIDDEDEKLRDLKDQMGIEAYKSVTSAIKEIN 600

Query: 321  EYNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGM 190
            EYNPSGRYIISELWNY EGRKATL+EGV  ++KLW   KRKRGM
Sbjct: 601  EYNPSGRYIISELWNYGEGRKATLKEGVTFLLKLWDNAKRKRGM 644


>ref|XP_006435548.1| hypothetical protein CICLE_v10030937mg [Citrus clementina]
            gi|568866258|ref|XP_006486474.1| PREDICTED: intracellular
            protein transport protein USO1-like isoform X1 [Citrus
            sinensis] gi|557537744|gb|ESR48788.1| hypothetical
            protein CICLE_v10030937mg [Citrus clementina]
          Length = 639

 Score =  851 bits (2198), Expect = 0.0
 Identities = 411/636 (64%), Positives = 515/636 (80%), Gaps = 2/636 (0%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            ME+SS +D+D+S+S          + LK+GNH  K++++A+TCP+CP KRK+++LYK+LL
Sbjct: 1    MEHSSEEDSDISESEMLKYEDKSYKKLKSGNHSVKISDEAFTCPYCPKKRKQEYLYKDLL 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMVDPLADHDRDEMFVW 1738
            QHA GVG   S KR+ K+KANHLALAK+LE D+     PS+     DPL     DE FVW
Sbjct: 61   QHASGVGNSTSNKRSAKEKANHLALAKYLEKDLRDAGSPSKPVNEGDPLTGCSHDEKFVW 120

Query: 1737 PWIGIVVNIPTE-FKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVEFQK 1561
            PW GIVVNIPT   +DGR VGESG+KLRD+L RRGFNPTRV PLWN++GHSG A+VEF K
Sbjct: 121  PWTGIVVNIPTRRAEDGRSVGESGSKLRDELIRRGFNPTRVHPLWNFRGHSGCAVVEFHK 180

Query: 1560 DWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRKIGD 1381
            DW G +NAMSFEKAYEA+HHGK+ W++ N  KS LY WVAR+DDYN   I G++LRKIGD
Sbjct: 181  DWPGLHNAMSFEKAYEADHHGKKDWYASNQEKSGLYAWVARSDDYNLKNIIGDHLRKIGD 240

Query: 1380 LRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKLLQE 1201
            L+TIS++ME+EARK   LV NLTN+IE K  HL EM+ +F ET NS+ +L+ EKD+LLQ 
Sbjct: 241  LKTISEMMEEEARKQNLLVSNLTNMIEVKDKHLEEMKERFTETSNSVEKLMEEKDRLLQS 300

Query: 1200 YNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRKLSE 1021
            YN+EIKKIQ SA+DHFQ+IF DHEKLKLQLE+QK+ELE RG EL KRET NEN+R+ L+E
Sbjct: 301  YNEEIKKIQLSARDHFQRIFTDHEKLKLQLESQKKELELRGEELEKRETQNENDRKILAE 360

Query: 1020 ELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAVELE 841
            E+E+NA++N SLQ A+  Q+KADE V KLAE+QKKQKE+LH +IIQLEKQLDAKQA+ LE
Sbjct: 361  EIEKNAMRNNSLQLASLVQQKADENVRKLAEDQKKQKEDLHNRIIQLEKQLDAKQALALE 420

Query: 840  IEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNSNDE 661
            IE+L+G LNVM+HM D+GD+EVL+K++++LK LREKEGEL+D+ ALNQTL+++ER SNDE
Sbjct: 421  IERLKGSLNVMKHMGDDGDIEVLQKMETVLKDLREKEGELDDLEALNQTLIIRERKSNDE 480

Query: 660  LQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSLWEE 481
            LQDARKEL+N LKEL+    IG+KRMGEL++KPF E M RKYNE EA+ERA+ELCSLWEE
Sbjct: 481  LQDARKELINALKELAGRAHIGLKRMGELDNKPFLEVMNRKYNEEEAEERASELCSLWEE 540

Query: 480  YLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNPSGR 301
            YL+DP+WHP KV+   GKH+ +I+++DEKLK L+   G+E+Y AVT+AL E+NEYNPSGR
Sbjct: 541  YLKDPDWHPFKVITAEGKHKEIINEEDEKLKGLKKEMGEEVYIAVTTALVEINEYNPSGR 600

Query: 300  YIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRG 193
            YI SELWNY EGRKATLQEGVA +MK W+    ++G
Sbjct: 601  YITSELWNYKEGRKATLQEGVAFLMKQWKLLVHRKG 636


>gb|KDO69291.1| hypothetical protein CISIN_1g0065972mg [Citrus sinensis]
            gi|641850419|gb|KDO69292.1| hypothetical protein
            CISIN_1g0065972mg [Citrus sinensis]
          Length = 639

 Score =  850 bits (2195), Expect = 0.0
 Identities = 411/636 (64%), Positives = 515/636 (80%), Gaps = 2/636 (0%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            ME+SS +D+D+S+S          + LK+GNH  K++++A+TCP+CP KRK+++LYK+LL
Sbjct: 1    MEHSSEEDSDISESEMLKYEDKSYKKLKSGNHSVKISDEAFTCPYCPKKRKQEYLYKDLL 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMVDPLADHDRDEMFVW 1738
            QHA GVG   S KR+ K+KANHLALAK+LE D+     PS+     DPL     DE FVW
Sbjct: 61   QHASGVGNSTSNKRSAKEKANHLALAKYLEKDLRDAGSPSKPVNEGDPLTGCSHDEKFVW 120

Query: 1737 PWIGIVVNIPTE-FKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVEFQK 1561
            PW GIVVNIPT   +DGR VGESG+KLRD+L RRGFNPTRV PLWN++GHSG A+VEF K
Sbjct: 121  PWTGIVVNIPTRRAEDGRSVGESGSKLRDELIRRGFNPTRVHPLWNFRGHSGCAVVEFHK 180

Query: 1560 DWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRKIGD 1381
            DW G +NAMSFEKAYEA+H+GK+ W++ N  KS LY WVAR+DDYN   I G++LRKIGD
Sbjct: 181  DWPGLHNAMSFEKAYEADHYGKKDWYASNQEKSGLYAWVARSDDYNLKNIIGDHLRKIGD 240

Query: 1380 LRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKLLQE 1201
            L+TIS++ME+EARK   LV NLTN+IE K  HL EM+ +F ET NS+ +L+ EKD+LLQ 
Sbjct: 241  LKTISEMMEEEARKQNLLVSNLTNMIEVKDKHLEEMKERFTETSNSVEKLMEEKDRLLQS 300

Query: 1200 YNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRKLSE 1021
            YN+EIKKIQ SA+DHFQ+IF DHEKLKLQLE+QK+ELE RG EL KRET NEN+R+ L+E
Sbjct: 301  YNEEIKKIQLSARDHFQRIFTDHEKLKLQLESQKKELELRGEELEKRETQNENDRKILAE 360

Query: 1020 ELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAVELE 841
            E+E+NA++N SLQ A+  Q+KADE V KLAE+QKKQKE+LH +IIQLEKQLDAKQA+ LE
Sbjct: 361  EIEKNAMRNNSLQLASLVQQKADENVRKLAEDQKKQKEDLHNRIIQLEKQLDAKQALALE 420

Query: 840  IEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNSNDE 661
            IE+L+G LNVM+HM D+GD+EVL+K++++LK LREKEGEL+D+ ALNQTL+++ER SNDE
Sbjct: 421  IERLKGSLNVMKHMGDDGDIEVLQKMETVLKDLREKEGELDDLEALNQTLIIRERKSNDE 480

Query: 660  LQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSLWEE 481
            LQDARKEL+N LKELS    IG+KRMGEL++KPF E M RKYNE EA+ERA+ELCSLWEE
Sbjct: 481  LQDARKELINALKELSGRAHIGLKRMGELDNKPFLEVMNRKYNEEEAEERASELCSLWEE 540

Query: 480  YLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNPSGR 301
            YL+DP+WHP KV+   GKH+ +I+++DEKLK L+   G+E+Y AVT+AL E+NEYNPSGR
Sbjct: 541  YLKDPDWHPFKVITAEGKHKEIINEEDEKLKGLKKEMGEEVYIAVTTALVEINEYNPSGR 600

Query: 300  YIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRG 193
            YI SELWNY EGRKATLQEGVA +MK W+    ++G
Sbjct: 601  YITSELWNYKEGRKATLQEGVAFLMKQWKLLVHRKG 636


>ref|XP_012459360.1| PREDICTED: protein INVOLVED IN DE NOVO 2-like [Gossypium raimondii]
            gi|763809191|gb|KJB76093.1| hypothetical protein
            B456_012G071000 [Gossypium raimondii]
          Length = 645

 Score =  848 bits (2192), Expect = 0.0
 Identities = 420/644 (65%), Positives = 513/644 (79%), Gaps = 9/644 (1%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M+ SSG+ +D+S+S          E LKNGN++ K++N+ YTCPFCP K+K+DFLYK+LL
Sbjct: 1    MDISSGEGSDISESEMEEYEDKYYEKLKNGNYKIKISNEKYTCPFCPKKKKQDFLYKDLL 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMV-----DPLADHDRD 1753
            QHA GVGK NS KR+ ++KANHLAL K+LEND+    G S  +        DPL+  D D
Sbjct: 61   QHASGVGKSNSDKRSAREKANHLALFKYLENDLRGTVGSSSSSAAAAAEAEDPLSGCDHD 120

Query: 1752 EMFVWPWIGIVVNIPTE-FKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTAL 1576
            E  VWPW G+VVNIPT+  +DGR VG SG+KLRD+L RRGFNP RV PLWNY+GHSGTA+
Sbjct: 121  EKIVWPWTGVVVNIPTQKLEDGRSVGGSGSKLRDELIRRGFNPLRVHPLWNYRGHSGTAV 180

Query: 1575 VEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENL 1396
            VEF+KDW G +NA+SFEKAYEA+H GK+ W + N +K  LY WVARADDY S  I GE+L
Sbjct: 181  VEFRKDWPGLHNALSFEKAYEADHRGKKDWFANNAVKEGLYAWVARADDYKSSTIIGEHL 240

Query: 1395 RKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKD 1216
            RKIGDL+T+S++ME+EARK  +LV NLTN+IE K  H+ EME +  ET  SL  L+ EKD
Sbjct: 241  RKIGDLKTVSELMEEEARKQDRLVTNLTNIIETKNKHIQEMEQRCSETSKSLEALMEEKD 300

Query: 1215 KLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENER 1036
             L Q YN+EIKKIQ SA+DHFQ+IF+DHEKLK QLE+ K++LE RG EL KRE  NE+ER
Sbjct: 301  NLSQAYNEEIKKIQVSARDHFQRIFSDHEKLKSQLESHKKDLELRGVELEKREALNESER 360

Query: 1035 RKLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQ 856
            +KL+EELEENAV+N +L  A  EQ++ADE VMKLAE+QK+QKE LH +IIQLEK+LD KQ
Sbjct: 361  KKLAEELEENAVQNSALHLAALEQKRADENVMKLAEDQKRQKEELHNRIIQLEKKLDQKQ 420

Query: 855  AVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQER 676
            A+ELEIEQLRG LNV+RHM DE D+EVLEKVD+ LK LREKE ELED+ ALNQTL+V+ER
Sbjct: 421  ALELEIEQLRGSLNVIRHMGDEDDIEVLEKVDASLKELREKEAELEDLEALNQTLIVRER 480

Query: 675  NSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELC 496
             SNDELQDARKEL+NGLKE+ST  QIGVKRMGEL+SKPF EAMKR+YNE  A+ERA+E+C
Sbjct: 481  KSNDELQDARKELINGLKEISTRSQIGVKRMGELDSKPFLEAMKRRYNEELAEERASEVC 540

Query: 495  SLWEEYLRDPEWHPIKVVEING--KHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVN 322
            SLWEEYL+DP+WHP K +++ G  ++Q VID +DEKL+DL +  G E Y +VTSA+ E+N
Sbjct: 541  SLWEEYLKDPDWHPFKRIKLEGGEEYQEVIDDEDEKLRDLTDQMGIEAYKSVTSAIKEIN 600

Query: 321  EYNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRGM 190
            EYNPSGRYIISELWNY EGRKATL+EGV  ++KLW   KRKRGM
Sbjct: 601  EYNPSGRYIISELWNYGEGRKATLKEGVTFLLKLWDNAKRKRGM 644


>ref|XP_012830016.1| PREDICTED: protein INVOLVED IN DE NOVO 2-like isoform X1 [Erythranthe
            guttatus]
          Length = 862

 Score =  847 bits (2187), Expect = 0.0
 Identities = 411/598 (68%), Positives = 496/598 (82%), Gaps = 5/598 (0%)
 Frame = -3

Query: 1971 CPFCPTKRKRDFLYKELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAG---- 1804
            CP+CP +R  +F YK+LLQHA  +  C+S+KRT KDKANHL+LAK+L+ D  V       
Sbjct: 262  CPYCPWRRTINFQYKDLLQHASAIATCSSKKRTAKDKANHLSLAKYLQIDFAVACAGSST 321

Query: 1803 -PSQQATMVDPLADHDRDEMFVWPWIGIVVNIPTEFKDGRYVGESGTKLRDQLTRRGFNP 1627
             P   + +V+ + D+D  EM VWPW G+VVNIP E  DGRYVGESG+KLRD+L+RRGFNP
Sbjct: 322  KPPSSSVVVEAVTDYDPSEMLVWPWSGVVVNIPVELIDGRYVGESGSKLRDRLSRRGFNP 381

Query: 1626 TRVRPLWNYQGHSGTALVEFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGW 1447
             RVR LW  + HSGTA+VEF+KDW GF +AM F+K YE N+HGK +W +K D  SDLY W
Sbjct: 382  IRVRTLWIDKWHSGTAIVEFRKDWSGFADAMFFDKDYEVNNHGKTNWLAKTDKSSDLYAW 441

Query: 1446 VARADDYNSDAIFGENLRKIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMEN 1267
            +AR DDYN++ + GE LR+ GD RT+ DIME+EARKT KLV +LTN  E+KK  LLEME+
Sbjct: 442  IARVDDYNANDVIGETLRRFGDPRTVLDIMEEEARKTNKLVISLTNTFESKKRCLLEMES 501

Query: 1266 KFKETENSLSQLITEKDKLLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELE 1087
            K KETE+SLSQLI E+D + + YN+EI+KIQSSA DH++KIF DHEKLK++LETQKR+LE
Sbjct: 502  KVKETESSLSQLIVERDNIRRVYNEEIRKIQSSAWDHYKKIFKDHEKLKVELETQKRDLE 561

Query: 1086 HRGHELAKRETHNENERRKLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKE 907
             RG EL KRETHN+ ER +L+EELE+NAVKNCSLQAA  EQR+ADE VMKLAEE KKQKE
Sbjct: 562  RRGQELMKRETHNKIERERLAEELEQNAVKNCSLQAAAKEQRRADENVMKLAEEHKKQKE 621

Query: 906  NLHKKIIQLEKQLDAKQAVELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEG 727
             LHK+IIQLEKQLDAKQAV+LE+EQLRGKLNVM HM  EGDL VL KV+ L+K+LREKE 
Sbjct: 622  KLHKRIIQLEKQLDAKQAVQLEVEQLRGKLNVMMHMGHEGDLRVLNKVELLIKALREKEE 681

Query: 726  ELEDVLALNQTLVVQERNSNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAM 547
            EL+D+ ALNQ L++QERN N++LQDARKELVNG KE+S NG+IGVKRMGEL+SKPFHEAM
Sbjct: 682  ELQDLEALNQYLILQERNRNNDLQDARKELVNGFKEISINGRIGVKRMGELDSKPFHEAM 741

Query: 546  KRKYNEVEADERATELCSLWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYG 367
            K+KYNE EA+ERA+ELCSL+EEYLRDPEWHP+KVV++NG HQAVI +DDEKL+DLR NYG
Sbjct: 742  KKKYNEAEANERASELCSLFEEYLRDPEWHPVKVVDVNGTHQAVIKEDDEKLRDLRENYG 801

Query: 366  DEMYDAVTSALTEVNEYNPSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKRG 193
            +E+Y+AVTSAL E+NEYNPSGRYIISELWNY EGR+A L+EGVA ++K  R YKRKRG
Sbjct: 802  EEVYNAVTSALCEINEYNPSGRYIISELWNYDEGRRANLKEGVAVLLKQLRLYKRKRG 859



 Score =  293 bits (751), Expect = 3e-76
 Identities = 141/239 (58%), Positives = 181/239 (75%), Gaps = 11/239 (4%)
 Frame = -3

Query: 2019 LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELLQHAMGVGKCNSQ--KRTRKDKANHLA 1846
            LKNG +Q K++N +Y CP+CP KR RDF Y++LLQHA  +G C+SQ  KRT +DKANH+A
Sbjct: 21   LKNGKYQVKISNKSYACPYCPEKRTRDFRYEDLLQHATAIGSCSSQNVKRTERDKANHVA 80

Query: 1845 LAKHLENDV-VVDAG-----PSQQATMVDPLAD--HDRDEMFVWPWIGIVVNIPTEFKDG 1690
            L K+L+ D+ V+ AG     PS ++     L      RDE+FVWPW G++VNIPT+ K+G
Sbjct: 81   LTKYLQTDIAVLPAGGGSKPPSAKSHAAAVLEQGLQGRDEVFVWPWTGVMVNIPTDLKNG 140

Query: 1689 RYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVEFQKDWFGFNNAMSFEKAYEA 1510
             YVG+ G KLR+QL +R FNP RVRPLW+YQGHSGTA+VEF KDW G  +AMSFEKAY++
Sbjct: 141  HYVGKGGIKLRNQLAKRKFNPIRVRPLWSYQGHSGTAIVEFHKDWSGLTDAMSFEKAYDS 200

Query: 1509 NHHGKRSWHSKNDLKS-DLYGWVARADDYNSDAIFGENLRKIGDLRTISDIMEDEARKT 1336
             +HGKR+W +K D  S D+Y W+AR DDY S++I GE+LRKIG LR ISDI+E+EARKT
Sbjct: 201  TNHGKRNWLAKTDKSSDDVYAWIARVDDYKSNSIIGEDLRKIGYLRPISDILEEEARKT 259


>ref|XP_012089069.1| PREDICTED: protein INVOLVED IN DE NOVO 2 [Jatropha curcas]
            gi|643739131|gb|KDP44945.1| hypothetical protein
            JCGZ_01445 [Jatropha curcas]
          Length = 636

 Score =  836 bits (2160), Expect = 0.0
 Identities = 408/635 (64%), Positives = 511/635 (80%), Gaps = 2/635 (0%)
 Frame = -3

Query: 2094 MEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLYKELL 1918
            M+YSS +DTDVSDS          E LKNG    K++++ ++CP+CP KRKRD+LYK+LL
Sbjct: 1    MDYSSDEDTDVSDSEMEEYEAQSYEELKNGTRSVKISDEIFSCPYCPKKRKRDYLYKDLL 60

Query: 1917 QHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMVDPLADHDRDEMFVW 1738
            QHA+GVGK  S KR+ K+KANHLAL K+LE D+     PS+  +  DPL++ D  E  VW
Sbjct: 61   QHAVGVGKSPSNKRSAKEKANHLALVKYLEKDLGATGSPSEPKSDTDPLSECDHYEKLVW 120

Query: 1737 PWIGIVVNIPT-EFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALVEFQK 1561
            PW GIVVN+PT    DGR+VG SG+K RD+L  RGFNPTRV PLWNY+GHSG+A+VEF+K
Sbjct: 121  PWTGIVVNLPTTRTDDGRFVGASGSKFRDELISRGFNPTRVHPLWNYRGHSGSAVVEFRK 180

Query: 1560 DWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLRKIGD 1381
            DW G +NA+SFEKAYEA+HHGK+ W +  + KS +Y WVARADDY +D I GE+LRKIGD
Sbjct: 181  DWPGLHNALSFEKAYEADHHGKKEWFTGGE-KSGVYCWVARADDYKADNIIGEHLRKIGD 239

Query: 1380 LRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDKLLQE 1201
            L+T+S+IME+EARK  KL+ NL N+IE K  HL EME K  ET  SL +L+ EKD+LLQ 
Sbjct: 240  LKTVSEIMEEEARKQDKLISNLNNIIEIKNKHLQEMEEKCSETTVSLQKLMGEKDRLLQA 299

Query: 1200 YNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERRKLSE 1021
            YN+EIKKIQ SA++HFQKIFNDHEKLKLQLE+QKRELE RG EL +RE  NE++RR LSE
Sbjct: 300  YNEEIKKIQMSAREHFQKIFNDHEKLKLQLESQKRELEMRGSELEQREARNESDRRLLSE 359

Query: 1020 ELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQAVELE 841
            E+E+NA++N SLQ A+ EQ+KADE V+KLAE+QK+QKE LH +IIQLEKQLDAKQA+ELE
Sbjct: 360  EIEKNAIRNSSLQLASLEQQKADESVLKLAEDQKRQKEELHNRIIQLEKQLDAKQALELE 419

Query: 840  IEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERNSNDE 661
            IE+LRG LNV++HM D+GD EVL+K+D+++++LREKEGELE++  LNQ L+V+ER SNDE
Sbjct: 420  IERLRGSLNVIKHMGDDGDAEVLKKMDTIIQNLREKEGELEELETLNQALIVRERKSNDE 479

Query: 660  LQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCSLWEE 481
            LQ+ARKEL+ GLKE+S    IGVKRMGEL+SKPF EAMK+K+ E EA+ RA+ELCSLW E
Sbjct: 480  LQEARKELITGLKEISNRASIGVKRMGELDSKPFLEAMKKKFVEDEAEVRASELCSLWME 539

Query: 480  YLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYNPSGR 301
            YL+DP+WHP K V ++GKH+ VI+ +DEKLK LR    +E+Y AVT AL E+NEYNPSGR
Sbjct: 540  YLKDPDWHPFKFVMVDGKHKEVINDEDEKLKGLRKEMSNEVYKAVTDALMEINEYNPSGR 599

Query: 300  YIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKR 196
            YIISELWNY EG+KATL+EGV+ ++K W+  KRKR
Sbjct: 600  YIISELWNYKEGKKATLKEGVSFLLKQWQVAKRKR 634


>ref|XP_002533154.1| conserved hypothetical protein [Ricinus communis]
            gi|223527049|gb|EEF29235.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 640

 Score =  833 bits (2153), Expect = 0.0
 Identities = 403/639 (63%), Positives = 515/639 (80%), Gaps = 2/639 (0%)
 Frame = -3

Query: 2106 MGSSMEYSSGDDTDVSDSXXXXXXXXXXE-LKNGNHQYKVANDAYTCPFCPTKRKRDFLY 1930
            MGSS+++SS +DTD+S+S          E LKNG H  K++++ +TCP+CP KRKR++LY
Sbjct: 1    MGSSVDHSSDEDTDMSESELDEYEAQCYEELKNGTHHVKISDETFTCPYCPKKRKREYLY 60

Query: 1929 KELLQHAMGVGKCNSQKRTRKDKANHLALAKHLENDVVVDAGPSQQATMVDPLADHDRDE 1750
            ++LLQHA GVG+  S+KR+ K+KANHLAL K+LE D+     PS+     DPL   + DE
Sbjct: 61   RDLLQHASGVGRSASKKRSTKEKANHLALVKYLEKDIADLGSPSKPKGESDPLDSCNHDE 120

Query: 1749 MFVWPWIGIVVNIPT-EFKDGRYVGESGTKLRDQLTRRGFNPTRVRPLWNYQGHSGTALV 1573
              VWPW GIV+NIPT +  DGR+VG SG+K RD+L  RGFNPTRV PLWNY+GHSG+A+V
Sbjct: 121  KIVWPWTGIVINIPTTKAPDGRFVGASGSKFRDELISRGFNPTRVHPLWNYRGHSGSAVV 180

Query: 1572 EFQKDWFGFNNAMSFEKAYEANHHGKRSWHSKNDLKSDLYGWVARADDYNSDAIFGENLR 1393
            EF KDW G +NA+SFEKAYEA+HHGK+ + +  + KS +Y WVARADDY +D I G++LR
Sbjct: 181  EFHKDWPGLHNALSFEKAYEADHHGKKDYFTTGE-KSGVYCWVARADDYKADNIIGDHLR 239

Query: 1392 KIGDLRTISDIMEDEARKTQKLVGNLTNVIEAKKMHLLEMENKFKETENSLSQLITEKDK 1213
            K GDL+TIS+IME+EARK  KL+ NL N+IE K  H+ EM++KF ET  SL++L+ EKD+
Sbjct: 240  KTGDLKTISEIMEEEARKQDKLISNLNNIIEIKNKHIQEMQDKFSETSVSLNKLMEEKDR 299

Query: 1212 LLQEYNDEIKKIQSSAQDHFQKIFNDHEKLKLQLETQKRELEHRGHELAKRETHNENERR 1033
            LLQ YN+EI+KIQ SA++HFQKIFNDHEKLKLQ+++QKRELE RG EL KRE  NEN+RR
Sbjct: 300  LLQAYNEEIRKIQMSAREHFQKIFNDHEKLKLQVDSQKRELEMRGSELEKREAKNENDRR 359

Query: 1032 KLSEELEENAVKNCSLQAATNEQRKADEKVMKLAEEQKKQKENLHKKIIQLEKQLDAKQA 853
            KLSEE+E+NA++N SLQ A  EQ+KADE V+KLAE+QK+QKE LH +IIQL+KQLDAKQA
Sbjct: 360  KLSEEIEKNAIRNSSLQLAAFEQQKADENVLKLAEDQKRQKEELHNRIIQLQKQLDAKQA 419

Query: 852  VELEIEQLRGKLNVMRHMEDEGDLEVLEKVDSLLKSLREKEGELEDVLALNQTLVVQERN 673
            +ELEIE+LRG LNVM+HM D+GD+EVL+K+++++++LREKEGELED+  LNQ L+V ER 
Sbjct: 420  LELEIERLRGTLNVMKHMGDDGDVEVLQKMETIIQNLREKEGELEDLETLNQALIVSERK 479

Query: 672  SNDELQDARKELVNGLKELSTNGQIGVKRMGELNSKPFHEAMKRKYNEVEADERATELCS 493
            SNDELQ+ARKEL+NGLKE+S   QIGVKRMGEL+SKPF EAMKRKY E EA+ RA+ELCS
Sbjct: 480  SNDELQEARKELINGLKEISNRAQIGVKRMGELDSKPFLEAMKRKYTEEEAEVRASELCS 539

Query: 492  LWEEYLRDPEWHPIKVVEINGKHQAVIDQDDEKLKDLRNNYGDEMYDAVTSALTEVNEYN 313
            LW EYL+DP WHP KV  ++GK++ VID  DEKL  L++  GDE+Y AVT A+ E+N+YN
Sbjct: 540  LWVEYLKDPGWHPFKVAMVDGKNKEVIDDKDEKLNGLKDELGDEVYKAVTDAVKEINDYN 599

Query: 312  PSGRYIISELWNYSEGRKATLQEGVAHIMKLWRTYKRKR 196
            PSGRYI SELWNY E +KATL+EGV+ ++K W+  KR+R
Sbjct: 600  PSGRYITSELWNYKEEKKATLKEGVSFLLKQWQIAKRRR 638


Top