BLASTX nr result

ID: Zingiber24_contig00020199 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber24_contig00020199
         (1537 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu...   468   e-129
ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri...   467   e-129
ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626...   466   e-129
ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260...   464   e-128
ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292...   463   e-128
ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr...   462   e-127
gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus pe...   462   e-127
gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 ...   460   e-127
ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813...   454   e-125
gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus...   453   e-125
ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776...   453   e-124
ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu...   451   e-124
ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ...   447   e-123
gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]     447   e-123
ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264...   432   e-118
ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590...   430   e-118
ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr...   424   e-116
ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217...   419   e-114
ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido...   416   e-113
gb|AAL59980.1| unknown protein [Arabidopsis thaliana]                 416   e-113

>ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|566157047|ref|XP_006386388.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|566157050|ref|XP_006386389.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|222843996|gb|EEE81543.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344610|gb|ERP64185.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344611|gb|ERP64186.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 475

 Score =  468 bits (1203), Expect = e-129
 Identities = 257/473 (54%), Positives = 314/473 (66%), Gaps = 3/473 (0%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            SSCCAICE SN ASIC  CVN+RLNEY  LLKSLN+   S YS            D Q N
Sbjct: 5    SSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQFN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR  QN +L   +E+L   K++LA+ KAKV K S DLK K G LE A + ++  + E L+
Sbjct: 65   WRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPDKIKDRSNNMSNQICNA 985
            K   +L+  Q L    IT E + +Q+VVI+ ICKLFP  RVN D  ++ S    +QICNA
Sbjct: 125  KFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGERNFSGQY-DQICNA 183

Query: 984  RLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWDA 805
            RLPRGLDPHSV  EELAASLGYMVQ             LHN+GFAGSCSRIWQR+SYW+A
Sbjct: 184  RLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSYWNA 243

Query: 804  RPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYSL 628
             PSS+S EYPLFIPRQN+C            S NFGVAS+ESERRPHL+S+R++SFNYS 
Sbjct: 244  CPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNSFNYSS 303

Query: 627  ASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSKE 448
             SPHS+E H DLQKG+ LLKKSVAC+TAY  N L LD+PS+ STFEAF KLL  LSSSKE
Sbjct: 304  VSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKE 363

Query: 447  LQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQ-TITSTVDDDNLQNPD 271
            ++S+ N LK A S + KQ Q+L +              L  +    +     D+NL N  
Sbjct: 364  VRSVFN-LKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNLPNSA 422

Query: 270  SSFMF-TAVMINKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
            +SF+F T +   K+ES ++GW+++EHPT PPPPS+ ED+EHW RAMFIDATKK
Sbjct: 423  ASFLFATGISDGKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475


>ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis]
            gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase
            II, putative [Ricinus communis]
          Length = 478

 Score =  467 bits (1202), Expect = e-129
 Identities = 258/476 (54%), Positives = 321/476 (67%), Gaps = 6/476 (1%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            SSCCAICE SN ASIC  CVN+RLNEYS LLKSL +     YS            D Q+N
Sbjct: 5    SSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKADDQLN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR  QN +L  L+E+L   K++L + KAK  K S+DL  KYG LE + S ++  + + L+
Sbjct: 65   WRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRVDQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    IT E +   +V ++ ICKLFP  RV  + + KD S+   +QICN
Sbjct: 125  KYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYDQICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLPRGLDPHS+P EELAASLGYMVQ             LHNSGFAGSCSRIWQR+SYW+
Sbjct: 185  ARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWN 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQ +C            S NFGVAS+ESERR  L+SSR+SSFNY+
Sbjct: 245  ARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSSFNYN 304

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
             ASPHS+E H DLQKGI L+KKSVAC+TAY  N L LD+P+EASTFEAF KLL  LSSSK
Sbjct: 305  SASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATLSSSK 364

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQT--ITSTVDDDNLQN 277
            E++S+  SLK A S + KQ Q+L +            +TL ES     +T  ++D+NL+N
Sbjct: 365  EVRSV-FSLKMACSRSCKQVQKLNK-SVWNVNSIISSSTLMESAHAPHLTKNINDNNLRN 422

Query: 276  PDSSFMFTAVM--INKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
              +SF+F   +    K+ES+++GW+++EHPT PPPPS+TEDVEHW RAMFIDATKK
Sbjct: 423  SATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478


>ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus
            sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED:
            uncharacterized protein LOC102626964 isoform X2 [Citrus
            sinensis]
          Length = 478

 Score =  466 bits (1200), Expect = e-129
 Identities = 258/476 (54%), Positives = 323/476 (67%), Gaps = 6/476 (1%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +S CAICE SN ASICA CVN+RL+E + LLKSL +   + Y             D Q+N
Sbjct: 5    ASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKADDQLN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +QN +L  L+E+L   K++L++ K K+ K S DLK++Y  L+ A S M+  + E L+
Sbjct: 65   WRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKVRYAILDSARSMMEKNRAEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +++  Q L    I  E + +Q+VVI+ ICKLFP  RVN D + +D S+   +QIC 
Sbjct: 125  KFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYDQICG 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLP+GLDPHSVP EELAASLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  ARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPILHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQN+C            S NFGVAS+ESERRP L+SSR++SFNY+
Sbjct: 245  ARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSTSFNYT 304

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
             AS HS+E H DLQKGI LLKKSVAC+TAY  NSL LD+P+EASTFEAF KLL  LSSSK
Sbjct: 305  SASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATLSSSK 364

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQ--TITSTVDDDNLQN 277
            E++S+  SLK A S + KQ Q+L R             TL ES     IT  + D+NL +
Sbjct: 365  EVRSV-FSLKMACSRSCKQVQKLNR-SVWNMNSAISSTTLLESAHMFPITKNLSDNNLPS 422

Query: 276  PDSSFMFTAVM--INKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
              +SF++   M  I K+ES+++GW+++EHPT PPPPS+TEDVEHW RAM IDATKK
Sbjct: 423  SAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera]
            gi|302141899|emb|CBI19102.3| unnamed protein product
            [Vitis vinifera]
          Length = 478

 Score =  464 bits (1195), Expect = e-128
 Identities = 262/476 (55%), Positives = 320/476 (67%), Gaps = 6/476 (1%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +S C+ICE SNLASICA CVN+RLNEY+  LKS      S Y             D QIN
Sbjct: 5    TSSCSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKADDQIN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +QN +L +L+E+L + K++  + KAKV K SNDLKLKYG LE A S ++  + E L+
Sbjct: 65   WRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNRVEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    IT E   +Q+VVI+ ICKLFP  RVN D + KD S+   +QICN
Sbjct: 125  KFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPYDQICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
             RLPR LDPHSVP +ELAASLGYMVQ             LHNSGFAGSCSRIWQR SYW+
Sbjct: 185  VRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQRESYWN 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
             RPSS+S EYPLFIPRQN C            S NFG+AS+ES+R+P LESS +SSFNYS
Sbjct: 245  PRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSSSFNYS 304

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
             AS HS+E H DLQKGI LLKKSVAC+T Y  +SL LD+P+EASTFEAF KLL ILSSSK
Sbjct: 305  SASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAILSSSK 364

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTI--TSTVDDDNLQN 277
            E++S+  SLK A S + KQ QQL +            +TL ES  T+  T  + D+NL N
Sbjct: 365  EVRSV-FSLKMACSRSCKQVQQLNK-SIWNMNSAISSSTLLESAHTLPMTRNIFDNNLPN 422

Query: 276  PDSSFMFTAVM--INKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
              +SF++T  M  I K+ES++E W+++EH   PPPPS+TED+EHW RAM IDATKK
Sbjct: 423  SAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478


>ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca
            subsp. vesca]
          Length = 478

 Score =  463 bits (1192), Expect = e-128
 Identities = 261/476 (54%), Positives = 323/476 (67%), Gaps = 6/476 (1%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            SS CAICE SNLASICA CVN+RLN+Y+  LK+L +     YS            D Q+N
Sbjct: 6    SSNCAICENSNLASICAVCVNYRLNDYNNSLKALKSRRDLLYSRLSDALVAKGKADDQLN 65

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +Q+ +L +L+E+L   K++L + KAK+ K S DLK+KYG LE A S ++  + E L+
Sbjct: 66   WRILQDEKLVRLREKLRRNKEQLVQGKAKIEKTSYDLKVKYGVLESALSMLEKNRAEQLE 125

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    IT E + +Q+VVI+ ICKLFP  RV  D K K+ S    +QICN
Sbjct: 126  KFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKEGSGGQYDQICN 185

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            A LPRGLDPHSVP EELAASLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 186  ASLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLGAPALHNSGFAGSCSRIWQRDSYWD 245

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQN+C            S NFGVAS+ESER+P L+SS +SSFNYS
Sbjct: 246  ARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIESERKPRLDSSGSSSFNYS 305

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
             AS HS+E H DLQ+GI LLKKSVACITAY  NSL LD+PSEASTFEAF KLL  LSSSK
Sbjct: 306  SASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLSTLSSSK 365

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVD--DDNLQN 277
            E+ S+  SLK A S + KQ QQL +             TL +S  T+T T +  ++N+ N
Sbjct: 366  EVHSV-FSLKMACSRSCKQVQQLNK-SVWNVNSAISSTTLLDSAHTMTMTKNFYENNIPN 423

Query: 276  PDSSFMFTAVM--INKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
              +SF+ +  M  + K+E  +EGW+++EHPTL PPPS++ED+EHW RAMFID TK+
Sbjct: 424  YATSFLSSTEMSDVGKNECTIEGWDLVEHPTL-PPPSQSEDIEHWTRAMFIDVTKR 478


>ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina]
            gi|567883029|ref|XP_006434073.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883031|ref|XP_006434074.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883033|ref|XP_006434075.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536194|gb|ESR47312.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536195|gb|ESR47313.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536196|gb|ESR47314.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536197|gb|ESR47315.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
          Length = 478

 Score =  462 bits (1189), Expect = e-127
 Identities = 257/476 (53%), Positives = 321/476 (67%), Gaps = 6/476 (1%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +S CAICE SN ASICA CVN+RL+E + LLKSL +   + Y             D Q+N
Sbjct: 5    ASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKADDQLN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +QN +L  L+E+L   K++L++ K K+ K S DLK +Y  L+ A S M+  + E L+
Sbjct: 65   WRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKNRAEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +++  Q L    I  E + +Q+VVI+ ICKLFP  RVN D + +D S+   +QIC 
Sbjct: 125  KFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYDQICG 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLP+GLDPHSVP EELAASLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  ARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQN+C            S NFGVAS+ESERRP L+SSR++SFNY+
Sbjct: 245  ARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSASFNYT 304

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
             AS HS+E H DLQKGI LLKKSVAC+TAY  NSL LD+P+EASTFEAF KLL  LS SK
Sbjct: 305  SASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATLSLSK 364

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQ--TITSTVDDDNLQN 277
            E++S+  SLK A S + KQ Q+L R             TL ES     IT  + D+NL +
Sbjct: 365  EVRSV-FSLKMACSRSCKQVQKLNR-SVWNMNSAISSTTLLESAHMFPITKNLSDNNLPS 422

Query: 276  PDSSFMFTAVM--INKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
              +SF++   M  I K+ES+++GW+++EHPT PPPPS+TEDVEHW RAM IDATKK
Sbjct: 423  SAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica]
            gi|462422646|gb|EMJ26909.1| hypothetical protein
            PRUPE_ppa005050mg [Prunus persica]
          Length = 479

 Score =  462 bits (1188), Expect = e-127
 Identities = 256/475 (53%), Positives = 319/475 (67%), Gaps = 5/475 (1%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            SS CAICE SNLAS+CA CVN+RL EY++ LK+L +   S YS            D Q+N
Sbjct: 6    SSNCAICESSNLASVCAICVNYRLTEYNSSLKALKSRRDSLYSRLTEALVAKGKADDQLN 65

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +QN +L +L+E+L   K++L + KAK+ K S DLK+K G LE A + ++  + E L+
Sbjct: 66   WRVLQNEKLVRLREKLRCNKEQLVQGKAKIEKTSYDLKVKSGVLESALAVLEKNRAEQLE 125

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   + +  Q L    IT E + +Q+VVI+ ICKLFP  RV  D K KD S    +QICN
Sbjct: 126  KFYPNFICTQNLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKDASGGQYDQICN 185

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            A LPRGLDPHSVP EELAASLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 186  ACLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLAAPALHNSGFAGSCSRIWQRDSYWD 245

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQN+C            S NFGVAS++SER+PHL+SS +SSFNY+
Sbjct: 246  ARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIDSERKPHLDSSGSSSFNYT 305

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
             AS HS+E H DLQ+GI LLKKSVACITAY  NSL LD+PSEASTFEAF KLL  LSSSK
Sbjct: 306  SASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLATLSSSK 365

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESE-QTITSTVDDDNLQNP 274
            E+ S+  SLK A S + KQ QQL +              L  +   T+T  + + NL   
Sbjct: 366  EVHSV-FSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHAMTMTKNLYEYNLPTY 424

Query: 273  DSSFMFTAVMIN--KSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
             +S + +  + +  K+ES+VEGW+++EHPT PPPPS++ED+EHW RAMFIDA +K
Sbjct: 425  ATSSLCSTELSDSGKNESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFIDAKRK 479


>gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao]
          Length = 479

 Score =  460 bits (1184), Expect = e-127
 Identities = 256/475 (53%), Positives = 320/475 (67%), Gaps = 5/475 (1%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +S CAIC+ SN ASICA CVN+RLNEY++LLKSL +     YS            D Q+N
Sbjct: 6    ASNCAICDNSNRASICAVCVNYRLNEYNSLLKSLKSRRDFLYSKLDEVLAAKRKADDQLN 65

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            W+ +QN +L  LKE+L   K++LA+ KAK+ + S DLK+KYG LE A   ++  + E L+
Sbjct: 66   WKILQNEKLTDLKEKLRRSKEQLAQGKAKIERVSYDLKVKYGVLESARGMLEKNRVEKLE 125

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    IT E + +Q+VVI+ ICKLFP  RVN D + +D S    + ICN
Sbjct: 126  KFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVNLDGEGRDGSCGQYDLICN 185

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
              LPRGLDPHSVP E+LAASLGYMVQ             LHNSGFAGSCSRIWQR+SYW+
Sbjct: 186  VGLPRGLDPHSVPSEQLAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWN 245

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQN+C            S NFGVAS+ESERRP L+SS ++SFNYS
Sbjct: 246  ARPSSRSNEYPLFIPRQNYCSTSGDNSWTDRSSSNFGVASMESERRPRLDSSGSNSFNYS 305

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
             AS H++E H DLQ GI LLKKSVACITA+  NSL LD+P+EASTFEAF KLL  LSS+K
Sbjct: 306  SASSHTVETHKDLQIGISLLKKSVACITAFCYNSLCLDVPTEASTFEAFSKLLATLSSTK 365

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQ-TITSTVDDDNLQNP 274
            E++S+  SLK A S + KQAQQL +              L  +    +T  + D NL + 
Sbjct: 366  EVRSV-FSLKMACSRSSKQAQQLNKSVWNVNSAMSSSMLLESAHMLPLTKNLSDHNLPSS 424

Query: 273  DSSFMFTAVM--INKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
             +SF+F   M  I K+ES++E W+++EHPT PPPPS+TEDVEHW RAMFIDATK+
Sbjct: 425  AASFLFATEMPDIGKNESLIEEWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKR 479


>ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max]
          Length = 474

 Score =  454 bits (1169), Expect = e-125
 Identities = 255/472 (54%), Positives = 314/472 (66%), Gaps = 2/472 (0%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +S CAICE SN ASIC+ CVN+RLNEY+  LK L     S YS            D Q N
Sbjct: 5    TSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYSKLSEVLVRKGKGDDQAN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +Q+ +L +LKE+L   K+++ + +AK+  +S DLKLKYG LE A ST++  + E L+
Sbjct: 65   WRVLQHEKLARLKEKLRQGKEQVTQGRAKIETKSADLKLKYGLLESALSTLEKNRVEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPDKIK-DRSNNMSNQICN 988
            K   +L+  Q L    IT E + +Q+VVI+ ICKLFP  RV  +  + D      +QICN
Sbjct: 125  KFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERGDGCCGQFDQICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLPR LDP SVP EEL+ SLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  ARLPRALDPRSVPSEELSTSLGYMVQLLNLIVHNLAAPALHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQN+C            S NFGVAS+ESERR  L+SS +SSFNYS
Sbjct: 245  ARPSSRSNEYPLFIPRQNYCSTGGENSWSERSSSNFGVASMESERRHRLDSSGSSSFNYS 304

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
            LAS HS++ H DLQKGI LLKKSVACITAY  NSL LD+PSEASTFEAF KLL  LSSSK
Sbjct: 305  LASSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLATLSSSK 364

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVDDDNLQNPD 271
            E++S+  SLK   S T KQ QQL +             TL ES  ++ +T  ++ L +  
Sbjct: 365  EVRSV-FSLKMPRSRTCKQVQQLNK-SVWNMNSAISSTTLLESAHSVPTTRIENYLPSAT 422

Query: 270  SSFMFTAVMINKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
            +SF++      K+E +VEGW+I+EHPT PPPPS++EDVEHW RAMFIDA +K
Sbjct: 423  ASFLYATDSDGKNECLVEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 474


>gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris]
            gi|561014256|gb|ESW13117.1| hypothetical protein
            PHAVU_008G169200g [Phaseolus vulgaris]
          Length = 476

 Score =  453 bits (1166), Expect = e-125
 Identities = 257/474 (54%), Positives = 319/474 (67%), Gaps = 4/474 (0%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +S CAICE SN ASIC+ CVN+RLNEY+  LKSL     S YS            D Q N
Sbjct: 5    TSNCAICENSNQASICSICVNYRLNEYNTSLKSLKDRRDSLYSKLSEVLVQKGKGDDQEN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            +  +QN +L +LKE+L   K+++ + +AK+   S DLK KYG LE A ST++  + E L+
Sbjct: 65   YIVLQNEKLARLKEKLHRSKEQVTQGRAKIETVSADLKHKYGLLESALSTLEKNRVEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    IT E + +Q+VVI+ ICKLFP  RV  + +I+D  +   +QICN
Sbjct: 125  KFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEIRDGCSGQYDQICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLPR LDPHSVP EEL+ASLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  ARLPRALDPHSVPSEELSASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC--XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNY 634
            ARPSS+S EYPLFIPRQN+C             S NFGVAS+ESE+R  L+SS NS+FNY
Sbjct: 245  ARPSSRSNEYPLFIPRQNYCSTAGENSWSTDKSSSNFGVASMESEKRNRLDSSGNSNFNY 304

Query: 633  SLASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSS 454
            SLAS HS++ H DLQKGI LLKKSVACITAY  NSL LD PSEASTFE+F KLL  LSSS
Sbjct: 305  SLASLHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFESFAKLLATLSSS 364

Query: 453  KELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVDDDNLQNP 274
            KE++S+  SLK A S T KQ QQL +             TL ES  ++ +T  ++ L + 
Sbjct: 365  KEVRSV-FSLKMAQSRTCKQVQQLNK-SVWNMNSVISSTTLLESAHSVPTTRIENYLPSS 422

Query: 273  DSSFMF-TAVMINKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
             +SF++ T +   K+E ++EGW+IIEHPT PPPPS++EDVEHW RAMFIDA +K
Sbjct: 423  TASFLYATDLNDGKNECLIEGWDIIEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 476


>ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine
            max]
          Length = 475

 Score =  453 bits (1165), Expect = e-124
 Identities = 253/473 (53%), Positives = 316/473 (66%), Gaps = 3/473 (0%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +S CAICE SN ASIC+ CVN+RLNEY+  LK L     S Y             D Q N
Sbjct: 5    TSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGDDQAN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +Q+ +L +LKE+L   K+++ + +AK+   S DLKLKYG LE A ST++  + E L+
Sbjct: 65   WRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRVEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    IT E + +++VVI+ ICKLFP  RV  + + +D  +   +QICN
Sbjct: 125  KFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYDQICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLPR LDPHSVP EEL+ SLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  ARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQN+C            S NFGVASVESERR  L+SS ++SFNYS
Sbjct: 245  ARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTSFNYS 304

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
            LAS HS++ H DLQKGI LLKKSV CITAY  NSL LD+PSEASTFEAF KLL  L+SSK
Sbjct: 305  LASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATLASSK 364

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVDDDNLQNPD 271
            E++S+  SLK A S T KQ QQL +             TL ES  ++ +T  ++ L +  
Sbjct: 365  EVRSV-FSLKMARSRTCKQVQQLNK-SVWNMNSAISSTTLLESAHSVPTTRIENYLPSST 422

Query: 270  SSFMFTAVMIN-KSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
             SF++ A + + K+E ++EGW+I+EHPT PPPPS++EDVEHW RAMFIDA  K
Sbjct: 423  GSFLYAADLSDGKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 475


>ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|550344612|gb|ERP64187.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 506

 Score =  451 bits (1161), Expect = e-124
 Identities = 257/504 (50%), Positives = 314/504 (62%), Gaps = 34/504 (6%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            SSCCAICE SN ASIC  CVN+RLNEY  LLKSLN+   S YS            D Q N
Sbjct: 5    SSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADDQFN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR  QN +L   +E+L   K++LA+ KAKV K S DLK K G LE A + ++  + E L+
Sbjct: 65   WRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPDKIKDRSNNMSNQICNA 985
            K   +L+  Q L    IT E + +Q+VVI+ ICKLFP  RVN D  ++ S    +QICNA
Sbjct: 125  KFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGERNFSGQY-DQICNA 183

Query: 984  RLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWDA 805
            RLPRGLDPHSV  EELAASLGYMVQ             LHN+GFAGSCSRIWQR+SYW+A
Sbjct: 184  RLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSYWNA 243

Query: 804  RPSSQ-------------------------------SKEYPLFIPRQNFC-XXXXXXXXX 721
             PSS+                               S EYPLFIPRQN+C          
Sbjct: 244  CPSSRRYFDWKSLCFGISVAKFELLLLSELNILCACSNEYPLFIPRQNYCSTSSENSWTD 303

Query: 720  XXSLNFGVASVESERRPHLESSRNSSFNYSLASPHSLENHTDLQKGILLLKKSVACITAY 541
              S NFGVAS+ESERRPHL+S+R++SFNYS  SPHS+E H DLQKG+ LLKKSVAC+TAY
Sbjct: 304  KSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVACVTAY 363

Query: 540  SCNSLGLDIPSEASTFEAFGKLLFILSSSKELQSIRNSLKTASSSTEKQAQQLKRXXXXX 361
              N L LD+PS+ STFEAF KLL  LSSSKE++S+ N LK A S + KQ Q+L +     
Sbjct: 364  CYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEVRSVFN-LKMACSRSCKQVQKLNKSVWNV 422

Query: 360  XXXXXXXNTLAESEQ-TITSTVDDDNLQNPDSSFMF-TAVMINKSESIVEGWNIIEHPTL 187
                     L  +    +     D+NL N  +SF+F T +   K+ES ++GW+++EHPT 
Sbjct: 423  NSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGISDGKNESFIDGWDLVEHPTF 482

Query: 186  PPPPSKTEDVEHWMRAMFIDATKK 115
            PPPPS+ ED+EHW RAMFIDATKK
Sbjct: 483  PPPPSQVEDIEHWTRAMFIDATKK 506


>ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula]
            gi|355516236|gb|AES97859.1| hypothetical protein
            MTR_5g061040 [Medicago truncatula]
          Length = 501

 Score =  447 bits (1151), Expect = e-123
 Identities = 256/483 (53%), Positives = 318/483 (65%), Gaps = 16/483 (3%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            S+ CAICE  N  SIC+ CVN+RLNEY++ LKSL     S YS            D Q N
Sbjct: 5    STNCAICENLNQPSICSVCVNYRLNEYNSSLKSLKERRDSLYSKLSEVLVRKGKGDDQTN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +++ +L + +E+L + K+++ + +AK+   S DLKLKYG LE A S ++  + E L+
Sbjct: 65   WRVLRHEKLARSREKLRHNKEQVTQGRAKIQAMSADLKLKYGVLESALSMLEKNRVEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPDKIK-DRSNNMSNQICN 988
            K   +L+  Q L    IT E + +Q+VVI+ ICKLFP  RV  +  K D  +   +QICN
Sbjct: 125  KFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQYDQICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLPR LDPHSVP EEL+ASLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  ARLPRALDPHSVPSEELSASLGYMVQLLNLVAHNLAAPALHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSK-------------EYPLFIPRQNFCXXXXXXXXXXXSL-NFGVASVESERRP 670
            ARPSS+SK             EYPLFIPRQN+C           S  NFGVAS+ES+RRP
Sbjct: 245  ARPSSRSKNFFNLKYSLFFSNEYPLFIPRQNYCSTSGENSWSEKSSSNFGVASMESDRRP 304

Query: 669  HLESSRNSSFNYSLASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFE 490
             L+SS +SSFNYSLAS HS+++H DLQKGI LLKKSVACITAY  NSL  DIPSEASTFE
Sbjct: 305  RLDSSGSSSFNYSLASSHSVQSHKDLQKGISLLKKSVACITAYCYNSLCFDIPSEASTFE 364

Query: 489  AFGKLLFILSSSKELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTI 310
            AF KLL  LSSSKE++S+  SLK A S T KQ QQL +             TL ES  ++
Sbjct: 365  AFAKLLATLSSSKEVRSV-FSLKMARSRTCKQVQQLNK-SVWNMNSANSSTTLLESTHSV 422

Query: 309  TSTVDDDNLQNPDSSFMF-TAVMINKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMF 133
             +T  ++ + N  +SF++ T     KSE ++EGW+I+EHPTLPPPPS++EDVEHW RAMF
Sbjct: 423  PTTRIENYMPNSAASFLYPTDSSDRKSECLIEGWDIVEHPTLPPPPSQSEDVEHWTRAMF 482

Query: 132  IDA 124
            IDA
Sbjct: 483  IDA 485


>gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]
          Length = 478

 Score =  447 bits (1149), Expect = e-123
 Identities = 254/475 (53%), Positives = 314/475 (66%), Gaps = 5/475 (1%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            S+ CA+CE SNL SIC+ CVN+RL ++  +LKS  +   S YS            D Q+ 
Sbjct: 5    STSCALCENSNLPSICSICVNYRLADHYNILKSNKSHRDSLYSRLEEVLLAKGKADDQVG 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR  QN +L KL+E+    K++L + KAKV +   DLK+K G LE A S ++N + E L+
Sbjct: 65   WRMSQNEKLAKLREKHRRSKERLVQGKAKVERMHYDLKVKSGVLEAARSMLENNRMEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   + +  Q L    IT E + +Q+VVI+ ICKLFP  RV  D + K+ S    +QICN
Sbjct: 125  KFYPNFICTQTLGHMAITSERLHKQSVVIKQICKLFPHRRVIIDGERKNGSAEQYDQICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLPRG+DPHSV  EEL ASLGYMVQ             LHNSGFAGS SRIWQR+SYWD
Sbjct: 185  ARLPRGVDPHSVASEELGASLGYMVQLLNLIVRILAAPALHNSGFAGSNSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFCXXXXXXXXXXXSL-NFGVASVESERRPHLESSRNSSFNYS 631
            ARPSS+S EYPLFIPRQN+C           S  NFGV S+ESER+  L+SS ++SFNYS
Sbjct: 245  ARPSSRSNEYPLFIPRQNYCSTSVENSWSDRSSSNFGVTSIESERKVRLDSSGSNSFNYS 304

Query: 630  LASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSK 451
             ASPHS+E H DLQKGI LLKKSVACIT Y  NSL LD+PSEASTFEAF KLL  LSSSK
Sbjct: 305  SASPHSIETHKDLQKGISLLKKSVACITTYCYNSLCLDVPSEASTFEAFAKLLATLSSSK 364

Query: 450  ELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITS--TVDDDNLQN 277
            EL+S+  S+K+A S + KQ QQL +             TL +S  T+ S   + ++NL N
Sbjct: 365  ELRSV-CSIKSACSRSNKQVQQLNK-SVWNVNSAFASTTLLDSAHTVASMKNIGENNLPN 422

Query: 276  PDSSFMF-TAVMINKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
            P +SF++ T     K+E I+EGW++IEHPT PPPPS+ EDVEHW RAMFIDATKK
Sbjct: 423  PATSFLYATESDAGKNEFIIEGWDLIEHPTFPPPPSQCEDVEHWTRAMFIDATKK 477


>ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum
            lycopersicum]
          Length = 481

 Score =  432 bits (1110), Expect = e-118
 Identities = 239/480 (49%), Positives = 311/480 (64%), Gaps = 10/480 (2%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +SCC ICE SNL S+C  CVN+RLNEYS +LKSL    ++               D Q++
Sbjct: 5    TSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGQLSEILLAKGKADDQLS 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR  +N +L +L+E+L   K+++++ KAK+ K S+DLK++Y  L  A   ++  + E L+
Sbjct: 65   WRVPRNEKLARLREKLRQQKEQVSQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    IT E + +Q+VV++ ICKLFP  RV  D   KD S+   + ICN
Sbjct: 125  KFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLP+GLDPHSVP +EL+ASLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  ARLPKGLDPHSVPSDELSASLGYMVQLLNLVVRCVCAPALHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-------XXXXXXXXXXXSLNFGVASVESERRPHLESSRN 649
            ARPSS+S EYPLFIPRQNFC                  S NFGV S+ES+R+P L+SS +
Sbjct: 245  ARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSSSNSGTSSNFGVTSMESDRKPRLDSSSS 304

Query: 648  SSFNYSLASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLF 469
            SSFNY+ AS HS+E H DLQKGI LLKKSVACITAY  N+L L++P+EASTFE F +LL 
Sbjct: 305  SSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARLLA 364

Query: 468  ILSSSKELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVDDD 289
             LSSSKE++S+  SLK + S   KQ Q L +            +TL ES     +T  + 
Sbjct: 365  TLSSSKEVRSV-FSLKMSGSRASKQVQPLNK-SVWNVDSAGSSSTLMESGHVPRNTF-EK 421

Query: 288  NLQNPDSSFMFTAVMIN--KSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
            +L +   + M+   + N  ++E+++E W++IEHP  PPPPS TEDVEHW RAMFIDATKK
Sbjct: 422  SLPSSGGNLMYATEVSNVGRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATKK 481


>ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum
            tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED:
            uncharacterized protein LOC102590673 isoform X2 [Solanum
            tuberosum]
          Length = 483

 Score =  430 bits (1105), Expect = e-118
 Identities = 232/480 (48%), Positives = 305/480 (63%), Gaps = 10/480 (2%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            +SCC ICE SNL S+C  CVN+RLNEYS +LKSL    ++               D Q++
Sbjct: 5    TSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGKLSEILLAKGKADDQLS 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR  +N +L +L+E+L   K+++++ KAK+ K S+DLK++Y  L  A   ++  + E L+
Sbjct: 65   WRVPRNEKLARLREKLRQQKEQISQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQLE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    IT E + +Q+VV++ ICKLFP  RV  D   KD S+   + ICN
Sbjct: 125  KFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            ARLP+GLDPHSVP +EL+ASLGYMVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  ARLPKGLDPHSVPSDELSASLGYMVQLLNLVIRCVCAPALHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFCXXXXXXXXXXXSL-------NFGVASVESERRPHLESSRN 649
            ARPSS+S EYPLFIPRQNFC           S        NFGV S+ES+R+P L+SS +
Sbjct: 245  ARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSCSNSGTSSNFGVTSMESDRKPRLDSSSS 304

Query: 648  SSFNYSLASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLF 469
            SSFNY+ AS HS+E H DLQKGI LLKKSVACITAY  N+L L++P+EASTFE F +LL 
Sbjct: 305  SSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARLLA 364

Query: 468  ILSSSKELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVDDD 289
             LSSSKE++S+  SLK + S   KQ Q L +              +      +     ++
Sbjct: 365  TLSSSKEVRSV-FSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGHVPVLRNTFEN 423

Query: 288  NLQNPDSSFMFTAVM--INKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
             L +   + ++   +    ++E+++E W++IEHP  PPPPS TEDVEHW RAMFIDATKK
Sbjct: 424  ALPSSSGNLIYATEVSDARRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATKK 483


>ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum]
            gi|557098297|gb|ESQ38733.1| hypothetical protein
            EUTSA_v10028627mg [Eutrema salsugineum]
          Length = 474

 Score =  424 bits (1090), Expect = e-116
 Identities = 240/473 (50%), Positives = 311/473 (65%), Gaps = 3/473 (0%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            SS CAICE +N ASIC+ CVN+RL EYS LLKSL T   + YS            D Q N
Sbjct: 5    SSNCAICENTNRASICSVCVNYRLIEYSTLLKSLKTRRDALYSKLSELLEAKGKADDQKN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            W+ +QN +L  LK  L   K+++ + KAK+ +ES DLKLKYG L+ A ST++  + E ++
Sbjct: 65   WKLIQNEKLSGLKNNLRRNKEQVTQGKAKIERESRDLKLKYGVLDSARSTLERIRVEQVE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    I+ E + +Q+VV++ +CKLFP  RV+ D + ++ S    N ICN
Sbjct: 125  KYFPNLICTQSLGHMAISSERLHKQSVVMKQVCKLFPQRRVSFDGESQNGSVGQYNLICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            +RLP+GLDPHS+P EELAASLG MVQ             LHNSGFAGSCSRIWQR+SYWD
Sbjct: 185  SRLPKGLDPHSIPSEELAASLGLMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERR-PHLESSRNSSFNY 634
            ARPS++S EYPLFIPRQN+C            S NFGVAS+ES+R+   L+S+  +SFNY
Sbjct: 245  ARPSTRSNEYPLFIPRQNYCSTSVENSWTDKNSSNFGVASMESDRKEARLDSTGRNSFNY 304

Query: 633  SLASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSS 454
            S ASPHS+E+H DLQKGI LLKKSVAC+TAY  NSL L++P EASTFEAF KLL  LSSS
Sbjct: 305  SSASPHSVESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLSSS 364

Query: 453  KELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVDDDNLQNP 274
            KE++S+  SLK ASS + KQAQQL +            +++ ES     +   + +  + 
Sbjct: 365  KEVRSV-FSLKMASSRSCKQAQQLNK--SIWNAHSVISSSILESSHLPRNASYNQDPNSA 421

Query: 273  DSSFMFTAVMINKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
             S    T +   +  + + GW+++EHP  PPPPS++EDVEHW RAMFIDA KK
Sbjct: 422  ASYLSGTELSEIRKSNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 474


>ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217421 [Cucumis sativus]
            gi|449524750|ref|XP_004169384.1| PREDICTED:
            uncharacterized LOC101217421 [Cucumis sativus]
          Length = 476

 Score =  419 bits (1078), Expect = e-114
 Identities = 238/472 (50%), Positives = 301/472 (63%), Gaps = 5/472 (1%)
 Frame = -2

Query: 1515 CAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQINWRA 1336
            CAICE SN ASIC  CVN RLN+Y++ LKSL       YS            D Q+NWR 
Sbjct: 8    CAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADDQLNWRV 67

Query: 1335 VQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLDKLR 1156
             +N +L  L+E+L   +++L + KA++  +S DL+LKY  LE A S ++ ++ E L+K  
Sbjct: 68   TRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQLKYAMLESARSVLEKQRLEQLEKAY 127

Query: 1155 SDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPDKIKDRS-NNMSNQICNARL 979
             DL+S + L    IT E + +Q+VVI+ +CKLFP  RV     K+       +QICN  L
Sbjct: 128  PDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQICNVSL 187

Query: 978  PRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWDARP 799
            PR LDPHSV P EL+ASLGYMVQ             LH SGFAGSCSRIWQR+SYW+A P
Sbjct: 188  PRSLDPHSVEPYELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDSYWNACP 247

Query: 798  SSQSKEYPLFIPRQNFCXXXXXXXXXXXSL-NFGVASVESERRPHLESSRNSSFNYSLAS 622
            SS+S EYP+F+PRQ++C           S  NFGVAS+ESER+P L S  N SFNYS AS
Sbjct: 248  SSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRSFNYSSAS 307

Query: 621  PHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSSKELQ 442
            PHS+E+H DLQKGI LLKKSVAC+TAY  NSL LD+PSEASTFEAF KLL  LSSSKE++
Sbjct: 308  PHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLSSSKEVR 367

Query: 441  SIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAES-EQTITSTVDDDNLQNPDSS 265
            S+  SLK ASS + K  Q  K             + L ES    I  T  + NL +  SS
Sbjct: 368  SV-FSLKMASSRSTKHIQ--KPIKSTWNVNSIASSMLFESGHSQIMKTNYESNLPSSASS 424

Query: 264  FMFTAVM--INKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
            +++        K++S +EGW+++EHPT PPPPS+ ED+EHW RAM IDATK+
Sbjct: 425  YLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ 476


>ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana]
            gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis
            thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA
            polymerase II protein [Arabidopsis thaliana]
          Length = 473

 Score =  416 bits (1070), Expect = e-113
 Identities = 233/473 (49%), Positives = 310/473 (65%), Gaps = 3/473 (0%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            SS CAIC+ +N   IC  CVNHRL EY+ LLKSL T   S  S            D Q N
Sbjct: 5    SSNCAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKADDQKN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +QN ++ KLK++L   K+ + + K K+ + S+DLK+KYG L+ A ST++  + E ++
Sbjct: 65   WRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTRVEQVE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    I+ E + +Q+VV++ ICKLFP+ RV+ D + ++ S    + ICN
Sbjct: 125  KYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQYDVICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            +RLP GLDPHS+P EELA SLGYMVQ             LH+SGFAGSCSRIWQR+SYWD
Sbjct: 185  SRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERR-PHLESSRNSSFNY 634
             R S++S EYPLFIPR+N+C            S NFGVAS+ES+R+ P L+S  ++SF Y
Sbjct: 245  GRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGSNSFKY 304

Query: 633  SLASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSS 454
            S ASPHS+E+H DLQKGI LLKKSVAC+TAY  NSL L++P EASTFEAF KLL  LSSS
Sbjct: 305  SSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLSSS 364

Query: 453  KELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVDDDNLQNP 274
            KE++S+  SLK ASS + KQAQQL +            ++L ES     +T  + +  +P
Sbjct: 365  KEVRSV-FSLKMASSRSGKQAQQLNK--SIWNAHSVISSSLLESAHLPRNTSYNQDPNSP 421

Query: 273  DSSFMFTAVMINKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
             +S++    +  +  + + GW+++EHP  PPPPS++EDVEHW RAMFIDA KK
Sbjct: 422  -ASYLSATELSTRKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


>gb|AAL59980.1| unknown protein [Arabidopsis thaliana]
          Length = 473

 Score =  416 bits (1068), Expect = e-113
 Identities = 233/473 (49%), Positives = 310/473 (65%), Gaps = 3/473 (0%)
 Frame = -2

Query: 1524 SSCCAICEGSNLASICAPCVNHRLNEYSALLKSLNTVCKSHYSXXXXXXXXXXXXDQQIN 1345
            SS CAIC+ +N   IC  CVNHRL EY+ LLKSL T   S  S            D Q N
Sbjct: 5    SSNCAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKADDQKN 64

Query: 1344 WRAVQNVELKKLKERLTYLKKKLAEDKAKVSKESNDLKLKYGSLELAFSTMKNKQTEVLD 1165
            WR +QN ++ KLK++L   K+ + + K K+ + S+DLK+KYG L+ A ST++  + E ++
Sbjct: 65   WRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTRVEQVE 124

Query: 1164 KLRSDLMSNQRLVDTGITPECIRRQAVVIENICKLFPMCRVNPD-KIKDRSNNMSNQICN 988
            K   +L+  Q L    I+ E + +Q+VV++ ICKLFP+ RV+ D + ++ S    + ICN
Sbjct: 125  KYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQYDVICN 184

Query: 987  ARLPRGLDPHSVPPEELAASLGYMVQXXXXXXXXXXXXXLHNSGFAGSCSRIWQRNSYWD 808
            +RLP GLDPHS+P EELA SLGYMVQ             LH+SGFAGSCSRIWQR+SYWD
Sbjct: 185  SRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQRDSYWD 244

Query: 807  ARPSSQSKEYPLFIPRQNFC-XXXXXXXXXXXSLNFGVASVESERR-PHLESSRNSSFNY 634
             R S++S EYPLFIPR+N+C            S NFGVAS+ES+R+ P L+S  ++SF Y
Sbjct: 245  GRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGSNSFMY 304

Query: 633  SLASPHSLENHTDLQKGILLLKKSVACITAYSCNSLGLDIPSEASTFEAFGKLLFILSSS 454
            S ASPHS+E+H DLQKGI LLKKSVAC+TAY  NSL L++P EASTFEAF KLL  LSSS
Sbjct: 305  SSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLSSS 364

Query: 453  KELQSIRNSLKTASSSTEKQAQQLKRXXXXXXXXXXXXNTLAESEQTITSTVDDDNLQNP 274
            KE++S+  SLK ASS + KQAQQL +            ++L ES     +T  + +  +P
Sbjct: 365  KEVRSV-FSLKMASSRSGKQAQQLNK--SIWNAHSVISSSLLESAHLPRNTSYNQDPNSP 421

Query: 273  DSSFMFTAVMINKSESIVEGWNIIEHPTLPPPPSKTEDVEHWMRAMFIDATKK 115
             +S++    +  +  + + GW+++EHP  PPPPS++EDVEHW RAMFIDA KK
Sbjct: 422  -ASYLSATELSTRKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


Top