BLASTX nr result

ID: Papaver25_contig00020877 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00020877
         (3264 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulga...   203   4e-49
gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa ...   201   1e-48
gb|EPS61587.1| hypothetical protein M569_13207, partial [Genlise...   194   2e-46
ref|XP_007217321.1| hypothetical protein PRUPE_ppa019733mg [Prun...   192   9e-46
ref|XP_007203452.1| hypothetical protein PRUPE_ppa022115mg [Prun...   192   1e-45
gb|EPS63383.1| hypothetical protein M569_11401 [Genlisea aurea]       190   4e-45
gb|AAS55787.1| hypothetical protein [Oryza sativa Japonica Group...   190   4e-45
ref|XP_002450418.1| hypothetical protein SORBIDRAFT_05g005061 [S...   187   2e-44
ref|XP_004980332.1| PREDICTED: uncharacterized protein LOC101786...   186   6e-44
emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga...   185   1e-43
emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulga...   183   4e-43
gb|AAD21778.1| putative non-LTR retroelement reverse transcripta...   181   3e-42
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   180   4e-42
pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW1...   180   4e-42
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...   179   6e-42
gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...   179   6e-42
gb|EEE61581.1| hypothetical protein OsJ_15963 [Oryza sativa Japo...   178   1e-41
emb|CCA66050.1| hypothetical protein [Beta vulgaris subsp. vulga...   177   3e-41
emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga...   176   5e-41
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...   176   6e-41

>emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1362

 Score =  203 bits (517), Expect = 4e-49
 Identities = 124/474 (26%), Positives = 215/474 (45%), Gaps = 4/474 (0%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            MK+LSWNCQG  NP T + LH    +  P+I+F+ +T +    +    ++  + N     
Sbjct: 1    MKLLSWNCQGLANPWTVNALHSLCWRDRPNIVFVMETMVDSQVLEKIRKRCGFMNGLCLS 60

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQ 2394
              G SGG+ L W N +D+ +   S H++HA++   + N  +    +YG          W 
Sbjct: 61   SNGNSGGMGLWW-NEMDVTVESFSAHHIHAVVLDENKNPIWNAMGIYGWPETSNKHLTWS 119

Query: 2393 YLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSS 2214
             L  ++    LP +  GD N      E               R +I    + DLG+ G+ 
Sbjct: 120  LLRRLKQQCSLPVLFFGDFNEITSIEEKEGGAPRCERVMDAFREVIDDCAVKDLGYVGNR 179

Query: 2213 TTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLST--SSGMTK 2040
             TW      +     RLDR L N  W +++ +  ++H+P   SDH P+LL T  +    +
Sbjct: 180  FTWQRGNSPSTLIRERLDRMLANDEWCDNFPSWEVVHLPRYRSDHAPLLLKTGVNDSFRR 239

Query: 2039 KQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIE 1860
                F+ +  W S   C  I+  +W+    GS    ++++L      L  W   TFGN++
Sbjct: 240  GNKLFKFEAMWLSKEECGKIVEEAWN----GSAGEDITNRLDEVSRSLSTWATKTFGNLK 295

Query: 1859 TQIQTIQTHLDQCYNRNMPTSNP*VVNLTQS-LQKWLAIQKDFYMQKSGAH-FYDADRNT 1686
             + +   T L+    R+   S      +    L +   +++ ++  ++ A+   D D+NT
Sbjct: 296  KRKKEALTLLNGLQQRDPDASTLEQCRIVSGDLDEIHRLEESYWHARARANEIRDGDKNT 355

Query: 1685 SYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSIFSEF 1506
             YFH KA+  K+ + I  + D  G+W   R  I   + ++F  +  +    +        
Sbjct: 356  KYFHHKASQRKRRNTINELLDENGVWKKGREEICGVVQHYFEGLFATDSPVNMELALEGL 415

Query: 1505 QSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKE 1344
              C++ + N+ LL LPS  E+K+ +F + P  +PG DG  A F+Q+ W  +G +
Sbjct: 416  SHCVSTDMNTALLMLPSGDEVKEALFAMHPNKAPGIDGLHALFFQKFWHILGSD 469



 Score = 72.0 bits (175), Expect = 2e-09
 Identities = 71/274 (25%), Positives = 117/274 (42%), Gaps = 14/274 (5%)
 Frame = -1

Query: 858  ELIHLE*GQ*NIDLLRNLFTSQQVDSILTIPLV-LQTQDTLIWTLTSSGVFTTKSTCH-- 688
            +LI +  G  NI+ ++  F  ++ + +L+IPL      D   W  + +G+F+ +S C+  
Sbjct: 966  DLIDVARGAWNIESVQQTFVEEEWELVLSIPLSRFLPDDHRYWWPSRNGIFSVRS-CYWL 1024

Query: 687  -HLCDIDLQDSALSSLSKTYWFKFWKLKIPYKLLIFLWKIINNVLPVKSNL--RHIPGID 517
              L  +              W + W+L+ P KL  FLW+     L VK  L  RHI    
Sbjct: 1025 GRLGPVRTWQLQHGERETELWRRVWQLQGPPKLSHFLWRACKGSLAVKGRLFSRHISV-- 1082

Query: 516  DFSCPLYNTANVEDINHLLFTCPFATAVWKASLPQHFSLLLQHNTL------IDWIRTWS 355
            D +C +    + E INH LF C FA A+W+ S    F+ L+ +  L      ++W+   +
Sbjct: 1083 DATCSVCGDPD-ESINHALFDCTFARAIWQVS---GFASLMMNAPLSSFSERLEWLAKHA 1138

Query: 354  STDVVINFSSVSPSIYGMIATMWHIWRYICQVPFRHVQINLNSVLLPMFKYLANITATLA 175
            + +              M + MW  W    ++ F +    L+   L + K  + + A   
Sbjct: 1139 TKE----------EFRTMCSFMWAGWFCRNKLIFEN---ELSDAPL-VAKRFSKLVADYC 1184

Query: 174  SHRHTCHHNQ--NIGQSHHWNPPPPDTLKVNIDA 79
             +  +         G S  W+PPP    KVN DA
Sbjct: 1185 EYAGSVFRGSGGGCGSSALWSPPPTGMFKVNFDA 1218


>gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa rugosa]
          Length = 1656

 Score =  201 bits (512), Expect = 1e-48
 Identities = 117/441 (26%), Positives = 207/441 (46%), Gaps = 1/441 (0%)
 Frame = -2

Query: 2681 QKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAPPVGQSGGISLAWKNGVDLEIMHTS 2502
            +KH+P+ILFL +T+  +  +  + +   + +     P+    G++L W + V + I+ +S
Sbjct: 629  KKHNPEILFLMETRQQEGIIKEWKRNLKFTDHHVVDPIATGRGLALFWGDAVQVSILDSS 688

Query: 2501 RHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQYLLDMQPFVDLPWILMGDLNFTML 2322
             + V  ++   S      ++ MYG  +  E +  W+ +    P   LPW+++GD N  + 
Sbjct: 689  PNYVDTVVSFLSDAFVCKITWMYGNPHDNEKRAFWRLMYSRFPVQSLPWLVLGDFNEVLD 748

Query: 2321 DSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSSTTWSNHRQGNDYTAVRLDRALGNI 2142
             SE             + R  +    L DL F G   +W   R G  +   RLDRALGNI
Sbjct: 749  PSEKWGGGPPLPWRIKLFRDFLNNGHLRDLHFKGPGFSWFAMRHGRVFIKERLDRALGNI 808

Query: 2141 SWINSYSTAHLIHIPPVASDHYPILLSTSSGMTKKQFPFRLQRYWFSIASCTDIINTSWS 1962
            +W +S     ++H+P + SDH P+LL ++  M  K   FR ++ W +    +D+I  SW 
Sbjct: 809  AWSSSQPNTQILHLPKIGSDHRPLLLDSNPKMLNKTRLFRFEQMWTTHEEYSDVIQRSWP 868

Query: 1961 HQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIETQIQTIQTHLDQCYNRNMPTSNP*VV 1782
                GS     +  L      L++W K  F N   Q+  + + +++ +  N P ++  + 
Sbjct: 869  PAFGGSAMRSWNRNLLSCGKALKMWSKEKFSNPSVQVADLLSDIEKLHQSNPPDAHHQIN 928

Query: 1781 NLTQSLQKWLAIQKDFYMQKSGAHFYD-ADRNTSYFHSKANFNKKISHIFAIQDSLGIWH 1605
             LT  + K     + ++ Q+S  ++    D+N+S+FH      ++ + I  ++D  G W 
Sbjct: 929  ILTDQVTKLWTQDEMYWHQRSRVNWLKLGDQNSSFFHQTTIQRRQYNKIVRLKDDHGNWL 988

Query: 1604 DSRFGIENTLLNHFSAISKSTKLASDNSIFSEFQSCITDEENSLLLQLPSEQEIKDVVFQ 1425
            DS   +    L++F+A+ +S        +     + +T E N +L    S  E+K  VF 
Sbjct: 989  DSEADVALQFLDYFTALYQSNGPQQWEEVLDFVDTAVTAEMNKILSSPVSLLEVKKAVFD 1048

Query: 1424 IKPWSSPGNDGFQAAFYQQCW 1362
            +    +PG DGF   FYQ  W
Sbjct: 1049 LGATKAPGPDGFSGIFYQNQW 1069



 Score = 62.8 bits (151), Expect = 1e-06
 Identities = 31/105 (29%), Positives = 54/105 (51%), Gaps = 4/105 (3%)
 Frame = -2

Query: 1166 KLQNTTIQQMDKIQRDYRWGKTK----HFIAWNKVCFPIRHGGIGLKYLSHYNSALLAKL 999
            K  +T  ++++ I  D+ WG       H+ +W+ +  P + GG+G + L  +N++LLAK 
Sbjct: 1418 KFPSTLCKELNGILADFWWGNVDTRGIHWKSWDFLARPKKDGGMGFRNLEDFNNSLLAKQ 1477

Query: 998  AWNMIHSSTELWVQLLNGKYFSLYELTHEPPPNCENASWIWQSIM 864
            AW +  +   LW ++L   Y+          P   N SWIW S++
Sbjct: 1478 AWRLHQNPFALWARVLEQLYYPRSSFLE--APKGPNPSWIWNSLL 1520


>gb|EPS61587.1| hypothetical protein M569_13207, partial [Genlisea aurea]
          Length = 475

 Score =  194 bits (494), Expect = 2e-46
 Identities = 141/483 (29%), Positives = 228/483 (47%), Gaps = 18/483 (3%)
 Frame = -2

Query: 2738 WNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAPPVGQS 2559
            WNCQG G+P T   L   I +  P  +FL +TK +  ++ +    Y Y  F +   VG S
Sbjct: 1    WNCQGMGSPWTVRRLKELIVQFSPSFIFLCETKCSSCKLSWVKNHYPYFGF-FVDAVGAS 59

Query: 2558 GGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQ--EQWQYLL 2385
            GG++L WK  +D+ ++  S+  +   I+    ++   ++  Y  +NP+     + W  L 
Sbjct: 60   GGLALFWKKELDVSLLSYSKWYIDVSINISFGDVQCRVTGFY--DNPISSSRPDSWNLLR 117

Query: 2384 DMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSSTTW 2205
             +      PWI +GD N  +   E+              R  +   GL DL + G   TW
Sbjct: 118  RLHRHSLRPWICVGDFNEVLFPHEVSSLASRPTAQMVGFRRALMDCGLTDLPYHGHPFTW 177

Query: 2204 SNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILL-----STSSGMTK 2040
            SN R+       RLDRA+ + SW+  Y      H+    SDH P+ +     S   G+ K
Sbjct: 178  SNKRKHPQTVRARLDRAVASTSWLQCYPCTSTSHLSFGGSDHAPLWIQSAPPSAGDGLRK 237

Query: 2039 KQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIE 1860
             +  FR +  W  +  C   I T+W+    G PS  L  KL  T+  L  W +     I+
Sbjct: 238  SR-RFRFEARWMQLPGCDSTIRTAWTSS-QGPPS-TLQPKLGTTRISLLKWYQHQISPIK 294

Query: 1859 TQIQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKWLA--IQKD--FYMQKSGAHFY-DAD 1695
              I+ ++T L       + T +   +   + LQ  LA  +Q++  F+ Q+S  H+    D
Sbjct: 295  ANIKRVETELATI---AVSTRDDIFMAREKQLQCELAGYLQQEELFWKQRSKTHWLAKGD 351

Query: 1694 RNTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTK------LA 1533
            RNT+YFH+ A+  ++ + I +I DS G   DS  GI + +L++F+ I  ST        A
Sbjct: 352  RNTAYFHACASGRRECNRISSIMDSDGRQQDSPHGIHSAILDYFNRIFSSTMPPPELLAA 411

Query: 1532 SDNSIFSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPV 1353
            +  SI +     +TD   ++L    +++E+   + Q+KP S+PG DGF   F+Q+ W  V
Sbjct: 412  TTRSISNR----LTDSMKTMLSVPFTKEEVWPAIKQMKPLSAPGPDGFPPIFFQRYWPIV 467

Query: 1352 GKE 1344
             +E
Sbjct: 468  HEE 470


>ref|XP_007217321.1| hypothetical protein PRUPE_ppa019733mg [Prunus persica]
            gi|462413471|gb|EMJ18520.1| hypothetical protein
            PRUPE_ppa019733mg [Prunus persica]
          Length = 1275

 Score =  192 bits (488), Expect = 9e-46
 Identities = 125/412 (30%), Positives = 194/412 (47%), Gaps = 4/412 (0%)
 Frame = -2

Query: 2567 GQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQYL 2388
            G SGG++L WK  VD+ +   S H +   I ++     + L+  YG     + ++ W  L
Sbjct: 20   GYSGGLALLWKEEVDVHVCAFSDHFIDVKIGSNGGGDRWRLTVFYGFPAVQDREKSWILL 79

Query: 2387 LDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSSTT 2208
              +     LPW+ +GD N  +   E               R+I+ +LG  DLGF G   T
Sbjct: 80   DQLGHHNQLPWLCVGDFNEILSTDEKEGGPLRNNRQMQGFRNIVDKLGFRDLGFNGYKFT 139

Query: 2207 WSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGMTKKQ-- 2034
            W   R G+ +  VRLDRAL   SW N +    + H+ P  SDH PIL+       +K   
Sbjct: 140  WKC-RFGDGFVRVRLDRALATTSWQNLFPGFSVQHLDPSRSDHLPILVRIRHATCQKSRY 198

Query: 2033 FPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIETQ 1854
              F  +  W +   C   I   W       P   L  K+      LQ W K TFG+I+ +
Sbjct: 199  HRFHFEAMWTTHVDCEKTIKQVWESVGDLDPMVGLDKKIKQMTWVLQRWSKSTFGHIKEE 258

Query: 1853 IQTIQTHLDQCYNRNMPTSNP*VVNLTQ-SLQKWLAIQKDFYMQKSGAHFYDA-DRNTSY 1680
             + ++  L   +             + Q SL + LA  + ++ Q+S  ++  A D+NTSY
Sbjct: 259  TRVLRAKLASLFQAPYSERVEEDRRVVQKSLDELLAKNELYWCQRSRENWLKAGDKNTSY 318

Query: 1679 FHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSIFSEFQS 1500
            FH KA   ++ + I  ++DS G W  SR GI + ++++F  + +S+  +    I S  + 
Sbjct: 319  FHQKATNRRRRNIIKGLEDSNGCWRTSRQGITSIVIDYFGDLFRSSGSSMMEEILSALEP 378

Query: 1499 CITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKE 1344
             +T +   +L+   S QEIKD VFQ++P  +PG DG    FYQ+ W  VG +
Sbjct: 379  KVTADMQQVLIADFSYQEIKDAVFQMQPSKAPGPDGLPPLFYQKYWRIVGDD 430



 Score = 75.9 bits (185), Expect = 1e-10
 Identities = 47/144 (32%), Positives = 68/144 (47%), Gaps = 5/144 (3%)
 Frame = -1

Query: 834  Q*NIDLLRNLFTSQQVDSILTIPLVLQTQ-DTLIWTLTSSGVFTTKSTCHHLCDIDLQDS 658
            Q ++  L NLF    V  I+ IPL ++   D ++W     G+FT KS       +   D 
Sbjct: 910  QWDLQKLNNLFLPVDVVDIVRIPLSIRAPPDRIVWNYDKHGLFTVKSAYRVALRVTSGDE 969

Query: 657  ALSSLSKT----YWFKFWKLKIPYKLLIFLWKIINNVLPVKSNLRHIPGIDDFSCPLYNT 490
              SS S +     W   W   +P KL IF W++ +++LP K+NL    G+D     ++  
Sbjct: 970  DESSSSNSDTGMLWRHIWNATVPTKLKIFAWRVAHDILPTKANLIK-KGVDMQDMCMFCG 1028

Query: 489  ANVEDINHLLFTCPFATAVWKASL 418
               E   H+L  CPFA A W  SL
Sbjct: 1029 DITESALHVLAMCPFAVATWNISL 1052


>ref|XP_007203452.1| hypothetical protein PRUPE_ppa022115mg [Prunus persica]
            gi|462398983|gb|EMJ04651.1| hypothetical protein
            PRUPE_ppa022115mg [Prunus persica]
          Length = 1755

 Score =  192 bits (487), Expect = 1e-45
 Identities = 125/412 (30%), Positives = 194/412 (47%), Gaps = 4/412 (0%)
 Frame = -2

Query: 2567 GQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQYL 2388
            G SGG++L WK  VD+ +   S H +   I ++     + L+  YG     + ++ W  L
Sbjct: 474  GYSGGLALLWKEEVDVHVCAFSDHFIDVQIGSNGGGDRWRLTVFYGFPAVQDREKSWILL 533

Query: 2387 LDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSSTT 2208
              +     LPW+ +GD N  +   E               R+I+ +LG  DLGF G   T
Sbjct: 534  DQLGHHNQLPWLCVGDFNEILSTDEKEGGPLRNNRQMQGFRNIVDKLGFRDLGFNGYKFT 593

Query: 2207 WSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGMTKKQF- 2031
            W   R G+ +  VRLDRAL   SW N +    + H+ P  SDH PIL+       +K   
Sbjct: 594  WKC-RFGDGFVRVRLDRALATTSWQNLFPGFSVQHLDPSRSDHLPILVRIRHATCQKSRY 652

Query: 2030 -PFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIETQ 1854
              F  +  W +   C   I   W       P   L  K+      LQ W K TFG+I+ +
Sbjct: 653  RRFHFEAMWTTHVDCEKTIKQVWESVGNLDPMVGLDKKIKQMTWVLQRWSKSTFGHIKEE 712

Query: 1853 IQTIQTHLDQCYNRNMPTSNP*VVNLTQ-SLQKWLAIQKDFYMQKSGAHFYDA-DRNTSY 1680
             + ++  L   +             + Q SL + LA  + ++ Q+S  ++  A D+NTSY
Sbjct: 713  TRVLRAKLASLFQAPYSERVEEDRRVVQKSLDELLAKNELYWCQRSRENWLKAGDKNTSY 772

Query: 1679 FHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSIFSEFQS 1500
            FH KA   ++ + I  ++DS G W  SR GI + ++++F  + +S+  +    I S  + 
Sbjct: 773  FHQKATNRRRRNIIKGLEDSNGCWRTSRQGITSIVIDYFGDLFRSSGSSMMEEILSALEP 832

Query: 1499 CITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKE 1344
             +T +   +L+   S QEIKD VFQ++P  +PG DG    FYQ+ W  VG +
Sbjct: 833  KVTADMQQVLIADFSYQEIKDAVFQMQPSKAPGPDGLPPLFYQKYWRIVGDD 884



 Score = 75.9 bits (185), Expect = 1e-10
 Identities = 47/144 (32%), Positives = 68/144 (47%), Gaps = 5/144 (3%)
 Frame = -1

Query: 834  Q*NIDLLRNLFTSQQVDSILTIPLVLQTQ-DTLIWTLTSSGVFTTKSTCHHLCDIDLQDS 658
            Q ++  L NLF    V  I+ IPL ++   D ++W     G+FT KS       +   D 
Sbjct: 1390 QWDLQKLNNLFLPVDVVDIVRIPLSIRAPPDRIVWNYDKHGLFTVKSAYRVALRVTSGDE 1449

Query: 657  ALSSLSKT----YWFKFWKLKIPYKLLIFLWKIINNVLPVKSNLRHIPGIDDFSCPLYNT 490
              SS S +     W   W   +P KL IF W++ +++LP K+NL    G+D     ++  
Sbjct: 1450 DESSSSNSDTGMLWRHIWNATVPTKLKIFAWRVAHDILPTKANLIK-KGVDMQDMCMFCG 1508

Query: 489  ANVEDINHLLFTCPFATAVWKASL 418
               E   H+L  CPFA A W  SL
Sbjct: 1509 DITESALHVLAMCPFAVATWNISL 1532


>gb|EPS63383.1| hypothetical protein M569_11401 [Genlisea aurea]
          Length = 1469

 Score =  190 bits (482), Expect = 4e-45
 Identities = 133/496 (26%), Positives = 234/496 (47%), Gaps = 12/496 (2%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYA- 2577
            M +L+WNC+G  + ST   L   I    P ++FLS+TK   S + +  +  SY  F  A 
Sbjct: 369  MSLLAWNCRGLRSASTVRRLRDVISSDAPSMIFLSETKCLASHVEWLKECLSY--FGVAV 426

Query: 2576 PPVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQW 2397
               G SGG++L W+  V + ++      +  ++       ++  +  YG          W
Sbjct: 427  SATGLSGGLALFWRKDVCVSLLSFCSSYIDVLVRLTPTLPEWRFTGFYGNPAVQLRPRSW 486

Query: 2396 QYLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGS 2217
              L  ++     PW++ GD N  ++ +E+              R  +    L D+GFTG 
Sbjct: 487  DLLRQIRHHSICPWLVAGDFNEVVMQNEVESLNSRPASQMRAFRDALLDCQLQDIGFTGF 546

Query: 2216 STTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILL------STS 2055
              TW N R+  D    RLDRA+   +W N +  A + H+P  +SDH P+L+       TS
Sbjct: 547  PFTWCNKRKAPDTVRARLDRAVATTTWNNLFPRAIVKHLPYGSSDHLPLLIFLDPAAPTS 606

Query: 2054 SGMTKKQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKIT 1875
                K++F F  + +W +I  C D+I+ SW+     S   + + ++  T+  L  W +  
Sbjct: 607  IRPNKRRFKF--EAFWTTIPGCADVIHQSWAP---NSQPTNFNYRIQKTRMSLLKWYQSK 661

Query: 1874 FGNIETQIQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKWLAIQKDFYMQKSGAHFY--D 1701
             G I++++Q I T LD    +++        +  +  Q  L  Q++ Y ++ G   +   
Sbjct: 662  VGPIKSRLQKIATELDLLARQSITDDIKHCESALKEEQASLWKQEEMYWKQRGKIHWLRC 721

Query: 1700 ADRNTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKST---KLAS 1530
             DRNT++FH+ A+  +  + I  I+++ G+W      +  T+L+++  +  S+    +  
Sbjct: 722  GDRNTAFFHASASEKRTQNRIAGIKNAHGLWITRGPEVITTMLSYYQDLFTSSPPDPIEM 781

Query: 1529 DNSIFSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVG 1350
            + ++ S     ITD+  ++L +  +  E+   V ++KP SSPG DGF   FYQ+ W  VG
Sbjct: 782  ERAL-SIIPRTITDDMRAILERPYNAAEVWPAVRRMKPLSSPGPDGFPPVFYQKYWPTVG 840

Query: 1349 KEERITTTKVSLLING 1302
              +      + LL NG
Sbjct: 841  --QATVEAVLKLLNNG 854


>gb|AAS55787.1| hypothetical protein [Oryza sativa Japonica Group]
            gi|54291856|gb|AAV32224.1| hypothetical protein [Oryza
            sativa Japonica Group]
          Length = 1936

 Score =  190 bits (482), Expect = 4e-45
 Identities = 122/467 (26%), Positives = 208/467 (44%), Gaps = 3/467 (0%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            M  L+WNC+G GN +T   L   IQK    ++FL +T+ +  +M    ++ ++  F    
Sbjct: 636  MSCLAWNCRGLGNTATVQDLRALIQKAGSQLVFLCETRQSVEKMSRLRRKLAFRGFVGVS 695

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQ 2394
              G+SGG++L W   V +++   ++  + A +        + ++ +YG          W 
Sbjct: 696  SEGKSGGLALYWDESVSVDVKDINKRYIDAYVRLSPDEPQWHITFVYGEPRVENRHRMWS 755

Query: 2393 YLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSS 2214
             L  ++    LPW+++GD N T+   E               R  +    L DLGF G  
Sbjct: 756  LLRTIRQSSALPWMVIGDFNETLWQFEHFSKNPRCETQMQNFRDALYDCDLQDLGFKGVP 815

Query: 2213 TTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLS--TSSGMTK 2040
             T+ N R G     VRLDRA+ +  W + +  A + H+    SDH PILL          
Sbjct: 816  HTYDNRRDGWRNVKVRLDRAVADDKWRDLFPEAQVSHLVSPCSDHSPILLEFIVKDTTRP 875

Query: 2039 KQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIE 1860
            +Q     +  W        +I  +W +  + +    ++  L      L+ W K    N+ 
Sbjct: 876  RQKCLHYEIVWEREPESVQVIEEAWINAGVKTDLGDINIALGRVMSALRSWSKTKVKNVG 935

Query: 1859 TQIQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKWLAIQKDFYMQKSGAHFY-DADRNTS 1683
             +++  +  L+     N   S+  +   T  + + L  ++  ++Q+S  ++  + DRNT 
Sbjct: 936  KELEKARKKLEDLIASNAARSS--IRQATDHMNEMLYREEMLWLQRSRVNWLKEGDRNTR 993

Query: 1682 YFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSIFSEFQ 1503
            +FHS+A +  K + I  ++D  G  H +   +E     +F  + K+    +  S+   FQ
Sbjct: 994  FFHSRAVWRAKKNKISKLRDENGAIHSTTSVLETMATEYFQGVYKADPSLNPESVTRLFQ 1053

Query: 1502 SCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCW 1362
              +TD  N  L Q   E+EI   +FQI P  SP  DGF A FYQ+ W
Sbjct: 1054 EKVTDAMNEKLCQEFKEEEIAQAIFQIGPLKSPRPDGFPARFYQRNW 1100


>ref|XP_002450418.1| hypothetical protein SORBIDRAFT_05g005061 [Sorghum bicolor]
            gi|241936261|gb|EES09406.1| hypothetical protein
            SORBIDRAFT_05g005061 [Sorghum bicolor]
          Length = 753

 Score =  187 bits (476), Expect = 2e-44
 Identities = 131/494 (26%), Positives = 225/494 (45%), Gaps = 10/494 (2%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            MK L WNC+G GNP+T   L    + + P ++F+ +T+I+  ++       S+ N +   
Sbjct: 1    MKTLCWNCRGIGNPATVKELRDLAKDYAPSVMFIMETQISKYRVENLRYTLSFDNSFAVN 60

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQ 2394
              G+SGG+ L W N V L I   S +++  II  H       +S +YG  N       W 
Sbjct: 61   SSGRSGGLGLFWNNDVLLSIQKYSNYHIDTIISEHGKE-PRRMSFIYGEPNRSFRYRTWD 119

Query: 2393 YLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSS 2214
             +  M+   DLPW+ MGD N  +   E              +R  +    L D+G+ G  
Sbjct: 120  IMKQMRSDTDLPWVCMGDFNEILRREEQLGPNEREEYLMEGVRDAVDMCQLRDIGYIGLD 179

Query: 2213 TTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLS-----TSSG 2049
             T+     G  +  VRLDRAL +++W   +  A + H+  V SDH PILLS      + G
Sbjct: 180  WTFEKKVAGGHFVRVRLDRALASVNWCARFPLAAVQHLTAVKSDHCPILLSHVPDERNEG 239

Query: 2048 MTKKQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFG 1869
               +  PFR +  W +    + +I   W +    +    +  KL +   +L+ W   TFG
Sbjct: 240  GGCQGKPFRYELMWETNERLSSLIEQIWKNGQHCNSVKDMKDKLFHLGEELKSWGGKTFG 299

Query: 1868 NIETQIQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKWLAI----QKDFYMQKSG-AHFY 1704
             +  +++  +  L+Q   R  P+ N  V    Q +   + +    ++  + Q+S     +
Sbjct: 300  VVRKELRVQKKRLEQL--RADPSRNT-VSEEEQKIVNRIILLNYQEEIMWRQRSRITWLH 356

Query: 1703 DADRNTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDN 1524
            + D NT +FH +A+  +  + I  +    G    +   +   +++ +  + +S   ++ N
Sbjct: 357  EGDSNTKFFHQRASRRRIRNRIDKLNRPDGSECTNVDELHQMVVDFYRNLFESEGTSNMN 416

Query: 1523 SIFSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKE 1344
             +       +TD+ N  L    +E E+K+ +FQ+ P  SPG DGF A F+Q+ W   G +
Sbjct: 417  LVLDHIPRKVTDDMNLFLCAPFNETEVKNALFQMFPTKSPGPDGFPAHFFQRNWDVCGDD 476

Query: 1343 ERITTTKVSLLING 1302
                T  V  ++NG
Sbjct: 477  ---LTRMVLNVLNG 487


>ref|XP_004980332.1| PREDICTED: uncharacterized protein LOC101786154 [Setaria italica]
          Length = 720

 Score =  186 bits (472), Expect = 6e-44
 Identities = 130/502 (25%), Positives = 233/502 (46%), Gaps = 9/502 (1%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            MKI++WNC+G GN +    L    ++ DPDILFLS+TK+ + ++     +    N     
Sbjct: 1    MKIIAWNCRGLGNGAAIRGLLNVQKEEDPDILFLSETKLDEHRIKGLRWKLGLTNVVVKD 60

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQ 2394
             VG+SGG++L W+ GVD+ +   SR  + A +   + +  + L+  YG          W+
Sbjct: 61   CVGRSGGLALFWRTGVDVHVRWISRLYIDADVQ-EADSFSWRLTGFYGEPRTENKHLSWK 119

Query: 2393 YLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGS- 2217
             L  +      PW+ +GD N  ++ +E               R  ++   L DLGF G  
Sbjct: 120  ALRTLNAARRKPWLCLGDFNEILMGAEKEGGLPRGQACMDRFRSALEDCELSDLGFAGDL 179

Query: 2216 STTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTS----SG 2049
             T W+N    N+Y   RLDRA+ N  W   +    +I+  P  SDH P+++ T     SG
Sbjct: 180  FTWWNNSHNSNEYVRERLDRAVANEEWRAHFPLYKVINGEPRHSDHRPVVVLTEEDVHSG 239

Query: 2048 MTKKQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFG 1869
              +    FR + +W     C  I+  +W   + G     + + +      L  W +   G
Sbjct: 240  ARRGGQSFRFEAHWVEEEQCAPIVENAWKTVMEGRRGTVMEA-VQSVATDLGDWNRNVLG 298

Query: 1868 NIETQIQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKW--LAIQKDFY-MQKSGAHFYD- 1701
            ++E +++ ++  ++ C  R  P +   +  +     +W  L  QKD Y  Q++ AH+   
Sbjct: 299  DLEKRMKQLRKAMEAC--RRGPINEQSLRRMELLKFRWERLEEQKDLYWRQRAKAHWVSK 356

Query: 1700 ADRNTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNS 1521
             DRNT +FH  A+  +K S I  +    G   +    +E  + N++  +  +      + 
Sbjct: 357  GDRNTHFFHQYASERRKRSRIKRLVQDDGRVVEEDGELETLITNYYRNLFTTNAGDRMDE 416

Query: 1520 IFSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKEE 1341
            +    +  +++E N  LL+  +E+E++  +  I    +PG DG  + FY++ W+ VG + 
Sbjct: 417  LLQHVKQVVSNEMNDQLLRDFTEEEVRQGLDAIGDLKAPGADGMSSLFYKKHWNIVGAD- 475

Query: 1340 RITTTKVSLLINGSPTASFSPT 1275
             +    ++ L +GS  A ++ T
Sbjct: 476  -VVREVLNFLNDGSMPAKWNET 496


>emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1363

 Score =  185 bits (469), Expect = 1e-43
 Identities = 129/479 (26%), Positives = 219/479 (45%), Gaps = 11/479 (2%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            MK + WN +G  + +   +    ++ H PD+L L +TK +  +     ++  Y NF   P
Sbjct: 1    MKAIIWNVRGANSKAFLWHALDLVKMHKPDLLILLETKCSSLRADQATKRLGYVNFRIIP 60

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRH---NVHAIIHTHSHNLDFILSCMYGANNPMECQE 2403
              G+ GGI L WK   D+ ++H + +   + HA+    S   + +L+ M+  +   E  +
Sbjct: 61   AFGKRGGIWLMWK--ADIALVHYADYQPNHFHALFKLRSDIPEVLLTGMHAPSVVSERNK 118

Query: 2402 QWQYLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFT 2223
             W  L +  P    PW++ GD+N  +  +E               +  I    L+DLGF 
Sbjct: 119  YWVDLTEDSPPRGTPWLVAGDMNEVLHGNEKMGGRQVGKEQGKQCKDWIAANALLDLGFQ 178

Query: 2222 GSSTTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGMT 2043
            G   TW+N R G      RLDRAL N  W++ +    +IH+P   SDH P+L+  +    
Sbjct: 179  GPKFTWTNGRTGGSLIKERLDRALVNSEWLDLFPDTKVIHLPRTFSDHCPLLILFNENPR 238

Query: 2042 KKQFPFRLQRYWFSIASCTDIINTSW-SHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGN 1866
             + FPFR +  W      T++I  +W SH      SY  +  L  +   ++ W K  FG+
Sbjct: 239  SESFPFRCKEVWAYHPDFTNVIEETWGSHH----NSYVAARDLFLS--SVKSWSKYVFGS 292

Query: 1865 IETQIQTIQTHLDQCYNRNMPTSNP*V------VNLTQSLQKWLAIQKDFYMQKSGAHFY 1704
            I  + + I   L     +   + +P V      ++L   L +    ++ F+ QK+G    
Sbjct: 293  IFQKKKRILARLGGI--QKSLSIHPSVFLSKLEIDLLVELNELSKQERVFWAQKAGIDRA 350

Query: 1703 D-ADRNTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASD 1527
               D NT YFH+ A        I  +++    W  +   ++  +++HF  I  ++  +  
Sbjct: 351  KLGDMNTKYFHTLAKIRTCKRKISCLKNDNHDWVSNNEDLKKMMMSHFEKIFTTSMYSHQ 410

Query: 1526 NSIFSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVG 1350
             +     +  I+DE N  L +   E EIK+ + Q+ P  SPG DG QA F+++ W  +G
Sbjct: 411  RNNSFRGECRISDEWNKRLARRVEEDEIKEALAQMAPLKSPGPDGIQAFFFKKYWEQMG 469



 Score = 61.6 bits (148), Expect = 2e-06
 Identities = 28/86 (32%), Positives = 51/86 (59%), Gaps = 5/86 (5%)
 Frame = -2

Query: 1178 LQVLKLQNTTIQQMDKIQRDYRWGKT--KHFIA---WNKVCFPIRHGGIGLKYLSHYNSA 1014
            +Q   L  + + +++K  R + W K    H++A   W+++C P   GG+G + L ++N A
Sbjct: 808  MQSSLLPVSVMNEIEKDCRKFLWNKMDKSHYLARMSWDRICSPTGKGGLGFRRLHNWNLA 867

Query: 1013 LLAKLAWNMIHSSTELWVQLLNGKYF 936
             +AKL W +I   T+LWV++L  +Y+
Sbjct: 868  FMAKLGWMIIKDETKLWVRILKARYW 893


>emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  183 bits (465), Expect = 4e-43
 Identities = 138/495 (27%), Positives = 222/495 (44%), Gaps = 6/495 (1%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            M IL WNC+G GNP +   L     +  PDI+F+S+T I   ++        + N +   
Sbjct: 1    MNILCWNCRGLGNPWSVRQLRSWSNQFAPDIIFVSETMINKIEVEALKSWLGFSNAFGVA 60

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQ 2394
             VG++GG+ L WK  V   ++  S+H++   +   +    F+   +YG     E    W 
Sbjct: 61   SVGRAGGLCLYWKEEVMFSLVSFSQHHICGDVEDGNKKWRFV--GVYGWAKEEEKHLTWS 118

Query: 2393 YLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSS 2214
             L  +     LP +L GD N  +  +E               R  +  L L DLG+ G+ 
Sbjct: 119  LLRHLCEDTSLPILLGGDFNEILSAAEKEGGANRVRREMINFRDTLDTLALRDLGYVGTW 178

Query: 2213 TTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILL-STSSGMTK- 2040
             TW   R  +     RLDR L + SW++ Y  +   H     SDH  I+L S  +G  + 
Sbjct: 179  YTWERGRSPSTCIRERLDRYLCSNSWLDLYPDSVPEHTIRYKSDHSAIVLRSQRAGRPRG 238

Query: 2039 KQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIE 1860
            K      +  W     C  ++  SW +    S    ++ ++      L  W    F N+ 
Sbjct: 239  KTRRLHFETSWLLDDECEAVVRESWEN----SEGEVMTGRVASMGQCLVRWSTKKFKNLS 294

Query: 1859 TQIQTIQTHLDQCYNRNMPTSN-P*VVNLTQSLQKWLAIQKDF-YMQKSGAHFYDADRNT 1686
             QI+T +  L    N  +  S     V L + L +  A  + + Y++   A   D D+NT
Sbjct: 295  KQIETAEKALSVAQNNPISESACQECVLLEKKLDELHAKHEAYWYLRSRVAEVKDGDKNT 354

Query: 1685 SYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLA--SDNSIFS 1512
             YFH KA+  KK + +  + D LG W +    IEN   ++FS+I  S+  +  S  ++ S
Sbjct: 355  KYFHHKASQRKKRNFVKGLFDGLGTWREEADHIENIFTSYFSSIFTSSNPSDLSLEAVMS 414

Query: 1511 EFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKEERIT 1332
              +  +T+E N  LL+  S+ EI   + Q+ P  +PG DG    FYQ+ W  VG +    
Sbjct: 415  VIEPVVTEEHNLKLLEPFSKDEILAALQQMHPCKAPGPDGMHVIFYQRFWHIVGDD---V 471

Query: 1331 TTKVSLLINGSPTAS 1287
            T+ +S +++G  + S
Sbjct: 472  TSFISNILHGHSSPS 486



 Score = 75.9 bits (185), Expect = 1e-10
 Identities = 64/265 (24%), Positives = 104/265 (39%), Gaps = 1/265 (0%)
 Frame = -1

Query: 867  YASELIHLE*GQ*NIDLLRNLFTSQQVDSILTIPL-VLQTQDTLIWTLTSSGVFTTKSTC 691
            + SELI  +  +    LL +    + +  IL  PL      D L W  T    ++ K+  
Sbjct: 962  WVSELIDFDRMEWKTSLLESFLNERDLRCILASPLSATPVPDELTWAFTKDATYSVKTAY 1021

Query: 690  HHLCDIDLQDSALSSLSKTYWFKFWKLKIPYKLLIFLWKIINNVLPVKSNLRHIPGIDDF 511
                 +  +   L +  +  W   W L +  K+  FLW++    LPV+S L+H    DD 
Sbjct: 1022 -----MIGKGGNLDNFHQA-WVDIWSLDVSPKVRHFLWRLCTTSLPVRSLLKHRHLTDDD 1075

Query: 510  SCPLYNTANVEDINHLLFTCPFATAVWKASLPQHFSLLLQHNTLIDWIRTWSSTDVVINF 331
             CP +    +E   H +F CP    +W  S  Q+        ++ D + +W S D  +  
Sbjct: 1076 LCP-WGCGEIETQRHAIFDCPKMRDLWLDSGCQNLCSRDASMSMCDLLVSWRSLDGKLRI 1134

Query: 330  SSVSPSIYGMIATMWHIWRYICQVPFRHVQINLNSVLLPMFKYLANITATLASHRHTCHH 151
                          W IW       F + +   +SVL+     L     + A   +    
Sbjct: 1135 KGA--------YLAWCIWGERNAKIFNN-KTTPSSVLMQRVSRLVEENGSHARRIYQPLV 1185

Query: 150  NQNIGQSHHWNPPPPDTLKVNIDAS 76
             +  G    W  PP D++K+N+DAS
Sbjct: 1186 PRRTGSPRQWIAPPADSIKLNVDAS 1210


>gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1715

 Score =  181 bits (458), Expect = 3e-42
 Identities = 127/472 (26%), Positives = 213/472 (45%), Gaps = 2/472 (0%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            M++  WNCQG G P T   L    + +  D+LFL +TK  D+       +  + +     
Sbjct: 363  MRVGFWNCQGLGQPLTVRRLEEVQRVYFLDMLFLIETKQQDNYTRDLGVKMGFEDMCIIS 422

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRHNVHAI-IHTHSHNLDFILSCMYGANNPMECQEQW 2397
            P G SGG+ + WK  + ++++    H+V  + ++    N +F LSC+YG   P E    W
Sbjct: 423  PRGLSGGLVVYWKKHLSIQVIS---HDVRLVDLYVEYKNFNFYLSCIYGHPIPSERHHLW 479

Query: 2396 QYLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGS 2217
            + L  +      PW++ GD N  +  +E                ++I    + DL   G+
Sbjct: 480  EKLQRVSAHRSGPWMMCGDFNEILNLNEKKGGRRRSIGSLQNFTNMINCCNMKDLKSKGN 539

Query: 2216 STTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGMTKK 2037
              +W   RQ N+     LDR   N  W  S+       +P   SDH P+++  +  +  K
Sbjct: 540  PYSWVGKRQ-NETIESCLDRVFINSDWQASFPAFETEFLPIAGSDHAPVIIDIAEEVCTK 598

Query: 2036 QFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIET 1857
            +  FR  R  F      D +   W+     S   +   KLH  + +L  WK+ T  N   
Sbjct: 599  RGQFRYDRRHFQFEDFVDSVQRGWNRGRSDSHGGYYE-KLHCCRQELAKWKRRTKTNTAE 657

Query: 1856 QIQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKWLAIQKDFYMQKSGAHFYD-ADRNTSY 1680
            +I+T++  +D    R+    +  ++ L Q L +    ++ ++  KS   +    DRNT +
Sbjct: 658  KIETLKYRVDAA-ERDHTLPHQTILRLRQDLNQAYRDEELYWHLKSRNRWMLLGDRNTMF 716

Query: 1679 FHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSIFSEFQS 1500
            F++     K  + I AI D+ GI +     I     N+F+ +  +T+ +    I S    
Sbjct: 717  FYASTKLRKSRNRIKAITDAQGIENFRDDTIGKVAENYFADLFTTTQTSDWEEIISGIAP 776

Query: 1499 CITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKE 1344
             +T++ N  LLQ  ++QE++D VF I    +PG DGF AAFY   W  +G +
Sbjct: 777  KVTEQMNHELLQSVTDQEVRDAVFAIGADRAPGFDGFTAAFYHHLWDLIGND 828



 Score = 63.2 bits (152), Expect = 8e-07
 Identities = 59/239 (24%), Positives = 106/239 (44%), Gaps = 13/239 (5%)
 Frame = -1

Query: 750  QDTLIWTLTSSGVFTTKSTCHHLCDIDL-QDSALSSLSKTYWFK--FWKLKIPYKLLIFL 580
            +D+  W  T +  +T +S       ++L ++  ++ L      K   W+LKI  K+  F+
Sbjct: 1342 RDSYKWAYTRNTQYTVRSGYWVATHVNLTEEEIINPLEGDVPLKQEIWRLKITPKIKHFI 1401

Query: 579  WKIINNVLPVKSNLRHIPGIDDFSCPLYNTANVEDINHLLFTCPFATAVWKASLPQHFSL 400
            W+ ++  L   + LR+     D +C     A+ E INH++FTC +A  VW+++     + 
Sbjct: 1402 WRCLSGALSTTTQLRNRNIPADPTCQRCCNAD-ETINHIIFTCSYAQVVWRSANFSGSNR 1460

Query: 399  LLQHNTLIDWIRTWSSTDVVINFSSVSPSIYGMIA--TMWHIWR----YICQ----VPFR 250
            L   + L + IR         N     P + G++    MW +W+    Y+ Q     P++
Sbjct: 1461 LCFTDNLEENIRLILQGKKNQNL----PILNGLMPFWIMWRLWKSRNEYLFQQLDRFPWK 1516

Query: 249  HVQINLNSVLLPMFKYLANITATLASHRHTCHHNQNIGQSHHWNPPPPDTLKVNIDASY 73
              Q           + + N TA   SH     +++ + +S  W+ PP   LK N D+ Y
Sbjct: 1517 VAQ-KAEQEATEWVETMVNDTA--ISHNTAQSNDRPLSRSKQWSSPPEGFLKCNFDSGY 1572


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  180 bits (456), Expect = 4e-42
 Identities = 107/358 (29%), Positives = 177/358 (49%), Gaps = 6/358 (1%)
 Frame = -2

Query: 2399 WQYLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTG 2220
            W  L  +   +++PW++ GD N  +   E                  +   GL+D GF G
Sbjct: 1171 WDCLRRLAADIEVPWLVGGDFNIILKREERLYGSAPHEGAMEDFASTLLDCGLLDGGFEG 1230

Query: 2219 SSTTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGMTK 2040
            +  TW+N+R        RLDR + N  WIN +    + H+    SDH P+L+S  +   K
Sbjct: 1231 NPFTWTNNRMFQ-----RLDRIVYNHHWINKFPITRIQHLNRDGSDHCPLLISCFNSSEK 1285

Query: 2039 KQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIE 1860
                FR Q  W         + ++W+  + GS      SK H  K  L+ W K+ FG+I 
Sbjct: 1286 APSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQHRLKQHLKWWNKVMFGDIF 1345

Query: 1859 TQIQTIQTHLDQC--YNRNMPTSNP*VVNLTQS---LQKWLAIQKDFYMQKSGAHFY-DA 1698
            ++++  +  +++C   ++N  T    ++ L +S   L K L I++ F+ QKSG  +  + 
Sbjct: 1346 SKLKEAEKRVEECEILHQNEQTVES-IIKLNKSYAQLNKQLNIEEIFWKQKSGVKWVVEG 1404

Query: 1697 DRNTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSI 1518
            +RNT +FH++    +  SHIF +Q+  G W + +  ++ + + +FS++ K          
Sbjct: 1405 ERNTKFFHTRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIKYFSSLLKFEPCDDSRFQ 1464

Query: 1517 FSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKE 1344
             S   S I++ EN LL   P+ QE+KD VF I P S+ G DGF + FYQQCW+ +  +
Sbjct: 1465 RSLIPSIISNSENELLCAEPNLQEVKDAVFGIDPESAAGPDGFSSYFYQQCWNIIAHD 1522


>pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW15) - Arabidopsis
            thaliana retrotransposon Ta11-1 gi|976278|gb|AAA75254.1|
            reverse transcriptase [Arabidopsis thaliana]
          Length = 1333

 Score =  180 bits (456), Expect = 4e-42
 Identities = 126/468 (26%), Positives = 216/468 (46%), Gaps = 3/468 (0%)
 Frame = -2

Query: 2753 MKILSWNCQGFG--NPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWY 2580
            M ++SWNCQG G     T   L      H P++LFL +TK   + +    +   Y   + 
Sbjct: 1    MSLVSWNCQGLGWSQDLTIPRLMEMRLSHFPEVLFLMETKNCSNVVVDLQEWLGYERVFT 60

Query: 2579 APPVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQ 2400
              P+G SGG++L WK GVD+ I +  ++ +   I   SH  +F +SC+YG     +    
Sbjct: 61   VNPIGLSGGLALFWKKGVDIVIKYADKNLIDFQIQFGSH--EFYVSCVYGNPAFSDKHLV 118

Query: 2399 WQYLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTG 2220
            W+ +  +      PW ++GD N  + + E                 ++    +++L   G
Sbjct: 119  WEKITRIGINRKEPWCMLGDFNPILHNGEKRGGPRRGDSSFLPFTDMLDSCDMLELPSIG 178

Query: 2219 SSTTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGMTK 2040
            +  TW   +    +   RLDR  GN +W   +  ++   +    SDH P+L+  +    +
Sbjct: 179  NPFTWGG-KTNEMWIQSRLDRCFGNKNWFRFFPISNQEFLDKRGSDHRPVLVRLTKTKEE 237

Query: 2039 KQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIE 1860
             +  FR  +  F+  +  + I  +W+          L  KL + +  L  WKK    N  
Sbjct: 238  YRGNFRFDKRLFNQPNVKETIVQAWNGSQRNENLLVLD-KLKHCRSALSRWKKENNINSS 296

Query: 1859 TQIQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKWLAIQKDFYMQKSGAHF-YDADRNTS 1683
            T+I   +  L+   +   P ++  V +L   L K    ++ F+ QKS A + +  D+NTS
Sbjct: 297  TRITQARAALELEQSSGFPRADL-VFSLKNDLCKANHDEEVFWSQKSRAKWMHSGDKNTS 355

Query: 1682 YFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSIFSEFQ 1503
            +FH+    N+   HI  + D  G++H            +FS + KST  +S   +F ++Q
Sbjct: 356  FFHASVKDNRGKQHIDQLCDVNGLFHKDEMNKGAIAEAYFSDLFKSTDPSSFVDLFEDYQ 415

Query: 1502 SCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWS 1359
              +T+  N+ L+   S+ EI++ VF I+  S+PG DGF   F+Q+ WS
Sbjct: 416  PRVTESMNNTLIAAVSKNEIREAVFAIRSSSAPGVDGFTGFFFQKYWS 463


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  179 bits (455), Expect = 6e-42
 Identities = 109/375 (29%), Positives = 181/375 (48%), Gaps = 5/375 (1%)
 Frame = -2

Query: 2453 FILSCMYGANNPMECQEQWQYLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXS 2274
            F ++ +Y      E    W  L  +   +++PW++ GD N  +   E             
Sbjct: 981  FFVTIVYAKCTRSERTLLWDCLRRLADDIEVPWLVGGDFNVILKREERLYGSAPHEGAME 1040

Query: 2273 MIRHIIQQLGLIDLGFTGSSTTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPP 2094
                 +   GL+D GF G+S TW+N+R        RLDR + N  WIN +    + H+  
Sbjct: 1041 DFASTLLDCGLLDGGFEGNSFTWTNNRMFQ-----RLDRIVYNHHWINKFPVTRIQHLNR 1095

Query: 2093 VASDHYPILLSTSSGMTKKQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLH 1914
              SDH P+L+S  +   K    FR Q  W         + ++W+  + GS      SK H
Sbjct: 1096 DGSDHCPLLISCFNSSEKAPSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQH 1155

Query: 1913 YTKHQLQLWKKITFGNIETQIQTIQTHLDQC---YNRNMPTSNP*VVNLTQS-LQKWLAI 1746
              K  L+ W K  FG+I ++++  +  +++C   + +     +   +N + + L K L I
Sbjct: 1156 RLKQHLKWWNKAVFGDIFSKLKEAEKRVEECEILHQQEQTFESRIKLNKSYAQLNKQLNI 1215

Query: 1745 QKDFYMQKSGAHFY-DADRNTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLN 1569
            ++ F+ QKSG  +  + +RNT +FH +    +  SHIF +QD  G W + +  ++++ + 
Sbjct: 1216 EELFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKVQDPEGRWIEDQEQLKHSAIE 1275

Query: 1568 HFSAISKSTKLASDNSIFSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGF 1389
            +FS++ K           S   S I++ EN LL   PS QE+KD VF I   S+ G DGF
Sbjct: 1276 YFSSLLKVEPCYDSRFQSSLIPSIISNSENELLCAEPSLQEVKDAVFGINSESAAGPDGF 1335

Query: 1388 QAAFYQQCWSPVGKE 1344
             + FYQQCW+ + ++
Sbjct: 1336 SSYFYQQCWNIIAQD 1350


>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score =  179 bits (455), Expect = 6e-42
 Identities = 117/486 (24%), Positives = 221/486 (45%), Gaps = 1/486 (0%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            M+ILSWNCQG GN  T  +L      + P+++FL +TK   + +   +    + +     
Sbjct: 1    MRILSWNCQGVGNTPTVRHLREIRGLYFPEVIFLCETKKRRNYLENVVGHLGFFDLHTVE 60

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQ 2394
            P+G+SGG++L WK+ V ++++ + +  + A++     + +F L+C+YG     E  E W+
Sbjct: 61   PIGKSGGLALMWKDSVQIKVLQSDKRLIDALLIW--QDKEFYLTCIYGEPVQAERGELWE 118

Query: 2393 YLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSS 2214
             L  +      PW+L GD N  +  SE               R ++   GL ++  +G  
Sbjct: 119  RLTRLGLSRSGPWMLTGDFNELVDPSEKIGGPARKESSCLEFRQMLNSCGLWEVNHSGYQ 178

Query: 2213 TTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGMTKKQ 2034
             +W  +R  ++    RLDR + N +W+  +  A   ++  + SDH P++ +      +K 
Sbjct: 179  FSWYGNR-NDELVQCRLDRTVANQAWMELFPQAKATYLQKICSDHSPLINNLVGDNWRKW 237

Query: 2033 FPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIETQ 1854
              F+  + W       D++   WS Q   + +  +  K+   + ++  WK+++  +   +
Sbjct: 238  AGFKYDKRWVQREGFKDLLCNFWSQQSTKTNAL-MMEKIASCRREISKWKRVSKPSSAVR 296

Query: 1853 IQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKWLAIQKDFYMQKSGAHFY-DADRNTSYF 1677
            IQ +Q  LD    + +P     +  L + L +    ++ F+ +KS   +  + DRNT YF
Sbjct: 297  IQELQFKLDAA-TKQIPFDRRELARLKKELSQEYNNEEQFWQEKSRIMWMRNGDRNTKYF 355

Query: 1676 HSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSIFSEFQSC 1497
            H+     +  + I  + D  G    S   +      +F  +  S  +             
Sbjct: 356  HAATKNRRAQNRIQKLIDEEGREWTSDEDLGRVAEAYFKKLFASEDVGYTVEELENLTPL 415

Query: 1496 ITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKEERITTTKVS 1317
            ++D+ N+ LL   +++E++   F I P   PG DG     YQQ W  +G  ++IT    +
Sbjct: 416  VSDQMNNNLLAPITKEEVQRATFSINPHKCPGPDGMNGFLYQQFWETMG--DQITEMVQA 473

Query: 1316 LLINGS 1299
               +GS
Sbjct: 474  FFRSGS 479



 Score = 68.6 bits (166), Expect = 2e-08
 Identities = 48/155 (30%), Positives = 75/155 (48%), Gaps = 5/155 (3%)
 Frame = -1

Query: 870  HYASELIHLE*GQ*NIDLLRNLFTSQQVDSILTI-PLVLQTQDTLIWTLTSSGVFTTKST 694
            H   +L+  +    N +L+  LF     ++IL + P   +T+D   W  + SG ++ KS 
Sbjct: 966  HVVKDLLLPDGRDWNWNLVSLLFPDNTQENILALRPGGKETRDRFTWEYSRSGHYSVKSG 1025

Query: 693  CHHLCDIDLQ----DSALSSLSKTYWFKFWKLKIPYKLLIFLWKIINNVLPVKSNLRHIP 526
               + +I  Q       L       + + WKL +P K+  FLW+ +NN L V SNL +  
Sbjct: 1026 YWVMTEIINQRNNPQEVLQPSLDPIFQQIWKLDVPPKIHHFLWRCVNNCLSVASNLAYRH 1085

Query: 525  GIDDFSCPLYNTANVEDINHLLFTCPFATAVWKAS 421
               + SC +   ++ E +NHLLF CPFA   W  S
Sbjct: 1086 LAREKSC-VRCPSHGETVNHLLFKCPFARLTWAIS 1119


>gb|EEE61581.1| hypothetical protein OsJ_15963 [Oryza sativa Japonica Group]
          Length = 1494

 Score =  178 bits (452), Expect = 1e-41
 Identities = 119/454 (26%), Positives = 209/454 (46%), Gaps = 7/454 (1%)
 Frame = -2

Query: 2684 IQKHDPDILFLSKTKITDSQMHFYLQQYSYP----NFWYAPPVGQSGGISLAWKNGVDLE 2517
            +Q H   I+FLS+T+    Q  FY+    +       +    VG+ GG++L W   + +E
Sbjct: 270  VQTHSSKIVFLSETR----QDQFYVSNLKWRLGLRRCFVVNGVGKGGGLALFWDESLKVE 325

Query: 2516 IMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQYLLDMQPFVDLPWILMGDL 2337
            +   +  ++  II T      +  + +YG        E W  L  ++     PW+++GD 
Sbjct: 326  LKSYNMRHIDVII-TEPEGARWTATFVYGEPKAQNRHEMWNLLRRIRLNASDPWLMIGDF 384

Query: 2336 NFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSSTTWSNHRQGNDYTAVRLDR 2157
            N  M   E               R ++ +  L D+GF G   T+ N++   +   VRLDR
Sbjct: 385  NEAMWQIEHRSRVKHSERQMRDFREVLVECDLQDIGFQGVPWTYDNNQASPNNVKVRLDR 444

Query: 2156 ALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGMTKKQFPFR--LQRYWFSIASCTD 1983
            A+ +  W   +  A+++H+    SDH P+LL     M +++       +  W  + S   
Sbjct: 445  AVASPVWRAMFDQANIMHLTTACSDHVPLLLEKGGNMQQRRRSKINCFEAVWERVKSFNS 504

Query: 1982 IINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIETQIQTIQTHLDQCYNRNMP 1803
            I + SW    L      + +KL YT   L+ W +   GNI+  I+  +  L++   R   
Sbjct: 505  IEHESWDDGGLAKNLGDVRTKLAYTMENLKRWSRDKIGNIKKSIERCRRELEEMRMRGRE 564

Query: 1802 TSNP*VVNLTQSLQKWLAIQKDFYMQKSGAHFY-DADRNTSYFHSKANFNKKISHIFAIQ 1626
             S P V  L   LQ+ L  ++ ++ Q+S   +  + DRNT YFH KA++  + + I  ++
Sbjct: 565  DSEPDVHRLKIFLQELLHREEIWWKQRSRITWLKEGDRNTRYFHLKASWRARKNLIKKLR 624

Query: 1625 DSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNSIFSEFQSCITDEENSLLLQLPSEQE 1446
             S G+       +     + F  +    +  +   + + F+  ITDE N +L +  +++E
Sbjct: 625  RSDGMMCSKEEELGEIARSFFRDLYTKDESLNPGELLNMFEPKITDEMNGMLTKPFTDEE 684

Query: 1445 IKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKE 1344
            I D +FQI P  +PG DGF A F+Q+ W  + ++
Sbjct: 685  ISDALFQIGPLKAPGPDGFPARFFQRNWGVLKRD 718



 Score = 63.2 bits (152), Expect = 8e-07
 Identities = 33/109 (30%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
 Frame = -2

Query: 1172 VLKLQNTTIQQMDKIQRDYRWGKTK-----HFIAWNKVCFPIRHGGIGLKYLSHYNSALL 1008
            V K+     ++ +++ R++ WG  K     H+IAW K+  P   GG+G + +  +N ALL
Sbjct: 1059 VFKMPERFCEEYEQLVRNFWWGHEKGEKKVHWIAWEKLTSPKLLGGLGFRDIRCFNQALL 1118

Query: 1007 AKLAWNMIHSSTELWVQLLNGKYFSLYELTHEPPPNCENASWIWQSIMH 861
            A+ AW +I S   L  ++L  KY+    +T    P+  + +  W+ I+H
Sbjct: 1119 ARQAWRLIESPDSLCARVLKAKYYPNGTITDTAFPSVSSPT--WKGIVH 1165


>emb|CCA66050.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1357

 Score =  177 bits (449), Expect = 3e-41
 Identities = 126/478 (26%), Positives = 219/478 (45%), Gaps = 8/478 (1%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            MKIL WNCQG GNP T   L   +  + PD LF+S+TK+T + +    +   +   +   
Sbjct: 1    MKILCWNCQGMGNPWTVRQLRRLMASNTPDSLFMSETKVTKNIVEQKKESLGFSGAFGVS 60

Query: 2573 PVGQSGGISLAWK-NGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQW 2397
             VG++GG+ + WK   +   ++  S++++   + ++  ++ +    +YG        + W
Sbjct: 61   CVGRAGGLCMFWKEETISFRMVSFSQNHICGDVGSNG-DVRWRFVGIYGWPEEENKHKTW 119

Query: 2396 QYLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGS 2217
              +  +    + P +  GD N  +   E               R+++    L DL F G 
Sbjct: 120  ALIKGLCDEYEGPIVFGGDFNEILSYDEKEGGASRERRAIVGFRNVMDDCSLGDLRFVGQ 179

Query: 2216 STTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLST--SSGMT 2043
              TW   R        RLDR + + SW++ +  A + H     SDH  I+L    + GM 
Sbjct: 180  WHTWERGRSPESRIRERLDRFIVSRSWLHLFPEAFIDHQVRYCSDHAAIVLRCLGNEGMP 239

Query: 2042 KKQF-PFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGN 1866
            +++   F  + +W    +C +++  +W+     +    +  KL     +LQ W K TFG+
Sbjct: 240  RRRAGGFWFETFWLLDDTCEEVVRGAWN----AAEGGRICEKLGAVARELQGWSKKTFGS 295

Query: 1865 IETQIQTIQTHLDQCYNRNMPTSN-P*VVNLTQSLQKWLAIQKDF-YMQKSGAHFYDADR 1692
            +  +I+ ++  L           +    V L + L +  A  + + Y++   A   D DR
Sbjct: 296  LRKKIEAVEKKLHAAQGEATSIDSWERCVGLERELDELHAKNEAYWYLRSRVAEVKDGDR 355

Query: 1691 NTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDN--SI 1518
            NTSYFH KA+  KK + I  I D  G W      IE  +  +F  I  S++ +S++   +
Sbjct: 356  NTSYFHHKASQRKKRNLIHGIFDGGGRWQTEGEEIECVVERYFQEIFTSSEPSSNDFQEV 415

Query: 1517 FSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKE 1344
                +  +T E N +LL+  S++EI   +  + P  +PG DG  A FYQ+ W  +G E
Sbjct: 416  LQHVKRSVTQEYNDILLKPYSKEEIFAALSDMHPCKAPGPDGMHAIFYQRFWHIIGDE 473



 Score = 66.6 bits (161), Expect = 7e-08
 Identities = 59/262 (22%), Positives = 102/262 (38%), Gaps = 1/262 (0%)
 Frame = -1

Query: 858  ELIHLE*GQ*NIDLLRNLFTSQQVDSILTIPLVLQT-QDTLIWTLTSSGVFTTKSTCHHL 682
            +L+ +E  + N++L+   F  +    IL IPL  +  QD L W  +  G ++ K+     
Sbjct: 968  DLMDVERKEWNVELIERHFNERDQQCILAIPLSTRCLQDELTWAYSKDGTYSVKTAYMLG 1027

Query: 681  CDIDLQDSALSSLSKTYWFKFWKLKIPYKLLIFLWKIINNVLPVKSNLRHIPGIDDFSCP 502
               +L D          W   W L +  K+  FLW+   + LPV+  L+    ID+  CP
Sbjct: 1028 KGGNLDDF------HRVWNILWSLNVSPKVRHFLWRACTSSLPVRKVLQRRHLIDEAGCP 1081

Query: 501  LYNTANVEDINHLLFTCPFATAVWKASLPQHFSLLLQHNTLIDWIRTWSSTDVVINFSSV 322
                 + E   HL + CP +  +W+          ++   + D +  WS  D  +    V
Sbjct: 1082 CCARED-ETQFHLFYRCPMSLKLWEELGSYILLPGIEDEAMCDTLVRWSQMDAKV----V 1136

Query: 321  SPSIYGMIATMWHIWRYICQVPFRHVQINLNSVLLPMFKYLANITATLASHRHTCHHNQN 142
                Y     +W++W    +  F H       V   + + + +              +  
Sbjct: 1137 QKGCY----ILWNVWVERNRRVFEHTSQPATVVGQRIMRQVEDFNNYAVKIYGGMRSSAA 1192

Query: 141  IGQSHHWNPPPPDTLKVNIDAS 76
            +  S  W  PP   +K+N DAS
Sbjct: 1193 LSPS-RWYAPPVGAIKLNTDAS 1213



 Score = 60.8 bits (146), Expect = 4e-06
 Identities = 32/109 (29%), Positives = 56/109 (51%), Gaps = 7/109 (6%)
 Frame = -2

Query: 1181 LLQVLKLQNTTIQQMDKIQRDYRWG-----KTKHFIAWNKVCFPIRHGGIGLKYLSHYNS 1017
            L+ V KL    IQ++      + WG     +  H+++W K+C P   GG+G K L+ +N 
Sbjct: 811  LMGVYKLPVAVIQEIHSAMARFWWGGKGDERKMHWLSWEKMCKPKCMGGMGFKDLAVFND 870

Query: 1016 ALLAKLAWNMIHSSTELWVQLLNGKYFSLYELTHEPPPNCENASW--IW 876
            ALL K  W ++H+   L  ++++ KY+   ++ +       + SW  IW
Sbjct: 871  ALLGKQVWRLLHNKESLLSRVMSAKYYPHGDVRYARLGYSHSYSWRSIW 919


>emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  176 bits (447), Expect = 5e-41
 Identities = 136/498 (27%), Positives = 222/498 (44%), Gaps = 9/498 (1%)
 Frame = -2

Query: 2753 MKILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAP 2574
            M IL WNC+G GNP T   L      + PDI+FLS+T I  ++      +  + N +   
Sbjct: 1    MNILCWNCRGVGNPRTVRQLRKWSTFYAPDIMFLSETMINKTESEALKSRLGFANAFGVS 60

Query: 2573 PVGQSGGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQWQ 2394
              G++GG+ + W+  +   ++  S+H++   I   +    F+   +YG     E    W 
Sbjct: 61   SRGRAGGLCVFWREELSFSLVSFSQHHICGDIDDGAKKWRFV--GIYGWAKEEEKHHTWS 118

Query: 2393 YLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTGSS 2214
             +  +   +  P ++ GD N  M   E               R  +  L L DLG+ G  
Sbjct: 119  LMRFLCEDLSRPILMGGDFNEIMSYEEKEGGADRVRRGMYQFRETMDDLFLRDLGYNGVW 178

Query: 2213 TTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLSTSSGM--TK 2040
             TW      +     RLDR + + SW   Y    + H     SDH  I L ++     T 
Sbjct: 179  HTWERGNSLSTCIRERLDRFVCSPSWATMYPNTIVDHSMRYKSDHLAICLRSNRTRRPTS 238

Query: 2039 KQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKITFGNIE 1860
            KQ  F  +  W    +C + I  +W+     S    L+ +L     +L+ W     GNI 
Sbjct: 239  KQRRFFFETSWLLDPTCEETIRDAWT----DSAGDSLTGRLDLLALKLKSWSSEKGGNIG 294

Query: 1859 TQIQTIQTHLDQCYNRNMPTSNP*V---VNLTQSLQKWLAIQK-DFYMQKSGAHFYDADR 1692
             Q+  +++  D C  +  P S+      + L + L +  A Q+  +Y++       D DR
Sbjct: 295  KQLGRVES--DLCRLQQQPISSANCEARLTLEKKLDELHAKQEARWYLRSRAMEVRDGDR 352

Query: 1691 NTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASD---NS 1521
            NT YFH KA+  KK + +  + D+ G W +    IE    ++F++I  ST   SD   N 
Sbjct: 353  NTKYFHHKASQRKKRNFVKGLFDASGTWCEEVDDIECVFTDYFTSIFTSTN-PSDVQLND 411

Query: 1520 IFSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKEE 1341
            +       +T+E N+ LL+  S++E+   + Q+ P  +PG DG  A FYQ+ W  +G + 
Sbjct: 412  VLCCVDPVVTEECNTWLLKPFSKEELYVALSQMHPCKAPGPDGMHAIFYQKFWHIIGDD- 470

Query: 1340 RITTTKVSLLINGSPTAS 1287
               T  VS +++GS + S
Sbjct: 471  --VTQFVSSILHGSISPS 486



 Score = 90.1 bits (222), Expect = 6e-15
 Identities = 71/267 (26%), Positives = 118/267 (44%), Gaps = 5/267 (1%)
 Frame = -1

Query: 861  SELIHLE*GQ*NIDLLRNLFTSQQVDSILTIPLV-LQTQDTLIWTLTSSGVFTTKSTCHH 685
            SELI  +  +  + L+  +F  + +  IL+IPL  L  +D L W  T +  ++ K T + 
Sbjct: 964  SELIDFDRMEWKVSLIETVFNERDIKCILSIPLSSLPLKDELTWAFTKNAHYSVK-TAYM 1022

Query: 684  LCDIDLQDSALSSLSKTYWFKFWKLKIPYKLLIFLWKIINNVLPVKSNLRHIPGIDDFSC 505
            L      DS   +     W   W +++  K+  FLW++  N LPV+S L+H   +DD  C
Sbjct: 1023 LGKGGNLDSFHQA-----WIDIWSMEVSPKVKHFLWRLGTNTLPVRSLLKHRHMLDDDLC 1077

Query: 504  PLYNTANVEDINHLLFTCPFATAVWKASLPQHFSLLLQHNTLIDWIRTWSSTDVVINFSS 325
            P       E   H +F CPF   +W  S   +F  L     +         T+ ++N   
Sbjct: 1078 P-RGCGEPESQFHAIFGCPFIRDLWVDSGCDNFRALTTDTAM---------TEALVNSHG 1127

Query: 324  VSPSIYGMIATM-WHIWRYICQVPFRHVQINLNSVLLPMFKYL---ANITATLASHRHTC 157
            +  S+    A M W +W     + F       + +L  + + +      TA +  +R+ C
Sbjct: 1128 LDASVRTKGAFMAWVLWSERNSIVFNQSSTPPHILLARVSRLVEEHGTYTARIYPNRNCC 1187

Query: 156  HHNQNIGQSHHWNPPPPDTLKVNIDAS 76
                 I  +  W  PPP+ +K+N+DAS
Sbjct: 1188 ----AIPSARVWAAPPPEVIKLNVDAS 1210


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score =  176 bits (446), Expect = 6e-41
 Identities = 115/492 (23%), Positives = 219/492 (44%), Gaps = 11/492 (2%)
 Frame = -2

Query: 2747 ILSWNCQGFGNPSTRDYLHYCIQKHDPDILFLSKTKITDSQMHFYLQQYSYPNFWYAPPV 2568
            ILSWNC+G G+PS    L   +   +P I+FLS+TK+   +M    ++  + +       
Sbjct: 4    ILSWNCRGMGSPSALSALRRLLASENPQIVFLSETKLKSYEMESVKKKLKWEHMVAVDCE 63

Query: 2567 GQS----GGISLAWKNGVDLEIMHTSRHNVHAIIHTHSHNLDFILSCMYGANNPMECQEQ 2400
            G+     GG+++ W++ + +++M  S +++  ++   +   ++  + +YG        + 
Sbjct: 64   GECRKRRGGLAMLWRSEIKVQVMSMSSNHIDIVVGEEAQG-EWRFTGIYGYPEEEHKDKT 122

Query: 2399 WQYLLDMQPFVDLPWILMGDLNFTMLDSEIXXXXXXXXXXXSMIRHIIQQLGLIDLGFTG 2220
               L  +      PW+  GD N  ++ SE             + R+ +++   +DLGF G
Sbjct: 123  GALLSALARASRRPWLCGGDFNLMLVASEKKGGDGFNSREADIFRNAMEECHFMDLGFVG 182

Query: 2219 SSTTWSNHRQGNDYTAVRLDRALGNISWINSYSTAHLIHIPPVASDHYPILLS-----TS 2055
               TW+N+R G+     RLDR + N  W   +  + + H+P   SDH PI+ S     ++
Sbjct: 183  YEFTWTNNRGGDANIQERLDRFVANDLWKIKFPGSFVSHLPKRKSDHVPIVASVKGAQSA 242

Query: 2054 SGMTKKQFPFRLQRYWFSIASCTDIINTSWSHQVLGSPSYHLSSKLHYTKHQLQLWKKIT 1875
            +  TKK   FR +  W       +++  +W               L  T ++L  W K  
Sbjct: 243  ATRTKKSKRFRFEAMWLREGESDEVVKETWMR------GTDAGINLARTANKLLSWSKQK 296

Query: 1874 FGNIETQIQTIQTHLDQCYNRNMPTSNP*VVNLTQSLQKWLAIQKDFYMQKSGAHFY--D 1701
            FG++  +I+  Q  +           N   +    +    L  +++ Y  +     +   
Sbjct: 297  FGHVAKEIRMCQHQMKVLMESEPSEDNIMHMRALDARMDELEKREEVYWHQRSRQDWIKS 356

Query: 1700 ADRNTSYFHSKANFNKKISHIFAIQDSLGIWHDSRFGIENTLLNHFSAISKSTKLASDNS 1521
             D+NT +FH KA+  ++ +++  I++  G W +    +     ++F  + +S      + 
Sbjct: 357  GDKNTKFFHQKASHREQRNNVRRIRNEAGEWFEDEDDVTECFAHYFENLFQSGNNCEMDP 416

Query: 1520 IFSEFQSCITDEENSLLLQLPSEQEIKDVVFQIKPWSSPGNDGFQAAFYQQCWSPVGKEE 1341
            I +  +  ITDE  + L      +E+   + Q+ P  +PG DG  A FYQ  W  +G++ 
Sbjct: 417  ILNIVKPQITDELGTQLDAPFRREEVSAALAQMHPNKAPGPDGMNALFYQHFWDTIGED- 475

Query: 1340 RITTTKVSLLIN 1305
               TTKV  ++N
Sbjct: 476  --VTTKVLNMLN 485



 Score = 80.1 bits (196), Expect = 6e-12
 Identities = 69/259 (26%), Positives = 114/259 (44%), Gaps = 5/259 (1%)
 Frame = -1

Query: 828  NIDLLRNLFTSQQVDSILTIPLVLQTQ-DTLIWTLTSSGVFTTKSTCHH--LCDIDLQDS 658
            N++LL  LF   +  +I  IP+ LQ + D  +W ++ +G FT +S  +H  L D     S
Sbjct: 982  NVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQFTVRSAYYHELLEDRKTGPS 1041

Query: 657  ALSSLSKTYWFKFWKLKIPYKLLIFLWKIINNVLPVKSNLRHIPGIDDFSCPLYNTANVE 478
                 +   W K WK KIP K+ +F WK I+N L V +N+R      D +CP       E
Sbjct: 1042 TSRGPNLKLWQKIWKAKIPPKVKLFSWKAIHNGLAVYTNMRKRGMNIDGACPRCGEKE-E 1100

Query: 477  DINHLLFTCPFATAVWKASLPQHFSLLLQHNTLID--WIRTWSSTDVVINFSSVSPSIYG 304
               HL++ C  ++  W      + S L  H   I+    R W  + +  +  +   +++ 
Sbjct: 1101 TTEHLIWGCDESSRAW------YISPLRIHTGNIEAGSFRIWVESLLDTHKDTEWWALFW 1154

Query: 303  MIATMWHIWRYICQVPFRHVQINLNSVLLPMFKYLANITATLASHRHTCHHNQNIGQSHH 124
            MI   W+IW    +  F   ++    V+    + +       A   HT          + 
Sbjct: 1155 MIC--WNIWLGRNKWVFEKKKLAFQEVVERAVRGVMEFEEECA---HTSPVETLNTHENG 1209

Query: 123  WNPPPPDTLKVNIDASYHK 67
            W+ PP   +K+N+DA+  K
Sbjct: 1210 WSVPPVGMVKLNVDAAVFK 1228


Top