BLASTX nr result

ID: Papaver27_contig00038071 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver27_contig00038071
         (642 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga...   114   3e-23
gb|EEE65289.1| hypothetical protein OsJ_20518 [Oryza sativa Japo...   111   2e-22
ref|NP_001057661.2| Os06g0484800 [Oryza sativa Japonica Group] g...   111   2e-22
gb|EEC76821.1| hypothetical protein OsI_14959 [Oryza sativa Indi...   108   1e-21
gb|EEC77009.1| hypothetical protein OsI_15342 [Oryza sativa Indi...   106   7e-21
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...   103   3e-20
emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulga...   103   3e-20
emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga...   101   2e-19
emb|CCA66140.1| hypothetical protein [Beta vulgaris subsp. vulga...   100   4e-19
gb|EPS65078.1| hypothetical protein M569_09701, partial [Genlise...   100   5e-19
gb|EEC76169.1| hypothetical protein OsI_13484 [Oryza sativa Indi...   100   6e-19
gb|AAK71569.2|AC087852_29 putative reverse transcriptase [Oryza ...   100   6e-19
gb|EEC66781.1| hypothetical protein OsI_33174 [Oryza sativa Indi...    99   8e-19
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    98   2e-18
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...    98   2e-18
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...    98   2e-18
ref|XP_007224193.1| hypothetical protein PRUPE_ppa017155mg, part...    98   2e-18
ref|XP_007203701.1| hypothetical protein PRUPE_ppa020995mg, part...    98   2e-18
gb|AAD37019.2| putative non-LTR retrolelement reverse transcript...    98   2e-18
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...    97   3e-18

>emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1363

 Score =  114 bits (284), Expect = 3e-23
 Identities = 70/211 (33%), Positives = 103/211 (48%), Gaps = 9/211 (4%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILLNTCRN 443
           F GP +TW+NGR   SLI+ RLDR   + +W  +F  + V+ LPR  SDH P+L+    N
Sbjct: 177 FQGPKFTWTNGRTGGSLIKERLDRALVNSEWLDLFPDTKVIHLPRTFSDHCPLLILFNEN 236

Query: 442 TPRNPPSYRS------HPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNKIGCI-- 287
                  +R       HPDF  V+ E+W  + +  +  +  FL S +KSWSK   G I  
Sbjct: 237 PRSESFPFRCKEVWAYHPDFTNVIEETWGSHHNSYVAARDLFLSS-VKSWSKYVFGSIFQ 295

Query: 286 -KNKIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKTGDQ 110
            K +I               +  + + +     +LNE    E  +W Q+      K GD 
Sbjct: 296 KKKRILARLGGIQKSLSIHPSVFLSKLEIDLLVELNELSKQERVFWAQKAGIDRAKLGDM 355

Query: 109 NTK*FHISVQ*KRRKNQISCLQNNNQQWLTN 17
           NTK FH   + +  K +ISCL+N+N  W++N
Sbjct: 356 NTKYFHTLAKIRTCKRKISCLKNDNHDWVSN 386


>gb|EEE65289.1| hypothetical protein OsJ_20518 [Oryza sativa Japonica Group]
          Length = 826

 Score =  111 bits (277), Expect = 2e-22
 Identities = 71/213 (33%), Positives = 110/213 (51%), Gaps = 6/213 (2%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSP--ILLNTC 449
           ++GP YTW+N R   ++I  RLDR  A+ +WC  F  + V  LP I  DH+P  ILLN  
Sbjct: 3   YNGPAYTWTNKRTGNNVIYERLDRCLANVEWCSKFPYTTVYHLPLIYGDHAPILILLNPT 62

Query: 448 RNTPRNPPSYR----SHPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNKIGCIKN 281
              P+    +     S  DF  +   SWN  ++G  V K   LG  L +W + K   +++
Sbjct: 63  HRKPKKSFKFENWWLSENDFHDLAKNSWNSIVNGSFVAKAKNLGQNLLTWCRKK-KPLQD 121

Query: 280 KIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKTGDQNTK 101
           +IA                +  +K+K+   + +  +   + Y +QR K+ W+K GD+NT 
Sbjct: 122 QIASTEQEILDIQSSNNRQDQQEKEKELITKHDSLLHKLSDYHKQRAKKHWVKDGDRNTS 181

Query: 100 *FHISVQ*KRRKNQISCLQNNNQQWLTNPGAIA 2
            FH +   +RRKN+IS + +N+ Q +TNP  IA
Sbjct: 182 FFHQAAIKRRRKNRISSIISND-QLITNPDEIA 213


>ref|NP_001057661.2| Os06g0484800 [Oryza sativa Japonica Group]
           gi|125597263|gb|EAZ37043.1| hypothetical protein
           OsJ_21387 [Oryza sativa Japonica Group]
           gi|255677053|dbj|BAF19575.2| Os06g0484800 [Oryza sativa
           Japonica Group]
          Length = 935

 Score =  111 bits (277), Expect = 2e-22
 Identities = 70/213 (32%), Positives = 110/213 (51%), Gaps = 6/213 (2%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPI--LLNTC 449
           ++GP YTW+N R+   +I  RLDR  A+ +WC  F  + V  +P I  DH+PI  LLN  
Sbjct: 3   YNGPAYTWTNKRKGNEVIFERLDRCLANVEWCHHFPNTNVYHIPLIYGDHAPILVLLNPN 62

Query: 448 RNTPRNPPSYRS----HPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNKIGCIKN 281
              P+    + +      DF  V   SWN + +G  V K  FL   L +WSK K   I++
Sbjct: 63  FRKPKRSFKFENWWLLEEDFNTVAKNSWN-DCNGSFVSKTKFLSKNLSTWSKKK-KPIQD 120

Query: 280 KIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKTGDQNTK 101
           +I                 N++ K+K+  A+ +  +   + +  QR K+ W+K GD+NT 
Sbjct: 121 QIVSTEEDIKQIQASQDRHNLVDKEKELIAKYDLLLEKLSEFHRQRAKKDWIKDGDRNTS 180

Query: 100 *FHISVQ*KRRKNQISCLQNNNQQWLTNPGAIA 2
            F  +   +RRKN+I+ + +N+  ++TNP  IA
Sbjct: 181 FFQQAAIKRRRKNRIASIVSND-VYITNPDDIA 212


>gb|EEC76821.1| hypothetical protein OsI_14959 [Oryza sativa Indica Group]
          Length = 405

 Score =  108 bits (271), Expect = 1e-21
 Identities = 64/214 (29%), Positives = 116/214 (54%), Gaps = 7/214 (3%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPI--LLNTC 449
           ++GP YTWSN +    L+  RLDR  A+ +WC++F  + V  LP + SDH+PI  +LN  
Sbjct: 41  YNGPAYTWSNKKNGCDLVLERLDRCLANVEWCMLFPHTTVYHLPMLYSDHAPIIAILNPI 100

Query: 448 RNTPRNPPSYR----SHPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNKIGCIKN 281
            + P+    +     S  DF K     W+  ++ +   K   LG  L +W+K K G + +
Sbjct: 101 HHHPKKSFKFENWWISEKDFQKEAQAGWSA-INSNFHSKAIHLGKHLSTWNKKK-GSLHS 158

Query: 280 KIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIME-ATYWEQRMKQIWLKTGDQNT 104
           ++ ++             +N++ ++K+   QL++ ++ + + +++QR K+ W++ GD+NT
Sbjct: 159 QLKQVENQILQVQSNANRANLLHEEKRLE-QLHDALMSKLSDFYKQRAKKYWIQQGDRNT 217

Query: 103 K*FHISVQ*KRRKNQISCLQNNNQQWLTNPGAIA 2
             F  +V  +RRKN+I+ +   +  W  NP  IA
Sbjct: 218 SFFQQAVHKRRRKNRIAGIMTRD-GWTINPDNIA 250


>gb|EEC77009.1| hypothetical protein OsI_15342 [Oryza sativa Indica Group]
          Length = 815

 Score =  106 bits (264), Expect = 7e-21
 Identities = 71/220 (32%), Positives = 107/220 (48%), Gaps = 13/220 (5%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPI--LLNTC 449
           ++GP YTWSN +  + L+ +RLDR  A+ +WC+ F  + V  LP + SDH+PI  +LN  
Sbjct: 41  YNGPAYTWSNKQHGKDLVLQRLDRCLANVEWCMNFPNTTVYHLPMLYSDHAPIIAILNPK 100

Query: 448 RNTPRNPPSYRS----HPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNK------ 299
              PR    + +      DF +    +W          +   L  FL SWSK K      
Sbjct: 101 SRRPRRSFKFENWWLLESDFNQEAKAAWQKTERYHFQRRTTLLARFLTSWSKKKKPLQQQ 160

Query: 298 IGCIKNKIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKT 119
           +  I+N + K+               + Q+   T  +L       A Y++QR K+ W++ 
Sbjct: 161 LDQIENDLLKIQDSPNRDLYQIEEQRLEQQHDSTMQKL-------ADYYKQRSKKHWVQQ 213

Query: 118 GDQNTK*FHISVQ*KRRKNQISCLQNNNQQWLTN-PGAIA 2
           GD+NT  FH + Q +RRKN+IS +  NN   +TN P  IA
Sbjct: 214 GDRNTSFFHQAAQKRRRKNRISTIIQNNS--ITNDPDEIA 251


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
            gi|508715059|gb|EOY06956.1| Uncharacterized protein
            TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  103 bits (258), Expect = 3e-20
 Identities = 72/224 (32%), Positives = 108/224 (48%), Gaps = 15/224 (6%)
 Frame = -3

Query: 640  DVDMVD--FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSP 467
            D  ++D  F G  YTW+N   FQ     RLDRV  +P+W   F  + V  L R  SDH P
Sbjct: 409  DCGLIDAGFEGNSYTWTNNHMFQ-----RLDRVVYNPEWVHFFSSTRVQHLNRDGSDHCP 463

Query: 466  ILLNTCRNTPRNPPSYR------SHPDFLKVVLESWNINLSGDLVD----KLNFLGSFLK 317
            +L++    + + P ++R       H DFL  V  SW + L+   +     K   L   LK
Sbjct: 464  LLISCATASQKGPSTFRFLHAWTKHHDFLPFVERSWQVPLNSSGLTAFWTKQQRLKRDLK 523

Query: 316  SWSKNKIGCI--KNKIAKMXXXXXXXXXXXLTSNIIQK-KKKTSAQLNEYIIMEATYWEQ 146
             W+K   G I  K K+A++             S II+    K  A+LN  + +E  YW+Q
Sbjct: 524  WWNKQIFGDIFEKLKLAEIEAEKREMDFQQDLSLIIRNLMHKAYAKLNRQLSIEELYWQQ 583

Query: 145  RMKQIWLKTGDQNTK*FHISVQ*KRRKNQISCLQNNNQQWLTNP 14
            +    WL  G++NTK FH+ ++ KR +N I  +Q++      +P
Sbjct: 584  KSGVKWLVEGERNTKFFHLRMRKKRVRNNIFRIQDSKGNVYEDP 627


>emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1362

 Score =  103 bits (258), Expect = 3e-20
 Identities = 59/207 (28%), Positives = 102/207 (49%), Gaps = 8/207 (3%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILLNTCRN 443
           + G  +TW  G    +LIR RLDR+ A+ +WC  F    V+ LPR  SDH+P+LL T  N
Sbjct: 175 YVGNRFTWQRGNSPSTLIRERLDRMLANDEWCDNFPSWEVVHLPRYRSDHAPLLLKTGVN 234

Query: 442 TP--------RNPPSYRSHPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNKIGCI 287
                     +    + S  +  K+V E+WN +   D+ ++L+ +   L +W+    G +
Sbjct: 235 DSFRRGNKLFKFEAMWLSKEECGKIVEEAWNGSAGEDITNRLDEVSRSLSTWATKTFGNL 294

Query: 286 KNKIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKTGDQN 107
           K +  +              ++ +++ +  S  L+E   +E +YW  R +   ++ GD+N
Sbjct: 295 KKRKKEALTLLNGLQQRDPDASTLEQCRIVSGDLDEIHRLEESYWHARARANEIRDGDKN 354

Query: 106 TK*FHISVQ*KRRKNQISCLQNNNQQW 26
           TK FH     ++R+N I+ L + N  W
Sbjct: 355 TKYFHHKASQRKRRNTINELLDENGVW 381


>emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1378

 Score =  101 bits (251), Expect = 2e-19
 Identities = 63/214 (29%), Positives = 107/214 (50%), Gaps = 11/214 (5%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILLNTCRN 443
           F+GP +TWS G    +    RLDR  A+ +W L F +  V  LP+  SDH PIL++T   
Sbjct: 177 FTGPAHTWSRGLSPTTFKSARLDRGLANSEWKLKFTEGVVRNLPKSQSDHCPILISTSGF 236

Query: 442 TP--------RNPPSYRSHPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNKIGCI 287
            P        R   ++ +H  F + V ++WN +    +V  L      L  W+K +   I
Sbjct: 237 APVPRIIKPFRFQAAWLNHQVFCEFVRKNWNAD--APIVPFLKSFADKLNKWNKEEFYNI 294

Query: 286 KNKIAKMXXXXXXXXXXXLT---SNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKTG 116
             K +++            T   +++I+ + K   +++  +  E T W Q+ +   +  G
Sbjct: 295 FRKKSELWARISGVQALLSTGRQNHLIKLEAKLRREMDIVLDDEETLWFQKSRMEAICDG 354

Query: 115 DQNTK*FHISVQ*KRRKNQISCLQNNNQQWLTNP 14
           D+NT+ FH+S   +R +N+I  LQNN+ +W++NP
Sbjct: 355 DRNTRYFHLSTVIRRSRNRIDMLQNNDGEWISNP 388


>emb|CCA66140.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  100 bits (249), Expect = 4e-19
 Identities = 63/201 (31%), Positives = 99/201 (49%), Gaps = 9/201 (4%)
 Frame = -3

Query: 619 SGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILL-NTCRN 443
           S   +TW  G       +  LDR+F +P+W ++F    +  L R  SDH P+L+ N  +N
Sbjct: 179 SSGGFTWFRGNS-----KSLLDRLFINPEWLILFPGLKLSLLMRGLSDHCPLLVHNEDKN 233

Query: 442 TPRNPPSYR----SHPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNKIGCIKNKI 275
               P  ++    S P+ LK+V E W  +     V KL  +   LK W++ + G I N+I
Sbjct: 234 WGPKPFRFQNCWLSDPNCLKIVKEVWQASSGVSAVGKLKAVRKRLKVWNQEEYGNIDNRI 293

Query: 274 AKMXXXXXXXXXXXL----TSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKTGDQN 107
           +KM                T + +++K+K   +L +++     YW Q  +  WLK GD+N
Sbjct: 294 SKMENLIQQYDEISNQRILTEDELEEKQKAQVELWKWMKRREVYWAQNARISWLKEGDRN 353

Query: 106 TK*FHISVQ*KRRKNQISCLQ 44
           T+ FH     KRRKN I C++
Sbjct: 354 TRFFHTIASNKRRKNSIICIE 374


>gb|EPS65078.1| hypothetical protein M569_09701, partial [Genlisea aurea]
          Length = 314

 Score =  100 bits (248), Expect = 5e-19
 Identities = 62/213 (29%), Positives = 98/213 (46%), Gaps = 12/213 (5%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILLN---- 455
           F G  YTWSN RQ  + +R RLDR  +S  W ++F  + V  L    SDHSPIL+     
Sbjct: 10  FIGFPYTWSNKRQQPATVRARLDRALSSDSWNILFPNATVRHLSFGGSDHSPILIQKDFQ 69

Query: 454 -TCRNTPRN-----PPSYRSHPDFLKVVLESWNI--NLSGDLVDKLNFLGSFLKSWSKNK 299
            +  + PR+        +   P   + + E W +       ++ +L      L  W K  
Sbjct: 70  GSLGHVPRSRRFRFEAYWAEIPGCEESIREGWTMVHRRQSPMIGRLGRTRISLLKWQKRT 129

Query: 298 IGCIKNKIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKT 119
           +G +K  I ++           ++ +   ++K    +LN  + +E TYW QR K  W   
Sbjct: 130 LGSVKAAINRIEDEIALLAQEVISESNWNREKDLKVELNSLLKLEETYWRQRSKSHWFIN 189

Query: 118 GDQNTK*FHISVQ*KRRKNQISCLQNNNQQWLT 20
           GD+NT  FH     +R  N+I  L+N+ +Q L+
Sbjct: 190 GDRNTAFFHAHASRRRSMNRIGSLRNDEEQLLS 222


>gb|EEC76169.1| hypothetical protein OsI_13484 [Oryza sativa Indica Group]
          Length = 1874

 Score = 99.8 bits (247), Expect = 6e-19
 Identities = 66/215 (30%), Positives = 105/215 (48%), Gaps = 13/215 (6%)
 Frame = -3

Query: 640 DVDMVDFSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPIL 461
           D+  + F G  +T+ N ++    ++ RLDR  ASP W   F ++ +  L   SSDH+P+L
Sbjct: 124 DLHDIGFQGAPWTFCNMQREGRNVKVRLDRGVASPAWSSRFPQAVITHLTTPSSDHAPLL 183

Query: 460 LNTCRNTPRNPPSYRSHPDFLK-------VVLESWNINLS----GDLVDKLNFLGSFLKS 314
           L     T   P     + +  +       V+ E+W +       GD+ DK+    + L S
Sbjct: 184 LEREETTLARPMKIMRYEEVWERESSLPEVIQEAWTMGADASTLGDINDKMKVTMTKLVS 243

Query: 313 WSKNKIGCIKNKIAKMXXXXXXXXXXXL--TSNIIQKKKKTSAQLNEYIIMEATYWEQRM 140
           WSK+KIG ++ KI  +           L  T N +   KK   +L E +  E  +W+QR 
Sbjct: 244 WSKDKIGNVRKKIKDLREKLGELRNIGLLDTDNEVHSVKK---ELEEMLHREEIWWKQRS 300

Query: 139 KQIWLKTGDQNTK*FHISVQ*KRRKNQISCLQNNN 35
           +  WLK GD NT+ FH+    + +KN+I  L+ N+
Sbjct: 301 RITWLKEGDLNTRYFHLKASWRAKKNKIKKLKKND 335


>gb|AAK71569.2|AC087852_29 putative reverse transcriptase [Oryza sativa Japonica Group]
          Length = 1833

 Score = 99.8 bits (247), Expect = 6e-19
 Identities = 66/215 (30%), Positives = 105/215 (48%), Gaps = 13/215 (6%)
 Frame = -3

Query: 640 DVDMVDFSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPIL 461
           D+  + F G  +T+ N ++    ++ RLDR  ASP W   F ++ +  L   SSDH+P+L
Sbjct: 149 DLHDIGFQGAPWTFCNMQREGRNVKVRLDRGVASPAWSSRFPQAVITHLTTPSSDHAPLL 208

Query: 460 LNTCRNTPRNPPSYRSHPDFLK-------VVLESWNINLS----GDLVDKLNFLGSFLKS 314
           L     T   P     + +  +       V+ E+W +       GD+ DK+    + L S
Sbjct: 209 LEREETTLARPMKIMRYEEVWERESSLPEVIQEAWTMGADASTLGDINDKMKVTMTKLVS 268

Query: 313 WSKNKIGCIKNKIAKMXXXXXXXXXXXL--TSNIIQKKKKTSAQLNEYIIMEATYWEQRM 140
           WSK+KIG ++ KI  +           L  T N +   KK   +L E +  E  +W+QR 
Sbjct: 269 WSKDKIGNVRKKIKDLREKLGELRNIGLLDTDNEVHSVKK---ELEEMLHREEIWWKQRS 325

Query: 139 KQIWLKTGDQNTK*FHISVQ*KRRKNQISCLQNNN 35
           +  WLK GD NT+ FH+    + +KN+I  L+ N+
Sbjct: 326 RITWLKEGDLNTRYFHLKASWRAKKNKIKKLKKND 360


>gb|EEC66781.1| hypothetical protein OsI_33174 [Oryza sativa Indica Group]
          Length = 1144

 Score = 99.4 bits (246), Expect = 8e-19
 Identities = 66/219 (30%), Positives = 107/219 (48%), Gaps = 12/219 (5%)
 Frame = -3

Query: 622  FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPI--LLNTC 449
            ++GP YTWSN +Q + L+  RLDR  A+ +WC  +  + V  LP + SDH+PI  +LN  
Sbjct: 716  YNGPAYTWSNKQQGKDLVLERLDRCLANVEWCFNYPNTTVYHLPMLYSDHAPIIAILNPK 775

Query: 448  RNTPRNPPSYRS----HPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNK------ 299
               P+    + +     PDF +    +W  +++     +   L   L SWSK K      
Sbjct: 776  NRRPKRSFMFENWWLLEPDFNQHAHSAWLQSVNCHFQRRTTLLERSLTSWSKKKKPLQQQ 835

Query: 298  IGCIKNKIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKT 119
            +  ++  + K+               I+Q+   T  +L       A Y +QR K+ W++ 
Sbjct: 836  LDQLEEDLLKIQSSPDREHLYFEEKRIVQQHDITMQKL-------ADYHKQRSKKHWVQK 888

Query: 118  GDQNTK*FHISVQ*KRRKNQISCLQNNNQQWLTNPGAIA 2
            GD+NT  F  + Q +RRKN+IS + +NN   + +P  IA
Sbjct: 889  GDRNTSFFQKATQKRRRKNRISSVIHNN-SIINDPDEIA 926


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 64/216 (29%), Positives = 103/216 (47%), Gaps = 13/216 (6%)
 Frame = -3

Query: 622  FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILLNTCRN 443
            F G  +TW+N R FQ     RLDR+  +  W   F  + +  L R  SDH P+LL+   +
Sbjct: 1021 FEGNPFTWTNNRMFQ-----RLDRMVYNQQWINKFPITRIQHLNRDGSDHCPLLLSCSNS 1075

Query: 442  TPRNPPSYRS------HPDFLKVVLESWNINLSGDLV----DKLNFLGSFLKSWSKNKIG 293
            + + P S+R       H +F   V  +WN+ ++G  +     K   L   LK W+K   G
Sbjct: 1076 SEKAPSSFRFLHAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHLKWWNKTVFG 1135

Query: 292  CIKNKIAKMXXXXXXXXXXXLTSNIIQKK---KKTSAQLNEYIIMEATYWEQRMKQIWLK 122
             I + I +                 I  +    K+ AQLN+ + ME  +W+Q+    W+ 
Sbjct: 1136 DIFSNIKEAEKRVEECEILHQQEQTIGSRIQLNKSYAQLNKQLSMEEIFWKQKSGVKWVV 1195

Query: 121  TGDQNTK*FHISVQ*KRRKNQISCLQNNNQQWLTNP 14
             G++NTK FH+ +Q KR ++ I  +Q  +  W+ +P
Sbjct: 1196 EGERNTKFFHMRMQKKRIRSHIFKIQEQDGNWIEDP 1231


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 67/220 (30%), Positives = 102/220 (46%), Gaps = 14/220 (6%)
 Frame = -3

Query: 622  FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILLNTCRN 443
            F G  +TW+N R FQ     RLDRV  + +W   F  + V  L R  SDH P+L++    
Sbjct: 1022 FEGNSFTWTNNRMFQ-----RLDRVVYNQEWAEFFSSTRVQHLNRDGSDHCPLLISCSNT 1076

Query: 442  TPRNPPSYR------SHPDFLKVVLESWNINLSGDLVD----KLNFLGSFLKSWSKNKIG 293
              R P ++R       H DF+  V +SWN  +  + ++    K   L   LK W+K+  G
Sbjct: 1077 NQRGPATFRFLHAWTKHHDFISFVEKSWNTPIHAEGLNAFWTKQQRLKRDLKWWNKHIFG 1136

Query: 292  CIKNKIAKMXXXXXXXXXXXLTSNIIQKKK----KTSAQLNEYIIMEATYWEQRMKQIWL 125
             I  KI ++              N     +    K  A+LN  + +E  +W+Q+    WL
Sbjct: 1137 DIF-KILRLAEVEAEQRELNFQQNPSAANRELMHKAYAKLNRQLSIEELFWQQKSGVKWL 1195

Query: 124  KTGDQNTK*FHISVQ*KRRKNQISCLQNNNQQWLTNPGAI 5
              G++NTK FH+ ++ KR +N I  +Q+     L  P  I
Sbjct: 1196 VEGERNTKFFHMRMRKKRMRNHIFRIQDQEGNVLEEPHLI 1235


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 68/224 (30%), Positives = 107/224 (47%), Gaps = 15/224 (6%)
 Frame = -3

Query: 640  DVDMVD--FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSP 467
            D  ++D  F G  +TW+N   FQ     RLDRV  +P+W   F  + V  L R  SDH P
Sbjct: 753  DCGLIDAGFEGNSFTWTNNHMFQ-----RLDRVVYNPEWAHCFSSTRVQHLNRDGSDHCP 807

Query: 466  ILLNTCRNTPRNPPSYR------SHPDFLKVVLESWNINLSGDLVD----KLNFLGSFLK 317
            +L++    + + P ++R       H DFL  V  SW + L+   +     K   L   LK
Sbjct: 808  LLISCATASQKGPSTFRFLHAWTKHHDFLPFVERSWQVPLNSSGLTAFWIKQQRLKRDLK 867

Query: 316  SWSKNKIGCIKNKI--AKMXXXXXXXXXXXLTSNIIQK-KKKTSAQLNEYIIMEATYWEQ 146
             W+K   G I  K+  A++             S+I +    K  A+LN  + +E  +W+Q
Sbjct: 868  WWNKQIFGDIFEKLKRAEIEAEKREKEFQQDPSSINRNLMNKAYAKLNRQLSIEELFWQQ 927

Query: 145  RMKQIWLKTGDQNTK*FHISVQ*KRRKNQISCLQNNNQQWLTNP 14
            +    WL  G++NTK FH+ ++ KR +N I  +Q++      +P
Sbjct: 928  KSGVKWLVEGERNTKFFHLRMRKKRVRNNIFRIQDSEGNIYEDP 971


>ref|XP_007224193.1| hypothetical protein PRUPE_ppa017155mg, partial [Prunus persica]
           gi|462421129|gb|EMJ25392.1| hypothetical protein
           PRUPE_ppa017155mg, partial [Prunus persica]
          Length = 916

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 63/214 (29%), Positives = 94/214 (43%), Gaps = 13/214 (6%)
 Frame = -3

Query: 622 FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPI------- 464
           ++GP YTW      +  IR RLDRV A+ DWC  F  + V+ L    SDH P+       
Sbjct: 52  YTGPKYTWWRNNPME--IRIRLDRVLATADWCSRFLGTKVIHLNPTKSDHLPLKVTISER 109

Query: 463 -LLNTCRNTP-RNPPSYRSHPDFLKVVLESWNINLSGDL----VDKLNFLGSFLKSWSKN 302
            LLN  R    R    +  H + ++ + + W     G       +KL      L  WSK 
Sbjct: 110 MLLNGRRKKLFRFEEMWAEHVNCMQTIQDGWQRTCRGSAPFTTTEKLKCTRHQLLGWSKC 169

Query: 301 KIGCIKNKIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLK 122
             G + N+I               + + ++ +   + QL+  +     YW QR +  WLK
Sbjct: 170 NFGHLPNQIKITREKLGELLDAPPSHHTVELRNALTKQLDSLMAKNEVYWRQRSRATWLK 229

Query: 121 TGDQNTK*FHISVQ*KRRKNQISCLQNNNQQWLT 20
            GD+N+K FH      RR+N IS L++ +  W T
Sbjct: 230 AGDRNSKFFHYKASSCRRRNTISALEDEHGHWQT 263


>ref|XP_007203701.1| hypothetical protein PRUPE_ppa020995mg, partial [Prunus persica]
            gi|462399232|gb|EMJ04900.1| hypothetical protein
            PRUPE_ppa020995mg, partial [Prunus persica]
          Length = 1367

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 58/205 (28%), Positives = 91/205 (44%), Gaps = 4/205 (1%)
 Frame = -3

Query: 622  FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILLNTCRN 443
            ++GP YTW      +  IR RLDR  A+ DWC  F  + V+ L    SDH P+     + 
Sbjct: 457  YTGPKYTWWRNNPME--IRIRLDRALATADWCSRFLGTKVIHLNPTKSDHLPL-----KK 509

Query: 442  TPRNPPSYRSHPDFLKVVLESWNINLSGDL----VDKLNFLGSFLKSWSKNKIGCIKNKI 275
              R    +  H + ++ + + W     G       +KL      L  WSK   G + N+I
Sbjct: 510  LFRFEEMWAEHVNCMQTIQDGWQRTSRGSAPFTTTEKLKCTCHQLLGWSKCNFGHLPNQI 569

Query: 274  AKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKTGDQNTK*F 95
                           + + ++ +   + QL+  +     YW QR +  WLK GD+N+K F
Sbjct: 570  KITQEKLGELLDAPPSHHTVELRNVLTKQLDSLMAKNEVYWRQRSRATWLKAGDRNSKFF 629

Query: 94   HISVQ*KRRKNQISCLQNNNQQWLT 20
            H     +RR+N IS L++ +  W T
Sbjct: 630  HYKASSRRRRNTISALEDEHGHWQT 654


>gb|AAD37019.2| putative non-LTR retrolelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 855

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 58/212 (27%), Positives = 101/212 (47%), Gaps = 11/212 (5%)
 Frame = -3

Query: 622  FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILL----- 458
            F G  +TW  GR   + + +RLDRV   P   L ++++ V  LP  +SDH+PI +     
Sbjct: 588  FKGNKFTWKRGRVESTFVAKRLDRVLCRPQTRLKWQEASVTHLPFFASDHAPIYIQLEPE 647

Query: 457  ---NTCRNTPRNPPSYRSHPDFLKVVLESWNINLSGDLVDKLNFLGSFLKSWSKNKIGCI 287
               N  R   R   ++ +H  F  ++  SWN    G+    L  L S LK W++   G +
Sbjct: 648  VRSNPLRRPFRFEAAWLTHSGFKDLLQASWNTE--GETPVALAALKSKLKKWNREVFGDV 705

Query: 286  ---KNKIAKMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLKTG 116
               K  +               T N++ K+++   + +  +  E   W Q+ ++ W++ G
Sbjct: 706  NRRKESLMNEIKVVQELLEINQTDNLLSKEEELIKEFDVVLEQEEVLWFQKSREKWVELG 765

Query: 115  DQNTK*FHISVQ*KRRKNQISCLQNNNQQWLT 20
            D+NTK FH     +RR+N+I  L+ ++  W++
Sbjct: 766  DRNTKYFHTMTVVRRRRNRIEMLKADDGSWVS 797


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score = 97.4 bits (241), Expect = 3e-18
 Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 13/213 (6%)
 Frame = -3

Query: 622  FSGPDYTWSNGRQFQSLIRRRLDRVFASPDWCLIFEKSGVLQLPRISSDHSPILLNTCRN 443
            F G  +TW+N R FQ     RLDR+  +  W   F  + +  L R  SDH P+L++   +
Sbjct: 1228 FEGNPFTWTNNRMFQ-----RLDRIVYNHHWINKFPITRIQHLNRDGSDHCPLLISCFNS 1282

Query: 442  TPRNPPSYRS------HPDFLKVVLESWNINLSGDLVD----KLNFLGSFLKSWSKNKIG 293
            + + P S+R       H DF   V  +WN+ ++G  +     K + L   LK W+K   G
Sbjct: 1283 SEKAPSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQHRLKQHLKWWNKVMFG 1342

Query: 292  CIKNKIA---KMXXXXXXXXXXXLTSNIIQKKKKTSAQLNEYIIMEATYWEQRMKQIWLK 122
             I +K+    K             T   I K  K+ AQLN+ + +E  +W+Q+    W+ 
Sbjct: 1343 DIFSKLKEAEKRVEECEILHQNEQTVESIIKLNKSYAQLNKQLNIEEIFWKQKSGVKWVV 1402

Query: 121  TGDQNTK*FHISVQ*KRRKNQISCLQNNNQQWL 23
             G++NTK FH  +Q KR ++ I  +Q  + +W+
Sbjct: 1403 EGERNTKFFHTRMQKKRIRSHIFKVQEPDGRWI 1435


Top