BLASTX nr result

ID: Forsythia22_contig00056955 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00056955
         (564 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007207981.1| hypothetical protein PRUPE_ppa015715mg, part...   192   8e-47
ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prun...   188   1e-45
emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera]   187   3e-45
emb|CAN79339.1| hypothetical protein VITISV_044312 [Vitis vinifera]   184   2e-44
emb|CAN79389.1| hypothetical protein VITISV_004909 [Vitis vinifera]   181   2e-43
ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobrom...   179   7e-43
gb|AAK94516.1| gag-pol polyprotein [Hordeum vulgare]                  177   2e-42
ref|XP_007048683.1| DNA/RNA polymerases superfamily protein [The...   177   3e-42
ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobrom...   176   5e-42
gb|AAK94517.1| gag-pol polyprotein [Hordeum vulgare]                  176   5e-42
ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobrom...   176   8e-42
ref|XP_008779530.1| PREDICTED: uncharacterized protein LOC103699...   174   2e-41
ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The...   173   4e-41
gb|ABI96971.1| putative gag-pol polyprotein [Triticum monococcum...   173   5e-41
ref|XP_007023480.1| Uncharacterized protein TCM_027505 [Theobrom...   173   5e-41
gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja...   172   7e-41
dbj|BAA89466.1| gag-pol polyprotein, partial [Oryza sativa Indic...   172   9e-41
gb|AAM94350.1| gag-pol polyprotein [Zea mays]                         171   2e-40
ref|XP_007037024.1| Uncharacterized protein TCM_013224 [Theobrom...   171   2e-40
gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sa...   170   3e-40

>ref|XP_007207981.1| hypothetical protein PRUPE_ppa015715mg, partial [Prunus persica]
            gi|462403623|gb|EMJ09180.1| hypothetical protein
            PRUPE_ppa015715mg, partial [Prunus persica]
          Length = 1445

 Score =  192 bits (488), Expect = 8e-47
 Identities = 98/190 (51%), Positives = 124/190 (65%), Gaps = 5/190 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T+L FSSA+HPQTDGQTEVVNRSLG+LLRCLVG+    W  +L  AEFA+N ++NRSTG 
Sbjct: 1143 TTLKFSSAFHPQTDGQTEVVNRSLGDLLRCLVGDKPGNWDLLLPVAEFAYNNSVNRSTGK 1202

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             PFEVV+G  PR P+DLV+LP ++    SA    EHI+Q+H      +    D YKLAA+
Sbjct: 1203 SPFEVVHGFSPRSPVDLVALPVAARTSDSATSFAEHIRQLHDDVRRQISMHTDTYKLAAN 1262

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
              RR  EF  GD V   +  +RFP H + KL A      RI+KK+ SNAY +ELP  M  
Sbjct: 1263 AHRRQQEFREGDFVMVRVCPERFPKHSFKKLHARSMGPYRIIKKLGSNAYLIELPADMHI 1322

Query: 32   ADVFNVKHLA 3
            + +FNV  L+
Sbjct: 1323 SPIFNVSDLS 1332


>ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prunus persica]
            gi|462402465|gb|EMJ08022.1| hypothetical protein
            PRUPE_ppa025991mg [Prunus persica]
          Length = 1274

 Score =  188 bits (478), Expect = 1e-45
 Identities = 96/190 (50%), Positives = 122/190 (64%), Gaps = 5/190 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T+L FSSA+HPQTDGQTEVVNRSLG+LL CLVG+    W  +L  AEF +N ++NRSTG 
Sbjct: 972  TTLKFSSAFHPQTDGQTEVVNRSLGDLLHCLVGDKPGNWDLLLPVAEFTYNNSVNRSTGK 1031

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             PFEVV+G  PR P+DLV+LP ++    SA    EHI+Q+H      +    D YKLAA+
Sbjct: 1032 SPFEVVHGFSPRSPVDLVALPVAARSSDSATSFAEHIRQLHDDVRRQISMHTDTYKLAAN 1091

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
              RR  EF  GD V   +  +RFP H + KL A      RI+KK+ SNAY +ELP  M  
Sbjct: 1092 AHRRQQEFREGDFVMVRVCPERFPKHSFKKLHARSMGPYRIIKKLGSNAYLIELPANMHI 1151

Query: 32   ADVFNVKHLA 3
            + +FNV  L+
Sbjct: 1152 SPIFNVSDLS 1161


>emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera]
          Length = 1323

 Score =  187 bits (474), Expect = 3e-45
 Identities = 92/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L FSS++HPQTDGQTEVVNRSLGNLLRC+V + ++ W  VL QAEFA N + NR+TGY
Sbjct: 1024 TQLKFSSSFHPQTDGQTEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTTGY 1083

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             PFEV YGL P+ P+DL+ LP S            HI+ +H+     +K +N+ YK A D
Sbjct: 1084 LPFEVAYGLKPKQPVDLIPLPTSVRTSQDGDAFARHIRDIHEKVREKIKISNENYKEAXD 1143

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
              RR+++F+ G LV   L  +RF    Y KL+A      R+LK++  NAY LELP  +  
Sbjct: 1144 AHRRYIQFQEGGLVMVRLRPERFHPSTYQKLQAKKAGPFRVLKRLGENAYLLELPSNLXF 1203

Query: 32   ADVFNVKHL 6
            + +FNVK L
Sbjct: 1204 SPIFNVKDL 1212


>emb|CAN79339.1| hypothetical protein VITISV_044312 [Vitis vinifera]
          Length = 354

 Score =  184 bits (467), Expect = 2e-44
 Identities = 91/189 (48%), Positives = 122/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
           T L FSS++HPQTDGQ EVVNRSLGNLLRC+V + ++ W  VL QAEFA N + NR+TGY
Sbjct: 56  TQLKFSSSFHPQTDGQIEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTTGY 115

Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            PFEV YGL P+  +DL+ LP S            HIQ +H++    +K +N+ YK AAD
Sbjct: 116 SPFEVAYGLKPKQLVDLIPLPTSVHTSQDGDAFTRHIQDIHENVREKIKISNENYKEAAD 175

Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
             RR+++F+ GDLV   L  +RF    Y KL+A      ++LK++  NAY LELP  +  
Sbjct: 176 AHRRYIQFQEGDLVMVRLRPERFHPSTYQKLQAKKAGPFQVLKRLGENAYLLELPSNLHF 235

Query: 32  ADVFNVKHL 6
           + +FNV+ L
Sbjct: 236 SPIFNVEDL 244


>emb|CAN79389.1| hypothetical protein VITISV_004909 [Vitis vinifera]
          Length = 895

 Score =  181 bits (458), Expect = 2e-43
 Identities = 90/187 (48%), Positives = 120/187 (64%), Gaps = 5/187 (2%)
 Frame = -2

Query: 551  LDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGYCP 372
            L FSS++HPQTDGQTEVVNRSLGNLLRC+V + ++ W  VL QAEFA N + NR+ G+ P
Sbjct: 599  LKFSSSFHPQTDGQTEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTIGHSP 658

Query: 371  FEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAADGS 192
            FEV YGL P+ P+DL+ L  S            HI+ +H+     +K +N+ YK AAD  
Sbjct: 659  FEVAYGLKPKQPIDLIPLSTSVRTSQDGDAFARHIRDIHEKVREKIKISNENYKEAADAH 718

Query: 191  RRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKTAD 27
            RR+++F+ GDLV A L  +RF    Y KL+A      R+LK +  NAY LELP  +  + 
Sbjct: 719  RRYIQFQEGDLVMARLRPERFHPSTYQKLQAKKAGPFRVLKWLGENAYLLELPSNLHFSP 778

Query: 26   VFNVKHL 6
            +FNV+ L
Sbjct: 779  IFNVEDL 785


>ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobroma cacao]
           gi|508724940|gb|EOY16837.1| Uncharacterized protein
           TCM_035725 [Theobroma cacao]
          Length = 499

 Score =  179 bits (454), Expect = 7e-43
 Identities = 92/189 (48%), Positives = 124/189 (65%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
           T L +SS  HPQTDGQTEVVNRSLGN+LRCL+  + K W  V+ QAEFA+N ++NRS   
Sbjct: 221 TELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 280

Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            PFEV YGL P+  LDLV LP  + V +      +HI+++H+   A LK +N +Y   A+
Sbjct: 281 TPFEVAYGLKPQHVLDLVPLPQEARVSNEGELFADHIRKIHEEVKAALKASNAEYSFTAN 340

Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
             RR  EFE GD V   L ++RFP   Y+KLK+      ++LKKI+SNAY +ELPP ++ 
Sbjct: 341 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 400

Query: 32  ADVFNVKHL 6
           + +FN+  L
Sbjct: 401 SHIFNILDL 409


>gb|AAK94516.1| gag-pol polyprotein [Hordeum vulgare]
          Length = 1720

 Score =  177 bits (450), Expect = 2e-42
 Identities = 90/189 (47%), Positives = 122/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L FS+  HPQTDGQTEVVNRSL  +LR ++  +IK W + L   EFA+N++L+ +T  
Sbjct: 1409 TKLLFSTTCHPQTDGQTEVVNRSLSTMLRAVLKNNIKLWEECLPHIEFAYNRSLHSTTKM 1468

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            CPFE+VYG LPR P+DL+ +P+S  V+  A E  E I ++H+ T  +++  N +YKLA D
Sbjct: 1469 CPFEIVYGFLPRAPIDLLPIPSSEKVNFDAKERAELILKMHELTKENIERMNARYKLAGD 1528

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLK-----AVRILKKINSNAYRLELPPGMKT 33
              R+H+ F  GDLVW  L KDRFP    +KL        ++L+KIN NAYRLELP     
Sbjct: 1529 KGRKHVVFAPGDLVWLHLRKDRFPDLRKSKLMPRAGGPFKVLEKINDNAYRLELPADFGV 1588

Query: 32   ADVFNVKHL 6
            +  FN+  L
Sbjct: 1589 SPTFNIADL 1597


>ref|XP_007048683.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508700944|gb|EOX92840.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 647

 Score =  177 bits (449), Expect = 3e-42
 Identities = 92/189 (48%), Positives = 123/189 (65%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L +SS  HPQTDGQTEVVNRSLGN+LRCL+  + K W  V+ QAEFA+N ++NRS   
Sbjct: 442  TELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 501

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             PFEV YGL P+  LDLV LP  + V +       HI+++H+   A LK +N +Y   A+
Sbjct: 502  TPFEVAYGLKPQHVLDLVPLPQEARVSNEGELFAYHIRKIHEEVKAALKASNAEYSFTAN 561

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
              RR  EFE GD V   L ++RFP   Y+KLK+      +++KKI+SNAY +ELPP ++ 
Sbjct: 562  QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVIKKISSNAYLIELPPELQI 621

Query: 32   ADVFNVKHL 6
            + +FNV  L
Sbjct: 622  SPIFNVLDL 630


>ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobroma cacao]
            gi|508727408|gb|EOY19305.1| Uncharacterized protein
            TCM_044370 [Theobroma cacao]
          Length = 1306

 Score =  176 bits (447), Expect = 5e-42
 Identities = 92/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L +SS  HPQTD QTEVVNRSLGN+LRCL+  + K W  V  QAEFA+N ++NRS   
Sbjct: 1070 TELKYSSTCHPQTDSQTEVVNRSLGNILRCLIQNNPKTWDLVKPQAEFAYNNSVNRSIKK 1129

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             PFE  YGL P+  LDLV LP  + V +      +HIQ++H+   A LK +N +Y   A+
Sbjct: 1130 TPFEAAYGLKPQHVLDLVPLPQEARVSNEGELFADHIQKIHEEVKAALKASNAEYSFTAN 1189

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
              RR  EFE GD V   L ++RFP   Y+KLK+      ++LKKI+SNAY +ELPP ++ 
Sbjct: 1190 QHRRKQEFEEGDQVLVYLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 1249

Query: 32   ADVFNVKHL 6
            + +FNV  L
Sbjct: 1250 SHIFNVLDL 1258


>gb|AAK94517.1| gag-pol polyprotein [Hordeum vulgare]
          Length = 1717

 Score =  176 bits (447), Expect = 5e-42
 Identities = 89/189 (47%), Positives = 122/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L FS+  HPQTDGQTEVVNRSL  +LR ++  ++K W + L   EFA+N++L+ +T  
Sbjct: 1406 TKLLFSTTCHPQTDGQTEVVNRSLSTMLRAVLKTNLKLWEECLPHIEFAYNRSLHSTTKM 1465

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            CPFE+VYG LPR P+DL+ +P+S  V+  A E  E I ++H+ T  +++  N +YKLA D
Sbjct: 1466 CPFEIVYGFLPRAPIDLLPIPSSEKVNFDAKERAELILKMHELTKENIERMNARYKLAGD 1525

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLK-----AVRILKKINSNAYRLELPPGMKT 33
              R+H+ F  GDLVW  L KDRFP    +KL        ++L+KIN NAYRLELP     
Sbjct: 1526 KGRKHVVFAPGDLVWLHLRKDRFPDLRKSKLMPRAGGPFKVLEKINDNAYRLELPXDFGV 1585

Query: 32   ADVFNVKHL 6
            +  FN+  L
Sbjct: 1586 SPTFNIADL 1594


>ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobroma cacao]
            gi|508724802|gb|EOY16699.1| Uncharacterized protein
            TCM_035549 [Theobroma cacao]
          Length = 1392

 Score =  176 bits (445), Expect = 8e-42
 Identities = 91/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L +SS  HPQTDGQTEVVNRSLGN+LRCL+  + K W  V+ QAEFA+N ++NRS   
Sbjct: 1114 TELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 1173

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             PFE  YGL P+  LDLV LP    V +      +HI+++H+     LK +N +Y   A+
Sbjct: 1174 TPFEAAYGLKPQHVLDLVPLPQEPRVSNEGELFADHIRKIHEEVKTALKASNAQYSFTAN 1233

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
              RR  EFE GD V   L ++RFP   Y+KLK+      ++LKKI+SNAY +ELPP ++ 
Sbjct: 1234 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 1293

Query: 32   ADVFNVKHL 6
            + +FNV  L
Sbjct: 1294 SPIFNVLDL 1302


>ref|XP_008779530.1| PREDICTED: uncharacterized protein LOC103699270, partial [Phoenix
            dactylifera]
          Length = 1140

 Score =  174 bits (441), Expect = 2e-41
 Identities = 87/189 (46%), Positives = 124/189 (65%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L +S+AYHPQTDGQTEVVNRSLGNLLRCLVG+H   W  +LS AEFA+N ++NR++G 
Sbjct: 816  TKLKYSTAYHPQTDGQTEVVNRSLGNLLRCLVGDHPGNWDLLLSTAEFAYNSSVNRTSGL 875

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             PFE+V G +PR P+DL+ +  ++ +  +A    +H+Q +H+  +  ++  N +YK+A D
Sbjct: 876  SPFEIVLGYVPRKPVDLIPVAPNNRISETAESFAQHMQNLHKEINKKIEINNARYKMAVD 935

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
              RR+ EF VGD V   +  +RFP     KL A      +ILK++ SNAY +++P     
Sbjct: 936  LRRRYQEFRVGDDVMIRIRPERFPPGTVRKLHARSMGPYKILKRVGSNAYVVDIPSDFGI 995

Query: 32   ADVFNVKHL 6
              VFNV+ L
Sbjct: 996  NPVFNVEDL 1004


>ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508703673|gb|EOX95569.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1452

 Score =  173 bits (439), Expect = 4e-41
 Identities = 90/189 (47%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L +SS  HPQTDGQTEVVNRSLGN+LRCL+  + K W  V+ QAEFA+N ++NRS   
Sbjct: 1174 TELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 1233

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             PFE  YGL P+  LDLV LP  + V +      + I+++H+   A LK +N +Y   A+
Sbjct: 1234 TPFEAAYGLKPQHVLDLVPLPQEARVSNEGELFADQIRKIHEEVKAALKASNAEYSFTAN 1293

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
              RR  EFE GD V   L ++RFP   Y+KLK+      ++LKKI+SNAY +ELPP ++ 
Sbjct: 1294 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 1353

Query: 32   ADVFNVKHL 6
              +FN+  L
Sbjct: 1354 NPIFNILDL 1362


>gb|ABI96971.1| putative gag-pol polyprotein [Triticum monococcum subsp.
            aegilopoides]
          Length = 1704

 Score =  173 bits (438), Expect = 5e-41
 Identities = 87/185 (47%), Positives = 119/185 (64%), Gaps = 5/185 (2%)
 Frame = -2

Query: 545  FSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGYCPFE 366
            FS+  HPQTDGQTEVVNR+L  +LR ++  + K W + L   EFA+N++L+ +T  CPFE
Sbjct: 1398 FSTTCHPQTDGQTEVVNRTLSTMLRAVLKNNKKMWEECLPHIEFAYNRSLHSTTKMCPFE 1457

Query: 365  VVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAADGSRR 186
            +VYG LPR P+DL+ LP+S  V+  A E  E I ++H+ T  +++  N KYKLA D  R+
Sbjct: 1458 IVYGFLPRAPIDLLPLPSSEKVNFDAKERSELILKIHELTKENIERMNAKYKLARDKGRK 1517

Query: 185  HLEFEVGDLVWAVLTKDRFPAHEYNKLK-----AVRILKKINSNAYRLELPPGMKTADVF 21
            H+ F  GDLVW  L KDRFP    +KL        ++L+KIN NAY+LELP     +  F
Sbjct: 1518 HVVFAPGDLVWLHLRKDRFPNLRKSKLMPRADGPFKVLEKINDNAYKLELPADFGVSPTF 1577

Query: 20   NVKHL 6
            N+  L
Sbjct: 1578 NIADL 1582


>ref|XP_007023480.1| Uncharacterized protein TCM_027505 [Theobroma cacao]
           gi|508778846|gb|EOY26102.1| Uncharacterized protein
           TCM_027505 [Theobroma cacao]
          Length = 292

 Score =  173 bits (438), Expect = 5e-41
 Identities = 90/189 (47%), Positives = 120/189 (63%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
           T L + S  HPQTDGQTEVVNRSLGN+LRCL+  + K W  V+ QAEFA+N  +NRS   
Sbjct: 14  TELKYFSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNFVNRSIKK 73

Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            PFE  YGL P+  LDLV LP  + V +      +HI+++H+   A LK +N +Y   A+
Sbjct: 74  TPFEAAYGLKPQHVLDLVPLPQEARVSNEGELFADHIRKIHEEVKAALKASNAEYSFTAN 133

Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
             RR  EFE GD V   L ++RFP   Y+KLK+      ++LKKI+SNAY +ELPP ++ 
Sbjct: 134 QHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 193

Query: 32  ADVFNVKHL 6
             +FN+  L
Sbjct: 194 NPIFNILDL 202


>gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group]
            gi|31431012|gb|AAP52850.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 2447

 Score =  172 bits (437), Expect = 7e-41
 Identities = 86/189 (45%), Positives = 123/189 (65%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L FS+  HPQTDGQTEVVNR+L  +LR ++ ++IK W + L   EFA+N++L+ +T  
Sbjct: 1350 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHIEFAYNRSLHSTTKM 1409

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            CPF++VYGLLPR P+DL+ LP+S  ++  A +  E + ++H++T  +++  N KYK A D
Sbjct: 1410 CPFQIVYGLLPRAPIDLMPLPSSEKLNFDAKQRAELMLKLHETTKENIERMNAKYKFAGD 1469

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLK-----AVRILKKINSNAYRLELPPGMKT 33
              RR L FE GDLVW  L K+RFP    +KL        ++L KIN NAY+++LP     
Sbjct: 1470 KGRRELTFEPGDLVWLHLRKERFPDLRKSKLMPRADGPFKVLAKINENAYKIDLPADFGV 1529

Query: 32   ADVFNVKHL 6
            +  FNV  L
Sbjct: 1530 SPTFNVADL 1538



 Score = 95.1 bits (235), Expect = 2e-17
 Identities = 69/192 (35%), Positives = 98/192 (51%), Gaps = 8/192 (4%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L+FS+AYHPQTDGQTE +N+ L ++L   V +  K W + L  AEF++N +   S   
Sbjct: 2172 TRLNFSTAYHPQTDGQTERLNQILEDMLHACVLDFGKTWDKSLPYAEFSYNNSYQASIQM 2231

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
             P+E +YG   R PL L      S V  +  ++L   +   ++   +LK A  + K  AD
Sbjct: 2232 APYEALYGRKCRTPL-LWDQVGESQVFGT--DILREAEAKVRTIWDNLKVAQSRQKSYAD 2288

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKAV-------RILKKINSNAYRLELPPGM 39
              RR+LEF V D V+  +T  R       K K         RI+ +    AY+LELP  +
Sbjct: 2289 NRRRNLEFAVDDFVYLRVTPLRGVHRFQTKGKLAPRFVGPFRIIARRGEVAYQLELPASL 2348

Query: 38   -KTADVFNVKHL 6
                DVF+V  L
Sbjct: 2349 GNVHDVFHVSQL 2360


>dbj|BAA89466.1| gag-pol polyprotein, partial [Oryza sativa Indica Group]
          Length = 1587

 Score =  172 bits (436), Expect = 9e-41
 Identities = 87/189 (46%), Positives = 122/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L FS+  HPQTDGQTEVVNR+L  +LR ++ ++IK W + L   EFA+N++ + +T  
Sbjct: 1378 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHVEFAYNRSQHSTTKK 1437

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            CPFE+VYGLLPR P+DL+ LP S  V+  A    E + ++H++T  +++  N KYKLA  
Sbjct: 1438 CPFEIVYGLLPRAPIDLLPLPTSERVNFDAKYRAELMLKLHETTKENIERMNIKYKLAGS 1497

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKL-----KAVRILKKINSNAYRLELPPGMKT 33
              ++H+ FE GDLVW  L KDRFP    +KL        ++L+KIN NAY+LELP     
Sbjct: 1498 KGKKHVAFEPGDLVWLHLRKDRFPNLRKSKLLPRADGPFKVLQKINDNAYKLELPADFGV 1557

Query: 32   ADVFNVKHL 6
            +  FN+  L
Sbjct: 1558 SPTFNIADL 1566


>gb|AAM94350.1| gag-pol polyprotein [Zea mays]
          Length = 1618

 Score =  171 bits (434), Expect = 2e-40
 Identities = 84/189 (44%), Positives = 123/189 (65%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L FS+  HPQTDGQTEVVNR+L  +LR ++ ++IK W   L   EFA+N++L+ +T  
Sbjct: 1353 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEDCLPHIEFAYNRSLHSTTKM 1412

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            CPF++VYGLLPR P+DL+ LP+S  ++  A    E + ++H++T  +++  N +YK A+D
Sbjct: 1413 CPFQIVYGLLPRAPIDLMPLPSSEKLNFDATRRAELMLKLHETTKENIERMNARYKFASD 1472

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKL-----KAVRILKKINSNAYRLELPPGMKT 33
              R+ + FE GDLVW  L K+RFP    +KL        ++L+KIN NAYRL+LP     
Sbjct: 1473 KGRKEINFEPGDLVWLHLRKERFPELRKSKLLPRADGPFKVLEKINDNAYRLDLPADFGV 1532

Query: 32   ADVFNVKHL 6
            +  FN+  L
Sbjct: 1533 SPTFNIADL 1541


>ref|XP_007037024.1| Uncharacterized protein TCM_013224 [Theobroma cacao]
           gi|508774269|gb|EOY21525.1| Uncharacterized protein
           TCM_013224 [Theobroma cacao]
          Length = 412

 Score =  171 bits (433), Expect = 2e-40
 Identities = 89/189 (47%), Positives = 122/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557 TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
           T L +SS  HPQTDGQT+VVNRSLGN+LR L+  + K W  V+ QAEFA+N ++NRS   
Sbjct: 134 TELKYSSTCHPQTDGQTKVVNRSLGNMLRYLIQNNPKTWDLVIPQAEFAYNNSVNRSIKK 193

Query: 377 CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            PFE  YGL P+  LDLV LP  + V +      +HI+++H+   A LK +N +Y   A+
Sbjct: 194 TPFEAAYGLKPQHVLDLVPLPQEARVSNKGELFADHIRKIHEEVKAALKASNAEYSFTAN 253

Query: 197 GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKLKA-----VRILKKINSNAYRLELPPGMKT 33
             RR  EF+ GD V   L ++RFP   Y+KLK+      ++LKKI+SNAY +ELPP ++ 
Sbjct: 254 QHRRKQEFDEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIELPPELQI 313

Query: 32  ADVFNVKHL 6
           + +FNV  L
Sbjct: 314 SPIFNVLDL 322


>gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sativa Japonica Group]
            gi|15217296|gb|AAK92640.1|AC079634_1 Putative
            retroelement [Oryza sativa Japonica Group]
            gi|31431373|gb|AAP53161.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 1708

 Score =  170 bits (431), Expect = 3e-40
 Identities = 86/189 (45%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
 Frame = -2

Query: 557  TSLDFSSAYHPQTDGQTEVVNRSLGNLLRCLVGEHIKKWAQVLSQAEFAHNQALNRSTGY 378
            T L FS+  HPQTDGQTEVVNR+L  +LR ++ ++IK W + L   EFA+N++ + +T  
Sbjct: 1378 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHVEFAYNRSQHSTTKK 1437

Query: 377  CPFEVVYGLLPRGPLDLVSLPASSPVHSSAAELLEHIQQVHQSTSAHLKTANDKYKLAAD 198
            CPFE+VYGLLPR P+DL+ LP S  V+  A    E + ++H++T  +++  N KYKLA  
Sbjct: 1438 CPFEIVYGLLPRAPIDLLPLPTSERVNFDAKYHAELMLKLHETTKENIERMNIKYKLAGS 1497

Query: 197  GSRRHLEFEVGDLVWAVLTKDRFPAHEYNKL-----KAVRILKKINSNAYRLELPPGMKT 33
              ++H+ FE GDLVW  L KDRFP    +KL        ++L+KIN N Y+LELP     
Sbjct: 1498 KGKKHVAFEPGDLVWLHLRKDRFPNLRKSKLLPRADGPFKVLQKINDNTYKLELPADFGV 1557

Query: 32   ADVFNVKHL 6
            +  FN+  L
Sbjct: 1558 SPTFNIADL 1566


Top