BLASTX nr result

ID: Catharanthus23_contig00006355 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00006355
         (1855 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAU90169.1| putative polyprotein [Oryza sativa Japonica Group]     438   e-174
gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]                 364   e-155
gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]                 362   e-152
gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobrom...   328   e-134
gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ...   484   e-134
gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ...   483   e-133
gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]     478   e-132
gb|AAT85159.1| unknown protein [Oryza sativa Japonica Group] gi|...   478   e-132
gb|EOX92840.1| DNA/RNA polymerases superfamily protein [Theobrom...   328   e-132
ref|NP_001063540.1| Os09g0491900 [Oryza sativa Japonica Group] g...   475   e-131
gb|AAW28576.2| Gag-pol polyprotein, putative [Solanum demissum]       466   e-128
gb|EOY01158.1| DNA/RNA polymerases superfamily protein [Theobrom...   303   e-127
emb|CAN84067.1| hypothetical protein VITISV_041979 [Vitis vinifera]   237   e-124
emb|CAN77900.1| hypothetical protein VITISV_037350 [Vitis vinifera]   450   e-124
gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]              449   e-123
gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja...   449   e-123
gb|AAQ56407.1| putative gag-pol polyprotein [Oryza sativa Japoni...   449   e-123
dbj|BAA89466.1| gag-pol polyprotein [Oryza sativa Indica Group]       443   e-121
gb|ABI96971.1| putative gag-pol polyprotein [Triticum monococcum...   442   e-121
gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japoni...   442   e-121

>gb|AAU90169.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1154

 Score =  438 bits (1126), Expect(2) = e-174
 Identities = 205/325 (63%), Positives = 251/325 (77%), Gaps = 1/325 (0%)
 Frame = -3

Query: 1277 LKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIF 1098
            L+ DVER+V  C+   KAKS+  PHGLYTPL VP  PW DISMDF+LGLPRT +G DSIF
Sbjct: 829  LRHDVERYVQRCVTSHKAKSKLNPHGLYTPLPVPNAPWEDISMDFVLGLPRTRRGRDSIF 888

Query: 1097 VVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWK 918
            V VDRFSKMAHFIPC KSDDASHVA LFF  +VR HG+PR+IVSDRD KF+S+FW+ LW 
Sbjct: 889  VAVDRFSKMAHFIPCNKSDDASHVADLFFREVVRLHGVPRTIVSDRDVKFMSYFWKTLWA 948

Query: 917  KLGTNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHST 738
            KLGT LL+ST CH Q DGQ EVVNRT+  LLR ++KK L++WE+CLPH+EFAYNR  HST
Sbjct: 949  KLGTKLLFSTTCHSQIDGQMEVVNRTLSMLLRMMIKKNLKEWEDCLPHVEFAYNRVVHST 1008

Query: 737  TSHSPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVE 558
            T  SPF+VVYGFNP+TP DL+PLP+ +  + +A+K+A+++K +H K  E I    +    
Sbjct: 1009 TQLSPFEVVYGFNPITPLDLLPLPLQERANMEATKRADYVKKMHEKTKETIERIIQSYAA 1068

Query: 557  KRNVGRRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGH 378
            K N  R+K++F+PG+ VWVH RK+RFP++RKSKL   GDG F++LEKI DNAYK+DLPG 
Sbjct: 1069 KANKDRKKMLFQPGELVWVHLRKDRFPEKRKSKLMPHGDGPFRVLEKITDNAYKIDLPGD 1128

Query: 377  YNVSATFNVSDLS-LFDIGDGDSRT 306
            Y VS TFNV DLS  F   + DSRT
Sbjct: 1129 YTVSNTFNVVDLSPFFGTEETDSRT 1153



 Score =  204 bits (520), Expect(2) = e-174
 Identities = 93/161 (57%), Positives = 120/161 (74%), Gaps = 1/161 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            GKP+AYFSEKL  A + Y  YDKELYAL+R L+ WQHYLWPKEFV+H++HEALK+L+GQ 
Sbjct: 670  GKPIAYFSEKLGSAQLNYPVYDKELYALVRALETWQHYLWPKEFVIHSNHEALKYLRGQA 729

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
             LN+RHA+W++FIE+FPY + YKKGKENVVADALSR+  LL+ L  K+   E LKE+YS 
Sbjct: 730  NLNRRHAKWVEFIESFPYIVRYKKGKENVVADALSRKSVLLTQLDVKVSSLESLKELYSK 789

Query: 1494 DHYFKEIFEKCLLAP-FGKYFLHEGFFYCEGRLCISSCSTR 1375
            D  F + + KCL    + KY +H+GF +   +LC+   S R
Sbjct: 790  DSEFSDPYSKCLDGKGWEKYHVHDGFLFRADKLCVPESSLR 830


>gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]
          Length = 1518

 Score =  364 bits (935), Expect(2) = e-155
 Identities = 182/353 (51%), Positives = 239/353 (67%)
 Frame = -3

Query: 1325 VSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMD 1146
            V KT EIL +  +WP +  DV+  +  C +C+ +KS   P G YTPL VP +PW D+SMD
Sbjct: 1110 VQKTLEILQDQFYWPRMMGDVQIILRRCSKCQLSKSSFQP-GPYTPLPVPSKPWEDLSMD 1168

Query: 1145 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 966
            FI+ LPRT +G DS+ VVVDRFSKMAHF+ C+K++DA  VA LF   IVR HG+P++IVS
Sbjct: 1169 FIVALPRTQRGKDSVMVVVDRFSKMAHFVACKKTEDAVSVAELFLKEIVRLHGVPKTIVS 1228

Query: 965  DRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEE 786
            DRDTKF+ +FW+ LWK L T LL+ST+ H QTDGQTEV NRT+G++LR ++ K L+ W+ 
Sbjct: 1229 DRDTKFMGYFWKTLWKLLKTKLLFSTSHHPQTDGQTEVTNRTLGRILRCLVSKSLKDWDL 1288

Query: 785  CLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIH 606
             L   EFA+NRAP + T HSPF+VVYG NPL P DL  +P  + ++ DA K+A  +  +H
Sbjct: 1289 KLAAAEFAFNRAPSTATGHSPFEVVYGVNPLMPLDLSSVP-KENINLDAMKRAEQLLKLH 1347

Query: 605  IKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQI 426
              V   I  +N++  +   V + K  F+PGD VW+H RKERFP +RK+KL  R DG F++
Sbjct: 1348 ETVKRQIERTNEQYQKHLKVPKGKKEFEPGDLVWIHLRKERFPAKRKNKLMPRSDGPFEV 1407

Query: 425  LEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGDGDSRTGRSCESVRHARTS 267
            +EKI  +AYK+DLPG Y V  TFNV DLS +    GD       E V   RTS
Sbjct: 1408 VEKIGPSAYKIDLPGDYGVHGTFNVGDLSPYYEDSGD-------EEVTGLRTS 1453



 Score =  214 bits (545), Expect(2) = e-155
 Identities = 100/181 (55%), Positives = 134/181 (74%), Gaps = 3/181 (1%)
 Frame = -2

Query: 1851 KPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQHK 1672
            KP+AYFSEKLNGA ++YSTYDKE YA+IR L HW HYL PK FV+H+DHEALK++ GQHK
Sbjct: 931  KPVAYFSEKLNGAKLKYSTYDKEFYAIIRALMHWNHYLKPKPFVLHSDHEALKYINGQHK 990

Query: 1671 LNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSSD 1492
            LN RHA+W++F+++F ++  YK+GK+NVVADALSRR++LLS +  ++LGFE +KE+Y  D
Sbjct: 991  LNFRHAKWVEFLQSFTFSSKYKEGKKNVVADALSRRHSLLSVMSNRVLGFEFMKELYKED 1050

Query: 1491 HYFKEIF---EKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGV 1321
              F E +    +       KY L EGF +   +LC+   S R LL++E H GG+ GHFGV
Sbjct: 1051 PDFSEEWITQTEGHKNQGSKYLLQEGFLFQGNKLCVPRGSYRDLLIREVHSGGMGGHFGV 1110

Query: 1320 K 1318
            +
Sbjct: 1111 Q 1111


>gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]
          Length = 1475

 Score =  362 bits (930), Expect(2) = e-152
 Identities = 176/329 (53%), Positives = 229/329 (69%)
 Frame = -3

Query: 1325 VSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMD 1146
            + KTY+IL E  +WP +  DV+  +  C  C+++KS     G YTPL VP +PW DISMD
Sbjct: 1098 IQKTYDILQEQFYWPKMLGDVQDVIKRCAPCQQSKSY-FQTGPYTPLPVPNQPWEDISMD 1156

Query: 1145 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 966
            FI+ LPRT +G DSI VVVDRFSKMAHFI C+K++DA+ VA L+F  +V+ HGIP+SIVS
Sbjct: 1157 FIVALPRTQRGKDSIMVVVDRFSKMAHFIACKKTEDATSVAELYFKEVVKLHGIPKSIVS 1216

Query: 965  DRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEE 786
            DRD+KF+SHFWR LWK L T LL+ST+ H QTDGQTEV N+T+G++LR  + + L+ W+ 
Sbjct: 1217 DRDSKFMSHFWRTLWKLLKTRLLFSTSHHPQTDGQTEVTNKTLGRILRCTVARSLKDWDL 1276

Query: 785  CLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIH 606
             L   EFA+NRAP +TT  SPF+VVYG NP+ P DL P+     +  DA K+   M  IH
Sbjct: 1277 KLAQAEFAFNRAPSTTTGKSPFEVVYGVNPMMPTDLAPIK-RNTIDYDAKKRVEQMLHIH 1335

Query: 605  IKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQI 426
             +V + I  +N+    K    +    F+PGD VW+H RKERFP++RK+KL  R DG F++
Sbjct: 1336 EQVKKQIEKANEAHKGKSKGVKGTKSFEPGDLVWIHLRKERFPEKRKNKLMPRADGPFEV 1395

Query: 425  LEKINDNAYKVDLPGHYNVSATFNVSDLS 339
            LEK   NAYK++LPG Y V  TFNV DLS
Sbjct: 1396 LEKFGSNAYKINLPGEYGVHGTFNVGDLS 1424



 Score =  205 bits (522), Expect(2) = e-152
 Identities = 91/181 (50%), Positives = 131/181 (72%), Gaps = 3/181 (1%)
 Frame = -2

Query: 1851 KPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQHK 1672
            KP+AYFSEKL+GA + YSTYDKE YA++R L HW HYL P+ FV+H+DHEALK++ GQHK
Sbjct: 919  KPIAYFSEKLSGAKLNYSTYDKEFYAIVRALNHWSHYLKPRPFVLHSDHEALKYINGQHK 978

Query: 1671 LNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSSD 1492
            LN RHA+W++F+++F ++  Y +GK+N+VADALSRR+ +LS ++ ++LGFE +KE+Y  D
Sbjct: 979  LNHRHAKWVEFLQSFNFSSKYIEGKDNIVADALSRRFIMLSFMEQRVLGFEYMKELYVED 1038

Query: 1491 HYFK---EIFEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGV 1321
              FK   E+ +   +    KY +  GF +   +LC+     R LL++E H  GL GHFG+
Sbjct: 1039 PDFKGEWELLQSGQIKLKSKYLVQNGFLFFGNKLCVPRGPYRNLLIREVHSNGLAGHFGI 1098

Query: 1320 K 1318
            +
Sbjct: 1099 Q 1099


>gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1452

 Score =  328 bits (840), Expect(2) = e-134
 Identities = 161/338 (47%), Positives = 216/338 (63%)
 Frame = -3

Query: 1319 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 1140
            KT  ++ +  +WP ++RDVER V  C  C   K      GLY PL  P  PW+ +SMDF+
Sbjct: 1037 KTLVMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFV 1096

Query: 1139 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 960
            LGLP+T+KG DSIFVVVDRFSKMAHFIPC ++ DA+H+A LFF  IV  HGIP SIVSDR
Sbjct: 1097 LGLPKTTKGFDSIFVVVDRFSKMAHFIPCFRTSDATHIAELFFREIVILHGIPTSIVSDR 1156

Query: 959  DTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECL 780
              KF+ +FWR LW+K GT L YS+ CH QTDGQTEVVNR++G +LR +++   + W+  +
Sbjct: 1157 HVKFMGYFWRTLWRKFGTELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVI 1216

Query: 779  PHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIK 600
            P  EFAYN + + +   +PF+  YG  P    DLVPLP +  +S +    A+ ++ IH +
Sbjct: 1217 PQAEFAYNNSVNRSIKKTPFEAAYGLKPQHVLDLVPLPQEARVSNEGELFADQIRKIHEE 1276

Query: 599  VHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQILE 420
            V  A+  SN +     N  RRK  F+ GD V VH R+ERFP+    KL  R  G  ++L+
Sbjct: 1277 VKAALKASNAEYSFTANQHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVLK 1336

Query: 419  KINDNAYKVDLPGHYNVSATFNVSDLSLFDIGDGDSRT 306
            KI+ NAY ++LP    ++  FN+ DL  FD  DG + T
Sbjct: 1337 KISSNAYLIELPPELQINPIFNILDLYPFDGCDGTAST 1374



 Score =  180 bits (456), Expect(2) = e-134
 Identities = 86/180 (47%), Positives = 119/180 (66%), Gaps = 3/180 (1%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            G+P+ +FSEKL  +  RYSTYD E YAL+R ++HWQHYL  +EF V++DH+AL++L  Q 
Sbjct: 855  GRPIEFFSEKLTDSRRRYSTYDLEFYALVRAIRHWQHYLAYREFAVYSDHQALRYLHSQK 914

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KL+ +HA+W  F+  F +++ YK G+ N VADALSRR  +LS + T++ GFE LK  YSS
Sbjct: 915  KLSNQHAKWSSFLNEFNFSLKYKSGQSNTVADALSRRCKMLSVMSTQVTGFEELKNQYSS 974

Query: 1494 DHYFKEI---FEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFG 1324
            D YF +I    +  L A    Y LHE + +   +LCI   S R  +++E H  GL GHFG
Sbjct: 975  DSYFSKIIADLQGSLQAENLPYRLHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFG 1034


>gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score =  484 bits (1245), Expect = e-134
 Identities = 236/379 (62%), Positives = 288/379 (75%), Gaps = 2/379 (0%)
 Frame = -3

Query: 1436 FCMKDFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDV 1263
            F ++D F+ K +  C  +  L E ++ + H  G+     V KT EIL EH +WP +++DV
Sbjct: 1125 FNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILSEHFYWPSMRKDV 1184

Query: 1262 ERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDR 1083
            E+    C+ECK+AKSRTLPHGLYTPL V   PW+DISMDFILGLPRT  G DSIFVVVDR
Sbjct: 1185 EKVCSYCLECKQAKSRTLPHGLYTPLPVSNSPWIDISMDFILGLPRTKYGKDSIFVVVDR 1244

Query: 1082 FSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTN 903
            FSKMA FIPC+K++DASHVA LF   +V+ HGIPR+IVSDRD KFLSHFWR+LW KLGT 
Sbjct: 1245 FSKMARFIPCKKTNDASHVADLFVKEVVKLHGIPRTIVSDRDAKFLSHFWRILWGKLGTK 1304

Query: 902  LLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSP 723
            LL+ST+CH QTDGQTEVVNRT+G +LRAILK KL  WE+ LP +EFAYNR  HS+T  +P
Sbjct: 1305 LLFSTSCHPQTDGQTEVVNRTLGNMLRAILKGKLTSWEDYLPIVEFAYNRTFHSSTGKTP 1364

Query: 722  FQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVG 543
            F+VVYGFNPLTP DL+PLP +   + D  KKA+ MK IH +   AI   NK++  +RN G
Sbjct: 1365 FEVVYGFNPLTPLDLLPLPTNDFANLDGKKKADMMKKIHEQTRLAIEKKNKEVALRRNKG 1424

Query: 542  RRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSA 363
            R+ V+FKPGD VWVH RKERFP +RK+KLD RG G +++LE+I DNAYK+DLPG + VSA
Sbjct: 1425 RKYVIFKPGDLVWVHMRKERFPSKRKTKLDPRGSGPYKVLERIGDNAYKLDLPGEFQVSA 1484

Query: 362  TFNVSDLSLFDIGDGDSRT 306
            TFNVSDLS +D  D DSRT
Sbjct: 1485 TFNVSDLSHYD-ADLDSRT 1502



 Score =  192 bits (488), Expect = 4e-46
 Identities = 101/201 (50%), Positives = 130/201 (64%), Gaps = 1/201 (0%)
 Frame = -2

Query: 1851 KPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQHK 1672
            KP+AYFSEKL+GA + YST DKELYAL                              Q K
Sbjct: 1017 KPIAYFSEKLSGATLNYSTNDKELYAL-----------------------------SQGK 1047

Query: 1671 LNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSSD 1492
            L++RHA+W++FIETFPY I YK+GKENVVADALSRRY L+STL +K+LGF+ +K +Y++D
Sbjct: 1048 LSRRHAKWVEFIETFPYVIAYKQGKENVVADALSRRYVLISTLTSKLLGFDQIKFLYAND 1107

Query: 1491 HYFKEIFEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGV-KN 1315
              F EIF +C L PF K+ L + F + E +LC+ +CS R L V+EAHCGGLMGHFGV K 
Sbjct: 1108 SDFGEIFAECKLGPFEKFNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKT 1167

Query: 1314 L*NSS*TCFLATFKKRCRKIC 1252
            L   S   +  + +K   K+C
Sbjct: 1168 LEILSEHFYWPSMRKDVEKVC 1188


>gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score =  483 bits (1242), Expect = e-133
 Identities = 236/379 (62%), Positives = 288/379 (75%), Gaps = 2/379 (0%)
 Frame = -3

Query: 1436 FCMKDFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDV 1263
            F ++D F+ K +  C  +  L E ++ + H  G+     V KT EIL EH +WP +++DV
Sbjct: 1125 FNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILSEHFYWPSMRKDV 1184

Query: 1262 ERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDR 1083
            E+    C+ECK+AKSRTLPHGLYTPL V   PW+DISMDFILGLPRT  G DSIFVVVDR
Sbjct: 1185 EKVCSYCLECKQAKSRTLPHGLYTPLPVSNFPWIDISMDFILGLPRTKYGKDSIFVVVDR 1244

Query: 1082 FSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTN 903
            FSKMA FIPC+K++DASHVA LF   +V+ HGIPR+IVSDRD KFLSHFWR+LW KLGT 
Sbjct: 1245 FSKMARFIPCKKTNDASHVADLFVKEVVKLHGIPRTIVSDRDAKFLSHFWRILWGKLGTK 1304

Query: 902  LLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSP 723
            LL+ST+CH QTDGQTEVVNRT+G +LRAILK KL  WE+ LP +EFAYNR  HS+T  +P
Sbjct: 1305 LLFSTSCHPQTDGQTEVVNRTLGNMLRAILKGKLTSWEDYLPIVEFAYNRTFHSSTGKTP 1364

Query: 722  FQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVG 543
            F+VVYGFNPLTP DL+PLP +   + D  KKA+ MK IH +   AI   NK++  +RN G
Sbjct: 1365 FEVVYGFNPLTPLDLLPLPTNDFANLDGKKKADMMKKIHEQTRLAIEKKNKEVALRRNKG 1424

Query: 542  RRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSA 363
            R+ V+FKPGD VWVH RKERFP +RK+KLD RG G +++LE+I DNAYK+DLPG + VSA
Sbjct: 1425 RKYVIFKPGDLVWVHMRKERFPSKRKTKLDPRGSGPYKVLERIGDNAYKLDLPGEFQVSA 1484

Query: 362  TFNVSDLSLFDIGDGDSRT 306
            TFNVSDLS +D  D DSRT
Sbjct: 1485 TFNVSDLSHYD-ADLDSRT 1502



 Score =  192 bits (488), Expect = 4e-46
 Identities = 101/201 (50%), Positives = 130/201 (64%), Gaps = 1/201 (0%)
 Frame = -2

Query: 1851 KPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQHK 1672
            KP+AYFSEKL+GA + YST DKELYAL                              Q K
Sbjct: 1017 KPIAYFSEKLSGATLNYSTNDKELYAL-----------------------------SQGK 1047

Query: 1671 LNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSSD 1492
            L++RHA+W++FIETFPY I YK+GKENVVADALSRRY L+STL +K+LGF+ +K +Y++D
Sbjct: 1048 LSRRHAKWVEFIETFPYVIAYKQGKENVVADALSRRYVLISTLTSKLLGFDQIKFLYAND 1107

Query: 1491 HYFKEIFEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGV-KN 1315
              F EIF +C L PF K+ L + F + E +LC+ +CS R L V+EAHCGGLMGHFGV K 
Sbjct: 1108 SDFGEIFAECKLGPFEKFNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKT 1167

Query: 1314 L*NSS*TCFLATFKKRCRKIC 1252
            L   S   +  + +K   K+C
Sbjct: 1168 LEILSEHFYWPSMRKDVEKVC 1188


>gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1713

 Score =  478 bits (1230), Expect = e-132
 Identities = 228/368 (61%), Positives = 279/368 (75%)
 Frame = -3

Query: 1400 CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAK 1221
            C  H  +    L + H  G+       KTY++L +H +WP ++RDV+R V  C+ C KAK
Sbjct: 1189 CVPHCSVRLLLLQETHAGGLMGHFGWRKTYDMLADHFYWPKMRRDVQRLVQRCVTCHKAK 1248

Query: 1220 SRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSD 1041
            S+  PHGLYTPL VP  PW DISMDF+LGLPRT +G DSIFVVVDRFSKMAHFIPC KSD
Sbjct: 1249 SKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRTKRGRDSIFVVVDRFSKMAHFIPCHKSD 1308

Query: 1040 DASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQ 861
            DASH+ASLFF+ IVR HG+P++IVSDRDTKFLS+FW+ LW KLGT LL+ST CH QTDGQ
Sbjct: 1309 DASHIASLFFSEIVRLHGMPKTIVSDRDTKFLSYFWKTLWAKLGTRLLFSTTCHPQTDGQ 1368

Query: 860  TEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFD 681
            TEVVNRT+  LLRA++KK L++WEECLPH+EFAYNRA HSTT+  PF+VVYGF PL+P D
Sbjct: 1369 TEVVNRTLSMLLRALIKKNLKEWEECLPHVEFAYNRAVHSTTNMCPFEVVYGFKPLSPID 1428

Query: 680  LVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWV 501
            L+PLP+ +    +ASK+A ++K IH K  EAI   +K      N  R+KV F+PGD VWV
Sbjct: 1429 LLPLPLQERSDMEASKRATYVKKIHEKTKEAIEKRSKYYAAWANKNRKKVTFEPGDLVWV 1488

Query: 500  HFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGD 321
            H RK+RFPQ+RKSKL  RGDG F++L KINDNAYK++LP  Y VS+TFNV+DL+ F  G 
Sbjct: 1489 HLRKDRFPQKRKSKLMPRGDGPFRVLSKINDNAYKIELPEDYGVSSTFNVADLTPF-FGL 1547

Query: 320  GDSRTGRS 297
             DS + RS
Sbjct: 1548 EDSESSRS 1555



 Score =  247 bits (631), Expect = 1e-62
 Identities = 110/178 (61%), Positives = 141/178 (79%), Gaps = 1/178 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            G+P+AYFSEKL GA + YS YDKELYAL+R L+ WQHYLWPKEFV+H+DHEALK+LKGQ 
Sbjct: 1036 GQPVAYFSEKLGGAQLNYSVYDKELYALVRALETWQHYLWPKEFVIHSDHEALKYLKGQA 1095

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KLN+RHA+W++FIETFPY + YKKGKEN+VADALSR+  LL+ L+ K+ G E +KE+YS+
Sbjct: 1096 KLNRRHAKWVEFIETFPYVVKYKKGKENIVADALSRKNVLLNQLEVKVTGIESIKELYSA 1155

Query: 1494 DHYFKEIFEKCLLAP-FGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFG 1324
            D  F E + KC     + KY +H+GF +   +LC+  CS R+LL++E H GGLMGHFG
Sbjct: 1156 DLDFSEPYAKCTAGKGWEKYHIHDGFLFRANKLCVPHCSVRLLLLQETHAGGLMGHFG 1213


>gb|AAT85159.1| unknown protein [Oryza sativa Japonica Group]
            gi|52353557|gb|AAU44123.1| putative polyprotein [Oryza
            sativa Japonica Group]
          Length = 681

 Score =  478 bits (1230), Expect = e-132
 Identities = 228/368 (61%), Positives = 279/368 (75%)
 Frame = -3

Query: 1400 CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAK 1221
            C  H  +    L + H  G+       KTY++L +H +WP ++RDV+R V  C+ C KAK
Sbjct: 157  CVPHCSVRLLLLQETHAGGLMGHFGWRKTYDMLADHFYWPKMRRDVQRLVQRCVTCHKAK 216

Query: 1220 SRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSD 1041
            S+  PHGLYTPL VP  PW DISMDF+LGLPRT +G DSIFVVVDRFSKMAHFIPC KSD
Sbjct: 217  SKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRTKRGRDSIFVVVDRFSKMAHFIPCHKSD 276

Query: 1040 DASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQ 861
            DASH+ASLFF+ IVR HG+P++IVSDRDTKFLS+FW+ LW KLGT LL+ST CH QTDGQ
Sbjct: 277  DASHIASLFFSEIVRLHGMPKTIVSDRDTKFLSYFWKTLWAKLGTRLLFSTTCHPQTDGQ 336

Query: 860  TEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFD 681
            TEVVNRT+  LLRA++KK L++WEECLPH+EFAYNRA HSTT+  PF+VVYGF PL+P D
Sbjct: 337  TEVVNRTLSMLLRALIKKNLKEWEECLPHVEFAYNRAVHSTTNMCPFEVVYGFKPLSPID 396

Query: 680  LVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWV 501
            L+PLP+ +    +ASK+A ++K IH K  EAI   +K      N  R+KV F+PGD VWV
Sbjct: 397  LLPLPLQERSDMEASKRATYVKKIHEKTKEAIEKRSKYYAAWANKNRKKVTFEPGDLVWV 456

Query: 500  HFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGD 321
            H RK+RFPQ+RKSKL  RGDG F++L KINDNAYK++LP  Y VS+TFNV+DL+ F  G 
Sbjct: 457  HLRKDRFPQKRKSKLMPRGDGPFRVLSKINDNAYKIELPEDYGVSSTFNVADLTPF-FGL 515

Query: 320  GDSRTGRS 297
             DS + RS
Sbjct: 516  EDSESSRS 523



 Score =  247 bits (631), Expect = 1e-62
 Identities = 110/178 (61%), Positives = 141/178 (79%), Gaps = 1/178 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            G+P+AYFSEKL GA + YS YDKELYAL+R L+ WQHYLWPKEFV+H+DHEALK+LKGQ 
Sbjct: 4    GQPVAYFSEKLGGAQLNYSVYDKELYALVRALETWQHYLWPKEFVIHSDHEALKYLKGQA 63

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KLN+RHA+W++FIETFPY + YKKGKEN+VADALSR+  LL+ L+ K+ G E +KE+YS+
Sbjct: 64   KLNRRHAKWVEFIETFPYVVKYKKGKENIVADALSRKNVLLNQLEVKVTGIESIKELYSA 123

Query: 1494 DHYFKEIFEKCLLAP-FGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFG 1324
            D  F E + KC     + KY +H+GF +   +LC+  CS R+LL++E H GGLMGHFG
Sbjct: 124  DLDFSEPYAKCTAGKGWEKYHIHDGFLFRANKLCVPHCSVRLLLLQETHAGGLMGHFG 181


>gb|EOX92840.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 647

 Score =  328 bits (842), Expect(2) = e-132
 Identities = 162/334 (48%), Positives = 213/334 (63%)
 Frame = -3

Query: 1319 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 1140
            KT  ++ +  +WP + RDVER V  C  C   K      GLY PL  P  PW+ +SMDF+
Sbjct: 305  KTLAMVADRYYWPKMHRDVERLVKRCSTCLFGKGSAQNTGLYVPLLEPDAPWIHLSMDFV 364

Query: 1139 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 960
            LGLP+ +KG DSIFVVV +FSKMAHFIPC K+ DA+H+A LFF  +VR HGIP SIVSDR
Sbjct: 365  LGLPKIAKGFDSIFVVVYQFSKMAHFIPCFKTSDATHIAELFFCEVVRLHGIPTSIVSDR 424

Query: 959  DTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECL 780
            D KF+ HFWR LW+K GT L YS+ CH QTDGQTEVVNR++G +LR +++   + W+  +
Sbjct: 425  DVKFMGHFWRTLWRKFGTELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVI 484

Query: 779  PHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIK 600
            P  EFAYN + + +   +PF+V YG  P    DLVPLP +  +S +    A  ++ IH +
Sbjct: 485  PQAEFAYNNSVNRSIKKTPFEVAYGLKPQHVLDLVPLPQEARVSNEGELFAYHIRKIHEE 544

Query: 599  VHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQILE 420
            V  A+  SN +     N  RRK  F+ GD V VH R+ERFP+    KL  R  G  ++++
Sbjct: 545  VKAALKASNAEYSFTANQHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKVIK 604

Query: 419  KINDNAYKVDLPGHYNVSATFNVSDLSLFDIGDG 318
            KI+ NAY ++LP    +S  FNV DL  FD  DG
Sbjct: 605  KISSNAYLIELPPELQISPIFNVLDLYPFDGCDG 638



 Score =  171 bits (434), Expect(2) = e-132
 Identities = 85/180 (47%), Positives = 116/180 (64%), Gaps = 3/180 (1%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            G+ + +FSEKL  +  RYSTYD E YAL+R ++HWQHYL   EF V++DH+AL++L  Q 
Sbjct: 123  GRSIEFFSEKLTDSRRRYSTYDLEFYALVRAIRHWQHYLAYCEFAVYSDHQALRYLHSQK 182

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KL+ +HA+W  F+  F +++ YK G+ N VADALSRR  +LS + T++ GFE LK  YSS
Sbjct: 183  KLSNQHAKWSFFLNEFNFSLKYKSGQSNTVADALSRRCKMLSVMSTQVTGFEELKNQYSS 242

Query: 1494 DHYFKEI---FEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFG 1324
            D YF +I    +  L A    Y LHE + +   +LCI   S R  ++ E H  GL GHFG
Sbjct: 243  DSYFSKIIADLQGSLQAGNLPYRLHEDYLFKGNQLCILEGSLREQIIGELHGNGLGGHFG 302


>ref|NP_001063540.1| Os09g0491900 [Oryza sativa Japonica Group]
            gi|113631773|dbj|BAF25454.1| Os09g0491900 [Oryza sativa
            Japonica Group]
          Length = 681

 Score =  475 bits (1223), Expect = e-131
 Identities = 228/368 (61%), Positives = 276/368 (75%)
 Frame = -3

Query: 1400 CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAK 1221
            C  H  +    L + H  G+       KTY++L +H +WP ++RDV+R V  C+ C KAK
Sbjct: 157  CVPHCSVRLLLLQETHAGGLMGHFGWRKTYDMLADHFYWPKMRRDVQRLVQRCVTCHKAK 216

Query: 1220 SRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSD 1041
            S+  PHGLYTPL VP  PW DISMDF+LGLPRT +G DSIFVVVDRFSKMAHFIPC KSD
Sbjct: 217  SKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRTKRGRDSIFVVVDRFSKMAHFIPCHKSD 276

Query: 1040 DASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQ 861
            DASH+ASLFF+ IVR HG+P++IVSDRDTKFLS+FW+ LW KLGT LL+ST CH QTDGQ
Sbjct: 277  DASHIASLFFSEIVRLHGMPKTIVSDRDTKFLSYFWKTLWAKLGTRLLFSTTCHPQTDGQ 336

Query: 860  TEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFD 681
            TEVVNRT+  LLRA++KK L++WEECLPH+EFAYNRA HSTT+  PF+VVYGF PL P D
Sbjct: 337  TEVVNRTLSMLLRALIKKNLKEWEECLPHVEFAYNRAVHSTTNMCPFEVVYGFKPLAPID 396

Query: 680  LVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWV 501
            L+PLP+ +    +ASK A ++K IH K  EAI   +K      N  R+KV F+PGD VWV
Sbjct: 397  LLPLPLQERSDMEASKHATYVKKIHEKTKEAIEKRSKYYAAWANKDRKKVTFEPGDLVWV 456

Query: 500  HFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGD 321
            H RK+RFPQ+RKSKL  RGDG F++L KINDNAYK++LP  Y VS TFNV+DL+ F  G 
Sbjct: 457  HLRKDRFPQKRKSKLMPRGDGPFRVLSKINDNAYKIELPEDYGVSPTFNVADLTPF-FGL 515

Query: 320  GDSRTGRS 297
             DS + RS
Sbjct: 516  EDSESSRS 523



 Score =  244 bits (624), Expect = 7e-62
 Identities = 109/178 (61%), Positives = 140/178 (78%), Gaps = 1/178 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            G+P+AYFSEKL GA + YS YDKELYAL+R L+ WQHYLWPKEFV+H+DHEALK+LKGQ 
Sbjct: 4    GQPVAYFSEKLGGAQLNYSVYDKELYALVRALETWQHYLWPKEFVIHSDHEALKYLKGQA 63

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KLN+RHA+W++FIETFPY + YKKGKEN+VADALSR+  LL+ L+ K+ G E +KE+Y +
Sbjct: 64   KLNRRHAKWVEFIETFPYVVKYKKGKENIVADALSRKNVLLNQLEVKVPGIESIKELYPA 123

Query: 1494 DHYFKEIFEKCLLAP-FGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFG 1324
            D  F E + KC     + KY +H+GF +   +LC+  CS R+LL++E H GGLMGHFG
Sbjct: 124  DLDFSEPYAKCTAGKGWEKYHIHDGFLFRANKLCVPHCSVRLLLLQETHAGGLMGHFG 181


>gb|AAW28576.2| Gag-pol polyprotein, putative [Solanum demissum]
          Length = 1096

 Score =  466 bits (1200), Expect = e-128
 Identities = 233/379 (61%), Positives = 282/379 (74%), Gaps = 2/379 (0%)
 Frame = -3

Query: 1436 FCMKDFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDV 1263
            F ++D F+ K +  C  +  L E ++ + H  G+     V KT EIL EH +WP +++DV
Sbjct: 643  FNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILSEHFYWPSMRKDV 702

Query: 1262 ERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDR 1083
            E          KAKSRTLPHGLYTPL V   PW+DISMDFILGLPRT  G DSIFVVVDR
Sbjct: 703  E----------KAKSRTLPHGLYTPLPVSNSPWIDISMDFILGLPRTKYGKDSIFVVVDR 752

Query: 1082 FSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTN 903
            FSKMA FIPC+K++DASHVA LF   +V+ HGIPR+IVSDRD KFLSHFWR+LW KLGT 
Sbjct: 753  FSKMARFIPCKKTNDASHVADLFVKEVVKLHGIPRTIVSDRDAKFLSHFWRILWGKLGTK 812

Query: 902  LLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSP 723
            LL+ST+CH QTDGQTEVVNRT+G +LRAILK KL  WE+ LP +EFAYNR  HS+T  +P
Sbjct: 813  LLFSTSCHPQTDGQTEVVNRTLGNMLRAILKGKLTSWEDYLPIVEFAYNRTFHSSTGKTP 872

Query: 722  FQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVG 543
            F+VVYGFNPLTP DL+PLP +   + D  KKA+ MK IH +   AI   NK++  +RN G
Sbjct: 873  FEVVYGFNPLTPLDLLPLPTNDFANLDGKKKADMMKKIHEQTRLAIEKKNKEVALRRNKG 932

Query: 542  RRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSA 363
            R+ V+FKPGD VWVH RKERFP +RK+KLD RG G +++LE+I DNAYK+DLPG + VSA
Sbjct: 933  RKYVIFKPGDLVWVHMRKERFPSKRKTKLDPRGSGPYKVLERIGDNAYKLDLPGEFQVSA 992

Query: 362  TFNVSDLSLFDIGDGDSRT 306
            TFNVSDLS +D  D DSRT
Sbjct: 993  TFNVSDLSHYD-ADLDSRT 1010



 Score = 65.9 bits (159), Expect = 6e-08
 Identities = 33/77 (42%), Positives = 45/77 (58%)
 Frame = -2

Query: 1551 STLQTKILGFEMLKEMYSSDHYFKEIFEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRI 1372
            S  Q    G E  K+ +       E+ +   L+PF K+ L + F + E +LC+ +CS R 
Sbjct: 606  SNWQKNWRGEERRKQFWRCAKSSFELLDINQLSPFEKFNLQDEFLFKENKLCVPNCSLRE 665

Query: 1371 LLVKEAHCGGLMGHFGV 1321
            L V+EAHCGGLMGHFGV
Sbjct: 666  LFVREAHCGGLMGHFGV 682


>gb|EOY01158.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 786

 Score =  303 bits (775), Expect(2) = e-127
 Identities = 146/298 (48%), Positives = 193/298 (64%)
 Frame = -3

Query: 1319 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 1140
            KT  ++ +  +WP ++RDVER V  C  C   K      GLY PL  P  PW+ +SMDF+
Sbjct: 489  KTLAMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFV 548

Query: 1139 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 960
            LGLP+T+KG DSIFVVVDRFSKMAHFIPC ++ +A+H+A LFF  IVR HGIP SIVSDR
Sbjct: 549  LGLPKTAKGFDSIFVVVDRFSKMAHFIPCFRTSNATHIAELFFREIVRLHGIPTSIVSDR 608

Query: 959  DTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECL 780
            D KF+ HFWR LW+K GT L YS+ CH QTDGQTEVVNR++G +LR +++   + W+  +
Sbjct: 609  DVKFMGHFWRTLWRKFGTELKYSSTCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVI 668

Query: 779  PHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIK 600
            P  EFAYN + + +   +PF+  YG  P    DLVPLP +  +S +    A+ ++ IH +
Sbjct: 669  PQAEFAYNNSVNRSIKKTPFEAAYGLKPQHVLDLVPLPQEARVSNEGELFADHIRKIHEE 728

Query: 599  VHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQI 426
            V  A+  SN +     N  RRK  F+ GD V VH R+ERFP+    KL  R  G  ++
Sbjct: 729  VKAALKASNAEYSFTANQHRRKQEFEEGDQVLVHLRQERFPKGTYHKLKSRKFGPCKV 786



 Score =  180 bits (456), Expect(2) = e-127
 Identities = 86/180 (47%), Positives = 119/180 (66%), Gaps = 3/180 (1%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            G+P+ +FSEKL  +  RYSTYD E YAL+R ++HWQHYL  +EF V++DH+AL++L  Q 
Sbjct: 307  GRPIEFFSEKLTDSRRRYSTYDLEFYALVRAIRHWQHYLAYREFAVYSDHQALRYLHSQK 366

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KL+ +HA+W  F+  F +++ YK G+ N VADALSRR  +LS + T++ GFE LK  YSS
Sbjct: 367  KLSNQHAKWSSFLNEFNFSLKYKSGQSNTVADALSRRCKMLSVMSTQVTGFEELKNQYSS 426

Query: 1494 DHYFKEI---FEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFG 1324
            D YF +I    +  L A    Y LHE + +   +LCI   S R  +++E H  GL GHFG
Sbjct: 427  DSYFSKIIADLQGSLQAENLPYRLHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFG 486


>emb|CAN84067.1| hypothetical protein VITISV_041979 [Vitis vinifera]
          Length = 364

 Score =  237 bits (604), Expect(2) = e-124
 Identities = 107/178 (60%), Positives = 138/178 (77%)
 Frame = -2

Query: 1851 KPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQHK 1672
            +P  YFSEKLNGA + Y TYDKELYAL+R L+ WQHYLWPKEFV+H DHE+LK LKGQ K
Sbjct: 5    RPTTYFSEKLNGATLNYPTYDKELYALVRALETWQHYLWPKEFVIHTDHESLKHLKGQGK 64

Query: 1671 LNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSSD 1492
            LN+RHA+W++FIETFPY I YK+ KEN+VADALSRRY L+STL  K+LGFE +KE+Y++D
Sbjct: 65   LNRRHAKWVEFIETFPYVIKYKQCKENIVADALSRRYALVSTLNAKLLGFEYVKELYAND 124

Query: 1491 HYFKEIFEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 1318
            + F  ++  C    FGK++  +G+ + +  LC+ + S   LLV+EAH G  MGHFGV+
Sbjct: 125  NDFASVYGACEKTAFGKFYRLDGYLFRKNILCVPNSSMCELLVREAHGGDXMGHFGVR 182



 Score =  236 bits (603), Expect(2) = e-124
 Identities = 108/161 (67%), Positives = 129/161 (80%)
 Frame = -3

Query: 1325 VSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMD 1146
            V KT ++LHEH FWP +KRDVER    CI C++ KSR LPHGLYT L VP  PWV+I MD
Sbjct: 181  VRKTLDVLHEHFFWPKMKRDVERACARCITCRRTKSRVLPHGLYTLLPVPSAPWVNIYMD 240

Query: 1145 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 966
            F+LGLPR+  G DSIFVVVDRFSKM HFI C K++DA+H+A+LFF  IV  +G+P+SIVS
Sbjct: 241  FVLGLPRSRNGRDSIFVVVDRFSKMTHFISCHKTNDATHIANLFFREIVWLYGVPKSIVS 300

Query: 965  DRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEVVNR 843
            DRD KFL +FW+VLW+KLGT LL+ST CH QTDGQTEV+ R
Sbjct: 301  DRDVKFLRYFWKVLWRKLGTKLLFSTTCHPQTDGQTEVMIR 341


>emb|CAN77900.1| hypothetical protein VITISV_037350 [Vitis vinifera]
          Length = 1173

 Score =  450 bits (1158), Expect = e-124
 Identities = 223/365 (61%), Positives = 267/365 (73%)
 Frame = -3

Query: 1400 CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAK 1221
            C  +  + E  + + H  G+     V KT ++LHEH+FWP +KRDVER    CI  + AK
Sbjct: 778  CVPNSSMRELLVREAHEGGLMGHFGVRKTLDVLHEHIFWPKMKRDVERACARCITYRHAK 837

Query: 1220 SRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSD 1041
            S+ LPHGLYT L VP  PWVDISMDF+LGL R+  G DSIFVVVDRFSKM HFI C K+D
Sbjct: 838  SKVLPHGLYTTLLVPSAPWVDISMDFVLGLLRSRNGRDSIFVVVDRFSKMTHFISCHKTD 897

Query: 1040 DASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQ 861
            DA+H+A+LFF  IVR HGIPRSIVSDRD KFLS FW+VLW KLGT LL+ST CH QTDGQ
Sbjct: 898  DATHIANLFFRKIVRLHGIPRSIVSDRDVKFLSCFWKVLWGKLGTKLLFSTTCHPQTDGQ 957

Query: 860  TEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFD 681
             EVVNRT+  LLR I++K L+ WE+CLP  EFAYNR+ HSTTS SPF++VYGFNPLTP D
Sbjct: 958  IEVVNRTLSTLLRTIIQKNLKNWEDCLPFTEFAYNRSVHSTTSFSPFEIVYGFNPLTPLD 1017

Query: 680  LVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWV 501
            L+PLP+++  S D  K                   N++ V K N GRR+V+F+ GD VWV
Sbjct: 1018 LLPLPVNEMTSLDEKK-------------------NEQYVTKANKGRRQVLFESGDWVWV 1058

Query: 500  HFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGD 321
            H RKERFP +R+SKL  RGDG FQ+LE+INDNAYK+DL G YN+SATF VSDLS F++GD
Sbjct: 1059 HMRKERFPTRRQSKLHPRGDGPFQVLERINDNAYKLDLLGEYNISATFKVSDLSPFNVGD 1118

Query: 320  GDSRT 306
             DSRT
Sbjct: 1119 -DSRT 1122



 Score =  186 bits (472), Expect = 3e-44
 Identities = 90/172 (52%), Positives = 122/172 (70%), Gaps = 4/172 (2%)
 Frame = -2

Query: 1821 NGAAVRYSTYDKELYALIRTL----KHWQHYLWPKEFVVHNDHEALKFLKGQHKLNKRHA 1654
            +G A  Y  + K+   L+  L    K +  + W K FV+H DHE+LK+LKGQ KLN+RHA
Sbjct: 634  HGLASFYRRFVKDFSTLVAPLTEIVKKFVGFKWGK-FVIHTDHESLKYLKGQGKLNRRHA 692

Query: 1653 RWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEI 1474
            +W++FIETFPY I YK+GKEN+V DALSRRY L+STL  K+LGFE +KE+Y++D  F  +
Sbjct: 693  KWVEFIETFPYVIKYKQGKENIVVDALSRRYALVSTLNAKLLGFEYVKELYANDDDFASV 752

Query: 1473 FEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 1318
            +  C    FGK++  +G+ + E RLC+ + S R LLV+EAH GGLMGHFGV+
Sbjct: 753  YGACEKVAFGKFYRLDGYLFRENRLCVPNSSMRELLVREAHEGGLMGHFGVR 804


>gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]
          Length = 1887

 Score =  449 bits (1156), Expect = e-123
 Identities = 208/365 (56%), Positives = 270/365 (73%)
 Frame = -3

Query: 1400 CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAK 1221
            C  +  L E ++ + H  G+     VSKT +++ +H  WPH+KRDVER    C  CK+AK
Sbjct: 1377 CIPNSSLRELFIREAHGGGLMGHFGVSKTIKVMQDHFHWPHMKRDVERICERCPTCKQAK 1436

Query: 1220 SRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSD 1041
            +++ PHGLYTPL +P  PW DISMDF++GLPRT  G DSIFVVVDRFSKMAHFIPC K+D
Sbjct: 1437 AKSQPHGLYTPLPIPSHPWNDISMDFVVGLPRTRTGKDSIFVVVDRFSKMAHFIPCHKTD 1496

Query: 1040 DASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQ 861
            DA H+A+LFF  +VR HG+P++IVSDRDTKFLS+FW+ LW KLGT LL+ST CH QTDGQ
Sbjct: 1497 DAIHIANLFFREVVRLHGMPKTIVSDRDTKFLSYFWKTLWSKLGTKLLFSTTCHPQTDGQ 1556

Query: 860  TEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFD 681
            TEVVNRT+  LLRA++KK L+ WE+CLPH+EFAYN + HS +  SPFQ+VYGFNP TP D
Sbjct: 1557 TEVVNRTLSTLLRALIKKNLKTWEDCLPHVEFAYNHSMHSASKFSPFQIVYGFNPTTPLD 1616

Query: 680  LVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWV 501
            L+PLP+ + +S D  KKA  ++ IH +  + I    K+  +  N  R++V+F  GD VW+
Sbjct: 1617 LMPLPLSERVSLDGKKKAELVQQIHEQAKKNIEEKTKQYAKHANKSRKEVIFNEGDLVWI 1676

Query: 500  HFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGD 321
            H RKERFP++RKSKL  R DG F++L++IN+NAY +DL G YNVS +FNV+DL  F   +
Sbjct: 1677 HLRKERFPKERKSKLMSRIDGPFKVLKRINNNAYSLDLQGKYNVSNSFNVADLFPFIADN 1736

Query: 320  GDSRT 306
             D R+
Sbjct: 1737 TDLRS 1741



 Score =  256 bits (653), Expect = 3e-65
 Identities = 117/177 (66%), Positives = 142/177 (80%)
 Frame = -2

Query: 1851 KPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQHK 1672
            KP+AYFSEKL GA + Y TYDKELYAL+R L+  QHYLWPKEFV+H DHE+LK LKGQ K
Sbjct: 1226 KPIAYFSEKLGGATLNYPTYDKELYALVRALQTGQHYLWPKEFVIHTDHESLKHLKGQQK 1285

Query: 1671 LNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSSD 1492
            LNKRHARW++FIETFPY I YKKGK+NVVADALSRRY LLS+L  K+LGFE +K +Y++D
Sbjct: 1286 LNKRHARWVEFIETFPYVIKYKKGKDNVVADALSRRYVLLSSLDAKLLGFEHIKSLYAND 1345

Query: 1491 HYFKEIFEKCLLAPFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGV 1321
              F++I+  C    FGKY+ H+GF + + RLCI + S R L ++EAH GGLMGHFGV
Sbjct: 1346 SDFEKIYSSCEKFAFGKYYRHDGFLFYDNRLCIPNSSLRELFIREAHGGGLMGHFGV 1402


>gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group]
            gi|31431012|gb|AAP52850.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 2447

 Score =  449 bits (1156), Expect = e-123
 Identities = 221/386 (57%), Positives = 269/386 (69%), Gaps = 2/386 (0%)
 Frame = -3

Query: 1442 NIFCMKDFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKR 1269
            N F + D F+ + +  C     +    L + H  G+       KT++IL  H FWP ++R
Sbjct: 1170 NKFVINDGFVFRANKLCIPASSVRLLLLQEAHGGGLMGHFGAKKTHDILASHFFWPQMRR 1229

Query: 1268 DVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVV 1089
            DV RFV  C  C+KAKSR  PHGLY PL VP  PW DISMDF+LGLPRT +G DSIFVVV
Sbjct: 1230 DVGRFVARCATCQKAKSRLHPHGLYMPLPVPTVPWEDISMDFVLGLPRTKRGRDSIFVVV 1289

Query: 1088 DRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLG 909
            DRFSKMAHFIPC K+DDASH+A LFF  IVR HG+P +IVSDRDTKFLSHFWR LW KLG
Sbjct: 1290 DRFSKMAHFIPCHKTDDASHIADLFFREIVRLHGVPNTIVSDRDTKFLSHFWRTLWAKLG 1349

Query: 908  TNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSH 729
            T LL+ST CH QTDGQTEVVNRT+  +LRA+LKK ++ WEECLPHIEFAYNR+ HSTT  
Sbjct: 1350 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHIEFAYNRSLHSTTKM 1409

Query: 728  SPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRN 549
             PFQ+VYG  P  P DL+PLP  + L+ DA ++A  M  +H    E I   N K     +
Sbjct: 1410 CPFQIVYGLLPRAPIDLMPLPSSEKLNFDAKQRAELMLKLHETTKENIERMNAKYKFAGD 1469

Query: 548  VGRRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNV 369
             GRR++ F+PGD VW+H RKERFP  RKSKL  R DG F++L KIN+NAYK+DLP  + V
Sbjct: 1470 KGRRELTFEPGDLVWLHLRKERFPDLRKSKLMPRADGPFKVLAKINENAYKIDLPADFGV 1529

Query: 368  SATFNVSDLSLFDIGDGDSRTGRSCE 291
            S TFNV+DL  + +G+ D    R+ +
Sbjct: 1530 SPTFNVADLKPY-LGEEDELESRTTQ 1554



 Score =  234 bits (598), Expect = 7e-59
 Identities = 106/180 (58%), Positives = 141/180 (78%), Gaps = 1/180 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            GKP+AYFSEKL+G  + YSTYDKELYAL+RTL+ WQHYLWPKEFV+H+DHE+LK ++ Q 
Sbjct: 1033 GKPVAYFSEKLSGPVLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQG 1092

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KLN+RHA+W++FIE+FPY I +KKGKEN++ADALSRRYTLL+ L  KI G E +K+ Y+ 
Sbjct: 1093 KLNRRHAKWVEFIESFPYVIKHKKGKENIIADALSRRYTLLTQLDYKIFGLETIKDQYAH 1152

Query: 1494 DHYFKEIFEKCLLA-PFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 1318
            D  F ++   C     + K+ +++GF +   +LCI + S R+LL++EAH GGLMGHFG K
Sbjct: 1153 DADFNDVLLHCKDGRTWNKFVINDGFVFRANKLCIPASSVRLLLLQEAHGGGLMGHFGAK 1212



 Score =  206 bits (523), Expect = 4e-50
 Identities = 123/331 (37%), Positives = 186/331 (56%), Gaps = 4/331 (1%)
 Frame = -3

Query: 1322 SKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSR-TLPHGLYTPLEVPKEPWVDISMD 1146
            +K Y  L E  +W  +KR++  FV  C  C++ K+    P GL  PL+VP+  W +I MD
Sbjct: 2033 TKMYLDLKEKYWWVSMKREIAEFVALCDVCQRVKAEHQRPAGLLQPLQVPEWKWDEIGMD 2092

Query: 1145 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 966
            FI GLP+T  G DSI+VVVDR +K+A FIP + +   + +A L+F  IV  HG+P+ IVS
Sbjct: 2093 FITGLPKTQGGYDSIWVVVDRLTKVARFIPVKTTYGGNKLAELYFARIVSLHGVPKKIVS 2152

Query: 965  DRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEE 786
            DR+++F SHFW+ L ++LGT L +STA H QTDGQTE +N+ +  +L A +    + W++
Sbjct: 2153 DRESQFTSHFWKKLQEELGTRLNFSTAYHPQTDGQTERLNQILEDMLHACVLDFGKTWDK 2212

Query: 785  CLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIH 606
             LP+ EF+YN +  ++   +P++ +YG    TP     +   Q    D  ++A   K   
Sbjct: 2213 SLPYAEFSYNNSYQASIQMAPYEALYGRKCRTPLLWDQVGESQVFGTDILREAE-AKVRT 2271

Query: 605  IKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFRKERFPQ--QRKSKLDDRGDGQF 432
            I  +  +  S +K        RR + F   D V++     R     Q K KL  R  G F
Sbjct: 2272 IWDNLKVAQSRQKSYADNR--RRNLEFAVDDFVYLRVTPLRGVHRFQTKGKLAPRFVGPF 2329

Query: 431  QILEKINDNAYKVDLPGHY-NVSATFNVSDL 342
            +I+ +  + AY+++LP    NV   F+VS L
Sbjct: 2330 RIIARRGEVAYQLELPASLGNVHDVFHVSQL 2360



 Score = 85.1 bits (209), Expect = 9e-14
 Identities = 42/97 (43%), Positives = 59/97 (60%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            G  +AY S +L      Y T+D EL A++  LK W+HYL      ++ DH++LK++  Q 
Sbjct: 1825 GHVVAYASRQLWPHEGNYPTHDLELAAVVHALKIWRHYLIGNRCEIYTDHKSLKYIFTQS 1884

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRR 1564
             LN R  RW++ I+ +   IHY  GK NVVADALSR+
Sbjct: 1885 DLNLRQRRWLELIKDYDVGIHYHPGKANVVADALSRK 1921


>gb|AAQ56407.1| putative gag-pol polyprotein [Oryza sativa Japonica Group]
          Length = 1619

 Score =  449 bits (1156), Expect = e-123
 Identities = 217/367 (59%), Positives = 266/367 (72%)
 Frame = -3

Query: 1391 HLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRT 1212
            H++LL+    + H  G+     V KT +IL +H+FWP ++RDVERFV  C  C+KAKSR 
Sbjct: 1117 HMLLLQ----EAHGGGLMGHFGVKKTEDILADHLFWPKMRRDVERFVARCTTCQKAKSRL 1172

Query: 1211 LPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDAS 1032
             PHGLY PL VP  PW DISMDF+LGLPRT KG DSIFVVVDRFSKMAHFIPC KSDDA+
Sbjct: 1173 NPHGLYMPLPVPSVPWEDISMDFVLGLPRTKKGRDSIFVVVDRFSKMAHFIPCHKSDDAT 1232

Query: 1031 HVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEV 852
            HVA LFF  IVR HG+P +IVSDRDTKFLSHFWR LW KLGT LL+ST CH QTDGQTEV
Sbjct: 1233 HVADLFFREIVRLHGVPNTIVSDRDTKFLSHFWRTLWAKLGTKLLFSTTCHPQTDGQTEV 1292

Query: 851  VNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVP 672
            VNRT+  +LRA+LKK ++ WEECLPH+EFAYNR+ HSTT   PF++VYG  P  P DL+P
Sbjct: 1293 VNRTVSTMLRAVLKKNIKMWEECLPHVEFAYNRSQHSTTKKCPFEIVYGLLPRAPIDLLP 1352

Query: 671  LPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFR 492
            LP  + ++ DA  +A  M  +H    E I   N K     + G++ V F+PGD VW+H R
Sbjct: 1353 LPTLERVNFDAKYRAELMLKLHETTKENIERMNIKYKLAGSKGKKHVAFEPGDLVWLHLR 1412

Query: 491  KERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGDGDS 312
            K+RFP  RKSKL  R DG FQ+L+KINDNAYK++LP  + VS TFN++DL  + +G+ D 
Sbjct: 1413 KDRFPNLRKSKLPPRADGPFQVLQKINDNAYKLELPADFGVSPTFNIADLKPY-LGEEDE 1471

Query: 311  RTGRSCE 291
               R+ +
Sbjct: 1472 LESRTTQ 1478



 Score =  232 bits (591), Expect = 5e-58
 Identities = 106/180 (58%), Positives = 139/180 (77%), Gaps = 1/180 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            GKP+AYFSEKL+G ++ YSTYDK+L+AL+RTL+ WQHYLWPKEFV+H+DHE+LK ++ Q 
Sbjct: 957  GKPVAYFSEKLSGPSLNYSTYDKQLFALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQA 1016

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KLN+RHA+W++FIE+FPY I +KKGKENV+ DALSRRY +LS L  KI G E +KE Y+ 
Sbjct: 1017 KLNRRHAKWVEFIESFPYVIKHKKGKENVIVDALSRRYAMLSQLDFKIFGLETIKEQYAH 1076

Query: 1494 DHYFKEIFEKCLLA-PFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 1318
            D  FK++   C     + K+ L  GF +   +LCI + S  +LL++EAH GGLMGHFGVK
Sbjct: 1077 DDDFKDVLLNCKEGRTWNKFVLTNGFVFRANKLCIPASSVHMLLLQEAHGGGLMGHFGVK 1136


>dbj|BAA89466.1| gag-pol polyprotein [Oryza sativa Indica Group]
          Length = 1587

 Score =  443 bits (1140), Expect = e-121
 Identities = 215/367 (58%), Positives = 263/367 (71%)
 Frame = -3

Query: 1391 HLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRT 1212
            H++LL+    + H  G+     V K  +IL +H FWP  +RDVERFV  C  C+KAKSR 
Sbjct: 1221 HMLLLQ----EAHGGGLMGHFGVKKMEDILADHFFWPKKRRDVERFVARCTTCQKAKSRL 1276

Query: 1211 LPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDAS 1032
             PHGLY PL VP  PW DISMDF+LGLPRT KG DSIFVVVDRFSKMAHFIPC KSDDA+
Sbjct: 1277 NPHGLYMPLPVPSVPWEDISMDFVLGLPRTKKGRDSIFVVVDRFSKMAHFIPCHKSDDAT 1336

Query: 1031 HVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEV 852
            HVA LFF  IVR HG+P +IVSDRDTKFLSHFWR LW KLGT LL+ST CH QTDGQTEV
Sbjct: 1337 HVADLFFREIVRLHGVPNTIVSDRDTKFLSHFWRTLWAKLGTKLLFSTTCHPQTDGQTEV 1396

Query: 851  VNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVP 672
            VNRT+  +LRA+LKK ++ WEECLPH+EFAYNR+ HSTT   PF++VYG  P  P DL+P
Sbjct: 1397 VNRTLSTMLRAVLKKNIKMWEECLPHVEFAYNRSQHSTTKKCPFEIVYGLLPRAPIDLLP 1456

Query: 671  LPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFR 492
            LP  + ++ DA  +A  M  +H    E I   N K     + G++ V F+PGD VW+H R
Sbjct: 1457 LPTSERVNFDAKYRAELMLKLHETTKENIERMNIKYKLAGSKGKKHVAFEPGDLVWLHLR 1516

Query: 491  KERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGDGDS 312
            K+RFP  RKSKL  R DG F++L+KINDNAYK++LP  + VS TFN++DL  + +G+ D 
Sbjct: 1517 KDRFPNLRKSKLLPRADGPFKVLQKINDNAYKLELPADFGVSPTFNIADLKPY-LGEEDE 1575

Query: 311  RTGRSCE 291
               R+ +
Sbjct: 1576 LESRTTQ 1582



 Score =  233 bits (593), Expect = 3e-58
 Identities = 107/182 (58%), Positives = 140/182 (76%), Gaps = 1/182 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            GKP+AYFSEKL+G ++ YSTYDKEL+AL+RTL+ WQHYLWPKEFV+H+DHE+LK ++ Q 
Sbjct: 1061 GKPVAYFSEKLSGPSLNYSTYDKELFALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQA 1120

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            K N+RHA+W++FIE+FPY I +KKGKENV+ADALSRRY +LS L  KI G E +KE Y+ 
Sbjct: 1121 KHNRRHAKWVEFIESFPYVIKHKKGKENVIADALSRRYAMLSQLDFKIFGLETIKEQYAH 1180

Query: 1494 DHYFKEIFEKCLLA-PFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 1318
            D  FK++   C     + K+ L  GF +   +LCI + S  +LL++EAH GGLMGHFGVK
Sbjct: 1181 DDDFKDVLLNCKEGRTWNKFVLTNGFVFRANKLCIPASSVHMLLLQEAHGGGLMGHFGVK 1240

Query: 1317 NL 1312
             +
Sbjct: 1241 KM 1242


>gb|ABI96971.1| putative gag-pol polyprotein [Triticum monococcum subsp.
            aegilopoides]
          Length = 1704

 Score =  442 bits (1138), Expect = e-121
 Identities = 219/384 (57%), Positives = 267/384 (69%), Gaps = 2/384 (0%)
 Frame = -3

Query: 1442 NIFCMKDFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKR 1269
            N F + D F+ + +  C     +    L + H  G+     V KT +IL  H FWP ++R
Sbjct: 1214 NKFVLNDGFVFRANKLCIPASSVRLLLLQEAHGGGLMGHFGVKKTEDILATHFFWPKMRR 1273

Query: 1268 DVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVV 1089
            DVERFV  C  C++AKSR  PHGLY PL VP  PW DISMDF+LGLPRT KG DSIFVVV
Sbjct: 1274 DVERFVARCTTCQRAKSRLNPHGLYMPLPVPSVPWEDISMDFVLGLPRTKKGRDSIFVVV 1333

Query: 1088 DRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLG 909
            DRFSKMAHFIPC KSDDA +VA LFF  I+R HG+P +IVSDRDTKFLSHFWR LW KLG
Sbjct: 1334 DRFSKMAHFIPCHKSDDAVNVADLFFREIIRLHGVPNTIVSDRDTKFLSHFWRCLWAKLG 1393

Query: 908  TNLLYSTACHLQTDGQTEVVNRTIGQLLRAILKKKLRQWEECLPHIEFAYNRAPHSTTSH 729
              LL+ST CH QTDGQTEVVNRT+  +LRA+LK   + WEECLPHIEFAYNR+ HSTT  
Sbjct: 1394 NKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKNNKKMWEECLPHIEFAYNRSLHSTTKM 1453

Query: 728  SPFQVVYGFNPLTPFDLVPLPIDQALSKDASKKANFMKAIHIKVHEAIVNSNKKLVEKRN 549
             PF++VYGF P  P DL+PLP  + ++ DA +++  +  IH    E I   N K    R+
Sbjct: 1454 CPFEIVYGFLPRAPIDLLPLPSSEKVNFDAKERSELILKIHELTKENIERMNAKYKLARD 1513

Query: 548  VGRRKVVFKPGD*VWVHFRKERFPQQRKSKLDDRGDGQFQILEKINDNAYKVDLPGHYNV 369
             GR+ VVF PGD VW+H RK+RFP  RKSKL  R DG F++LEKINDNAYK++LP  + V
Sbjct: 1514 KGRKHVVFAPGDLVWLHLRKDRFPNLRKSKLMPRADGPFKVLEKINDNAYKLELPADFGV 1573

Query: 368  SATFNVSDLSLFDIGDGDSRTGRS 297
            S TFN++DL  + +G+ D    R+
Sbjct: 1574 SPTFNIADLKPY-LGEEDELPSRT 1596



 Score =  243 bits (620), Expect = 2e-61
 Identities = 111/180 (61%), Positives = 143/180 (79%), Gaps = 1/180 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            GKP+AYFSEK +G ++ YSTYDKELYAL+RTL+ WQHYLWPKEFV+H+DHE+LK +K Q 
Sbjct: 1077 GKPVAYFSEKFSGPSLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIKSQA 1136

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KLN+RHA+W++FIETFPY I +KKGKENV+ADALSRRYT+LS L  KI G E +K+ Y  
Sbjct: 1137 KLNRRHAKWVEFIETFPYVIKHKKGKENVIADALSRRYTMLSQLDFKIFGLETIKDQYVH 1196

Query: 1494 DHYFKEIFEKCLLA-PFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 1318
            D  FK++ + C     + K+ L++GF +   +LCI + S R+LL++EAH GGLMGHFGVK
Sbjct: 1197 DAEFKDVLQNCKEGRTWNKFVLNDGFVFRANKLCIPASSVRLLLLQEAHGGGLMGHFGVK 1256


>gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japonica Group]
            gi|91795218|gb|ABE60890.1| putative polyprotein [Oryza
            sativa Japonica Group]
          Length = 1616

 Score =  442 bits (1137), Expect = e-121
 Identities = 212/359 (59%), Positives = 258/359 (71%)
 Frame = -3

Query: 1367 LSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTP 1188
            L + H  G+     V KT +IL +H FWP ++RDVERFV  C  C+KAKSR  PHGLY P
Sbjct: 1225 LQEAHGGGLMGHFGVKKTEDILADHFFWPKMRRDVERFVARCTTCQKAKSRLNPHGLYMP 1284

Query: 1187 LEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFT 1008
            L VP  PW DISMDF+LGLPRT KG DSIFVVVDRFSKMAHFIPC KSDDA+HVA LFF 
Sbjct: 1285 LPVPSVPWEDISMDFVLGLPRTKKGRDSIFVVVDRFSKMAHFIPCHKSDDATHVADLFFR 1344

Query: 1007 WIVRFHGIPRSIVSDRDTKFLSHFWRVLWKKLGTNLLYSTACHLQTDGQTEVVNRTIGQL 828
             IVR HG+P +IVSDRDTKFLSHFWR LW KLGT  L+ST CH QTDGQTEVVNRT+  +
Sbjct: 1345 EIVRLHGVPNTIVSDRDTKFLSHFWRTLWAKLGTKFLFSTTCHPQTDGQTEVVNRTLSTM 1404

Query: 827  LRAILKKKLRQWEECLPHIEFAYNRAPHSTTSHSPFQVVYGFNPLTPFDLVPLPIDQALS 648
            LRA+LKK ++ WEECLPH+EFAYNR+ HSTT   PF++VYG  P  P DL+P P  + ++
Sbjct: 1405 LRAVLKKNIKMWEECLPHVEFAYNRSQHSTTKKCPFEIVYGLLPRAPIDLLPHPTSERVN 1464

Query: 647  KDASKKANFMKAIHIKVHEAIVNSNKKLVEKRNVGRRKVVFKPGD*VWVHFRKERFPQQR 468
             DA  +A  M  +H    E I   N K     + G++ V F+PGD VW+H RK+RFP  R
Sbjct: 1465 FDAKYRAELMLKLHETTKENIERMNIKYKLAGSKGKKHVAFEPGDLVWLHLRKDRFPNLR 1524

Query: 467  KSKLDDRGDGQFQILEKINDNAYKVDLPGHYNVSATFNVSDLSLFDIGDGDSRTGRSCE 291
            KSKL  R DG F++L+KINDNAYK++LP  + VS TFN++DL  + +G+ D    R+ +
Sbjct: 1525 KSKLLPRADGPFKVLQKINDNAYKLELPADFGVSPTFNIADLKPY-LGEEDELESRTTQ 1582



 Score =  236 bits (601), Expect = 3e-59
 Identities = 109/180 (60%), Positives = 140/180 (77%), Gaps = 1/180 (0%)
 Frame = -2

Query: 1854 GKPLAYFSEKLNGAAVRYSTYDKELYALIRTLKHWQHYLWPKEFVVHNDHEALKFLKGQH 1675
            GKP+AYFSEKL+G ++ YSTYDKEL+AL+RTL+ WQHYLWPKEFV+H+DHE+LK ++ Q 
Sbjct: 1061 GKPVAYFSEKLSGPSLNYSTYDKELFALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQA 1120

Query: 1674 KLNKRHARWMKFIETFPYAIHYKKGKENVVADALSRRYTLLSTLQTKILGFEMLKEMYSS 1495
            KLN+RHA+W++FIE+FPY I +KKGKENV+ADALSRRY +LS L  KI G E +KE Y+ 
Sbjct: 1121 KLNRRHAKWVEFIESFPYVIKHKKGKENVIADALSRRYAMLSQLDFKIFGLETIKEQYAH 1180

Query: 1494 DHYFKEIFEKCLLA-PFGKYFLHEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 1318
            D  FK +   C     + K+ L  GF +   +LCI + S R+LL++EAH GGLMGHFGVK
Sbjct: 1181 DDDFKNVLLNCKEGRTWNKFVLTNGFVFRANKLCIPASSVRMLLLQEAHGGGLMGHFGVK 1240


Top