BLASTX nr result

ID: Angelica23_contig00001158 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00001158
         (2645 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera]   343   e-156
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi...   319   e-134
ref|XP_003557045.1| PREDICTED: uncharacterized protein LOC100783...   315   e-134
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop...   315   e-133
gb|AFN88207.1| integrase core domain containing protein [Phaseol...   285   e-130

>emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera]
          Length = 1523

 Score =  343 bits (880), Expect(4) = e-156
 Identities = 220/616 (35%), Positives = 312/616 (50%), Gaps = 53/616 (8%)
 Frame = +2

Query: 212  LYTHSYLSSQSNSYTLLKGLTPYEILFNKLPSYTHLKVFGSLWYATSVTPHKDKFADRAH 391
            + T +YL +++ +  LL+G TP+E LF+K P+Y+HL+VFG   + ++      KF  R+ 
Sbjct: 715  ILTAAYLINRTPT-PLLQGKTPFEKLFHKSPNYSHLRVFGCRCFVSTHPLRPSKFDPRSI 773

Query: 392  RCLFLGYPFAKKAYKFLNLQTRKVFVSRDVIFVEDSFPFKDISSTSTPLFSSH------- 550
              +F+GYP  +K YK  +L+ +K  +SRDV F E  FP+++  ST++P   +        
Sbjct: 774  ESVFIGYPHGQKGYKVYSLKDKKXLISRDVTFFETEFPYQNXLSTTSPSLDTFFPSLPQT 833

Query: 551  TDLXXXXXXXXXXXXXXXXXXFDSVLSDFGPSTESEF-------------------VLVQ 673
             D+                    SV     P+ ++                     V+ Q
Sbjct: 834  PDIDDDHISFNHSGSNLQPSATSSVDXHPQPTLDNSHSSSHVDPPSSPPSLNTSPPVISQ 893

Query: 674  PLPS-----TRPVRTKVVPAKFQDF---------------TGLSSHMQSTVNT-----TL 778
            P PS     +RP +T   P   QDF               T   +H   T+++     + 
Sbjct: 894  PSPSQPRRSSRPTKT---PTTLQDFHIEAALPSRPVPPSSTSEVAH-SGTIHSLSQVLSY 949

Query: 779  TNFSPAYQTFSANVASVPEPTSYYAACKHHVWCXXXXXXXXXXXXNNTWQIVPLPPGKKV 958
               SP ++ F+  +    EP S+  A     W             N TW +VPLP  KK 
Sbjct: 950  DRLSPMHKAFTVKITLAKEPRSFSQAVLDSRWREAMNTEIQALQANKTWSLVPLPSHKKP 1009

Query: 959  VSCKWLYKVKFKPNGTVDRYKARLVARGFTQTEGLDYFDTFAPVAKMVTMRVLLSLVAVN 1138
            + CKW+YK+K+ P+GT++RYKARLVA+GF+Q EG+DY +TFAPVAK+ T+RVLLSL ++ 
Sbjct: 1010 IGCKWVYKIKYNPDGTIERYKARLVAKGFSQVEGIDYRETFAPVAKLTTVRVLLSLASIQ 1069

Query: 1139 GRSVTQMDVINVFLHGHLQEEGYMSIPPDYVLSPAHLASSLDRPLVCRLIKSIYGLKQAP 1318
            G  + Q+DV N FL+G L E+ YM +PP +     H         VC+L KS+YGLKQA 
Sbjct: 1070 GWHLHQLDVNNAFLNGDLYEDVYMQLPPGFGRKGEH--------RVCKLHKSLYGLKQAS 1121

Query: 1319 QVWAKKDCSVLLDYGFTQAHTDHSLFIYRYASSXXXXXXXXXXXXXXXGNDVGLITKIKT 1498
            + W  K  S L   GF Q+ +D+SLF  R                   GN +  I + K 
Sbjct: 1122 RQWFLKLSSALKAAGFKQSWSDYSLF-XRNTQGRFTTLLVYVDDVILAGNSLEDIIETKQ 1180

Query: 1499 YLASHFKIKDLEPLKYFLGIEFARSVKGIYLNQRKYSSDIIKDIGFESAKPSLVPTSHGT 1678
            +LASHFK+KD+  L+YFLGIE ARS +GI L QRKY+ ++++D GF  AKPS  P     
Sbjct: 1181 FLASHFKLKDMGQLRYFLGIEVARSKQGIVLCQRKYALELLEDAGFLGAKPSRFPVEQSL 1240

Query: 1679 KSXXXXXXXXWIFVXXXXXXXXXXXXXXXXDYSTRP--LYAVHVLAQFMSQPRQCHLDVA 1852
                                             TRP  +YAVH+L+QFM  PRQ HLD A
Sbjct: 1241 TLTRGDGAE-----LKDASQYRRLVGRLIYLTITRPDLVYAVHILSQFMDTPRQPHLDAA 1295

Query: 1853 FKVVRYIKHTLGQGIF 1900
            +KV+RY+K T GQGIF
Sbjct: 1296 YKVLRYVKQTPGQGIF 1311



 Score =  112 bits (281), Expect(4) = e-156
 Identities = 51/88 (57%), Positives = 73/88 (82%)
 Frame = +2

Query: 2177 FCDNKSAIYLASNPVFHERTKHIEINCHLVREKLLKGVIQTAYLASKHQPADLFTKSIPS 2356
            FCDN++AI++ASNPVFHERTKHIE++CH+VREK+ +G+++T ++ ++ QPADLFTK + S
Sbjct: 1400 FCDNQAAIHIASNPVFHERTKHIEMDCHVVREKVQRGLVKTMHIRTQEQPADLFTKPLSS 1459

Query: 2357 YAMSYLLSKLGVLNLFLHPSLRGDDNDI 2440
               S LLSKLGV+N  +H +LRG + D+
Sbjct: 1460 KQFSTLLSKLGVIN--IHTNLRGSEVDV 1485



 Score = 94.4 bits (233), Expect(4) = e-156
 Identities = 45/71 (63%), Positives = 53/71 (74%)
 Frame = +1

Query: 1957 CPITRRSLTGYCITLGSSLISWKSKRQATVSRSSAEAEYRALADVCCEITWLVNLFFELG 2136
            C  TRRS TGYCI  G++ ISWK+K+Q TVSRSSAEAEYR++A  CCEITWL +L  +L 
Sbjct: 1331 CKDTRRSTTGYCIFFGNAPISWKTKKQGTVSRSSAEAEYRSMATTCCEITWLRSLLADLN 1390

Query: 2137 VPSLCPVKLFC 2169
            V     VKLFC
Sbjct: 1391 VNHAHAVKLFC 1401



 Score = 75.9 bits (185), Expect(4) = e-156
 Identities = 39/71 (54%), Positives = 45/71 (63%)
 Frame = +3

Query: 6   IQFFRSDNDPEFLNYFLQAQLSTLGIVHQKSCTYTPQQNGVVERKHKSLRNTTRALRLQP 185
           ++  RSDN PEF +       S+ GI+HQ SC  TPQQNGVVERKH+ L N  RAL  Q 
Sbjct: 648 VKIVRSDNGPEFKHTQFY---SSRGILHQTSCINTPQQNGVVERKHRHLLNVARALLFQS 704

Query: 186 SLPLSFWGDCI 218
            LP  FWGD I
Sbjct: 705 HLPKPFWGDAI 715


>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1461

 Score =  319 bits (818), Expect(4) = e-134
 Identities = 197/574 (34%), Positives = 287/574 (50%), Gaps = 8/574 (1%)
 Frame = +2

Query: 257  LLKGLTPYEILFNKLPSYTHLKVFGSLWYATSVTPHKDKFADRAHRCLFLGYPFAKKAYK 436
            LL   TP+E+L  KLP Y+ LK FG L Y+++ +  + KF  R+  C+FLGYPF  K YK
Sbjct: 769  LLSNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRACVFLGYPFGFKGYK 828

Query: 437  FLNLQTRKVFVSRDVIFVEDSFPFKDISSTSTPLFSSHTDLXXXXXXXXXXXXXXXXXXF 616
             L+L++  V +SR+V F E+ FP      ++T      T +                   
Sbjct: 829  LLDLESNVVHISRNVEFHEELFPLASSQQSATTASDVFTPM------------------- 869

Query: 617  DSVLSDFGPSTESEFVLVQPLPSTRPVRTKVV--PAKFQD----FTGLSSHMQSTVNTTL 778
            D + S  G S  S     Q  PST+  + ++   PA  QD    F         + + + 
Sbjct: 870  DPLSS--GNSITSHLPSPQISPSTQISKRRITKFPAHLQDYHCYFVNKDDSHPISSSLSY 927

Query: 779  TNFSPAYQTFSANVASVPEPTSYYAACKHHVWCXXXXXXXXXXXXNNTWQIVPLPPGKKV 958
            +  SP++  +  N++ +P P SY+ A     WC             +TW+I  LPPGKK 
Sbjct: 928  SQISPSHMLYINNISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEITSLPPGKKA 987

Query: 959  VSCKWLYKVKFKPNGTVDRYKARLVARGFTQTEGLDYFDTFAPVAKMVTMRVLLSLVAVN 1138
            V CKW++ VKF  +G+++R+KAR+VA+G+TQ EGLDY +TF+PVAKM T+++LL + A  
Sbjct: 988  VGCKWVFTVKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVKLLLKVSASK 1047

Query: 1139 GRSVTQMDVINVFLHGHLQEEGYMSIPPDYVLSPAHLASSLDRPLVCRLIKSIYGLKQAP 1318
               + Q+D+ N FL+G L+E  YM +P  Y        +SL   +VCRL KSIYGLKQA 
Sbjct: 1048 KWYLNQLDISNAFLNGDLEETIYMKLPDGYA---DIKGTSLPPNVVCRLKKSIYGLKQAS 1104

Query: 1319 QVWAKKDCSVLLDYGFTQAHTDHSLFIYRYASSXXXXXXXXXXXXXXXGNDVGLITKIKT 1498
            + W  K  + LL  GF + H DH+LF+ R   S                        +  
Sbjct: 1105 RQWFLKFSNSLLALGFEKQHGDHTLFV-RCIGSEFIVLLVYVDDIVIASTTEQAAQSLTE 1163

Query: 1499 YLASHFKIKDLEPLKYFLGIEFARSVKGIYLNQRKYSSDIIKDIGFESAKPSLVPTSHGT 1678
             L + FK+++L PLKYFLG+E AR+ +GI L+QRKY+ +++        KPS +P +   
Sbjct: 1164 ALKASFKLRELGPLKYFLGLEVARTSEGISLSQRKYALELLTSADMLDCKPSSIPMTPNI 1223

Query: 1679 KSXXXXXXXXWIFVXXXXXXXXXXXXXXXXDYSTRP--LYAVHVLAQFMSQPRQCHLDVA 1852
            +            +                   TRP   +AV+ L QF S PR  HL   
Sbjct: 1224 RLSKNDG-----LLLEDKEMYRRLVGKLMYLTITRPDITFAVNKLCQFSSAPRTAHLAAV 1278

Query: 1853 FKVVRYIKHTLGQGIFXXXXXXXXXXXXXXXXWG 1954
            +KV++YIK T+GQG+F                WG
Sbjct: 1279 YKVLQYIKGTVGQGLFYSAEDDLTLKGYTDADWG 1312



 Score = 88.2 bits (217), Expect(4) = e-134
 Identities = 39/80 (48%), Positives = 59/80 (73%)
 Frame = +2

Query: 2177 FCDNKSAIYLASNPVFHERTKHIEINCHLVREKLLKGVIQTAYLASKHQPADLFTKSIPS 2356
            + D+ +A+Y+A+NPVFHERTKHIEI+CH VREKL  G ++  ++ +K Q AD+ TK +  
Sbjct: 1382 YSDSTAAVYIATNPVFHERTKHIEIDCHTVREKLDNGQLKLLHVKTKDQVADILTKPLFP 1441

Query: 2357 YAMSYLLSKLGVLNLFLHPS 2416
            Y  ++LLSK+ + N+F+  S
Sbjct: 1442 YQFAHLLSKMSIQNIFVFSS 1461



 Score = 83.6 bits (205), Expect(4) = e-134
 Identities = 42/69 (60%), Positives = 50/69 (72%)
 Frame = +1

Query: 1951 GGCPITRRSLTGYCITLGSSLISWKSKRQATVSRSSAEAEYRALADVCCEITWLVNLFFE 2130
            G CP +RRS TG+ + +GSSLISW+SK+Q TVSRSSAEAEYRALA   CE+ WL  L   
Sbjct: 1312 GTCPDSRRSTTGFTMFVGSSLISWRSKKQPTVSRSSAEAEYRALALASCEMAWLSTLLLA 1371

Query: 2131 LGVPSLCPV 2157
            L V S  P+
Sbjct: 1372 LRVHSGVPI 1380



 Score = 60.8 bits (146), Expect(4) = e-134
 Identities = 32/72 (44%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
 Frame = +3

Query: 6   IQFFRSDNDPEF-LNYFLQAQLSTLGIVHQKSCTYTPQQNGVVERKHKSLRNTTRALRLQ 182
           ++  RSDN  E     F +A+    GIV   SC  TP+QN VVERKH+ + N  RAL  Q
Sbjct: 688 VKSVRSDNAKELAFTEFYKAK----GIVSFHSCPETPEQNSVVERKHQHILNVARALMFQ 743

Query: 183 PSLPLSFWGDCI 218
            ++ L +WGDC+
Sbjct: 744 SNMSLPYWGDCV 755


>ref|XP_003557045.1| PREDICTED: uncharacterized protein LOC100783177 [Glycine max]
          Length = 2219

 Score =  315 bits (808), Expect(4) = e-134
 Identities = 204/580 (35%), Positives = 288/580 (49%), Gaps = 33/580 (5%)
 Frame = +2

Query: 260  LKGLTPYEILFNKLPSYTHLKVFGSLWYATSVTPHKDKFADRAHRCLFLGYPFAKKAYKF 439
            L+  +PY +L+N  P +  LKVFGSL +A+++  H+ K   RA +C+FLGY    K    
Sbjct: 681  LQNKSPYTLLYNTAPDFDTLKVFGSLVFASTLQSHRTKLDLRARKCVFLGYKSGVKGVVL 740

Query: 440  LNLQTRKVFVSRDVIFVEDSFPFKDISSTS---------TPLFSSHT-------DLXXXX 571
            L+L    +F+SRDV   E  FP++  S  +         TP  S  T       D+    
Sbjct: 741  LDLLNNSIFLSRDVTHHEHIFPYQSSSPKTPWEYHSISPTPNDSDITLDSDISLDINAEQ 800

Query: 572  XXXXXXXXXXXXXXFDSVLSDFGPSTES-EFVLVQPLPSTRPVRTKVVPAKFQDF----T 736
                           D+V+SD   ST   +     PL  ++P+R +  P    D+    T
Sbjct: 801  SPSPPHSSLSPNISNDTVISDTSTSTPPPKDHNDSPLLHSKPIRQRRAPLHLSDYVCHNT 860

Query: 737  GLSSHMQSTVNT----------TLTNFSPAYQTFSANVASVPEPTSYYAACKHHVWCXXX 886
              +SH   T  T          +LT  SP+++ FS ++    EP SY  A KH  W    
Sbjct: 861  SPTSHESLTSGTKSKYPLSSFHSLTLLSPSHKAFSMSITHCTEPQSYEEASKHEHWVTAM 920

Query: 887  XXXXXXXXXNNTWQIVPLPPGKKVVSCKWLYKVKFKPNGTVDRYKARLVARGFTQTEGLD 1066
                     N TW+IV LPP  K + CKW+YKVK K NG ++RYKARLVA+G+ Q EG+D
Sbjct: 921  KEELNALAKNCTWKIVELPPHTKPIGCKWVYKVKHKANGQIERYKARLVAKGYNQVEGID 980

Query: 1067 YFDTFAPVAKMVTMRVLLSLVAVNGRSVTQMDVINVFLHGHLQEEGYMSIPPDYVLSPAH 1246
            YF+TF+PVAK+ T+R LL++ A+    + Q+DV N FLHG LQE+ YM IP     +  +
Sbjct: 981  YFETFSPVAKITTVRTLLAVAAIKNWHLHQLDVNNAFLHGDLQEDVYMKIPDGVTCAKPN 1040

Query: 1247 LASSLDRPLVCRLIKSIYGLKQAPQVWAKKDCSVLLDYGFTQAHTDHSLFIYRYASSXXX 1426
                     VC+L KS+YGLKQA + W +K  ++LL  G+ Q+ +D+SLF     ++   
Sbjct: 1041 --------SVCKLQKSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNT-FT 1091

Query: 1427 XXXXXXXXXXXXGNDVGLITKIKTYLASHFKIKDLEPLKYFLGIEFARSVKGIYLNQRKY 1606
                        G+ +    +IK  L   FKIK+L  LKYFLG+E A S  GI ++QRKY
Sbjct: 1092 ALLVYVDDIILAGDSIDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKY 1151

Query: 1607 SSDIIKDIGFESAKPSLVPTSHGTKSXXXXXXXXWIFVXXXXXXXXXXXXXXXXDYSTRP 1786
              D++KD G    KP+  P     K                               +TRP
Sbjct: 1152 CLDLLKDSGLLGCKPASTPLDTSIKLHSAAGTP-----YADISGYRRIVGKLLYLNTTRP 1206

Query: 1787 --LYAVHVLAQFMSQPRQCHLDVAFKVVRYIKHTLGQGIF 1900
               +A   L+QFM  P   H + A +V+RY+K+  GQGIF
Sbjct: 1207 DIAFATQQLSQFMQAPTNVHFNAACRVLRYLKNNPGQGIF 1246



 Score = 85.5 bits (210), Expect(4) = e-134
 Identities = 38/72 (52%), Positives = 53/72 (73%)
 Frame = +1

Query: 1954 GCPITRRSLTGYCITLGSSLISWKSKRQATVSRSSAEAEYRALADVCCEITWLVNLFFEL 2133
            GC  +R+S++GYC  +G SL+SW++K+QATVSRSS+EAEYRAL+   CE+ WL+ LF +L
Sbjct: 1265 GCMDSRKSISGYCFFIGKSLVSWRAKKQATVSRSSSEAEYRALSSAACELQWLLYLFADL 1324

Query: 2134 GVPSLCPVKLFC 2169
             V       L+C
Sbjct: 1325 RVQLTRTPTLYC 1336



 Score = 83.2 bits (204), Expect(4) = e-134
 Identities = 35/58 (60%), Positives = 49/58 (84%)
 Frame = +2

Query: 2177 FCDNKSAIYLASNPVFHERTKHIEINCHLVREKLLKGVIQTAYLASKHQPADLFTKSI 2350
            +CDN+SA+++ASNPVFHERTKH+EI+CHLVREKLLKG ++   +++  Q AD  TK++
Sbjct: 1335 YCDNQSAVHIASNPVFHERTKHLEIDCHLVREKLLKGTLKLLPVSTSDQVADFLTKAL 1392



 Score = 67.0 bits (162), Expect(4) = e-134
 Identities = 35/67 (52%), Positives = 44/67 (65%)
 Frame = +3

Query: 6   IQFFRSDNDPEFLNYFLQAQLSTLGIVHQKSCTYTPQQNGVVERKHKSLRNTTRALRLQP 185
           I  FR DN PEFL   +    ++ GI+HQ SC  +PQQNG VERKH+ + N  RAL +Q 
Sbjct: 599 IWHFRLDNGPEFL---MPDFYASKGILHQTSCVDSPQQNGRVERKHQQILNIGRALLVQS 655

Query: 186 SLPLSFW 206
           +LP SFW
Sbjct: 656 NLPKSFW 662


>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
            (gb|U12626) [Arabidopsis thaliana]
          Length = 1315

 Score =  315 bits (807), Expect(4) = e-133
 Identities = 194/562 (34%), Positives = 286/562 (50%), Gaps = 14/562 (2%)
 Frame = +2

Query: 257  LLKGLTPYEILFNKLPSYTHLKVFGSLWYATSVTPHKDKFADRAHRCLFLGYPFAKKAYK 436
            +L+   P+E+L   +P+Y H+KVFG L YA++    + KF+ RA  C F+GYP   K YK
Sbjct: 607  ILEDKCPFEVLTKTVPTYDHIKVFGCLCYASTSPKDRHKFSPRAKACAFIGYPSGFKGYK 666

Query: 437  FLNLQTRKVFVSRDVIFVEDSFPF--KDISSTSTPLFSSHTDLXXXXXXXXXXXXXXXXX 610
             L+L+T  + VSR V+F E+ FPF   D+S      F                       
Sbjct: 667  LLDLETHSIIVSRHVVFHEELFPFLGSDLSQEEQNFFPDLNPTPPMQRQSSDHVNPSDSS 726

Query: 611  XFDSVLSDFGPSTESEFVLVQPLPSTRPVRTKVV-PAKFQDFTGLSSHMQSTVNTT---- 775
                +L    P+         P PS +    K   PA  QD+     +  S V++T    
Sbjct: 727  SSVEILPSANPTNNV------PEPSVQTSHRKAKKPAYLQDY-----YCHSVVSSTPHEI 775

Query: 776  -----LTNFSPAYQTFSANVASVPEPTSYYAACKHHVWCXXXXXXXXXXXXNNTWQIVPL 940
                     +  Y TF A +    EP++Y  A K  VW              +TW++  L
Sbjct: 776  RKFLSYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAMGAEFDFLEGTHTWEVCSL 835

Query: 941  PPGKKVVSCKWLYKVKFKPNGTVDRYKARLVARGFTQTEGLDYFDTFAPVAKMVTMRVLL 1120
            P  K+ + C+W++K+K+  +G+V+RYKARLVA+G+TQ EG+DY +TF+PVAK+ ++++LL
Sbjct: 836  PADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNSVKLLL 895

Query: 1121 SLVAVNGRSVTQMDVINVFLHGHLQEEGYMSIPPDYVLSPAHLASSLDRPLVCRLIKSIY 1300
             + A    S+TQ+D+ N FL+G L EE YM +P  Y    +    SL    VCRL KS+Y
Sbjct: 896  GVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYA---SRQGDSLPPNAVCRLKKSLY 952

Query: 1301 GLKQAPQVWAKKDCSVLLDYGFTQAHTDHSLFIYRYASSXXXXXXXXXXXXXXXGNDVGL 1480
            GLKQA + W  K  S LL  GF Q++ DH+ F+ + +                  N+   
Sbjct: 953  GLKQASRQWYLKFSSTLLGLGFIQSYCDHTCFL-KISDGIFLCVLVYIDDIIIASNNDAA 1011

Query: 1481 ITKIKTYLASHFKIKDLEPLKYFLGIEFARSVKGIYLNQRKYSSDIIKDIGFESAKPSLV 1660
            +  +K+ + S FK++DL  LKYFLG+E  RS KGI+++QRKY+ D++ + G    KPS +
Sbjct: 1012 VDILKSQMKSFFKLRDLGELKYFLGLEIVRSDKGIHISQRKYALDLLDETGQLGCKPSSI 1071

Query: 1661 PTSHGTKSXXXXXXXXWIFVXXXXXXXXXXXXXXXXDYSTRP--LYAVHVLAQFMSQPRQ 1834
            P      S          FV                   TRP   +AV+ LAQF   PR+
Sbjct: 1072 PMD---PSMVFAHDSGGDFVEVGPYRRLIGRLMYLN--ITRPDITFAVNKLAQFSMAPRK 1126

Query: 1835 CHLDVAFKVVRYIKHTLGQGIF 1900
             HL   +K+++YIK T+GQG+F
Sbjct: 1127 AHLQAVYKILQYIKGTIGQGLF 1148



 Score = 86.7 bits (213), Expect(4) = e-133
 Identities = 40/77 (51%), Positives = 59/77 (76%)
 Frame = +2

Query: 2177 FCDNKSAIYLASNPVFHERTKHIEINCHLVREKLLKGVIQTAYLASKHQPADLFTKSIPS 2356
            FCDN++AI++A+N VFHERTKHIE +CH VRE+LLKG+ +  ++ ++ Q AD FTK +  
Sbjct: 1237 FCDNEAAIHIANNHVFHERTKHIESDCHSVRERLLKGLFELYHINTELQIADPFTKPLYP 1296

Query: 2357 YAMSYLLSKLGVLNLFL 2407
                 L+SK+G+LN+F+
Sbjct: 1297 SHFHRLISKMGLLNIFV 1313



 Score = 79.7 bits (195), Expect(4) = e-133
 Identities = 39/71 (54%), Positives = 48/71 (67%)
 Frame = +1

Query: 1957 CPITRRSLTGYCITLGSSLISWKSKRQATVSRSSAEAEYRALADVCCEITWLVNLFFELG 2136
            C  +RRS +GYC+ LG SLI WKS++Q  VS+SSAEAEYR+L+    E+ WL N   EL 
Sbjct: 1168 CRDSRRSTSGYCMFLGDSLICWKSRKQDVVSKSSAEAEYRSLSVATDELVWLTNFLKELQ 1227

Query: 2137 VPSLCPVKLFC 2169
            VP   P  LFC
Sbjct: 1228 VPLSKPTLLFC 1238



 Score = 67.0 bits (162), Expect(4) = e-133
 Identities = 39/71 (54%), Positives = 46/71 (64%)
 Frame = +3

Query: 6   IQFFRSDNDPEFLNYFLQAQLSTLGIVHQKSCTYTPQQNGVVERKHKSLRNTTRALRLQP 185
           I+  RSDN PE LN F Q   S  GIV   SC  TPQQN VVERKH+ + N  R+L  Q 
Sbjct: 526 IKGVRSDNAPE-LN-FTQFYHSK-GIVPYHSCPETPQQNSVVERKHQHILNVARSLFFQS 582

Query: 186 SLPLSFWGDCI 218
            +P+S+WGDCI
Sbjct: 583 HIPISYWGDCI 593


>gb|AFN88207.1| integrase core domain containing protein [Phaseolus vulgaris]
          Length = 1387

 Score =  285 bits (729), Expect(4) = e-130
 Identities = 184/556 (33%), Positives = 283/556 (50%), Gaps = 8/556 (1%)
 Frame = +2

Query: 254  TLLKGLTPYEILFNKLPSYT-HLKVFGSLWYATSVTPHKDKFADRAHRCLFLGYPFAKKA 430
            ++L    P+ ILF   P ++   KVFGS  +  + +P  DK + R+H+C+FLG+  ++K 
Sbjct: 681  SVLDNKIPHSILFPHDPLHSLPPKVFGSTCFVHNFSPGLDKLSPRSHKCVFLGFTRSQKG 740

Query: 431  YKFLNLQTRKVFVSRDVIFVEDSFPFKDISSTSTPLFSSHTDLXXXXXXXXXXXXXXXXX 610
            YK  +    + F+S DV F E S  FK   S S    +                      
Sbjct: 741  YKCFSPSLNRYFISADVTFSESSLYFKSCPSPSMSSSNQVNIPLVVPSAPKDSPPPPTLQ 800

Query: 611  XFDSVLSDFGPSTESEFVLVQPLPSTRPVRTKVVPAKFQDFTGLSSHMQSTVNTTLTNF- 787
             +    +   PS +S   L+ P P + P  T           G+ S    + + T  ++ 
Sbjct: 801  VYSRRQTSHRPSDDS---LLVPTPHSPPAPTVEPDLPIAIRKGIRSTRNPSPHYTALSYH 857

Query: 788  --SPAYQTFSANVASVPEPTSYYAACKHHVWCXXXXXXXXXXXXNNTWQIVPLPPGKKVV 961
              S  + T  ++++SV  P S   A  H  W             N TW++VPLP  K VV
Sbjct: 858  RLSQPFYTCLSSISSVSIPKSVGDALAHPGWRQAMLDEMNALQNNGTWELVPLPSRKSVV 917

Query: 962  SCKWLYKVKFKPNGTVDRYKARLVARGFTQTEGLDYFDTFAPVAKMVTMRVLLSLVAVNG 1141
             C+W++ +K  P+GT+DR KARLVA+G+TQ  GLDY DTF+PVAKM ++R+ +++ A+  
Sbjct: 918  GCRWVFAIKVGPDGTIDRLKARLVAKGYTQIFGLDYGDTFSPVAKMASVRLFIAMAALQQ 977

Query: 1142 RSVTQMDVINVFLHGHLQEEGYMSIPPDYVLSPAHLASSLDRPLVCRLIKSIYGLKQAPQ 1321
              + Q+DV N FL+G LQEE YM  PP +V      A      LVCRL KS+YGLKQ+P+
Sbjct: 978  WPLYQLDVKNAFLNGDLQEEIYMEQPPGFV------AQGESSGLVCRLRKSLYGLKQSPR 1031

Query: 1322 VWAKKDCSVLLDYGFTQAHTDHSLFIYRYASSXXXXXXXXXXXXXXXGNDVGLITKIKTY 1501
             W  K  +V+  +G T++  DHS+F YR++S                G+D   I+++K +
Sbjct: 1032 AWFGKFSNVVQQFGMTRSEADHSVF-YRHSSVGCIYLVVYVDDIVLTGSDHHGISQVKQH 1090

Query: 1502 LASHFKIKDLEPLKYFLGIEFARSVKGIYLNQRKYSSDIIKDIGFESAKPSLVPTSHGTK 1681
            L  +F+ KDL  L+YFLGIE A+S  GI ++QRKY+ DI+++IG  ++K    P     K
Sbjct: 1091 LCQNFQTKDLGKLRYFLGIEVAQSNTGIVISQRKYALDILEEIGLMNSKSVDTPMDPNVK 1150

Query: 1682 SXXXXXXXXWIFVXXXXXXXXXXXXXXXXDY--STRP--LYAVHVLAQFMSQPRQCHLDV 1849
                                         +Y   TRP   +AV V++QF++ P + H + 
Sbjct: 1151 LLPNQG-------EPLSDPEKYRRLVGKLNYLTVTRPDISFAVSVVSQFLNSPCEDHWNA 1203

Query: 1850 AFKVVRYIKHTLGQGI 1897
              ++++YIK + G+G+
Sbjct: 1204 VIRILKYIKGSPGKGL 1219



 Score = 93.6 bits (231), Expect(4) = e-130
 Identities = 39/75 (52%), Positives = 60/75 (80%)
 Frame = +2

Query: 2180 CDNKSAIYLASNPVFHERTKHIEINCHLVREKLLKGVIQTAYLASKHQPADLFTKSIPSY 2359
            CDN++A++++SNPVFHERTKHIEI+CH +REK++ G I+T ++ S +Q AD+FTKS+   
Sbjct: 1310 CDNQAALHISSNPVFHERTKHIEIDCHFIREKIISGDIKTEFVNSNNQLADIFTKSLRGP 1369

Query: 2360 AMSYLLSKLGVLNLF 2404
             + Y+ +KLG  +L+
Sbjct: 1370 RIDYICNKLGTYDLY 1384



 Score = 83.6 bits (205), Expect(4) = e-130
 Identities = 38/72 (52%), Positives = 50/72 (69%)
 Frame = +1

Query: 1954 GCPITRRSLTGYCITLGSSLISWKSKRQATVSRSSAEAEYRALADVCCEITWLVNLFFEL 2133
            G P  RRS +GYC+++G +LISWKSK+Q+ V+RSSAEAEYRA+A   CE+ WL  L  EL
Sbjct: 1239 GSPSDRRSTSGYCVSIGDNLISWKSKKQSVVARSSAEAEYRAMASATCELIWLKQLLKEL 1298

Query: 2134 GVPSLCPVKLFC 2169
                +  + L C
Sbjct: 1299 QFGDVTQMTLIC 1310



 Score = 75.5 bits (184), Expect(4) = e-130
 Identities = 33/71 (46%), Positives = 48/71 (67%)
 Frame = +3

Query: 6   IQFFRSDNDPEFLNYFLQAQLSTLGIVHQKSCTYTPQQNGVVERKHKSLRNTTRALRLQP 185
           I+  RSDN  E+L++  +  +++ GI+HQ SC YTPQQNGV ERK++ L  TTR + +  
Sbjct: 598 IRILRSDNGREYLSHSFKNFMASHGILHQTSCAYTPQQNGVAERKNRHLVETTRTILIHG 657

Query: 186 SLPLSFWGDCI 218
            +P  FWGD +
Sbjct: 658 DVPQHFWGDAV 668


Top