BLASTX nr result
ID: Cephaelis21_contig00003787
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00003787 (1420 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002518705.1| cysteine protease, putative [Ricinus communi... 441 e-177 ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [V... 443 e-173 ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|2... 437 e-172 ref|XP_002298740.1| predicted protein [Populus trichocarpa] gi|2... 431 e-172 gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa] 431 e-169 >ref|XP_002518705.1| cysteine protease, putative [Ricinus communis] gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis] Length = 471 Score = 441 bits (1133), Expect(2) = e-177 Identities = 203/289 (70%), Positives = 229/289 (79%), Gaps = 6/289 (2%) Frame = +2 Query: 572 GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751 GSCWAFST+ AVEGINQI TGELI+LSEQELVDCD +YNQGCNGGLMDYAF+FI+ NGGI Sbjct: 160 GSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGI 219 Query: 752 DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931 DTEEDY Y A D++CD RKNA+VV+IDGYEDV NDE SLKKA+AHQP+SVAIEAGGRA Sbjct: 220 DTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRA 279 Query: 932 FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111 FQLY SGVFTGRCGT+LDHGVVAVGYGTENGV+YWIVRNSWG +WGE+GYIR+ERN+ANT Sbjct: 280 FQLYKSGVFTGRCGTELDHGVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERNVANT 339 Query: 1112 TTGKCGIAIEPSY------XXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIY 1273 TGKCGIAI+PSY CDDYFSCP+G+TCCCIY Sbjct: 340 KTGKCGIAIQPSYPTKKGANPPNPGPSPPSPVNPPPPVSPSTVCDDYFSCPDGNTCCCIY 399 Query: 1274 QYGSYCFGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNPL 1420 +Y YCFGWGCCPLE+AT YP+CD+ AGTC++SKDNPL Sbjct: 400 EYSGYCFGWGCCPLESATCCDDHNSCCPHEYPVCDLKAGTCRLSKDNPL 448 Score = 208 bits (529), Expect(2) = e-177 Identities = 96/142 (67%), Positives = 117/142 (82%), Gaps = 1/142 (0%) Frame = +1 Query: 97 DMSIIDYNINHGGVYPTG-EHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINE 273 DMSI+DYNI HG YP + +V+ +YE WLV+HGKAYNALGE+EKRFEIFKDN++FI+E Sbjct: 25 DMSIVDYNIKHGTKYPLRTDSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDE 84 Query: 274 HNAQDGTYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVD 453 HN+ D +YK+GLNRFADLTNEEY+++F+ +M+R R RS RY F+ GDDLPE+VD Sbjct: 85 HNSVDRSYKVGLNRFADLTNEEYKAMFLGTKMERKNRFLGTRS-QRYLFKDGDDLPENVD 143 Query: 454 WRAKGAVPPVKDQGQCGSCWAF 519 WR KGAV PVKDQGQCGSCWAF Sbjct: 144 WREKGAVVPVKDQGQCGSCWAF 165 >ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera] Length = 469 Score = 443 bits (1140), Expect(2) = e-173 Identities = 206/283 (72%), Positives = 226/283 (79%) Frame = +2 Query: 572 GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751 GSCWAFSTIAAVEGINQIATG+LI+LSEQELVDCD +YNQGCNGGLMDYAF+FI+ NGGI Sbjct: 164 GSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGI 223 Query: 752 DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931 D+EEDY Y A D CD RKNARVVSIDGYEDV NDE+SLKKA+A+QP+SVAIEAGGRA Sbjct: 224 DSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRA 283 Query: 932 FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111 FQLY SGVFTG+CGTQLDHGVVAVGYGTEN VDYWIVRNSWGP+WGE+GYI+LERNLA T Sbjct: 284 FQLYQSGVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGT 343 Query: 1112 TTGKCGIAIEPSYXXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIYQYGSYC 1291 TGKCGIAIEPSY CD+Y++CPE STCCCIY+Y +C Sbjct: 344 ETGKCGIAIEPSYPIKNGQNPPNPGPSPPSPSKPSVVCDEYYTCPEESTCCCIYEYAGFC 403 Query: 1292 FGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNPL 1420 F WGCCPLE AT YP+CDVDAGTCQMSK NPL Sbjct: 404 FEWGCCPLEGATCCDDHYSCCPHEYPVCDVDAGTCQMSKGNPL 446 Score = 192 bits (487), Expect(2) = e-173 Identities = 93/141 (65%), Positives = 110/141 (78%) Frame = +1 Query: 97 DMSIIDYNINHGGVYPTGEHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINEH 276 DMSII Y + + EV A+YE+WLVKHGK+YNALGERE+RFEIFKDN++FI EH Sbjct: 32 DMSIISYGDR---LEKRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEH 88 Query: 277 NAQDGTYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVDW 456 NA + TYK+GLNRFADLTNEEYRS ++ R + L + R +DRY+FR G+DLPESVDW Sbjct: 89 NAVNRTYKVGLNRFADLTNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDW 148 Query: 457 RAKGAVPPVKDQGQCGSCWAF 519 R KGAV PVKDQG CGSCWAF Sbjct: 149 REKGAVVPVKDQGNCGSCWAF 169 >ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa] Length = 477 Score = 437 bits (1125), Expect(2) = e-172 Identities = 203/286 (70%), Positives = 225/286 (78%), Gaps = 4/286 (1%) Frame = +2 Query: 572 GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751 GSCWAFST+ AVEGINQI TG L +LSEQELVDCD YNQGCNGGLMDYAF+FI+KNGGI Sbjct: 160 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGI 219 Query: 752 DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931 DTEEDY Y A D MCD RKNARVV+IDGYEDV NDEKSL+KA+A+QP+SVAIEAGGRA Sbjct: 220 DTEEDYPYKAVDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRA 279 Query: 932 FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111 FQLY SGVFTG CGTQLDHGVVAVGYGTENGVDYW+VRNSWGP+WGENGYIR+ERN+A+T Sbjct: 280 FQLYQSGVFTGSCGTQLDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVAST 339 Query: 1112 TTGKCGIAIEPSY----XXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIYQY 1279 TGKCGIA+E SY CDDY+SCP GSTCCCIY Y Sbjct: 340 ETGKCGIAMEASYPTKKGANPPNPGPSPPSPVNPSPPPSSECDDYYSCPAGSTCCCIYPY 399 Query: 1280 GSYCFGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNP 1417 G YCFGWGCCPLE+AT YP+CD++AGTC+MSK+NP Sbjct: 400 GDYCFGWGCCPLESATCCDDHNSCCPHEYPVCDLEAGTCRMSKNNP 445 Score = 196 bits (499), Expect(2) = e-172 Identities = 93/142 (65%), Positives = 111/142 (78%), Gaps = 1/142 (0%) Frame = +1 Query: 97 DMSIIDYNINHGGVYPTGEHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINEH 276 DMSIIDYN+ HG V E E LYE WLVK+GKAYNALGE+E+RFEIFKDN++F+++H Sbjct: 24 DMSIIDYNLKHGQVPERTEAETLRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQH 83 Query: 277 NAQDG-TYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVD 453 N+ +YKLGLN+FADL+NEEYR+ ++ RMD RL + RY F+ GDDLPESVD Sbjct: 84 NSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVD 143 Query: 454 WRAKGAVPPVKDQGQCGSCWAF 519 WR KGAV PVKDQGQCGSCWAF Sbjct: 144 WREKGAVAPVKDQGQCGSCWAF 165 >ref|XP_002298740.1| predicted protein [Populus trichocarpa] gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa] Length = 455 Score = 431 bits (1109), Expect(2) = e-172 Identities = 201/288 (69%), Positives = 222/288 (77%), Gaps = 6/288 (2%) Frame = +2 Query: 572 GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751 GSCWAFST+ AVEGINQI TG L +LSEQELVDCD YN GCNGGLMDYAFDFI++NGGI Sbjct: 136 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGI 195 Query: 752 DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931 DTEEDY Y A D MCD RKNARVV+IDGYEDV NDEKSLKKA+A+QP+SVAIEAGGR Sbjct: 196 DTEEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRG 255 Query: 932 FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111 FQLY SGVFTG CGTQLDHGVV VGYGTE+GVDYWIVRNSWGP+WGENGYIR+ER++A+T Sbjct: 256 FQLYQSGVFTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRMERDVAST 315 Query: 1112 TTGKCGIAIEPSY------XXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIY 1273 TGKCGIA+E SY CDDY+SCP GSTCCCIY Sbjct: 316 ETGKCGIAMEASYPTKKSANPPNPGPSPPSPVNPPPPEKPSSECDDYYSCPAGSTCCCIY 375 Query: 1274 QYGSYCFGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNP 1417 QYG YCFGWGCCPLE+AT YP+CD++AGTC+MSK NP Sbjct: 376 QYGDYCFGWGCCPLESATCCDDHNSCCPHEYPVCDLEAGTCRMSKSNP 423 Score = 202 bits (515), Expect(2) = e-172 Identities = 94/141 (66%), Positives = 114/141 (80%), Gaps = 1/141 (0%) Frame = +1 Query: 100 MSIIDYNINHGGVYPTGEHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINEHN 279 MSIIDYNI HG V E E + +YE WLVKHG+AYNALGE+E+RFEIFKDN++FI+EHN Sbjct: 1 MSIIDYNIKHGQVPERTEAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHN 60 Query: 280 AQDG-TYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVDW 456 + +YKLGLN+FADL+N+EYRS+++ RMD GRL ++RY F+ GDDLPE+VDW Sbjct: 61 SVGNPSYKLGLNKFADLSNDEYRSVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDW 120 Query: 457 RAKGAVPPVKDQGQCGSCWAF 519 R KGAV PVKDQGQCGSCWAF Sbjct: 121 REKGAVAPVKDQGQCGSCWAF 141 >gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa] Length = 463 Score = 431 bits (1109), Expect(2) = e-169 Identities = 201/283 (71%), Positives = 219/283 (77%) Frame = +2 Query: 572 GSCWAFSTIAAVEGINQIATGELITLSEQELVDCDTAYNQGCNGGLMDYAFDFIVKNGGI 751 GSCWAFSTI+AVEGINQI TGELI+LSEQELVDCD +YN GCNGGLMDY F FI+ NGGI Sbjct: 156 GSCWAFSTISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYGFQFIINNGGI 215 Query: 752 DTEEDYTYTARDDMCDQYRKNARVVSIDGYEDVHGNDEKSLKKALAHQPISVAIEAGGRA 931 DTEEDY Y A D CDQ+RKNARVVSI+GYEDV +DE SLKKA+A+QP+SVAIEAGGRA Sbjct: 216 DTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRA 275 Query: 932 FQLYHSGVFTGRCGTQLDHGVVAVGYGTENGVDYWIVRNSWGPSWGENGYIRLERNLANT 1111 FQLY SGVFTG CGT LDHGVVAVGYGTENGVDYW VRNSWGP WGENGYI+LERN+ N Sbjct: 276 FQLYESGVFTGHCGTNLDHGVVAVGYGTENGVDYWTVRNSWGPKWGENGYIKLERNI-NA 334 Query: 1112 TTGKCGIAIEPSYXXXXXXXXXXXXXXXXXXXXXXXXCDDYFSCPEGSTCCCIYQYGSYC 1291 T+GKCGIA SY CDDY+SCPEGSTCCC+YQYG +C Sbjct: 335 TSGKCGIASMASYPTKTGSNPPNPGPSPPTPVNPPTVCDDYYSCPEGSTCCCVYQYGDFC 394 Query: 1292 FGWGCCPLEAATXXXXXXXXXXXXYPICDVDAGTCQMSKDNPL 1420 GWGCCPLE+AT YPICD+D GTC MSKDNPL Sbjct: 395 IGWGCCPLESATCCDDHSSCCPHEYPICDLDGGTCLMSKDNPL 437 Score = 192 bits (488), Expect(2) = e-169 Identities = 92/141 (65%), Positives = 109/141 (77%) Frame = +1 Query: 97 DMSIIDYNINHGGVYPTGEHEVKALYESWLVKHGKAYNALGEREKRFEIFKDNIQFINEH 276 DMSII Y+ H + E A+YE WL HGKAYNA+GE+E+RFEIFKDN++F++EH Sbjct: 24 DMSIISYDQTHPP--QRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEH 81 Query: 277 NAQDGTYKLGLNRFADLTNEEYRSLFVSGRMDRAGRLTSPRSNDRYAFRTGDDLPESVDW 456 NA G+Y++GLNRFADLTNEEYRS+F+ G M+ R S +S DRYAFR GD LP SVDW Sbjct: 82 NAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTKS-DRYAFRAGDKLPGSVDW 140 Query: 457 RAKGAVPPVKDQGQCGSCWAF 519 R KGAV PVKDQGQCGSCWAF Sbjct: 141 REKGAVSPVKDQGQCGSCWAF 161