BLASTX nr result
ID: Cephaelis21_contig00006515
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00006515 (1232 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002275899.1| PREDICTED: uncharacterized protein LOC100262... 299 1e-78 ref|XP_004147022.1| PREDICTED: uncharacterized protein LOC101214... 271 2e-70 gb|ADN34011.1| translation initiation factor [Cucumis melo subsp... 267 5e-69 ref|NP_195583.1| glycine-rich protein [Arabidopsis thaliana] gi|... 250 6e-64 ref|XP_003537798.1| PREDICTED: uncharacterized protein LOC100798... 248 2e-63 >ref|XP_002275899.1| PREDICTED: uncharacterized protein LOC100262348 [Vitis vinifera] Length = 401 Score = 299 bits (765), Expect = 1e-78 Identities = 197/424 (46%), Positives = 235/424 (55%), Gaps = 59/424 (13%) Frame = -3 Query: 1170 MAATVSAWAKPGAWALDSEEHEAELVDQKRDDFHSNG-----------NANGDYPTLXXX 1024 MAATVS W K GAWALDSEEHE EL+ Q+RDD NG A+ D+PTL Sbjct: 1 MAATVSPWGKAGAWALDSEEHEDELLQQQRDD-KVNGEFSGGEGRQAPEASADFPTLATA 59 Query: 1023 XXXXXXXXKGQTLSLQEFTAYGTTAKQPLSQANKGLTPDELLSLPTGPRQRSAEELERSK 844 KGQTLSL EF+A+G SQ KGLT ++L+ LPTGPRQRSAEEL+R + Sbjct: 60 AATKSKKKKGQTLSLSEFSAFGAGKSAQPSQ-TKGLTHEDLMMLPTGPRQRSAEELDRGR 118 Query: 843 ---GFRSYGT-------------------------GDEQPRR---QRDSNRDFAPPSRAD 757 GFRSYG+ G E+ R+ RDS+R+ A PSRAD Sbjct: 119 LGGGFRSYGSNGSYEGGRSRYGGGEDSANPRWGPRGSEERRQGGFGRDSSRELA-PSRAD 177 Query: 756 EIDDWGAAKKFTAPNXXXXXXXXXXXXXXFSDSQSRADEVDNWASNKTFVPSEGRR---- 589 EIDDWGAAKK T N DSQSRADE +W SNK+F PSEGRR Sbjct: 178 EIDDWGAAKKSTVGNGFERRDRGGFF-----DSQSRADESASWVSNKSFTPSEGRRFGGG 232 Query: 588 ------NERRGVFDANS--SGGADSDNWVKRKEEEGRKIGSLGGSFDXXXXXXXXXXXXX 433 ERRG FD+ S GGADS++W ++KEE G+ GS Sbjct: 233 GGFESLRERRGGFDSASDGGGGADSESWGRKKEEGS---GNANGS--------------- 274 Query: 432 XXXXXXXXXXXGKKREESGGRPRLNLQPRTLPLGDGQQQNNESLVKPKGSSPFGEARPRE 253 +G RP+L LQPRT+P+ DGQQ + S+ KPKG +PFGEARPRE Sbjct: 275 -----------------AGSRPKLILQPRTVPVNDGQQPGSGSVAKPKGPNPFGEARPRE 317 Query: 252 EVLKEKGHDRKEIEEMLESEKIKE-----IAEERPLAFAKRGFGSGNWRGSLQEDRSERA 88 EVL EKG D KEIEE LES K+K+ + + +F KR FGSGN R SL E RSE++ Sbjct: 318 EVLAEKGQDWKEIEEKLESVKLKDVGSPGVGQTDGPSFGKRSFGSGNARASLPESRSEKS 377 Query: 87 WRKP 76 WRKP Sbjct: 378 WRKP 381 >ref|XP_004147022.1| PREDICTED: uncharacterized protein LOC101214573 [Cucumis sativus] gi|449489695|ref|XP_004158389.1| PREDICTED: uncharacterized LOC101214573 [Cucumis sativus] Length = 405 Score = 271 bits (693), Expect = 2e-70 Identities = 174/418 (41%), Positives = 221/418 (52%), Gaps = 41/418 (9%) Frame = -3 Query: 1170 MAATVSAWAKPGAWALDSEEHEAELVDQKRDDFHSNGNANGDYPTLXXXXXXXXXXXKGQ 991 MAATVS W KPGAWALD+EEHEAEL+ + + + D+P+L KGQ Sbjct: 1 MAATVSPWGKPGAWALDAEEHEAELLKDQEEQSRHQEEPSADFPSLAAAAATKPKKKKGQ 60 Query: 990 TLSLQEFTAYGTTAKQPLSQANKGLTPDELLSLPTGPRQRSAEELERSK---GFRSYGTG 820 ++ L EF YG S KGLT ++L+ LPTGPRQR+AEE++R++ GF+S+G Sbjct: 61 SIPLSEFQTYGGPKPSAQSSDPKGLTAEDLMMLPTGPRQRTAEEMDRNRLGGGFKSWGQN 120 Query: 819 DEQPRRQRDSNRDFAP----------------------------PSRADEIDDWGAAKKF 724 R R SN + +P PSRADEIDDWGA KK Sbjct: 121 SLYDRGNRYSNSEDSPNSRRSSRVFDESRRTNDGSDREFRRESLPSRADEIDDWGAGKKP 180 Query: 723 TAPNXXXXXXXXXXXXXXFSDSQSRADEVDNWASNKTFVPSEGRRN-----ERRGVFDAN 559 N S S S+ADE D+W S+K+F PSEGRR+ ERRG F Sbjct: 181 MVGNGFERRERGGGGGFFDSHS-SKADESDSWVSSKSFTPSEGRRSGGFDRERRGGFPT- 238 Query: 558 SSGGADSDNWVKRKEEEGRKIGSLGGSFDXXXXXXXXXXXXXXXXXXXXXXXXGKKREES 379 S GGADSDNW ++ + IG GGS D R Sbjct: 239 SGGGADSDNWGRKPDGARGGIGENGGSADSENWGKRSEGV----------------RSGI 282 Query: 378 GGRPRLNLQPRTLPLGDGQQQNNESLVKPKGSSPFGEARPREEVLKEKGHDRKEIEEMLE 199 G RPRLNLQPR++PL +G Q+ + VKPKGS+PFG ARPREEVL EKG D K+I+E LE Sbjct: 283 GERPRLNLQPRSIPLNNGNQEASGVAVKPKGSNPFGNARPREEVLAEKGQDWKKIDEQLE 342 Query: 198 SEKIKEIAEERPLAFA-----KRGFGSGNWRGSLQEDRSERAWRKPSEIMDARPRSAK 40 S KIK+ E + K+GFG+ + R S R WRKP E +++RP+SA+ Sbjct: 343 SVKIKDTVERAETSSGASFERKKGFGARSGR----SPDSGRTWRKP-ESVESRPQSAE 395 >gb|ADN34011.1| translation initiation factor [Cucumis melo subsp. melo] Length = 405 Score = 267 bits (682), Expect = 5e-69 Identities = 172/418 (41%), Positives = 219/418 (52%), Gaps = 41/418 (9%) Frame = -3 Query: 1170 MAATVSAWAKPGAWALDSEEHEAELVDQKRDDFHSNGNANGDYPTLXXXXXXXXXXXKGQ 991 MAATVS W KPGAWALD+EEHEAEL+ ++D + D+P+L KGQ Sbjct: 1 MAATVSPWGKPGAWALDAEEHEAELLKDQQDQSRHQSEPSADFPSLAAAAATKPKKKKGQ 60 Query: 990 TLSLQEFTAYGTTAKQPLSQANKGLTPDELLSLPTGPRQRSAEELERSK---GFRSYGTG 820 ++ L EF YG S KGLT ++L+ LPTGPRQR+AEE++R++ GF+S+G Sbjct: 61 SIPLSEFQTYGGPRPAAQSTDPKGLTAEDLMMLPTGPRQRTAEEMDRNRLGGGFKSWGQN 120 Query: 819 DEQPRRQRDSNRDFAP----------------------------PSRADEIDDWGAAKKF 724 R R SN + +P PSRADEIDDWGA KK Sbjct: 121 SLYDRGNRYSNSEDSPNSRRSSRVFDESRRSNDGSDREFRRESLPSRADEIDDWGAGKKP 180 Query: 723 TAPNXXXXXXXXXXXXXXFSDSQSRADEVDNWASNKTFVPSEGRRN-----ERRGVFDAN 559 N S S S+ADE D+W S+K+F PSEGRR+ ERRG F Sbjct: 181 MMGNGFERRERGGGGGFFDSHS-SKADESDSWVSSKSFTPSEGRRSGGFDRERRGGFPT- 238 Query: 558 SSGGADSDNWVKRKEEEGRKIGSLGGSFDXXXXXXXXXXXXXXXXXXXXXXXXGKKREES 379 S GGADSDNW ++ + +G GG D R Sbjct: 239 SGGGADSDNWGRKSDGARAGMGENGGGADSDNWGKKSEGV----------------RSGI 282 Query: 378 GGRPRLNLQPRTLPLGDGQQQNNESLVKPKGSSPFGEARPREEVLKEKGHDRKEIEEMLE 199 G RPRLNLQPR++PL +G Q+ + VKPKGS+PFG ARPREEVL EKG D K+I+E L Sbjct: 283 GERPRLNLQPRSIPLNNGNQEASGVAVKPKGSNPFGNARPREEVLAEKGQDWKKIDEQLG 342 Query: 198 SEKIKEIAEERPLAFA-----KRGFGSGNWRGSLQEDRSERAWRKPSEIMDARPRSAK 40 S KIK+ E + ++GFG + R S R+WRKP E D+RP+SA+ Sbjct: 343 SMKIKDTVERAETSSGASFERRKGFGVRSGR----SPDSGRSWRKP-ESADSRPQSAE 395 >ref|NP_195583.1| glycine-rich protein [Arabidopsis thaliana] gi|4467158|emb|CAB37527.1| putative protein [Arabidopsis thaliana] gi|7270854|emb|CAB80535.1| putative protein [Arabidopsis thaliana] gi|17065142|gb|AAL32725.1| putative protein [Arabidopsis thaliana] gi|20259814|gb|AAM13254.1| putative protein [Arabidopsis thaliana] gi|332661567|gb|AEE86967.1| glycine-rich protein [Arabidopsis thaliana] Length = 452 Score = 250 bits (638), Expect = 6e-64 Identities = 176/418 (42%), Positives = 216/418 (51%), Gaps = 52/418 (12%) Frame = -3 Query: 1167 AATVSAWAKPGAWALDSEEHEAELVDQKRD-DFHSNGNANGDYPTLXXXXXXXXXXXKGQ 991 AA S WAKPGAWAL++EEHEAEL Q + S+ + D+P+L KGQ Sbjct: 3 AAVSSVWAKPGAWALEAEEHEAELKQQPSPTNQKSSAEDSSDFPSLAAAATTKTKKKKGQ 62 Query: 990 TLSLQEFTAYGTTAKQPLSQANKGLTPDELLSLPTGPRQRSAEELERSK---GFRSYGTG 820 T+SL EF YGT +P Q + LT EL++LPTGPR+RSAEEL+RSK GFRSYG G Sbjct: 63 TISLAEFATYGTAKAKPAPQTER-LTQAELVALPTGPRERSAEELDRSKLGGGFRSYGGG 121 Query: 819 -----------------DEQPRRQRDSNRDFAP-----PSRADEIDDWGAAKKFTAPNXX 706 ++ RR NRD P PSRADE D+W AAKK + N Sbjct: 122 RYGDESSSSRWGSSRVSEDGERRGGGFNRDREPSRDSGPSRADEDDNWAAAKKPISGNGF 181 Query: 705 XXXXXXXXXXXXFSDSQSRADEVDNWASNKTFVPSE--------GRRNERRGVFDANS-- 556 S SQS+ADEVD+W S K P G R E+RG F++ S Sbjct: 182 ERRERGSGGGFFESQSQSKADEVDSWVSTKPSEPRRFVSSNGGGGDRFEKRGSFESLSRN 241 Query: 555 -------SGGADSDNWVKRKEEEGRKIGSLGGSFDXXXXXXXXXXXXXXXXXXXXXXXXG 397 GG++SD W +R+EE G GS S Sbjct: 242 RDSQYGGGGGSESDTWGRRREESGAANGSPPPS--------------------------- 274 Query: 396 KKREESGGRPRLNLQPRTLPLGDGQQQNNESLV-----KPKGSSPFGEARPREEVLKEKG 232 G RPRL LQPRTLP+ + ES V KPKG++PFG ARPREEVL EKG Sbjct: 275 -----GGSRPRLVLQPRTLPVAVVEVVKPESPVLVIVEKPKGANPFGNARPREEVLAEKG 329 Query: 231 HDRKEIEEMLESEKIKEIAE--ERP--LAFAKRGFGSGNWRGSLQEDRSERAWRKPSE 70 D KEI+E LE+EK+K+IA E+P + K GFG GN G E+R ER+WRK +E Sbjct: 330 QDWKEIDEKLEAEKLKDIAAAMEKPNEKSTGKMGFGLGN--GRKDEERIERSWRKSTE 385 >ref|XP_003537798.1| PREDICTED: uncharacterized protein LOC100798129 [Glycine max] Length = 377 Score = 248 bits (634), Expect = 2e-63 Identities = 180/416 (43%), Positives = 218/416 (52%), Gaps = 39/416 (9%) Frame = -3 Query: 1170 MAATVS-AWAKPGAWALDSEEHEAELVDQKRDDFHSNGNANGDYPTLXXXXXXXXXXXKG 994 MAATVS AW+KPGAWALDSEEHEAEL+ Q D D+P+L Sbjct: 1 MAATVSSAWSKPGAWALDSEEHEAELLQQNND------KPLADFPSLAAAAAKPKKKK-A 53 Query: 993 QTLSLQEFTAYGTTAKQPLSQANKGLTPDELLSLPTGPRQRSAEELERSK---GFRSYGT 823 QT SL EFTA T+ + + LPTGPRQR+AEEL+R++ GFR+YG Sbjct: 54 QTYSLAEFTAKPDTS----------FADQDPVVLPTGPRQRTAEELDRTRLGGGFRNYGD 103 Query: 822 ---------GDE-------------QPRRQ----RDSNRDFAPPSRADEIDDWGAAKKFT 721 GDE +PRR RDSNR+ PPSRADE D+W A+KK + Sbjct: 104 RPNRNNSGGGDESSNSRWGSSRVSDEPRRNGFGARDSNREL-PPSRADETDNWAASKKPS 162 Query: 720 APNXXXXXXXXXXXXXXFSDSQSRADEVDNWASNKTFVPSEGRRNERRG-----VFDANS 556 DSQSRADE D+W SNK+FVPSEGRR G V S Sbjct: 163 GGGFERRERDKGGFF----DSQSRADESDSWVSNKSFVPSEGRRFSSNGGGERRVVGFGS 218 Query: 555 SGGADSDNWVKRKEEEGRKIGSLGGSFDXXXXXXXXXXXXXXXXXXXXXXXXGKKREESG 376 SGGADSDNW +K+ E IGS + G Sbjct: 219 SGGADSDNWNNKKKSESN-IGS-------------------------------SESVGVG 246 Query: 375 GRPRLNLQPRTLPLGDGQQQNNESLVKPKGSSPFGEARPREEVLKEKGHDRKEIEEMLES 196 GRP+L LQPRTL + + +++ KPKG +PFGEARPRE+VL EKG D K+I+E LES Sbjct: 247 GRPKLVLQPRTLSVSN----EGDNVGKPKGVNPFGEARPREQVLAEKGQDWKKIDEQLES 302 Query: 195 EKIKEIAEERPLAFAKRGFGSGNWRGS----LQEDRSERAWRKPSEIMDARPRSAK 40 KIKE + F KRGFGS N G L E R+ER+WRKP + D RP+SA+ Sbjct: 303 VKIKETSGGGGDGFGKRGFGSSNGGGGGRAILPESRTERSWRKP-QSDDDRPKSAE 357