BLASTX nr result
ID: Angelica23_contig00003413
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00003413 (1862 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282779.2| PREDICTED: xaa-Pro dipeptidase-like [Vitis v... 795 0.0 ref|XP_002514160.1| xaa-pro dipeptidase, putative [Ricinus commu... 791 0.0 ref|XP_002308288.1| predicted protein [Populus trichocarpa] gi|2... 790 0.0 ref|NP_001241060.1| uncharacterized protein LOC100793240 [Glycin... 768 0.0 ref|NP_194678.2| Xaa-Pro dipeptidase [Arabidopsis thaliana] gi|1... 761 0.0 >ref|XP_002282779.2| PREDICTED: xaa-Pro dipeptidase-like [Vitis vinifera] gi|297738698|emb|CBI27943.3| unnamed protein product [Vitis vinifera] Length = 509 Score = 795 bits (2054), Expect = 0.0 Identities = 387/497 (77%), Positives = 434/497 (87%), Gaps = 2/497 (0%) Frame = +3 Query: 24 MGSSSLTPPPEVPMKLHLLNRNKLITSLGDHLSTSSRPHQGFVLLQGGEEQTRYCTDHAQ 203 M SSSLTPP EVPM+LH +NR KL+ SL HL+ S+ P GFVLLQGGEEQTR+ TDHA+ Sbjct: 10 MASSSLTPP-EVPMELHAINRGKLVKSLLQHLTESTHPLHGFVLLQGGEEQTRHDTDHAE 68 Query: 204 LFRQESYFAYLFGVQEPGFYGAIDIATGKSLLFAPRLPSEYAVWLGEIKPLSYYKERYMV 383 LFRQESYFAYLFGV+EPGFYGAIDIATGKS+LFAPRLP+EYAVWLGEIKPLSY+KERYMV Sbjct: 69 LFRQESYFAYLFGVREPGFYGAIDIATGKSILFAPRLPAEYAVWLGEIKPLSYFKERYMV 128 Query: 384 DLAFYTDEIVTTLLNQSQGVAQPLLFLLHGLNTDSNNYSKPAEFEGIEKFEKDLNTLHPI 563 YTDEI L ++ + +PLLFLLHGLNTDSNN+SKPAEFEGIEKF+ DLNTLHPI Sbjct: 129 SKVCYTDEIAGVLHDEYKEQGKPLLFLLHGLNTDSNNFSKPAEFEGIEKFKTDLNTLHPI 188 Query: 564 LTECRVLKTNLELAVIQYANDISSEAHVEVMKNVKAGMKEYQLESIFLHHTYMYGGCRHC 743 L ECRV K++LELA+IQYANDISSEAHVEVM+ GMKEYQLES+FLHHTYMYGGCRHC Sbjct: 189 LAECRVFKSDLELALIQYANDISSEAHVEVMRKTTVGMKEYQLESMFLHHTYMYGGCRHC 248 Query: 744 SYTCICATGGNSSVLHYGHAAAPNDRAFEDGDMALLDMGAEFKFYGSDITCSFPVNGKFT 923 SYTCICATGGNS+VLHYGHAAAPNDR FEDGDMALLDMGAE+ FYGSDITCSFPVNGKFT Sbjct: 249 SYTCICATGGNSAVLHYGHAAAPNDRTFEDGDMALLDMGAEYHFYGSDITCSFPVNGKFT 308 Query: 924 NDQRLIYNAVLDAHNAVISSMKPGVSWVDMHKLAEKIILESLKKGGLLVGDVHDMMAERV 1103 +DQRLIYNAVL AHN VIS+MKPGV+W+DMHKLAEKIIL+SLKKG ++VGDV DMM +R+ Sbjct: 309 SDQRLIYNAVLQAHNTVISAMKPGVNWIDMHKLAEKIILDSLKKGCIVVGDVDDMMVKRL 368 Query: 1104 GAVFMPHGLGHLLGIDTHDPGGYLKGAERPKEPGLKSLRTSRALLEGMVVTVEPGCYFID 1283 GAVFMPHGLGH LGIDTHD GGYL+G ERPKEPGLKSLRT R L EGMV+TVEPGCYFID Sbjct: 369 GAVFMPHGLGHFLGIDTHDTGGYLEGLERPKEPGLKSLRTVRDLQEGMVITVEPGCYFID 428 Query: 1284 ALLVPAMKSPNTTEFFSSE-INRFRGFGGVRIESDVYVNGHGCLNMTKCPREITEIESVM 1460 ALL PAM++ T++FF+ E I RF+ FGGVRIESDV+V +GC NMT PRE EIE+VM Sbjct: 429 ALLAPAMENSETSKFFNHEIIGRFKSFGGVRIESDVHVTSNGCKNMTNVPRETWEIEAVM 488 Query: 1461 AGGPWSI-KKSVYFENG 1508 AG PW + K S++ ENG Sbjct: 489 AGSPWPLDKSSIHSENG 505 >ref|XP_002514160.1| xaa-pro dipeptidase, putative [Ricinus communis] gi|223546616|gb|EEF48114.1| xaa-pro dipeptidase, putative [Ricinus communis] Length = 494 Score = 791 bits (2043), Expect = 0.0 Identities = 375/489 (76%), Positives = 434/489 (88%), Gaps = 1/489 (0%) Frame = +3 Query: 24 MGSSSLTPPPEVPMKLHLLNRNKLITSLGDHLSTSSRPHQGFVLLQGGEEQTRYCTDHAQ 203 M S+S PP+VPM+LH+ NR KL+ SL HL+ +SRP GFVLLQGGEEQTR+CTDH + Sbjct: 1 MASTSSLTPPKVPMELHVTNREKLLKSLRQHLTETSRPLHGFVLLQGGEEQTRHCTDHLE 60 Query: 204 LFRQESYFAYLFGVQEPGFYGAIDIATGKSLLFAPRLPSEYAVWLGEIKPLSYYKERYMV 383 LFRQESYFAYLFGV+EPGFYGAID+ATGKS+LFAPRL ++YAVWLGEIKPLSY++E Y+V Sbjct: 61 LFRQESYFAYLFGVKEPGFYGAIDVATGKSILFAPRLLADYAVWLGEIKPLSYFQESYVV 120 Query: 384 DLAFYTDEIVTTLLNQSQGVAQPLLFLLHGLNTDSNNYSKPAEFEGIEKFEKDLNTLHPI 563 ++ +YTDEIV L S+GVA+PLLFLLHGLNTDSNN+SKPAEFEGIEKFE DL TLHPI Sbjct: 121 NMVYYTDEIVQCLHEVSKGVAKPLLFLLHGLNTDSNNFSKPAEFEGIEKFETDLMTLHPI 180 Query: 564 LTECRVLKTNLELAVIQYANDISSEAHVEVMKNVKAGMKEYQLESIFLHHTYMYGGCRHC 743 LTECRVLK+ LELA+IQ+ANDISSEAH+EVM+ +AGMKEYQLESIFLHHTYMYGGCRHC Sbjct: 181 LTECRVLKSELELAIIQFANDISSEAHIEVMRRTQAGMKEYQLESIFLHHTYMYGGCRHC 240 Query: 744 SYTCICATGGNSSVLHYGHAAAPNDRAFEDGDMALLDMGAEFKFYGSDITCSFPVNGKFT 923 SYTCICATG NSSVLHYGHAAA NDR + GDMAL DMGAE+ FYGSDITCSFPVNG+FT Sbjct: 241 SYTCICATGENSSVLHYGHAAAANDRTLQYGDMALFDMGAEYSFYGSDITCSFPVNGRFT 300 Query: 924 NDQRLIYNAVLDAHNAVISSMKPGVSWVDMHKLAEKIILESLKKGGLLVGDVHDMMAERV 1103 +DQ L+YNAVLDAHNAVIS+M+PG+SW+DMHKLAE+ I+ESLK+G +LVGDV DMM ER+ Sbjct: 301 SDQSLVYNAVLDAHNAVISAMRPGISWLDMHKLAERTIIESLKRGLILVGDVDDMMTERL 360 Query: 1104 GAVFMPHGLGHLLGIDTHDPGGYLKGAERPKEPGLKSLRTSRALLEGMVVTVEPGCYFID 1283 GAVFMPHGLGH LGIDTHDPGGYLKG +R KEPGL+SLRT+R L EGMV+TVEPGCYFID Sbjct: 361 GAVFMPHGLGHFLGIDTHDPGGYLKGPKRSKEPGLRSLRTARELQEGMVITVEPGCYFID 420 Query: 1284 ALLVPAMKSPNTTEFFSSE-INRFRGFGGVRIESDVYVNGHGCLNMTKCPREITEIESVM 1460 A+L PA ++ +T++FF+SE I RF+GFGGVRIESDV+V +GC NMTKCPREI EIE+VM Sbjct: 421 AVLAPAKEASSTSKFFNSEAIGRFKGFGGVRIESDVHVTSNGCNNMTKCPREIWEIEAVM 480 Query: 1461 AGGPWSIKK 1487 AG PW + K Sbjct: 481 AGAPWPLNK 489 >ref|XP_002308288.1| predicted protein [Populus trichocarpa] gi|222854264|gb|EEE91811.1| predicted protein [Populus trichocarpa] Length = 488 Score = 790 bits (2041), Expect = 0.0 Identities = 377/487 (77%), Positives = 429/487 (88%), Gaps = 1/487 (0%) Frame = +3 Query: 24 MGSSSLTPPPEVPMKLHLLNRNKLITSLGDHLSTSSRPHQGFVLLQGGEEQTRYCTDHAQ 203 M SSS PPP+VPM+LH NR KL+ SL HL+ +SRP GFV LQGGEE+TRYCTDH + Sbjct: 1 MASSSRLPPPKVPMELHAKNREKLLKSLRQHLTETSRPLHGFVFLQGGEEKTRYCTDHIE 60 Query: 204 LFRQESYFAYLFGVQEPGFYGAIDIATGKSLLFAPRLPSEYAVWLGEIKPLSYYKERYMV 383 LFRQESYFAYLFGV+EPGFYGAIDIATGKS+LFAPRLP++YAVWLGEIKP S ++++YMV Sbjct: 61 LFRQESYFAYLFGVKEPGFYGAIDIATGKSILFAPRLPADYAVWLGEIKPSSCFQQQYMV 120 Query: 384 DLAFYTDEIVTTLLNQSQGVAQPLLFLLHGLNTDSNNYSKPAEFEGIEKFEKDLNTLHPI 563 + +YTDEIV L S + +PLLFLLHGLNTDSNN+SKPAEFEGIEKFEKDL TLHPI Sbjct: 121 SMVYYTDEIVGVLHELSNVLEKPLLFLLHGLNTDSNNFSKPAEFEGIEKFEKDLTTLHPI 180 Query: 564 LTECRVLKTNLELAVIQYANDISSEAHVEVMKNVKAGMKEYQLESIFLHHTYMYGGCRHC 743 LTECRVLK+++ELA+IQ+ANDISSEAHVEVM+ + GM+EYQLESIFLHHTYMYGGCRHC Sbjct: 181 LTECRVLKSDMELALIQFANDISSEAHVEVMRKTRVGMEEYQLESIFLHHTYMYGGCRHC 240 Query: 744 SYTCICATGGNSSVLHYGHAAAPNDRAFEDGDMALLDMGAEFKFYGSDITCSFPVNGKFT 923 SYTCICATG NS+VLHYGHAAAPNDR +DGDMAL DMGAE++FYGSDITCSFPVNGKFT Sbjct: 241 SYTCICATGENSAVLHYGHAAAPNDRTLQDGDMALFDMGAEYQFYGSDITCSFPVNGKFT 300 Query: 924 NDQRLIYNAVLDAHNAVISSMKPGVSWVDMHKLAEKIILESLKKGGLLVGDVHDMMAERV 1103 +DQ LIYNAVLDAHNAVIS+MKPGVSWVDMHKLAE++ILESLK G ++VG+V DMM ER+ Sbjct: 301 SDQSLIYNAVLDAHNAVISAMKPGVSWVDMHKLAEQLILESLKNGCIIVGNVDDMMIERL 360 Query: 1104 GAVFMPHGLGHLLGIDTHDPGGYLKGAERPKEPGLKSLRTSRALLEGMVVTVEPGCYFID 1283 GAVFMPHGLGH LGIDTHDPGGYLKG E+ K PGLK+LRT R L EGMV+TVEPGCYFID Sbjct: 361 GAVFMPHGLGHFLGIDTHDPGGYLKGLEKLKGPGLKALRTIRELQEGMVITVEPGCYFID 420 Query: 1284 ALLVPAMKSPNTTEFFSSE-INRFRGFGGVRIESDVYVNGHGCLNMTKCPREITEIESVM 1460 ALL PAM+S NT +FF E I+RF+GFGGVRIESDV+V GC NMTKCPR+I+EIE+VM Sbjct: 421 ALLAPAMESSNTAKFFDREAISRFKGFGGVRIESDVHVTAGGCQNMTKCPRQISEIEAVM 480 Query: 1461 AGGPWSI 1481 AG PW + Sbjct: 481 AGSPWPL 487 >ref|NP_001241060.1| uncharacterized protein LOC100793240 [Glycine max] gi|255637035|gb|ACU18850.1| unknown [Glycine max] Length = 477 Score = 768 bits (1984), Expect = 0.0 Identities = 369/476 (77%), Positives = 413/476 (86%), Gaps = 1/476 (0%) Frame = +3 Query: 63 MKLHLLNRNKLITSLGDHLSTSSRPHQGFVLLQGGEEQTRYCTDHAQLFRQESYFAYLFG 242 M+LH+ NR KL+TSL HLS SSR GFVLLQGGEEQTRY TDH +LFRQESYFAYLFG Sbjct: 1 MELHVKNREKLLTSLRQHLSDSSRSLHGFVLLQGGEEQTRYDTDHLELFRQESYFAYLFG 60 Query: 243 VQEPGFYGAIDIATGKSLLFAPRLPSEYAVWLGEIKPLSYYKERYMVDLAFYTDEIVTTL 422 V EPGFY AID+ATG S+LFAPRLPSEYAVWLGEIKPLSY+KE YMV ++DEI + L Sbjct: 61 VIEPGFYAAIDVATGNSILFAPRLPSEYAVWLGEIKPLSYFKEHYMVTTCCFSDEIESVL 120 Query: 423 LNQSQGVAQPLLFLLHGLNTDSNNYSKPAEFEGIEKFEKDLNTLHPILTECRVLKTNLEL 602 Q +PLLFLLHGLNTDS+NYSKPA+F+GI+KF+KDL TLHPILTECRV+K+ LE+ Sbjct: 121 QQHYQCSGKPLLFLLHGLNTDSDNYSKPAQFQGIDKFDKDLTTLHPILTECRVIKSELEI 180 Query: 603 AVIQYANDISSEAHVEVMKNVKAGMKEYQLESIFLHHTYMYGGCRHCSYTCICATGGNSS 782 A+IQYANDISSEAHVEVM+ K GMKEYQLESIFLHHTYMYGGCRHCSYTCICATG NS+ Sbjct: 181 ALIQYANDISSEAHVEVMRKTKVGMKEYQLESIFLHHTYMYGGCRHCSYTCICATGDNSA 240 Query: 783 VLHYGHAAAPNDRAFEDGDMALLDMGAEFKFYGSDITCSFPVNGKFTNDQRLIYNAVLDA 962 VLHYGHAAAPND+ EDGDMAL DMGAE+ FYGSDITCSFPVNGKFT+DQ LIY+AVLDA Sbjct: 241 VLHYGHAAAPNDKILEDGDMALFDMGAEYHFYGSDITCSFPVNGKFTSDQSLIYSAVLDA 300 Query: 963 HNAVISSMKPGVSWVDMHKLAEKIILESLKKGGLLVGDVHDMMAERVGAVFMPHGLGHLL 1142 HNAVIS+MKPG++WVDMH LAEK+ILESLK+G +++GDV DMMA R+GA FMPHGLGH L Sbjct: 301 HNAVISAMKPGINWVDMHILAEKVILESLKRGHVILGDVDDMMASRLGAAFMPHGLGHFL 360 Query: 1143 GIDTHDPGGYLKGAERPKEPGLKSLRTSRALLEGMVVTVEPGCYFIDALLVPAMKSPNTT 1322 GIDTHDPGGYLKG ER KEPGLKSLRT R L EGMV+TVEPGCYFIDALL+PAM SP T+ Sbjct: 361 GIDTHDPGGYLKGLERRKEPGLKSLRTIRDLREGMVITVEPGCYFIDALLLPAMNSPETS 420 Query: 1323 EFFSSE-INRFRGFGGVRIESDVYVNGHGCLNMTKCPREITEIESVMAGGPWSIKK 1487 +F + E INRF+GFGGVRIESDV V GC NMTKCPRE+ EIE+VMAG PW +K Sbjct: 421 KFLNQEAINRFKGFGGVRIESDVLVTATGCYNMTKCPREMREIEAVMAGAPWPAQK 476 >ref|NP_194678.2| Xaa-Pro dipeptidase [Arabidopsis thaliana] gi|110742445|dbj|BAE99141.1| putative prolidase [Arabidopsis thaliana] gi|332660237|gb|AEE85637.1| Xaa-Pro dipeptidase [Arabidopsis thaliana] Length = 486 Score = 761 bits (1964), Expect = 0.0 Identities = 362/482 (75%), Positives = 416/482 (86%), Gaps = 1/482 (0%) Frame = +3 Query: 33 SSLTPPPEVPMKLHLLNRNKLITSLGDHLSTSSRPHQGFVLLQGGEEQTRYCTDHAQLFR 212 SSL+PPP +PM+LH NR KL+ S+ LS+S+R GFVLLQGGEE+ RYCTDH +LFR Sbjct: 2 SSLSPPP-IPMELHAGNRKKLLESIRRQLSSSNRSLDGFVLLQGGEEKNRYCTDHTELFR 60 Query: 213 QESYFAYLFGVQEPGFYGAIDIATGKSLLFAPRLPSEYAVWLGEIKPLSYYKERYMVDLA 392 QESYFAYLFGV+EP FYGAIDI +GKS+LF PRLP +YAVWLGEIKPLS++KE YMVD+ Sbjct: 61 QESYFAYLFGVREPDFYGAIDIGSGKSILFIPRLPDDYAVWLGEIKPLSHFKETYMVDMV 120 Query: 393 FYTDEIVTTLLNQSQGVAQPLLFLLHGLNTDSNNYSKPAEFEGIEKFEKDLNTLHPILTE 572 FY DEI+ Q +G +PLL+LLHGLNTDS+N+SKPA FEGI+KFE DL TLHPIL E Sbjct: 121 FYVDEIIQVFNEQFKGSGKPLLYLLHGLNTDSSNFSKPASFEGIDKFETDLTTLHPILAE 180 Query: 573 CRVLKTNLELAVIQYANDISSEAHVEVMKNVKAGMKEYQLESIFLHHTYMYGGCRHCSYT 752 CRV+K++LEL +IQ+ANDISSEAH+EVM+ V GMKEYQ+ES+FLHH+YMYGGCRHCSYT Sbjct: 181 CRVIKSSLELQLIQFANDISSEAHIEVMRKVTPGMKEYQMESMFLHHSYMYGGCRHCSYT 240 Query: 753 CICATGGNSSVLHYGHAAAPNDRAFEDGDMALLDMGAEFKFYGSDITCSFPVNGKFTNDQ 932 CICATG NS+VLHYGHAAAPNDR FEDGD+ALLDMGAE+ FYGSDITCSFPVNGKFT+DQ Sbjct: 241 CICATGDNSAVLHYGHAAAPNDRTFEDGDLALLDMGAEYHFYGSDITCSFPVNGKFTSDQ 300 Query: 933 RLIYNAVLDAHNAVISSMKPGVSWVDMHKLAEKIILESLKKGGLLVGDVHDMMAERVGAV 1112 LIYNAVLDAHN+VIS+MKPGV+WVDMHKLAEKIILESLKKG +L GDV DMM +R+GAV Sbjct: 301 SLIYNAVLDAHNSVISAMKPGVNWVDMHKLAEKIILESLKKGSILTGDVDDMMVQRLGAV 360 Query: 1113 FMPHGLGHLLGIDTHDPGGYLKGAERPKEPGLKSLRTSRALLEGMVVTVEPGCYFIDALL 1292 FMPHGLGH +GIDTHD GGY KG ERPK+PGLKSLRT+R LLEGMV+TVEPGCYFI ALL Sbjct: 361 FMPHGLGHFMGIDTHDTGGYPKGVERPKKPGLKSLRTARDLLEGMVITVEPGCYFIKALL 420 Query: 1293 VPAMKSPNTTEFFSSE-INRFRGFGGVRIESDVYVNGHGCLNMTKCPREITEIESVMAGG 1469 PAM + T++FF+ E I RFR FGGVRIESD+ V +GC NMT PRE EIE+VMAGG Sbjct: 421 FPAMANATTSKFFNRETIERFRNFGGVRIESDLVVTANGCKNMTNVPRETWEIEAVMAGG 480 Query: 1470 PW 1475 PW Sbjct: 481 PW 482