BLASTX nr result
ID: Angelica23_contig00017754
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00017754 (1491 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002332417.1| predicted protein [Populus trichocarpa] gi|2... 173 1e-40 ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containi... 150 6e-34 ref|XP_002513116.1| pentatricopeptide repeat-containing protein,... 147 5e-33 ref|XP_003522216.1| PREDICTED: uncharacterized protein LOC100797... 140 1e-30 ref|XP_003598926.1| hypothetical protein MTR_3g023510 [Medicago ... 140 1e-30 >ref|XP_002332417.1| predicted protein [Populus trichocarpa] gi|222832370|gb|EEE70847.1| predicted protein [Populus trichocarpa] Length = 394 Score = 173 bits (438), Expect = 1e-40 Identities = 115/375 (30%), Positives = 180/375 (48%), Gaps = 27/375 (7%) Frame = +3 Query: 222 NYYQTKVDPQEVLQNSSLP-WEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXXXXX 395 +Y + P + +P WE+KFC L G +PW KV+ AKKY+YCH N+L W Sbjct: 21 SYDYPESPPHSSFVDDGIPSWEKKFCSLIGSVPWRKVVDAKKYMYCHGNILNWDDSAGEE 80 Query: 396 XXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEYFNPDEAE--- 566 ++RF + I + PDP+++ID+I WN IDPE++ DL+++ F PDE + Sbjct: 81 AFHNAKKRFWAEINGVSCGISPPDPNLFIDEIKWNAYIDPEVIKDLEQDLFVPDEGDTGG 140 Query: 567 ----------NLSSNEIPDCNNKNKSTLDNPWES-HHLENNVDIKDLAQSWNKWGDSLES 713 N S C +N + NPWES ++ ++++ + D A+SWN+W + Sbjct: 141 KVGRKNKKRRNFVSIPSNGC-YENTDDVKNPWESNNNTQSSLSLIDKAKSWNQWDSDINK 199 Query: 714 KDAMNL----WEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQ 881 +N WE+ + EA K K WG NK +GWN N +S+ +++ N W Sbjct: 200 SSNLNKVDNPWERGFSQESEAVKGKTWGVCGNKSWGWNHSGNHVDQSNDW-NNNSNPWQH 258 Query: 882 GALHFKLPNEKGWGDASKNSCGWNCGNSR--SNEWGNAGNVDSWKPRPGGTSYCRQFTNY 1055 N+KGWG+ +S G+N SR +N+ ++GN + G S R++ + Sbjct: 259 SRQGVDPANDKGWGNLRDSSRGYNQHESRKWNNDCKSSGNGFF---QGSGASKDRKWEDN 315 Query: 1056 GNNSRNSKWLTNQNIDTEKFDSGIHSGACRKREEYQENTSRRKSWKHEGA-----DFEAH 1220 G+NS+ K N +T+ D H G R E SW+ EGA +E+ Sbjct: 316 GSNSQGWKQWDNYGKNTKGLDFRKHGGGWETRNE--------GSWQREGAHQHITGYEST 367 Query: 1221 QFWGKASPHGQYYRG 1265 +F G G + G Sbjct: 368 RFQGDGFQTGHSWSG 382 >ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Vitis vinifera] Length = 1294 Score = 150 bits (380), Expect = 6e-34 Identities = 105/308 (34%), Positives = 140/308 (45%), Gaps = 37/308 (12%) Frame = +3 Query: 261 QNSSLPWEQKFCLLSGIPWYKVLAAKKYIYCHDNVLKWXXXXXXXXXXXXQERFCSMIYS 440 QNS WE++FC GIPW KV+ AKKYI+ H +VL W + RF + I Sbjct: 17 QNSVPSWEKRFCTSVGIPWGKVVDAKKYIHYHVDVLNWNDLAGEEAFHNAKRRFWAEING 76 Query: 441 LPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEYFNPDEAENLSSNEIPDCNNKNKSTL 620 +P PDPD+YID IDWNP IDPELM +LDKE+F+PDE E DC KN ++ Sbjct: 77 IPCSISQPDPDIYIDNIDWNPXIDPELMRNLDKEFFSPDEREQ-------DC--KNPASG 127 Query: 621 DNPWESHHLENNVDIKDLAQSWNKWG---DSLESKDAMN--------------------- 728 DNPWE L +KD A +W+KWG L + D N Sbjct: 128 DNPWE---LNMPKTLKDRAWAWDKWGGCKTELRNLDKTNSQVSGYATEGHYRKPDGGDSP 184 Query: 729 --LWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKL 902 W + + ++A +WG ++N+ ++ LN D + H +W GA+ Sbjct: 185 WEYWNVQGVLKEKARVRNQWGGNINE----SRNLN----GDDNRWKHSCTWASGAV---- 232 Query: 903 PNEKGWGDASKNSCG-WNCGNSRSNEWGNAGN-VDSWK---------PRPGGTSYCRQFT 1049 + WG+ NS WN N+ N N VD+W R Q Sbjct: 233 -RDDSWGNCEGNSWRMWNEVPKPINQLSNLDNGVDNWNSSCNQANAAQRDNACGGWSQGW 291 Query: 1050 NYGNNSRN 1073 NY N SRN Sbjct: 292 NYQNKSRN 299 >ref|XP_002513116.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223548127|gb|EEF49619.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 1128 Score = 147 bits (372), Expect = 5e-33 Identities = 101/342 (29%), Positives = 159/342 (46%), Gaps = 31/342 (9%) Frame = +3 Query: 201 HPRRSPPNYYQTKVDPQE----VLQNSSLPWEQKFCLLSG-IPWYKVLAAKKYIYCHDNV 365 H R P + Y +P ++ WE+KFC L G +PW K++ KK++YCHD V Sbjct: 13 HQYRDPASSYYNHQEPPPPYPGFAEDGVPSWEKKFCSLIGSVPWQKIVNVKKFMYCHDIV 72 Query: 366 LKWXXXXXXXXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEY 545 + W + RF + I LP + LPDPDMY+D I+W+P+IDPEL+ +L++ + Sbjct: 73 INWNDSAGADAFQNAKNRFWADINRLPCQISLPDPDMYVDDINWHPDIDPELVKELERAF 132 Query: 546 FNPDEAENLSSNEIPDCNNK---------------NKSTLDNPWE-SHHLENNVDIKDLA 677 F P+E EN N+ +C NK N + PWE +NV +++ Sbjct: 133 FAPEEGEN---NDNVECKNKKARHFLSVPSEGWNRNPDEVRIPWECEDEGGSNVAVEEKT 189 Query: 678 QSWNKWGDSLESKDAMN----LWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRE-- 839 + WN+W S S +N W +EA + K WG+ +K GW+ +++ + Sbjct: 190 RGWNQWRISTNSSRNVNNGDTPWVSHFTQGNEAVEGKTWGNCADKLQGWSGSVHNQAKDW 249 Query: 840 SDKHESDHVNSWNQGALHFKLPNEKG-WGDASKNSCGW-NCGN--SRSNEWGNAGNVDSW 1007 + ++ W L KG W ++S GW + GN ++S EW + GN W Sbjct: 250 GSCNLTNDDKPWGHSYL-------KGTWRESSGKLWGWSHKGNQVNQSKEWDSGGN--PW 300 Query: 1008 KPRPGGTSYCRQFTNYGNNSRNSKWLTNQNIDTEKFDSGIHS 1133 + G + N NS + W N+ + KF G +S Sbjct: 301 EHSSQGVVLVKD--NVWGNSNHISWGKNKQVG--KFSHGENS 338 >ref|XP_003522216.1| PREDICTED: uncharacterized protein LOC100797066 [Glycine max] Length = 387 Score = 140 bits (352), Expect = 1e-30 Identities = 95/312 (30%), Positives = 142/312 (45%), Gaps = 9/312 (2%) Frame = +3 Query: 216 PPNYYQTKVDPQEVLQNSSLPWEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXXXX 392 PP +Y E Q+ WE+K+C + G +PW K++ +K ++YCH NV W Sbjct: 22 PPTFYDINAPLPEYWQDGIPLWEKKYCTIVGLVPWQKIVDSKMFVYCHSNVFDWNDSAAE 81 Query: 393 XXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEYFN-PDEAEN 569 + + + I SLP + LPDPD Y D+IDWNP IDP+++ ++DK +F PDE + Sbjct: 82 EALQNAKNHYWAKINSLPCDISLPDPDTYNDQIDWNPYIDPDMIKEIDKAFFTVPDEEQE 141 Query: 570 LSSNEIPDCNNKNKSTLDNPWE------SHHLENNVDIKDLAQSWNKWGDSLESKDAMNL 731 + I + K +NP E S LENN Q WN+ G+S + + N Sbjct: 142 TA---IKNKRTKTSVNDENPLECSDTPLSRALENNE-----VQRWNQ-GNSGDVDNTDNP 192 Query: 732 WEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKLPNE 911 WE + + D W K +GWN+G + + WN L + Sbjct: 193 WECSVTHGNGRLTDNAWEGGPVKSWGWNEG---------RDHNQCKDWNSENL-----QD 238 Query: 912 KGWGDASKNSCGWNCGNSRSNEWGNAGNVDSWKPRPGGTSYCRQFTNYGNNSRN-SKWLT 1088 KGWG A +S W C +SN N GN SW+ + + T + N+ N S W Sbjct: 239 KGWGKARDSS--W-C-QQQSNNLANFGN-SSWQCKSSQQNVTPLKTGWRNSGANGSGWKQ 293 Query: 1089 NQNIDTEKFDSG 1124 + D + + G Sbjct: 294 QEKADVSRRNYG 305 >ref|XP_003598926.1| hypothetical protein MTR_3g023510 [Medicago truncatula] gi|355487974|gb|AES69177.1| hypothetical protein MTR_3g023510 [Medicago truncatula] Length = 365 Score = 140 bits (352), Expect = 1e-30 Identities = 105/369 (28%), Positives = 163/369 (44%), Gaps = 34/369 (9%) Frame = +3 Query: 213 SPPNYYQTKVDPQEVLQNSSLP-WEQKFCLLSG-IPWYKVLAAKKYIYCHDNVLKWXXXX 386 +PP+ + P ++ +P WE+K+C LSG +PW K++ +K+ IYCH NVL W Sbjct: 18 NPPSIFYDIRAPLPEFRHDGIPVWEKKYCTLSGCVPWQKIVDSKELIYCHHNVLDWKDSG 77 Query: 387 XXXXXXXXQERFCSMIYSLPHEPPLPDPDMYIDKIDWNPEIDPELMSDLDKEYFN-PDEA 563 ++R+ + + +LP + LPDPD YI++IDWNP ID EL+ +LD +F PDE Sbjct: 78 AEEAFQNAKKRYWANVNNLPCDISLPDPDAYIEQIDWNPCIDAELIKELDNAFFTVPDEE 137 Query: 564 ENLSSNEIPDCNNKNKSTLDNPWE------SHHLENNVDIKDLAQSWNKWGDSLESKDAM 725 E N I K +NPWE LENN + Q+ + D+ E+ Sbjct: 138 E--QENAIQYKRTKISVDGENPWECAATSVGRGLENN---EVQGQNQGDYHDNSENVGTT 192 Query: 726 -NLWEQRDLNDDEASKDKKWGSSVNKPFGWNKGLNDTRESDKHESDHVNSWNQGALHFKL 902 N W + ++ D W K GWN+G + ++ + WN G L Sbjct: 193 DNPWVSSAVCGNQGLTDNAWEGGHVKSRGWNEG--------RDHNNQCSGWNSGCLQ--- 241 Query: 903 PNEKGWGDASKNSC-------------GWNCGNSRSN------EWGNAGN-VDSWKPRPG 1022 +KGWG NS W C +S+ N W N+G V WK Sbjct: 242 -TDKGWGKVRDNSWCHQKSNNLANSGNSWGCKSSQQNVIPMNTGWRNSGTIVPRWKQHE- 299 Query: 1023 GTSYCRQFTNYGNNSRNSKWLT-NQNIDTEKFDSGIHSGACRKREEYQENTSRRKSWKHE 1199 +Y + + N N W + NQ+ + S H+ + + ++++ W+ E Sbjct: 300 -NAYVTSDSQFRRN--NGGWNSGNQSYHQMRGGSNRHNPSYNGSQPQRDDSQTGHYWRRE 356 Query: 1200 GA---DFEA 1217 + DF A Sbjct: 357 QSRKRDFRA 365