BLASTX nr result
ID: Forsythia21_contig00032928
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00032928 (714 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011091791.1| PREDICTED: pentatricopeptide repeat-containi... 133 2e-42 ref|XP_008375237.1| PREDICTED: pentatricopeptide repeat-containi... 121 2e-36 ref|XP_012830707.1| PREDICTED: pentatricopeptide repeat-containi... 111 1e-34 ref|XP_009368090.1| PREDICTED: pentatricopeptide repeat-containi... 115 3e-34 ref|XP_007220734.1| hypothetical protein PRUPE_ppa023145mg [Prun... 108 1e-33 ref|XP_008231523.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 108 1e-33 ref|XP_010255813.1| PREDICTED: pentatricopeptide repeat-containi... 116 2e-33 ref|XP_009604999.1| PREDICTED: pentatricopeptide repeat-containi... 116 2e-33 ref|XP_010645700.1| PREDICTED: pentatricopeptide repeat-containi... 106 4e-33 ref|XP_008238545.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 107 4e-33 ref|XP_009788856.1| PREDICTED: pentatricopeptide repeat-containi... 114 2e-32 ref|XP_002513116.1| pentatricopeptide repeat-containing protein,... 105 8e-32 ref|XP_012069204.1| PREDICTED: pentatricopeptide repeat-containi... 102 7e-31 ref|XP_010670382.1| PREDICTED: pentatricopeptide repeat-containi... 101 1e-30 gb|KMT17191.1| hypothetical protein BVRB_2g040990 [Beta vulgaris... 101 1e-30 gb|KHG02696.1| hypothetical protein F383_25080 [Gossypium arboreum] 111 8e-30 ref|XP_006345374.1| PREDICTED: pentatricopeptide repeat-containi... 107 2e-29 ref|XP_012449113.1| PREDICTED: pentatricopeptide repeat-containi... 106 3e-29 gb|KJB67470.1| hypothetical protein B456_010G192200 [Gossypium r... 106 3e-29 emb|CDP12559.1| unnamed protein product [Coffea canephora] 103 5e-29 >ref|XP_011091791.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Sesamum indicum] gi|747088409|ref|XP_011091792.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Sesamum indicum] Length = 720 Score = 133 bits (334), Expect(2) = 2e-42 Identities = 91/198 (45%), Positives = 118/198 (59%), Gaps = 7/198 (3%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*---ILSRCRMQS*KEWMNF 172 ADLAIYNSLI+GLC K VDRAYK FQ+TI+E F +LS ++ KE+ Sbjct: 394 ADLAIYNSLIRGLCNAKLVDRAYKLFQVTIREDLQPEFHTVNPILLSYAELKRMKEFCKL 453 Query: 173 VRCLRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLR--LCFNLQFV-- 340 + L+K V+ L L FF+G + +G + ++ + E K+ LC + + Sbjct: 454 LEQLQKLGVSVIDYL-----LKFFSGMVE-KDDGVVAALEVFEYLKIHNYLCVPIYNIVM 507 Query: 341 ETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSV 520 E L+K+ K KAL LFHEL+D VPD Y NAIL YVE ++ EACT YN IKE +SV Sbjct: 508 EALIKNSKEKKALALFHELSDLGLVPDSSTYCNAILCYVEVEDVQEACTMYNKIKEMASV 567 Query: 521 PSVAAYYFLGKGLSRIGE 574 PSVAAY L KGL +IGE Sbjct: 568 PSVAAYSSLVKGLCKIGE 585 Score = 67.4 bits (163), Expect(2) = 2e-42 Identities = 32/47 (68%), Positives = 38/47 (80%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + LIR+CLAHVT+ PMEFKY+ TIIHVC LN+A K +EVVNEM Sbjct: 586 IEPAMMLIRDCLAHVTNGPMEFKYSLTIIHVCKLNDAQKIVEVVNEM 632 >ref|XP_008375237.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Malus domestica] gi|657967097|ref|XP_008375238.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Malus domestica] gi|657967099|ref|XP_008375239.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Malus domestica] Length = 717 Score = 121 bits (303), Expect(2) = 2e-36 Identities = 82/197 (41%), Positives = 111/197 (56%), Gaps = 7/197 (3%) Frame = +2 Query: 5 DLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL---*ILSRCRMQS*KEWMNFV 175 DL IYNSLI+GLC VKRVD+AYK F +T+QE F ++ + ++ + Sbjct: 393 DLGIYNSLIEGLCNVKRVDKAYKIFXVTVQEGLQPDFATVNPILVFYXETRKVDKFCEML 452 Query: 176 RCLRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNL----QFVE 343 + KC V+ L F F F G + +G + + E+ K++ ++L F+E Sbjct: 453 AQMEKCGFPVIDBLXKF---FSFIVG---KEDGVTMGLEVFEELKVKGYYSLGIYNTFME 506 Query: 344 TLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVP 523 L K GKV KAL LF+E D D PD YS AI+ +VE G+IHEAC YN I E S VP Sbjct: 507 ALHKSGKVKKALSLFNETKDVDLQPDSSTYSIAIMCFVEGGDIHEACACYNKIIEMSCVP 566 Query: 524 SVAAYYFLGKGLSRIGE 574 S+AAY L +GL +IGE Sbjct: 567 SIAAYRSLARGLCKIGE 583 Score = 59.3 bits (142), Expect(2) = 2e-36 Identities = 27/47 (57%), Positives = 34/47 (72%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CLA VTS P EFKY+ TI+H C NNA K +EV+NEM Sbjct: 584 IDAVMLLVRDCLASVTSGPSEFKYSLTILHACKSNNAEKVVEVLNEM 630 >ref|XP_012830707.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Erythranthe guttatus] gi|604344163|gb|EYU42962.1| hypothetical protein MIMGU_mgv1a021045mg [Erythranthe guttata] Length = 726 Score = 111 bits (277), Expect(2) = 1e-34 Identities = 82/199 (41%), Positives = 102/199 (51%), Gaps = 8/199 (4%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADLAIYNSLIKGLC K VDRAYK FQ I+E F K+ +F + Sbjct: 401 ADLAIYNSLIKGLCNSKLVDRAYKLFQAAIREDLQPDFNTVNPILICYAELKKLHDFCKL 460 Query: 182 LRKCRNWVVV----LLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQF---- 337 L + LL FF + N G ++ + E K+R N+ Sbjct: 461 LEQMEKLGFSINESLLDFFSCVVETNDGVA-------TALEVFEFLKIRNYINVPIYNIL 513 Query: 338 VETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSS 517 ++ L K+G KAL LFHEL D+D PD AI Y++ G++ EAC YN IKE SS Sbjct: 514 MDALFKNGDEKKALLLFHELKDADLAPDSSTLCIAISCYIKIGDVREACNTYNTIKEMSS 573 Query: 518 VPSVAAYYFLGKGLSRIGE 574 VPS+ AYY L KGLS IGE Sbjct: 574 VPSLDAYYALVKGLSDIGE 592 Score = 62.8 bits (151), Expect(2) = 1e-34 Identities = 29/47 (61%), Positives = 34/47 (72%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 + + L+R+CLAHVT PMEFKY TIIHVC N+A K IEVV EM Sbjct: 593 VDAAMVLVRDCLAHVTGGPMEFKYALTIIHVCKSNDARKVIEVVGEM 639 >ref|XP_009368090.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Pyrus x bretschneideri] gi|694384377|ref|XP_009368091.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Pyrus x bretschneideri] gi|694384380|ref|XP_009368092.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Pyrus x bretschneideri] Length = 717 Score = 115 bits (288), Expect(2) = 3e-34 Identities = 78/198 (39%), Positives = 109/198 (55%), Gaps = 8/198 (4%) Frame = +2 Query: 5 DLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*---ILSRCRMQS*KEWMNFV 175 DL IYNSLI+GLC VKRVD+AYK F++T+QE F ++ ++ + Sbjct: 393 DLGIYNSLIEGLCNVKRVDKAYKIFRVTVQEGLQPDFATVNPILVLYAETSKVDKFCEML 452 Query: 176 RCLRKCRNWVVV-LLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQ----FV 340 + KC V+ L FF ++ + +G + + E+ K++ ++L F+ Sbjct: 453 AQMEKCGFPVIDDLSKFFSLIVG-------KEDGVTMGLEVFEELKVKGYYSLGIYNIFM 505 Query: 341 ETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSV 520 E L K GKV KAL LF+E D PD YS AI+ +VE G+IHEAC YN I E S V Sbjct: 506 EALHKSGKVKKALSLFNETKDVGLQPDSSTYSIAIMCFVEDGDIHEACACYNKIIEMSCV 565 Query: 521 PSVAAYYFLGKGLSRIGE 574 P +AAY L +GL +IGE Sbjct: 566 PLIAAYRSLARGLCKIGE 583 Score = 57.8 bits (138), Expect(2) = 3e-34 Identities = 27/47 (57%), Positives = 34/47 (72%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CLA VTS P+EFKY+ TI+H C NNA K EV+NEM Sbjct: 584 IDAVMLLLRDCLASVTSGPLEFKYSLTILHACKSNNAEKVDEVLNEM 630 >ref|XP_007220734.1| hypothetical protein PRUPE_ppa023145mg [Prunus persica] gi|462417196|gb|EMJ21933.1| hypothetical protein PRUPE_ppa023145mg [Prunus persica] Length = 721 Score = 108 bits (271), Expect(2) = 1e-33 Identities = 72/195 (36%), Positives = 105/195 (53%), Gaps = 4/195 (2%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADL IYNSLI+GLC KRVD+AYK F++T+QE F + NF Sbjct: 395 ADLGIYNSLIEGLCNAKRVDKAYKIFRVTVQEGLQPDFATVNPILVSYAEMRRMDNFCDM 454 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQFVETLL--- 352 L + + ++ F F G + +G ++ + + K++ +++ L+ Sbjct: 455 LAEMEKFDFPVIDDLSKFFSFMVG---KEDGVPLALEVFGELKVKGYYSVGIYNILMGSL 511 Query: 353 -KDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPSV 529 K GKV KAL LF+E+ D D PD YS AI+ +VE +IHEAC +N I E S VPS+ Sbjct: 512 HKSGKVKKALSLFNEMKDVDLQPDASTYSIAIMCFVEDEDIHEACASHNKIIEMSCVPSI 571 Query: 530 AAYYFLGKGLSRIGE 574 +AY L +GL ++GE Sbjct: 572 SAYCSLARGLCKVGE 586 Score = 62.0 bits (149), Expect(2) = 1e-33 Identities = 29/47 (61%), Positives = 35/47 (74%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CLA VTS PMEFKY+ TI+H C NNA K IEV+NEM Sbjct: 587 IDTVMLLVRDCLASVTSGPMEFKYSLTILHACKSNNAEKVIEVLNEM 633 >ref|XP_008231523.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g20740-like [Prunus mume] Length = 720 Score = 108 bits (271), Expect(2) = 1e-33 Identities = 71/195 (36%), Positives = 106/195 (54%), Gaps = 4/195 (2%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADL IYNSLI+GLC K+VD+AYK F++T+QE F + NF Sbjct: 394 ADLGIYNSLIEGLCNAKQVDKAYKIFRVTVQEGLQPDFATVNPILVSYAEMRRMDNFCDM 453 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQFVETLL--- 352 L + + ++ F F G + +G + ++ + + K++ +++ L+ Sbjct: 454 LAEMEKFDFPVIDDLSKFFSFMLG---KEDGVLLALEVFGELKVKGYYSVGIYNILMGSL 510 Query: 353 -KDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPSV 529 K GKV KAL LF+E+ D D PD YS AI+ +VE +IHEAC +N I E S VPS+ Sbjct: 511 HKSGKVKKALSLFNEMKDVDLQPDASTYSIAIMCFVEDEDIHEACASHNKIIEMSCVPSI 570 Query: 530 AAYYFLGKGLSRIGE 574 +AY L +GL ++GE Sbjct: 571 SAYCSLARGLCKVGE 585 Score = 62.0 bits (149), Expect(2) = 1e-33 Identities = 29/47 (61%), Positives = 35/47 (74%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CLA VTS PMEFKY+ TI+H C NNA K IEV+NEM Sbjct: 586 IDTVMLLVRDCLASVTSGPMEFKYSLTILHACKSNNAEKVIEVLNEM 632 >ref|XP_010255813.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Nelumbo nucifera] gi|719965226|ref|XP_010255821.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Nelumbo nucifera] Length = 733 Score = 116 bits (290), Expect(2) = 2e-33 Identities = 83/198 (41%), Positives = 116/198 (58%), Gaps = 7/198 (3%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIF--LL*ILSRCRMQS*KEWMNFV 175 ADL+IYNSLI+GLC +V++A+K FQIT+QE F + IL+ QS + +F Sbjct: 407 ADLSIYNSLIEGLCNANQVNKAFKLFQITVQEGLGPDFTTINPILASYAEQSRMD--DFY 464 Query: 176 RCLRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLR-----LCFNLQFV 340 R L + + V + F F +G R ++++ + E K +N+ + Sbjct: 465 RLLEQMQMLGVPVSDDLSKFFSFMIAKGDRE---MKALEVFEHLKANGYCSVSIYNI-LI 520 Query: 341 ETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSV 520 +L K G+V AL LF+E+NDSD PD+ YSNAI +V+ GNI EAC YN IKE S V Sbjct: 521 GSLYKIGEVKGALSLFNEMNDSDFKPDLFTYSNAIPCFVDIGNIKEACLCYNGIKEMSWV 580 Query: 521 PSVAAYYFLGKGLSRIGE 574 P+++AY L KGLSRIGE Sbjct: 581 PTISAYRSLVKGLSRIGE 598 Score = 54.3 bits (129), Expect(2) = 2e-33 Identities = 24/47 (51%), Positives = 33/47 (70%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CL +V S PMEFKY+ TI+H C +A K IEV++EM Sbjct: 599 IDAALMLVRDCLGNVVSGPMEFKYSLTILHACKSGDAQKVIEVIDEM 645 >ref|XP_009604999.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Nicotiana tomentosiformis] Length = 721 Score = 116 bits (290), Expect(2) = 2e-33 Identities = 78/192 (40%), Positives = 100/192 (52%), Gaps = 1/192 (0%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADLAIYNSLI+G C KR+DRAYK FQIT+QE F K + Sbjct: 395 ADLAIYNSLIEGFCNAKRIDRAYKLFQITVQEDLQPDFSTVRPILVSYAESKRMDEICKL 454 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*-IPEKEKLRLCFNLQFVETLLKD 358 L + + + F F + R + + EK+ + +E L ++ Sbjct: 455 LEELQRLSYCIRDDLSKFFTFMVEKDDRIMIALEVFEYLKEKDYCGVPIYNILMEALYRN 514 Query: 359 GKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPSVAAY 538 G+V KAL LF EL DSD PD YSNAI +VE G++ EAC YN IKE S +PSVAAY Sbjct: 515 GEVTKALTLFSELRDSDHKPDSSTYSNAIQCFVEVGDVQEACNCYNRIKEMSLIPSVAAY 574 Query: 539 YFLGKGLSRIGE 574 L KGL +IG+ Sbjct: 575 RSLVKGLCKIGQ 586 Score = 53.9 bits (128), Expect(2) = 2e-33 Identities = 25/48 (52%), Positives = 36/48 (75%) Frame = +1 Query: 571 RIHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 +I + LIR+CL +V S P+EFKY TIIHVC +N+A K ++V++EM Sbjct: 586 QIDPAMMLIRDCLGNVESGPIEFKYILTIIHVCKMNDAEKVMKVLDEM 633 >ref|XP_010645700.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Vitis vinifera] gi|296081308|emb|CBI17752.3| unnamed protein product [Vitis vinifera] Length = 729 Score = 106 bits (265), Expect(2) = 4e-33 Identities = 74/202 (36%), Positives = 110/202 (54%), Gaps = 11/202 (5%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*---ILSRCRMQS*KEWMNF 172 ADLAIYNSLI+G+C VK+VD+AYK FQ+T+ E FL ++S M+ ++ + Sbjct: 395 ADLAIYNSLIEGMCNVKQVDKAYKLFQVTVHESLEPNFLTVKPMLVSYAEMKRMDDFCSL 454 Query: 173 VRCLRKCRNWVVV-LLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRL-------CFN 328 + ++K V+ L FF ++ G + + E L+ +N Sbjct: 455 LGQMQKLGFPVIDDLSKFFSVMI---------EKGERLKLALEVFEHLKAKGYCSISIYN 505 Query: 329 LQFVETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKE 508 + +E + + G+V KAL LF ++ DS+ PD YSNAI+ +VE G++ EAC YN I E Sbjct: 506 I-LMEAIHRTGEVKKALSLFDDIKDSNFKPDSSTYSNAIICFVEVGDVQEACACYNKIIE 564 Query: 509 SSSVPSVAAYYFLGKGLSRIGE 574 +PSVAAY L KGL + E Sbjct: 565 MCQLPSVAAYRSLVKGLCKSEE 586 Score = 62.8 bits (151), Expect(2) = 4e-33 Identities = 29/51 (56%), Positives = 37/51 (72%) Frame = +1 Query: 562 QNRRIHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 ++ I I L+R+CLA+VTS PMEFKYT TI+H C NA K I+V+NEM Sbjct: 583 KSEEIDAAIMLVRDCLANVTSGPMEFKYTLTILHACKSGNAEKVIDVLNEM 633 >ref|XP_008238545.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g20740-like [Prunus mume] Length = 719 Score = 107 bits (267), Expect(2) = 4e-33 Identities = 71/194 (36%), Positives = 104/194 (53%), Gaps = 4/194 (2%) Frame = +2 Query: 5 DLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRCL 184 DL IYNSLI+GLC KRVD+AYK F++T+QE F + NF L Sbjct: 394 DLGIYNSLIEGLCNAKRVDKAYKIFRVTVQEGLQPDFATVNPILVSYAEMRRMDNFCYML 453 Query: 185 RKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQFVETLL---- 352 + + ++ F F G + +G ++ + + K++ +++ L+ Sbjct: 454 AEMEKFDFPVIDDLSKFFSFMVG---KEDGVPLALEVFGELKVKGYYSVGIYNILMGSLH 510 Query: 353 KDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPSVA 532 K GKV KAL LF+E+ D D PD YS AI+ +VE +IHEAC +N I E S VPS++ Sbjct: 511 KSGKVKKALSLFNEMKDVDLQPDASTYSIAIMCFVEDEDIHEACASHNKIIEMSCVPSIS 570 Query: 533 AYYFLGKGLSRIGE 574 AY L +GL ++GE Sbjct: 571 AYCSLARGLCKVGE 584 Score = 62.0 bits (149), Expect(2) = 4e-33 Identities = 29/47 (61%), Positives = 35/47 (74%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CLA VTS PMEFKY+ TI+H C NNA K IEV+NEM Sbjct: 585 IDTVMLLVRDCLASVTSGPMEFKYSLTILHACKSNNAEKVIEVLNEM 631 >ref|XP_009788856.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Nicotiana sylvestris] Length = 721 Score = 114 bits (284), Expect(2) = 2e-32 Identities = 77/192 (40%), Positives = 99/192 (51%), Gaps = 1/192 (0%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADLAIYNSLI+G C K +DRAYK FQIT+QE F K + Sbjct: 395 ADLAIYNSLIEGFCNAKLIDRAYKLFQITVQEDLQPDFTTVRPILVSYAESKRMDEICKL 454 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*-IPEKEKLRLCFNLQFVETLLKD 358 L + R + F F + R + + EK+ + +E L K+ Sbjct: 455 LEELRRLSYCIRDDLSKFFTFMVEKDDRIMIALEVFEHLKEKDYCGVPIYNILMEALYKN 514 Query: 359 GKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPSVAAY 538 G+VNK+L LF EL DS PD YSNA+ +VE G++ EAC YN IKE S +PSVAAY Sbjct: 515 GEVNKSLTLFSELRDSYYEPDSSTYSNAVQCFVEVGDVQEACNCYNRIKEMSLIPSVAAY 574 Query: 539 YFLGKGLSRIGE 574 L KGL +IG+ Sbjct: 575 RSLVKGLCKIGQ 586 Score = 53.1 bits (126), Expect(2) = 2e-32 Identities = 25/48 (52%), Positives = 35/48 (72%) Frame = +1 Query: 571 RIHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 +I + LIR+CL +V S P+EFKY TIIHVC N+A K ++V++EM Sbjct: 586 QIDPAMMLIRDCLGNVASGPIEFKYILTIIHVCKTNDAEKVMKVLDEM 633 >ref|XP_002513116.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223548127|gb|EEF49619.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 1128 Score = 105 bits (263), Expect(2) = 8e-32 Identities = 81/196 (41%), Positives = 103/196 (52%), Gaps = 5/196 (2%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADL IYNSLI+GLC VKRVD+A K FQI +QE F K F + Sbjct: 802 ADLGIYNSLIEGLCNVKRVDKARKLFQIMVQEGLELDFKTVNPMLVSYAEMKRMDEFCKL 861 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLR-----LCFNLQFVET 346 L + ++ LF F R ++ + E+ K++ L +N +E Sbjct: 862 LVQMERLGFSVMDDISKLFSFLVR---REEIITLALEVFEELKVKGYISVLIYNT-LMEA 917 Query: 347 LLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPS 526 LLK G+V KAL LF E+ D + PD YS A++ +VE GNI EAC +N I E SSVPS Sbjct: 918 LLKVGEVRKALSLFSEMKDLNCEPDSNTYSIAVICFVEDGNIQEACVCHNKIIEMSSVPS 977 Query: 527 VAAYYFLGKGLSRIGE 574 VAAY L KGL IGE Sbjct: 978 VAAYCSLTKGLCDIGE 993 Score = 58.9 bits (141), Expect(2) = 8e-32 Identities = 27/47 (57%), Positives = 35/47 (74%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CL +VTS PMEFKYT T++HVC +A K IEV+NEM Sbjct: 994 IDEAMMLVRDCLGNVTSGPMEFKYTLTVLHVCRSGDAEKVIEVLNEM 1040 >ref|XP_012069204.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Jatropha curcas] Length = 1159 Score = 102 bits (253), Expect(2) = 7e-31 Identities = 74/195 (37%), Positives = 100/195 (51%), Gaps = 4/195 (2%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADL IYNSLI+GLC VK+VD+A+K F+ + E F K +F Sbjct: 834 ADLGIYNSLIQGLCNVKQVDKAHKLFKFLVHEGLEPDFNTVNPMLVFYSETKRMNDFCNL 893 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQF----VETL 349 L + L+ F F GE + ++ + E KL+ ++Q +E Sbjct: 894 LVQMDKLGFSLIDDISKFFSFLVGE----ERTMMALEVFEDLKLKGYNSVQIYNILMEAF 949 Query: 350 LKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPSV 529 LK G+VNKAL LF E+ D + PD YS A++ +VE GNI +AC +N I E S VPS+ Sbjct: 950 LKIGEVNKALSLFSEMKDLNFEPDSTTYSIAVMCFVEDGNIQQACVCHNKIIEMSCVPSI 1009 Query: 530 AAYYFLGKGLSRIGE 574 AY L KGL IGE Sbjct: 1010 PAYCSLAKGLCDIGE 1024 Score = 59.7 bits (143), Expect(2) = 7e-31 Identities = 28/47 (59%), Positives = 35/47 (74%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CL +VTS PMEFKYT TI+HVC +A K IEV+NEM Sbjct: 1025 IDEAMMLVRDCLGNVTSGPMEFKYTLTILHVCRSGDADKVIEVLNEM 1071 >ref|XP_010670382.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Beta vulgaris subsp. vulgaris] Length = 741 Score = 101 bits (252), Expect(2) = 1e-30 Identities = 67/211 (31%), Positives = 106/211 (50%), Gaps = 20/211 (9%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*---ILSRCRMQS*KEWMNF 172 ADL+I+NSLI GLC +K++D+AYK FQ+T+ + F ++S + ++ N Sbjct: 415 ADLSIFNSLIHGLCNLKQLDKAYKLFQVTVNQGLQPDFTTVNPMLVSYAESREMDDFFNL 474 Query: 173 VRCLRKCRNWVVV-LLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQ----- 334 + ++K ++V+ L FF ++ + +E+LR + Sbjct: 475 LVRMQKLGSYVIDGLSKFFSLM-------------------VEREERLRCTLEVFGDLKG 515 Query: 335 -----------FVETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEA 481 +E L K + +AL LF E+ S+ PD YS+AI+ V+ ++ EA Sbjct: 516 KGYCSVSIYNILLEALYKSKQAKEALSLFTEMKASNFAPDSTTYSHAIMCLVDLEDVREA 575 Query: 482 CTWYNMIKESSSVPSVAAYYFLGKGLSRIGE 574 C WYN IKE S+PS+AAY L GL +IGE Sbjct: 576 CLWYNKIKEMGSIPSIAAYCSLVNGLCKIGE 606 Score = 59.3 bits (142), Expect(2) = 1e-30 Identities = 28/47 (59%), Positives = 35/47 (74%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I I+L+++CLA+VTS PMEFKYT I+H C +A K IEVVNEM Sbjct: 607 IDAAISLVQDCLANVTSGPMEFKYTLNILHACKSYDADKVIEVVNEM 653 >gb|KMT17191.1| hypothetical protein BVRB_2g040990 [Beta vulgaris subsp. vulgaris] Length = 739 Score = 101 bits (252), Expect(2) = 1e-30 Identities = 67/211 (31%), Positives = 106/211 (50%), Gaps = 20/211 (9%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*---ILSRCRMQS*KEWMNF 172 ADL+I+NSLI GLC +K++D+AYK FQ+T+ + F ++S + ++ N Sbjct: 413 ADLSIFNSLIHGLCNLKQLDKAYKLFQVTVNQGLQPDFTTVNPMLVSYAESREMDDFFNL 472 Query: 173 VRCLRKCRNWVVV-LLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQ----- 334 + ++K ++V+ L FF ++ + +E+LR + Sbjct: 473 LVRMQKLGSYVIDGLSKFFSLM-------------------VEREERLRCTLEVFGDLKG 513 Query: 335 -----------FVETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEA 481 +E L K + +AL LF E+ S+ PD YS+AI+ V+ ++ EA Sbjct: 514 KGYCSVSIYNILLEALYKSKQAKEALSLFTEMKASNFAPDSTTYSHAIMCLVDLEDVREA 573 Query: 482 CTWYNMIKESSSVPSVAAYYFLGKGLSRIGE 574 C WYN IKE S+PS+AAY L GL +IGE Sbjct: 574 CLWYNKIKEMGSIPSIAAYCSLVNGLCKIGE 604 Score = 59.3 bits (142), Expect(2) = 1e-30 Identities = 28/47 (59%), Positives = 35/47 (74%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I I+L+++CLA+VTS PMEFKYT I+H C +A K IEVVNEM Sbjct: 605 IDAAISLVQDCLANVTSGPMEFKYTLNILHACKSYDADKVIEVVNEM 651 >gb|KHG02696.1| hypothetical protein F383_25080 [Gossypium arboreum] Length = 829 Score = 111 bits (278), Expect(2) = 8e-30 Identities = 80/201 (39%), Positives = 110/201 (54%), Gaps = 10/201 (4%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIF-----LL*ILSRCRMQS*KEWM 166 ADL IYNSLI+G+C VK +DRAYK FQ+T+QE F +L + + R S Sbjct: 446 ADLGIYNSLIEGMCDVKLIDRAYKLFQVTVQEGLEPGFATVKPMLLVFAEMRRMS----- 500 Query: 167 NFVRCLRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLR-----LCFNL 331 +F + L + + + F F +G R+ I +V + + K++ L +N+ Sbjct: 501 DFCKLLEQMQKLGFSVNDDLSKFFSFVVEKGERT---IMAVRVFNELKVKGYGSVLIYNI 557 Query: 332 QFVETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKES 511 + L K GKV +AL LF E+ D + PD YSNAI+ YVE NI EAC +N I E Sbjct: 558 -LMGALHKTGKVKQALSLFQEMKDLNFEPDSSTYSNAIICYVEDENIKEACICHNKIIEM 616 Query: 512 SSVPSVAAYYFLGKGLSRIGE 574 S VPS+ AYY L GL +IGE Sbjct: 617 SCVPSIDAYYSLTNGLCKIGE 637 Score = 46.6 bits (109), Expect(2) = 8e-30 Identities = 22/47 (46%), Positives = 32/47 (68%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CL +VT+ PMEFKY T++ C + A K +EV+NEM Sbjct: 638 IDAAMVLVRDCLGNVTNGPMEFKYALTVLPACK-SGAEKVMEVLNEM 683 >ref|XP_006345374.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Solanum tuberosum] Length = 720 Score = 107 bits (266), Expect(2) = 2e-29 Identities = 80/199 (40%), Positives = 110/199 (55%), Gaps = 8/199 (4%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*---ILSRCRMQS*KEWMNF 172 ADLAIYNS+I+GLC KR DRAYK FQIT+QE F ++S + E Sbjct: 394 ADLAIYNSIIEGLCNAKRTDRAYKLFQITVQEDLCPDFSTVKPILVSYAESKKMDEICKL 453 Query: 173 VRCLRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLR-LC----FNLQF 337 + L++ + + L F F + +G R + + E K++ C +N+ Sbjct: 454 LEELQRLSHCISDDLSKF---FTYMVEKGDRIMIALE---VFEYLKVKDYCGVPIYNI-L 506 Query: 338 VETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSS 517 +E L ++G+VNKAL LF EL SD PD YSNA+ +VE G++ EA YN IKE S Sbjct: 507 MEALYQNGEVNKALTLFSELRSSDYEPDSSAYSNAVQCFVEVGDVQEASICYNRIKEMSL 566 Query: 518 VPSVAAYYFLGKGLSRIGE 574 +PSVAAY L GL +IG+ Sbjct: 567 IPSVAAYRSLVIGLCKIGQ 585 Score = 49.7 bits (117), Expect(2) = 2e-29 Identities = 23/48 (47%), Positives = 35/48 (72%) Frame = +1 Query: 571 RIHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 +I + LIR+CL +V S P+EFK TIIHVC +N+A K ++V++E+ Sbjct: 585 QIDPAMMLIRDCLGNVASGPIEFKCILTIIHVCKMNDAEKVMKVLDEL 632 >ref|XP_012449113.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Gossypium raimondii] gi|823232893|ref|XP_012449114.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Gossypium raimondii] gi|763800516|gb|KJB67471.1| hypothetical protein B456_010G192200 [Gossypium raimondii] Length = 718 Score = 106 bits (264), Expect(2) = 3e-29 Identities = 73/195 (37%), Positives = 103/195 (52%), Gaps = 4/195 (2%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADL IYN LI+G+C VK +DRAYK FQ+T+QE F + +F + Sbjct: 393 ADLGIYNPLIEGMCDVKLIDRAYKLFQVTVQEGLEPGFATVKPMLLAFAEMRRMSDFCKL 452 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQFVETLL--- 352 L + + + F F +G R+ I +V + + K++ +++ L+ Sbjct: 453 LEQMQKLGFSVNDDLSKFFSFVVEKGERT---IMAVRVFNELKVKGYGSVRIYSILMGAL 509 Query: 353 -KDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPSV 529 K GKV +AL LF E+ D + PD YSNAI+ YVE NI +AC +N I E S VPS+ Sbjct: 510 HKTGKVKQALSLFQEMKDLNFEPDSSTYSNAIICYVEDENIKDACICHNKIIEMSCVPSI 569 Query: 530 AAYYFLGKGLSRIGE 574 AYY L GL +IGE Sbjct: 570 DAYYSLTNGLCKIGE 584 Score = 50.1 bits (118), Expect(2) = 3e-29 Identities = 23/47 (48%), Positives = 33/47 (70%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CL +VT+ PMEFKY T++H C + A K +EV+NEM Sbjct: 585 IDAAMMLVRDCLGNVTNGPMEFKYALTVLHACK-SGAEKVMEVLNEM 630 >gb|KJB67470.1| hypothetical protein B456_010G192200 [Gossypium raimondii] Length = 590 Score = 106 bits (264), Expect(2) = 3e-29 Identities = 73/195 (37%), Positives = 103/195 (52%), Gaps = 4/195 (2%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADL IYN LI+G+C VK +DRAYK FQ+T+QE F + +F + Sbjct: 265 ADLGIYNPLIEGMCDVKLIDRAYKLFQVTVQEGLEPGFATVKPMLLAFAEMRRMSDFCKL 324 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQFVETLL--- 352 L + + + F F +G R+ I +V + + K++ +++ L+ Sbjct: 325 LEQMQKLGFSVNDDLSKFFSFVVEKGERT---IMAVRVFNELKVKGYGSVRIYSILMGAL 381 Query: 353 -KDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWYNMIKESSSVPSV 529 K GKV +AL LF E+ D + PD YSNAI+ YVE NI +AC +N I E S VPS+ Sbjct: 382 HKTGKVKQALSLFQEMKDLNFEPDSSTYSNAIICYVEDENIKDACICHNKIIEMSCVPSI 441 Query: 530 AAYYFLGKGLSRIGE 574 AYY L GL +IGE Sbjct: 442 DAYYSLTNGLCKIGE 456 Score = 50.1 bits (118), Expect(2) = 3e-29 Identities = 23/47 (48%), Positives = 33/47 (70%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + L+R+CL +VT+ PMEFKY T++H C + A K +EV+NEM Sbjct: 457 IDAAMMLVRDCLGNVTNGPMEFKYALTVLHACK-SGAEKVMEVLNEM 502 >emb|CDP12559.1| unnamed protein product [Coffea canephora] Length = 727 Score = 103 bits (257), Expect(2) = 5e-29 Identities = 75/207 (36%), Positives = 100/207 (48%), Gaps = 16/207 (7%) Frame = +2 Query: 2 ADLAIYNSLIKGLCTVKRVDRAYKYFQITIQEVSSQIFLL*ILSRCRMQS*KEWMNFVRC 181 ADLAIYNSLI+GLC +RVDRAYK FQ+ I E F + + +F + Sbjct: 401 ADLAIYNSLIEGLCGAERVDRAYKLFQVMIVEDVQPDFSTVRPLLVSLAELERMDDFCKM 460 Query: 182 LRKCRNWVVVLLMFFQILFFFNGGEG*RSNGGIRSV*IPEKEKLRLCFNLQ--------- 334 L + +N ++ LF F + EK++L L Sbjct: 461 LEEMKNLGFSVIDDLSKLFEFM---------------VVNDEKIKLALELFEYLKMKDYC 505 Query: 335 -------FVETLLKDGKVNKALELFHELNDSD*VPDMIIYSNAILRYVEAGNIHEACTWY 493 +ETL + G+V KAL + EL S+ PD + YS AI + E G++HEACT Y Sbjct: 506 SVSIYNIVMETLNRIGEVRKALVVLDELKSSNFEPDSVTYSIAIQCFAEVGDVHEACTCY 565 Query: 494 NMIKESSSVPSVAAYYFLGKGLSRIGE 574 N IKE S +PS+AAY L KGL E Sbjct: 566 NKIKEISKLPSLAAYRSLVKGLCATAE 592 Score = 52.0 bits (123), Expect(2) = 5e-29 Identities = 24/47 (51%), Positives = 32/47 (68%) Frame = +1 Query: 574 IHVTITLIRNCLAHVTSSPMEFKYTHTIIHVCTLNNAGKGIEVVNEM 714 I + LIR+CL V S P+EFKYT TIIH+C +A K + V++EM Sbjct: 593 IDAAMMLIRDCLGSVASGPLEFKYTLTIIHLCKSKDAKKVVGVIDEM 639