BLASTX nr result

ID: Cephaelis21_contig00038197 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00038197
         (1875 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002454941.1| hypothetical protein SORBIDRAFT_03g001780 [S...   159   2e-64
gb|EEC83100.1| hypothetical protein OsI_28249 [Oryza sativa Indi...   132   1e-61
gb|ABA97040.1| retrotransposon protein, putative, unclassified [...   136   4e-61
ref|XP_002468588.1| hypothetical protein SORBIDRAFT_01g048580 [S...   154   3e-57
gb|EEC84753.1| hypothetical protein OsI_31756 [Oryza sativa Indi...   118   4e-57

>ref|XP_002454941.1| hypothetical protein SORBIDRAFT_03g001780 [Sorghum bicolor]
            gi|241926916|gb|EES00061.1| hypothetical protein
            SORBIDRAFT_03g001780 [Sorghum bicolor]
          Length = 631

 Score =  159 bits (403), Expect(4) = 2e-64
 Identities = 111/327 (33%), Positives = 167/327 (51%), Gaps = 13/327 (3%)
 Frame = -2

Query: 1496 PTRGIR*GDPISPYLFLIFLEGFSNMLKQAIRIGELTGVKISRLGLSITHLLFDDDSLLF 1317
            P RG+R GDP+SPYLF+I  EG S ML++A + G++ G+KI R    + HL F DDSL+ 
Sbjct: 210  PQRGLRQGDPLSPYLFIICAEGLSAMLQKAEQKGKIEGIKICRGAPKVNHLFFADDSLIL 269

Query: 1316 CKVDYQQITCIKKVLQRYEQSSGQQDNMEKFSIFFSKNTQ*EARRRICVELDGVTEQKCS 1137
             +        ++++L  YE++SGQ  N EK S+ FS NT    R  +   L    E    
Sbjct: 270  MRARASDALELRRILDVYEKASGQVINKEKSSMMFSPNTGQHDRSLMRSNLSITDEASTE 329

Query: 1136 KYLGLPLEIGKTKCQVFKFITDVVQRKIQS*KNGFLSPAGKEILIKSVTWHFLIILCLAI 957
            KYLGLP+ IGK++ + F++I   +  +IQ  +   LS AGKEILIK+V            
Sbjct: 330  KYLGLPVNIGKSRKKPFEYIKKKIWSRIQGWQEKLLSKAGKEILIKAVAQSIPTYAMSCF 389

Query: 956  EFQPRS*MRLQV-------*QLDSGGK------QQMTKEKFIGLLGKN*LRVRIGEDWVL 816
            +        L          Q D   K      +++T+ K  G LG              
Sbjct: 390  DLTKGLCDELSSIIGRYWWSQQDKANKIHWISWEKLTQSKEKGGLG-------------F 436

Query: 815  KNLGPFNDSLLAKQL*CIIIEPHLLMSRVMKARYFPTGDILQEKPRQCSS*I*KSWLSTK 636
            ++L  FN ++L++Q   ++  P  L  +V+KARYFP  DIL   PR   S   +S L   
Sbjct: 437  RDLHLFNLAMLSRQAWRLLTNPDSLCGQVLKARYFPHSDILHCSPRPRISYTWRSILKGV 496

Query: 635  YLMEKGVHF*VGDEQAIKI*EQKWHPE 555
             L+++G+ + +G+ + +KI E  W P+
Sbjct: 497  ALLKEGIIWRIGNGKNVKIWEHPWIPK 523



 Score = 80.9 bits (198), Expect(4) = 2e-64
 Identities = 39/88 (44%), Positives = 56/88 (63%)
 Frame = -3

Query: 1735 NMIVANEYVNHLNNLKQGKEGFLALKLDMAKAYKRVEWNFVAATMRKMGFHNTFVSWIMN 1556
            N+++A E V+HL ++K+G  G  A+KLDM+KAY RVEW+F+   MRKMGF   +V  IM 
Sbjct: 130  NVLLAYEIVHHLKSIKKGAGGLAAIKLDMSKAYDRVEWSFLENMMRKMGFDEKWVQLIMK 189

Query: 1555 CLSTVSYAFNENGQPEGYVFQLEELDRG 1472
            C++TV+Y    N      +F    L +G
Sbjct: 190  CVTTVTYKIKVNDSYTQKIFPQRGLRQG 217



 Score = 40.0 bits (92), Expect(4) = 2e-64
 Identities = 19/49 (38%), Positives = 30/49 (61%)
 Frame = -3

Query: 457 PQEEETIRQIPISQLGSRDKIIWHYTESGKYTANSGYQVAVSMQRAKSG 311
           PQ+ E I QIPI +    D + WH+  +G ++  S Y++AVS++ A  G
Sbjct: 563 PQDAEEILQIPIDE-HMEDWLAWHFDATGLFSVKSAYKLAVSIRDADRG 610



 Score = 35.8 bits (81), Expect(4) = 2e-64
 Identities = 16/30 (53%), Positives = 21/30 (70%)
 Frame = -1

Query: 1821 ISKILVERMKTVLPKCISENQSVFLKGRQI 1732
            ISK+L  R+K +LP  IS +QS F+ GR I
Sbjct: 98   ISKVLANRLKKILPNIISNSQSAFVPGRLI 127


>gb|EEC83100.1| hypothetical protein OsI_28249 [Oryza sativa Indica Group]
          Length = 1300

 Score =  132 bits (331), Expect(4) = 1e-61
 Identities = 109/330 (33%), Positives = 160/330 (48%), Gaps = 17/330 (5%)
 Frame = -2

Query: 1496 PTRGIR*GDPISPYLFLIFLEGFSNMLKQAIRIGELTGVKISRLGLSITHLLFDDDSLLF 1317
            PTRG+R GDP+SP+LFL   +G S +L++ +  G ++ V+I R    I+HLLF DD+LLF
Sbjct: 551  PTRGLRQGDPLSPFLFLFVADGLSLLLEEKVNQGAISPVRICRRAPGISHLLFADDTLLF 610

Query: 1316 CKVDYQQITCIKKVLQRYEQSSGQQDNMEKFSIFFSKNTQ*EARRRICVELDGVTEQKCS 1137
             K + +Q   I  VL  Y  S+GQ  N  K SI F + T    +  I   L         
Sbjct: 611  LKSNKEQAEIINSVLGDYAASTGQLVNPSKCSIMFGEATPLSEQTDIKATLQITNNVFED 670

Query: 1136 KYLGLPLEIGKTKCQVFKFITDVVQRKIQS*KNGFLSPAGKEILIKSVTWHFLIILCLAI 957
            KYLG P   G+     F+ + + V ++I      FLS  GKE+LIKSV    L +  + I
Sbjct: 671  KYLGFPTPDGRMHKGKFQSLHERVWKRIIQWGENFLSSGGKEVLIKSV-MQALPVYVMGI 729

Query: 956  EFQPRS---*MRLQV*QLDSG---GKQQ--------MTKEKFIGLLGKN*LRVRIGEDWV 819
               P S    +   V     G   GK++        +TK K  G +G    R+       
Sbjct: 730  FKLPDSVCEDLSKAVRNFWWGAGDGKRRTHWRAWDSLTKPKQCGGMGFRDFRL------- 782

Query: 818  LKNLGPFNDSLLAKQL*CIIIEPHLLMSRVMKARYFPTGDILQEKPRQCSS*I*KSWLST 639
                  FN +LLA+Q   I+  P  L +RV+KA+YFP G ++       +S     W   
Sbjct: 783  ------FNQALLARQSWRILEFPESLCARVLKAKYFPNGSLIDTSFSGNAS---PGWRGI 833

Query: 638  KY---LMEKGVHF*VGDEQAIKI*EQKWHP 558
            +Y   L+++G+ + VG+ + I+I    W P
Sbjct: 834  EYGLELLKQGIIWRVGNGRTIRIWRDPWIP 863



 Score = 73.9 bits (180), Expect(4) = 1e-61
 Identities = 36/131 (27%), Positives = 63/131 (48%)
 Frame = -3

Query: 457  PQEEETIRQIPISQLGSRDKIIWHYTESGKYTANSGYQVAVSMQRAKSGXXXXXXXXXEA 278
            P + E I  I  S+    D + WH  + G+++  S Y +A+S+    +          +A
Sbjct: 903  PIDVEKILSIHTSRFHENDFVAWHSDKLGRFSVRSAYHLALSLSNVVASSSSSGQELSKA 962

Query: 277  RRKWKLLWSLNIKQKVKIFLWKCNHGALPVKAQLKRRDMSLDSTCNLFGECEKTFENIFF 98
               W  LWS ++ QKV+IF+W+    +L      K++ +   S C++ G  E+   +   
Sbjct: 963  ---WNQLWSCHVPQKVRIFIWRAASNSLATMVNKKKKRLEHCSMCSICGTEEEDVAHALC 1019

Query: 97   HCPHTQRIWEV 65
             CPH + +WEV
Sbjct: 1020 RCPHAKYLWEV 1030



 Score = 67.4 bits (163), Expect(4) = 1e-61
 Identities = 31/73 (42%), Positives = 45/73 (61%)
 Frame = -3

Query: 1735 NMIVANEYVNHLNNLKQGKEGFLALKLDMAKAYKRVEWNFVAATMRKMGFHNTFVSWIMN 1556
            N ++A E  + +   +       A KLD++KAY RV+W F+   M KMGF + +VSWIM+
Sbjct: 471  NALLAFECFHFIQKNRSPNNAACAYKLDLSKAYDRVDWRFLEQAMYKMGFAHRWVSWIMS 530

Query: 1555 CLSTVSYAFNENG 1517
            C++TV YA   NG
Sbjct: 531  CITTVRYAIKFNG 543



 Score = 33.9 bits (76), Expect(4) = 1e-61
 Identities = 15/30 (50%), Positives = 21/30 (70%)
 Frame = -1

Query: 1821 ISKILVERMKTVLPKCISENQSVFLKGRQI 1732
            +SK LV RM+ +L + +S NQS F+ GR I
Sbjct: 439  VSKCLVNRMRPILDEVVSPNQSAFVPGRLI 468


>gb|ABA97040.1| retrotransposon protein, putative, unclassified [Oryza sativa
            Japonica Group]
          Length = 1913

 Score =  136 bits (342), Expect(4) = 4e-61
 Identities = 104/330 (31%), Positives = 164/330 (49%), Gaps = 17/330 (5%)
 Frame = -2

Query: 1496 PTRGIR*GDPISPYLFLIFLEGFSNMLKQAIRIGELTGVKISRLGLSITHLLFDDDSLLF 1317
            P+RG+R GDP+SP+LFL   +GFS ++++ +  G++T VKI R  L I+HLLF DD+LLF
Sbjct: 1204 PSRGLRQGDPLSPFLFLFVADGFSRLMEERVAAGDITPVKICRGALGISHLLFVDDTLLF 1263

Query: 1316 CKVDYQQITCIKKVLQRYEQSSGQQDNMEKFSIFFSKNTQ*EARRRICVELDGVTEQKCS 1137
             K D +Q   IK VL  Y   +GQ  N  K SI F   +  E +  I   L  V      
Sbjct: 1264 FKADCEQARLIKGVLNEYALGTGQLVNPNKCSILFDDTSHAETQDAIKNLLQIVCHNFED 1323

Query: 1136 KYLGLPLEIGKTKCQVFKFITDVVQRKIQS*KNGFLSPAGKEILIKSVTWHFLIILCLAI 957
            KYLG P   G+     F+ + + + ++I      +LS  GKEI++K V    + +  ++I
Sbjct: 1324 KYLGFPTPEGRLNKGKFQSLQEKMWKRIILWGENYLSSGGKEIMLK-VVIQAIPVYVMSI 1382

Query: 956  EFQPRS-------*MRLQV*QLDSGGKQQ-------MTKEKFIGLLGKN*LRVRIGEDWV 819
               P S         R      D G ++        +TK K  G L     R+       
Sbjct: 1383 FRLPESVCEDLNKLARNFWWGADKGKRKTHWRAWSCLTKPKHNGGLAFRDFRL------- 1435

Query: 818  LKNLGPFNDSLLAKQL*CIIIEPHLLMSRVMKARYFPTGDILQEKPRQCSS*I*KSWLST 639
                  FN +LLA+Q   ++ +P  L +RVMKA+Y+P G ++       +S     W + 
Sbjct: 1436 ------FNQALLARQAWRLLDKPDSLCARVMKAKYYPNGSLVDTAFGGNAS---PGWRAI 1486

Query: 638  KY---LMEKGVHF*VGDEQAIKI*EQKWHP 558
            ++   L++KG+ + +G+ ++++I    W P
Sbjct: 1487 EHGLALLKKGIVWRIGNGRSVRIWRDPWIP 1516



 Score = 69.7 bits (169), Expect(4) = 4e-61
 Identities = 40/159 (25%), Positives = 69/159 (43%), Gaps = 1/159 (0%)
 Frame = -3

Query: 541  HCRLSLVKDLM-VDGEXXXXXXXXXXXLSPQEEETIRQIPISQLGSRDKIIWHYTESGKY 365
            +CR+  V DL+  DG              P + + I +I  S     D + WH    G++
Sbjct: 1529 NCRVKWVSDLLGQDGSWDVQKVSRVFL--PIDADEILKIRTSVRLEEDFLSWHPDRLGQF 1586

Query: 364  TANSGYQVAVSMQRAKSGXXXXXXXXXEARRKWKLLWSLNIKQKVKIFLWKCNHGALPVK 185
            +  S Y++A+S+  A              ++ W L+W  N+ QKVK+F W+  +  L  +
Sbjct: 1587 SVRSAYKLAISLDYADESSSSSGQNP---QKIWDLIWKCNVPQKVKVFCWRAANNCLANQ 1643

Query: 184  AQLKRRDMSLDSTCNLFGECEKTFENIFFHCPHTQRIWE 68
               K+R++     C + G   +   +    CPH   +WE
Sbjct: 1644 ENKKKRNLERSEICCICGNETEDVSHALSRCPHAVHLWE 1682



 Score = 64.7 bits (156), Expect(4) = 4e-61
 Identities = 30/73 (41%), Positives = 44/73 (60%)
 Frame = -3

Query: 1735 NMIVANEYVNHLNNLKQGKEGFLALKLDMAKAYKRVEWNFVAATMRKMGFHNTFVSWIMN 1556
            N ++A E  + +   K   +   A KLD++KAY RV+W F+  TM K+GF   ++ WIM 
Sbjct: 1124 NALLAFECFHFIQRNKNANKAACAYKLDLSKAYDRVDWVFLEQTMCKLGFAPRWIKWIMT 1183

Query: 1555 CLSTVSYAFNENG 1517
            CLS+V Y+   NG
Sbjct: 1184 CLSSVRYSIKFNG 1196



 Score = 34.7 bits (78), Expect(4) = 4e-61
 Identities = 14/30 (46%), Positives = 22/30 (73%)
 Frame = -1

Query: 1821 ISKILVERMKTVLPKCISENQSVFLKGRQI 1732
            +SK LV R++ +L + +S+NQS F+ GR I
Sbjct: 1092 VSKCLVNRLRPILDELVSQNQSAFVPGRMI 1121


>ref|XP_002468588.1| hypothetical protein SORBIDRAFT_01g048580 [Sorghum bicolor]
            gi|241922442|gb|EER95586.1| hypothetical protein
            SORBIDRAFT_01g048580 [Sorghum bicolor]
          Length = 517

 Score =  154 bits (388), Expect(3) = 3e-57
 Identities = 114/331 (34%), Positives = 170/331 (51%), Gaps = 13/331 (3%)
 Frame = -2

Query: 1499 LPTRGIR*GDPISPYLFLIFLEGFSNMLKQAIRIGELTGVKISRLGLSITHLLFDDDSLL 1320
            +P RG R GDP+SPYLF++  EG S ML+ A   G++ G+KI R    + HL F DDSL+
Sbjct: 52   VPKRGSRQGDPLSPYLFILCAEGLSAMLQSAENAGKIEGIKICRGAPRVNHLFFADDSLM 111

Query: 1319 FCKVDYQQITCIKKVLQRYEQSSGQQDNMEKFSIFFSKNTQ*EARRRICVELDGVTEQKC 1140
              +   +    +K +L+ YE++SGQ  N +K SI FS NT    + ++  EL  + E + 
Sbjct: 112  LMRARKEDAHELKHILEIYERASGQVINKDKSSIMFSPNTSDYFKGKMRKELSIMQEARS 171

Query: 1139 SKYLGLPLEIGKTKCQVFKFITDVVQRKIQS*KNGFLSPAGKEILIKSVT----WHFLII 972
             KYLG P+  GK++ + F++I   +  +IQ  +   LS AGKEILIK+V      + +  
Sbjct: 172  EKYLGGPISTGKSRKKAFEYIKQRIWARIQGWQEKLLSKAGKEILIKAVAQAIPTYAMSY 231

Query: 971  LCLAIEF-QPRS*M--RLQV*QLDSGGK------QQMTKEKFIGLLGKN*LRVRIGEDWV 819
              L   F    S M  R    + D   K      +++T  K  G LG             
Sbjct: 232  FNLTKSFCDDLSSMVGRYWWSEQDKTNKIHWLSWEKLTLSKKRGGLG------------- 278

Query: 818  LKNLGPFNDSLLAKQL*CIIIEPHLLMSRVMKARYFPTGDILQEKPRQCSS*I*KSWLST 639
             ++L  FN ++LA+Q   I+IEP  L  RV+KARYFP  +ILQ K R       +S +  
Sbjct: 279  FRDLYHFNLAMLARQAWRILIEPDTLCGRVLKARYFPNSNILQCKARAGILYTWRSIIDG 338

Query: 638  KYLMEKGVHF*VGDEQAIKI*EQKWHPECQT 546
              L+++G+ + +G    + I    W P   T
Sbjct: 339  INLLKEGIIWRIGSGSNVNIWTDPWIPRGTT 369



 Score = 60.8 bits (146), Expect(3) = 3e-57
 Identities = 25/46 (54%), Positives = 34/46 (73%)
 Frame = -3

Query: 1651 MAKAYKRVEWNFVAATMRKMGFHNTFVSWIMNCLSTVSYAFNENGQ 1514
            M+KAY RVEW F+   MRK+GFH+ ++  IM C++TVSY  N NG+
Sbjct: 1    MSKAYDRVEWTFLQGMMRKLGFHDKWIHLIMKCVTTVSYRINVNGE 46



 Score = 57.0 bits (136), Expect(3) = 3e-57
 Identities = 30/104 (28%), Positives = 49/104 (47%), Gaps = 3/104 (2%)
 Frame = -3

Query: 457 PQEEETIRQIPISQLGSRDKIIWHYTESGKYTANSGYQVAVSMQRAKSGXXXXXXXXXEA 278
           P++ E I  IP +   + D   WHY   G ++  S Y++AV ++  + G           
Sbjct: 406 PEDAEAILMIP-TDYEAVDWPAWHYDSKGVFSVKSAYKLAVQIRDHQKGTDASTSMAENQ 464

Query: 277 RR---KWKLLWSLNIKQKVKIFLWKCNHGALPVKAQLKRRDMSL 155
                +W  LW LNI  K+K+FLW   H  LP++  ++  +  L
Sbjct: 465 NTYGFRWGKLWQLNIPNKIKMFLWHFAHNTLPLRRNIQGEESIL 508


>gb|EEC84753.1| hypothetical protein OsI_31756 [Oryza sativa Indica Group]
          Length = 1350

 Score =  118 bits (295), Expect(4) = 4e-57
 Identities = 96/331 (29%), Positives = 159/331 (48%), Gaps = 18/331 (5%)
 Frame = -2

Query: 1496 PTRGIR*GDPISPYLFLIFLEGFSNMLKQAIRIGELTGVKISRLGLSITHLLFDDDSLLF 1317
            P+RG+  GDP+SP+LFL   +G S +L + ++ G L+ + I      I+HLLF DD+LLF
Sbjct: 828  PSRGLCQGDPLSPFLFLFLADGLSLLLDEKVQQGVLSPIHICHSAPGISHLLFADDTLLF 887

Query: 1316 CKVDYQQITCIKKVLQRYEQSSGQQDNMEKFSIFFSKNTQ*EARRRICVELDGVTEQKCS 1137
             K   +Q   I++VL  Y  ++GQQ N  K SI F  ++    +  I   L         
Sbjct: 888  LKAVPEQAEVIREVLNDYATATGQQINPSKCSIMFGDSSSELVQEAIRDTLQIANNVFED 947

Query: 1136 KYLGLPLEIGKTKCQVFKFITDVVQRKIQS*KNGFLSPAGKEILIKSVTWHFLIILCLAI 957
            KYLG P   G+ K   F+ I + + +++      +LS  GKEILIK++    + +  + I
Sbjct: 948  KYLGFPTPDGRLKKGKFQSIQERIWKRLIQWGENYLSSGGKEILIKALI-QAIPVYVMGI 1006

Query: 956  EFQPRS*MRLQV*QLDSG---------------GKQQMTKEKFIGLLGKN*LRVRIGEDW 822
               P S +  ++ ++  G                   +TK K  G LG    R+      
Sbjct: 1007 FKLPES-VCDELTRITRGFWWGGEKGARKTHWKAWDTLTKPKNCGGLGFRDFRL------ 1059

Query: 821  VLKNLGPFNDSLLAKQL*CIIIEPHLLMSRVMKARYFPTGDILQEKPRQCSS*I*KSWLS 642
                   FN +LLA+Q   +I  P  L + V+KA+YF  G++        +S     W +
Sbjct: 1060 -------FNQALLARQAWRLIDSPDSLCAMVLKAKYFLNGNLTDTSFGGNAS---PGWRA 1109

Query: 641  TKY---LMEKGVHF*VGDEQAIKI*EQKWHP 558
             ++   L++KG+ + +G+ ++++I    W P
Sbjct: 1110 IEFGLELLKKGIIWRIGNGRSVRIWRDPWIP 1140



 Score = 71.6 bits (174), Expect(4) = 4e-57
 Identities = 37/129 (28%), Positives = 56/129 (43%)
 Frame = -3

Query: 457  PQEEETIRQIPISQLGSRDKIIWHYTESGKYTANSGYQVAVSMQRAKSGXXXXXXXXXEA 278
            P + E I +I +      D + WH    G+++  S Y +AVS+    S          + 
Sbjct: 1180 PIDAEVILKIRVPSQDVSDFVAWHPDRLGRFSVRSAYSLAVSLA---SENAFSASSAVDR 1236

Query: 277  RRKWKLLWSLNIKQKVKIFLWKCNHGALPVKAQLKRRDMSLDSTCNLFGECEKTFENIFF 98
             + W +LW  N+ QKVKIF WK     L      KRR +     C + G  E+   +   
Sbjct: 1237 SKAWNMLWKCNVPQKVKIFAWKVALDCLATMVNKKRRKLEATDVCAICGTEEEDSAHALC 1296

Query: 97   HCPHTQRIW 71
             CPH + +W
Sbjct: 1297 RCPHAKSLW 1305



 Score = 68.2 bits (165), Expect(4) = 4e-57
 Identities = 30/73 (41%), Positives = 46/73 (63%)
 Frame = -3

Query: 1735 NMIVANEYVNHLNNLKQGKEGFLALKLDMAKAYKRVEWNFVAATMRKMGFHNTFVSWIMN 1556
            N ++A EY +++   K   +   A KLD++KAY RV+W F+   M K+GF + +V WIM 
Sbjct: 748  NALLAFEYFHYIQKNKNPNKAASAYKLDLSKAYDRVDWGFLEQAMYKLGFAHRWVRWIME 807

Query: 1555 CLSTVSYAFNENG 1517
            C++TV Y+   NG
Sbjct: 808  CITTVRYSVKFNG 820



 Score = 33.9 bits (76), Expect(4) = 4e-57
 Identities = 16/30 (53%), Positives = 21/30 (70%)
 Frame = -1

Query: 1821 ISKILVERMKTVLPKCISENQSVFLKGRQI 1732
            +SK LV R+K +L + IS NQS F+ GR I
Sbjct: 716  VSKCLVNRLKPLLDELISVNQSAFVPGRMI 745


Top