BLASTX nr result

ID: Cocculus22_contig00017627 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00017627
         (1402 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-contai...   298   5e-78
ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citr...   298   5e-78
ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Popu...   296   1e-77
ref|XP_007012520.1| ARID/BRIGHT DNA-binding domain-containing pr...   289   2e-75
ref|XP_002324130.2| arid/bright DNA-binding domain-containing fa...   282   2e-73
emb|CBI35803.3| unnamed protein product [Vitis vinifera]              282   2e-73
ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-contai...   268   6e-69
gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [...   266   2e-68
ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-contai...   265   3e-68
ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-contai...   264   6e-68
ref|XP_007217035.1| hypothetical protein PRUPE_ppa001668mg [Prun...   260   9e-67
gb|EYU21278.1| hypothetical protein MIMGU_mgv1a001736mg [Mimulus...   248   5e-63
ref|XP_007012523.1| ARID/BRIGHT DNA-binding domain-containing pr...   248   6e-63
ref|XP_007012522.1| ARID/BRIGHT DNA-binding domain-containing pr...   248   6e-63
ref|XP_003546529.2| PREDICTED: AT-rich interactive domain-contai...   247   1e-62
ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-contai...   246   2e-62
ref|XP_003595365.1| Fiber protein-like protein [Medicago truncat...   244   7e-62
ref|XP_006828651.1| hypothetical protein AMTR_s00129p00111730 [A...   244   9e-62
ref|XP_007138611.1| hypothetical protein PHAVU_009G223200g [Phas...   243   1e-61
ref|XP_002516200.1| DNA binding protein, putative [Ricinus commu...   242   3e-61

>ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Citrus sinensis]
          Length = 745

 Score =  298 bits (762), Expect = 5e-78
 Identities = 145/207 (70%), Positives = 166/207 (80%)
 Frame = +1

Query: 781  MMFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTI 960
            MMFHAQ   +N CSLLAVL  K  D K K   +     YPFPE++SSGRLEVH L+ P+ 
Sbjct: 1    MMFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPKYPFPEIASSGRLEVHLLSSPST 60

Query: 961  DEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLEL 1140
            DEFRR+LES+EP+IVYLQGE +++SEEIGSLVWG V LSTPEA+ GLFGSTLPTTVYLE+
Sbjct: 61   DEFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLEI 120

Query: 1141 PNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFR 1320
            PNGE  AEA++S+GVPYVIYWK++FS YAA HF  ALLSVVQSSCSHTWDAFQLAHASFR
Sbjct: 121  PNGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFR 180

Query: 1321 LYCTRNNYALPANGAKNSGMSGPHLLG 1401
            LYC RNN  + +N  K S   GPHLLG
Sbjct: 181  LYCVRNNIVMASNSQKGSSKLGPHLLG 207


>ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citrus clementina]
            gi|557556132|gb|ESR66146.1| hypothetical protein
            CICLE_v10007563mg [Citrus clementina]
          Length = 745

 Score =  298 bits (762), Expect = 5e-78
 Identities = 145/207 (70%), Positives = 166/207 (80%)
 Frame = +1

Query: 781  MMFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTI 960
            MMFHAQ   +N CSLLAVL  K  D K K   +     YPFPE++SSGRLEVH L+ P+ 
Sbjct: 1    MMFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPKYPFPEIASSGRLEVHLLSSPST 60

Query: 961  DEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLEL 1140
            DEFRR+LES+EP+IVYLQGE +++SEEIGSLVWG V LSTPEA+ GLFGSTLPTTVYLE+
Sbjct: 61   DEFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLEI 120

Query: 1141 PNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFR 1320
            PNGE  AEA++S+GVPYVIYWK++FS YAA HF  ALLSVVQSSCSHTWDAFQLAHASFR
Sbjct: 121  PNGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFR 180

Query: 1321 LYCTRNNYALPANGAKNSGMSGPHLLG 1401
            LYC RNN  + +N  K S   GPHLLG
Sbjct: 181  LYCVRNNIVMASNSQKGSSKLGPHLLG 207


>ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Populus trichocarpa]
            gi|550336257|gb|ERP59348.1| hypothetical protein
            POPTR_0006s13780g [Populus trichocarpa]
          Length = 749

 Score =  296 bits (758), Expect = 1e-77
 Identities = 146/207 (70%), Positives = 164/207 (79%)
 Frame = +1

Query: 781  MMFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTI 960
            MMFHAQGPL+N C+LLAVLCGK  D K K   S     +PFPEL+S+GRLEV  LT P+ 
Sbjct: 1    MMFHAQGPLRNHCTLLAVLCGKSGDNKQKQPLSDDKPRFPFPELASAGRLEVQVLTNPST 60

Query: 961  DEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLEL 1140
            DEF+RVL S EPSIVY QGE + +SEEIG L WG + LSTPE++ GLFGSTLP TVYLE+
Sbjct: 61   DEFQRVLHSLEPSIVYFQGEQIEDSEEIGPLRWGDIDLSTPESLCGLFGSTLPPTVYLEI 120

Query: 1141 PNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFR 1320
            PNGEKLAEA++SKGVPYVIYWK+ FS YA SHFR ALLSVVQSSCSHT DAFQLA+ASFR
Sbjct: 121  PNGEKLAEALHSKGVPYVIYWKSMFSCYAVSHFRQALLSVVQSSCSHTCDAFQLAYASFR 180

Query: 1321 LYCTRNNYALPANGAKNSGMSGPHLLG 1401
            LYC RNN  L +NG K  G  GP LLG
Sbjct: 181  LYCGRNNNTLASNGQKVGGKPGPQLLG 207


>ref|XP_007012520.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1
            [Theobroma cacao] gi|590574848|ref|XP_007012521.1|
            ARID/BRIGHT DNA-binding domain-containing protein isoform
            1 [Theobroma cacao] gi|508782883|gb|EOY30139.1|
            ARID/BRIGHT DNA-binding domain-containing protein isoform
            1 [Theobroma cacao] gi|508782884|gb|EOY30140.1|
            ARID/BRIGHT DNA-binding domain-containing protein isoform
            1 [Theobroma cacao]
          Length = 746

 Score =  289 bits (739), Expect = 2e-75
 Identities = 146/208 (70%), Positives = 162/208 (77%), Gaps = 1/208 (0%)
 Frame = +1

Query: 781  MMFHAQGPLKNTCSLLAVLCG-KVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPT 957
            MMF AQG  +N CSLLAVL G  VSD K K   S     YPFPEL+SSGRLEV  L  P 
Sbjct: 1    MMFSAQGSSRNHCSLLAVLSGGNVSDNKQKQPVSDDKPRYPFPELASSGRLEVQLLNSPN 60

Query: 958  IDEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLE 1137
            IDE RRVLESTEP++VYLQGE  ++SEEIG L+WG V LSTPE + GLF STLPTTVYLE
Sbjct: 61   IDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPETLCGLFDSTLPTTVYLE 120

Query: 1138 LPNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASF 1317
             PNG+KLAEA++S+GVPYVIYWKNTFS +AA HFR ALLSV+QSSCSHTWDAFQLAHASF
Sbjct: 121  TPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQSSCSHTWDAFQLAHASF 180

Query: 1318 RLYCTRNNYALPANGAKNSGMSGPHLLG 1401
            RLYC RNN  + +N  K S   GP LLG
Sbjct: 181  RLYCVRNNNVVSSNSQKQSVKPGPRLLG 208


>ref|XP_002324130.2| arid/bright DNA-binding domain-containing family protein [Populus
            trichocarpa] gi|550318261|gb|EEF02695.2| arid/bright
            DNA-binding domain-containing family protein [Populus
            trichocarpa]
          Length = 746

 Score =  282 bits (722), Expect = 2e-73
 Identities = 144/207 (69%), Positives = 161/207 (77%)
 Frame = +1

Query: 781  MMFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTI 960
            MMFHAQGPL+N C+LLAVLCGK  + K  P    KP  YP PEL S+GRLEV  L  P+ 
Sbjct: 1    MMFHAQGPLRNHCTLLAVLCGKSGEQK-LPLSDDKP-RYPLPELESTGRLEVQVLNNPST 58

Query: 961  DEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLEL 1140
            DEFR+VL+S EPSIVY QGE + + EEIGSL W  V LSTPE++ GLFGSTLP TVYLE+
Sbjct: 59   DEFRQVLQSLEPSIVYFQGEQVEDREEIGSLRWADVGLSTPESLCGLFGSTLPPTVYLEM 118

Query: 1141 PNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFR 1320
            PNGEKLAEA++SKGVPYVIYWK+ FS YAASHFR ALLSVVQSSCSHT DAFQLAHASFR
Sbjct: 119  PNGEKLAEALHSKGVPYVIYWKSAFSCYAASHFRQALLSVVQSSCSHTCDAFQLAHASFR 178

Query: 1321 LYCTRNNYALPANGAKNSGMSGPHLLG 1401
            LYC +NN    +N  K  G  GP LLG
Sbjct: 179  LYCVQNNNTPASNSQKVGGKPGPRLLG 205


>emb|CBI35803.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score =  282 bits (722), Expect = 2e-73
 Identities = 140/206 (67%), Positives = 158/206 (76%)
 Frame = +1

Query: 784  MFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTID 963
            M H QG   +TC LLAV CGK S+ K +   S     YPFP+  SSGRLEV TLT P+ D
Sbjct: 1    MLHTQGISNHTCGLLAVTCGKTSECKQEHETSNDRPRYPFPDFVSSGRLEVQTLTSPSPD 60

Query: 964  EFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLELP 1143
            EFRRV ES +P+ VY QGE L N +E+GSLVWGGV+LS+ E I GLFGS LPTTVYLE+P
Sbjct: 61   EFRRVFESVQPNFVYFQGEQLQN-DEVGSLVWGGVELSSAEDICGLFGSKLPTTVYLEIP 119

Query: 1144 NGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFRL 1323
            NGEKLAEA++SKG+PYVIYWKN FS YAA HFR AL SVVQSS +HTWDAFQLA+ASFRL
Sbjct: 120  NGEKLAEALHSKGIPYVIYWKNAFSCYAACHFRNALFSVVQSSSTHTWDAFQLAYASFRL 179

Query: 1324 YCTRNNYALPANGAKNSGMSGPHLLG 1401
            YC RNN+ LPAN  K SG  GP LLG
Sbjct: 180  YCVRNNHVLPANSHKVSGKLGPRLLG 205


>ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Solanum lycopersicum]
          Length = 771

 Score =  268 bits (684), Expect = 6e-69
 Identities = 132/206 (64%), Positives = 155/206 (75%)
 Frame = +1

Query: 784  MFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTID 963
            MFH QG  + +CSLLAVLCG+ S+   K         Y FPE+ SSGRLEV  L  P+ D
Sbjct: 1    MFHCQGASRQSCSLLAVLCGRTSEYDQKKDVHDGKPRYCFPEIVSSGRLEVQVLKNPSTD 60

Query: 964  EFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLELP 1143
            EF +VL+S +P+IVYLQGE LSN +E+GSLVWGG+ LS+ EAISGLF S LPT VYLELP
Sbjct: 61   EFHKVLDSWQPNIVYLQGEHLSN-DEVGSLVWGGLDLSSAEAISGLFSSVLPTAVYLELP 119

Query: 1144 NGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFRL 1323
            NGEKLAEA+++KG+PYV+YWK+ FS YAASHFR A L V QSS  H WDAFQLAHASFRL
Sbjct: 120  NGEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAHASFRL 179

Query: 1324 YCTRNNYALPANGAKNSGMSGPHLLG 1401
            YC RNN+AL     ++S   GPHLLG
Sbjct: 180  YCVRNNFALSEMSQRDSDNVGPHLLG 205


>gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [Morus notabilis]
          Length = 779

 Score =  266 bits (680), Expect = 2e-68
 Identities = 133/206 (64%), Positives = 162/206 (78%)
 Frame = +1

Query: 784  MFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTID 963
            MFH+QG  K TCSLLAV CG VS+ K K       + YPFPEL SSGRLEV TLT P+ +
Sbjct: 1    MFHSQGSSKQTCSLLAVTCGNVSESKRKKDVPENRSLYPFPELISSGRLEVQTLTSPSKE 60

Query: 964  EFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLELP 1143
            EF ++LES +P++VYLQGE L+N +E+G LVWG V LSTPE++S LFG+TLPTTVYLE+P
Sbjct: 61   EFSKLLESYKPNLVYLQGEQLAN-DEVGPLVWGDVDLSTPESVSELFGTTLPTTVYLEIP 119

Query: 1144 NGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFRL 1323
            + E+LAE ++SKGVPYVIYWK+ FS +AA HFR ALLSVV+SS +H WDAFQLA+ASFRL
Sbjct: 120  DCEELAEELHSKGVPYVIYWKDRFSRHAACHFRNALLSVVKSSSTHAWDAFQLAYASFRL 179

Query: 1324 YCTRNNYALPANGAKNSGMSGPHLLG 1401
            YC RNN+ LP+ G + S   GP LLG
Sbjct: 180  YCVRNNHVLPSKGHEISDEQGPCLLG 205


>ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-containing protein 4 [Vitis
            vinifera] gi|297738501|emb|CBI27746.3| unnamed protein
            product [Vitis vinifera]
          Length = 739

 Score =  265 bits (678), Expect = 3e-68
 Identities = 138/209 (66%), Positives = 155/209 (74%), Gaps = 3/209 (1%)
 Frame = +1

Query: 784  MFHAQGPLKNTCSLLAVLCGKV---SDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKP 954
            MFH Q   +N C+LLAV+CGK+    D +  P        YPFPEL SSGRLEV  L  P
Sbjct: 1    MFHVQAASRNHCALLAVVCGKIPVSEDQQQHP--------YPFPELVSSGRLEVQILKNP 52

Query: 955  TIDEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYL 1134
            +I EF+R LES EP+ +YLQGE L  SEEIGSL WGGV LS+ EA+  LFG TLPTTVYL
Sbjct: 53   SIHEFQRSLESLEPNFLYLQGEQLPGSEEIGSLTWGGVDLSSAEALVELFGPTLPTTVYL 112

Query: 1135 ELPNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHAS 1314
            E PNGEKLA+A++SKGV YVIYWKN FS YAA HFR AL SVVQSSCSHTWDAFQLAHAS
Sbjct: 113  ETPNGEKLAKALHSKGVSYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHAS 172

Query: 1315 FRLYCTRNNYALPANGAKNSGMSGPHLLG 1401
            FRLYC +NN  +P+N  K SG  GP LLG
Sbjct: 173  FRLYCVQNN-TVPSNNQKVSGKLGPCLLG 200


>ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Solanum tuberosum]
          Length = 770

 Score =  264 bits (675), Expect = 6e-68
 Identities = 130/206 (63%), Positives = 153/206 (74%)
 Frame = +1

Query: 784  MFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTID 963
            MFH QG  + +CSLLAVLCG  S+   K         Y FPE+ SSGRLEV  L  P+ D
Sbjct: 1    MFHCQGTSRQSCSLLAVLCGSTSEYDQKKDVHDGKPRYCFPEIVSSGRLEVQVLKNPSTD 60

Query: 964  EFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLELP 1143
            EF +VL+S +P+IVYLQGE LSN +E+GSLVWGG+ LS+ EAISGLF S LPT VYLELP
Sbjct: 61   EFHKVLDSWQPNIVYLQGEHLSN-DEVGSLVWGGLDLSSAEAISGLFSSALPTAVYLELP 119

Query: 1144 NGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFRL 1323
            NGEKLAEA+++KG+PYV+YWK+ FS YAASHFR A L V QSS  H WDAFQLA ASFRL
Sbjct: 120  NGEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAQASFRL 179

Query: 1324 YCTRNNYALPANGAKNSGMSGPHLLG 1401
            YC +NN+ LP    ++S   GPHLLG
Sbjct: 180  YCVQNNFVLPEMSQRDSDNMGPHLLG 205


>ref|XP_007217035.1| hypothetical protein PRUPE_ppa001668mg [Prunus persica]
            gi|462413185|gb|EMJ18234.1| hypothetical protein
            PRUPE_ppa001668mg [Prunus persica]
          Length = 783

 Score =  260 bits (665), Expect = 9e-67
 Identities = 134/207 (64%), Positives = 159/207 (76%), Gaps = 1/207 (0%)
 Frame = +1

Query: 784  MFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTID 963
            M H+QG  K TCSLL V CGK+S+ KP      +   YPFPEL S GRLEV TLTKP+ +
Sbjct: 1    MNHSQGASKQTCSLLVVTCGKISEEKPNEDTLDEKLKYPFPELVSLGRLEVQTLTKPSKE 60

Query: 964  EFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLELP 1143
            EF ++LES +P++VYLQGE L N+E IGS VW  V LST EAIS +F +TLPTTVYLE+P
Sbjct: 61   EFCKMLESYKPNLVYLQGEQLENNE-IGSPVWEDVDLSTAEAISEIFSATLPTTVYLEVP 119

Query: 1144 NGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFRL 1323
            NGE LA A++SKG+PYVIYWK+ FS YAA HFR ALLSVVQSS +HTWDAFQLA+ASFRL
Sbjct: 120  NGENLAAALHSKGIPYVIYWKHEFSSYAACHFRHALLSVVQSSSTHTWDAFQLAYASFRL 179

Query: 1324 YCTRNNYALPANGAKNSGMS-GPHLLG 1401
            YC  N++A+PAN  K+S    GP LLG
Sbjct: 180  YCVENSHAIPANRHKSSSAELGPCLLG 206


>gb|EYU21278.1| hypothetical protein MIMGU_mgv1a001736mg [Mimulus guttatus]
          Length = 767

 Score =  248 bits (633), Expect = 5e-63
 Identities = 117/186 (62%), Positives = 149/186 (80%)
 Frame = +1

Query: 784  MFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTID 963
            MFH QG LKNTC+LLAVLC + ++ K       +  N+PFPE+ SSGRLEV TL  PT+D
Sbjct: 1    MFHTQGALKNTCNLLAVLCNRAAENKHSQNVLDERPNFPFPEIVSSGRLEVQTLKNPTVD 60

Query: 964  EFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLELP 1143
            EF +VL+S++ ++VYLQGE L N ++IGS+VWGG +LS+PEAI+GLF S LPTTVYLE+P
Sbjct: 61   EFSKVLDSSQANLVYLQGEHLEN-DKIGSIVWGGFELSSPEAITGLFNSKLPTTVYLEVP 119

Query: 1144 NGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFRL 1323
            NGE+LA++++SKG+PYVIYW N+FS Y ASHFR AL S +QSS  HTWD+F+LA ASFRL
Sbjct: 120  NGERLAKSLHSKGIPYVIYWNNSFSCYEASHFRHALFSSIQSSSCHTWDSFKLADASFRL 179

Query: 1324 YCTRNN 1341
            +C R N
Sbjct: 180  HCLRGN 185


>ref|XP_007012523.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 4, partial
            [Theobroma cacao] gi|508782886|gb|EOY30142.1| ARID/BRIGHT
            DNA-binding domain-containing protein isoform 4, partial
            [Theobroma cacao]
          Length = 540

 Score =  248 bits (632), Expect = 6e-63
 Identities = 120/164 (73%), Positives = 135/164 (82%)
 Frame = +1

Query: 910  LSSSGRLEVHTLTKPTIDEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEA 1089
            L+SSGRLEV  L  P IDE RRVLESTEP++VYLQGE  ++SEEIG L+WG V LSTPE 
Sbjct: 1    LASSGRLEVQLLNSPNIDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPET 60

Query: 1090 ISGLFGSTLPTTVYLELPNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQS 1269
            + GLF STLPTTVYLE PNG+KLAEA++S+GVPYVIYWKNTFS +AA HFR ALLSV+QS
Sbjct: 61   LCGLFDSTLPTTVYLETPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQS 120

Query: 1270 SCSHTWDAFQLAHASFRLYCTRNNYALPANGAKNSGMSGPHLLG 1401
            SCSHTWDAFQLAHASFRLYC RNN  + +N  K S   GP LLG
Sbjct: 121  SCSHTWDAFQLAHASFRLYCVRNNNVVSSNSQKQSVKPGPRLLG 164


>ref|XP_007012522.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 3, partial
            [Theobroma cacao] gi|508782885|gb|EOY30141.1| ARID/BRIGHT
            DNA-binding domain-containing protein isoform 3, partial
            [Theobroma cacao]
          Length = 708

 Score =  248 bits (632), Expect = 6e-63
 Identities = 120/164 (73%), Positives = 135/164 (82%)
 Frame = +1

Query: 910  LSSSGRLEVHTLTKPTIDEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEA 1089
            L+SSGRLEV  L  P IDE RRVLESTEP++VYLQGE  ++SEEIG L+WG V LSTPE 
Sbjct: 1    LASSGRLEVQLLNSPNIDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPET 60

Query: 1090 ISGLFGSTLPTTVYLELPNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQS 1269
            + GLF STLPTTVYLE PNG+KLAEA++S+GVPYVIYWKNTFS +AA HFR ALLSV+QS
Sbjct: 61   LCGLFDSTLPTTVYLETPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQS 120

Query: 1270 SCSHTWDAFQLAHASFRLYCTRNNYALPANGAKNSGMSGPHLLG 1401
            SCSHTWDAFQLAHASFRLYC RNN  + +N  K S   GP LLG
Sbjct: 121  SCSHTWDAFQLAHASFRLYCVRNNNVVSSNSQKQSVKPGPRLLG 164


>ref|XP_003546529.2| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Glycine max]
          Length = 751

 Score =  247 bits (630), Expect = 1e-62
 Identities = 131/210 (62%), Positives = 148/210 (70%), Gaps = 3/210 (1%)
 Frame = +1

Query: 781  MMFHAQGPLKNTCSLLAVLCGKVSDLKPKPG---FSGKPANYPFPELSSSGRLEVHTLTK 951
            MM H+QG  ++ CSLLAVL GK  D+K K      S     YPFPELSSSGRLEV  LTK
Sbjct: 1    MMLHSQGVSRH-CSLLAVLSGKSRDIKQKQKQGTASEDQFPYPFPELSSSGRLEVKVLTK 59

Query: 952  PTIDEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVY 1131
            PT DE  R LE  +P  VYLQG+ L +SEEIG LVW    LS PEA+ GLF S +P TVY
Sbjct: 60   PTADELGRSLEQLQPDFVYLQGQQLEDSEEIGPLVWEDFDLSVPEALCGLFSSKIPNTVY 119

Query: 1132 LELPNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHA 1311
            LE P GEKLAEA+ +KGVPY IYWKN FS YAASHFR +  SV QS+ SHTWDAFQLA A
Sbjct: 120  LETPKGEKLAEALLNKGVPYTIYWKNDFSKYAASHFRHSFFSVAQSTSSHTWDAFQLALA 179

Query: 1312 SFRLYCTRNNYALPANGAKNSGMSGPHLLG 1401
            SFRLYC +NN  LP+N  K +G  GP +LG
Sbjct: 180  SFRLYCVQNN-VLPSNSHKGAGKLGPQILG 208


>ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Glycine max]
          Length = 782

 Score =  246 bits (628), Expect = 2e-62
 Identities = 126/205 (61%), Positives = 151/205 (73%)
 Frame = +1

Query: 787  FHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTIDE 966
            FH+QG  K+TC+LLAV C + S  + K   S     YPFPEL S+GRLEV TL  P  ++
Sbjct: 4    FHSQGTPKHTCTLLAVTC-RTSSAEHK--LSHAQRTYPFPELVSAGRLEVQTLCSPEKEQ 60

Query: 967  FRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLELPN 1146
            FR+VLES +P+ VYL+G+ L N E +GSLVW GV+LST E I+ LFGSTLPT VYLE+PN
Sbjct: 61   FRKVLESFQPNFVYLRGDQLENGE-VGSLVWQGVELSTCEDITELFGSTLPTAVYLEIPN 119

Query: 1147 GEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFRLY 1326
            GE  AEA++ KG+PYVI+WKNTFS YAA HFR A LSVVQSS +HTWDAF LA ASF LY
Sbjct: 120  GESFAEALHLKGIPYVIFWKNTFSCYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELY 179

Query: 1327 CTRNNYALPANGAKNSGMSGPHLLG 1401
            C +NN  LP++    S   GPHLLG
Sbjct: 180  CVQNNQVLPSDSDDASSEMGPHLLG 204


>ref|XP_003595365.1| Fiber protein-like protein [Medicago truncatula]
            gi|355484413|gb|AES65616.1| Fiber protein-like protein
            [Medicago truncatula]
          Length = 844

 Score =  244 bits (623), Expect = 7e-62
 Identities = 131/214 (61%), Positives = 147/214 (68%), Gaps = 7/214 (3%)
 Frame = +1

Query: 781  MMFHAQGPLKNTCSLLAVLCGKVSDLKPKP-------GFSGKPANYPFPELSSSGRLEVH 939
            MMFH QG  ++ CSLLAVL GK  D K K            + ++YPFPELSSSGRLEV 
Sbjct: 1    MMFHFQGVSRH-CSLLAVLSGKSHDSKQKQKQKQDDDASEDQFSSYPFPELSSSGRLEVK 59

Query: 940  TLTKPTIDEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLP 1119
             LTKPT DE  RVLE  +P  VYLQG+ L +S EIGSLVW    LSTPEA+ GLF S LP
Sbjct: 60   VLTKPTFDELARVLEQLQPDFVYLQGQQLDDSGEIGSLVWEDFDLSTPEALCGLFSSKLP 119

Query: 1120 TTVYLELPNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQ 1299
             TVYLE P GEKLAEA++SKGVPY IYWKN FS  AASHF  A  SV QS+ SHTWDAFQ
Sbjct: 120  NTVYLETPKGEKLAEALHSKGVPYTIYWKNEFSKSAASHFHQAFFSVAQSTSSHTWDAFQ 179

Query: 1300 LAHASFRLYCTRNNYALPANGAKNSGMSGPHLLG 1401
            LA +SFRLYC +N   +P N  K S   GP +LG
Sbjct: 180  LAQSSFRLYCVQNE-VIPHNSQKGSDKVGPKILG 212


>ref|XP_006828651.1| hypothetical protein AMTR_s00129p00111730 [Amborella trichopoda]
            gi|548833441|gb|ERM96067.1| hypothetical protein
            AMTR_s00129p00111730 [Amborella trichopoda]
          Length = 810

 Score =  244 bits (622), Expect = 9e-62
 Identities = 122/198 (61%), Positives = 149/198 (75%)
 Frame = +1

Query: 808  KNTCSLLAVLCGKVSDLKPKPGFSGKPANYPFPELSSSGRLEVHTLTKPTIDEFRRVLES 987
            K +C LL VLCGK SD + +     +P  YPFPEL SSGRLEV  +T P+ +EF+RVLES
Sbjct: 12   KQSCILLGVLCGKRSDKEKQENAEDRPV-YPFPELVSSGRLEVQIITNPSSEEFKRVLES 70

Query: 988  TEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVYLELPNGEKLAEA 1167
            ++   VYLQGE   + +E+G LV G V +S+ +AI+ LFGS LP+TVYLE+PNGEKLAEA
Sbjct: 71   SDFDFVYLQGEQSLHKDEVGPLVLGDVNISSADAITRLFGSKLPSTVYLEIPNGEKLAEA 130

Query: 1168 IYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHASFRLYCTRNNYA 1347
            ++SKGVPYVIYW+++FS YAA HFR AL+S +QSS  HTWD FQLA ASFRLYC RNN+ 
Sbjct: 131  LHSKGVPYVIYWRHSFSCYAACHFRQALVSTLQSSSCHTWDVFQLAQASFRLYCVRNNHN 190

Query: 1348 LPANGAKNSGMSGPHLLG 1401
            L  NG K SG  GP LLG
Sbjct: 191  LVLNGQKVSGKLGPRLLG 208


>ref|XP_007138611.1| hypothetical protein PHAVU_009G223200g [Phaseolus vulgaris]
            gi|561011698|gb|ESW10605.1| hypothetical protein
            PHAVU_009G223200g [Phaseolus vulgaris]
          Length = 753

 Score =  243 bits (621), Expect = 1e-61
 Identities = 130/210 (61%), Positives = 146/210 (69%), Gaps = 3/210 (1%)
 Frame = +1

Query: 781  MMFHAQGPLKNTCSLLAVLCGKVSDLKPKPGFSGKPAN---YPFPELSSSGRLEVHTLTK 951
            MMFH+QG  ++ CSLLAVL GK+ D K K   S    +   YPFPELSSSG LEV  L K
Sbjct: 1    MMFHSQGASRH-CSLLAVLSGKLRDTKQKQKQSAASEDQPPYPFPELSSSGSLEVKLLIK 59

Query: 952  PTIDEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEAISGLFGSTLPTTVY 1131
            P  DE  R LE  +P  VYLQG+LL +  EIG LVW    LS PEA+ GLFGS LP TVY
Sbjct: 60   PNTDELGRALEQLQPDFVYLQGQLLEDRGEIGPLVWEDFDLSAPEALCGLFGSKLPNTVY 119

Query: 1132 LELPNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQSSCSHTWDAFQLAHA 1311
            LE P GEKLAEA+ +KGVPY IYWKN FS YAASHFR +  SV QS+ SHTWDAFQLA A
Sbjct: 120  LETPKGEKLAEALRNKGVPYTIYWKNDFSKYAASHFRHSFFSVGQSTSSHTWDAFQLALA 179

Query: 1312 SFRLYCTRNNYALPANGAKNSGMSGPHLLG 1401
            SFRLYC +NN  LP+N  K  G  GP +LG
Sbjct: 180  SFRLYCIQNN-VLPSNSQKGVGKLGPQILG 208


>ref|XP_002516200.1| DNA binding protein, putative [Ricinus communis]
            gi|223544686|gb|EEF46202.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 749

 Score =  242 bits (618), Expect = 3e-61
 Identities = 119/164 (72%), Positives = 135/164 (82%)
 Frame = +1

Query: 910  LSSSGRLEVHTLTKPTIDEFRRVLESTEPSIVYLQGELLSNSEEIGSLVWGGVQLSTPEA 1089
            L SSGRLEV  L+ P+ DEFRRVL+S+EP+IVYLQGE++ +SEEIGSL W G  LSTP+A
Sbjct: 43   LXSSGRLEVQILSSPSTDEFRRVLQSSEPNIVYLQGEIIEDSEEIGSLRWAGADLSTPDA 102

Query: 1090 ISGLFGSTLPTTVYLELPNGEKLAEAIYSKGVPYVIYWKNTFSLYAASHFRLALLSVVQS 1269
            +  LFGSTLP TVYLE+PNGEKLAEA++ KGVPYVIYWK+TFS YAA+HFR ALLSVVQS
Sbjct: 103  LCELFGSTLPPTVYLEIPNGEKLAEALHFKGVPYVIYWKSTFSCYAAAHFRQALLSVVQS 162

Query: 1270 SCSHTWDAFQLAHASFRLYCTRNNYALPANGAKNSGMSGPHLLG 1401
            SCSHT DAFQLAHASF LYC RNN  L +N  K  G  GP LLG
Sbjct: 163  SCSHTCDAFQLAHASFSLYCVRNNTGLSSNNQKVGGKPGPRLLG 206


Top