ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Observation Article

Possible identification of CENP-C in fish and the presence of the CENP-C motif in M18BP1 of vertebrates.

[version 1; peer review: 1 approved with reservations]
PUBLISHED 05 Aug 2015
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

The centromeric protein CENP-C is a base component of the kinetochore. This protein, along with CENP-A has been shown to adaptively evolve in a number of animal and plant species. In order to determine if CENP-C also evolves in fish species, I attempted to retrieve fish CENP-C sequences from GenBank. No Teleostei CENP-C sequences were found either by name or by BLASTP searches with the vertebrate CENP-C motif sequence. A number of putative Teleostei protein sequences were identified in GenBank that have homology to the C-terminal cupin domain of vertebrate CENP-C. These proteins only have partial homology to the CENP-C motif, but evidence is presented that makes it likely that these fish proteins are orthologs of CENP-C. Interestingly, it was also discovered that the CENP-C motif sequence is also mostly present in M18BP1 proteins of fish and some other vertebrates but not in mammals. This finding may have implications for CENP-C and M18BP1 assembly in centromeric regions of different vertebrate taxa.

Keywords

CENP-C, M18BP1, centromeric proteins, teleostei, kinetochore, CENP-C motif, cupin domain protein

Introduction

The kinetochore is a structure that connects chromosomal centromeric DNA to microtubules during mitosis and meiosis1. The centromere is epigenetically defined by the deposition of nucleosomes that contain the histone H3 variant CENP-A2. Centromeric protein CENP-C is required for both the recruitment of new CENP-A to the centromeric region as well as the initial assembly of the kinetochore. The CENP-C protein is generally considered to be ubiquitous in all eukaryotic taxa since homologs of CENP-C have been identified in yeast3 and Drosophila4 as well as many plants and vertebrates5. While CENP-C evolves so rapidly that very little homology is observed between distantly related taxa, a conserved CENP-C motif has been identified across all lineages studied5. This conserved motif should, therefore, be of utility to identify CENP-C orthologs in other species.

CENP-A has been initially shown to evolve adaptively in Drosophila6, in members of the Bressicaceae family7 and more recently in primates8 and in percid fishes9,10. CENP-C has also been shown to evolve adaptively in a number of animal and plant species5 as well as in primates8. In an effort to determine if CENP-C also evolves adaptively in fish species, searches were conducted in GenBank for Teleostei proteins that had been already identified as CENP-C or for genes that had been annotated as coding for CENP-C. No such fish proteins or genes were found. BLASTP searches of just the Teleostei subset of GenBank were performed with the conserved vertebrate CENP-C motif and these too failed to find identified fish CENP-C proteins or genes. However, these searches did identify fish M18BP1, which, as will be discussed below, contains a sequence homologous to the CENP-C motif. A search of the Chondrichthyes for proteins that had been already identified as CENP-C, or for genes that had been annotated as coding for CENP-C, found one gene annotated as CENP-C in the elephant shark (Callorhinchus milii) genome. Interestingly, while this putative shark CENP-C protein contained a cupin domain at the C-terminal end homologous to the cupin domain found at the C-terminal end of other vertebrate CENP-C proteins, the conserved CENP-C motif was not found in the expected location upstream of the cupin domain. However, a region further upstream does have homology to some of the most conserved amino acids of the CENP-C motif.

BLASTP searches of the Teleostei subset of GenBank with the putative shark CENP-C protein sequence identified a number of genes that were primarily homologous to the C-terminal cupin domain. However, none of these fish sequences were annotated as CENP-C in GenBank. Upon closer analysis, as will be discussed below, these C-terminal cupin domain containing fish proteins do contain sequences that are partly homologous to the conserved CENP-C motif and, therefore, these fish genes could be CENP-C orthologs.

Methods

Standard BLASTP searches were performed on the NCBI blast server. The vertebrate CENP-C motif NVRRTKRXRLKPLEYWRGERVBY used in BLASTP searches in this study was obtained from an alignment of 25 species including amphibians, reptiles, birds and mammals (Supplementary File S1). Retrieved sequences were aligned with the MUSCLE alignment feature in Geneious (version 6.1) sequence analysis software.

Results and discussion

BLASTP searches with the vertebrate CENP-C motif identified CENP-C proteins from a variety of taxa, including plants, but did not identify any CENP-C in fish lineages. It is possible that CENP-C may be absent in fish, but the ubiquity of this protein in other lineages and the central role of this protein in centromeric function make this unlikely. A C-terminal cupin domain protein encoded by a shark gene annotated in GenBank as CENP-C was used to identify homologs in Teleostei genomes by BLASTP. The retrieved teleost fish homologs were annotated as either calponin homology domain containing protein, neurofilament heavy polypeptide-like protein, or myb-like protein. Within vertebrate CENP-C proteins the RxxRxxxxPLxYWxGERxxY sequence defines identities within the CENP-C motif located within about 100 amino acids upstream of the cupin domain (Figure 1). However, within the shark and teleost fish C-terminal cupin domain-containing protein sequences, only some of these CENP-C motif sequence identities were present (Figure 2) and, therefore, unambiguous identity of these proteins as CENP-C was not obvious.

d89eb324-a51f-447b-b0e7-1855b353f3d2_figure1.gif

Figure 1. Conserved domains and sequences in vertebrate CENP-C.

(A) Diagram of human CENP-C. (B) Amino acids that are identical in the CENPC motif in vertebrates in which CENP-C has been identified. (C) Conserved sequence in the CENP-C central region that is homologous to part of the CENP-C motif. Amino acid locations within human CENP-C protein of conserved sequences are indicated at the beginning and end of each sequence.

d89eb324-a51f-447b-b0e7-1855b353f3d2_figure2.gif

Figure 2. Conserved regions homologous to a portion of the CENP-C motif present in putative fish CENP-C.

(A) Diagram of putative Stegastes partitus CENP-C. (B) Alignment of fish central region sequences that contain the conserved RxxxxPxxWW sequence. (C) Alignment of the fish cupin domain proximal sequences that contain the conserved RxxxxPxxWW sequence. Amino acids matching the conserved sequence identities are highlighted in red. Amino acid locations within each species’ protein are indicated at the beginning and end of each sequence.

In a recent study that examined the interaction between CENP-C conserved domains and CENP-A containing nucleosomes (or nucleosomes containing histone H3 modified with a CENP-A C-terminal tail), Kato et al.11 identified within the conserved central region of CENP-C a RxSxxPSxWW consensus sequence (Figure 1) that is similar to the core portion of the CENP-C motif. Mutations of the arginine to alanine or the tryptophans to alanine in this sequence prevented the binding of this central region to the nucleosomes. So, functionally, the RxxxxPxxWW portion of the central region sequence is important to centromeric binding of CENP-C. Furthermore, mutations of the arginine, tyrosine and tryptophan in the core CENP-C motif RxxxxPxxYW also reduce the binding affinity the CENP-C to the nucleosomes11. A mutation of arginine to alanine in this core portion of the CENP-C motif was previously shown to prevent the binding of Xenopus CENP-C to centromeres12.

An alignment of the putative shark and teleost fish CENP-C proteins identified two conserved regions that contained the RxxxxPxxWW sequences (Figure 2). The placement of these sequences corresponds roughly to the locations of the central portion and the CENP-C motif of the vertebrate CENP-C (Figure 1). Therefore, it is likely that the combination of the C-terminal cupin domain and the presence of these centromeric nucleosome binding regions in positions generally corresponding to the locations of the central region and the CENP-C motif identifies these teleost genes as possible CENP-C orthologs. It will be necessary, of course, to verify if this protein is actually found at fish centromeres. It should be noted, however, that the distance between the cupin domain and the “CENP-C motif” position is about twice as long in the putative fish CENP-C in comparison to this distance in CENP-C of other vertebrates. It is interesting that the putative shark “CENP-C motif” location lacks the tryptophans of the RxxxxPxxWW sequence and that Poecilia reticulata has a replacement of the first tryptophan in the conserved central region sequence (Figure 2). However, depending on other factors acting in the assembly of the centromere in various taxa, it may be possible that just one of those conserved RxxxxPxxWW sequences may be necessary for centromeric binding of the putative fish CENP-C. Interestingly, no homology to the RxxxxPxxWW portion of the conserved central region is detectable in CENP-C of reptiles and birds.

Interestingly, BLASTP searches of the Teleostei subset of GenBank retrieved centromeric protein M18BP1 sequences. This protein is recruited to centromeres by CENP-C13,14 and along with centromeric proteins Mis18α and Mis18β functions in the recruitment of CENP-A to centromeres15. The M18BP1 protein contains almost the entire vertebrate CENP-C motif in all vertebrates examined except in mammals (Figure 3). It appears that the CENP-C motif sequence is not exclusive to just CENP-C. Since both CENP-C and M18BP1 associate with centromeres and with each other, it is tempting to speculate that what has generally been regarded as a CENP-C motif sequence facilitates the interaction of both of these proteins with centromeric nucleosomes. Furthermore, since mammalian M18BP1 lacks this CENP-C motif, it is possible that mammalian M18BP1 may be more dependent on association with CENP-C to localize to the centromere than the M18BP1 of other vertebrate taxa.

d89eb324-a51f-447b-b0e7-1855b353f3d2_figure3.gif

Figure 3. Alignment of regions of vertebrate CENP-C and M18BP1 that contain the CENP-C motif sequences.

The vertebrate CENP-C motif sequence identities (Figure 1A) are highlighted in red.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 05 Aug 2015
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Kral LG. Possible identification of CENP-C in fish and the presence of the CENP-C motif in M18BP1 of vertebrates. [version 1; peer review: 1 approved with reservations] F1000Research 2015, 4:474 (https://doi.org/10.12688/f1000research.6823.1)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 05 Aug 2015
Views
24
Cite
Reviewer Report 25 Sep 2015
Tatsuo Fukagawa, Graduate School of Frontier Biosciences, Osaka University, Osaka, Japan 
Approved with Reservations
VIEWS 24
The centromere protein CENP-C is well known as an essential component for functional kinetochore assembly. Due to importance of this molecule, CENP-C must be conserved in Fish species. The author performed BLASTP searches with the conserved CENP-C motif sequence, but ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Fukagawa T. Reviewer Report For: Possible identification of CENP-C in fish and the presence of the CENP-C motif in M18BP1 of vertebrates. [version 1; peer review: 1 approved with reservations]. F1000Research 2015, 4:474 (https://doi.org/10.5256/f1000research.7336.r10498)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 20 Jan 2016
    Leos Kral, Department of Biology, University of West Georgia, Carrollton, 30118, USA
    20 Jan 2016
    Author Response
    I have revised the manuscript to address concerns #2, #3 and #4. Unfortunately, I do not have the resources to carry out the localization experiment (concern #1). My main motivation ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 20 Jan 2016
    Leos Kral, Department of Biology, University of West Georgia, Carrollton, 30118, USA
    20 Jan 2016
    Author Response
    I have revised the manuscript to address concerns #2, #3 and #4. Unfortunately, I do not have the resources to carry out the localization experiment (concern #1). My main motivation ... Continue reading

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 05 Aug 2015
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.