- Source: PROSER3
Gene
Chromosome 19 open reading frame 55 (C19orf55), also known as PROSER3 or Proline And Serine-Rich Protein 3, is an uncharacterized human protein-coding gene. PROSER3 (accession: NM_001039887) is found on the plus strand of chromosome 19 at gene locus q.13.12 at the span of 12,991 base pairs, see Figure "Ideogram of chromosome 19 showing PROSER3 in Humans". There are no paralogs of this gene, and it does not interact with any of its immediate gene neighborhood. In an analysis of human tissue expression, PROSER3 was found to be expressed ubiquitously at low levels with up to 30-fold expression level variation among tissues. PROSER3 takes part in regulating cell growth and apoptosis in papillary thyroid cancer (PTC) cells, and targeting it can enhance treatment efficacy and decrease PTC recurrence. The probability of PROSER3 having a role in centrosome function is considerable.
mRNA
There are 10 exons contained within PROSER3’s mRNA sequence as well as 28 different mRNA variations. Some features of isoform 2 include a major polyA site, an alternative polyA site, and 3 disordered regions once translated.5
Protein
The Proline and Serine-rich 3 protein in Homo sapiens is encoded by the PROSER3 gene. A FASTA translation in NCBI reveals that the protein-coding sequence produces 101 serine and 116 proline amino acids out of a total of 724 amino acids within that protein-coding sequence. The protein is localized to the Golgi apparatus as well as co-localized with γ-tubulin or pericentrin (PCNT) in the centrosome, suggesting Proline and Serine-rich 3 is a centrosome component.5, In mammalian cells, the centrosome and Golgi apparatus are closely positioned during interphase and have a functional relationship. The Golgi apparatus transports vesicles via microtubules arranged by the centrosome, and the positioning of the centrosome influences the organization of the Golgi apparatus, which is critical for cell polarization and migration and suggests potential significance of PROSER3's role in it. The PROSER3 protein has two partial homologous amino acid sequences that are well conserved in bacteria and viruses. One is the UL36 tegument protein, while the other is the DNA polymerase III subunit gamma/tau.
The UL36 tegument protein, which is present in viruses such as the Herpes Simplex Virus 1, includes residues linked to catalysis and is known to actively deubiquitinate host cells in vitro. Ubiquitination is essential for proteolysis and cell cycle control, both of which play vital roles in centrosome formation.
DNA polymerase III subunits gamma/tau in E. coli create a highly effective replication apparatus that concurrently duplicates both strands of the duplex DNA by assembling the core interface that joins two DNA polymerases with a single clamp-loading complex that is responsible for quick and precise DNA replication. In order to control microtubule stability, dynamics, and cytoskeletal architecture, tau, a microtubule-associated protein, possesses microtubule-binding domains that rely on sequence motifs and helices to mediate interactions with tubulin.
Gene Ontology
Mature microRNA hsa-miR-212-5p is involved in miRNA-mediated post-transcriptional gene silencing, negative regulation of canonical Wnt signaling pathway, and enables mRNA base-pairing translational repressor activity.
Evolutionary Aspects
PROSER3 retains sequence identity exclusively with the domain Eukarya, specifically the animal kingdom and, more precisely, one of the oldest invertebrates: sponges. The bacteria and archaea domains have no sequence identity, nor do the protists, plants, or fungi kingdoms. It is expected that the PROSER3 gene first arose around 758 million years ago. PROSER3 has no gene family.
PROSER3 orthologs are found in every extant class of vertebrates except birds and crocodiles, and partial orthologs are found in certain invertebrates. In decreasing order of divergence, these invertebrates include Sponges (Porifera), Mollusks, Cnidaria, and Echinoderms.
= Orthologs
=Non-primate orthologs of PROSER3 were found below 60% sequence identity and 64% sequence similarity. Although the Mammalia class has an identity and similarity score of more than 30%, the range is very wide, going from 30% to 99%. Taxonomic classes prior to Mammalia have lower similarity and identity scores, although this range is narrower despite the larger divergence (439 million years ago) compared to Mammalia's 160 million years of diversification.
= Protein Divergence
=In the figure titled "Informational context of PROSER3 human protein..." the human PROSER3 protein, in contrast to the conserved m cytochrome C—as seen by its lower slope— is evolving quickly over time similar to fibrinogen alpha. However, fibrinogen alpha continues to evolve more quickly than the PROSER3 protein.
= Multiple Sequence Alignment
=Exon 2 at amino acid K is in a highly conserved/unchanged amino acid sequence VVAKYI between aa 170 and 180. The following are highly conserved unchanged amino acid sequences even in distant orthologs: FWWL at aa 206 and DDILYQWR…RRKLEQA at aa 463. The figures titled "Multiple Sequence Alignment of Distant PROSER3 Orthologs" show a snippet of the most important conserved regions in the PROSER3 protein.
Protein Analysis
All proteins that are known to physically interact with PROSER3 exhibit a shared consensus. Based on their functions and subcellular localizations, PROSER3 plays a role in centriole biogenesis and the organization of microtubules. PROSER3 protein's subcellular location is likely on top of the Golgi apparatus. Further research into microtubule organization and location reveals that there are two microtubule populations, axonemal microtubules and cytoplasmic microtubules. Microtubule plus ends grow out from the microtubule-organizing center nucleated by the basal body and the minus ends are tethered to the adjacent centriole at the base of the connecting cilium. There are only four types of proteins that interact non-physically. The three proteins CEP128, CEP135, and NINL are proximity-based interaction types, while MIF is an association-type interaction. Considering the subcellular location of these interaction proteins is either near or in the same area where PROSER3 protein localizes, and because the activities of these interaction proteins are extremely similar to the hypothesized function of PROSER3 protein, there is a strong probability that these protein interactions with PROSER3 protein are real. Proving these interactions could be done with co-immunoprecipitation to pull the target protein out of a cell lysate along with any proteins bound to it, forming a protein complex that can then be analyzed by western blotting. Additionally, co-expression of the Proline and Serine-rich 3 protein is observed with proteins coded by the genes C19orf44, CFAP100, DNAAF3, CCDC38, and C10orf67, indicating these proteins may be functionally related or involved in similar biological processes. Co-expression between PROSER3 and C19orf44 is accompanied by a strong correlation in the co-occurence, indicating a potential functional link between the two.
Conceptual Translation
Depicted on the right is a PDF for the conceptual translation of the human PROSER3 isoform 2 (full mRNA) with annotations to the right-hand side.
SNPs
There are 6413 SNPs catalogued in NCBI's Variation Viewer, none of which have publications or clinical significance records. rs1187661251 is a single nucleotide variant that overlaps with the conserved region VVAKYI and, when mutated, prevents the RNA-binding protein FUS from binding. Potential resulting side effects may include promotion of neurodegenerative diseases like ALS or FTLD, neuronal dysfunction, decreased damage response signaling and repair, and RNA dysregulation. rs1971047914 is a C insertion variant overlapping the transcription factor ZNF384 (C2H2 zinc finger factors) and RNA-binding protein ELAVL2 which may have potential significance for chromatin remodeling and neuronal differentiation/function. If mutated by the C insertion, ZNF384 would no longer be able to bind to the DNA sequence and downregulate the cell’s ability to remodel chromatin, initiating or promoting oncogenesis. Additionally, ELAVL2 would lose its ability to bind, which could result in risk-associated diseases such as frontotemporal dementia/ALS and lung cancer.
Further Links and Articles
AlphaFold can be used to predict the tertiary structure of the PROSER3 protein.