Analysis of gene expression profiles in HeLa cells in response to overexpression or siRNA-mediated depletion of NASP
Reproductive Biology and Endocrinologyvolume 7, Article number: 45 (2009)
NASP (Nuclear Autoantigenic Sperm Protein) is a linker histone chaperone required for normal cell division. Changes in NASP expression significantly affect cell growth and development; loss of gene function results in embryonic lethality. However, the mechanism by which NASP exerts its effects in the cell cycle is not understood. To understand the pathways and networks that may involve NASP function, we evaluated gene expression in HeLa cells in which NASP was either overexpressed or depleted by siRNA.
Total RNA from HeLa cells overexpressing NASP or depleted of NASP by siRNA treatment was converted to cRNA with incorporation of Cy5-CTP (experimental samples), or Cy3-CTP (control samples). The labeled cRNA samples were hybridized to whole human genome microarrays (Agilent Technologies, Wilmington, Delaware, USA). Various gene expression analysis techniques were employed: Significance Analysis of Microarrays (SAM), Expression Analysis Systematic Explorer (EASE), and Ingenuity Pathways Analysis (IPA).
From approximately 36 thousand genes present in a total human genome microarray, we identified a set of 47 up-regulated and 7 down-regulated genes as a result of NASP overexpression. Similarly we identified a set of 56 up-regulated and 71 down-regulated genes as a result of NASP siRNA treatment. Gene ontology, molecular network and canonical pathway analysis of NASP overexpression demonstrated that the most significant changes were in proteins participating in organismal injury, immune response, and cellular growth and cancer pathways (major "hubs": TNF, FOS, EGR1, NFκB, IRF7, STAT1, IL6). Depletion of NASP elicited the changed expression of proteins involved in DNA replication, repair and development, followed by reproductive system disease, and cancer and cell cycle pathways (major "hubs": E2F8, TP53, FGF, FSH, FST, hCG, NFκB, TRAF6).
This study has demonstrated that NASP belongs to a network of genes and gene functions that are critical for cell survival. We have confirmed the previously reported interactions between NASP and HSP90, HSP70, histone H1, histone H3, and TRAF6. Overexpression and depletion of NASP identified overlapping networks that included TNF as a core protein, confirming that both high and low levels of NASP are detrimental to cell cycle progression. Networks with cancer-related functions had the highest significance, however reproductive networks containing follistatin and FSH were also significantly affected, which confirmed NASP's important role in reproductive tissues. This study revealed that, despite some overlap, each response was associated with a unique gene signature and placed NASP in important cell regulatory networks.
Nuclear autoantigenic sperm protein (NASP) is highly expressed in all dividing cells including embryonic and malignant tissues as either tNASP (testicular/embryonic isoform) or sNASP (somatic/embryonic isoform). Human tNASP contains three functional histone binding sites; sNASP is the shorter version of tNASP with two deletions in the coding region arising from alternative splicing, resulting in two histone binding sites. In embryonic and transformed cell lines both sNASP and tNASP are expressed and tNASP is present in a variety of malignant tumors . Different cancer types as well as different stages of the same cancer demonstrate specific NASP RNA expression profiles, for example expression levels are up regulated in grade 1 and 2 versus grade 3 in breast cancer , estrogen receptor positive versus negative tumor types, or sporadic versus BRCA1/BRCA2 mutation positive tumors . Thus, the NASP expression profile could be used to establish the "poor prognosis signature" which consists of genes regulating cell cycle, invasion, metastasis and angiogenesis . Indeed, NASP has been reported as a serologic marker for ovarian cancer, which could be suitable for clinical testing in high-risk populations .
NASP has interactions with a variety of chromatin remodeling proteins: 1) linker H1 histones , 2) CAF1(p150, p60, p48) and HIRA in DNA synthesis dependent or independent nucleosome assembly pathways promoted by histone H3.1 and H3.3 complexes [6, 7], 3) proteins involved in DNA repair (NASP is phosphorylated after DNA damage by irradiation of U2OS cells  and binds to Ku70/Ku80 and DNA PK in HeLa cells ). As a linker histone H1 chaperone, NASP binds linker histones in the cytoplasm and transports them into the nucleus , where NASP facilitates the incorporation of linker histones onto nucleosome arrays . The present list of NASP interacting proteins is far from complete because Sun et al.  reported 356 network connectivity episodes for NASP in developing embryonic stem cells, suggesting numerous direct protein-protein interactions.
NASP is a tightly regulated cell cycle protein because both increased levels of NASP by overexpression  and decreased levels induced by siRNA treatment  cause disruption of the cell cycle. NASP mRNA levels increase during S-phase and decline during G2concomitant with histone mRNA levels  and NASP is required for cell survival because the NASP-/- null mutation causes embryonic lethality . Because HeLa cells have well studied signaling pathways and are easy to transfect, we chose them to study the general effect of increased or decreased NASP levels. Those networks and pathways that are involved in reproductive physiology were identified and will be further studied in reproductive tissues.
All chemicals and reagents used in this study were molecular biology grade. Restriction enzymes were purchased from Roche Applied Science (Indianapolis, Indiana, USA). Purification of plasmid DNA and PCR products was carried out using QIAprep Miniprep and QIAquick PCR purification kits (Qiagen, Valencia, California, USA); sequencing was performed at the University of North Carolina at Chapel Hill automated sequencing facility. Goat antiserum to full-length human recombinant tNASP (GenBank: AAH10105) was made by Bethyl Laboratories (Montgomery, Texas, USA).
The entire coding sequence of mouse tNASP (nucleotides 92–2405, GenBank: AF034610) was amplified from mouse testis Quick-clone cDNA (Clontech, Palo Alto, California, USA) using the Expand High Fidelity PCR System (Roche Applied Science, Indianapolis, Indiana, USA) and cloned into a Kpn1/BamH1 site in the pEGFP-N1 vector (Clontech, Palo Alto, California, USA) which contains the sequence for the green fluorescent protein. To prevent possible effects of the expressed GFP , the GFP sequence was removed. All constructs were sequenced to verify the correct reading frame. Plasmid-DNA complexes were transiently transfected into HeLa cells using Effectene transfection reagent (Qiagen, Valencia, California, USA) according to the manufacturer's recommendations. This method is based on a non-liposomal lipid formulation, and resulted in low cytotoxicity and high transfection efficiency (~97%) as determined by FACS analysis . Control cells were transfected with the transfection reagent only (Effectene transfection reagent, Qiagen, Valencia, California, USA). This choice of control cell treatment was based on the data that "mock" transfection with lipid formulation transfection reagents could consistently affect gene expression . Transfection efficiency was confirmed by Western blotting: lysates from HeLa cells overexpressing tNASP were separated by SDS-PAGE, blotted and probed with goat anti-NASP affinity purified antibody.
A series of siRNAs targeting the human NASP open reading frame were designed (Dharmacon, Lafayette, CO) and one (GCACAGUUCAGCAAAUCUAdTdT) was found to effectively deplete both tNASP and sNASP from HeLa cells . Transfection with C2 siRNA, which had no cellular target, served as a negative control . HeLa cells (8.5–10 × 105 cells per well in a 24-well plate) were transfected with NASP and C2 siRNA utilizing a two-hit siRNA transfection method with Lipofectamine™2000 for 18 hr as described . Twenty four hours after the first transfection cells were trypsinized and split into 6-well plates. Forty eight hours after the first transfection cells were re-transfected. Ninety hours after the initial transfection cells were harvested for RNA purification.
RNA isolation and hybridization of RNA to oligonucleotide arrays
Total cellular RNA was purified from HeLa cells using RNeasy® Mini Kit (Qiagen, Valencia, California, USA) according to the manufacturers' instructions. RNA samples representing four separate experiments from cells overexpressing tNASP and four experiments from cells treated with NASP siRNA, along with appropriate controls, were submitted for analysis. After the RNA Quality check was performed the double-stranded cDNA was synthesized from RNA via MMLV reverse transcriptase. Amplified labeled cRNA was created via T7 RNA polymerase, which simultaneously amplifies the target material and incorporates Cy3- or Cy5-labeled CTP with at least a 100-fold RNA amplification rate. cRNA from treated cells was amplified with incorporation of Cy5-CTP (fluorescent in the red region), while cRNA from control samples was labeled by Cy3-CTP (fluorescent in the green region) and purified. cDNA synthesis, cRNA synthesis, amplification and labeling were done using the Low RNA Input Linear Amplification Kit (Agilent Technologies, Wilmington, Delaware, USA). The labeled cRNA samples were then fragmented in fragmentation buffer at 60°C for 30 min before the microarray hybridization. Each sample was hybridized to a whole separate Human Genome (4 × 44K) microarray (Agilent Technologies, Wilmington, Delaware, USA) overnight at 65°C in a hybridization oven. The hybridization slides were washed, stabilized, dried, and immediately scanned by Agilent Technologies Microarray Scanner (Agilent Technologies, Wilmington, Delaware, USA). RNA hybridization was performed in the Genomics and Bioinformatics Core Facility (Lineberger Comprehensive Cancer Center, UNC-CH) according to the protocol suggested by Agilent (Agilent Technologies, Wilmington, Delaware, USA).
During the initial analysis at UNC Microarray Database (Genomics and Bioinformatics Core Facility, Lineberger Comprehensive Cancer Center, UNC-CH), all genes were retrieved, appropriately annotated, and filtered. Eventually, only genes with an absolute value of a Log2 Red/Green Lowess Normalized Ratio of at least 1 (doubled in intensity) for all 4 arrays were selected. The complete processed and raw data were deposited in Gene Expression Omnibus (GEO) and can be found as GSE14972 .
The UNC Microarray Database analysis generated a list of genes with an altered expression (at least 2 fold increased/decreased) between overexpression/depletion and mock treated samples. To identify which of these genes were significantly differentially expressed (significant genes) we used a statistical technique called SAM (Significance Analysis of Microarrays; ). SAM assigns a score to each gene on the basis of a change in gene expression relative to the standard deviation of repeated measurements. For genes with scores greater than an adjustable threshold, SAM uses permutations of the repeated measurements to estimate the percentage of genes identified by chance – the false discovery rate (FDR). Analysis parameters (Delta) were set to result in zero FDR.
To provide a rapid biological interpretation (from "genes to themes") of the obtained data, significant genes from SAM analysis were analyzed by the Expression Analysis Systematic Explorer (EASE). EASE calculates over representation with respect to the total number of genes assayed and annotated within each system to allow comparisons of categories from categorization systems . We used three main categories: biological processes, cellular components, and molecular function. For each classification within the category the Fisher exact probability of over representation was calculated. Presented in this study the EASE score serves as a p-value to the Fisher exact probability that weights significance in favor of themes supported by more genes .
Functional interpretation of significant genes in the context of gene ontology, molecular networks and relevance to canonical pathways was generated through the use of Ingenuity Pathways Analysis (IPA 6.5 software, Ingenuity Systems® ). Gene ontology analysis was based on an approach similar to EASE analysis, but the main categories used were in connection to top biological functions: diseases and disorders, molecular and cellular functions, and physiological system development and function. The significant genes were categorized, compared to genetic categories in the IPA database, and ranked according to p-values. P-values less than 0.05 indicate a statistically significant, non-random association between a set of significant genes and a set of all genes related to a given function in Ingenuity's knowledge base . The IPA analysis determined the subcategories within each category supplied with an appropriate p-value and the number of genes identified.
The set of significant genes was used to find possible connections between genes/gene products and other genes based on interactions previously reported in the literature. Intermolecular connections were presented as molecular networks. Interacting genes (not found in SAM analysis but present in the IPA knowledge base) were used by IPA software to connect smaller groups of significant genes into a larger network. Since the size of the created network could potentially be enormous, the IPA software limited the number of molecules in the network to 35, leaving only the most important ones based on the number of connections for each focus gene (focus genes = a subset of uploaded significant genes having direct interactions with other genes in the database) to other significant genes . Focus molecules and interacting molecules are presented separately. Networks are scored based on the number of focus molecules in the network, its size, the total number of focus molecules analyzed, and the total number of molecules in the knowledge database that could potentially be included in the networks . The network score is the negative log of Fisher's Exact Test p-value. Only networks with a score of at least 10 (p-value of 10-10) were analyzed.
The final IPA analysis compared the list of significant genes with established pathways associated with metabolism and signaling (canonical pathway analysis). The results are presented in a diagram based on scoring and the ratio of significant genes present in the canonical pathway to the total number of molecules in the canonical pathway. The threshold level was set at p = 0.05. These analyses determined which pathways were involved under our experimental conditions.
Results and discussion
SAM analysis resulted in identification of groups of genes that were significantly differentially expressed after overexpression or depletion of NASP in HeLa cells at least two fold in all four experiments. Overexpression of tNASP in HeLa cells significantly affects the expression level of 54 genes (0.14% of 39,064 genes included in the microarray): 47 (0.12%) were up-regulated and 7 (0.02%) were down-regulated. Figure 1A shows a scatter plot summary of up-regulated and down-regulated genes from cells overexpressing NASP. Inhibition of NASP expression by siRNA treatment significantly affected the expression of 127 genes (0.32%): 56 (0.14%) were up-regulated and 71(0.18%) were down-regulated. Figure 1B shows a scatter plot summary of up-regulated and down-regulated genes from cells treated with NASP siRNA. The different ratios between up-regulated and down-regulated genes demonstrate that there was no bias between overexpression, siRNA treatment, and control groups due to dye affinity misbalance. The list of significant genes is presented in table 1.
All genes reported in table 1 were subjected to Gene Ontology (GO) clustering by EASE software . EASE analysis is presented in table 2 and table 3. Themes with a p value of < 0.05 and with at least three genes in the category are reported .
In HeLa cells overexpressing tNASP, EASE identified 39 up-regulated and 5 down-regulated GO categories (table 2). Fourteen percent of up-regulated genes are located on chromosome 4. In the group of up-regulated genes the highest EASE score as well as the highest number of genes was found within the categories represented by proteins participating in the immune response, the response to biotic and external stimuli, stress, and to pathogens. These results may indicate a rather non-specific reaction of HeLa cells to an excessive amount of expressed recombinant NASP. Down-regulated genes in NASP overexpressing cells represented proteins involved in transcription regulator activity, transcription factor complex structure and activity, and nucleoplasm structure.
Gene ontology categories affected in cells treated by siRNA are presented in table 3. Most of the up-regulated genes in this category are located on chromosome 1(17%), the same chromosome where the NASP gene is located. In cells with inhibited NASP expression, up-regulated categories (n = 13) represented gene products that were involved in morphogenesis and organogenesis, chromosome organization (chromosome, nucleic acid binding, chromatin, DNA packaging, DNA binding), development, and transporter activity. Down-regulated genes in NASP deficient cells represented a group of proteins involved in DNA packaging, nucleosome structure and assembly, establishment and maintenance of chromatin architecture, DNA metabolism, chromosome organization and biogenesis, chromatin assembly/disassembly, and chromatin/chromosome structure (a total of 16 categories). This result is consistent with earlier observations that histone H1 transfer between NASP and DNA affects chromatin structure .
Although EASE analysis identified general categories, it did not provide detailed subcategories or connect identified genes to discrete disorders and functions. Therefore additional functional GO analysis was carried out employing the IAP 6.5 software.
Top bio functions
Genes with altered expression in both NASP overexpression and inhibition experiments were analyzed by IPA software. The analysis of the top eight affected functional categories (highest significance) is presented in figure 2. Each group of significant genes (A: up-regulated after overexpression; B: down-regulated after overexpression; C: up-regulated after siRNA treatment; D: down-regulated after siRNA treatment) was over represented in the set of genes related to some function or disease category. The top affected subcategories are presented in Additional file 1, Additional file 2, Additional file 3 and Additional file 4.
The highest scoring category in NASP overexpressing cells was "organismal injury", subcategory "fibrosis" (p-value 6.35 × 10-14, represented by 13 genes: ATF3, IFI6, IFI27, IFITM1, IL6, IL11, IRF7, IRF9, NT5E, OAS1, PTGS2, STAT1, TNF). A high number of up-regulated genes were functionally related to the "cell growth" subcategory, part of the category "cellular proliferation" (22 molecules out of 47 analyzed, including transcription factors ATF3, DDIT3, FOSL1, RELB, STAT1, cytokines CXCL1, CXCL2, CXCL3, TNF, IL6, IL8, IL11, IL1A, and growth factors EREG, GDF15). The "apoptosis of eukaryotic cell" subcategory of the "cell death" category was represented by an overlapping with the "cell growth" set of genes (total= 24 genes). The most over represented subcategories in the "cell cycle category" were "cell division process of cells "and "arrest in cell division process of cells" with 18 and 12 molecules respectively. The category "cancer" had the highest number of genes (30 out of 47 analyzed). The leading number of molecules (16) was in the subcategory "developmental process of tumor cell lines" and "apoptosis of tumor cell lines". "Immune response", "immunological disease", and "viral functions" categories were selected as a result of the increased expression of a group of cytokines: CXCL1, CXCL3, IL6, IL8, IL11, and TNF.
Down-regulated genes in NASP overexpressing cells were over represented in a group of interrelated functions (figure 2B) mostly as a result of down-regulation of 4 significant genes: transcription regulators EGR1, FOS, LMO1, and NFE2. Their changed expression affected subcategories related to differentiation, maturation, and development within such categories as cellular development (subcategory "developmental process of blood cells", p-value 1.25 × 10-4), hematological system (subcategory "differentiation of blood cells", p-value 2.67 × 10-5), cell morphology (subcategory "morphology of tumor cells", p-value 3 × 10-5), immune system (subcategory "development of macrophages", p-value 6.69 × 10-5), gene expression (subcategory "transactivation", p-value 2.1 × 10-3). All these categories are interrelated and the different level of significance is a reflection of the total number of genes in each category. The subcategory "arrest in G0/G1 phase of eukaryotic cells" from category "cancer" had the highest significance (p-value of 1.82 × 10-4) within a given category.
Although up-regulated genes after siRNA treatment identified a list of discrete and diverse functional categories (figure 2C), the number of molecules presented in each subcategory was low (1–4). Therefore these categories and subcategories were assigned a low significance. In the "reproductive system disease" category the presence of transcription regulator HMGB2 determined the subcategories connected to the degeneration of Sertoli cells (p-value 2.93 × 10-3), testicular cells (p-value 2.93 × 10-3), and germ cells (p-value 2.03 × 10-2). Follistatin (FST) and CGA (alpha subunit of glycoprotein hormones) identified the categories related to degeneration of seminiferous tubules (p-value 4.98 × 10-5), disease process of testicular cells (p-value 1.38 × 10-3), infertility (p-value 6.67 × 10-3), and several organ morphology subcategories (size of the organ, CALB1, CGA, FST, KIF5C, p-value 7.98 × 10-4). Transmembrane receptor TNFRSF11B determined the selection of multiple subcategories associated with bone mineral density (p-value 1.42 × 10-4) within the "connective tissue function" category as well as myogenesis (p-value 2.89 × 10-2) within the "muscle development" category. Subcategories related to vascular development were determined by combined up-regulation of G-protein coupled receptor F2RL1 and growth factor receptor PDGFRB. In the cancer category cytochrome P450 (CYP1B1) and PDGFRB identified the endometrial cancer (p-value 1.37 × 10-2) and endometrial carcinoma (p-value 1.68 × 10-2) subcategories. Testis derived transcript (TES) was present in subcategories tumorigenesis, breast cancer, and neoplasia. All 5 genes (CYP1B1, FST, HMGB2, PDGFRB, and TNFRSF11B) were present in the "colorectal cancer" subcategory.
After siRNA treatment down-regulated transmembrane receptors ITGB3, ITGA5, and TLR3 determined the functional categories "cell signaling", "cellular function", "immune response", and "cancer" with subcategories related to phagocytosis, adhesion, and binding. Down-regulated transcription factor SRF (serum response factor) and FGF2 (fibroblast growth factor 2) are reported in categories "cellular assembly", "cell death" and "muscle development". Seventeen molecules are identified in the subcategory "tumorigenesis" and "neoplasia" within the "cancer" category. Along with transcription factor HOXA10 and transmembrane receptors TLR3 and ITGB3, other molecules ACTA2, CFI, CNN1, CPA4, DHTKD1, FABP5, FGF2, HIST2H2AA3, ITGA5, LIPH, MAP2K1, TNC, TUBB3, ZFYVE26 were present in these subcategories.
Network analysis assembled three networks from up-regulated genes after NASP overexpression (table 4A). The network with the highest score (table 4A, network #1) included gene products associated with TNF, which is a multifunctional proinflammatory cytokine that belongs to the tumor necrosis factor (TNF) superfamily. This cytokine is involved in the regulation of a wide spectrum of biological processes including cell proliferation, and differentiation, and has been implicated in cancer and autoimmune diseases . In response to TNF and growth factors, STAT1 protein (found up-regulated in NASP overexpressing cells), is phosphorylated and translocates to the nucleus where it acts as a transcription activator , mediating the expression of a variety of genes. In this network ATF3 (activating transcription factor) is connected with TNF, which is known to increase the expression of human ATF3 mRNA in LoVo cells .
The other two networks (table 4A, network #2 and #3) contained groups of interacting proteins that included GADD45A, interleukins IL6, IL1A, chemokines CXCL3, CXCL1, and NF-κB. NF-κB is a transcription factor that mediates the transcription of proteins involved in cell survival, proliferation, and inflammatory responses, and is the subject of active research for anti-cancer therapy . The protein encoded by the GADD45A gene induces apoptosis and cell cycle arrest by maintaining p38 and c-JNK MAPK activation in keratinocytes. The absence of Gadd45a results in loss of sustained p38/JNK MAPK activity that leads to inadequate p53 activation and loss of normal activation of G1 and G2 checkpoints .
Only one network was assembled from the down-regulated genes after NASP overexpression (table 4B). It included TNF, FOS, LMO1, EGR1, NFE2, KRT19, and AQP5. TNF, FOS, and NFκB are the "hubs" that suggests possible involvement of this network in cell proliferation, differentiation, and transformation. The presence of transcriptional regulator EGR suggests cancer suppressor activity .
Distinct networks were assembled with up-regulated genes after NASP siRNA treatment (table 5). The highest scoring network (table 5, network #1) presents E2F8 interacting with TP53. E2F8, which is in the family of E2F transcription factors, is essential for orchestrating expression of genes required for cell cycle progression and proliferation . In the assembled network E2F8 interacts with tumor protein p53 , which wasn't found as a focus gene in this study, but was added during network assemblage as an interacting gene. Protein p53 regulates target genes that induce cell cycle arrest, apoptosis, senescence, DNA repair and is postulated to function as a tumor suppressor . Multiple focus genes TMSL8, OGG1, HMGB2, GSP, ARL5A, which were reported to be acted upon by p53 [34–39], were found to be up-regulated in this study. One of them, NAP1 (nucleosome assembly protein1) plays a role in chromatin maintenance by facilitating core histone exchange (by regulating the concentration of free histones) as well as nucleosome assembly and disassembly .
Network #2 (table 5) of genes up-regulated after siRNA treatment was assembled with the "hub" gene products FSH (follicle stimulating hormone), FST (follistatin), NFκB, hCG (human chorionic gonadotropin) and focus gene products TNFRSF11B, SYTL5, ZBED2, FSTL1, CALB1, GNG2, SAA1, F2RL1. Urbanek et al.  studied 37 candidate genes for linkage and association with polycystic ovary syndrome (PCOS) or hyperandrogenemia and found evidence for linkage between PCOS and follistatin. Follistatin has been reported as a modulator of gonadal tumor progression in inhibin deficient mice .
Network #3 (table 5) of up-regulated genes after siRNA treatment identified TGFB1 as a core molecule indirectly affiliated with interacting proteins. TGFB is a multifunctional peptide that controls proliferation, differentiation, inducing transformation, and other functions in many cell types .
After siRNA treatment 71 down-regulated genes were almost evenly assembled into four networks of interacting proteins (table 6). The network with the highest score (table 6, network #1) included: HOXA10, FGF2, SRF, ITGA5, and ITGB3. HOXA10 is a DNA-binding transcription factor that may regulate gene expression, morphogenesis, and differentiation. More specifically, it may function in fertility and embryo viability . FGF2 (fibroblast growth factor) and SRF (c-fos serum response element-binding transcription factor) participate in cell cycle regulation, apoptosis, cell growth and differentiation . Interaction of SRF with other proteins, such as steroid hormone receptors, may contribute to regulation of muscle growth .
Network #2 of down regulated genes (table 6) includes TNF as a core protein along with several interacting proteins. Overlap of this network with another assembled network (table 6, network #4) revealed TRAF6 (TNF receptor associated factor), which mediates signal transduction from members of the TNF receptor superfamily and interacts with various protein kinases including IRAK1/IRAK, SRC and PKCzeta . TRAF6 was identified by mass spectrometry in a large scale immunoprecipitation as an interacting protein with NASP protein, but the functional correlation between TRAF6 and NASP is unknown . TRAF6 mediates signals from TNF apoptotic pathways, as well as other pathways (IL1, NF-κB). It is not clear if NASP regulates TRAF6, or TRAF6 transfers a signal to NASP. The focus genes interacting with NASP (table 6, network #4; figure 3) include HIST1H1C  and HIST1H2BK , which implies that these histone gene products may be regulated in concert with NASP. None of the focus genes were found to be a hub in network #3 (table 6), therefore it was not investigated further.
Canonical pathway analysis
We found that each treatment resulted in activation/inhibition of specific signaling pathways. A list of the significant genes present in canonical pathways after different treatments is presented in Additional file 5, Additional file 6, Additional file 7 and Additional file 8.
The highest activation level after NASP overexpression (figure 4A) was detected in pathways related to antiviral responses, activation of IRF (interferon activation factor), recognition of bacteria and viruses, interferon signaling and other interrelated pathways. Some pathways were inhibited by NASP overexpression (figure 4B). The significance of all top 6 canonical pathways in this category barely exceeded the threshold level and all of them were selected as a result of down-regulation of only one gene, FOS, which is an important transcription regulator of cell proliferation, differentiation, and transformation . The inhibition of a single gene makes understanding its role in any of those pathways difficult.
A similar result was observed with analysis of activated signaling pathways after siRNA treatment (figure 4C). Only one pathway reached significance above the threshold level, the remaining five canonical pathways were below the threshold level. After siRNA treatment (figure 4D) several signaling pathways were inhibited due to down-regulation of growth factor FGF2, transmembrane receptors ITGB3 and ITGA5 and other molecules such as ATCA1 and ATCA2. The two highest scores had canonical pathways related to caveolar-mediated (p-value 1.85 × 10-4, 4 out of 81 molecules were associated with a given pathway) and clathrin-mediated (p-value 2.39 × 10-4, 5/165) endocytosis. Selection of these pathways was based on down-regulation of both α- and β- integrins (ITGA5 and ITGB3) which serve as transmembrane receptors. Activation of α- and β-integrins by appropriate ECM proteins causes the activation of ERK/MAPK-mediated transcription, which eventually leads to cell proliferation . Mitogen activated protein kinase MAP2K1 (synonym MEK1/2), an important link in both those pathways, was found to be down regulated and it makes NASP's effect on this pathway plausible. Down regulation of ACTA1 and ACTA2 (actin α-1 and actin α-2) along with MAP2K1, α-integrin, and FGF2 (fibroblast growth factor 2) indicates a high probability that there is an effect on the actin cytoskeleton signaling pathway (p-value 8.17 × 10-4, 5/222), which leads to actin reorganization and plays an important role in cell motility, cytokinesis and phagocytosis . Decreased expression of kinase MAP2K1 and transcription factor SRF implies an effect on EGF (epidermal growth factor) signaling pathways (p-value 1.25 × 10-2, 2/47), which regulate cell growth and differentiation .
This study has demonstrated that NASP, a linker histone chaperone, belongs to a network of genes and gene functions that are critical for cell survival. We have confirmed the previously reported interactions between NASP and HSP90, HSP70, histone H1, histone H3, and TRAF6. Indeed, based on the number of critical pathways affected by the overexpression or inhibition of NASP expression, NASP may play a much wider role in gene expression events that require the participation of histones. Significantly this study found that during overexpression the network with the highest score of up-regulated genes included gene products associated with TNF and during inhibition network #2 of down regulated genes also included TNF. TNF receptor associated factor, TRAF6, has already been identified as interacting with NASP. Therefore these two identified networks may explain how the expression of NASP is modulated during cell proliferation and differentiation.
Of interest to reproductive biology, this study found a significant association of NASP with the FSH and follistatin gene pathways that are up-regulated after NASP siRNA treatment. This may have implications for the control of NASP expression during granulosa cell and endometrial cell proliferation.
Most of the pathways found in this study have not been previously reported to be connected to expression levels of NASP and demonstrate a rather complicated picture of changed gene expression in HeLa cells after NASP expression was increased or decreased. Gene ontology and protein network analysis identified general biological processes as well as individual genes/gene products and possible interaction networks. Some of these processes may relate to HeLa-type cells in tissue culture in which cell adhesion and migration are critical, while others may only be relevant in neoplasia. We found signaling pathways which were affected as a result of changed NASP expression and despite some overlap each reactive response was associated with a unique gene signature. The results of this study have elucidated the changes that emerge from increased and decreased NASP expression and will help our understanding of the molecular mechanisms involved in NASP function. Confirmation of NASP's role in regulating the cell cycle may contribute to the development of new pharmaceutical approaches to control the relevant pathological conditions.
fluorescent-activated cell sorting
sodium dodecyl sulfate polyacrylamide gel electrophoresis
RNA derived from cDNA
short interfering RNA
Richardson RT, Batova IN, Widgren EE, Zheng L-X, Whitfield M, Marzluff WF, O'Rand MG: Characterization of the histone H1-binding protein, NASP, as a cell cycle-regulated somatic protein. J Biol Chem. 2000, 275 (39): 30378-30386. 10.1074/jbc.M003781200.
Perou CM, Sorlie T, Eisen MB, Rijn van de M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, Fluge O, Pergamenschikov A, Williams C, Zhu SX, Lonning PE, Borresen-Dale A-L, Brown PO, Botstein D: Molecular portraits of human breast tumours. Nature. 2000, 406 (6797): 747-752. 10.1038/35021093.
van't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsley PS, Bernards R, Friend SH: Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002, 415 (6871): 530-536. 10.1038/415530a.
Martin KJ, Patrick DR, Bissell MJ, Fournier MV: Prognostic breast cancer signature identified from 3D culture model accurately predicts clinical outcome across independent datasets. PLoS ONE. 2008, 3 (8): e2994-10.1371/journal.pone.0002994.
Chatterjee M, Mohapatra S, Ionan A, Bawa G, Ali-Fehmi R, Wang X, Nowak J, Ye B, Nahhas FA, Lu K, Witkin SS, Fishman D, Munkarah A, Morris R, Levin NK, Shirley NN, Tromp G, Abrams J, Draghici S, Tainsky MA: Diagnostic markers of ovarian cancer by high-throughput antigen cloning and detection on arrays. Cancer Res. 2006, 66 (2): 1181-1190. 10.1158/0008-5472.CAN-04-2962.
Nakatani Y, Ray-Gallet D, Quivy J, Tagami H, Almouzni G: Two distinct nucleosome assembly pathways: dependent or independent of DNA synthesis promoted by histone H3.1 and H3.3 complexes. Cold Spring Harb Symp Quant Biol. 2004, 69: 273-280. 10.1101/sqb.2004.69.273.
Tagami H, Ray-Gallet D, Almouzni G, Nakatani Y: Histone H3.1 and H3.3 complexes mediate nucleosome assembly pathways dependent or independent of DNA synthesis. Cell. 2004, 116 (1): 51-61. 10.1016/S0092-8674(03)01064-X.
Matsuoka S, Ballif BA, Smogorzewska A, McDonald ER, Hurov KE, Luo J, Bakalarski CE, Zhao Z, Solimini N, Lerenthal Y, Shiloh Y, Gygi SP, Elledge SJ: ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage. Science. 2007, 316 (5828): 1160-1166. 10.1126/science.1140321.
Alekseev OM, Richardson RT, Pope MR, O'Rand MG: Mass spectrometry identification of NASP binding partners in HeLa cells. Proteins. 2005, 61 (1): 1-5. 10.1002/prot.20600.
Alekseev OM, Widgren EE, Richardson RT, O'Rand MG: Association of NASP with HSP90 in mouse spermatogenic cells: stimulation of ATPase activity and transport of linker histones into nuclei. J Biol Chem. 2005, 280 (4): 2904-2911. 10.1074/jbc.M410397200.
Finn RM, Browne K, Hodgson KC, Ausio J: sNASP, a histone H1-specific eukaryotic chaperone dimer that facilitates chromatin assembly. Biophys J. 2008, 95 (3): 1314-1325. 10.1529/biophysj.108.130021.
Sun Y, Li H, Liu Y, Mattson MP, Rao MS, Zhan M: Evolutionarily conserved transcriptional co-expression guiding embryonic stem cell differentiation. PLoS ONE. 2008, 3 (10): e3406-10.1371/journal.pone.0003406.
Alekseev OM, Bencic DC, Richardson RT, Widgren EE, O'Rand MG: Overexpression of the linker histone-binding protein tNASP affects progression through the cell cycle. J Biol Chem. 2003, 278 (10): 8846-8852. 10.1074/jbc.M210352200.
Richardson RT, Alekseev OM, Grossman G, Widgren EE, Thresher R, Wagner EJ, Sullivan KD, Marzluff WF, O'Rand MG: Nuclear Autoantigenic Sperm Protein (NASP), a linker histone chaperone that is required for cell proliferation. J Biol Chem. 2006, 281 (30): 21526-21534. 10.1074/jbc.M603816200.
Chu Y-W, Wang R, Schmid I, Sakamoto KM: Analysis with flow cytometry of green fluorescent protein expression in leukemic cells. Cytometry. 1999, 36 (4): 333-339. 10.1002/(SICI)1097-0320(19990801)36:4<333::AID-CYTO8>3.0.CO;2-F.
Schaniel C, Li F, Schafer XL, Moore T, Lemischka IR, Paddison PJ: Delivery of short hairpin RNAs – triggers of gene silencing – into mouse embryonic stem cells. Nat Meth. 2006, 3 (5): 397-400. 10.1038/nmeth0506-397.
Wagner EJ, Garcia-Blanco MA: RNAi-mediated PTB depletion leads to enhanced exon definition. Molecular Cell. 2002, 10 (4): 943-949. 10.1016/S1097-2765(02)00645-7.
Gene Expression Omnibus. [http://www.ncbi.nlm.nih.gov/geo/]
Tusher V, Tibshirani R, Chu G: Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA. 2001, 98 (9): 5116-5121. 10.1073/pnas.091062498.
Hosack D, Dennis G, Sherman B, Lane H, Lempicki R: Identifying biological themes within lists of genes with EASE. Genome Biology. 2003, 4 (10): R70-10.1186/gb-2003-4-10-r70.
Ingenuity Systems. [http://www.ingenuity.com]
Mori R, Xiong S, Wang Q, Tarabolous C, Shimada H, Panteris E, Danenberg KD, Danenberg PV, Pinski JK: Gene profiling and pathway analysis of neuroendocrine transdifferentiated prostate cancer cells. The Prostate. 2009, 69 (1): 12-23. 10.1002/pros.20851.
Calvano SE, Xiao W, Richards DR, Felciano RM, Baker HV, Cho RJ, Chen RO, Brownstein BH, Cobb JP, Tschoeke SK, Miller-Graziano C, Moldawer LL, Mindrinos MN, Davis RW, Tompkins RG, Lowry SF: A network-based analysis of systemic inflammation in humans. Nature. 2005, 437 (7061): 1032-1037. 10.1038/nature03985.
Petrusz P, Jeyaraj D, Grossman G: Microarray analysis of androgen-regulated gene expression in testis: the use of the androgen-binding protein (ABP)-transgenic mouse as a model. Reprod Biol Endocrinol. 2005, 3 (1): 70-10.1186/1477-7827-3-70.
Locksley RM, Killeen N, Lenardo MJ: The TNF and TNF receptor superfamilies: integrating mammalian biology. Cell. 2001, 104 (4): 487-501. 10.1016/S0092-8674(01)00237-9.
Yang J, Stark GR: Roles of unphosphorylated STATs in signaling. Cell Res. 2008, 18 (4): 443-451. 10.1038/cr.2008.41.
Masumoto J, Yang K, Varambally S, Hasegawa M, Tomlins SA, Qiu S, Fujimoto Y, Kawasaki A, Foster SJ, Horie Y, Mak TW, Nunez G, Chinnaiyan AM, Fukase K, Inohara N: Nod1 acts as an intracellular receptor to stimulate chemokine production and neutrophil recruitment in vivo. J Exp Med. 2006, 203 (1): 203-213. 10.1084/jem.20051229.
Escárcega R, Fuentes-Alexandro S, García-Carrasco M, Gatica A, Zamora A: The transcription factor nuclear factor-kappa B and cancer. Clin Oncol (R Coll Radiol). 2007, 19 (2): 154-161.
Hildesheim J, Bulavin DV, Anver MR, Alvord WG, Hollander MC, Vardanian L, Fornace AJ: Gadd45a protects against UV irradiation-induced skin tumors, and promotes apoptosis and stress signaling via MAPK and p53. Cancer Res. 2002, 62 (24): 7305-7315.
Liu J, Liu Y, Huang G, Yao C, Li S, Yang W, Yang D, Huang R: Concurrent down-regulation of Egr-1 and gelsolin in the majority of human breast cancer cells. Cancer Genomics Proteomics. 2007, 4 (6): 377-385.
Christensen J, Cloos P, Toftegaard U, Klinkenberg D, Bracken AP, Trinh E, Heeran M, Di Stefano L, Helin K: Characterization of E2F8, a novel E2F-like cell-cycle regulated repressor of E2F-activated transcription. Nucl Acids Res. 2005, 33 (17): 5458-5470. 10.1093/nar/gki855.
Hammond EM, Mandell DJ, Salim A, Krieg AJ, Johnson TM, Shirazi HA, Attardi LD, Giaccia AJ: Genome-wide analysis of p53 under hypoxic conditions. Mol Cell Biol. 2006, 26 (9): 3492-3504. 10.1128/MCB.26.9.3492-3504.2006.
Bénard J, Douc-Rasy S, Ahomadegbe J-C: TP53 family members and human cancers. Human Mutation. 2003, 21 (3): 182-191. 10.1002/humu.10172.
Daoud SS, Munson PJ, Reinhold W, Young L, Prabhu VV, Yu Q, LaRose J, Kohn KW, Weinstein JN, Pommier Y: Impact of p53 knockout and topotecan treatment on gene expression profiles in human colon carcinoma cells: a pharmacogenomic study. Cancer Res. 2003, 63 (11): 2782-2793.
Achanta G, Huang P: Role of p53 in sensing oxidative DNA damage in response to reactive oxygen species-generating agents. Cancer Res. 2004, 64 (17): 6233-6239. 10.1158/0008-5472.CAN-04-0494.
Wu Q, Kirschmeier P, Hockenberry T, Yang T-Y, Brassard DL, Wang L, McClanahan T, Black S, Rizzi G, Musco ML, Mirza A, Liu S: Transcriptional regulation during p21WAF1/CIP1-induced apoptosis in human ovarian cancer cells. J Biol Chem. 2002, 277 (39): 36329-36337. 10.1074/jbc.M204962200.
Mann K, Hainaut P: Aminothiol WR1065 induces differential gene expression in the presence of wild-type p53. Oncogene. 2005, 24 (24): 3964-3975. 10.1038/sj.onc.1208563.
Quadrini KJ, Bieker JJ: Kruppel-like zinc fingers bind to nuclear import proteins and are required for efficient nuclear localization of erythroid Kruppel-like factor. J Biol Chem. 2002, 277 (35): 32243-32252. 10.1074/jbc.M205677200.
Lin CY, Li CC, Huang PH, Lee FJ: A developmentally regulated ARF-like 5 protein (ARL5), localized to nuclei and nucleoli, interacts with heterochromatin protein 1. J Cell Sci. 2002, 115 (23): 4433-4445. 10.1242/jcs.00123.
Andrews AJ, Downing G, Brown K, Park YJ, Luger K: A thermodynamic model for Nap1-histone interactions. J Biol Chem. 2008, 283 (47): 32412-32418. 10.1074/jbc.M805918200.
Urbanek M, Legro RS, Driscoll DA, Azziz R, Ehrmann DA, Norman RJ, Strauss JF, Spielman RS, Dunaif A: Thirty-seven candidate genes for polycystic ovary syndrome: strongest evidence for linkage is with follistatin. Proc Natl Acad Sci USA. 1999, 96 (15): 8573-8578. 10.1073/pnas.96.15.8573.
Cipriano SC, Chen L, Kumar TR, Matzuk MM: Follistatin is a modulator of gonadal tumor progression and the activin-induced wasting syndrome in inhibin-deficient mice. Endocrinology. 2000, 141 (7): 2319-2327. 10.1210/en.141.7.2319.
Herpin A, Lelong C, Favrel P: Transforming growth factor-beta-related proteins: an ancestral and widespread superfamily of cytokines in metazoans. Dev Comp Immunol. 2004, 28 (5): 461-485. 10.1016/j.dci.2003.09.007.
Satokata I, Benson G, Maas R: Sexually dimorphic sterility phenotypes in HoxalO-deficient mice. Nature. 1995, 374 (6521): 460-463. 10.1038/374460a0.
Liu Y, Song Z, Zhao Y, Qin H, Cai J, Zhang H, Yu T, Jiang S, Wang G, Ding M, Deng H: A novel chemical-defined medium with bFGF and N2B27 supplements supports undifferentiated growth in human embryonic stem cells. Biochem Biophys Res Commun. 2006, 346 (1): 131-139. 10.1016/j.bbrc.2006.05.086.
Vlahopoulos S, Zimmer WE, Jenster G, Belaguli NS, Balk SP, Brinkmann AO, Lanz RB, Zoumpourlis VC, Schwartz RJ: Recruitment of the androgen receptor via serum response factor facilitates expression of a myogenic gene. J Biol Chem. 2005, 280 (9): 7786-7792. 10.1074/jbc.M413992200.
Inoue J, Ishida T, Tsukamoto N, Kobayashi N, Naito A, Azuma S, Yamamoto T: Tumor Necrosis Factor Receptor-Associated Factor (TRAF) family: adapter proteins that mediate cytokine signaling. Exp Cell Res. 2000, 254 (1): 14-24. 10.1006/excr.1999.4733.
Ewing RM, Chu P, Elisma F, Li H, Taylor P, Climie S, McBroom-Cerajewski L, Robinson MD, O'Connor L, Li M, Taylor R, Dharsee M, Ho Y, Heilbut A, Moore L, Zhang S, Ornatsky O, Bukhman YV, Ethier M, Sheng Y, Vasilescu J, Abu-Farha M, Lambert J-P, Duewel HS, Stewart II, Kuehl B, Hogue K, Colwill K, Gladwish K, Muskat B, Kinach R, Adams S-L, Moran MF, Morin GB, Topaloglou T, Figeys D: Large-scale mapping of human protein-protein interactions by mass spectrometry. Mol Syst Biol. 2007, 3: 89-10.1038/msb4100134.
Citterio E, Papait R, Nicassio F, Vecchi M, Gomiero P, Mantovani R, Di Fiore PP, Bonapace IM: Np95 is a histone-binding protein endowed with ubiquitin ligase activity. Mol Cell Biol. 2004, 24 (6): 2526-2535. 10.1128/MCB.24.6.2526-2535.2004.
Milde-Langosch K: The Fos family of transcription factors and their role in tumourigenesis. Eur J Cancer. 2005, 41 (16): 2449-2461. 10.1016/j.ejca.2005.08.008.
Hynes RO: Integrins: bidirectional, allosteric signaling machines. Cell. 2002, 110 (6): 673-687. 10.1016/S0092-8674(02)00971-6.
Kallergi G, Tsapara A, Kampa M, Papakonstanti EA, Krasagakis K, Castanas E, Stournaras C: Distinct signaling pathways regulate differential opioid effects on actin cytoskeleton in malignant MCF7 and nonmalignant MCF12A human breast epithelial cells. Exp Cell Res. 2003, 288 (1): 94-109. 10.1016/S0014-4827(03)00210-6.
Fallon J, Seroogy K, Loughlin S, Morrison R, Bradshaw R, Knaver D, Cunningham D: Epidermal growth factor immunoreactive material in the central nervous system: location and development. Science. 1984, 224 (4653): 1107-1109. 10.1126/science.6144184.
This research was supported in part by the Eunice Kennedy Shriver NICHD/NIH through cooperative agreement [U54 (HD35041)] as part of the Specialized Cooperative Centers Program in Reproduction and Infertility Research and in part by the UNC faculty development grant. We thank Professor Peter Petrusz for critical discussions and reading the manuscript. We acknowledge the assistance of Cheng Fan and Jun-Yuan Wu from UNC Lineberger Comprehensive Cancer Center Genomics and Bioinformatics Core Facility for help with statistical analysis, Thomas Randall from UNC Center for Bioinformatics for help with IAP 6.5 analysis, and Paul Amieux and Kimberly Smith from SCCPRR Gene Array Facility, University of Washington for help with EASE analysis.
The authors declare that they have no competing interests.
OMA conceived of the study, carried out the microarray statistical analysis and drafted the manuscript. RR assisted with experimental design and conceived of the study. OA performed the cell culture experiments: overexpression and siRNA treatment. MO'R conceived of the study, participated in project design and coordination, helped to draft the manuscript. All authors read and approved the final manuscript.