CAPÌTULO SEXTO Revisoría fiscal
19. La Resolución 3575 del 24 de diciembre de 1996 del DANCOOP queda derogada con este capítulo
Our previous studies regarding highly stable cytosolic soluble proteins (16) originating from thermostable archaeal organism from the order Sulfolobales have gave an insight into mechanisms and determinants of protein thermostability, but also some additional topics for discussion from ecological and evolutionary points of view. We have discussed that essential life preserving processes in the living cell are likelier to involve more stable protein sequences in order to protect the life itself and that exhaustive thermal treatment of soluble cytosolic proteome has been a good method of isolating highly stable proteome subset.
These results were in agreement with data from literature stating that per se presence or absence of certain type of amino acid residues or group is not a sufficient prerequisite for difference in stability, bur rather the position and subsequent interaction of the residues with the surrounding ones can lead towards alteration in stability properties. SCOP folds are also not very strongly biased in respect to the thermostability. General observation regarding these results is that none of the so‐called predictive thermostability measures in fact have strong predictive capabilities and the most obvious explanation is that their combination may or may not lead towards enhanced thermal stability. As a continuance of this investigation, further study provided more results and developed conclusions based on isolation and identification of highly
thermostable proteins originating from mesophilic bacterial organism E. coli as the model system, using the same initial methodology, exhaustive thermal treatment of soluble cytosolic proteome in order to isolate highly stable proteome subset.
Sequences that were identified as highly stable after thermal treatment (90oC) of E. coli cytosolic proteome, were cross‐compared with the results published by the group of Taguchi and co‐workers regarding bimodal solubility/aggregation distribution of E. coli proteome (18) of the proteins expressed in cell‐free, chaperone‐free system named PURE. The intersection of those two groups of sequences ‐ superstable cytosolic proteins that survived harsh thermal treatment remaining soluble, and chaperone‐free‐expressed soluble proteins, was investigated in order to check for common determinants between these two important properties: thermostability and solubility. Interesting conclusions emerged, guiding towards the opinion that cellular guidelines that favor protein thermostability are in common ones that favor solubility, or lack of propensity to aggregate, of important life preserving sequences.
5.3.
39BObjectives and Methodologies
There are several published studies regarding correlation of protein solubility and other sequence‐derived properties. Wilkinson and Harrison (19) proposed a method that was latter improved for calculating protein solubility from the known sequence, that is based on parameters: average charge, determined by the relative numbers of Asp, Glu, Lys and Arg residues, and the content of turn‐forming residues (Asn, Gly, Pro and Ser). It was latter demonstrated that insoluble proteins tended to have more hydrophobic stretches (longer then 20 amino acids), lower glutamine content (Q<4%), fewer negatively charged residues (DE<17%) and higher percentage of aromatic amino acids (FYW>7.5%) than soluble ones (20), allowing prediction of protein solubility with 65% accuracy. Also, high content of negative residues (DE>18%) and absence of hydrophobic patches are associated with improved solubility and also low percentage of aspartic acid, glutamic acid, asparagines and glutamine residues (DENQ<16%) increases the probability of a protein to be insoluble. Analysis of more than 27 000 proteins from multiple organisms
found that protein solubility is influenced by (in decreasing order of importance): percentage of serine (S <6.4%), fraction of negatively charged residues (DE <10.8%), percentage of S, C, T and M amino acids, and length (<516 amino acids), in decreasing order of importance (21).
Fig. 5.1. Schematic illustration of the PURE expression system (18). Each ORF in the ASKA library, which has all of the E. coli ORFs, was amplified by PCR using two common primers to translate the gene in the cell‐free translation system. The reconstituted cell‐free translation system (the PURE system) contains no chaperones. After the 60‐min translation, an aliquot of the translation mixture was centrifuged to obtain the soluble fraction. The uncentrifuged (Total) and supernatant (Sup) fractions were subjected to SDS/PAGE, and the translated products were quantified by autoradiography (18).
In the continuance with the results presented in the previous chapter of this thesis, this study further developed results and conclusions based on isolation and identification of highly thermostable proteins originating from mesophilic bacterial organism E. coli as the model system. Process of analysis started with exhaustive thermal treatment of soluble cytosolic proteome in order to isolate highly stable proteome subset. Soluble cytosolic extract was exposed to high temperature up to 90o C during prolonged period of time, less stable denatured proteins were removed by bench centrifugation, while proteins that remained soluble after this treatment were subjected to further analysis. iTRAQ protein sequences identification and quantification of this “thermoproteome” and bioinformatics analysis gave an insight into possible relationship between
thermostability on one side and various other important protein properties. Our results were subsequently compared to the results recently published in literature (18, 22, 23), revealing accordance with the most of the information present in the literature up to date in following manner: Additional point of discussion was comparing our data regarding the E. coli cytosolic proteome subset with elevated thermostability, with the information present in the literature regarding the E. coli cytosolic proteome subsets with highly pronounced solubility properties. Taguchi and co‐workers (18) have previously published interesting set of results that can relate with our findings in a mutually supporting manner. They used a model named PURE, a reconstituted system (24) containing only the essential E. coli factors responsible for protein synthesis (Fig 5.1, (18)). Complete ASCA library ‐ consisting of all ORFs ‐ was translated, but in the presence only of the essential E. coli factors responsible for protein synthesis and excluding the presence of the chaperones. They performed a comprehensive analysis, successfully quantified more than 3000 protein sequences of the initial 4000 ORFs. The fact that this cell‐free translation system contains no chaperones enabled to investigate and evaluate inherent aggregation propensities (14, 17). They examined propensity for protein aggregation by a centrifugation assay where an aliquot of the translation mixture was centrifuged and the solubility was determined as the proportion of the supernatant fraction obtained after the centrifugation of the translation mixture, to the uncentrifuged total protein. They have also invested a considerable effort in identifying residues stabilizing the native conformations of proteins. The aggregation propensities of proteins, which were evaluated under the chaperone‐free condition, showed that the proteins were categorized into two groups, soluble and aggregation‐prone. Another conclusion was that some of the SCOP folds are strongly biased to the aggregation propensity which is apparently paradoxical because aggregates formation should occur before the completion of folding. We have cross‐compared our results of the thermally (90oC) selected E. coli cytosolic proteome subset with the results published by Taguchi’s group regarding bimodal solubility/aggregation distribution of E. coli proteome(18) publicly
available at http://www.taguchi.bio.titech.ac.jp/eng/paper‐ e/assets/2009_PNAS_ecoli_proteins_solubility.xls .
Our subset of “superstable” cytosolic proteins that survived harsh thermal treatment and still remained in the solution, when compared to the published ones, obtained results available in this segment of this thesis.
Publically available PubMed, at the National Center for Biotechnology Information (NCBI) HUhttp://www.ncbi.nlm.nih.govUH was used to access the
bibliographic database. To check for easily detectable biases in our data we evaluated the classification performance of simple global sequence dependent protein features (isoelectric point, molecular weight, aliphatic index and GRAVY index) by retrieving them from the ProtParam tool of the ExPASy proteomics server of the Swiss Institute of Bioinformatics
HU
http://us.expasy.org/tools/protparam.htmlUH. Information regarding COG
functional categories was retrieved from
HU
http://www.ncbi.nlm.nih.gov/COG/old/xognitor.htmlU
5.4.
40BResults and discussion
In all of the presented results in this chapter, abbreviation “S” refers to the group of sequences that are result of overlapping data from our thermally enhanced E. coli sequences with results from Taguchi and co‐ workers (18). The intersection of those two groups of sequences: a) superstable cytosolic proteins that survived harsh thermal treatment remaining soluble, and b) chaperone‐free‐expressed soluble proteins, was investigated in order to check for common determinants between these two important properties: thermostability and solubility. Sequences “S” data are available in Appendix III.
68B