Description
Computes Residue Accessibility and SASA
Usage
acc.dssp(pdb, dssp = 'compute', aa = 'all')
Arguments
pdb
is either a PDB id, or the path to a pdb file.
dssp
string indicating the preferred method to obtain the dssp file. It must be either ‘compute’ or ‘mkdssp.
aa
one letter code for the amino acid of interest, or ‘all’ for all the protein residues.
Value
A dataframe where each row is an individual residue of the selected protein. The variables computed, among others, are: (i) the secondary structure (ss) element to which the residue belongs, (ii) the solvent accessible surface area (sasa) of each residue in square angstrom (Ų), and (iii) the accessibility (acc) computed as the percent of the sasa that the residue X would have in the tripeptide GXG with the polypeptide skeleton in an extended conformation and the side chain in the conformation most frequently observed in proteins. For the given PDB, the function obtains its corresponding DSSP file using the chosen method. The argument dssp allows two alternative methods: compute (by default, it calls to the function compute.dssp(), which in turn uses an API to run DSSP at the CMBI and
mkdssp (if you have installed DSSP on your system and in the search path for executables).
References
- Miller et al (1987) J. Mol. Biol. 196: 641-656.
- Touw et al (2015) Nucl. Ac. Res. 43(Database issue): D364-D368.
Details
Analysis of protein commonly requires the partition of their structure into regions such as the surface, interior, or interface. Whenever you are interested into analysing such a partition in a particular protein, the following function of the ptm package may be of help:
The function acc.dssp() can be of help to detect changes in accessibilities (ACC) when the analysis shifts its focus from the monomer to the complex. To illustrate this point, let’s use the methonine S-adenosyltransferase from Ureaplasma ureliticum, which is a homotetramer, as an example.
uMAT <- acc.dssp(pdb = '6rjs')
kable(head(uMAT))
resnum | respdb | chain | aa | ss | sasa_complex | sasa_chain | delta_sasa | acc_complex | acc_chain | delta_acc |
---|---|---|---|---|---|---|---|---|---|---|
1 | 5 | A | K | C | 124 | 170 | 46 | 0.588 | 0.806 | 0.218 |
2 | 6 | A | I | E | 23 | 85 | 62 | 0.126 | 0.467 | 0.341 |
3 | 7 | A | I | E | 25 | 67 | 42 | 0.137 | 0.368 | 0.231 |
4 | 8 | A | T | E | 0 | 58 | 58 | 0.000 | 0.397 | 0.397 |
5 | 9 | A | S | E | 0 | 4 | 4 | 0.000 | 0.033 | 0.033 |
6 | 10 | A | E | E | 23 | 72 | 49 | 0.126 | 0.393 | 0.267 |
If we define the protein-protein interface as the set of residues for which , we can provide some basic information about the protein-protein interface. For instance, the fraction of amino acid involved in this protein-protein interface can be computed as:
round(length(which(uMAT$delta_acc != 0))/nrow(uMAT), 3)
## [1] 0.235
Or we may know the surface (in Ų) involved in this interface:
sum(uMAT$delta_sasa)
## [1] 12153
And so on.