Description
Searchs for specific MetO sites filtering MetOSite according to the selected criteria.
Usage
meto.search(…)
Arguments
highthroughput.group
logical, when FALSE the sites described in a high-throughput study (unknown effect) are filtered out.
bodyguard.group
logical, when FALSE the sites postulated to function as ROS sink (because when oxidized no apparent effect can be detected) are filtered out.
regulatory.group
logical, when FALSE the sites whose oxidation affect the properties of the protein (and therefore may be involved in regulation) are filtered out.
gain.activity
introduce 1 or 0 to indicate whether the oxidation of the selected sites implies a gain of activity or not, respectively. If we do not wish to use this property to filter, introduce 2.
loss.activity
introduce 1 or 0 to indicate whether or not the oxidation of the selected sites implies a loss of activity or not, respectively. If we do not wish to use this property to filter, introduce 2.
gain.ppi
introduce 1 or 0 to indicate whether the oxidation of the selected sites implies a gain of protein-protein interaction or not, respectively. If we do not wish to use this property to filter, introduce 2.
loss.ppi
introduce 1 or 0 to indicate whether or not the oxidation of the selected sites implies a loss of protein-protein interaction or not, respectively. If we do not wish to use this property to filter, introduce 2.
change.stability
introduce 1 or 0 to indicate whether the oxidation of the selected sites leads to a change in the protein stability or not, respectively. If we do not wish to use this property to filter, introduce 2.
change.location
introduce 1 or 0 to indicate whether or not the oxidation of the selected sites implies a change of localization or not, respectively. If we do not wish to use this property to filter, introduce 2.
organism
a character string indicating the scientific name of the species of interest, or -1 if we do not wish to filter by species.
oxidant
a character string indicating the oxidant, or -1 if we do not wish to filter by oxidants.
Value
This function returns a dataframe with a line per MetO site.
References
Valverde et al. 2019. MetOSite: an integrated resource for the study of methionine residues sulfoxidation, Bioinformatics 35:4849-4850.
See Also
meto.scan()
, meto.list()
Details
We have search the literature to find those protein-bound methionine residues that have been detected, either in vitro or in vivo, as MetO. All this information has been collected and published in the form of a database: MetOSite.
Thus, MetOSite is a database that provides mannually curated data related to experimentally confirmed sulfoxidation sites and the effect of such PTM on the protein properties. Currently, it contains over 7300 sites found in more than 3500 different proteins from over 20 species.
The ptm packages offers three functions that help to explore and download data from MetOSite.
Thus, if you wish to know whether your protein of interest has been described to contain MetO sites, the function meto.scan() will help you.
If, on the contrary, you’re interested on sets of proteins or MetO sites, grouped according to criteria such as functional effect, species or oxidants, then you are in the right place and meto.search() is your function.
The function meto.list() provides a list with the proteins found in MetOSite whose names contain the keyword provided as argument to the function.
Each MetO site can be assigned to one of three possible groups, depending on the answer to the following question:
Does the oxidation of this methionine has an effect on any biological property of the protein?
-
Group 1: The answer is “We do not know because it has not been addressed”. This group is composed mainly for those MetO sites coming from high-througput studies and nothing is known about the effect of their sulfoxidation just because it has not been addressed (hightroughput.group).
-
Group 2: The answer is “No, we could not find any effect even though we searched for it”. This group is formed for those methionines that are postulated to function as ROS sink (bodyguard.group).
-
Group 3: The answer is “Yes, a change in some property has been described”. This group is populated for those methionines that may fulfil a role in cellular signaling by affecting at least one of the following biological properties (regulatory.group):
- Gain of Activity
- Loss of Activity
- Gain of Protein-Protein Interaction
- Loss of Protein-Protein Interaction
- Effect on Protein Stability
- Effect on Subcellular Location
The function meto.search() helps to download those MetO sites belonging to one or various of the above described groups. For those sites belonging to the “regulatory.group” we can select for those sites affecting one or various of the 6 properties listed above. In addition, data can also be filterd attending to the oxidant and/or the species of interest.
For instance, let’s say we are interested in all the methionine sites known to be oxidized by hypochlorous acid (HClO), regardless of the species and the effect on the protein properties. Then, we have to run:
hclo <- meto.search(oxidant = 'HClO')
The function returns a dataframe with a line per MetO site found. This dataframe provides information regarding the UniProt ID of the protein (prot_id), the protein’s name (prot_name), the position of the MetO site (met_pos), whether the oxidation has been observed in vivo or in vitro (met_vivo_vitro), the functional category of the site (reg_id), the species (org_sp) and the oxidant (org_oxidant).
unique(hclo$prot_name)
## [1] "Glutathione S-transferase F2" ## [2] "Glutathione S-transferase F3" ## [3] "Cytochrome c" ## [4] "Hypochlorite-responsive transcription factor" ## [5] "von Willebrand factor" ## [6] "Protein S100-A9" ## [7] "Fibronectin" ## [8] "Thrombomodulin" ## [9] "StAR-related lipid transfer protein 3" ## [10] "Phosphatidylinositol 3,4,5-trisphosphate 3-phosphatase and dual-specificity protein phosphatase PTEN" ## [11] "Alpha-enolase" ## [12] "Enolase 1" ## [13] "Hsp70 nucleotide exchange factor FES1"
Now, let’s move the focus of interest. We are going to recover all those methionine sites that when oxidized lead to the activation of the protein, regardless of the species and the oxidant. To this end, we’ll filter out those sites belonging to the group 1 (highthroughput) and group 2 (bodyguard) and select gain.activity = 1.
activ <- meto.search(highthroughput.group = FALSE,
bodyguard.group = FALSE,
gain.activity = 1)
print(head(activ[,1:3]))
## prot_id prot_name met_pos ## 1 A0A0E1M553 Hypochlorite-responsive transcription factor 123 ## 2 A0A0E1M553 Hypochlorite-responsive transcription factor 206 ## 3 A0A0E1M553 Hypochlorite-responsive transcription factor 230 ## 4 Q6PHZ2 Calcium/calmodulin-dependent protein kinase type II subunit delta 281 ## 5 Q6PHZ2 Calcium/calmodulin-dependent protein kinase type II subunit delta 285 ## 6 P11035 Nitrate reductase [NADH] 2 538
Note that all the arugments that can be passed to this function are optional. We only pass an argument to the function when we want to use that parameter to filter. Thus, meto.search() will return all the MetO sites found in the database MetOSite:
all <- meto.search()
Then, we can build, for example, a dataframe indicating the number of sites and proteins found for each species
species <- unique(all$org_sp)
mytable <- data.frame(species = species, sites = NA, proteins = NA)
counter <- 0
for (sp in species){
counter <- counter + 1
mytable$sites[counter] <- length(all$met_pos[which(all$org_sp == sp)])
mytable$proteins[counter] <- length(unique(all$prot_id[which(all$org_sp == sp)]))
}
mytable
## species sites proteins ## 1 Arabidopsis thaliana 1044 718 ## 2 Aspergillus nidulans 1 1 ## 3 Bacillus cereus 508 239 ## 4 Bacillus licheniformis 1 1 ## 5 Bos taurus 16 5 ## 6 Conus textile 1 1 ## 7 Dictyostelium discoideum 3 1 ## 8 Drosophila melanogaster 4 3 ## 9 Equus caballus 1 1 ## 10 Escherichia coli 32 5 ## 11 Gallus gallus 9 1 ## 12 Homo sapiens 5621 2594 ## 13 Human immunodeficiency virus 2 2 1 ## 14 Litchi chinensis 3 1 ## 15 Mesocricetus auratus 5 1 ## 16 Mus musculus 53 34 ## 17 Musa acuminata 2 1 ## 18 Mycobacterium tuberculosis 3 1 ## 19 Oryctolagus cuniculus 3 2 ## 20 Ovis aries 3 1 ## 21 Rattus norvegicus 11 2 ## 22 Saccharomyces cerevisiae 5 3 ## 23 Sus scrofa 2 2 ## 24 Triticum aestivum 9 1