Skip to contents

Retrieve a specific subset of the aggregated data into a single data frame by specifying which columns to take from each dataset (file_info, scans, peaks, etc.) using dplyr::select() syntax. If data from more than one dataset is selected (e.g. some columns from scans AND some from peaks), the datasets are combined with an dplyr::inner_join() using the columns listed in by (only the ones actually in the datasets). Joins that would lead to duplicated data entries (i.e. many-to-many joins) are not allowed and will throw an error to avoid unexpected replications of individual datapoints. If you really want to do such a join, you'll have to do it manually.

Usage

orbi_get_data(
  aggregated_data,
  file_info = NULL,
  scans = NULL,
  peaks = NULL,
  spectra = NULL,
  problems = NULL,
  by = c("uid", "scan")
)

Arguments

aggregated_data

datasets aggregated from orbi_aggregate_raw()

file_info

columns to get from the aggregated file_info, all dplyr::select() syntax is supported

scans

columns to get from the aggregated scans, all dplyr::select() syntax is supported

peaks

columns to get from the aggregated peaks, all dplyr::select() syntax is supported

spectra

columns to get from the aggregated spectra, all dplyr::select() syntax is supported

problems

columns to get from the aggregated problems, all dplyr::select() syntax is supported

by

which columns to look for when joining datasets together. Make sure to include the relevant by columns in the selections of the individual datasets so they are joined correctly.

Value

a tibble