API
Boosting
- boosting.boost(csm_fdr=(0.0, 1.0), pep_fdr=(0.0, 1.0), prot_fdr=(0.0, 1.0), link_fdr=(0.0, 1.0), ppi_fdr=(0.0, 1.0), boost_cols=None, neg_boost_cols=None, boost_level='ppi', boost_between=True, method='manhattan', decoy_adjunct='REV_', countdown=3, points=10, n_jobs=-1, **kwargs)
Find the best FDR cutoffs to optimize results for a certain FDR level.
- Parameters:
df (
polars.dataframe.frame.DataFrame) – CSM DataFramecsm_fdr ((
float,float)) – Search range for CSM FDR level cutoffpep_fdr ((
float,float)) – Search range for peptide FDR level cutoffprot_fdr ((
float,float)) – Search range for protein FDR level cutofflink_fdr ((
float,float)) – Search range for residue link FDR level cutoffppi_fdr ((
float,float)) – Search range for protein pair FDR level cutoffboost_cols (
list) – Columns in which to look for lower cutoffsneg_boost_cols (
list) – Columns in which to look for upper cutoffsboost_level (
str) – FDR level tp boost forboost_between (
bool) – Whether to boost for between linksmethod (
str) – Search algorithm to usecountdown (
int) – Number interation without improvement to stoppoints (
int) – Number of FDR cutoffs to search in one iterationn_jobs (
int) – Number of threads to use
- Return type:
(
float,float,float,float,float)- Returns:
Returns a tuple with the optimal FDR levels.
Multi-Level FDR calculation
- fdr.full_fdr(csm_fdr=1.0, pep_fdr=1.0, prot_fdr=1.0, link_fdr=1.0, ppi_fdr=1.0, min_len=5, decoy_adjunct='REV_', unique_csm=True, filter_back=True, prepare_column=True, td_prob=2, td_prot_prob=10, td_dd_ratio=1.0, custom_aggs=None)
- Parameters:
df (
typing.Union[polars.dataframe.frame.DataFrame,pandas.DataFrame]) – Input CSM dataframecsm_fdr (
float) – CSM level FDR cutoffpep_fdr (
float) – Peptide level FDR cutoffprot_fdr (
float) – Protein level FDR cutofflink_fdr (
float) – Link level FDR cutoffppi_fdr (
float) – Protein pair level FDR cutoffmin_len (
int) – Minimum peptide sequence lengthdecoy_adjunct (
str) – Prefix/Suffix indicating a decoy matchunique_csm (
bool) – Make CSMs uniquefilter_back (
bool) – Filter lower levels to include only matches that also pass on higher levelsprepare_column (
bool) – Perform preparation of aggregation columns like sorting ambiguous proteins and swapping protein 1/2td_prob (
int) – Minimum theoretical TD machtes for the FDR levels (except protein level)td_prot_prob (
int) – Minimum theoretical TD machtes for the protein FDR leveltd_dd_ratio (
float) – Minimum ratio of TD/DDcustom_aggs (
dict) – Custom aggregation functions for the FDR levels
- Return type:
dict[str,polars.dataframe.frame.DataFrame]- Returns:
Return a dict with keys ‘csm’, ‘pep’, ‘prot’, ‘link’, ‘ppi’ that contains the resulting polars DataFrame for each FDR level.