Help for package dRiftDM

Type:

Package

Title:

Estimating (Time-Dependent) Drift Diffusion Models

Version:

0.3.1

License:

MIT + file LICENSE

Description:

Fit and explore Drift Diffusion Models (DDMs), a common tool in psychology for describing decision processes in simple tasks. It can handle both time-independent and time-dependent DDMs. You either choose prebuilt models or create your own, and the package takes care of model predictions and parameter estimation. Model predictions are derived via the numerical solutions provided by Richter, Ulrich, and Janczyk (2023, <doi:10.1016/j.jmp.2023.102756>).

Suggests:

testthat (≥ 3.0.0), cowsay, knitr, rmarkdown, DMCfun, truncnorm, vdiffr

Config/testthat/edition:

Encoding:

UTF-8

LazyData:

true

RoxygenNote:

7.3.3

Imports:

withr, parallel, pbapply, mirai, purrr, DEoptim, dfoptim, Rcpp, Rdpack, progress, stats, lifecycle, coda

LinkingTo:

Rcpp

Depends:

R (≥ 4.1.0)

VignetteBuilder:

knitr

RdMacros:

Rdpack

URL:

https://github.com/bucky2177/dRiftDM, https://bucky2177.github.io/dRiftDM/

BugReports:

https://github.com/bucky2177/dRiftDM/issues

NeedsCompilation:

yes

Packaged:

2025-12-01 12:51:54 UTC; cognium-admin

Author:

Valentin Koob [cre, aut, cph], Thomas Richter [aut, cph], Markus Janczyk [aut]

Maintainer:

Valentin Koob <v.koob@web.de>

Repository:

CRAN

Date/Publication:

2025-12-01 13:20:12 UTC

dRiftDM: Estimating (Time-Dependent) Drift Diffusion Models

Description

Author(s)

Maintainer: Valentin Koob v.koob@web.de [copyright holder]

Authors:

Thomas Richter [copyright holder]
Markus Janczyk

Convolute the First Passage Times with the Non-Decision Time Distribution

Description

Calls stats::convolve for the first passage times and the non-decision time distribution to derive the full distribution of response times. Before convolution, I add the robustness parameter.

Usage

add_residual(pdf_nt, pdf_u, pdf_l, dt, nt)

Arguments

pdf_nt

the non-decision time density values

pdf_u, pdf_l

the first passage times

dt, nt

step size and number of steps for the time space (for input checks and scaling)

Value

a list of PDFs for one condition "pdf_u" and "pdf_l"

Aggregate Statistics Across ID

Description

aggregate_stats is a (not exported) function to aggregate stats_dm objects across IDs. Since the column names may vary by the statistic type, the behavior of aggregate depends on the subclass of stats_dm.

Usage

aggregate_stats(stat_df)

Arguments

stat_df

A data.frame of class stats_dm (see new_stats_dm())

Details

aggregate_stats calls the internal_aggregate() with the relevant arguments

Value

Returns the statistics aggregated across the relevant cols.

The Coding of the Boundaries

Description

Functions to get or set the "boundary coding" of an object.

Usage

b_coding(object, ...) <- value

## S3 replacement method for class 'drift_dm'
b_coding(object, ...) <- value

b_coding(object, ...)

## S3 method for class 'drift_dm'
b_coding(object, ...)

## S3 method for class 'fits_ids_dm'
b_coding(object, ...)

## S3 method for class 'fits_agg_dm'
b_coding(object, ...)

Arguments

object

an object of type drift_dm, fits_ids_dm, or fits_agg_dm (see estimate_dm()).

...

additional arguments.

value

a named list, specifying how boundaries are coded (see Details).

Details

b_coding() is a generic accessor function, and ⁠b_coding<-()⁠ a generic replacement function. The default methods get and set the "boundary coding", which is an attribute of drift_dm model.

The boundary coding summarizes which response time belongs to which boundary and how the boundaries shall be "labeled". The list specifies three entries:

column, contains a single character string, indicating which column in an observed data set codes the boundaries.
u_name_value, contains a numeric or character vector of length 1. The name of this vector gives a label for the upper boundary, and the entry gives the value stored in obs_data[[column]] coding the upper boundary.
l_name_value, contains a numeric or character vector of length 1. The name of this vector gives a label for the lower boundary, and the entry gives the value stored in obs_data[[column]] coding the lower boundary.

The package dRiftDM has a default boundary coding:

column = "Error"
u_name_value = c("corr" = 0)
l_name_value = c("err" = 1)

Thus, per default, dRiftDM assumes that any observed data set has a column "Error", providing the values 0 and 1 for the upper and lower boundary, respectively. The upper and lower boundaries are labeled "corr" and "err", respectively. These labels are used, for example, when calculating statistics (see calc_stats).

When calling ⁠b_coding<-()⁠ with value = NULL, the default "accuracy" coding is evoked

Value

For b_coding() a list containing the boundary coding For ⁠b_coding<-()⁠ the updated drift_dm or fits_ids_dm object

Examples

# show the default accuracy coding of dRiftDM
my_model <- ratcliff_dm() # get a pre-built model
b_coding(my_model)

# can be modified/replaced
b_coding(my_model)[["column"]] <- "Response"

# accessor method also available for fits_ids_dm objects
# get an exemplary fits_ids_dm object (see estimate_model_ids)
fits <- get_example_fits("fits_ids_dm")
names(b_coding(fits))

Constant Boundary

Description

Constant Boundary

Usage

b_constant(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, containing b

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

a vector of the same length as t_vec with the b value for each element of the vector.

Collapsing Boundary - Hyperbolic Ratio Function

Description

Provides the boundary, collapsing as a hyperbolic ratio function.

Usage

b_hyperbol(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

dt_b_hyperbol(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, containing b0, kappa, t05

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Details

b_hyperbol and dt_b_hyperbol provide the plain boundary values and the respective derivative, respectively.

Value

a vector of the same length as t_vec with the boundary values (or the deriviative) for each element of the vector.

Collapsing Boundary - Weibull Function

Description

Provides the boundary, collapsing in accordance with a Weibull function

Usage

b_weibull(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

dt_b_weibull(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, containing b0, lambda, k, kappa

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Details

b_weibull and dt_b_weibull provide the plain boundary values and the respective derivative, respectively.

Value

a vector of the same length as t_vec with the boundary values (or the deriviative) for each element of the vector.

Calculate Basic Statistics for Response Times or Probability Densities

Description

Function that calls the underlying functions calc_basic_stats_obs and calc_basic_stats_pred. Handles input checks and data wrangling.

Usage

calc_basic_stats(
  pdf_u = NULL,
  pdf_l = NULL,
  rts_u = NULL,
  rts_l = NULL,
  one_cond,
  b_coding,
  t_vec = NULL,
  dt = NULL,
  skip_if_contr_low = NULL
)

Arguments

pdf_u, pdf_l

numeric, vectors of probability density values for the upper and lower boundaries.

rts_u, rts_l

numeric, vectors of response times for the upper and lower boundaries.

one_cond

character, a label for the condition.

b_coding

list, used for accessing the upper boundary label, determines the corresponding column of the returned data.frame (e.g., P_corr).

t_vec

numeric vector, containing the time points corresponding to the probability density values.

dt

a single numeric, providing the step size in t_vec.

skip_if_contr_low

a single numeric, threshold below which probability densities are ignored (default is obtained from drift_dm_skip_if_contr_low()).

Details

If pdf_u and pdf_l are provided, the function computes statistics for the probability densities.
If rts_u and rts_l are provided, the function computes statistics for the observed RTs.
If both sets of inputs are provided, both types of statistics are computed and returned.

Value

A data.frame with columns:

Source: Indicates whether the statistics refer to observed (obs) or predicted (pred) data.
Cond: A condition label.
⁠Mean_<u_label>⁠: Mean response time for the upper boundary.
⁠Mean_<l_label>⁠: Mean response time for the lower boundary.
⁠P_<u_label>⁠: Proportion of upper-boundary responses.

Calculate Basic Mean, Standard Deviations, and Percentages for Observed and Predicted Data

Description

Backend functions to compute the mean response times, standard deviations of response times, and response proportions; for both observed RTs or the predicted probability density functions.

Usage

calc_basic_stats_obs(rts_u, rts_l, one_cond)

calc_basic_stats_pred(
  pdf_u,
  pdf_l,
  one_cond,
  t_vec,
  dt,
  skip_if_contr_low = NULL
)

Arguments

rts_u, rts_l

numeric, vectors of response times for the upper and lower boundaries.

one_cond

character, a label for the condition.

pdf_u, pdf_l

numeric, vectors of probability density values for the upper and lower boundaries.

t_vec

numeric vector, containing the time points corresponding to the probability density values.

dt

a single numeric, providing the step size in t_vec.

skip_if_contr_low

a single numeric, threshold below which probability densities are ignored (default is obtained from drift_dm_skip_if_contr_low()).

Details

For observed data, calculates mean RTs, standard deviations of RTs, and the proportion of upper responses.
The same statistics are calculated for the probability density values (via simple numerical integration)

Value

A data.frame with columns:

Cond: Condition label.
Mean_U: Mean response time for the upper boundary
Mean_L: Mean response time for the lower boundary
SD_U: Mean response time for the upper boundary
SD_L: Mean response time for the lower boundary
P_U: Proportion of upper-boundary responses.

Calculate CAFs

Description

Function that calls the underlying CAF calculation functions calc_cafs_obs and calc_cafs_pred. Does input checks and the data wrangling

Usage

calc_cafs(
  pdf_u = NULL,
  pdf_l = NULL,
  t_vec = NULL,
  rts_u = NULL,
  rts_l = NULL,
  one_cond,
  n_bins = NULL,
  b_coding
)

Arguments

pdf_u, pdf_l

density values for the upper and lower boundary

t_vec

the time space

rts_u, rts_l

vectors of RTs for the upper and lower boundary

one_cond

character label

n_bins

number of bins to use for the CAFs

b_coding

used for accessing the upper boundary label, determines the corresponding column of the returned data.frame (e.g., P_corr).

Details

if pdf_u and pdf_l are not NULL, returns CAFs of the densities

if rts_u and rts_l are not NULL, returns CAFs of the response times

if all are not NULL, returns both.

Value

a data.frame with "Source", "Cond", "Bin"s, "P_<u_label>" for the CAFs of type c("cafs", "sum_dist", "stats_dm", "data.frame")

Calculate CAFs

Description

Backend functions to calculate conditional accuracy functions for RT vectors or pdfs

Usage

calc_cafs_obs(rts_u, rts_l, one_cond, n_bins)

calc_cafs_pred(pdf_u, pdf_l, t_vec, one_cond, n_bins)

Arguments

rts_u, rts_l

vectors of RTs for the upper and lower boundary

one_cond

character label

n_bins

number of bins to use for the CAFs

pdf_u, pdf_l

density values for the upper and lower boundary

t_vec

the time space

Details

for RTs: first elements are attributed to a bin (with bins calculated across all RTs using equally spaced quantiles), then accuracy per bin is calculated.

for Densities: Add density values, calculate a CDF and force it between 0 and

Then determine the indices that cut the CDF into bins by considering equally spaced quantiles. Then calculate the ratio of probability mass per bin.

Value

a data.frame with the "Cond" label, the "Bin"s and "P_U" for the CAFs

Calculate delta function(s)

Description

Given a dataset providing the quantiles (calc_quantiles), calculates delta function(s) for the character vectors minuends and subtrahends

Usage

calc_delta_funs(
  quantiles_dat,
  minuends = NULL,
  subtrahends = NULL,
  dvs = NULL,
  b_coding
)

Arguments

quantiles_dat

a data.frame of quantiles (calc_quantiles)

minuends, subtrahends

character vectors (with equal length), specifying the conditions to use for the delta function: minuend - subtrahend

dvs

character, indicating which quantile columns to use. Default is "Quant_<u_label>". If multiple dvs are provided, then minuends and subtrahends must have the same length, and matching occurs pairwise. In this case, if only one minuend/subtrahend is specified, minuend and subtrahend are recycled to the necessary length.

b_coding

a b_coding object, necessary to build default dvs

Details

Takes the quantile data_frame, stats::reshape it to wide, and then access the relevant dv columns, together with minuends and subtrahends to calculate the delta functions.

Value

a data.frame with columns "Source", "Prob", the "Quant_<u_label>", "Quant_<l_label". May have the following additional columns:

if only one dv: as many Delta_<minuend_subtrahend> and Avg_<minuends_subtrahends> as minuends and subtrahends.
if more than one dv: as many Delta_<u/l-label><minuend_subtrahend> and Avg<u/l-label>_<minuends_subtrahends> as minuends and subtrahends.

The data.frame is of type c("delta_funs", "sum_dist", "stats_dm", "data.frame")

Calculate and Combine Density Estimates for Observed and Predicted Data

Description

Internal helper functions that return a data.frame summarizing density values for observed and predicted response times.

Usage

calc_dens_obs(
  rts_u,
  rts_l,
  one_cond,
  t_max = NULL,
  discr = NULL,
  scaling_factor = 1
)

calc_dens(
  pdf_u = NULL,
  pdf_l = NULL,
  t_vec = NULL,
  t_max = NULL,
  discr = NULL,
  rts_u = NULL,
  rts_l = NULL,
  one_cond,
  b_coding,
  scaling_factor = 1
)

Arguments

rts_u, rts_l

vectors of RTs for the upper and lower boundary

one_cond

character label

t_max

a single numeric value specifying the maximum RT to consider. Defaults to the smallest multiple of discr above the maximum RT. If t_vec is provided, t_max defaults to the maximum value of t_vec.

discr

a single numeric value defining the bin width for histogram and KDE estimation. Defaults to 0.015 (seconds).

scaling_factor

a single numeric value, multiplied with the PDFs. It is used to scale the corresponding probability mass proportional to the number of trials per condition. Defaults to 1.0.

pdf_u, pdf_l

density values for the upper and lower boundary

t_vec

the time space (required for the pdfs)

b_coding

used for accessing the upper/lower boundary labels, determines the corresponding columns of the returned data.frame (e.g., Quant_corr).

Details

calc_dens_obs() computes empirical histograms and kernel density estimates for a single condition based on observed RTs.

calc_dens() serves as a general interface that combines observed and predicted data into a single data.frame. Observed data (rts_u and rts_l) is passed to calc_dens_obs(). Predicted data (pdf_u and pdf_l) is wrapped into a data.frame that matches the structure returned by calc_dens_obs(). If both are provided, observed and predicted data are row-bound into a single data.frame.

These functions are used internally to support type = "density" in calc_stats(), providing a full distributional overview.

Value

A stats_dm object (via new_stats_dm()) containing a data.frame with columns:

Source: indicates whether the row is from observed ("obs") or predicted ("pred") data.
Cond: the condition label.
Time: the time point corresponding to the density value.
Stat: type of density summary—"hist" or "kde" (for observed data), or "pdf" (for predicted data).
⁠Dens_<U>⁠: density value for the upper response.
⁠Dens_<L>⁠: density value for the lower response.

The ⁠<U>⁠ and ⁠<L>⁠ placeholders are determined by the b_coding argument.

Calculate Fit Statistics

Description

Computes/Summarizes multiple fit statistics, inclduing Log-Likelihood, the Negative Log-Likelihood, the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), and the Root-Mean Squared-Error (RMSE) statistic.

Usage

calc_fit_stats(drift_dm_obj, k = 2, ...)

Arguments

drift_dm_obj

an object of type drift_dm

k

a single numeric, scaling the penality of stats::AIC)

...

additional arguments passed forward. Options are probs, n_bins, and weight_err for calculating the RMSE.

Value

A custom object of class stats_dm (c("fit_stats", "stats_dm", "data.frame")). The columns are:

Log_Like: the log-likelihood value
Neg_Log_Like: the negative log-likelihood value
AIC: the calculated AIC value
BIC: the calculated BIC value
RMSE_s: the root-mean-squared error (for RTs in seconds)
RMSE_ms: the root-mean-squared error (for RTs in milliseconds) If a respective statistic cannot be calculated, the respective column contains NA.

Calculate the Log-Likelihood

Description

Wrapper function around log_like_heart

Usage

calc_log_like(pdfs, t_vec, obs_data)

log_like_heart(pdf_u, pdf_l, t_vec, rts_u, rts_l)

Arguments

pdfs

a list of pdfs (see details)

t_vec

time space

obs_data

a list of obs_data

pdf_u, pdf_l

numeric vectors of the pdfs (unpacked)

rts_u, rts_l

numeric vectors of the observed RTs (unpacked)

Details

calc_log_like

Iterates over all conditions, and passes forward the (unpacked) arguments to log_like_heart, adding each log-likelihood of a condition.

pdfs must be a list with entries named as the conditions, and then each condition being a list of the two PDFs (named pdf_u and pdf_l)

obs_data must be a list with entries "rts_u" and "rts_l", and then each rts_* entry being a named list with the RT values for each condition

log_like_heart

Gets the density values for RTs in rts_u/rts_l via stats::approx(), takes the log of that, and then sums across both. Wraps up the calculation in a tryCatch statement, throwing warnings when log_like_values can not be calculated

Value

a single value of the log-likelihood. If no data is provided, NULL is returned. If the calculation fails, -Inf is returned.

Calculate the PDFs

Description

This method takes a model, the time and space vectors, and the unpacked parameters for solving the PDF to derive the first passage time across all conditions. It is a wrapper around the cpp implementations and add_residual. Important: This function is used in the depths of the package and the generic method is not exported.

Usage

calc_pdfs(drift_dm_obj, x_vec, t_vec, prms_solve)

## S3 method for class 'drift_dm'
calc_pdfs(drift_dm_obj, x_vec, t_vec, prms_solve)

Arguments

drift_dm_obj

a model of type drift_dm

x_vec

numeric vector, the evidence space

t_vec

numeric vector, the time space

prms_solve

the discretization (see prms_solve)

Details

calc_pdfs is a generic method which dispatches the function call (not exported). Currently, the method only considers objects of type drift_dm.

calc_pdfs.drift_dm is the function that will be called for all models. It evaluates the different components of a model, and subsequently calls the cpp implementations for the KFE or integral method. It also calls the add_residual function to convolute the non-decision time to the first passage time.

The numerical methods for deriving the PDFs are based on the code provided by (Richter et al. 2023).

Value

a list of PDFs, with named entries for each condition. Each of this entry contains a list of vectors, named "pdf_u" and "pdf_l"

References

Richter T, Ulrich R, Janczyk M (2023). “Diffusion models with time-dependent parameters: An analysis of computational effort and accuracy of different numerical methods.” Journal of Mathematical Psychology, 114, 102756. doi:10.1016/j.jmp.2023.102756.

Calculate Quantiles

Description

Function that calls the underlying quantile calculation functions calc_quantiles_obs() and calc_quantiles_pred(). Does input checks and the data wrangling

Usage

calc_quantiles(
  pdf_u = NULL,
  pdf_l = NULL,
  t_vec = NULL,
  dt = NULL,
  rts_u = NULL,
  rts_l = NULL,
  one_cond,
  probs = NULL,
  b_coding,
  skip_if_contr_low = NULL
)

Arguments

pdf_u, pdf_l

density values for the upper and lower boundary

t_vec

the time space (required for the pdfs)

dt

the step size corresponding to the time space

rts_u, rts_l

vectors of RTs for the upper and lower boundary

one_cond

character label

probs

numeric vector with values between 0 and 1 for the probability levels

b_coding

used for accessing the upper/lower boundary labels, determines the corresponding columns of the returned data.frame (e.g., Quant_corr).

skip_if_contr_low

numeric. If the contribution of the upper or lower PDF to the overall PDF is too low, return NAs for this PDF (see also drift_dm_skip_if_contr_low()).

Details

if pdf_u and pdf_l are not NULL, returns quantiles for the densities

if rts_u and rts_l are not NULL, returns quantiles for the response times

if all are not NULL, returns both.

Value

a data.frame with "Source", "Cond", "Prob"s, "Quant_<u_label>", "Quant_<l_label>" of type c("quantiles", "sum_dist", "stats_dm", "data.frame")

Calculate Quantiles

Description

Backend functions to calculate quantiles for RT vectors or pdfs

Usage

calc_quantiles_obs(rts_u, rts_l, one_cond, probs)

calc_quantiles_pred(
  pdf_u,
  pdf_l,
  t_vec,
  one_cond,
  probs,
  dt,
  skip_if_contr_low = NULL
)

Arguments

rts_u, rts_l

vectors of RTs for the upper and lower boundary

one_cond

character label

probs

numeric vector with values between 0 and 1 for the probability levels

pdf_u, pdf_l

density values for the upper and lower boundary

t_vec

the time space (required for the pdfs)

dt

the step size corresponding to the time space

skip_if_contr_low

numeric. If the contribution of the upper or lower PDF to the overall PDF is too low, return NAs for this PDF (see also drift_dm_skip_if_contr_low()).

Details

for RTs: straightforward via stats::quantile.

for Densities: Calculate CDF (for each pdf separately here), and then map the desired probability level via the CDF (y-axis) to the time space (x-axis)

Value

a data.frame with the "Cond" label, the "Prob"s and "Quant_U" and "Quant_L" for the quantiles

RMSE Calculation and Helpers

Description

Internal helpers for computing the root mean squared error (RMSE) between predicted and observed quantiles and conditional accuracy functions.

Usage

calc_rmse_eval(pdfs, t_vec, dt, stats_agg, stats_agg_info, weight_err = 1.5)

calc_rmse(quants_pred, cafs_pred, quants_obs, cafs_obs, weight_err = 1.5)

Arguments

pdfs

list of PDFs per condition (named).

t_vec

numeric time vector.

dt

numeric time step.

stats_agg

list of observed summary statistics.

stats_agg_info

list with info on quantile probabilities and CAF bins.

weight_err

non-negative numeric scalar; weight factor for CAF error relative to quantile error. Default is 1.5

quants_pred

numeric vector of predicted quantiles (already flattened).

cafs_pred

numeric vector of predicted CAFs (already flattened).

quants_obs

numeric vector of observed quantiles (already flattened).

cafs_obs

numeric vector of observed CAFs (already flattened).

Details

calc_rmse_eval() prepares observed and predicted quantiles/CAFs from PDFs and aggregated info, then calls calc_rmse().
calc_rmse() computes the weighted RMSE given predicted and observed quantiles/CAFs.

Value

A single numeric RMSE value, or NULL if no observed stats were provided, or Inf if predictions failed (contain NA).

Calculate Statistics

Description

calc_stats provides an interface for calculating statistics/metrics on model predictions and/or observed data. Supported statistics include basic statistics on mean and standard deviation, Conditional Accuracy Functions (CAFs), Quantiles, Delta Functions, and fit statistics. Results can be aggregated across individuals.

Usage

calc_stats(object, type, ...)

## S3 method for class 'data.frame'
calc_stats(
  object,
  type,
  ...,
  conds = NULL,
  resample = FALSE,
  progress = 1,
  level = "individual",
  b_coding = NULL
)

## S3 method for class 'drift_dm'
calc_stats(object, type, ..., conds = NULL, resample = FALSE)

## S3 method for class 'fits_ids_dm'
calc_stats(
  object,
  type,
  ...,
  conds = NULL,
  resample = FALSE,
  progress = 1,
  level = "individual"
)

## S3 method for class 'fits_agg_dm'
calc_stats(
  object,
  type,
  ...,
  conds = NULL,
  resample = FALSE,
  progress = 1,
  level = "group",
  messaging = TRUE
)

## S3 method for class 'stats_dm'
print(
  x,
  ...,
  round_digits = NULL,
  print_rows = NULL,
  some = NULL,
  show_header = NULL,
  show_note = NULL
)

## S3 method for class 'stats_dm_list'
print(x, ...)

Arguments

object

an object for which statistics are calculated. This can be a data.frame of observed data, a drift_dm object, a fits_ids_dm object, or a fits_agg_dm object (see estimate_dm()).

type

a character vector, specifying the statistics to calculate. Supported values include "basic_stats", "cafs", "quantiles", "delta_funs", "densities", and "fit_stats".

...

additional arguments passed to the respective method and the underlying calculation functions (see Details for mandatory arguments).

conds

optional character vector specifying conditions to include. Conditions must match those found in the object.

resample

logical. If TRUE, then data is (re-)sampled to create an uncertainty estimate for the requested summary statistic. See Details for more information. Default is FALSE. Note that resampling does not work with type = "fit_stats".

progress

integer, indicating if information about the progress should be displayed. 0 -> no information, 1 -> a progress bar. Default is 1.

level

a single character string, indicating at which "level" the statistic should be calculated. Options are "group" or "individual". If "individual", the returned stats_dm object contains an "ID" column.

b_coding

a list for boundary coding (see b_coding). Only relevant when object is a data.frame. For other object types, the b_coding of the object is used.

messaging

logical, if FALSE, no message is provided.

x

an object of type stats_dm or stats_dm_list, as returned by the function calc_stats().

round_digits

integer, controls the number of digits shown. Default is 3.

print_rows

integer, controls the number of rows shown.

some

logical. If TRUE, a subset of randomly sampled rows is shown.

show_header

logical. If TRUE, a header specifying the type of statistic will be displayed.

show_note

logical. If TRUE, a footnote is displayed indicating that the underlying data.frame can be accessed as usual.

Details

calc_stats is a generic function to handle the calculation of different statistics/metrics for the supported object types. Per default, it returns the requested statistics/metrics.

List of Supported Statistics

Basic Statistics

With "basic statistics", we refer to a summary of the mean and standard deviation of response times, including a proportion of response choices.

Conditional Accuracy Function (CAFs)

CAFs are a way to quantify response accuracy against speed. To calculate CAFs, RTs (whether correct or incorrect) are first binned and then the percent correct responses per bin is calculated.

When calculating model-based CAFs, a joint CDF combining both the pdf of correct and incorrect responses is calculated. Afterwards, this CDF is separated into even-spaced segments and the contribution of the pdf associated with a correct response relative to the joint CDF is calculated.

The number of bins can be controlled by passing the argument n_bins. The default is 5.

Quantiles

For observed response times, the function stats::quantile is used with default settings.

Which quantiles are calcuated can be controlled by providing the probabilites, probs, with values in [0, 1]. Default is seq(0.1, 0.9, 0.1).

Delta Functions

Delta functions calculate the difference between quantiles of two conditions against their mean:

Delta_i = Q_{i,j} - Q_{i,k}
Avg_i = 0.5 \cdot Q_{i,j} + 0.5 \cdot Q_{i,k}

With i indicating a quantile, and j and k two conditions.

To calculate delta functions, users have to specify:

minuends: character vector, specifying condition(s) j. Must be in conds(drift_dm_obj).
subtrahends: character vector, specifying condition(s) k. Must be in conds(drift_dm_obj)
dvs: character, indicating which quantile columns to use. Default is "Quant_<u_label>". If multiple dvs are provided, then minuends and subtrahends must have the same length, and matching occurs pairwise. In this case, if only one minuend/subtrahend is specified, minuend and subtrahend are recycled to the necessary length.
specifying probs is possible (see Quantiles)

Densities

With "densities", we refer to a summary of the distribution of observed or predicted data. For observed data, histogram values and kernel density estimates are provided. For predicted data, the model's predicted PDFs are provided.

Optional arguments are:

discr: numeric, the band-width when calculating the histogram or the kernel density estimates. Defaults to 0.015 seconds
t_max: numeric, the maximum time window when calculating the distribution summaries of observe data. Defaults to the longest RT (for observed data) or the maximum of the time domain of a model (which is the preferred choice, if possible). If necessary, t_max is slightly adjusted to match with discr.
scale_mass: logical, only relevant if observed data is available. If TRUE, density masses are scaled proportional to the number of trials per condition.

Fit Statistics

Calculates the Log-Likelihood, Akaike and Bayesian Information Criteria, and root-mean squared-error statistic.

Optional arguments are:

k: numeric, for penalizing the AIC statistic (see also stats::AIC and AIC.fits_ids_dm).
n_bins, probs: numeric vectors, see the section on CAFs and Quantiles above
weight_err: numeric scalar, determines how CAFs and quantiles are weighted. Default is 1.5.

Resampling

When resampling = TRUE, an uncertainty interval is provided via simulation. The default number of iterations is R = 100, which can be changed by passing the optional argument R.

If resampling is requested, the returned stats_dm object contains the column "Estimate", coding the interval. The interval width is controlled via the optional argument interval_level, a single numeric value between 0 and 1 (default: 0.95). The interpretation of this interval depends on the specific situation (see below).

Resampling at the Individual Level

If object is a drift_dm object (i.e., a single model created by drift_dm()), synthetic data are simulated under the model, and for each synthetic data set the requested statistic is calculated. The interval then reflects the range of these simulated statistics. To determine the number of trials for each synthetic data set, dRiftDM either uses the observed data attached to the model (if available) or the optional argument n_sim (passed to simulate_data()). Note that n_sim must be provided if no observed data are available, and that n_sim always has priority.

If object is a drift_dm object with attached observed data, resampling is also performed for the observed data. In this case, trials are bootstrapped, and for each bootstrap sample the requested statistic is calculated.

If object is a data.frame, fits_agg_dm, or fits_ids_dm object, resampling is performed for each individual if level = "individual". For both models and observed data, synthetic or bootstrapped data sets are generated as described above.

Resampling at the Group Level

Group-level resampling is possible only if object is a data.frame (with an "ID" column), fits_agg_dm, or fits_ids_dm object. To request this, set level = "group". Participants are then bootstrapped, and for each bootstrapped sample the aggregated statistic is calculated.

Interpretation of Intervals

For level = "group", intervals represent bootstrapped confidence intervals For level = "individual", intervals represent the variability in the statistic when data for a single participant are resampled or simulated under the model.

Note

For objects of type fits_agg_dm, which contain a mixture of group- and individual-level information, the level argument only affects resampling for the observed data. For the model itself, resampling is always performed under the fitted model, in the same way as for a drift_dm object.

Value

If type is a single character string, then a subclass of data.frame is returned, containing the respective statistic. Objects of type sum_dist will have an additional attribute storing the boundary encoding (see also b_coding). The reason for returning subclasses of data.frame is to provide custom plot() methods (e.g., plot.cafs). To get rid of the subclass label and additional attributes (i.e., to get just the plain underlying data.frame, users can use unpack_obj()).

If type contains multiple character strings (i.e., is a character vector) a subclass of list with the calculated statistics is returned. The list will be of type stats_dm_list (to easily create multiple panels using the respective plot.stats_dm_list() method).

The print methods print.stats_dm() and print.stats_dm_list() each invisibly return the supplied object x.

Note

When a model's predicted density function integrates to a value of less than drift_dm_skip_if_contr_low(), means and quantiles return the values NA. Users can alter this by explicitly passing the argument skip_if_contr_low when calling calc_stats() (e.g., calc_stats(..., skip_if_contr_low = -Inf))

Examples

# Example 1: Calculate CAFs and Quantiles from a model ---------------------
# get a model for demonstration purpose
a_model <- ssp_dm()
# and then calculate cafs and quantiles
some_stats <- calc_stats(a_model, type = c("cafs", "quantiles"))
print(some_stats)

# Example 2: Calculate a Delta Function from a data.frame ------------------
# get a data set for demonstration purpose
some_data <- ulrich_simon_data
conds(some_data) # relevant for minuends and subtrahends
some_stats <- calc_stats(
  a_model,
  type = "delta_funs",
  minuends = "incomp",
  subtrahends = "comp"
)
print(some_stats, print_rows = 5)


# Example 3: Calculate Quantiles from a fits_ids_dm object -----------------
# get an auxiliary fits_ids_dm object
all_fits <- get_example_fits("fits_ids_dm")
some_stats <- calc_stats(all_fits, type = "quantiles")
print(some_stats, print_rows = 5) # note the ID column

# one can also request that the statistics are averaged across individuals
print(
  calc_stats(all_fits, type = "quantiles", average = TRUE)
)

Calculate Statistics for Model Prediction and/or Observed Data

Description

This function derives statistics that can be calculated for model predictions and/or observed data. However, it does not calculate it, but rather calls the respective backend functions. Supported statistics currently include:

Basic Summary Statistics (i.e., means and response percentages calc_basic_stats())
Conditional Accuracy Functions (CAFs; calc_cafs())
Quantiles (calc_quantiles())
Delta Functions (calc_delta_funs()).
Density Estimates (calc_dens()).

Usage

calc_stats_pred_obs(type, b_coding, conds, ..., scale_mass = FALSE)

Arguments

type

character string, specifying the type of statistic to calculate. Available options are "basic_stats", "cafs", "quantiles", "delta_funs", and "densities".

b_coding

list for the boundary coding (see b_coding).

conds

character vector, specifying the conditions to include in calculations (used for labeling and subsetting the model PDFs and the observed data).

...

Additional parameters passed on to the specific statistic calculation function (see Details).

scale_mass

a single logical, only relevant for density estimation. If TRUE, PDF masses are scaled proportional to the number of trials per condition.

Details

When calling this function the arguments all_rts_u/all_rts_l and/or all_pdfs must always be specified (see re_evaluate_model, obs_data). Otherwise, the backend functions won't work properly. Further arguments are:

for CAFS: n_bins controls the number of bins, with a default of 5.
for Quantiles and Delta Functions: probs controls the quantiles to calculate. Default is seq(0.1, 0.9, 0.1) (see drift_dm_default_probs()).
for basic summary satistics, Quantiles, and Delta Function: skip_if_contr_low controls if quantiles and means are calculated for PDFs with very small contribution (see also drift_dm_skip_if_contr_low()).
for densities: discr controls the bin width for the observed data. Default is 0.015 seconds

This function gets called by calc_stats()

Value

A data frame with the calculated statistic across conds (ordered according to Source).

Metropolis Acceptance Step for Vectorized MCMC Sampling

Description

This internal function computes the Metropolis acceptance decision for a set of MCMC proposals, using either hierarchical or non-hierarchical posterior evaluation. It returns updated log-posterior and log-likelihood values, as well as a logical vector indicating accepted proposals.

Usage

call_log_posterior_m(
  proposal_mat,
  prev_prms_mat,
  prev_pis,
  prev_lls,
  level,
  re_eval,
  ...
)

Arguments

proposal_mat

a numeric matrix of proposed parameter values. Each column corresponds to one chain; rows represent parameters.

prev_prms_mat

a numeric matrix of current (previous) parameter values. Must have the same dimensions as proposal_mat.

prev_pis

a numeric vector of current log-posterior values for each chain.

prev_lls

a numeric vector of current log-likelihood values for each chain.

level

a character string specifying the sampling level, either "lower", "hyper", or "none". Determines whether to call log_posterior_lower() or log_posterior_hyper().

re_eval

logical. If TRUE, the log-posterior and log-likelihood for the current parameters are re-evaluated.

...

Additional arguments passed to log_posterior_lower() or log_posterior_hyper(), depending on the level.

Details

This function implements a vectorized Metropolis acceptance step for multiple MCMC chains simultaneously. The posterior is calculated using either log_posterior_lower() for subject-level parameters or log_posterior_hyper() for group-level parameters.

Log-posterior and log-likelihood values are only updated where proposals were accepted. In cases where proposals yield invalid posteriors (i.e., NA), they are automatically rejected.

Value

A list with three elements:

pis: A numeric vector of updated log-posterior values.
lls: A numeric vector of updated log-likelihood values.
accept: A logical vector of length equal to the number of chains, indicating which proposals were accepted.

Check a B_Coding

Description

Checks if a list satisfies the requirements to be considered a valid b_coding

Usage

check_b_coding(b_coding)

Arguments

b_coding

a named list

Details

Checks for...

input being a list
list is of length three and provides the names "column", "u_name_value", "l_name_value",
if b_coding$column provides a single string
if b_coding$u_name_value and b_coding$l_name_value provides a single named value of type character or numeric, and that both are of the same type

Value

the unmodified list for convenience

Checks if a variable/vector of length 1 is a number > 0 or 0/expression

Description

assumed that x is either a digit or an expression

Usage

check_digit_larger_0(x)

Arguments

x

a vector of length 1

Value

a single logical

Check time/space discretization via reference comparison

Description

check_discretization() helps you choose or check time (dt) and space (dx) discretization settings. It computes a high-precision reference solution of the model's PDFs with dt_ref/dx_ref, and then compares the reference PDFs to the discretization settings of the supplied object, using the Hellinger distance per condition. Smaller distances indicate closer agreement with the reference — i.e., a sufficiently fine grid.

There are not yet overall and officially published recommendations on how large the Hellinger distance can be without affecting model precision, and this very likely will depend on the model itself. Based on some preliminary simulations using dmc_dm(), we would recommend trying to keep the Hellinger Distance at best below 10 percent. However, we also observed for extreme parameter values that the Hellinger distance can be even larger without sacrificing the qualitative model behavior, and vice versa! It is thus best to iterate between plotting model predictions and calculating the Hellinger Distance, to ensure that you can best interpret this quantity for your model at hand. Furthermore, we recommend to run parameter recoveries using simulate_data() and estimate_dm(), to check if you can recover data generated under your model with fine discretization using that same model with coarse discretization.

Usage

check_discretization(object, ...)

## S3 method for class 'drift_dm'
check_discretization(
  object,
  ...,
  dt_ref = 0.001,
  dx_ref = 0.001,
  round_digits = 5
)

## S3 method for class 'fits_ids_dm'
check_discretization(object, ...)

## S3 method for class 'fits_agg_dm'
check_discretization(object, ...)

Arguments

object

a drift_dm, fits_agg_dm, or fits_ids_dm object. (the latter two are returned by estimate_dm())

...

further arguments passed forward to the respective method.

dt_ref, dx_ref

numeric scalars, providing a fine time or space step size for the reference solution. Defaults to 0.001.

round_digits

number of decimal places to which the final Hellinger distances are rounded (default: 5).

Details

Under the hood, for each condition, we concatenate the lower- and upper- boundary PDFs (pdf_l, pdf_u), interpolate the model PDFs to a time space matching with the reference PDFs, and then compute the Hellinger distance: H(p,q) = \sqrt{1 - \int \sqrt{p(t)\,q(t)}\,dt}

There are not yet overall, officially published recommendations on how large the Hellinger distance can be without affecting model precision, and this may even depend on the specific model. Based on preliminary simulations, we recommend trying to keep the average Hellinger distance below 5\

The reference discretizations (dt_ref/dx_ref) must be at least as fine as the object's current discretization settings (dt_model/dx_model). If dt_model < dt_ref or dx_model < dx_ref, an error is raised because the “reference” would not be the finest solution.

Value

a named numeric vector of Hellinger distances (one per condition) if object is of type drift_dm or fits_agg_dm. A data.frame of Hellinger distances across IDs and conditions if object is of type fits_ids_dm. Hellinger distances are in ⁠[0, 1]⁠, where 0 means identical to the reference.

Examples

# Example:
my_model <- ratcliff_dm()

# Assess current (dt=0.0075, dx=0.02) against a fine reference:
check_discretization(my_model)

# If distances are near zero across conditions, the current grid is adequate.

Check if Object is a Named Numeric Vector

Description

Validates that an object is a named numeric vector with specified attributes. Optionally checks specific names, length, and restrictions on label characters.

Usage

check_if_named_numeric_vector(
  x,
  var_name,
  labels = NULL,
  length = NULL,
  allow_non_word_chars = FALSE
)

Arguments

x

numeric vector, expected to be named.

var_name

character, the name of the variable to display in error messages.

labels

character vector, optional, specifying valid names for x. If provided, all names in x must match these labels.

length

integer, optional, specifying the exact required length of x.

allow_non_word_chars

logical, whether to permit non-word characters in names (default is FALSE).

Details

Checks for:

Numeric type of x with non-zero length
Required length, if specified
Unique, non-empty names for each entry in x
Match of all names in x to labels, if labels is specified
Absence of NA of Inf values in x
Optional absence of non-word names if allow_non_word_chars is FALSE

Value

Throws an error if the conditions are not met. If all checks pass, no output is returned.

checks if all entries of internal_list are an expression or integer. Throws an error if not (internal docu)

Description

Checks also if there are entries for each parameter and condition

Usage

check_internal_list(internal_list, prm_names, cond_names)

Arguments

internal_list

a list, referring to a (linear) internal list of a flex_prms object.

prm_names

the expected parameter of the flex_prms object

cond_names

the expected conditions of the flex_prms object

Value

the internal list for convenience

Check and Reduce the Observed Data

Description

Checks a data set that is considered an "observed data set". Used in the internals of dRiftDM. When calling this function, unncessary column names are stripped away.

Usage

check_reduce_raw_data(obs_data, b_coding_column, u_value, l_value)

Arguments

obs_data

a data.frame

b_coding_column

a single string, indicating which column of obs_data indicates how each RT corresponds to the boundaries.

u_value, l_value

the value within the b_coding_column column that specifies the upper/lower boundary

Details

Checks:

if obs_data is a data.frame
if "RT", b_coding_column, and "Cond" column are present
when IDs are present, if each ID has values on each condition. At the same time unused factor levels are dropped drop_levels_ID_column
If all columns are there, the data set is reduced to the relevant ones
for missing Values, and drops rows with missing values
if "Cond" is of type character, and if not casts it to character
if RT is of type numeric, and of not casts it to numeric
RTs are >= 0
that the values in b_coding_column match with u_value and l_value (casts the column if necessary)
if b_coding_column has only 1 or 2 unique values

Value

the obs_data for convenience (with edits as listed under Details).

Access Coefficients of a Model

Description

Extract or set the coefficients/parameters objects supported by dRiftDM.

Usage

coef(object, ...) <- value

## S3 replacement method for class 'drift_dm'
coef(object, ..., eval_model = FALSE) <- value

## S3 method for class 'drift_dm'
coef(object, ..., select_unique = TRUE, select_custom_prms = TRUE)

## S3 method for class 'fits_agg_dm'
coef(object, ...)

## S3 method for class 'fits_ids_dm'
coef(object, ...)

## S3 method for class 'mcmc_dm'
coef(object, ..., .f = mean, id = NULL)

## S3 method for class 'coefs_dm'
print(
  x,
  ...,
  round_digits = drift_dm_default_rounding(),
  print_rows = 10,
  some = FALSE,
  show_header = TRUE,
  show_note = TRUE
)

Arguments

object

an object of type drift_dm, fits_agg_dm, fits_ids_dm (see also estimate_dm()), or mcmc_dm.

...

additional arguments passed forward (to coef.drift_dm() for objects of type fits_agg_dm; to .f for objects of type mcmc_dm.

value

numerical, a vector with valid values to update the model's parameters. Must match with the number of (unique and free) parameters.

eval_model

logical, indicating if the model should be re-evaluated or not when updating the parameters (see re_evaluate_model). Default is FALSE.

select_unique

logical, indicating if only those parameters shall be returned that are considered unique (e.g., when a parameter is set to be identical across three conditions, then the parameter is only returned once). Default is TRUE. This will also return only those parameters that are estimated. The argument is currently not supported for objects of type mcmc_dm.

select_custom_prms

logical, indicating if custom parameters shall be returned as well. Only has an effect if select_unique = FALSE. The argument is currently not supported for objects of type mcmc_dm.

.f

the function to be applied to each parameter of a chain. Must either return a single value or a vector (with always the same length). Default is mean (i.e., the mean function).

id

an optional numeric or character vector specifying the IDs of participants from whom to summarize samples. Only applicable when the model was estimated hierarchically. Use id = NA as a shorthand to summarize samples for all individuals in the chain object.

x

an object of type coefs_dm, as returned by the function coef() when supplied with a fits_ids_dm object.

round_digits

integer, controls the number of digits shown. Default is 3.

print_rows

integer, controls the number of rows shown.

some

logical. If TRUE, a subset of randomly sampled rows is shown.

show_header

logical. If TRUE, a header specifying the type of statistic will be displayed.

show_note

logical. If TRUE, a footnote is displayed indicating that the underlying data.frame can be accessed as usual.

Details

⁠coef.*()⁠ are methods for the generic stats::coef() function; ⁠coefs<-()⁠ is a generic replacement function, currently supporting objects of type drift_dm.

The argument value supplied to the ⁠coefs<-()⁠ function must match with the vector returned from ⁠coef(<object>)⁠. It is possible to update just part of the (unique) parameters.

Whenever the argument select_unique is TRUE, dRiftDM tries to provide unique parameter labels.

Value

For objects of type drift_dm, coefs() returns either a named numeric vector if select_unique = TRUE, or a matrix if select_unique = FALSE. If select_custom_prms = TRUE, custom parameters are added to the matrix.

For objects of type fits_ids_dm, coefs() returns a data.frame. If select_unique = TRUE, the columns will be the (unique, free) parameters, together with a column coding IDs. If select_unique = FALSE, the columns will be the parameters as listed in the columns of prms_matrix (see drift_dm), together with columns coding the conditions and IDs. If select_custom_prms = TRUE, the data.frame will also contain columns for the custom parameters. The returned data.frame has the class label coefs_dm to easily plot histograms for each parameter (see hist.coefs_dm).

For objects of type fits_agg_dm, returns the same as coef.drift_dm() (i.e., as if calling coef() with an object of type drift_dm)

For objects of type mcmc_dm, the return type depends on the model structure and the .f output:

If the model is non-hierarchical or id is a single value (not NA), the function returns either a vector or a matrix, depending on whether .f returns a single value or a vector.
In the hierarchical case, when id is a vector or NA, the function returns a data.frame. If .f returns a single value, the data.frame will contain one row per participant (with an ID column and one column per parameter). If .f returns a vector, the data.frame will include an additional column .f_out, coding the output of .f in long format.

Examples

# get a pre-built model and a data set for demonstration purpose
# (when creating the model, set the discretization to reasonable values)
a_model <- dmc_dm()
coef(a_model) # gives the free and unique parameters
coef(a_model, select_unique = FALSE) # gives the entire parameter matrix

The Component Functions of A Model

Description

Functions to get or set the "component functions" of an object. The component functions are a list of functions providing the drift rate, boundary, starting point distribution, and non-decision time distribution They are at the heart of the package and shape the model's behavior.

Usage

comp_funs(object, ...) <- value

## S3 replacement method for class 'drift_dm'
comp_funs(object, ..., eval_model = FALSE) <- value

comp_funs(object, ...)

## S3 method for class 'drift_dm'
comp_funs(object, ...)

## S3 method for class 'fits_ids_dm'
comp_funs(object, ...)

## S3 method for class 'fits_agg_dm'
comp_funs(object, ...)

Arguments

object

an object of type drift_dm, fits_ids_dm, or fits_agg_dm (see estimate_dm()).

...

additional arguments passed down to the specific method.

value

a named list which provides the component functions to set (see Details)

eval_model

logical, indicating if the model should be re-evaluated or not when updating the component funtions (see re_evaluate_model). Default is FALSE.

Details

comp_funs() is a generic accessor function, and ⁠comp_funs<-()⁠ is a generic replacement function. The default methods get and set the "component functions". The component functions are a list of functions, with the following names (see also vignette("customize_ddms", "dRiftDM") for examples):

mu_fun and mu_int_fun, provide the drift rate and its integral, respectively, across the time space.
x_fun provides a distribution of the starting point across the evidence space.
b_fun and dt_b_fun provide the values of the upper decision boundary and its derivative, respectively, across the time space. It is assumed that boundaries are symmetric.
nt_fun provides a distribution of the non-decision component across the time space.

All of the listed functions are stored in the list comp_funs of the respective model (see also drift_dm()).

Each component function must take the model's parameters (i.e., one row of prms_matrix), the parameters for deriving the PDFs, the time or evidence space, a condition, and a list of optional values as arguments. These arguments are provided with values when dRiftDM internally calls them.

In order to work with dRiftDM, mu_fun, mu_int_fun, b_fun, dt_b_fun, and nt_fun must have the following declaration: ⁠my_fun = function(prms_model, prms_solve, t_vec, one_cond, ddm_opts⁠). Here, prms_model is one row of prms_matrix, prms_solve the parameters relevant for deriving the PDFs, t_vec the time space, going from 0 to t_max with length nt + 1 (see drift_dm), and one_cond a single character string, indicating the current condition. Finally dmm_opts may contain additional values. Each function must return a numeric vector of the same length as t_vec. For mu_fun, mu_int_fun, b_fun, dt_b_fun the returned values provide the respective boundary/drift rate (and their derivative/integral) at every time step t. For nt_fun the returned values provide the density of the non-decision time across the time space (which get convoluted with the pdfs when solving the model)

In order to work with dRiftDM, x_fun must have the following declaration: ⁠my_fun = function(prms_model, prms_solve, x_vec, one_cond, ddm_opts⁠). Here, x_vec is the evidence space, going from -1 to 1 with length nx + 1 (see drift_dm). Each function must return a numeric vector of the same length as x_vec, providing the density values of the starting points across the evidence space.

Drift rate and its integral:

The drift rate is the first derivative of the expected time-course of the diffusion process. For instance, if we assume that the diffusion process X is linear with a slope of v...

E(X) = v \cdot t

...then the drift rate at every time step t is the constant v, obtained by taking the derivative of the expected time-course with respect to t:

\mu(t) = v

Conversely, the integral of the drift rate is identical to the expected time-course:

\mu_{int}(t) = v \cdot t

For the drift rate mu_fun, the default function when calling drift_dm() is a numeric vector containing the number 3. Its integral counterpart mu_int_fun will return a numeric vector containing the values t_vec*3.

Starting Point Distribution:

The starting point of a diffusion model refers to the initial value taken by the evidence accumulation process at time t=0. This is a PDF over the evidence space.

The default function when calling drift_dm() will be a function returning a dirac delta on zero, meaning that every potential diffusion process starts at 0.

Boundary:

The Boundary refers to the values of the absorbing boundaries at every time step t in a diffusion model. In most cases, this will be a constant. For instance:

b(t) = b

In this case, its derivative with respect to t is 0.

The default function when calling drift_dm() will be function for b_fun returning a numeric vector of length length(t_vec) containing the number 0.5. Its counterpart dt_b will return a numeric vector of the same length containing its derivative, namely, 0.

Non-Decision Time:

The non-decision time refers to an additional time-requirement. Its distribution across the time space will be convoluted with the PDFs derived from the diffusion process.

In psychology, the non-decision time captures time-requirements outside the central decision process, such as stimulus perception and motor execution.

The default function when calling drift_dm() returns a dirac delta on t = 0.3.

Value

For comp_funs() the list of component functions.

For ⁠comp_funs<-()⁠ the updated drift_dm object.

Note

There is only a replacement function for drift_dm objects. This is because replacing the component functions after the model has been fitted (i.e., for a fits_ids_dm object) doesn't make sense.

Examples

# get a pre-built model for demonstration
my_model <- ratcliff_dm()
names(comp_funs(my_model))

# direct replacement (see customize_ddms for a more information on
# how to write custom component functions)
# 1. Choose a uniform non-decision time from the pre-built component_shelf()
nt_uniform <- component_shelf()$nt_uniform
# swap it in
comp_funs(my_model)[["nt_fun"]] <- nt_uniform

# now update the flex_prms object to ensure that this model has the required
# parameters
prms <- c(muc = 3, b = 0.6, non_dec = 0.3, range_non_dec = 0.05)
conds <- "null"
new_flex_prms <- flex_prms(prms, conds = conds)
flex_prms(my_model) <- new_flex_prms

# accessor method also available for fits_ids_dm objects
# (see estimate_model_ids)
# get an exemplary fits_ids_dm object
fits <- get_example_fits("fits_ids_dm")
names(comp_funs(fits))

Evaluate all Component Functions

Description

Gets/calculates all values provided by the component functions of a drift_dm object

Usage

comp_vals(
  drift_dm_obj,
  x_vec = NULL,
  t_vec = NULL,
  nt = NULL,
  dt = NULL,
  nx = NULL,
  dx = NULL,
  prms_solve = NULL,
  solver = NULL,
  prms_matrix = NULL
)

Arguments

drift_dm_obj

an object of type drift_dm

x_vec

optional, the discretized evidence space

t_vec

optional, the discretized time space

nx, nt, dx, dt

optional, the steps and step sizes of each space

prms_solve

optional, vector of solver settings

solver

optional, string controlling which component values are evaluated

prms_matrix

optional, matrix of parameters

Details

arguments are optional, because they can be extracted from the model. However, supplying these are faster than creating them.

Value

If solver "kfe", a named list with entries "mu_vals", "x_vals", "b_vals", "dt_b_vals", "nt_vals".

If solver "im_zero", the returned list will also contain "mu_int_vals".

Diffusion Model Components

Description

This function is meant as a convenient way to access pre-built model component functions.

Usage

component_shelf()

Details

The function provides the following functions:

mu_constant, provides the component function for a constant drift rate with parameter muc.
mu_dmc, provides the drift rate of the superimposed diffusion process of DMC. Necessary parameters are muc (drift rate of the controlled process), a (shape..), A (amplitude...), tau (scale of the automatic process).
mu_ssp, provides the drift rate for SSP. Necessary parameters are p (perceptual input of flankers and target), sd_0 (initial spotlight width), r (shrinking rate of the spotlight) and 'sign' (an auxiliary parameter for controlling the contribution of the flanker stimuli). Note that no mu_int_ssp exists.
mu_int_constant, provides the complementary integral to mu_constant.
mu_int_dmc, provides the complementary integral to mu_dmc.
x_dirac_0, provides a dirac delta for a starting point centered between the boundaries (no parameter required).
x_uniform, provides a uniform distribution for a start point centered between the boundaries. Requires a parameter range_start (between 0 and 2).
x_beta, provides the function component for a symmetric beta-shaped starting point distribution with parameter alpha.
b_constant, provides a constant boundary with parameter b.
b_hyperbol, provides a collapsing boundary in terms of a hyperbolic ratio function with parameters b0 as the initial value of the (upper) boundary, kappa the size of the collapse, and t05 the point in time where the boundary has collapsed by half.
b_weibull, provides a collapsing boundary in terms of a Weibull distribution with parameters b0 as the initial value of the (upper) boundary, lambda controlling the time of the collapse, k the shape of the collapse, and kappa the size of the collapse.
dt_b_constant, the first derivative of b_constant.
dt_b_hyperbol, the first derivative of b_hyperbol.
nt_constant, provides a constant non-decision time with parameter non_dec.
nt_uniform, provides a uniform distribution for the non-decision time. Requires the parameters non_dec and range_non_dec.
nt_truncated_normal, provides the component function for a normally distributed non-decision time with parameters non_dec, sd_non_dec. The Distribution is truncated to [0, t_{max}].
dummy_t a function that accepts all required arguments for mu_fun or mu_int_fun but which throws an error. Might come in handy when a user doesn't require the integral of the drift rate.

See vignette("customize_ddms", "dRiftDM") for more information on how to set/modify/customize the components of a diffusion model.

Value

A list of the respective functions; each entry/function can be accessed by "name" (see the Example and Details).

Examples

pre_built_functions <- component_shelf()
names(pre_built_functions)

The Conditions of an Object

Description

Extract the conditions from a (supported) object.

Usage

conds(object, ...) <- value

## S3 replacement method for class 'drift_dm'
conds(object, ..., eval_model = FALSE, messaging = TRUE) <- value

conds(object, ...)

## S3 method for class 'drift_dm'
conds(object, ...)

## S3 method for class 'fits_ids_dm'
conds(object, ...)

## S3 method for class 'fits_agg_dm'
conds(object, ...)

## S3 method for class 'data.frame'
conds(object, ...)

## S3 method for class 'traces_dm_list'
conds(object, ...)

Arguments

object

an R object, see details

...

additional arguments passed forward.

value

a character vector, providing labels for the model's new conditions.

eval_model

logical, indicating if the model should be re-evaluated or not when updating the conditions (see re_evaluate_model). Default is FALSE.

messaging

logical, indicating if messages shall be displayed or not.

Details

conds() is a generic accessor function and ⁠conds<-()⁠ is a generic replacement function. The replacement method currently only supports drift_dm objects. The default methods get and set the conditions of an object.

When replacing the conditions of a drift_dm object, a new flex_prms object is created and then set to the model, resetting all parameter specifications and setting all parameter values to those of the previously first condition. In addition, if data was attached to the model, the data is removed. This is because there is no meaningful way for dRiftDM to know how the model should behave for the newly introduced condition(s), and how these new conditions relate to the old ones. Messages reminding the user of this behavior are displayed per default.

Value

For conds() NULL or a character vector with the conditions. NULL is given if the object has no conditions (e.g., when a data.frame has no Cond column).

For ⁠conds<-()⁠ the updated drift_dm object.

Examples

# get a pre-built model to demonstrate the conds() function
my_model <- dmc_dm()
conds(my_model)

# accessor functions also work with other object types provided by dRiftDM
# (simulated traces; see the documentation of the respective function)
some_traces <- simulate_traces(my_model, k = 1)
conds(some_traces)

# get an exemplary fits_ids_dm object (see estimate_model_ids)
fits <- get_example_fits("fits_ids_dm")
conds(fits)

# also works with data.frames that have a "Cond" column
conds(dmc_synth_data)

Copy Class Attributes from One Object to Another

Description

This function transfers class attributes from an old object to a new object, ensuring that new inherits the class structure and missing attributes of old. The primary purpose is to enforce class consistency and restore any lost attributes when modifying or combining objects. It is used in the internals of the package and it is not exported.

Usage

copy_class_attributes(old, new)

## S3 method for class 'stats_dm'
copy_class_attributes(old, new)

Arguments

old

The source object from which class attributes will be copied.

new

The target object to which class attributes will be assigned.

Details

The function assumes that all class attributes of new can be found in old. Note also, that the order of attributes is not ensured.

Value

The modified new object with attributes and class from old.

Access/Replace the Cost Function Label and Access the Cost Function Value

Description

Functions to access/replace the cost function label of a ⁠dRiftDM object⁠ and to access the current cost function value. The cost function label codes which cost function is used during estimation (e.g., the negative log-likelihood). The cost function value indicates the current value of the cost function given the current set of parameters and the data.

Usage

cost_function(object, ...) <- value

## S3 replacement method for class 'drift_dm'
cost_function(object, ..., eval_model = FALSE) <- value

cost_function(object, ...)

## S3 method for class 'drift_dm'
cost_function(object, ...)

## S3 method for class 'fits_ids_dm'
cost_function(object, ...)

## S3 method for class 'fits_agg_dm'
cost_function(object, ...)

cost_value(object, ...)

## S3 method for class 'drift_dm'
cost_value(object, ...)

## S3 method for class 'fits_ids_dm'
cost_value(object, ...)

## S3 method for class 'fits_agg_dm'
cost_value(object, ...)

Arguments

object

an object of type drift_dm, fits_ids_dm, or fits_agg_dm (see estimate_dm()).

...

additional arguments passed down to update_stats_agg() when setting the cost function label.

value

a character string, providing the cost function label (options are "neg_log_like" or "rmse")

eval_model

logical, indicating if the model should be re-evaluated or not when updating the conditions (see re_evaluate_model). Default is FALSE.

Value

cost_function() returns a single character string, specifying the used cost function
⁠cost_function<-()⁠ returns the model object with the updated cost function.
cost_value() returns a single numeric if object is of type drift_dm or fits_agg_dm. If there is no data attached to an object of type drift_dm, the function returns NULL. If object is of type fits_ids_dm, the function returns a data.frame with all cost values across participants.

Examples

# get a pre-built model for demonstration purpose
a_model <- ratcliff_dm(obs_data = ratcliff_synth_data)
cost_function(a_model)
cost_value(a_model)

# switch the default cost function to rmse
cost_function(a_model) <- "rmse"
out <- estimate_dm(a_model, verbose = 0, messaging = FALSE)
# -> the model was estimated using the RMSE statistic

Count the number of digits > 0

Description

This function takes one entry of the internal_list (i.e., all conditions for one parameter) and counts how often there are unique digits > 0

Usage

count_unique_prms_one_internal_entry(one_internal_entry)

Arguments

one_internal_entry

such as internal_list[["A"]]

Value

an integer number

Create a matrix for lower and upper

Description

Outsourced, deep inside the package function to avoid large nesting

Usage

create_matrix_smart(input, conds, prm_labels = NULL)

Arguments

input

either a list or a vector of numeric values

conds

a character string, conceptually representing the conditions of a model

prm_labels

a character string with parameter labels. Used as a fall back when the default_values are not labeled (see details)

Details

The goal of this function is to build up a matrix, serving as the upper or lower end of a parameter space (relevant when simulating data). The function gets called by get_parameters_smart().

It assumes the following: input is either a list or a numeric vector.

The easiest case is when it is a numeric vector. In this case, the function builds a matrix with as many rows as entries in conds. The rows will also be labeled according to conds. The column names are either the names specified with the numeric vector, or the labels specified in prm_labels
The less intuitive case is when input is a list. In this case, the list requires an entry called "default_values" which specifies the named or plain numeric vector as above. If the list only contains this entry, then the behavior is as if input was already a numeric vector. However, the input list can also have entries labeled as specific conditions, which contain named (!) numeric vectors with parameter labels. This will modify the value for the upper/lower parameter space with respect to the specified parameters in the respective condition.#'

Value

a matrix indicating either the upper or lower end of a parameter space. There will be as many rows as conds implies. The number of columns depend on input (matching its length if it is a vector, or matching the length of the entry "default_values" if it is a list). If input is NULL, then NULL is returned.

Create "Temperatures" for TIDE

Description

Create "Temperatures" for TIDE

Usage

create_temperatures(n_chains, sampler)

Arguments

n_chains

numeric

sampler

"TIDE" or anything else

Value

a numeric vector of length equal to n_chains. The returned values correspond to quantiles of a Beta(0.3, 1) distribution for sampler == "TIDE". Otherwise a numeric vector of 1s is returned.

Perform Crossover Between Chains

Description

This function dispatches to either full_crossover() or migration_crossover() depending on the which argument.

Usage

crossover(which, ...)

Arguments

which

character string, Either "diff" or "migration".

...

Further arguments passed to the underlying crossover function.

Value

A list with the following components:

new_prms_across_chains: The updated parameter matrix of shape ⁠p × n⁠.
new_pis_across_chains: The updated vector of log-posterior values.
new_log_likes_across_chains: The updated vector of log-likelihood values.

Default Prior for Group-Level (Hyper) Parameters

Description

These functions define and evaluate a default prior distribution for hyperparameters at the group level.

Usage

d_default_prior_hyper(x, mean, sd, lower, upper, shape, rate, log)

r_default_prior_hyper(n, mean, sd, lower, upper, shape, rate)

Arguments

x

a numeric vector of length 2 or a matrix with 2 rows and N column. Here, x[1] or x[1,] are interpreted as the group mean(s) and x[2] or x[2,] as the group standard deviation(s).

mean, sd

mean and standard deviation of the truncated normal distribution for the group-level mean. (recycled if necessary)

lower, upper

lower and upper bounds for the truncated normal distribution. (recycled if necessary)

shape, rate

shape and rate parameters of the gamma distribution for the group-level standard deviation. (recycled if necessary)

log

logical; if TRUE, the log-density is returned.

n

number of samples to generate.

Details

d_default_prior_hyper computes the (log) density of a prior for a two-element vector or a 2xN matrix, containing the mean and standard deviation (i.e., phi_j). The mean is modeled with a truncated normal distribution, and the standard deviation with a gamma distribution.

r_default_prior_hyper samples hyperparameter values from this prior.

the arguments mean, sd, lower, upper, shape, and rate are recycled if necessary with respect to the columns of x. For example, if x has two columns, then mean might provide two values.

Value

For d_default_prior_hyper, a numeric vector representing the (log) prior density value(s), with the simplifying assumption of independence of the mean and standard deviation.

For r_default_prior_hyper, a 2-row matrix with n columns. The first row contains sampled group means; the second row contains sampled standard deviations. Samples are drawn independently. If n is 1, then a named numeric vector is returned.

Optional Arguments for the Component Functions

Description

Functions to get or set the optional, user-defined R objects attached to a model object.

Usage

ddm_opts(object, ...) <- value

## S3 replacement method for class 'drift_dm'
ddm_opts(object, ..., eval_model = FALSE) <- value

ddm_opts(object, ...)

## S3 method for class 'drift_dm'
ddm_opts(object, ...)

## S3 method for class 'fits_agg_dm'
ddm_opts(object, ...)

Arguments

object

an object of type drift_dm or fits_agg_dm (see estimate_dm()).

...

additional arguments passed down to the specific method.

value

an arbitrary R object.

eval_model

logical, indicating if the model should be re-evaluated or not after attaching the arbitrary R object to the model (see re_evaluate_model). Default is FALSE.

Details

When deriving model predictions, the model's component functions (see comp_funs()) are evaluated and the returned values are passed forward to dedicated numerical methods implemented in dRiftDM. To allow users to access arbitrary R objects within their custom component functions, models may contain a ddm_opts entry (see also drift_dm() and the end of vignette("customize_ddms", "dRiftDM") for an example).

ddm_opts() is a generic accessor function, and ⁠ddm_opts<-()⁠ is a generic replacement function. The default methods get and set the optional R object.

Value

For ddm_opts() the optional R object that was once supplied by the user, or NULL.

For ⁠ddm_opts<-()⁠ the updated drift_dm object.

Examples

# get a pre-built model for demonstration
a_model <- ratcliff_dm()
ddm_opts(a_model) <- "Hello World"
ddm_opts(a_model)

Default Values for the dRiftDM Package

Description

These functions provide default values for various settings in the dRiftDM package.

Usage

drift_dm_approx_error()

drift_dm_medium_approx_error()

drift_dm_small_approx_error()

drift_dm_rough_approx_error()

drift_dm_robust_prm()

drift_dm_default_rounding()

drift_dm_default_probs()

drift_dm_default_n_bins()

drift_dm_default_b_coding()

drift_dm_skip_if_contr_low()

drift_dm_n_id_trunc_warn()

Details

drift_dm_approx_error(): Returns the default approximation error for precise calculations (1e-20).
drift_dm_medium_approx_error(): Returns a 'medium' level of approximation error (1e-06).
drift_dm_small_approx_error(): Returns a 'small' level of approximation error (.01).
drift_dm_rough_approx_error(): Returns a rough level of approximation error (.1).
drift_dm_robust_prm(): Returns a value that is added to the PDFs after convolution with the non-decision time to make parameter estimation and the evaluation of the log-likelihood more robust (1e-8).
drift_dm_default_rounding(): Returns the default rounding precision for numerical outputs (3).
drift_dm_default_probs(): Returns the default sequence of probabilities for quantiles (0.1, 0.2, ..., 0.9)
drift_dm_default_n_bins(): Returns the default number of bins for a CAF (5)
drift_dm_default_b_coding(): Returns the default boundary coding (list(column = "Error", u_name_value = c("corr" = 0), l_name_value = c("err" = 1))
drift_dm_skip_if_contr_low(): returns the value 0.0001. If a PDF integrates to a value lower than that (i.e., if there is almost no contribution of a PDF; most likely this will be pdf_l), then summary functions returned by calc_stats() might contain the value NA for the respective PDF.
drift_dm_n_id_trunc_warn(): returns 15. If there are warnings relevant to multiple participants, the printed IDs are truncated at 15.

Value

the respective values/lists as described in the Details section

Create the Diffusion Model for Conflict Tasks

Description

This function creates a drift_dm object that corresponds to the Diffusion Model for Conflict Tasks by Ulrich et al. (2015).

Usage

dmc_dm(
  var_non_dec = TRUE,
  var_start = TRUE,
  instr = NULL,
  obs_data = NULL,
  sigma = 1,
  t_max = 3,
  dt = 0.0075,
  dx = 0.02,
  b_coding = NULL
)

Arguments

var_non_dec, var_start

logical, indicating whether the model should have a normally-distributed non-decision time or beta-shaped starting point distribution, respectively. (see nt_truncated_normal and x_beta in component_shelf). Defaults are TRUE. If FALSE, a constant non-decision time and starting point is set (see nt_constant and x_dirac_0 in component_shelf).

instr

optional string with "instructions", see modify_flex_prms().

obs_data

data.frame, an optional data.frame with the observed data. See obs_data.

sigma, t_max, dt, dx

numeric, providing the settings for the diffusion constant and discretization (see drift_dm)

b_coding

list, an optional list with the boundary encoding (see b_coding)

Details

The Diffusion Model for Conflict Tasks is a model for describing conflict tasks like the Stroop, Simon, or flanker task.

It has the following properties (see component_shelf):

a constant boundary (parameter b)
an evidence accumulation process that results from the sum of two subprocesses:
- a controlled process with drift rate muc
- a gamma-shaped process with a scale parameter tau, a shape parameter a, and an amplitude A.

If var_non_dec = TRUE, a (truncated) normally distributed non-decision with mean non_dec and standard deviation sd_non_dec is assumed. If var_start = TRUE, a beta-shaped starting point distribution is assumed with shape and scale parameter alpha.

If var_non_dec = TRUE, a constant non-decision time at non_dec is set. If var_start = FALSE, a starting point centered between the boundaries is assumed (i.e., a dirac delta over 0).

Per default the shape parameter a is set to 2 and not allowed to vary. This is because the derivative of the scaled gamma-distribution function does not exist at t = 0 for a < 2. Currently, we recommend keeping a fixed to 2. If users decide to set a != 2, then a small value of tol = 0.001 (default) is added to the time vector t_vec before calculating the derivative of the scaled gamma-distribution as originally introduced by Ulrich et al. (2015). Users can control this value by passing a value via ddm_opts() (see the example below). Note, however, that varying a can lead to large numerical inaccuracies if a gets smaller.

The model assumes the amplitude A to be negative for incompatible trials. Also, the model contains the custom parameter peak_l, containing the peak latency ((a-2)*tau).

Value

An object of type drift_dm (parent class) and dmc_dm (child class), created by the function drift_dm().

Note

The scaling of the parameters in dRiftDM is different to Ulrich et al. (2015). This is because dRiftDM works in seconds and with a diffusion constant of 1, while the original DMC parameterization is in milliseconds and with a diffusion constant of 4. We describe how to convert the parameters on our website.

References

Ulrich R, Schröter H, Leuthold H, Birngruber T (2015). “Automatic and controlled stimulus processing in conflict tasks: Superimposed diffusion processes and delta functions.” Cognitive Psychology, 78, 148–174. doi:10.1016/j.cogpsych.2015.02.005.

Examples

# the model with default settings
my_model <- dmc_dm()

# the model with no variability in the starting point and a finer
# discretization
my_model <- dmc_dm(var_start = FALSE, dt = .005, dx = .01)

# we don't recommend this, but if you really want a != 2, just do...
# (see the Details for more warnings/information about this)
my_model <- dmc_dm(instr = "a ~!")
coef(my_model)["a"] <- 1.9
# -> if you want to control the small value that is added to t_vec when
# calculating the drift rate for a != 2, just use ...
ddm_opts(my_model) <- 0.0001 # ==> t_vec + 0.0001
ddm_opts(my_model) <- NULL # default ==> t_vec + 0.001

A synthetic data set with two conditions

Description

This dataset was simulated by using the Diffusion Model for Conflict tasks (see dmc_dm()) with parameter settings that are typical for a Simon task.

Usage

dmc_synth_data

Format

A data frame with 600 rows and 3 columns:

RT: Response Times
Error: Error Coding (Error Response = 1; Correct Response = 0)
Cond: Condition ('comp' and 'incomp')

Temporarily suppress new stats generation during resampling

Description

Internal helper that sets the skip_new_stats_dm and skip_validate_stats_dm options to TRUE before evaluating an expression, and resets them to NULL afterward (see also stats.options()). Intended to prevent the (unncessary) creation/checking of stats_dm objects during resampling.

Usage

do_resampling(x)

Arguments

x

An expression to evaluate.

Value

The result of evaluated x.

Draw Samples Using Inverse Transform Sampling

Description

draw_from_pdf generates samples from a given probability density function (PDF) using inverse transform sampling. This function takes in a numeric PDF vector and a corresponding domain vector, then returns a specified number of samples.

Usage

draw_from_pdf(a_pdf, x_def, k, seed = NULL, round_to = NULL, method = "discr")

Arguments

a_pdf

a numeric vector representing the PDF values.

x_def

a numeric vector defining the domain (or x-values) corresponding to the values in a_pdf. The vector x_def must be sorted in increasing order.

k

a single integer specifying the number of samples to generate.

seed

an optional single integer value used to set the seed for random number generation, allowing for reproducibility of results.

round_to

an optional integer, indicating the number of digits to which the result should be rounded.

method

a single character string. If "discr", then simulated values match x_def. If "linear", x_def and a_pdf are linearly interpolated, so that the simulated values can lay in between the discrete values of x_def.

Details

This function implements inverse transform sampling by first constructing a cumulative distribution function (CDF) from the given PDF. Then k values between zero and one are sampled from a uniform distribution, and the corresponding values are mapped to x_def using linear interpolation.

Value

A numeric vector of length k containing the sampled values from the specified PDF. If k is 0, an empty numeric vector is returned.

Create a drift_dm object

Description

This function creates an object of type drift_dm, which serves as the parent class for all further created drift diffusion models (all of which have a child class label, e.g., dmc_dm). The objects created by drift_dm() are the backbone of the dRiftDM package. For a list of all pre-built models, see vignette("dRiftDM", "dRiftDM").

Usage

drift_dm(
  prms_model,
  conds,
  subclass,
  instr = NULL,
  obs_data = NULL,
  sigma = 1,
  t_max = 3,
  dt = 0.001,
  dx = 0.001,
  solver = "kfe",
  cost_function = "neg_log_like",
  mu_fun = NULL,
  mu_int_fun = NULL,
  x_fun = NULL,
  b_fun = NULL,
  dt_b_fun = NULL,
  nt_fun = NULL,
  b_coding = NULL
)

## S3 method for class 'drift_dm'
print(x, ..., round_digits = drift_dm_default_rounding())

Arguments

prms_model

a named numeric vector of the model parameters. The names indicate the model's parameters, and the numeric entries provide the current parameter values.

conds

a character vector, giving the names of the model's conditions. values within conds will be used when addressing the data and when deriving the model's predictions.

subclass

a character string, with a name for the newly created diffusion model (e.g., my_dmc_dm). This will be the child class.

instr

an optional character string, providing "instructions" for the underlying flex_prms object.

obs_data

an optional data.frame, providing a data set (see obs_data() for more information).

sigma

the diffusion constant. Default is 1.

t_max

the maximum of the time space. Default is set 3 (seconds).

dt, dx

the step size of the time and evidence space discretization, respectively. Default is set to .001 (which refers to seconds for dt). Note that these values are set conservatively per default. In many cases, users can increase the discretization.

solver

a character string, specifying which approach to use for deriving the first passage time. Options are kfe or im_zero. Default is kfe, which provides access to the numerical discretization of the Kolmogorov Forward Equation.

cost_function

a character string, specifying the cost function used during estimation. Options are neg_log_like (negative log-likelihood), rmse (root-mean-squared error). Default is neg_log_like.

mu_fun, mu_int_fun, x_fun, b_fun, dt_b_fun, nt_fun

Optional custom functions defining the components of a diffusion model. See comp_funs(). If an argument is NULL, dRiftDM falls back to the respective default functions, which are documented in comp_funs().

b_coding

an optional list, specifying how boundaries are coded. See b_coding(). Default refers to accuracy coding.

x

an object of type drift_dm

...

additional parameters

round_digits

integer, controls the number of digits shown for print.drift_dm(). Default is 3.

Details

To modify the entries of a model users can use the replacement methods and the modify_flex_prms() method (see also vignette("dRiftDM", "dRiftDM") and vignette("customize_ddms", "dRiftDM")).

Value

For drift_dm(), a list with the parent class label "drift_dm" and the child class label ⁠<subclass>⁠. The list contains the following entries:

An instance of the class flex_prms for controlling the model parameters. Provides information about the number of parameters, conditions etc.
Parameters used for deriving the model predictions, prms_solve, containing the diffusion constant (sigma), the maximum of the time space (t_max), the evidence and space discretization (dt and dx, respectively), and the resulting number of steps for the time and evidence space discretization (nt and nx, respectively).
A character string solver, indicating the method for deriving the model predictions.
A character string cost_function, indicating the cost function used for model estimation.
A list of functions called comp_funs, providing the components of the diffusion model (i.e., mu_fun, mu_int_fun, x_fun, b_fun, dt_b_fun, nt_fun). These functions are called in the depths of the package and will determine the behavior of the model

If (optional) observed data were passed via obs_data(), the list will contain an entry obs_data. This is a (nested) list with stored response times for the upper and lower boundary and with respect to each condition. If the cost function is a summary statistic requiring quantiles, CAFs, etc., the model also contains the entries stats_agg and stats_agg_info. The former is a (nested) list with descriptive statistics. The latter contains information about the descriptive statistics (e.g., the quantile levels).

If the model has been evaluated (see re_evaluate_model()), the list will contain...

... the cost value; can be addressed via cost_value().
... the PDFs of the first passage time; can be addressed via pdfs().

If the model was estimated (which includes its evaluation), the list will contain estimate_info. This entry contains a convergence flag (conv_flag, logical) and the optimizer (a string).

Finally, if arbitrary R objects were passed via ddm_opts() (to access these objects when evaluating the component functions) the list will contain an entry ddm_opts.

Every model also has the attribute b_coding, which summarizes how the boundaries are labeled.

For print.drift_dm(), the supplied drift_dm object x (invisible return).

Examples

# Plain call, with default component functions -----------------------------
# create parameter and condition vectors
prms <- c(muc = 4, b = 0.5)
conds <- c("one", "two")

# then call the backbone function (note that we don't provide any component
# functions, so dRiftDM uses the default functions as documented in
# comp_funs())
my_model <- drift_dm(prms_model = prms, conds = conds, subclass = "example")
print(my_model)

Available Cost Functions for Model Estimation

Description

Returns the names of implemented cost functions.

Usage

drift_dm_cost_functions()

Value

A character vector of cost function names.

Pre-built Drift Diffusion Models

Description

Returns the names of available pre-built DDMs in dRiftDM.

Usage

drift_dm_pre_built_models()

Value

A character vector of model names.

Available types of statistics

Description

Internal helper to return supported statistic types depending on the context (e.g., for observed data.frames or fitted model objects).

Usage

drift_dm_stats_types(context = NULL)

Arguments

context

a character string, indicating the context. If NULL, all available types are returned.

Value

a character vector of valid statistic types for the given context.

Maybe droplevels of ID column

Description

This function takes a data frame with an ID colmumn, and drops the unused levels from the ID column if it is factor; in this case a warning is thrown

Usage

drop_levels_ID_column(some_data)

Arguments

some_data

a data.frame with an ID column

Value

if the ID column is not of type factor, then the unmodified object is returned.

if the ID column is of type factor, droplevels is applied, and if levels were dropped, a warning is thrown

Derivative of a Constant Boundary

Description

Derivative of a Constant Boundary

Usage

dt_b_constant(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, no prm label required

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

a vector of the same length as t_vec only zeros.

Truncated Normal Density Function

Description

Computes the probability density function for the truncated normal distribution. This version supports both vector and matrix input for x.

Usage

dtnorm(x, mean = 0, sd = 1, lower = -Inf, upper = Inf, log = FALSE)

rtnorm(n, mean = 0, sd = 1, lower = -Inf, upper = Inf)

Arguments

x

A numeric vector or matrix of values where the density should be evaluated.

mean

Mean of the normal distribution. Can be a scalar or vector (recycled if necessary).

sd

Standard deviation of the normal distribution. Can be a scalar or vector (recycled if necessary).

lower

Lower truncation bound. Can be a scalar or vector (recycled if necessary). Default is -Inf.

upper

Upper truncation bound. Can be a scalar or vector (recycled if necessary). Default is Inf.

log

Logical; if TRUE, probabilities p are given as log(p). Default is FALSE.

Details

The function evaluates the normal density at x and scales it to reflect truncation to the interval (lower, upper). Values outside the truncation bounds are assigned a density of 0 (or -Inf on the log scale). Internally, stats::dnorm and stats::pnorm are used.

If x is a matrix, the result retains the same dimensions. All other arguments are recycled as needed. For example, if x has two rows and 5 columns, then mean might provide 2 values, so that the first/second row is evaluated under the first/second mean value.

Value

A numeric vector or matrix of the same shape as x, containing the (possibly log) densities.

(Hierarchical) Bayesian Estimation with Differential Evolution

Description

Estimate group-level and individual-level parameters with a hierarchical Bayesian approach using Differential Evolution MCMC (DE-MCMC) Turner et al. (2013). An approximation of the marginal likelihood to calculate Bayes Factors can be obtained with the Thermodynamic Integration via Differential Evolution (TIDE) sampler Evans and Annis (2019).

Usage

estimate_bayes_h(
  drift_dm_obj,
  obs_data_ids,
  sampler,
  n_chains,
  burn_in,
  samples,
  n_cores,
  prob_migration,
  prob_re_eval,
  verbose,
  seed = NULL,
  ...
)

estimate_bayes_one_subj(
  drift_dm_obj,
  sampler,
  n_chains,
  burn_in,
  samples,
  prob_migration,
  prob_re_eval,
  verbose,
  ...
)

Arguments

drift_dm_obj

an object of type drift_dm.

obs_data_ids

data.frame for the hierarchical case. An additional column ID is necessary that codes the individuals (see also obs_data).

sampler

character string, indicating the sampler to use. Must be either "DE-MCMC" (default) or "TIDE".

n_chains

numeric, number of chains for the MCMC-sampler. Default is 40.

burn_in

numeric, number of burn-in iterations. Default is 500.

samples

numeric, number of sampling iterations after burn-in. Default is 2000.

n_cores

numeric, number of threads to use for parallel processing in the hierarchical case. Default is 1.

prob_migration

numeric, probability of performing a migration crossover step during burn-in. Default is 0.1 (i.e., 10%).

prob_re_eval

numeric, probability of re-evaluating the likelihood/posterior values of the previous iteration i-1 when deciding for the acceptance of the proposal in iteration i. Only considered during burn-in. Default is 0.1 (i.e., 10%).

verbose

integer, indicating verbosity of output: 0 (none), 1 (minimal text output), or 2 (text output and progress bar). Default is 2.

seed

optional random seed for reproducibility.

...

additional arguments passed to get_default_prior_settings to customize prior settings.

Details

The function estimate_bayes_h() handles the hierarchical case. The function estimate_bayes_one_subj() handles the case for estimating a single individual. The reason for writing two functions is that the hierarchical case has some unique tweaks to it that need to be considered ... and writing one function would be quite the mess.

Prior Settings: See the wrapper estimate_bayesian() and also get_default_prior_settings()

Value

A named ist containing posterior samples for group-level and individual-level parameters, log-posterior values, and log-likelihoods. Labels: phi, pis_phi, lls_phi, theta, pis_theta, lls_theta. The first three entries are only present in the hierarchical case.

The list also has an additional attribute named data_model. In the hierarchical case, the attribute contains a named list of model copies with all the individual data sets attached. The list is named according to the individual IDs in the argument obs_data_ids. In the non-hierarchical case, the attribute contains the model and its attached data.

References

Turner BM, Sederberg PB, Brown SD, Steyvers M (2013). “A method for efficiently sampling from distributions with correlated dimensions.” Psychological Methods, 18(3), 368–384. doi:10.1037/a0032222. Evans NJ, Annis J (2019). “Thermodynamic integration via differential evolution: A method for estimating marginal likelihoods.” Behavior Research Methods, 51, 930–947. doi:10.3758/s13428-018-1172-y.

(Hierarchical) Bayesian Estimation

Description

This function provides a wrapper around the implemented samplers for Bayesian inference in dRiftDM. For parameter estimation, Differential Evolution Markov-Chain Monte-Carlo (DE-MCMC) Turner et al. (2013) is used. An approximation of the marginal likelihood to calculate Bayes Factors can be obtained with the Thermodynamic Integration via Differential Evolution (TIDE) sampler Evans and Annis (2019). However, TIDE is not yet supported fully, and is at an experimental stage.

Usage

estimate_bayesian(
  drift_dm_obj,
  obs_data_ids = NULL,
  sampler,
  n_chains,
  burn_in,
  samples,
  prob_migration,
  prob_re_eval,
  verbose = NULL,
  ...
)

Arguments

drift_dm_obj

an object of type drift_dm.

obs_data_ids

data.frame for the hierarchical case. An additional column ID is necessary that codes the individuals (see also obs_data).

sampler

character string, indicating the sampler to use. Must be either "DE-MCMC" (default) or "TIDE".

n_chains

numeric, number of chains for the MCMC-sampler. Default is 40.

burn_in

numeric, number of burn-in iterations. Default is 500.

samples

numeric, number of sampling iterations after burn-in. Default is 2000.

prob_migration

numeric, probability of performing a migration crossover step during burn-in. Default is 0.1 (i.e., 10%).

prob_re_eval

verbose

integer, indicating verbosity of output: 0 (none), 1 (minimal text output), or 2 (text output and progress bar). Default is 2.

...

additional arguments passed forward to estimate_bayes_h() and estimate_bayes_one_subj().

Details

When a data.frame is supplied, a hierarchical approach to parameter estimation is done. In this case, the supplied data set must provide data for multiple individuals. To estimate the parameters for a single individual (i.e., pursue the non-hierarchical approach), then the supplied model drift_dm_obj must have data attached to it (see obs_data()).

Value

an object of type mcmc_dm containing posterior samples for parameters, log-posterior values, and log-likelihoods. In the hierarchical case, the respective values are available at both the group-level and the individual-level. The object contains two attributes: sampler and data_model. The former simply stores the type of sampler that was used and codes whether estimation was done in a hierarchical fashion or not. The latter either contains the model and the attached data (in the non-hierarchical case) or a named list of model copies with each individual's data attached.

References

Estimate Parameters of a `drift_dm` Model via Classical Optimization

Description

estimate_classical() estimates the parameters of a drift_dm model by minimizing the model's cost function (e.g., RMSE or negative log-likelihood) using classical (non-Bayesian) optimization routines.

Available optimizers include:

Nelder-Mead (bounded or unbounded): "Nelder-Mead", "nmkb" (via stats::optim() and dfoptim::nmkb(), respectively)
BFGS and L-BFGS-B (via stats::optim())
Differential Evolution (via DEoptim::DEoptim())

Usage

estimate_classical(
  drift_dm_obj,
  optimizer,
  start_vals = NULL,
  return_runs = NULL,
  lower = NULL,
  upper = NULL,
  verbose = NULL,
  de_n_cores = 1,
  control = list(),
  round_digits = NULL,
  seed = NULL,
  use_ez = NULL,
  n_lhs = NULL
)

Arguments

drift_dm_obj

an object inheriting from drift_dm.

optimizer

a character string specifying the optimizer to use. Must be one of "Nelder-Mead", "nmkb", "BFGS", "L-BFGS-B", or "DEoptim".

start_vals

a set of starting values. Must be compatible with get_parameters_smart(). If start_vals is not NULL, the function tries to set the provided parameter values to the model, using those values as starting values for the optimization routine. Special case: If start_vals is a data.frame, the function is recursively called for each row of start_vals, providing a handy way to run an optimization routine with different starting values. Default is NULL, which implies that the current model parameters are used as starting values.

return_runs

a single logical. Only relevant when start_vals is a data.frame and the optimization routine is called multiple times with different starting values. If FALSE, the best-fitting model is returned. If TRUE, a list is returned, containing the best-fitting model, all cost values across runs, and all estimated model parameters across runs.

lower, upper

bounds on the parameters to be estimated. Can be numeric vectors, named vectors, or flexible lists (see Details).

verbose

an integer (0, 1, or 2). Controls the amount of printed output.

0 = silent
1 = starting/exiting messages
2 = all parameters and the cost value per iteration

de_n_cores

an integer > 0. Number of CPU cores to use for DEoptim.

control

a named list of control parameters passed to the chosen optimizer.

round_digits

an integer. Number of digits to round cost values in printed output. If NULL, defaults to drift_dm_default_rounding().

seed

a seed, to make the results of DEoptim reproducible.

Details

Search space specification

lower and upper can be specified flexibly:

As unnamed numeric vectors (not recommended unless you're sure of the parameter order)
As named numeric vectors matching the parameter names of the model
As lists with a default_values entry (plus optional condition-specific entries)

This design mirrors the structure used in simulate_data.drift_dm().

Optimization details

Some optimizers (i.e., "nmkb", "L-BFGS-B", "DEoptim") require both lower and upper bounds.

Differential Evolution (DEoptim) supports parallelization across cores via de_n_cores. If de_n_cores > 1, a parallel cluster is created and automatically closed after optimization.

The cost function being minimized depends on the cost_function() of the model.

During optimization, failed model evaluations yield a very high cost value (i.e., .Machine$double.xmax). In some cases, this ensures that the optimization doesn't crash, though, this is not guaranteed.

Value

The updated drift_dm_obj, with optimized parameters.

Internal wrapper for classical estimation of individuals

Description

This function wraps estimate_classical() to handle fitting multiple individuals in a consistent way. It prepares the data, distributes the estimation across individuals, and manages parallelization strategies and progress reporting. Unlike the deprecated estimate_model_ids(), this function no longer saves results to disk — instead, it directly returns an object of class fits_ids_dm.

Usage

estimate_classical_wrapper(
  drift_dm_obj,
  obs_data_ids,
  parallelization_strategy = NULL,
  progress = NULL,
  start_vals = NULL,
  optimizer,
  n_cores = NULL,
  seed = NULL,
  ...
)

Arguments

drift_dm_obj

a drift_dm object that will be estimated for each individual in obs_data_ids.

obs_data_ids

a data.frame of observed data including an ID column that uniquely identifies each individual.

parallelization_strategy

an integer, either 1 or 2. Strategy 1 parallelizes across individuals, while strategy 2 parallelizes within individuals (only supported for "DEoptim"). Default is 1

progress

an integer, controlling progress output. 0 = no progress, 1 = minimal output, 2 = progress bar. Default is 1

start_vals

an optional data.frame with starting values for each individual. Must contain an ID column matching the IDs in obs_data_ids, and one column per parameter.

seed

an optional seed to make the results reproducible

...

further arguments passed to estimate_classical(), including lower, upper, verbose, control, round_digits. Note that the argument return_runs is not supported.

Details

Convergence issues are checked automatically. If one or more individuals fail to converge, a warning is issued with the corresponding IDs and messages returned by the optimizer.

Value

an object of class fits_ids_dm, which is a list with two components:

drift_dm_fit_info — a list containing the model object, observed data, optimizer information, and convergence messages
all_fits — a list of individual estimation results

Fit a DDM to Observed Data

Description

estimate_dm() is the main function to fit a drift diffusion model (DDM) in dRiftDM. Several ways of fitting a model are supported: fitting a single participant, fitting multiple participants separately or aggregated, and fitting a (hierarchical) Bayesian model. The particular way is controlled via the approach argument.

Usage

estimate_dm(
  drift_dm_obj,
  obs_data = NULL,
  approach = NULL,
  optimizer = NULL,
  control = list(),
  n_cores = 1,
  parallelization_strategy = NULL,
  lower = NULL,
  upper = NULL,
  start_vals = NULL,
  means = NULL,
  sds = NULL,
  shapes = NULL,
  rates = NULL,
  n_chains = 40,
  burn_in = 500,
  samples = 1000,
  prob_migration = 0.1,
  prob_re_eval = 1,
  messaging = TRUE,
  seed = NULL,
  ...
)

## S3 method for class 'fits_agg_dm'
print(x, ...)

## S3 method for class 'fits_ids_dm'
print(x, ...)

## S3 method for class 'mcmc_dm'
print(x, ..., round_digits = drift_dm_default_rounding())

Arguments

drift_dm_obj

a drift_dm object containing the model to be fitted.

obs_data

an optional data.frame (see also obs_data). If no ID column is present, a single-individual setup is assumed. If an ID column is present, the model is fitted separately for each individual.

approach

an optional character string, specifying the approach to fitting the model. Options are "sep_c", "agg_c", "sep_b", "hier_b" (see the Details).

optimizer

a character string. For classical optimization, one of "nmkb", "Nelder-Mead", "BFGS", "L-BFGS-B", "DEoptim". For the Bayesian framework, only "DE-MCMC" is currently supported. If NULL and if a classical optimization approach is used, defaults to "DEoptim" or "Nelder-Mead", depending on whether lower/upper are provided or not. If NULL and if a Bayesian framework is used, defaults to ⁠"DE-MCMC⁠. Note that "BFGS" and "L-BFGS-B" are often unstable.

control

a list of control parameters passed to the optimizer (for Nelder-Mead, BFGS, and L-BFGS-B, see stats::optim; for nmkb, see dfoptim::nmkb; for DEoptim, see DEoptim::DEoptim). Per default, we set the trace control argument for DEoptim::DEoptim to FALSE. Also, we set the parscale control argument for "Nelder-Mead" via stats::optim to pmax(x0, 1e-6).

n_cores

an integer > 0, indicating the number of CPU cores/threads to use (at the moment, this doesn't have an effect when fitting a single individual within the Bayesian framework).

parallelization_strategy

an integer, controlling how parallelization is performed when fitting multiple individuals with the classical approach. If 1, parallelization is across individuals. If 2, parallelization is within individuals (currently only supported for "DEoptim"). Defaults to 1.

lower, upper

numeric vectors or lists, specifying the lower and upper bounds on each parameter to be optimized (see Details).

start_vals

optional starting values for classical single-subject fits and when using an optimizer that requires a starting value. Can be a numeric vector of model parameters when fitting a single individual, or a data.frame with columns for each model parameter. In the latter case, enables multi-start (one row per start). For 'approach = "separately"', a data.frame with an ID column is required.

means, sds, shapes, rates

optional numeric vectors for prior specification (when using the Bayesian framework, see Details).

n_chains

an integer, providing the number of MCMC chains (Bayesian framework).

burn_in

an integer, number of burn-in iterations (Bayesian framework).

samples

an integer, number of post-burn-in samples per chain ( Bayesian framework).

prob_migration

a numeric in ⁠[0,1]⁠, controlling the migration probability of the DE-MCMC algorithm (Bayesian framework).

prob_re_eval

a numeric in ⁠[0,1]⁠, probability to re-evaluate the model at current group-level parameters during sampling (Bayesian framework; only relevant for the hierarchical case).

messaging

a logical, if TRUE progress/info messages are printed

seed

an optional integer to set the RNG seed for reproducibility.

...

additional arguments forwarded to lower-level routines. Options are: progress/verbose (integers, for controlling progress bars and verbosity of estimation infos), round_digits (for controlling the number of digits for rounding when printing individual model evaluations; if verbose = 2), return_runs (when fitting a single individual and starting the estimation routine with multiple starting points; if TRUE, then a list of all routines is returned), probs/n_bins (the quantile levels and the number of CAF bins when fitting aggregated data using the RMSE cost function), use_ez/n_lhs (logical and integer; the first controls if EZ-Diffusion Parameter Estimates shall be used for determining starting points; the latter controls the number of parameters to sample per dimension for the latin hypercube sampling when searching for starting values)

x

an object of type fits_agg_dm, fits_ids_dm, or mcmc_dm

round_digits

integer, specifying the number of decimal places for rounding in the printed summary. Default is 3.

Details

Fitting Approaches

The function supports different "approaches" to fitting data.

"sep_c": This means that data is always considered separately for each participant (if there are multiple participants) and that a classical approach to parameter optimization is used. This means that a standard cost_function is minimized (e.g., the negative log-likelihood). If users provide only a single participant or a data set without an ID column, then the model is fitted just once to that data set.
"agg_c": This fits the model to aggregated data. For each individual in a data set, summary statistics (e.g., quantiles, accuracies) are calculated, and the model is fitted once to the average of these summary statistics.
"sep_b": Similar to ⁠sep_b"⁠, although a Bayesian approach is used to sample from the posterior distribution.
"hier_b": A hierarchical approach to parameter estimation. In this case all participants are considered simultaneously and samples are drawn both at the individual-level and group-level.

The optimizers "nmkb", "L-BFGS-B", and "DEoptim" (for classical parameter optimization) require the specification of the lower/upper arguments.

Fitting to Aggregated Data

For aggregated fits, aggregated statistics are set to the model and the cost function is switched to "rmse". If incompatible settings are requested, the function switches to a compatible configuration and informs the user with messages (these messages can be suppressed via the messaging argument).

Specifying `lower/upper` for Classical optimization

the function estimate_model_dm() provides a flexible way of specifying the optimization space; this is identical to specifying the parameter simulation space in simulate_data.drift_dm().

Users have three options to specify the search space (see also the examples below):

Plain numeric vectors (not very much recommended). In this case, lower/upper must be sorted in accordance with the parameters in the underlying flex_prms object of drift_dm_obj that vary for at least one condition (call print(drift_dm_obj) and have a look at the columns of the ⁠Parameter Settings⁠ output; for each column that has a number > 0, specify an entry in lower/upper).
Named numeric vectors. In this case lower/upper have to provide labels in accordance with the parameters that are considered "free" at least once across conditions (call coef(drift_dm_obj) and provide one named entry for each parameter; dRiftDM will try to recycle parameter values across conditions).
The most precise way is when lower/upper are lists. In this case, the list requires an entry called "default_values" which specifies the named or plain numeric vectors as above. If the list only contains this entry, then the behavior is as if lower/upper were already numeric vectors. However, the lower/upper lists can also provide entries labeled as specific conditions, which contain named (!) numeric vectors with parameter labels. This will modify the value for the upper/lower parameter space with respect to the specified parameters in the respective condition.

Specifying Priors for Bayesian Estimation

(Default) Prior settings in the non-hierarchical case:

Let \theta^{(j)} indicate parameter j of a model (e.g., the drift rate). The prior on \theta^{(j)} is a truncated normal distribution:

\theta^{(j)} \sim NT(\mu^{(j)}, \sigma^{(j)}, l^{(j)}, u^{(j)})

With \mu^{(j)} and \sigma^{(j)} representing the mean and standard deviation of parameter j. l^{(j)} and u^{(j)} represent the lower and upper boundary. \mu^{(j)} is taken from the mean argument or the currently set model parameters (i.e., from coef(drift_dm_obj)) when calling the function. \sigma^{(j)} is, per default, equal to \mu^{(j)}. This can be changed by passing the sd argument. The lower and upper boundaries of the truncated normal are -Inf and Inf per default. This can be altered by passing the arguments lower and upper (see the examples below).

(Default) Prior settings in the hierarchical case:

Let \theta_i^{(j)} indicate parameter j for participant i (e.g., the drift rate estimated for individual i). The prior on \theta_i^{(j)} is a truncated normal distribution:

\theta_i^{(j)} \sim NT(\mu^{(j)}, \sigma^{(j)}, l^{(j)}, u^{(j)})

With \mu^{(j)} and \sigma^{(j)} representing the mean and standard deviation of parameter j at the group level. l^{(j)} and u^{(j)} represent the lower and upper boundary. The lower and upper boundaries of the truncated normal are -Inf and Inf per default. This can be altered by passing the arguments lower and upper.

For a group-level mean parameter, \mu^{(j)}, the prior is also a truncated normal distributions:

\mu^{(j)} \sim NT(M^{(j)}, SD^{(j)}, l^{(j)}, u^{(j)})

With M^{(j)} specified by the mean argument or the currently set model parameters. SD^{(j)} is, per default, equal to M^{(j)}. This can be changed by passing the sd argument.

For a group-level standard deviation parameter, \sigma^{(j)}, the prior is a gamma distribution:

\sigma^{(j)} \sim \Gamma(shape^{(j)},rate^{(j)})

With shape^{(j)} and rate^{(j)} being 1 by default. This can be changed by passing the arguments shape and rate.

Specifying Prior Settings/Arguments

Argument specification for mean, sd, lower, upper, shape and rate is conceptually identical to specifying lower/upper for the classical optimization approach (see the subsection above and the examples below).

Value

If fitting a single individual: either a drift_dm object with fitted parameters and additional fit information (for the classical optimization framework) or an object of type mcmc_dm (for the Bayesian framework)
If fitting multiple individuals separately: a fits_ids_dm object or a list of mcmc_dm objects, containing all the individual model fits.
If fitting aggregated data: a fits_agg_dm object containing the model itself and the raw data.
If fitting multiple individuals hierarchically: an object of type mcmc_dm.

Note

estimate_dm dispatches to underlying estimation routines that are not exported:

Classical optimization of one individual via estimate_classical()
Classical optimization of multiple individuals via estimate_classical_wrapper()
Bayesian estimation via estimate_bayesian().
Aggregated fitting is handled within estimate_dm() in combination with estimate_classical()

When fitting a model with optimizer = "DEoptim", the corresponding minimization routine always runs for 200 iterations by default, irrespective of whether a minimum has already been reached (see DEoptim::DEoptim.control). Therefore, with default optimization settings, estimate_dm() returns the convergence flag NA for optimizer = "DEoptim", because the termination of the routine does not necessarily indicate convergence. However, this is typically not an issue, as 200 iterations are generally sufficient for the algorithm to find the global minimum. If users explicitly define convergence criteria via the control argument of estimate_dm() (which is passed on to DEoptim::DEoptim.control), valid convergence messages and flags are returned.

Examples

##########
# Note: The following examples were trimmed for speed to ensure they run
# within seconds. They do not always provide realistic settings.
##########

####
# Setup

# get a model for the examples (DMC with just two free parameters)
model <- dmc_dm(
  instr = "
   b <!>
   non_dec <!>
   sd_non_dec <!>
   tau <!>
   alpha <!>
   "
)

# get some data (the first two participants in the data set of Ulrich et al.)
data <- ulrich_flanker_data[ulrich_flanker_data$ID %in% 1:2, ]


####
# Fit a single individual (using unbounded Nelder-Mead)
fit <- estimate_dm(
  drift_dm_obj = model,
  obs_data = data[data$ID == 1, ],
  optimizer = "Nelder-Mead"
)
print(fit)




####
# Fit a single individual (using DEoptim)
l_u <- get_lower_upper(model)
set.seed(2)
fit <- estimate_dm(
  drift_dm_obj = model,
  obs_data = data[data$ID == 1, ],
  optimizer = "DEoptim",
  lower = l_u$lower, upper = l_u$upper,
  control = list(itermax = 5) # use default itermax in practice!
)
print(fit)


####
# Fit multiple individuals (separately; using bounded Nelder-Mead)
l_u <- get_lower_upper(model)
fit <- estimate_dm(
  drift_dm_obj = model,
  obs_data = data, # contains the data for two individuals
  optimizer = "nmkb",
  lower = l_u$lower, upper = l_u$upper,
)
print(fit)
coef(fit)


###
# Fit to aggregated data (using unbounded Nelder-Mead)
fit <- estimate_dm(
  drift_dm_obj = model,
  obs_data = data, # contains data for two individuals
  optimizer = "Nelder-Mead",
  approach = "agg_c"
)
print(fit)
coef(fit)


###
# EXPERIMENTAL
# Fit a single individual (using DE-MCMC; Bayesian; custom priors)
fit <- estimate_dm(
  drift_dm_obj = model,
  obs_data = data[data$ID == 1, ],
  approach = "sep_b",
  burn_in = 1, # higher in practice (e.g., 500)
  samples = 1, # higher in practice (e.g., 1000)
  n_chains = 5, # higher in practice (e.g., 40)
  mean = c(muc = 3, A = 0.9),
  sd = c(muc = 2, A = 0.8),
)
print(fit)
coef(fit)


###
# EXPERIMENTAL
# Fit multiple individuals (using DE-MCMC; hierarchical Bayesian)
fit <- estimate_dm(
  drift_dm_obj = model,
  approach = "hier_b",
  obs_data = data, # contains data for two individuals
  burn_in = 1, # higher in practice (e.g., 500)
  samples = 1, # higher in practice (e.g., 1000)
  n_chains = 5, # higher in practice (e.g., 40)
  n_cores = 1, # higher in practice (depending on your machine and data set)
)
print(fit)
coef(fit)

Estimate the Parameters of a drift_dm Model

Description

This function was deprecated in dRiftDM version v.0.3.0, please use the more general estimate_dm() function.

Old documentation: Find the 'best' parameter settings by fitting a drift_dm models' predicted probability density functions (PDFs) to the observed data stored within the respective object. The fitting procedure is done by minimizing the negative log-likelihood of the model.

Users have three options:

Estimate the parameters via Differential Evolution (Default)
Estimate the parameters via (bounded) Nelder-Mead
Use Differential Evolution followed by Nelder-Mead.

See also vignette("dRiftDM", "dRiftDM")

Usage

estimate_model(
  drift_dm_obj,
  lower,
  upper,
  verbose = 0,
  use_de_optim = TRUE,
  use_nmkb = FALSE,
  seed = NULL,
  de_n_cores = 1,
  de_control = list(reltol = 1e-08, steptol = 50, itermax = 200, trace = FALSE),
  nmkb_control = list(tol = 1e-06)
)

Arguments

drift_dm_obj

an object inheriting from drift_dm

lower, upper

numeric vectors or lists, specifying the lower and upper bounds on each parameter to be optimized (see Details).

verbose

numeric, indicating the amount of information displayed. If 0, no information is displayed (default). If 1, basic information about the start of Differential Evolution or Nelder-Mead and the final estimation result is given. If 2, each evaluation of the log-likelihood function is shown. Note that verbose is independent of the information displayed by DEoptim::DEoptim.

use_de_optim

logical, indicating whether Differential Evolution via DEoptim::DEoptim should be used. Default is TRUE

use_nmkb

logical, indicating whether Nelder-Mead via dfoptim::nmkb should be used. Default is FALSE.

seed

a single numeric, providing a seed for the Differential Evolution algorithm

de_n_cores

a single numeric, indicating the number of cores to use. Run parallel::detectCores() to see how many cores are available on your machine. Note that it is generally not recommended to use all of your cores as this will drastically slow down your machine for any additional task.

de_control, nmkb_control

lists of additional control parameters passed to DEoptim::DEoptim and dfoptim::nmkb.

Details

Specifying lower/upper

the function estimate_model provides a flexible way of specifying the search space; identical to specifying the parameter simulation space in simulate_data.drift_dm.

Users have three options to specify the simulation space:

Plain numeric vectors (not very much recommended). In this case, lower/upper must be sorted in accordance with the parameters in the flex_prms_obj object that vary for at least one condition (call print(drift_dm_obj) and have a look at the ⁠Parameter Settings⁠ output)
Named numeric vectors. In this case lower/upper have to provide labels in accordance with the parameters that are considered "free" at least once across conditions.
The most flexible way is when lower/upper are lists. In this case, the list requires an entry called "default_values" which specifies the named or plain numeric vectors as above. If the list only contains this entry, then the behavior is as if lower/upper were already numeric vectors. However, the lower/upper lists can also provide entries labeled as specific conditions, which contain named (!) numeric vectors with parameter labels. This will modify the value for the upper/lower parameter space with respect to the specified parameters in the respective condition.

Details on Nelder-Mead and Differential Evolution

If both use_de_optim and use_nmkb are TRUE, then Nelder-Mead follows Differential Evolution. Note that Nelder-Mead requires a set of starting parameters for which either the parameter values of drift_dm_obj or the estimated parameter values by Differential Evolution are used.

Default settings will lead DEoptim::DEoptim to stop if the algorithm is unable to reduce the negative log-likelihood by a factor of reltol * (abs(val) + reltol)after steptol = 50 steps, with reltol = 1e-8 (or if the default itermax of 200 steps is reached). Similarly, dfoptim::nmkb will stop if the absolute difference of the log-likelihood between successive iterations is below tol = 1e-6.See DEoptim::DEoptim.control and the details of dfoptim::nmkb for further information.

Value

the updated drift_dm_obj (with the estimated parameter values, log-likelihood, and probability density functions of the first passage time)

Fit Multiple Individuals and Save Results

Description

This function was deprecated in dRiftDM version 0.3.0. Please use the more general estimate_dm() instead. NOTE: dRiftDM now supports multiple ways of estimating a model. To ensure a more consistent function interface, individual fits are no longer saved to disk when fitting multiple participants. Instead, estimate_dm() directly returns an object of type fits_ids_dm, which users can save manually if desired.

Old documentation: Provides a wrapper around estimate_model to fit multiple individuals. Each individual will be stored in a folder. This folder will also contain a file drift_dm_fit_info.rds, containing the main arguments of the function call. One call to this function is considered a "fit procedure". Fit procedures can be loaded via load_fits_ids.

Usage

estimate_model_ids(
  drift_dm_obj,
  obs_data_ids,
  lower,
  upper,
  fit_procedure_name,
  fit_path,
  fit_dir = "drift_dm_fits",
  folder_name = fit_procedure_name,
  seed = NULL,
  force_refit = FALSE,
  progress = 2,
  start_vals = NULL,
  ...
)

Arguments

drift_dm_obj

an object inheriting from drift_dm that will be estimated for each individual in obs_data_ids.

obs_data_ids

data.frame, see obs_data. An additional column ID necessary, to identify a single individual.

lower, upper

numeric vectors or lists, providing the parameter space, see estimate_model.

fit_procedure_name

character, providing a name of the fitting procedure. This name will be stored in drift_dm_fit_info.rds to identify the fitting procedure, see also load_fits_ids.

fit_path

character, a path, pointing to the location where all fits shall be stored (i.e., fit_dir will be created in this location). From the user perspective, the path will likely be identical to the current working directory.

fit_dir

character, a directory where (multiple) fitting procedures can be stored. If the directory does not exist yet, it will be created via base::create.dir(fit_dir, recursive = TRUE) in the location provided by fit_path. Default is "drift_dm_fits".

folder_name

character, a folder name for storing all the individual model fits. This variable should just state the name, and should not be a path. Per default folder_name is identical to fit_procedure_name.

seed

numeric, a seed to make the fitting procedure reproducable (only relevant for differential evolution, see estimate_model). Default is NULL which means no seed.

force_refit

logical, if TRUE each individual of a fitting routine will be fitted once more. Default is FALSE.

progress

numerical, indicating if and how progress shall be displayed. If 0, no progress is shown. If 1, the currently fitted individual is printed out. If 2, a progressbar is shown. Default is 2.

start_vals

optional data.frame, providing values to be set before calling estimate_model. Can be used to control the starting values for each individual when calling Nelder-Mead. Note that this will only have an effect if DEoptim is not used (i.e., when setting use_de_optim = FALSE; see estimate_model). The data.frame must provide a column ID whose entries match the ID column in obs_data_ids, as well as a column for each parameter of the model matching with coef(drift_dm_obj, select_unique = TRUE).

...

additional arguments passed down to estimate_model.

Details

Examples and more information can also be found in vignette("dRiftDM", "dRiftDM").

When developing the fitting routine we had three levels of files/folders in mind:

In a directory/folder named fit_dir multiple fitting routines can be stored (default is "drift_dm_fits")
Each fitting routine has its own folder with a name as given by folder_name (e.g., "ulrich_flanker", "ulrich_simon", ...)
Within each folder, a file called drift_dm_fit_info.rds contains the main information about the function call. That is, the time when last modifying/calling a fitting routine, the lower and upper parameter boundaries, the drift_dm_object that was fitted to each individual, the original data set obs_data_ids, and the identifier fit_procedure_name. In the same folder each individual has its own ⁠<individual>.rds⁠ file containing the modified drift_dm_object.

Value

nothing (NULL; invisibly)

Specify custom parameters

Description

This function takes a flex_prms_obj and adds or builds the entry cust_prms to allow for custom parameters. An examplary instruction is "peak_l = (a-1)*tau"

Usage

flex_cust_prm(flex_prms_obj, formula_instr)

Arguments

flex_prms_obj

a flex_prms object

formula_instr

a string referring to "custom parameter combination" (see modify_flex_prms)

Details

The entry cust_prms is a list with entries expressions and values. Each of these is again a named list, that either contains the expression with instructions on how to calculate the custom parameter (e.g., "peak_l") or the respective values. Values are getting updated/calculated in update_special_values()

The cust_prms exists of two entries "expressions" and values". "expressions" contains a named list, with expressions referring to prms_matrix (see flex_prms) on how to calculate the custom parameter (across all conditions). The "values" contain a named list, with named numeric vectors (names are conditions, values the calculated custom parameter values)

Value

a modified flex_prms object with respect to the cust_prms entry

Exclude parameters from being modified (i.e., fix it; internal docu)

Description

This function modifies the (linear) internal list and sets the desired parameters (based on the instruction string) to 0. This indicates that this parameter is not altered within the function x2prms_vals (i.e., ' prm <!> conda')

Usage

flex_fix_prms(flex_prms_obj, formula_instr)

Arguments

flex_prms_obj

a flex_prms object

formula_instr

a string referring to "fix" (see modify_flex_prms)

Value

a modified flex_prms_obj with respect to the (linear) internal list

Flex_Prms

Description

Functions for creating, accessing replacing, or printing a flex_prms object. Any object of type flex_prms provides a user-friendly way to specify dependencies, parameter values etc. for a model.

Usage

flex_prms(object, ...) <- value

## S3 replacement method for class 'drift_dm'
flex_prms(object, ..., eval_model = FALSE) <- value

flex_prms(object, ...)

## S3 method for class 'numeric'
flex_prms(object, ..., conds, instr = NULL, messaging = NULL)

## S3 method for class 'flex_prms'
flex_prms(object, ...)

## S3 method for class 'drift_dm'
flex_prms(object, ...)

## S3 method for class 'flex_prms'
print(
  x,
  ...,
  round_digits = drift_dm_default_rounding(),
  dependencies = TRUE,
  cust_parameters = TRUE
)

Arguments

object

an R object (see Details)

...

additional arguments passed on to the specific method.

value

an object of type flex_prms.

eval_model

logical, indicating if the model should be re-evaluated or not when replacing the flex_prms object (see re_evaluate_model).

conds

A character vector, giving the names of the model's conditions. values within conds will be used when addressing the data and when deriving the model's predictions.

instr

optional string with "instructions", see modify_flex_prms().

messaging

optional logical, indicates if messages shall be displayed when processing instr.

x

an object of type flex_prms

round_digits

integer, controls the number of digits shown when printing out a flex_prms object. Default is 3.

dependencies

logical, controlling if a summary of the special dependencies shall be printed.

cust_parameters

logical, controlling if a summary of the custom parameters shall be printed.

Details

Objects of type flex_prms can be modified using the generic modify_flex_prms() function and a corresponding set of "instructions" (see the respective function for more details).

flex_prms() is a generic function. If called with a named numeric vector, then this will create an object of type flex_prms (requires conds to be specified). If called with other data types, gives the respective flex_prms object

⁠flex_prms<-()⁠ is a generic replacement function. Currently this only supports objects of type drift_dm. It will replace/update the model with a new instance of type flex_prms.

Value

The specific value returned depends on which method is called

Creating an object of type `flex_prms`

Can be achieved by calling flex_prms() with a named numeric vector, thus when calling the underlying method flex_prms.numeric (see the example below). In this case a list with the class label "flex_prms" is returned. It containts three entries:

A nested list internal_list. This list specifies the dependencies and restrains enforced upon the parameters across conditions. Integers >= 1 indicate that this parameter will be estimated for a specific condition, and conditions with the same number refer to a single parameter. Integers == 0 indicate thtat this parameter will not be esitmated for a specific condition (i.e., it is considered "fixed"). Expressions will be evaluated at run time and specify special dependencies among parameters.
A nested list linear_internal_list. This list essentially contains the same information as internal_list, but the parameters are sorted so that they can be mapped to an integer vector (relevant only in the depths of the package for the minimization routines).
A numeric matrix prms_matrix which contains the currently set values for each parameter across all conditions. Per default, the values of each parameter are set equal across all conditions. Additionally, each parameter is assumed to be restrained as equal across all conditions. The values for all parameters given a condition will be passed to the component functions (see comp_funs).
(optional) A list of additional parameters cust_prms that are derived from the parameters in prms_matrix.

Accessing an object of type `flex_prms`

Users can access/get the flex_prms object when calling flex_prms() with an object of type drift_dm, fits_ids_dm (see estimate_model_ids()), or flex_prms. In this case, the stored flex_prms object is returned.

Replacing an object of type `flex_prms`

The flex_prms object stored within an object of type drift_dm can be replaced by calling the generic ⁠flex_prms<-⁠ replacement function. In this case, the modified drift_dm object is returned.

Printing an object of type `flex_prms`

The print.flex_prms() method invisibly returns the supplied flex_prms object.

Note

There is only a replacement function for drift_dm objects. This is because replacing the solver settings after the model has been fitted (i.e., for a fits_ids_dm object) doesn't make sense.

Examples

# Create a flex_prms object -----------------------------------------------
conds <- c("one", "two")
prms <- c(muc = 3, b = 0.5)
one_instr <- "muc ~ one + two"
flex_prms_obj <- flex_prms(
  prms,
  conds = conds,
  instr = one_instr
)
print(flex_prms_obj)


# Access a flex_prms object of a model ------------------------------------
my_model <- ratcliff_dm() # the Ratcliff DDM comes with dRiftDM
print(flex_prms(my_model))


# Replace the flex_prms object of a model ---------------------------------
# create a new flex_prms object
conds <- c("one", "two")
prms <- c(muc = 3, b = 0.6, non_dec = 0.3)
new_flex_prms_obj <- flex_prms(
  prms,
  conds = conds
)

flex_prms(my_model) <- new_flex_prms_obj

# acess the new flex_prms object
print(flex_prms(my_model))


# Control the print method -------------------------------------------------
dmc_model <- dmc_dm() # another, more complex, model; comes with dRiftDM
print(flex_prms(dmc_model), round_digits = 1, cust_parameters = FALSE)

Set parameters as equal across conditions

Description

This function takes a flex_prms object and modifies the (linear) internal list so that a parameter is set as equal across multiple conditions, according to the instruction formula (i.e., ' prm ~! conda + condb)

Usage

flex_restrain_prms(flex_prms_obj, formula_instr)

Arguments

flex_prms_obj

flex_prms object

formula_instr

a string referring to "restrain" (see modify_flex_prms)

Value

a modified flex_prms object with an updated (linear) internal list

Set special dependencies (internal docu)

Description

Sets special dependencies so that parameters depend on other parameters. (i.e., 'prmX ~ conda == -(prmY ~ condb)')

Usage

flex_special_dependency(flex_prms_obj, formula_instr)

Arguments

flex_prms_obj

a flex_prms object

formula_instr

a string referring to "special dependency" (see modify_flex_prms)

Value

a modified flex_prms_object with a modified (linear) internal list and modified parameter and custom parameter matrices

Set a specific value to the parameter matrix (internal docu)

Description

This function takes a flex_prms_obj and sets certain values to the parameter matrix, based on the given instruction string (i.e., ' prm ~ conda => 0.3)

Usage

flex_specific_value(flex_prms_obj, formula_instr)

Arguments

flex_prms_obj

flex_prms object

formula_instr

a string referring to "set" (see modify_flex_prms)

Value

an updated flex_prms object with a modified prms_matrix object, and (if applicable) a modified cust_prms matrix

Allow parameters to vary

Description

This function takes an object of type flex_prms and a instruction string that refers to a "vary" instruction (i.e., ab ~ cd + ds). This string is broken down and unique parameters are introduced for the condition x parameter combinations

Usage

flex_vary_prms(flex_prms_obj, formula_instr)

Arguments

flex_prms_obj

an object of type flex_prms

formula_instr

a string referring to "vary" (see modify_flex_prms)

Value

an updated flex_prms_obj with an updated (linear) internal list

Perform a Full Crossover Step Using Differential Evolution

Description

This function updates each chain's parameters by proposing new values using a differential evolution strategy. For each chain k, two other chains m and n are randomly selected, and a proposal is generated via: prms_k + gamma * (prms_m - prms_n) + noise, where gamma = 2.38 / sqrt(2 * n_prms) and noise is uniform perturbation controlled by b. The proposal is accepted with Metropolis probability via call_log_posterior_m(), and accepted proposals replace the current values.

Usage

full_crossover(
  prms_across_chains,
  pis_across_chains,
  log_likes_across_chains,
  gamma = NULL,
  b = 0.001,
  ...
)

Arguments

prms_across_chains

a numeric matrix of dimension ⁠p × n⁠, where p is the number of parameters and n is the number of chains. Each column contains the current parameter vector of a chain.

pis_across_chains

a numeric vector of length n, containing the current log-posterior values for each chain.

log_likes_across_chains

a numeric vector of length n, containing the current log-likelihood values for each chain.

gamma

a single numeric tuning parameter, that scales the difference between parameters. If NULL, defaults to 2.38 / sqrt(2 * n_prms)

b

a small numeric value used to perturb the proposal parameters to avoid degeneracy.

...

additional arguments passed to call_log_posterior_m().

Value

A list with the following components:

new_prms_across_chains: The updated parameter matrix of shape ⁠p × n⁠.
new_pis_across_chains: The updated vector of log-posterior values.
new_log_likes_across_chains: The updated vector of log-likelihood values.

Compute average trials per condition across individuals

Description

internal helper — assumes each ID appears with the same set of conditions in column Cond. counts trials per ID×Cond and averages across ID.

Usage

get_avg_trials(obs_data_ids)

Arguments

obs_data_ids

data frame with columns ID and Cond; one row per trial.

Value

list with:

N: number of unique individuals
avg_trials: named numeric vector of average trials per condition

Get default/fall back component functions

Description

If arguments are provided that are not NULL, the respective argument is simply returned. If it is NULL, then a default/fall back component function is returned for the respective component. This function is called to fill up non-specified component functions when calling drift_dm().

Usage

get_default_functions(
  mu_fun = NULL,
  mu_int_fun = NULL,
  x_fun = NULL,
  b_fun = NULL,
  dt_b_fun = NULL,
  nt_fun = NULL
)

Arguments

mu_fun

drift rate function

mu_int_fun

integral drift rate function

x_fun

starting point function

b_fun

boundary function

dt_b_fun

derivative of boundary function

nt_fun

non-decision time function

Details

defaults...

mu_fun -> constant drift rate of 3 (i.e., vector of 0s)
mu_int_fun -> constant drift rate of 3 (i.e., vector of 3 times t_vec)
x_fun -> dirac delta on zero x_dirac_0()
b_fun -> constant boundary of 0.5 (i.e., vector of 0.5s)
dt_b_fun -> derivate of constant boundary (i.e., vector of 0s). dt_b_constant()
nt_fun -> constant non-decision time of 0.3 (i.e., vector for dirac delta on 0.5).

Value

a list of mu_fun, mu_int_fun, x_fun, b_fun, dt_b_fun, and nt_fun, with either the supplied component functions or the added/filled in default component functions (if an argument is NULL).

Generate Prior Functions for Model Parameters

Description

This function creates prior distribution functions for each model parameter in a drift diffusion model (DDM), depending on the specified hierarchical level. It returns both log-density functions and, where applicable, random-sample generators based on the user-defined prior settings.

Usage

get_default_prior_settings(
  drift_dm_obj,
  level,
  means = NULL,
  sds = NULL,
  lower = NULL,
  upper = NULL,
  shapes = NULL,
  rates = NULL
)

Arguments

drift_dm_obj

a drift_dm model object.

level

a character string, specifying the modeling level. Must be one of: "hyper" (group-level priors), "lower" (individual-level priors given group-level parameters), or "none" (non-hierarchical setting).

means

a named numeric vector or list, specifying the prior means for each parameter. Missing values will be filled up from the first matching parameter in drift_dm_obj

sds

a named numeric vector or list of standard deviations. Missing or NULL values will be replaced by corresponding values from mean.

lower, upper

optional numeric vectors or lists specifying the lower and upper truncation bounds for each prior distribution. Defaults to -Inf and Inf, respectively.

shapes, rates

optional numeric vectors or lists specifying the shape and rate parameter for group-level standard deviations (used at the hyper-level). Defaults to 1.

Details

Each prior is parameter-specific and wrapped using purrr::partial() so that downstream sampling or density evaluation can be performed easily. At the hyper-level, the functions d_default_prior_hyper() and r_default_prior_hyper() are used. At the lower-level, the functions dtnorm() and rtnorm() are used.

The input arguments means, sds, lowers, uppers, shapes, and rates are handled by the function get_parameters_smart().

Value

A named list with two elements:

log_dens_priors: A named list of functions. Each function returns the log-density for a parameter value, based on the chosen prior settings.
r_priors: A named list of functions for sampling from the specified prior distributions.

Auxiliary Function to load a `fits_ids_dm`, `fits_agg_dm`, or `mcmc_dm` object

Description

The function is merely helper functions to create an object of type fits_ids_dm, fits_agg_dm, or mcmc_dm. It is used for example code.

Usage

get_example_fits(class, hierarchical = FALSE)

Arguments

class

a string of either "fits_ids_dm", "fits_agg_dm", or "mcmc_dm" (can be abbreviated)

hierarchical

a logical, relevant when class = "mcmc_dm". If TRUE, an object from a hierarchical fit is returned. If FALSE, an object from an individual fit is returned.

Details

For "fits_ids_dm", the returned object comprises DMC (see dmc_dm()) fitted to three participants of the ulrich_flanker_data.

For "fits_agg_dm", the returned object comprises the Ratcliff model (see ratcliff_dm()) fitted to synthetic data of three participants.

For "mcmc_dm" and hierarchical = FALSE, the returned object comprises the Ratcliff model (see ratcliff_dm()) fitted to synthetic data of one participant.

For "mcmc_dm" and hierarchical = TRUE, the returned object comprises the Ratcliff model (see ratcliff_dm()) fitted to synthetic data of ten participants.

Value

An object of type fits_ids_dm, fits_agg_dm, or mcmc_dm, mimicking a result from calling estimate_dm().

Examples

get_example_fits(class = "fits_agg")

Compute EZ Diffusion parameters

Description

Internal helper that computes EZ diffusion model parameters for each condition in a drift_dm_obj. The computation is based on the equations from Wagenmakers et al. (2007) and estimates drift rate (muc), boundary separation (b), and non-decision time (non_dec).

Usage

get_ez_diffusion(drift_dm_obj)

Arguments

drift_dm_obj

a drift diffusion model object containing observed data in obs_data, including upper (rts_u) and lower (rts_l) response times per condition

Details

If Pc equals 0, 0.5, or 1, small adjustments are applied to prevent numerical issues in the logit transformation.

Value

a matrix with rows muc, b, and non_dec

Get Default Parameter Ranges for a Model

Description

get_lower_upper() returns suggested default values for parameter bounds of a drift_dm model. The function inspects the model's component functions (e.g., drift, boundary, non-decision time, start) and provides heuristic defaults for some of the pre-built components. Only parameters that are currently considered free in the model are returned.

Usage

get_lower_upper(object, ...)

## S3 method for class 'drift_dm'
get_lower_upper(object, ..., warn = TRUE)

Arguments

object

a drift_dm model.

...

additional arguments passed forward to the respective method.

warn

a single logical, if TRUE issue a warning listing components and parameters where no defaults could be provided.

Details

Supported components include: mu_constant, mu_dmc, mu_ssp, b_constant, x_uniform, x_beta, nt_constant, nt_uniform, nt_truncated_normal. For some defaults we use the model's discretization (dt, dx) to ensure sensible minima.

If a component is not recognized (or refers to currently unsupported components), no defaults are provided for that component. When warn = TRUE, a single warning lists components without defaults and any free parameters that remain unmatched. In this case, the user has to add the missing parameter ranges before attempting to fit the model.

The default ranges are heuristics intended to provide a reasonable starting point for new users. They are not guaranteed to be appropriate for every model or data set. Always review and, if needed, adjust the returned values as needed.

Value

a list with two named numeric vectors:

lower — suggested lower bounds for free parameters
upper — suggested upper bounds for free parameters

Examples

# get a model for the example
model <- dmc_dm(obs_data = dmc_synth_data)

# get the parameter ranges
lu <- get_lower_upper(model)
lu$lower
lu$upper

# then continue to estimate
# estimate_dm(model, lower = lu$lower, upper = lu$upper, optimizer = "nmkb")

Turn default/special parameter specifications to vectors

Description

The function is used in the depths to map parameter inputs to the parameters of a model. One application is to get the search space as a vector, matching with the free parameters of a model. Other applications map, for example, mean values to the free parameters of a model. Relevant when users use the "default parameters" approach where they only specify the parameter labels and assume the package figures out how each parameter relates across conditions (see simulate_data). This comes in handy, when freeing a parameter across conditions, while the search space remains the same (otherwise, a user would always have to adapt the vectors for lower/upper to match with x2prms_vals)

Usage

get_parameters_smart(
  drift_dm_obj,
  input_a,
  input_b = NULL,
  labels = TRUE,
  is_l_u = TRUE,
  fill_up_with = NULL
)

Arguments

drift_dm_obj

an object of type drift_dm

input_a, input_b

either a atomic vector or list (see create_matrix_smart)

labels

optional logical, if TRUE, then the returned vectors have the unique parameter labels according to prm_cond_combo_2_labels.

is_l_u

optional logical, if TRUE, a warning is thrown when input_a leads to larger values than input_b. Useful when input_a and input_b span a (search) space.

fill_up_with

optional values used to fill up the returned vectors for all parameters that are not specified in input_a or input_b (requires at least one parameter to specified).

Details

The function first gets all unique parameters across conditions using prms_cond_combo. The unique parameter labels are then forwarded to create_matrix_smart, together with all (!) the conditions in the model and the input_a/input_b arguments. Subsequently, the created matrices are wrangled into vectors in accordance with prms_cond_combo. The vectors are then passed back.

Value

a list with two entries named vec_a/vec_b. The length and names (if requested) matches with coef(model, select_unique = TRUE). When input_a and/or input_b is NULL, the respective entry for vec_a/vec_b will be NULL as well.

TITLE

Description

TITLE

Usage

get_starting_values(object, ...)

## S3 method for class 'drift_dm'
get_starting_values(
  object,
  ...,
  lower = NULL,
  upper = NULL,
  verbose = 0,
  use_ez = NULL,
  n_lhs = NULL
)

Arguments

object

...

Extract a Subset of MCMC Chains

Description

When calling estimate_bayesian(), the MCMC results are packed up as an mcmc_dm object. This function is used in the depths of dRiftDM to extract the relevant array of MCMC samples, depending on whether the model is hierarchical and whether a participant ID is provided.

Usage

get_subset_chains(chains_obj, id = NULL)

Arguments

chains_obj

an object of classmcmc_dm.

id

an optional single numeric or character, specifying the ID of a participant to extract individual-level samples from a hierarchical model. Ignored for non-hierarchical models.

Value

A 3D array of MCMC samples. The first dimension indicates parameters, the second dimension chains, and the third dimension iterations

Plot Parameter Distribution(s)

Description

This function creates a histogram for each parameter in a coefs_dm object, resulting from a call to coef.fits_ids_dm.

Usage

## S3 method for class 'coefs_dm'
hist(
  x,
  ...,
  conds = NULL,
  col = NULL,
  xlim = NULL,
  ylim = NULL,
  xlab = "value",
  ylab = NULL,
  bundle_plots = TRUE
)

Arguments

x

an object of class coefs_dm (see coef.fits_ids_dm)

...

additional graphical arguments passed to graphics::hist(). Not supported are the plot and probability arguments (the latter can be controlled via the supported freq argument). For further plotting arguments, see also set_default_arguments().

conds

a character vector specifying the conditions to plot. Defaults to all available conditions.

col

character vector, specifying colors for each condition, if conditions are present.

xlim

a numeric vector of length 2, specifying the x-axis limits.

ylim

a numeric vector of length 2, specifying the y-axis limits.

xlab, ylab

character strings for the x- and y-axis labels.

bundle_plots

logical, indicating whether to display separate panels in a single plot layout (FALSE), or to plot them separately (TRUE).

Details

The hist.coefs_dm function is designed for visualizing parameter distributions.

If multiple conditions are present, it overlays histograms for each condition with adjustable transparency.

When bundle_plots is set to TRUE, histograms for each parameter are displayed in a grid layout within a single graphics device.

This function has some customization options, but they are limited. If you want to have a highly customized histogram, it is best to create it on your own using R's graphics::hist() function (see the examples below).

Value

Nothing (NULL; invisibly)

Examples

# get an auxiliary fit procedure result (see the function load_fits_ids)
all_fits <- get_example_fits("fits_ids")
coefs <- coef(all_fits)
print(coefs)
hist(coefs, bundle_plots = FALSE) # calls hist.coefs_dm method of dRiftDM

# how to fall back to R's hist() function for heavy customization
coefs <- unpack_obj(coefs) # provides the plain data.frame
hist(coefs$muc, main = expression(mu[c])) # calls graphics::hist()

Aggregate Data Frame Columns by Group

Description

internal function to aggregate columns of a data frame across "ID" while considering a set of grouping columns. It retains the class and attriubtes of the input data.

Usage

internal_aggregate(data, group_cols)

Arguments

data

A data.frame containing the data to be aggregated. It should include both the grouping columns, an "ID" column, and the columns for which aggregation shall take place.

group_cols

A character vector specifying the names of the columns to group by during aggregation.

Details

internal_aggregate identifies DV columns as those not in group_cols or "ID". It then calculates the mean of these DV columns, grouped by the specified columns. Columns specified in group_cols that are not part of data are ignored silently.

Value

A data.frame containing the aggregated data.

Turn an internal list to a matrix

Description

this function wrangles the internal list to character matrix to show how each parameter relates across conditions

Usage

internal_list_to_matrix(internal_list)

Arguments

internal_list

the internal list of a flex_prms object

Value

a character matrix with all parameter labels (columns) and conditions (rows)

Numerical integration using the trapezoidal rule

Description

These are internal helper functions to perform numerical integration via the trapezoidal rule. The workhorse is internal_trapz(), which computes the full integral or returns the cumulative integral.

Usage

internal_trapz(x, y, return_cumsum = FALSE)

cumtrapz(x, y)

trapz(x, y)

Arguments

x

numeric vector of strictly increasing x-values.

y

numeric vector of function values at x.

return_cumsum

logical, if TRUE return the cumulative integral at each point in x (starting with 0), if FALSE return the total integral.

Details

internal_trapz(x, y, return_cumsum = FALSE): core implementation
trapz(x, y) wrapper for internal_trapz(x, y, FALSE), returns the total integral.
cumtrapz(x, y) wrapper for internal_trapz(x, y, TRUE), returns the cumulative integral.

Value

trapz(): a single numeric value
cumtrapz(): numeric vector of cumulative integrals (starting with 0)
internal_trapz(): either of the above, depending on return_cumsum

Check if an object of length <= 1 is either empty or contains entry with ""

Description

Check if an object of length <= 1 is either empty or contains entry with ""

Usage

is_empty(x)

Arguments

x

a single value or empty vector

Value

TRUE or FALSE otherwise

Check if an object is a valid numeric vector

Description

This function verifies whether the input is a numeric vector with no missing (NA, NaN) or infinite (Inf or -Inf) values.

Usage

is_numeric(x)

Arguments

x

An object to check.

Value

A logical value: TRUE if the input is a numeric vector without any missing or infinite values, otherwise FALSE.

Relabel the internal list

Description

The entries of the internal list are either digits (0-x) or expressions. To ensure a valid mapping of these values to an input vector (as done when an optimizer provides input parameters), we have to linearize the list (done whenever modifying the list, see the different flex_* functions )

Usage

linearize_internal_list(internal_list)

Arguments

internal_list

the internal list, with entries for each parameter x condition combination

Value

another list, but with remapped digits in increasing order, while leaving expressions or digits of 0 untouched.

Load Estimates of a Fit Procedure

Description

This function was deprecated in dRiftDM version 0.3.0, because dRiftDM no longer saves model fits to disk when fitting multiple participants. When estimating multiple individuals with the new function estimate_dm(), an object of type fits_ids_dm is returned directly.

Usage

load_fits_ids(
  path = "drift_dm_fits",
  fit_procedure_name = "",
  detailed_info = FALSE,
  check_data = TRUE,
  progress = 2
)

Arguments

path

character, a path pointing to a folder or directory containing the individual model fits.

fit_procedure_name

character, an optional name that identifies the fit procedure that should be loaded

detailed_info

logical, controls the amount of information displayed in case multiple fit procedures were found and the user is prompted to explicitly choose one

check_data

logical, should the data be checked before passing them back? This checks the observed data and the properties of the model. Default is TRUE

progress

numerical, indicating if and how progress shall be depicted. If 0, no progress is shown. If 1, basic infos about the checking progress is shown. If 2, multiple progressbars are shown. Default is 2.

Details

Old documentation: This function loads the results of a fit procedure where a model was fitted to multiple individuals (see estimate_model_ids). It is also the function that creates an object of type fits_ids_dm.

with respect to the logic outlined in the details of estimate_model_ids on the organization of fit procedures, path could either point to a directory with (potentially) multiple fit routines or to a specific folder with the individual fits. In either case the intended location is recursively searched for files named drift_dm_fit_info.rds.

If the fit procedure was uniquely located, either because only one fit routine was found in the intended location or because only one drift_dm_fit_info.rds contains the optional identifier specified in fit_procedure_name, then all individual model fits including the information fit_procedure_name are loaded and returned.

In case multiple fit procedures are identified, the user is prompted with a utils::menu, listing information about the possible candidates. The intended fit procedure can then interactively be chosen by the user. The amount of displayed information is controlled via detailed_info.

The print() method for objects of type fits_ids_dm prints out basic information about the fit procedure name, the fitted model, time of (last) call, and the number of individual data sets.

Value

For load_fits_ids(), an object of type fits_ids_dm, which essentially is a list with two entries:

drift_dm_fit_info, containing a list of the main arguments when estimate_model_ids was originally called, including a time-stamp.
all_fits, containing a list of all the modified/fitted drift_dm objects. The list's entry are named according to the individuals' identifier (i.e., ID).

For print.fits_ids_dm(), the supplied fit_ids_dm object x (invisible return).

Extract Log-Likelihood for a drift_dm Object

Description

This method extracts the log-likelihood for a drift_dm object if possible.

Usage

## S3 method for class 'drift_dm'
logLik(object, ...)

Arguments

object

a drift_dm object containing observed data

...

additional arguments

Value

A logLik object containing the log-likelihood value for the drift_dm object. This value has attributes for the number of observations (nobs) and the number of model parameters (df).

Returns NULL if the log-likelihood is not available (e.g., when the model has no observed data attached).

Examples

# get a pre-built model and a data set for demonstration purpose
# (when creating the model, set the discretization to reasonable values)
a_model <- dmc_dm()
obs_data(a_model) <- dmc_synth_data
logLik(a_model)

Extract Model Statistics for fits_ids_dm Object

Description

These methods are wrappers to extract specific model fit statistics (log-likelihood, AIC, BIC) for each model in a fits_ids_dm object.

Usage

## S3 method for class 'fits_ids_dm'
logLik(object, ...)

## S3 method for class 'fits_ids_dm'
AIC(object, ..., k = 2)

## S3 method for class 'fits_ids_dm'
BIC(object, ...)

Arguments

object

a fits_ids_dm object (see estimate_model_ids)

...

additional arguments (currently not used)

k

numeric; penalty parameter for the AIC calculation. Defaults to 2 (standard AIC).

Details

Each function retrieves the relevant statistics by calling calc_stats with type = "fit_stats" and selects the columns for ID and the required statistic.

Value

An object of type fit_stats containing the respective statistic in one column (named Log_Like, AIC, or BIC) and a corresponding ID column. If any of the statistics can't be calculated, the function returns NULL.

Examples

# get an auxiliary fits_ids object for demonstration purpose;
# such an object results from calling load_fits_ids
all_fits <- get_example_fits("fits_ids_dm")

# AICs
AIC(all_fits)

# BICs
BIC(all_fits)

# Log-Likelihoods
logLik(all_fits)

# All unique and free parameters
coef(all_fits)

# Or all parameters across all conditions
coef(all_fits, select_unique = FALSE)

Conditional Log-Posterior Distributions for MCMC Sampling

Description

These functions compute conditional log-posterior distributions used in a (hierarchical) MCMC sampler.

Usage

log_posterior_hyper(
  phi_j_mat,
  theta_j_mat,
  log_prior_hyper_fun,
  log_prior_lower_fun,
  temperatures,
  suppress_warnings = TRUE
)

log_posterior_lower(
  thetas_one_subj_mat,
  all_phis_mat,
  model_subj,
  log_prior_lower_funs,
  temperatures,
  suppress_warnings = TRUE
)

Arguments

phi_j_mat

a numeric matrix of current group-level parameters for one individual-level parameter. It must be 2 x n_chains and provide the mean and standard deviation; in that order.

theta_j_mat

a numeric matrix of individual-level parameter values across all individuals and chains for one model parameter. Must be n_chains x n_subj.

log_prior_hyper_fun

a function that returns the log-prior density of the hyperparameters. Must be a single function (not a list of functions) and it must accept phi_j_mat as input.

log_prior_lower_fun

a function that returns the log-prior density of individual parameter values given the mean and standard deviation at the group-level (as stored in phi_j_mat). Must be a single function (not a list of functions) and it must accept theta_j_mat as a first argument and input, and the mean and standard deviation with arguments mean and sd (vectorized).

temperatures

a numeric vector of temperature scaling values, one per chain, used when applying tempered inference (e.g., in TIDE).

suppress_warnings

logical, if TRUE, warnings created from log_prior_hyper_fun and log_prior_lower_fun(s) are suppressed. The default is true, because in the beginning of an MCMC sampler implausible proposals are provided which can yield missing values and warnings.

thetas_one_subj_mat

a named matrix of lower-level parameters for a single participant. Each row represents one parameter, and each column one chain.

all_phis_mat

a named matrix of all current group-level parameters. Each mean group-level parameter must be named "M-<param>" and each standard deviation "S-<param>". The ⁠<param>⁠ part must match the individual-level parameters in thetas_one_subj_mat. If this argument is NULL, this indicates that the estimation is done in a non-hierarchical fashion. Each row represents a hyper-parameter, and each column one chain.

model_subj

a drift_dm object, containing an individual's data.

log_prior_lower_funs

a named list of functions, one per parameter stored in thetas_one_subj_mat, returning the log-prior densities. It is assumed that each function can take one type of parameter across chains (i.e., a vector). In the non-hierarchical case, each function can only accept a single vector (for the respective individual-level parameter across chains). In the hierarchical case, each function must also support the arguments mean and sd for vectorized prior computation.

Details

log_posterior_hyper() computes the conditional log-posterior for a group-level hyperparameter matrix phi_j_mat, given the individual-level parameters across subjects theta_j_mat (for one type of model parameter).

log_posterior_lower() computes the conditional log-posterior for an individual participant’s parameter matrix thetas_one_subj_mat, given prior distributions. In the hierarchical setting, the prior distributions are conditioned on the group-level parameters.

Value

A list with two elements:

posterior_vals, the total log-posterior values (log-likelihood + log-prior) per chain.
log_like_vals, the log-likelihood components only, per chain.

Get the maximum number from an internal entry or flex_prms_obj (internal docu)

Description

The entries of the internal list are either digits (0-x) or expressions.

Usage

max_number_one_internal_entry(one_internal_entry)

get_number_prms(flex_prms_obj)

Arguments

one_internal_entry

one entry of multiple conditions

flex_prms_obj

a list stored as a flex_prms object

Value

the largest digit in the entry or of the linear_list in the supplied flex_prms_obj (0 if there are only expressions). The largest number of the linear_list corresponds to the number of model parameters.

Convert MCMC Chain Array to a `coda::mcmc.list` Object

Description

Converts a 3D MCMC chain array (parameters × chains × iterations) into a coda::mcmc.list object for compatibility with diagnostic and summary functions from the coda package.

Usage

mcmc_dm_to_coda_mcmc(chains)

Arguments

chains

a 3D numeric array with dimensions corresponding to parameters × chains × iterations. chains are typically obtained from a call to get_subset_chains().

Value

An object of class mcmc.list containing one mcmc object per chain.

Perform a Migration Step Between Chains

Description

In a migration step, a random subset of chains is selected. Each selected chain k receives a proposal based on the next chain (k + 1) in the sequence (cycling back to the first chain after the last). The proposed parameters are slightly perturbed by uniform noise controlled by b. All proposals are then evaluated simultaneously using Metropolis acceptance probabilities via call_log_posterior_m(), and accepted proposals replace the current values.

Usage

migration_crossover(
  prms_across_chains,
  pis_across_chains,
  log_likes_across_chains,
  b = 0.001,
  ...
)

Arguments

prms_across_chains

a numeric matrix of dimension ⁠p × n⁠, where p is the number of parameters and n is the number of chains. Each column contains the current parameter vector of a chain.

pis_across_chains

a numeric vector of length n, containing the current log-posterior values for each chain.

log_likes_across_chains

a numeric vector of length n, containing the current log-likelihood values for each chain.

b

a small numeric value used to perturb the proposal parameters to avoid degeneracy.

...

additional arguments passed to call_log_posterior_m().

Value

A list with the following components:

new_prms_across_chains: The updated parameter matrix of shape ⁠p × n⁠.
new_pis_across_chains: The updated vector of log-posterior values.
new_log_likes_across_chains: The updated vector of log-likelihood values.

Set Instructions to a flex_prms object

Description

Functions to carry out the "instructions" on how to modify a flex_prms object, specified as a string.

Usage

modify_flex_prms(object, instr, ...)

## S3 method for class 'drift_dm'
modify_flex_prms(object, instr, ..., eval_model = FALSE)

## S3 method for class 'flex_prms'
modify_flex_prms(object, instr, ..., messaging = NULL)

Arguments

object

an object of type drift_dm or flex_prms.

instr

a character string, specifying a set of instructions (see Details).

...

further arguments passed forward to the respective method.

eval_model

logical, indicating if the model should be re-evaluated or not when updating modifying the flex_prms object (see re_evaluate_model). Default is FALSE.

messaging

logical, indicating if messages shall be displayed or not. Can happen, for example, when setting a parameter value for a specific condition, although the parameter values are assumed to be the identical across conditions.

Details

modify_flex_prms is a generic function. The default methods pass forward a set of "instructions" to modify the (underlying) flex_prms object.

These instructions are inspired by the model syntax of the lavaan package. Note that specifying multiple instructions is possible, but each instruction has to be defined in its own line. Comments with '#' are possible, also line continuations are possible, if the last symbol is a "+","-", "*", "/", "(", or "[". The following instructions are implemented:

The "vary" instruction:

Looks something like "a ~ foo + bar"
This means that the parameter 'a' is allowed to vary independently for the conditions 'foo' and 'bar'
Thus, when estimating the model, the user will have independent values for 'a' in conditions 'foo' and 'bar'

The "restrain" instruction:

Looks something like "a ~! foo + bar "
This means that the parameter 'a' is assumed to be identical for the conditions 'foo' and 'bar'
Thus, when estimating the model, the user will have only a single value for 'a' in conditions 'foo' and 'bar'

The "set" instruction:

Users may not always estimate a model directly but rather explore the model behavior. In this case setting the value of a parameter is necessary.
The corresponding instruction looks something like "a ~ foo => 0.3"
This will set the value for 'a' in condition 'foo' to the value of 0.3

The "fix" instruction:

Oftentimes, certain parameters of a model are considered "fixed", so that they don't vary while the remaining parameters are estimated. An example would be the shape parameter 'a' of DMC (see dmc_dm).
The corresponding instruction looks something like "a <!> foo + bar"
Usually, users want to call the "set" instruction prior or after the "fix" instruction, to set the corresponding parameter to a certain value.

The "special dependency" instruction:

Sometimes, users want to allow one parameter to depend on another. For instance, in DMC (see dmc_dm), the parameter A is positive in the congruent condition, but negative in the incongruent condition. Thus, parameters may have a 'special depencency' which can be expressed as an equation.
To define a special dependency, users can use the operation "==". The parameter that should have the dependency is on the left-hand side, while the mathematical relationship to other parameters is defined on the right-hand side.
This then looks something like "a ~ foo == -(a ~ bar)".
This means that the parameter a in condition foo will always be -1 * the parameter a in condition bar. Thus, if a in condition bar has the value 5, then a in condition foo will be -5.
The expression on the right-side can refer to any arbitrary mathematical relation.
Important: Make sure that each 'parameter ~ condition' combination on the right-hand side of the equation are set in brackets.
Another example: Parameter a in condition foo should be the mean of the parameter b in conditions bar and baz; this would be the instruction "a ~ foo == 0.5*(b ~ bar) + 0.5*(b ~ baz)"

The "additional/custom parameter combination" instruction:

Sometimes, users may want to combine multiple parameters to summarize a certain property of the model. For example, in DMC (see dmc_dm), the shape and rate parameter jointly determine the peak latency.
To avoid having to calculate this manually, users can define "custom" parameter combinations using the ":=" operation:
An exemplary instruction might look like this: "peak_l := (a - 1) * tau"
Expressions and values that provide calculations for those parameters are stored in a separate list cust_prms.

Value

For drift_dm objects, the updated drift_dm object.

For flex_prms, the updated flex_prms object.

Examples

# Example 1: Modify a flex_prms object  directly ---------------------------
# create an auxiliary flex_prms object
a_flex_prms_obj <- flex_prms(
  c(muc = 3, b = 0.5, non_dec = 0.3),
  conds = c("foo", "bar")
)

# then carry out some "instructions". Here (arbitrary operations):
# 1.) Consider b as fixed
# 2.) Let muc vary independently for the conditions foo and bar
# 3.) Set non_dec in condition bar to be half as large as non_dec in
#     condition bar
instr <-
  "b <!>
 muc ~
 non_dec ~ bar == (non_dec ~ foo) / 2
"
modify_flex_prms(object = a_flex_prms_obj, instr = instr)


# Example 2: Modify a flex_prms object stored inside a drift_dm object -----
a_model <- ratcliff_dm() # get a model for demonstration purpose
modify_flex_prms(object = a_model, instr = "muc ~ => 4")

Constant Drift Rate

Description

Constant Drift Rate

Usage

mu_constant(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, containing muc

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

a vector of the same length as t_vec with the drift rate for each element of the vector.

Drift Rate for DMC

Description

Provides the drift rate of the superimposed decision process. That is the derivative of the rescaled gamma function plus a constant drift rate for the controlled process.

Usage

mu_dmc(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, containing muc, tau, a, A

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

provides the first derivative of the superimposed process with respect to t_vec.

Integral of Constant Drift Rate

Description

Integral of Constant Drift Rate

Usage

mu_int_constant(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, containing muc

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

a vector calculated as t_vec*muc

Integral of DMC's Drift Rate

Description

Provides the integral of the drift rate of the superimposed decision process. This is the sum of the rescaled gamma function and the linear function.

Usage

mu_int_dmc(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, containing muc, tau, a, A

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

provides the scaled gamma distribution function of the superimposed process for each time step in t_vec.

Drift Rate for SSP

Description

Provides the drift rate for the SSP model. That is, the sum of attention attributed to the flankers and central target, scaled by the perceptual input.

Usage

mu_ssp(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters, containing p, sd_0, r, sign

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

provides the drift rate for SSP with respect to t_vec

Create a DDM model — internal

Description

This function assembles all components to create a drift_dm object.

Usage

new_drift_dm(
  flex_prms_obj,
  sigma,
  t_max,
  dt,
  dx,
  solver,
  comp_funs,
  cost_function,
  subclass,
  b_coding = NULL,
  obs_data = NULL
)

Arguments

flex_prms_obj

a flex_prms object.

sigma

the diffusion noise (sigma).

t_max

the maximum trial duration (t_max).

dt

the temporal step size (dt).

dx

the evidence step size (dx).

solver

a string identifying the solver (e.g., "kfe").

comp_funs

a list of component functions.

cost_function

a string, defining how to compute the fit cost.

subclass

a string with model info label set for the child class.

b_coding

an optional list with boundary coding (e.g., drift_dm_default_b_coding()).

obs_data

an optional data.frame with observed data.

Details

We do not perform input checks here; we just assemble the object. Any pre-wrangling is done in drift_dm(). Checks are performed by validate_drift_dm(), which is called indirectly via the setters (e.g., prms_solve() and obs_data()).

Value

A list with elements flex_prms_obj, prms_solve, solver, comp_funs, and cost_function. The object has class attributes c(subclass, "drift_dm") and an attribute "b_coding" containing the boundary coding. If obs_data is not NULL, the observed data are attached via obs_data().

Create a New stats_dm Object

Description

new_stats_dm initializes a stats_dm object to label statistic types and store necessary attributes for the custom methods (such as ⁠plot_*⁠)

Usage

new_stats_dm(stat_df, type, ...)

Arguments

stat_df

a data.frame, containing calculated statistics to be encapsulated within the stats_dm class.

type

a character string, specifying the type of statistic provided by stat_df. Valid options include "basic_stats", "cafs", "quantiles", "delta_funs", and "fit_stats".

...

Additional arguments passed to set attributes. For "cafs", "quantiles", and "delta_funs", a b_coding attribute is required.

Details

new_stats_dm sets up the stat_df object by assigning it the class stats_dm, along with additional classes based on the specified type. For "basic_stats", "cafs", "quantiles", "delta_funs", this will be c(type, "sum_dist", "stats_dm", "data.frame")". For "fit_stats", this will be c(type, "stats_dm", "data.frame")".

For basic stats, Conditional Accuracy Functions (CAFs), Quantiles, and Delta Functions, the function requires a b_coding argument, which specifies boundary coding details and is set as an attribute.

The function performs validation through validate_stats_dm to ensure that the stats_dm object is well formatted.

Value

An object of class stats_dm, with additional classes and attributes depending on type.

Get the Number of Observations for a drift_dm Object

Description

This method retrieves the total number of observations in the obs_data list of a drift_dm object.

Usage

## S3 method for class 'drift_dm'
nobs(object, ...)

Arguments

object

a drift_dm object, which potentially contains the observed data in object$obs_data.

...

additional arguments

Details

The function iterates over each element in object$obs_data, counts the entries in each nested component, and returns the cumulative sum as the total observation count.

It was written to provide an nobs method for calculating the log-likelihood (logLik), AIC (stats::AIC), and BIC (stats::BIC) statistics for objects of type drift_dm.

Value

An integer representing the total number of observations across all conditions in object$obs_data. If obs_data doesn't exist, the function returns 0

Examples

# get a pre-built model and data set for demonstration purpose
a_model <- dmc_dm()
obs_data(a_model) <- dmc_synth_data

# then get the number of observations by accessing the model
nobs(a_model)

# same number of observations as in the original data set
nrow(dmc_synth_data)

Constant Non-Decision time

Description

A dirac delta on "non_dec", to provide a constant non-decision time.

Usage

nt_constant(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters; containing "non_dec"

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

a vector of the same length as t_vec with zeros, except for the element matching with "non_dec" with respect to "t_vec"

Truncated Normally-Distributed Non-Decision Time

Description

Truncated Normally-Distributed Non-Decision Time

Usage

nt_truncated_normal(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters; including "non_dec" and "sd_non_dec"

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

returns the PDF of a truncated normal distribution for t_vec, with mean "non_dec" and standard deviation "sd_non_dec". Lower truncation is 0. Upper truncation is max(t_vec)

Uniform Non-Decision Time

Description

Uniform Non-Decision Time

Usage

nt_uniform(prms_model, prms_solve, t_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters; including "non_dec" and "range_non_dec"

prms_solve

solver settings

t_vec

time space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

returns the PDF of a uniform distribution for t_vec, centered around "non_dec" and with a range of "range_non_dec".

The Observed Data

Description

Functions to get or set the "observed data" of an object.

Usage

obs_data(object, ...) <- value

## S3 replacement method for class 'drift_dm'
obs_data(object, ..., eval_model = FALSE) <- value

obs_data(object, ...)

## S3 method for class 'drift_dm'
obs_data(object, ..., messaging = TRUE)

## S3 method for class 'fits_ids_dm'
obs_data(object, ...)

## S3 method for class 'fits_agg_dm'
obs_data(object, ...)

Arguments

object

an object of type drift_dm, fits_ids_dm, or fits_agg_dm (see estimate_dm()).

...

additional arguments passed down to the specific method.

value

a data.frame which provides three columns: (1) RT for the response times, (2) a column for boundary coding according to the model's b_coding(), (3) Cond for specifying the conditions.

eval_model

logical, indicating if the model should be re-evaluated or not when updating the solver settings (see re_evaluate_model). Default is FALSE.

messaging

logical, indicating if messages shall be displayed or not.

Details

obs_data() is a generic accessor function, and ⁠obs_data<-()⁠ is a generic replacement function. The default methods get and set the "observed data". Their behavior, however, may be a bit unexpected.

In drift_dm objects, the observed data are not stored as a data.frame. Instead, any supplied observed data set is disassembled into RTs for the upper and lower boundary and with respect to the different conditions (ensures more speed and easier programming in the depths of the package). Yet, obs_data() returns a data.frame for drift_dm objects. This implies that obs_data() does not merely access the observed data, but re-assembles it. Consequently, a returned data.frame for the observed data is likely sorted differently than the data.frame that was originally set to the model via ⁠obs_data<-()⁠. Also, when the originally supplied data set provided more conditions than the model, the unused conditions will not be part of the returned data.frame.

For fits_ids_dm (see load_fits_ids), the observed data are stored as a data.frame in the general fit procedure info. This is the data.frame that obs_data() will return. Thus, the returned data.frame will match with the data.frame that was initially supplied to estimate_model_ids, although with unused conditions being dropped.

In theory, it is possible to update parts of the "observed data". However, because obs_data() returns a re-assembled data.frame for drift_dm objects, great care has to be taken with respect to the ordering of the argument value. A message is displayed to remind the user that the returned data.frame may be sorted differently than expected.

Value

For obs_data() a data.frame of the observed data. The method obs_data.drift_dm() per default displays a message to remind the user that the returned data.frame is likely sorted differently than expected.

For ⁠obs_data<-()⁠ the updated drift_dm object.

Note

There is only a replacement function for drift_dm objects. This is because replacing the observed data after the model has been fitted (i.e., for a fits_ids_dm object) doesn't make sense.

Examples

# Set some data to a model -------------------------------------------------
my_model <- dmc_dm() # DMC is pre-built and directly available
# synthetic data suitable for DMC; comes with dRiftDM
some_data <- dmc_synth_data
obs_data(my_model) <- some_data

# Extract data from a model ------------------------------------------------
head(obs_data(my_model))

# Important: ---------------------------------------------------------------
# The returned data.frame may be sorted differently than the one initially
# supplied.
some_data <- some_data[sample(1:nrow(some_data)), ] #' # shuffle the data set
obs_data(my_model) <- some_data
all.equal(obs_data(my_model), some_data)
# so don't do obs_data(my_model)["Cond"] <- ...

# Addition: ----------------------------------------------------------------
# accessor method also available for fits_ids_dm objects
# (see estimate_model_ids)
# get an exemplary fits_ids_dm object
fits <- get_example_fits("fits_ids_dm")
head(obs_data(fits))

Disassemble an Observed Data set

Description

Takes a data.frame with columns RT, Cond, and column matching with b_coding, and disassembles it into a list of rts

Usage

obs_data_to_rt_lists(obs_data, b_coding = NULL)

Arguments

obs_data

a data.frame wth columns RT, Cond, and column matching b_coding

b_coding

a boundary coding list (see b_coding)

Details

performs checks on b_coding (check_b_coding) and checks/reduces obs_data (check_reduce_raw_data) before disassembling the data set.

Value

a list of rts with entries

rts_u -> containing a list of numeric vectors, with names according to the values in Cond
rts_l -> containing a list of numeric vectors, with names according to the values in Cond

Access the Probability Density Functions of a Model

Description

Functions to obtain the probability density functions (PDFs) of a model. These PDFs represent the convolution of the first-passage-time (decision time) with the non-decision time.

Usage

pdfs(object, ...)

## S3 method for class 'drift_dm'
pdfs(object, ...)

## S3 method for class 'fits_agg_dm'
pdfs(object, ...)

Arguments

object

an object of type drift_dm or fits_agg_dm (see estimate_dm()).

...

additional arguments passed down to the specific method.

Details

If the model has not been evaluated, re_evaluate_model() is called before returning the PDFs.

Value

A list with the entries:

pdfs, contains another named list with entries corresponding to the conditions of the model (see conds()). Each of these elements is another named list, containing the entries pdf_u and pdf_l, which are numeric vectors for the PDFs of the upper and lower boundary, respectively.
t_vec, containing a numeric vector of the time domain.

Examples

# get a pre-built model for demonstration purpose
a_model <- dmc_dm()
str(pdfs(a_model))

Plot Conditional Accuracy Functions (CAFs)

Description

Visualizes conditional accuracy functions (CAFs) for observed and/or predicted data. This is useful for assessing model fit or exploring response patterns across conditions or participants.

Usage

## S3 method for class 'cafs'
plot(
  x,
  ...,
  id = NULL,
  conds = NULL,
  col = NULL,
  xlim = NULL,
  ylim = c(0, 1),
  xlab = "Bins",
  ylab = NULL,
  interval_obs = TRUE,
  interval_pred = TRUE
)

Arguments

x

an object of type = "cafs", typically returned by calc_stats().

...

additional graphical arguments passed to plotting functions. See set_default_arguments() for the full list of supported options.

id

a numeric or character, specifying the ID of a single participant to plot. If length(id) > 1, plot.cafs() is called recursively for each entry. Each id must match an entry in the ID column of x.

conds

a character vector specifying the conditions to plot. Defaults to all available conditions.

col

a character vector specifying colors for each condition. If a single color is provided, it is repeated for all conditions.

xlim

a numeric vector of length 2, specifying the x-axis limits.

ylim

a numeric vector of length 2, specifying the y-axis limits.

xlab, ylab

character strings for the x- and y-axis labels.

interval_obs, interval_pred

logicals; if TRUE and x contains a column named Estimate, error bars for observed data and shaded contours for predicted data are drawn, respectively.

Details

If x contains multiple IDs and no specific id is provided, the function aggregates across participants before plotting.

Observed CAFs are shown as points, and predicted CAFs as lines. When interval = TRUE and the input includes interval estimates (i.e., the column Estimate exists), the plot includes error bars for observed data and shaded contours for model predictions.

Colors, symbols, and line styles can be customized via ....

Value

Returns NULL invisibly. The function is called for its side effect of generating a plot.

Examples

# Example 1: Model predictions only ---------------------------------------
a_model <- dmc_dm()
cafs <- calc_stats(a_model, type = "cafs")
plot(cafs)
plot(cafs, col = c("green", "red"), ylim = c(0.5, 1))

# Example 2: Observed and predicted data ----------------------------------
obs_data(a_model) <- dmc_synth_data
cafs <- calc_stats(a_model, type = "cafs")
plot(cafs)

# Example 3: Observed data only -------------------------------------------
cafs <- calc_stats(dmc_synth_data, type = "cafs")
plot(cafs)

# Example 4: Observed data with interval ----------------------------------
cafs <- calc_stats(dmc_synth_data, type = "cafs", resample = TRUE)
plot(cafs)

Plot Delta Functions

Description

Visualizes delta functions for observed and/or predicted data. This is useful for assessing model fit or exploring the model behavior

Usage

## S3 method for class 'delta_funs'
plot(
  x,
  ...,
  id = NULL,
  conds = NULL,
  dv = NULL,
  col = NULL,
  xlim = NULL,
  ylim = NULL,
  xlab = "RT [s]",
  ylab = expression(Delta),
  interval_obs = TRUE,
  interval_pred = TRUE
)

Arguments

x

an object of type = "delta_funs", typically returned by calc_stats().

...

additional graphical arguments passed to plotting functions. See set_default_arguments() for the full list of supported options.

id

conds

a character vector specifying the conditions to plot. Defaults to all available conditions.

dv

a character vector indicating the delta function(s) to plot. Defaults to all columns in x that begin with "Delta_".

col

a character vector specifying colors for each condition. If a single color is provided, it is repeated for all conditions.

xlim

a numeric vector of length 2, specifying the x-axis limits.

ylim

a numeric vector of length 2, specifying the y-axis limits.

xlab, ylab

character strings for the x- and y-axis labels.

interval_obs, interval_pred

logicals; if TRUE and x contains a column named Estimate, error bars for observed data and shaded contours for predicted data are drawn, respectively.

Details

If x contains multiple IDs and no specific id is provided, the function aggregates across participants before plotting.

Observed delta functions are shown as points, and predicted delta functions as lines. When interval_obs = TRUE or interval_pred = TRUE and the input includes interval estimates (i.e., the column Estimate exists), the plot includes error bars for observed data and shaded contours for model predictions.

Colors, symbols, and line styles can be customized via ....

Value

Returns NULL invisibly. The function is called for its side effect of generating a plot.

Examples

# Example 1: Model predictions only ---------------------------------------
a_model <- dmc_dm()
deltas <- calc_stats(
  a_model,
  type = "delta_funs",
  minuends = "incomp",
  subtrahends = "comp"
)
plot(deltas)
plot(deltas, col = "black", lty = 2, xlim = c(0.2, 0.65))

# Example 2: Observed and predicted data ----------------------------------
obs_data(a_model) <- dmc_synth_data
deltas <- calc_stats(
  a_model,
  type = "delta_funs",
  minuends = "incomp",
  subtrahends = "comp"
)
plot(deltas)

# Example 3: Observed data only -------------------------------------------
deltas <- calc_stats(
  dmc_synth_data,
  type = "delta_funs",
  minuends = "incomp",
  subtrahends = "comp"
)
plot(deltas)

# Example 4: Observed data with intervals ---------------------------------
deltas <- calc_stats(
  dmc_synth_data,
  type = "delta_funs",
  minuends = "incomp",
  subtrahends = "comp",
  resample = TRUE
)
plot(deltas)

Plot Distributions of Predicted and Observed Data

Description

Visualizes observed and/or predicted response time distributions. Useful for assessing model fit or exploring model behavior.

Usage

## S3 method for class 'densities'
plot(
  x,
  ...,
  id = NULL,
  conds = NULL,
  col = NULL,
  xlim = NULL,
  ylim = NULL,
  xlab = "RT [s]",
  ylab = "Density",
  obs_stats = "hist",
  interval_obs = FALSE,
  interval_pred = TRUE
)

Arguments

x

an object of type = "densities", typically returned by calc_stats().

...

additional graphical arguments passed to plotting functions. See set_default_arguments() for the full list of supported options.

id

conds

a character vector specifying the conditions to plot. Defaults to all available conditions.

col

a character vector specifying colors for each condition. If a single color is provided, it is repeated for all conditions.

xlim

a numeric vector of length 2, specifying the x-axis limits.

ylim

a numeric vector of length 2, specifying the y-axis limits.

xlab, ylab

character strings for the x- and y-axis labels.

obs_stats

a character vector specifying which observed statistics to plot. Options include "hist" for histograms and "kde" for kernel density estimates. Defaults to "hist".

interval_obs, interval_pred

logicals; if TRUE and x contains a column named Estimate, error bars for observed data and shaded contours for predicted data are drawn, respectively.

Details

If x contains multiple IDs and no specific id is provided, the function aggregates across participants before plotting. You can provide a vector of ids to produce separate plots for each participant.

Observed densities are shown as histograms (default: gray shaded areas), or KDE lines (default: black, dotted). Predicted densities are shown as lines (default: colorized). Distributions associated with the upper boundary are shown with values > 0 (i.e., the upper part of the plot), distributions associated with the lower boundary are shown with values < 0 (i.e., the lower part of the plot).

Axis limits, colors, and styling options can be customized via .... If interval information is provided (i.e., the column Estimate exists in x), error bars or shading will be added, depending on the type of statistic.

A legend is only displayed if there is predicted data.

Value

Returns NULL invisibly. The function is called for its side effect of generating a plot.

Examples

# Example 1: Predicted densities only -------------------------------------
a_model <- dmc_dm()
dens <- calc_stats(a_model, type = "densities")
plot(dens, xlim = c(0, 1))
plot(dens, xlim = c(0, 1), conds = "comp")


# Example 2: Observed and predicted densities -----------------------------
obs_data(a_model) <- dmc_synth_data
dens <- calc_stats(a_model, type = "densities")
plot(dens, xlim = c(0, 1), conds = "comp", col = "blue")

# Example 3: Observed densities only --------------------------------------
dens <- calc_stats(dmc_synth_data, type = "densities")
plot(dens, conds = "comp", obs.hist.col = "green", alpha = 1)

# Example 4: With interval estimates --------------------------------------
dens <- calc_stats(dmc_synth_data, type = "densities", resample = TRUE)
plot(dens, interval_obs = TRUE, conds = "comp")

Plot Components of a Drift Diffusion Model

Description

This function generates plots for all components of a drift diffusion model (DDM), such as drift rate, boundary, and starting condition. Each component is plotted against the time or evidence space, allowing for visual inspection of the model's behavior across different conditions.

Usage

## S3 method for class 'drift_dm'
plot(x, ..., conds = NULL, col = NULL, xlim = NULL, bundle_plots = TRUE)

Arguments

x

an object of class drift_dm

...

additional graphical arguments passed to plotting functions. See set_default_arguments() for the full list of supported options.

conds

a character vector specifying the conditions to plot. Defaults to all available conditions.

col

a character vector specifying colors for each condition. If a single color is provided, it is repeated for all conditions.

xlim

a numeric vector of length 2, specifying the x-axis limits.

bundle_plots

logical, indicating whether to display separate panels in a single plot layout (FALSE), or to plot them separately (TRUE).

Details

The plot.drift_dm function provides an overview of key DDM components, which include:

mu_fun: Drift rate over time.
mu_int_fun: Integrated drift rate over time (if required by the specified solver of the model).
x_fun: Starting condition as a density across evidence values.
b_fun: Boundary values over time.
dt_b_fun: Derivative of the boundary function over time.
nt_fun: Non-decision time as a density over time.

Value

Nothing (NULL; invisibly)

Examples

# plot the component functions of the Ratcliff DDM
plot(ratcliff_dm())
plot(ratcliff_dm(var_non_dec = TRUE))
# Note: the variability in the drift rate for the Ratcliff DDM
# is not plotted! This is because it is not actually stored as a component
# function.

# plot the component functions of the DMC model
plot(dmc_dm(), col = c("green", "red"))

Plot MCMC Results and Diagnostics for `mcmc_dm` Objects

Description

Visualize MCMC results and diagnostics for mcmc_dm objects. The function plot.mcmc() is typically called when users supply an mcmc_dm object returned by estimate_dm() to the generic base::plot() function.

Usage

## S3 method for class 'mcmc_dm'
plot(x, ..., id = NULL, what = "trace", bundle_plots = TRUE)

Arguments

x

an object of class mcmc_dm, as returned by estimate_dm().

...

optional arguments passed on to the underlying plotting functions plot_mcmc_trace(), plot_mcmc_marginal(), and plot_mcmc_auto(). See the respective documentations for a list of optional arguments and the examples below. Probably the most relevant optional argument is which_prms that allows users to select a specific subset of parameters.

id

optional character vector, specifying the id(s) of participants to plot. If length(id) > 1, plot.mcmc_dm() is called recursively, iterating over each entry in id. Each id must match with the relevant dimension names of the used chains array stored in x.

what

a character string indicating the type of plot to produce. Must be either "trace", "density", or "auto". See the Details below. Default is "trace".

bundle_plots

logical, indicating whether to display separate panels in a single plot layout (FALSE), or to plot them separately (TRUE).

Details

This function provides diagnostic and summary visualizations of MCMC samples. It handles results from both hierarchical and non-hierarchical MCMC runs:

If id is provided, the plot refers to the requested participant, with MCMC results extracted at the individual level.
If id is omitted, plots refer to group-level parameters (i.e., the hyperparameters)

The following plot types are supported:

Trace plots (what = "trace"): These plots show sampled parameter values across MCMC iterations for each chain. They are primarily used to inspect convergence and mixing behavior. Ideally, all chains should appear well-mixed (i.e., they should overlap and sample in a similar range). Lack of convergence is indicated by chains that remain in separate regions or exhibit trends over time.
Density plots (what = "density"): These plots display smoothed marginal posterior distributions for each parameter, collapsed over chains and iterations. They are useful for understanding the central tendency, variance, and shape of the posterior distributions.
Autocorrelation plots (what = "auto"): These plots display the autocorrelation at different lags, averaged across chains. They are useful to judge how quickly the chains produced independent samples.

Value

Returns NULL invisibly.

Examples

# get an examplary `mcmc_dm` object
chains_obj <- get_example_fits("mcmc")
plot(chains_obj)
plot(chains_obj, what = "density")
plot(chains_obj, what = "density", which_prm = "b", bundle_plots = FALSE)

Plot Response Time Quantiles

Description

Visualizes response time quantiles for observed and/or predicted data across experimental conditions. This is useful for assessing model fit or exploring response patterns across conditions or participants.

Usage

## S3 method for class 'quantiles'
plot(
  x,
  ...,
  id = NULL,
  conds = NULL,
  dv = NULL,
  col = NULL,
  xlim = NULL,
  ylim = c(0, 1),
  xlab = "RT [s]",
  ylab = "F(RT)",
  interval_obs = TRUE,
  interval_pred = TRUE
)

Arguments

x

an object of type = "quantiles", typically returned by calc_stats().

...

additional graphical arguments passed to plotting functions. See set_default_arguments() for the full list of supported options.

id

conds

a character vector specifying the conditions to plot. Defaults to all available conditions.

dv

a character string indicating the dependent variable to plot. Defaults to the quantiles for the upper boundary.

col

a character vector specifying colors for each condition. If a single color is provided, it is repeated for all conditions.

xlim

a numeric vector of length 2, specifying the x-axis limits.

ylim

a numeric vector of length 2, specifying the y-axis limits.

xlab, ylab

character strings for the x- and y-axis labels.

interval_obs, interval_pred

logicals; if TRUE and x contains a column named Estimate, error bars for observed data and shaded contours for predicted data are drawn, respectively.

Details

If x contains multiple IDs and no specific id is provided, the function aggregates across participants before plotting.

Observed quantiles are shown as points, and predicted quantiles as lines. When interval = TRUE and the input includes interval estimates (i.e., the column Estimate exists), the plot includes error bars for observed data and shaded contours for model predictions.

Colors, symbols, and line styles can be customized via ....

Value

Returns NULL invisibly. The function is called for its side effect of generating a plot.

Examples

# Example 1: Model predictions only ---------------------------------------
a_model <- dmc_dm()
quantiles <- calc_stats(a_model, type = "quantiles")
plot(quantiles)
plot(quantiles, col = c("green", "red"), xlim = c(0.2, 0.6))

# Example 2: Observed and predicted data ----------------------------------
obs_data(a_model) <- dmc_synth_data
quantiles <- calc_stats(a_model, type = "quantiles")
plot(quantiles)

# Example 3: Observed data only -------------------------------------------
quantiles <- calc_stats(dmc_synth_data, type = "quantiles")
plot(quantiles)

# Example 4: Observed data with interval ----------------------------------
cafs <- calc_stats(dmc_synth_data, type = "quantiles", resample = TRUE)
plot(cafs)

Plot Multiple Statistics

Description

This function iterates over a list of statistics data, resulting from a call to calc_stats(), and subsequently plots each statistic. It allows for a simple arrangement of multiple plots on a single graphics device.

Usage

## S3 method for class 'stats_dm_list'
plot(x, ..., mfrow = NULL)

Arguments

x

an object of type stats_dm_list, which is essentially a list of multiple statistics, resulting from a call to calc_stats().

...

additional arguments passed to the plot function for each individual stats_dm object in x.

mfrow

an optional numeric vector of length 2, specifying the number of rows and columns for arranging multiple panels in a single plot (e.g., c(1, 3)). Plots are provided sequentially if NULL (default), using the current graphics layout of a user.

Details

The plot.stats_dm_list() function "merely" iterates over each entry of x and calls the respective plot() method. If dRiftDM doesn't provide a plot() method for an object stored in x, the respective entry is skipped and a message is displayed.

When users want more control over each plot, it is best to call the plot() function separately for each statistic in the list (e.g., plot(x$cafs); plot(x$quantiles))

Value

Nothing (NULL; invisibly)

Examples

# get a list of statistics for demonstration purpose
all_fits <- get_example_fits("fits_ids_dm")
stats <- calc_stats(all_fits, type = c("cafs", "quantiles"))

# then call the plot function.
plot(stats, mfrow = c(1, 2))

Plot Traces of a Drift Diffusion Model

Description

Creates a plot of simulated traces (i.e., simulated evidence accumulation processes) from a drift diffusion model. Such plots are useful for exploring and testing model behavior.

Usage

## S3 method for class 'traces_dm_list'
plot(
  x,
  ...,
  conds = NULL,
  col = NULL,
  col_b = NULL,
  xlim = NULL,
  ylim = NULL,
  xlab = "Time",
  ylab = "Evidence"
)

## S3 method for class 'traces_dm'
plot(x, ...)

Arguments

x

an object of type traces_dm_list or traces_dm, containing the traces to be plotted, resulting from a call to simulate_traces().

...

additional graphical arguments passed to plotting functions. See set_default_arguments() for the full list of supported options.

conds

a character vector specifying the conditions to plot. Defaults to all available conditions.

col

a character vector specifying colors for each condition. If a single color is provided, it is repeated for all conditions.

col_b

a character vector, specifying the color of the boundary for each condition. If a single color is provided, it is repeated for all conditions. Default is "black".

xlim

a numeric vector of length 2, specifying the x-axis limits.

ylim

a numeric vector of length 2, specifying the y-axis limits.

xlab, ylab

character strings for the x- and y-axis labels.

Details

plot.traces_dm_list() iterates over all conditions and plots the traces. It includes a legend with condition labels.

plot.traces_dm plots a single set of traces. Because simulate_traces() returns an object of type traces_dm_list per default, users will likely call plot.traces_dm_list() in most cases; and not plot.traces_dm. plot.traces_dm is only relevant if users explicitly extract and provide an object of type traces_dm.

The function automatically generates the upper and lower boundaries based on the information stored within x.

Value

NULL invisibly

Examples

# get a couple of traces for demonstration purpose
a_model <- dmc_dm()
some_traces <- simulate_traces(a_model, k = 3)

# Plots for traces_dm_list objects ----------------------------------------
# basic plot
plot(some_traces)

# a slightly more beautiful plot :)
plot(some_traces,
  col = c("green", "red"),
  xlim = c(0, 0.35),
  xlab = "Time [s]",
  ylab = bquote(Realizations ~ of ~ X[t]),
  legend_pos = "bottomright"
)

# Plots for traces_dm objects ---------------------------------------------
# we can also extract a single set of traces and plot them
one_set_traces <- some_traces$comp
plot(one_set_traces)

# modifications to the plot work in the same way
plot(one_set_traces,
  col = "green",
  xlim = c(0, 0.35),
  xlab = "Time [s]",
  ylab = bquote(Realizations ~ of ~ X[t]),
  legend = "just comp"
)

Plot MCMC Chains for Drift Diffusion Model Parameters

Description

The functions provide visualizations of MCMC results obtained for drift_dm() objects. Users won't call the functions directly. Instead, they are called via plot.mcmc_dm(), and the following documentation helps to define optional arguments that can be passed on via ....

Usage

plot_mcmc_trace(
  chains,
  col_palette = grDevices::rainbow,
  col_chains = NULL,
  which_prms = NULL,
  xlab = NULL,
  ylab = NULL,
  xlim = NULL,
  ylim = NULL
)

plot_mcmc_marginal(
  chains,
  col_line = NULL,
  col_shade = NULL,
  which_prms = NULL,
  xlab = NULL,
  ylab = NULL,
  xlim = NULL,
  ylim = NULL
)

plot_mcmc_auto(
  chains,
  lags = 1:30,
  col_line = NULL,
  which_prms = NULL,
  xlab = NULL,
  ylab = NULL,
  xlim = NULL,
  ylim = NULL,
  type = NULL,
  main = NULL
)

Arguments

chains

an array of MCMC samples with three dimensions (parameters × chains × iterations; see also plot.mcmc_dm()). This argument is not optional and will be provided by plot.mcmc_dm().

col_palette

a function to generate a color palette for chains (default is grDevices::rainbow). Must be callable like this col_palette(n).

col_chains

a character vector, defining colors(s) to override col_palette for chain lines. Can be a single value or a vector matching the number of chains. Recycled if necessary to match the number of chains.

which_prms

a regular expression (string), used to select a subset of parameters to plot. For example, "^v" would match all parameters starting with v. See also the examples in plot.mcmc_dm().

xlab, ylab

character vector(s), specifying axis labels for each panel. Recycled if necessary to match the number of plots being generated.

xlim, ylim

list(s), containing length-2 numeric vectors specifying x- and y-axis limits, respectively. Recycled if necessary to match the number of plots being generated.

col_line

a character vector, defining color(s) for the line outlines in the density plots. Can be a single value or a vector matching the number of parameters to plot. Recycled if necessary to match the number of parameters being plotted.

col_shade

a character vector, defining color(s) for the shaded areas in the density plots. Can be a single value or a vector matching the number of parameters to plot. Recycled if necessary to match the number of parameters being plotted.

lags

a numeric vector, giving the lags at which to calculate autocorrelation (see also coda::autocorr.diag() and coda::autocorr()).

type

a character, specifying how autocorrelations shall be displayed. Defaults to "h" (see the type argument of base::plot()). Recycled if necessary to match the number of parameters being plotted.

main

character vector, specifying the labels above each autocorrelation plot. Defaults to parameter labels. Recycled if necessary to match the number of parameters being plotted.

Details

plot_mcmc_trace() plots one panel per parameter, with lines for each chain showing how values evolve over iterations. This is useful for diagnosing convergence and mixing.
plot_mcmc_marginal() plots smoothed marginal posterior densities collapsed over chains and iterations for each parameter, useful for inspecting posterior distributions.
plot_mcmc_auto() plots the autocorrelation among samples as function of lags.

Value

These functions are called for their side effects (producing plots). They return NULL invisibly.

Print Functions for Stats Objects

Description

when calling calc_stats(), each returned statistic will be a subclass of stats_dm and data.frame. The following print() methods will call the more generic print.stats_dm() function.

Usage

## S3 method for class 'cafs'
print(x, ...)

## S3 method for class 'basic_stats'
print(x, ...)

## S3 method for class 'quantiles'
print(x, ...)

## S3 method for class 'delta_funs'
print(x, ...)

## S3 method for class 'fit_stats'
print(x, ...)

## S3 method for class 'sum_dist'
print(x, ...)

Arguments

x

a subclass of data.frame, as returned by calc_stats().

...

additional arguments passed forward to print.stats_dm().

Value

x (invisibly)

Summary and Printing for `fits_agg_dm` Objects

Description

Methods for summarizing and printing objects of the class fits_agg_dm, which contain model fits based on aggregated data across participants.

Usage

## S3 method for class 'summary.fits_agg_dm'
print(x, ..., just_header = FALSE, round_digits = drift_dm_default_rounding())

## S3 method for class 'fits_agg_dm'
summary(object, ..., select_unique = FALSE)

Arguments

x

an object of class summary.fits_agg_dm.

...

additional arguments (currently unused).

just_header

logical, if TRUE only print the header information without details. Default is FALSE.

round_digits

an integer, specifying the number of decimal places for rounding in the printed summary. Default is 3.

object

an object of class fits_agg_dm, typically generated by a call to estimate_dm.

select_unique

logical, passed to coef.drift_dm().

Details

The summary.fits_agg_dm function creates a structured summary of a fits_agg_dm object, containing:

summary_drift_dm_obj: A list with information about the underlying drift diffusion model (as returned by summary.drift_dm()).
prms: Parameter estimates obtained from the model fit. This is equivalent to a call to coef.drift_dm() on the stored model object.
obs_data: A list providing the number of individual participants and the average number of trials per condition across participants.

The print.summary.fits_agg_dm function formats and prints the above summary in a human-readable form.

Value

summary.fits_agg_dm() returns a list of class summary.fits_agg_dm (see Details for its structure).

print.summary.fits_agg_dm() returns the input object invisibly.

Examples

# Load example fit object
fits_agg <- get_example_fits("fits_agg")
sum_obj <- summary(fits_agg)
print(sum_obj, round_digits = 2)

Summary and Printing for fits_ids_dm Objects

Description

Methods for summarizing and printing objects of the class fits_ids_dm, which contain multiple fits across individuals.

Usage

## S3 method for class 'summary.fits_ids_dm'
print(x, ..., just_header = FALSE, round_digits = drift_dm_default_rounding())

## S3 method for class 'fits_ids_dm'
summary(object, ..., select_unique = FALSE)

Arguments

x

an object of class summary.fits_ids_dm.

...

additional arguments (currently unused).

just_header

logical, if TRUE only print the header information without details. Default is FALSE.

round_digits

an integer, specifying the number of decimal places for rounding in the printed summary. Default is 3.

object

an object of class fits_ids_dm, generated by a call to load_fits_ids.

select_unique

logical, passed to coef.drift_dm().

Details

The summary.fits_ids_dm function creates a summary object. The contents of this summary object depends on whether the user supplies a fits_ids_dm object that was created with estimate_dm() or the deprecated function load_fits_ids().

In the first case, the object contains:
summary_drift_dm_obj: A list with information about the underlying drift diffusion model (as returned by summary.drift_dm()).
prms: All parameter values across all conditions (essentially a call to coef() with the argument select_unique = FALSE).
stats: A named list of matrices for each condition, including mean and standard error for each parameter.
obs_data: A list providing the number of individual participants and the average number of trials per condition across participants.
optimizer: A string of the optimizer that was used
conv_info: A list providing a summary of the convergance and messages for all IDs
In the second case, the object contains:
lower and upper: Lower and upper bounds of the search space.
model_type: Description of the model type, based on class information.
prms: All parameter values across all conditions (essentially a call to coef() with the argument select_unique = FALSE).
stats: A named list of matrices for each condition, including mean and standard error for each parameter.
N: The number of individuals.

The print.summary.fits_ids_dm function displays the summary object in a formatted manner.

Value

summary.fits_ids_dm() returns a list of class summary.fits_ids_dm (see the Details section summarizing each entry of this list).

print.summary.fits_ids_dm() returns invisibly the summary.fits_ids_dm object.

Examples

# get an auxiliary object of type fits_ids_dm for demonstration purpose
all_fits <- get_example_fits("fits_ids_dm")
sum_obj <- summary(all_fits)
print(sum_obj, round_digits = 2)

Generate Parameter-Condition Labels

Description

Creates a vector of labels from a parameter-condition combination matrix, resulting from a call to prms_cond_combo. Used, for instance, in coef.drift_dm.

Usage

prm_cond_combo_2_labels(prms_cond_combo, sep = ".")

Arguments

prms_cond_combo

a 2-row character matrix where each column represents a unique parameter-condition combination.

sep

Separator for parameter and condition labels (default: "~").

Value

A vector of labels with as many entries as the columns of prms_cond_combo had, combining parameter and condition (if necessary).

If the parameter labels are already unique (because all parameters do not vary across conditions or are selectively used for one condition), then only these parameter labels are returned

Unique Conditions-Parameter Combinations

Description

This is a helper function. It searches through the linear_internal_list of the stored flex_prms object, and keeps the first unique appearance of parameters. For example, when the parameter muc is equal for comp, neutral, and incomp, the function will provide the info "muc" and "comp", thus dropping incomp and neutral, where the parameter is the same.

Usage

prms_cond_combo(drift_dm_obj)

Arguments

drift_dm_obj

an object of type drift_dm

Value

a matrix with two rows. Each column contains a combination of the parameter name and the condition that can be considered unique. Parameter names are stored in the first row, condition labels in the second.

Extract the conditions and parameters from an instruction string

Description

This function takes an instruction as a string and then extracts the conditions and parameters, depending on what "type" of operation it is. The expected structure is listed in the details docu of flex_prms()

Usage

prms_conds_to_modify(formula_instr, operation, all_conds, all_prms)

Arguments

formula_instr

an instruction string

operation

what to expect in terms of the string's structure. Can be "vary", "restrain", "fix", "set" or "dependency"

all_conds

all potential conditions (necessary for extending missing condition specification)

all_prms

all potential paramters (necessary for extending missing condition specification)

Value

a named list prms_to_adress and conds_to_adress as character vectors.

The Parameters for Deriving Model Predictions

Description

Functions to get or set the "solver settings" of an object. This includes the diffusion constant and the discretization of the time and evidence space.

Usage

prms_solve(object, ...) <- value

## S3 replacement method for class 'drift_dm'
prms_solve(object, ..., eval_model = FALSE) <- value

prms_solve(object, ...)

## S3 method for class 'drift_dm'
prms_solve(object, ...)

## S3 method for class 'fits_ids_dm'
prms_solve(object, ...)

## S3 method for class 'fits_agg_dm'
prms_solve(object, ...)

Arguments

object

an object of type drift_dm, fits_ids_dm, or fits_agg_dm (see estimate_dm()).

...

additional arguments (i.e., eval_model).

value

a named numeric vector providing new values for the prms_solve vector (see drift_dm()).

eval_model

logical, indicating if the model should be re-evaluated or not when updating the solver settings (see re_evaluate_model). Default is FALSE.

Details

prms_solve() is a generic accessor function, and ⁠prms_solve<-()⁠ is a generic replacement function. The default methods get and set the "solver settings".

It is possible to update parts of the "solver setttings" (i.e., parts of the underlying prms_solve vector). However, modifying "nx" or "nt" is not allowed! Any attempts to modify the respective entries will silently fail (no explicit error/warning etc. is thrown).

Value

For prms_solve() the vector prms_solve (see drift_dm()).

For ⁠prms_solve<-()⁠ the updated drift_dm object.

Note

There is only a replacement function for drift_dm objects. This is because replacing the solver settings after the model has been fitted (e.g., for a fits_ids_dm object) doesn't make sense.

Examples

# get some default model to demonstrate the prms_solve() functions
my_model <- ratcliff_dm()
# show the discretization and scaling of the model
prms_solve(my_model)
# partially modify these settings
prms_solve(my_model)[c("dx", "dt")] <- c(0.005)
prms_solve(my_model)

# accessor method also available for fits_ids_dm objects
# (see estimate_model_ids)
# get an exemplary fits_ids_dm object
fits <- get_example_fits("fits_ids_dm")
prms_solve(fits)

Format Parameters as String

Description

Converts parameter values into a formatted string.

Usage

prms_to_str(x, prms = NULL, round_digits = NULL, sep = "=>", collapse = "\n")

Arguments

x

a drift_dmobject or character vector for labels.

prms

Numeric vector of values (used if x is character).

round_digits

Rounding precision (default set by drift_dm_default_rounding()).

sep

Separator between names and values (default: "=>").

collapse

String to separate each name-value pair (default: "\n").

Value

A single formatted string with parameter names and values. (e.g., "a => 0 \n b => 1")

Create a Basic Diffusion Model

Description

This function creates a drift_dm model that corresponds to the basic Ratcliff Diffusion Model

Usage

ratcliff_dm(
  var_non_dec = FALSE,
  var_start = FALSE,
  var_drift = FALSE,
  instr = NULL,
  obs_data = NULL,
  sigma = 1,
  t_max = 3,
  dt = 0.0075,
  dx = 0.02,
  solver = "kfe",
  b_coding = NULL
)

Arguments

var_non_dec, var_start, var_drift

logical, indicating whether the model should have a variable non-decision time , starting point (uniform), or drift rate (normally-distributed). (see also nt_uniform and x_uniform in component_shelf)

instr

optional string with "instructions", see modify_flex_prms().

obs_data

data.frame, an optional data.frame with the observed data. See obs_data.

sigma, t_max, dt, dx

numeric, providing the settings for the diffusion constant and discretization (see drift_dm)

solver

character, specifying the solver.

b_coding

list, an optional list with the boundary encoding (see b_coding)

Details

The classical Ratcliff Diffusion Model is a diffusion model with a constant drift rate muc and a constant boundary b. If var_non_dec = FALSE, a constant non-decision time non_dec is assumed, otherwise a uniform non-decision time with mean non_dec and range range_non_dec. If var_start = FALSE, a constant starting point centered between the boundaries is assumed (i.e., a dirac delta over 0), otherwise a uniform starting point with mean 0 and range range_start. If var_drift = FALSE, a constant drift rate is assumed, otherwise a normally distributed drift rate with mean mu_c and standard deviation sd_muc (can be computationally intensive). Important: Variable drift rate is only possible with dRiftDM's mu_constant function. No custom drift rate is yet possible in this case.

Value

An object of type drift_dm (parent class) and ratcliff_dm (child class), created by the function drift_dm().

References

Ratcliff R (1978). “A theory of memory retrieval.” Psychological Review, 85(2), 59–108. doi:10.1037/0033-295X.85.2.59.

Examples

# the model with default settings
my_model <- ratcliff_dm()

# the model with a variable non-decision time and with finer space
# discretization
my_model <- ratcliff_dm(var_non_dec = TRUE, dx = .01)

A synthetic data set with one condition

Description

This dataset was simulated by using the classical Ratcliff diffusion model (see ratcliff_dm()).

Usage

ratcliff_synth_data

Format

A data frame with 300 rows and 3 columns:

RT: Response Times
Error: Error Coding (Error Response = 1; Correct Response = 0)
Cond: Condition ('null')

Re-evaluate the model

Description

Updates the PDFs of a model. If observed data is available (e.g., via the obs_data entry or the stats_agg entry; depending on the cost_function, see also drift_dm()), the cost_value is also updated.

Usage

re_evaluate_model(drift_dm_obj, eval_model = TRUE)

Arguments

drift_dm_obj

an object of type drift_dm

eval_model

logical, indicating if the model should be evaluated or not. If FALSE, PDFs and the value of the cost function are deleted from the model. Default is True.

Details

More in-depth information about the mathematical details for deriving the PDFs can be found in Richter et al. (2023)

Value

Returns the passed drift_dm_obj object, after (re-)calculating the PDFs and (if observed data is set) the cost_value.

the PDFs an be addressed via drift_dm_obj$pdfs
the cost_value can be addressed via drift_dm_obj$cost_value

Note that if re_evaluate model is called before observed data was set, the function silently updates the pdfs, but not cost_value.

Examples

# choose a pre-built model (e.g., the Ratcliff model)
# and set the discretization as needed
my_model <- ratcliff_dm()

# then calculate the model's predicted PDF
my_model <- re_evaluate_model(my_model)
str(my_model$pdfs) # show the structure of the attached pdfs

# if you want the cost_function, make sure some data is attached to the
# model (see also the documentation of obs_data())
obs_data(my_model) <- ratcliff_synth_data # this data set comes with dRiftDM
my_model <- re_evaluate_model(my_model)
str(my_model$pdfs)
print(my_model$cost_value)

Remove flags added when calling estimate_classical

Description

This is a small internal function that ensures that the flags added after estimating a model via estimate_classical() are removed. It is used when replacing settings of a model after it has been estimated.

Usage

remove_estimate_info(drift_dm_obj)

Arguments

drift_dm_obj

a drift_dm object

Value

the model without the list entry estimate_info

Internal Helpers for Resampling of Summary Statistics

Description

These functions support the construction of intervals for descriptive statistics computed from observed or simulated data. They are used internally by stats_resample_dm() methods.

Usage

stats_resample_wrapper(
  obs_data_split,
  one_set_idxs = NULL,
  type,
  b_coding,
  ...
)

resample_assemble(resample_list, level, original)

Arguments

obs_data_split

a named list of data.frames, containing a single set of observed data, splitted by condition

one_set_idxs

a named list of numeric vectors. Each entry contains indices to shuffle the data.frames in obs_data_split. Default NULL keeps obs_data_split as is.

type

a character, passed to calc_stats_pred_obs().

b_coding

a list with boundary coding information, , required to wrangle rts to match with calc_stats_pred_obs().

...

additional arguments passed to calc_stats_pred_obs().

resample_list

a list of statistics returned by calls to stats_resample_wrapper().

level

a numeric between 0 and 1, controlling the width of the interval.

original

a stats_dm object representing the statistic computed from the original data set or model prediction.

Details

stats_resample_wrapper() wraps a call to calc_stats_pred_obs() for use in resampling.

resample_assemble() takes a list of resampled statistics and the original statistic, and computes lower and upper bounds based on the requested level. It returns a stats_dm object with an added Estimate column.

Value

stats_resample_wrapper() returns a single stats_dm object for one sample.
resample_assemble() returns a stats_dm object containing the lower and upper interval bounds along with the original estimate.

Set aggregated data to a model object

Description

Helper function that aggregates a data set across all participants and attaches the resulting group-level summary statistics to a drift_dm_obj. This is required when the user wants to fit aggregated data (e.g., via the RMSE cost function).

Usage

set_agg_data(drift_dm_obj, obs_data_ids, ...)

Arguments

drift_dm_obj

a model object (of class drift_dm) to which the aggregated data will be attached.

obs_data_ids

a data.frame containing individual-level observations. Must include an ID column identifying participants.

...

optional arguments (currently supported are n_bins and probs which are relevant when using the rmse cost function)

Details

This function is intended for internal use and is called by estimate_dm() when aggregated model fitting is requested.

Value

The updated drift_dm_obj with aggregated stats_agg and obs_data set to NULL.

Set default graphical parameters for plotting in dRiftDM

Description

This function sets/updates graphical parameters (passed through ... by ⁠plot.*()⁠ methods of dRiftDM) and handles default settings. It supports arguments relevant to graphics::plot(), graphics::points(), graphics::legend(), and graphics::arrows(), allowing users fine-grained control over plotting while avoiding argument clashes. This is an internal function, so users won't call it directly. However, the Details below list all optional arguments that users can provide when calling the ⁠plot.*⁠ methods

Usage

set_default_arguments(dots, leg, id)

Arguments

dots

a named list of graphical parameters (created from ... when passing additional arguments to ⁠plot.*()⁠ methods of dRiftDM).

leg

a character vector, used to set default legend entries.

id

a character or numeric participant value.

Details

The following list shows possible arguments and default values for...

shades, lines, points:

obs.pch: plotting symbol for points of observed data (default: 19)
obs.pt.bg: background color for points (default: NA)
pred.lty: line type for lines of predicted data (default: 1)
pred.lwd: line width for lines (default: 1)
alpha: alpha transparency for shaded prediction intervals (default: 0.2)
b.lty: line type for decision boundaries (default: 1)
b.lwd: line width for decision boundaries (default: 1)

text and scaling:

cex.axis, cex.lab, cex.main, cex.pt, cex.legend: text and point scaling factors (default: 1)
main: plot title (default: NULL, with id appended if applicable)
family: font family (default: empty string)

legend:

legend: legend labels (default: leg)
legend.pos: position of the primary legend (default: "bottomright", it is also possible to specify a numeric vector for legend.pos, providing the exact x- and y-position)
legend.bg: background color of the legend box (default: "white")
bty: box type for the legend (default: "o")
box.lwd: border line width of the legend box (default: 1)
box.col: border color of the legend box (default: "black")
box.lty: border line type of the legend box (default: 1)
lines.legend: optional labels for a separate legend for line types (only relevant for plot.densities(), default: NULL)
lines.legend.pos: position of that separate legend (default: "topright", passing a vector with exact x- and y-positions for lines.legend is possible)

error bars:

err.width: width of caps on error bars (default: 0.05)
err.col: color of error bars (default: NULL; resolved to line/point colors, black or gray, depending on the plot type; recycled across conditions)
err.eps: threshold under which an error bar is treated as too small to display (default: ⁠0.1%⁠ of the y-axis range)

observed histogram and KDE (for plot of RT distributions/densities):

obs.hist.col: fill color for histograms (default: "gray20")
obs.kde.col: color of KDE lines (default: "black")
obs.kde.lty: line type of KDE lines (default: 2)
obs.kde.lwd: line width of KDE lines (default: 1)

ETC.:

horiz.col: color of a horizontal axis line at y = 0 (default: "gray")
horiz.lwd: color of a horizontal axis line at y = 0 (default: 1)

Value

A list with all updated graphical parameters, ready to be passed to plotting functions. (done internally, not by the user)

Examples

# This is not an example of calling set_default_arguments() directly
# (because it is an internal function), but it illustrates how user-supplied
# plotting arguments are processed via this helper.

some_data <- ulrich_flanker_data
some_stats <- calc_stats(some_data, type = "quantiles")

# See also ?plot.quantiles for more detail
plot(
  some_stats,
  obs.pch = 21, # optional argument 1: point type
  obs.pt.bg = "black", # optional argument 2: point background
  legend = c("foo", "bar") # optional argument 3: custom legend labels
)
# for a full list of optional arguments, see the Details above

Set Default Colors

Description

This function assigns default colors if none are provided or adjusts the color vector to match the number n

Usage

set_default_colors(colors, n)

Arguments

colors

character vector, specifying colors for conditions. If NULL, default colors are used.

n

Value

A character vector of colors of length n.

Set one specific aspect of the solver settings

Description

Internal function to update one aspect of prms_solve or solver.

Usage

set_one_solver_setting(drift_dm_obj, name_prm_solve, value_prm_solve)

Arguments

drift_dm_obj

an object of type drift_dm

name_prm_solve

which aspect to address? ("sigma", "t_max", "dx", "dt", "solver")

value_prm_solve

either a single numeric or character string

Details

Ensures that the supplied values are reasonable and that nx and nt are updated. The functions prms_solve<- and solver<- pass their arguments forward to this function.

Value

the updated un-evaluated (!) drift_dm_obj object

Simulate Synthetic Responses

Description

This function simulates data based on the provided model. To this end, random samples from the predicted PDFs are drawn via approximate inverse CDF sampling.

Usage

simulate_data(object, ...)

## S3 method for class 'drift_dm'
simulate_data(
  object,
  ...,
  n,
  conds = NULL,
  k = 1,
  lower = NULL,
  upper = NULL,
  df_prms = NULL,
  seed = NULL,
  progress = 1
)

Arguments

object

an object inheriting from drift_dm.

...

further arguments passed on to other functions, i.e., simulate_values() and simulate_one_data_set(). This allows users to control the distribution from which original parameter values are drawn (if k > 0) and the number of decimal places that the simulated RTs should have. If users want to use a different distribution than uniform for simulate_values(), they must provide the additional arguments (e.g., means and sds) in a format like lower/upper.

n

numeric, the number of trials per condition to draw. If a single numeric, then each condition will have n trials. Can be a (named) numeric vector with the same length as there are conditions to allow a different number of trials per condition.

conds

character vector, specifying the conditions to sample from. Default NULL is equivalent to conds(object).

k

numeric larger than 0, indicating how many data sets shall be simulated. If > 1, users must specify lower/upper.

lower, upper

vectors or a list, specifying the simulation space for each parameter of the model (see Details). Only relevant for k > 1

df_prms

an optional data.frame providing the parameters that should be used for simulating the data. df_prms must provide column names matching with (coef(object, select_unique = TRUE)), plus a column ID that will identify each simulated data set.

seed

a single numeric, an optional seed for reproducible sampling

progress

an integer, indicating if information about the progress should be displayed. 0 -> no information, 1 -> a progress bar. Default is 1. Only effective when k > 1.

Details

simulate_data is a generic function for simulating data based on approximate inverse CDF sampling. CDFs are derived from the model's PDFs and data is drawn by mapping samples from a uniform distribution (in [0, 1]) to the values of the CDF. Note that sampled response times will correspond to the values of the time space (i.e., they will correspond to seq(0, t_max, dt), see drift_dm).

For drift_dm objects, the behavior of simulate_data depends on k. If k = 1 and no lower/upper or df_prms arguments are supplied, then the parameters currently set to the model are used to generate the synthetic data. If k > 1, then k parameter combinations are either randomly drawn via simulate_values or gathered from the provided data.frame df_prms, and then data is simulated for each parameter combination.

When specifying lower/upper, parameter combinations are simulated via simulate_values. This comes in handy for simple parameter recovery exercises. If df_prms is specified, then the parameter combinations from this data.frame is used. Note that the column names in df_prms must match with the (unique) parameter combinations of the model (see print(coef(object)))

Details on how to specify `lower/upper`.

When users want to simulate data with k > 1 and lower/upper, then parameter values have to be drawn. One great aspect about the flex_prms object within each drift_dm model, is that users can easily allow certain parameters to vary freely across conditions. Consequently, the actual number of parameters varies with the settings of the flex_prms object. In many cases, however, the simulation space for a parameter is the same across conditions. For instance, in a model, the parameter "mu" may vary across the conditions "easy", "medium", or "hard", but the lower/upper limits are the same across conditions. To avoid that users always have to re-specify the simulation space via the lower/upper arguments, the lower and upper arguments refer to the parameter labels, and dRiftDM figures out how to map these to all parameters that vary across conditions.

Here is an example: Assume you have the model with parameters "A" and "B", and the conditions "foo" and "bar". Now assume that "A" is allowed to vary for "foo" and "bar". Thus, there are actually three parameters; "A~foo", "A~bar", and "B". dRiftDM, however, can help with this. If we provide lower = c(A = 1, B = 2), upper = c(A = 3, B = 4), simulate_data checks the model, and creates the vectors temp_lower = c(1,1,2) and temp_upper = c(3,3,4) as a basis to simulate the parameters.

Users have three options to specify the simulation space:

Plain numeric vectors (not very much recommended). In this case, lower/upper must be sorted in accordance with the free parameters in the flex_prms_obj object (call ⁠print(<model>)⁠ and have a look at the ⁠Parameter Settings⁠ output)
Named numeric vectors. In this case lower/upper have to provide labels in accordance with the parameters that are considered "free" at least once across conditions.
The most flexible way is when lower/upper are lists. In this case, the list requires an entry called "default_values" which specifies the named or plain numeric vectors as above. If the list only contains this entry, then the behavior is as if lower/upper were already numeric vectors. However, the lower/upper lists can also provide entries labeled as specific conditions, which contain named (!) numeric vectors with parameter labels. This will modify the value for the upper/lower parameter space with respect to the specified parameters in the respective condition.

Value

The return value depends on whether a user specifies lower/upper or df_prms. If none of these are specified and if k = 1, then a data.frame containing the columns RT, Error, and Cond is returned.

If lower/upper or df_prms are provided, then a list with entries synth_data and prms is returned. The entry synth_data contains a data.frame, with the columns RT, ⁠<b_column>⁠, Cond, and ID (the name of the second column, ⁠<b_column>⁠, depends on the b_coding of the model object). The entry prms contains a data.frame with an ID column and the parameters used for simulating each synthetic data set.

Note

A function for fits_ids_dm will be provided in the future.

Examples

# Example 1 ----------------------------------------------------------------
# get a pre-built model for demonstration
a_model <- ratcliff_dm()

# define a lower and upper simulation space
lower <- c(1, 0.4, 0.1)
upper <- c(6, 0.9, 0.5)

# now simulate 5 data sets with each 100 trials
data_prms <- simulate_data(a_model,
  n = 100, k = 5, lower = lower,
  upper = upper, seed = 1, progress = 0
)
head(data_prms$synth_data)
head(data_prms$prms)

# Example 2 ----------------------------------------------------------------
# more flexibility when defining lists for lower and upper
# get a pre-built model, and allow muc to vary across conditions
a_model <- dmc_dm(instr = "muc ~ ")

# define a lower and upper simulation space
# let muc vary between 2 and 6, but in incomp conditions, let it vary
# between 1 and 4
lower <- list(
  default_values = c(
    muc = 2, b = 0.4, non_dec = 0.1,
    sd_non_dec = 0.01, tau = 0.02, A = 0.05,
    alpha = 3
  ),
  incomp = c(muc = 1)
)
upper <- list(
  default_values = c(
    muc = 6, b = 0.9, non_dec = 0.4,
    sd_non_dec = 0.15, tau = 0.15, A = 0.15,
    alpha = 7
  ),
  incomp = c(muc = 4)
)

data_prms <- simulate_data(a_model,
  n = 100, k = 5, lower = lower,
  upper = upper, seed = 1, progress = 0
)
range(data_prms$prms$muc.comp)
range(data_prms$prms$muc.incomp)

Simulate one data set

Description

Function that simulates a single data based on a model (using the prms set

Usage

simulate_one_data_set(drift_dm_obj, n, conds = NULL, round_to = NULL)

Arguments

drift_dm_obj

a drift_dm object

n

integer, specifying the number of trials per condition. Can be a single integer, or a (named) integer vector with the same length as conds

conds

character vector, specifying the conditions to sample from. Default NULL is equivalent to conds(drift_dm_obj)

round_to

integer, specifying the number of decimal places that the simulated RTs should have. Default is 3L.

Value

A data.frame with the columns "RT", "<b_column>", and "Cond"; and with n rows.

Simulate Trajectories/Traces of a Model

Description

Simulates single trajectories/traces of a model (i.e., evidence accumulation processes) using forward Euler.

Might come in handy when exploring the model's behavior or when creating figures (see also plot.traces_dm_list)

Usage

simulate_traces(object, k, ...)

## S3 method for class 'drift_dm'
simulate_traces(
  object,
  k,
  ...,
  conds = NULL,
  add_x = FALSE,
  sigma = NULL,
  seed = NULL,
  unpack = FALSE
)

## S3 method for class 'fits_ids_dm'
simulate_traces(object, k, ...)

## S3 method for class 'fits_agg_dm'
simulate_traces(object, k, ...)

## S3 method for class 'traces_dm_list'
print(x, ..., round_digits = drift_dm_default_rounding(), print_steps = 5)

## S3 method for class 'traces_dm'
print(
  x,
  ...,
  round_digits = drift_dm_default_rounding(),
  print_steps = 5,
  print_k = 4
)

Arguments

object

an object of type drift_dm, fits_ids_dm, or fits_agg_dm (see estimate_dm()).

k

numeric, the number of traces to simulate per condition. Can be a named numeric vector, to specify different number of traces per condition.

...

additional arguments passed forward to the respective method.

conds

optional character vector, conditions for which traces shall be simulated. If NULL, then traces for all conditions are simulated.

add_x

logical, indicating whether traces should contain a variable starting point. If TRUE, samples from x_fun (see comp_vals) are added to each trace. Default is FALSE.

sigma

optional numeric, providing a value >= 0 for the diffusion constant "sigma" to temporally override prms_solve. Useful for exploring the model without noise.

seed

optional numerical, a seed for reproducible sampling

unpack

logical, indicating if the traces shall be "unpacked" (see also unpack_obj and the return value below).

x

an object of type traces_dm_list or traces_dm, resulting from a call to simulate_traces.

round_digits

integer, indicating the number of decimal places (round) to be used when printing out the traces (default is 3).

print_steps

integer, indicating the number of steps to show when printing out traces (default is 5).

print_k

integer, indicating how many traces shall be shown when printing out traces (default is 4).

Details

simulate_traces() is a generic function, applicable to objects of type drift_dm or fits_ids_dm (see load_fits_ids).

For drift_dm objects, simulate_traces() performs the simulation on the parameter values currently set (see coef.drift_dm()).

For fits_ids_dm objects, simulate_traces() first extracts the model and all parameter values for all IDs (see coef.fits_ids_dm()). Subsequently, simulations are based on the averaged parameter values.

The algorithm for simulating traces is forward euler. See Richter et al. (2023) and Ulrich et al. (2015) (Appendix A) for more information.

Value

simulate_traces() returns either an object of type traces_dm_list, or directly a list of matrices across conditions, containing the traces (if unpack = TRUE). If the model has only one condition (and unpack = TRUE), then the matrix of traces for this one condition is directly returned.

The returned list has as many entries as conditions requested. For example, if only one condition is requested via the conds argument, then the list is of length 1 (if unpack = FALSE). If conds is set to NULL (default), then the list will have as many entries as conditions specified in the supplied object (see also conds). If unpack = FALSE, the list contains an additional attribute with the time space.

Each matrix of traces has k rows and nt + 1 columns, stored as an array of size (k, nt + 1). Note that nt is the number of steps in the discretization of time; see drift_dm. If unpack = FALSE, the array is of type traces_dm. It contains some additional attributes about the time space, the drift rate, the boundary, the added starting values, if starting values were added, the original model class and parameters, the boundary coding, and the solver settings.

The print methods print.traces_dm_list() and print.traces_dm() each invisibly return the supplied object x.

Note

Evidence values with traces beyond the boundary of the model are set to NA before passing them back.

The reason why simulate_traces passes back an object of type traces_dm_list (instead of simply a list of arrays) is to provide a plot.traces_dm_list and print.traces_dm_list function.

Users can unpack the traces even after calling simulate_traces() using unpack_obj().

Examples

# get a pre-built model to demonstrate the function
my_model <- dmc_dm()
some_traces <- simulate_traces(my_model, k = 1, seed = 1)
print(some_traces)

# a method is also available for fits_ids_dm objects
# (see estimate_model_ids)
# get an exemplary fits_ids_dm object
fits <- get_example_fits("fits_ids_dm")
some_traces <- simulate_traces(fits, k = 1, seed = 1)
print(some_traces)

# we can also print only the traces of one condition
print(some_traces$comp)

Simulate Traces for One Conditions

Description

The function simulates traces with forward Euler. It is the backend function to simulate_traces.

Usage

simulate_traces_one_cond(drift_dm_obj, k, one_cond, add_x, sigma)

Arguments

drift_dm_obj

a model of type drift_dm

k

a single numeric, the number of traces to simulate

one_cond

a single character string, specifying which condition shall be simulated

add_x

a single logical, indicating if starting values shall be added or not. Sometimes, when visualizing the model, one does not want to have the starting values.

sigma

a single numeric, to override the "sigma" in prms_solve

Value

An array of size k times nt + 1. The array becomes an object of type traces_dm, which allows for easier printing with print.traces_dm. Furthermore, each object has the additional attributes:

"t_vec" -> the time space from 0 to t_max
"mu_vals" -> the drift rate values by mu_fun
"b_vals" -> the boundary values by b_fun
"samp_x" -> the values of the starting points (which are always added to the traces in the array.
"add_x" -> boolean, indicating if the starting values were added or not
"orig_model_class" -> the class label of the original model
"orig_prms" -> the parameters with which the traces were simulated (for the respective condition)
"b_coding" -> the boundary coding
"prms_solve" -> the solver settings with which the traces were simulated

Simulate Values

Description

Draw values, most likely model parameters.

Usage

simulate_values(
  lower,
  upper,
  k,
  distr = NULL,
  cast_to_data_frame = TRUE,
  add_id_column = "numeric",
  seed = NULL,
  ...
)

Arguments

lower, upper

Numeric vectors, indicating the lower/upper boundary of the drawn values.

k

Numeric, the number of values to be drawn for each value pair of lower/upper. If named numeric, the labels are used for the column names of the returned object

distr

Character, indicating which distribution to draw from. Currently available are: "unif" for a uniform distribution or "tnorm" for a truncated normal distribution. NUll will lead to "unif" (default).

cast_to_data_frame

Logical, controls whether the returned object is of type data.frame (TRUE) or matrix (FALSE). Default is TRUE

add_id_column

Character, controls whether an ID column should be added. Options are "numeric", "character", or "none". If "numeric" or "character" the column ID provides values from 1 to k of the respective type. If none, no column is added. Note that "character" casts all simulated values to character if the argument cast_to_data_frame is set to FALSE.

seed

Numeric, optional seed for making the simulation reproducable (see details)

...

Further arguments relevant for the distribution to draw from

Details

When drawing from a truncated normal distribution, users must provide values for the arguments means and sds. These are numeric vectors of the same size as lower and upper, and indicate the mean and the standard deviation of the normal distributions.

Value

If cast_to_data_frame is TRUE, a data.frame with k rows and at least length(lower);length(upper) columns. Otherwise a matrix with the same number of rows and columns. Columns are labeled either from V1 to Vk or in case lower and upper are named numeric vectors using the labels of both vectors.

If add_id_column is not "none", an ID column is provided of the respective data type.

The data type of the parameters will be numeric, unless add_id_column is "character" and cast_to_data_frame is FALSE. In this case the returned matrix will be of type character.

Examples


# Example 1: Draw from uniform distributions ------------------------------
lower <- c(a = 1, b = 1, c = 1)
upper <- c(a = 3, b = 4, c = 5)
values <- simulate_values(
  lower = lower,
  upper = upper,
  k = 50,
  add_id_column = "none"
)
summary(values)

# Example 2: Draw from truncated normal distributions ---------------------
lower <- c(a = 1, b = 1, c = 1)
upper <- c(a = 3, b = 4, c = 5)
means <- c(a = 2, b = 2.5, c = 3)
sds <- c(a = 0.5, b = 0.5, c = 0.5)
values <- simulate_values(
  lower = lower,
  upper = upper,
  distr = "tnorm",
  k = 5000,
  add_id_column = "none",
  means = means,
  sds = sds
)
quantile(values$a, probs = c(0.025, 0.5, 0.975))
quantile(values$b, probs = c(0.025, 0.5, 0.975))
quantile(values$c, probs = c(0.025, 0.5, 0.975))

The Solver for Deriving Model Predictions

Description

Functions to get or set the "solver" of an object. The "solver" controls the method for deriving the model's first passage time (i.e., its predicted PDFs).

Usage

solver(object, ...) <- value

## S3 replacement method for class 'drift_dm'
solver(object, ..., eval_model = FALSE) <- value

solver(object, ...)

## S3 method for class 'drift_dm'
solver(object, ...)

## S3 method for class 'fits_ids_dm'
solver(object, ...)

## S3 method for class 'fits_agg_dm'
solver(object, ...)

Arguments

object

an object of type drift_dm, fits_ids_dm, or fits_agg_dm (see estimate_dm()).

...

additional arguments (i.e., eval_model).

value

a single character string, providing the new "solver" (i.e., approach to derive the first passage time; see drift_dm()).

eval_model

logical, indicating if the model should be re-evaluated or not when updating the solver (see re_evaluate_model). Default is FALSE.

Details

solver() is a generic accessor function, and ⁠solver<-()⁠ is a generic replacement function. The default methods get and set the "solver".

The "solver" indicates the approach with which the PDFs of a model are calculated. Supported options are "kfe" and "im_zero" (method based on the Kolmogorov-Forward-Equation or on integral equations, respectively). Note that "im_zero" is only supported for models that assume a fixed starting point from 0.

Value

For solve() the string solver (see drift_dm()).

For ⁠solver<-()⁠ the updated drift_dm object.

Note

There is only a replacement function for drift_dm objects. This is because replacing the approach for deriving PDFs after the model has been fitted (i.e., for a fits_ids_dm object) doesn't make sense.

Examples

# get some default model to demonstrate the solver() functions
my_model <- ratcliff_dm()
solver(my_model)
# change to the integral approach
solver(my_model) <- "im_zero"
solver(my_model)

# accessor method also available for fits_ids_dm objects
# (see estimate_model_ids)
# get an exemplary fits_ids_dm object
fits <- get_example_fits("fits_ids_dm")
solver(fits)

Sorts the numbers in ascending order

Description

.... within one entry (i.e., one prm across all conditions, such as when calling internal_list[["A"]]); which is a list. Used to ensure that parameter ordering remains logical

Usage

sort_one_internal_entry(one_internal_entry)

Arguments

one_internal_entry

one entry of a (linearized) internal_list, must be named

Details

The entries of the internal list are either digits (0-x) or expressions.

Value

the newly sorted entry as a list

Create the Shrinking Spotlight Model

Description

This function creates a drift_dm object that corresponds to a simple version of the shrinking spotlight model by White et al. (2011).

Usage

ssp_dm(
  var_non_dec = TRUE,
  var_start = FALSE,
  instr = NULL,
  obs_data = NULL,
  sigma = 1,
  t_max = 3,
  dt = 0.005,
  dx = 0.02,
  b_coding = NULL
)

Arguments

var_non_dec, var_start

logical, indicating whether the model should have a variable non-decision time or starting point (see also nt_uniform and x_uniform in component_shelf

instr

optional string with "instructions", see modify_flex_prms().

obs_data

data.frame, an optional data.frame with the observed data. See obs_data.

sigma, t_max, dt, dx

numeric, providing the settings for the diffusion constant and discretization (see drift_dm)

b_coding

list, an optional list with the boundary encoding (see b_coding)

Details

The shrinking spotlight model is a model developed for the flanker task.

It has the following properties (see component_shelf):

a constant boundary (parameter b)
a constant starting point in between the decision boundaries
an evidence accumulation process that is driven by an attention spotlight that covers both the flankers and the target. The area that covers the flankers and target is modeled by normal distribution with mean 0:
- At the beginning of the trial attention is wide-spread, and the width at t=0 is the standard deviation sd_0
- As the trial progresses in time, the attention spotlight narrows, reflected by a linear decline of the standard deviation with rate r (to a minimum of 0.001).
- the attention attributed to both the flankers and the target is scaled by p which controls the strength of evidence accumulation
A non-decision time that follows a truncated normal distribution with mean non_dec and standard deviation sd_non_dec.
The model also contains the auxiliary parameter sign, which is used to control the influence of the flankers across conditions. It is not really a parameter and should not be estimated!

Per default, the parameter r is assumed to be fixed (i.e., is not estimated freely). The model also contains the custom parameter interf_t, quantifying the interference time (sd_0 / r).

Value

An object of type drift_dm (parent class) and ssp_dm (child class), created by the function drift_dm().

Note

The parameters of SSP in dRiftDM differ in their size from the original publication of White et al. (2011). dRiftDM uses symmetrical boundaries around zero and a diffusion constant of 1. In the original publication, SSP was parameterized with boundaries ranging from zero to a and a diffusion constant of 0.1.

Thus, in dRiftDM, the boundary b corresponds to b = a/2 \cdot 10. Additionally, p in dRiftDM is 10 times larger than p in the original publication. Finally, r is expressed in seconds, and thus r is 1000 times larger in dRiftDM than in the original publication.

References

White CN, Ratcliff R, Starns JJ (2011). “Diffusion models of the flanker task: Discrete versus gradual attentional selection.” Cognitive psychology, 63(4), 210–238. doi:10.1016/j.cogpsych.2011.08.001.

Examples

# the model with default settings
my_model <- ssp_dm()

# the model with a finer discretization
my_model <- ssp_dm(dt = .0025, dx = .01)

A synthetic data set with two conditions

Description

This dataset was simulated by using the Shrinking Spotlight Model (see ssp_dm()) with parameter settings that are typical for a Flanker task.

Usage

ssp_synth_data

Format

A data frame with 600 rows and 3 columns:

RT: Response Times
Error: Error Coding (Error Response = 1; Correct Response = 0)
Cond: Condition ('comp' and 'incomp')

Helper to get, set, or reset package-global options for statistics

Description

Internal utility to manage global options for the package via the "stats.dRiftDM" option slot.

Usage

stats.options(...)

Arguments

...

input, see Details below

Details

Usage patterns:

stats.options(): Returns the full list of currently stored options.
stats.options(name): Returns the value of a specific option (must be a single unnamed string).
stats.options(name = value, ...): Sets (or updates) named option(s).
stats.options(NULL): Resets (clears) the entire option list.

This function is intended for internal use only. It behaves similarly to options() and keeps all package-specific options in a single named list under getOption("stats.dRiftDM").

Setting an argument can only be done once with this function, any additional attempts to modify an option will not work (unless this argument is explicitly set to NULL).

Value

Depending on usage:

Full list of options (if no input),
A specific option value (if string input),
Invisibly NULL (if setting or resetting options).

Get Quantiles/CAFs from PDFs and stats_agg_info

Description

Internal helper to compute predicted summary statistics from condition-wise PDFs, following the condition order. Calculates summary statistics according to the information in stats_agg_info. Currently supports either quantiles or conditional accuracy functions (CAFs).

Usage

stats_from_pdfs_agg_info(pdfs, t_vec, dt, stats_agg_info = NULL, what)

Arguments

pdfs

named list of PDFs per condition, each containing elements pdf_u and pdf_l.

t_vec

numeric time vector.

dt

numeric time step.

stats_agg_info

list with information needed to compute summaries (e.g., quantile probabilities or CAF bin counts; optional).

what

character, one of "quantiles" or "cafs", selecting which statistic to compute.

Value

A list of numeric vectors, one per condition, containing the predicted quantiles or CAFs. In case of failure, NA_real_ values are returned.

Resample Observed and Predicted Statistics for Interval Estimation

Description

Internal methods to generate bootstrap-like intervals for descriptive statistics derived from either observed data or model predictions. These methods support both drift_dm objects and data.frames containing a single participant's observed data.

Usage

stats_resample_dm(object, conds, type, b_coding, ..., R, interval_level)

## S3 method for class 'drift_dm'
stats_resample_dm(
  object,
  conds,
  type,
  b_coding,
  ...,
  R = 100,
  interval_level = 0.95,
  n_sim = NULL
)

## S3 method for class 'data.frame'
stats_resample_dm(
  object,
  conds,
  type,
  b_coding,
  ...,
  R = 100,
  interval_level = 0.95,
  progress = 0,
  level
)

## S3 method for class 'fits_ids_dm'
stats_resample_dm(
  object,
  conds,
  type,
  b_coding,
  ...,
  R = 100,
  interval_level = 0.95,
  progress = 0,
  level
)

Arguments

object

a drift_dm object (for model-based resampling) or a data.frame with observed data for a single participant. drift_dm_stats_types("sum_dists"), such as "quantiles" or "cafs".

conds

a character vector indicating the condition(s) for which the statistics should be resampled.

type

a character string, specifying the type of statistic to calculate

b_coding

a list, specifying the boundary coding, required when calculating the statistics.

...

additional arguments passed to stats_resample_wrapper() and simulate_data(). Must contain type and b_coding

R

an integer, number of replications (default is 100).

interval_level

a numeric between 0 and 1, controlling the width of the interval (default is 0.95).

n_sim

an optional vector, providing the trial numbers for simulating synthetic data under the model. Only relevant when no observed data is attached to the model via the obs_data entry of the model.

progress

an integer, specifying if a progress bar shall be displayed (1) or not (0).

level

a character string, specifying at which level resampling shall take place. "individual" will lead to resampling of an individual's data. "group" will lead to resampling of the entire participant.

Details

The stats_resample_dm() generic dispatches to class-specific methods. For drift_dm objects, it generates synthetic data sets under the model. For raw data, it resamples observations with replacement (i.e, it performs a bootstrap). In both cases, statistics from the resampled/generated data are used to compute intervals for the requested statistic.

Resampling is done for each condition separately.

The function stats_resample_dm() is called within calc_stats().

Value

A stats_dm object with added column Estimate indicating whether the row represents the lower interval bound, the original value ("orig"), or the upper interval bound. The interval level can be controlled via the interval_level argument.

Reads Info file

Description

Requires a path to the info file of a fit procedure (see estimate_model_ids) and turns its main information into a string

Usage

summarize_drift_dm_info(full_name_to_file, detailed_info)

Arguments

full_name_to_file

path to the info file (.RDS)

detailed_info

logical, if detailed info shall be provided or not

Value

a string with infos about the fit procedure name, last call, and, if detailed_info = TRUE, model, individuals, lower/upper, and seed.

Summary for `coefs_dm` Objects

Description

Summary and corresponding printing methods for coefs_dm objects. These objects result from a call to coef.fits_ids_dm() (i.e., when calling coef() with an object of type fits_ids_dm).

Usage

## S3 method for class 'coefs_dm'
summary(object, ..., round_digits = drift_dm_default_rounding())

## S3 method for class 'summary.coefs_dm'
print(x, ..., show_header = TRUE)

Arguments

object

an object of type coefs_dm.

...

additional arguments passed forward.

round_digits

integer, specifying the number of decimal places for rounding the summary of the underlying data.frame. Default is 3.

x

an object of class summary.coefs_dm.

show_header

logical. If TRUE, a header specifying the type of statistic will be displayed.

Details

summary.coefs_dm() summarizes coefs_dm objects, returning the type, a summary of the underlying data.frame (summary_dataframe), and the number of unique IDs (n_ids).

Value

For summary.coefs_dm() a summary object of class summary.coefs_dm.

For print.summary.coefs_dm(), the supplied object is returned invisibly.

Examples

# get a fits_ids object for demonstration purpose
fits_ids <- get_example_fits("fits_ids_dm")
coefs <- coef(fits_ids)
summary(coefs)

Summary for `drift_dm` objects

Description

summary and corresponding printing methods for objects of class drift_dm, created by a call to drift_dm().

Usage

## S3 method for class 'drift_dm'
summary(object, ...)

## S3 method for class 'summary.drift_dm'
print(x, ..., round_digits = drift_dm_default_rounding())

Arguments

object

an object of class drift_dm.

...

additional arguments passed forward (currently not used).

x

an object of class summary.drift_dm.

round_digits

integer, specifying the number of decimal places for rounding in the printed summary. Default is 3.

Details

summary.drift_dm() constructs a summary list with information about the drift_dm object. The returned list has class summary.drift_dm and can include the following entries:

class: Class vector of the drift_dm object.
summary_flex_prms: Summary of the flex_prms object in the model (see summary.flex_prms).
prms_solve: Parameters used for solving the model (see prms_solve).
solver: Solver used for generating model predictions.
b_coding: Boundary coding for the model (see b_coding).
obs_data: Summary table of observed response time data, if available, by response type (upper/lower boundary). rows correspond to upper first then lower responses; row names are prefixed by the boundary names from b_coding. columns (all lower-case) are: min, ⁠1st qu.⁠, median, mean, ⁠3rd qu.⁠, max, and n.
cost_function: Name (or descriptor) of the cost function used during estimation.
fit_stats: Fit statistics, if available. we return a named atomic vector created via unlist(unpack_obj(calc_stats(..., type = "fit_stats"))).
estimate_info: Additional information about the estimation procedure.

print.summary.drift_dm() displays this summary in a formatted way.

Value

summary.drift_dm() returns a list of class summary.drift_dm (see details for the entries).

print.summary.drift_dm() returns invisibly the summary.drift_dm object.

Examples

# get a pre-built model for demonstration
a_model <- dmc_dm()
sum_obj <- summary(a_model)
print(sum_obj, round_digits = 2)

# more information is provided when we add data to the model
obs_data(a_model) <- dmc_synth_data # (data set comes with dRiftDM)
summary(a_model)

# fit indices are added once we evaluate the model
a_model <- re_evaluate_model(a_model)
summary(a_model)

Summarizing Flex Parameters

Description

summary method for class "flex_prms".

Usage

## S3 method for class 'flex_prms'
summary(object, ...)

## S3 method for class 'summary.flex_prms'
print(
  x,
  ...,
  round_digits = drift_dm_default_rounding(),
  dependencies = TRUE,
  cust_parameters = TRUE
)

Arguments

object

an object of class flex_prms, resulting from a call to flex_prms.

...

additional arguments passed forward to the respective method

x

an object of class summary.flex_prms; a result of a call to summary.flex_prms().

round_digits

integer, indicating the number of decimal places (round) to be used (default is 3).

dependencies

logical, controlling if a summary of the special dependencies shall be printed (see the "special dependency instruction" in the details of flex_prms)

cust_parameters

logical, controlling if a summary of the custom parameters shall be printed (see the "additional/custom parameter instruction" in the details of flex_prms)

Details

The summary.flex_prms() function creates a summary object containing:

prms_matrix: All parameter values across all conditions.
unique_matrix: A character matrix, showing how parameters relate across conditions.
depend_strings: Special Dependencies, formatted as a string.
cust_prms_matrix: (if they exist), a matrix containing all custom parameters.

The print.summary.flex_prms() function displays the summary object in a formatted manner.

Value

summary.flex_prms() returns a list of class summary.flex_prms (see the Details section summarizing each entry of this list).

print.summary.flex_prms() returns invisibly the summary.flex_prms object.

Examples

# create a flex_prms object
flex_obj <- flex_prms(c(a = 1, b = 2), conds = c("foo", "bar"))

sum_obj <- summary(flex_obj)
print(sum_obj)

# the print function for the summary object is identical to the print
# function of the flex_prms object
print(flex_obj)

Summary for `mcmc_dm` Objects

Description

Summary and corresponding print methods for objects of the class mcmc_dm, resulting from a call to estimate_bayesian(). mcmc_dm objects contain MCMC samples for Bayesian parameter estimation of drift_dm() objects. The summary includes basic parameter statistics, quantiles, Gelman-Rubin diagnostics, and effective sample sizes.

Usage

## S3 method for class 'mcmc_dm'
summary(object, ..., id = NULL)

## S3 method for class 'summary.mcmc_dm'
print(
  x,
  ...,
  round_digits = drift_dm_default_rounding(),
  show_statistics = TRUE,
  show_quantiles = FALSE,
  show_gr = TRUE,
  show_eff_n = TRUE
)

Arguments

object

an object of class mcmc_dm, as returned by estimate_bayesian()

...

additional arguments passed forward to coda::summary.mcmc.list().

id

optional single numeric or character, specifying one or more participant IDs to subset object in the hierarchical case. Note that id will be converted to character, because dimension names of the chains stored in object are character. If NULL, then the function is applied to group-level parameters.

x

an object of class summary.mcmc_dm, as returned by summary.mcmc_dm().

round_digits

an integer, defining the number of digits for rounding the output.

show_statistics

a logical, if TRUE, print basic parameter statistics (means, SDs, standard errors).

show_quantiles

a logical, if TRUE, print quantile summary.

show_gr

a logical; if TRUE, print Gelman-Rubin convergence diagnostics for each parameter.

show_eff_n

a logical, if TRUE, print effective sample sizes for each parameter.

Details

The summary and diagnostic statistics of the MCMC chains are obtained using the R package coda.

Value

summary.mcmc_dm() returns an object of class summary.mcmc_dm, which is a list with the following entries:

general: General information about the MCMC run.
statistics: Basic parameter summary statistics.
quantiles: Quantiles for each parameter.
gr: Gelman-Rubin diagnostics.
eff_n: Effective sample sizes.

print.summary.mcmc_dm() prints selected summary components and returns the input object invisibly.

Examples

mcmc_obj <- get_example_fits("mcmc_dm")
print(mcmc_obj)
summary(mcmc_obj)

Summary for `stats_dm` Objects

Description

Summary and corresponding printing methods for objects of the classes stats_dm, basic_stats, cafs, quantiles, delta_funs, fit_stats, sum_dist, and stats_dm_list. These object types result from a call to calc_stats().

Usage

## S3 method for class 'stats_dm'
summary(object, ..., round_digits = drift_dm_default_rounding())

## S3 method for class 'basic_stats'
summary(object, ...)

## S3 method for class 'cafs'
summary(object, ...)

## S3 method for class 'quantiles'
summary(object, ...)

## S3 method for class 'delta_funs'
summary(object, ...)

## S3 method for class 'fit_stats'
summary(object, ...)

## S3 method for class 'sum_dist'
summary(object, ...)

## S3 method for class 'stats_dm_list'
summary(object, ...)

## S3 method for class 'summary.stats_dm'
print(x, ..., show_header = TRUE, drop_cols = NULL)

## S3 method for class 'summary.basic_stats'
print(x, ...)

## S3 method for class 'summary.cafs'
print(x, ...)

## S3 method for class 'summary.quantiles'
print(x, ...)

## S3 method for class 'summary.delta_funs'
print(x, ...)

## S3 method for class 'summary.fit_stats'
print(x, ...)

## S3 method for class 'summary.sum_dist'
print(x, ...)

## S3 method for class 'summary.stats_dm_list'
print(x, ...)

Arguments

object

an object of the respective class

...

additional arguments passed forward.

round_digits

integer, specifying the number of decimal places for rounding the summary of the underlying data.frame. Default is 3.

x

an object of the respective class.

show_header

logical. If TRUE, a header specifying the type of statistic will be displayed.

drop_cols

character vector, specifying which columns of the table summarizing the underlying data.frame should not be displayed.

Details

summary.stats_dm(): Summarizes stats_dm objects, returning the type, a summary of the underlying data.frame (summary_dataframe), and, if possible, the number of unique IDs (n_ids).
summary.sum_dist(): Extends summary.stats_dm() with additional information about the source (source).
summary.basic_stats(): Extends summary.sum_dist() with additional information about the conditions (conds).
summary.cafs(): Extends summary.sum_dist() with additional information about the bins (bins) and conditions (conds).
summary.quantiles(): Extends summary.sum_dist() with additional information about the quantile levels (probs) and conditions (conds).
summary.delta_funs(): Extends summary.sum_dist() with additional information about the quantile levels (probs).
summary.fit_stats(): Identical to summary.stats_dm.
summary.stats_dm_list(): Applies the summary function to each element of the list and returns a list of the respective summary objects.

Note the following class relationships and properties:

basic_stats, cafs, quantiles, and delta_funs are all inheriting from sum_dist.
All sum_dist and fit_stats objects are inheriting from stats_dm.
Each stats_dm_list object is just a list containing instances of stats_dm.

Value

For ⁠summary.*()⁠ methods, a summary object of class corresponding to the input class.

For ⁠print.*()⁠ methods, the respective object is returned invisibly

Examples

# get a model with data for demonstration purpose
a_model <- dmc_dm()
obs_data(a_model) <- dmc_synth_data

# now get some statistics and call the summary functions
some_stats <- calc_stats(a_model, type = c("quantiles", "fit_stats"))
summary(some_stats) # summary.stats_dm_list
summary(some_stats$quantiles) # summary.quantiles

Summary for traces_dm and traces_dm_list Objects

Description

Summary and corresponding printing methods for traces_dm and traces_dm_list objects, resulting from a call to simulate_traces(). Here, traces_dm objects are entries of the returned list.

Usage

## S3 method for class 'traces_dm'
summary(object, ...)

## S3 method for class 'summary.traces_dm'
print(x, ..., round_digits = drift_dm_default_rounding())

## S3 method for class 'traces_dm_list'
summary(object, ...)

## S3 method for class 'summary.traces_dm_list'
print(x, ..., round_digits = drift_dm_default_rounding())

Arguments

object

an object of class traces_dm or traces_dm_list.

...

additional arguments passed forward.

x

an object of type summary.traces_dm or summary.traces_dm_list.

round_digits

integer, specifying the number of decimal places for rounding in the printed summary. Default is 3.

Details

The summary.traces_dm() function constructs a summary list with information about the traces_dm object, including:

k: The number of traces in the object.
add_x: A logical, indicating whether starting values were added.
orig_model_class: The class label of the original model.
orig_prms: The parameters with which the traces were simulated (for the respective condition)
prms_solve: The solver settings with which the traces were simulated.
fpt_desc: A summary of the first passage times, including mean, standard deviation, and response probabilities for upper and lower boundaries.

The summary.traces_dm_list() function constructs a summary list with information about the traces_dm_list object, including:

k: A numeric vector, providing the number of traces per condition.
add_x: A logical vector, indicating whether starting values were added for each condition.
orig_prms: A matrix, containing the original parameter values per condition, with which the traces were simulated.
orig_model_class: The class label of the original model
prms_solve: A matrix of solver settings per condition.
fpt_desc: A summary of the first passage times per condition, including mean, standard deviation, and response probabilities for the upper or lower boundary.

The print.summary.traces_dm() and print.summary.traces_dm_list() functions display the summary in a formatted way.

Value

summary.traces_dm() returns a list of class summary.traces_dm (see the Details section summarizing each entry of this list).

summary.traces_dm_list() returns a list of class summary.traces_dm_list (see the Details section summarizing each entry of this list).

print.summary.traces_dm() returns the summary.traces_dm object invisibly.

print.summary.traces_dm_list() returns the summary.traces_dm_list object invisibly.

Examples

# get a couple of traces a cross conditions
traces <- simulate_traces(dmc_dm(), k = c(5, 10))
summary(traces)

# get a single traces object
one_traces_obj <- traces[[1]]
summary(one_traces_obj)

Convert Character Digits to Numeric Digits

Description

This internal function casts a character vector to integer, if the character vector only contains digits for entries. If the input vector is not of type character or if any entry contains a non-digit, then the vector is returned unmodified.

Usage

try_cast_integer(values)

Arguments

values

a vector of values to attempt conversion to integer.

Value

an integer vector if conversion succeeds; otherwise, the original vector.

Exemplary Flanker Data

Description

Data of the Flanker task collected in the course of the study by Ulrich et al. (2015).

Usage

ulrich_flanker_data

Format

A data.frame with 16 individuals and the following columns:

ID: Individual IDs
RT: Response Times
Error: Error Coding (Error Response = 1; Correct Response = 0)
Cond: Condition ('comp' and 'incomp')

Exemplary Simon Data

Description

Data of the Simon task collected in the course of the study by Ulrich et al. (2015).

Usage

ulrich_simon_data

Format

A data.frame with 16 individuals and the following columns:

ID: Individual IDs
RT: Response Times
Error: Error Coding (Error Response = 1; Correct Response = 0)
Cond: Condition ('comp' and 'incomp')

Unpack/Destroy dRiftDM Objects

Description

When calling simulate_traces(), calc_stats, or coef.fits_ids_dm the returned objects will be custom objects (e.g., subclasses of list or data.frame). The respective subclasses were created to provide convenient plotting and printing, but they don't really provide any additional functionality.

The goal of unpack_obj() is to provide a convenient way to strip away the attributes of the respective objects (revealing them as standard arrays, data.frames, or lists).

Usage

unpack_obj(object, ...)

## S3 method for class 'traces_dm'
unpack_obj(object, ..., unpack_elements = TRUE)

## S3 method for class 'traces_dm_list'
unpack_obj(object, ..., unpack_elements = TRUE, conds = NULL)

## S3 method for class 'stats_dm'
unpack_obj(object, ..., unpack_elements = TRUE)

## S3 method for class 'stats_dm_list'
unpack_obj(object, ..., unpack_elements = TRUE, type = NULL)

## S3 method for class 'coefs_dm'
unpack_obj(object, ..., unpack_elements = TRUE)

Arguments

object

an object of type stats_dm, stats_dm_list, traces_dm, traces_dm_list, or coefs_dm

...

further arguments passed on to the respective method.

unpack_elements

logical, indicating if the traces_dm, stats_dm, or coefs_dm objects shall be unpacked. Default is TRUE.

conds

optional character vector, indicating specific condition(s). The default NULL will lead to conds = conds(object). Thus, per default all conditions are addressed

type

optional character vector, indicating specific type(s) of statistics. The default NULL will access all types of statics.

Details

unpack_obj() is a generic function to strip away the custom information and class labels of stats_dm, stats_dm_list, traces_dm, traces_dm_list, and coefs_dm objects. These objects are created when calling simulate_traces(), calc_stats, or coef.fits_ids_dm.

For traces_dm_list, unpack_obj() returns the requested conditions (see the argument conds). The result contains objects of type traces_dm if unpack_elements = FALSE. For unpack_elements = TRUE, the result contains the plain arrays with the traces.

For stats_dm_list, unpack_obj() returns the requested statistics (see the argument type). The result contains objects of type stats_dm if unpack_elements = FALSE. For unpack_elements = TRUE, the result contains the plain data.frames with the statistics.

Value

For traces_dm_list, the returned value is a list, if conds specifies more than one condition. For example, if conds = c("foo", "bar"), then the returned value is a list with the two (named) entries "foo" and "bar". If the returned list would only have one entry (either because the traces_dm_list has only one condition, see conds, or because a user explicitly requested only one condition), then the underlying array or traces_dm object is returned directly.

For stats_dm_list, the returned value is a list, if type specifies more than one condition. If the returned list would only have one entry, then the underlying data.frame or stats_dm object is returned directly.

For traces_dm, unpack_obj() returns an array with the traces, if unpack=TRUE. If unpack=FALSE, the unmodified object is returned.

For stats_dm, unpack_obj() returns a data.frame with the respective statistic, if unpack=TRUE. If unpack=FALSE, the unmodified object is returned.

For coefs_dm, unpack_obj() returns a data.frame with the parameters, if unpack=TRUE. If unpack=FALSE, the unmodified object is returned.

Examples

# get a pre-built model to demonstrate the function
my_model <- dmc_dm()

# get some traces ...
some_traces <- simulate_traces(my_model, k = 2, seed = 1)
some_traces <- some_traces$comp
class(some_traces)
# ... unpack them to get the underlying arrays
class(unpack_obj(some_traces))

# get some statistics ...
some_stats <- calc_stats(my_model, type = "cafs")
class(some_stats)
class(unpack_obj(some_stats))

# get some parameters ...
some_coefs <- coef(get_example_fits("fits_ids_dm"))
class(some_coefs)
class(unpack_obj(some_coefs))

Unpack/Destroy Traces Objects

Description

unpack_traces() is deprecated. Please use the more general unpack_obj() function.

Usage

unpack_traces(object, ...)

## S3 method for class 'traces_dm'
unpack_traces(object, ..., unpack = TRUE)

## S3 method for class 'traces_dm_list'
unpack_traces(object, ..., unpack = TRUE, conds = NULL)

Arguments

object

an object of type traces_dm or traces_dm_list (see simulate_traces())

...

further arguments passed on to the respective method.

unpack

logical, indicating if the traces_dm objects shall be unpacked. Default is TRUE.

conds

optional character, indicating specific condition(s). The default NULL will lead to conds = conds(object). Thus, per default all conditions are accessed.

Details

unpack_traces() was a generic function to strip away the "unnecessary" information of traces_dm_list and traces_dm objects. These objects are created when calling simulate_traces().

For traces_dm_list, unpack_traces() returns the requested conditions (see the argument conds). The result contains objects of type traces_dm if unpack = FALSE. For unpack = TRUE, the result contains the plain arrays with the traces.

Value

For traces_dm, unpack_traces() returns an array with the traces, if unpack=TRUE. If unpack=FALSE, the unmodified object is returned.

Update all prms

Description

This function takes a flex_prms object and updates the prms_matrix according to the special instructions in internal_list and the custom parameters cust_prms

Usage

update_special_values(flex_prms_obj)

Arguments

flex_prms_obj

a flex_prms object

Value

the modified flex_prms_obj (i.e,. with the updated prms_matrix and the updated cust_prms$values)

Update aggregated statistics in a `drift_dm` object

Description

Internal function that creates or updates the aggregated statistics (stats_agg and stats_agg_info) in a drift_dm object, depending on the specified cost function. For maximum likelihood estimation ("neg_log_like"), aggregated statistics are removed, because the raw RTs are used directly.

Usage

update_stats_agg(
  drift_dm_obj,
  which_cost_function,
  probs = NULL,
  n_bins = NULL
)

Arguments

drift_dm_obj

a drift_dm object.

which_cost_function

a character string, indicating which cost function is used. Must be one of drift_dm_cost_functions().

probs

optional numeric vector of probabilities for quantile calculation. If NULL, defaults are taken from drift_dm_default_probs().

n_bins

an optional integer, giving the number of bins for the CAFs. If NULL, defaults are taken from drift_dm_default_n_bins().

Value

the input drift_dm_obj, with its stats_agg and stats_agg_info entries updated or removed, depending on the cost function and availability of observed data

Note

This function is called by obs_data() and cost_function()

Validate a DDM object

Description

Performs basic checks to ensure everything is as expected with the model. This function should be called whenever modifying a ddm object!

Usage

validate_drift_dm(drift_dm_obj)

Arguments

drift_dm_obj

the ddm object

Details

Checks:

The flex_prms_object via validate_flex_prms()
The prms_solve (that it is a named numeric vector check_if_named_numeric_vector() with the expected entries) and that nt, nx make sense. This may adjust t_max if t_max is smaller than max(RT) of the observed data
The solver string (only a single string and that it refers to something that is actually implemented). If im_zero, then check if dirac delta on 0.
The cost_function string (only a single string and that it refers to something that is actually implemented).
If cost_function is a summary statistic, that the objects stats_agg exists and has the correct structure.
That the list comp_funs only contains functions and that each function provides the expected arguments
If PDFs exist, the names, lengths and data type
If cost_val exists, if it is a single numeric
If obs_data exists, the data type, names, and structure
The b_coding (column, u_name_value and l_name_value).

Value

the ddm object, after it passed all checks. Usually, it will be unmodified. The only exception is when the observed RTs are larger than t_max. Then, the returned ddm object has a new t_maxthat covers the largest RTs.

Validate a an Object of Type fits_ids_dm

Description

checks if all the information are in the fits_ids_dm (see estimate_model_ids) object and ensures that nothing obviously fishy is going on with the individual model fits.

Usage

validate_fits_ids(fits_ids, progress)

Arguments

fits_ids

an object of type fits_ids_dm

progress

numeric, 0, no progress, 1 basic output, 2 progress bars

Details

Checks:

if all names are in the info file
and if the respective entries make sense
if the flex_prms object of the all saved models and the overall model is the same (except for the differences in the prm values).
if the class, prms_solve and solver of the saved models and the overall model is the same
if the estimated parameters are in the parameter space
for same b_coding and same functions
if the number of fits matches with the number of individuals in the info file
if the data in each fitted model matches with the observed data in the info file

Value

the unmodified fits_ids objects after passing all tests.

checks if a flex_prms_object is parameterized in a reasonable way

Description

checks internal_list
checks liner_internal_list
checks the parameter matrix

Usage

validate_flex_prms(flex_prms_obj)

Arguments

flex_prms_obj

and object of type flex_prms

Value

the unmodified flex_prms_obj object; only stop/warning statements

Validate a stats_dm Object

Description

validate_stats_dm is an internal (i.e., not exported) generic function to ensure that stats_dm objects, as well as their specific subclasses (basic_stats, cafs, quantiles, delta_funs, sum_dist, and fit_stats), meet the necessary structural and column requirements. Each method performs class-specific validation checks.

Usage

validate_stats_dm(stat_df)

## S3 method for class 'basic_stats'
validate_stats_dm(stat_df)

## S3 method for class 'cafs'
validate_stats_dm(stat_df)

## S3 method for class 'quantiles'
validate_stats_dm(stat_df)

## S3 method for class 'delta_funs'
validate_stats_dm(stat_df)

## S3 method for class 'densities'
validate_stats_dm(stat_df)

## S3 method for class 'sum_dist'
validate_stats_dm(stat_df)

## S3 method for class 'fit_stats'
validate_stats_dm(stat_df)

## S3 method for class 'stats_dm'
validate_stats_dm(stat_df)

Arguments

stat_df

A data.frame of class stats_dm, basic_stats, cafs, quantiles, delta_funs, sum_dist, or fit_stats containing the calculated statistics to be validated.

Details

The validation process checks for required columns and structure based on the class of stat_df. Each class has specific requirements:

validate_stats_dm.stats_dm: Ensures stat_df is a data.frame.
validate_stats_dm.basic_stats: Checks for the presence of "Cond", exactly two columns with prefix ⁠"Mean_⁠, and exactly one column prefixed with "P_"
validate_stats_dm.cafs: Checks for the presence of "Bin", "Cond", and exactly one column prefixed with "P_"
validate_stats_dm.quantiles: Requires "Prob", "Cond", and exactly two columns prefixed with "Quant_"
validate_stats_dm.delta_funs: Ensures "Prob" exists, at least two columns prefixed with "Quant_", and at least one column each Avg_ and Delta_
validate_stats_dm.delta_funs: Ensures "Cond", "Time", and "Stat" exists, and at least two column with "Dens_".
validate_stats_dm.sum_dist: Checks for a "Source" column. Here, it is also checked whether cell combinations appear equally often.
validate_stats_dm.fit_stats: Checks for if the fit statistics summarize a log-likelihood cost function or the RMSE statistic. In the former case, the columns "Log_Like", "Neg_Log_Like", "AIC", and "BIC" are expected. In the latter case, the columns "RMSE_ms" and "RMSE_s" are expected.

Value

Returns the unmodified stat_df for convenience.

Update the parameter matrix for vector inputs (internal docu)

Description

This function takes a numeric vector and maps the values to the parameter matrix using the linearized internal list. This will also lead to an update of the values for which special dependencies were set

Usage

x2prms_vals(x, flex_prms_obj)

Arguments

x

a numeric vector with new values to set

flex_prms_obj

a flex_prms with the (linearized) internal list and the parameter matrix

Details

Does not perform input checks!

Value

a flex_prms_obj with updated parameter matrix

Beta-Shaped Starting Point Distribution Centered Around Zero

Description

Beta-Shaped Starting Point Distribution Centered Around Zero

Usage

x_beta(prms_model, prms_solve, x_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters; prm name "alpha" required

prms_solve

solver settings

x_vec

evidence space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

returns the PDF of a beta-shaped distribution for x_vec, centered around zero and with a shape parameter "alpha".

Constant Starting Point at Zero

Description

A dirac delta on zero, to provide no bias and a constant starting point

Usage

x_dirac_0(prms_model, prms_solve, x_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters; no prm name required

prms_solve

solver settings

x_vec

evidence space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

a vector of the same length as x_vec with zeros, except for the element in the middle of the vector

Uniform Starting Point Distribution Centered Around Zero

Description

Uniform Starting Point Distribution Centered Around Zero

Usage

x_uniform(prms_model, prms_solve, x_vec, one_cond, ddm_opts)

Arguments

prms_model

the model parameters; prm name "range_start" required

prms_solve

solver settings

x_vec

evidence space

one_cond

one condition

ddm_opts

optional arguments attached to an object

Value

returns the PDF of a uniform distribution for x_vec, centered around zero and with a range of "range_start".

dRiftDM: Estimating (Time-Dependent) Drift Diffusion Models

Description

Author(s)

See Also

Convolute the First Passage Times with the Non-Decision Time Distribution

Description

Usage

Arguments

Value

Aggregate Statistics Across ID

Description

Usage

Arguments

Details

Value

See Also

The Coding of the Boundaries

Description

Usage

Arguments

Details

Value

See Also

Examples

Constant Boundary

Description

Usage

Arguments

Value

Collapsing Boundary - Hyperbolic Ratio Function

Description

Usage

Arguments

Details

Value

Collapsing Boundary - Weibull Function

Description

Usage

Arguments

Details

Value

Calculate Basic Statistics for Response Times or Probability Densities

Description

Usage

Arguments

Details

Value

See Also

Calculate Basic Mean, Standard Deviations, and Percentages for Observed and Predicted Data

Description

Usage

Arguments

Details

Value

Calculate CAFs

Description

Usage

Arguments

Details

Value

See Also

Calculate CAFs

Description

Usage

Arguments

Details

Value

Calculate delta function(s)

Description

Usage

Arguments

Details

Value

See Also

Calculate and Combine Density Estimates for Observed and Predicted Data

Description

Usage

Arguments

Details

Value