Help for package LLMAgentR

Type:

Package

Title:

Language Model Agents in R for AI Workflows and Research

Version:

0.3.0

Maintainer:

Kwadwo Daddy Nyame Owusu Boakye <kwadwo.owusuboakye@outlook.com>

Description:

Provides modular, graph-based agents powered by large language models (LLMs) for intelligent task execution in R. Supports structured workflows for tasks such as forecasting, data visualization, feature engineering, data wrangling, data cleaning, 'SQL', code generation, weather reporting, and research-driven question answering. Each agent performs iterative reasoning: recommending steps, generating R code, executing, debugging, and explaining results. Includes built-in support for packages such as 'tidymodels', 'modeltime', 'plotly', 'ggplot2', and 'prophet'. Designed for analysts, developers, and teams building intelligent, reproducible AI workflows in R. Compatible with LLM providers such as 'OpenAI', 'Anthropic', 'Groq', and 'Ollama'. Inspired by the Python package 'langagent'.

License:

MIT + file LICENSE

Encoding:

UTF-8

RoxygenNote:

7.3.2

URL:

https://github.com/knowusuboaky/LLMAgentR, https://knowusuboaky.github.io/LLMAgentR/

BugReports:

https://github.com/knowusuboaky/LLMAgentR/issues

Depends:

R (≥ 4.1.0)

Imports:

plotly, stats, utils, DBI, RSQLite, dplyr, glue, httr, officer, purrr, timetk, pdftools, parsnip, recipes, workflows, rsample, modeltime.ensemble, modeltime, xml2

Suggests:

testthat (≥ 3.0.0), roxygen2, jsonlite, magrittr, rlang, tidyr, ggplot2, usethis, prophet, forcats, kernlab, xgboost, xfun, modeltime.resample, tidymodels, tibble, lubridate, methods, tesseract, rvest, fastDummies, stringr

NeedsCompilation:

Packaged:

2025-05-20 20:05:52 UTC; kwadw

Author:

Kwadwo Daddy Nyame Owusu Boakye [aut, cre]

Repository:

CRAN

Date/Publication:

2025-05-20 20:20:02 UTC

Build an R Code-Generation Agent

Description

Constructs an LLM-powered agent for generating, debugging, explaining, or optimizing R code. **Two calling patterns are supported**:

**Builder pattern** – omit 'user_input'. The function returns a reusable coder-agent closure. Call that closure with different queries whenever you need code help.
**One-shot pattern** – provide 'user_input'. The function executes immediately and returns the result once.

Arguments

llm

A function that accepts a character 'prompt' and returns an LLM response (optionally accepts 'verbose').

system_prompt

Optional system-level instructions that override the built-in default prompt.

user_input

The coding task/query (e.g., '"Write function to filter NAs"'). **Default 'NULL'** – omit to obtain a reusable agent.

max_tries

Maximum LLM retry attempts (default '3').

backoff

Seconds to wait between retries (default '2').

verbose

Logical flag to show progress messages (default 'TRUE').

Details

The agent automatically retries failed LLM calls (with exponential back-off) and always returns a structured result.

Value

If 'user_input' is 'NULL': a function (the coder-agent closure).
Otherwise: a list with the fields

input
The original user query.

llm_response
The LLM output (or error message).

system_prompt
Prompt actually sent.

success
Logical; did the call succeed?

attempts
Number of attempts made.

Examples

## Not run: 
## ------------------------------------------------------------------
## 1)  Builder pattern – create a reusable coder agent
## ------------------------------------------------------------------
coder <- build_code_agent(
  llm       = my_llm_wrapper,   # your own wrapper around the LLM API
  max_tries = 3,
  backoff   = 2,
  verbose   = FALSE
)

# Use the agent multiple times
res1 <- coder("Write an R function that z-score–standardises all numeric columns.")
res2 <- coder("Explain what `%>%` does in tidyverse pipelines.")

## ------------------------------------------------------------------
## 2)  One-shot pattern – run a single request immediately
## ------------------------------------------------------------------
one_shot <- build_code_agent(
  llm        = my_llm_wrapper,
  user_input = "Create a ggplot2 bar chart of mpg by cyl in mtcars.",
  max_tries  = 3,
  backoff    = 2,
  verbose    = FALSE
)

## End(Not run)

Build a Data Cleaning Agent

Description

Constructs a multi-step agent workflow to recommend, generate, fix, execute, and explain robust R code for data cleaning tasks using LLMs and user-defined data.

Arguments

model

A function that accepts a prompt and returns a text response (e.g., OpenAI, Claude).

data_raw

A raw data.frame (or list convertible to data.frame) to be cleaned.

human_validation

Logical; whether to include a manual review step.

bypass_recommended_steps

Logical; whether to skip LLM-based cleaning step suggestions.

bypass_explain_code

Logical; whether to skip explanation of the generated code.

verbose

Logical; whether to print progress messages (default: TRUE)

Value

A compiled graph-based cleaning agent function that accepts and mutates a state list.

Examples

## Not run: 
# 1) Load the data
data <- read.csv("tests/testthat/test-data/churn_data.csv")

# 2) Create the agent
data_cleaner_agent <- build_data_cleaning_agent(
  model = my_llm_wrapper,
  human_validation = FALSE,
  bypass_recommended_steps = FALSE,
  bypass_explain_code = FALSE,
  verbose = FALSE
)

# 3) Define the initial state
initial_state <- list(
  data_raw = data,
  user_instructions = "Don't remove outliers when cleaning the data.",
  max_retries = 3,
  retry_count = 0
)

# 4) Run the agent
final_state <- data_cleaner_agent(initial_state)

## End(Not run)

Build a Data Wrangling Agent

Description

Constructs a state graph-based agent that recommends, generates, executes, fixes, and explains data wrangling transformations based on user instructions and dataset structure. The resulting function handles list or single data frame inputs and produces a cleaned dataset.

Arguments

model

A function that takes a prompt string and returns LLM-generated output.

human_validation

Logical; whether to enable manual review step before code execution.

bypass_recommended_steps

Logical; skip initial recommendation of wrangling steps.

bypass_explain_code

Logical; skip final explanation step after wrangling.

verbose

Logical; whether to print progress messages (default: TRUE)

Value

A callable agent function that mutates a provided 'state' list by populating: - 'data_wrangled': the final cleaned data frame, - 'data_wrangler_function': the code used, - 'data_wrangler_error': any execution error (if occurred), - 'wrangling_report': LLM-generated explanation (if 'bypass_explain_code = FALSE')

Examples

## Not run: 
# 1) Simulate multiple data frames with a common ID
df1 <- data.frame(
  ID = c(1, 2, 3, 4),
  Name = c("John", "Jane", "Jim", "Jill"),
  stringsAsFactors = FALSE
)

df2 <- data.frame(
  ID = c(1, 2, 3, 4),
  Age = c(25, 30, 35, 40),
  stringsAsFactors = FALSE
)

df3 <- data.frame(
  ID = c(1, 2, 3, 4),
  Education = c("Bachelors", "Masters", "PhD", "MBA"),
  stringsAsFactors = FALSE
)

# 2) Combine into a list
data <- list(df1, df2, df3)

# 3) Create the agent
data_wrangling_agent <- build_data_wrangling_agent(
  model = my_llm_wrapper,
  human_validation = FALSE,
  bypass_recommended_steps = FALSE,
  bypass_explain_code = FALSE,
  verbose = FALSE
)

# 4) Define the initial state
initial_state <- list(
  data_raw = data,
  user_instructions = "Merge the data frames on the ID column.",
  max_retries = 3,
  retry_count = 0
)

# 5) Run the agent
final_state <- data_wrangling

## End(Not run)

Build a Document Summarizer Agent

Description

Creates an LLM-powered document summarization workflow that processes PDF, DOCX, PPTX, TXT, or plain text input and returns structured markdown summaries.

Usage

build_doc_summarizer_agent(
  llm,
  summary_template = NULL,
  chunk_size = 4000,
  overlap = 200,
  verbose = TRUE
)

Arguments

llm

A function that accepts a character prompt and returns an LLM response.

summary_template

Optional custom summary template in markdown format.

chunk_size

Maximum character length for document chunks (default: 4000).

overlap

Character overlap between chunks (default: 200).

verbose

Logical controlling progress messages (default: TRUE).

Value

A function that accepts file paths or text input and returns:

summary - The generated markdown summary
metadata - Document metadata if available
chunks - Number of processing chunks used
success - Logical indicating success

Examples

## Not run: 
# Build document summarizer agent
summarizer_agent <- build_doc_summarizer_agent(
  llm = my_llm_wrapper,
  summary_template = NULL,
  chunk_size = 4000,
  overlap = 200,
  verbose = FALSE
)

# Summarize document
final_state <- summarizer_agent("https://github.com/knowusuboaky/LLMAgentR/raw/main/\
tests/testthat/test-data/scrum.docx")

## End(Not run)

Build a Feature Engineering Agent

Description

Constructs a graph-based feature engineering agent that guides the process of: recommending, generating, executing, fixing, and explaining feature engineering code.

Arguments

model

A function that accepts a prompt and returns an LLM-generated response.

human_validation

Logical; include a manual review node before code execution.

bypass_recommended_steps

Logical; skip the LLM-based recommendation phase.

bypass_explain_code

Logical; skip final explanation step.

verbose

Logical; whether to print progress messages (default: TRUE)

Value

A callable agent function that executes feature engineering via a state graph.

Examples

## Not run: 
# 1) Load the data
data <- read.csv("tests/testthat/test-data/churn_data.csv")

# 2) Create the feature engineering agent
feature_engineering_agent <- build_feature_engineering_agent(
  model = my_llm_wrapper,
  human_validation = FALSE,
  bypass_recommended_steps = FALSE,
  bypass_explain_code = FALSE,
  verbose = TRUE
)

# 3) Define the initial state
initial_state <- list(
  data_raw = data,
  target_variable = "Churn",
  user_instructions = "Inspect the data. Make any new features and transformations
  that you think will be useful for predicting the target variable.",
  max_retries = 3,
  retry_count = 0
)

# 4) Run the agent
final_state <- feature_engineering_agent(initial_state)

## End(Not run)

Build a Time Series Forecasting Agent

Description

Constructs a state graph-based forecasting agent that: recommends forecasting steps, extracts parameters, generates code, executes the forecast using 'modeltime', fixes errors if needed, and explains the result. It leverages multiple models including Prophet, XGBoost, Random Forest, SVM, and Prophet Boost, and combines them in an ensemble.

Arguments

model

A function that takes a prompt and returns an LLM-generated result.

bypass_recommended_steps

Logical; skip initial step recommendation.

bypass_explain_code

Logical; skip the final explanation step.

mode

Visualization mode for forecast plots. One of '"light"' or '"dark"'.

line_width

Line width used in plotly forecast visualization.

verbose

Logical; whether to print progress messages.

Value

A callable agent function that mutates the given 'state' list.

Examples

## Not run: 
# 2) Prepare the dataset
my_data <- walmart_sales_weekly

# 3) Create the forecasting agent
forecasting_agent <- build_forecasting_agent(
  model = my_llm_wrapper,
  bypass_recommended_steps = FALSE,
  bypass_explain_code = FALSE,
  mode = "dark", # dark or light
  line_width = 3,
  verbose = FALSE
)

# 4) Define the initial state
initial_state <- list(
  user_instructions = "Forecast sales for the next 30 days, using `id` as the grouping variable,
  a forecasting horizon of 30, and a confidence level of 90%.",
  data_raw = my_data
)

# 5) Run the agent
final_state <- forecasting_agent(initial_state)

## End(Not run)

Build an Interpreter Agent

Description

Constructs an LLM-powered agent that explains plots, tables, text, or other outputs for both technical and non-technical audiences.

Arguments

llm

Function that takes prompt and returns an LLM response (may or may not accept verbose).

interpreter_prompt

Optional template for the prompt (default supplied).

code_output

The output to interpret (plot caption, table text, model summary, etc.). **Default NULL**.

max_tries

Max LLM retry attempts (default 3).

backoff

Seconds between retries (default 2).

verbose

Logical; print progress (default TRUE).

Details

**Two calling patterns**

**Builder pattern** – omit code_output; a reusable interpreter-agent closure is returned.
**One-shot pattern** – provide code_output; the function runs immediately and returns the interpretation.

Value

If code_output is NULL: a function (closure).
Otherwise: a list with

prompt
The full prompt sent to the LLM.

interpretation
The LLM’s explanation (or error).

success
Logical; did it succeed?

attempts
Number of attempts made.

Examples

## Not run: 
## 1) Builder pattern --------------------------------------------
interp <- build_interpreter_agent(llm = my_llm_wrapper, verbose = FALSE)

table_txt <- "
| Region | Sales | Profit |
| North  | 2000  | 300    |
| South  | 1500  | 250    |"

res1 <- interp(table_txt)
res2 <- interp("R² = 0.87 for the fitted model …")

## 2) One-shot pattern -------------------------------------------
build_interpreter_agent(
  llm         = my_llm_wrapper,
  code_output = table_txt,
  verbose     = FALSE
)

## End(Not run)

Build a Web Researcher Agent

Description

Constructs an LLM-powered research agent that performs web searches (via Tavily API) and generates structured responses based on search results. The agent handles different question types (general knowledge, comparisons, controversial topics) with appropriate response formats.

Arguments

llm

A function that accepts a character prompt and returns an LLM response. (It must accept 'prompt' and optionally 'verbose'.)

tavily_search

Tavily API key as a string or NULL to use 'Sys.getenv("TAVILY_API_KEY")'.

system_prompt

Optional custom system prompt for the researcher agent.

max_results

Number of web search results to retrieve per query (default: 5).

max_tries

Maximum number of retry attempts for search or LLM call (default: 3).

backoff

Initial wait time in seconds between retries (default: 2).

verbose

Logical flag to control progress messages (default: TRUE).

Value

A function that accepts a user query string and returns a list with:

query - The original research query.
prompt - The full prompt sent to the LLM.
response - The generated LLM response.
search_results - Raw search results (if any were found).
success - Logical indicating if research succeeded (both search and LLM).

Examples

## Not run: 
# Initialize researcher agent
researcher_agent <- build_researcher_agent(
  llm = my_llm_wrapper,
  tavily_search = NULL,
  system_prompt = NULL,
  max_results = 5,
  max_tries = 3,
  backoff = 2,
  verbose = FALSE
)

# Perform research
result <- researcher_agent("Who is Messi?")

## End(Not run)

Build a SQL Agent Graph

Description

This function constructs a full SQL database agent using a graph-based workflow. It supports step recommendation, SQL code generation, error handling, optional human review, and automatic explanation of the final code.

Arguments

model

A function that accepts prompts and returns LLM responses.

connection

A DBI connection object to the target SQL database.

n_samples

Number of candidate SQL plans to consider (used in prompt).

human_validation

Whether to include a human review node.

bypass_recommended_steps

If TRUE, skip the step recommendation node.

bypass_explain_code

If TRUE, skip the final explanation step.

verbose

Logical indicating whether to print progress messages (default: TRUE).

Value

A compiled SQL agent function that runs via a state machine (graph execution).

Examples

## Not run: 
# 1) Connect to the database
conn <- DBI::dbConnect(RSQLite::SQLite(), "tests/testthat/test-data/northwind.db")

# 2) Create the SQL agent
sql_agent <- build_sql_agent(
  model                    = my_llm_wrapper,
  connection               = conn,
  human_validation         = FALSE,
  bypass_recommended_steps = FALSE,
  bypass_explain_code      = FALSE,
  verbose                  = FALSE
)

# 3) Define the initial state
initial_state <- list(
  user_instructions = "Identify the Regions (or Territories) with the highest
  CustomerCount and TotalSales.
  Return a table with columns: Region, CustomerCount, and TotalSales.
Hint: (UnitPrice × Quantity).",
  max_retries       = 3,
  retry_count       = 0
)

# 4) Run the agent
final_state <- sql_agent(initial_state)

## End(Not run)

Build Visualization Agent

Description

Creates a data visualization agent with configurable workflow steps.

Arguments

model

The AI model function to use for code generation

human_validation

Whether to include human validation step (default: FALSE)

bypass_recommended_steps

Skip recommendation step (default: FALSE)

bypass_explain_code

Skip explanation step (default: FALSE)

function_name

Name for generated visualization function (default: "data_visualization")

verbose

Whether to print progress messages (default: TRUE)

Value

A function that takes state and returns visualization results

Examples

## Not run: 
# 1) Load the data
data <- read.csv("tests/testthat/test-data/churn_data.csv")

# 2) Create the visualization agent
visualization_agent <- build_visualization_agent(
  model = my_llm_wrapper,
  human_validation = FALSE,
  bypass_recommended_steps = FALSE,
  bypass_explain_code = FALSE,
  verbose = FALSE
)

# 3) Define the initial state
initial_state <- list(
  data_raw = data,
  target_variable = "Churn",
  user_instructions = "Create a clean and visually appealing box plot to show
  the distribution of Monthly Charges across Churn categories.
  Use distinct colors for each Churn group,
  add clear axis labels, a legend, and a meaningful title.",
  max_retries = 3,
  retry_count = 0
)

# 4) Run the agent
final_state <- visualization_agent(initial_state)

## End(Not run)

Build a Weather Agent

Description

Constructs an LLM-powered weather assistant that fetches data from OpenWeatherMap and generates user-friendly reports. Handles location parsing, API calls, caching, and LLM-based summarization.

Arguments

llm

A function that accepts a character prompt and returns an LLM response.

location_query

Free-text location query (e.g., "weather in Toronto").

system_prompt

Optional LLM system prompt for weather reporting.

weather_api_key

OpenWeatherMap API key (defaults to OPENWEATHERMAP_API_KEY env var).

units

Unit system ("metric" or "imperial").

n_tries

Number of retry attempts for API/LLM calls (default: 3).

backoff

Base seconds to wait between retries (default: 2).

endpoint_url

OpenWeatherMap endpoint URL.

verbose

Logical controlling progress messages (default: TRUE).

Value

A list containing:

success - Logical indicating if operation succeeded
location - Cleaned location string
weather_raw - Raw API response
weather_formatted - Formatted weather string
llm_response - Generated weather report
timestamp - Time of response
cache_hit - Logical indicating cache usage
attempts - Number of tries made

Examples

## Not run: 
# Get weather information
weather_agent <- build_weather_agent(
  llm = my_llm_wrapper,
  location_query = "Tokyo, Japan",
  system_prompt = NULL,
  weather_api_key = NULL,
  units = "metric", # metric or imperial
  n_tries = 3,
  backoff = 2,
  endpoint_url = NULL,
  verbose = FALSE
)

## End(Not run)