Code and Data for Empirical and Synthetic Experiments on Formula L Optimization

Name: Code and Data for Empirical and Synthetic Experiments on Formula L Optimization
Creator: Raman Marozau
License: https://creativecommons.org/licenses/by/4.0/
Keywords: Artificial Intelligence

Citation Author(s):: Raman Marozau
Submitted by:: Raman Marozau
Last updated:: Tue, 01/21/2025 - 23:18
DOI:: 10.21227/d481-zv33
Data Format:: *.json;*.py;*.txt;*.zip

72 views

Categories:

Artificial Intelligence

Keywords:

large language models

Multi-agent systems

Formula L

Semantic Task Allocation

artificial intelligence

Contextual Deviation

Dynamic Adaptation

Autonomous Systems.

ACCESS DATASET CITE

Abstract

This dataset provides the foundational resources for evaluating and optimizing Formula L , a novel mathematical framework for semantic-driven task allocation in multi-agent systems (MAS) powered by large language models (LLM). The dataset includes Python code and both empirical and synthetic data, specifically designed to validate the effectiveness of Formula L in improving task distribution, contextual relevance, and dynamic adaptation within MAS.

The dataset comprises:

Empirical data derived from real-world task allocation scenarios, demonstrating the practical application of Formula L in domains such as logistics and autonomous systems.
Synthetic data generated to simulate diverse and complex scenarios for benchmarking the scalability and robustness of the proposed framework.
Python code implementing Formula L , along with detailed experiments and evaluation scripts.
JSON files containing structured task definitions, historical context, and relevance metrics for both empirical and synthetic cases.

This dataset supports reproducibility, further development, and comparative analysis for researchers and practitioners exploring task allocation strategies in MAS-LLM environments. By integrating semantic evaluation and optimization principles, it offers a robust foundation for advancing autonomous decision-making systems.

Instructions:

# Documentation for LLM Framework Experimental Section

## Overview

This project provides an LLM framework experimental section for analyzing and optimizing task prioritization and contextual dependencies using the novel mathematical formulation of the **L-function**. This system is designed to enhance performance in Multi-Agent Systems (MAS), emphasizing resource allocation, historical context, and task-specific deviations.

## Prerequisites

1. Python 3.11 or higher.

2. Required Python packages (install using `pip install -r requirements.txt`):

- `matplotlib`

- `numpy`

- `pandas`

- `sentence_transformers`

- ...

# Project Structure

This section outlines the directory structure of the project, providing an overview of the organization and purpose of each folder and file.

## Root Directory

The root directory contains the main source folder (`src`) where all experiments and utility scripts are located.

### `src/` Directory

The `src/` folder is the main working directory, structured as follows:

```

src/

├── experiments/

│ ├── .data/

│ ├── a.task_priority/

│ │ ├── experimental_data/

│ │ ├── a.a.task_priority.histograms.py

│ │ ├── a.b.task_priority.histograms.analysis.py

│ │ ├── a.c.task_priority.heatmaps.py

│ ├── b.deviation_task/

│ │ ├── experimental_data/

│ │ ├── b.a.deviation_task.synth.py

│ │ ├── b.b.deviation_task.experiment.empirical.py

│ ├── c.deviation_historical/

│ │ ├── experimental_data/

│ │ ├── c.a.deviation_historical.model.by_agent.py

│ │ ├── c.b.deviation_historical.analysis.py

│ ├── d.llm_formula/

│ │ ├── experimental_data/

│ │ ├── d.a.llm_formula.empirical.py

├── utils/

│ ├── files.py

│ ├── objects.py

│ ├── strings.py

```

---

### `experiments/` Directory

This directory contains all experimental modules grouped by specific experiment types. Each subdirectory contains scripts and `experimental_data` folders for data storage.

#### `.data/`

- Holds initial datasets required for experiments.

#### `a.task_priority/`

- Focused on task prioritization experiments and visualization.

- **Subdirectories:**

- `experimental_data/`: Stores priority-related data by experiments.

- **Key Scripts:**

- `a.a.task_priority.histograms.py`: Generates histograms for priority coefficients (`\alpha`, `\beta`, `\gamma`).

- `a.b.task_priority.histograms.analysis.py`: Analyzes generated histogram data.

- `a.c.task_priority.heatmaps.py`: Creates heatmaps to visualize priority coefficient relationships.

#### `b.deviation_task/`

- Handles experiments related to task-specific deviations.

- **Subdirectories:**

- `experimental_data/`: Stores task-related data by experiments.

- **Key Scripts:**

- `b.a.deviation_task.synth.py`: Generates synthetic data for task-specific experiments.

- `b.b.deviation_task.experiment.empirical.py`: Executes empirical validation experiments for task deviations.

#### `c.deviation_historical/`

- Focused on analyzing historical context deviations.

- **Subdirectories:**

- `experimental_data/`: Stores data by deviation historical experiments.

- **Key Scripts:**

- `c.a.deviation_historical.model.by_agent.py`: Simulates historical deviation models for agents.

- `c.b.deviation_historical.analysis.py`: Performs data analysis for historical context deviations.

#### `d.llm_formula/`

- Dedicated to the experimental validation of the **L-formula**.

- **Subdirectories:**

- `experimental_data/`: Stores empirical datasets resulted by experiments.

- **Key Scripts:**

- `d.a.llm_formula.empirical.py`: Runs empirical analysis for L-formula experiments.

---

### `utils/` Directory

Utility scripts for shared functionality across experiments.

- `files.py`: Manages file-related operations.

- `objects.py`: Defines reusable object structures and methods.

- `strings.py`: Handles string manipulations and formatting.

---

## Notes on Directory Organization

- **Alphabetical Order**: File and directory names are alphabetically prefixed (e.g., `a.`, `b.`, `c.`) to define the order of execution, ensuring consistency across operating systems.

- **Experimental Data**: Each experiment has a dedicated `experimental_data` folder to store datasets used during execution.

---

This structure ensures modularity, clarity, and ease of navigation for contributors to the project.

Let me know if you need additional refinements!

## Getting Started

### Task Prioritization with `lambda`

1. **Generate Priority Histograms**:

File: `a.a.task_priority.histograms.py`

- Generates histograms and initial datasets for priority coefficients `\alpha`, `\beta`, `\gamma`.

- Default values: `[0.01, 0.2, 0.5, 0.9, 1]`.

- Run:

```bash

python src/experiments/a.task_priority/a.a.task_priority.histograms.py

```

2. **Analyze Histogram Data**:

File: `a.b.task_priority.histograms.analysis.py`

- Processes histogram datasets to produce heatmaps of dependencies.

- Run:

```bash

python src/experiments/a.task_priority/a.b.task_priority.histograms.analysis.py

```

3. **Create Priority Heatmaps**:

File: `a.c.task_priority.heatmaps.py`

- Generates heatmaps visualizing the relationship between priority coefficients.

- Run:

```bash

python src/experiments/a.task_priority/a.c.task_priority.heatmaps.py

```

### Task-Specific Deviation Experiments

1. **Independent Task Experiment**:

File: `b.a.deviation_task.synth.py`

- Generate random values for experimental conditions to test the formula independently.

- Output includes deviation metrics and alignment values.

- Run:

```bash

python src/experiments/b.deviation_task/b.a.deviation_task.synth.py

```

2. **Synchronized Task Experiment**:

File: `b.b.deviation_task.experiment.empirical.py`

- `lambda` values are defined as sinusoidal dependencies, with model outputs aligned to cosine patterns for maximal consistency.

- Run:

```bash

python src/experiments/b.deviation_task/b.b.deviation_task.experiment.empirical.py

```

### Historical Context Deviation Experiments

1. **Progressive Experiment with Fibonacci Sequences**:

File: `c.a.deviation_historical.model.by_agent.py`

- Evaluate deviations using context windows based on Fibonacci numbers for argumentation validity.

- Run:

```bash

python src/experiments/c.deviation_historical/c.a.deviation_historical.model.by_agent.py

```

2. **Mean Historical Context Deviation Experiment by Window Size**:

File: `c.b.deviation_historical.analysis.py`

- Evaluate minimum, maximum and average deviations using context windows based on Fibonacci numbers for argumentation validity.

- Run:

```bash

python src/experiments/c.deviation_historical/c.b.deviation_historical.analysis.py

```

### L-Formula Optimization Experiments

1. **Experimental Parameter Tuning**:

File: `d.a.llm_formula.empirical.py`

- Optimize `L` values by defining dynamic adjustment windows for key parameters.

- The L function experiments involve:

- Managing the optimization window to determine if the current value is optimal.

- Testing the formula’s minimums and scalability across the system's value ranges.

- Theoretical principles evidence for balancing response brevity, task-specific alignment, and contextual relevance are directly applied here.

- Run:

```bash

python src/experiments/d.llm_formula/d.a.llm_formula.empirical.py

```

## Appendix

### Notes

- File names are alphabetically ordered to ensure consistent execution across different operating systems.

- The order of execution is critical. Follow the alphabetical file names to ensure proper dependency resolution.

- Experimental datasets are stored in the `src/experiments/.data` directory. These include base values for testing scenarios and parameter ranges.

### Contribution

Contributions are welcome! Please submit a pull request with a detailed description of your changes.

---

## License

This project is open-source and available under the [Apache License 2.0](LICENSE).

Datasets

Standard Dataset

Code and Data for Empirical and Synthetic Experiments on Formula L Optimization

Abstract

Instructions:

Dataset Files

DATASET SCRIPTS

QUESTIONS?

More like this Dataset

Weather Monitoring Station For Farms And Agriculture

Trilateration based on RSSI values in transmitters and receivers

The FLAME dataset: Aerial Imagery Pile burn detection using drones (UAVs)

Retinal Fundus Multi-disease Image Dataset (RFMiD)

Experimental database for detecting and diagnosing rotor broken bar in a three-phase induction motor.

Dataset for classification of handwritten and printed text in a Doctor's prescription