ORAN | IEEE DataPort

Accuracy, latency and overhead test in Federated Deep Reinforcement Learning, L2FPPO in Focus

With the advent of 6G Open-RAN architecture, multiple operational services can be simultaneously executed in RAN, leveraging the near-Real-Time Radio Intelligent Controller (near-RT-RIC) and real-time (RT) nodes. The architecture provides an ideal platform for Federated Learning (FL): The xAPP is hosted in the near-RT-RIC to perform global aggregation, whereas the Open Radio Unit (ORU) allocates power to users to participate in FL in a RT manner. This paper identifies power and latency optimization as critical factors for enhancing FL in a stochastic environment.

Categories:

Machine Learning

Benchmark Dataset for Generative AI on Edge Devices

The benchmarking dataset, GenAI on the Edge, contains performance metrics from evaluating Large Language Models (LLMs) on edge devices, utilizing a distributed testbed of Raspberry Pi devices orchestrated by Kubernetes (K3s). It includes performance data collected from multiple runs of prompt-based evaluations with various LLMs, leveraging Prometheus and the Llama.cpp framework. The dataset captures key metrics such as resource utilization, token generation rates/throughput, and detailed inference timing for stages such as Sample, Prefill, and Decode.

Categories:

Subscribe to ORAN