Framework Design

About 390 wordsAbout 1 min

2025-06-13

Overview

DataFlex is an advanced dynamic training framework built on LlamaFactory. It intelligently schedules data during training, supporting dynamic sample selection, domain ratio adjustment, and dynamic weight allocation to improve training efficiency and final model performance.

Design Philosophy

The core design philosophy of DataFlex is: Data-centric intelligent training scheduling. Traditional training methods typically use fixed data order and ratios, while DataFlex allows models to dynamically adjust data usage strategies based on their current state during training, achieving more efficient learning. It is designed to seamlessly integrate with LlamaFactory, providing researchers and developers with more flexible and powerful training control capabilities.

During the data selection process, it is often necessary to perform operations such as embedding, inference, and gradient computation on data samples. DataFlex is designed to provide a unified management framework for embedding, large model inference, and gradient computation.

Core Architecture

Overall Architecture Diagram

┌───────────────────────────────────────────────────────────────────────────────┐
│                           LlamaFactory Framework                              │
├───────────────────────────────────────────────────────────────────────────────┤
│                  Model Management · Data Processing · Optimizers              │
├───────────────────────────────────────────────────────────────────────────────┤
│            Training Layer (DataFlex replaces LlamaFactory trainer)            │
│  ┌────────────────────────┬────────────────────────┬────────────────────────┐ │
│  │      Select Trainer    │       Mix Trainer      │     Weight Trainer     │ │
│  │   (Dynamic Selection)  │      (Dynamic Ratio)   │     (Dynamic Weights)  │ │
│  ├────────────────────────┼────────────────────────┼────────────────────────┤ │
│  │  Selector Components   │    Mixer Components    │   Weighter Components  │ │
│  │  ┌──────────────────┐  │  ┌──────────────────┐  │  ┌───────────────────┐ │ │
│  │  │  Loss Selector   │  │  │   Random Mixer   │  │  │   Loss Weighter   │ │ │
│  │  │  LESS Selector   │  │  │   Custom Mixer   │  │  │  Custom Weighter  │ │ │
│  │  │   Custom ...     │  │  │       ...        │  │  │        ...        │ │ │
│  │  └──────────────────┘  │  └──────────────────┘  │  └───────────────────┘ │ │
│  └────────────────────────┴────────────────────────┴────────────────────────┘ │
└───────────────────────────────────────────────────────────────────────────────┘

Component Hierarchy

DataFlex adopts a modular design with the following layers:

Base Layer (LlamaFactory): Provides model management, data processing, optimizers and other basic components
Trainer Layer (DataFlex Trainers): Replaces LlamaFactory's original trainer, implementing three dynamic training modes
Strategy Component Layer (Components): Provides specific data processing strategies (Selector/Mixer/Weighter)
Registry System: Manages component registration and loading

Key Feature: DataFlex doesn't add new layers on top of LlamaFactory, but seamlessly replaces its training layer, maintaining original functionality while enhancing training capabilities.

Three Core Trainer Concepts

DataFlex provides three core trainers that can seamlessly integrate into LlamaFactory's training pipeline:

Select Trainer (Dynamic Selection Trainer): During training, dynamically selects a subset of samples from the dataset based on predefined strategies (Selector) for subsequent training, e.g., prioritizing "difficult" samples that the model finds challenging.
Mix Trainer (Dynamic Ratio Trainer): Supports dynamic adjustment of mixing ratios for data from different sources or domains during training.
Weight Trainer (Dynamic Weighting Trainer): Supports dynamic adjustment of sample weights during backpropagation, increasing learning intensity for model-preferred data.