DeitaQualitySampleEvaluator

About 267 wordsLess than 1 minute

2025-10-09

DeitaQualitySampleEvaluator is an operator designed to evaluate the quality of instruction-response pairs. It utilizes the hkust-nlp/deita-quality-scorer model to generate a quality score ranging from 1 to 6 for each sample.

`init` function

def __init__(self, device='cuda', model_cache_dir='./dataflow_cache', max_length=512)

Parameter	Type	Default	Description
device	str	'cuda'	The device to run the model on, e.g., 'cuda' or 'cpu'.
model_cache_dir	str	'./dataflow_cache'	The directory to cache the downloaded Hugging Face model.
max_length	int	512	The maximum sequence length for the model input.

Prompt Template Descriptions

Prompt Template Name	Primary Use	Applicable Scenarios	Feature Description

`run` function

def run(self, storage: DataFlowStorage, input_instruction_key: str = 'instruction', input_output_key: str = 'output', output_key: str = 'DeitaQualityScore')

Parameter	Type	Default	Description
storage	DataFlowStorage	Required	The DataFlow storage instance for reading the input DataFrame and writing the results.
input_instruction_key	str	'instruction'	The column name in the input DataFrame that contains the instruction text.
input_output_key	str	'output'	The column name in the input DataFrame that contains the response text.
output_key	str	'DeitaQualityScore'	The column name to store the generated quality score in the output DataFrame.

🧠 Example Usage

from dataflow.operators.text_sft.eval import DeitaQualitySampleEvaluator
from dataflow.utils.storage import FileStorage

# Prepare storage with instruction-output pairs
storage = FileStorage(first_entry_file_name="sft_data.jsonl")

# Initialize and run the evaluator
evaluator = DeitaQualitySampleEvaluator(
    device="cuda",
    model_cache_dir="./dataflow_cache",
    max_length=512,
)
evaluator.run(
    storage.step(),
    input_instruction_key="instruction",
    input_output_key="output",
    output_key="DeitaQualityScore",
)

🧾 Default output format (Output Format)

Field	Type	Description
instruction	str	The input instruction text.
output	str	The input response text.
DeitaQualityScore	float	The generated quality score, a value between 1 and 6.

eval

generate

eval

generate

eval

filter

generate

eval

filter

generate

generate

eval

filter

refine

generate

generate

generate

eval

filter

refine

generate

generate

eval

filter

generate

eval

filter

generate

eval

generate

filter

eval

filter

generate

refine

DeitaQualitySampleEvaluator

__init__ function

Prompt Template Descriptions

run function

🧠 Example Usage

🧾 Default output format (Output Format)

`init` function

`run` function