TreeinstructSampleEvaluator

About 297 wordsLess than 1 minute

2025-10-09

📘 Overview

The TreeinstructSampleEvaluator is an operator designed to measure the complexity of a given instruction by analyzing the size of its syntax tree. It leverages a large language model (LLM) to generate a syntax tree for the instruction and then calculates a complexity score based on the number of nodes in that tree. A higher node count signifies a more complex instruction.

`init`

def __init__(self, llm_serving: LLMServingABC = None)

Parameter	Type	Default	Description
llm_serving	LLMServingABC	None	An instance of an LLM serving class that handles model inference.

Prompt Template Descriptions

Prompt Template Name	Primary Purpose	Applicable Scenarios	Feature Description

`run`

def run(self, storage: DataFlowStorage, input_instruction_key: str, output_key: str='TreeinstructScore')

Parameter	Type	Default	Description
storage	DataFlowStorage	Required	DataFlow storage instance for reading and writing data.
input_instruction_key	str	Required	The column name in the input DataFrame that contains the instruction text.
output_key	str	"TreeinstructScore"	The column name for the generated complexity score.

🧠 Example Usage

from dataflow.operators.text_sft.eval import TreeinstructSampleEvaluator
from dataflow.utils.storage import FileStorage
from dataflow.utils.llm_serving import APILLMServing_request

# Prepare storage with instruction-only data
storage = FileStorage(first_entry_file_name="sft_instructions.jsonl")

# Initialize LLM serving
llm_serving = APILLMServing_request(
    api_url="http://<your_llm_api_endpoint>",
    model_name="<your_model_name>",
)

# Initialize and run the evaluator
evaluator = TreeinstructSampleEvaluator(llm_serving=llm_serving)
evaluator.run(
    storage.step(),
    input_instruction_key="instruction",
    output_key="TreeinstructScore",
)

🧾 Default Output Format

Field	Type	Description
(original_fields)	any	The original fields from the input data are preserved.
TreeinstructScore	float	The calculated complexity score for the instruction.

Example Input:

{
  "instruction": "Can you provide a list of healthy habits to maintain a healthy lifestyle? Please format your response as an HTML page with bullet points."
}

Example Output:

{
  "instruction": "Can you provide a list of healthy habits to maintain a healthy lifestyle? Please format your response as an HTML page with bullet points.",
  "TreeinstructScore": 11.0
}

eval

generate

eval

generate

eval

filter

generate

eval

filter

generate

generate

eval

filter

refine

generate

generate

generate

eval

filter

refine

generate

generate

eval

filter

generate

eval

filter

generate

eval

generate

filter

eval

filter

generate

refine

TreeinstructSampleEvaluator

📘 Overview

__init__

Prompt Template Descriptions

run

🧠 Example Usage

🧾 Default Output Format

`init`

`run`