FormatChatGenerationSFT¶

Format the output of a ChatGeneration task for Supervised Fine-Tuning (SFT) following the

standard formatting from frameworks such as axolotl or alignment-handbook.

`FormatChatGenerationSFT` is a `Step` that formats the output of a `ChatGeneration` task for
Supervised Fine-Tuning (SFT) following the standard formatting from frameworks such as `axolotl`
or `alignment-handbook`. The output of the `ChatGeneration` task is formatted into a chat-like
conversation with the `instruction` as the user message and the `generation` as the assistant
message. Optionally, if the `system_prompt` is available, it is included as the first message
in the conversation.

Input & Output Columns¶

Inputs¶

system_prompt (str, optional): The system prompt used within the LLM to generate the generation, if available.
instruction (str): The instruction used to generate the generation with the LLM.
generation (str): The generation produced by the LLM.

Outputs¶

prompt (str): The instruction used to generate the generation with the LLM.
prompt_id (str): The SHA256 hash of the prompt.
messages (List[Dict[str, str]]): The chat-like conversation with the instruction as the user message and the generation as the assistant message.