diff --git a/torchtune/datasets/_alpaca.py b/torchtune/datasets/_alpaca.py index 4254cb7cb6..a881c149b0 100644 --- a/torchtune/datasets/_alpaca.py +++ b/torchtune/datasets/_alpaca.py @@ -51,7 +51,7 @@ def alpaca_dataset( :class:`~torchtune.data.AlpacaToMessages` to the new column names in the dataset. Keys should be "instruction", "input", and "output" and values should be the actual column names. If None, uses the default column names ``"instruction``, ``"input"``, and ``"output"`` in ``tatsu-lab/alpaca``. - train_on_input (bool): Whether the model is trained on the prompt or not. Default is False. + train_on_input (bool): Whether the model is trained on the prompt or not. Default is True. packed (bool): Whether or not to pack the dataset to ``max_seq_len`` prior to training. Default is False. filter_fn (Optional[Callable]): callable used to filter the dataset prior to any pre-processing. See the Hugging Face `docs `_ for more