Easy_Rec Config
===============

Dataset parameters 
-----------------------------------------
- ``name (str)`` – Name of the dataset to use, e.g. `ml-1m`, `amazon_beauty`, `behance`. Default to **ml-100k**.
- ``data_folder (str)`` – Path to the raw dataset folder. Default to **../data/raw/**.
- ``min_rating (int or float)`` – Minimum rating threshold. Interactions below this value will be filtered out. Default to **0**.
- ``min_items_per_user (int)`` – Minimum number of items a user must have interacted with. Interactions below this value will be filtered out. Default to **5**.
- ``min_users_per_item (int)`` – Minimum number of users that must have interacted with an item. Interactions below this value will be filtered out. Default to **5**.
- ``densify_index (bool)`` – If ``True``, user and item indices will be re-mapped to a contiguous range. Default to **True**.
- ``split_method (str)`` – Method used to split the dataset into train/val/test sets. Default to **leave_n_out**, for the moment is the only option.
- ``test_sizes (list of int or null)`` – Number of final interactions kept for validation and test sets. Default to **[1, 1]**.
- ``dataset_params``
    - ``split_keys (dict)`` – Keys used to group data for splitting. Default to `{train: [sid, uid], val: [sid, uid], test: [sid, uid]}`.
- ``collator_params``
    - ``sequential_keys (list of str)`` – Keys used to identify sequence order. Default to **[sid]**.
    - ``padding_value (int)`` – Value used to pad sequences. Default to **0**.
    - ``lookback (int)`` – Number of past items to include in each training sample i.e. length of the sequence. Default to **200**.
    - ``lookforward (int)`` – Number of future items to predict. Default to **1**.
    - ``simultaneous_lookforward (int)`` – Number of future steps included in a single prediction step. Default to **1**.
    - ``out_seq_len (dict)`` – Output sequence length per split.  
        - ``train``- Default to **null**. 
        - ``val`` - Default to **1** . 
        - ``test``- Default to **1**.
    - ``num_negatives (dict)`` – Number of negative samples per positive example. To include all in a set, put `1.` .
        - ``train`` - Default to  **1**.
        - ``val`` - Default to **100**. 
        - ``test`` - Default to **100**.

    - ``negatives_distribution (str)`` – Strategy for sampling negatives. Default to **uniform**, for the moment it's the only option. 


Loader parameters
-----------------------------------------
- ``batch_size (int)`` – Number of samples processed in each training batch. Default to **128**.
- ``drop_last (bool)`` – If `True`, discards the last batch if it contains fewer than `batch_size` samples. Default to **True**. See this link for more details https://discuss.pytorch.org/t/usage-of-drop-last-on-data-loader/66741 .
- ``num_workers (int)`` – The number of workers processing the data. Default to **1**.
- ``shuffle (bool)`` – If `True`, shuffles the dataset at every epoch. Default to **True**.
- ``persisent_workers (bool)`` –  If `True`, worker processes remain active between epochs to improve loading speed. Default to **False**.
- ``pin_memory (bool)`` –  If `True`, allocates tensors in pinned memory, which speeds up GPU transfers. Default to **False**.


Training parameters
-----------------------------------------
- ``accelerator (str)`` – Specifies the hardware accelerator to use. Options in [`cpu`, `cuda`, `mps`]. Default set to **cpu**.
- ``enable_checkpointing (bool)`` – Enables or disables automatic checkpoint saving during training. Default to **True**.
- ``max_epochs (int)`` – Maximum number of training epochs. Default to **600**.
- ``log_every_n_steps (int)`` – Number of steps between logging events. Default to **1**.
- ``callbacks``
    - ``ModelCheckpoint``
        - ``dirpath (str)`` – Directory path where checkpoints are saved. Example: `${__exp__.project_folder}out/models/${__exp__.name}/`.
        - ``filename (str)`` – Base name for saved checkpoint files. Default to **best**.
        - ``save_top_k (int)`` – Number of best models to keep. Default to **1**.
        - ``save_last (bool)`` – Whether to save the last checkpoint regardless of performance. Default to **True**.
        - ``monitor (str)`` – Metric to monitor for saving best checkpoints. Example: `val_NDCG_@10/dataloader_idx_0`.
        - ``mode (str)`` – Whether to maximize or minimize the monitored metric. Options in [`min`, `max`]. Default to **max**.
        - ``enable_version_counter (bool)`` – If **False**, overwrites the best model instead of creating versions. Default to **False**.
- ``logger``
    - ``name (str)`` – Logger type to use, e.g. `CSVLogger`, `WandbLogger`. Default to **CSVLogger**.
    - ``save_dir (str)`` – Directory where logs are saved. Example: `${__exp__.project_folder}out/log/${__exp__.name}/`.
    - ``version (int)`` – Logger version. If **0**, overwrites existing logs. Default to **0**.


Model parameters
-----------------------------------------
Common features
~~~~~~~~~~~~~~~~
- ``name (str)`` – Name of the recommender model. Options in [`BERT4Rec`, `Caser`, `CORE`, `CosRec`, `GRU4Rec`, `HGN`, `LightGCN`, `Mamba4Rec`, `NARM`, `NCF`, `SASRec`]
- ``emb_size (int)`` – Size of the items and positions embeddings. Default to **64**.

BERT4Rec
~~~~~~~~~~~~~~~~
- ``bert_num_blocks (int)`` – Number of Transformer blocks in the encoder of BERT4Rec. Default to **2**.
- ``ber_num_heads (int)`` – Number of attention heads in the Transformer model of BERT4Rec. Default to **4**.
- ``dropout_rate (float)`` – Dropout rate for regularization. Default to **0.1**.

Caser
~~~~~~~~~~~~~~~~
- ``lookback (int)`` – Length of the input sequence (number of past time steps considered). Typically sourced from `${data_params.dataset_params.lookback}`.
- ``num_ver_filters (int)`` – Number of vertical convolutional filters. Default to **2**.
- ``num_hor_filters (int)`` – Number of horizontal convolutional filters. Default to **2**.
- ``act_conv (str)`` – Activation function used in the convolutional layers. Default to **Tanh**.
- ``act_fc (str)`` – Activation function used in the fully connected layers. Default to **Tanh**.
- ``drop_rate (float)`` – Dropout rate for regularization. Default to **0.5**.

CORE
~~~~~~~~~~~~~~~~
- ``sess_dropout_rate (float)`` – Dropout rate applied to session representations for regularization. Default to **0.2**.
- ``item_dropout_rate (float)`` – Dropout rate applied to item representations for regularization.  Default to **0.2**.

CosRec
~~~~~~~~~~~~~~~~
- ``block_dims (list of int)`` – Dimensions of convolutional or processing blocks. Default to **[128, 256]**.
- ``fc_dim (int)`` – Dimension of the fully connected layer.  Default to **150**.
- ``act_fc (str)`` – Activation function used in the fully connected layer. Default to **Tanh**.
- ``dropout_rate (float)`` – Dropout rate for regularization. Default to **0.5**.

GRU4Rec
~~~~~~~~~~~~~~~~
- ``num_layers (int)`` – Number of GRU layers. Default to **1**.
- ``dropout_hidden (float)`` – Dropout rate applied to the hidden layers of the GRU. Default to **0.0**.
- ``dropout_input (float)`` – Dropout rate applied to the input embeddings. Default to **0.2**.


HGN
~~~~~~~~~~~~~~~~
- ``lookback (int)`` – Length of the input sequence (number of past time steps considered). Typically sourced from `${data_params.dataset_params.lookback}`.

LightGCN
~~~~~~~~~~~~~~~~
- ``num_layers (int)`` – Number of graph convolution layers applied in LightGCN. Default: **1**.


Mamba4Rec
~~~~~~~~~~~~~~~~
- ``d_model (int)`` – Dimensionality of the model layers and embeddings. Default to **64**.
- ``ssm_cfg.d_model (int)`` – Dimensionality used within the SSM (State Space Model) configuration. Inherits the value from `d_model`.

NARM
~~~~~~~~~~~~~~~~
- ``hidden_size (int)`` – Number of hidden units in the GRU layer. Default to **50**.

- ``n_layers (int)`` – Number of GRU layers used in the attention-based session encoder. Default to **1**.

- ``emb_dropout (float)`` – Dropout rate applied to the input embeddings for regularization. Default to **0.25**.

- ``ct_dropout (float)`` – Dropout rate applied to the context vector or attention mechanism. Default to **0.5**.

NCF
~~~~~~~~~~~~~~~~
- ``mlp_emb_size (int)`` – Embedding size used for the MLP (Multi-Layer Perceptron) component of the model. Default to **8**.

- ``mf_emb_size (int)`` – Embedding size used for the MF (Matrix Factorization) component of the model. Default to **8**.

- ``layers (list of int)`` – List specifying the number of units in each hidden layer of the MLP. Default to **[32, 16, 8]**.

SASRec
~~~~~~~~~~~~~~~~
- ``num_blocks (int)`` – Number of transformer blocks (stacked self-attention + feed-forward layers).  Default to **1**.
- ``num_heads (int)`` – Number of attention heads in the multi-head self-attention layer. Default to **1** .
- ``dropout_rate (float)`` – Dropout rate for regularization. Default to **0.2**.


Global Data Parameters
~~~~~~~~~~~~~~~~~~~~~~
They declare global defaults or shared parameters within the whole config structure.

- ``data_params.collator_params.lookforward (int)`` – Number of future items to look ahead when generating training instances. Default to **0**.
- ``data_params.collator_params.mask_prob (float)`` – Probability of masking each item in the sequence during training. Default to **0.15**
- ``data_params.collator_params.keep_last.train (int)`` – Number of last interactions to keep in each training session. Default to **1**.
- ``data_params.collator_params.keep_last.val (int or null)`` – Number of last interactions to keep for validation. If `null`, no filtering is applied. Default: **null**.
- ``data_params.collator_params.keep_last.test (int or null)`` – Number of last interactions to keep for test.  If `null`, all test interactions are kept. Default: **null**.

Step Routing Parameters 
~~~~~~~~~~~~~~~~~~~~~~~
They define how data flows through the model during training, validation and test.

- ``model_input_from_batch (list of str)`` – Specifies which keys from the input batch are passed as inputs to the model. Default to `[in_sid, out_sid]`, where `in_sid` refers to the input sequence IDs, and `out_sid` refers to the target sequence IDs.
- ``loss_input_from_model_output (dict)`` – Defines the inputs to the loss function coming from the model's output or batch.  

   - ``input: null`` indicates that the model output is directly used for loss computation without additional inputs from the batch.