Config YAML Format#
Using a configuration file lets you manage complex settings, version changes, and avoid long command-line arguments.
Available fields are:
- architecture: Name of the model architecture to use (e.g., - minicpm).
- model_path: HuggingFace model identifier for the pre-trained model. 
- model: List of model-specific args while initializing the model (e.g., - torch_dtype).
- forward: List of model-specific args while doing a forward pass (e.g., - max_new_tokens).
- output_db: Database name where results will be saved. 
- input_dir: Directory containing the input data. 
- prompt: Prompt for the model. 
- modules: List of model layers to extract. 
Example:
architecture: minicpm
model_path: openbmb/MiniCPM-o-2_6
model:
    - torch_dtype: auto
    - trust_remote_code: True
forward:
    - max_new_tokens: 1
output_db: minicpm-o.db
input_dir: ./data/
prompt: "Describe the color in this image in one word."
modules:
    - llm.lm_head
    - vpm.encoder.layers.26