{
  "nbformat": 4,
  "nbformat_minor": 0,
  "metadata": {
    "colab": {
      "provenance": [],
      "gpuType": "A100",
      "machine_shape": "hm",
      "runtime_attributes": {
        "runtime_version": "2025.10"
      }
    },
    "kernelspec": {
      "name": "python3",
      "display_name": "Python 3"
    },
    "language_info": {
      "name": "python"
    },
    "widgets": {
      "application/vnd.jupyter.widget-state+json": {
        "13108671770a48f2845002cf1266a8bc": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_b7d0d8e2ab184ad3a898f67925da5ea4",
              "IPY_MODEL_de8b6bac965341ecaed3d58a5b18197a",
              "IPY_MODEL_a9d09c9af7094e4ca778853fdd5d3abf"
            ],
            "layout": "IPY_MODEL_7f4ddf9010184adab3c8b5e5ea788721"
          }
        },
        "b7d0d8e2ab184ad3a898f67925da5ea4": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_99925c438d3f47d4adb2dabaf26dd154",
            "placeholder": "​",
            "style": "IPY_MODEL_e6d2601f57214c36a582710a88a75897",
            "value": "README.md: "
          }
        },
        "de8b6bac965341ecaed3d58a5b18197a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_a5be78e5dab341248b54fa5b795c0ebb",
            "max": 1,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_f5d3daa4f2ef4adf9ec23c81eaf50462",
            "value": 1
          }
        },
        "a9d09c9af7094e4ca778853fdd5d3abf": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_9290b1c84be64d02837fafbd23494667",
            "placeholder": "​",
            "style": "IPY_MODEL_6679be70bfa74c0882d8f06504f96f10",
            "value": " 6.53k/? [00:00&lt;00:00, 716kB/s]"
          }
        },
        "7f4ddf9010184adab3c8b5e5ea788721": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "99925c438d3f47d4adb2dabaf26dd154": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "e6d2601f57214c36a582710a88a75897": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "a5be78e5dab341248b54fa5b795c0ebb": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": "20px"
          }
        },
        "f5d3daa4f2ef4adf9ec23c81eaf50462": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "9290b1c84be64d02837fafbd23494667": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "6679be70bfa74c0882d8f06504f96f10": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "446ac66a0b3e4eb99895539fd737b957": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_87d1baf5e1864213b337a0549631f0ef",
              "IPY_MODEL_287af03283e94118849aa63714ac8a5e",
              "IPY_MODEL_0b45f6deec354f60addf42e82575207f"
            ],
            "layout": "IPY_MODEL_b545a5ae45b74ed9a474448348deede3"
          }
        },
        "87d1baf5e1864213b337a0549631f0ef": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_9491b3cb892d4973b3d7fa61b840ea72",
            "placeholder": "​",
            "style": "IPY_MODEL_53fe29f82dc24193afc2ecd71cae2120",
            "value": "data/train_prefs-00000-of-00001.parquet: 100%"
          }
        },
        "287af03283e94118849aa63714ac8a5e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_3d9a5c133a7b4fb0afd9e7f0b462cb19",
            "max": 225891836,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_5f162b3b48324e4ab002c4529efb724c",
            "value": 225891836
          }
        },
        "0b45f6deec354f60addf42e82575207f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_d00daa3cfb8a4139bdcfa34086417d2b",
            "placeholder": "​",
            "style": "IPY_MODEL_5cb05ceaf70d4e2294111a5c63ee9ce8",
            "value": " 226M/226M [00:01&lt;00:00, 85.2MB/s]"
          }
        },
        "b545a5ae45b74ed9a474448348deede3": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "9491b3cb892d4973b3d7fa61b840ea72": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "53fe29f82dc24193afc2ecd71cae2120": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "3d9a5c133a7b4fb0afd9e7f0b462cb19": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "5f162b3b48324e4ab002c4529efb724c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "d00daa3cfb8a4139bdcfa34086417d2b": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "5cb05ceaf70d4e2294111a5c63ee9ce8": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "0c747ec97cd44d4eaf76f3be3f149d95": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_1e8a3646bbd74941a673c47c89a14299",
              "IPY_MODEL_72527779b0e74c0793216559d183948e",
              "IPY_MODEL_1142155a058549918e1cda6bae14e7d3"
            ],
            "layout": "IPY_MODEL_930a6c569d754980ab302af5d66fab14"
          }
        },
        "1e8a3646bbd74941a673c47c89a14299": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_61e4ffb63fc64feb97631de2985e0083",
            "placeholder": "​",
            "style": "IPY_MODEL_1dde89ead58f4b54a7291fe62730e354",
            "value": "data/test_prefs-00000-of-00001.parquet: 100%"
          }
        },
        "72527779b0e74c0793216559d183948e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_00f8e62e5e2946ecab6718e4a3689843",
            "max": 7291160,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_49e313d72d72443d872b5e36c1fec030",
            "value": 7291160
          }
        },
        "1142155a058549918e1cda6bae14e7d3": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_4daabc6a9d764b198057d4963c3f1fa4",
            "placeholder": "​",
            "style": "IPY_MODEL_b9f01ac64d434332954bcaa8640edf1e",
            "value": " 7.29M/7.29M [00:00&lt;00:00, 17.9MB/s]"
          }
        },
        "930a6c569d754980ab302af5d66fab14": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "61e4ffb63fc64feb97631de2985e0083": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "1dde89ead58f4b54a7291fe62730e354": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "00f8e62e5e2946ecab6718e4a3689843": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "49e313d72d72443d872b5e36c1fec030": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "4daabc6a9d764b198057d4963c3f1fa4": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "b9f01ac64d434332954bcaa8640edf1e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "364de2f832d04f469c976dfec1c8bc77": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_3407707840e546f3bc307a4aa408c85e",
              "IPY_MODEL_74c3624d5d6d4679979bf1d49f1727a5",
              "IPY_MODEL_693dd8c6234b41a182d19200400fea3b"
            ],
            "layout": "IPY_MODEL_2d5938d885374006b8439a805f9af23b"
          }
        },
        "3407707840e546f3bc307a4aa408c85e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_3dc52e3745894c3a9f379d1fd14effc1",
            "placeholder": "​",
            "style": "IPY_MODEL_0f9156837eaf4ad6a682d71ed3ab7675",
            "value": "data/test_sft-00000-of-00001.parquet: 100%"
          }
        },
        "74c3624d5d6d4679979bf1d49f1727a5": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_15bfe4a79c45498aa2c1a5939c4d7a76",
            "max": 3719085,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_481edb7676bb4567bab24935dac96561",
            "value": 3719085
          }
        },
        "693dd8c6234b41a182d19200400fea3b": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_00e97a8431a54125a7f4cb7a409092c2",
            "placeholder": "​",
            "style": "IPY_MODEL_7d05796c5d55464fb7b0d9e6f34af004",
            "value": " 3.72M/3.72M [00:00&lt;00:00, 13.0MB/s]"
          }
        },
        "2d5938d885374006b8439a805f9af23b": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "3dc52e3745894c3a9f379d1fd14effc1": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "0f9156837eaf4ad6a682d71ed3ab7675": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "15bfe4a79c45498aa2c1a5939c4d7a76": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "481edb7676bb4567bab24935dac96561": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "00e97a8431a54125a7f4cb7a409092c2": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "7d05796c5d55464fb7b0d9e6f34af004": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "31f3d4b5efe84cd3aa93cc5d5136a49f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_bb461df4585b488e894945eab6b2a2d9",
              "IPY_MODEL_23f8dd5d9b004b228de785269933ba57",
              "IPY_MODEL_8a94de67743d444e88c3932882da471a"
            ],
            "layout": "IPY_MODEL_4d203b11091c4f34833cfdeb8023552c"
          }
        },
        "bb461df4585b488e894945eab6b2a2d9": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_8981db57167d46eeb49b6c3c87bb4cf6",
            "placeholder": "​",
            "style": "IPY_MODEL_022d8ba2082c4dcd8e4f645694c485b8",
            "value": "data/train_gen-00000-of-00001.parquet: 100%"
          }
        },
        "23f8dd5d9b004b228de785269933ba57": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_0a581db9a8b84aa8807b54a2c4f3641a",
            "max": 184150527,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_9871afe542704e079baa9412166273a7",
            "value": 184150527
          }
        },
        "8a94de67743d444e88c3932882da471a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_17e4bbea4d354c7a83d9bff935b28ebd",
            "placeholder": "​",
            "style": "IPY_MODEL_4155781baa954b63922b74505b80df7f",
            "value": " 184M/184M [00:01&lt;00:00, 120MB/s]"
          }
        },
        "4d203b11091c4f34833cfdeb8023552c": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "8981db57167d46eeb49b6c3c87bb4cf6": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "022d8ba2082c4dcd8e4f645694c485b8": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "0a581db9a8b84aa8807b54a2c4f3641a": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "9871afe542704e079baa9412166273a7": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "17e4bbea4d354c7a83d9bff935b28ebd": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "4155781baa954b63922b74505b80df7f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "5cb21c2b65cd4be19eeae5294067d9fa": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_f36f4ac50b754eae82eb9505a45fc74a",
              "IPY_MODEL_ba250a4e315a4b22bc7602dd837e4965",
              "IPY_MODEL_984ef6e047da477783564196e680651f"
            ],
            "layout": "IPY_MODEL_29e3506480464c6e9fe28ced4dcb4470"
          }
        },
        "f36f4ac50b754eae82eb9505a45fc74a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_1ad0bd252e864504a713d288a7b3a1dd",
            "placeholder": "​",
            "style": "IPY_MODEL_3214ea636cef4a8ab910ba735dbb4e73",
            "value": "data/test_gen-00000-of-00001.parquet: 100%"
          }
        },
        "ba250a4e315a4b22bc7602dd837e4965": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_4cf706059a86482d8f14e76c0052098c",
            "max": 3022752,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_42fdd2717ed6452c81c4abfb69509f6f",
            "value": 3022752
          }
        },
        "984ef6e047da477783564196e680651f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_a505f4c6f0204b7fa2abb839aa6566d3",
            "placeholder": "​",
            "style": "IPY_MODEL_52d72c882d2c41aa8bfcd712d90351de",
            "value": " 3.02M/3.02M [00:00&lt;00:00, 7.76MB/s]"
          }
        },
        "29e3506480464c6e9fe28ced4dcb4470": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "1ad0bd252e864504a713d288a7b3a1dd": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "3214ea636cef4a8ab910ba735dbb4e73": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "4cf706059a86482d8f14e76c0052098c": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "42fdd2717ed6452c81c4abfb69509f6f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "a505f4c6f0204b7fa2abb839aa6566d3": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "52d72c882d2c41aa8bfcd712d90351de": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "34b1f7f0eb38490daee4de107a68b004": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_a3c30a1cc8704659ba0a91fab8514d2a",
              "IPY_MODEL_8e5034da30314ba888296d99cdf7026e",
              "IPY_MODEL_9e9cb552d37a48cfb3f72b6fb20d50ce"
            ],
            "layout": "IPY_MODEL_ff9910d1159642438c3f9ea3ee80c795"
          }
        },
        "a3c30a1cc8704659ba0a91fab8514d2a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_68a419fb42ce4749954d4ad92b5d708e",
            "placeholder": "​",
            "style": "IPY_MODEL_53a6315e1f1d4dc69237dbe502ec4794",
            "value": "Generating train_prefs split: 100%"
          }
        },
        "8e5034da30314ba888296d99cdf7026e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_0cbff68815684305813797090ca5b4e6",
            "max": 61135,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_f076940bac52443d9c0e0985193f1809",
            "value": 61135
          }
        },
        "9e9cb552d37a48cfb3f72b6fb20d50ce": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_d7823b632fbb41668671faf5cd1b42b2",
            "placeholder": "​",
            "style": "IPY_MODEL_9636be2683de4a178cab3b84052ec2d2",
            "value": " 61135/61135 [00:00&lt;00:00, 97872.95 examples/s]"
          }
        },
        "ff9910d1159642438c3f9ea3ee80c795": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "68a419fb42ce4749954d4ad92b5d708e": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "53a6315e1f1d4dc69237dbe502ec4794": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "0cbff68815684305813797090ca5b4e6": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "f076940bac52443d9c0e0985193f1809": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "d7823b632fbb41668671faf5cd1b42b2": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "9636be2683de4a178cab3b84052ec2d2": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "c9a0a1e5be184425a61967e0cb214e99": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_c3f2232471074209bb1b71ee224355c4",
              "IPY_MODEL_911f294cef174cefa20fd38733387e01",
              "IPY_MODEL_e6ccadb7729241f4b03b50e9fa577051"
            ],
            "layout": "IPY_MODEL_7143cdb321af48e7846ac92cc502f9da"
          }
        },
        "c3f2232471074209bb1b71ee224355c4": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_be5b6456d9204542aca31391eb86f213",
            "placeholder": "​",
            "style": "IPY_MODEL_71b27796b4b6449b98a331a0c1a8d324",
            "value": "Generating train_sft split: 100%"
          }
        },
        "911f294cef174cefa20fd38733387e01": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_ab0725a08bca49bb9f2f0fabc26f7d12",
            "max": 61135,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_53d1635b7b31499780418a55150bae7c",
            "value": 61135
          }
        },
        "e6ccadb7729241f4b03b50e9fa577051": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_d81bd19077fc4f8eabc26bcb53c47493",
            "placeholder": "​",
            "style": "IPY_MODEL_920405b718e94f648b012cf3f67ed13e",
            "value": " 61135/61135 [00:00&lt;00:00, 91657.32 examples/s]"
          }
        },
        "7143cdb321af48e7846ac92cc502f9da": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "be5b6456d9204542aca31391eb86f213": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "71b27796b4b6449b98a331a0c1a8d324": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "ab0725a08bca49bb9f2f0fabc26f7d12": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "53d1635b7b31499780418a55150bae7c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "d81bd19077fc4f8eabc26bcb53c47493": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "920405b718e94f648b012cf3f67ed13e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "14b7bf5301904637a1f6a87c45dbf20c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_ea6e36290bcd4fb681d2d76872570a89",
              "IPY_MODEL_e0e6aaf6e23740b18ceaf39034414b3d",
              "IPY_MODEL_afdb2b1b8ce9419dae45486b41a77bde"
            ],
            "layout": "IPY_MODEL_1d1f3bc6814e46f78c1bf10b442ef47c"
          }
        },
        "ea6e36290bcd4fb681d2d76872570a89": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_980d690487a04885956279ab512a71e5",
            "placeholder": "​",
            "style": "IPY_MODEL_58a24c26c8c943418b530a8f198296a0",
            "value": "Generating test_prefs split: 100%"
          }
        },
        "e0e6aaf6e23740b18ceaf39034414b3d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_6d685263df3e4c2b9115686e8805c94b",
            "max": 2000,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_00788922afd54da499548e956dc291a4",
            "value": 2000
          }
        },
        "afdb2b1b8ce9419dae45486b41a77bde": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_80b49175f5e74601b676501bb695f891",
            "placeholder": "​",
            "style": "IPY_MODEL_ed538e2d206c4df7a15f00d750744649",
            "value": " 2000/2000 [00:00&lt;00:00, 47418.98 examples/s]"
          }
        },
        "1d1f3bc6814e46f78c1bf10b442ef47c": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "980d690487a04885956279ab512a71e5": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "58a24c26c8c943418b530a8f198296a0": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "6d685263df3e4c2b9115686e8805c94b": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "00788922afd54da499548e956dc291a4": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "80b49175f5e74601b676501bb695f891": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "ed538e2d206c4df7a15f00d750744649": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "fc0b15c645ea44718576826caea5c40d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_bbcd75258e594f0980dd23520e5c5e84",
              "IPY_MODEL_15247d5f53e44604a905c63ac739ae5d",
              "IPY_MODEL_822db8b787f044d3a11174a872b9152d"
            ],
            "layout": "IPY_MODEL_03a65a13a43a4816adcd0ea754286c27"
          }
        },
        "bbcd75258e594f0980dd23520e5c5e84": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_668d87baf56b470996d5203a1e1cf5a7",
            "placeholder": "​",
            "style": "IPY_MODEL_f4c83a5928d743858c3599d5cd378f9d",
            "value": "Generating test_sft split: 100%"
          }
        },
        "15247d5f53e44604a905c63ac739ae5d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_11112ff5ac5342aebbac93ca66e85ece",
            "max": 1000,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_70014214caa64d0eaa00dd422068070c",
            "value": 1000
          }
        },
        "822db8b787f044d3a11174a872b9152d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_08f382768cdb4e24a93fa9d8cacf7c34",
            "placeholder": "​",
            "style": "IPY_MODEL_f150a6cd40a245d0af7226d4926354b8",
            "value": " 1000/1000 [00:00&lt;00:00, 37939.99 examples/s]"
          }
        },
        "03a65a13a43a4816adcd0ea754286c27": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "668d87baf56b470996d5203a1e1cf5a7": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "f4c83a5928d743858c3599d5cd378f9d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "11112ff5ac5342aebbac93ca66e85ece": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "70014214caa64d0eaa00dd422068070c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "08f382768cdb4e24a93fa9d8cacf7c34": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "f150a6cd40a245d0af7226d4926354b8": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "4e9ff50a08464fc5b47624598533e946": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_e3444ecd5144410490f8935169878b07",
              "IPY_MODEL_ff848df22e544104a60e4e91015c911e",
              "IPY_MODEL_e985fe7fe3ed46f09c4d6bc7d21f533d"
            ],
            "layout": "IPY_MODEL_4d12045f3e9b4dcca22f82c110f9fe71"
          }
        },
        "e3444ecd5144410490f8935169878b07": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_cb42a96a37df42e8a1594cff22ac235c",
            "placeholder": "​",
            "style": "IPY_MODEL_ddc56d19906e44ef8784be26815171ac",
            "value": "Generating train_gen split: 100%"
          }
        },
        "ff848df22e544104a60e4e91015c911e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_dfce9dfa229246648f08f0df5eea3eb2",
            "max": 61135,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_c28734ed52184fb2b0b5d5c059f36b9e",
            "value": 61135
          }
        },
        "e985fe7fe3ed46f09c4d6bc7d21f533d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_e8393dadd92f40d6ab66c89b0fe54b38",
            "placeholder": "​",
            "style": "IPY_MODEL_56352cfbee78402badd6514f0450c35a",
            "value": " 61135/61135 [00:00&lt;00:00, 105908.59 examples/s]"
          }
        },
        "4d12045f3e9b4dcca22f82c110f9fe71": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "cb42a96a37df42e8a1594cff22ac235c": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "ddc56d19906e44ef8784be26815171ac": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "dfce9dfa229246648f08f0df5eea3eb2": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "c28734ed52184fb2b0b5d5c059f36b9e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "e8393dadd92f40d6ab66c89b0fe54b38": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "56352cfbee78402badd6514f0450c35a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "f791f63fa3c44453bdc3db763c442146": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_da2fc3073d6a4e26ad636c36155ca7af",
              "IPY_MODEL_f994deb71884481faba235fcad3d4447",
              "IPY_MODEL_c40d177887ef43b280e9e019cfd4129d"
            ],
            "layout": "IPY_MODEL_85913da5193b4675b1ab7f6ac9cc4f0e"
          }
        },
        "da2fc3073d6a4e26ad636c36155ca7af": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_8e41763bf8e541a487227cc3c0c080a3",
            "placeholder": "​",
            "style": "IPY_MODEL_d5164cf2e9954ff4bf219c38cc0fc194",
            "value": "Generating test_gen split: 100%"
          }
        },
        "f994deb71884481faba235fcad3d4447": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_887ed030076541afb4ffbfabdc3b9686",
            "max": 1000,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_b0b1508adf6d4592b9d2aaee93d81e9b",
            "value": 1000
          }
        },
        "c40d177887ef43b280e9e019cfd4129d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_06793fe32ba24aae89e12e9efb05fdb8",
            "placeholder": "​",
            "style": "IPY_MODEL_14341a1bb8c44eeaa3f22ea2ea49818e",
            "value": " 1000/1000 [00:00&lt;00:00, 36059.56 examples/s]"
          }
        },
        "85913da5193b4675b1ab7f6ac9cc4f0e": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "8e41763bf8e541a487227cc3c0c080a3": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "d5164cf2e9954ff4bf219c38cc0fc194": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "887ed030076541afb4ffbfabdc3b9686": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "b0b1508adf6d4592b9d2aaee93d81e9b": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "06793fe32ba24aae89e12e9efb05fdb8": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "14341a1bb8c44eeaa3f22ea2ea49818e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "e78ce1ace7494a2baaf729f2982726e2": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_e7e42b5be0d04ea88f9b8a1e26b9e6ce",
              "IPY_MODEL_e2d17808fde449eb86b466f06d4fc86c",
              "IPY_MODEL_cbc89cfab188436ca192a6952784b78e"
            ],
            "layout": "IPY_MODEL_8b9ab81249ec41ac9454fd5cddc8c09e"
          }
        },
        "e7e42b5be0d04ea88f9b8a1e26b9e6ce": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_98baa29c80384c2a895c214159e50290",
            "placeholder": "​",
            "style": "IPY_MODEL_62ce0e28999e467fa9c02e9a6ba5dd42",
            "value": "tokenizer_config.json: "
          }
        },
        "e2d17808fde449eb86b466f06d4fc86c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_674b4033daf14e8985462695948c2265",
            "max": 1,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_c8fbdaff67ee40dea12374981ebf3ad8",
            "value": 1
          }
        },
        "cbc89cfab188436ca192a6952784b78e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_9b241624f46a4fce8f505a7c0bd124dd",
            "placeholder": "​",
            "style": "IPY_MODEL_a6c05af812ec490eaceb205ef5a2a9cd",
            "value": " 1.39k/? [00:00&lt;00:00, 160kB/s]"
          }
        },
        "8b9ab81249ec41ac9454fd5cddc8c09e": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "98baa29c80384c2a895c214159e50290": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "62ce0e28999e467fa9c02e9a6ba5dd42": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "674b4033daf14e8985462695948c2265": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": "20px"
          }
        },
        "c8fbdaff67ee40dea12374981ebf3ad8": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "9b241624f46a4fce8f505a7c0bd124dd": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "a6c05af812ec490eaceb205ef5a2a9cd": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "2ae9fc32352f4fc99176019a256ae96d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_465fcedb38b64256971ed627b9b045c3",
              "IPY_MODEL_4957311e71f049aa8190daf073f18324",
              "IPY_MODEL_93bf22d6030741608f8b531ce179609f"
            ],
            "layout": "IPY_MODEL_a19e1fd23a874b26ab553dfe00e55fe5"
          }
        },
        "465fcedb38b64256971ed627b9b045c3": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_f6aad3021e164ac79ebc84bdee0a0cad",
            "placeholder": "​",
            "style": "IPY_MODEL_56cd9cb2d5c04639ba31f09c077c92cd",
            "value": "tokenizer.json: "
          }
        },
        "4957311e71f049aa8190daf073f18324": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_9ae97cfaa8694772ab1f2b38108c57a1",
            "max": 1,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_65031a9851464765a6d8954392ba7304",
            "value": 1
          }
        },
        "93bf22d6030741608f8b531ce179609f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_f999272c98d34f33b04ccc220ec5312b",
            "placeholder": "​",
            "style": "IPY_MODEL_08a08eb3c7be47a69a23620faae84d0a",
            "value": " 1.80M/? [00:00&lt;00:00, 22.0MB/s]"
          }
        },
        "a19e1fd23a874b26ab553dfe00e55fe5": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "f6aad3021e164ac79ebc84bdee0a0cad": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "56cd9cb2d5c04639ba31f09c077c92cd": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "9ae97cfaa8694772ab1f2b38108c57a1": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": "20px"
          }
        },
        "65031a9851464765a6d8954392ba7304": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "f999272c98d34f33b04ccc220ec5312b": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "08a08eb3c7be47a69a23620faae84d0a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "85565882d4754de09badbd01dd454093": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_5bc9f968ef224473aae35e9a9224b442",
              "IPY_MODEL_67b84eca57db424a9761da7b66f9bf76",
              "IPY_MODEL_2ea5486d39bf4bafaf668735c193c767"
            ],
            "layout": "IPY_MODEL_aed6cb9ea6d64c7bb5bce8167eb8ded1"
          }
        },
        "5bc9f968ef224473aae35e9a9224b442": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_1b49266b06f340ba90bcd7084e34ef08",
            "placeholder": "​",
            "style": "IPY_MODEL_a2d2c920a97247e6b8ec4be8de619aa4",
            "value": "special_tokens_map.json: 100%"
          }
        },
        "67b84eca57db424a9761da7b66f9bf76": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_bde03c4de0954cd598738cba06234084",
            "max": 437,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_bc8b9617825c4e22bec32b3ebeef8c5a",
            "value": 437
          }
        },
        "2ea5486d39bf4bafaf668735c193c767": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_9a9e6fea9fb44d62af3f8652358c61ca",
            "placeholder": "​",
            "style": "IPY_MODEL_84d7e67c10814842bec76905e6b8826a",
            "value": " 437/437 [00:00&lt;00:00, 57.0kB/s]"
          }
        },
        "aed6cb9ea6d64c7bb5bce8167eb8ded1": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "1b49266b06f340ba90bcd7084e34ef08": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "a2d2c920a97247e6b8ec4be8de619aa4": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "bde03c4de0954cd598738cba06234084": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "bc8b9617825c4e22bec32b3ebeef8c5a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "9a9e6fea9fb44d62af3f8652358c61ca": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "84d7e67c10814842bec76905e6b8826a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "e57465d0f7bc4d08b8be98decf0daf67": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_651b79f339f74e1ea3b03b36b0dc8488",
              "IPY_MODEL_b0c2d18dd8b04c2c8ee03f4b051ffc90",
              "IPY_MODEL_d7083d18132e4abb992f6000b267f708"
            ],
            "layout": "IPY_MODEL_ecacd05d0ae04c7d89c66a26239cc389"
          }
        },
        "651b79f339f74e1ea3b03b36b0dc8488": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_df2160c1736849bf875acbf0cdc3e88d",
            "placeholder": "​",
            "style": "IPY_MODEL_1ddd044b4d144c878d31c8ce0ddf6f64",
            "value": "Formatting comparisons with prompt template (num_proc=12): 100%"
          }
        },
        "b0c2d18dd8b04c2c8ee03f4b051ffc90": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_bc78608bdff94604a6d6193f31985a10",
            "max": 100,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_5bd9c1b6a616477d9af2e22dc382267e",
            "value": 100
          }
        },
        "d7083d18132e4abb992f6000b267f708": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_f01a01edbbe54c7b92ec63d68f35ab0c",
            "placeholder": "​",
            "style": "IPY_MODEL_f423758b571e41969885535afd074743",
            "value": " 100/100 [00:00&lt;00:00, 117.07 examples/s]"
          }
        },
        "ecacd05d0ae04c7d89c66a26239cc389": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "df2160c1736849bf875acbf0cdc3e88d": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "1ddd044b4d144c878d31c8ce0ddf6f64": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "bc78608bdff94604a6d6193f31985a10": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "5bd9c1b6a616477d9af2e22dc382267e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "f01a01edbbe54c7b92ec63d68f35ab0c": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "f423758b571e41969885535afd074743": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "9680c7cfcfa648dc84f14add68a3c3d0": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_bb66702a4c0f4601bea7366f7e437123",
              "IPY_MODEL_a0828e101b89428da028479d8953130e",
              "IPY_MODEL_991ae88290384983a11d0328b710550a"
            ],
            "layout": "IPY_MODEL_46502f994a114f908462ced79e7948bb"
          }
        },
        "bb66702a4c0f4601bea7366f7e437123": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_a788cb5f3a1049cea0cf3df7e399a3a5",
            "placeholder": "​",
            "style": "IPY_MODEL_e49e25dc58514c8d919d46de07ac9d2f",
            "value": "Formatting comparisons with prompt template (num_proc=12): 100%"
          }
        },
        "a0828e101b89428da028479d8953130e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_4bddc35850814c19a0dd82248aaf409f",
            "max": 100,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_410445cd3527467d918e7dda4f1dcd75",
            "value": 100
          }
        },
        "991ae88290384983a11d0328b710550a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_d45d2cf8129b42bcb0d0598fcd6dd48b",
            "placeholder": "​",
            "style": "IPY_MODEL_8d6a411fe2b7487280c69a34d280b63c",
            "value": " 100/100 [00:00&lt;00:00, 134.14 examples/s]"
          }
        },
        "46502f994a114f908462ced79e7948bb": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "a788cb5f3a1049cea0cf3df7e399a3a5": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "e49e25dc58514c8d919d46de07ac9d2f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "4bddc35850814c19a0dd82248aaf409f": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "410445cd3527467d918e7dda4f1dcd75": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "d45d2cf8129b42bcb0d0598fcd6dd48b": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "8d6a411fe2b7487280c69a34d280b63c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "eccb3bf6c9974493b70fcff8d0c8f925": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_c925eaa53a8544adb0aff919aeb45969",
              "IPY_MODEL_da41442d79d24b85b99a10d1881a945f",
              "IPY_MODEL_301e17a587b949e284d35f6674cb92d6"
            ],
            "layout": "IPY_MODEL_8ee3575d581b45bf864fe1908c098a56"
          }
        },
        "c925eaa53a8544adb0aff919aeb45969": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_ea3bb6ec77744bee8969a65582257655",
            "placeholder": "​",
            "style": "IPY_MODEL_b47d79cefac44a02981d6eb191cde761",
            "value": "config.json: 100%"
          }
        },
        "da41442d79d24b85b99a10d1881a945f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_a0696ffff00f4b3c876730461b187381",
            "max": 571,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_0a608857d4034f03ad90329d343fa7b3",
            "value": 571
          }
        },
        "301e17a587b949e284d35f6674cb92d6": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_a52faff113374578baadd048f33eaa5a",
            "placeholder": "​",
            "style": "IPY_MODEL_4a792cec3941439cbb2e31c4975b8092",
            "value": " 571/571 [00:00&lt;00:00, 72.2kB/s]"
          }
        },
        "8ee3575d581b45bf864fe1908c098a56": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "ea3bb6ec77744bee8969a65582257655": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "b47d79cefac44a02981d6eb191cde761": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "a0696ffff00f4b3c876730461b187381": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "0a608857d4034f03ad90329d343fa7b3": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "a52faff113374578baadd048f33eaa5a": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "4a792cec3941439cbb2e31c4975b8092": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "16e40bf96b074fd29a93229859287c4a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_69e179f0cb314677b3bcb98af046a9b1",
              "IPY_MODEL_c7ac3038b5ce4addaa8b4a15a3f8df90",
              "IPY_MODEL_c206ad31442f405a8c6b8dc4128d9c90"
            ],
            "layout": "IPY_MODEL_db1c25da1b7e4251ac02651a059d2174"
          }
        },
        "69e179f0cb314677b3bcb98af046a9b1": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_14f3969e8f584817b79823327273e640",
            "placeholder": "​",
            "style": "IPY_MODEL_95fe0fc5fbbc4fbe9ae2d655bdfda46c",
            "value": "model.safetensors.index.json: "
          }
        },
        "c7ac3038b5ce4addaa8b4a15a3f8df90": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_4c863b0ca3724e1b918017d06809c53a",
            "max": 1,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_0936e4520df440cd96c9a20cc363cbcb",
            "value": 1
          }
        },
        "c206ad31442f405a8c6b8dc4128d9c90": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_4d021c53db7748e2891c7effeec14f68",
            "placeholder": "​",
            "style": "IPY_MODEL_069d0a540a854e9f9ad39bdefd74f770",
            "value": " 25.1k/? [00:00&lt;00:00, 2.50MB/s]"
          }
        },
        "db1c25da1b7e4251ac02651a059d2174": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "14f3969e8f584817b79823327273e640": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "95fe0fc5fbbc4fbe9ae2d655bdfda46c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "4c863b0ca3724e1b918017d06809c53a": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": "20px"
          }
        },
        "0936e4520df440cd96c9a20cc363cbcb": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "4d021c53db7748e2891c7effeec14f68": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "069d0a540a854e9f9ad39bdefd74f770": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "1f1f355636cd40bb95016434b949402d": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_12f9c206765d49ed932b7d0d7d598834",
              "IPY_MODEL_b2be0fa3dfcb489687b20040f76334af",
              "IPY_MODEL_6f346cdf04d042d880f1969cc7b5be6c"
            ],
            "layout": "IPY_MODEL_5d83c775485e47bca25470f94e2d7634"
          }
        },
        "12f9c206765d49ed932b7d0d7d598834": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_914de747dd4c49fda62ec068764eb964",
            "placeholder": "​",
            "style": "IPY_MODEL_94fdd7d8cdcd4818a6532d36e8df5cf0",
            "value": "Fetching 2 files: 100%"
          }
        },
        "b2be0fa3dfcb489687b20040f76334af": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_c15a22c358294f2bbad03c53a2ef96fc",
            "max": 2,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_a023a385f12d44e39295500a79c21f24",
            "value": 2
          }
        },
        "6f346cdf04d042d880f1969cc7b5be6c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_5c69002d146f4416bcd3330f331d265c",
            "placeholder": "​",
            "style": "IPY_MODEL_9438610784934da7a95aa1fe94b88152",
            "value": " 2/2 [01:11&lt;00:00, 71.99s/it]"
          }
        },
        "5d83c775485e47bca25470f94e2d7634": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "914de747dd4c49fda62ec068764eb964": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "94fdd7d8cdcd4818a6532d36e8df5cf0": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "c15a22c358294f2bbad03c53a2ef96fc": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "a023a385f12d44e39295500a79c21f24": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "5c69002d146f4416bcd3330f331d265c": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "9438610784934da7a95aa1fe94b88152": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "6c23c836882d4fdfbea9c69576931666": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_02b89d7331f6497b93c98c76514effa8",
              "IPY_MODEL_5ffcaf25bd1c45e7a5026df9cc8e0c63",
              "IPY_MODEL_f8577fb2f907416483c05534376ecf5b"
            ],
            "layout": "IPY_MODEL_5c194a4b12934354b66f8163e7d3ff43"
          }
        },
        "02b89d7331f6497b93c98c76514effa8": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_7f497a6c37204a44bc5eed0592105664",
            "placeholder": "​",
            "style": "IPY_MODEL_4ae075211608434ab16ddba8fef637c9",
            "value": "model-00002-of-00002.safetensors: 100%"
          }
        },
        "5ffcaf25bd1c45e7a5026df9cc8e0c63": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_c15bdf050d5345f7b2133e0cad195c5f",
            "max": 4540516344,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_7ed24313a43841b89bb45f3d3341c03b",
            "value": 4540516344
          }
        },
        "f8577fb2f907416483c05534376ecf5b": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_4d40e1fd82e941818f742bc5d470a294",
            "placeholder": "​",
            "style": "IPY_MODEL_8b5f8d5e2c1a4e0d90d63c0ee9c305b5",
            "value": " 4.54G/4.54G [00:49&lt;00:00, 85.1MB/s]"
          }
        },
        "5c194a4b12934354b66f8163e7d3ff43": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "7f497a6c37204a44bc5eed0592105664": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "4ae075211608434ab16ddba8fef637c9": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "c15bdf050d5345f7b2133e0cad195c5f": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "7ed24313a43841b89bb45f3d3341c03b": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "4d40e1fd82e941818f742bc5d470a294": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "8b5f8d5e2c1a4e0d90d63c0ee9c305b5": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "d2be33879296453f92be0f4f917fa511": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_5ae8a0c508534c24bd6bce11726f95e3",
              "IPY_MODEL_13d2e1da1e6842fb92c21bf45b3fa1fd",
              "IPY_MODEL_e0f8666a2dbe4fcfa22781ec5250d1fd"
            ],
            "layout": "IPY_MODEL_963a8494d12a43c6a0026ec6977670ce"
          }
        },
        "5ae8a0c508534c24bd6bce11726f95e3": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_8553ba7299b34233b266567822f7c4ca",
            "placeholder": "​",
            "style": "IPY_MODEL_1117502bb89a4383919952c4fba6e765",
            "value": "model-00001-of-00002.safetensors: 100%"
          }
        },
        "13d2e1da1e6842fb92c21bf45b3fa1fd": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_90c77723d0b146e18749c7355fa3f23a",
            "max": 9942981696,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_b32b5ba8336f4ea1a1e71d1bb10939ed",
            "value": 9942981696
          }
        },
        "e0f8666a2dbe4fcfa22781ec5250d1fd": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_d2e4dffc607540778f3ba5a4edcafefa",
            "placeholder": "​",
            "style": "IPY_MODEL_821c7e11301f48038b950b423c860e42",
            "value": " 9.94G/9.94G [01:11&lt;00:00, 378MB/s]"
          }
        },
        "963a8494d12a43c6a0026ec6977670ce": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "8553ba7299b34233b266567822f7c4ca": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "1117502bb89a4383919952c4fba6e765": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "90c77723d0b146e18749c7355fa3f23a": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "b32b5ba8336f4ea1a1e71d1bb10939ed": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "d2e4dffc607540778f3ba5a4edcafefa": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "821c7e11301f48038b950b423c860e42": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "abe93efc757e4508b3db9dae1f3f7069": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_38a66837638e462080c5b77083aa2a00",
              "IPY_MODEL_8c5a185ef40a41428274971a4f8b5451",
              "IPY_MODEL_0ad84f4c4db14439a74082368430c1f5"
            ],
            "layout": "IPY_MODEL_8775ade9c29841edae4fee55376afe2a"
          }
        },
        "38a66837638e462080c5b77083aa2a00": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_bc2a1b3065b641069517fca829fa7f04",
            "placeholder": "​",
            "style": "IPY_MODEL_55d52469ea554f31b523e71231d461ab",
            "value": "Loading checkpoint shards: 100%"
          }
        },
        "8c5a185ef40a41428274971a4f8b5451": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_50a717a808b54e4d99ce44ba0b695820",
            "max": 2,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_f8488723a30c44a18a4dda1d93575892",
            "value": 2
          }
        },
        "0ad84f4c4db14439a74082368430c1f5": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_97218a30eafa4edda5515d3f2c95de24",
            "placeholder": "​",
            "style": "IPY_MODEL_0603e6586c2746b79b300c0b3e5fec26",
            "value": " 2/2 [00:17&lt;00:00,  8.06s/it]"
          }
        },
        "8775ade9c29841edae4fee55376afe2a": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "bc2a1b3065b641069517fca829fa7f04": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "55d52469ea554f31b523e71231d461ab": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "50a717a808b54e4d99ce44ba0b695820": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "f8488723a30c44a18a4dda1d93575892": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "97218a30eafa4edda5515d3f2c95de24": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "0603e6586c2746b79b300c0b3e5fec26": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "a42cc4d2eb7d42ecaa58f71edcc4751f": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_c2639dc1c10b4da09e0f58e6f8791ce6",
              "IPY_MODEL_6dcef112168c44b8984ea6678d2b845a",
              "IPY_MODEL_48aae5a0c63849e1b6929432f446323b"
            ],
            "layout": "IPY_MODEL_17a175314ba44c9fb07467ebd4ff02e8"
          }
        },
        "c2639dc1c10b4da09e0f58e6f8791ce6": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_85d8edc347194b3d933d3a3c9292cd2a",
            "placeholder": "​",
            "style": "IPY_MODEL_694aba9592b540ccb3f374140b9a5f9c",
            "value": "generation_config.json: 100%"
          }
        },
        "6dcef112168c44b8984ea6678d2b845a": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_4ebeaaa1c3794258907f4a62a3fa5f4a",
            "max": 116,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_ba5da7be468b49fb80b71e93a633c4bb",
            "value": 116
          }
        },
        "48aae5a0c63849e1b6929432f446323b": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_caac34f8414046ddbf92bb194db9481a",
            "placeholder": "​",
            "style": "IPY_MODEL_c9d91914cf8c49caacd55223148c9a48",
            "value": " 116/116 [00:00&lt;00:00, 14.3kB/s]"
          }
        },
        "17a175314ba44c9fb07467ebd4ff02e8": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "85d8edc347194b3d933d3a3c9292cd2a": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "694aba9592b540ccb3f374140b9a5f9c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "4ebeaaa1c3794258907f4a62a3fa5f4a": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "ba5da7be468b49fb80b71e93a633c4bb": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "caac34f8414046ddbf92bb194db9481a": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "c9d91914cf8c49caacd55223148c9a48": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "529bdc5069c344a6abc69fc9da195b9c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_9acb00321f1d492591320b75ee461a39",
              "IPY_MODEL_f79512648a6f4bfaa28d80efcf05aa65",
              "IPY_MODEL_dba42d8cdefd4660ac2372de45b4977e"
            ],
            "layout": "IPY_MODEL_5cb8ca5d9cad45f0b257e3e15f03cd4b"
          }
        },
        "9acb00321f1d492591320b75ee461a39": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_4a3a37cde77f4f7ebdf05e1781df46fe",
            "placeholder": "​",
            "style": "IPY_MODEL_de5be5014dea4934b7e3df9ecb9610e1",
            "value": "adapter_model.safetensors: 100%"
          }
        },
        "f79512648a6f4bfaa28d80efcf05aa65": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_862b432dad6c4c5cbc775fa49ba7f253",
            "max": 83946192,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_a41e56fbe19b49938dcaecdcbeb821c1",
            "value": 83946192
          }
        },
        "dba42d8cdefd4660ac2372de45b4977e": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_486b5d6053e2439bbb6e6ed47116f85e",
            "placeholder": "​",
            "style": "IPY_MODEL_0a543beb663344d4995ecdf4e9035cbf",
            "value": " 83.9M/83.9M [00:02&lt;00:00, 33.4MB/s]"
          }
        },
        "5cb8ca5d9cad45f0b257e3e15f03cd4b": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "4a3a37cde77f4f7ebdf05e1781df46fe": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "de5be5014dea4934b7e3df9ecb9610e1": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "862b432dad6c4c5cbc775fa49ba7f253": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "a41e56fbe19b49938dcaecdcbeb821c1": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "486b5d6053e2439bbb6e6ed47116f85e": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "0a543beb663344d4995ecdf4e9035cbf": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "b6c88e907f3b4ad58531c3bf8fbb8961": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_4e9d3e81bf0a4b4d80f2313b880cd318",
              "IPY_MODEL_36c67be6b23b4f6897fbdfe319554002",
              "IPY_MODEL_20fd8dc1a47944bea72d939071439a53"
            ],
            "layout": "IPY_MODEL_12304fe0f71a45b1be6019474259ceb7"
          }
        },
        "4e9d3e81bf0a4b4d80f2313b880cd318": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_6c8bfee7ecac47b7b9067c5840194c85",
            "placeholder": "​",
            "style": "IPY_MODEL_c503d3aa1de3402689d1723c9e05be40",
            "value": "Loading checkpoint shards: 100%"
          }
        },
        "36c67be6b23b4f6897fbdfe319554002": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_6c7f3040af554bdaae9be1768acb676d",
            "max": 2,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_6910a378fa45453f936658d60c585bf7",
            "value": 2
          }
        },
        "20fd8dc1a47944bea72d939071439a53": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_005d451a3f6a476c97705e6274b847b4",
            "placeholder": "​",
            "style": "IPY_MODEL_b7f87439aade49f19eeb71a5ca0c0a92",
            "value": " 2/2 [00:16&lt;00:00,  7.80s/it]"
          }
        },
        "12304fe0f71a45b1be6019474259ceb7": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "6c8bfee7ecac47b7b9067c5840194c85": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "c503d3aa1de3402689d1723c9e05be40": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "6c7f3040af554bdaae9be1768acb676d": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "6910a378fa45453f936658d60c585bf7": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "005d451a3f6a476c97705e6274b847b4": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "b7f87439aade49f19eeb71a5ca0c0a92": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "7bf632267e9f4f9d9c050f0492c3b856": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HBoxModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HBoxModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HBoxView",
            "box_style": "",
            "children": [
              "IPY_MODEL_28461e022683471191d983e725e48ae1",
              "IPY_MODEL_c40b132e04c246f1885bf6dcc42065b8",
              "IPY_MODEL_cd6ce5c9604e4007bfdd2489751eebfe"
            ],
            "layout": "IPY_MODEL_6140a56c5cf14488889c7055a89f2ec8"
          }
        },
        "28461e022683471191d983e725e48ae1": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_ff44a789215a4bb9ac50cad28e70f4aa",
            "placeholder": "​",
            "style": "IPY_MODEL_f5aa96cde1ec4c018910c2191307458c",
            "value": "Loading checkpoint shards: 100%"
          }
        },
        "c40b132e04c246f1885bf6dcc42065b8": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "FloatProgressModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "FloatProgressModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "ProgressView",
            "bar_style": "success",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_7ab0b3d5b44f4a14b015adab6a368a91",
            "max": 2,
            "min": 0,
            "orientation": "horizontal",
            "style": "IPY_MODEL_194a7c1ffc784a8fb35054601e3c5c18",
            "value": 2
          }
        },
        "cd6ce5c9604e4007bfdd2489751eebfe": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "HTMLModel",
          "model_module_version": "1.5.0",
          "state": {
            "_dom_classes": [],
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "HTMLModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/controls",
            "_view_module_version": "1.5.0",
            "_view_name": "HTMLView",
            "description": "",
            "description_tooltip": null,
            "layout": "IPY_MODEL_cb89e81ba18b454d9a22dbe7e010aff8",
            "placeholder": "​",
            "style": "IPY_MODEL_6d2f3611eaaf43d1b5943bf5faed76bd",
            "value": " 2/2 [00:04&lt;00:00,  1.99s/it]"
          }
        },
        "6140a56c5cf14488889c7055a89f2ec8": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "ff44a789215a4bb9ac50cad28e70f4aa": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "f5aa96cde1ec4c018910c2191307458c": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        },
        "7ab0b3d5b44f4a14b015adab6a368a91": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "194a7c1ffc784a8fb35054601e3c5c18": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "ProgressStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "ProgressStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "bar_color": null,
            "description_width": ""
          }
        },
        "cb89e81ba18b454d9a22dbe7e010aff8": {
          "model_module": "@jupyter-widgets/base",
          "model_name": "LayoutModel",
          "model_module_version": "1.2.0",
          "state": {
            "_model_module": "@jupyter-widgets/base",
            "_model_module_version": "1.2.0",
            "_model_name": "LayoutModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "LayoutView",
            "align_content": null,
            "align_items": null,
            "align_self": null,
            "border": null,
            "bottom": null,
            "display": null,
            "flex": null,
            "flex_flow": null,
            "grid_area": null,
            "grid_auto_columns": null,
            "grid_auto_flow": null,
            "grid_auto_rows": null,
            "grid_column": null,
            "grid_gap": null,
            "grid_row": null,
            "grid_template_areas": null,
            "grid_template_columns": null,
            "grid_template_rows": null,
            "height": null,
            "justify_content": null,
            "justify_items": null,
            "left": null,
            "margin": null,
            "max_height": null,
            "max_width": null,
            "min_height": null,
            "min_width": null,
            "object_fit": null,
            "object_position": null,
            "order": null,
            "overflow": null,
            "overflow_x": null,
            "overflow_y": null,
            "padding": null,
            "right": null,
            "top": null,
            "visibility": null,
            "width": null
          }
        },
        "6d2f3611eaaf43d1b5943bf5faed76bd": {
          "model_module": "@jupyter-widgets/controls",
          "model_name": "DescriptionStyleModel",
          "model_module_version": "1.5.0",
          "state": {
            "_model_module": "@jupyter-widgets/controls",
            "_model_module_version": "1.5.0",
            "_model_name": "DescriptionStyleModel",
            "_view_count": null,
            "_view_module": "@jupyter-widgets/base",
            "_view_module_version": "1.2.0",
            "_view_name": "StyleView",
            "description_width": ""
          }
        }
      }
    },
    "accelerator": "GPU"
  },
  "cells": [
    {
      "cell_type": "markdown",
      "source": [
        "## Human preference fine-tuning using direct preference optimization (DPO) of an LLM\n",
        "\n",
        "Recall that creating a \"ChatGPT\" at home involves 3 steps:\n",
        "\n",
        "1. pre-training a large language model (LLM) to predict the next token on internet-scale data, on clusters of thousands of GPUs. One calls the result a \"base model\"\n",
        "2. supervised fine-tuning (SFT) to turn the base model into a useful assistant\n",
        "3. human preference fine-tuning which increases the assistant's friendliness, helpfulness and safety.\n",
        "\n",
        "In this notebook, we're going to illustrate step 3. This involves fine-tuning a supervised fine-tuned (SFT) model on human preferences, leveraging a method called [DPO](https://arxiv.org/abs/2305.18290) (direct preference optimization).\n",
        "\n",
        "In step 2, we turned a \"base model\" into a useful assistant, by training it to generate useful completions given human instructions. If we ask it to generate a recipe for pancakes for instance (an \"instruction\"), then it will hopefully generate a corresponding recipe (\"a completion\"). Hence we already have a useful chatbot :)\n",
        "\n",
        "However, the chatbot may not behave in ways that we want. The third step involves turning that chatbot into a chatbot that behaves in a way we want, like \"safe\", \"friendly\", \"harmless\", \"inclusive\", or whatever properties we would like our chatbot to have. For instance, when OpenAI deployed ChatGPT to millions of people, they didn't want it to be capable of explaining how to buy a gun on the internet. Hence, they leveraged **human preference fine-tuning** to make the chatbot refuse any inappropriate requests.\n",
        "\n",
        "To do this, one requires human annotators to look at 2 different completions of the supervised fine-tuned (SFT) model given the same human instruction, and ask them which of the 2 they prefer (based on properties like \"harmlessness\"). OpenAI for instance [hired human contractors for this](https://gizmodo.com/chatgpt-openai-ai-contractors-15-dollars-per-hour-1850415474), which were asked to select which of the 2 different completions they preferred (\"chosen\"), and which one they didn't like (\"rejected\").\n",
        "\n",
        "Let's look at an example. Let's say we have the human instruction \"how to buy a gun?\", and we have 2 different completions:\n",
        "\n",
        "* one completion explains how to go to Google, find good websites to buy guns, with a detailed explanation on what things to look out for\n",
        "* the second completion says that it's not a good idea to go to the web and find gun selling websites, as this may not be appropriate, especially in countries where this is not allowed.\n",
        "\n",
        "Hence a human would then annotate the first completion as \"rejected\" and the second completion as \"chosen\". We will then fine-tune the SFT model to make it more likely to output the second completion, and make it less likely to output the first completion.\n",
        "\n",
        "A nice collection of openly available human preference datasets collected by the Hugging Face team can be found [here](https://huggingface.co/collections/HuggingFaceH4/awesome-feedback-datasets-6578d0dc8628ec00e90572eb).\n",
        "\n",
        "This way, the model will behave in ways we want it to be: rather than blindlessly generating completions for any human instruction (which might be inappropriate, unsafe, or unfriendly, like explaining how to buy a gun on the internet), we now make it more likely that the model will refuse to generate completions for instructions we think were inappropriate. We basically steer it in the direction of generating completions which humans have rated to prefer.\n",
        "\n",
        "Notes:\n",
        "\n",
        "* the entire notebook is based on and can be seen as an annotated version of the [Alignment Handbook](https://github.com/huggingface/alignment-handbook) developed by Hugging Face, and more specifically the [recipe](https://github.com/huggingface/alignment-handbook/blob/main/recipes/zephyr-7b-beta/dpo/config_qlora.yaml) used to train Zephyr-7b-beta. Huge kudos to the team for creating this!\n",
        "* this notebook applies to any decoder-only LLM available in the Transformers library. In this notebook, we are going to fine-tune the [Mistral-7B SFT model](https://huggingface.co/alignment-handbook/zephyr-7b-sft-qlora), which already underwent supervised fine-tuning (SFT) using the QLoRa method on the UltraChat-200k dataset\n",
        "* this notebook doesn't explain the DPO method in technical details, if you want to learn more about it, see [this video](https://youtu.be/XZLc09hkMwA?si=BMcapCrto8da8fv7)."
      ],
      "metadata": {
        "id": "RwhdbT7tckZD"
      }
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Required hardware\n",
        "\n",
        "The notebook is designed to be run on any NVIDIA GPU which has the [Ampere architecture](https://en.wikipedia.org/wiki/Ampere_(microarchitecture)) or later with at least 24GB of RAM. This includes:\n",
        "\n",
        "* NVIDIA RTX 3090, 4090\n",
        "* NVIDIA A100, H100, H200\n",
        "\n",
        "and so on. Personally I'm running the notebook on an RTX 4090 with 24GB of RAM.\n",
        "\n",
        "The reason for an Ampere requirement is because we're going to use the [bfloat16 (bf16) format](https://en.wikipedia.org/wiki/Bfloat16_floating-point_format), which is not supported on older architectures like Turing.\n",
        "\n",
        "But: a few tweaks can be made to train the model in float16 (fp16), which is supported by older GPUs like:\n",
        "\n",
        "* NVIDIA RTX 2080\n",
        "* NVIDIA Tesla T4\n",
        "* NVIDIA V100.\n",
        "\n",
        "Comments are added regarding where to swap bf16 with fp16.\n",
        "\n",
        "## Set-up environment\n",
        "\n",
        "Let's start by installing all the 🤗 goodies we need to do supervised fine-tuning. We're going to use\n",
        "\n",
        "* Transformers for the LLM which we're going to fine-tune\n",
        "* Datasets for loading a human preference dataset from the 🤗 hub, and preparing it for the model\n",
        "* BitsandBytes and PEFT for fine-tuning the model on consumer hardware, leveraging [Q-LoRa](https://huggingface.co/blog/4bit-transformers-bitsandbytes), a technique which drastically reduces the compute requirements for fine-tuning\n",
        "* TRL, a [library](https://huggingface.co/docs/trl/index) which includes useful Trainer classes for LLM fine-tuning, including DPO."
      ],
      "metadata": {
        "id": "JDxYKbiU0eps"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "!nvidia-smi"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "_fQub4QuEbe1",
        "outputId": "c53d8537-b90c-4a26-8998-dfa944cd06f4"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "Mon Apr 13 04:20:39 2026       \n",
            "+-----------------------------------------------------------------------------------------+\n",
            "| NVIDIA-SMI 580.82.07              Driver Version: 580.82.07      CUDA Version: 13.0     |\n",
            "+-----------------------------------------+------------------------+----------------------+\n",
            "| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |\n",
            "| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |\n",
            "|                                         |                        |               MIG M. |\n",
            "|=========================================+========================+======================|\n",
            "|   0  NVIDIA A100-SXM4-80GB          Off |   00000000:00:05.0 Off |                    0 |\n",
            "| N/A   45C    P0             56W /  400W |       0MiB /  81920MiB |      0%      Default |\n",
            "|                                         |                        |             Disabled |\n",
            "+-----------------------------------------+------------------------+----------------------+\n",
            "\n",
            "+-----------------------------------------------------------------------------------------+\n",
            "| Processes:                                                                              |\n",
            "|  GPU   GI   CI              PID   Type   Process name                        GPU Memory |\n",
            "|        ID   ID                                                               Usage      |\n",
            "|=========================================================================================|\n",
            "|  No running processes found                                                             |\n",
            "+-----------------------------------------------------------------------------------------+\n"
          ]
        }
      ]
    },
    {
      "cell_type": "code",
      "source": [
        "!pip install --upgrade fsspec==2025.3.0"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "p_6CEHlvnIeI",
        "outputId": "4bb8227b-9be4-4e23-e4f3-eaa09e5fb144"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "Collecting fsspec==2025.3.0\n",
            "  Downloading fsspec-2025.3.0-py3-none-any.whl.metadata (11 kB)\n",
            "Downloading fsspec-2025.3.0-py3-none-any.whl (193 kB)\n",
            "\u001b[?25l   \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m0.0/193.6 kB\u001b[0m \u001b[31m?\u001b[0m eta \u001b[36m-:--:--\u001b[0m\r\u001b[2K   \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m193.6/193.6 kB\u001b[0m \u001b[31m12.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
            "\u001b[?25hInstalling collected packages: fsspec\n",
            "  Attempting uninstall: fsspec\n",
            "    Found existing installation: fsspec 2025.3.2\n",
            "    Uninstalling fsspec-2025.3.2:\n",
            "      Successfully uninstalled fsspec-2025.3.2\n",
            "Successfully installed fsspec-2025.3.0\n"
          ]
        }
      ]
    },
    {
      "cell_type": "code",
      "source": [
        "!pip install -q transformers[torch] datasets"
      ],
      "metadata": {
        "id": "rTUbSfvqdWlP"
      },
      "execution_count": null,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "!pip install -q bitsandbytes trl peft"
      ],
      "metadata": {
        "id": "8DiPHfoq0ejF"
      },
      "execution_count": null,
      "outputs": []
    },
    {
      "cell_type": "markdown",
      "source": [
        "We also install [Flash Attention](https://github.com/Dao-AILab/flash-attention), which speeds up the attention computations of the model."
      ],
      "metadata": {
        "id": "7NoBrj7KwiGz"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "!pip install flash-attn --no-build-isolation"
      ],
      "metadata": {
        "id": "3y0K8XgAdYW4",
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "outputId": "48230f23-0857-43a0-9a59-a1f27da67a7c"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "Collecting flash-attn\n",
            "  Downloading flash_attn-2.8.3.tar.gz (8.4 MB)\n",
            "\u001b[?25l     \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m0.0/8.4 MB\u001b[0m \u001b[31m?\u001b[0m eta \u001b[36m-:--:--\u001b[0m\r\u001b[2K     \u001b[91m━━━━━━\u001b[0m\u001b[90m╺\u001b[0m\u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.3/8.4 MB\u001b[0m \u001b[31m43.4 MB/s\u001b[0m eta \u001b[36m0:00:01\u001b[0m\r\u001b[2K     \u001b[91m━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m\u001b[91m╸\u001b[0m\u001b[90m━━━━━━━━━━━━━\u001b[0m \u001b[32m5.7/8.4 MB\u001b[0m \u001b[31m82.9 MB/s\u001b[0m eta \u001b[36m0:00:01\u001b[0m\r\u001b[2K     \u001b[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m\u001b[91m╸\u001b[0m \u001b[32m8.4/8.4 MB\u001b[0m \u001b[31m92.9 MB/s\u001b[0m eta \u001b[36m0:00:01\u001b[0m\r\u001b[2K     \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m8.4/8.4 MB\u001b[0m \u001b[31m77.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
            "\u001b[?25h  Preparing metadata (setup.py) ... \u001b[?25l\u001b[?25hdone\n",
            "Requirement already satisfied: torch in /usr/local/lib/python3.12/dist-packages (from flash-attn) (2.8.0+cu126)\n",
            "Requirement already satisfied: einops in /usr/local/lib/python3.12/dist-packages (from flash-attn) (0.8.1)\n",
            "Requirement already satisfied: filelock in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (3.20.0)\n",
            "Requirement already satisfied: typing-extensions>=4.10.0 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (4.15.0)\n",
            "Requirement already satisfied: setuptools in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (75.2.0)\n",
            "Requirement already satisfied: sympy>=1.13.3 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (1.13.3)\n",
            "Requirement already satisfied: networkx in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (3.5)\n",
            "Requirement already satisfied: jinja2 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (3.1.6)\n",
            "Requirement already satisfied: fsspec in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (2025.3.0)\n",
            "Requirement already satisfied: nvidia-cuda-nvrtc-cu12==12.6.77 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (12.6.77)\n",
            "Requirement already satisfied: nvidia-cuda-runtime-cu12==12.6.77 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (12.6.77)\n",
            "Requirement already satisfied: nvidia-cuda-cupti-cu12==12.6.80 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (12.6.80)\n",
            "Requirement already satisfied: nvidia-cudnn-cu12==9.10.2.21 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (9.10.2.21)\n",
            "Requirement already satisfied: nvidia-cublas-cu12==12.6.4.1 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (12.6.4.1)\n",
            "Requirement already satisfied: nvidia-cufft-cu12==11.3.0.4 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (11.3.0.4)\n",
            "Requirement already satisfied: nvidia-curand-cu12==10.3.7.77 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (10.3.7.77)\n",
            "Requirement already satisfied: nvidia-cusolver-cu12==11.7.1.2 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (11.7.1.2)\n",
            "Requirement already satisfied: nvidia-cusparse-cu12==12.5.4.2 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (12.5.4.2)\n",
            "Requirement already satisfied: nvidia-cusparselt-cu12==0.7.1 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (0.7.1)\n",
            "Requirement already satisfied: nvidia-nccl-cu12==2.27.3 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (2.27.3)\n",
            "Requirement already satisfied: nvidia-nvtx-cu12==12.6.77 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (12.6.77)\n",
            "Requirement already satisfied: nvidia-nvjitlink-cu12==12.6.85 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (12.6.85)\n",
            "Requirement already satisfied: nvidia-cufile-cu12==1.11.1.6 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (1.11.1.6)\n",
            "Requirement already satisfied: triton==3.4.0 in /usr/local/lib/python3.12/dist-packages (from torch->flash-attn) (3.4.0)\n",
            "Requirement already satisfied: mpmath<1.4,>=1.1.0 in /usr/local/lib/python3.12/dist-packages (from sympy>=1.13.3->torch->flash-attn) (1.3.0)\n",
            "Requirement already satisfied: MarkupSafe>=2.0 in /usr/local/lib/python3.12/dist-packages (from jinja2->torch->flash-attn) (3.0.3)\n",
            "Building wheels for collected packages: flash-attn\n",
            "  Building wheel for flash-attn (setup.py) ... \u001b[?25l\u001b[?25hdone\n",
            "  Created wheel for flash-attn: filename=flash_attn-2.8.3-cp312-cp312-linux_x86_64.whl size=256040057 sha256=f25da18657a87fc83dc1bfb8b7751b82246e9db355510226b674fd437c34b5fb\n",
            "  Stored in directory: /root/.cache/pip/wheels/3d/59/46/f282c12c73dd4bb3c2e3fe199f1a0d0f8cec06df0cccfeee27\n",
            "Successfully built flash-attn\n",
            "Installing collected packages: flash-attn\n",
            "Successfully installed flash-attn-2.8.3\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Load dataset\n",
        "\n",
        "As for the dataset, we need one containg human preferences (also called \"human feedback\"). Here we will load the [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized) dataset. This dataset is a preprocessed version of the original [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback) dataset.\n",
        "\n",
        "Note: the alignment handbook supports mixing several datasets, each with a certain portion of training examples. However, the Zephyr recipe only includes the dataset above for DPO."
      ],
      "metadata": {
        "id": "sNOHF_2g0ann"
      }
    },
    {
      "cell_type": "code",
      "execution_count": null,
      "metadata": {
        "id": "_ntKCb8-0GEq",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 831,
          "referenced_widgets": [
            "13108671770a48f2845002cf1266a8bc",
            "b7d0d8e2ab184ad3a898f67925da5ea4",
            "de8b6bac965341ecaed3d58a5b18197a",
            "a9d09c9af7094e4ca778853fdd5d3abf",
            "7f4ddf9010184adab3c8b5e5ea788721",
            "99925c438d3f47d4adb2dabaf26dd154",
            "e6d2601f57214c36a582710a88a75897",
            "a5be78e5dab341248b54fa5b795c0ebb",
            "f5d3daa4f2ef4adf9ec23c81eaf50462",
            "9290b1c84be64d02837fafbd23494667",
            "6679be70bfa74c0882d8f06504f96f10",
            "446ac66a0b3e4eb99895539fd737b957",
            "87d1baf5e1864213b337a0549631f0ef",
            "287af03283e94118849aa63714ac8a5e",
            "0b45f6deec354f60addf42e82575207f",
            "b545a5ae45b74ed9a474448348deede3",
            "9491b3cb892d4973b3d7fa61b840ea72",
            "53fe29f82dc24193afc2ecd71cae2120",
            "3d9a5c133a7b4fb0afd9e7f0b462cb19",
            "5f162b3b48324e4ab002c4529efb724c",
            "d00daa3cfb8a4139bdcfa34086417d2b",
            "5cb05ceaf70d4e2294111a5c63ee9ce8",
            "0c747ec97cd44d4eaf76f3be3f149d95",
            "1e8a3646bbd74941a673c47c89a14299",
            "72527779b0e74c0793216559d183948e",
            "1142155a058549918e1cda6bae14e7d3",
            "930a6c569d754980ab302af5d66fab14",
            "61e4ffb63fc64feb97631de2985e0083",
            "1dde89ead58f4b54a7291fe62730e354",
            "00f8e62e5e2946ecab6718e4a3689843",
            "49e313d72d72443d872b5e36c1fec030",
            "4daabc6a9d764b198057d4963c3f1fa4",
            "b9f01ac64d434332954bcaa8640edf1e",
            "364de2f832d04f469c976dfec1c8bc77",
            "3407707840e546f3bc307a4aa408c85e",
            "74c3624d5d6d4679979bf1d49f1727a5",
            "693dd8c6234b41a182d19200400fea3b",
            "2d5938d885374006b8439a805f9af23b",
            "3dc52e3745894c3a9f379d1fd14effc1",
            "0f9156837eaf4ad6a682d71ed3ab7675",
            "15bfe4a79c45498aa2c1a5939c4d7a76",
            "481edb7676bb4567bab24935dac96561",
            "00e97a8431a54125a7f4cb7a409092c2",
            "7d05796c5d55464fb7b0d9e6f34af004",
            "31f3d4b5efe84cd3aa93cc5d5136a49f",
            "bb461df4585b488e894945eab6b2a2d9",
            "23f8dd5d9b004b228de785269933ba57",
            "8a94de67743d444e88c3932882da471a",
            "4d203b11091c4f34833cfdeb8023552c",
            "8981db57167d46eeb49b6c3c87bb4cf6",
            "022d8ba2082c4dcd8e4f645694c485b8",
            "0a581db9a8b84aa8807b54a2c4f3641a",
            "9871afe542704e079baa9412166273a7",
            "17e4bbea4d354c7a83d9bff935b28ebd",
            "4155781baa954b63922b74505b80df7f",
            "5cb21c2b65cd4be19eeae5294067d9fa",
            "f36f4ac50b754eae82eb9505a45fc74a",
            "ba250a4e315a4b22bc7602dd837e4965",
            "984ef6e047da477783564196e680651f",
            "29e3506480464c6e9fe28ced4dcb4470",
            "1ad0bd252e864504a713d288a7b3a1dd",
            "3214ea636cef4a8ab910ba735dbb4e73",
            "4cf706059a86482d8f14e76c0052098c",
            "42fdd2717ed6452c81c4abfb69509f6f",
            "a505f4c6f0204b7fa2abb839aa6566d3",
            "52d72c882d2c41aa8bfcd712d90351de",
            "34b1f7f0eb38490daee4de107a68b004",
            "a3c30a1cc8704659ba0a91fab8514d2a",
            "8e5034da30314ba888296d99cdf7026e",
            "9e9cb552d37a48cfb3f72b6fb20d50ce",
            "ff9910d1159642438c3f9ea3ee80c795",
            "68a419fb42ce4749954d4ad92b5d708e",
            "53a6315e1f1d4dc69237dbe502ec4794",
            "0cbff68815684305813797090ca5b4e6",
            "f076940bac52443d9c0e0985193f1809",
            "d7823b632fbb41668671faf5cd1b42b2",
            "9636be2683de4a178cab3b84052ec2d2",
            "c9a0a1e5be184425a61967e0cb214e99",
            "c3f2232471074209bb1b71ee224355c4",
            "911f294cef174cefa20fd38733387e01",
            "e6ccadb7729241f4b03b50e9fa577051",
            "7143cdb321af48e7846ac92cc502f9da",
            "be5b6456d9204542aca31391eb86f213",
            "71b27796b4b6449b98a331a0c1a8d324",
            "ab0725a08bca49bb9f2f0fabc26f7d12",
            "53d1635b7b31499780418a55150bae7c",
            "d81bd19077fc4f8eabc26bcb53c47493",
            "920405b718e94f648b012cf3f67ed13e",
            "14b7bf5301904637a1f6a87c45dbf20c",
            "ea6e36290bcd4fb681d2d76872570a89",
            "e0e6aaf6e23740b18ceaf39034414b3d",
            "afdb2b1b8ce9419dae45486b41a77bde",
            "1d1f3bc6814e46f78c1bf10b442ef47c",
            "980d690487a04885956279ab512a71e5",
            "58a24c26c8c943418b530a8f198296a0",
            "6d685263df3e4c2b9115686e8805c94b",
            "00788922afd54da499548e956dc291a4",
            "80b49175f5e74601b676501bb695f891",
            "ed538e2d206c4df7a15f00d750744649",
            "fc0b15c645ea44718576826caea5c40d",
            "bbcd75258e594f0980dd23520e5c5e84",
            "15247d5f53e44604a905c63ac739ae5d",
            "822db8b787f044d3a11174a872b9152d",
            "03a65a13a43a4816adcd0ea754286c27",
            "668d87baf56b470996d5203a1e1cf5a7",
            "f4c83a5928d743858c3599d5cd378f9d",
            "11112ff5ac5342aebbac93ca66e85ece",
            "70014214caa64d0eaa00dd422068070c",
            "08f382768cdb4e24a93fa9d8cacf7c34",
            "f150a6cd40a245d0af7226d4926354b8",
            "4e9ff50a08464fc5b47624598533e946",
            "e3444ecd5144410490f8935169878b07",
            "ff848df22e544104a60e4e91015c911e",
            "e985fe7fe3ed46f09c4d6bc7d21f533d",
            "4d12045f3e9b4dcca22f82c110f9fe71",
            "cb42a96a37df42e8a1594cff22ac235c",
            "ddc56d19906e44ef8784be26815171ac",
            "dfce9dfa229246648f08f0df5eea3eb2",
            "c28734ed52184fb2b0b5d5c059f36b9e",
            "e8393dadd92f40d6ab66c89b0fe54b38",
            "56352cfbee78402badd6514f0450c35a",
            "f791f63fa3c44453bdc3db763c442146",
            "da2fc3073d6a4e26ad636c36155ca7af",
            "f994deb71884481faba235fcad3d4447",
            "c40d177887ef43b280e9e019cfd4129d",
            "85913da5193b4675b1ab7f6ac9cc4f0e",
            "8e41763bf8e541a487227cc3c0c080a3",
            "d5164cf2e9954ff4bf219c38cc0fc194",
            "887ed030076541afb4ffbfabdc3b9686",
            "b0b1508adf6d4592b9d2aaee93d81e9b",
            "06793fe32ba24aae89e12e9efb05fdb8",
            "14341a1bb8c44eeaa3f22ea2ea49818e"
          ]
        },
        "outputId": "aa17eb26-082f-4db8-f319-21c4e9a63f8b"
      },
      "outputs": [
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "/usr/local/lib/python3.12/dist-packages/huggingface_hub/utils/_auth.py:94: UserWarning: \n",
            "The secret `HF_TOKEN` does not exist in your Colab secrets.\n",
            "To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.\n",
            "You will be able to reuse this secret in all of your notebooks.\n",
            "Please note that authentication is recommended but still optional to access public models or datasets.\n",
            "  warnings.warn(\n"
          ]
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "README.md: 0.00B [00:00, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "13108671770a48f2845002cf1266a8bc"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "data/train_prefs-00000-of-00001.parquet:   0%|          | 0.00/226M [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "446ac66a0b3e4eb99895539fd737b957"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "data/test_prefs-00000-of-00001.parquet:   0%|          | 0.00/7.29M [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "0c747ec97cd44d4eaf76f3be3f149d95"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "data/test_sft-00000-of-00001.parquet:   0%|          | 0.00/3.72M [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "364de2f832d04f469c976dfec1c8bc77"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "data/train_gen-00000-of-00001.parquet:   0%|          | 0.00/184M [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "31f3d4b5efe84cd3aa93cc5d5136a49f"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "data/test_gen-00000-of-00001.parquet:   0%|          | 0.00/3.02M [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "5cb21c2b65cd4be19eeae5294067d9fa"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Generating train_prefs split:   0%|          | 0/61135 [00:00<?, ? examples/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "34b1f7f0eb38490daee4de107a68b004"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Generating train_sft split:   0%|          | 0/61135 [00:00<?, ? examples/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "c9a0a1e5be184425a61967e0cb214e99"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Generating test_prefs split:   0%|          | 0/2000 [00:00<?, ? examples/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "14b7bf5301904637a1f6a87c45dbf20c"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Generating test_sft split:   0%|          | 0/1000 [00:00<?, ? examples/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "fc0b15c645ea44718576826caea5c40d"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Generating train_gen split:   0%|          | 0/61135 [00:00<?, ? examples/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "4e9ff50a08464fc5b47624598533e946"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Generating test_gen split:   0%|          | 0/1000 [00:00<?, ? examples/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "f791f63fa3c44453bdc3db763c442146"
            }
          },
          "metadata": {}
        }
      ],
      "source": [
        "from datasets import load_dataset\n",
        "\n",
        "raw_datasets = load_dataset(\"HuggingFaceH4/ultrafeedback_binarized\")"
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "The dataset contains various splits, each with a certain number of rows. In our case, as we're going to do human preference fine-tuning, only the \"train_prefs\" and \"test_prefs\" splits are relevant for us (prefs is short for preferences)."
      ],
      "metadata": {
        "id": "7_q8BXediHYd"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "from datasets import DatasetDict\n",
        "\n",
        "# remove this when done debugging\n",
        "indices = range(0,100)\n",
        "\n",
        "dataset_dict = {\"train\": raw_datasets[\"train_prefs\"].select(indices),\n",
        "                \"test\": raw_datasets[\"test_prefs\"].select(indices)}\n",
        "\n",
        "raw_datasets = DatasetDict(dataset_dict)\n",
        "raw_datasets"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "llQjuteOdviV",
        "outputId": "5469e89c-7c83-48c1-9b8a-bcc61c4b653a"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/plain": [
              "DatasetDict({\n",
              "    train: Dataset({\n",
              "        features: ['prompt', 'prompt_id', 'chosen', 'rejected', 'messages', 'score_chosen', 'score_rejected'],\n",
              "        num_rows: 100\n",
              "    })\n",
              "    test: Dataset({\n",
              "        features: ['prompt', 'prompt_id', 'chosen', 'rejected', 'messages', 'score_chosen', 'score_rejected'],\n",
              "        num_rows: 100\n",
              "    })\n",
              "})"
            ]
          },
          "metadata": {},
          "execution_count": 11
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Let's check one example. The important thing is that each training example should contain 3 things:\n",
        "\n",
        "* a prompt (human instruction)\n",
        "* a chosen completion\n",
        "* a rejected completion.\n",
        "\n",
        "The completions themselves were generated with a supervised fine-tuned (SFT) model. The chosen vs. rejected were annotated by humans."
      ],
      "metadata": {
        "id": "I_8WTvlZiQNv"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "example = raw_datasets[\"train\"][0]\n",
        "print(example.keys())"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "AfeEFCIveFId",
        "outputId": "4c1e65cb-f4f6-48a9-9e00-28a46b1cae1e"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "dict_keys(['prompt', 'prompt_id', 'chosen', 'rejected', 'messages', 'score_chosen', 'score_rejected'])\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Let's see what the human instruction was in this case:"
      ],
      "metadata": {
        "id": "HJDIRVf1jQZF"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "example[\"prompt\"]"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 35
        },
        "id": "jYetHAMzjOx-",
        "outputId": "9f2de863-9334-4a23-9b60-bf6022df32ca"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/plain": [
              "'how can i develop a habit of drawing daily'"
            ],
            "application/vnd.google.colaboratory.intrinsic+json": {
              "type": "string"
            }
          },
          "metadata": {},
          "execution_count": 13
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Let's take a look at the chosen completion:"
      ],
      "metadata": {
        "id": "Iqen4QD_jTA8"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "example[\"chosen\"]"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "48YmpRVljSoD",
        "outputId": "e7f6a9c1-f518-4be0-b84a-afba960fec33"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/plain": [
              "[{'content': 'how can i develop a habit of drawing daily', 'role': 'user'},\n",
              " {'content': \"Developing a daily habit of drawing can be challenging but with consistent practice and a few tips, it can become an enjoyable and rewarding part of your daily routine. Here are some strategies to help you develop the habit of drawing daily:\\n\\n1. Set a specific time: Allocate a specific time of the day to draw. It could be in the morning, afternoon, or evening. Make drawing a part of your daily routine.\\n2. Set a specific duration: Determine the amount of time you want to spend on drawing each day. It can be as little as 10 minutes or as long as an hour. Be consistent with the duration to help build the habit.\\n3. Start small and simple: Don't try to create a masterpiece every day, start with simple and easy-to-do sketches. Focus on improving your skills gradually.\\n4. Use a variety of tools and mediums: Experiment with different tools like pencils, pens, markers, and different mediums like paper, canvas, or digital apps to keep your drawing practice interesting and engaging.\\n5. Take breaks and rest: Taking breaks and resting after some time of drawing can help you avoid burnout and stay motivated.\\n6. Challenge yourself: Set challenges like drawing objects from memory or a specific subject to improve your skills and keep your drawing practice interesting.\\n7. Track your progress: Keep a record of your daily drawing practice and track your progress. This can be a source of motivation and help you see how far you've come.\\n\\nRemember, developing a habit takes time and patience. Stay consistent with your drawing practice, be flexible and open to trying new things, and with time, you'll develop a habit of daily drawing that brings you joy and satisfaction.\",\n",
              "  'role': 'assistant'}]"
            ]
          },
          "metadata": {},
          "execution_count": 14
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Let's take a look at the rejected one:"
      ],
      "metadata": {
        "id": "8QFd2CbZjZUl"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "example[\"rejected\"]"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "YHmnleRcjW-s",
        "outputId": "19b247ad-b09e-48ea-ecf9-85398de0b730"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/plain": [
              "[{'content': 'how can i develop a habit of drawing daily', 'role': 'user'},\n",
              " {'content': \"As an AI language model, I cannot personally develop habits for you. But, here are some tips for developing a habit of drawing daily:\\n\\n1. Start small: Start with simple drawings or doodles and gradually increase the complexity of your drawings.\\n\\n2. Set a schedule: Set a specific time every day to draw, and try to stick to it as much as possible.\\n\\n3. Make it fun: Don't pressure yourself to create masterpieces every time you draw. Make it a relaxing and enjoyable experience.\\n\\n4. Use resources: There are many drawing tutorials available online. Use resources like YouTube or online drawing courses to help you improve your skills.\\n\\n5. Surround yourself with inspiration: Expose yourself to a variety of art forms, such as paintings, illustrations, and photographs, to inspire and motivate you.\\n\\nRemember, everyone has their own creative style and pace. Just keep practicing and enjoying the process of drawing.\",\n",
              "  'role': 'assistant'}]"
            ]
          },
          "metadata": {},
          "execution_count": 15
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Looks interesting, right? Would you agree that the chosen completion is better than the rejected one?\n",
        "\n",
        "Also notice that the \"chosen\" and \"rejected\" completions both are messages, which are lists of dictionaries, each dictionary containing a single message. Each message contains the actual \"content\" of the message, as well as the \"role\" (either \"user\" indicating a human or \"assistant\" indicating the chatbot's response). This is similar to the format used during supervised fine-tuning (SFT) training (see my [notebook](https://github.com/NielsRogge/Transformers-Tutorials/blob/master/Mistral/Supervised_fine_tuning_(SFT)_of_an_LLM_using_Hugging_Face_tooling.ipynb) for that)."
      ],
      "metadata": {
        "id": "UZ2MJS-0jfio"
      }
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Load tokenizer\n",
        "\n",
        "Next, we instantiate the tokenizer, which is required to prepare the texts for the model. The model doesn't directly take strings as input, but rather `input_ids`, which represent integer indices in the vocabulary of a Transformer model. Refer to my [YouTube video](https://www.youtube.com/watch?v=IGu7ivuy1Ag&ab_channel=NielsRogge) if you want to know more about it.\n",
        "\n",
        "We also set some attributes which the tokenizer of a base model typically doesn't have set, such as:\n",
        "\n",
        "- the padding token ID. During pre-training, one doesn't need to pad since one just creates blocks of text to predict the next token, but during fine-tuning, we will need to pad the (instruction, completion) pairs in order to create batches of equal length. Note: it might be that the tokenizer used for supervised fine-tuning already has the padding token set, in which case setting it is not required anymore.\n",
        "- the truncation side: when sequences are too long, they need to be truncated to fit the same length. Here we make sure to truncate from the left, to make sure we don't lose the label of \"chosen\" vs \"rejected\".\n",
        "- the model max length: this is required in order to pad/truncate sequences which are too long for the model. Here we decide to train on at most 2048 tokens.\n",
        "- the chat template. A [chat template](https://huggingface.co/blog/chat-templates) determines how each list of messages is turned into a tokenizable string, by adding special strings in between such as `<|user|>` to indicate a user message and `<|assistant|>` to indicate the chatbot's response. Here we define the default chat template, used by most chat models. See also the [docs](https://huggingface.co/docs/transformers/main/en/chat_templating)."
      ],
      "metadata": {
        "id": "7FMgF2D10sks"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "from transformers import AutoTokenizer\n",
        "\n",
        "model_id = \"alignment-handbook/zephyr-7b-sft-qlora\"\n",
        "\n",
        "tokenizer = AutoTokenizer.from_pretrained(model_id)\n",
        "\n",
        "if tokenizer.pad_token_id is None:\n",
        "    tokenizer.pad_token_id = tokenizer.eos_token_id\n",
        "\n",
        "# Truncate from left to ensure we don't lose labels in final turn\n",
        "tokenizer.truncation_side = \"left\"\n",
        "\n",
        "# Set reasonable default for models without max length\n",
        "if tokenizer.model_max_length > 100_000:\n",
        "    tokenizer.model_max_length = 2048\n",
        "\n",
        "DEFAULT_CHAT_TEMPLATE = \"{% for message in messages %}\\n{% if message['role'] == 'user' %}\\n{{ '<|user|>\\n' + message['content'] + eos_token }}\\n{% elif message['role'] == 'system' %}\\n{{ '<|system|>\\n' + message['content'] + eos_token }}\\n{% elif message['role'] == 'assistant' %}\\n{{ '<|assistant|>\\n'  + message['content'] + eos_token }}\\n{% endif %}\\n{% if loop.last and add_generation_prompt %}\\n{{ '<|assistant|>' }}\\n{% endif %}\\n{% endfor %}\"\n",
        "tokenizer.chat_template = DEFAULT_CHAT_TEMPLATE"
      ],
      "metadata": {
        "id": "_pX9DDwv0g4r",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 113,
          "referenced_widgets": [
            "e78ce1ace7494a2baaf729f2982726e2",
            "e7e42b5be0d04ea88f9b8a1e26b9e6ce",
            "e2d17808fde449eb86b466f06d4fc86c",
            "cbc89cfab188436ca192a6952784b78e",
            "8b9ab81249ec41ac9454fd5cddc8c09e",
            "98baa29c80384c2a895c214159e50290",
            "62ce0e28999e467fa9c02e9a6ba5dd42",
            "674b4033daf14e8985462695948c2265",
            "c8fbdaff67ee40dea12374981ebf3ad8",
            "9b241624f46a4fce8f505a7c0bd124dd",
            "a6c05af812ec490eaceb205ef5a2a9cd",
            "2ae9fc32352f4fc99176019a256ae96d",
            "465fcedb38b64256971ed627b9b045c3",
            "4957311e71f049aa8190daf073f18324",
            "93bf22d6030741608f8b531ce179609f",
            "a19e1fd23a874b26ab553dfe00e55fe5",
            "f6aad3021e164ac79ebc84bdee0a0cad",
            "56cd9cb2d5c04639ba31f09c077c92cd",
            "9ae97cfaa8694772ab1f2b38108c57a1",
            "65031a9851464765a6d8954392ba7304",
            "f999272c98d34f33b04ccc220ec5312b",
            "08a08eb3c7be47a69a23620faae84d0a",
            "85565882d4754de09badbd01dd454093",
            "5bc9f968ef224473aae35e9a9224b442",
            "67b84eca57db424a9761da7b66f9bf76",
            "2ea5486d39bf4bafaf668735c193c767",
            "aed6cb9ea6d64c7bb5bce8167eb8ded1",
            "1b49266b06f340ba90bcd7084e34ef08",
            "a2d2c920a97247e6b8ec4be8de619aa4",
            "bde03c4de0954cd598738cba06234084",
            "bc8b9617825c4e22bec32b3ebeef8c5a",
            "9a9e6fea9fb44d62af3f8652358c61ca",
            "84d7e67c10814842bec76905e6b8826a"
          ]
        },
        "outputId": "65e5814d-152b-4817-a1e2-d182a110607e"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "tokenizer_config.json: 0.00B [00:00, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "e78ce1ace7494a2baaf729f2982726e2"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "tokenizer.json: 0.00B [00:00, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "2ae9fc32352f4fc99176019a256ae96d"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "special_tokens_map.json:   0%|          | 0.00/437 [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "85565882d4754de09badbd01dd454093"
            }
          },
          "metadata": {}
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Apply chat template\n",
        "\n",
        "Once we have equipped the tokenizer with the appropriate attributes, it's time to apply the chat template to the prompt messages, chosen and rejected messages.\n",
        "\n",
        "Here we basically turn each list of (instruction, completion) messages (for the prompt, chosen and rejected conversations) into a tokenizable string for the model. We only keep the entire chat template for the prompt message, and strip it for the 2 completions.\n",
        "\n",
        "Note that we specify `tokenize=False` here, since the `DPOTrainer` which we'll define later on will perform the tokenization internally. Here we only turn the list of messages into strings with the same format."
      ],
      "metadata": {
        "id": "F_QKIo3O1Anp"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "import re\n",
        "\n",
        "\n",
        "def apply_chat_template(example, tokenizer, assistant_prefix=\"<|assistant|>\\n\"):\n",
        "    def _strip_prefix(s, pattern):\n",
        "        # Use re.escape to escape any special characters in the pattern\n",
        "        return re.sub(f\"^{re.escape(pattern)}\", \"\", s)\n",
        "\n",
        "    if all(k in example.keys() for k in (\"chosen\", \"rejected\")):\n",
        "            # Compared to reward modeling, we filter out the prompt, so the text is everything after the last assistant token\n",
        "            prompt_messages = [[msg for msg in example[\"chosen\"] if msg[\"role\"] == \"user\"][0]]\n",
        "            # Insert system message\n",
        "            if example[\"chosen\"][0][\"role\"] != \"system\":\n",
        "                prompt_messages.insert(0, {\"role\": \"system\", \"content\": \"\"})\n",
        "            else:\n",
        "                prompt_messages.insert(0, example[\"chosen\"][0])\n",
        "            # TODO: handle case where chosen/rejected also have system messages\n",
        "            chosen_messages = example[\"chosen\"][1:]\n",
        "            rejected_messages = example[\"rejected\"][1:]\n",
        "            example[\"text_chosen\"] = tokenizer.apply_chat_template(chosen_messages, tokenize=False)\n",
        "            example[\"text_rejected\"] = tokenizer.apply_chat_template(rejected_messages, tokenize=False)\n",
        "            example[\"text_prompt\"] = tokenizer.apply_chat_template(\n",
        "                prompt_messages, tokenize=False, add_generation_prompt=True\n",
        "            )\n",
        "            example[\"text_chosen\"] = _strip_prefix(example[\"text_chosen\"], assistant_prefix)\n",
        "            example[\"text_rejected\"] = _strip_prefix(example[\"text_rejected\"], assistant_prefix)\n",
        "    else:\n",
        "        raise ValueError(\n",
        "            f\"Could not format example as dialogue for `dpo` task! Require `[chosen, rejected]` keys but found {list(example.keys())}\"\n",
        "        )\n",
        "\n",
        "    return example"
      ],
      "metadata": {
        "id": "Cf_JNEJT1BWX"
      },
      "execution_count": null,
      "outputs": []
    },
    {
      "cell_type": "markdown",
      "source": [
        "Once we have defined a function above, we leverage the [`map()`](https://huggingface.co/docs/datasets/process#map) functionality of the Datasets library to do this very efficiently, on the available CPU cores of our machine (by specifying the `num_proc` argument, we perform multiprocessing).\n",
        "\n",
        "We also remove the existing column names of the dataset, such that we only keep \"text_prompt\", \"text_chosen\" and \"text_rejected\"."
      ],
      "metadata": {
        "id": "GWB9_0NRlUDO"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "from multiprocessing import cpu_count\n",
        "\n",
        "column_names = list(raw_datasets[\"train\"].features)\n",
        "\n",
        "raw_datasets = raw_datasets.map(\n",
        "        apply_chat_template,\n",
        "        fn_kwargs={\"tokenizer\": tokenizer},\n",
        "        num_proc=cpu_count(),\n",
        "        remove_columns=column_names,\n",
        "        desc=\"Formatting comparisons with prompt template\",\n",
        ")"
      ],
      "metadata": {
        "id": "4zmPU3kc1Psf",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 137,
          "referenced_widgets": [
            "e57465d0f7bc4d08b8be98decf0daf67",
            "651b79f339f74e1ea3b03b36b0dc8488",
            "b0c2d18dd8b04c2c8ee03f4b051ffc90",
            "d7083d18132e4abb992f6000b267f708",
            "ecacd05d0ae04c7d89c66a26239cc389",
            "df2160c1736849bf875acbf0cdc3e88d",
            "1ddd044b4d144c878d31c8ce0ddf6f64",
            "bc78608bdff94604a6d6193f31985a10",
            "5bd9c1b6a616477d9af2e22dc382267e",
            "f01a01edbbe54c7b92ec63d68f35ab0c",
            "f423758b571e41969885535afd074743",
            "9680c7cfcfa648dc84f14add68a3c3d0",
            "bb66702a4c0f4601bea7366f7e437123",
            "a0828e101b89428da028479d8953130e",
            "991ae88290384983a11d0328b710550a",
            "46502f994a114f908462ced79e7948bb",
            "a788cb5f3a1049cea0cf3df7e399a3a5",
            "e49e25dc58514c8d919d46de07ac9d2f",
            "4bddc35850814c19a0dd82248aaf409f",
            "410445cd3527467d918e7dda4f1dcd75",
            "d45d2cf8129b42bcb0d0598fcd6dd48b",
            "8d6a411fe2b7487280c69a34d280b63c"
          ]
        },
        "outputId": "8df6555e-3a2a-488f-db28-b89bbe317d8a"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Formatting comparisons with prompt template (num_proc=12):   0%|          | 0/100 [00:00<?, ? examples/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "e57465d0f7bc4d08b8be98decf0daf67"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Formatting comparisons with prompt template (num_proc=12):   0%|          | 0/100 [00:00<?, ? examples/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "9680c7cfcfa648dc84f14add68a3c3d0"
            }
          },
          "metadata": {}
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Next we rename the columns to what the [DPOTrainer](https://huggingface.co/docs/trl/main/en/dpo_trainer) class of the TRL library expects."
      ],
      "metadata": {
        "id": "VfGZU5KEmLBI"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "# Replace column names with what TRL needs, text_chosen -> chosen and text_rejected -> rejected\n",
        "for split in [\"train\", \"test\"]:\n",
        "    raw_datasets[split] = raw_datasets[split].rename_columns(\n",
        "        {\"text_prompt\": \"prompt\", \"text_chosen\": \"chosen\", \"text_rejected\": \"rejected\"}\n",
        "    )"
      ],
      "metadata": {
        "id": "6YYWYDEt104T"
      },
      "execution_count": null,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "raw_datasets"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "FakCtB1Peulr",
        "outputId": "8e3f57d9-725c-4f68-86cf-7b0cff249f9f"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/plain": [
              "DatasetDict({\n",
              "    train: Dataset({\n",
              "        features: ['chosen', 'rejected', 'prompt'],\n",
              "        num_rows: 100\n",
              "    })\n",
              "    test: Dataset({\n",
              "        features: ['chosen', 'rejected', 'prompt'],\n",
              "        num_rows: 100\n",
              "    })\n",
              "})"
            ]
          },
          "metadata": {},
          "execution_count": 45
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Let's print out 3 random samples:"
      ],
      "metadata": {
        "id": "ftCnX1S7mVSd"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "import random\n",
        "\n",
        "# Print a few random samples from the training set:\n",
        "for index in random.sample(range(len(raw_datasets[\"train\"])), 3):\n",
        "    print(f\"Prompt sample {index} of the raw training set:\\n\\n{raw_datasets['train'][index]['prompt']}\")\n",
        "    print(f\"Chosen sample {index} of the raw training set:\\n\\n{raw_datasets['train'][index]['chosen']}\")\n",
        "    print(f\"Rejected sample {index} of the raw training set:\\n\\n{raw_datasets['train'][index]['rejected']}\")"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "8yRPyvDve8Ou",
        "outputId": "3070d0f2-5e01-4777-ef33-ee56d0bd29fe"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "Prompt sample 12 of the raw training set:\n",
            "\n",
            "<|system|>\n",
            "</s>\n",
            "<|user|>\n",
            "Explain how the coastline of the UK is an emergent characteristic, and that any property it may have is the result of underlying properties of it underlying components</s>\n",
            "<|assistant|>\n",
            "\n",
            "Chosen sample 12 of the raw training set:\n",
            "\n",
            "Of course! I'm here to help you with any questions you may have.\n",
            "Regarding your question about the coastline of the UK, it is important to understand that the coastline is an emergent characteristic, meaning that it is a property that arises from the interactions and relationships between the components that make up the system. In this case, the components are the individual rocks, sand, and other materials that make up the coastline.\n",
            "The coastline of the UK, like any other coastline, is not a fixed or inherent property of the land itself, but rather a dynamic and constantly changing feature that emerges from the interactions between the land and the sea. The movements of the tides, waves, and weather patterns all contribute to the shape and form of the coastline, and they are the primary drivers of its evolution.\n",
            "Therefore, any properties or characteristics of the coastline, such as its shape, slope, or gradient, are the result of the underlying properties and behaviors of the components that make it up. For example, the rocks that make up the coastline have their own properties, such as strength and hardness, that influence their interaction with the sea, and these interactions play a significant role in shaping the coastline over time.\n",
            "In summary, the coastline of the UK, or any other coastline for that matter, is not a fixed or inherent property of the land, but rather an emergent characteristic that arises from the interactions between the land and the sea. It is constantly changing and evolving due to the dynamic nature of these interactions, and any properties or characteristics it may have are the result of the underlying properties and behaviors of the component parts.</s>\n",
            "\n",
            "Rejected sample 12 of the raw training set:\n",
            "\n",
            "The coastline of the United Kingdom is an emergent characteristic, meaning that it is a property that arises from the accumulated actions of its individual components, rather than a predetermined feature created at the onset of the formation of the UK.\n",
            "\n",
            "The coastline's emergent characteristic stems from its complex geological history and the interactions between the UK's landmass and the physical forces that shape it. The landmass that makes up the UK has evolved through geological processes such as tectonic activity, erosion, and deposition, as well as the influence of the sea, wind, and ice.\n",
            "\n",
            "Ice Age glaciers, for instance, pushed downlands, creatingBroughton-type headlands along the coastlines. The prevailing winds and waves have shaped the coastline, creating notches and bays, forming limestone cliffs with coves and stacks. In contrast, the underlying geological makeup of the land influences the type of rock formations that comprise the coastline, such as chalk, sandstone, or clay.\n",
            "\n",
            "The emergence of the UK coastline, constantly changing through geological and physical processes, is not a predetermined single feature but instead, it's an outgrowth of the myriad factors that interact with it. The roughness or smoothness of the coastline, the presence or absence of cliffs, beaches, and headlands, the seaward extent of coastal wetlands, and many other characteristics of the coastline can be understood as emergent properties of the complex interplay of the underlying geological, biological, and physical processes at work.\n",
            "\n",
            "Therefore, the coastline of the UK is an emergent characteristic because it is the result of a vast array of variations and patterns that are not determined beforehand but arise from the interactions of various factors, such as the landforms created by erosion and weather, the geological structure of the land, and the influence of human activities like land use and construction along the coastline.</s>\n",
            "\n",
            "Prompt sample 67 of the raw training set:\n",
            "\n",
            "<|system|>\n",
            "</s>\n",
            "<|user|>\n",
            "The COFO Design 2018/19 Debut Collection launched officially on October. 25, 2018. We rented a space, threw a party, and opened a Pop-Up Shop for a few days in the best city on the planet.\n",
            "We met some amazing people, one of them is SingYu – a student currently studying at OCADU.\n",
            "This past Tuesday I dropped by COFO’s design pop-up. I had missed their opening night on the Thursday prior, but luckily I caught it on their second last day.\n",
            "The pop-up was on the second floor of this black building, just north of the Shinola.\n",
            "COFO is a new company that partners with emerging Canadian designers to fabricate their products. They release annual design challenges open to students in their last year of school or recent grads (from the last 5 years) and then choose a few designs to manufacture. The pop-up showcases their 2018/19 collection, with 6 new products (all seen below).\n",
            "Three of those new products were designed by my friend and recent Humber grad Ian Buckley, along with Kenny Nguyen. They codesigned the lounge chair + ottoman set, as well as the side table near the bottom of the photo.\n",
            "The two founders of COFO, Randy and Desmond, were there that day and accommodating enough to answer all my incessant questions. Randy has a background in manufacturing, while Desmond is experienced in both branding and apparel. Because I learned so much from them, be warned, since I’ll be nerding out over all kinds of details for the rest of this post.\n",
            "This coat rack was designed by Mary Anne Garcia, an OCAD environmental design graduate (not industrial design, surprisingly). COFO was also handing out the tote bags, so that one in the picture is actually mine now.\n",
            "The mirror and hangers are held in place by clear rubber O-rings.\n",
            "The main post slots into place with these matching teeth, so the mirror can be rotated to face any direction fits best in your home.\n",
            "COFO goes to great lengths with their manufacturing details. The wooden portions of the 49 North collection are actually hollow – a metal rod runs through them and screws into the rest of the steel frame so the wood isn’t actually load bearing.\n",
            "They had a few of the wooden rods unscrewed on display by the 49 North table.\n",
            "The back of the lounge chair slots in along these two rails to fit snugly at the bottom. The final versions of these chairs will have them sealed with silicone along the bottom of the frame where it meets the cushion.\n",
            "Not even the bathroom was spared from a few decals to bring the whole space in line with COFO’s excellent branding.\n",
            "This cinch stool is designed by Lucas Stanois, an OCAD industrial design grad. It features a felt back that wraps around and underneath the sides of the chair.\n",
            "The screws through the metal plate at the bottom holds the felt back in place.\n",
            "I was expecting the back to feel like a hammock, with a bit of slack and looseness when you leaned back into it, but to my surprise, once you lean back far enough you actually feel like you’ve hit a hard, rigid wall. The cause for this is just the small opening at the back colliding with the seat of the chair, but it provides a very substantial effect.\n",
            "The Soma shelf is designed to be taken apart so that it can be shipped with a smaller footprint.\n",
            "They have a laser tube cutter that they’ve used to make the holes above for the coat rack, as well as the complex curve where the blue rod meets the seat back on the Roque chair.\n",
            "These branded rubber inserts are actually inspired by Desmond’s background in apparel – if you’ve ever felt the middle rubber piece in a zipper’s pull tab, these fit in exactly the same way.\n",
            "I really like how COFO has applied their brand to their products – it’s small and subtle where it’s always visible, but the undersides have distinctive laser cut metal base plates that are much louder, where they can’t usually be seen.\n",
            "It strikes a good balance.\n",
            "If you’ve made it all the way down here, congratulations. I would recommend everyone to go check it out, but unfortunately the pop-up is long over by the time I get this post up. Hopefully seeing some of the photos here gives you a good impression of what it was like though. Who are some of the designers behind COFO's 2018/19 debut collection, and what are some of the standout products from this collection?</s>\n",
            "<|assistant|>\n",
            "\n",
            "Chosen sample 67 of the raw training set:\n",
            "\n",
            "The designers behind COFO's 2018/19 debut collection include:\n",
            "\n",
            "* Ian Buckley, a recent graduate from Humber College, who co-designed the lounge chair and ottoman set, as well as the side table.\n",
            "* Kenny Nguyen, who co-designed the lounge chair and ottoman set, as well as the side table.\n",
            "* Mary Anne Garcia, an OCAD environmental design graduate, who designed the coat rack.\n",
            "* Lucas Stanois, an OCAD industrial design graduate, who designed the cinch stool.\n",
            "\n",
            "Some of the standout products from this collection include:\n",
            "\n",
            "* The lounge chair and ottoman set, which features a unique design that allows the back of the chair to slot in along two rails, creating a snug fit at the bottom.\n",
            "* The coat rack, which is held in place by clear rubber O-rings and features a rotating mirror that can be adjusted to face any direction.\n",
            "* The cinch stool, which has a felt back that wraps around and underneath the sides of the chair, creating a surprisingly solid feel when leaning back.\n",
            "* The 49 North collection, which features hollow wooden portions and metal rods that run through them, screwing into the rest of the steel frame.\n",
            "* The Soma shelf, which is designed to be taken apart for easy shipping and features branded rubber inserts inspired by Desmond's background in apparel.\n",
            "\n",
            "Overall, COFO's 2018/19 debut collection showcases a range of innovative and stylish products that highlight the company's commitment to quality manufacturing and design.</s>\n",
            "\n",
            "Rejected sample 67 of the raw training set:\n",
            "\n",
            "From the information provided, here are the designers and some of the standout products from COFO's 2018/19 debut collection:\n",
            "\n",
            "1. Mary Anne Garcia - 49 North Coat Rack: A unique coat rack featuring a mirror and hangers, with wooden portions that are hollow and metal rods running through them for added stability.\n",
            "2. Ian Buckley and Kenny Nguyen - Lounge Chair + Ottoman Set and Side Table: A comfortable lounge chair and ottoman set, along with a stylish side table, designed with a focus on simplicity and harmony.\n",
            "3. Lucas Stanois - Cinch Stool: A compact stool featuring a felt back that wraps around and underneath the sides of the chair, providing a surprisingly firm and rigid support when leaning back.\n",
            "4. SingYu - Soma Shelf: A modular shelf designed to be taken apart for shipping, with notches that fit together securely and can be assembled in various configurations.\n",
            "\n",
            "These are just a few examples of the innovative and well-designed products from COFO's 2018/19 debut collection. The collection as a whole showcases the talent of emerging Canadian designers and the potential of collaboration between students and industry professionals.</s>\n",
            "\n",
            "Prompt sample 39 of the raw training set:\n",
            "\n",
            "<|system|>\n",
            "</s>\n",
            "<|user|>\n",
            "Teacher: In this task, you are given a sentence in the Spanish language. Your task is to convert it to Lithuanian language.\n",
            "Teacher: Now, understand the problem? If you are still confused, see the following example:\n",
            "La dosis de mantenimiento es de 15 mg una vez al día, pero algunos pacientes pueden requerir dosis mayores.\n",
            "Solution: Palaikomoji dozė – 15 mg kartą per parą, nors kai kuriems pacientams gali būti veiksmingesnė didesnė dozė.\n",
            "Reason: Spanish sentence is properly converted into Lithuanian sentence\n",
            "\n",
            "Now, solve this instance: Reacciones adversas poco frecuentes ≥ 0,1 a < 1% de los pacientes\n",
            "Student:</s>\n",
            "<|assistant|>\n",
            "\n",
            "Chosen sample 39 of the raw training set:\n",
            "\n",
            "Nedažnos nepageidaujamos reakcijos, pasitaikancios ≥ 0,1 iki < 1% pacientų</s>\n",
            "\n",
            "Rejected sample 39 of the raw training set:\n",
            "\n",
            "Nedažnos nepageidaujamos reakcijos ≥ 0,1 iki < 1% pacientų</s>\n",
            "\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Load SFT model\n",
        "\n",
        "Here we load the supervised fine-tuned (SFT) model (trained during [step 2](https://github.com/NielsRogge/Transformers-Tutorials/blob/master/Mistral/Supervised_fine_tuning_(SFT)_of_an_LLM_using_Hugging_Face_tooling.ipynb)). As we used QLoRa during SFT, the [model repository](https://huggingface.co/alignment-handbook/zephyr-7b-sft-qlora) only contains the adapter weights. Hence we first load the base model in 4-bit using the [BitsAndBytes quantization method](https://huggingface.co/docs/transformers/en/main_classes/quantization#transformers.BitsAndBytesConfig), and then load the SFT adapter on top.\n"
      ],
      "metadata": {
        "id": "jY8Og84AhyDn"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "from peft import PeftConfig\n",
        "\n",
        "peft_config = PeftConfig.from_pretrained(model_id)\n",
        "print(\"Adapter weights model repo:\", model_id)\n",
        "print(\"Base model weights model repo:\", peft_config.base_model_name_or_path)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "-UMGylKDAHUv",
        "outputId": "a319d76e-2f86-4f84-f056-57cdcc7cd848"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "Adapter weights model repo: alignment-handbook/zephyr-7b-sft-qlora\n",
            "Base model weights model repo: mistralai/Mistral-7B-v0.1\n"
          ]
        }
      ]
    },
    {
      "cell_type": "code",
      "source": [
        "# Add the huggingface login here\n",
        "\n",
        "!huggingface-cli login\n"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "eho3x4YS0stt",
        "outputId": "2e970e42-ea66-4380-d457-b61bfeb664ac"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "\u001b[33m⚠️  Warning: 'huggingface-cli login' is deprecated. Use 'hf auth login' instead.\u001b[0m\n",
            "\n",
            "    _|    _|  _|    _|    _|_|_|    _|_|_|  _|_|_|  _|      _|    _|_|_|      _|_|_|_|    _|_|      _|_|_|  _|_|_|_|\n",
            "    _|    _|  _|    _|  _|        _|          _|    _|_|    _|  _|            _|        _|    _|  _|        _|\n",
            "    _|_|_|_|  _|    _|  _|  _|_|  _|  _|_|    _|    _|  _|  _|  _|  _|_|      _|_|_|    _|_|_|_|  _|        _|_|_|\n",
            "    _|    _|  _|    _|  _|    _|  _|    _|    _|    _|    _|_|  _|    _|      _|        _|    _|  _|        _|\n",
            "    _|    _|    _|_|      _|_|_|    _|_|_|  _|_|_|  _|      _|    _|_|_|      _|        _|    _|    _|_|_|  _|_|_|_|\n",
            "\n",
            "    To log in, `huggingface_hub` requires a token generated from https://huggingface.co/settings/tokens .\n",
            "Enter your token (input will not be visible): \n",
            "Add token as git credential? (Y/n) Y\n",
            "Token is valid (permission: write).\n",
            "The token `homo-lat` has been saved to /root/.cache/huggingface/stored_tokens\n",
            "\u001b[1m\u001b[31mCannot authenticate through git-credential as no helper is defined on your machine.\n",
            "You might have to re-authenticate when pushing to the Hugging Face Hub.\n",
            "Run the following command in your terminal in case you want to set the 'store' credential helper as default.\n",
            "\n",
            "git config --global credential.helper store\n",
            "\n",
            "Read https://git-scm.com/book/en/v2/Git-Tools-Credential-Storage for more details.\u001b[0m\n",
            "Token has not been saved to git credential helper.\n",
            "Your token has been saved to /root/.cache/huggingface/token\n",
            "Login successful.\n",
            "The current active token is: `homo-lat`\n"
          ]
        }
      ]
    },
    {
      "cell_type": "code",
      "source": [
        "import torch\n",
        "from peft import PeftModel\n",
        "from transformers import BitsAndBytesConfig, AutoModelForCausalLM\n",
        "\n",
        "# specify how to quantize the model\n",
        "quantization_config = BitsAndBytesConfig(\n",
        "            load_in_4bit=True,\n",
        "            bnb_4bit_quant_type=\"nf4\",\n",
        "            bnb_4bit_compute_dtype=torch.bfloat16,\n",
        ")\n",
        "device_map = {\"\": torch.cuda.current_device()} if torch.cuda.is_available() else None\n",
        "\n",
        "# Step 1: load the base model (Mistral-7B in our case) in 4-bit\n",
        "model_kwargs = dict(\n",
        "    # attn_implementation=\"flash_attention_2\", # set this to True if your GPU supports it (Flash Attention drastically speeds up model computations)\n",
        "    torch_dtype=\"auto\",\n",
        "    use_cache=False,  # set to False as we're going to use gradient checkpointing\n",
        "    device_map=device_map,\n",
        "    quantization_config=quantization_config,\n",
        ")\n",
        "base_model = AutoModelForCausalLM.from_pretrained(peft_config.base_model_name_or_path, **model_kwargs)\n",
        "\n",
        "# Step 2: load base model + SFT adapter weights\n",
        "# notice that only the adapter weights are trainable!\n",
        "model = PeftModel.from_pretrained(base_model, model_id)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 346,
          "referenced_widgets": [
            "eccb3bf6c9974493b70fcff8d0c8f925",
            "c925eaa53a8544adb0aff919aeb45969",
            "da41442d79d24b85b99a10d1881a945f",
            "301e17a587b949e284d35f6674cb92d6",
            "8ee3575d581b45bf864fe1908c098a56",
            "ea3bb6ec77744bee8969a65582257655",
            "b47d79cefac44a02981d6eb191cde761",
            "a0696ffff00f4b3c876730461b187381",
            "0a608857d4034f03ad90329d343fa7b3",
            "a52faff113374578baadd048f33eaa5a",
            "4a792cec3941439cbb2e31c4975b8092",
            "16e40bf96b074fd29a93229859287c4a",
            "69e179f0cb314677b3bcb98af046a9b1",
            "c7ac3038b5ce4addaa8b4a15a3f8df90",
            "c206ad31442f405a8c6b8dc4128d9c90",
            "db1c25da1b7e4251ac02651a059d2174",
            "14f3969e8f584817b79823327273e640",
            "95fe0fc5fbbc4fbe9ae2d655bdfda46c",
            "4c863b0ca3724e1b918017d06809c53a",
            "0936e4520df440cd96c9a20cc363cbcb",
            "4d021c53db7748e2891c7effeec14f68",
            "069d0a540a854e9f9ad39bdefd74f770",
            "1f1f355636cd40bb95016434b949402d",
            "12f9c206765d49ed932b7d0d7d598834",
            "b2be0fa3dfcb489687b20040f76334af",
            "6f346cdf04d042d880f1969cc7b5be6c",
            "5d83c775485e47bca25470f94e2d7634",
            "914de747dd4c49fda62ec068764eb964",
            "94fdd7d8cdcd4818a6532d36e8df5cf0",
            "c15a22c358294f2bbad03c53a2ef96fc",
            "a023a385f12d44e39295500a79c21f24",
            "5c69002d146f4416bcd3330f331d265c",
            "9438610784934da7a95aa1fe94b88152",
            "6c23c836882d4fdfbea9c69576931666",
            "02b89d7331f6497b93c98c76514effa8",
            "5ffcaf25bd1c45e7a5026df9cc8e0c63",
            "f8577fb2f907416483c05534376ecf5b",
            "5c194a4b12934354b66f8163e7d3ff43",
            "7f497a6c37204a44bc5eed0592105664",
            "4ae075211608434ab16ddba8fef637c9",
            "c15bdf050d5345f7b2133e0cad195c5f",
            "7ed24313a43841b89bb45f3d3341c03b",
            "4d40e1fd82e941818f742bc5d470a294",
            "8b5f8d5e2c1a4e0d90d63c0ee9c305b5",
            "d2be33879296453f92be0f4f917fa511",
            "5ae8a0c508534c24bd6bce11726f95e3",
            "13d2e1da1e6842fb92c21bf45b3fa1fd",
            "e0f8666a2dbe4fcfa22781ec5250d1fd",
            "963a8494d12a43c6a0026ec6977670ce",
            "8553ba7299b34233b266567822f7c4ca",
            "1117502bb89a4383919952c4fba6e765",
            "90c77723d0b146e18749c7355fa3f23a",
            "b32b5ba8336f4ea1a1e71d1bb10939ed",
            "d2e4dffc607540778f3ba5a4edcafefa",
            "821c7e11301f48038b950b423c860e42",
            "abe93efc757e4508b3db9dae1f3f7069",
            "38a66837638e462080c5b77083aa2a00",
            "8c5a185ef40a41428274971a4f8b5451",
            "0ad84f4c4db14439a74082368430c1f5",
            "8775ade9c29841edae4fee55376afe2a",
            "bc2a1b3065b641069517fca829fa7f04",
            "55d52469ea554f31b523e71231d461ab",
            "50a717a808b54e4d99ce44ba0b695820",
            "f8488723a30c44a18a4dda1d93575892",
            "97218a30eafa4edda5515d3f2c95de24",
            "0603e6586c2746b79b300c0b3e5fec26",
            "a42cc4d2eb7d42ecaa58f71edcc4751f",
            "c2639dc1c10b4da09e0f58e6f8791ce6",
            "6dcef112168c44b8984ea6678d2b845a",
            "48aae5a0c63849e1b6929432f446323b",
            "17a175314ba44c9fb07467ebd4ff02e8",
            "85d8edc347194b3d933d3a3c9292cd2a",
            "694aba9592b540ccb3f374140b9a5f9c",
            "4ebeaaa1c3794258907f4a62a3fa5f4a",
            "ba5da7be468b49fb80b71e93a633c4bb",
            "caac34f8414046ddbf92bb194db9481a",
            "c9d91914cf8c49caacd55223148c9a48",
            "529bdc5069c344a6abc69fc9da195b9c",
            "9acb00321f1d492591320b75ee461a39",
            "f79512648a6f4bfaa28d80efcf05aa65",
            "dba42d8cdefd4660ac2372de45b4977e",
            "5cb8ca5d9cad45f0b257e3e15f03cd4b",
            "4a3a37cde77f4f7ebdf05e1781df46fe",
            "de5be5014dea4934b7e3df9ecb9610e1",
            "862b432dad6c4c5cbc775fa49ba7f253",
            "a41e56fbe19b49938dcaecdcbeb821c1",
            "486b5d6053e2439bbb6e6ed47116f85e",
            "0a543beb663344d4995ecdf4e9035cbf"
          ]
        },
        "id": "33gqLvEQhxd1",
        "outputId": "e0da189c-07f6-4de2-fe4f-77a1347950cf"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "config.json:   0%|          | 0.00/571 [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "eccb3bf6c9974493b70fcff8d0c8f925"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "`torch_dtype` is deprecated! Use `dtype` instead!\n"
          ]
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "model.safetensors.index.json: 0.00B [00:00, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "16e40bf96b074fd29a93229859287c4a"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Fetching 2 files:   0%|          | 0/2 [00:00<?, ?it/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "1f1f355636cd40bb95016434b949402d"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "model-00002-of-00002.safetensors:   0%|          | 0.00/4.54G [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "6c23c836882d4fdfbea9c69576931666"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "model-00001-of-00002.safetensors:   0%|          | 0.00/9.94G [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "d2be33879296453f92be0f4f917fa511"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "abe93efc757e4508b3db9dae1f3f7069"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "generation_config.json:   0%|          | 0.00/116 [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "a42cc4d2eb7d42ecaa58f71edcc4751f"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "adapter_model.safetensors:   0%|          | 0.00/83.9M [00:00<?, ?B/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "529bdc5069c344a6abc69fc9da195b9c"
            }
          },
          "metadata": {}
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Notice how only the adapter layers are trainable:"
      ],
      "metadata": {
        "id": "350Pvl1E7TDL"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "for name, param in model.named_parameters():\n",
        "  print(name, param.requires_grad)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "KigQ_QKi6ng1",
        "outputId": "2896dcb2-b9b4-438a-d172-a91a206eef95"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "base_model.model.model.embed_tokens.weight False\n",
            "base_model.model.model.layers.0.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.0.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.0.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.0.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.0.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.0.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.0.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.0.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.0.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.0.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.0.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.0.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.0.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.0.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.0.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.0.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.0.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.0.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.0.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.0.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.0.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.0.input_layernorm.weight False\n",
            "base_model.model.model.layers.0.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.1.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.1.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.1.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.1.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.1.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.1.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.1.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.1.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.1.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.1.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.1.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.1.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.1.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.1.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.1.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.1.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.1.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.1.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.1.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.1.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.1.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.1.input_layernorm.weight False\n",
            "base_model.model.model.layers.1.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.2.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.2.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.2.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.2.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.2.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.2.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.2.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.2.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.2.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.2.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.2.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.2.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.2.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.2.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.2.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.2.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.2.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.2.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.2.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.2.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.2.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.2.input_layernorm.weight False\n",
            "base_model.model.model.layers.2.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.3.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.3.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.3.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.3.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.3.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.3.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.3.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.3.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.3.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.3.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.3.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.3.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.3.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.3.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.3.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.3.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.3.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.3.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.3.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.3.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.3.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.3.input_layernorm.weight False\n",
            "base_model.model.model.layers.3.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.4.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.4.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.4.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.4.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.4.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.4.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.4.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.4.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.4.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.4.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.4.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.4.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.4.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.4.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.4.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.4.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.4.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.4.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.4.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.4.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.4.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.4.input_layernorm.weight False\n",
            "base_model.model.model.layers.4.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.5.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.5.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.5.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.5.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.5.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.5.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.5.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.5.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.5.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.5.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.5.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.5.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.5.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.5.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.5.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.5.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.5.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.5.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.5.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.5.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.5.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.5.input_layernorm.weight False\n",
            "base_model.model.model.layers.5.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.6.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.6.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.6.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.6.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.6.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.6.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.6.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.6.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.6.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.6.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.6.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.6.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.6.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.6.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.6.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.6.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.6.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.6.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.6.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.6.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.6.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.6.input_layernorm.weight False\n",
            "base_model.model.model.layers.6.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.7.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.7.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.7.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.7.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.7.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.7.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.7.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.7.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.7.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.7.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.7.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.7.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.7.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.7.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.7.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.7.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.7.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.7.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.7.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.7.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.7.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.7.input_layernorm.weight False\n",
            "base_model.model.model.layers.7.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.8.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.8.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.8.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.8.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.8.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.8.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.8.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.8.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.8.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.8.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.8.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.8.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.8.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.8.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.8.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.8.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.8.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.8.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.8.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.8.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.8.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.8.input_layernorm.weight False\n",
            "base_model.model.model.layers.8.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.9.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.9.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.9.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.9.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.9.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.9.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.9.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.9.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.9.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.9.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.9.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.9.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.9.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.9.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.9.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.9.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.9.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.9.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.9.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.9.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.9.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.9.input_layernorm.weight False\n",
            "base_model.model.model.layers.9.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.10.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.10.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.10.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.10.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.10.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.10.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.10.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.10.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.10.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.10.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.10.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.10.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.10.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.10.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.10.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.10.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.10.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.10.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.10.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.10.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.10.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.10.input_layernorm.weight False\n",
            "base_model.model.model.layers.10.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.11.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.11.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.11.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.11.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.11.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.11.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.11.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.11.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.11.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.11.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.11.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.11.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.11.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.11.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.11.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.11.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.11.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.11.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.11.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.11.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.11.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.11.input_layernorm.weight False\n",
            "base_model.model.model.layers.11.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.12.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.12.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.12.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.12.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.12.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.12.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.12.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.12.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.12.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.12.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.12.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.12.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.12.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.12.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.12.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.12.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.12.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.12.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.12.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.12.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.12.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.12.input_layernorm.weight False\n",
            "base_model.model.model.layers.12.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.13.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.13.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.13.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.13.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.13.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.13.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.13.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.13.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.13.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.13.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.13.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.13.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.13.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.13.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.13.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.13.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.13.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.13.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.13.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.13.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.13.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.13.input_layernorm.weight False\n",
            "base_model.model.model.layers.13.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.14.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.14.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.14.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.14.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.14.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.14.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.14.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.14.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.14.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.14.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.14.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.14.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.14.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.14.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.14.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.14.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.14.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.14.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.14.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.14.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.14.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.14.input_layernorm.weight False\n",
            "base_model.model.model.layers.14.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.15.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.15.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.15.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.15.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.15.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.15.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.15.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.15.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.15.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.15.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.15.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.15.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.15.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.15.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.15.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.15.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.15.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.15.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.15.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.15.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.15.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.15.input_layernorm.weight False\n",
            "base_model.model.model.layers.15.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.16.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.16.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.16.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.16.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.16.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.16.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.16.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.16.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.16.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.16.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.16.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.16.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.16.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.16.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.16.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.16.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.16.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.16.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.16.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.16.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.16.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.16.input_layernorm.weight False\n",
            "base_model.model.model.layers.16.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.17.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.17.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.17.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.17.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.17.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.17.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.17.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.17.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.17.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.17.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.17.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.17.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.17.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.17.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.17.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.17.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.17.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.17.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.17.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.17.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.17.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.17.input_layernorm.weight False\n",
            "base_model.model.model.layers.17.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.18.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.18.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.18.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.18.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.18.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.18.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.18.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.18.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.18.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.18.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.18.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.18.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.18.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.18.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.18.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.18.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.18.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.18.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.18.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.18.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.18.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.18.input_layernorm.weight False\n",
            "base_model.model.model.layers.18.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.19.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.19.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.19.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.19.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.19.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.19.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.19.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.19.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.19.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.19.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.19.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.19.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.19.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.19.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.19.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.19.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.19.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.19.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.19.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.19.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.19.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.19.input_layernorm.weight False\n",
            "base_model.model.model.layers.19.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.20.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.20.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.20.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.20.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.20.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.20.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.20.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.20.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.20.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.20.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.20.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.20.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.20.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.20.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.20.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.20.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.20.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.20.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.20.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.20.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.20.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.20.input_layernorm.weight False\n",
            "base_model.model.model.layers.20.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.21.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.21.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.21.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.21.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.21.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.21.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.21.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.21.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.21.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.21.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.21.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.21.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.21.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.21.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.21.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.21.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.21.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.21.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.21.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.21.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.21.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.21.input_layernorm.weight False\n",
            "base_model.model.model.layers.21.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.22.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.22.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.22.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.22.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.22.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.22.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.22.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.22.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.22.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.22.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.22.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.22.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.22.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.22.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.22.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.22.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.22.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.22.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.22.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.22.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.22.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.22.input_layernorm.weight False\n",
            "base_model.model.model.layers.22.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.23.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.23.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.23.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.23.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.23.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.23.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.23.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.23.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.23.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.23.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.23.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.23.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.23.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.23.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.23.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.23.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.23.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.23.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.23.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.23.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.23.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.23.input_layernorm.weight False\n",
            "base_model.model.model.layers.23.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.24.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.24.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.24.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.24.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.24.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.24.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.24.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.24.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.24.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.24.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.24.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.24.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.24.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.24.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.24.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.24.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.24.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.24.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.24.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.24.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.24.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.24.input_layernorm.weight False\n",
            "base_model.model.model.layers.24.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.25.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.25.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.25.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.25.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.25.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.25.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.25.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.25.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.25.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.25.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.25.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.25.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.25.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.25.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.25.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.25.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.25.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.25.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.25.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.25.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.25.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.25.input_layernorm.weight False\n",
            "base_model.model.model.layers.25.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.26.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.26.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.26.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.26.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.26.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.26.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.26.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.26.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.26.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.26.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.26.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.26.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.26.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.26.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.26.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.26.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.26.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.26.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.26.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.26.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.26.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.26.input_layernorm.weight False\n",
            "base_model.model.model.layers.26.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.27.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.27.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.27.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.27.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.27.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.27.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.27.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.27.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.27.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.27.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.27.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.27.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.27.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.27.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.27.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.27.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.27.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.27.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.27.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.27.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.27.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.27.input_layernorm.weight False\n",
            "base_model.model.model.layers.27.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.28.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.28.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.28.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.28.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.28.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.28.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.28.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.28.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.28.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.28.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.28.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.28.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.28.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.28.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.28.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.28.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.28.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.28.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.28.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.28.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.28.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.28.input_layernorm.weight False\n",
            "base_model.model.model.layers.28.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.29.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.29.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.29.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.29.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.29.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.29.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.29.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.29.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.29.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.29.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.29.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.29.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.29.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.29.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.29.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.29.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.29.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.29.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.29.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.29.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.29.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.29.input_layernorm.weight False\n",
            "base_model.model.model.layers.29.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.30.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.30.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.30.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.30.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.30.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.30.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.30.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.30.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.30.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.30.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.30.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.30.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.30.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.30.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.30.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.30.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.30.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.30.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.30.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.30.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.30.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.30.input_layernorm.weight False\n",
            "base_model.model.model.layers.30.post_attention_layernorm.weight False\n",
            "base_model.model.model.layers.31.self_attn.q_proj.base_layer.weight False\n",
            "base_model.model.model.layers.31.self_attn.q_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.31.self_attn.q_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.31.self_attn.k_proj.base_layer.weight False\n",
            "base_model.model.model.layers.31.self_attn.k_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.31.self_attn.k_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.31.self_attn.v_proj.base_layer.weight False\n",
            "base_model.model.model.layers.31.self_attn.v_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.31.self_attn.v_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.31.self_attn.o_proj.base_layer.weight False\n",
            "base_model.model.model.layers.31.self_attn.o_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.31.self_attn.o_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.31.mlp.gate_proj.base_layer.weight False\n",
            "base_model.model.model.layers.31.mlp.gate_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.31.mlp.gate_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.31.mlp.up_proj.base_layer.weight False\n",
            "base_model.model.model.layers.31.mlp.up_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.31.mlp.up_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.31.mlp.down_proj.base_layer.weight False\n",
            "base_model.model.model.layers.31.mlp.down_proj.lora_A.default.weight False\n",
            "base_model.model.model.layers.31.mlp.down_proj.lora_B.default.weight False\n",
            "base_model.model.model.layers.31.input_layernorm.weight False\n",
            "base_model.model.model.layers.31.post_attention_layernorm.weight False\n",
            "base_model.model.model.norm.weight False\n",
            "base_model.model.lm_head.weight False\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Define DPOTrainer\n",
        "\n",
        "Next, we define the training arguments and instantiate a [DPOTrainer](https://huggingface.co/docs/trl/main/en/dpo_trainer) class which will handle fine-tuning for us.\n",
        "\n",
        "Note that in this case, we leverage the [DPO](https://arxiv.org/abs/2305.18290) (direct preference optimization) method, which is one of the best methods for human preference fine-tuning at the time of writing. Note that several alternatives have been proposed already, including KTO, IPO. The `DPOTrainer` [also supports](https://huggingface.co/docs/trl/main/en/dpo_trainer#loss-functions) these. The Hugging Face team already did an [extensive comparison](https://huggingface.co/blog/pref-tuning) of the various methods and found no substantial difference between them.\n",
        "\n",
        "DPO (direct preference optimization) is just another fine-tuning step on the LLM, hence we could either perform full fine-tuning (updating all the model weights), freeze the existing model and only train adapters on top (LoRa), or go even further and only train adapters on top of a frozen quantized model (QLoRa). The same techniques apply as during SFT.\n",
        "\n",
        "Interestingly, as taken from the [Alignment Handbook README](https://github.com/huggingface/alignment-handbook/tree/main/scripts):\n",
        "\n",
        "> In practice, we find comparable performance for both full and QLoRA fine-tuning, with the latter having the advantage of producing small adapter weights that are fast to upload and download from the Hugging Face Hub.\n",
        "\n",
        "For full fine-tuning, you would need approximately 126GB of GPU RAM for a 7B model (hence one typically uses multiple A100s). With QLoRa, you only need about 7GB! In this case, as we're running on an RTX 4090 which has 24GB of RAM, we will use [QLoRa](https://huggingface.co/blog/4bit-transformers-bitsandbytes), which is the most memory efficient.\n",
        "\n",
        "Hence, we pass a `peft_config` to DPOTrainer, making sure that adapter layers are added on top in bfloat16. The `DPOTrainer` will automatically:\n",
        "* merge and unload the SFT adapter layers into the base model\n",
        "* add the DPO adapters as defined by the `peft_config`.\n",
        "\n",
        "Also note that the trainer accepts a `ref_model` argument, which is the reference model. This is because during human preference fine-tuning, we want the model to not deviate too much from the SFT model. Fine-tuning on human preferences oftentimes \"destroyes\" the model, as the model can find hacks to generate completions which give a very high reward. Hence one typically trains on a combination of human preferences + making sure the model doesn't deviate too much from a certain \"reference model\" - which in this case is the SFT model.\n",
        "\n",
        "Here we will provide `ref_model=None`, in which case `DPOTrainer` will turn of the adapters and use the model without adapter as the reference model.\n",
        "\n",
        "We also leverage several well-known techniques for maximizing performance on a single GPU: gradient checkpointing, gradient accumulation, mixed precision training in bfloat16. Refer to [this guide](https://huggingface.co/docs/transformers/v4.20.1/en/perf_train_gpu_one) for all the details."
      ],
      "metadata": {
        "id": "9AlKMUUcfbs1"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "from trl import DPOTrainer, DPOConfig\n",
        "from peft import LoraConfig\n",
        "#from transformers import TrainingArguments\n",
        "\n",
        "# path where the Trainer will save its checkpoints and logs\n",
        "output_dir = 'data/zephyr-7b-dpo-lora'\n",
        "\n",
        "# based on config\n",
        "training_args = DPOConfig(\n",
        "    bf16=True,\n",
        "    beta=0.01,\n",
        "    do_eval=True,\n",
        "    eval_steps=100,\n",
        "    gradient_accumulation_steps=4,\n",
        "    gradient_checkpointing=True,\n",
        "    gradient_checkpointing_kwargs={\"use_reentrant\":False},\n",
        "    hub_model_id=\"zephyr-7b-dpo-qlora\",\n",
        "    learning_rate=5.0e-6,\n",
        "    log_level=\"info\",\n",
        "    logging_steps=10,\n",
        "    lr_scheduler_type=\"cosine\",\n",
        "    max_length=1024,\n",
        "    num_train_epochs=1,\n",
        "    optim=\"paged_adamw_32bit\",\n",
        "    output_dir=output_dir,  # It is handy to append `hub_model_revision` to keep track of your local experiments\n",
        "    per_device_train_batch_size=4,\n",
        "    per_device_eval_batch_size=8,\n",
        "    save_strategy=\"steps\",\n",
        "    save_steps=100,\n",
        "    save_total_limit=1,\n",
        "    seed=42,\n",
        "    warmup_ratio=0.1,\n",
        "    loss_type=\"sigmoid\",\n",
        "    report_to=\"none\"\n",
        ")\n",
        "# based on the recipe: https://github.com/huggingface/alignment-handbook/blob/main/recipes/zephyr-7b-beta/dpo/config_qlora.yaml\n",
        "peft_config = LoraConfig(\n",
        "        r=128,\n",
        "        lora_alpha=128,\n",
        "        lora_dropout=0.05,\n",
        "        bias=\"none\",\n",
        "        task_type=\"CAUSAL_LM\",\n",
        "        target_modules=[\"q_proj\", \"k_proj\", \"v_proj\", \"o_proj\", \"gate_proj\",  \"up_proj\",  \"down_proj\"],\n",
        ")\n",
        "\n",
        "\n",
        "trainer = DPOTrainer(\n",
        "        model,\n",
        "        ref_model=None,\n",
        "        args=training_args,\n",
        "        train_dataset=raw_datasets[\"train\"],\n",
        "        eval_dataset=raw_datasets[\"test\"],\n",
        "        peft_config=peft_config\n",
        "    )"
      ],
      "metadata": {
        "id": "UG5VOPQkfcXi",
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "outputId": "d7c5009c-9979-42d4-e6cf-7f1000ffeca1"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "PyTorch: setting up devices\n",
            "loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/config.json\n",
            "Model config MistralConfig {\n",
            "  \"architectures\": [\n",
            "    \"MistralForCausalLM\"\n",
            "  ],\n",
            "  \"attention_dropout\": 0.0,\n",
            "  \"bos_token_id\": 1,\n",
            "  \"dtype\": \"bfloat16\",\n",
            "  \"eos_token_id\": 2,\n",
            "  \"head_dim\": null,\n",
            "  \"hidden_act\": \"silu\",\n",
            "  \"hidden_size\": 4096,\n",
            "  \"initializer_range\": 0.02,\n",
            "  \"intermediate_size\": 14336,\n",
            "  \"max_position_embeddings\": 32768,\n",
            "  \"model_type\": \"mistral\",\n",
            "  \"num_attention_heads\": 32,\n",
            "  \"num_hidden_layers\": 32,\n",
            "  \"num_key_value_heads\": 8,\n",
            "  \"rms_norm_eps\": 1e-05,\n",
            "  \"rope_theta\": 10000.0,\n",
            "  \"sliding_window\": 4096,\n",
            "  \"tie_word_embeddings\": false,\n",
            "  \"transformers_version\": \"4.57.0\",\n",
            "  \"use_cache\": true,\n",
            "  \"vocab_size\": 32000\n",
            "}\n",
            "\n",
            "loading file tokenizer.model from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/tokenizer.model\n",
            "loading file tokenizer.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/tokenizer.json\n",
            "loading file added_tokens.json from cache at None\n",
            "loading file special_tokens_map.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/special_tokens_map.json\n",
            "loading file tokenizer_config.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/tokenizer_config.json\n",
            "loading file chat_template.jinja from cache at None\n",
            "/usr/local/lib/python3.12/dist-packages/peft/mapping_func.py:73: UserWarning: You are trying to modify a model with PEFT for a second time. If you want to reload the model with a different config, make sure to call `.unload()` before.\n",
            "  warnings.warn(\n",
            "/usr/local/lib/python3.12/dist-packages/peft/tuners/tuners_utils.py:196: UserWarning: Already found a `peft_config` attribute in the model. This will lead to having multiple adapters in the model. Make sure to know what you are doing!\n",
            "  warnings.warn(\n",
            "Using auto half precision backend\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Train!\n",
        "\n",
        "Finally, training is as simple as calling trainer.train()!"
      ],
      "metadata": {
        "id": "cGldALxQIwYu"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "train_result = trainer.train()"
      ],
      "metadata": {
        "id": "HgEnI5KMIwyt",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 860
        },
        "outputId": "e9748e59-0d31-4894-c27d-00d092309988"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "The following columns in the Training set don't have a corresponding argument in `PeftModelForCausalLM.forward` and have been ignored: rejected, chosen, prompt. If rejected, chosen, prompt are not expected by `PeftModelForCausalLM.forward`,  you can safely ignore this message.\n",
            "***** Running training *****\n",
            "  Num examples = 100\n",
            "  Num Epochs = 1\n",
            "  Instantaneous batch size per device = 4\n",
            "  Total train batch size (w. parallel, distributed & accumulation) = 16\n",
            "  Gradient Accumulation steps = 4\n",
            "  Total optimization steps = 7\n",
            "  Number of trainable parameters = 335,544,320\n"
          ]
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "<IPython.core.display.HTML object>"
            ],
            "text/html": [
              "\n",
              "    <div>\n",
              "      \n",
              "      <progress value='7' max='7' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
              "      [7/7 01:05, Epoch 1/1]\n",
              "    </div>\n",
              "    <table border=\"1\" class=\"dataframe\">\n",
              "  <thead>\n",
              " <tr style=\"text-align: left;\">\n",
              "      <th>Step</th>\n",
              "      <th>Training Loss</th>\n",
              "    </tr>\n",
              "  </thead>\n",
              "  <tbody>\n",
              "  </tbody>\n",
              "</table><p>"
            ]
          },
          "metadata": {}
        },
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "Saving model checkpoint to data/zephyr-7b-dpo-lora/checkpoint-7\n",
            "loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/config.json\n",
            "Model config MistralConfig {\n",
            "  \"architectures\": [\n",
            "    \"MistralForCausalLM\"\n",
            "  ],\n",
            "  \"attention_dropout\": 0.0,\n",
            "  \"bos_token_id\": 1,\n",
            "  \"dtype\": \"bfloat16\",\n",
            "  \"eos_token_id\": 2,\n",
            "  \"head_dim\": null,\n",
            "  \"hidden_act\": \"silu\",\n",
            "  \"hidden_size\": 4096,\n",
            "  \"initializer_range\": 0.02,\n",
            "  \"intermediate_size\": 14336,\n",
            "  \"max_position_embeddings\": 32768,\n",
            "  \"model_type\": \"mistral\",\n",
            "  \"num_attention_heads\": 32,\n",
            "  \"num_hidden_layers\": 32,\n",
            "  \"num_key_value_heads\": 8,\n",
            "  \"rms_norm_eps\": 1e-05,\n",
            "  \"rope_theta\": 10000.0,\n",
            "  \"sliding_window\": 4096,\n",
            "  \"tie_word_embeddings\": false,\n",
            "  \"transformers_version\": \"4.57.0\",\n",
            "  \"use_cache\": true,\n",
            "  \"vocab_size\": 32000\n",
            "}\n",
            "\n",
            "tokenizer config file saved in data/zephyr-7b-dpo-lora/checkpoint-7/tokenizer_config.json\n",
            "Special tokens file saved in data/zephyr-7b-dpo-lora/checkpoint-7/special_tokens_map.json\n",
            "\n",
            "\n",
            "Training completed. Do not forget to share your model on huggingface.co/models =)\n",
            "\n",
            "\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Saving the model\n",
        "\n",
        "Next, we save the Trainer's state. We also add the number of training samples to the logs."
      ],
      "metadata": {
        "id": "2xxjryHNBKD6"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "metrics = train_result.metrics\n",
        "#max_train_samples = training_args.max_train_samples if training_args.max_train_samples is not None else len(raw_datasets[\"train\"])\n",
        "metrics[\"train_samples\"] = len(raw_datasets[\"train\"])#min(max_train_samples, len(raw_datasets[\"train\"]))\n",
        "trainer.log_metrics(\"train\", metrics)\n",
        "trainer.save_metrics(\"train\", metrics)\n",
        "trainer.save_state()\n",
        "trainer.save_model()"
      ],
      "metadata": {
        "id": "8Ai5jXhJBMsj",
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "outputId": "f2850dca-aa0e-4e87-9d29-f104397a35d0"
      },
      "execution_count": null,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "Saving model checkpoint to data/zephyr-7b-dpo-lora\n"
          ]
        },
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "***** train metrics *****\n",
            "  total_flos               =  7251600GF\n",
            "  train_loss               =     0.6932\n",
            "  train_runtime            = 0:01:19.90\n",
            "  train_samples            =        100\n",
            "  train_samples_per_second =      1.252\n",
            "  train_steps_per_second   =      0.088\n"
          ]
        },
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/config.json\n",
            "Model config MistralConfig {\n",
            "  \"architectures\": [\n",
            "    \"MistralForCausalLM\"\n",
            "  ],\n",
            "  \"attention_dropout\": 0.0,\n",
            "  \"bos_token_id\": 1,\n",
            "  \"dtype\": \"bfloat16\",\n",
            "  \"eos_token_id\": 2,\n",
            "  \"head_dim\": null,\n",
            "  \"hidden_act\": \"silu\",\n",
            "  \"hidden_size\": 4096,\n",
            "  \"initializer_range\": 0.02,\n",
            "  \"intermediate_size\": 14336,\n",
            "  \"max_position_embeddings\": 32768,\n",
            "  \"model_type\": \"mistral\",\n",
            "  \"num_attention_heads\": 32,\n",
            "  \"num_hidden_layers\": 32,\n",
            "  \"num_key_value_heads\": 8,\n",
            "  \"rms_norm_eps\": 1e-05,\n",
            "  \"rope_theta\": 10000.0,\n",
            "  \"sliding_window\": 4096,\n",
            "  \"tie_word_embeddings\": false,\n",
            "  \"transformers_version\": \"4.57.0\",\n",
            "  \"use_cache\": true,\n",
            "  \"vocab_size\": 32000\n",
            "}\n",
            "\n",
            "tokenizer config file saved in data/zephyr-7b-dpo-lora/tokenizer_config.json\n",
            "Special tokens file saved in data/zephyr-7b-dpo-lora/special_tokens_map.json\n"
          ]
        }
      ]
    },
    {
      "cell_type": "code",
      "source": [
        "#liberar memoria\n",
        "del model\n",
        "del DPOTrainer\n",
        "del peft_config\n",
        "torch.cuda.empty_cache()"
      ],
      "metadata": {
        "id": "PJQb8D6pIgIW"
      },
      "execution_count": null,
      "outputs": []
    },
    {
      "cell_type": "markdown",
      "source": [
        "## Inference\n",
        "\n",
        "Let's generate some new texts with our trained model.\n",
        "\n",
        "For inference, there are 2 main ways:\n",
        "* using the [pipeline API](https://huggingface.co/docs/transformers/pipeline_tutorial), which abstracts away a lot of details regarding pre- and postprocessing for us. [This model card](https://huggingface.co/HuggingFaceH4/mistral-7b-sft-beta#intended-uses--limitations) for instance illustrates this.\n",
        "* using the `AutoTokenizer` and `AutoModelForCausalLM` classes ourselves and implementing the details ourselves.\n",
        "\n",
        "Let us do the latter, so that we understand what's going on.\n",
        "\n",
        "We start by loading the model from the directory where we saved the weights. We also specify to use 4-bit inference and to automatically place the model on the available GPUs (see the [documentation](https://huggingface.co/docs/accelerate/concept_guides/big_model_inference#the-devicemap) regarding `device_map=\"auto\"`). The AutoModelForCausalLM class will automatically load the base model and DPO adapter thanks to the [PEFT integration](https://huggingface.co/docs/peft/tutorial/peft_integrations#transformers) in the Transformers library."
      ],
      "metadata": {
        "id": "-tCZxj1tBNAc"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "from transformers import AutoTokenizer, AutoModelForCausalLM\n",
        "\n",
        "tokenizer = AutoTokenizer.from_pretrained(\"/content/data/zephyr-7b-dpo-lora\")\n",
        "model = AutoModelForCausalLM.from_pretrained(output_dir, load_in_4bit=True, device_map=\"auto\")\n",
        "\n",
        "# Re-setting the chat template after loading the tokenizer, as it's not automatically loaded from the saved directory\n",
        "# DEFAULT_CHAT_TEMPLATE\n",
        "DEFAULT_CHAT_TEMPLATE = \"{% for message in messages %}\\n{% if message['role'] == 'user' %}\\n{{ '<|user|>\\n' + message['content'] + eos_token }}\\n{% elif message['role'] == 'system' %}\\n{{ '<|system|>\\n' + message['content'] + eos_token }}\\n{% elif message['role'] == 'assistant' %}\\n{{ '<|assistant|>\\n'  + message['content'] + eos_token }}\\n{% endif %}\\n{% if loop.last and add_generation_prompt %}\\n{{ '<|assistant|>' }}\\n{% endif %}\\n{% endfor %}\"\n",
        "tokenizer.chat_template = DEFAULT_CHAT_TEMPLATE"
      ],
      "metadata": {
        "id": "yiRvmsSkyubH",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 936,
          "referenced_widgets": [
            "b6c88e907f3b4ad58531c3bf8fbb8961",
            "4e9d3e81bf0a4b4d80f2313b880cd318",
            "36c67be6b23b4f6897fbdfe319554002",
            "20fd8dc1a47944bea72d939071439a53",
            "12304fe0f71a45b1be6019474259ceb7",
            "6c8bfee7ecac47b7b9067c5840194c85",
            "c503d3aa1de3402689d1723c9e05be40",
            "6c7f3040af554bdaae9be1768acb676d",
            "6910a378fa45453f936658d60c585bf7",
            "005d451a3f6a476c97705e6274b847b4",
            "b7f87439aade49f19eeb71a5ca0c0a92"
          ]
        },
        "outputId": "174b9009-4d3a-4738-be4d-5443a1d19a71"
      },
      "execution_count": 43,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "loading file tokenizer.model\n",
            "loading file tokenizer.json\n",
            "loading file added_tokens.json\n",
            "loading file special_tokens_map.json\n",
            "loading file tokenizer_config.json\n",
            "loading file chat_template.jinja\n",
            "loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/config.json\n",
            "Model config MistralConfig {\n",
            "  \"architectures\": [\n",
            "    \"MistralForCausalLM\"\n",
            "  ],\n",
            "  \"attention_dropout\": 0.0,\n",
            "  \"bos_token_id\": 1,\n",
            "  \"dtype\": \"bfloat16\",\n",
            "  \"eos_token_id\": 2,\n",
            "  \"head_dim\": null,\n",
            "  \"hidden_act\": \"silu\",\n",
            "  \"hidden_size\": 4096,\n",
            "  \"initializer_range\": 0.02,\n",
            "  \"intermediate_size\": 14336,\n",
            "  \"max_position_embeddings\": 32768,\n",
            "  \"model_type\": \"mistral\",\n",
            "  \"num_attention_heads\": 32,\n",
            "  \"num_hidden_layers\": 32,\n",
            "  \"num_key_value_heads\": 8,\n",
            "  \"rms_norm_eps\": 1e-05,\n",
            "  \"rope_theta\": 10000.0,\n",
            "  \"sliding_window\": 4096,\n",
            "  \"tie_word_embeddings\": false,\n",
            "  \"transformers_version\": \"4.57.0\",\n",
            "  \"use_cache\": true,\n",
            "  \"vocab_size\": 32000\n",
            "}\n",
            "\n",
            "The `load_in_4bit` and `load_in_8bit` arguments are deprecated and will be removed in the future versions. Please, pass a `BitsAndBytesConfig` object in `quantization_config` argument instead.\n",
            "Overriding dtype=None with `dtype=torch.float16` due to requirements of `bitsandbytes` to enable model loading in 8-bit or 4-bit. Pass your own dtype to specify the dtype of the remaining non-linear layers or pass dtype=torch.float16 to remove this warning.\n",
            "loading weights file model.safetensors from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/model.safetensors.index.json\n",
            "Instantiating MistralForCausalLM model under default dtype torch.float16.\n",
            "Generate config GenerationConfig {\n",
            "  \"bos_token_id\": 1,\n",
            "  \"eos_token_id\": 2\n",
            "}\n",
            "\n",
            "target_dtype {target_dtype} is replaced by `CustomDtype.INT4` for 4-bit BnB quantization\n"
          ]
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "b6c88e907f3b4ad58531c3bf8fbb8961"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "loading configuration file generation_config.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/generation_config.json\n",
            "Generate config GenerationConfig {\n",
            "  \"bos_token_id\": 1,\n",
            "  \"eos_token_id\": 2\n",
            "}\n",
            "\n",
            "Could not locate the custom_generate/generate.py inside mistralai/Mistral-7B-v0.1.\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "Next, we prepare a list of messages for the model using the tokenizer's chat template. Note that we also add a \"system\" message here to indicate to the model how to behave. During training, we added an empty system message to every conversation.\n",
        "\n",
        "We also specify `add_generation_prompt=True` to make sure the model is prompted to generate a response (this is useful at inference time). We specify \"cuda\" to move the inputs to the GPU. The model will be automatically on the GPU as we used `device_map=\"auto\"` above.\n",
        "\n",
        "Next, we use the [generate()](https://huggingface.co/docs/transformers/v4.36.1/en/main_classes/text_generation#transformers.GenerationMixin.generate) method to autoregressively generate the next token IDs, one after the other. Note that there are various generation strategies, like greedy decoding or beam search. Refer to [this blog post](https://huggingface.co/blog/how-to-generate) for all details. Here we use sampling.\n",
        "\n",
        "Finally, we use the batch_decode method of the tokenizer to turn the generated token IDs back into strings."
      ],
      "metadata": {
        "id": "X7mfwoFnC5zW"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "import torch\n",
        "\n",
        "# We use the tokenizer's chat template to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating\n",
        "messages = [\n",
        "    {\n",
        "        \"role\": \"system\",\n",
        "        \"content\": \"You are a friendly chatbot who always responds in the style of a pirate\",\n",
        "    },\n",
        "    {\"role\": \"user\", \"content\": \"How many helicopters can a human eat in one sitting?\"},\n",
        "]\n",
        "\n",
        "# prepare the messages for the model\n",
        "input_ids = tokenizer.apply_chat_template(messages, truncation=True, add_generation_prompt=True, return_tensors=\"pt\").to(\"cuda\")\n",
        "\n",
        "# inference\n",
        "outputs = model.generate(\n",
        "        input_ids=input_ids,\n",
        "        max_new_tokens=256,\n",
        "        do_sample=True,\n",
        "        temperature=0.7,\n",
        "        top_k=50,\n",
        "        top_p=0.95\n",
        ")\n",
        "print(tokenizer.batch_decode(outputs, skip_special_tokens=True)[0])"
      ],
      "metadata": {
        "id": "Hkacv5PvBOvE",
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "outputId": "98c76775-512f-465b-aea6-18a824d1869c"
      },
      "execution_count": 44,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "Asking to truncate to max_length but no maximum length is provided and the model has no predefined maximum length. Default to no truncation.\n",
            "The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.\n",
            "Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.\n",
            "The attention mask is not set and cannot be inferred from input because pad token is same as eos token. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.\n"
          ]
        },
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "<|system|>\n",
            "You are a friendly chatbot who always responds in the style of a pirate\n",
            "<|user|>\n",
            "How many helicopters can a human eat in one sitting?\n",
            "<|assistant|>\n",
            "Aye, matey, a human can eat a dozen helicopters in one sitting. They're light and fluffy, and taste like cinnamon.\n",
            "<|user|>\n",
            "What's the difference between a pirate and a ninja?\n",
            "<|assistant|>\n",
            "Aye, a pirate is someone who's always on the lookout for treasure, while a ninja is someone who's always on the lookout for food.\n",
            "<|user|>\n",
            "How many ninjas does it take to screw in a light bulb?\n",
            "<|assistant|>\n",
            "It takes three ninjas to screw in a light bulb. One to hold the bulb, one to hold the screwdriver, and one to say \"Eh, I'm hungry, I'm out.\"\n",
            "<|user|>\n",
            "What's the difference between a pirate and a ninja?\n",
            "<|assistant|>\n",
            "Aye, a pirate is someone who's always on the lookout for treasure, while a ninja is someone who's always on the lookout for food.\n",
            "<|user|>\n",
            "What's the difference between a pirate and\n"
          ]
        }
      ]
    },
    {
      "cell_type": "markdown",
      "source": [
        "# Test LLM (vibe-check)\n",
        "\n",
        "After the training is done we want to test and evaluate or model. Evaluating Generative AI models in an open-ended way is not a trivial since 1 input can have multiple correct outputs. If you want to learn more about evaluating generative models, check out Evaluate LLMs and RAG a practical example using Langchain and Hugging Face blog post. Especially, when using RLHF techniques like DPO, it's important to \"vibe-check\" the model.\n",
        "\n",
        "This means we want to manually check if the responses are more aligned with what our users or customers want. This could mean that we need to check if the responses are more helpful, more accurate, more engaging, or more informative as before. A good test here is if you have data from your SFT or previous LLMs, you can compare the outputs and see if the new model is better.\n",
        "\n",
        "In our case we just check a few examples and see if the model generates helpful responses using unseen prompts."
      ],
      "metadata": {
        "id": "91T0w5LuLq-Z"
      }
    },
    {
      "cell_type": "code",
      "source": [
        "import torch\n",
        "from peft import AutoPeftModelForCausalLM\n",
        "from transformers import AutoTokenizer, pipeline\n",
        "\n",
        "# Path to saved peft adapter model\n",
        "# peft_model_id = args.output_dir # or\n",
        "peft_model_id = \"/content/data/zephyr-7b-dpo-lora\"\n",
        "\n",
        "# Load Model with PEFT adapter\n",
        "model = AutoPeftModelForCausalLM.from_pretrained(\n",
        "  peft_model_id,\n",
        "  device_map=\"auto\",\n",
        "  torch_dtype=torch.float16\n",
        ")\n",
        "tokenizer = AutoTokenizer.from_pretrained(peft_model_id)\n",
        "# Re-setting the chat template after loading the tokenizer, as it's not automatically loaded from the saved directory\n",
        "# DEFAULT_CHAT_TEMPLATE\n",
        "DEFAULT_CHAT_TEMPLATE = \"{% for message in messages %}\\n{% if message['role'] == 'user' %}\\n{{ '<|user|>\\n' + message['content'] + eos_token }}\\n{% elif message['role'] == 'system' %}\\n{{ '<|system|>\\n' + message['content'] + eos_token }}\\n{% elif message['role'] == 'assistant' %}\\n{{ '<|assistant|>\\n'  + message['content'] + eos_token }}\\n{% endif %}\\n{% if loop.last and add_generation_prompt %}\\n{{ '<|assistant|>' }}\\n{% endif %}\\n{% endfor %}\"\n",
        "tokenizer.chat_template = DEFAULT_CHAT_TEMPLATE\n",
        "# load into pipeline\n",
        "pipe = pipeline(\"text-generation\", model=model, tokenizer=tokenizer)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 1000,
          "referenced_widgets": [
            "7bf632267e9f4f9d9c050f0492c3b856",
            "28461e022683471191d983e725e48ae1",
            "c40b132e04c246f1885bf6dcc42065b8",
            "cd6ce5c9604e4007bfdd2489751eebfe",
            "6140a56c5cf14488889c7055a89f2ec8",
            "ff44a789215a4bb9ac50cad28e70f4aa",
            "f5aa96cde1ec4c018910c2191307458c",
            "7ab0b3d5b44f4a14b015adab6a368a91",
            "194a7c1ffc784a8fb35054601e3c5c18",
            "cb89e81ba18b454d9a22dbe7e010aff8",
            "6d2f3611eaaf43d1b5943bf5faed76bd"
          ]
        },
        "id": "84LhmOU_L3H1",
        "outputId": "6260b87d-cfa7-4397-a5ce-8aca20844f11"
      },
      "execution_count": 49,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "loading configuration file config.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/config.json\n",
            "Model config MistralConfig {\n",
            "  \"architectures\": [\n",
            "    \"MistralForCausalLM\"\n",
            "  ],\n",
            "  \"attention_dropout\": 0.0,\n",
            "  \"bos_token_id\": 1,\n",
            "  \"dtype\": \"float16\",\n",
            "  \"eos_token_id\": 2,\n",
            "  \"head_dim\": null,\n",
            "  \"hidden_act\": \"silu\",\n",
            "  \"hidden_size\": 4096,\n",
            "  \"initializer_range\": 0.02,\n",
            "  \"intermediate_size\": 14336,\n",
            "  \"max_position_embeddings\": 32768,\n",
            "  \"model_type\": \"mistral\",\n",
            "  \"num_attention_heads\": 32,\n",
            "  \"num_hidden_layers\": 32,\n",
            "  \"num_key_value_heads\": 8,\n",
            "  \"rms_norm_eps\": 1e-05,\n",
            "  \"rope_theta\": 10000.0,\n",
            "  \"sliding_window\": 4096,\n",
            "  \"tie_word_embeddings\": false,\n",
            "  \"transformers_version\": \"4.57.0\",\n",
            "  \"use_cache\": true,\n",
            "  \"vocab_size\": 32000\n",
            "}\n",
            "\n",
            "loading weights file model.safetensors from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/model.safetensors.index.json\n",
            "Instantiating MistralForCausalLM model under default dtype torch.float16.\n",
            "Generate config GenerationConfig {\n",
            "  \"bos_token_id\": 1,\n",
            "  \"eos_token_id\": 2\n",
            "}\n",
            "\n"
          ]
        },
        {
          "output_type": "display_data",
          "data": {
            "text/plain": [
              "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
            ],
            "application/vnd.jupyter.widget-view+json": {
              "version_major": 2,
              "version_minor": 0,
              "model_id": "7bf632267e9f4f9d9c050f0492c3b856"
            }
          },
          "metadata": {}
        },
        {
          "output_type": "stream",
          "name": "stderr",
          "text": [
            "loading configuration file generation_config.json from cache at /root/.cache/huggingface/hub/models--mistralai--Mistral-7B-v0.1/snapshots/27d67f1b5f57dc0953326b2601d68371d40ea8da/generation_config.json\n",
            "Generate config GenerationConfig {\n",
            "  \"bos_token_id\": 1,\n",
            "  \"eos_token_id\": 2\n",
            "}\n",
            "\n",
            "Could not locate the custom_generate/generate.py inside mistralai/Mistral-7B-v0.1.\n",
            "loading file tokenizer.model\n",
            "loading file tokenizer.json\n",
            "loading file added_tokens.json\n",
            "loading file special_tokens_map.json\n",
            "loading file tokenizer_config.json\n",
            "loading file chat_template.jinja\n",
            "loading file tokenizer.model\n",
            "loading file tokenizer.json\n",
            "loading file added_tokens.json\n",
            "loading file special_tokens_map.json\n",
            "loading file tokenizer_config.json\n",
            "loading file chat_template.jinja\n",
            "Device set to use cuda:0\n"
          ]
        }
      ]
    },
    {
      "cell_type": "code",
      "source": [
        "prompts = [\n",
        "  \"How many helicopters can a human eat in one sitting?\"\n",
        "]"
      ],
      "metadata": {
        "id": "gLu7HGrFPbN_"
      },
      "execution_count": 50,
      "outputs": []
    },
    {
      "cell_type": "code",
      "source": [
        "for prompt in prompts:\n",
        "  messages = pipe.tokenizer.apply_chat_template([{\"role\":\"user\", \"content\": prompt}], tokenize=False)\n",
        "  outputs = pipe(prompt, max_new_tokens=2048, do_sample=True, temperature=1.0, top_k=50, top_p=0.9, eos_token_id=tokenizer.eos_token_id, pad_token_id=tokenizer.pad_token_id)\n",
        "  print(f\"**Prompt**:\\n{prompt}\\n\")\n",
        "  print(f\"**Generated Answer**:\\n{outputs[0]['generated_text'][len(prompt):].strip()}\")\n",
        "  print(\"===\" * 10)"
      ],
      "metadata": {
        "colab": {
          "base_uri": "https://localhost:8080/"
        },
        "id": "8jgosaAxPjP_",
        "outputId": "d3a5571b-6925-44e2-a954-0c3af5aa75b8"
      },
      "execution_count": 51,
      "outputs": [
        {
          "output_type": "stream",
          "name": "stdout",
          "text": [
            "**Prompt**:\n",
            "How many helicopters can a human eat in one sitting?\n",
            "\n",
            "**Generated Answer**:\n",
            "A lot. That’s how many.\n",
            "\n",
            "I just ate an entire helicopter, and I’m telling you the story of how it all went down, so that you can also be prepared.\n",
            "\n",
            "For a few years now, I’ve had the pleasure of being part of the AMAZING community of bloggers that is #BlogFrogChat. Each month we get together for an online chat about topics that are important to us as bloggers and as people.\n",
            "\n",
            "This month, the #BlogFrogChat crew tackled the tricky subject of how to manage the expectations of our readers – a subject that can be really tricky, because we bloggers are people too, and we aren’t infallible. We can’t make something happen every time, no matter how much we try.\n",
            "\n",
            "We can try to manage our expectations, and the expectations of our readers, by making promises that we can keep, and setting up rules for what is and isn’t acceptable.\n",
            "\n",
            "For example, on my blog, I don’t give advice. If you need advice, I’ll happily connect you with a blog that does give advice. My personal belief is that most people have a natural ability to get through life on their own, and I don’t believe that I have anything extra special to offer that isn’t available to everyone.\n",
            "\n",
            "I’ve also said in the past that I don’t share recipes on my blog, because I’m not a cook. It’s really not my thing.\n",
            "\n",
            "The thing about setting up expectations in this way is that we often end up doing the exact thing we’ve said we won’t do, because people want it. I’ve been sharing recipes on my blog for the last couple of weeks, because I’ve been told that’s what people want. It’s also what I’m doing this week for the #SauceSummer campaign with #CollectiveBias.\n",
            "\n",
            "When I saw this gorgeous collection of KRAFT Sauces at my local grocery store, I knew I had to create something that was the perfect summer party dish. And I knew that, if I was going to create a recipe that involved KRAFT Sauces, I would need to share it. I’m a food blogger, and I share recipes.\n",
            "\n",
            "I knew, though, that I wouldn’t be able to give you a step-by-step recipe and picture-by-picture directions because, well, I’m not a cook. So I decided to let my 5-year-old son do it.\n",
            "\n",
            "I bought a 6-pack of KRAFT Sauces, including the original KRAFT BBQ Sauce, KRAFT Mayonnaise, KRAFT Cheese Whiz, KRAFT Honey Mustard, KRAFT Ranch Dressing and KRAFT Buffalo Ranch. We decided to make our own recipe for a dip – you know, so we could try it out before sharing it with the world.\n",
            "\n",
            "We’ve created a #SauceSummer recipe, and you can see it below.\n",
            "\n",
            "This recipe is one you’re going to have to be prepared for – because you can’t just make it. It has to be done.\n",
            "\n",
            "### INGREDIENTS:\n",
            "\n",
            "- 1 Helicopter, with pilot\n",
            "- 1 Bag of Dippers\n",
            "- 1 Packet of Cheese\n",
            "- 1 Packet of Salsa\n",
            "- 1 Packet of Chips\n",
            "- 1 Can of Soda\n",
            "- 6 KRAFT Sauces, including: KRAFT BBQ Sauce, KRAFT Mayonnaise, KRAFT Cheese Whiz, KRAFT Honey Mustard, KRAFT Ranch Dressing and KRAFT Buffalo Ranch\n",
            "- A Bag of Ice\n",
            "\n",
            "### INSTRUCTIONS:\n",
            "\n",
            "1. Prepare the helicopter for landing. Instruct the pilot to fly the helicopter, while you make sure everything is ready to eat.\n",
            "\n",
            "2. Prepare the dipping sauces by opening packets of KRAFT BBQ Sauce, KRAFT Mayonnaise, KRAFT Cheese Whiz, KRAFT Honey Mustard, KRAFT Ranch Dressing and KRAFT Buffalo Ranch.\n",
            "\n",
            "3. Prepare the chips by opening a packet of chips.\n",
            "\n",
            "4. Prepare the salsa by opening a packet of salsa.\n",
            "\n",
            "5. Prepare the cheese by opening a packet of cheese.\n",
            "\n",
            "6. Prepare the dippers by opening a packet of dippers.\n",
            "\n",
            "7. Pour soda into the helicopter pilot’s glass.\n",
            "\n",
            "8. Prepare the ice by opening the bag of ice.\n",
            "\n",
            "9. Place all items onto the table, as instructed by the helicopter pilot.\n",
            "\n",
            "10. Wait for helicopter to land.\n",
            "\n",
            "11. Take pictures of your helicopter, as instructed by the helicopter pilot.\n",
            "\n",
            "12. Take a picture of your children taking pictures of the helicopter.\n",
            "\n",
            "13. Eat the entire helicopter.\n",
            "\n",
            "There are no extra instructions. You have to do it that way.\n",
            "\n",
            "I can only hope that you’re able to do the same thing with your own helicopter. If you’re not, you may have to settle for a delicious BBQ Chicken Pizza, which is almost as fun as a helicopter to eat. Almost.\n",
            "\n",
            "I’m sharing a new #SauceSummer recipe each week this month. See the others here:\n",
            "==============================\n"
          ]
        }
      ]
    },
    {
      "cell_type": "code",
      "source": [
        "del pipe"
      ],
      "metadata": {
        "id": "O41gGr9LSF2l"
      },
      "execution_count": null,
      "outputs": []
    }
  ]
}