Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kmantric 's Collections
grpo-training

grpo-training

updated Sep 11
Upvote
-

  • meta-llama/Llama-3.2-1B-Instruct

    Text Generation • 1B • Updated Oct 24, 2024 • 3.43M • • 1.19k

  • meta-llama/Llama-3.1-8B

    Text Generation • 8B • Updated Oct 16, 2024 • 731k • • 1.96k

  • epfl-llm/meditron-7b

    Text Generation • 7B • Updated Dec 7, 2023 • 13.8k • 301

  • medalpaca/medalpaca-7b

    Text Generation • 7B • Updated Apr 2, 2024 • 3.83k • 90

  • ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code

    Paper • 2509.07006 • Published Sep 6 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs