agents-course/course-certificates-of-excellence Viewer β’ Updated about 2 hours ago β’ 3.89k β’ 564 β’ 7
huggingface-projects/Deep-RL-Course-Certification Viewer β’ Updated about 14 hours ago β’ 1.62k β’ 2.18k β’ 16
view post Post 1353 NEW: @EssentialAI just released Rnj-1, their first 8B model. You can easily fine-tune it with GRPO using TRL to add reasoning capabilities to a compact modeFree Colab link: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_rnj_1_instruct.ipynbMore free TRL notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks See translation π 5 5 + Reply
agents-course/course-certificates-of-excellence Viewer β’ Updated about 2 hours ago β’ 3.89k β’ 564 β’ 7
agents-course/course-certificates-of-excellence Viewer β’ Updated about 2 hours ago β’ 3.89k β’ 564 β’ 7