Stanford AI

university

https://www.ai.stanford.edu

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

adityatadimeti authored a paper 7 days ago

LFM2 Technical Report

yuegao authored a paper 7 months ago

Aligning Pretraining for Detection via Object-Level Contrastive Learning

nouamanetazi authored a paper 8 months ago

SmolVLM: Redefining small and efficient multimodal models

View all activity

Papers

Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

View all Papers

KingNish

posted an update about 6 hours ago

Post

128

Muon vs MuonClip vs Muon+Adamw

Muon has gone from an experiment to a mainstream optimizer, but does it hold up for fine‑tuning? We ran head‑to‑head tests on Qwen3‑4B (10k+ high‑quality instruction rows) to find out.

Short story: Pure Muon converged fastest at the start, but its gradient‑norm spikes made training unstable. MuonClip (Kimi K2’s clipping) stabilizes long pretraining runs, yet in our small‑scale fine‑tune it underperformed, lower token accuracy and slower convergence. The winner was the hybrid: Muon for 2D layers + AdamW for 1D layers. It delivered the best balance of stability and final performance and even beat vanilla AdamW.

Takeaway: for small-scale fine-tuning, hybrid = practical and reliable.

Next Step: scale to larger models/datasets to see if Muon’s spikes become catastrophic or if clipping wins out.

Full Blog Link: https://huggingface.co/blog/KingNish/optimizer-part1

KingNish

posted an update 2 days ago

Post

2318

I tested Muon vs MuonClip vs Muon+AdamW for fine-tuning LLMs
Just published a blog on that, Read here 👉 https://huggingface.co/blog/KingNish/optimizer-part1

1 reply

AnSungJae3489

posted an update 3 months ago

Post

2601

ShareGPT? How about ShareGPT-X?

We release **92K** Human with LLM conversations as a refresh and update over the original ShareGPT Dataset.

DSULT-Core/ShareGPT-X

KingNish

posted an update 4 months ago

Post

2152

Wan 2.2 fast upto 10x faster than original wan 2.2

Model: FastVideo/FastWan2.2-TI2V-5B-FullAttn-Diffusers

Space: KingNish/wan2-2-fast

KingNish

posted an update 6 months ago

Post

1179

What's currently the biggest gap in Open Source Datasets ??

5 replies

yuegao

authored a paper 7 months ago

Aligning Pretraining for Detection via Object-Level Contrastive Learning

Paper • 2106.02637 • Published Jun 4, 2021

Kameshr

updated a dataset 9 months ago

Stanford/Compiled_COT

Viewer • Updated Mar 15 • 2.23M • 83 • 2

Kameshr

published a dataset 9 months ago

Stanford/Compiled_COT

Viewer • Updated Mar 15 • 2.23M • 83 • 2

yuegao

authored a paper 9 months ago

FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video

Paper • 2503.04720 • Published Mar 6 • 1

KingNish

posted an update about 1 year ago

Post

11062

Realtime Whisper Large v3 Turbo Demo:
It transcribes audio in about 0.3 seconds.

KingNish/Realtime-whisper-large-v3-turbo

2 replies

KingNish

posted an update about 1 year ago

Post

8315

Exciting news! Introducing super-fast AI video assistant, currently in beta. With a minimum latency of under 500ms and an average latency of just 600ms.

DEMO LINK:
KingNish/Live-Video-Chat

1 reply

KingNish

posted an update about 1 year ago

Post

4109

A super good and fast image inpainting demo is here.
Its' super cool and realistic.

Demo by @OzzyGT (Must try):
OzzyGT/diffusers-fast-inpaint

KingNish

posted an update about 1 year ago

Post

3631

Mistral Nemo is better than many models in 1st grader level reasoning.

KingNish

posted an update about 1 year ago

Post

3963

I am experimenting with Flux and trying to push it to its limits without training (as I am GPU-poor 😅).
I found some flaws in the pipelines, which I resolved, and now I am able to generate an approx similar quality image as Flux Schnell 4 steps in just 1 step.
Demo Link:
KingNish/Realtime-FLUX

1 reply

KingNish

posted an update about 1 year ago

Post

1957

I am excited to announce a major speed updated in Voicee, a superfast voice assistant.

It has now achieved latency <250 ms.
While its average latency is about 500ms.
KingNish/Voicee

This become Possible due to newly launched @sambanovasystems cloud.

You can also use your own API Key to get fastest speed.
You can get on from here: https://cloud.sambanova.ai/apis

For optimal performance use Google Chrome.

Please try Voicee and share your valuable feedback to help me further improve its performance and usability.
Thank you!

KingNish

posted an update over 1 year ago

Post

3625

Introducing Voicee, A superfast voice fast assistant.
KingNish/Voicee
It achieved latency <500 ms.
While its average latency is 700ms.
It works best in Google Chrome.
Please try and give your feedbacks.
Thank you. 🤗

3 replies

KingNish

posted an update over 1 year ago

Post

5918

Introducing OpenCHAT mini: a lightweight, fast, and unlimited version of OpenGPT 4o.

KingNish/OpenCHAT-mini2

It has unlimited web search, vision and image generation.

Please take a look and share your review. Thank you! 🤗

7 replies

KingNish

posted an update over 1 year ago

Post

15165

OpenGPT 4o now features WEB SEARCH

This feature enhances the capabilities of OpenGPT 4o, allowing it to fetch and integrate the latest information from the web directly into its responses.
Try Now: KingNish/OpenGPT-4o

With WEB SEARCH, OpenGPT 4o becomes an even more versatile and dynamic AI, ready to assist with up-to-date data retrieval and analysis.

30 replies

KingNish

posted an update over 1 year ago

Post

6497

I am pleased to announce 2 amazing AI demos:

1. Chat with Google Agent - This includes three AI models that allow you to converse with an AI, which provides answers by searching Google.
Demo Link: poscye/google-go

2. HelpingAI 9B - A model that surpassed all top AIs with the highest EQ benchmark score of 89.23. It specializes in understanding human emotions and responding in human style.
Demo Link: https://huggingface.co/spaces/Abhaykoul/HelpingAI-9B
Model Link: https://huggingface.co/OEvortex/HelpingAI-9B
Blog Link: https://huggingface.co/blog/KingNish/helpingai-9b

2 replies

KingNish

posted an update over 1 year ago

Post

3770

ChatGPT made Custom GPTs Free for Everyone.

Yes, you can use them but...
with limitations like
You can't use DallE 😥,
You can't make Custom GPTs
And chat limit also😥.
But...
We already have an open-source alternative like Hugging Chat, where you can create your custom assistant, generate, edit images, without any chat limit.

Try both of them from here:
https://chatgpt.com/gpts
https://huggingface.co/chat

and don't forget to Give your review here 👇:

4 replies

AI & ML interests

Recent Activity

Papers

Team members 443

Stanford's activity