AI & ML interests

None defined yet.

Recent Activity

KingNish 
posted an update about 6 hours ago
view post
Post
128
Muon vs MuonClip vs Muon+Adamw

Muon has gone from an experiment to a mainstream optimizer, but does it hold up for fine‑tuning? We ran head‑to‑head tests on Qwen3‑4B (10k+ high‑quality instruction rows) to find out.

Short story: Pure Muon converged fastest at the start, but its gradient‑norm spikes made training unstable. MuonClip (Kimi K2’s clipping) stabilizes long pretraining runs, yet in our small‑scale fine‑tune it underperformed, lower token accuracy and slower convergence. The winner was the hybrid: Muon for 2D layers + AdamW for 1D layers. It delivered the best balance of stability and final performance and even beat vanilla AdamW.

Takeaway: for small-scale fine-tuning, hybrid = practical and reliable.

Next Step: scale to larger models/datasets to see if Muon’s spikes become catastrophic or if clipping wins out.

Full Blog Link: https://huggingface.co/blog/KingNish/optimizer-part1
KingNish 
posted an update 2 days ago
AnSungJae3489 
posted an update 3 months ago
view post
Post
2601
ShareGPT? How about ShareGPT-X?

We release **92K** Human with LLM conversations as a refresh and update over the original ShareGPT Dataset.

DSULT-Core/ShareGPT-X
KingNish 
posted an update 4 months ago
KingNish 
posted an update 6 months ago
view post
Post
1179
What's currently the biggest gap in Open Source Datasets ??
·
KingNish 
posted an update about 1 year ago
KingNish 
posted an update about 1 year ago
view post
Post
8315
Exciting news! Introducing super-fast AI video assistant, currently in beta. With a minimum latency of under 500ms and an average latency of just 600ms.

DEMO LINK:
KingNish/Live-Video-Chat
  • 1 reply
·
KingNish 
posted an update about 1 year ago
KingNish 
posted an update about 1 year ago
view post
Post
3631
Mistral Nemo is better than many models in 1st grader level reasoning.
KingNish 
posted an update about 1 year ago
view post
Post
3963
I am experimenting with Flux and trying to push it to its limits without training (as I am GPU-poor 😅).
I found some flaws in the pipelines, which I resolved, and now I am able to generate an approx similar quality image as Flux Schnell 4 steps in just 1 step.
Demo Link:
KingNish/Realtime-FLUX

  • 1 reply
·
KingNish 
posted an update about 1 year ago
view post
Post
1957
I am excited to announce a major speed updated in Voicee, a superfast voice assistant.

It has now achieved latency <250 ms.
While its average latency is about 500ms.
KingNish/Voicee

This become Possible due to newly launched @sambanovasystems cloud.

You can also use your own API Key to get fastest speed.
You can get on from here: https://cloud.sambanova.ai/apis

For optimal performance use Google Chrome.

Please try Voicee and share your valuable feedback to help me further improve its performance and usability.
Thank you!
KingNish 
posted an update over 1 year ago
view post
Post
3625
Introducing Voicee, A superfast voice fast assistant.
KingNish/Voicee
It achieved latency <500 ms.
While its average latency is 700ms.
It works best in Google Chrome.
Please try and give your feedbacks.
Thank you. 🤗
·
KingNish 
posted an update over 1 year ago
view post
Post
5918
Introducing OpenCHAT mini: a lightweight, fast, and unlimited version of OpenGPT 4o.

KingNish/OpenCHAT-mini2

It has unlimited web search, vision and image generation.

Please take a look and share your review. Thank you! 🤗
·
KingNish 
posted an update over 1 year ago
view post
Post
15165
OpenGPT 4o now features WEB SEARCH

This feature enhances the capabilities of OpenGPT 4o, allowing it to fetch and integrate the latest information from the web directly into its responses.
Try Now: KingNish/OpenGPT-4o

With WEB SEARCH, OpenGPT 4o becomes an even more versatile and dynamic AI, ready to assist with up-to-date data retrieval and analysis.
·
KingNish 
posted an update over 1 year ago
view post
Post
6497
I am pleased to announce 2 amazing AI demos:

1. Chat with Google Agent - This includes three AI models that allow you to converse with an AI, which provides answers by searching Google.
Demo Link: poscye/google-go

2. HelpingAI 9B - A model that surpassed all top AIs with the highest EQ benchmark score of 89.23. It specializes in understanding human emotions and responding in human style.
Demo Link: https://huggingface.co/spaces/Abhaykoul/HelpingAI-9B
Model Link: https://huggingface.co/OEvortex/HelpingAI-9B
Blog Link: https://huggingface.co/blog/KingNish/helpingai-9b
  • 2 replies
·
KingNish 
posted an update over 1 year ago
view post
Post
3770
ChatGPT made Custom GPTs Free for Everyone.

Yes, you can use them but...
with limitations like
You can't use DallE 😥,
You can't make Custom GPTs
And chat limit also😥.
But...
We already have an open-source alternative like Hugging Chat, where you can create your custom assistant, generate, edit images, without any chat limit.

Try both of them from here:
https://chatgpt.com/gpts
https://huggingface.co/chat

and don't forget to Give your review here 👇:
·