A 125M custom GPT-X2 language model with RoPE, SwiGLU, grouped-query attention, and curriculum-trained code/math normalization.
Visualize Any Hugging Face Model.
Paste a Hugging Face URL or repo name and jump straight into a clear interactive model view.