-
Notifications
You must be signed in to change notification settings - Fork 45
/
Copy pathmain.py
27 lines (19 loc) · 1.21 KB
/
main.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
"""
# Mini-NanoGPT
Based on karpathy/nanoGPT with a GUI and extended features that make GPT model training intuitive and accessible. Currently implements native GPT training, with GPT-2 fine-tuning in development.
- 🚀 One-click data processing, training and inference
- 🎨 Real-time training visualization and logging
- 🔧 Character-level and GPT-2 tokenizer support
- 💾 Checkpoint resume and model evaluation
- 🌏 Multi-language interface (English/Chinese)
- 📊 Rich learning rate scheduling options
- 🎛️ Visual configuration for all parameters, no code editing needed
Compared to the original nanoGPT, this project adds more practical features and flexible configuration options, giving you complete control over the training process. Whether for learning or experimentation, it helps you explore GPT models more easily.
Built with PyTorch and Gradio, featuring clean code structure, perfect for deep learning beginners and researchers.
"""
import torch._dynamo
torch._dynamo.config.suppress_errors = True
from modules.interface import build_app_interface
if __name__ == "__main__":
demo = build_app_interface()
demo.launch(server_name="0.0.0.0", server_port=7860, share=False)