Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
-
Updated
Feb 6, 2025 - Python
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Unattended Lightweight Text Classifiers with LLM Embeddings
Advanced NLP project detecting duplicate questions on Quora using transformer-based embeddings, LSTM architectures, and ensemble models, achieving 88% accuracy with scalable solutions for real-world applications 🧠💬.
A demo from the blog post comparing MiniLM-based models using song lyrics and Milvus for vector similarity search—an approach that works for any text content.
Add a description, image, and links to the minilm topic page so that developers can more easily learn about it.
To associate your repository with the minilm topic, visit your repo's landing page and select "manage topics."