build-vocabulary-from-text-corpus
maintained by ECNU-ICALK
star
57
account_tree
5
verified_user
MIT License
Automatically builds a vocabulary file from a text dataset by tokenizing, counting frequencies, filtering by minimum frequency, sorting by frequency, and optionally prepending special tokens. Use when you need to generate vocab.txt for NLP models from raw text.
Key Features
- Comprehensive skill evaluation and performance tracking
- Community-driven ratings and reviews
- Easy integration with Claude Code
- Regular updates and maintenance
Quick Start
TopRank Skills install ECNU-ICALK/build-vocabulary-from-text-corpus
chat Comments (0)
Sign in to join the discussion and leave a comment.
Skill Details
GitHub Stars
57
GitHub Forks
5
Created
Mar 2026
Last Updated
3 months ago
tools
tools automation tools
Related Skills
Build your own?
Join 12,000+ developers contributing to the Claude ecosystem.
No comments yet. Be the first to share your thoughts!