build-vocabulary-from-text-corpus | Skill Performance & Reviews | TopRankSkills

TopRank Skills

Home / Skills / tools / build-vocabulary-from-text-cor...

build-vocabulary-from-text-corpus

maintained by ECNU-ICALK

star 57 account_tree 5 verified_user MIT License
bolt View GitHub

Automatically builds a vocabulary file from a text dataset by tokenizing, counting frequencies, filtering by minimum frequency, sorting by frequency, and optionally prepending special tokens. Use when you need to generate vocab.txt for NLP models from raw text.

Key Features

  • Comprehensive skill evaluation and performance tracking
  • Community-driven ratings and reviews
  • Easy integration with Claude Code
  • Regular updates and maintenance

Quick Start

TopRank Skills install ECNU-ICALK/build-vocabulary-from-text-corpus

chat Comments (0)

chat_bubble_outline

No comments yet. Be the first to share your thoughts!

Skill Details

GitHub Stars 57
GitHub Forks 5
Created Mar 2026
Last Updated 3 months ago
tools tools automation tools

Related Skills

specs-gen
chevron_right
glm-coding-agent
chevron_right
creating-pr
chevron_right
writing-skills
chevron_right
reviewing-pr
chevron_right

Build your own?

Join 12,000+ developers contributing to the Claude ecosystem.