Hermetic Word Frequency Counter is a dedicated Windows-based desktop utility developed by Hermetic Systems that automates the extraction, counting, and statistical sorting of words and phrases within electronic texts. Core Functionality
The software functions like a graphical interface for advanced text-mining and grep utilities. It scans unstructured text files, identifies every unique word, and tallies total occurrences.
Input Formats: It processes Microsoft Word DOCX files, plain text (.txt), and code/web files like HTML, XML, and PHP. When reading code, it automatically ignores syntax elements like tags while evaluating the actual content.
Sorting Capabilities: Users can sort the final metrics alphabetically or by total frequency. It details both the rank and precise count for every word found.
No File Limits: The application can handle exceptionally massive data sets with no limit on the input text file size. Granular Text Filtering Settings
The software stands out from basic web-based counters because of its highly specific configuration parameters:
Custom Word Rules: You can explicitly instruct the program to allow or disallow numbers, hyphens, apostrophes, underscores, or colons inside a word string. This permits complex terms—like email addresses, URLs, or chemical compounds—to be counted as a single coherent word.
Lemmatization and Verbs: It includes a setting to automatically convert plural words into singular forms or match third-person verbs to the first-person singular to keep counts cohesive.
Ignore Lists (Stop Words): It strips out common, uninformative words (such as “the”, “this”, or “an”). The software detects the language automatically (supporting major European languages like English, French, German, Spanish, and Italian) and applies the corresponding ignore file. Basic vs. Advanced Version
The software is split into two primary tiers depending on the depth of analysis required: Basic Version (WFC) Advanced Version (WFCA) Scope Processes one single file at a time.
Processes multiple files across entire folder trees simultaneously. Phrases Restricted to single-word counting. Extracts and counts multi-word phrases or n-grams. Pattern Matching Exact word matches only.
Supports Regular Expressions (Regex) for complex wildcard searches. Output and Data Integration
Once the analysis finishes, data can be written directly to an external file. This output seamlessly loads into spreadsheet tools like Microsoft Excel using delimiter formatting, allowing researchers or marketers to chart trends and compile deeper reports. Discover research tools for studying texts. – TAPoR
Leave a Reply