The original Brill tagger (in the AI repository)
Edit Page
Report
Scan day: 17 February 2014 UTC
20
Virus safety - good
Description: Eric Brill's original trainable rule-based part-of-speech tagger, which is based on error-driven transformation-based learning (TBL). Comes with a model for English. Written in C (with some Perl code).
Package: areas/nlp/parsing/taggers/brill/ CMU Artificial Intelligence Repository Brill: Trainable Part of Speech Tagger This directory contains Eric Brill's trainable rule-based part of speech tagger. This tagger is based on transformation-based error-driven learning, a technique that has been effective in a number of natural language applications, including part of speech and word sense tagging, prepositional phrase attachment, and syntactic parsing. The code includes a tokenizer for ASCII English, an English lexicon enduced from the Brown corpus, a table of mappings for word suffixes to likely ambiguity classes, and an HMM trained on the odd numbered sentences in the Brown corpus. For more information, see chapter 6 of Brill's thesis.
Size: 756 chars
Contact Information
Email: —
Phone&Fax: —
Address: —
Extended: —
WEBSITE Info
Page title: | Package: areas/nlp/parsing/taggers/brill/ |
Keywords: | |
Description: | |
IP-address: | 128.2.217.13 |
WHOIS Info
NS | Name Servers: NSAUTH1.NET.CMU.EDU 128.2.1.8 NSAUTH2.NET.CMU.EDU 128.237.148.168 |
WHOIS | |
Date | activated: 24-Apr-1985 last updated: 16-Sep-2010 expires: 31-Jul-2014 |