lemmatizer

Returns an array of possible lemmas for each token

Cadmium::Lemmatizer

WIP. For now, until Cadmium::POS_Tagger is ready, this lemmatizer returns an array of possible lemmas for a string input. English data is included, but other languages are available at cadmiumcr/languages.

Installation

  1. Add the dependency to your shard.yml:

    dependencies:
      cadmium_lemmatizer:
        github: cadmiumcr/lemmatizer
    
  2. Run shards install

Usage

require "cadmium_lemmatizer"

Cadmium::Lemmatizer.new.lemmatize("zoomed") # => ["zoom"]

Contributing

  1. Fork it (https://github.com/cadmiumcr/lemmatizer/fork)
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create a new Pull Request

Contributors

Repository

lemmatizer

Owner
Statistic
  • 2
  • 0
  • 0
  • 2
  • 1
  • over 4 years ago
  • September 9, 2019
License

MIT License

Links
Synced at

Fri, 03 May 2024 21:19:22 GMT

Languages