FrequencyWords Dictionary Data
==============================

The OCR dictionary files in app/src/main/assets/dictionaries/ are derived from:

Project: FrequencyWords
Source:  https://github.com/hermitdave/FrequencyWords
Author:  Hermit Dave

License: Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
         https://creativecommons.org/licenses/by-sa/4.0/

The dictionary data is based on word frequency lists extracted from Wikipedia.

-----------------------------------------------------------
Creative Commons Attribution-ShareAlike 4.0 International
-----------------------------------------------------------

You are free to:

  Share — copy and redistribute the material in any medium or format
  Adapt — remix, transform, and build upon the material for any purpose,
          even commercially

Under the following terms:

  Attribution — You must give appropriate credit, provide a link to the
                license, and indicate if changes were made. You may do so
                in any reasonable manner, but not in any way that suggests
                the licensor endorses you or your use.

  ShareAlike — If you remix, transform, or build upon the material, you
               must distribute your contributions under the same license
               as the original.

  No additional restrictions — You may not apply legal terms or
                               technological measures that legally restrict
                               others from doing anything the license permits.

Full license text: https://creativecommons.org/licenses/by-sa/4.0/legalcode

-----------------------------------------------------------
Included Dictionary Files
-----------------------------------------------------------

The following language dictionaries are included:

- ara.txt.gz    (Arabic)
- ces.txt.gz    (Czech)
- chi_sim.txt.gz (Chinese Simplified)
- chi_tra.txt.gz (Chinese Traditional)
- dan.txt.gz    (Danish)
- deu.txt.gz    (German)
- eng.txt.gz    (English)
- fas.txt.gz    (Persian)
- fra.txt.gz    (French)
- hun.txt.gz    (Hungarian)
- ita.txt.gz    (Italian)
- nld.txt.gz    (Dutch)
- nor.txt.gz    (Norwegian)
- pol.txt.gz    (Polish)
- por.txt.gz    (Portuguese)
- ron.txt.gz    (Romanian)
- rus.txt.gz    (Russian)
- slk.txt.gz    (Slovak)
- spa.txt.gz    (Spanish)
- swe.txt.gz    (Swedish)
- tha.txt.gz    (Thai)

Each file contains the 50,000 most frequent words for the respective language.
