| 1 | {{{ |
| 2 | #!html |
| 3 | |
| 4 | <h1 style="font-size: xx-large"><span style="color: #D11E11">No</span>Sketch Engine</h1> |
| 5 | <style type="text/css"> |
| 6 | p,h2,a,pre.wiki,td { |
| 7 | font-size: medium; |
| 8 | } |
| 9 | h1 { |
| 10 | font-size: large; |
| 11 | } |
| 12 | </style> |
| 13 | }}} |
| 14 | Welcome to !NoSketch Engine, an open-source project combining Manatee and Bonito and Crystal into a powerful and free corpus management system. !NoSketch Engine is a limited version of the software empowering the famous [https://www.sketchengine.eu/ Sketch Engine] service, a commercial variant offering word sketches, thesaurus, keyword computation, user-friendly corpus creation and many other excellent features. |
| 15 | |
| 16 | Try [https://auth.sketchengine.eu/#register Sketch Engine trial account] - word sketches, thesaurus, keywords, online corpus building and space for your corpora, online availability and technical support. See overview of [https://www.sketchengine.eu/nosketch-engine/ Sketch Engine versus NoSketch Engine]. |
| 17 | |
| 18 | = News = |
| 19 | |
| 20 | For receiving updates about new versions and futures, please subscribe to the [https://groups.google.com/a/sketchengine.co.uk/forum/#!forum/noske NoSketch Engine Google group]. You are free to use the [https://www.sketchengine.eu/documentation/ documentation available for commercial Sketch Engine]. |
| 21 | |
| 22 | = !NoSketch Engine packages = |
| 23 | == manatee == |
| 24 | Manatee is a corpus management and query system. License: GPLv2+. |
| 25 | |
| 26 | == bonito == |
| 27 | Bonito is an API interface for the Manatee corpus management system. License: GPLv2+. |
| 28 | |
| 29 | == gdex == |
| 30 | GDEX (Good Dictionary Examples) is a Bonito module for sorting concordances according to their suitability as dictionary examples. License: GPLv3. |
| 31 | |
| 32 | == crystal == |
| 33 | Crystal is a web interface for Sketch Engine. License: GPLv3. |
| 34 | |
| 35 | == third party packages == |
| 36 | Bonito wants [https://github.com/seveas/python-prctl python-prctl] and [https://launchpad.net/python-signalfd python-signalfd] (required) to be installed. |
| 37 | |
| 38 | = Downloads = |
| 39 | == Latest stable release == |
| 40 | |
| 41 | '''You should always download the latest versions of all components. |
| 42 | |
| 43 | || ||= '''manatee-open''' =||= '''bonito-open''' =||= '''gdex''' =||= '''crystal-open''' =||= '''sample corpus''' =|| |
| 44 | || tar.gz || [http://corpora.fi.muni.cz/noske/current/src/manatee-open-2.167.10.tar.gz manatee-open-2.167.10.tar.gz] || [http://corpora.fi.muni.cz/noske/current/src/bonito-open-4.24.6.tar.gz bonito-open-4.24.6.tar.gz] || [http://corpora.fi.muni.cz/noske/current/src/gdex-3.12.tar.gz gdex-3.12.tar.gz] || [http://corpora.fi.muni.cz/noske/current/src/crystal-open-2.14.tar.gz crystal-open-2.14.tar.gz] || [http://corpora.fi.muni.cz/noske/current/src/susanne-example-source.tar.bz2 susanne-example-source.tar.bz2] || |
| 45 | || rpm (Centos 7) || [http://corpora.fi.muni.cz/noske/current/centos7/manatee-open/ 2.167.10] || [http://corpora.fi.muni.cz/noske/current/centos7/bonito-open/ 4.24.6] || [http://corpora.fi.muni.cz/noske/current/centos7/gdex/ 3.12] || [http://corpora.fi.muni.cz/noske/current/centos7/crystal-open/ 2.14] || [https://corpora.fi.muni.cz/noske/current/centos7/manatee-open/ 2.167.10] || |
| 46 | |
| 47 | == Older releases == |
| 48 | |
| 49 | Older releases can be downloaded from the [http://corpora.fi.muni.cz/noske/archive archive]. |
| 50 | |
| 51 | = Build and installation = |
| 52 | == manatee == |
| 53 | |
| 54 | {{{ |
| 55 | tar xzvf manatee-open-<version>.tar.gz |
| 56 | cd manatee-open-<version> |
| 57 | ./configure PYTHON=python2 --with-pcre |
| 58 | make |
| 59 | sudo make install |
| 60 | ldconfig |
| 61 | }}} |
| 62 | == bonito == |
| 63 | |
| 64 | {{{ |
| 65 | tar xzvf bonito-open-<version>.tar.gz |
| 66 | cd bonito-open-<version> |
| 67 | ./configure |
| 68 | make |
| 69 | sudo make install |
| 70 | sudo ./setupbonito <CGIPATH> <DATAPATH> |
| 71 | # where CGIPATH is the your webserver CGI directory and DATAPATH is a data directory writable by the webserver |
| 72 | }}} |
| 73 | |
| 74 | == gdex == |
| 75 | |
| 76 | {{{ |
| 77 | tar xzvf gdex-<version>.tar.gz |
| 78 | cd gdex-<version> |
| 79 | sudo python2 setup.py install |
| 80 | }}} |
| 81 | |
| 82 | == crystal == |
| 83 | |
| 84 | {{{ |
| 85 | tar xzvf crystal-open-<version>.tar.gz |
| 86 | cd crsytal-open-<version> |
| 87 | make |
| 88 | sudo make install VERSION=<version> |
| 89 | }}} |
| 90 | |
| 91 | == Installation from RPM packages == |
| 92 | !NoSketch Engine packages |
| 93 | {{{ |
| 94 | rpm -ivh crystal-open-*.el7.noarch bonito4-open-*.el7.noarch manatee-open-*.el7.x86_64 manatee-open-python-*.el7.x86_64 |
| 95 | }}} |
| 96 | |
| 97 | sample corpora |
| 98 | |
| 99 | {{{ |
| 100 | rpm -ivh manatee-open-susanne-*.el7.noarch |
| 101 | }}} |
| 102 | |
| 103 | |
| 104 | = Configuration = |
| 105 | == Apache (httpd) configuration == |
| 106 | |
| 107 | {{{ |
| 108 | # all corpora are accessible for every one |
| 109 | Alias /crystal /var/www/crystal |
| 110 | |
| 111 | Alias /bonito /var/www/bonito |
| 112 | |
| 113 | <Directory /var/www/bonito> |
| 114 | AllowOverride All |
| 115 | Options +ExecCGI |
| 116 | AddHandler cgi-script .cgi |
| 117 | </Directory> |
| 118 | }}} |
| 119 | |
| 120 | {{{ |
| 121 | # authentication for access all corpora required |
| 122 | Alias /crystal-auth /var/www/crystal-auth |
| 123 | <Location "/crystal-auth"> |
| 124 | AuthType Basic |
| 125 | AuthName "Secure Content" |
| 126 | AuthUserFile /var/lib/bonito/htpasswd |
| 127 | Require valid-user |
| 128 | </Location> |
| 129 | |
| 130 | Alias /bonito-auth /var/www/bonito |
| 131 | <Location "/bonito-auth"> |
| 132 | AuthType Basic |
| 133 | AuthName "Secure Content" |
| 134 | AuthUserFile /var/lib/bonito/htpasswd |
| 135 | Require valid-user |
| 136 | </Location> |
| 137 | }}} |
| 138 | |
| 139 | == Bonito (run.cgi) configuration == |
| 140 | Bonito configuration file is `run.cgi` you may run multiple instances just by copying this file and changing the configuration. |
| 141 | |
| 142 | To enable authentication set `_anonymous = False` |
| 143 | |
| 144 | |
| 145 | == Crystal (config.js) configuration == |
| 146 | {{{ |
| 147 | # set URL to run.cgi script of bonito |
| 148 | URL_BONITO: "https://no.sketchengine.co.uk/bonito/run.cgi/", |
| 149 | }}} |
| 150 | |
| 151 | = Credits = |
| 152 | Finlib, Manatee and Bonito2 have been crafted by [http://www.fi.muni.cz/~pary/ Pavel Rychlý], starting with his [http://www.fi.muni.cz/~pary/dis.pdf PhD thesis]. All the components are still being developed in collaboration with the Sketch Engine development team. When using !NoSketch Engine for research purposes, please cite: |
| 153 | |
| 154 | ''' ''Rychlý, Pavel. !Manatee/Bonito - A Modular Corpus Manager. In 1st Workshop on Recent Advances in Slavonic Natural Language Processing. Brno : Masaryk University, 2007. p. 65-70. ISBN 978-80-210-4471-5.'' ''' |
| 155 | |
| 156 | This software was partly supported within the projects LC536 and 2C06009. |
| 157 | |
| 158 | For a list of related publications, please refer to the Sketch Engine [https://www.sketchengine.eu/bibliography-of-sketch-engine/ publications page]. |
| 159 | |
| 160 | = Testing installation = |
| 161 | !NoSketch Engine plain installation: |
| 162 | * https://no.sketchengine.co.uk/crystal |
| 163 | |
| 164 | !NoSketch Engine with enabled authentication (test/t): |
| 165 | * https://no.sketchengine.co.uk/crystal-auth |
| 166 | |
| 167 | = (No)Sketch Engine installations over the world = |
| 168 | {{{ |
| 169 | #!html |
| 170 | <iframe src="https://www.google.com/maps/d/u/0/embed?mid=1NkpLRybzQmKUg5FC1U2WlVoa-Tc" width="640" height="480"></iframe> |
| 171 | }}} |
| 172 | |
| 173 | |