1 Introduction
Copyright 2005, 2006, 2007, 2008, 2009, 2010 by Damir Ćavar, Petar Garžina, Larisa Grčić, Tanja Gulan, Damir Kero, Robert Paleka, Franjo Pehar, Pavle Valerjev
(require nltk) |
The S-NLTK library is under continuous development. The current release is considered Alpha, meaning that the coding of desired functionality and specific procedures is not yet completed, but nevertheless, the included procedures and data structures are fully functional, if the specified conditions are respected.
As the library is mainly intended for Scheme implementations that support the R6RS standard, and tested on the PLT-Scheme and Larceny platforms, we also recommend these implementations for learning and development purposes. These Scheme development platforms are available for common hardware environments and operating systems, including Linux and other Unix variants, as well as Mac OS X and Microsoft Windows.
The project S-NLTK was inspired by the impressive Python NLTK [Bird:ea:2009]. The S-NLTK library is not a Scheme implementation of the Python NLTK, it rather brings together procedures, algorithms and utilities for NLP, coded in Scheme R6RS, in the same way as the NLTK does for Python.
We are grateful for feedback and bug reports, as well as for suggestions related to improvements and new functionality, or even the need for more examples and documentation of specific functionalities. If you have code or linguistic data that you might want to contribute to the library under the specified licensing conditions, we would be glad to integrate it. Please submit your contributions, comments and suggestions to the Schemers in Zadar.
Note:
Parts of the S-NLTK library are based on code that was developed for research purposes in the project Semantic-nets and computational lexicology, funded by the Ministry of Research, Education and Sports of the Republic of Croatia, grant number 2120920-0930. Other parts were developed for, and presented at the 18th European Summer School in Logic, Language and Information (ESSLLI 2006) course Introduction to symbolic and statistical NLP in Scheme. Most of the code was written and translated into R6RS by the members of Schemers in Zadar in their spare time.
1.1 Overview
The goal of the Scheme Natural Language ToolKit (S-NLTK) is to provide common functionality for natural language processing (NLP) tasks in Scheme, that can be used for educational purposes, as well as for development of NLP components in productive environments.
The documentation consists of three components:
Description of the library and functionality
Documented implementation of the procedures and algorithms in the library
Documented example applications
!!! TODO !!!
1.2 License
This documentation is part of the Scheme Natural Language Toolkit (S-NLTK).
The Scheme Natural Language Toolkit (S-NLTK) is free software: you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
The Scheme Natural Language Toolkit (S-NLTK) is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with the Scheme Natural Language Toolkit (S-NLTK). If not, see http://www.gnu.org/licenses/.