Source: ucto
Section: science
Priority: extra
Maintainer: Ubuntu Developers <ubuntu-devel-discuss@lists.ubuntu.com>
XSBC-Original-Maintainer: Debian Science Team <debian-science-maintainers@lists.alioth.debian.org>
Uploaders: Joost van Baal-Ilić <joostvb@debian.org>, Ko van der Sloot <ko.vandersloot@uvt.nl>
Build-Depends: cdbs, debhelper (>= 7), pkg-config, libxml2-dev, libicu-dev, libfolia-dev, libticcutils2-dev, dh-autoreconf
Standards-Version: 3.9.3
Homepage: http://ilk.uvt.nl/
Vcs-Svn: svn://svn.debian.org/svn/debian-science/packages/ucto/trunk
Vcs-Browser: http://svn.debian.org/wsvn/debian-science/packages/ucto/trunk/

Package: ucto
Architecture: any
Depends: ${shlibs:Depends}, ${misc:Depends}
Description: Unicode Tokenizer
 Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
 punctuation, split sentences, generate n-grams), and  offers several other
 basic preprocessing steps (change case, count words/characters and reverse
 lines) that make your text suited for further processing such as indexing,
 part-of-speech tagging, or machine translation.
 .
 Ucto is a product of the ILK Research Group, Tilburg University (The
 Netherlands).
 .
 If you are interested in machine parsing of UTF-8 encoded text files, e.g. to
 do scientific research in natural language processing, ucto will likely be of
 use to you.

Package: libucto-dev
Section: libdevel
Architecture: any
Depends: libucto2 (= ${binary:Version}), ${misc:Depends}
Conflicts: libucto1-dev
Replaces: libucto1-dev
Description: Unicode Tokenizer - development
 Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
 punctuation, split sentences, generate n-grams), and  offers several other
 basic preprocessing steps (change case, count words/characters and reverse
 lines) that make your text suited for further processing such as indexing,
 part-of-speech tagging, or machine translation.
 .
 Ucto is a product of the ILK Research Group, Tilburg University (The
 Netherlands).
 .
 This package provides the ucto header files required to compile C++ programs
 that use ucto.

Package: libucto2
Section: libs
Architecture: any
Depends: ${shlibs:Depends}, ${misc:Depends}
Description: Unicode Tokenizer - runtime
 Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
 punctuation, split sentences, generate n-grams), and  offers several other
 basic preprocessing steps (change case, count words/characters and reverse
 lines) that make your text suited for further processing such as indexing,
 part-of-speech tagging, or machine translation.
 .
 Ucto is a product of the ILK Research Group, Tilburg University (The
 Netherlands).
 .
 This package provides the runtime files required to run programs that use
 ucto.

