Table of Contents
sentences - splits text into separate sentences deimited by newlines.
sentences
The sentences utility splits input into
separate sentences delimited by newlines. If sentences are delimited by
more than one line in the input, they will be delimited by exactly two
in the output.
Single-line page feed characters (C character ’\f’) and paragraphs
are preserved in the output. Input must end with a ’\f’.
- Command:
echo -e "Hello, world! - Hello to you, too.\n\nThis is a new paragraph.\n\f" | sentences
- Output:
Hello, world!
- Hello to you too.
This is a new paragraph.
^L
Autocorpus was written by Maciej Pacula (maciej.pacula@gmail.com).
The project website is http://mpacula.com/autocorpus
autocorpus(7)
,
ngrams(1)
, ngrams(5)
, ngrams-freq-filter(1)
, ngrams-sort(1)
, tokenize(1)
,
wiki-articles(1)
, wiki-textify(1)
,
Table of Contents