Project info for CEM

Share This Created 19 Sep 2001 at 04:22 UTC by loam, last modified 19 Sep 2001 at 23:17 UTC by loam.



CEM stands for ``Colloquial Entropy Markup.'' A colloquial language or dialect is defined as ``[b]elonging to common speech; characteristic of or proper to ordinary conversation, as distinguished from formal or elevated language.'' [OED]

CEM is a java package for text mining by text markup of Unicode text. Released under the GNU General Public License.

License: GPL

This project has the following developers:

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!

Share this page