Sha256: 0d74078cc2129877397b047e431355c725b7314c42aec9e2653f000c90530aa8

Contents?: true

Size: 1.23 KB

Versions: 1

Compression:

Stored size: 1.23 KB

Contents

---
layout: post
title:  "Under the hood"
date:   2017-03-07 10:34:13
categories: wordcram update
---

### The required libraries

- cue.language.jar

  Created by Jonathan Feinberg

  cue.language is a small library of Java code and resources that provides the following basic natural-language processing capabilities:

  *  Tokenizing natural language text into individual words
  *  Tokenizing natural language text into sentences
  *  Tokenizing natural language text into n-grams (sequences of 2 or more words that appear next to each other in a sentence)
  *  Counting strings
  *  Detecting which script (alphabet, writing system) is required to represent a text
  *  Guessing what language a text is in
  *  Customizable "stop word" detection for a variety of languages


- WordCram.jar

  Created by Dan Bernier
    WordCram lets you generate word clouds in Processing. It does the heavy lifting – text analysis, collision detection – for you, so you can focus on making your word clouds as beautiful, as revealing, or as silly as you like.


- jsoup-1.7.2.jar

  jsoup is a Java library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods.

Version data entries

1 entries across 1 versions & 1 rubygems

Version Path
ruby_wordcram-2.0.0 docs/_posts/2017-03-07-under_the_hood.md