2012-02-02

Future work: Computational literacy


Future work: Computational literature

I don't know how to call these kind of approach to natural languages, I might say, computational literacy, or something like that. As far as I know, there are some research of similar approach. For instance, some kind of spam filter using entropy based approach. Some people use statistical approach to finding an author of a document. In a Science Fiction novel, Asimov wrote a scene that a politician talked a lot, but people find out there is no information at all in the talk by an information analysis (The Foundation Series).

We can extend the presented method more systematically way. For example, we can analyze famous widely available books, e.g., the Bible, some Shakespeare's, IKEA's catalogs, and so on. Also the translation of the Bible altered in the history, I would like to see the history of the information in it. If you know anything about research in this approach, please put it in the comment.

Appendix 1: person + tree = ?

The Kanji combined a person (人) with a tree (木) means ``rest (休).'' When I was an elementary school student, the explanation of this character was ``a person is resting under a tree.'' Many of Kanji are composed by several basic Kanjis. A tree (木) is a tree, but two trees (林) means woods, then three trees (森) is forest. There is no four tree character, but if it exists, that would mean jungle.

Appendix 2: Information theory, entropy, and compression algorithm

Some readers may not familiar with Information theory, entropy, or compression algorithm. In this article, I could not provide the explanation why we can measure the entropy in the document by a compression. If someone wants to know further, the following Wikipedia's article would be a good start. http://en.wikipedia.org/wiki/Information_theory

Acknowledgments

This article is based on many party discussions. So many my friends are participated in these discussions. It started from I lived in Saarbr\"{u}ecken and until Gr\"{u}nkohl party. The ideas, for instance, apply the compression method on the Bible, the history of the compressed size of the Bible, are born from the discussion with my friends.  I thank to all my friends who participated in this discussion.

No comments: