Rethinking Superintelligence without Language
Language disguises the thought; so that from the external form of the clothes one cannot infer the form of the thought they clothe, because the external form of the clothes is constructed with quite another object than to let the form of the body be recognized. — Ludwig Wittgenstein, Tractatus Logico-Philosophicus.
Despite all the hype generated by big corps trying to rise billions on the idea that “scaling LLMs is all you need” to achieve AGI/ASI/etc. — whatever that means and whatever will be the benefits — let me mind-wander about language being the conveyer of the said intelligence.
Language is a model of the world which evolved because of the need to collaborate. Our complex language system evolved
as an information compression mechanism, first transmissible through acoustic waves, then made long-lasting and
immutable via written forms. It allowed us to communicate from bits of information to complete cultural identities,
enabling knowledge sharing, coordination of actions, and many other evolutionary advantages. But as impressive as this
may seem, language is a lossy compression algorithm, as Wittgenstein said: language disguises the thought
1.
Beyond its lossy nature, there exists a language-cognition interplay. Linguistic determinism states that language fully determines thought, meaning we cannot conceive of ideas for which our language lacks words or structures. A slightly weaker version of this theory, linguistic relativity, states that language influences thought, making certain concepts easier or harder to grasp depending on linguistic features.
There are clear examples of this interplay. As an example, speakers of the Australian language Guugu Yimithirr using cardinal, absolute positional systems (similar to our north/south/east/west) and not relative ones (left/right/up/down) 2, are “predominantly coding nonverbal memory in “absolute” concepts congruent with their language, while a comparative sample of Dutch speakers do so in “relative” concepts”3.
We start to see that language is a part of our Brain’s Operating System which models the world, but all models are
wrong, even if some are useful. Training a large language model is like distilling our own culture, the lens with which
we see and interact with the world. To better model the world, we should go beyond the bottleneck that is language,
because language is not all we need — to quote Wittgenstein, the limits of my language means the limits of my world
.
-
Wittgenstein, L., Pears, D. F., Russell, B., & Wittgenstein, L. (1995). Tractatus logico-philosophicus (1. publ., reprinted). Routledge. ↩︎
-
Haviland, J. B. (1998). Guugu Yimithirr Cardinal Directions. Ethos, 26(1), 25–47. https://doi.org/10.1525/eth.1998.26.1.25 ↩︎
-
Levinson, S. C. (1997). Language and Cognition: The Cognitive Consequences of Spatial Description in Guugu Yimithirr. Journal of Linguistic Anthropology, 7(1), 98–131. https://doi.org/10.1525/jlin.1997.7.1.98 ↩︎