Jayarava's Raves

08 May 2015

What can the Turing Test Tell Us?

Alan Turing's contribution to mathematics, cryptography and computer science was inestimable. Not only did he shorten World War Two, saving thousands of lives, he advanced us onto the path of digital computers. His suicide after being coerced into hormone treatment is a massive blot on the intellectual landscape in Britain. It is an enduring source of shame. Turing's work remained classified for decades because of the fear that war might break out again and knowing how to break the complex codes used by the Germans was too valuable an advantage to throw away. Nowadays, cryptography has advanced to the point where keeping Turing's work a secret no longer confers much advantage.

Turing was prescient in many ways. Not only did he set the paradigm for how digital computers work, but he understood that one day such machines might become so sophisticated that they were indistinguishable from intelligent beings. He was the first person to consider artificial intelligence (AI). Thinking about AI led him to construct one of the most famous thought experiments ever proposed. The Turing Test is not only a way to distinguish intelligence, it is actually a way of thinking about intelligence without getting bogged down in the details of how intelligence works. For Turing and many of us, the argument is that if a machine can communicate in a way that is it indistinguishable from a human being, then we must assume that it is intelligent, however it achieves this. It's a pragmatic definition of intelligence and one that leads to a practical threshold, beyond which all AI researchers wish to pass.

However underpinning the test are some assumptions about communication, language, and intelligence that I wish to examine. The first is that all human beings all seem to be considered good judges for the Turing Test. I think a good case can be made for considering this a false assumption. The second is the assumptions that mere word use is how we define not only intelligence, but language. Both of these are demonstrably false. If the assumptions the test is built on are false, then we need to rethink what the test is measuring, and whether we still feel this is a sufficient measure of intelligence.

Turing Judges.

The idea of the Turing Test is that a person sits at a teletype machine that prints texts and allows the operator to type text. The human and the test subject sit in different rooms and use the teletype machines to communicate. A machine can be said to pass the Turing Test if a human operator of the teletype cannot tell that the subject is not human. This puts word use at the forefront of Turing's definition of what it means to be intelligent.

Human beings use of language is indeed one of our defining features. Animals use faculties that hint at a proto-language facility. No animal uses language in the sense that we do. At best animals show one or two of the target properties that define language. They might for example have several grunts that indicate objects (often types of predator), but no syntax or grammar. There has been significant interest in programs that sought to teach apes to use language either as symbols or gestures. But most of this research has been discredited. Koko the gorilla was supposedly one of the most sophisticated language uses, but her "language" in fact consisted of rapidly cycling through the repertoire of signs, with the handler picking the signs that made most sense to them. In other experiments subtle cues from handlers told the animals what signs to use. More rigorous experiments show that chimps can understand some language, particularly nouns, but then so can grey parrots, some dogs, and other animals. Crucially they don't use language to communicate. In fact a far more impressive demonstration of intelligence is the ability of crows to improvise tools to retrieve food, or the coordinated pack hunting of aquatic mammals like orca and dolphins. So animals do not use language, but are none the less intelligent.

Humans are all at different levels when it comes to language use. Some of us are extraordinarily gifted with language and others struggle with the basics. The distinctions are magnified when we restrict language to just written words. This restriction alone is doubtful. Language as written language, even if used for a dialogue, is only small part of what language use consists of. A great deal of what we communicate in language is conveyed by tone of voice, facial expression, hand gestures, or body posture. Those people who can use written language well are rare. So a Turing judge is not simply distinguishing a machine from a human, but is placing a machine on a scale that includes novelists and football hooligans. What happens when the subject responds to any question by chanting "Oi, oi, oi, Come on you reds!"? Intelligence, particularly as measured by word use, is not a simple proposition.

The Turing Test using text alone would be more interesting if we could define in advance what elements would convince us that the generator of the text was human. To the best of my knowledge this has never been achieved. We don't know what criteria constitute a valid or successful test. We just assume that any generic human being is a good judge. There's no reason to believe that this is true. As I've mentioned many times now, individuals are actually quite poor at solo reasoning tasks (See An Argumentative Theory of Reason). Reason does not work the way they we thought it did. Mercier & Sperber have argued that at least one of the many fallacies that we almost inevitably fall prey to—confirmation bias—is a feature of reason, rather than a bug. M&S argue that this is because reason evolved to help small groups make decisions and those who make proposals think and argue differently to those who critique them. On this account, any given individual would most likely be a poor Turing judge.

Humans beings evolved to use language. Almost without exception, we all use it without giving it much thought. Certain disorders or diseases may prevent language use, but these stand out against the background of general language use: from the Amazon jungles to the African veldt, humans speak. The likelihood is that we've been using language for tens of thousands of years (See When Did Language Evolve?). But writing is another story. Writing is unusual amongst the world's languages, in that only a minority of living languages are written, or were before contact with Europe. Writing was absent from the Americas, from the Pacific, from Australia and New Guinea. The last two have hundreds of languages each. Unlike speaking, writing is something that we learn with difficulty. No child spontaneously begins to communicate in writing. Writing co-opts skills evolved for other purposes. And as a consequence our ability to use writing to express ourselves is extremely variable. Most people are not very good at it. Those who are, are usually celebrated as extraordinary individuals. Writers and their oeuvre are very important in literary cultures.

So to chose writing as the medium of a test for intelligence is an extremely doubtful choice. We don't expect intelligent human beings to be good at writing. Many highly intelligent people are lousy writers. We don't even expect people who are gifted speakers to be good at writing, which is why politicians do not write their own speeches! Writing is not a representative skill. Indeed it masks our inherent verbal skill.

In fact it might be better to use another skill altogether, i.e. tool making. A crow can modify found objects (specifically bending wire into a hook) to retrieve food items. Another important manifestation of intelligence is the ability to work in groups. Some orca, for example, coordinate their movements to create a bow-wave that can knock a seal off an ice-flow. This is a feat that involves considerable ability at abstract thought, and they pass this acquired knowledge onto to their offspring. The ability to fashion a tool or coordinate actions to achieve a goal are at least as interesting as manifestations of intelligence as language is.

Language and Recognition.

My landlady talks to her cats as though they understand her. She has one-sided conversations with them. Explains to them narratively when their behaviour causes her discomfort, as though they might understand and desist (they never do). She's not peculiar in this. Many people feel their pets are intelligent and can understand them even if they cannot speak. Why is this? Well, at least in part, it's because we recognise certain elements of posture in animals corresponding to emotions. The basic emotions are not so different in our pets that we cannot accurately understand their disposition: happy, content, excited, tired, frightened, angry, desire. With a little study we can even pick up nuances. A dog that barks with ears pinned back is saying something different to one that has its ears forward. A wagging tail or a purr can be a different signal depending on circumstances. A lot of it has to do with displays of and reception of affection.

Intelligence is not simply about words or language. Depending on our expectations the ability to follow instructions (dogs) or the ability to ignore instructions (cats) can be judged intelligent. The phrase emotional intelligence is now something of a cliché, but it tells us something very important about what intelligence is. A dog that responds to facial expressions, to posture and tone of voice is displaying intelligence of the kind that has a great deal of value to us. Some people value relationships with animals precisely because the communication is stuck at this level. A dog does not try to deceive or communicate in confusingly abstract terms. An animal broadcasts its own disposition ("emotions") without filtering and it responds directly to human dispositions. Many people would say that this type of relationship is more honest.

There's a terrible, but morbidly fascinating, neurological condition called Capgras Syndrome. In this condition a person can recognise the physical features of humans, but their ability to connect those features with emotions is compromised. Usually when one sees a familiar face there is an accompanying emotion that tells us what our relationship with the person is. If we feel disgust or anger on recognition, then we know them to be enemies, perhaps dangerous and we act to avoid or perhaps confront them. If the emotion is joy or love then we know it's a friend or loved one. In Capgras the emotional resonance is absent. With loved ones the absence of that emotion is so strange that the most plausible explanation often seems to be that these are mere replicas of loved ones, or lookalikes. The lack of emotion in response to a known face can be incapacitating in the sense of disrupting every existing relationship. In the classic novel, The Echo Maker, by Richard Powers, the man with Capgras is able to recognise and respond to his sister's voice on the telephone, but does not feel anything when he sees her. The same is true for his home and even his dog. The only way he can explain it is that they are all substitutes cleverly recreated to fool him. Only he isn't "fooled" which creates a nightmarish situation for him.

The problem, then, with the Turing Test is that it is rooted in the old Victorian conceit about reason being our highest faculty. Reason was, until quite recently, considered to float above the mere bodily processes of emotion. In other words it was very much caught up in Cartesian mind/body dualism and the metaphors associated with matter and spirit (See Metaphors and Materialism). Reason is associated, by default, with spirit, since it seems to be distinct from emotion. We now know that nothing could be further from the truth. Cut off from emotions our minds cannot function properly. We cannot make decisions, cannot assess information, and cannot take responsibility for our actions. The Turing test assumes that intelligence is an abstract quality, separable from the body. But these assumptions are demonstrably false.

What Kind of Intelligence?

I've already pointed out that language is more than words. I've expanded the idea of language to include the prosody, gesture and posture associated with the words (which as we know shapes the meaning of the words). An ironic eyebrow lift can make words mean something quite different than their face value. The ability to use and detect irony depends on non-verbal cues. This is why, for example, irony seldom works on Twitter. Text tends to be taken on face value, and attempts at irony simply cause misunderstanding. This is true in all text based media. In the absence of emotional cues we are forced to try to interpolate the disposition of the interlocutor. Getting a computer to work with irony would be an interesting test of intelligence!

Indeed trying to assess the internal disposition of the hidden interlocutor is a key aspect of the Turing Test. Faced with a Turing Test subject I suspect that most of us would ask questions designed to evoke emotional responses. This is because we intuit that what makes us human is not the words we use, but the feelings we communicate. Someone who acts without remorse is routinely referred to as "inhuman". In most cases humans are not good at making empathetic connections using text - which is why text-based online forums seem to be populated with borderline, if not outright, sociopaths. It's the medium, not the message. Personally I find that doing a lot of online communication produces a profound sense of alienation and brings out my underlying psycho-pathology. Writing an essay however is far more productive exercise than trying to dialogue in text. Even the telephone, with it's limited frequency range, is better for communicating, because tone of voice and inflection communicates sufficient to establish an empathetic connection.

So if a computer can play chess better than a human being (albeit with considerable help from a team of programmers) then that is impressive, but not intelligent. The computer plays well because it does not feel anything, does not have to respond to its environment (internal or external), and does not have any sense of having won or lost. It has nothing for us to relate to. Similarly, even if a computer ever managed to use language with any kind of facility, i.e. if it could form grammatically and idiomatically correct sentences, it would probably still seem inhuman because it would not share our concerns and values. It would not empathise with us, nor us with it.

I suppose that in the long run a computer might be able to simulate both language and an interest in our values so that in text form it might fool a human being. But would this constitute intelligence? I think not. A friendly dog would be more intelligent by far. Which is not to say that such a computer would not be a powerful tool. But we'd be better off using it to predict the weather or model a genome than trying to simulate what any of us, or any dog, can do effortlessly.

An argument against this point of view is that our minds are tuned to over-estimate intelligence or emotions in objects we see. So we see faces in clouds and agency in inanimate objects. So an approximation of intelligence would not have to be all that sophisticated to stimulate the emotions in us that would make us judge it intelligent. For example, in movies robots are often given a minimal ability to emote in order to make them sympathetic characters. The robot, Number five, in the film Short Circuit has "eyebrows" and an emotionally expressive voice and this is enough for us to empathise with it. So perhaps we will be easily fooled into believing in machine intelligence. But this means that simulation of intelligence is insufficiently impressive because people are easily fooled.

This point is brilliantly made in the movie Blade Runner. The Voight-Kampff test is designed to distinguish "replicants" from humans based on subtle differences in emotional responses. The replicants are otherwise indistinguishable from humans. The test of Rachael is particularly difficult because she has been raised to believe she is human (the logic of the movie breaks down to some extent because we do not learn by Deckard persists in asking 100 questions if Rachael is answering satisfactorily). Ridley Scott has muddied the waters further by suggesting that the blade runner, Deckard, is himself a replicant, though based on the original story and the context of the film this seems an unlikely twist.

So there are two major problems here: what makes a good Turing test; and who makes a good Turing judge. The whole set up seems under-defined and poorly thought out at present. My impression is that passing the Turing test as it is usually specified is a trivial matter that would tell us nothing about artificial intelligence or humanity that we do not already know.

Conclusion

It seems to me that we have many reasons to rethink the Turing Test. It seems to be rooted in a series of assumptions that are untenable in light of contemporary knowledge. As a test for intelligence the Turing Test no longer seems reasonable. On one hand the way that it defines intelligence is far too limited. The definition of intelligence it uses is rooted in Cartesian Dualism which sees intelligence as an abstract quality, not rooted in physicality, not embodied. And this is simply false. Emotions, as felt in the body, for example, play a key role in how we process information and make decisions.

As much as anything our decision on whether or not an entity is intelligent or not, will be based on how we feel about it, how interacting with it feels to us. We will compare the feeling of interacting with the unknown entity, to how it feels to interact with an intelligent being. And until it feels right we will not judge that entity intelligent.

In Turing's day we simply did not understand how decision making worked. We still thought of abstract reasoning as a detachable mental function unrelated to being embodied. We still saw reason as the antithesis of emotion. Now we know that emotion is an indivisible part of the process. We must now consider that reason itself may not have evolved for seeking truth, but merely for optimising decision making in small groups. At the very least, the lone teletype operator needs to be replaced with a group of people; and mere words must be replaced by tasks that involve creativity and cooperation. A machine ought to show the ability to cooperate with a human being to achieve a shared goal before being judged "intelligent". The idea that we can judge intelligence at arms length, rationally, dispassionately has little interest or value any more. We judge intelligence through interaction, physical interaction as much as anything.

As George Lakoff and his colleagues have shown, abstract thought is rooted in metaphors deriving from how we physically interact with the world. Our intelligence is embodied and the idea of disembodied intelligence is no longer tenable. As interesting as the idea may appear, there is no ghost in the machine that can be extracted or instantiated and maintained apart from the body. Any attempts to create disembodied intelligence will only result in a simulacrum, not in intelligence that we can recognise as such.

Buddhists will often smugly claim this as their own insight, though most Buddhists I know are crypto-dualists (most believe in life after death and karma for example). I've argued at length that the Buddha's insight was into the nature of experience and that he avoided drawing ontological conclusions. Thus, although we read the texts as being a critique of doctrines involving souls, the methods of Buddhism were always different from the methods of Brahmanism. The Brahmins sought to experience the ātman as a reality, and from the Upaniṣadic description ātman could be experienced as a sense of oneness or connection with everything in the world (oceanic boundary loss). Buddhists deconstructed experience itself to show that nothing in experience persisted and that therefore, even if there was a soul we must either always experience it, or it could never be experienced, and since we start off not experiencing it, no permanent soul can ever be experienced (which is not a comment on whether or not such a soul exists!). Therefore the experiences of the Brahmins are of something other than ātman. Only after Buddhists had started down the road of misguided ontological speculation did this become an opinion about the existence of a soul. So the superficial similarities between ancient Buddhist and modern scientific views is an accident of a philosophical wrong turn on the part of Buddhists. They got it partly right by accident, which is not really worth being smug over.

History shows that we must proceed with real caution here. Our Western views on intelligence have been subject to extreme bias in the past and this has led to some horrific consequences for those people who failed our tests for completely bogus reasons. We must constantly subject our views on intelligence to the most rigorous criticism and scepticism we are capable of. Our mistakes in this field ought to haunt us and make us extremely uncomfortable. This is yet another reason why tests for intelligence ought to require more interactivity. If we do create intelligence we need to know we can get along with it, and it with us. And we know that we have a poor record on this score.

The Turing Test seems not to have been updated to take account of what we know about ourselves nowadays. The test itself is anachronistic. The method is faulty, because it is based on a faulty understanding of intelligence and decision making. We are not even asking the correct question about intelligence. With all due respect to Alan Turing, he was a man of his time, a glorious pioneer, but we're moved on since he came up with this idea and it's had its day.

~~oOo~~

See also: Why Artificial Intelligences Will Never Be Like Us and Aliens Will Be Just Like Us. (27 June 2014)

01 May 2015

Yāmagaṇḍika: Telling the Time in Ancient India

Revision 2.0 - 3 May 2015

My Pāḷi reading group has been working through the commentary to the Kāraṇiya Metta Sutta which I translated for this blog some years ago (11 Jun 2010). In this text we come across an unusual term that has no counterpart in the suttas. In picturing some bhikkhus zealously meditating in the forest it describes them as yāmagaṇḍikaṃ koṭṭetvā. This is a curious expression and in this essay I'll attempt to elucidate what it means. The compound yāmagaṇḍika occurs only twice, both times in commentarial texts (Paramatthajotikā SnA 1.193; Papañcasūdaniyā, MNA 1.122) and these should be enough to allow us to gain some clarity. We'll see that the commentator does not see his own time in context, but wrongly assumes that his milieu reflects that of the Buddha some centuries earlier.

The gerund koṭṭetvā must come from the verb koṭṭeti (from a rare root √kuṭ or kuṭṭ) 'to beat, crush, pound'. For example it is the action associated with a mortar (udukkhale koṭṭetvā DN ii.341) and with pounding grain (dhaññaṃ koṭṭenti Thī 117). It has other minor senses in PED, but these don't seem relevant here. The compound yāmagaṇḍika combines yāma and gaṇḍikā. We'll take these one at a time.

According to PED gaṇḍikā derives from gaṇḍa 'a swelling; a stalk or shaft' + -ikā. The formation gaṇḍikā means 'a stalk or shaft', particularly 'the trunk of a tree' and by association 'a block of wood'. However there is a potential confusion here with ghaṇṭā 'bell' or ghaṭikā 'gong'. As we will see the CST edition of the text is quite unreliable and this means we must allow for errors. In the Digital Pāḷi Reader version of this text, we find yāmaghaṇḍikaṃ koṭṭetvā. The spelling -ghaṇḍikaṃ occurs in the Majjhima Ṭīkā (the sub-commentary on MNA 1.121) "yāmaghaṇṭikaṃ paharati" (MNṬ 1.196) though the Aṭṭhakāthā has -ga-. The Khuddaka Nikāya Commentary—which parallels the Suttanipata commentary—also has -gh-.

Pañjaranatha Mahākāla
with gaṇḍikā

DOP sv ghaṇṭī/ghaṇḍī, suggests a confusion with gaṇḍi and ghaṇṭā. If it does mean 'block' then it must refer to a resonant gong-like block that is 'pounded' (√koṭṭ) as a time signal. Buddhadatta's Concise Pali-English Dictionary defines gaṇḍikā as "(f.) a hollowed block of wood which is used to serve the purpose of a bell; a gong." A gaṇḍī or gaṇḍikā is the characteristic implement of a form of Mahākāla known as Pañjaranatha. In the image on the right he holds it across his body (thanks to Maitiu for pointing this out).

Yāma is complicated because it has homonyms that derive from different verbs. From √yam 'hold, hold back' + -a we get yāma 'restraint'; and from √yā 'go' + -ma we get yāma 'motion, going, progress'. The latter is used figurative to mean 'a watch of the night'. We frequently read in Pāḷi of the three watches of the night (tiyāmā): paṭhamayāma, majjhimayāma, and pacchimayāma (first, middle, and last watches). The practice of dividing the night in particular in watches was common in the ancient world. The Latin name for these periods was vigilia, whence English 'vigil'. Incidentally yāma can also be a collective noun for people or things related to the God of the afterlife, Yama, in this case his name means 'twin', from √yam 'combine'.

The compound, yāmagaṇḍika, can really only be a tatpuruṣa so it must mean something like 'the block of restraint', or 'the gong of the watches'. The context is that the monks are resolute night and day, devoted to wise attention, and sitting at the foot of trees meditating. It may be that 'beating the block of restraint' is a metaphor that we no longer understand, similar to the Buddha saying to Upaka the Ājīvaka in the Ariyapariyesana Sutta, āhañchaṃ amatadundubhiṃ 'I beat the drum of the deathless' (MN i.171). It's not entirely obvious what this means since drums are primarily for entertainment in our society.

However, I believe that here we must read yāma as 'watch of the night' and the phrase means 'beating the block or pounding the gong that marks the watches'. For confirmation we can look at the second of the two occurrences of yāmagaṇḍika at MNA 1.122 (already mentioned above):

Ajagaravihārepi kāḷadevatthero antovasse yāmagaṇḍikaṃ paharati, āciṇṇametaṃ therassa. Na ca yāmayantanāḷikaṃ payojeti, aññe bhikkhū payojenti. Atha nikkhante paṭhame yāme there muggaraṃ gahetvā ṭhitamatteyeva ekaṃ dve vāre paharanteyeva vā yāmayantaṃ patati,

We immediately strike a problem in that ajagara probably means 'python' or some other large snake and doesn't fit the context, and the spelling of the next word (with -tth-) is suspect. Consulting the Dictionary of Pāli Names we find an entry for a Thera named Kāḷadeva:

"...incumbent of Vajagaragiri-vihāra. He is mentioned as having known the exact passage of time without the help of an "hour-glass" (yāmayantanālika). MA.i.100f

This is in fact, a reference to the passage we are about to analyse. It's thus apparent that the CST (Burmese) edition is incorrect here and we must amend it to:

Vajagara[giri]vihārepi kāḷadevathero antovasse yāmagaṇḍikaṃ paharati, āciṇṇametaṃ therassa. Na ca yāmayantanāḷikaṃ payojeti, aññe bhikkhū payojenti. Atha nikkhante paṭhame yāme there muggaraṃ gahetvā ṭhitamatte yeva ekaṃ dve vāre paharante yeva ca* yāmayantaṃ patati.

The Elder Kāḷadeva of Vajagaragiri Monastery, performs this striking of the block of the watches till the end of the rains. And he does not use a measuring device as other monks did. At the end of the first watch the Elder takes up the hammer (muggara) and strikes twice for every measure of time, just as the watch-mechanism falls.

* The text has vā, but I think this must also be wrong, and have amended to ca.

My translation of this passage is a little rough, but the main points are clear. For our purposes two things are important. It is entirely clear that yāma must refer to 'a watch of the night' rather than 'restraint'. Secondly we read that Kāḷadeva did not yāmayantanāḷikaṃ payojeti, that he used a hammer (muggara) to strike the block, and then yāmayantaṃ patati. And this helps to fill out what the author of the Metta Sutta commentary was thinking.

One of the problems of living a regular life is keeping time. The early forest monks had no way of telling the time apart from the sun, moon and stars. Pāḷi distinguishes day (diva) from night (ratti) and we read of monks doing things in the morning-time (pubbaṇhasamaya) or evening-time (sāyaṇhasamaya). We know that the phases of the moon—full moon (puṇṇacanda) and new moon (navacanda)—were important for organising the lives of monks. The moon takes on a magical significance for some Buddhists as a result of this. The watches of the night, however, are far more difficult to determine. How did monks, living in a forest, know when the watches began and end. Presumably the first watch started at dusk and the last ended at dawn, but what marked the other boundaries? Presumably one versed in astronomy would be able to keep track of when certain stars were due to rise and set, but the three month retreat is during the rain season when the skies are perpetually cloudy.

The simple answer is that the first monks almost certainly did not keep accurate track of the time and that the watches were assessed subjectively. And we can point out that no references to time keeping apart from observing the sun and moon are referenced in the suttas. The texts we are dealing with here, however, are from 5th century Sri Lanka and from an environment of highly organised, large scale, urban monasteries.

If we now look at the phrase yāmayantanāḷikaṃ payojeti we can see that the DOPN glosses it is as "the help of an hour-glass". Now an hour-glass is anachronistic here, they did not exist in this time or place. But yanta does mean 'mechanism' and nāḷika 'a tube or measure'. So we know that Kāḷideva did not, as other monks did, employ (pa√yuj) a measure/tube device for the watches (yāma-yanta-nāḷika). This suggests some kind of clock, but is the idea plausible? I had a dig around in some horological books and apparently it is plausible to think that in first millennium India there were water-clocks.

Water-clocks come in two forms: a vessel with a hole that allows water to leak out slowly, and the slight more sophisticated sinking bowl, in which a bowl with a hole in it gradually sinks into a container of water. The books suggest that the sinking-bowl water-clock was common in India by medieval times and so accurate that it probably delayed the introduction of mechanical clocks. Importantly the attendant of water-clock announced the end of the time period by striking a 'gong'. The Gujarati word for which was ghaḍiyār. There's an outside possibility that this word is related to gaṇḍikā or ghaṭika.

Persian Water Clock.

We do know that the Achaemenid Persians possessed just such water clocks, from the records of Alexander's conquests in India by Callisthenes of Olynthus. We know that similar water-clocks were employed to mark the passage of time in monasteries in North India by the 7th century. This information comes from the records of Yijing (義淨 aka I-Tsing; ) a Chinese monk who lived 635–713 CE, and spend 25 years travelling, taking the southern sea route to India. Yijing's account (see translation by Takakusu 1896: 142-6) is widely recycled in a variety of other sources, for example Misra (1998) simply quotes Takakusu at length, while Sharfe (2002) paraphrases and the Wikipedia article on water-clocks cites Sharfe. Yijing records the use of sinking bowl water clocks in several monasteries, with each using slightly different measures and signalling conventions. The bowls were made of copper and were very expensive, generally being the gift of a king to a monastery. Such clocks were also used by the ancient Britons.

Sri Lankan
water-clock bowl
McGill.

So it seems at least plausible that urban monks in fifth century Sri Lanka measured the hours of the day using a water-clock and marked the increments by striking some kind of gong (probably wooden given how expensive metal was). And what our commentators have done is imagine that this is also what monks did in the Buddha's time. Thus when they tell the story of the Metta Sutta they project this technology backwards. And we know that they have done in this other ways as well. For example they projected South Indian kinship patterns familiar, to them in Sri Lanka, onto the family tree of the Buddha and his family, even though these patterns were out of place in North India (See Attwood 2012). But it is extremely unlikely that forest monks in the fifth century BC uses anything so elaborate to measure time.

One little loose end is that having struck the gong with the hammer, yeva ca yāmayantaṃ patati. Now, patati comes from √pat 'fall, fly' and it's not usually a transitive verb. Yanta being a neuter noun we can read this as 'and just as the watch-mechanism falls'. If the yāmayanta falls at the end of the time period, then this is consistent with a sinking bowl style water clock.

It is fascinating how a short phrase like this one can open a window into history. And while here we are not talking about the time of the Buddha, but of the period of the Sri Lankan commentators, it is still a glimpse of history. It reinforces the point that the commentaries reflect their own time rather than any earlier time. They are apt to project their own culture and technology backwards onto the past, making them unreliable guides to the past. Thus when we consult the Pāḷi commentaries for insights into the suttas we must be cautious in drawing historical conclusions. The commentators were no doubt sincere, but they had a vested interest in trying to establish that the past was reflected in the present because it was one way of establishing their legitimacy as bearers of the tradition. It shows how very tenuous lineage is as a guide to legitimacy or authenticity.

~~oOo~~

Bibliography

Attwood, Jayarava. (2012) 'Possible Iranian Origins for Sākyas and Aspects of Buddhism.' Journal of the Oxford Centre for Buddhist Studies, 3.

Misra B.N. (1998) Nālandā: Vol. 1. Sources and Background. B.R. Publishing Corporation.

Sharfe, Harmut. (2002) Education in Ancient India. Brill 2002.

Takakusu, J. trans. (1896) I-Tsing, A Record of the Buddhist Religion : As Practised in India and the Malay Archipelago (A.D. 671-695), Clarendon Press 1896. Reprint. New Delhi, AES, 2005.

24 April 2015

Avalokiteśvara & The Heart Sutra

Avalokiteśvara
Huntington Archive

Avalokiteśvara (aka Guānyīn, Kannon, Chenrezik) is probably the best known Buddhist deity after the Buddha. Avalokiteśvara makes his first appearance in Buddhist literature as one of two bodhisattvas flanking Amitāyus in the Sukhāvatīvyūha Sūtras and continues to play roles associated with the qualities of Amitābha, particularly karuṇā, or compassion. He is one of the first mythic figures who has no discernible basis in an historical person, but emerges as a Buddhist value (karuṇā) personified. He (or, indeed, She in China) was and continues to be one of the most important figures in the Buddhist pantheon, both in Asia and in the West.

This essay will be particularly concerned with the name Avalokiteśvara. We most commonly read that this name means something like 'Lord Who Looks Down'. This is how Conze reads the name in his Heart Sutra commentary and it's also a feature of the commentarial literature on the Heart Sutra . We'll see that the name changed, perhaps in the 6th or 7th century, and that the etymology alone is insufficient to fully understand what the name means and how to translate it.

Sanskrit

In Sanskrit we find two forms of the name: Avalokitasvara (avalokita-svara) and Avalokiteśvara (avalokita-īśvara). The name Avalokitasvara does not appear in any complete Sanskrit manuscript, but is found on fragments of an old manuscript Saddharmapuṇḍarikā Sūtra (Studholme 53). The form is confirmed by the Chinese translation with 音 yīn which means 'sound' (discussed in more detail below).

The first part of the name avalokita is usually interpreted as something like 'looked down'. This is a deceptively literal reading of the etymology. Avalokita is a passive past participle from ava + the verbal root √lok. The root does mean 'look', and the prefix ava- can mean 'down'. A quirk of Sanskrit is that past participles such as avalokita can take on an active meaning (Studholme 2002: 55). Thus we can understand how translators such as Conze get "looks down" as the translation. This is often how the tradition has understood the name. As I comment in my forthcoming article on the Heart Sutra (JOCBS 8):

This is confirmed, for example, by the Indian commentaries preserved in Tibetan, viz. “Because he looks down on all sentient beings at all times and in all ways with great love and compassion, he is the one who looks down (avalokita)” (Lopez 1988: 43); “Because he is superior and is the lord who looks down, he is called the ‘Noble Lord Who Looks Down (āryāvalokiteśvara)” (Vimalamitra in Lopez 1996: 52). Looking down on the world and its inhabitants is one of the prominent characteristics of this figure in Buddhist mythology.

Studholme suggests that the name might be understood as "sound viewer", or "sound perceiver" which he ties to the mythology of Avalokiteśvara, the one who responds to the cries of the suffering (55-56). This is a theme in the myth of Amitābha as well, with whom Avalokiteśvara is closely connected: calling his name results in an intervention, usually at death, so that the supplicant is reborn in Sukhāvati, the Pure Land of Amitābha. This practice is known as nāmānusmṛti 'recollection of the name'. In the Kāraṇḍavyūha Sūtra, the efficacy of the mantra oṃ maṇipadme hūṃ is explained as a form of nāmānusmṛti, since maṇipadma is a coded form of the name of Avalokiteśvara, though here the supplicant is reborn in one of the worlds which occupy the hair pores on Avalokiteśvara's body.

As my forthcoming article says, the closely related verb vyavalokayati (vi + ava + √lok) does not mean 'look down' but 'examine', still with a visual connotation. Which suggests to me that the ava in ava√lok does not mean 'in a downwards direction', but more like 'to look closely', 'to narrow down one's field of view', either by concentrating or by physically getting close to the object.

The problem here is that we have a mixed metaphor, a jumbling of sensory modes. The idea of seeing sounds is not found in early Buddhist texts which assert that only the eye can see forms and only the ear can hear sounds. Studholm also notes this synaesthesia and does what we all do, he changes the sense of avalokita from a visual one to a general sensory perception. While this certainly solves the problem, I'm not convinced that it is justified because ava√lok is specifically a visual verb. However, we have no better explanation and the Buddhist tradition has also used this solution.

There is another potential solution. Peter Alan Roberts (2012: 236-7) points out that the word avalokita has a different meaning in the Mahāvastu, which contains two sub-texts both called Avalokita Sutra (See Jones Vol. II: 242-253) [I'm grateful to Richard Gombrich for pointing out this article to me]. Curiously, amongst the audience for the texts are two devas called Īśvara and Maheśvara, two epithets traditionally associated with Śiva. According to Roberts, because the Mahāvastu is the product of the Lokavattarin branch of the Mahāsaṅghika sect, it may well represent a kind of proto-Mahāyāna view of what the word means.

"In the Avalokita Sūtras, avalokita does not refer to a being, but means that which has been seen by those who have crossed over saṃsāra, and is therefore a synonym for enlightenment." (237)

Roberts' observation helps a bit with the earlier form of the name: avalokita-svara where svara means 'sound, noise' and the whole must mean something like 'the sounds perceived by the enlightened'. Unfortunately I don't quite see why Roberts thinks avalokita means "that which has been seen by those who have crossed over saṃsāra". I have looked at the Avalokita Sūtras and as far as I can tell they don't actually comment on this issue, they merely contain episodes in the biography that makes up the Mahāvastu.

The situation improves somewhat with the change of the bodhisattva's name. As Studholme discusses, in his study of the Kāraṇḍavūyha Sūtra, Avalokiteśvara converts the god Śiva to Buddhism and in the process seems to assimilate some of Śiva's iconography, including especially the epithet īśvara 'Lord' (Studholme 2002: 37ff.). For a Lokottaravādin, according to Roberts, "whatever the actual etymological origin of the name may be, it would inescapably have had the resonance of meaning 'Lord of Enlightenment'." (2012: 237). It may be that reading the Mahāvastu in Sanskrit reveals something about the word avalokita that the translation does not, but since it is 100 pages of translation, the reading becomes a fairly major project in itself for little reward.

The association with the Avalokita Sūtras, however, opens up the possibility of another way of understanding avalokita-īśvara. It might mean 'Īśvara of the Avalokita'; i.e., the Īśvara who was in the audience of the Avalokita Sūtra. But this may be too simple and obvious to appeal to many people.

Translations of the name into other languages, particularly Chinese, shed further light on the name. The Chinese forms are particularly useful because texts in which Avalokiteśvara appears were translated from early on, which, in this case, means from around the late 2nd Century CE onwards.

Chinese & Tibetan

As in Sanskrit, there are two forms of the name in Chinese. Avalokiteśvara is known in Chinese by the name 觀世音 Guānshìyīn. Literally 'look-world-sound' or 'watching the sounds of the world'. This is apparently a translation/interpretation of the name Avalokita-svara. Note that the Chinese translators preserve the synaesthetic idea of seeing sounds.

Although 觀世音 was used by earlier translators, it was the translations of Kumārajīva, in the early 5th Century CE, which popularised this form of the name. The name is regularly shortened to 觀音 Guānyīn, though there is no evidence for doing so until around the sixth century (Studholme 2002: 53). It is this shortened form of the name by which Avalokiteśvara is known in China down to the present. The shortening is sometimes said to be because of the death of the Emperor Taizong of Tang Dynasty (唐太宗; 599-649) to avoid uttering one of the characters in his personal name 李世民 Lǐ Shìmín. This is a traditional form of Chinese taboo, but that it applies in this case is disputed. Indeed, the word 世 'world' is so common it would be hard to avoid it completely.

Buddhist Chinese routinely abbreviates words, so that prajñāpāramitā is transcribed as 般若波羅蜜多 bōrěbōluómìduō, but just as often, and routinely by Kumārajīva, the last syllable is dropped. So in some respects 觀音 is an unexpected form of the name. If it were an abbreviation in this style, we might expect 觀世. Studholm, apparently following an argument made by Lokesh Chandra, seems to suggest (2002: 57) that 觀音 might have been the original form of the name in Chinese, since there is no Sanskrit equivalent of 觀世音 containing the word 世, which in Sanskrit is loka; i.e., we do not find the form avalokita-loka-svara. (this claim is repeated without caveats on Wikipedia). However, this is not entirely convincing because it is not backed up by evidence for the existence of earlier texts without 世.

It might be more plausible to suggest that 觀世 conveyed avalokita by combining a word meaning 'to see' with one that suggested 'loka'. Digital Dictionary of Buddhism associates 觀 more with verbs from the root √paś and √īkṣ than with √lok. In other words, 世 was not intended as a standalone character, but as one which modifies 觀 phonetically. The principle of phonetic and semantic radicals pervades the construction of complex Chinese characters from simpler elements. Indeed, the character 觀 guān is made up from two radicals: 雚 guàn is a phonetic element which suggests how the word is pronounced and 見 jiàn 'see' is a semantic element suggesting what the word means. This trend continues with Modern Mandarin frequently employing two characters to both avoid ambiguous homonyms and to expand the range of meaning carried by single characters.

In my other writing about the Heart Sutra, I've noted that the first sentence in Sanskrit contains two visual verbs meaning roughly 'to look' and 'to see', the first being vyavalokayati and the second being paśyati. In Chinese, these both tend be covered by 見 and related words. So, in the Chinese Heart Sutra, instead of Avalokiteśvara looking and seeing as he does in the Sanskrit, we find the puzzling phrase 照見. This is variously translated as "illuminated and saw" or "illuminatingly saw/clearly saw", since 照 means 'illuminate, shine' and it is ambiguous as to whether it is intended as a second verb (illuminated) or as an adverb (illuminatingly). Since we now know that the Chinese preceded the Sanskrit, and we can infer that the first translator of the Heart Sutra was better informed about Chinese than Sanskrit, we can assume that, for that translator, 照見 conveyed both looking and seeing, since that is how they chose to translate it. The shift of perspective provided by Nattier (1992) provides us with valuable insights into these small textual or linguistic problems.

The form 觀自在, Guānzìzài ('watching one's existence'), was introduced by Xuánzàng and used, for instance, in the translation of the Heart Sutra attributed to him. My friend Maitiu has written in to point out that:

"自在 means 'free', 'unrestrained' or 'independent'. It has the sense of 'sovereignty' and it's used to translate īśvara more generally than just Avalokiteśvara's name."

Thus, 觀自在 ought to mean 'Watching Lord'. Studholme suggests that the timing of this new form coincides with the change of the last element of the name from svara to īśvara (2002:56-57). However, though Xuánzàng's translations are acknowledged to be more faithful to the Sanskrit, where a translation by Kumārajīva exists it has always remained more popular that Xuánzàng's (with the sole exception of the Heart Sutra and this is taken as evidence to doubt the attribution). And so it is with the name. In fact, even Xuánzàng's followers, and his biographer Huili, continued to use the older form of the name.

The Tibetan version of the name is སྤྱན་རས་གཟིགས i.e. spyan ras gzigs (pronounced Chenrezik). Romanisations for this name vary and I have adopted that used by the Dictionary of the Tibetan & Himalayan Library. The name is translated literally as 'sees with eyes'. The word spyan means 'eye' and is frequently used to translate words related to Sanskrit cakṣu or sometimes netra both meaning 'eye'; and spyan ras can mean "penetrating vision, observation". Gzigs means 'to see, gaze, perceive, realise', etc., and is used to translate Sanskrit words from √īkṣ 'to see' and √paś 'to see'. As we can see, the Tibetans resolved the difficulty of the different sensory modes in their translation of the name in favour of the visual sense. This in itself is interesting, since it must have been a source of cognitive dissonance for the Tibetan translators, who are usually very faithful to the Sanskrit. Studholme suggests that spyan ras gzigs is "an honorific form of the Sanskrit Avalokita" (2002: 58).

The Heart Sutra

One of the differences between the two short versions of the Heart Sutra in Chinese, T250 and T251, is the name they use for Avalokiteśvara. The former uses 觀世音, consistent with being translated by Kumārajīva; while the latter uses 觀自在, consistent with being translated by Xuánzàng. As we recently saw (Chinese Heart Sutra: Dates and Attributions), the attribution of these translations to these translators is now plausibly disputed, because the facts of history, such as they are, conflict with the traditional authorship. Now that we also know the text was composed in China, it also alters the landscape. The scholarly consensus is that Kumārajīva did not translate or compose T250. Nattier makes a good case for Xuánzàng not being the translator/composer of T251 (See Nattier 1992: 184 ff.). Both texts seem to be later creations, based on some earlier text, that have been edited to look like authentic productions of the two famous translators. Indeed, Nattier shows that T250 has most likely been altered to look more like《大智度論》Dàzhìdù lùn (*Mahā-prajñā-pāramitopadeśa; T1509 ), a commentary on Pañcaviṃśati Prajñāpāramitā Sūtra attributed to Nāgārjuna, also translated by Kumārajīva.

However, since 觀世音 as the translation of Avalokiteśvara both predates Kumārajīva and is the standard form he used in his translation of Prajñāpāramitā texts (from which the Heart Sutra certainly draws its core), then we can assume that the ur-text of the Heart Sutra used this form of the name also.

In his translation of the Heart Sutra, Edward Conze takes the odd step of carving up the name Āryāvalokiteśvara into its constituent parts: ārya, avalokita, and īśvara. He then takes īśvara to be an epithet like ārya and translates "Avalokita, the Holy Lord". There is simply no way to construe ārya as qualifying īśvara here. Although it is true that some texts, notably the Bodhicaryāvatāra, use the name Avalokita, Avalokiteśvara can only be read as a compound with an implied syntactic relationship between the two words, because avalokita is undeclined. Ārya then qualifies the whole name. Indeed, in this period of Buddhism it was typical to add ārya to names of people and texts as a mark of special status. Perhaps "holiness" is not too far from the mark, though we can refine it to mean 'connected with awakening'.

In China, Avalokiteśvara took on a female form, partly through syncretisation with the myth of Miaoshan (妙善) (Guang 2011). Though scholars differ on when the sex-change took place, it seems to have begun to manifest by the Tang Dynasty. This covers the likely period of composition of the Heart Sutra. However, as far as the Sanskrit text is concerned, Avalokiteśvara is a grammatically masculine name, as is the alternate Avalokitasvara. We can assume, therefore that at least the translator from Chinese into Sanskrit thought of the deity as masculine.

The question is frequently raised as to what a deity associated with compassion is doing in a sutra about wisdom. Various theories have been put forward to explain this disparity. However, given what we now know and can deduce about the history of the text, it was composed or collated by an early medieval Chinese monk who saw nothing strange about worshipping Guānyīn and studying Prajñāpāramitā. By the seventh century the geography of Chinese Buddhism was very different from its Indian forms. Boundaries shifted or disappeared. Elements that might have seemed distinct—Pure Land Buddhism and Prajñāpāramitā Buddhism—became mixed and recombined to form native Chinese sects. Indeed, Xuánzàng dealt with the Prajñāpāramitā texts from a Yogācāra perspective as did his main disciples 窺基 Kuījī (632–682) and 圓測 Woncheuk (613-696).

Thus, even if the association of Avalokiteśvara still strikes us as incongruent, we must accept that it did not seem so to the author/composer. The failing is on our part. Perhaps because of monotheism or perhaps because European Christian churches dealt with heresy so viciously, for so long, we find syncretism difficult to fit within our paradigms of religion. At best, "syncretism" is pejorative, at worst, dismissive. However, it was the norm in both ancient India and China and their culture spheres in Central and South-East Asia. Synthesis is just as common and accepted as schism is. I've explored this also in my critiques of the tree as a metaphor for evolution. Which is not to say that there are no arguments over orthodoxy and orthopraxy, only that these arguments seldom seemed to generate quite the hostility that we find in European religion. And this situation has changed in modern India, perhaps under the influence of European values, certainly in reaction to colonialism and its aftermath.

It is a curious feature of history that some details seem to become so well known that we stop explaining them, and they are subsequently lost without possibility of recovery. If the early Mahāyāna Buddhists puzzled over the name Avalokiteśvara, they did not record their thoughts. Nor does any justification for the name change survive. Attempts to reconstruct ancient knowledge from minimal clues is a fascinating endeavour and I'm grateful to the people at the coal face, the various experts, whose work makes my kind of writing possible.

~~oOo~~

Bibliography

Guang Xing (2011). 'Avalokiteśvara in China.' The Indian International Journal of Buddhist Studies 12, 2011

Jones. J. J. (1952) Mahavastu. (Trans.) Vol. II. Luzac & co.

Nattier, Jan (1992). ‘The Heart Sūtra: a Chinese apocryphal text?’ Journal of the International Association of Buddhist Studies. 15 (2) 153-223. Online: http://archiv.ub.uni-heidelberg.de/ojs/index.php/jiabs/article/view/8800/2707

Roberts, Peter Alan. (2012) ‘Translating Translation: An Encounter with the Ninth-Century Tibetan Version of the Kāraṇḍavyūha-sūtra.’ Journal of the Oxford Centre for Buddhist Studies. 2: 224-242.

Senart, Émile (1882-1897) Mahavastu-Avadana. 3 vols. Paris.

Studholme, Alexander. (2002) The Origins of Oṃ Maṇipadme Hūṃ: A Study of the Kāraṇḍavyūha Sūtra. State University of New York.

Note 8 Jun 2015

Who Composed the Mahāyāna Scriptures? ––– The Mahāsāṃghikas and Vaitulya Scriptures.
Seishi KARASHIMA

ARIRIAB XVIII (2015), 113–162.

"An illustrative example of this sort of misunderstanding is Avalokitasvara and Avalokiteśvara. There are at least eight old Sanskrit fragments from Central Asia which bear the name Avalokitasvara, as well as one fragment from Kizil, which has (Apa)lokidasvara. These older forms agree with the early Chinese renderings “One, who observes sounds” and “One, who observes sounds of the world” (闚音, 現音聲, 光世音, 觀世音), which were made between the 2nd and 5th centuries, [114] while the newer form Avalokiteśvara, which first appears in a Mathurā inscription of the Gupta year 148 (467/468 C.E.)¹ and later in the Gilgit manuscript of the Lotus Sutra, dating back to the 7th century, agrees with the newer Chinese renderings “One who observes the sovereignty of the world” and “One who observes sovereignty” (觀世自在,觀自在) from the 6th century onwards. We cannot say for certain that the older forms are “corruptions” of the newer ones.²"

¹ Cf. IBInsc I 686~687.

² The most recent example of this misunderstanding is found in Saitō 2015. I assume that, in the language (probably Gāndhārī), in which the verses of the Samantamukha Chapter of the Lotus Sutra had been composed originally, svara (or śpara) might have meant both “sound” and “thinking” (= Skt. smara), and the composer of the verses himself may have understood *Avalokitasvara (or Avalokitaśpara, *Olokitaśpara or the like) as “One, who Observes Thinking”. Much later, when this -svara (or -śpara) was no longer understood as meaning “thinking; memory”, people probably began to regard it literally as “sound”. Thus, the composer of the prose portion of the same chapter understood the Bodhisattva’s name in this way, which was shared also by the early Chinese translators. I assume, also, that the Gāndhārī form *Avalokitaśpara could have been incorrectly sanskritised later to Avalokiteśvara by somebody who knew the development Skt. īśvara > Gā iśpara. Cf. Karashima 1999 and 2014a.

Subscribe to: Comments ( Atom )

Recently Published

Attwood, Jayarava. (2025) "Remembering Sue Hamilton (1950–2016)." Buddhist Studies Review 42(1): 2-11.

Attwood Jayarava (2024). "Revised Editions of the Prajñāpāramitāhṛdaya and Bānrěbōluómìduō xīn jīng «般若波羅蜜多心經»." Asian Literature and Translation. 11(1). 52-92. https://alt.cardiffuniversitypress.org/articles/10.18573/alt.63.

Attwood, Jayarava. (2024). “Heart to Heart: A Comparative Study of and Commentary on the Chinese and Sanskrit Heart Sutra Texts”. Buddhist Studies Review 40 (2): 159-88. https://doi.org/10.1558/bsrv.25438. [Email me to obtain a copy]

(2022) "The Cessation of Sensory Experience and Prajñāpāramitā Philosophy" IJBTC 32(1):111-148. IJBTC Website. [free download]

Books by Jayarava

Karma and Rebirth Reconsidered

A thorough reconsideration of the myths of the afterlife (rebirth) and just world (karma) in Buddhism. The book shows that from an evolutionary point of view such myths are understandable, but that Buddhists have never produced a coherent theory of either that can stand scrutiny. Indeed we now know that neither myth is realistic and that there is no afterlife and the universe is not fair. However, as human beings we can live on in memory or leave works, and we can live ethically.

Buy from Lulu now.

Talking to the Kālāmas

A new translation of the Kālāma Sutta along with a commentary which explores the meaning of the text, and explodes some myths about it. 34 p. £4.50

Buy from Lulu now.

Visible Mantra

A resource for visualising and calligraphy of Buddhist mantras. Commons mantras and seed-syllables in various fonts and scripts, including Devanāgarī, Tibetan Uchen, and Siddham.