The graphic shows how the word 'apple' originated from the transfer of an action (plucking or holding an apple) into a representing gesture. The gesture, by neural motor equivalence of arm movement and motor programming of the tongue and other articulatory organs, then generated the sound structure of the word for the fruit.
Similar graphics can be prepared for typical words from other major language groups. The different word structures for the same fruit in other languages result from the different arm-movement elements used to form the same final gesture (similarly to the relation between different words for 'head' and the final pointing gesture to the head: see http://www.percepp.demon.co.uk/nouns.htm
This has a relation to the ideas advanced by Rizzolatti, Gallese and Arbib in their papers on "Language within our Grasp" [Rizzolatti, G, and Arbib, M.A., 1998, Trends in Neuroscience, 21(5):188-194.] and the significance of 'mirror neurons'.