WHAT IS MODELING IN GENERAL?

In natural sciences, we usually consider the system A to be a model of the system B if A is similar to B in some important properties and exhibits somewhat similar behavior in similar circumstances. According to this definition, it is unnecessary the physical nature of the modeling system A be the same as of the modeled system B. For example, B can be a technological aggregation in chemistry, while A can be a set of differential equations, i.e., a conceptual system of a mathematical nature. A mathematical model is usually the best if it really ensures sufficient approximation.

In the linguistic domain, we need a modeling system that, after receiving information of a linguistic nature in its input, exhibits in its output an acceptable degree of similarity to the results of natural language activity of the human brain. Such a definition follows the idea of Alan Turing mentioned above.

NEUROLINGUISTIC MODELS

Neurolinguistic models investigate the links between any external speech activity of human beings and the corresponding electrical and humoral activities of nerves in their brain.

It would be interesting to investigate in detail what part of the brain is activated when a human is preparing and producing an utterance or is trying to understand an utterance just heard. Unfortunately, the problem to discover the way people think while speaking or understanding is tremendously difficult. Indeed, the unique objective way for a researcher to reveal the ways of the human thinking by neurophysiological methods is to synchronously investigate electrical and humoral activities in a multiplicity of places in the human brain.

The brain of any human consists of a tremendous number of neural cells, or neurons. The neurons are not elementary units by themselves, since each of them is connected with hundreds of other neurons by their dendrites and axons in a very complicated manner.

The dimensions of neurons, dendrites, and axons are so small that the techniques of modern neurophysiology provide no practical opportunity to observe each neuron separately. What is more, the intrusion of observing tools inside the brain evidently changes the activity under observation. To understand in detail the internal functioning of the brain, it would be necessary to observe vast groups of neurons activated in the multistage process and trace back a grandiose multiplicity of causal chains.

Then, after this tracing, the researcher would be able to express the detected traces as rules and algorithms presumably used by the human brain. This is what might be called a neurolinguistic model of language processing. However, the extremely large number of possible combinations of signals traced in such a way is well beyond the capability of any computer to handle.

Leaving aside the inadequacy of the modern neurophysical techniques, the mathematical tools for such a modeling are insufficient either. We cannot even hope to get close to the information activity of neuron groups in our brain with rather simple mathematical models. According to the modern point of view, the neurons are complicated elements of logical type. Thus, a modeling system should contain elements of the switching circuitry. However, neither deterministic nor stochastic theory of approximation by such a circuitry has been developed well nowadays.

About 30 years ago, neural networks were proposed as a tool of artificial intelligence research. They consist of elements referred to as formal neurons. These are standard logical computational units with rather limited abilities. There have been numerous studies attempting to apply neural networks to various tasks of artificial intelligence and particularly to language processing. If there are some statistical trends in phenomena under observation and modeling, these trends can be recognized and taken into account by this technique.

However, for computational linguistics the problem of how to choice the input and output signals for modeling of the brain activity remains unclear. If we propose some specific inner representations for these purposes, then the formal neurons will only model our inventions rather than real language processes.

For this reason, without revolutionary changes in the research techniques and new approaches for treatment of observable data, neurolinguistic models of natural language understanding are unlikely to give good results in the nearest future. At present, only very crude features of brain activity can be observed effectively, such as which areas within the brain show neuronal activity associated with human memory, general reasoning ability, and so forth.

PSYCHOLINGUISTIC MODELS

Psycholinguistics is a science investigating the speech activity of humans, including perception and forming of utterances, via psychological methods. After creating its hypotheses and models, psycholinguistics tests them through psychological experiments. Therefore, psycholinguistics is similar to linguistics in its objects of investigation and similar to psychology in its methods.

Here is an example of a psycholinguistic experiment. The subject, i.e., the person under observation, is given, one by one, a sequence of specially selected words as input stimuli. Then the subject is prompted to give the response to each word as any other word first coming to his or her mind. The pairs “stimulus—response” are recorded: rosaroja, padremadre, manopie, etc. Based on such experiments, psycholinguists put forward a hypothesis that various psychological types of people demonstrate specific types of associative choices, so that they can give a true identification of the personality under investigation based exclusively on such tests.

In another experiment, the objectives can be nearer to linguistics as such. The series of utterances with some syntactic ambiguities are given to the human subjects. The time required to the person under test for disambiguation is measured, when he or she selects at last one of possible hypotheses. On the grounds of experimental statistics, a hypothesis is advanced how a human can understand such constructions and what is the humans’ way for the disambiguation.

Psycholinguistics tries also to describe the teaching of native and not native language, social influence of a speech on humans, and so forth. In such a way, psycholinguistics aims to explain some purely psychological features of people concerning their speech behavior. Psychological features, in their turn, are closely related to the social behavior of a person. Two kinds of activities, both of very high complexity, are studied in parallel, and each aspect of the science exploits techniques and suggestions from the other.

Therefore, psycholinguistics usually does not have linguistic ideas of its own. It adopts them from various branches of linguistics and then uses them for its own purposes without much critical review and feedback. Purely psychological methods well adapted to the problems of linguistics as such have not yet been proposed, so that these methods give little to generalized (holistic) models of language.