The current study examined a sensorimotor strategy under time constraints in a situation where there was uncertainty about the value of the target. In the main task, there were two alternative targets, and target values were presented along with an auditory stimulus that signaled the start of the trial, followed by a value if the cursor passes the target within a time constraint. When the time constraint is short, it is desirable to adopt a simple-reaction strategy because the attempt to obtain information about the target’s values will not result in reaching the target in time. In contrast, if there is enough time to recognize the value of the target, it is preferable to accurately recognize the target information and choose the target that has a higher value (i.e., a choice-reaction strategy). Thus, time constraint is a key factor that determines the desired strategies for the participants. Moreover, in the current task, the participants were free to initiate actions at any time after the auditory go-signal, so they could start movements after identifying the target values or before identifying them. This setup allows us to evaluate the optimality of the strategy and examine the characteristics of the selected movement patterns.
By analyzing the modulations in spatiotemporal movement patterns according to time constraints, we obtained two main findings. First, the modulation of the initial reach direction and the total length of trajectories under the time constraint revealed that different motor patterns were selected between the double-target and single-target conditions when the time constraint became relatively long. Second, participants used the choice-reaction strategy even in the tightly time-constrained condition and their performance was lower than that in the condition with a single goal. The fact that the temporal performance (i.e., whether or not reached in time) was significantly lower suggests that this negative effect of multiple alternatives was caused by the choice reaction of not being able to meet the time constraint.
Comparing the cursor trajectories between the double-target and single-target conditions, we found that the initial reach direction was more centered in the double-target condition than in the single-target condition. It is generally known that in a reaching movement, the trajectory is almost straight to the target 21,22. In fact, a trajectory of straight movement was often observed in the single-target condition, and if the movement was initiated after the determination of the target, it was likely that the movement would also follow a straight trajectory in the double-target condition. However, we found that the initial reach direction was closer to the middle direction of the two targets under a relatively long time constraint, suggesting that the valuation process between the targets may have interfered with the movement trajectory.
The question that arises here is whether this interference is due to unintentional or intentional control, but a view that supports each is possible. Many previous studies have reported that when initiating a movement in the presence of multiple potential targets, the initial movements are directed toward the weighted average of the given targets 19,23−28. An unintentional averaging output of discrete motor plans corresponding to each potential target has been proposed as one of the causes of such motor patterns. This is supported by neurological evidence that motor plans for each target are represented simultaneously when there are multiple competing movement targets in the reaching-related areas 29–33.
Most of the studies that provided this evidence used go-before-you-know situations, in which participants were forced to initiate a movement for multiple competing targets and then reach a final goal presented after the movement had begun19,23,25–27,34−38. However, the present study was different from the go-before-you-know situation in that the participants were able to initiate the movement at their own timing. Therefore, it was also possible to start a movement after the movement target had been specified by the participants. Even in such a situation, we observed the centering (albeit imperfect) of the initial movement, which may be attributed to the existence of some advantage of online control 39–45.
One of the possible reasons for the centering of the initial movement is that the participants may have intended to shorten the reaction time since the participants were required to reach the target within a given time constraint and thus, there is an advantage of starting earlier. As shown in a previous study, an overlap between motor planning and execution reduces the reaction time when there are multiple motor goals 46. In addition, reaction time and movement time can be reduced if participants pre-plan the kinematic components of the movements commonly required for both targets before determining the targets 47. Thus, it is possible that the centralized initial movement is a goal-oriented action generation that increases the time available to determine the movement target 42.
Another possibility of the centering tendency of the initial movement is to maintain the capability to respond to target changes during movement execution. Accounting for possible movement corrections that may occur later is an important aspect of motor planning 43, and the state of the movement affects whether or not the goal is changed 44,45. Because the current task requires instantaneous decisions, errors in value processing may occur, and changing the target during movement execution may be necessary. A centering tendency in movement trajectory is advantageous when dealing with a changing target. Therefore, the centered initial movement could reduce the cost of possible later movement corrections, including temporal performance, reach accuracy, and biomechanical costs.
When the time constraint became longer, the reaction time was longer in the double-target condition than in the single-target condition, while the reaction times were close to an intermediate value between the conditions under severe time constraints. Accompanying this change, the probability of reaching the target with the highest score (Phigh−value) increased as the time constraint increased. Therefore, when the reaction time was short, the participant started to move in the predetermined direction as a simple reaction, but when the reaction time was long, the participant started to move after acquiring the value information as a choice reaction. These results showed that the participants varied their strategy to obtain information about the target depending on the given time constraint.
The question then is whether such a strategy switch is optimal or biased in terms of objective reward maximization. When we compared the mean scores according to the level of time constraint between the single-target and double-target conditions, interestingly, we found that participants performing the single-target condition scored higher under the relatively severe time constraint, and this tendency was consistent across participants. This can be attributed to two factors. The first is the difference in temporal performance. In the double-target condition, the probability of reaching the target in time was lower than that in the single-target condition under the relatively more severe time constraint. This reflects, in the double-target condition, the longer reaction time due to the time spent obtaining information about the target, and the lack of a corresponding reduction in the movement time. Second, the difference in the expected value between simple and choice reactions is theoretically less than 10 points. In the case of a simple reaction, the expected value of the target that the participant aims at is 50 points. If the participant performs a choice reaction and selects and heads for the higher scoring of the two targets with perfect accuracy, the expected value of the target is 60 points. Therefore, if the success probability of the double-target condition is less than 5/6 against the success probability of the single-target condition, rather than benefiting from making a choice, a loss is incurred.
Interestingly, the preference of the choice-reaction strategy over the simple-reaction strategy, even when the benefit of the strategy is small, is a cognitive bias similar to the tendency of our previous study to have longer reaction times than the optimal strategy 10. In the present study, avoiding the uncertainty of the outcome possibly caused the preference for the choice-reaction strategy. In the presence of two different values, it is undesirable to start a movement with uncertainty about which value target to go for, and this may lead to a cognitive bias in choosing the better alternative.
Many previous studies have found that risk-seeking decision-making is more likely to occur in motor tasks 6,7,48−50, which can be interpreted from two perspectives: distortion of subjective and objective values (i.e., preference for higher scores) and overestimation of one’s own ability. The distortion of subjective and objective values has been widely confirmed in economic decision-making 51 and suggested to be present in motor tasks as well, although it was shown to be approximated by different functions 5. There is a tendency to choose the highest value that exists in the place. In such cases, it is acceptable to use a choice strategy by selecting the option with the higher reward. In particular, motor tasks may be prone to such preferences, because daily movement decisions require instantaneous decisions.
Another possibility, as mentioned above, is the existence of a bias in the estimation of ability. In the current task, it is necessary to correctly estimate the relationship between reaction time and choice accuracy, but it is not clear whether humans are able to represent the relationship accurately. In the current task, it is necessary to correctly estimate the relationship between reaction time and accuracy of choice, as well as the relationship between the movement time and the reach accuracy; however, it is not clear whether humans can represent them correctly. Given the representation that it is possible to make accurate choices even with a shorter reaction time than actual, it is understandable that the choice-reaction strategy is used frequently in a suboptimal manner.
Presently, the major challenge is the difficulty in separating the effects of distortion of subjective and objective values and misestimation of one's own abilities. It is possible that only one influence is strongly at work, or that both are at work, or that different factors are at work depending on the decision-maker. In our previous study 10, we found that different strategies were selected between conditions in which the scores changed and conditions in which the probabilities changed, even when the expected values were equal. Thus, in future studies, manipulating value from various factors and examining the modulation of the strategy accordingly may contribute to the separation of the two possibilities.
The current study investigated a sensorimotor strategy according to a time constraint in a situation where there was uncertainty about the value of the target. We obtained two main findings. First, the modulation of the movement kinematic patterns under on time constraints revealed that different motor patterns were selected between the double-target condition and single-target conditions when the time constraint became relatively long. This modulation could be due to both unintentional and intentional control, but more importantly, it can be interpreted as a beneficial and adaptive action. Second, we found that participants frequently used the choice-reaction strategy even in the tightly time-constrained condition, and that performance was consistently lower than that in the single-goal condition across participants. The results suggest that there is a consistent cognitive bias among individuals to choose a higher value in situations where there are multiple alternatives with different values. Future studies are required to clarify the causes of this bias.