Visual Uncertainty Unveils the Distinct Role of Haptic Cues in Multisensory Grasping

Ivan Camponogara; Robert Volcic

doi:10.1523/ENEURO.0079-22.2022

Abstract

Human multisensory grasping movements (i.e., seeing and feeling a handheld object while grasping it with the contralateral hand) are superior to movements guided by each separate modality. This multisensory advantage might be driven by the integration of vision with either the haptic position only or with both position and size cues. To contrast these two hypotheses, we manipulated visual uncertainty (central vs peripheral vision) and the availability of haptic cues during multisensory grasping. We showed a multisensory benefit regardless of the degree of visual uncertainty suggesting that the integration process involved in multisensory grasping can be flexibly modulated by the contribution of each modality. Increasing visual uncertainty revealed the role of the distinct haptic cues. The haptic position cue was sufficient to promote multisensory benefits evidenced by faster actions with smaller grip apertures, whereas the haptic size was fundamental in fine-tuning the grip aperture scaling. These results support the hypothesis that, in multisensory grasping, vision is integrated with all haptic cues, with the haptic position cue playing the key part. Our findings highlight the important role of nonvisual sensory inputs in sensorimotor control and hint at the potential contributions of the haptic modality in developing and maintaining visuomotor functions.

Significance Statement

The longstanding view that vision is considered the primary sense we rely on to guide grasping movements relegates the equally important haptic inputs, such as touch and proprioception, to a secondary role. Here, we show that by increasing visual uncertainty during visuo-haptic grasping, the central nervous system exploits distinct haptic inputs about the object position and size to optimize grasping performance. Specifically, we demonstrate that haptic inputs about the object position are fundamental to support vision in enhancing grasping performance, whereas haptic size inputs can further refine hand shaping. Our results provide strong evidence that nonvisual inputs serve an important, previously underappreciated, functional role in grasping.

Introduction

A large proportion of grasping actions are directed toward objects we can sense with multiple modalities. For instance, when grasping with one hand an object we already hold in the other hand, the properties of the object, such as its size and position in space, are provided by both vision and haptics (touch and proprioception). The integration of these redundant sensory cues fosters a consistently superior grasping performance compared with when movements are guided by each modality alone (Camponogara and Volcic, 2019a,b). Even more intriguingly, the same superior grasping performance is achieved when the haptic size cue is not provided and vision is complemented by only the haptic position cue (Camponogara and Volcic, 2021b).

The elusive effect of the haptic size cue in the multisensory integration process might result from two different causes. The superior performance in multisensory grasping might arise from the visual and haptic integration at the level of the position cues only which would reduce the uncertainty about the position of the object in space (Carey and Allan, 1996; Battaglia et al., 2010; Sperandio et al., 2013; Chen et al., 2018). As a consequence, the object size estimation would be solely determined by vision (Camponogara and Volcic, 2021b). Alternatively, the visuo-haptic integration might occur both at the level of the position cues and at the level of size cues, but the dominance of the more reliable visual size cue would completely overshadow the haptic size cue, making it hard to determine whether the multisensory size information is truly integrated.

The main aim of this study was to contrast these two alternative explanations by disrupting visual information during multisensory grasping. The quality of visual information was manipulated by modulating the participants’ gaze direction and, by this, the grasping actions were executed in either central (foveal) or peripheral vision. Because visual acuity sharply declines with retinal eccentricity (Strasburger et al., 2011; Rosenholtz, 2016), object’s size and position estimates are noticeably impaired in peripheral compared with central vision (Collier, 1931; Newsome, 1972; Schneider et al., 1978; Thompson and Fowler, 1980; Bock, 1993; Goodale and Murphy, 1997; Brown et al., 2005; Baldwin et al., 2016). Moreover, multisensory integration studies in perception have shown that as the quality of visual information gradually declines, the object size estimation shifts toward more haptically-based perceptual judgments (Derrick and Dewar, 1970; Heller, 1983; Ernst and Banks, 2002; Gepshtein and Banks, 2003; Helbig and Ernst, 2007; Van Doorn et al., 2010). It might be thus expected that increasing visual uncertainty through peripheral vision should let the haptic size cue effect emerge also in conditions of multisensory grasping.

With respect to movements in central vision, grasping movements in peripheral vision are generally slower, with larger grip apertures and with a poorer grip aperture scaling (Sivak and MacKenzie, 1990, 1992; Goodale and Murphy, 1997; Watt et al., 2000; Brown et al., 2005; Schlicht and Schrater, 2007; Hesse et al., 2012). Introducing additional haptic cues might thus refine grasping movements in several ways depending on the contribution of haptic position and size cues. The integration of the haptic position cue would reduce the overall positional uncertainty, which would translate into faster movements and narrower grip apertures. Analogously, the contribution of the haptic size cue would diminish the uncertainty relative to the object size and would be revealed by an improved grip aperture scaling. However, if the haptic size cue is not part of the integration process, the sensitivity to changes in object size should remain unaffected.

We tested these predictions in two experiments. In the first experiment, we contrasted grasping performance under peripheral vision conditions, with (pVH) or without (pV) additional haptic cues, along with the central vision conditions (V, VH) and a haptic only (H) condition. In the second experiment, we further teased apart the contribution of haptic cues when grasping handheld objects in peripheral vision by selectively withdrawing the haptic size cue and providing the haptic position cue only (pVHP).

Experiment 1

Materials and methods

Participants

Eighteen participants took part in this experiment (four male, age 25.3 ± 8.2). All had normal or corrected-to-normal vision and no known history of neurologic disorders. All of the participants were naive to the purpose of the experiment and were provided with a subsistence allowance. The experiment was undertaken with the understanding and informed written consent of each participant and the experimental procedures were approved by the Institutional Review Board of New York University Abu Dhabi.

Apparatus

The set of stimuli consisted of three 3D-printed rectangular cuboids with depths of 40, 50, 60 mm, all the same height (120 mm) and width (25 mm). A chin rest was positioned at the edge of the experimental table and its height was adjusted such that the participants’ eyes were 440 mm above the table surface. During the experiment the three target objects were positioned 350 mm in the sagittal direction with respect to the table’s edge. Thus, in the peripheral vision condition, the top of the objects was at ∼45° of eccentricity with respect to the participants’ gaze (Fig. 1A). This eccentricity allowed to increase the visual uncertainty without completely eliminating the availability of visual cues (Goodale and Murphy, 1997; Schlicht and Schrater, 2007). A custom-made eye-tracker was attached to the left rod of the chin rest with a locking arm (JB01291-BWW). The eye-tracker consisted of a modified webcam (Vivitar V49252), with a sampling frequency of 30 Hz. An array of 25 infrared LEDs was positioned on the table 40 cm far and 30 cm to the left of the participant. The activation and deactivation of LEDs was controlled by an Arduino Yún board via MATLAB (MathWorks Inc) by a custom program, which also computed the pupil coordinates from the sampled eye images. The start position of the right hand was defined by a 5 mm high rubber bump with a diameter of 9 mm attached at the edge of the table, 450 mm to the right of the participants’ mid-line. The experiment was conducted in a dark room with the experimental table illuminated by a LED desk light (5W) positioned on the left side of the participant.

Figure 1.

A, Experimental setup. Participant’s head was resting on a chin rest. In the Haptic condition (H), participants were blindfolded. In the Visual (V) and Visuo-Haptic (VH) conditions, the participant’s gaze was directed toward the object. In the Visual-Peripheral (pV) and Visuo-Haptic-Peripheral (pVH) conditions, the participant was fixating a small white square on the frontoparallel board and thus the object was in peripheral vision at ∼45° of eccentricity with respect to the participant’s gaze direction. B, Representation of the task in each condition. The grasping action was always performed with the right hand. In H and VH, the participant was already holding the object with the left hand before the start of the grasping action. In V, only vision was available. The pV and pVH conditions were identical to the V and VH conditions, but the object was seen in peripheral vision.

A black panel (600 mm wide, 500 mm high) was positioned 450 mm far from the participants’ position (i.e., behind the object). A small white square (5 × 5 mm) was positioned at the center of the panel, at a height of 440 mm, and acted as the fixation point in the peripheral vision block. A cardboard panel (400 mm wide, 300 mm high) was used to prevent vision of the workspace (but not of the board with the fixation point) between trials in the central and peripheral blocks, whereas a pair of occlusion goggles was used to prevent vision in the Haptic condition (Red Scientific). A pure tone of 1000 Hz, 100-ms length was used to signal the start of the trial, while a tone of 600 Hz with the same length was used to signal its end.

Index, thumb, and wrist movements were acquired on-line at 200 Hz with submillimeter resolution by using an Optotrak Certus system (Northern Digital Inc.). The position of the tip of each digit was calculated during the system calibration phase with respect to two rigid bodies defined by three infrared-emitting diodes attached on each distal phalanx (Nicolini et al., 2014). An additional marker was attached on the styloid process of the radius to monitor the movement of the wrist. The Optotrak system was controlled by the MOTOM toolbox (Derzsi and Volcic, 2018).

Procedure

Participants sat comfortably at the table with their torso touching its edge. All the trials started with the thumb and index digit of the right hand positioned on the start position, the left hand positioned on the left side of the chin rest and the head on the chin rest (Fig. 1A). The height of the chair was adjusted to keep the eyes at a fixed height to maintain the object at a fixed visual angle. Participants were required to perform a precision grip with their right thumb and index digit along the depth axis of the stimulus.

Before each trial, the cardboard panel was placed in front of the participant to cover the workspace, and the object was placed in its position 350 mm in front of the participant. The experimenter then removed the cardboard panel and after a variable period the start tone was delivered. The participant had to perform a right-handed reach-to-grasp action toward the object at a natural speed. No reaction time constrains were imposed. Three seconds after the start tone, the end sound was delivered, and the participant had to move the right hand back to the start position. The cardboard was then placed in front of the participant, the object was set to the new required size and the next trial started.

Five different conditions (Fig. 1B) were performed: Haptic (H), Visual (V), Visuo-Haptic (VH), Peripheral Vision (pV), and Peripheral Vision plus Haptic (pVH). In the H condition, vision was prevented for the whole duration of the condition. Before each trial, the experimenter signaled to the participant to hold the object with their left hand along its depth axis at its base (i.e., sense its size and position by means of touch and proprioception). In the V condition, as soon as the cardboard was removed the experimenter instructed the participant to look at the object which was in the central visual field (the left hand was kept on the table close to the chin rest). In the VH condition, the participant had to hold the object at its base with their left hand and look at the object. The pV and pVH conditions were identical to the V and VH conditions except that participants were instructed to look at the fixation point instead of foveating the object, so that the target object was always in visual periphery (Fig. 1A). Whereas in the pV condition only peripheral vision was available, in the pVH condition, participants were asked to also hold the object at its base with their left hand. Eye fixations in these two conditions were monitored with the eye-tracker, which started sampling as soon as the experimenter placed the cardboard panel between the participant and the object (the cardboard height was lower than the fixation point, but high enough to cover the target object), and stopped when the end of the trial sound was delivered. If the algorithm detected an eye movement of ∼10 mm (∼1.3° of visual angle) in the horizontal or vertical direction from the fixation point, the trial was discarded and repeated later in the condition. The five conditions were divided in two main experimental blocks. The H, V, and VH conditions were part of the Central vision block, whereas the pV and pVH conditions were part of the Peripheral vision block.

The Central and Peripheral vision blocks were performed in sequence, while the order of the conditions (H, V, VH, pV, and pVH) was randomized within blocks and across participants. The differently sized objects were presented in a random order and ten repetitions were performed for each object size and condition, which led to a total of 150 trials per participant. In order to get accustomed with the task, participants underwent a training session of ten trials before each condition, for a total of 50 trials.

Data analysis

Kinematic data were analyzed in R (R Core Team, 2020). The raw data were smoothed and differentiated with a third-order Savitzky–Golay filter with a window size of 21 points. These filtered data were then used to compute velocities and accelerations in three-dimensional space for each digit and the wrist. Movement onset was defined as the moment of the lowest, nonrepeating wrist acceleration value before the continuously increasing wrist acceleration values (Volcic and Domini, 2016; Camponogara and Volcic, 2019b), while the end of the grasping movement was defined on the basis of the Multiple Sources of Information method (Schot et al., 2010). We used the criteria that the grip aperture is close to the size of the object, that the grip aperture is decreasing, that the second derivative of the grip aperture is positive, and that the velocities of the wrist, thumb and index finger are low. Moreover, the probability of a moment being the end of the movement decreased over time to capture the first instance in which the above criteria were met. Trials in which the end of the movement was not captured correctly or in which the missing marker samples could not be reconstructed using interpolation were discarded from further analysis, the exclusion of these trials (158 trials, 5.8% in total) left us with 2542 trials.

We focused our analyses on two dependent variables: the peak grip aperture, defined as the maximum Euclidean distance between the thumb and the index finger, and, the peak velocity of the hand movement, defined as the highest wrist velocity along the movement. We analyzed the data using Bayesian linear mixed-effects models, estimated using the brms package (Bürkner, 2017) which implements Bayesian multilevel models in R using the probabilistic programming language Stan (Carpenter et al., 2017). The models included as fixed-effects (predictors) the categorical variable Condition (H, V, VH, pV, and pVH) in combination with the continuous variable Size. This latter was centered before being entered in the models, thus, the estimates of the Condition parameters (β_Condition) correspond to the average performance of each Condition. The estimates of the parameter Size (β_Size) correspond instead to the change in the dependent variables as a function of the object size. All models included independent random (group-level) effects for subjects. Models were fitted considering weakly informative prior distributions for each parameter to provide information about their plausible scale. We used Gaussian priors for the Condition fixed-effect predictor (peak grip aperture β_Condition: mean = 90 and SD = 40; peak velocity β_Condition: mean = 1100 and SD = 200). For the Size fixed-effect predictors we used a Cauchy prior distribution centered at 0 with a scale parameter of 2.5. For the group-level standard deviation parameters and sigmas we used Student t-distribution priors (peak grip aperture all SD parameters and sigma: df = 3, scale = 10; peak velocity all SD parameters and sigma: df = 3, scale = 170). Finally, we set a prior over the correlation matrix that assumes that smaller correlations are slightly more likely than larger ones (LKJ prior set to 2).

For each model we ran four Markov chains simultaneously, each for 16,000 iterations (1000 warm-up samples to tune the MCMC sampler) with the delta parameter set to 0.9 for a total of 60,000 postwarm-up samples. Chain convergence was assessed using the R̂ statistic (all values equal to 1) and visual inspection of the chain traces. Additionally, predictive accuracy of the fitted models was estimated with leave-one-out cross-validation by using the Pareto Smoothed Importance Sampling. All Pareto k values were below 0.5.

The posterior distributions we have obtained represent the probabilities of the parameters conditional on the priors, model, and data, and they represent our belief that the “true” parameter lies within some interval with a given probability. We summarize these posterior distributions by computing the medians and the 95% highest density intervals (HDIs). The 95% HDI specifies the interval that includes with a 95% probability the true value of a specific parameter. To evaluate the differences between parameters of two conditions, we have simply subtracted the posterior distributions of β_Condition and β_Size weights between specific conditions. The resulting distributions are denoted as the credible difference distributions and are again summarized by computing the medians and the 95% HDIs.

For statistical inferences about the β_Size we assessed the overlap of the 95% HDI with zero. A 95% HDI that does not span zero indicates that the predictor has an effect on the dependent variable. For statistical inferences about the differences of the model parameters, β_Condition and β_Size, between conditions, we applied an analogous approach. A 95% HDI of the credible difference distribution that does not span zero is taken as evidence that the model parameters in the two conditions differ from each other. Data and codes are available at the following link https://osf.io/dfycg/.

Results and discussion

Based on previous results (Camponogara and Volcic, 2019a,b, 2021b), we predict that the multisensory condition in central vision (VH) should exhibit faster grasping movements with smaller peak grip apertures than the V and H unisensory conditions. Likewise, we expect the peripheral vision conditions (pV, pVH) to show a decline in performance with respect to their corresponding central vision conditions (V, VH), because peripheral vision is characterized by a higher visual uncertainty. However, two main scenarios are considered for the peripheral vision conditions. If haptic size is largely involved in the control of grasping, we expect faster movements, with narrower peak grip apertures and a better grip aperture scaling in pVH compared with pV. If haptic size does not play a relevant role, we expect actions in pVH to be faster and with narrower peak grip apertures than in pV, but with no improvement in grip aperture scaling, that is, the sensitivity to changes in object size would be equivalent to the pV condition.

We confirmed that movements performed in central vision were faster and with a narrower peak grip aperture in multisensory compared with each unisensory conditions (Fig. 2A,C; Camponogara and Volcic, 2019a,b, 2021b). Interestingly, the same pattern of results was found also in peripheral vision (Fig. 2B,D), confirming that haptics and vision are integrated also when vision is degraded. As expected, actions were slower and were performed with a wider grip aperture in peripheral compared with central vision, in both unisensory and multisensory conditions (V vs pV and VH vs pVH; Fig. 2). Interestingly, while the peak grip aperture scaled similarly in V and VH (Fig. 2C), the scaling was stronger in pVH compared with pV (Fig. 2D), suggesting a different support of haptics when acting in central and peripheral vision.

Figure 2.

Top row, Average peak velocity as a function of the object size in the central vision (A) and peripheral vision (B) blocks in experiment 1. Bottom row, Average peak grip aperture as a function of the object size in the central vision (C) and peripheral vision (D) blocks. Error bars represent the SEM. Dotted lines show the Bayesian mixed-effects regression model fits.

Central vision

In central vision, the peak velocity was modulated according to the available sensory information (Fig. 3A), with an advantage of multisensory over unisensory grasping, and of vision over haptics. The peak velocity was credibly higher in VH compared with V and H, and tended to be credibly higher in V compared with H (Fig. 3B). The peak velocity was not affected by changes in object size in any of the conditions, with slope values ranging between –0.1 and –0.65 corresponding to minimal variations in peak velocity between the smallest and the largest object (∼10 mm/s difference equivalent to ∼1% of the average peak velocity).

Figure 3.

Experiment 1 results. Left column, Estimates of peak velocity for each condition (A) and their differences (B). Middle column, Estimates of peak grip aperture for each condition (C) and their differences (D). Right column, Estimates of the scaling of peak grip aperture for each condition (E) and their differences (F). The gray areas indicate the estimates and comparisons of the conditions in which the object was in visual periphery. The graphical elements in A, C, E represent the posterior β weights distributions, and the graphical elements in B, D, F represent the credible difference distributions of the Bayesian linear mixed-effects regression models. The white dots show the median, the boxes the 50% HDIs, and the areas between whiskers the 95% HDIs.

The peak grip aperture was also clearly affected by the available sensory inputs (Fig. 3C). Peak grip aperture was credibly smaller in the VH condition compared with the H condition, and in V compared with the H condition (Fig. 3D). Also, the peak grip aperture in VH tended to be smaller than in the V condition. These results replicate previous findings and further corroborate that the simultaneous availability of visual and haptic inputs leads to a multisensory advantage (Camponogara and Volcic, 2019b, 2021b).

The peak grip aperture scaled with object size in all conditions (Fig. 3E). The scaling was equivalent in the VH and V conditions, and stronger compared with the H condition (Fig. 3F). This can be considered as a sign that, in central vision, the peak grip aperture modulation in multisensory grasping is mainly based on the visual size cue, as suggested by previous studies (Camponogara and Volcic, 2021b).

Comparisons between central and peripheral vision

Additional haptic inputs affected both peak velocity and peak grip aperture also in peripheral vision (Fig. 3A,C). As observed for central vision, holding the object with the contralateral hand facilitated faster movements and reduced grip apertures highlighting again the beneficial role of haptics.

The concurrent availability of peripheral vision and haptics enabled faster movements compared with when either of the two modalities was presented in isolation (Fig. 3B, pV–pVH and H–pVH comparisons). As expected, reach-to-grasp actions toward peripherally seen objects were slower than those toward centrally seen objects. The peak velocity was credibly lower in pVH compared with VH, and there was a tendency for a credibly lower peak velocity in pV compared with V (Fig. 3B, pV–V and pVH–VH comparisons).

The peak grip aperture credibly increased when the object was in peripheral compared with central vision, both with or without the support of concurrent haptic information (Fig. 3D, pV–V and pVH–VH comparisons). However, the switch from central to peripheral vision increased peak grip apertures more strongly when the grasping behavior was not supported by additional haptic information (pV–V vs pVH–VH). The effect of adding haptic information to peripheral vision resulted in credibly narrower peak grip apertures (Fig. 3D, pV–pVH comparison), whereas adding peripheral vision to haptics led to only a minor improvement (Fig. 3D, H–pVH comparison).

The availability of concurrent haptic size and position cues also partially prevented the typical worsening of the scaling of the grip aperture when grasping is guided only by peripheral vision (Fig. 3E). Object size scaling was credibly weaker in peripheral compared with central vision (Fig. 3F, pV–V and pVH–VH comparisons), but the scaling of the peak grip aperture was credibly stronger in the pVH condition compared with the pV condition (Fig. 3F, pV–pVH comparison), which was, in turn, identical to the H condition (Fig. 3F, H–pVH comparison). It is interesting to note that while in central vision the grip aperture scaled similarly in the unisensory visual and in the multisensory conditions (Fig. 3F, V–VH), the grip aperture in visual periphery scaled more strongly in the multisensory compared with the unisensory visual condition (Fig. 3F, pV–pVH). This suggests that haptic object position and size information are flexibly used according to the quality of visual information.

Figure 4A summarizes all the conditions in terms of peak velocity, peak grip aperture and the scaling of peak grip aperture as a function of object size. Conditions from worst (larger grip apertures and lower velocity) to best (smaller grip apertures and higher velocity) grasping performance lie along the diagonal line connecting the top-left to the bottom-right corners and are denoted with smaller to larger dot sizes indicating their respective slope of peak grip apertures. Two aspects are again evident here. First, both conditions with peripheral vision (pV and pVH) are inferior to their respective central vision conditions (V and VH). Second, complementing peripheral vision with haptic inputs leads to a superior grasping performance than when actions are guided only by peripheral vision (pVH vs pV). Interestingly, in peripheral vision haptics improved the grip aperture, peak velocity and the overall scaling of the peak grip aperture to a higher extent than in central vision.

Figure 4.

Summary of experiment 1 results. A, Relationship between the peak grip aperture and the peak velocity. The areas of the dots represent the slope of the peak grip aperture as a function of the object size (the higher the slope the larger the dot). B, Relationship between the grip aperture and the wrist velocity from the start to the end of the movement. The lines representing each condition were obtained by resampling each movement trajectory in 201 steps evenly spaced along the three-dimensional path and by then averaging the grip aperture and movement velocity over all participants and sizes for each step of the space-normalized movement trajectory. C, Slope of the grip aperture along the space-normalized trajectory. The slope values and their SEs (shaded regions) were computed by fitting a linear model with the grip aperture as a function of the object size for each step of the space-normalized trajectory. The dots represent the point of the trajectory at which the peak grip aperture occurred.

Figure 4B represents the covariation of the wrist velocity and the grip aperture from the start to the end of the movement. The highest value reached by each curve along the horizontal axis represents the point of the movement trajectory at which the peak velocity occurred, and, similarly, the highest value of each curve along the vertical axis represents the peak grip aperture. Just after movement start, the curves clustered into two groups, one including the conditions with haptic information (H, VH, and pVH) and one including those without haptics (V and pV); a sign that the initial movement velocity and grip aperture in multisensory conditions were mainly under haptic control. These groups dissolved before the curves reached the peak velocity and the evolution of each curve was affected by the available sensory information. In contrast, the curves representing the changes in the scaling of the peak grip aperture formed three groups of conditions which stayed separated until movement end (Fig. 4C). The slopes were similar between H and pVH, and between V and VH, with flatter slopes for the first (H, pVH) than for the second group (V, VH). Instead, the pV condition showed a distinct slope profile with very weak scaling which persisted almost until movement end.

Experiment 2

The results of experiment 1 show that, as for central vision, actions toward handheld objects in peripheral vision are performed faster and with narrower grip apertures than those toward only (peripherally) seen objects. This suggests that visual and haptic inputs are successfully integrated even when vision is disrupted. However, the partially restored grip aperture scaling observed in peripheral multisensory grasping could have two different origins that either incorporate haptic size cues or not. If the haptic size cue is critical for hand shaping in peripheral multisensory grasping, we expect that its removal would resemble the peak grip aperture and its scaling observed in the pV condition. Instead, if the hand shaping is mainly determined by visual size cues which are improved by the availability of haptic positional information, as seen in central vision (Camponogara and Volcic, 2021b), the haptic position cue should be sufficient to attain the same level of peak grip aperture and its scaling as when all haptic cues are provided. As long as the haptic position cue is available, the presence or absence of the haptic size cue should not affect peak velocities, which should be higher than when only peripheral vision is available. To tease apart the relative contribution of these haptic inputs, we systematically manipulated the haptic size availability. In the Peripheral Vision plus Haptic Position condition (pVHP), we introduced a new set of objects which were identical to those used in experiment 1, but had the lower half replaced by a post which did not co-vary with the size of the objects (Fig. 5A). Thus, in the pVHP condition, participants were holding the post with their left hand (Fig. 5B), which provided only haptic positional but no relevant size information, while simultaneously seeing the object in the periphery. This pVHP condition was performed on a new group of participants together with the pV and pVH conditions, which were the same as in experiment 1.

Figure 5.

A, Example of a stimulus used in the Peripheral Vision plus Haptic Position (pVHP) condition of experiment 2. B, Left hand holding the post during the pVHP condition. Average peak velocity (C) and peak grip aperture (D) as a function of the object size in experiment 2. Error bars represent the SEM. Dotted lines show the Bayesian mixed-effects regression model fits.