Neuromorphic perception systems inspired by biology have tremendous potential in efficiently processing multi-sensory signals from the physical world, but a highly efficient hardware element capable of sensing and encoding multiple physical signals is still lacking. Here, we report a spike-based neuromorphic perception system consisting of calibratable artificial sensory neurons based on epitaxial VO2, where the high crystalline quality of VO2 leads to significantly improved cycle-to-cycle uniformity. A calibration resistor is introduced to optimize device-to-device consistency, and to adapt the VO2 neuron to different sensors with varied resistance level, a scaling resistor is further incorporated, demonstrating cross-sensory neuromorphic perception component that can encode illuminance, temperature, pressure and curvature signals into spikes. These components are utilized to monitor the curvatures of fingers, thereby achieving hand gesture classification. This study addresses the fundamental cycle-to-cycle and device-to-device variation issues of sensory neurons, therefore promoting the construction of neuromorphic perception systems for e-skin and neurorobotics.