Perceptual Algorithm Engineer (Segmentation and Detection) (J17154)

UBTECH ROBOTICS CORP LTD.

Location

China

Vacancy for

Human

Employment type

Full-time

Necessary education

Higher

Employer provided salary

0¥ per year

Posted at 11.11.2025

Description

Requirements

Our company is actively hiring for a Computer Vision Algorithm Engineer position in Shenzhen, offering exceptional employment opportunities in humanoid robot perception technology. We are looking for talented professionals to develop real-time visual detection and semantic segmentation algorithms for humanoid robot scene understanding. This position provides the chance to work on cutting-edge VSLAM systems and build lightweight semantic maps for navigation systems. If you're searching for impactful careers in robotics perception, this represents one of the most exciting job openings in the humanoid robot industry.

The ideal candidate should meet the following requirements:

 

1. Proficient in semantic/instance segmentation network architecture (DeepLabV3+, Mask2Former, SAM, yolo adaptation model), master multi-scale feature fusion and attention mechanism optimization skills.
2. Familiar with small sample/weakly supervised segmentation schemes (such as Mask-CLIP, FocalClick).
3. In-depth understanding of the 2D/3D detection framework (DETR series, YOLO-v8/v9, PointPillars), and master the optimization methods of rotating frames and intensive scenes.
4. Be familiar with missed detection/false detection suppression strategies (such as timing consistency filtering) in dynamic scenarios.
5. Master the whole process of model training: data enhancement (CutMix/MixUp), loss function design (Focal Loss, Dice Loss), evaluation indicators (mAP, mIoU, PQ).
6. Proficient in using Python and CV libraries (OpenCV, PIL), you must master PyTorch (ability to customize layer/Dataset).

Extra points:
1. Be familiar with near-earth perspective data sets (such as HOMERobot, Habitat-Matterport 3D) or self-built simulation environments.
2. Study the coupling problem of binaural movement and perception: the influence of gait phase on image quality and compensation algorithm.
3. Master RGB-D point cloud segmentation (PointRend, PointGroup) or monocular depth estimation (AdaBins, ZoeDepth).
4. Develop geometric-assisted detection of multiple viewing angles (such as the use of motion parallax to improve the detection of small objects).
5. The visual basic model (VLMs) has been used to improve the generalization ability of zero samples.
6. Those who have top papers are preferred.

 

 

Check out the full job listings here

Needed key skills

  • Algorithms
  • Collaboration
  • Problem-solving skills
  • Python

Bonuses

1. Work on the Core of Embodied AI. 2. Tackle Challenging Real-World Perception Problems. 3. Full-Stack Development from Algorithm to Embedded Deployment.

Share with friends