Our General Models
DINO-XSeek
Referring object detection model based on a multimodal large language model, designed to precisely locate objects based on user-input natural language descriptions.
DINO-X
Supports open-world object detection and segmentation with text, visual, and prompt-free modes for precise outputs like bounding boxes, masks, keypoints, and captions.
Grounding DINO
Leverages text prompts to intelligently identify object locations and confidence, simplifying detection across common, long-tail, and dense scenarios.
T-Rex
Achieves object detection and counting through visual prompts without training, adaptable to various scenarios and excelling in dense and overlapping scenes.
Our Custom Services
By training on a small number of sample images, high-quality visual embeddings can be generated to enable accurate recognition and detection of specific targets. This capability is ideal for complex scenarios such as long-tail category identification, industrial customization, and non-standard object detection, empowering efficient business validation and deployment.
Our Application Scenarios
Clothing Color Change
Supports precise clothing color changes in images, allowing you to quickly generate updated images by specifying target areas and RGB values.
DINO-X Platform - Computer Vision Models for developers and enterprises