Intel/dpt-hybrid-midas

1年前发布 4 00

Model Details: DPT-Hybrid ...

收录时间:
2025-06-02
Intel/dpt-hybrid-midasIntel/dpt-hybrid-midas

Model Details: DPT-Hybrid

Dense Prediction Transformer (DPT) model trained on 1.4 million images for monocular depth estimation.
It was introduced in the paper Vision Transformers for Dense Prediction by Ranftl et al. (2021) and first released in this repository.
DPT uses the Vision Transformer (ViT) as backbone and adds a neck + head on top for monocular depth estimation.

This repository hosts the “hybrid” version of the model as stated in the paper. DPT-Hybrid diverges from DPT by using ViT-hybrid as a backbone and taking some activations from the backbone.
The model card has been written in combination by the Hugging Face team and Intel.

Model DetailDescription
Model Authors – CompanyIntel
DateDecember 22, 2022
Version1
TypeComputer Vision – Monocular Depth Estimation
Paper or Other ResourcesVision Transformers for Dense Prediction and GitHub Repo
LicenseApache 2.0
Questions or CommentsCommunity Tab and Intel Developers Discord

数据统计

相关导航

没有相关内容!

暂无评论

您必须登录才能参与评论!
立即登录
none
暂无评论...