Published onNovember 6, 2023Object detection: Owlv2-base-patch16-ensemble vs Kosmos-2-Patch14-224GoogleArtificial-IntelligenceObject-DetectionVisual-Question-Answering-(VQA)Owlv2-base-patch16-ensemble: Powerful multi-modal object detection from Google. This post contains an application build and a comparison between this model and Kosmos-2-Patch14-224.
Published onNovember 3, 2023Kosmos-2-Patch14-224: Multi-modal Object DetectionArtificial-IntelligenceMicrosoftObject-DetectionMulti-ModalComprehensive guide to build an object detection application with bounding boxes using the Kosmos-2 model from Microsoft and FastAPI.