ADD IMAGES Machinelearning(@ai_machinelearning_big_data). πŸ”„ Caption Anything: Interactive Image Description with Diverse Multimodal Controls Caption-Anythin
ОблоТка канала

Machinelearning

Π’Π΅Ρ…Π½ΠΎΠ»ΠΎΠ³ΠΈΠΈ . ΠΏΡ€ΠΎΠ³Ρ€Π°ΠΌΠΌΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅ , Π½Π΅ΠΉΡ€ΠΎΠ½Π½Ρ‹Π΅ сСти . ΠΊΠ°Π½Π°Π» с самой свСТСй ΠΈ Π°ΠΊΡ‚ΡƒΠ°Π»ΡŒΠ½ΠΎΠΉ ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΠ΅ΠΉ ΠΈΠ· ΠΌΠΈΡ€Π° it

Machinelearning

3 Π³ΠΎΠ΄Π° Π½Π°Π·Π°Π΄
ΠžΡ‚ΠΊΡ€Ρ‹Ρ‚ΡŒ Π²
πŸ”„ Caption Anything: Interactive Image Description with Diverse Multimodal Controls Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. Π£Π½ΠΈΠ²Π΅Ρ€ΡΠ°Π»ΡŒΠ½Ρ‹ΠΉ инструмСнт для Ρ€Π°Π±ΠΎΡ‚Ρ‹ с изобраТСниями<i>, ΡΠΎΡ‡Π΅Ρ‚Π°ΡŽΡ‰ΠΈΠΉ Π² сСбС возмоТности<i>, Visual Captioning, SAM, ChatGPT<i>. МодСль Π³Π΅Π½Π΅Ρ€ΠΈΡ€ΡƒΠ΅Ρ‚ ΠΎΠΏΠΈΡΠ°Ρ‚Π΅Π»ΡŒΠ½Ρ‹Π΅ подписи для любого ΠΎΠ±ΡŠΠ΅ΠΊΡ‚Π° Π½Π° ΠΈΠ·ΠΎΠ±Ρ€Π°ΠΆΠ΅Π½ΠΈΠΈ. πŸ–₯ Github: https://github.com/ttengwang/caption-anything ⏩ Paper: https://arxiv.org/abs/2305.02677v1 πŸ“Œ Dataset: https://paperswithcode.com/dataset/cityscapes-3d πŸ–₯ Colab: colab.research.google.com/github/…al.ipynb ai_machinelearning_big_data