...

Eran Feit Blog posts

How to Use Grounding DINO with Segment Anything Tutorial

grounding dino segment anything tutorial

Introduction In the world of AI-powered computer vision, combining detection, segmentation, and creative editing in a single pipeline is a major breakthrough. The grounding dino segment anything tutorial introduces precisely such a workflow — allowing you to detect arbitrary objects described in text, segment them precisely, and even manipulate them (for example via inpainting or […]

How to Use Grounding DINO with Segment Anything Tutorial Read More »

Subtitle Generator Guide: Transform Speech into Text

Subtitle Generator Guide

Introduction A subtitle generator has become an essential tool for anyone working with audio or video content. As digital communication continues to grow, subtitles help bridge gaps in accessibility, comprehension, and language diversity. Whether you’re creating educational videos, online tutorials, podcasts, or professional presentations, accurately transforming speech into text ensures your message reaches more people.

Subtitle Generator Guide: Transform Speech into Text Read More »

AI Object Removal Using Python: A Practical Guide

AI Object Removal Using Python

Introduction AI Object Removal Using Python has quickly become one of the most practical and exciting applications of modern computer vision. It allows anyone—from developers to digital artists—to automatically erase unwanted objects from images without manually editing pixels. Instead of relying on traditional tools like clone stamping or masking, deep learning models can now detect

AI Object Removal Using Python: A Practical Guide Read More »

Image Captioning using PyTorch and Transformers in Python

Image Captioning using PyTorch and Transformers in Python

Image captioning python is all about teaching a computer to look at a picture and describe it in natural language. Instead of manually writing alt-text or descriptions for every image, you use deep learning models to generate sentences automatically. With a few lines of code in Python, you can load a pre-trained vision–language model, pass

Image Captioning using PyTorch and Transformers in Python Read More »

Object Detection Heatmap for Tracking Moving Dogs

object detection heatmap

Object detection heatmap is a simple idea with a lot of power behind it.Instead of just drawing bounding boxes around objects, you aggregate all those detections into a colorful map that shows where activity is concentrated.Each new detection slightly “warms up” the corresponding region of the frame, so after processing many frames you get a

Object Detection Heatmap for Tracking Moving Dogs Read More »

Automatic Image Annotation with Autodistill and YOLOv8

Automatic Image Annotation with Autodistill and YOLOv8

Automatic image annotation is all about teaching machines to tag images for us.Instead of a human drawing every bounding box and typing every label, models learn to recognize patterns and automatically assign classes like horse, car, or person to each object in a picture or video frame.This drastically reduces the manual work needed to build

Automatic Image Annotation with Autodistill and YOLOv8 Read More »

How to Train Yolo Nas on Custom Dataset (Aquarium)

Yolo-Nas

When you train yolo nas on custom dataset, you’re taking an object detection model that was designed as a strong, general-purpose detector and adapting it to your specific problem. Instead of relying only on COCO-style everyday scenes, you can teach YOLO-NAS to recognize underwater creatures like fish, jellyfish, sharks, or penguins in noisy, low-contrast aquarium

How to Train Yolo Nas on Custom Dataset (Aquarium) Read More »

Eran Feit