Questions tagged [yolo]

YOLO, or You Only Look Once, is a real-time object detection algorithm

YOLO, or You Only Look Once, is a real-time object detection algorithm and special type of CNN (Convolutional Neural Network).

The original 2015 paper describing YOLO can be found here

115 questions
14
votes
3 answers

what is darknet and why is it needed for YOLO object detection?

what is darknet and why is it needed for YOLO object detection ? I read that its a neural network written in C , but why is it needed for YOLO object detection when we have lot of machine learning framework,api like tensorflow,keras,pytorch . Im…
star
  • 1,521
  • 7
  • 20
  • 31
10
votes
2 answers

How does the bounding box regressor work in Fast R-CNN?

In the fast R-CNN paper (https://arxiv.org/abs/1504.08083) by Ross Girshick, the bounding box parameters are continuous variables. These values are predicted using regression method. Unlike other neural network outputs, these values do not represent…
9
votes
1 answer

How does YOLO algorithm detect objects if the grid size is way smaller than the object in the test image?

In YOLO algorithm how do these grids output a prediction if some grids only see a small black portion of the car if the model was trained on datasets with full images?
8
votes
3 answers

Should there be a flat layer in between the conv layers and dense layer in YOLO?

Should there be a flat layer in between the conv layers and dense layer in YOLO? It's something not specified in the paper, but I see most implementations of YOLO on github do this. In my implementation, I do not flatten the 7*7*1024 feature map and…
5
votes
1 answer

Which is the "BEST" deep learning model for "Custom" object detection for images & real time. YOLO v3, v4, v5, EfficientDet?

Whenever I look for object detection model, I find YOLO v3 most of the times and that might be due to the fact that it is the last version created by original authors and also more stable. In 2020, a new author released unofficial version called…
4
votes
1 answer

Preparing ground truth labels for YOLO3

I want to train YOLO3 for a custom dataset that has raw labels in JSON format. Each bounding box in JSON is specified as [x1, y1, x2, y2]. So far, I have converted [x1, y1, x2, y2] to [cx, cy, pw, ph], that is, center x, center y of the bounding…
3
votes
1 answer

Basic doubt regarding "training" of a YOLO model

So I have just recently started exploring machine learning, and for a project I was required to train the YOLO v5 model. I first tried it on the coco128 dataset:https://www.kaggle.com/ultralytics/coco128.. repository of the yolo…
3
votes
1 answer

How to train YOLOV4?

I am going to write yolov4 real-time object detection, and I have to do it for car then vehicle plate number, but it does not have to find plate number if there is no car, first car then number on car, is that possible? Is it okay to use the darknet…
3
votes
1 answer

YOLOv1 algorithm - how to determine predictor responsibility

I am researching Yolo detector, and have read the original paper, but still have some confusion and a few questions regarding the assignment of bounding box predictors to ground truth at training time (in particular calculating the confidence). Some…
monolith
  • 92
  • 5
3
votes
3 answers

predicted bounding boxes that stretch beyond grid cell (Andrew NG CNN course)?

I was following Pr. Andrew Ng course on Course about Convolutional neural network and I have a doubt regarding one of the points he mentions in the Yolo algorithm. In one of the slides he mentioned two key points: 1) For each grid in our $3 \times…
Anjith
  • 961
  • 2
  • 11
  • 20
3
votes
1 answer

How to label overlapping objects for deep learning model training

I am training yolov3 to detect a custom object (chickens). In a lot of my training images I have overlapping chickens (can only see a partial chicken etc). Is there a common practice for how to label the data (bounding box) in these cases? Should…
Sjoseph
  • 163
  • 1
  • 6
3
votes
1 answer

YOLO algorithm - understanding training data

I am taking "Convolutional Neural Networks" on Coursera and it is taught by Andrew Ng. I am in week 3 and confused about YOLO algorithm. I checked the course forums on coursera but I am still not clear and it seems that many people are confused…
user2543622
  • 147
  • 1
  • 11
3
votes
1 answer

Algorithms to do a CTRL+F (find object) on an image

We all know the CTRL+F "Find text..." feature in text editors / browsers. I'd like to study the available algorithms to do something similar on an image. Example of UI/UX: let's say you have an electronic schematic. You draw a rectangle around a…
2
votes
0 answers

Can Darknet be Integrated with Tensorflow and tfx?

Tensorflow and Darknet are deep learning frameworks that work and are configured. Can Darknet be Integrated differently? Is there any framework or way to integrate Darknet within a tfx pipeline, for both inferencing and training models? I do work…
pentanol
  • 161
  • 6
2
votes
1 answer

Training a YOLO-style object detector

tl;dr I'm trying to train a small CNN (two conv layers and two connected layers) to find humans in the COCO dataset. Is my network big enough, and if so, roughly how many epochs of training will it need (there are 64115 training images)? I am trying…
Qwertiops
  • 123
  • 3
1
2 3 4 5 6 7 8