Each cell produces 4 default boxes with different sizes and aspect ratios. Every box predicts 4 class scores (cat, dog, person, background) and 4 bounding box offsets. Loss: Cross-entropy for ...