YOLO algorithm improves one of the backbone series: Fcaformer

author：Nuist Object Detection 2024-04-13 14:50:00

At present, one of the main research directions for designing more efficient visual transformers is to reduce the computational cost of self-attention modules by adopting sparse attention or using local attention windows. In contrast, we propose a different approach that aims to improve the performance of transformer-based architectures through intensive attention patterns. Specifically, we propose forward cross-notation for FcaFormer, i.e., the secondary use of the marker of the previous block at the same stage. To achieve this, FcaFormer utilizes two innovative components: Learnable Scale Factors (LSFs) and the Marker Merging and Enhancement Module (TME). LSFs can efficiently process cross-tokens, while TME can generate representative cross-tokens. By integrating these components, the proposed FcaFormer enhances the interaction between tag blocks with potentially different semantics and encourages the flow of more information down the stream.

Based on Forward Cross Attention (Fca), we designed a series of FcaFormer models to achieve the best trade-offs between model size, computational cost, memory cost, and accuracy. For example, without the need for enhanced training through knowledge distillation, our FcaFormer was able to achieve a top-1 accuracy of 83.1% on Imagenet with only 16.3 million parameters and about 3.6 billion MACs. This saves nearly half the parameters and a small amount of computational costs compared to the refined EfficientFormer, while also improving accuracy by 0.7%.

The overall structure of the FcaFormer model is shown below:

YOLO algorithm improves one of the backbone series: Fcaformer

Tutorial for adding a model as a backbone in a YOLOv5 project:

(1) Modify the models/yolo.py of the YOLOv5 project to the parse_model function and the _forward_once function of the BaseModel

(2) Create a new fcaformer.py in the models/backbone file and add the following code:

(3) Import the model in models/yolo.py and modify it in the parse_model function as follows (import the file first):

(4) Create a new configuration file under the model: yolov5_fcaformer.yaml

(5) Run verification: specify the --cfg parameter as the newly created yolov5_fcaformer.yaml parameter in the models/yolo.py file

YOLO algorithm improves one of the backbone series: Fcaformer

Read on

Dalizhuang Township carried out a series of theme activities of "welcoming July 1st, strengthening party spirit, and practicing the original intention".

"Healthy life, green and drug-free" Gaogou Town carried out a series of anti-drug publicity activities

World Premiere! CRRC released 7 series of new energy locomotives

Are you sure! The Northeast Tiger and Leopard National Park commemorative coin series will be issued in July

The Chenggu County Emergency Management Bureau launched a series of activities to celebrate the 103rd anniversary of the founding of the Communist Party of China

Let's look at several museums in the ancient city of Pingyao - Pingyao Ancient City Series (8)

Let's talk about the "Godfather" series: what did Fredo do

Party organizations at all levels will carry out a series of activities to celebrate the "July 1st" (1) |

Zodiac Series Boutique Picture Sharing (794)

Zodiac Series Boutique Picture Sharing (793)

Beichen No. 1 Kindergarten launched the "Night of the Brave" of the Graduation Ceremony of the Kindergarten

Xiaomi 14 series surging OS big update | FindX8 is also a small size straight screen? Configure exposure

The "National Ball into the Park" outdoor table tennis series 219 Park Station started

Wallpapers – Disney series wallpapers

"Upgrading and Transformation of Village and Town Industrial Agglomeration Areas" Series of Reports (10): Revitalizing Inefficient Land in Chini Town and Awakening "Sleeping" Resources

万物共生！Alexis Mabille 2024 秋冬高定系列 JMARK SHOW