当前位置:网站首页>国产之光!高分时空表征学习模型 UniFormer
国产之光!高分时空表征学习模型 UniFormer
2022-07-15 16:37:00 【Zilliz Planet】
出品人:Towhee 技术团队
由中科院、国科大、上海人工智能实验室、商汤、香港中文大学几大高手联合出品,SoTA 模型 UniFormer (UNIFIED TRANSFORMER) 在主流数据集上都取得了优秀的成绩:在 Kinetics-400/Kinetics600 上取得 82.9% / 84.8% top-1 精度;在 Something-Something V1 & V2 上取得 60.9% 和 71.2% top-1 精度。其论文一经发表,就获得了高分,最终收录于 ICLR 2022(初审评分高达7.5分:8 8 6 8)。

UniFormer Architecture
UniFormer 提出了一种整合 3D 卷积和时空自注意力机制的 Transformer 结构,能在计算量和精度之间取得平衡。不同于传统的 Transformer 结构在所有层都使用自注意力机制,论文中提出的 relation aggregator 可以分别处理视频的冗余信息和依赖信息。在浅层,aggregator 利用一个小的 learnable matrix 学习局部的关系,通过聚合小的 3D 邻域的 token 信息极大地减少计算量。在深层,aggregator通过相似性比较学习全局关系,可以灵活的建立远距离视频帧 token 之间的长程依赖关系。
参考资料:
模型用例:[action-classification/video-swin-transformer]
论文:[UNIFORMER: UNIFIED TRANSFORMER FOR EFFICIENT SPATIOTEMPORAL REPRESENTATION LEARNING]
更多资料:
[高分论文!UniFormer:高效时-空表征学习的统一Transformer]
[ICLR2022 UniFormer:无缝集成 Transformer,更高效的时空表征学习框架]
更多项目更新及详细内容请关注我们的项目( https://github.com/towhee-io/towhee/blob/main/towhee/models/README_CN.md) ,您的关注是我们用爱发电的强大动力,欢迎 star, fork, slack 三连 :)
zilliz用户交流

边栏推荐
- 基于OSQP的二次规划
- What app should individuals use to buy stocks is safer and faster
- 智能工厂名词解释
- 24. 两两交换链表中的节点
- Jerry opened the key pairing, and after the first pairing TWS, it is difficult to pair successfully by cross pairing [article]
- Application of Tupu web visualization engine in simulation analysis field
- Extrait d'un bon article
- 論文中的好文佳句摘錄
- Huawei cloud stack opens its framework to the south to help ecological partners enter the cloud efficiently
- scala for循环 (循环守卫、 循环步长、循环嵌套 、引入变量、循环返回值、循环中断 Breaks)
猜你喜欢

Iterators and generators

Ffmpeg audio and video transfer package (MP4 and flv are transferred to each other, and streaming data is transferred to FLV and MP4)

Great Cells & Counting Grids

Functions and arrow functions

TCP 三次握手、四次挥手图解

Among the top 50 intelligent operation and maintenance enterprises in 2022, Borui data strength was selected

Special topic of software R & D efficiency demand value stream analysis

Distributed basic theory that cannot be ignored

How to apply knowledge management in enterprise work to solve enterprise problems?

Domain Driven Design Fundamentals
随机推荐
【pytorch】|transforms.FiveCrop
C 基本语法解读: 总结程序中的一些常用到的但是容易混乱的函数(i++与++i) (位域)
圖撲 Web 可視化引擎在仿真分析領域的應用
MP4 file introduction
leetcode:330. Complete the array as required
读书笔记:《过程咨询 I II III》 回顾
循环语句及数组
What is the difference between reject and catch processing in promise
Animation and encapsulation (offset, client, scroll series)
Matlab-mex
room android sqlite
Segment tree beats~
freeswitch的话单模块
Matlab-mex
Application du moteur de visualisation Web de topologie dans le domaine de la simulation et de l'analyse
Application of Tupu web visualization engine in simulation analysis field
[system design] 4S analysis method
The third question of the 13th Landbridge cup 2022 - sum (prefix sum or formula method)
flink.14 DataStream模块 source底层是怎么实现的?
The difference between let / const /var