Video Classification with a CNN-RNN Architecture

Sayak_Paul · June 6, 2021, 2:06am

How do we process videos to feed to a Deep Learning model and train it? Can we borrow concepts from image and text models and combine those to train a video classification model? Yes, we can.

My latest example on keras.io shows you how:

lgusm · June 7, 2021, 2:52pm

Nice work Sayak!! Added to my to-read list!!

Sayak_Paul · June 7, 2021, 3:16pm

A Transformer variant coming soon. Stay tuned, Gus

Bhack · June 7, 2021, 4:55pm

Recently we have added MoViNets for Action Recognition on Mobile:

https://github.com/tensorflow/models/tree/master/official/vision/beta/projects/movinet

Sayak_Paul · June 16, 2021, 1:44am

Here’s the one: Video Classification with Transformers

Topic		Replies	Views
Implementing "ViViT: A Video Vision Transformer" Show and Tell keras , education	0	2288	January 18, 2022
Video classification with Transformers Show and Tell keras , learning , education	0	1355	June 16, 2021
Image classification with MobileViT Show and Tell keras , learning , education	1	2125	October 29, 2021
How to input videos in Video Vision Transformer? Keras vision	1	409	November 22, 2023
New Keras Example: Image classification with Swin Transformers Show and Tell models , keras , education	4	2937	April 4, 2022

Video Classification with a CNN-RNN Architecture

Related topics