LCVSL

Official code for Local Compressed Video Stream Learning for Generic Event Boundary Detection.

Libo Zhang, Xin Gu, Congcong Li, Tiejian Luo and Heng Fan

Introduction

In this work, we propose an end-to-end compressed video representation learning method for GEBD. Specifically, we convert the video input into successive frames and use the Gaussion kernel to preprocess the annotations. Meanwhile, we design a spatial-channel attention module (SCAM) to make full use of the motion vectors and residuals to learn discriminative feature representations for P-frames with bidirectional information flow. After that, we propose a temporal contrastive module that use local frames bag as representation to model the temporal dependency between frames and generate accurate event boundaries with group similarity. Extensive experiments have conducted on the Kinetics-GEBD and TAPOS datasets demonstrate that the proposed method performs favorably against the state-of-the-art methods.

Architecture

Usage

Our proposed method is implemented with PyTorch.

1. Environment

pip3 install torch==1.12.1+cu113 torchvision==0.13.1+cu113 torchaudio==0.12.1 --extra-index-url https://download.pytorch.org/whl/cu113
pip3 install -i https://pypi.tuna.tsinghua.edu.cn/simple opencv-python
pip3 install ffmpeg

2. Installation

Clone this repo:

git git@github.com:GX77/LCVSL.git
cd LCVSL

3. Download datasets

4. Training

python3 train.py --config-file config/end_to_end_sidedata_mv_res.yaml /
                 --test-only False /
                 --all-thres False

5. Testing

python3 train.py --config-file config/end_to_end_sidedata_mv_res.yaml /
                 --test-only True /
                 --resume Model_path /
                 --all-thres True

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
config		config
datasets		datasets
figures		figures
modeling		modeling
solver		solver
utils		utils
.DS_Store		.DS_Store
README.md		README.md
inference.py		inference.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LCVSL

Introduction

Architecture

Usage

1. Environment

2. Installation

3. Download datasets

4. Training

5. Testing

About

Releases

Packages

Languages

GX77/LCVSL

Folders and files

Latest commit

History

Repository files navigation

LCVSL

Introduction

Architecture

Usage

1. Environment

2. Installation

3. Download datasets

4. Training

5. Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages