nano-JEPA: Democratizing Video Understanding with Personal Computers

The Video Joint Embedding Predictive Architecture (V-JEPA) has shown great promise in self-supervised video representation learning. However, its substantial computational demands, often necessitates powerful GPU clusters, limit accessibility for many researchers. We introduce nano-JEPA, a streamlin...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Rostagno, Adrián, Iparraguirre, Javier, Ermantraut, Joel, Tobio, Lucas, Foissac, Segundo, Aggio, Santiago, Friedrich, Guillermo Rodolfo
Formato: Objeto de conferencia
Lenguaje:Inglés
Publicado: 2024
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/176281
Aporte de:

Ejemplares similares