Kamino666

🎯

Focusing

Kamino Kamino666

🎯

Focusing

一位涉猎广泛的小白杨

47 followers · 4 following

Achievements

academic-page Public

HTML MIT License Updated Feb 2, 2026
kamino666.github.io Public

HTML Updated Feb 2, 2026
LaGoVAD-PreVAD Public

[ICLR 26] This repository contains the code and dataset for our paper: Language-guided Open-world Video Anomaly Detection under Weak Supervision (https://arxiv.org/abs/2503.13160)

Python 9 Apache License 2.0 Updated Feb 1, 2026
stagehand-python Public
Forked from browserbase/stagehand-python

The AI Browser Automation Framework

Python Updated Nov 3, 2025
RethinkingVAD Public

This repository contains the codes and datasets for the ArXiv paper: Rethinking Metrics and Benchmarks of Video Anomaly Detection (https://arxiv.org/abs/2505.19022)

Jupyter Notebook 7 MIT License Updated Oct 30, 2025
Adaptive-BLIP2-MM24 Public

This is official implementation of our MM'24 paper: Adaptively Building a Video-Language Model For Video Captioning and Retrieval without Massive Video Pretraining

Python 5 1 Updated Feb 17, 2025
vidat Public
Forked from anucvml/vidat

Video Annotation Tool

Vue MIT License Updated Apr 4, 2024
PEL4VAD Public
Forked from yujiangpu20/PEL4VAD

Official code for "Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomlay Detection"

Jupyter Notebook MIT License Updated Jul 5, 2023
cifar-pytorch-learning Public
Forked from blindwang/cifar-pytorch-learning

LeNet5、AlexNet、VGG、GoogleNet、ResNet不同网络结构的尝试

Python Updated May 7, 2023
LAVIS-MMVCT Public
Forked from salesforce/LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Python BSD 3-Clause "New" or "Revised" License Updated Feb 25, 2023
video_features Public
Forked from v-iashin/video_features

Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as I3D, R(2+1)D, VGGish, ResNet, CLIP features.

Python 6 1 GNU General Public License v3.0 Updated Nov 10, 2022
watermark-tracer Public

一个基于可视水印检测识别的数字媒体溯源应用系统，是我的大作业项目，包含这个系统以及一个开源的大规模常见水印图像数据集（Large-scale Common Watermark Dataset, LCWD）。输入一个带有可视水印的图片或视频，系统会检测定位到水印所在的区域，然后将其提取出来，然后借助百度AI开放平台的OCR和logo识别以及Bing搜索引擎，溯源到这个图片或视频的源头。

object-detection watermark copyright-protection yolov5 visible-watermark

Python 158 20 GNU General Public License v3.0 Updated Nov 7, 2022
vatex-downloader Public

A simple vatex dataset downloader. 一个简单的VATEX数据集（或其他YouTube视频数据集）的下载器，特别为国内网络环境优化（其实就是断点下载和加上代理的参数）。

downloader vatex

Python 1 Updated Oct 25, 2022
pycocoevalcap Public
Forked from salaniz/pycocoevalcap

Python 3 support for the MS COCO caption evaluation tools

Python Other Updated Jul 22, 2022
mmselfsup Public
Forked from open-mmlab/mmselfsup

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Python Apache License 2.0 Updated Jun 22, 2022
wx-challenge Public
Forked from WeChat-Big-Data-Challenge-2022/challenge

微信大赛baseline

Python Updated May 25, 2022
learn_cv Public

Python Updated May 16, 2022
learn_cryptography Public

The Python3 implementation of MD5, SHA1 algorithms. Used for learning cryptography.

Python Updated Apr 21, 2022
Video-Captioning-Transformer Public

这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。视频描述生成任务指的是：输入一个视频，输出一句描述整个视频内容的文字（前提是视频较短且可以用一句话来描述）。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境，促进“无障碍视频”的发展。

pytorch transformer video-captioning

Python 99 18 Apache License 2.0 Updated Mar 12, 2022
dangdang-analyse Public

爬取、分析当当网的图书评论数据，用来做大作业的

Python 3 Updated Dec 5, 2021
torchvggish Public
Forked from harritaylor/torchvggish

Pytorch port of Google Research's VGGish model used for extracting audio features.

Python Apache License 2.0 Updated Nov 3, 2021
S2VT-video-caption Public

An implementation of paper "Sequence to Sequence – Video to Text". This implementation uses the S2VT model to do video captioning(or video description) task.

Python 7 Other Updated Jul 13, 2021
torch_videovision Public
Forked from hassony2/torch_videovision

Transforms for video datasets in pytorch

Python GNU General Public License v3.0 Updated Jun 7, 2021
mmt Public
Forked from gabeur/mmt

Multi-Modal Transformer for Video Retrieval

Python Apache License 2.0 Updated May 10, 2021
CBIR Public
Forked from pochih/CBIR

🏞 A content-based image retrieval (CBIR) system

Python Updated May 10, 2021
Machine-Learning-Notes Public

入门机器学习的笔记库

Jupyter Notebook Updated Mar 27, 2021
pytorch-book Public
Forked from chenyuntc/pytorch-book

PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch：入门与实战》)

Jupyter Notebook MIT License Updated Dec 22, 2020
CreationEngine Public
Forked from BuleStorm/CreationEngine

C++ OpenGL 模仿我的世界，内容相对完善，随机地图，支持双人联机，代码注释多

C++ GNU General Public License v3.0 Updated Oct 20, 2020
jxpt.cuc.edu-spider Public

Updated Aug 19, 2020
a-PyTorch-Tutorial-to-Image-Captioning Public
Forked from sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Python MIT License Updated Aug 3, 2020

Kamino Kamino666

Achievements

Achievements

academic-page Public

Uh oh!

kamino666.github.io Public

Uh oh!

LaGoVAD-PreVAD Public

Uh oh!

stagehand-python Public

Uh oh!

RethinkingVAD Public

Uh oh!

Adaptive-BLIP2-MM24 Public

Uh oh!

vidat Public

Uh oh!

PEL4VAD Public

Uh oh!

cifar-pytorch-learning Public

Uh oh!

LAVIS-MMVCT Public

Uh oh!

video_features Public

Uh oh!

watermark-tracer Public

Uh oh!

vatex-downloader Public

Uh oh!

pycocoevalcap Public

Uh oh!

mmselfsup Public

Uh oh!

wx-challenge Public

Uh oh!

learn_cv Public

Uh oh!

learn_cryptography Public

Uh oh!

Video-Captioning-Transformer Public

Uh oh!

dangdang-analyse Public

Uh oh!

torchvggish Public

Uh oh!

S2VT-video-caption Public

Uh oh!

torch_videovision Public

Uh oh!

mmt Public

Uh oh!

CBIR Public

Uh oh!

Machine-Learning-Notes Public

Uh oh!

pytorch-book Public

Uh oh!

CreationEngine Public

Uh oh!

jxpt.cuc.edu-spider Public

Uh oh!

a-PyTorch-Tutorial-to-Image-Captioning Public

Uh oh!