Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View anitarau's full-sized avatar

Block or report anitarau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 6 Updated Nov 21, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,399 2,717 Updated Aug 12, 2024

Implementation for the NEJM AI original article "Artificial Intelligence Identifies Factors Associated with Blood Loss and Surgical Experience in Cholecystectomy".

Python 2 1 Updated Feb 27, 2024

[CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research" code for MicroVQA benchmark and RefineBot method

Python 31 Updated Nov 25, 2025

[ICLR 2025] Video Action Differencing

Python 49 3 Updated Jul 3, 2025
HTML 1 Updated Mar 27, 2025
Python 41 2 Updated Sep 9, 2025

[CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Python 89 6 Updated Mar 22, 2025

[MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures

Python 79 3 Updated Sep 14, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,762 617 Updated Jan 15, 2026

Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such as Classification, Semantic Segmentation and Monocular dept…

Jupyter Notebook 269 15 Updated Jun 21, 2023

MedLSAM: Localize and Segment Anything Model for 3D Medical Images

Python 514 25 Updated Apr 30, 2024

Official repository for the ICCV2023 paper "Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTV"

Python 84 5 Updated Mar 5, 2024

Code for "Deconstructing Monocular Depth Reconstruction: The Design Decisions that Matter" (https://arxiv.org/abs/2208.01489)

Python 119 15 Updated Jul 20, 2023
Python 269 26 Updated Jan 14, 2026

[CVPR 2021] Self-supervised depth estimation from short sequences

Python 655 84 Updated Aug 9, 2023

TRI-ML Monocular Depth Estimation Repository

Python 1,273 245 Updated Jul 16, 2023

Official code repository for "Using deep learning to identify the recurrent laryngeal nerve during thyroidectomy", Scientific Reports 2021.

Python 2 Updated Oct 17, 2021

Tensorflow port of Image-to-Image Translation with Conditional Adversarial Nets https://phillipi.github.io/pix2pix/

JavaScript 5,091 1,295 Updated Feb 2, 2021

Visualize Camera's Pose Using Extrinsic Parameter by Plotting Pyramid Model on 3D Space

Python 316 34 Updated Feb 17, 2025

VR-Caps: A Virtual Environment for Active Capsule Endoscopy

C# 202 50 Updated Jun 23, 2022

Official repo for the work titled "SharinGAN: Combining Synthetic and Real Data for Unsupervised GeometryEstimation"

Python 27 11 Updated May 4, 2023

Extremely stupid LabVIEW game (shameless ripoff of the flash helicopter game from the early 2000s)

LabVIEW 1 Updated Nov 4, 2021

PyTorch implementation for 3D Bounding Box Estimation Using Deep Learning and Geometry

Python 489 94 Updated Jan 30, 2024

EndoSLAM Dataset and an Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearner

Python 297 52 Updated Jun 21, 2022

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Python 7,491 2,026 Updated Mar 24, 2024

A technical report on convolution arithmetic in the context of deep learning

TeX 14,608 2,322 Updated Jun 8, 2023