-
Southeast University
- Naijing
- [email protected]
- https://orcid.org/0000-0002-5616-9994
- https://zl-he.com
Highlights
- Pro
Lists (17)
Sort Name ascending (A-Z)
Stars
CV/Resume Overleaf Template for PhD Application
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
Release repo for our SLAM Handbook
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Monitor Gaussian Splatting additional real-time viewable and differentiable outputs
Ericsii / FAST_LIO_ROS2
Forked from hku-mars/FAST_LIOROS2 version of FAST_LIO2. Welcome to the technical communication discord server discord: https://discord.gg/U3B65MGH8m
FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry
NMPC, WBC, state estimation, and sim2real framework for legged robots based on OCS2 and ros-controls
RL Extension Library for Robots, Based on IsaacLab.
Release for Improved Denoising Diffusion Probabilistic Models
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Unified framework for robot learning built on NVIDIA Isaac Sim
VMamba: Visual State Space Models,code is based on mamba
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
✨✨Latest Papers on Vision Mamba and Related Areas
The official dataset repository of "MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description". ECCV [Oral] 2024.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
An open source Drone Suite for Aerial Robot
hardware and software design of the 250mm autonomous drone
Official implementation of FouriScale (ECCV2024)
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
A fast and robust global registration library for outdoor LiDAR point clouds.