Pinned Loading
-
Video-LLaMA
Video-LLaMA PublicForked from DAMO-NLP-SG/Video-LLaMA
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Python
-
VisionLLM
VisionLLM PublicForked from OpenGVLab/VisionLLM
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
-
Voice-Identification
Voice-Identification PublicForked from AKBoles/Voice-Identification
Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.
Jupyter Notebook 1
-
whisper
whisper PublicForked from openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Python
-
YOLOX
YOLOX PublicForked from Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Python
If the problem persists, check the GitHub status page or contact support.