Skip to content
View kq-chen's full-sized avatar

Highlights

  • Pro

Organizations

@shikras

Block or report kq-chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

tool for turning many repos into a meta repo. why choose many repos or a monolithic repo, when you can have both with a meta repo?

JavaScript 2,106 101 Updated Feb 22, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 24,062 2,075 Updated Feb 26, 2025

Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models

Python 111 6 Updated Feb 25, 2025

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 7,718 1,306 Updated Feb 24, 2025

[IROS 2024] Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation. [CoRL 2024] OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning

Python 443 28 Updated Feb 21, 2025

SAM with text prompt

Python 1,984 223 Updated Feb 16, 2025

Image augmentation for machine learning experiments.

Python 14,520 2,455 Updated Jul 30, 2024

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

Python 509 102 Updated Jun 14, 2024

Infinite Photorealistic Worlds using Procedural Generation

Python 6,245 506 Updated Jan 8, 2025

Recommended based on comfyui node pictures:Joy_caption + MiniCPMv2_6-prompt-generator + florence2

Python 517 29 Updated Feb 6, 2025

Official implementation of the Law of Vision Representation in MLLMs

Python 150 7 Updated Nov 17, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,171 275 Updated Nov 5, 2024

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

5,829 542 Updated Dec 1, 2024

A lightweight library for PyTorch training tools and utilities

Python 1,685 282 Updated Feb 21, 2025

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,096 86 Updated Oct 21, 2024

Long Context Transfer from Language to Vision

Python 362 19 Updated Nov 20, 2024

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,135 210 Updated Feb 26, 2025

The Memory layer for AI Agents

Python 24,892 2,310 Updated Feb 26, 2025

[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Python 776 76 Updated Sep 27, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,240 1,457 Updated Dec 25, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

798 20 Updated Jul 31, 2024

RecordRTC is WebRTC JavaScript library for audio/video as well as screen activity recording. It supports Chrome, Firefox, Opera, Android, and Microsoft Edge. Platforms: Linux, Mac and Windows.

JavaScript 6,679 1,770 Updated May 13, 2024

Android ViewServer and ADB client

Python 1,642 347 Updated Nov 30, 2024
Python 50 8 Updated Jun 13, 2024

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 42,648 5,513 Updated Feb 18, 2025

Tensor library for machine learning

C++ 11,965 1,145 Updated Feb 25, 2025
Python 21 2 Updated Apr 13, 2024

A pytorch template for beginners based on pytorch_lightning

Python 42 5 Updated Feb 1, 2024

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 3,742 211 Updated Feb 26, 2025

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 476 17 Updated Aug 9, 2024
Next
Showing results