Skip to content
@FoundationVision

FoundationVision

Bytedance's opensource FoundationVision models

Hi there 👋

This is FoundationVision official website repo

Popular repositories Loading

  1. VAR VAR Public

    [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

    Jupyter Notebook 6.7k 440

  2. LlamaGen LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    Python 1.6k 70

  3. GLEE GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Python 1.1k 86

  4. Infinity Infinity Public

    Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    Python 981 40

  5. VNext VNext Public

    Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))

    Python 610 54

  6. Groma Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    Python 544 61

Repositories

Showing 10 of 17 repositories
  • Liquid Public

    Liquid: Language Models are Scalable Multi-modal Generators

    FoundationVision/Liquid’s past year of commit activity
    65 MIT 0 4 0 Updated Feb 25, 2025
  • Infinity Public

    Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    FoundationVision/Infinity’s past year of commit activity
    Python 981 MIT 40 25 1 Updated Feb 23, 2025
  • FlashVideo Public

    FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

    FoundationVision/FlashVideo’s past year of commit activity
    Python 376 Apache-2.0 23 10 (1 issue needs help) 1 Updated Feb 15, 2025
  • UniRef Public

    [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

    FoundationVision/UniRef’s past year of commit activity
    Python 235 MIT 15 4 0 Updated Feb 14, 2025
  • Goku Public

    Goku: Generative Flow Kit for Unified Image-Video Creation

    FoundationVision/Goku’s past year of commit activity
    JavaScript 11 0 1 0 Updated Feb 11, 2025
  • FoundationVision/flashvideo-page’s past year of commit activity
    HTML 0 0 0 0 Updated Feb 10, 2025
  • Autoregressive-Models-in-Vision-Survey Public Forked from ChaofanTao/Autoregressive-Models-in-Vision-Survey

    The paper collections for the autoregressive models in vision.

    FoundationVision/Autoregressive-Models-in-Vision-Survey’s past year of commit activity
    5 14 0 0 Updated Jan 12, 2025
  • VAR Public

    [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

    FoundationVision/VAR’s past year of commit activity
    Jupyter Notebook 6,709 MIT 440 46 0 Updated Jan 12, 2025
  • FoundationVision/infinity.project’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 24, 2024
  • GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    FoundationVision/GLEE’s past year of commit activity
    Python 1,096 MIT 86 41 2 Updated Oct 21, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…