Skip to content

Popular repositories Loading

  1. micro_diffusion micro_diffusion Public

    Official repository for our work on micro-budget training of large-scale diffusion models.

    Python 1.6k 55

  2. simba simba Public

    Jupyter Notebook 122 6

  3. COALA COALA Public

    COALA: A Practical and Vision-Centric Federated Learning Platform, accepted to ICML'24

    Python 119 4

  4. Woosh Woosh Public

    Public release of the Sound Effect Foundation model by Sony AI.

    Python 115 5

  5. raw_image_denoising raw_image_denoising Public

    Noise Modeling in One Hour: Minimizing Preparation Efforts for Self-supervised Low-Light RAW Image Denoising

    Python 75 6

  6. RAW-Diffusion RAW-Diffusion Public

    [WACV 2025] Official implementation of "RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation"

    Python 63 4

Repositories

Showing 10 of 59 repositories
  • PAVAS Public

    [CVPR 2026 (Oral)] PAVAS: Physics-Aware Video-to-Audio Synthesis

    SonyResearch/PAVAS’s past year of commit activity
    0 0 0 0 Updated Apr 14, 2026
  • Woosh Public

    Public release of the Sound Effect Foundation model by Sony AI.

    SonyResearch/Woosh’s past year of commit activity
    Python 115 Apache-2.0 5 1 0 Updated Apr 13, 2026
  • LLM2Fx Public

    Large Language Models for Music Post Production

    SonyResearch/LLM2Fx’s past year of commit activity
    Python 38 4 1 0 Updated Mar 31, 2026
  • OpenVocabularySELD Public

    [TASLP] Open-Vocabulary Sound Event Localization and Detection with Joint Learning of CLAP Embedding and Activity-Coupled Cartesian DOA Vector

    SonyResearch/OpenVocabularySELD’s past year of commit activity
    Python 8 MIT 1 0 0 Updated Mar 25, 2026
  • VibeToken Public

    [CVPR 2026] VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations

    SonyResearch/VibeToken’s past year of commit activity
    Python 4 MIT 0 0 0 Updated Feb 25, 2026
  • SonyResearch/CoherentAVEdit’s past year of commit activity
    Python 3 1 0 0 Updated Feb 12, 2026
  • SAVGBench Public

    SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation

    SonyResearch/SAVGBench’s past year of commit activity
    Python 5 2 0 0 Updated Feb 4, 2026
  • DataCleaning4MSS Public

    Official Repository for "Towards blind data cleaning: A case study in music source separation"

    SonyResearch/DataCleaning4MSS’s past year of commit activity
    0 0 0 0 Updated Jan 26, 2026
  • MEGAMI Public

    Accompanying repository for the paper "Automatic Music Mixing Using a Generative Model of Effect Embeddings"

    SonyResearch/MEGAMI’s past year of commit activity
    Python 33 3 0 0 Updated Jan 18, 2026
  • soundreactor_v2a_eval Public

    evaluation toolkit for video-to-audio generation on SoundReactor

    SonyResearch/soundreactor_v2a_eval’s past year of commit activity
    Python 2 MIT 0 0 0 Updated Dec 23, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…