👋 Open to Work

Yasunori Ozaki PRO

alfredplpl

·

https://alfredplpl.github.io/en/index.html

AI & ML interests

Computer Vision, LLM

Recent Activity

updated a model 1 day ago

aidealab/AnimeGen-I2V

updated a Space 2 days ago

aidealab/AnimeGen-Frame-Interpolation

updated a Space 2 days ago

aidealab/AnimeGen-I2V

View all activity

Organizations

upvoted a collection 2 days ago

AnimeGen

AnimeGen Demos and Models • 5 items • Updated 2 days ago • 1

upvoted a collection 5 days ago

FLUX.2

Our second generation of FLUX • 21 items • Updated Apr 6 • 258

upvoted a collection 6 days ago

LingBot-Video

3 items • Updated 6 days ago • 17

upvoted a paper about 2 months ago

MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset

Paper • 2605.21272 • Published May 20 • 4

upvoted 4 collections about 2 months ago

MONET - Massive Open Non-redundant, Enriched, Text-to-image

A curated, deduped & recaptioned open image–text dataset of 104.9M samples released under the Apache2.0 licence. https://huggingface.co/blog/jasperai/ • 4 items • Updated May 28 • 11

Bonsai Image

6 items • Updated Jun 4 • 91

Jagle

Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision–Language Models • 5 items • Updated Apr 12 • 2

MobileCLIP2

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated Apr 23 • 64

upvoted a paper about 2 months ago

L2P: Unlocking Latent Potential for Pixel Generation

Paper • 2605.12013 • Published May 12 • 36

upvoted 6 papers 2 months ago

Asymmetric Flow Models

Paper • 2605.12964 • Published May 13 • 23

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published May 13 • 62

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 117

STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation

Paper • 2605.08029 • Published May 8 • 12

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Paper • 2605.06376 • Published May 7 • 27

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 85

upvoted 2 collections 2 months ago

GenLIP

Model weights of paper "Let ViT Speak: Generative Language-Image Pre-training" • 6 items • Updated May 5 • 8

imabari-dialect-models

今治弁モデル • 6 items • Updated Apr 23 • 2

upvoted a paper 3 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

upvoted a collection 3 months ago

MiMo-V2.5

4 items • Updated Apr 27 • 90

upvoted a paper 3 months ago

AVControl: Efficient Framework for Training Audio-Visual Controls

Paper • 2603.24793 • Published Mar 25 • 30