[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 950 54 Updated Aug 5, 2025

djghosh13 / geneval

GenEval: An object-focused framework for evaluating text-to-image alignment

HTML 439 31 Updated Mar 3, 2025

bethgelab / DataTypeIdentification

Code for the ICLR'24 paper: "Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models"

13 Updated Jan 17, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,401 894 Updated Dec 17, 2024

RAIVNLab / sugar-crepe

[NeurIPS 2023] A faithful benchmark for vision-language compositionality

Python 91 11 Updated Feb 13, 2024

facebookresearch / genecis

Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"

Python 61 4 Updated Jun 12, 2023

tgxs002 / HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 659 28 Updated May 24, 2024

navervision / CompoDiff

Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)

Python 88 3 Updated Feb 2, 2025

ExplainableML / ImageSelect

Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"

Python 27 1 Updated Jul 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shyamgopal Karthik sgk98

Achievements

Achievements

Highlights

Block or report sgk98

Stars

naderAsadi / simFlow

ExplainableML / HyperNoise

lucasdegeorge / T2I-ImageNet

AaltoML / BayesVLM

xie-lab-ml / awesome-alignment-of-diffusion-models

aisagarw / awesome-explainable-cv

jialuli-luka / SELMA

ExplainableML / EgoCVR

bethgelab / CiteME

linzhiqiu / t2v_metrics

RockeyCoss / SPO

ExplainableML / ReNO

wangbohan97 / MPS

naver / dust3r

Stability-AI / StableCascade

ExplainableML / Vision_by_Language

mehdidc / compositionality-datasets-merge

VIRL-Platform / VIRL

OpenDriveLab / DriveLM

oripress / CCC

roboflow / awesome-openai-vision-api-experiments

mbzuai-oryx / groundingLMM