int8 model consumes the same GPU memory as default model.

#15

by Iamexperimenting - opened May 19, 2023

May 19, 2023

•

edited May 29, 2023

Hi team, when I'm trying the load flan-t5-xl model I see the same GPU memory is getting consumed. Could you please help me here, I'm sagemaker studio with ml.g4dn.xlarge

for default - it consumes - 11448MiB/15109Mib
for float 16 - it consumes - 7532MiB/15109Mib
for int8 - it consumes - 11448MiB/15109Mib

Thanks

limcheekin

May 25, 2023

I just share a model that might be helpful to you.
https://huggingface.co/limcheekin/flan-t5-xl-ct2

AndyChing

Jun 13, 2023

Hi good day, may i how it consums on RAM when it use on cpu？

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment