Update model card with latest A/B test results and llama.cpp.python evaluation 91a2f09 verified zapabobouj commited on Jan 7