This code on GitHub trains Sentence-BERT on an NLI dataset. The evaluation section currently uses STS. Is it possible to change this to accuracy (the correct answer rate for the NLI data)? If so, I would appreciate specific guidance on how to implement this change.