Awq vllm reddit. I wonder how it does with tensor parallel and 70b vs llama.