I believe ChatGPT generally gives accurate answers to most questions. Certainly: it produces answers that are more reliably true than a random average person. Obviously it cannot yet do advanced programming tasks: but generally it answers questions accurately.
Prove my position wrong.
What can I ask it that will produce factually incorrect answers?
As a side quest, a much easier one, what can I ask it that would cause it to produce extremely biased answers that fail to do justice to the truth of things?


It gets medical questions wrong 15% of the time.
The problem with your question is that there’s never going to be a question it gets wrong every time, because it’s probabilistic. You might as well ask “what question can I ask my dice that will reliably produce a wrong answer?”
The article states: “ChatGPT-4o performed best with 84.6% validity”
It is reasonable to assume that the GPT 5.5 on thinking mode has significantly reduced the error rate.
It is also worth noting that the error rate when it comes to diagnosis amongst real doctors is estimated to be around 5%
Admittedly a quite old study: Singh, H., Meyer, A. N. D., & Thomas, E. J. (2014). The frequency of diagnostic errors in outpatient care: Estimations from three large observational studies involving US adult populations. BMJ Quality & Safety, 23(9), 727–731. https://doi.org/10.1136/bmjqs-2013-002627�
In response to your point: I am mainly interested in probabilistic reliability - if it gives the correct answer 99.9% of the time, it is clearly superior to the vast majority of human beings (with, perhaps, the exception of the best specialists in the most obscure niches) - especially given the sheer breadth of topics is can reliability answer questions on.
Interestingly, my question “What was India like before the British arrived?” produces consistently biased and misleading answers. Though I haven’t asked it for the new model.