I'm really confused that people don't understand this. It's just predicting the most likely next text token and its trained on most internet text, so why would we expect anything at all different?
How do we know this is regurgitation and not something like an AI summary of top hits ala Bing chat? Is there a reference to source links, if not highly questionable