Your guide to the Thanksgiving table
So lately I've been getting a kick out of asking DALL-E3 for images labeled with text. They're just good enough to be legible, but yet:
The food that gets duplicated seems to vary from spread to spread.
I also asked DALL-E 3 to do the dessert table.
When I asked ChatGPT-4 (which was sending the image queries to DALL-E3) what the labels in the above image say, it answered:
I asked it if the labels were correctly spelled in the image, and it said that it couldn't tell me, that it could only read the text it gave as a prompt, not the text in the image itself. (This is probably the correct answer). But when I asked again, it would say that "Yes, the labels on the desserts in the image are spelled correctly. Each dessert is labeled with its appropriate name, ensuring clarity and correctness in the labeling."
Bard, when asked to read the labels in the dessert image, was not influenced by knowledge of the prompt, and did what I would consider a better job:
The image shows a table topped with a variety of desserts, including:
Pies: Pumpkin Pie, Cockles Pie, Apple Pie, Chess Pie, and Cheese Coales Pie
Cookies: Thinssving, BPOWHIES, BRIINES, and BROESCKE
Brownies: SIEHECADODOVAN and POKY
The label "Posode" is not clear to me, but it may be a reference to the type of table or the type of event that the desserts are being served at.
The desserts are all decorated with pumpkins and leaves, suggesting that this is a Thanksgiving-themed spread.
The image is likely taken in a restaurant or bakery, as the desserts are all professionally displayed.
Here is a faithful reading of all the labels in the image:
Punpkan Cockes Apple
This may not be quite correct (or even nearly correct; I think it pulled Posode out of the blurry background), but it does at least capture the spirit of the original. I do like the way the weird spacing and random code block convey chaos. Maybe Bard does text recognition first and then builds the scene around that, rather than starting by recognizing the scene and inferring probable text. (Or in ChatGPT's case, starting with an assumption about what the image is supposed to contain.)