OpenAI made a series of important announcements yesterday, and hidden within them was something that has implications for 3D printing.
The announcement was of a new AI system, GPT-4o. Yes, the āoā is an āoā and not a ā0ā. Itās short for āomniā, which is meant to imply the new modelās capabilities are āomnipresentā: it apparently is able to handle multiple media formats all at once.
OpenAI published a couple dozen short videos this week showing examples of the incredible feats possible with GPT-4o. If you have time, you should watch them, and be amazed.
However, buried deep within their announcement page they provide considerable more details about GPT-4o. Thereās a pop-up menu under āExplorations of Capabilitiesā where you can select from among 17 different applications enabled by the new technology.
In there youāll see examples of GPT-4o generating new fonts, making movie posters from supplied images, understanding different voices in a meeting, generated handwriting, and many more astonishing applications.
One of them is called ā3D object synthesisā.
Could this mean that GPT-4o can generate 3D objects? I took a further look at their example to see whatās going on.
Their example starts with generating a 3D rendering via a text prompt. They used āA realistic looking 3D rendering of the OpenAI logo with “OpenAI” shown below.ā I presume they uploaded an image of their logo for this prompt.
GPT-4o then produced a nice 3D rendering of the logo.
But then there is ā4 hidden stepsā. We donāt know what these steps were, but I can guess as you will see.
Later in the sequence they present an animated ā3D reconstruction from 6 generated imagesā.
So it seems that the conversation with GPT-4o generated for separate 3D renderings of the logo. These were then combined into a 3D reconstruction. They repeated the process using a design of a seal sitting on a cylindrical base. Both seemed quite successful.
I tried replicating this conversation in my own ChatGPT-4o session, but was unsuccessful. After some research it turns out that OpenAI has not quite rolled out all the feature to everyone yet. While their demos show incredible image generations, they havenāt updated the image generator in ChatGPT yet. I guess we will have to wait a bit more to try this.
However, there is still a question: if it can generate a 3D reconstruction, could it then export the 3D model in a usable format? Iām thinking of āexport this 3D model in STL formatā. That might be a hard thing to do, and it would certainly be quite possible that errors could be introduced into the export.
Nevertheless it appears that OpenAI has taken a big step towards 3D model generation, and at some point in the future they (or someone else) will refine the technology to the point where is can be commonly used.
Via OpenAI