Apple researchers have created an AI model that reconstructs a 3D object from a single image, while keeping reflections, highlights, and other effects consistent across different viewing angles. Here ...
Alibaba's new AI model called RynnBrain is focused on powering robots. One video released by Alibaba's DAMO Academy shows a robot identifying fruit and putting it in a basket. Nvidia and Google are ...
The original version of this story appeared in Quanta Magazine. Here’s a test for infants: Show them a glass of water on a desk. Hide it behind a wooden board. Now move the board toward the glass. If ...
Artificial intelligence models don’t have souls, but one of them does apparently have a “soul” document. A person named Richard Weiss was able to get Anthropic’s latest large language model, Claude ...
Abstract: Traditional object detection models often lose the detailed outline information of the object. To address this problem, we propose the Fourier Series Object Detection (FSD). It encodes the ...
Mere hours after OpenAI updated its flagship foundation model GPT-5 to GPT-5.1, promising reduced token usage overall and a more pleasant personality with more preset options, Chinese search giant ...
Abstract: Estimating the poses of new objects is a challenging problem. Although many methods have been developed for instance-level object pose estimation, they often struggle when faced with ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Andrew Ng’s startup LandingAI wants to make agentic AI the backbone of enterprise document processing with ADE DPT-2. (Photo by Mark RALSTON / AFP) (Photo credit should read MARK RALSTON/AFP via Getty ...
When Donald Trump published an August 12 letter addressed to the secretary of the Smithsonian Institution, informing him of “a comprehensive internal review” of the shows and explanatory materials at ...
IBM is releasing Granite-Docling-258M, an ultra-compact and cutting-edge open-source vision-language model (VLM) for converting documents to machine-readable formats while fully preserving their ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results