Spatial Reasoning Test

Opinion

1hOpinion

AI is failing ‘Humanity’s Last Exam’. So what does that mean for machine intelligence?

The 2,500 questions that make up the exam are specifically designed to probe the outer limits of what today’s AI systems cannot do.

17hon MSN

12 logic puzzles that only smarty pants can solve

It might not seem like there's enough information to solve these logic puzzles—but that's part of the fun!

China Daily Global Edition

Chinese researchers score breakthrough in general artificial intelligence logical reasoning

In performance and functional diversity, the system, TongGeometry, has fully outperformed international benchmarks, including DeepMind's AlphaGeometry. This represents a major step forward in ...

Psychology Today

Is It Time to See Dyslexia as a Superpower?

A new documentary challenges the medical paradigm, framing dyslexia not as a disorder but a distinct cognitive style with its ...

13d

If You Can Handle These Kind Of Problems, You’re Officially “Hyper-Cognitive”

For most people, solving a problem is the reward—the relief of being done, the achievement of having figured it out.

14d

Is Your IQ Off The Charts? If You Know These Answers You Might Be A Genius

Psychological research shows that intolerance of uncertainty limits reasoning ability. Highly intelligent individuals tend to ...

GitHub

A set of prompts for testing LLMs spatial reasoning. The LLMs are wordcels and we need a shape rotator.

TLDR; the LLMs are great at math in N-dimensions (we tested 1, 2, 3, 4, & 5). BUT when it stops being raw math and starts getting physical and visual, they start to ...

Miami Herald

GRE Score Percentiles: What They Mean for Admissions

Each GRE verbal or quantitative reasoning test produces a total score from 130-170 in 1-point increments, where the analytical writing test receives a score between 0 and 6 in half-point increments.

VentureBeat

Google unveils Gemini 3 claiming the lead in math, science, multimodal and agentic AI benchmarks

After more than a month of rumors and feverish speculation — including Polymarket wagering on the release date — Google today unveiled Gemini 3, its newest proprietary frontier model family and the ...

Forbes

AI Has Mastered Words And Images. Now It’s Entering The Physical World

Forbes contributors publish independent expert analyses and insights. Dr. Gerui Wang writes about AI, society, media, and culture. Fei-Fei Li, a recipient of the 2025 Queen Elizabeth Prize for ...

GitHub

VLA 2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation

VLA-2/ ├── experiments/ # Main experimental codes │ ├── robot/ # Core VLA-2 implementation │ │ ├── openvla_utils.py # OpenVLA utility functions │ │ ├── robot_utils.py # Robot interaction utilities │ │ ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results