> But why ignore a huge body of research in how to write scientific tests of intelligence and cognition?
Not saying to ignore it, but we are not dealing with humans. Those tests may give misleading results as you're proposing to use them outside of their design envelope. This is an area of research in itself.
Smells like linear algebra exceptionalism.
Is ARC AGI really the, "simplest, most basic assessment of fluid intelligence possible" ?