Discussion about this post

User's avatar
Rob F.'s avatar

"It was trivially easy to specify and run these models and to output the results."

I have 3 technical degrees from MIT and don't know enough statistics to follow your explanation.

Give yourself more credit.

SB's avatar

How about creating an LLM based tool that reads papers and flags specific or potential issues? It might not be perfect, but it could be a way to do a little stress testing at scale.

21 more comments...

No posts

Ready for more?