Samsung has launched TRUEBench, a new benchmark designed to measure how well AI systems handle real workplace tasks instead of narrow academic tests. Covering 2,485 scenarios across ten categories and twelve languages, it evaluates everything from quick prompts to long document processing. The scoring is strict, requiring models to meet every condition, which makes the results demanding but more realistic.