Deepseek's new AI approach uses inference time scaling, reinforcement learning, and reward modeling. Their Deepseek GRM AI judge evaluates responses with detailed critiques. The upcoming Deepseek R2 model aims to rival top AI models like Meta's Llama 4, potentially outshining GPT-4.