AI Tag

2025

02-24

狂读论文：DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

02-22

狂读论文：Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

01-23

狂读论文：Test-time Computing- from System-1 Thinking to System-2 Thinking

01-16

狂读论文：blog-scaling-test-time-compute

2024

12-23

最优化导论总结

0%