A Comparative Study on Reasoning Patterns of OpenAI's o1 Model Paper • 2410.13639 • Published Oct 17, 2024 • 19
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published Jan 29, 2025 • 14
A Case Study of Web App Coding with OpenAI Reasoning Models Paper • 2409.13773 • Published Sep 19, 2024 • 7
LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench Paper • 2409.13373 • Published Sep 20, 2024 • 3
Quantization for OpenAI's Whisper Models: A Comparative Analysis Paper • 2503.09905 • Published Mar 12, 2025 • 7
Performance Comparison of Large Language Models on VNHSGE English Dataset: OpenAI ChatGPT, Microsoft Bing Chat, and Google Bard Paper • 2307.02288 • Published Jul 5, 2023 • 1
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models Paper • 2508.12461 • Published Aug 17, 2025 • 2
H-CoT: Hijacking the Chain-of-Thought Safety Reasoning Mechanism to Jailbreak Large Reasoning Models, Including OpenAI o1/o3, DeepSeek-R1, and Gemini 2.0 Flash Thinking Paper • 2502.12893 • Published Feb 18, 2025 • 1
Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production Paper • 2312.14972 • Published Dec 20, 2023 • 1
On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability Paper • 2409.19924 • Published Sep 30, 2024 • 1
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym Paper • 2312.03290 • Published Dec 6, 2023 • 1
Vector Search with OpenAI Embeddings: Lucene Is All You Need Paper • 2308.14963 • Published Aug 29, 2023