We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens in new tab), HuggingFace (opens in new tab) and GitHub (opens in new tab). Phi-4-reasoning-vision-15B is a broadly capable model that can be used for a wide array of vision-language tasks such as image captioning, asking questions about images, reading documents and receipts, helping with homework, inferring about changes in sequences of images, and much more. Beyond these general capabilities, it excels at math and science reasoning and at understanding and grounding elements on computer and mobile screens. In particular, our model presents an appealing value relative to popular open-weight models, pushing the pareto-frontier of the tradeoff between accuracy and compute costs. We have competitive performance to much slower models that require ten times or more compute-time and tokens and better accuracy than similarly fast models, particularly when it comes to math and science reasoning.
Пожар вспыхнул на территории нефтебазы в российском городе из-за атаки БПЛА02:39
,更多细节参见TikTok
平安是“法治中的平安”,这一重要论断深刻揭示了法治与安全的本质联系,指明了以法治保障长治久安、统筹发展和安全的核心路径,明确了建设更高水平平安中国的一个重要着力点。
Sharpening the StoneWays to improve the experience, notable deficiencies, workarounds, and notes about incorporating the software into modern workflows (if possible).