Microsoft's compact language model (SLM) Phi Silica can now understand not just text but also images, making it multimodal. This is a significant update as the model is integrated into Windows 11 on Copilot+ PCs and runs on the local Neural Processing Unit (NPU) built into the processor. Currently, the model only works with Snapdragon X processors, but support for AMD and Intel chips is coming soon.
Rather than adding a separate bulky model for image analysis, Microsoft engineers combined existing components. The foundation remains the already deployed Phi Silica model. For computer vision, they're using the Florence image encoder, which is also used in the preview versions of Windows Recall and AI-enhanced search. Finally, they developed a small additional "Projector" module (just 80 million parameters) that converts Florence's visual data into a format Phi Silica can understand.
This approach allowed Microsoft to avoid deploying another large model on users' devices, saving disk space and reducing computational load. Reusing existing components also reduced training costs and time.
The primary practical application of this new technology is generating image descriptions for blind or visually impaired users. The multimodal Phi Silica can create image descriptions at varying levels of detail for the Narrator feature.
Phi Silica generates brief image descriptions (135 characters on average) in about 4 seconds, while more detailed descriptions (400-500 characters) take about 7 seconds. The current version is optimized for English, but support for other languages is planned.
Brief description:
Detailed description:
Additional technical information about the Phi Silica model update can be found in Microsoft's official blog at this link.
SourceNo comments.
Users of Guests are not allowed to comment this publication.
Microsoft Windows 11, 10, 8.1, 7, XP, Server -
10-10-2024, 09:31PC motherboard schematics
3-05-2024, 12:29Laptop Motherboard Schematics
3-05-2024, 12:29Now MSNEWS.RU application is available in Google
13-03-2025, 17:34April cumulative updates for Windows 11 and
10-04-2025, 11:34Did you find a mistake?
You can report it to the administration.
Press CTRL+Enter while highlighting text