6 posts tagged with "extension"

Enabling OpenVINO Inference in Podman AI Lab

May 21, 2025 · 3 min read

Engineering Manager

Introduction to Podman AI Lab

Podman AI Lab is an open-source platform designed to simplify the deployment, management, and experimentation of AI workloads using container technology. It provides a user-friendly interface for running, testing, and scaling AI models locally or in the cloud, leveraging the power and flexibility of Podman containers.

What is OpenVINO?

OpenVINO™ (Open Visual Inference and Neural Network Optimization) is an open-source toolkit developed by Intel to accelerate AI inference on a variety of hardware, including CPUs, GPUs, and specialized accelerators. It optimizes deep learning models for fast, efficient inference, making it a popular choice for edge and cloud AI applications.

Using OpenVINO in Podman AI Lab

Podman AI Lab now supports OpenVINO as an inference provider. This means you can:

Select an OpenVINO compatible model when starting an inference server or playground.
Benefit from hardware-accelerated inference on supported Intel devices.
Easily switch between different inference providers (e.g., llama-cpp, OpenVINO) for benchmarking and compatibility testing.

warning

This feature is only available on Intel based systems, as OpenVINO is optimized for Intel hardware. If you are using a non-Intel system, you will not be able to use OpenVINO as an inference provider.

How to use:

Launch Podman AI Lab and navigate to the model deployment or playground section.
When configuring your model, choose an OpenVINO compatible model.
Start the inference server or playground.

Starting an OpenVINO inference server

Click the Podman AI Lab icon in the navigation bar.
In the Podman AI Lab navigation bar, click Models > Services menu item.
Click the New Model Service button on the top right.
Select an OpenVINO compatible model in the list (e.g. OpenVINO/mistral-7B-instruct-v0.2-int4-ov) in the Model list and click the Create Service button.
The inference server for the model is being started and after a while, click on the Open service details button.

OpenVINO inference server details

Using the terminal shell, execute the given curl command and see the inference result output.

Starting a playground with an OpenVINO compatible model

Click the Podman AI Lab icon in the navigation bar.
In the Podman AI Lab navigation bar, click Models > Playgrounds menu item.
Click the New Playground button on the top right.
Select an OpenVINO compatible model in the list (e.g. OpenVINO/mistral-7B-instruct-v0.2-int4-ov) in the Model list and click the Create playground button.
The playground for the model is being started and after a while, a chat interface is displayed.

Initial playground on OpenVINO model

Enter 'What is OpenVINO ?' in the prompt and click the Send button. The OpenVINO model will respond with an answer.

OpenVINO model response in the playground

Consistency with OpenShift AI + OpenVINO

One of the key advantages of using OpenVINO in Podman AI Lab is the consistency it brings when transitioning workloads to OpenShift AI. Both platforms now support OpenVINO, ensuring that:

Models tested and optimized locally in Podman AI Lab will behave the same way when deployed to OpenShift AI.
You can maintain a unified workflow from development to production, reducing surprises and integration issues.
Performance optimizations and hardware acceleration are preserved across environments.

Conclusion

By enabling OpenVINO as an inference provider, Podman AI Lab empowers users to leverage high-performance AI inference both locally and in the cloud, with a consistent experience across platforms like OpenShift AI. This integration streamlines the AI development lifecycle and unlocks new possibilities for deploying efficient, scalable AI solutions.

Supercharge Your Container Development in VS Code with Podman and Podman Desktop

May 5, 2025 · 5 min read

Matt Demyttenaere

Product Manager

Developing containerized applications can sometimes feel complex, but with the right tools, it can be a smooth and efficient process. In this blog post, we'll explore how to leverage the power of Visual Studio Code (VS Code) together with Podman and Podman Desktop to streamline your container development workflow. We'll cover setting up and using two VS Code extensions that integrate with Podman.

VS Code: Your IDE for Container Development

VS Code is a popular and versatile code editor that can be extended to enhance its functionality. For container development, several excellent extensions integrate seamlessly with Podman.

Prerequisites

Before we begin, ensure you have the following installed:

Podman: Follow the installation instructions on the official website.
Podman Desktop: Download and install Podman Desktop from the official website.
Visual Studio Code: Download and install VS Code from the official website.

VS Code Extensions

To integrate VS Code with Podman, we have 2 extensions as options:

Microsoft’s Container Tools extension: The "Container Tools" extension provides excellent support for container-related tasks, including building images, managing containers, and working with Containerfiles and Dockerfiles. Microsoft recently announced that they will be evolving the Docker extension into the Container Tools extension to support other tools like Podman!
Pod Manager: This extension was created by one of the members of our community and is completely open source. It is designed to help you manage Podman containers, images, volumes, and networks directly from the VS Code interface.

While it's unlikely that you would use both extensions simultaneously, comparing them will help you understand the strengths and weaknesses of each, ultimately making it easier to select the one that best fits your needs. So for this blog post, we'll install them one by one.

Option 1: Microsoft’s Container Tools extension

To install the extension:

Open VS Code.
Click on the Extensions icon in the Activity Bar (or press Ctrl+Shift+X or Cmd+Shift+X).
Search for "Container Tools" and install the extension by Microsoft.

Configuring VS Code for Podman

The Container Tools extension usually automatically detects Podman if Docker isn't running, by looking at the DOCKER_HOST environment variable. In Podman Desktop navigate to Settings > Docker Compatibility > Third-Party Tool Compatibility and make sure the option is enabled. Learn more about the Docker Compatibility in our documentation.

enabling docker compatibility in the settings

Option 2: Pod Manager

To install the extension:

Open VS Code.
Click on the Extensions icon in the Activity Bar (or press Ctrl+Shift+X or Cmd+Shift+X).
Search for "Pod Manager" and install the extension by dreamcatcher45.

Using the VS Code Extensions

Now that we have the extensions installed and configured, let's see how to use them.

Working with Containerfiles and Dockerfiles

Both extensions provide syntax highlighting, code completion, and linting for Containerfiles and Dockerfiles. Open a Containerfile in VS Code, and you'll immediately benefit from these features.

You can also build images directly from VS Code:

Right-click on the Containerfile in the Explorer view.
Select "Build Image".
VS Code will prompt you for an image name and tag.
The extension will then build the image using Podman.
After that you will see the built image in the sidebar.

If you are using the CLI commands to build images, you will also see them here.

building a Containerfile in vs code using the microsoft extension

Managing Containers

The extensions also allow you to manage containers directly from VS Code. You can start, stop, restart, and remove containers, as well as view their logs and inspect their configuration.

To view the container logs:

Click on the Container icon in the Activity Bar.
You'll see a list of your containers, images, and networks.
Right-click on a container to perform actions.

using Container Tools extension to view the logs of the container

Similarly using the Pod manager extension we can visually inspect containers, images, and volumes.

Click on the Pod manager icon in the Activity Bar.
You'll see a list of your containers, images, and networks.

using podmanager to view all the running containers, images and volumes

and manage the container lifecycle.

using podmanager to manage the lifecycle of the container

and of course troubleshoot issues with a visual interface.

using podmanager to enter the container

Conclusion

In comparison, both extensions provide a nearly identical set of features, so it is really up to you which UI you prefer. Personally I will stick with the Pod Manager because the logo is a seal 🦭. Remember if you encounter any issues using these tools or with Podman Desktop let us know by starting a discussion or creating an issue.

By combining the power of VS Code, Podman, and Podman Desktop, you can create a streamlined and efficient container development workflow. The VS Code extensions provide excellent integration with Podman, allowing you to manage containers, build images, and work with Containerfiles directly from your code editor. We are excited to see that Microsoft is embracing Podman and building support into their ecosystem. Podman Desktop complements this with a visual interface for managing your container environment and will help to move from your development environment to a production Kubernetes environment. Embrace these tools and elevate your container development experience!

Podman Quadlets with Podman Desktop

January 29, 2025 · 4 min read

Axel Stefanini

Software Engineer

Containers are typically deployed in Kubernetes clusters. However, for smaller-scale use cases such as on a single-node server or during development, Kubernetes can be overkill.

What’s a more lightweight solution for running autonomous applications with multiple interacting containers?

In this blog, we'll dive into what Quadlets are, their benefits, and how to use them within Podman Desktop.

Podman Desktop BootC extension 1.6 Release

January 7, 2025 · 3 min read

Charlie Drage

Software Engineer

BootC extension 1.6 Release! 🎉

BootC (Bootable Container) is an extension for Podman Desktop that builds bootable container disk images. Go from a standard container image to a full bootable-on-a-usb-stick OS!

You can update or install the extension via the Podman Desktop extension catalog.

This release introduces exciting new features and improvements:

Detailed example pages: Each example now has a dedicated page with detailed instructions on how to use it.
Interactive build configuration creator: Easily create your build configuration through a fillable form directly in the GUI.
Experimental Linux VM support: Added support for running Linux VMs on generated images.

Release Details

Examples now have detail pages

Each example now includes a dedicated detail page! Click on More Details in the Examples section to view step-by-step instructions for each example.

example details

Interactive build config creator

No need to manually create a custom build config. Use our interactive build configuration creator to easily generate your own build config through a user-friendly form.

build config interactive

Experimental Linux VM support

Linux support is now available for running virtual machines on generated images! Look for the new Virtual Machine (Experimental) tab or the dedicated VM launch button on the Disk Images page.

linux support

Detailed release changelog

Features 💡

feat: add example details page by @cdrage in #1017
feat: add build config configurator by @cdrage in #1026
feat: add Linux VM experimental support by @cdrage in #1102

Chores 🛠️

chore: remove yarn references by @deboer-tim in #969
chore: update to latest UI library by @deboer-tim in #971
chore: add release process by @deboer-tim in #970
chore: delete packages/backend/yarn.lock by @benoitf in #1001
chore: rename team in CODEOWNERS by @benoitf in #999
chore: refresh dependencies to update to latest versions by @benoitf in #1003
chore: add telemetry for examples by @cdrage in #1098
chore: update bootc-image-builder image by @cdrage in #1078
chore: remove HVF acceleration from AMD64 VM command by @cdrage in #1089
chore: add READMEs to each example by @cdrage in #1014
chore: rename section by @cdrage in #1015
chore: revert back to Vite 5 and update Vitest by @cdrage in #1116

Fixes 🔨

fix: E2E tests workflow failure to install PNPM by @dgolovin in #1085
fix: E2E main workflow node setup step by @dgolovin in #1103
fix: navigation to webview by @cbr7 in #1052
fix: bootc E2E tests by @cbr7 in #998

Documentation 📚

docs: update release doc by @cdrage in #1115

Podman AI Lab - For developers to build AI Applications with LLMs running locally

November 26, 2024 · 9 min read

Philippe Martin

Principal Software Engineer

Red Hat provides an extension to Podman Desktop, Podman AI Lab, which lets developers discover examples of applications by using large language models (LLMs), and gives them a framework to create their own AI-based applications and share them with their team.

We will discover, through this article, the different steps to create our first AI application, and to add it to the catalog of recipes of Podman AI Lab.

For our first experiment, we will work on a micro-service for the podman-desktop.io website. The micro-service would receive the search terms from the website, and would ask the model to find the best matching pages, before returning the result to the website.

my first app

Preparing Podman Desktop and Podman AI Lab

If you haven't done it yet, first install Podman Desktop and its extension Podman AI Lab.

To have a better experience, it is recommended to use the GPU acceleration to serve the model. If you have such a GPU on your machine, you will need to create a Podman machine with the LibKrun provider (on MacOS). More details on the GPU support for Podman AI Lab.

At the time of writing, the GPU support is still experimental on Podman AI Lab. You will need to enable the option on the Preferences to enable it.

a podman machine running using libkrun

GPU support for inference servers preference is enabled

Testing a prompt with a model

Podman AI Lab provides a catalog of open source models that can be used locally. You can go to the Models > Catalog page to download the model of your choice. For this article, we will use the Mistral-7B-instruct model.

Mistral model is downloaded

Once a model is downloaded, we can test and interact with this model to try to find the best prompt for our application. For chat models, Podman AI Lab provides a Playground, so we can test different prompts and validate that the responses of the model are adequate.

Let's start a new playground (from the Models > Playgrounds menu), and send our first prompt:

Give me a list of pages in the website podman-desktop.io related to "build an image"

The model should reply with some list of pages, in a human-readable form (see the screenshot below, for the response we received).

a first prompt with human-readable output

The problem is that the response is in human-readable form, but we don't want the API to return this response as is. We want to have the name and the url of the pages, and send them to the website, so the website can display these pages with its preferred format.

For this, we can try to ask the model to reply with a structured response, with the following prompt:

Give me a list of pages in the website podman-desktop.io related to "build an image" as JSON output as an array of objects with 2 fields name and url

This time, we received a response in JSON format, which is more suitable for our needs.

a prompt with structured output

We don't expect the user to ask such a precise question, and we would prefer to send to the model the exact question of the user, without modifying it in real time. To achieve this, chat models provide a system prompt feature. The system prompt can be defined at the beginning of the chat session.

Podman AI Lab supports this feature, let's restart a Playground session with the following system prompt:

Give me a list of pages in the website podman-desktop.io related to the request as JSON output as an array of objects with 2 fields name and url

Then, send the prompt build an image, to simulate a realistic search input of a user.

We can see in the screenshot below that the model still returns a response suitable for our use case.

a session with a system prompt

Please note that this section was not a course on writing the best prompt, I'm sure you will find much more efficient prompts for this purpose. The purpose of this section is to demonstrate how you can iterate with Podman AI Lab to refine the prompts you want to use for your application.

Testing a recipe

Now that we have a suitable prompt to use for our application, it is time to start our application itself.

Many developers prefer to have a working example of application to start with, and Podman AI Lab provides such examples with a catalog of recipes, visible in the page AI Apps > Recipe Catalog.

Let's select the Chatbot recipe (More details link on the Chatbot card), and start it with the Mistral model (by pressing the Start button and filling the form).

Once the application is started, we can access the list of running apps in the AI Apps > Running page, and we can access the app's UI by clicking on the Open AI App link.

We can test again by typing our prompt (not the one with a system prompt, as the recipe does not support providing a system prompt), and see that the response is very similar to the one received from the playground.

a session on the Chatbot recipe

Back to the recipe's details page, we can access the sources of the recipe by clicking on the Open in VSCode button, the respository's link or the link Local clone.

Structure of a recipe

The entrypoint of a recipe is the file ai-lab.yaml present in the repository of the recipe.

Let's examine the content of this file (the syntax of the file is specified in this documentation) for the chatbot example.

version: v1.0
application:
  type: language
  name: ChatBot_Streamlit
  description: Chat with a model service in a web frontend.
  containers:
    - name: llamacpp-server
      contextdir: ../../../model_servers/llamacpp_python
      containerfile: ./base/Containerfile
      model-service: true
      backend:
        - llama-cpp
      arch:
        - arm64
        - amd64
      ports:
        - 8001
      image: quay.io/ai-lab/llamacpp_python:latest
    - name: streamlit-chat-app
      contextdir: app
      containerfile: Containerfile
      arch:
        - arm64
        - amd64
      ports:
        - 8501
      image: quay.io/ai-lab/chatbot:latest

The file defines two containers, one for the inference server and one for the application itself.

The first container, for the inference server, is generic and can be reused for any app using a chat model.

The second one is the one we are particularly interested in. It defines how the container's image for the application is built. It points to the Containerfile used to build the image, on which we can find the source code for the app: in the app/chatbot_ui.py file.

Looking at the Python source file, we can see that the application uses the streamlit framework for the UI part, and the langchain framework for discussing with the model.

We can adapt this source code, by replacing the UI part with a framework to make the app a REST service, and keep the langchain part.

An interesting part of the source code is that the recipe does not expose to the user the system prompt, but defines one internally (You are world class technical advisor):

prompt = ChatPromptTemplate.from_messages([
    ("system", "You are world class technical advisor."),
    MessagesPlaceholder(variable_name="history"),
    ("user", "{input}")
])

This is exactly what we want to do in our application, we will be able to indicate here the system prompt we have found earlier.

Creating our own app

Adapting the source code for the purpose of our application is out of the scope of this article, let's see the result in our app repository.

As discussed in the previous section, we have replaced the streamlit part with the flask framework to create a REST API with two endpoints: one for the health check on / necessary for Podman AI Lab, and another one on /query, which will be the endpoint on which the micro-service's user will send requests.

We have also indicated our own system prompt:

prompt = ChatPromptTemplate.from_messages([
    ("system", """
        reply in JSON format with an array of objects with 2 fields name and url
        (and with no more text than the JSON output),
        with a list of pages in the website https://www.podman-desktop.io related to my query
    """),
    MessagesPlaceholder(variable_name="history"),
    ("user", "{input}")
])

Testing my own app locally

To iterate during the development of our app, we can test our app locally in our host system, while using the model served by Podman AI Lab. For this, we need to start a new model service from the page Models > Services, by clicking the New Model Service, then choosing the appropriate model (Mistral-7B-instruct in our case), and specifying a port number (let's say 56625).

a running inference server with Mistral model

Then, we can run our app, by specifying through the MODEL_ENDPOINT environment variable how to access the model service.

my app running locally

Finally, we can send a request to this app running locally, and listening in the port 5000, and we can check in the screenshot below that the response is, as expected, a list of pages (name and url) in JSON format:

a request to the micro-service

Creating a recipe

The last step is to add this application to the Podman AI Lab recipe catalog.

Podman AI Lab provides a way for a user to extend the provided catalog with its own recipes. This can be done by adding a file in a specific directory, as described in this documentation.

{
  "version": "1.0",
  "recipes": [
    {
      "id": "search-podman-desktop-io",
      "description": "Search on Podman-desktop.io website",
      "name": "Search Podman-desktop.io",
      "repository": "https://github.com/redhat-developer/podman-desktop-demo",
      "ref": "main",
      "icon": "natural-language-processing",
      "categories": ["natural-language-processing"],
      "basedir": "ai-lab-demo/recipe",
      "readme": "",
      "recommended": ["hf.TheBloke.mistral-7b-instruct-v0.2.Q4_K_M"],
      "backend": "llama-cpp"
    }
  ]
}

By creating the file $HOME/.local/share/containers/podman-desktop/extensions-storage/redhat.ai-lab/user-catalog.json with the content above, you should now be able to see a new recipe Search Podman-desktop.io in the recipe catalog of Podman AI Lab, and run it as any other recipe. And, of course, you can share this file with your colleagues to share with them your latest experiment.

Introduction to Podman Desktop extensions

October 29, 2024 · 4 min read

Charlie Drage

Software Engineer

programming

Extensions are a powerful tool to customize and extend the functionality of Podman Desktop. Whether you want to add new container management features, streamline current workflows, or create custom UI elements specific to your tech stack, building extensions allows you to tailor the Podman Desktop experience to your specific needs.

In this guide, we'll introduce how you can build your own Podman Desktop extension, with links to detailed documentation that covers each part of the process.

Introduction to extensions

Extensions are abundant in Podman Desktop and can be found in the Extensions -> Catalog section.

extension catalog

Each extension expands on Podman Desktop, such as providing Kubernetes development clusters with Minikube or even analyzing your image layers.

Below is an example of the layers explorer extension and how it integrates into Podman Desktop:

layers_explorer

Getting started with your project

The first step in creating your extension is setting up the project environment. To learn how to configure the project and add basic components, check out the Templates for creating an extension guide, which walks you through initializing your project from an official template.

Adding UI components

One of the most common tasks when creating an extension is adding a user interface. Whether it’s adding buttons, panels, or icons, UI components help make your extension more interactive and accessible. Adding a UI component is totally optional and an extension can be ran without UI components. Learn more about this in the Adding UI components documentation, where you’ll find instructions on creating and integrating components into the application’s UI.

Working with icons

Icons are a great way to make your extension more visually unique. You can learn how to add and style custom icons by following the Adding icons documentation.

Below is an example of how the bootc extension added icons to the image list within Podman Desktop:

icons

Extensions often integrate with existing menus and navigation to offer users easy access to new commands and features. If you want to add items to the context menu, explore the Menu configuration documentation, which explains how to add commands to menus and control when they are displayed using When Clauses.

Below is an example of how the bootc extension added a new menu command to image list:

menus

Adding and configuring commands

Commands are the backbone of most extensions, allowing users to interact with the application and trigger specific actions.

If you need to define and register custom commands, the Commands guide will show you how to create commands that respond to user actions or input, and tie them into your extension’s workflow.

You can also configure these commands to appear in different contexts. Check out the When clause Contexts documentation to learn more about restricting commands to specific scenarios.

Commands are heavily influenced by VS Code commands and can be configured similarly. See our commands guide for more information.

Setting up onboarding workflows

Creating a smooth onboarding experience is essential to help users get started with your extension. This includes steps for CLI binary installations or other initial setup values.

You can provide guidance, tutorials, or initial setup steps using the Onboarding workflow guide.

Below is an example of how the built-in compose extension adds onboarding for the compose CLI binary installation:

compose

Configuration settings

Once you’ve built your components and commands, you may want to setup configuration settings for advanced usage of your extension.

The Configuration documentation outlines the configuration file structure and how to link everything together to use user-specific values.

Publishing your extension

Publishing enables users to install your extension, you can compile your extension into a container image for users to easily consume. Follow the Publishing guide to learn how to distribute your extension.

Conclusion

Creating an extension opens up endless possibilities to customize Podman Desktop to your specific needs.

It is also easy to package and publish your extension for others to use.

Have fun exploring our documentation on how to create an extension and happy coding!

Introduction to Podman AI Lab​

What is OpenVINO?​

Using OpenVINO in Podman AI Lab​

Starting an OpenVINO inference server​

Starting a playground with an OpenVINO compatible model​

Consistency with OpenShift AI + OpenVINO​

Conclusion​

VS Code: Your IDE for Container Development​

Prerequisites​

VS Code Extensions​

Option 1: Microsoft’s Container Tools extension​

Configuring VS Code for Podman​

Option 2: Pod Manager​

Using the VS Code Extensions​

Working with Containerfiles and Dockerfiles​

Managing Containers​

Conclusion​

Release Details​

Examples now have detail pages​

Interactive build config creator​

Experimental Linux VM support​

Detailed release changelog​

Features 💡​

Chores 🛠️​

Fixes 🔨​

Documentation 📚​

Preparing Podman Desktop and Podman AI Lab​

Testing a prompt with a model​

Testing a recipe​

Structure of a recipe​

Creating our own app​

Testing my own app locally​

Creating a recipe​

Introduction to extensions​

Getting started with your project​

Adding UI components​

Working with icons​

Menus and navigation​

Adding and configuring commands​

Setting up onboarding workflows​

Configuration settings​

Publishing your extension​

Conclusion​

Introduction to Podman AI Lab

What is OpenVINO?

Using OpenVINO in Podman AI Lab

Starting an OpenVINO inference server

Starting a playground with an OpenVINO compatible model

Consistency with OpenShift AI + OpenVINO

Conclusion

VS Code: Your IDE for Container Development

Prerequisites

VS Code Extensions

Option 1: Microsoft’s Container Tools extension

Configuring VS Code for Podman

Option 2: Pod Manager

Using the VS Code Extensions

Working with Containerfiles and Dockerfiles

Managing Containers

Conclusion

Release Details

Examples now have detail pages

Interactive build config creator

Experimental Linux VM support

Detailed release changelog

Features 💡

Chores 🛠️

Fixes 🔨

Documentation 📚

Preparing Podman Desktop and Podman AI Lab

Testing a prompt with a model

Testing a recipe

Structure of a recipe

Creating our own app

Testing my own app locally

Creating a recipe

Introduction to extensions

Getting started with your project

Adding UI components

Working with icons

Menus and navigation

Adding and configuring commands

Setting up onboarding workflows

Configuration settings

Publishing your extension

Conclusion