Create a self checkout with computer vision

We can easily integrate the output of our vision service into a web app. In this codelab, we'll walk through the source code of a kiosk built with Viam's TypeScript SDK. In it, we implemented a simple kiosk that shows our camera feed and detections (including bounding boxes and labels!), the beverage detected, and a price. This app can be deployed to any static hosting provider, run locally on your laptop, or packaged as a module to be deployed to your machine running viam-server.

src/main.ts

Let's break down the highlights of the src/main.ts file to see how this app works.

To get the necessary credentials to connect to viam-server, we grab them from a .env file.

We initialize a kiosk view canvas where we'll output the vision service's image and detections.

For now, our data store is a simple object. It uses the labels from our custom model as the keys and returns an object of the beverage name and price we want to display as the values.

Here is the main function that renders the vision service data into the kiosk format we've structured. It first checks for the initialized canvas element then clears any existing drawing before creating a new drawing. This drawing renders the image from the vision service and draws the bounding box for any detections within the image. Finally, the name and price of the beverage are also rendered.

Finally, we have our main function. Here, we instantiate our connection to our machine in Viam, set up our canvas and vision service dependency, and start the loop that continuously polls for data from our vision service. This polling allows us to render a near real-time feed from our camera. If there is something detected, we call our renderDetectedBeverage() method.

index.html

The structure of our kiosk app is straightforward and is contained within the sole index.html file. A

holds our canvas and shows the "stream" from our vision service and some labels render the name and price of the detected beverage. And of course, a nice "Powered by Viam" logo 😉

<!doctype html>
<html>
  <head>
    <title>CV Checkout</title>
    <link rel="icon" href="favicon.ico" />
    <link rel="preconnect" href="https://fonts.googleapis.com">
    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
    <link href="https://fonts.googleapis.com/css2?family=Public+Sans:ital,wght@0,100..900;1,100..900&family=Space+Mono:ital,wght@0,400;0,700;1,400;1,700&display=swap" rel="stylesheet">
    <link rel="stylesheet" href="styles.css" />
  </head>
  <body>
    <h1 class="centered">CV Checkout Kiosk Demo</h1>
    <div id="main" class="flex-row-center">
      <div id="kioskView"></div>
      <div id="itemInfo">
        <p class="beverage-label">Beverage Detected:</p>
        <p id="beverageLabel"></p>
        <p class="price-label">Price: $<span id="priceLabel"></span></p>
      </div>
    </div>
    <div class="viam-logo" alt="Viam logo"></div>
    <script type="module" src="src/main.ts"></script>
  </body>
</html>

Once you have built your interface and have configured your Viam machine to use it, you'll have fully CV-powered checkout kiosk! If you followed along with our demo app, the kiosk will look like this:

GIF preview of CV checkout kiosk in action

Congratulations! You've just built a computer vision-powered checkout! 🥳 Using your own images, favorite drinks, and the built-in TensorFlow Lite framework, you've created a custom model to detect the beverages that make you smile and can be deployed anywhere. And through Viam's modular platform, you combined your custom model with a vision service to enable a CV-powered checkout! Lastly, you used Viam's TypeScript SDK to create a kiosk interface that outputs your vision service's detections and added a clear beverage name and price label. Do let me know if you've built this!

What You Learned

How to configure a camera in the Viam platform
How to capture images and create your own training dataset
How to build and train a custom model using TFLite
How to implement your custom model in your machine
How to create and connect a simple interface to your Viam machine using Viam's TypeScript SDK

Real-world applications for CV-powered checkout

This project is a great way to learn about combining different components to produce something useful; it has practical applications as well:

Handling items without barcodes or damaged barcodes
Quickly identifying and pricing items that have several varieties (say detecting a Granny Smith Apple from a Honeycrisp apple more quickly than trying to find the PLU (Price Look-Up code))
Scanning oddly-shaped items, where orienting a barcode is difficult or awkward
Real-time inventory and stock monitoring, with items detected at checkout updating stock levels to the second
Specialized retail environments where setting up full point-of-sale systems is prohibitive or cumbersome.

Specifically for beverages, some real-world use cases can include:

Automated vending machines with image confirmation of the correct drink being dispensed, resulting in fewer errors or customer disputes
Drive-thru visual confirmation, where prepared orders can be visually inspected and confirmed for the correct drinks
Retrofitting your refrigerator to manage home drink inventory or to automate billing of drinks in shared living spaces

Extend your CV-checkout with Viam

Right now, you can detect your favorite drinks using your custom model. But there are other things you can do! As an example, you could:

Integrate with an existing Point of Sale system to enable CV-powered self-checkout.
Incorporate automatic pricing and calculations based on detected items.
Add a piezo buzzer to play a tone when a drink is detected.
Send a text notification when your drink has been removed from the fridge!

What You'll Build

Prerequisites

What You'll Need

What You'll Learn

Watch the video

Review the web app code

`src/main.ts`

`index.html`

What You Learned

Real-world applications for CV-powered checkout

Extend your CV-checkout with Viam

Related Resources