Easy automation for busy people. UI.Vision Kantu automates web and desktop apps on Windows, Mac and Linux. Kantu “lives” in the web browser as browser extension, but it can automate desktop applications as well. The Kantu core is open-source and guarantees Enterprise-Grade Security. Your data never leaves your machine.

Kantu for Chrome UI Vision Kantu for Chrome (Link goes to Chrome Web Store)
Kantu for Firefox UI Vision Kantu for Firefox (Link goes to Firefox Add-Ons)

(1) Visual Web Automation and UI Testing

Kantu's computer-vision visual UI testing commands allow you to write automated visual tests with Kantu - this makes Kantu the first and only Chrome and Firefox extension (and Selenium IDE) that has "👁👁 eyes". A huge benefit of doing visual tests is that you are not just checking one element or two elements at a time, you’re checking a whole section or page in one visual assertion.

The visual UI testing and browser automation commands of Kantu help web designers and developers to verify and validate the layout of websites and canvas elements. Kantu can read and recognize images and text inside canvas elements, images and videos.

Kantu can resize the browser's window in order to emulate various resolutions. This is particularly useful to test layouts on different browser resolutions, and to validate visually perfect mobile, web, and native apps.

(2) Visual Desktop Automation for Windows, Mac and Linux

Kantu can not only see and automate everything inside the web browser. It uses the same image and text recognition technology to automate your desktop as well (Robotic Process Automation, RPA). Kantu’s eyes can read images and words on your desktop and Kantu’s hands can click, move, drag & drop and type.

The desktop automation feature requires the installation of the free Kantu Extension Modules (XModules). This is a separate software available for Windows, Mac and Linux. It adds the “eyes” and “hands” to Kantu.

(3) Selenium IDE++ for hybrid web automation

Kantu includes standard Selenium IDE commands for general web automation, web testing, form filling & web scraping. But Kantu has a different design philosophy then the classic Selenium IDE. It is a record & replay tool for automated testing just like the classic Selenium IDE, but even more it is a "swiss army knife" for general web automation like screen scraping, automating file uploads and autofill form filling. So it has many features that the classic IDE does not (want to) have. For example, you can run your macros directly from the browser as bookmarks or even embed them on your website. If there’s an activity you have to do repeatedly, just create a web macro for it. The next time you need to do it, the entire macro will run at the click of a button and do the work for you.

This short screencast demos how to automate form filling on our online ocr website with Kantu. We record the macro, insert a PAUSE (3 seconds) command manually and then replay the macro twice.

Kantu is a open-source alternative to iMacros and Selenium IDE, and supports all important Selenium IDE commands. When you invest the time to learn Kantu, you learn Selenium IDE at the same time.

In addition, Kantu includes new web automation commands that are not found in the classic Selenium IDE, such as the ability to write and read CSV files (data-driven testing), visual checks, file download automation, PDF testing and the ability to take full page and desktop screenshots.

API: Integrate with your favorite tools via command line

One of the best Kantu features is its extensive command line API. This allows Kantu to integrate with any application (e. g. Jenkins, CI/CD tools,...) and any programming or scripting language (e. g. Python or Powershell). The API includes detailed error reporting for reliable non-stop operation.

Command line API Code Tab

Kantu has Enterprise-Grade Security. Your data never leaves your machine.

With its strict open-source security approach, Kantu is more secure then any other Robotic Process Automation (RPA) solution on the market: Kantu and its XModules are designed to fit the highest security and data protection standards for Enterprise use. All processing is done locally on your machine. The XModules only communicate with the open-source Kantu browser extension.

Kantu does not send any data back to us or any other place. You can easily verify this statement because all internet communication - like loading websites in your browser - is done inside the open-source Kantu core . The fact that Kantu is open-source under an official Open-Source license guarantees you the freedom to run, study, share and modify the software.

The only exception to the "all data is processed locally" rule is the OCR screen scraping feature and that is why it is disabled by default. Only when you explicitly enable it on the OCR tab - and use an OCR command - does it send images of text to the OCR.space cloud service for text recognition. A 100% local OCR option is available as part of our Kantu Enterprise plans.

User Quotes.

I'm very impressed - in a few hours I was able to fully replicate a web app test setup that had taken a week or so to build in Visual Studio Gui testing.
Christian Berndt, Deloitte, France - More user quotes

We selected your automation testing framework for its focus on simplicity and easy maintenance. Your software is perfect for what we need.
Darren Myatt, Sony Europe, UK - More user quotes

We use UI.Vision to automate the process of extracting data files from our financial system. This used to be a manual process which now takes place automatically over night. UI.Vision made the automation of the procedure very easy.
Ian Brown, UK - More user quotes

More information

For more information please read the UI Vision Kantu user manual and meet us in the user forum.


Use UI.Vision Kantu for...

Kantu in Key Points

  • Screenshot-Driven Automation
  • Works on the most complex websites
  • Visual Record & Replay
  • Automate File Downloads/Uploads
  • Data-driven testing with CSV file import

Kantu Resources

Subscribe to the a9t9 automation software newsletter . We'll send you updates on new releases that we're working on.