Kantu UI Vision
Get Kantu UI Vision. Open-Source Web Automation, Desktop Automation (RPA) and Selenium IDE++
Easy automation for busy people. Kantu UI Vision automates web and desktop apps on Windows, Mac and Linux. The Kantu core is open-source and “lives” in the web browser as browser extension, but it can automate desktop applications as well.
Kantu UI Vision is a universal AI-powered task automation tool that combines state-of-the-art image and text recognition with the best of classical web automation: It has Visual Web Automation, Visual Desktop Automation, Selenium IDE++ and an API to connect with other tools and scripts.
(1) Visual Web Automation and UI Testing
Kantu's computer-vision visual UI testing commands allow you to write automated visual tests with Kantu - this makes Kantu the first and only Chrome and Firefox extension (and Selenium IDE) that has "👁👁 eyes". A huge benefit of doing visual tests is that you are not just checking one element or two elements at a time, you’re checking a whole section or page in one visual assertion.
The visual UI testing and browser automation commands of Kantu help web designers and developers to verify and validate the layout of websites and canvas elements. Kantu can read and recognize images and text inside canvas elements, images and videos.
Kantu can resize the browser's window in order to emulate various resolutions. This is particularly useful to test layouts on different browser resolutions, and to validate visually perfect mobile, web, and native apps.
(2) Visual Desktop Automation for Windows, Mac and Linux
Kantu can not only see and automate everything inside the web browser. It uses the same image and text recognition technology to automate your desktop as well (Robotic Process Automation, RPA). Kantu’s eyes can read images and words on your desktop and Kantu’s hands can click, move, drag & drop and type.
The desktop automation feature requires the installation of the free Kantu Extension Modules (XModules). This is a separate software available for Windows, Mac and Linux. It adds the “eyes” and “hands” to Kantu.
(3) Selenium IDE++ for hybrid web automation
Kantu includes standard Selenium IDE commands for general web automation, web testing, form filling & web scraping. But Kantu has a different design philosophy then the classic Selenium IDE. It is a record & replay tool for automated testing just like the classic Selenium IDE, but even more it is a "swiss army knife" for general web automation like screen scraping, automating file uploads and autofill form filling. So it has many features that the classic IDE does not (want to) have. For example, you can run your macros directly from the browser as bookmarks or even embed them on your website. If there’s an activity you have to do repeatedly, just create a web macro for it. The next time you need to do it, the entire macro will run at the click of a button and do the work for you.
This short screencast demos how to automate form filling on our online ocr website with Kantu. We record the macro, insert a PAUSE (3 seconds) command manually and then replay the macro twice.
Kantu is a open-source alternative to iMacros and Selenium IDE, and supports all important Selenium IDE commands. When you invest the time to learn Kantu, you learn Selenium IDE at the same time.
In addition, Kantu includes new web automation commands that are not found in the classic Selenium IDE, such as the ability to write and read CSV files (data-driven testing), visual checks, file download automation, PDF testing and the ability to take full page and desktop screenshots.
API: Integrate with your favorite tools via command line
One of the best Kantu features is its extensive command line API. This allows Kantu to integrate with any application (e. g. Jenkins, CI/CD tools,...) and any programming or scripting language (e. g. Python or Powershell). The API includes detailed error reporting for reliable non-stop operation.
Kantu has Enterprise-Grade Security. Your data never leaves your machine.
With its strict open-source security approach, Kantu is more secure then any other Robotic Process Automation (RPA) on the market: Kantu and its XModules are designed to fit the highest security and data protection standards for Enterprise use. All processing is done locally on your machine. The XModules only communicate with the open-source Kantu browser extension.
Kantu does not send any data back to us or any other place. You can easily verify this statement because all internet communication - like loading websites in your browser - is done inside the open-source Kantu core. The fact that Kantu is open-source under an official Open-Source license guarantees you the freedom to run, study, share and modify the software.
The only exception to the "all data is processed locally" rule is the OCR screen scraping feature and that is why it is disabled by default. Only when you explicitly enable it on the OCR tab - and use an OCR command - does it send images of text to the OCR.space cloud service for text recognition. A 100% local OCR option is available as part of our Kantu Enterprise plans.