Introduction to Sikuli GUI Automation Tool (Automate Anything You See on Screen) – Sikuli Tutorial #1

As always we try to bring in new things to learn for our readers. Today let’s explore an interesting GUI automation tool – Sikuli.

“Automate anything you see” using the Sikuli Graphical User Interface (GUI) automation tool – Complete beginners guide to quickly set up and start using the Sikuli Script tool with these in-depth Sikuli Tutorials.

Sikuli Automates anything you see on the screen using the image recognition method to identify GUI elements. Sikuli script allows users to automate GUI interaction by using screenshots.

Sikuli GUI Automation Tool

List Of Tutorials In This Sikuli Series

We have divided this series into 3 parts:

Tutorial #1: How It Works, How To Create A Simple Sikuli Project.
Tutorial #2: How Sikuli Can Be Used With Selenium Web Driver To Automate Webpages.
Tutorial #3: Automating Flash Based Applications Using Sikuli Tool

Sikuli GUI Automation Tool

Let’s start with the 1st part of this series.

Sikuli is a tool to automate Graphical User Interfaces (GUI) using the “Visual Image Match” method. In Sikuli, all the web elements should be taken as an image and stored inside the project. Sikuli will trigger GUI interactions based on the image visual match, the image which we have passed as the parameter along with all methods.

Sikuli can be very much useful to automate flash objects (which do not have ID or name). It can be useful in the situation, where we have a stable GUI (i.e. GUI components not changing).

Even Window based applications can also be automated using Sikuli. Sikuli provides very friendly Sikuli-script.jar, which can be easily used together with Selenium WebDriver. We can even automate Adobe Video/Audio player, Flash Games on the website using Sikuli. With simple API, it makes coding easier.

Practical Uses

  • Sikuli can be used to automate Flash Objects / Flash Websites.
  • It can be useful to automate the Window based application. We can automate, what we are seeing on the screen.
  • It provides, simple API. i.e. all methods can be accessed using screen class objects.
  • It can be easily integrated with Selenium and all other tools.
  • Using Sikuli we can automate desktop applications.
  • Most of the automation testing tools will not support flash-object automation (E.g. Selenium). Sikuli provides extensive support to automate flash objects.
  • It uses a powerful “Visual Match” mechanism to automate desktop & flash objects.


  • Open-source Tool.
  • One of the biggest advantages of Sikuli is that it can easily automate Flash objects.
  • It makes easy to automate windows application.
  • When you’re testing an application under development and you don’t know the ID/name of the elements, then you can go with Sikuli. It will check the appearance of the image and if the match found, it will interact with the image accordingly.


Before getting started, we need to download and install the following software:

  • Any screenshot capturing tool (For Example, DuckCapture, or qSnap)
  • JDK
  • Eclipse (detailed steps here to install JDK and Eclipse)

Steps To Create The Sikuli Java Project

Step #1: Sikuli Download – Download Sikuli from here.

Step #2: Extract the zip file which you’ve downloaded. It will contain the Sikuli-script.jar file. Save this extracted file in your local file system.

Step #3: Open Eclipse.

Step #4: Create a java project File -> New  -> Java Project

Step #5:

  1. Right Click on the project
  2. Go to Build Path-> Configure Build Path
  3. Switch to Libraries tab
  4. Click the “Add External Jars” button and Add Sikuli-Script.jar in the Build Path.
  5. Click “Ok”


Sikuli-script.jar will be added to your project build path. You’re done. Now you can start writing Sikuli scripts inside this project.

Some Sikuli Methods

#1) Creating Object for Screen Class

The screen is a base class provided by Sikuli. We need to create an object for this screen class first, then only we can access all the methods provided by Sikuli.

Screen s=new Screen();

#2) Click On An Element

This method used to Click on the specific image present on the screen.

Syntax:“<<image name>>”);

For Example,“test.png”);

#3) Right Click On An Element

This method used to right-click on the specific image present on the screen.

s.rightClick(“<<image name>>”);

For Example,

#4) Find An Element

This method used to find a specific element present on the screen.

s.find(“<<image name>>”);

For Example,

 #5) Double Click on An Element

This method used to trigger a double click event on a specific image present on the screen.

s.doubleClick(“<<image name>>”);

For Example,

#6) Check whether an Element present on the Screen

This method is used to check whether the specified element is present on the screen.

s.exists(“<<image name>>”);

For Example,

#7) Type a string on a Textbox

This method is used to enter the specified text on the Text box.

s.type(“<<image name>>”,”String to be typed”);

For Example,

#8) Wheeling on a particular image

This method is used to perform wheeling action on the element image.

s.wheel(“<<image name>>”,<<int position>>,<<int direction>>);

For Example,

#9) Drag and Drop a Image/Element

This method is used to drag and drop a specified image from source position to target position.

s.dragDrop(“<<source image name>>”,”<<target image name>>”);

For Example,     

#10) Roll Hover on a particular image

This method is used to perform roll hover event on the specified image.

s.hover(“<<image name>>”);

For Example,

#11) Paste Copied String

This method used to paste text on the specified textbox.

s.paste(“<<image name>>”,”test”);

For Example,

Sikuli Examples

#1) YouTube Video – Pause And Play A Video

Step #1) Open a YouTube video link and Capture play and pause element images using the screen capture tool.

Pause button (Note: filename is pause.png)


Play button (Note: filename is play.png)


Copy these images inside the project.

Step #2) Create a package inside the Sikuli java project created and within that create a class named “Youtube”.

Step #3) Type the following code inside that class.

 package com.test;

import org.sikuli.script.FindFailed;
 import org.sikuli.script.Screen;

public class Youtube {

public static void main(String[] args) throws FindFailed, InterruptedException {
 // TODO Auto-generated method stub

Screen s=new Screen();
 s.find("pause.png"); //identify pause button"pause.png"); //click pause button
 System.out.println("pause button clicked");

s.find("play.png"); //identify play button"play.png"); //click play button


Step #4) Right-click on the class select Run As -> Java Application.

#2) Open Notepad And Type Some Text

Step #1) Capture the notepad icon on the desktop on the screen.





Step #2) Copy these images inside your project.

Step #3) Create a class named “NotepadExample” inside your project and type the following code.

 package com.test;

import org.sikuli.script.FindFailed;
 import org.sikuli.script.Screen;

public class NotepadExample {

public static void main(String[] args) throws FindFailed {
 // TODO Auto-generated method stub

Screen s=new Screen();"notepad_icon.png");
 s.type("notepad.png","This is Nice Sikuli Tutorial!!!!");


Step #4) Open the screen to be tested before executing the code.
Execute this file by Right click Run As  -> Java Application.

#3) Drag And Drop

Step #1) Take the screenshot of the required items on the screen, and put it inside your Sikuli project.


[Note: here, downloads icon is “source.png” and flower image is “destination.png”]

Step #2) Put these pictures inside your project.

Step #3) Create a class with the name “DragAndDrop” and write the following code.

 package com.test;

import org.sikuli.script.FindFailed;
 import org.sikuli.script.Screen;

public class DragAndDrop {

public static void main(String[] args) throws FindFailed, InterruptedException {
 // TODO Auto-generated method stub
 Screen s=new Screen();
 System.out.println("Source image found");
 System.out.println("target image found");
 s.dragDrop("source.png", "target.png");


Step #4) Execute this script by right click Run As  -> Java Application.
After the execution of this script, the download icon will be dragged and dropped on the image, indicated as a target.

Before Execution:


After Execution:


Drawbacks Of This Tool

  • We cannot assure you that the image match will be always accurate. Sometimes, if two or more similar images are available on the screen, Sikuli will attempt to select the wrong image.
  • And if image appearance varies in pixel size, it will also result in the “Find Failed ” exception.
  • Overhead of taking too many screenshots.
  • If anyone of the screenshot is missing, it will affect the execution of the program.

More resources:

  • Simple Sikuli example here.
  • Detailed Sikuli documentation here.


Sikuli is very much useful in automating flash objects. It can be used to automate window-based applications. It is a great tool to play with elements on a screen, based on their visuals.

About the author: This is a guest post by Anitha Eswari. She is currently working as a senior test engineer having sound knowledge of manual and automation testing and various test management tools.

Next Tutorial: In the next part of this series let’s have a deep look at creating the Sikuli maven project and how to integrate Selenium with Sikuli.

Already using this tool? Please share your experience and tips. If you want to get started but have queries let us know.

Recommended Reading

73 thoughts on “Introduction to Sikuli GUI Automation Tool (Automate Anything You See on Screen) – Sikuli Tutorial #1”

  1. Hi Anitha, its nice information about mentioned methods in Sikuli and could you help me on below:

    1. how to write code for “keydown” characters/alphabets like D, C, V, B, N, S, X.. etc

    2. how to write code to select option after doing right click

    3. if we use robot and sikuli code then am getting error. so how can i use robot and sikuli combination

  2. i am using sikuli to automate the flash objests .it is going good but in one place i have to 2 very similar images ,so anybody tell how i can differentiate them

  3. Is sikuli-api-1.1.0-sources.jar sufficient to run sample program in sikuli. I m getting import statement Error.

    I there are any more jar files need to be import plz let me Know.

  4. *** classpath dump
    0: /C:/Users/ronak/workspace/sikuliFirstTest/bin/
    1: /C:/Users/ronak/Desktop/sikuli/sikulixsetup-1.1.1.jar
    *** classpath dump end
    [error] RunTimeINIT: *** terminating: libs to export not found on above classpath: /sikulixlibs/windows/libs64

    Display this error when run sikuli script in eclipse

  5. Hi Guys,
    I have an issue when I want to get value of combobox.
    I took picture at combobox field and set TargetOffset for Combobox pattern. But I print out result, it doesn’t get value of combobox. Anyone here gets any idea for this issue. I’m using sikuli for implementation and sikuliX for capture pic

  6. I am new at this everyone one else seems to be happy with your tutorial but I am not as I try to follow it and its not helping rather make me doubt if this is a great tool for the automation .Your first example YouTube example You are not showing us how to get to you tube and the fact that this automates things you see on screen. I have got the eclipse project open that’s what I see so when I run it ,it does not see the pictures could you please revise that .I would like to know the following:
    How to work with sikuli if you have two screens?
    Where to put the pictures to be compared?
    Navigating to the page that needs testing and finding the pictures
    And you have the notepad example: tried to follow that but that’s not clear as well the icon is meant to be on my desk top to click and take a picture of that put it in my bin directory reference it to be compared I am confused can’t get to it or see it my eclipse environment is what I see .

    The same goes to the drag and drop example what am I dragging and dropping it where please explain what the source is and the destination.
    My understanding the source is where the picture to be dragged is and the destination where I want it to end up so please help clarify this

  7. Hi,

    Machine is not useable in parallel while SikuliX scripts or programs are running. Is there any better way to run the Sikuli script in backend? Please help me on this.

  8. Hi,

    Could you please post a tutorial on how to log in to Citrix where I do not want my credentials to be written down in a JAVA code,instead the user is asked to input the same during the execution of the code.

    Thanks in advance.

  9. Screen s=new Screen();

    I think this needs to be updated. I added
    to my project.

    The above line needs to implement a couple of methods before it can be used, it seems.
    Screen screen = new Screen() {
    public BufferedImage getScreenshot(int i, int i1, int i2, int i3) {
    return null;

    public Dimension getSize() {
    return null;

    I am not able to access any others, such as or any others.

  10. *** classpath dump
    0: /C:/Users/ronak/workspace/sikuliFirstTest/bin/
    1: /C:/Users/ronak/Desktop/sikuli/sikulixsetup-1.1.1.jar
    *** classpath dump end
    [error] RunTimeINIT: *** terminating: libs to export not found on above classpath: /sikulixlibs/windows/libs64

    Display this error when run sikuli script in eclipse

  11. But I don’t know how to launch Youtube in the youtube example
    all those tests are after youtube is launched
    but who launches youtube app?

  12. hi,
    i am getting below error on creating object of screen

    Exception in thread “main” java.lang.ExceptionInInitializerError
    at org.sikuli.script.Region.(
    at org.sikuli.script.Screen.(
    at sachin.demo.main(
    Caused by: java.lang.StringIndexOutOfBoundsException: begin 2, end 3, length 2
    at java.base/java.lang.String.checkBoundsBeginEnd(
    at java.base/java.lang.String.substring(
    at org.sikuli.basics.Settings.(
    … 3 more

  13. SIKULI is an open-source GUI based test automation tool. It is mainly used for interacting with elements of web pages and handling windows based popups. Sikuli uses the technique of “Image Recognition” and “Control GUI” to interact with elements of web pages and windows popups. In Sikuli, all the web elements are taken as images and stored inside the project.

  14. As a beginner in using sikuli IDE, have launched the IDE and tried taking screen shot manually by clicking the ‘camera button’. it is taking me always to desktop page and not the one i would like to capture (chrome or any other screens).

    I want to capture some images from chrome browser for test. whenever i click image capture button in sikuli IDE (2.0.4) – java version 8, it is taking me to my desktop screen everytime. is there any setting i need to configure to make sure i can capture any screen i want?

  15. Very good article.I’m facing an issue it is,whenever there is a firefox browser upgrades the image buttons sizes are increased the increased button image size does not match with the stored image size ,hence i have to replace the stored image buttons with the latest all the time.Is there anyway to handle this to avoid manual replacement when ever firefox upgrade happens


Leave a Comment