WithSecureOpenSource
diff --git a/‎.gitignore
Lines changed: 1 addition & 0 deletions b/‎.gitignore
Lines changed: 1 addition & 0 deletions
diff --git a/‎LICENSE-murphy.txt
Lines changed: 13 additions & 0 deletions b/‎LICENSE-murphy.txt
Lines changed: 13 additions & 0 deletions
diff --git a/‎LICENSE-vncdotool.txt
Lines changed: 19 additions & 0 deletions b/‎LICENSE-vncdotool.txt
Lines changed: 19 additions & 0 deletions
diff --git a/‎LICENSE.txt
Lines changed: 11 additions & 0 deletions b/‎LICENSE.txt
Lines changed: 11 additions & 0 deletions
diff --git a/‎README.md
Lines changed: 4 additions & 0 deletions b/‎README.md
Lines changed: 4 additions & 0 deletions
diff --git a/‎docs/extractingmodels.html
Lines changed: 100 additions & 0 deletions b/‎docs/extractingmodels.html
Lines changed: 100 additions & 0 deletions
diff --git a/‎docs/how.html
Lines changed: 51 additions & 0 deletions b/‎docs/how.html
Lines changed: 51 additions & 0 deletions
diff --git a/‎docs/img/7zip-desc.png
16.9 KB b/‎docs/img/7zip-desc.png
16.9 KB
diff --git a/‎docs/img/7zip-flow.png
61.9 KB b/‎docs/img/7zip-flow.png
61.9 KB
diff --git a/‎docs/img/diff.png
46.8 KB b/‎docs/img/diff.png
46.8 KB
diff --git a/‎docs/img/diff2.png
69.8 KB b/‎docs/img/diff2.png
69.8 KB
diff --git a/‎docs/img/export.png
21.8 KB b/‎docs/img/export.png
21.8 KB
diff --git a/‎docs/img/inputs.png
31.6 KB b/‎docs/img/inputs.png
31.6 KB
diff --git a/‎docs/img/new_model.png
88.5 KB b/‎docs/img/new_model.png
88.5 KB
diff --git a/‎docs/img/old_model.png
69.2 KB b/‎docs/img/old_model.png
69.2 KB
diff --git a/‎docs/img/planning.png
95.5 KB b/‎docs/img/planning.png
95.5 KB
diff --git a/‎docs/img/realflow.png
88.8 KB b/‎docs/img/realflow.png
88.8 KB
diff --git a/‎docs/img/route.png
41.3 KB b/‎docs/img/route.png
41.3 KB
diff --git a/‎docs/img/runlog.png
47.8 KB b/‎docs/img/runlog.png
47.8 KB
diff --git a/‎docs/img/smallflow.png
37 KB b/‎docs/img/smallflow.png
37 KB
diff --git a/‎docs/install.html
Lines changed: 82 additions & 0 deletions b/‎docs/install.html
Lines changed: 82 additions & 0 deletions
diff --git a/‎docs/introduction.html
Lines changed: 36 additions & 0 deletions b/‎docs/introduction.html
Lines changed: 36 additions & 0 deletions
@@ -0,0 +1 @@
+*.pyc
@@ -0,0 +1,13 @@
+Copyright 2014 F-Secure Corporation.
+
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+
+    http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
@@ -0,0 +1,19 @@
+Copyright (c) 2010 Marc Sibson
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in
+all copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
+THE SOFTWARE.
@@ -0,0 +1,11 @@
+Murphy is licensed under the Apache 2.0 (APL 2.0) license, see
+LICENSE-murphy.txt
+
+Murphy includes a slightly modified version of vncdotool in 
+murphy\user_simulation\vnc\vncdotool:
+    vncdotool is a command line VNC client. It can be useful to automating
+    interactions with virtual machines or hardware devices that are otherwise
+    difficult to control.
+See LICENSE-vncdotool.txt for license information, website can be found at
+https://github.com/sibson/vncdotool
+
@@ -0,0 +1,4 @@
+murphy_sdk
+==========
+
+Murphy open source
@@ -0,0 +1,100 @@
+<html>
+<title>Your first model extraction scripts</title>
+<body>
+<h3>Starting from the samples</h3>
+<h4>Short explanation</h4>
+Create a dir 7zip_mine, copy extract_7zip_win_scraper.py inside of it.<br>
+In the line <pre>extractor = base_extractor.BaseExtractor('7zipWinScraper',</pre> change 7zipWinScraper for 7zip_mine<br>
+Add the new project to web_workbench/project.json files the following line:<br>
+<pre>,{"file": "../samples/7zip_mine/7zip_mine/7zip_mine.json"}</pre>
+Run extract_7zip_win_scraper.py<br>
+
+<h4>Longer explanation</h4>
+It is ofthen easier to start with an existing script and adjust rather than totally from scratch.<br>
+For that purpose, create a folder called 7zip_mine (in the directory where you had murphy, for example: c:\git_projects\murphy_sdk\samples\7zip_mine<br>
+Copy the file extract_7zip_win_scraper.py from the samples\7zip into the new folder.<br>
+<br>
+The code of that file looks like this:<br>
+<pre>
+from model_extraction import base_extractor, configuration
+from model_extraction.ui import window_scrap
+    
+if __name__ == '__main__':
+    test_files = configuration.get_default_config()["test files"]
+    extractor = base_extractor.BaseExtractor('7zipWinScraper',
+                                             '\\utils\\runurl.py ' + test_files + '7z920.exe')
+
+    #Ignore 'Browse For Folder' dialog as we dont care for windows native dialog
+    extractor.add_boundary_node('Browse For Folder')
+    
+    extractor.scrap_method = lambda node, world, scraper_hints, node_hints: window_scrap.custom_scraper(node, world)
+    
+    extractor.crawl_application()
+</pre>
+We care about this line:<br>
+<pre>
+    extractor = base_extractor.BaseExtractor('7zipWinScraper',
+                                             '\\utils\\runurl.py ' + test_files + '7z920.exe')
+</pre>
+The first parameter is the name we give to the model, so lets start by changing it to 7zip_mine<br>
+The second parameter is the 1st action to perform when murphy starts crawling the application, we're not going to change it now but all you need to know for now is that
+it will be a command that when build will look like: <pre>\utils\runurl.py http://192.168.56.1:8901/files/7z920.exe</pre>
+In essence, it will download that url and then execute 7z920.exe<br>
+Once we modified that line it should look like:<br>
+<pre>
+    extractor = base_extractor.BaseExtractor('7zip_mine',
+                                             '\\utils\\runurl.py ' + test_files + '7z920.exe')
+</pre>
+We're ready to run it now, however we wont see it in the web workbench yet, we have to configure it, for doing so, open the file config.json that is located in the web_workbench directory.<br>
+It should look like this:<br>
+<pre>
+[
+  {
+    "file": "../samples/7zip/7zip/7zip.json"
+  }, 
+  {
+    "file": "../samples/7zip/7zip_newer/7zip_newer.json"
+  }, 
+  {
+    "file": "../samples/7zip/7zipWinScraper/7zipWinScraper.json"
+  }
+]
+</pre>
+It is pretty self described, we need to copy the last entry and add our new model, you should modify it as:<br>
+<pre>
+[
+  {
+    "file": "../samples/7zip/7zip/7zip.json"
+  }, 
+  {
+    "file": "../samples/7zip/7zip_newer/7zip_newer.json"
+  }, 
+  {
+    "file": "../samples/7zip/7zipWinScraper/7zipWinScraper.json"
+  },
+  {
+    "file": "../samples/7zip_mine/7zip_mine.json"
+  }
+]
+</pre>
+If you dont have the workbench running, launch it now (for example: C:\git-projects\murphy_sdk\web_workbench>workbench.py)<br>
+Open a browser to http://127.0.0.1:8090 and enter your name, the project should appear in the dropdown list of models, of course it will still display a blank model.<br>
+(if you had a browser opened then reload the page so you'll get the new project)<br>
+So from a command line do:<br>
+<pre>C:\git-projects\murphy_sdk\samples\7zip_mine>extract_7zip_win_scraper.py</pre>
+In a few seconds you should see:
+<pre>
+2014-01-27 15:20:29,338 - root.model_extraction.crawler - INFO - Started exploration, current node is None, current edge is None
+2014-01-27 15:20:29,556 - root.model_extraction.crawler - INFO - Node created for state Node Node 0 (1)
+        ->Launch application->None
+2014-01-27 15:20:29,571 - root.model_extraction.ui.edge - INFO - Performing Edge Launch application (Node 0)
+2014-01-27 15:20:29,782 - root.murphy.user_simulation.helpers - INFO - Waiting for a stable screen
+...
+</pre>
+Go now to the web workbench page and select the model from the dropdown list, you may have to wait a little until first images are viewable.<br>
+<br>
+<h4>If something went wrong</h4>
+Make sure you had PYTHONPATH set (for example set PYTHONPATH=C:\git-projects\murphy_sdk).<br>
+Ensure that when you are going to execute the script (extract_7zip_win_scraper.py) the virtual machine IS NOT RUNNING in virtualbox.<br>
+</body>
+</html>
@@ -0,0 +1,51 @@
+<html>
+<title>How murphy works</title>
+<body>
+<h3>High level inner workings of murphy</h3>
+<h4>Extracting an application model</h4>
+Murphy will run in one machine (let's call it the controller) and will spawn a virtual machine<br>
+In the virtual machine it will launch the application to be analyzed<br>
+It will analyze what is seeing in the screen, what is the currently active window, what actions can the user do (buttons / links it can click, text inputs, selections from dropdown boxes, etc)<br>
+It will systematically try to do those actions and generate a model that describes such application<br>
+At any given point, murphy may decide to recreate the virtual machine to get back to a clean state (to prevent side effects from previously executed actions)<br>
+<h4>Separation of concerns</h4>
+Internally murphy has a few abstractions, amongst them:<br>
+<ul>
+<li>A scraper (a sorry name, but responsible to decompose what the user sees, akin to a DOM of the application)</li>
+<li>A crawler (responsible for 'walking around' the application for mapping it)</li>
+<li>A machine (handles the logic for creating, allocating and deallocating a virtual machine or device</li>
+<li>A user (simulates user actions like mouse clicks and keyboard strokes, also responsible for providing screenshots</li>
+</ul>
+<h4>How does murphy know what elements the user can interact with?</h4>
+The scraper component is a class responsible for doing that, there are at the moment 2 scrapers provided in murphy.<br>
+One uses windows API's for enumerating windows and controls, so any native application built on c, c++, mfc and many other technologies will work fine,
+it can be found in model_extraction_helpers/window_scraper.py<br>
+The other uses user actions (pressing the TAB key and hovering the mouse) and analyzes the changes in the UI, it then deduces the user interface elements based on that, it can be found in model_extraction/ui/scraper.py<br>
+The good side of scraper.py is that it works for most applications independently of the language used to develop them, the bad is that is unable to extract texts from the ui and that many applications are poorly written
+and many ui elements are not reachable while pressing the TAB key.<br>
+<br>
+It is relatively simple to create additional scrapers, in house we had to develop a custom one for applications based on QT 4, we also develop an experimental scraper for android applications and also another one
+based on the windows accessibility API (ui automation)<br>
+<h4>How does murphy know it has to input a serial key / activation code / etc in an input field?</h4>
+It doesnt, that's why the way to extract a model is to write a simple script and tell murphy the very specific, non guessable things of the application you're interested in.<br>
+<h4>Boundaries</h4>
+You dont want to crawl everything, suppose a case that there's a link in your application that opens a browser, im pretty sure you dont want murphy to crawl the whole internet.<br>
+Here's where the concept of boundary comes, in an extraction script, one of the things you tell murphy is what you're NOT interested to crawl, in other words what are the boundaries of the exploration<br>
+<h4>In simple words</h4>
+The idea is to tell only the essential things in an extraction script, some typical things to handle:<br>
+<ul>
+<li>launch x application (run x command)</li>
+<li>if the window is a help window, i'm not interested to crawl beyond it.</li>
+<li>if the window is the 'enter serial number' then use x value on y field</li>
+<li>if the window is 'license information' then the value of field 'y' must be ignored or custom handle (for example a timestamp)</li>
+</ul>
+<h4>Simplest model extraction script</h4>
+<pre>
+extractor = base_extractor.BaseExtractor('7zip', '\\utils\\runurl.py http://127.0.0.1:8901/files/7z920.exe')
+
+extractor.add_boundary_node('Browse For Folder')
+
+extractor.crawl_application()
+</pre>
+</body>
+</html>
@@ -0,0 +1,82 @@
+<html>
+<title>Installing murphy</title>
+<body>
+<br>
+<strong>WARNING</strong><br>
+Murphy will launch a web server in the machine it is installed and will listen for connections in port 8090, care has been taken as this to be as safe as possible, however it
+is recommended that you disable inbound connections to that port in your firewall unless you know and are sure of what you're doing.<br>
+<h3>Platforms</h3>
+Windows 7 (should work in Vista and windows 8, maybe xp, never tested with Metro apps)<br>
+There's interest in making it work under Linux and we're almost there, if you know linux you can probably make it work.<br>
+<h3>Dependencies</h3>
+Python 2.7.3 or newer (32 bits), wont probably work on python 3.x and I havent tested with pre 2.7.3. It can be found <a href="http://www.python.org/download/releases/2.7/">here</a><br>
+There's a small dll written in c and compiled for 32 bits platform so it wont work on python 64 bits, however the work needed to support it is minimal<br>
+Python PIL (be sure to install the one that matches your python version, including 32 or 64 bits...), it can be found <a href="http://www.pythonware.com/products/pil/">here</a><br>
+The included vnc library uses Twisted (download twisted for python), which can be found <a href="https://twistedmatrix.com">here</a><br>
+Twisted needs zope (4.0.6 seems to work fine), and can be found <a href="https://pypi.python.org/pypi/zope.interface">here</a><br>
+Graphviz, which is used for the flow graphs can be found <a href="http://www.graphviz.org/">here</a> (Note: latest versions has some issues with the generated SVG, version 2.32 is known to work fine)<br>
+You'll need Bottle, you can get it from <a href="http://bottlepy.org/">here</a>, just drop the file in the web_workbench directory (it is known to work with version 0.11)<br>
+VirtualBox, it can be found <a href="https://www.virtualbox.org/">here</a><br>
+Other virtualization tools like Kvm, VmWare, Dvmps, etc. can be used but need a small and simple adapter, take a look at virtualbox.py and rules_vbox.py in the model_extraction directory for that matter.<br>
+7zip is used in some examples / exercises, get 7z920.exe and put it in the directory model_extraction_helpers/files (newer versions may work but I haven't tested them)<br>
+
+
+<h3>Murphy</h3>
+Since you're reading this, you already have this or know how to get this.<br>
+You can copy murphy to any directory, you'll need to set your PYTHONPATH to that directory, so if you did a git clone to say c:\git_projects\murphy_sdk then do set PYTHONPATH=c:\git_projects\murphy_sdk<br>
+Note that PYTHONPATH is needed for the model extraction scripts, the web workbench doesn't need it.<br>
+
+<h3>Virtual machines</h3>
+Murphy uses virtual machines to run the applications it will crawl and extract their models, it will not run the applications locally (unless you modify murphy to do so)<br>
+See <a href="virtual machine.html">here</a> on instructions on how to create a virtual machine that murphy can use.<br>
+<h3>Checking the installation</h3>
+Last but not least, in the murphy folder, there's a file graphviz.py, change the line:
+<pre>
+_GRAPHVIZ_EXE = (r'\Program Files (x86)\Graphviz2.32\bin\dot.exe')
+</pre>
+As to point to the version of graphviz you installed<br>
+After taking care of the dependencies, creating and configuring a virtual machine, be sure the virtual machine image you're going to use is not running!<br>
+<br>
+Assuming that your installation directory is c:\git_projects\murphy_sdk, do:<br>
+<pre>
+c:\git_projects\murphy_sdk\web_workbench>python workbench.py
+Workbench running on pid 736
+Bottle v0.11.rc1 server starting up (using WSGIRefServer())...
+Listening on http://0.0.0.0:8090/
+Hit Ctrl-C to quit.
+</pre>
+Open the browser of your choice, I use chrome and every now and then test that it works in FF, no clue if it even works in IE.<br>
+Point your browser to http://127.0.0.1:8090/<br>
+When asked about your user name you can put whatever you want as it does not do any authentication, press the login button.<br>
+At the top-left corner you should see a dropdown with the models, choose one from the list.<br>
+After a few seconds you should see a statechart loaded in the browser, if not take a look a the console output as something went bad.<br>
+Launch the webserver that provides the test files:<br>
+<pre>C:\git-projects\murphy_sdk\model_extraction_helpers>python -m SimpleHTTPServer 8901</pre>
+<h4>Last but not least, test that the model extraction works</h4>
+From a console, do (remember to set the PYTHONPATH var!)<br>
+<pre>
+C:\git-projects\murphy_sdk\samples\7zip>extract_7zip_win_scraper.py
+</pre>
+After a few seconds you should see something similar to this:<br>
+<pre>
+C:\git-projects\murphy_sdk\samples\7zip>extract_7zip_win_scraper.py
+2014-01-26 12:59:07,364 - root.model_extraction.crawler - INFO - Started exploration, current node is None, current edge is None
+2014-01-26 12:59:07,641 - root.model_extraction.crawler - INFO - Node created for state Node Node 0 (1)
+        ->Launch application->None
+2014-01-26 12:59:07,654 - root.model_extraction.ui.edge - INFO - Performing Edge Launch application (Node 0)
+2014-01-26 12:59:07,897 - root.murphy.user_simulation.helpers - INFO - Waiting for a stable screen
+2014-01-26 12:59:15,072 - root.murphy.user_simulation.vnc.vncdotool.vnc_wrapper - INFO - Sending keystrokes {+ctrl}{esc}{-ctrl}
+2014-01-26 12:59:15,842 - root.murphy.user_simulation.vnc.vncdotool.vnc_wrapper - INFO - Sending keystrokes \utils\runurl.py http://192.168.56.1:8901/files/7z920.exe{enter}
+2014-01-26 12:59:31,237 - root.murphy.user_simulation.vnc.vncdotool.vnc_wrapper - INFO - moving mouse to 1 1
+2014-01-26 12:59:31,453 - root.model_extraction.virtualbox - INFO - Wait for remote machine to be idle...
+2014-01-26 12:59:31,456 - root.murphy.user_simulation.helpers - INFO - Waiting for a stable screen
+2014-01-26 12:59:39,292 - root.model_extraction.ui.node - DEBUG - Checking if i'm in Node 0
+...
+</pre>
+It will take something between 5 and 10 minutes to extract the model, depending on many many many factors but it should finish, if you notice
+in the log output that got stuck for more than a couple of minutes then something is definitively wrong.<br>
+After (or while) the model is extracted, you can check the progress in the workbench tool.<br>
+The models are already extracted in the git version so you can see the flows in the workbench but they wont work interactively or thru scripting if you dont rebuild them,
+now is a good time to build the other provided examples (or you're doomed to forget it :))<br>
+</body>
+</html>
@@ -0,0 +1,36 @@
+<html>
+<title>Murphy</title>
+<body>
+<h1>Murphy, a set of tools for automation.</h1>
+<br>
+<img src="img/smallflow.png"><br>
+<br>
+Some of the things that can be done with can be found <a href="overview.html">here</a>.<br>
+<br>
+<a href="why.html">Why?</a><br>
+<br>
+<a href="how.html">How?</a><br>
+<br>
+<a href="worthknowing.html">Things you should know...</a><br>
+<br>
+<a href="install.html">Installation & running the samples</a><br>
+<br>
+<a href="extractingmodels.html">Creating your first model extraction scripts</a><br>
+<br>
+<a href="quickref.html">Quick references</a><br>
+<br>
+More to come...<br>
+<br>
+<h3>References on the web</h3>
+A paper related to murphy was presented in EESSMod 2013 - Experiences and Empirical Studies in Software Modeling at <a href="http://ceur-ws.org/Vol-1078/">http://ceur-ws.org/Vol-1078/</a>, a direct
+link to the paper is <a href="http://ceur-ws.org/Vol-1078/paper6.pdf">here</a><br>
+Internal workshop of Ericsson in Stockholm, 2013 (sorry, no online material available at the moment)<br>
+Anoter presentation related to murphy was submitted to <a href="http://issta2014.org/">ISSTA 2014</a><br>
+<br>
+There's no official support, we may have support forums in the future if there's enough interest, in the meaintime (and I make no promises) you can try reach me at valkoinen.rapu at gmail.com.<br>
+Some basic info I've written can be found in <a href="http://valkoinenrapu.blogspot.com/2014/01/about-time.html">this blog</a><br>
+<br>
+<h3>Thanks to</h3>
+Everybody who made this possible including the vncdotool guys and Pekka Aho et al (who kindly published papers about the experiences of Murphy in F-Secure)<br>
+</body>
+</html>