UCB Flor¶
Flor (formerly known as Jarvis) is a system with a declarative DSL embedded in python for managing the workflow development phase of the machine learning lifecycle. Flor enables data scientists to describe ML workflows as directed acyclic graphs (DAGs) of Actions and Artifacts, and to experiment with different configurations by automatically running the workflow many times, varying the configuration. To date, Flor serves as a build system for producing some desired artifact, and serves as a versioning system that enables tracking the evolution of artifacts across multiple runs in support of reproducibility.
Install Flor¶
Clone or download the Flor repository.
You’ll need Anaconda, preferably version 4.4+
Please read this guide to set up a Python 3.6 environment inside Anaconda. Whenever you work with Flor, make sure the Python 3.6 environment is active.
Once the Python 3.6 environment in Anaconda is active, please run the following command (use the requirements.txt file in this repo):
pip install -r requirements.txt
Next, we will install RAY, a Flor dependency:
brew update
brew install cmake pkg-config automake autoconf libtool boost wget
pip install numpy funcsigs click colorama psutil redis flatbuffers cython --ignore-installed six
conda install libgcc
pip install git+https://github.com/ray-project/ray.git#subdirectory=python
Next, Add the directory containing this flor package (repo) to your PYTHONPATH.
Quickstart¶
Create a Python file named plate.py:
import flor
with flor.Experiment('plate_demo') as ex:
ex.groundClient('git')
ones = ex.literalForEach([1, 2, 3], "ones")
tens = ex.literalForEach([10, 100], "tens")
@flor.func
def multiply(x, y):
print(x*y)
return x*y
doMultiply = ex.action(multiply, [ones, tens])
product = ex.artifact('product.txt', doMultiply)
product.plot()
product.pull()
To run the file:
# Within a Python3.6 Anaconda environment
$ python plate.py
The expected output is as follows:
10
20
30
100
200
300