This is an external repository to build functionality for Arkouda with a focus on advanced graph processing. It is built with the same structure as arkouda-contrib to manage modules and easily swap between the production (arachne) and development (arachne_development) directories.
To install the prerequisites below, the following libraries must be installed on your system. This can be done via any package handler depending on your distribution, or if on a cluster HPC system, they can be loaded in as modules. At the time of writing, the following versions were confirmed to work.
- GCC 11.2.0 or later.
- CMake 3.26.3.
- OpenMPI GNU 4.1.4 (needed by CMake)
- Anaconda 2023.09-0
jq1.6 command-line JSON processor.
We recommend following the installation instructions provided by the Arkouda development team. Most specifically, follow the Prerequisites section in its entirety, and only the Dependency Configuration section of the build instructions. The installation steps usually involve the following:
- Download Chapel from the Chapel downloads page. Use Chapel version 2.4.0.
- Alternatively, you may clone Chapel and switch to a given tagged version. The commands for these should look something like:
git clone https://github.com/chapel-lang/chapel.git cd chapel git fetch --tags origin git checkout tags/2.4.0 --force
- Alternatively, you may clone Chapel and switch to a given tagged version. The commands for these should look something like:
- Build Chapel by executing the commands below. This assumes you have installed all Chapel prerequisites. Note: We recommend using
gcc/11.2.0or later due to dependencies with VieCut.cd /path/to/chapel/ source ./util/setchplenv.bash export CHPL_GMP=bundled export CHPL_HWLOC=bundled export CHPL_RE2=bundled export CHPL_LLVM=bundled make -j 4 # This value can be increased dependent on your device's number of processors
- Note: This installs single locale (shared-memory) Chapel. For multilocale (distributed-memory) Chapel please follow the documentation guide on Multilocale Chapel Execution. Arachne has its best performance on shared-memory. However, kernels such as breadth-first search and property graph querying have multilocale-optimized versions that require multilocale Chapel to be installed.
- Download, but do not build, Arkouda. Use Arkouda version v2025.01.13. A specified version can be selected for download by clicking on
Releasesin the main GitHub page for Arkouda.- Alternatively, you may clone Arkouda and switch to a given tagged version.
git clone https://github.com/Bears-R-Us/arkouda.git cd arkouda git fetch --tags origin git checkout tags/v2025.01.13 --force
- Alternatively, you may clone Arkouda and switch to a given tagged version.
- Install Arkouda dependencies with
Anaconda. An environment containing all dependencies can be installed fromarkouda-env.ymlwithin your Arkouda home directory.- This can be done by executing the following command within your Arkouda directory:
conda env create -f arkouda-env.yml
- This can be done by executing the following command within your Arkouda directory:
- Configure your Arkouda dependencies. This involves creating (or modifying) the
Makefile.pathswithin your Arkouda home directory. - Install constrained-clustering and compile the C++ object files required by Arachne by following the commands below. Constrained-clustering requires a C++ compiler that supports
c++-20, such asclang++11, andcmake. These and other prerequisites should be covered by the prerequisites in items 1-5 above.cd /path/to/arkouda-njit/arachne/server/external_libs git clone https://github.com/MinhyukPark/constrained-clustering.git cd constrained-clustering ./setup.sh ./easy_build_and_compile.sh cd ../../../../arachne/server/viecut_helpers/ source compileLogger.sh -f logger.cpp -o logger.cpp.o gcc -c -fPIC -I../external_libs/constrained-clustering/external_libs/VieCut/lib/ -I../external_libs/constrained-clustering/external_libs/VieCut/extlib/tlx/ computeMinCut.cpp -o computeMinCut.o cd ../../../
Building Arachne is performed through executing the module_configuration.py file. The complete path to the location of arkouda must be specified through ak_loc and the complete path to the location of arachne should be specified through pkg_path.
python module_configuration.py --ak_loc=/complete/path/to/arkouda/ --pkg_path=/complete/path/to/arkouda-njit/arachne/ | bashThe above command will pipe the following three commands to terminal that installs Arachne using pip, copies the Arkouda server modules to a temporary file, and combines them with the Arachne server modules to build the enhanced arkouda_server.
pip install -U /complete/path/to/arkouda-njit/arachne/client
cp /complete/path/to/arkouda/ServerModules.cfg ~/TmpServerModules.cfg.1683320760
ARKOUDA_SERVER_USER_MODULES=" /complete/path/to/arkouda-njit/arachne/server/BuildGraphMsg.chpl /complete/path/to/arkouda-njit/arachne/server/PropertyGraphMsg.chpl /complete/path/to/arkouda-njit/arachne/server/GraphInfoMsg.chpl /complete/path/to/arkouda-njit/arachne/server/BFSMsg.chpl /complete/path/to/arkouda-njit/arachne/server/TriCtrMsg.chpl /complete/path/to/arkouda-njit/arachne/server/TriCntMsg.chpl /complete/path/to/arkouda-njit/arachne/server/TrussMsg.chpl /complete/path/to/arkouda-njit/arachne/server/CCMsg.chpl" ARKOUDA_CONFIG_FILE=~/TmpServerModules.cfg.1683320760 ARKOUDA_SKIP_CHECK_DEPS=true make -C /Users/alvaradoo/Research/arkoudaFor usage instructions of module_configuration.py please execute the the following.
python module_configuration.py --helpIf you are interested in installing the development version of Arachne, please follow the same instructions as above, but for pkg_path include /complete/path/to/arkouda-njit/arachne_development/.
The server can be started as specified in the Arkouda documentation. Simply put, navigate to your Arkouda directory, and an executable named arkouda_server should exist. Execute it with the command below to start a server instance.
./arkouda_server # - nl XThe output should be something that looks like below.
********************************************************************************************************
********************************************************************************************************
* *
* server listening on tcp://n118.cluster.local:5555 *
* arkouda server version = v2024.06.21 *
* built with chapel version2.1.0 *
* memory limit = 973796998348 *
* bytes of memory used = 0 *
* *
********************************************************************************************************
********************************************************************************************************To run the testing harness via pytest please proceed to the Arachne directory for those instructions.
import arkouda as ak
import arachne as ar
# code using arachne and arkouda below- Issue: Unrecognized HDF5, Apache Arrow, etc. installations.
Fix: Ensure
Makefile.pathswas properly added to the base Arkouda directory. More information can be found in the Arkouda build instructions. - Issue: Arkouda or Arachne functions are not recognized when executing scripts.
Fix: Make sure to run
pip3 install -e .at both/complete/path/to/arkouda-njit/arachne/client/.and/complete/path/to/arkouda/.