Customize Solvers¶
A more interesting thing to explore is extending GraphVite with new solvers. Generally, the core library is capable to perform any graph embedding variant that fits into the following paradigm.
The training samples are edges. There may be additional attributes (e.g. labels) to edge samples.
To support that, GraphVite provides a protocol interface and a series of abstract classes. We only need to declare the protocols for our parameters, and fill in the virtual member functions for the classes.
Let’s begin with the protocol interface. There are 3 main protocols for parameters.
head
tail
global
For each parameter matrix, it should be assigned one of these protocols.
head
means that the parameter matrix is indexed by head nodes in directed edges,
while tail
corresponds to tail nodes. Any other parameter matrix should be assigned
with global
.
There are also 2 optional protocols. One is in place
, which implies that the
parameter matrix takes in-place update and doesn’t need storage for gradients.
The other is shared
, which implies the matrix is shared with the previous one.
This may be used for tied weight case.
Each parameter matrix should also be specified with a shape. We can use auto
if the shape can be inferred from the protocol and the graph structure.
For example, knowledge graph embeddings take the following settings.
// head embeddings, tail embeddings, relation embeddings
protocols = {head | in place, tail | in place | shared, global};
shapes = {auto, auto, graph->num_relation};
If the learning routine also needs negative sampling, we should additionally specify a negative sampler protocol. For knowledge graph embedding, this is
negative_sampler_protocol = head | tail;
Given the protocols, GraphVite will automatically schedule the paramters and samples over multiple GPUs, using an algorithm called parallel negative sampling. For a more detailed explanation of the algorithm, see section 3.2 in GraphVite paper.
Note
Parallel negative sampling only takes place when at least one parameter matrix
is head
or tail
. If all parameters are global
, GraphVite will schedule
them by standard data parallel.
To implement a new solver, we need to implement get_protocols()
,
get_sampler_protocol()
and get_shapes()
as above. Some additional helper
functions may be required to complete the solver.
A solver also contains a sampler and a worker class. By default, the sampler samples
positive edges from the graph, with probability proportional to the weight of each
edge. We only need to specify the additional edge attributes in get_attributes()
.
For the worker, it will build the negative sampler according to the its protocol.
We need to specify the GPU implementation of models in train_dispatch()
. See
Customize Models for how to do that.
Finally, to get our new solver appeared in Python, add a Python declaration for it in
include/bind.h
, and instantiate it in src/graphvite.cu
.
See include/instance/*
for all solver instances.
Note
Functions in solver, sampler and worker can be overrided. For example,
GraphSolver
overrides edge sampling with
online augmentation.