pystruct.models.ChainCRF

class pystruct.models.ChainCRF(n_states=None, n_features=None, inference_method=None, class_weight=None, directed=True)[source]

Linear-chain CRF.

Pairwise potentials are symmetric and the same for all edges. This leads to n_classes parameters for unary potentials. If directed=True, there are n_classes * n_classes parameters for pairwise potentials, if directed=False, there are only n_classes * (n_classes + 1) / 2 (for a symmetric matrix).

Unary evidence x is given as array of shape (n_nodes, n_features), and labels y are given as array of shape (n_nodes,). Chain lengths do not need to be constant over the dataset.

Parameters:

n_states : int, default=None

Number of states for all variables. Inferred from data if not provided.

inference_method : string or None, default=None

Function to call do do inference and loss-augmented inference. Defaults to “max-product” for max-product belief propagation. As chains can be solved exactly and efficiently, other settings are not recommended.

class_weight : None, or array-like

Class weights. If an array-like is passed, it must have length n_classes. None means equal class weights.

directed : boolean, default=False

Whether to model directed or undirected connections. In undirected models, interaction terms are symmetric, so an edge a -> b has the same energy as b -> a.

Methods

batch_inference(X, w[, relaxed])
batch_joint_feature(X, Y[, Y_true])
batch_loss(Y, Y_hat)
batch_loss_augmented_inference(X, Y, w[, ...])
continuous_loss(y, y_hat)
inference(x, w[, relaxed, return_energy]) Inference for x using parameters w.
initialize(X, Y)
joint_feature(x, y) Feature vector associated with instance (x, y).
loss(y, y_hat)
loss_augmented_inference(x, y, w[, relaxed, ...]) Loss-augmented Inference for x relative to y using parameters w.
max_loss(y)
__init__(n_states=None, n_features=None, inference_method=None, class_weight=None, directed=True)[source]
inference(x, w, relaxed=False, return_energy=False)

Inference for x using parameters w.

Finds (approximately) armin_y np.dot(w, joint_feature(x, y)) using self.inference_method.

Parameters:

x : tuple

Instance of a graph with unary evidence. x=(unaries, edges) unaries are an nd-array of shape (n_nodes, n_states), edges are an nd-array of shape (n_edges, 2)

w : ndarray, shape=(size_joint_feature,)

Parameters for the CRF energy function.

relaxed : bool, default=False

Whether relaxed inference should be performed. Only meaningful if inference method is ‘lp’ or ‘ad3’. By default fractional solutions are rounded. If relaxed=True, fractional solutions are returned directly.

return_energy : bool, default=False

Whether to return the energy of the solution (x, y) that was found.

Returns:

y_pred : ndarray or tuple

By default an inter ndarray of shape=(width, height) of variable assignments for x is returned. If relaxed=True and inference_method is lp or ad3, a tuple (unary_marginals, pairwise_marginals) containing the relaxed inference result is returned. unary marginals is an array of shape (width, height, n_states), pairwise_marginals is an array of shape (n_states, n_states) of accumulated pairwise marginals.

joint_feature(x, y)

Feature vector associated with instance (x, y).

Feature representation joint_feature, such that the energy of the configuration (x, y) and a weight vector w is given by np.dot(w, joint_feature(x, y)).

Parameters:

x : tuple

Unary evidence.

y : ndarray or tuple

Either y is an integral ndarray, giving a complete labeling for x. Or it is the result of a linear programming relaxation. In this case, y=(unary_marginals, pariwise_marginals).

Returns:

p : ndarray, shape (size_joint_feature,)

Feature vector associated with state (x, y).

loss_augmented_inference(x, y, w, relaxed=False, return_energy=False)

Loss-augmented Inference for x relative to y using parameters w.

Finds (approximately) armin_y_hat np.dot(w, joint_feature(x, y_hat)) + loss(y, y_hat) using self.inference_method.

Parameters:

x : tuple

Instance of a graph with unary evidence. x=(unaries, edges) unaries are an nd-array of shape (n_nodes, n_features), edges are an nd-array of shape (n_edges, 2)

y : ndarray, shape (n_nodes,)

Ground truth labeling relative to which the loss will be measured.

w : ndarray, shape=(size_joint_feature,)

Parameters for the CRF energy function.

relaxed : bool, default=False

Whether relaxed inference should be performed. Only meaningful if inference method is ‘lp’ or ‘ad3’. By default fractional solutions are rounded. If relaxed=True, fractional solutions are returned directly.

return_energy : bool, default=False

Whether to return the energy of the solution (x, y) that was found.

Returns:

y_pred : ndarray or tuple

By default an inter ndarray of shape=(n_nodes) of variable assignments for x is returned. If relaxed=True and inference_method is lp or ad3, a tuple (unary_marginals, pairwise_marginals) containing the relaxed inference result is returned. unary marginals is an array of shape (n_nodes, n_states), pairwise_marginals is an array of shape (n_states, n_states) of accumulated pairwise marginals.