grl.agents¶

QGPOAgent¶

class grl.agents.QGPOAgent(config, model)[source]¶

__init__(config, model)[source]¶

Parameters:

act(obs, return_as_torch_tensor=False)[source]¶

Parameters:

obs (Union[np.ndarray, torch.Tensor, Dict]) – The observation.
return_as_torch_tensor (bool) – Whether to return the action as a torch tensor.

Returns:

The action.

Return type:

action (Union[np.ndarray, torch.Tensor, Dict])

class grl.agents.SRPOAgent(config, model)[source]¶

__init__(config, model)[source]¶

Parameters:

act(obs, return_as_torch_tensor=False)[source]¶

Parameters:

obs (Union[np.ndarray, torch.Tensor, Dict]) – The observation.
return_as_torch_tensor (bool) – Whether to return the action as a torch tensor.

Returns:

The action.

Return type:

action (Union[np.ndarray, torch.Tensor, Dict])

class grl.agents.GPAgent(config, model)[source]¶

Overview:: The agent trained for generative policies. This class is designed to be used with the GMPGAlgorithm and GMPOAlgorithm.
Interface:: __init__, action

__init__(config, model)[source]¶

Parameters:

act(obs, return_as_torch_tensor=False)[source]¶

Overview:: Given an observation, return an action as a numpy array or a torch tensor.

Parameters:

obs (Union[np.ndarray, torch.Tensor, Dict]) – The observation.
return_as_torch_tensor (bool) – Whether to return the action as a torch tensor.

Returns:

The action.

Return type:

action (Union[np.ndarray, torch.Tensor, Dict])