torchdrivesim.rendering
#
Renderers used to visualize the state of the environments. Currently two backends are supported (pytorch3d and nvdiffrast), along with a dummy renderer generating black images.
Submodules#
Package Contents#
Classes#
Determines behavior of the renderer. |
|
For DummyRenderer. |
|
A renderer producing simple 2D birdview images based on static background meshes and rectangular agents. |
|
Produces a black image of the required size. Mostly used for debugging and benchmarking. |
|
Configuration of pytorch3d-based renderer. |
|
Renderer based on pytorch3d, using an orthographic projection and a trivial shader. |
|
Configuration of nvdiffrast-based renderer. |
|
Similar to PyTorch3DRenderer, and producing indistinguishable images, but sometimes faster. |
Functions#
|
Construct the selected renderer from config, by default using Pytorch3DRenderer. |
Attributes#
- class torchdrivesim.rendering.RendererConfig[source]#
Determines behavior of the renderer. Subclasses determine renderer class used.
- backend: str = 'default'#
- render_agent_direction: bool = True#
- left_handed_coordinates: bool = False#
- highlight_ego_vehicle: bool = False#
- class torchdrivesim.rendering.DummyRendererConfig[source]#
Bases:
RendererConfig
For DummyRenderer.
- backend: str = 'dummy'#
- class torchdrivesim.rendering.BirdviewRenderer(cfg: RendererConfig, device: torch.device | None = None, batch_size: int | None = None, static_mesh: torchdrivesim.mesh.BirdviewMesh | None = None, world_center: torch.Tensor | None = None, color_map: Dict[str, Tuple[int, int, int]] | None = None, rendering_levels: Dict[str, float] | None = None, res: torchdrivesim.utils.Resolution = Resolution(64, 64), fov: float = 35)[source]#
Bases:
abc.ABC
A renderer producing simple 2D birdview images based on static background meshes and rectangular agents. Currently only square resolutions are supported. The renderer always operates in batch mode, with a single batch dimension on the left.
- Parameters:
cfg – configuration object, usually subclassed
device – torch device used for rendering
batch_size – if road_mesh is not specified, this is used to determine batch size
static_mesh – BirdviewMesh object specifying drivable surface (empty mesh is used if not provided)
world_center – Bx2 float tensor, defaults to geometric centre of the road mesh
color_map – a dictionary of RGB tuples in 0-255 range specifying colors of different rendered elements
res – default resolution
- add_static_meshes(meshes: List[torchdrivesim.mesh.BirdviewMesh]) None [source]#
Includes additional static elements to render.
- expand(n: int)[source]#
Adds another dimension with size n on the right of the batch dimension and flattens them. Returns a new renderer, without modifying the current one.
- select_batch_elements(idx: torch.Tensor)[source]#
Selects given elements from the batch, potentially with repetitions. Returns a new renderer, without modifying the current one.
- Parameters:
idx – one-dimensional integer tensor
- render_static_meshes(camera_xy: torch.Tensor | None = None, camera_sc: torch.Tensor | None = None, res: torchdrivesim.utils.Resolution = None, fov: float = None) torch.Tensor [source]#
Render a single birdview of the static mesh only. Nc is the number of cameras. C=3 is the number of RGB channels.
- Parameters:
camera_xy – Ncx2 tensor of camera positions
camera_sc – Ncx2 tensor of camera orientations (sine and cosine of yaw angle)
res – Resolution, currently only square resolutions are supported
fov – Field of view in meters
- Returns:
birdview image tensor of shape NcxHxWxC
- transform(points: torch.Tensor, pose: torch.Tensor) torch.Tensor [source]#
Given points relative to a pose, produce absolute positions of the points. There can be zero or more batch dimensions.
- Parameters:
points – BxNx2 tensor
pose – Bx3 tensor of position (x,y) and orientation (yaw angle in radians)
- Returns:
Bx2 tensor of absolute positions
- make_actor_mesh(agent_state: Dict[str, torch.Tensor], agent_attributes: Dict[str, torch.Tensor]) torchdrivesim.mesh.BirdviewMesh [source]#
Creates a mesh representing given actors. Each vertex and each face corresponds to a unique agent and both vertices and faces for each agent are continuous in the resulting tensor to allow for subsequent masking. For each agent there are seven vertices and three faces, specifying its bounding box and direction. Direction vertices use the ‘direction’ category, while agent categories are copied from input dictionaries.
- render_frame(agent_state: Dict[str, torch.Tensor], agent_attributes: Dict[str, torch.Tensor], camera_xy: torch.Tensor | None = None, camera_sc: torch.Tensor | None = None, rendering_mask: Dict[str, torch.Tensor] = None, res: torchdrivesim.utils.Resolution | None = None, traffic_controls: Dict[str, torchdrivesim.traffic_controls.BaseTrafficControl] | None = None, fov: float | None = None, waypoints: torch.Tensor | None = None, waypoints_rendering_mask: torch.Tensor | None = None) torch.Tensor [source]#
Renders the agents and traffic controls on top of the static mesh. Cameras batch size is (B*Nc), which corresponds to using Nc cameras per batch element. This extra dimension is added on the right of batch dimension and flattened, to match the semantics of extend from pytorch3d. If cameras is None, one egocentric camera per agent is used, that is Nc = A.
- Parameters:
agent_state – maps agent types to state tensors of shape BxAxSt, where St >= 3 and the first three components are x coordinate, y coordinate, and orientation in radians
agent_attributes – maps agent types to static attributes tensors of shape BxAxAttr, where Attr >= 2 and the first two components are length and width of the agent
camera_xy – BxNcx2 tensor of camera positions, by default one camera placed on each agent
camera_sc – BxNcx2 tensor of camera orientations (sine and cosine), by default matching agent orientations
rendering_mask – BxNcxA tensor per agent type, indicating which cameras see which agents
res – resolution HxW of the resulting image, currently only square resolutions are supported
traffic_controls – traffic controls by type (traffic-light, yield, etc.)
fov – Field of view in meters
waypoints – BxNcxMx2 tensor of M waypoints per camera (x,y)
waypoints_rendering_mask – BxNcxM tensor of M waypoint masks per camera, indicating which waypoints should be rendered
- Returns:
tensor image of float RGB values in [0,255] range with shape shape (B*Nc)xAxCxHxW
- abstract render_mesh(mesh: torchdrivesim.mesh.BirdviewMesh, res: torchdrivesim.utils.Resolution, cameras: pytorch3d.renderer.FoVOrthographicCameras) torch.Tensor [source]#
Renders a given mesh, producing BxHxWxC tensor image of float RGB values in [0,255] range.
- construct_cameras(xy: torch.Tensor, sc: torch.Tensor, scale: float | None = None) pytorch3d.renderer.FoVOrthographicCameras [source]#
Create PyTorch3D cameras object for given positions and orientations. Input tensor dimensions should be Bx2.
- build_verts_faces_from_bounding_box(bbs: torch.Tensor, z: float = 2) Tuple[torch.Tensor, torch.Tensor] [source]#
Triangulates actors for rendering. Input is a tensor of bounding boxes of shape …xAx4x2, where A is the number of actors. Outputs are shaped …x4*Ax2 and …x2*Ax3 respectively.
- make_traffic_controls_mesh(traffic_controls: Dict[str, torchdrivesim.traffic_controls.BaseTrafficControl]) torchdrivesim.mesh.BirdviewMesh [source]#
Create a mesh showing traffic controls.
- make_direction_mesh(lenwid: torch.Tensor, pose: torch.Tensor, size: float = 0.3) torchdrivesim.mesh.BaseMesh [source]#
Create a mesh indicating the direction of each agent.
- Parameters:
lenwid – BxAx2 tensor specifying length and width of the agents
pose – Bx3 tensor of position (x,y) and orientation (yaw angle in radians)
size – determines the size of the triangle indicating direction
- make_waypoint_mesh(waypoints: torch.Tensor, radius: float = 2.0, num_triangles: int = 10) torchdrivesim.mesh.BirdviewMesh [source]#
Create a mesh of the given waypoints.
- Parameters:
waypoints – BxNcxMx3 tensor of M waypoints per camera (x,y,psi)
radius – float radius of the disc
num_triangles – int number of triangles used for the disc
- class torchdrivesim.rendering.DummyRenderer(cfg: RendererConfig, device: torch.device | None = None, batch_size: int | None = None, static_mesh: torchdrivesim.mesh.BirdviewMesh | None = None, world_center: torch.Tensor | None = None, color_map: Dict[str, Tuple[int, int, int]] | None = None, rendering_levels: Dict[str, float] | None = None, res: torchdrivesim.utils.Resolution = Resolution(64, 64), fov: float = 35)[source]#
Bases:
BirdviewRenderer
Produces a black image of the required size. Mostly used for debugging and benchmarking.
- render_mesh(mesh: torchdrivesim.mesh.BirdviewMesh, res: torchdrivesim.utils.Resolution, cameras: pytorch3d.renderer.FoVOrthographicCameras) torch.Tensor [source]#
Renders a given mesh, producing BxHxWxC tensor image of float RGB values in [0,255] range.
- class torchdrivesim.rendering.Pytorch3DRendererConfig[source]#
Bases:
torchdrivesim.rendering.base.RendererConfig
Configuration of pytorch3d-based renderer.
- backend: str = 'pytorch3d'#
- differentiable_rendering: RenderingBlend#
- class torchdrivesim.rendering.Pytorch3DRenderer(cfg: Pytorch3DRendererConfig, *args, **kwargs)[source]#
Bases:
torchdrivesim.rendering.base.BirdviewRenderer
Renderer based on pytorch3d, using an orthographic projection and a trivial shader. Works on both GPU and CPU, but CPU is very slow.
- render_mesh(mesh: torchdrivesim.mesh.BirdviewMesh, res: torchdrivesim.utils.Resolution, cameras: pytorch3d.renderer.FoVOrthographicCameras) torch.Tensor [source]#
Renders a given mesh, producing BxHxWxC tensor image of float RGB values in [0,255] range.
- class torchdrivesim.rendering.NvdiffrastRendererConfig[source]#
Bases:
torchdrivesim.rendering.base.RendererConfig
Configuration of nvdiffrast-based renderer.
- backend: str = 'nvdiffrast'#
- antialias: bool = False#
- opengl: bool = True#
- max_minibatch_size: int | None#
- class torchdrivesim.rendering.NvdiffrastRenderer(cfg: NvdiffrastRendererConfig, *args, **kwargs)[source]#
Bases:
torchdrivesim.rendering.base.BirdviewRenderer
Similar to PyTorch3DRenderer, and producing indistinguishable images, but sometimes faster. Note that nvdiffrast requires separate installation and is subject to its own license terms.
- render_mesh(mesh: torchdrivesim.mesh.BirdviewMesh, res: torchdrivesim.utils.Resolution, cameras: pytorch3d.renderer.FoVOrthographicCameras) torch.Tensor [source]#
Renders a given mesh, producing BxHxWxC tensor image of float RGB values in [0,255] range.
- torchdrivesim.rendering.renderer_from_config(cfg: base.RendererConfig, *args, **kwargs) base.BirdviewRenderer [source]#
Construct the selected renderer from config, by default using Pytorch3DRenderer. Additional arguments are passed to the constructor.