SmoothL1Loss
-
class torch.nn.SmoothL1Loss(size_average=None, reduce=None, reduction='mean', beta=1.0)[source] -
Creates a criterion that uses a squared term if the absolute element-wise error falls below beta and an L1 term otherwise. It is less sensitive to outliers than the
torch.nn.MSELossand in some cases prevents exploding gradients (e.g. seeFast R-CNNpaper by Ross Girshick). Omitting a scaling factor ofbeta, this loss is also known as the Huber loss:where is given by:
and arbitrary shapes with a total of elements each the sum operation still operates over all the elements, and divides by .
betais an optional parameter that defaults to 1.Note: When
betais set to 0, this is equivalent toL1Loss. Passing a negative value in forbetawill result in an exception.The division by can be avoided if sets
reduction = 'sum'.- Parameters
-
-
size_average (bool, optional) – Deprecated (see
reduction). By default, the losses are averaged over each loss element in the batch. Note that for some losses, there are multiple elements per sample. If the fieldsize_averageis set toFalse, the losses are instead summed for each minibatch. Ignored whenreduceisFalse. Default:True -
reduce (bool, optional) – Deprecated (see
reduction). By default, the losses are averaged or summed over observations for each minibatch depending onsize_average. WhenreduceisFalse, returns a loss per batch element instead and ignoressize_average. Default:True -
reduction (string, optional) – Specifies the reduction to apply to the output:
'none'|'mean'|'sum'.'none': no reduction will be applied,'mean': the sum of the output will be divided by the number of elements in the output,'sum': the output will be summed. Note:size_averageandreduceare in the process of being deprecated, and in the meantime, specifying either of those two args will overridereduction. Default:'mean' - beta (float, optional) – Specifies the threshold at which to change between L1 and L2 loss. This value defaults to 1.0.
-
size_average (bool, optional) – Deprecated (see
- Shape:
-
- Input: where means, any number of additional dimensions
- Target: , same shape as the input
- Output: scalar. If
reductionis'none', then , same shape as the input
© 2019 Torch Contributors
Licensed under the 3-clause BSD License.
https://pytorch.org/docs/1.8.0/generated/torch.nn.SmoothL1Loss.html