QConvInt8 Class — pytorch Architecture
Architecture documentation for the QConvInt8 class in Conv.cpp from the pytorch codebase.
Entity Profile
Source Code
aten/src/ATen/native/quantized/cudnn/Conv.cpp lines 368–384
class QConvInt8 final {
public:
static at::Tensor run(
at::Tensor act,
const c10::intrusive_ptr<ConvPackedParamsBase<kSpatialDim>>& packed_weight,
double output_scale,
int64_t output_zero_point) {
TORCH_CHECK(kSpatialDim == 1 || kSpatialDim == 2, "Error in quantized cudnn conv2d operator: "
"Expected kSpatialDim == 1 || kSpatialDim == 2; received kSpatialDim=", kSpatialDim);
// TODO: check all zero_points are zero/all tensors are symmetrically quantized
if (kReluFused) {
return packed_weight->apply_relu(act, output_scale, output_zero_point);
} else {
return packed_weight->apply(act, output_scale, output_zero_point);
}
}
};
Source
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free