Deep Learning | 我的...

TensorRT の EntropyCalibrator の観察

ディープニューラルネットワーク (DNN) の量子化は、主に、モデルの Weight と Activation を単精度浮動小数点 (FP32) から 8ビット整数 (INT8) へ変換することを指す。この変換は、浮動小数点のスケール $s$ と整数のゼロ点 $z$ を用いて表される。本稿では

[Reading] Training data-efficient image transformers & distillation through attention

DeiT 論文を読んだのでそのメモ。多くの ViT 研究において、 DeiT の学習スキームがフォローされている。最近読んだ ShiftViT1 において言及されており、ちゃんと読んでおこうと思っていた。書誌情報 @misc{touvron2021training, title={Training data-efficient image transformers & distillation through attention}, author={Hugo Touvron and Matthieu Cord and Matthijs Douze

"Deep Learning"

TensorRT の EntropyCalibrator の観察

[Reading] Training data-efficient image transformers & distillation through attention