'Quantifying Attention Flow in Transformers' 태그의 글 목록

Quantifying Attention Flow in Transformers 1

[Transformer] Quantifying Attention Flow in Transformers

Input Token의 상대적인 relevance에 따라 attention weight을 이용할때, attention weight, attention롤아웃 및 흐름을 고려해서 Input Token에 대한 attention 을 근사하는 2가지 방법을 제안하는 논문. 해당 방법은 정보의 흐름에 대한 보완적인 view를 제공하였음. 또한 오리지널 attention과 비교했을 때, ablation method 및 input gradient (두 방법 모두) 를 이용하여 얻은 Input Token의 importance score와 높은 상관관계를 생성했음을 보여주었음 $y=a_x$ 단수/복수 예측하는 verb number 예측 해당 task와 데이터셋은 arxiv.org/pdf/2005.00928.pdf

IT/Paper 2021.03.13

jin's blog

Endure

Fast R-CNN, They Are Features, TCAV, RL논문, smoothGrad, Never Give Up, Regularizing Trajectory Optimization with Denoising Autoencoders, Learning Directed Exploration Strategies, Interpretability Beyond Feature Attribution:Quantitative Testing with Concept Activation Vectors, R-CNN, vision transformer, Axiomatic Attribution for Deep Networks, intergrated gradient, Concept vector, XAI, Adversarial Examples Are Not Bugs, Paper리뷰, Quantifying Attention Flow in Transformers, CAV, Deconvolution Network,

Today :
Yesterday :

일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Quantifying Attention Flow in Transformers 1

티스토리툴바