INDEX
Explanations
from_pretrained transformers
New Auto-Interp
Negative Logits
saucer
0.67
stressed
0.67
Kinect
0.67
Nilsson
0.65
Kona
0.63
గ్రహ
0.63
stressing
0.63
ດ້
0.62
amyg
0.62
KJ
0.62
POSITIVE LOGITS
Hug
1.34
hugging
1.26
transformers
1.24
transformers
1.23
🤗
1.22
Transformers
1.20
🤗
1.11
Transformers
1.10
tokenizer
1.05
tokenizer
0.99
Activations Density 1.154%