INDEX
Explanations
exaggerated statements
words related to exaggeration
New Auto-Interp
Negative Logits
NetMessage
-0.74
spot
-0.70
aining
-0.70
washer
-0.69
sector
-0.66
fighters
-0.66
abiding
-0.65
âĸ¬âĸ¬
-0.65
Ķ
-0.65
avis
-0.63
POSITIVE LOGITS
exagger
1.21
exaggeration
1.07
exaggerated
0.89
mble
0.85
inflated
0.81
overest
0.80
tales
0.79
distortions
0.79
guiActiveUn
0.79
distort
0.78
Activations Density 0.033%