INDEX
Explanations
mentions of being impressed by something or someone
expressions of admiration or being impressed
New Auto-Interp
Negative Logits
yl
-0.62
race
-0.61
BD
-0.60
="#
-0.59
access
-0.58
most
-0.58
Place
-0.57
Thirty
-0.56
Ban
-0.56
Theft
-0.56
POSITIVE LOGITS
impressed
3.59
amazed
1.98
intrigued
1.92
impress
1.81
pleased
1.80
amused
1.78
surprised
1.70
dazz
1.65
fascinated
1.64
astonished
1.61
Activations Density 0.007%