INDEX
Explanations
instances where the word "really" is used intensively
New Auto-Interp
Negative Logits
antry
-0.75
tein
-0.70
ifully
-0.68
icipated
-0.68
raged
-0.64
theless
-0.63
iously
-0.62
illary
-0.61
ently
-0.60
newsletters
-0.60
POSITIVE LOGITS
bother
0.79
FTWARE
0.78
messed
0.78
bothering
0.75
hin
0.74
liked
0.73
appreciated
0.73
darn
0.72
appreciate
0.72
wanna
0.71
Activations Density 0.927%