INDEX
Explanations
expressions of strong affection or appreciation toward someone or something
expressions of strong affection or appreciation
New Auto-Interp
Negative Logits
trickle
-0.62
plur
-0.61
substituted
-0.59
Explore
-0.58
TRY
-0.54
randomized
-0.54
Somers
-0.54
Transcript
-0.54
transitional
-0.54
Agility
-0.53
POSITIVE LOGITS
dearly
1.43
uncond
1.14
much
1.02
greatly
1.01
much
0.98
passionately
0.95
MUCH
0.93
deeply
0.86
immensely
0.84
more
0.84
Activations Density 0.173%