INDEX
Explanations
phrases related to actions and descriptions of characters in a story
phrases related to emotional states and social interactions
New Auto-Interp
Negative Logits
scholarship
-0.75
itutional
-0.75
internationally
-0.73
Semitic
-0.71
attribution
-0.70
Sources
-0.70
biography
-0.70
endorsement
-0.69
definition
-0.67
Scholarship
-0.67
POSITIVE LOGITS
gigg
1.10
stroll
1.09
chuck
1.08
smir
1.06
grabbed
1.05
Luckily
1.03
calmly
1.01
grinned
1.00
chuckled
0.99
luckily
0.98
Activations Density 0.740%