INDEX
Explanations
mentions of specific activities or events
New Auto-Interp
Negative Logits
ère
-0.88
bal
-0.80
si
-0.75
redo
-0.74
eka
-0.71
iere
-0.69
ifer
-0.67
distinguishing
-0.67
rez
-0.66
respond
-0.65
POSITIVE LOGITS
redients
1.09
tons
0.88
spree
0.79
bird
0.76
HAM
0.73
engagements
0.71
lass
0.71
AME
0.70
oneself
0.69
aids
0.69
Activations Density 3.510%