INDEX
Explanations
phrases related to support, attention, and responses related to political or social situations
New Auto-Interp
Negative Logits
ovie
-0.65
igmat
-0.63
Tale
-0.61
sshd
-0.58
Schmidt
-0.57
apixel
-0.55
Ri
-0.55
lookout
-0.55
bey
-0.55
constant
-0.54
POSITIVE LOGITS
fulness
0.97
flows
0.88
emanating
0.83
killers
0.82
abilities
0.81
ternity
0.79
from
0.78
worthy
0.77
elsewhere
0.77
shed
0.76
Activations Density 1.828%