INDEX
Explanations
adjectives related to emotional expression
terms related to emotional expression and societal dynamics
New Auto-Interp
Negative Logits
ammy
-0.72
azo
-0.70
onga
-0.67
INS
-0.66
ANS
-0.64
Italians
-0.64
Garry
-0.63
Accuracy
-0.63
Viet
-0.63
Khe
-0.62
POSITIVE LOGITS
outward
1.13
inward
1.07
ly
0.83
ward
0.81
heastern
0.79
worldly
0.77
robe
0.76
comings
0.75
angular
0.74
selves
0.74
Activations Density 0.005%