INDEX
Explanations
emotional states conveyed through language
New Auto-Interp
Negative Logits
metic
-0.82
SPI
-0.78
societies
-0.75
mathemat
-0.72
democracy
-0.72
Gmail
-0.71
democracies
-0.70
Skydragon
-0.69
JPM
-0.69
deflation
-0.69
POSITIVE LOGITS
d
1.49
t
1.41
ve
1.29
ten
1.26
s
1.26
te
1.26
tre
1.25
re
1.20
c
1.20
sed
1.19
Activations Density 0.220%