INDEX
Explanations
discourse on governance and accountability
New Auto-Interp
Negative Logits
majority
-0.19
amenti
-0.17
onis
-0.16
erts
-0.16
jh
-0.15
Series
-0.15
sb
-0.14
ôle
-0.14
series
-0.14
enas
-0.14
POSITIVE LOGITS
lots
0.16
Tide
0.16
otle
0.15
grou
0.15
academia
0.15
æĮĩæķ°
0.14
gest
0.14
goose
0.14
رÙħز
0.14
Contrib
0.14
Activations Density 0.114%