INDEX
Explanations
direct quotes and statements made by individuals
New Auto-Interp
Negative Logits
ãĥIJãĥ¼
-0.14
unya
-0.14
ially
-0.13
zbo
-0.13
itan
-0.13
ÅĤy
-0.13
Lyons
-0.13
ovaly
-0.13
kas
-0.13
askan
-0.13
POSITIVE LOGITS
mach
0.15
invalidate
0.14
erland
0.14
ÄĽn
0.14
ey
0.14
åĭ
0.14
eker
0.14
elu
0.14
IDGET
0.13
<
0.13
Activations Density 0.057%