INDEX
Explanations
references to individuals in roles related to education or communication
New Auto-Interp
Negative Logits
ardy
-0.14
Swedish
-0.14
Baltic
-0.14
CrLf
-0.14
pied
-0.14
Jes
-0.14
qs
-0.13
kola
-0.13
enko
-0.13
TexCoord
-0.13
POSITIVE LOGITS
TV
0.25
TV
0.23
Attr
0.23
Attrs
0.22
-TV
0.22
Attr
0.20
tir
0.20
ND
0.19
tir
0.19
tv
0.18
Activations Density 0.004%