INDEX
Explanations
content related to historical figures and events
New Auto-Interp
Negative Logits
vsp
-0.20
дина
-0.18
enheim
-0.17
ortho
-0.16
abi
-0.15
UMB
-0.15
ohana
-0.15
lus
-0.14
ulty
-0.14
Ñģел
-0.14
POSITIVE LOGITS
vice
0.26
vice
0.23
proton
0.23
procur
0.23
Podesta
0.23
Vice
0.21
chamber
0.20
Chamber
0.19
vir
0.18
govern
0.18
Activations Density 0.042%