INDEX
Explanations
references to surnames and last names
New Auto-Interp
Negative Logits
igon
-0.78
cko
-0.74
seys
-0.72
Plex
-0.70
ifax
-0.69
kowski
-0.69
airo
-0.67
ipeg
-0.67
essee
-0.67
zin
-0.66
POSITIVE LOGITS
scenes
0.72
�
0.69
scene
0.68
い
0.66
eries
0.65
Hud
0.65
Offense
0.64
liberty
0.63
Narr
0.60
racket
0.60
Activations Density 1.174%