INDEX
Explanations
mentions of the filmmaker "Kojima"
occurrences of the name "Wojak" or similar variations
New Auto-Interp
Negative Logits
ples
-0.66
rade
-0.64
bleach
-0.64
Dominion
-0.64
esville
-0.63
mingham
-0.60
raped
-0.60
idy
-0.59
Solitaire
-0.59
pled
-0.58
POSITIVE LOGITS
ansky
0.80
owski
0.77
ÅĤ
0.71
jan
0.71
cies
0.70
inski
0.68
anski
0.68
zl
0.66
zzi
0.64
owicz
0.63
Activations Density 0.066%