INDEX
Explanations
names of individuals or locations with the letters "z" or "k"
occurrences of specific letters or characters in words
New Auto-Interp
Negative Logits
priceless
-0.77
_-
-0.72
innocence
-0.64
paradise
-0.63
distraction
-0.60
Paradise
-0.60
wrath
-0.59
hate
-0.56
oppos
-0.56
supernatural
-0.55
POSITIVE LOGITS
inski
1.05
anski
0.96
ynski
0.95
zynski
0.95
ety
0.91
owski
0.91
zb
0.87
owicz
0.87
oslav
0.85
ovsky
0.85
Activations Density 0.124%