INDEX
Explanations
proper names of people
New Auto-Interp
Negative Logits
*=-
-0.75
mercial
-0.68
eneg
-0.66
ModLoader
-0.64
Olympus
-0.61
disemb
-0.59
Anonymous
-0.59
letal
-0.59
LET
-0.59
JPEG
-0.58
POSITIVE LOGITS
ocide
1.48
esis
1.35
uine
1.16
iuses
1.15
furt
1.02
etics
0.96
hardt
0.93
ius
0.92
uin
0.91
heimer
0.90
Activations Density 0.018%