INDEX
Explanations
instances of the word "Islam" at various strengths
references to a specific individual or name, particularly related to "Stanislav."
New Auto-Interp
Negative Logits
ModLoader
-0.85
lishing
-0.72
rule
-0.66
notes
-0.65
LIFE
-0.64
stration
-0.63
Solitaire
-0.63
block
-0.62
perty
-0.62
nces
-0.62
POSITIVE LOGITS
ocate
0.94
ipeg
0.93
ative
0.92
apse
0.86
umber
0.86
owsky
0.85
ocated
0.84
uggage
0.84
avery
0.84
iquid
0.84
Activations Density 0.026%