INDEX
Explanations
words related to political figures or events
occurrences of the substring "isl" in various contexts
New Auto-Interp
Negative Logits
ModLoader
-0.77
lishing
-0.71
notes
-0.70
cffffcc
-0.69
nces
-0.67
ccoli
-0.67
LIFE
-0.65
ilant
-0.64
ritic
-0.63
block
-0.63
POSITIVE LOGITS
ipeg
0.96
ature
0.92
owsky
0.88
uggage
0.88
atures
0.87
ifter
0.86
akes
0.85
ative
0.84
iquid
0.83
oud
0.82
Activations Density 0.020%