INDEX
Explanations
words related to proper nouns, specifically names of people
occurrences of the word "smol" and variations of it
New Auto-Interp
Negative Logits
Refuge
-0.68
Exit
-0.64
Cortex
-0.63
Triangle
-0.63
reservation
-0.61
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.61
unfocusedRange
-0.60
dime
-0.59
tert
-0.59
Camer
-0.59
POSITIVE LOGITS
glers
0.94
rill
0.87
wear
0.83
vity
0.81
iland
0.81
chery
0.79
etsk
0.78
lein
0.75
cer
0.74
ety
0.74
Activations Density 0.070%