INDEX
Explanations
proper nouns, specifically names of people, places, and organizations
references to the letter "O" in various contexts
New Auto-Interp
Negative Logits
ãĥķ
-0.90
ãĥ¼ãĥĨãĤ£
-0.86
ãĥķãĤ©
-0.81
PID
-0.81
PLA
-0.76
ÑĮ
-0.75
unfocusedRange
-0.75
ãĤ¼ãĤ¦ãĤ¹
-0.74
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.73
Wanted
-0.72
POSITIVE LOGITS
vernight
1.06
zzy
1.03
tto
1.03
OT
1.03
ugi
1.02
ogie
1.00
scill
0.99
lymp
0.98
culus
0.96
OM
0.96
Activations Density 0.018%