INDEX
Explanations
specific locations or settings
occurrences of the word "here," indicating a focus on local references or context
New Auto-Interp
Negative Logits
eer
-0.65
Thrones
-0.61
Decay
-0.59
IRC
-0.58
Spending
-0.57
Kaiser
-0.54
Meter
-0.54
CHAT
-0.54
ahime
-0.54
Shards
-0.53
POSITIVE LOGITS
tics
1.91
tical
1.82
abouts
1.64
tic
1.51
to
0.97
with
0.96
upon
0.88
ina
0.83
after
0.79
from
0.76
Activations Density 0.057%