INDEX
Explanations
phrases related to consistent or routine actions
the frequency of actions or events occurring on a regular basis
New Auto-Interp
Negative Logits
Geh
-0.70
Mara
-0.69
Warden
-0.69
Millennium
-0.68
holes
-0.67
Warriors
-0.67
elo
-0.67
Lore
-0.67
lad
-0.66
Way
-0.65
POSITIVE LOGITS
spaced
1.05
theless
0.91
encountered
0.86
monitored
0.85
pract
0.84
consulted
0.84
entimes
0.84
encount
0.83
Asked
0.82
employed
0.81
Activations Density 0.017%