INDEX
Explanations
phrases indicating a deep understanding or serious consideration of a concept or idea
occurrences of the word "taken" in various contexts
New Auto-Interp
Negative Logits
eers
-0.64
reinforcement
-0.63
Zucker
-0.62
glers
-0.59
SPD
-0.58
icing
-0.58
Split
-0.57
FTWARE
-0.57
ileaks
-0.57
Machina
-0.56
POSITIVE LOGITS
aback
1.56
advantage
1.21
care
1.14
seriously
1.08
hostage
1.05
orally
0.94
Seriously
0.90
apart
0.90
care
0.85
aways
0.84
Activations Density 0.039%