INDEX
Explanations
the word "so" when used emphatically to express a result or consequence
New Auto-Interp
Negative Logits
Hunters
-0.64
Lau
-0.63
aughed
-0.60
works
-0.60
Kang
-0.59
Concepts
-0.59
²
-0.58
Dreams
-0.58
Neighborhood
-0.58
Moving
-0.58
POSITIVE LOGITS
oner
1.03
anonymously
0.93
zin
0.82
cheaply
0.80
willingly
0.80
othe
0.79
voluntarily
0.77
legally
0.77
reluctantly
0.76
----------------------------------------------------------------
0.75
Activations Density 0.019%