INDEX
Explanations
repeated uses of the word "great" in various contexts and expressions of significance
New Auto-Interp
Negative Logits
trace
-0.16
ÅĦ
-0.15
alen
-0.15
/topics
-0.15
ian
-0.14
ä¿
-0.14
isch
-0.14
Effects
-0.14
opro
-0.14
Effects
-0.14
POSITIVE LOGITS
-grand
0.24
atsby
0.19
eur
0.18
(est
0.18
emale
0.17
big
0.15
wheel
0.15
-hearted
0.14
iosity
0.14
wheel
0.14
Activations Density 0.045%