INDEX
Explanations
prominent figures or quotes from literature and historical references
New Auto-Interp
Negative Logits
next
-0.15
opposite
-0.14
ương
-0.14
ounced
-0.13
which
-0.13
cola
-0.13
Clo
-0.13
situation
-0.13
ocol
-0.13
ianne
-0.13
POSITIVE LOGITS
quoted
0.25
quoted
0.25
quote
0.22
quotes
0.19
Quotes
0.18
æijĺ
0.18
~-~-~-~-
0.18
-quote
0.18
speaking
0.17
via
0.17
Activations Density 0.068%