INDEX
Explanations
repetitive phrases used for comparison or context in a text
occurrences of the word "same."
New Auto-Interp
Negative Logits
;;;;
-0.72
*=-
-0.70
ËĪ
-0.69
ãĤ´ãĥ³
-0.68
urally
-0.68
export
-0.67
icum
-0.66
Khe
-0.65
rend
-0.64
their
-0.64
POSITIVE LOGITS
thing
0.90
vein
0.88
applies
0.86
caveats
0.84
principle
0.84
kind
0.80
principles
0.79
reasoning
0.78
exact
0.77
fate
0.77
Activations Density 0.042%