INDEX
Explanations
the word "rit" with varying activation values
occurrences of the term "Pritchard."
New Auto-Interp
Negative Logits
¶ħ
-0.85
ĨĴ
-0.82
©¶æ
-0.75
ĻĤ
-0.75
«ĺ
-0.74
ŃĶ
-0.71
instantaneous
-0.68
é¾
-0.67
¥ŀ
-0.65
Merit
-0.62
POSITIVE LOGITS
rit
1.10
ual
1.04
chard
0.90
ravel
0.89
krit
0.89
igi
0.83
ually
0.82
ika
0.81
sis
0.81
ions
0.80
Activations Density 0.007%