INDEX
Explanations
patterns of high confidence or intensity, specifically related to the word "mu," which often indicates a measurement, strength, or a high-value feature
New Auto-Interp
Negative Logits
O
-0.56
U
-0.56
G
-0.56
μ
-0.56
V
-0.55
I
-0.54
J
-0.54
X
-0.52
Humphrey
-0.52
T
-0.51
POSITIVE LOGITS
Monfieur
1.03
Efq
1.02
myſelf
0.93
itſelf
0.89
auffi
0.88
purpoſe
0.86
Jefus
0.84
Anſ
0.84
Diſ
0.83
ainfi
0.83
Activations Density 0.071%