INDEX
Explanations
repeated appearances of the name "Art" or its variants
New Auto-Interp
Negative Logits
myſelf
-0.75
Theſe
-0.74
auffi
-0.73
Monfieur
-0.69
uſed
-0.68
pleaſure
-0.68
Efq
-0.68
LabelTagHelper
-0.67
Reſ
-0.66
Beſ
-0.66
POSITIVE LOGITS
Ar
3.92
Ar
3.59
ar
2.98
Ар
2.17
AR
2.00
Ар
1.94
ар
1.84
arc
1.52
Arc
1.50
Ark
1.46
Activations Density 0.056%