INDEX
Explanations
instances of the word "alt" in various contexts
New Auto-Interp
Negative Logits
pleaſure
-0.81
sánchez
-0.72
Verſ
-0.71
itſelf
-0.69
LayoutPanel
-0.69
ſch
-0.69
ſche
-0.69
betweenstory
-0.69
bershka
-0.67
évaluateur
-0.66
POSITIVE LOGITS
&
0.52
if
0.50
فريبيس
0.47
"
0.45
6
0.44
C
0.43
3
0.43
1
0.42
de
0.42
0.41
Activations Density 0.035%