INDEX
Explanations
the occurrence of the word "only."
New Auto-Interp
Head Attr Weights
0:0.08
1:0.06
2:0.08
3:0.07
4:0.07
5:0.08
6:0.08
7:0.07
8:0.09
9:0.08
10:0.07
11:0.10
Negative Logits
mute
-3.20
crow
-3.08
Mandarin
-3.03
Yel
-2.73
Monk
-2.72
Rahman
-2.71
audible
-2.63
hawk
-2.60
subscrib
-2.59
bub
-2.59
POSITIVE LOGITS
idth
3.21
anton
3.01
toc
2.99
[/
2.97
groups
2.85
arget
2.72
breeding
2.65
abor
2.65
IDA
2.65
Compact
2.59
Activations Density 0.000%