INDEX
Explanations
instances of the word "only"
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.06
3:0.08
4:0.08
5:0.08
6:0.07
7:0.08
8:0.08
9:0.08
10:0.09
11:0.06
Negative Logits
Ankara
-3.27
Stories
-3.09
Kazakh
-3.00
Unreal
-2.98
Athe
-2.90
Azerbai
-2.87
Berserker
-2.87
Sham
-2.85
Isis
-2.85
Submission
-2.82
POSITIVE LOGITS
retire
3.20
erning
3.15
cellar
3.14
retirees
3.04
leases
2.98
lux
2.91
osit
2.80
wines
2.79
untled
2.75
laborers
2.74
Activations Density 0.000%