INDEX
Explanations
instances of the character "l" in various forms
New Auto-Interp
Head Attr Weights
0:0.07
1:0.02
2:0.13
3:0.11
4:0.04
5:0.04
6:0.27
7:0.02
8:0.04
9:0.09
10:0.06
11:0.05
Negative Logits
neighb
-1.13
headlights
-1.12
Kitt
-1.10
Yo
-1.07
moth
-1.05
zu
-1.05
whe
-1.03
lightly
-1.03
uncond
-1.03
Jinn
-1.01
POSITIVE LOGITS
rar
1.39
版
1.33
actual
1.31
actionDate
1.24
estic
1.22
anto
1.22
icio
1.22
urations
1.21
anton
1.19
PLEASE
1.18
Activations Density 0.055%