INDEX
Explanations
repeated mentions of "of" in various contexts within the text
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.11
3:0.05
4:0.12
5:0.02
6:0.03
7:0.39
8:0.03
9:0.02
10:0.09
11:0.06
Negative Logits
scraps
-1.79
龍喚士
-1.76
ikes
-1.60
Crash
-1.46
fulness
-1.44
icc
-1.43
legates
-1.41
cffff
-1.35
verages
-1.34
ike
-1.33
POSITIVE LOGITS
Pandora
1.71
Oracle
1.57
izon
1.52
anto
1.44
libel
1.44
AT
1.35
municip
1.34
doors
1.34
ranch
1.33
competition
1.33
Activations Density 0.007%