INDEX
Explanations
references to percentages
New Auto-Interp
Negative Logits
oner
-0.18
vik
-0.16
erno
-0.16
adder
-0.15
sb
-0.15
nap
-0.15
erator
-0.15
colspan
-0.14
ENE
-0.14
sville
-0.14
POSITIVE LOGITS
cent
0.53
-cent
0.41
Cent
0.39
cent
0.37
.cent
0.37
Cent
0.36
_cent
0.33
CENT
0.31
cents
0.31
CENT
0.28
Activations Density 0.006%