INDEX
Explanations
references to "status" and related indicators of condition or state
New Auto-Interp
Negative Logits
to
-0.15
age
-0.14
pel
-0.14
ucha
-0.14
lot
-0.14
erv
-0.14
ICI
-0.14
ewe
-0.14
tober
-0.14
inston
-0.14
POSITIVE LOGITS
quo
0.36
getStatus
0.20
=status
0.20
ses
0.19
sing
0.17
(Status
0.17
ler
0.17
craft
0.17
eldorf
0.17
utory
0.17
Activations Density 0.027%