INDEX
Explanations
references to education, community support structures, and the roles of different sectors in society
New Auto-Interp
Negative Logits
efe
-0.15
disadv
-0.14
408
-0.14
Overrides
-0.14
avanaugh
-0.14
.Override
-0.13
tainment
-0.13
ButtonDown
-0.13
mez
-0.13
qb
-0.13
POSITIVE LOGITS
plays
0.45
role
0.40
importance
0.40
play
0.39
Plays
0.38
played
0.37
plays
0.36
Role
0.34
play
0.33
role
0.33
Activations Density 0.366%