INDEX
Explanations
references to educational and gender equality topics
New Auto-Interp
Negative Logits
Include
-0.16
include
-0.16
Uses
-0.14
Includes
-0.14
Include
-0.14
include
-0.14
exclude
-0.14
INCLUDE
-0.14
jeme
-0.13
ëŀij
-0.13
POSITIVE LOGITS
plays
0.42
played
0.40
play
0.39
plays
0.32
played
0.31
playa
0.31
Plays
0.30
play
0.30
constitutes
0.28
ranks
0.28
Activations Density 0.501%