INDEX
Explanations
references to individuals and their actions, particularly in the context of teamwork and personal responsibility
New Auto-Interp
Negative Logits
ape
-0.16
ReadStream
-0.16
STEM
-0.15
aber
-0.15
stem
-0.15
Wolfe
-0.15
Stem
-0.15
Stevenson
-0.15
çª
-0.15
alet
-0.15
POSITIVE LOGITS
vla
0.17
841
0.16
ocrin
0.15
è¼Ķ
0.14
orama
0.14
elda
0.14
Carr
0.14
itle
0.14
onal
0.13
ÎľÎij
0.13
Activations Density 0.006%