INDEX
Explanations
words related to significant impact or importance
references to the concept of "role" in various contexts
New Auto-Interp
Negative Logits
Eyes
-0.61
affles
-0.59
Junk
-0.59
hound
-0.58
ocaust
-0.57
trap
-0.57
apest
-0.55
Flesh
-0.55
urses
-0.55
Results
-0.54
POSITIVE LOGITS
playing
0.95
in
0.95
therein
0.94
role
0.87
facilitating
0.81
roles
0.80
role
0.79
overseeing
0.78
helping
0.75
influencing
0.75
Activations Density 0.061%