INDEX
Explanations
quotes spoken by individuals
instances of the word "told" in reported speech
New Auto-Interp
Negative Logits
ILCS
-0.77
reci
-0.71
racted
-0.70
icons
-0.68
spir
-0.67
untled
-0.65
MIT
-0.64
immune
-0.63
tan
-0.61
osterone
-0.61
POSITIVE LOGITS
tale
0.98
ingly
0.83
him
0.77
me
0.76
tell
0.76
IPS
0.76
reporters
0.75
us
0.75
eret
0.74
mares
0.73
Activations Density 0.060%