INDEX
Explanations
specific instances of the word "the" within sentences
instances of the word "the."
New Auto-Interp
Negative Logits
CHAPTER
-0.79
thood
-0.77
EVA
-0.73
ictionary
-0.72
dinand
-0.71
kef
-0.69
alus
-0.68
lehem
-0.68
much
-0.67
renheit
-0.67
POSITIVE LOGITS
circumstances
0.93
desired
0.89
goal
0.89
opponent
0.86
slightest
0.84
conditions
0.84
situation
0.83
buyer
0.83
interviewer
0.83
criteria
0.83
Activations Density 0.119%