INDEX
Explanations
instances where procedures, policies, or instructions are emphasized or referenced
instances of the word "in" and its context within sentences
New Auto-Interp
Negative Logits
summed
-0.66
laughs
-0.64
%%
-0.62
pins
-0.61
grows
-0.60
CHA
-0.59
âĿ
-0.59
fame
-0.59
classmates
-0.59
alas
-0.58
POSITIVE LOGITS
lieu
1.48
accordance
1.45
conjunction
1.33
relation
1.22
appropriate
1.16
ordinate
1.15
regards
1.13
effic
1.11
favour
1.10
favor
1.10
Activations Density 0.400%