INDEX
Explanations
personal pronouns referring to actions or consequences
instances of the pronoun "it."
New Auto-Interp
Negative Logits
IDA
-0.65
Eighth
-0.63
Jaw
-0.62
IX
-0.57
Chair
-0.57
Missing
-0.56
Statement
-0.56
oided
-0.55
Major
-0.55
Reporting
-0.54
POSITIVE LOGITS
tends
1.42
becomes
1.40
varies
1.39
happens
1.36
depends
1.33
occurs
1.32
boils
1.31
slows
1.29
affects
1.28
takes
1.27
Activations Density 0.208%