INDEX
Explanations
instances of the word "personally" and its variations
New Auto-Interp
Negative Logits
Definitions
-0.76
eland
-0.66
Mant
-0.65
period
-0.64
Ends
-0.63
iens
-0.63
Dispatch
-0.62
Deadline
-0.62
DAY
-0.62
ENCY
-0.62
POSITIVE LOGITS
identifiable
1.07
intervened
0.93
benefited
0.92
apologized
0.91
ised
0.91
vou
0.90
insulted
0.89
thanked
0.86
offended
0.84
owned
0.83
Activations Density 0.008%