INDEX
Explanations
instances where someone is personally involved or mentioned
the word "personally."
New Auto-Interp
Negative Logits
iens
-0.85
Ends
-0.73
Tycoon
-0.70
Gates
-0.68
Landing
-0.66
Stall
-0.66
Weaver
-0.65
Conclusion
-0.65
xual
-0.64
Faster
-0.63
POSITIVE LOGITS
identifiable
1.24
ised
0.88
benefited
0.84
intervened
0.83
invested
0.83
speaking
0.82
minded
0.80
opposed
0.77
insulted
0.77
advising
0.76
Activations Density 0.012%