INDEX
Explanations
personal pronouns and words frequently associated with people
New Auto-Interp
Negative Logits
and
-0.79
doing
-0.69
ChildIndex
-0.69
letting
-0.66
doing
-0.65
carrying
-0.65
speaking
-0.65
sharing
-0.65
showing
-0.65
driving
-0.64
POSITIVE LOGITS
e
0.63
i
0.52
'
0.50
FileReader
0.50
EventArgs
0.50
ˈ
0.49
ever
0.49
ethe
0.48
’
0.48
et
0.47
Activations Density 12.798%