INDEX
Explanations
individuals with specific personal or professional attributes
references to time intervals and significant events in people's lives
New Auto-Interp
Negative Logits
"!
-0.65
tremend
-0.61
apest
-0.58
idden
-0.51
ãĥij
-0.50
ãĤ³
-0.49
Subtle
-0.49
Orig
-0.49
ãĤ¢ãĥ«
-0.48
nodd
-0.48
POSITIVE LOGITS
?,
0.96
Downloadha
0.83
,
0.82
,[
0.74
says
0.74
,
0.73
,...
0.72
believes
0.68
})
0.68
admits
0.68
Activations Density 0.495%