INDEX
Explanations
information related to political figures or polling statistics
New Auto-Interp
Negative Logits
oneself
-0.65
Yourself
-0.56
Rew
-0.56
ifestyle
-0.56
concludes
-0.53
externalActionCode
-0.53
clusive
-0.52
identities
-0.51
wealth
-0.50
collect
-0.49
POSITIVE LOGITS
likewise
1.02
similarly
0.92
meanwhile
0.90
theirs
0.67
Same
0.63
unaffected
0.61
similar
0.60
chim
0.60
hers
0.55
pts
0.52
Activations Density 1.011%