INDEX
Explanations
mentions of specific locations and activities involving people
references to personal experiences and motivations in relation to political or significant life events
New Auto-Interp
Negative Logits
respectively
-0.71
their
-0.68
Their
-0.62
advertising
-0.61
present
-0.60
inct
-0.59
pieces
-0.59
idi
-0.58
imprint
-0.58
deems
-0.58
POSITIVE LOGITS
myself
1.11
arij
0.74
Pastebin
0.73
yesterday
0.69
<[
0.68
onnaissance
0.67
intending
0.65
questionnaire
0.65
nesday
0.65
bnb
0.64
Activations Density 1.175%